llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-24 13:33:37 +02:00

Author	SHA1	Message	Date
Chandler Carruth	4096a52c28	[x86] In the new vector shuffle lowering, when trying to do another layer of tie-breaking sorting, it really helps to check that you're in a tie first. =] Otherwise the whole thing cycles infinitely. Test case added, another one found through fuzz testing. llvm-svn: 218523	2014-09-26 17:24:26 +00:00
Chandler Carruth	9381e0fa4c	[x86] Fix a large collection of bugs that crept in as I fleshed out the AVX support. New test cases included. Note that none of the existing test cases covered these buggy code paths. =/ Also, it is clear from this that SHUFPS and SHUFPD are the most bug prone shuffle instructions in x86. =[ These were all detected by fuzz-testing. (I <3 fuzz testing.) llvm-svn: 218522	2014-09-26 17:11:02 +00:00
Renato Golin	eb17383852	Elide repeated register operand in Thumb1 instructions This patch makes the ARM backend transform 3 operand instructions such as 'adds/subs' to the 2 operand version of the same instruction if the first two register operands are the same. Example: 'adds r0, r0, #1' will is transformed to 'adds r0, #1'. Currently for some instructions such as 'adds' if you try to assemble 'adds r0, r0, #8' for thumb v6m the assembler would throw an error message because the immediate cannot be encoded using 3 bits. The backend should be smart enough to transform the instruction to 'adds r0, #8', which allows for larger immediate constants. Patch by Ranjeet Singh. llvm-svn: 218521	2014-09-26 16:14:29 +00:00
Andrea Di Biagio	c61458a223	[X86][SchedModel] SSE reciprocal square root instruction latencies. The SSE rsqrt instruction (a fast reciprocal square root estimate) was grouped in the same scheduling IIC_SSE_SQRT* class as the accurate (but very slow) SSE sqrt instruction. For code which uses rsqrt (possibly with newton-raphson iterations) this poor scheduling was affecting performances. This patch splits off the rsqrt instruction from the sqrt instruction scheduling classes and creates new IIC_SSE_RSQER* classes with latency values based on Agner's table. Differential Revision: http://reviews.llvm.org/D5370 Patch by Simon Pilgrim. llvm-svn: 218517	2014-09-26 12:56:44 +00:00
Frederic Riss	2e26931cef	Revert "Store TypeUnits in a SmallVector<DWARFUnitSection> instead of a single DWARFUnitSection." This reverts commit r218513. Buildbots using libstdc++ issue an error when trying to copy SmallVector<std::unique_ptr<>>. Revert the commit until we have a fix. llvm-svn: 218514	2014-09-26 12:34:06 +00:00
Frederic Riss	ea404b3a9c	Store TypeUnits in a SmallVector<DWARFUnitSection> instead of a single DWARFUnitSection. Summary: There will be multiple TypeUnits in an unlinked object that will be extracted from different sections. Now that we have DWARFUnitSection that is supposed to represent an input section, we need a DWARFUnitSection<TypeUnit> per input .debug_types section. Once this is done, the interface is homogenous and we can move the Section parsing code into DWARFUnitSection. Reviewers: samsonov, dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5482 llvm-svn: 218513	2014-09-26 12:15:40 +00:00
Daniel Sanders	31d5b7be78	Fix unused variable warning added in r218509 llvm-svn: 218510	2014-09-26 10:45:26 +00:00
Daniel Sanders	4ed8535c04	[mips] Generalize the handling of f128 return values to support f128 arguments. Summary: This will allow us to handle f128 arguments without duplicating code from CCState::AnalyzeFormalArguments() or CCState::AnalyzeCallOperands(). No functional change. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5292 llvm-svn: 218509	2014-09-26 10:06:12 +00:00
Robert Khasanov	3e0cfb4887	[AVX512] Added load/store from BW/VL subsets to Register2Memory opcode tables. Added lowering tests for these instructions. llvm-svn: 218508	2014-09-26 09:48:50 +00:00
David Majnemer	58cb17481c	llvm-vtabledump: Small cleanup llvm-svn: 218505	2014-09-26 08:01:23 +00:00
Jyoti Allur	b6f615c3fd	fix a typo in doumentation index. llvm-svn: 218504	2014-09-26 06:59:15 +00:00
David Majnemer	e51e7a3fa8	llvm-vtabledump: strip trailing NUL bytes llvm-svn: 218502	2014-09-26 05:50:45 +00:00
David Majnemer	2bce899754	Fix build breakage on MSVC 2013 llvm-svn: 218499	2014-09-26 04:47:54 +00:00
David Majnemer	032b6bd999	llvm-vtabledump: Dump RTTI structures for the MS ABI llvm-svn: 218498	2014-09-26 04:21:51 +00:00
David Majnemer	f21a7fecf5	Target: Fix build breakage. No functional change intended. llvm-svn: 218497	2014-09-26 02:57:05 +00:00
David Majnemer	b3c0e72987	Support: Remove undefined behavior from &raw_ostream::operator<< Don't negate signed integer types in &raw_ostream::operator<<(const FormattedNumber &FN). llvm-svn: 218496	2014-09-26 02:48:14 +00:00
David Xu	a2ad12fe59	Revert patch of r218493, delete the test case llvm-svn: 218495	2014-09-26 02:40:54 +00:00
David Xu	c351ac640d	Revert patch ofr218493 llvm-svn: 218494	2014-09-26 02:28:03 +00:00
David Xu	43a5d5bdc1	Redundant store instructions should be removed as dead code llvm-svn: 218493	2014-09-26 02:02:09 +00:00
Eric Christopher	a4eb9e140f	Add the first backend support for on demand subtarget creation based on the Function. This is currently used to implement mips16 support in the mips backend via the existing module pass resetting the subtarget. Things to note: a) This involved running resetTargetOptions before creating a new subtarget so that code generation options like soft-float could be recognized when creating the new subtarget. This is to deal with initialization code in isel lowering that only paid attention to the initial value. b) Many of the existing testcases weren't using the soft-float feature correctly. I've corrected these based on the check values assuming that was the desired behavior. c) The mips port now pays attention to the target-cpu and target-features strings when generating code for a particular function. I've removed these from one function where the requested cpu and features didn't match the check lines in the testcase. llvm-svn: 218492	2014-09-26 01:44:08 +00:00
Eric Christopher	cafa73c1ce	Add a FIXME to TargetMachine to remove the function specific code generation options from TargetMachine. This will depend upon Function + TargetSubtargetInfo based code generation at which point resetTargetOptions and this code can be removed. llvm-svn: 218491	2014-09-26 01:44:05 +00:00
Eric Christopher	74f3138425	Have setSubtarget take a const subtarget. llvm-svn: 218490	2014-09-26 01:28:13 +00:00
Eric Christopher	3b65e1ff31	Move resetTargetOptions from taking a MachineFunction to a Function since we are accessing the TargetMachine that we're a member function of. llvm-svn: 218489	2014-09-26 01:28:10 +00:00
Matt Arsenault	7bca916f1f	R600: Avoid repeated check lines llvm-svn: 218487	2014-09-26 01:12:36 +00:00
Matt Arsenault	5def7ffc65	R600/SI: Fix emitting trailing whitespace after s_waitcnt llvm-svn: 218486	2014-09-26 01:09:46 +00:00
Adam Nemet	75bf4d85fb	[AVX512] Simplify use of !con() No change in X86.td.expanded. llvm-svn: 218485	2014-09-26 00:53:12 +00:00
Adam Nemet	2f9330edf6	[AVX512] Pull pattern for subvector extract into the instruction definition No functional change. I initially thought that pulling the Pat<> into the instruction pattern was not possible because it was doing a transform on the index in order to convert it from a per-element (extract_subvector) index into a per-chunk (vextract*x4) index. Turns out this also works inside the pattern because the vextract_extract PatFrag has an OperandTransform EXTRACT_get_vextract{128,256}_imm, so the index in $idx goes through the same conversion. The existing test CodeGen/X86/avx512-insert-extract.ll extended in the previous commit provides coverage for this change. llvm-svn: 218480	2014-09-25 23:48:49 +00:00
Adam Nemet	ffc86b5e73	[AVX512] Make vextractx4/vinsertx4 tests check for the index as well Extend test so that it provides coverage for the next commit. llvm-svn: 218479	2014-09-25 23:48:47 +00:00
Adam Nemet	d8b4967294	[AVX512] Refactor subvector extracts No functional change. These are now implemented as two levels of multiclasses heavily relying on the new X86VectorVTInfo class. The multiclass at the first level that is called with float or int provides the 128 or 256 bit subvector extracts. The second level provides the register and memory variants and some more Pat<>s. I've compared the td.expanded files before and after. One change is that ExeDomain for 64x4 is SSEPackedDouble now. I think this is correct, i.e. a bugfix. (BTW, this is the change that was blocked on the recent tablegen fix. The class-instance values X86VectorVTInfo inside vextract_for_type weren't properly evaluated.) Part of <rdar://problem/17688758> llvm-svn: 218478	2014-09-25 23:48:45 +00:00
Adam Nemet	1dca2e1268	[AVX512] Fix typo F->I in VEXTRACTF32x4rr. llvm-svn: 218477	2014-09-25 23:48:42 +00:00
Hal Finkel	ed9bc0fad0	Add SDAG TableGen definitions for BR_CC Add SelectionDAG TableGen definitions for BR_CC so that targets can instruction-select BR_CC using TableGen pattern matching. Patch by deadal nix. llvm-svn: 218476	2014-09-25 23:34:18 +00:00
Matt Arsenault	d5fabe634e	R600: Fix some missing conversion testcases llvm-svn: 218474	2014-09-25 23:16:18 +00:00
Matt Arsenault	5eae8e006b	Remove duplicated RUN lines in middle of test llvm-svn: 218473	2014-09-25 23:16:14 +00:00
Bruno Cardoso Lopes	d68585f076	[MachineSink+PGO] Teach MachineSink to use BlockFrequencyInfo Machine Sink uses loop depth information to select between successors BBs to sink machine instructions into, where BBs within smaller loop depths are preferable. This patch adds support for choosing between successors by using profile information from BlockFrequencyInfo instead, whenever the information is available. Tested it under SPEC2006 train (average of 30 runs for each program); ~1.5% execution speedup in average on x86-64 darwin. <rdar://problem/18021659> llvm-svn: 218472	2014-09-25 23:14:26 +00:00
David Majnemer	2d4b17ffff	Object: Add range iterators for Archive children No functional change intended. llvm-svn: 218471	2014-09-25 22:56:54 +00:00
Nick Kledzik	e78cdda62d	[Support] Fix Format.h to build on Windows llvm-svn: 218467	2014-09-25 21:00:38 +00:00
Nick Kledzik	2ac9102fff	[Support] Add type-safe alternative to llvm::format() llvm::format() is somewhat unsafe. The compiler does not check that integer parameter size matches the %x or %d size and it does not complain when a StringRef is passed for a %s. And correctly using a StringRef with format() is ugly because you have to convert it to a std::string then call c_str(). The cases where llvm::format() is useful is controlling how numbers and strings are printed, especially when you want fixed width output. This patch adds some new formatting functions to raw_streams to format numbers and StringRefs in a type safe manner. Some examples: OS << format_hex(255, 6) => "0x00ff" OS << format_hex(255, 4) => "0xff" OS << format_decimal(0, 5) => " 0" OS << format_decimal(255, 5) => " 255" OS << right_justify(Str, 5) => " foo" OS << left_justify(Str, 5) => "foo " llvm-svn: 218463	2014-09-25 20:30:58 +00:00
Anton Yartsev	ab00cb374c	Refactoring: raw pointer -> unique_ptr llvm-svn: 218462	2014-09-25 19:55:58 +00:00
Tom Stellard	eaa04c42f6	ARM: Remove unneeded check for MI->hasPostISelHook() llvm-svn: 218459	2014-09-25 18:59:23 +00:00
Tom Stellard	fbc414f7ff	SelectionDAG: Remove #if NDEBUG from check for a post-isel hook The InstrEmitter will skip the check of MI.hasPostISelHook() before calling AdjustInstrPostInstrSelection() when NDEBUG is not defined. This was added in r140228, and I'm not sure if it is intentional or not, but it is a likely source for bugs, because it means with Release+Asserts builds you can forget to set the hasPostISelHook flag on TableGen definitions and AdjustInstrPostInstrSelection() will still be called. llvm-svn: 218458	2014-09-25 18:59:22 +00:00
Tom Stellard	d53a286419	R600/SI: Add support for global atomic add llvm-svn: 218457	2014-09-25 18:30:26 +00:00
Robin Morisset	98b0bed638	Lower idempotent RMWs to fence+load Summary: I originally tried doing this specifically for X86 in the backend in D5091, but it was rather brittle and generally running too late to be general. Furthermore, other targets may want to implement similar optimizations. So I reimplemented it at the IR-level, fitting it into AtomicExpandPass as it interacts with that pass (which could not be cleanly done before at the backend level). This optimization relies on a new target hook, which is only used by X86 for now, as the correctness of the optimization on other targets remains an open question. If it is found correct on other targets, it should be trivial to enable for them. Details of the optimization are discussed in D5091. Test Plan: make check-all + a new test Reviewers: jfb Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5422 llvm-svn: 218455	2014-09-25 17:27:43 +00:00
Aaron Ballman	30858ed18e	Since the DisasmMemoryObject only operates on const data, it now only accepts a const data pointer. This silences a -Wcast-qual warning. llvm-svn: 218454	2014-09-25 14:02:43 +00:00
Sid Manning	f1133d5d3f	Add missing attributes !cmp.[eq,gt,gtu] instructions. These instructions do not indicate they are extendable or the number of bits in the extendable operand. Rename to match architected names. Add a testcase for the intrinsics. llvm-svn: 218453	2014-09-25 13:09:54 +00:00
Daniel Sanders	f02986683b	Add llvm_unreachables() for [ASZ]ExtUpper to X86FastISel.cpp to appease the buildbots. llvm-svn: 218452	2014-09-25 13:08:51 +00:00
Daniel Sanders	c3ccff7583	[mips] Add CCValAssign::[ASZ]ExtUpper and CCPromoteToUpperBitsInType and handle struct's correctly on big-endian N32/N64 return values. Summary: The N32/N64 ABI's require that structs passed in registers are laid out such that spilling the register with 'sd' places the struct at the lowest address. For little endian this is trivial but for big-endian it requires that structs are shifted into the upper bits of the register. We also require that structs passed in registers have the 'inreg' attribute for big-endian N32/N64 to work correctly. This is because the tablegen-erated calling convention implementation only has access to the lowered form of struct arguments (one or more integers of up to 64-bits each) and is unable to determine the original type. Reviewers: vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5286 llvm-svn: 218451	2014-09-25 12:15:05 +00:00
Renato Golin	075db70d6a	Add aliases for VAND imm to VBIC ~imm On ARM NEON, VAND with immediate (16/32 bits) is an alias to VBIC ~imm with the same type size. Adding that logic to the parser, and generating VBIC instructions from VAND asm files. This patch also fixes the validation routines for NEON splat immediates which were wrong. Fixes PR20702. llvm-svn: 218450	2014-09-25 11:31:24 +00:00
Chandler Carruth	65ea352f53	[x86] Teach the new vector shuffle lowering to use AVX2 instructions for v4f64 and v8f32 shuffles when they are lane-crossing. We have fully general lane-crossing permutation functions in AVX2 that make this easy. Part of this also changes exactly when and how these vectors are split up when we don't have AVX2. This isn't always a win but it usually is a win, so on the balance I think its better. The primary regressions are all things that just need to be fixed anyways such as modeling when a blend can be completely accomplished via VINSERTF128, etc. Also, this highlights one of the few remaining big features: we do a really poor job of inserting elements into AVX registers efficiently. This completes almost all of the big tricks I have in mind for AVX2. The only things left that I plan to add: 1) element insertion smarts 2) palignr and other fairly specialized lowerings when they happen to apply llvm-svn: 218449	2014-09-25 11:03:55 +00:00
Sylvestre Ledru	7bf0243f90	Update my previous commit to fit 80 cols... llvm-svn: 218448	2014-09-25 10:58:16 +00:00
Sylvestre Ledru	6742db270f	Details that -debug-only is not available when LLVM is built with --enable-optimized llvm-svn: 218447	2014-09-25 10:57:00 +00:00

1 2 3 4 5 ...

108080 Commits