llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
Justin Holewinski	7663d1fd5a	[NVPTX] Implement fma and imad contraction as target DAGCombiner patterns This also introduces DAGCombiner patterns for mul.wide to multiply two smaller integers and produce a larger integer llvm-svn: 211935	2014-06-27 18:35:37 +00:00
Justin Holewinski	1c8d752df2	[NVPTX] Add support for efficient rotate instructions on SM 3.2+ llvm-svn: 211934	2014-06-27 18:35:33 +00:00
Justin Holewinski	b7ecf6b4d6	[NVPTX] Add missing isel patterns for 64-bit atomics llvm-svn: 211933	2014-06-27 18:35:30 +00:00
Justin Holewinski	2ffa2d24b0	[NVPTX] Add isel patterns for bit-field extract (bfe) llvm-svn: 211932	2014-06-27 18:35:27 +00:00
Justin Holewinski	7c9cd5f566	[NVPTX] Add support for isspacep instruction llvm-svn: 211931	2014-06-27 18:35:24 +00:00
Justin Holewinski	ab34c507cf	[NVPTX] Add support for envreg reads llvm-svn: 211930	2014-06-27 18:35:21 +00:00
Justin Holewinski	6eaef88634	[NVPTX] Add target options for PTX 3.2/4.0 and SM 5.0 (Maxwell) Default PTX version is set to PTX 3.2 llvm-svn: 211929	2014-06-27 18:35:18 +00:00
Justin Holewinski	1c2cdcb292	[NVPTX] Update sub-target feature detection llvm-svn: 211928	2014-06-27 18:35:16 +00:00
Justin Holewinski	7d536fecc0	[NVPTX] Directly control the Machine SSA passes that are invoked for NVPTX. NVPTX is a bit special in the optimizations it requires, so this gives us better control over the backend optimization pipeline. llvm-svn: 211927	2014-06-27 18:35:14 +00:00
Justin Holewinski	a3b8653f47	[NVPTX] Emit .weak when linkage is not external, internal, or private llvm-svn: 211926	2014-06-27 18:35:10 +00:00
Justin Holewinski	b137c23ee9	[NVPTX] Just use getTypeAllocSize() when computing return value size for structures and vectors llvm-svn: 211925	2014-06-27 18:35:08 +00:00
Tyler Nowicki	8327f835c8	Vectorization documentation for loop hint pragmas and Rpass diagnostics. llvm-svn: 211924	2014-06-27 18:30:08 +00:00
Aaron Ballman	172ae5e180	Silencing some -Wcast-qual warnings. No functional changes intended. llvm-svn: 211923	2014-06-27 18:25:49 +00:00
Chandler Carruth	ec4fb8a59b	[x86] Fix a miscompile in the new shuffle lowering uncovered by a bootstrap. I managed to mis-remember how PACKUS worked on x86, and was using undef for the high bytes instead of zero. The fix is fairly obvious. llvm-svn: 211922	2014-06-27 18:25:23 +00:00
David Majnemer	abf7854d05	IR: Add COMDATs to the IR This new IR facility allows us to represent the object-file semantic of a COMDAT group. COMDATs allow us to tie together sections and make the inclusion of one dependent on another. This is required to implement features like MS ABI VFTables and optimizing away certain kinds of initialization in C++. This functionality is only representable in COFF and ELF, Mach-O has no similar mechanism. Differential Revision: http://reviews.llvm.org/D4178 llvm-svn: 211920	2014-06-27 18:19:56 +00:00
Reid Kleckner	e91e249372	cmake: Don't do anything for LLVM_ENABLE_ASSERTIONS=OFF By default, CMake will set NDEBUG in Rel* builds and leave it off in debug builds, so we shouldn't need to do anything ourselves. Before this change, it was possible to a Debug build without assertions (aka Debug-Asserts in the autoconf system) by configuring with -DLLVM_ENABLE_ASSERTIONS=OFF, but this configuration isn't very useful. You can still get the same effect by explicitly adding -DNDEBUG to CFLAGS. Differential Revision: http://reviews.llvm.org/D4257 Patch by Janusz Sobczak! llvm-svn: 211919	2014-06-27 18:17:30 +00:00
Julien Lerouge	5dbf44035a	lldb can interrupt waitpid, so EINTR shouldn't be an error. This fixes the case where there is no timeout. In the case where there is a timeout though, the code is still wrong since it doesn't check that the alarm really went off. Without this patch, I cannot debug a program that forks itself using sys::ExecuteAndWait with lldb. llvm-svn: 211918	2014-06-27 18:02:54 +00:00
Matt Arsenault	5e70db4151	R600: Move trivial getters into header, use initializer list llvm-svn: 211917	2014-06-27 17:57:00 +00:00
David Blaikie	06bae5c2fe	Fix test so it doesn't try to write out temporary files into the test tree. llvm-svn: 211916	2014-06-27 17:45:43 +00:00
Logan Chien	f308787727	Avoid non-ascii character in the source code. llvm-svn: 211914	2014-06-27 17:25:54 +00:00
David Majnemer	c2cdc28730	MC: Fix associative sections on COFF COFF sections in MC were represented by a tuple of section-name and COMDAT-name. This is not sufficient to represent a .text section associated with another .text section; we need a way to distinguish between the key section and the one marked associative. llvm-svn: 211913	2014-06-27 17:19:44 +00:00
Juergen Ributzka	f7b405f472	[FastISel][X86] Fix typos. llvm-svn: 211911	2014-06-27 17:16:34 +00:00
Matt Arsenault	128df7aaf1	R600: Don't crash on unhandled instruction in promote alloca llvm-svn: 211906	2014-06-27 16:52:49 +00:00
Ed Maste	2e5df25fce	llvm-objdump: don't assert if ELF file has no sections FreeBSD core files, for example, have no sections (only program headers). llvm.org/pr20139 Differential Revision: http://reviews.llvm.org/D4323 llvm-svn: 211904	2014-06-27 16:37:20 +00:00
Alexander Kornienko	3be4bd19cf	Clean up unused variable warning in release build. llvm-svn: 211902	2014-06-27 15:30:55 +00:00
Chandler Carruth	35b7259047	Re-apply r211287: Remove support for LLVM runtime multi-threading. I'll fix the problems in libclang and other projects in ways that don't require <mutex> until we sort out the cygwin situation. llvm-svn: 211900	2014-06-27 15:13:01 +00:00
Ulrich Weigand	85c764c15a	[PowerPC] Constrain base register in PPCRegisterInfo::resolveFrameIndex I've run into a bug where current LLVM at -O0 (with fast-isel) generated invalid code like: ld 0, 20936(1) # 8-byte Folded Reload stw 12, 10348(0) stw 12, 10344(0) The underlying vreg had been introduced as base register by the Local Stack Slot Allocation pass. That register was constrained to G8RC by PPCRegisterInfo::materializeFrameBaseRegister to match the ADDI instruction used to set it, but it was not constrained to G8RC_NOX0 to fit the use of the register in an address. That should have happened in PPCRegisterInfo::resolveFrameIndex. This patch adds an appropriate constrainRegClass call. Reviewed by Hal Finkel. llvm-svn: 211897	2014-06-27 13:04:12 +00:00
Chandler Carruth	608f397a25	[x86] Clean up some unused variables, especially in release builds. llvm-svn: 211894	2014-06-27 12:04:18 +00:00
Chandler Carruth	e7aaf337d0	[x86] Teach the target combine step to aggressively fold pshufd insturcions. Summary: This allows it to fold pshufd instructions across intervening half-shuffles and other noise. This pattern actually shows up in the generic lowering tests, but I've also added direct tests using intrinsics to make sure that the specific desired functionality is working even if the lowering stuff changes in the future. Differential Revision: http://reviews.llvm.org/D4292 llvm-svn: 211892	2014-06-27 11:40:13 +00:00
Simon Atanasyan	fb5fe85156	[ELF][Mips] Fix recognition of MIPS 64-bit arch in the ELFObjectFile:getArch() method. llvm-svn: 211891	2014-06-27 11:36:45 +00:00
Chandler Carruth	127f1b532c	[x86] Teach the target-specific combining how to aggressively fold half-shuffles, even looking through intervening instructions in a chain. Summary: This doesn't happen to show up with any test cases I've found for the current shuffle lowering, but previous attempts would benefit from this and it seems generally useful. I've tested it directly using intrinsics, which also shows that it will work with hand vectorized code as well. Note that even though pshufd isn't directly used in these tests, it gets exercised because we combine some of the half shuffles into a pshufd first, and then merge them. Differential Revision: http://reviews.llvm.org/D4291 llvm-svn: 211890	2014-06-27 11:34:40 +00:00
Chandler Carruth	c62b16ce89	[x86] Teach the X86 backend to DAG-combine SSE2 shuffles that are trivially redundant. This fixes several cases in the new vector shuffle lowering algorithm which would generate redundant shuffle instructions for the sake of simplicity. I'm also deleting a testcase which was somewhat ridiculous. It was checking for a bug in 2007 about incorrectly transforming shuffles by looking for the string "-86" in the output of a pretty substantial function. This test case doesn't seem to have any value at this point. Differential Revision: http://reviews.llvm.org/D4240 llvm-svn: 211889	2014-06-27 11:27:52 +00:00
Chandler Carruth	e28f70fce3	[x86] Begin a significant overhaul of how vector lowering is done in the x86 backend. This sketches out a new code path for vector lowering, hidden behind an off-by-default flag while it is under development. The fundamental idea behind the new code path is to aggressively break down the problem space in ways that ease selecting the odd set of instructions available on x86, and carefully avoid scalarizing code even when forced to use older ISAs. Notably, this starts off restricting itself to SSE2 and implements the complete vector shuffle and blend space for 128-bit vectors in SSE2 without scalarizing. The plan is to layer on top of this ISA extensions where we can bail out of the complex SSE2 lowering and opt for a cheaper, specialized instruction (or set of instructions). It also needs to be generalized to AVX and AVX512 vector widths. Currently, this does a decent but not perfect job for SSE2. There are some specific shortcomings that I plan to address: - We need a peephole combine to fold together shuffles where possible. There are cases where a previous shuffle could be modified slightly to arrange for elements to be in the correct position and a later shuffle eliminated. Doing this eagerly added quite a bit of complexity, and so my plan is to combine away these redundancies afterward. - There are a lot more clever ways to use unpck and pack that need to be added. This is essential for real world shuffles as it turns out... Once SSE2 is polished a bit I should be able to get interesting numbers on performance improvements on benchmarks conducive to vectorization. All of this will be off by default until it is functionally equivalent of course. Differential Revision: http://reviews.llvm.org/D4225 llvm-svn: 211888	2014-06-27 11:23:44 +00:00
Ulrich Weigand	275191ff0a	[RuntimeDyld, PowerPC] Fix/improve handling of TOC relocations Current PPC64 RuntimeDyld code to handle TOC relocations has two problems: - With recent linkers, in addition to the relocations that implicitly refer to the TOC base (R_PPC64_TOC), you can now also use the .TOC. magic symbol with any other relocation to refer to the TOC base explicitly. This isn't currently used much in ELFv1 code (although it could be), but it is essential in ELFv2 code. - In a complex JIT environment with multiple modules, each module may have its own .toc section, and TOC relocations in one module must refer to its own* TOC section. The current findPPC64TOC implementation does not correctly implement this; in fact, it will always return the address of the first TOC section it finds anywhere. (Note that at the time findPPC64TOC is called, we don't even know which module the relocation originally resided in, so it is not even possible to fix this routine as-is.) This commit fixes both problems by handling TOC relocations earlier, in processRelocationRef. To do this, I've removed the findPPC64TOC routine and replaced it by a new routine findPPC64TOCSection, which works analogously to findOPDEntrySection in scanning the sections of the ObjImage provided by its caller, processRelocationRef. This solves the issue of finding the correct TOC section associated with the current module. This makes it straightforward to implement both R_PPC64_TOC relocations, and relocations explicitly refering to the .TOC. symbol, directly in processRelocationRef. There is now a new problem in implementing the R_PPC64_TOC16* relocations, because those can now in theory involve three different sections: the relocation may be applied in section A, refer explicitly to a symbol in section B, and refer implicitly to the TOC section C. The final processing of the relocation thus may only happen after all three of these sections have been assigned final addresses. There is currently no obvious means to implement this in its general form with the common-code RuntimeDyld infrastructure. Fortunately, ppc64 code usually makes no use of this most general form; in fact, TOC16 relocations are only ever generated by LLVM for symbols residing themselves in the TOC, which means "section B" == "section C" in the above terminology. This special case can easily be handled with the current infrastructure, and that is what this patch does. [ Unhandled cases result in an explicit error, unlike the current code which silently returns the wrong TOC base address ... ] This patch makes the JIT work on both BE and LE (ELFv2 requires additional patches, of course), and allowed me to successfully run complex JIT scenarios (via mesa/llvmpipe). Reviewed by Hal Finkel. llvm-svn: 211885	2014-06-27 10:32:14 +00:00
Alp Toker	f228194c3e	IRReader: don't mark MemoryBuffers const llvm-svn: 211883	2014-06-27 09:19:14 +00:00
Dinesh Dwivedi	73e3709b2c	Added instruction combine to transform few more negative values addition to subtraction (Part 3) This patch enables transforms for (x + (~(y \| c) + 1) --> x - (y \| c) if c is odd Differential Revision: http://reviews.llvm.org/D4210 llvm-svn: 211881	2014-06-27 07:47:35 +00:00
Eric Christopher	666067451e	Remove the caching of the target machine from SystemZTargetLowering. Update all callers and uses accordingly. llvm-svn: 211880	2014-06-27 07:38:01 +00:00
David Majnemer	be156a8772	GlobalOpt: Fix constantfold-initializers.ll test The test added in r211762 was sloppy, the correct initializer wasn't added to @llvm.global_ctors Spotted by Pasi Parviainen! llvm-svn: 211879	2014-06-27 07:36:26 +00:00
Eric Christopher	fe4c033d5f	Remove target machine caching from SystemZInstrInfo and SystemZRegisterInfo and replace it with the subtarget as that's all they needed in the first place. Update all uses and calls accordingly. llvm-svn: 211877	2014-06-27 07:01:17 +00:00
David Blaikie	d6c9082735	Revert "Revert "Revert "PR20038: DebugInfo: Inlined call sites where the caller has debug info but the call itself has no debug location.""" Reverting this again, didn't mean to commit it - while r211872 fixes one of the issues here, there are still others to figure out and address. This reverts commit r211871. llvm-svn: 211873	2014-06-27 05:34:05 +00:00
David Blaikie	232726db30	ArgumentPromotion: Propagate debug locations on calls for which arguments are promoted. llvm-svn: 211872	2014-06-27 05:32:09 +00:00
David Blaikie	a4e2150a25	Revert "Revert "PR20038: DebugInfo: Inlined call sites where the caller has debug info but the call itself has no debug location."" This reverts commit r211724. llvm-svn: 211871	2014-06-27 05:31:49 +00:00
Eric Christopher	702459da8e	Have SystemZSelectionDAGInfo constructor take a DataLayout rather than a target machine since it doesn't need anything past the DataLayout. llvm-svn: 211870	2014-06-27 05:26:28 +00:00
Craig Topper	9586f4812f	Rename getX86ConditonCode -> getX86ConditionCode llvm-svn: 211869	2014-06-27 05:18:21 +00:00
Andrew Trick	424d4c474f	Left out the NDEBUG in the previous checkin. llvm-svn: 211867	2014-06-27 05:09:36 +00:00
Andrew Trick	a8cb3163c0	MachineScheduler: add some book-keeping to fix an assert. Fixe for Bug 20057 - Assertion failied in llvm::SUnit* llvm::SchedBoundary::pickOnlyChoice(): Assertion `i <= (HazardRec->getMaxLookAhead() + MaxObservedStall) && "permanent hazard"' Thanks to Chad for the test case. llvm-svn: 211865	2014-06-27 04:57:05 +00:00
Alp Toker	efad9949fa	Propagate const-correctness into parseBitcodeFile() llvm-svn: 211864	2014-06-27 04:48:32 +00:00
Eric Christopher	75468540c0	Have MipsSelectionDAGInfo constructor take a DataLayout rather than a target machine since it doesn't need anything past the DataLayout. llvm-svn: 211863	2014-06-27 04:38:30 +00:00
Alp Toker	9208e7b6ba	ParseIR: don't take ownership of the MemoryBuffer clang was needlessly duplicating whole memory buffer contents in an attempt to satisfy unclear ownership semantics. Let's just hide internal LLVM quirks and present a simple non-owning interface. The public C API preserves previous behaviour for stability. llvm-svn: 211861	2014-06-27 04:33:58 +00:00
Eric Christopher	d9c871322b	Move NVPTX subtarget dependent variables from the target machine to the subtarget. llvm-svn: 211860	2014-06-27 04:33:14 +00:00

1 2 3 4 5 ...

105040 Commits