llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Toma Tabacu	d8b3aeeabe	[mips] Remove a redundant semicolon and add space before curly brackets. NFC. llvm-svn: 226269	2015-01-16 10:45:15 +00:00
Simon Pilgrim	f787eeaa8f	[X86] Refactored stack memory folding tests to explicitly force register spilling The current 'big vectors' stack folded reload testing pattern is very bulky and makes it difficult to test all instructions as big vectors will tend to use only the ymm instruction implementations. This patch changes the tests to use a nop call that lists explicit xmm registers as sideeffects, with this we can force a partial register spill of the relevant registers and then check that the reload is correctly folded. The asm generated only adds the forced spill, a nop instruction and a couple of extra labels (a fraction of the current approach). More exhaustive tests will follow shortly, I've added some extra tests (the xmm versions of some of the existing folding tests) as a starting point. Differential Revision: http://reviews.llvm.org/D6932 llvm-svn: 226264	2015-01-16 09:32:54 +00:00
Timur Iskhodzhanov	6d120e1a54	Revert r226242 - Revert Revert Don't create new comdats in CodeGen This breaks AddressSanitizer (ninja check-asan) on Windows llvm-svn: 226251	2015-01-16 08:38:45 +00:00
Filipe Cabecinhas	47d5b20f32	Use report_fatal_error instead of llvm_unreachable, so we don't crash on user input llvm-svn: 226248	2015-01-16 04:54:12 +00:00
Hal Finkel	04316a019c	[PowerPC] Adjust PatchPoints for ppc64le Bill Schmidt pointed out that some adjustments would be needed to properly support powerpc64le (using the ELF V2 ABI). For one thing, R11 is not available as a scratch register, so we need to use R12. R12 is also available under ELF V1, so to maintain consistency, I flipped the order to make R12 the first scratch register in the array under both ABIs. llvm-svn: 226247	2015-01-16 04:40:58 +00:00
Mehdi Amini	a1e86a9849	Fix Reassociate handling of constant in presence of undef float http://reviews.llvm.org/D6993 llvm-svn: 226245	2015-01-16 03:00:58 +00:00
Rafael Espindola	f1394d41f0	Revert "Revert Don't create new comdats in CodeGen" This reverts commit r226173, adding r226038 back. No change in this commit, but clang was changed to also produce trivial comdats for costructors, destructors and vtables when needed. Original message: Don't create new comdats in CodeGen. This patch stops the implicit creation of comdats during codegen. Clang now sets the comdat explicitly when it is required. With this patch clang and gcc now produce the same result in pr19848. llvm-svn: 226242	2015-01-16 02:22:55 +00:00
Kevin Enderby	454bb27b57	Work around to get the build bot clang-cmake-armv7-a15-full green by removing the macho-archive-headers.test added with r226228 that it is failing on for now while I try to figure out what is going on. llvm-svn: 226241	2015-01-16 02:08:11 +00:00
Kevin Enderby	769159a05f	Another attempt to fix the build bot clang-cmake-armv7-a15-full failing on the macho-archive-headers.test added with r226228. llvm-svn: 226239	2015-01-16 01:09:54 +00:00
Sanjoy Das	8ce28789d0	Add a new pass "inductive range check elimination" IRCE eliminates range checks of the form 0 <= A * I + B < Length by splitting a loop's iteration space into three segments in a way that the check is completely redundant in the middle segment. As an example, IRCE will convert len = < known positive > for (i = 0; i < n; i++) { if (0 <= i && i < len) { do_something(); } else { throw_out_of_bounds(); } } to len = < known positive > limit = smin(n, len) // no first segment for (i = 0; i < limit; i++) { if (0 <= i && i < len) { // this check is fully redundant do_something(); } else { throw_out_of_bounds(); } } for (i = limit; i < n; i++) { if (0 <= i && i < len) { do_something(); } else { throw_out_of_bounds(); } } IRCE can deal with multiple range checks in the same loop (it takes the intersection of the ranges that will make each of them redundant individually). Currently IRCE does not do any profitability analysis. That is a TODO. Please note that the status of this pass is experimental, and it is not part of any default pass pipeline. Having said that, I will love to get feedback and general input from people interested in trying this out. This pass was originally r226201. It was reverted because it used C++ features not supported by MSVC 2012. Differential Revision: http://reviews.llvm.org/D6693 llvm-svn: 226238	2015-01-16 01:03:22 +00:00
Kevin Enderby	c79090cb32	This should fix the build bot clang-cmake-armv7-a15-full failing on the macho-archive-headers.test added with r226228. llvm-svn: 226232	2015-01-16 00:27:31 +00:00
Matt Arsenault	5500d52bba	R600/SI: Add patterns for v_cvt_{flr\|rpi}_i32_f32 llvm-svn: 226230	2015-01-15 23:58:35 +00:00
Filipe Cabecinhas	f9b20bcc5c	Fix edge case when Start overflowed in 32 bit mode llvm-svn: 226229	2015-01-15 23:50:44 +00:00
Kevin Enderby	c8e3a999b3	Add the option, -archive-headers, used with -macho to print the Mach-O archive headers to llvm-objdump. llvm-svn: 226228	2015-01-15 23:19:11 +00:00
Matt Arsenault	2f04c34f62	R600/SI: Fix trailing comma with modifiers Instructions with 1 operand can still use source modifiers, so make sure we don't print an extra comma afterwards. llvm-svn: 226226	2015-01-15 23:17:03 +00:00
Colin LeMahieu	ef6291ec41	[Hexagon] Adding new-value store and bit reverse instructions. llvm-svn: 226224	2015-01-15 23:10:29 +00:00
Filipe Cabecinhas	f8083bfc50	Report fatal errors instead of segfaulting/asserting on a few invalid accesses while reading MachO files. Summary: Shift an older “invalid file” test to get a consistent naming for these tests. Bugs found by afl-fuzz Reviewers: rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6945 llvm-svn: 226219	2015-01-15 22:52:38 +00:00
Lang Hames	9510b7f6fd	[Object] Add SF_Exported flag. This flag will be set on all symbols that would be exported from a dylib if their containing object file were linked into one. No test case: No command line tools query this flag, and there are no Object unit tests. llvm-svn: 226217	2015-01-15 22:33:30 +00:00
Sanjoy Das	618e939258	Revert r226201 (Add a new pass "inductive range check elimination") The change used C++11 features not supported by MSVC 2012. I will fix the change to use things supported MSVC 2012 and recommit shortly. llvm-svn: 226216	2015-01-15 22:18:10 +00:00
David Majnemer	3affad4957	InductiveRangeCheckElimination: Remove extra ';' This silences a GCC warning. llvm-svn: 226215	2015-01-15 21:55:16 +00:00
Andrew Kaylor	c921a5c529	Fixing pedantic build warnings. llvm-svn: 226214	2015-01-15 21:50:53 +00:00
Colin LeMahieu	d02bf117bc	[Hexagon] Fix 226206 by uncommenting required pattern and changing patterns for simple load-extends. llvm-svn: 226210	2015-01-15 21:35:49 +00:00
Hal Finkel	dcf8b14857	[PowerPC] Loosen ELFv1 PPC64 func descriptor loads for indirect calls Function pointers under PPC64 ELFv1 (which is used on PPC64/Linux on the POWER7, A2 and earlier cores) are really pointers to a function descriptor, a structure with three pointers: the actual pointer to the code to which to jump, the pointer to the TOC needed by the callee, and an environment pointer. We used to chain these loads, and make them opaque to the rest of the optimizer, so that they'd always occur directly before the call. This is not necessary, and in fact, highly suboptimal on embedded cores. Once the function pointer is known, the loads can be performed ahead of time; in fact, they can be hoisted out of loops. Now these function descriptors are almost always generated by the linker, and thus the contents of the descriptors are invariant. As a result, by default, we'll mark the associated loads as invariant (allowing them to be hoisted out of loops). I've added a target feature to turn this off, however, just in case someone needs that option (constructing an on-stack descriptor, casting it to a function pointer, and then calling it cannot be well-defined C/C++ code, but I can imagine some JIT-compilation system doing so). Consider this simple test: $ cat call.c typedef void (fp)(); void bar(fp x) { for (int i = 0; i < 1600000000; ++i) x(); } $ cat main.c typedef void (fp)(); void bar(fp x); void foo() {} int main() { bar(foo); } On the PPC A2 (the BG/Q supercomputer), marking the function-descriptor loads as invariant brings the execution time down to ~8 seconds from ~32 seconds with the loads in the loop. The difference on the POWER7 is smaller. Compiling with: gcc -std=c99 -O3 -mcpu=native call.c main.c : ~6 seconds [this is 4.8.2] clang -O3 -mcpu=native call.c main.c : ~5.3 seconds clang -O3 -mcpu=native call.c main.c -mno-invariant-function-descriptors : ~4 seconds (looks like we'd benefit from additional loop unrolling here, as a first guess, because this is faster with the extra loads) The -mno-invariant-function-descriptors will be added to Clang shortly. llvm-svn: 226207	2015-01-15 21:17:34 +00:00
Colin LeMahieu	ad558c9627	[Hexagon] Updating indexed load-extend patterns and changing test to new expected output. llvm-svn: 226206	2015-01-15 21:07:52 +00:00
Sanjoy Das	a7eb1a0b3d	Add a new pass "inductive range check elimination" IRCE eliminates range checks of the form 0 <= A * I + B < Length by splitting a loop's iteration space into three segments in a way that the check is completely redundant in the middle segment. As an example, IRCE will convert len = < known positive > for (i = 0; i < n; i++) { if (0 <= i && i < len) { do_something(); } else { throw_out_of_bounds(); } } to len = < known positive > limit = smin(n, len) // no first segment for (i = 0; i < limit; i++) { if (0 <= i && i < len) { // this check is fully redundant do_something(); } else { throw_out_of_bounds(); } } for (i = limit; i < n; i++) { if (0 <= i && i < len) { do_something(); } else { throw_out_of_bounds(); } } IRCE can deal with multiple range checks in the same loop (it takes the intersection of the ranges that will make each of them redundant individually). Currently IRCE does not do any profitability analysis. That is a TODO. Please note that the status of this pass is experimental, and it is not part of any default pass pipeline. Having said that, I will love to get feedback and general input from people interested in trying this out. Differential Revision: http://reviews.llvm.org/D6693 llvm-svn: 226201	2015-01-15 20:45:46 +00:00
Hal Finkel	d48111840e	Revert "r226086 - Revert "r226071 - [RegisterCoalescer] Remove copies to reserved registers"" Reapply r226071 with fixes. Two fixes: 1. We need to manually remove the old and create the new 'deaf defs' associated with physical register definitions when we move the definition of the physical register from the copy point to the point of the original vreg def. This problem was picked up by the machinstr verifier, and could trigger a verification failure on test/CodeGen/X86/2009-02-12-DebugInfoVLA.ll, so I've turned on the verifier in the tests. 2. When moving the def point of the phys reg up, we need to make sure that it is neither defined nor read in between the two instructions. We don't, however, extend the live ranges of phys reg defs to cover uses, so just checking for live-range overlap between the pair interval and the phys reg aliases won't pick up reads. As a result, we manually iterate over the range and check for reads. A test soon to be committed to the PowerPC backend will test this change. Original commit message: [RegisterCoalescer] Remove copies to reserved registers This allows the RegisterCoalescer to join "non-flipped" range pairs with a physical destination register -- which allows the RegisterCoalescer to remove copies like this: <vreg> = something (maybe a load, for example) ... (things that don't use PHYSREG) PHYSREG = COPY <vreg> (with all of the restrictions normally applied by the RegisterCoalescer: having compatible register classes, etc. ) Previously, the RegisterCoalescer handled only the opposite case (copying from a physical register). I don't handle the problem fully here, but try to get the common case where there is only one use of <vreg> (the COPY). An upcoming commit to the PowerPC backend will make this pattern much more common on PPC64/ELF systems. llvm-svn: 226200	2015-01-15 20:32:09 +00:00
Philip Reames	b8aacc1dea	Style cleanup of old gc.root lowering code Use static functions for helpers rather than static member functions. a) this changes the linking (minor at best), and b) this makes it obvious no object state is involved. llvm-svn: 226198	2015-01-15 19:49:25 +00:00
Matt Arsenault	1851a2058d	R600/SI: Improve fpext / fptrunc test coverage llvm-svn: 226197	2015-01-15 19:39:42 +00:00
Philip Reames	38c9dcbb7c	clang-format GCStrategy.cpp & GCRootLowering.cpp (NFC) llvm-svn: 226196	2015-01-15 19:39:17 +00:00
Philip Reames	2743d53c9e	Split GCStrategy.cpp into two files (NFC) This preparation for an update to http://reviews.llvm.org/D6811. GCStrategy.cpp will hopefully be moving into IR/, where as the lowering logic needs to stay in CodeGen/ llvm-svn: 226195	2015-01-15 19:29:42 +00:00
Colin LeMahieu	14474ecfe7	[Hexagon] Removing old versions of vsplice, valign, cl0, ct0 and updating references to new versions. llvm-svn: 226194	2015-01-15 19:28:32 +00:00
Marek Olsak	5cf742495d	R600/SI: Unify VOP2 instructions which are VOP3-only on VI This removes some duplicated classes and definitions. These instructions are defined: _e32 // pseudo _e32_si _e64 // pseudo _e64_si _e64_vi llvm-svn: 226191	2015-01-15 18:43:06 +00:00
Marek Olsak	6dc07f6738	R600/SI: Use 64-bit encoding by default for opcodes that are VOP3-only on VI llvm-svn: 226190	2015-01-15 18:43:01 +00:00
Marek Olsak	9dde9a5c29	R600/SI: Add V_READLANE_B32 and V_WRITELANE_B32 for VI These are VOP3-only on VI. The new multiclass doesn't define VOP3 versions of VOP2 instructions. llvm-svn: 226189	2015-01-15 18:42:55 +00:00
Marek Olsak	e2d9d70da1	R600/SI: Don't shrink instructions whose e32 encoding doesn't exist v2: modify hasVALU32BitEncoding instead v3: - add pseudoToMCOpcode helper to AMDGPUInstInfo, which is used by both hasVALU32BitEncoding and AMDGPUMCInstLower::lower - report an error if a pseudo can't be lowered llvm-svn: 226188	2015-01-15 18:42:51 +00:00
Marek Olsak	d720b76451	R600/SI: Add common class VOPAnyCommon llvm-svn: 226187	2015-01-15 18:42:44 +00:00
Marek Olsak	d751f41714	R600/SI: Don't select SI-only VOP3 opcodes on VI llvm-svn: 226186	2015-01-15 18:42:40 +00:00
Colin LeMahieu	39bcb9a1e4	[Hexagon] Adding vmux instruction. Removing old transfer instructions and updating references. llvm-svn: 226184	2015-01-15 18:16:00 +00:00
Ramkumar Ramachandra	6658c23924	statepoint tests: use statepoint-example gc Mechanical conversion of statepoint tests to use the example-statepoint gc. llvm-svn: 226183	2015-01-15 18:10:44 +00:00
Joerg Sonnenberger	76e84ed36b	Support @PLT loads on 32bit x86. llvm-svn: 226182	2015-01-15 17:59:02 +00:00
Colin LeMahieu	55264a56e6	[Hexagon] Deleting old float comparison instruction and updating references to new ones. llvm-svn: 226179	2015-01-15 17:28:14 +00:00
Colin LeMahieu	6ce6625967	[Hexagon] Replacing old fadd/fsub instructions and updating references. llvm-svn: 226176	2015-01-15 16:30:07 +00:00
Timur Iskhodzhanov	7b5eababde	Revert Don't create new comdats in CodeGen It breaks AddressSanitizer on Windows. llvm-svn: 226173	2015-01-15 16:14:34 +00:00
Daniel Sanders	ebcb17dfb5	[mips] Fix a typo in the compare patterns for MIPS32r6/MIPS64r6. Summary: The patterns intended for the SETLE node were actually matching the SETLT node. Reviewers: atanasyan, sstankovic, vmedic Reviewed By: vmedic Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D6997 llvm-svn: 226171	2015-01-15 15:41:03 +00:00
Vasileios Kalintiris	b3567820d5	Fix the C-API MCJIT test for 32-bit big endian machines. Avoid using unions for storing the return value from LLVMGetGlobalValueAddress() and LLVMGetFunctionAddress() and accessing it as a pointer through another pointer member. This causes problems on 32-bit big endian machines since the pointer gets the higher part of the return value of the aforementioned functions. llvm-svn: 226170	2015-01-15 15:36:04 +00:00
Vladimir Medic	68db5bdb63	Add disassembler tests for mips64r6 platform. There are no functional changes. llvm-svn: 226166	2015-01-15 14:18:12 +00:00
Vladimir Medic	d7462ab6c7	Add disassembler tests for mips32r6 platform. There are no functional changes. llvm-svn: 226165	2015-01-15 14:11:38 +00:00
Vladimir Medic	4f5cb92c60	Add disassembler tests for mips64r2 platform. There are no functional changes. llvm-svn: 226164	2015-01-15 14:06:34 +00:00
Mehdi Amini	be6225a8f6	Fix SelectionDAG -view-*-dags filtering llvm-svn: 226163	2015-01-15 12:03:32 +00:00
Alexander Kornienko	66580103e2	Replace size method call of containers to empty method where appropriate This patch was generated by a clang tidy checker that is being open sourced. The documentation of that checker is the following: /// The emptiness of a container should be checked using the empty method /// instead of the size method. It is not guaranteed that size is a /// constant-time function, and it is generally more efficient and also shows /// clearer intent to use empty. Furthermore some containers may implement the /// empty method but not implement the size method. Using empty whenever /// possible makes it easier to switch to another container in the future. Patch by Gábor Horváth! llvm-svn: 226161	2015-01-15 11:41:30 +00:00

1 2 3 4 5 ...

111848 Commits