llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 12:33:33 +02:00

Author	SHA1	Message	Date
Andrew Trick	7350fe079d	Move a unit test into the correct dir. Sorry if it broke Mips-only builds. llvm-svn: 199911	2014-01-23 17:47:57 +00:00
Rafael Espindola	adb277286a	Remove tail marker when changing an argument to an alloca. Argument promotion can replace an argument of a call with an alloca. This requires clearing the tail marker as it is very likely that the callee is now using an alloca in the caller. This fixes pr14710. llvm-svn: 199909	2014-01-23 17:19:42 +00:00
Tom Stellard	6f13c22a7a	R600: Recommit 199842: Add work-around for the CF stack entry HW bug The unit test is now disabled on non-asserts builds. The CF stack can be corrupted if you use CF_ALU_PUSH_BEFORE, CF_ALU_ELSE_AFTER, CF_ALU_BREAK, or CF_ALU_CONTINUE when the number of sub-entries on the stack is greater than or equal to the stack entry size and sub-entries modulo 4 is either 0 or 3 (on cedar the bug is present when number of sub-entries module 8 is either 7 or 0) We choose to be conservative and always apply the work-around when the number of sub-enries is greater than or equal to the stack entry size, so that we can safely over-allocate the stack when we are unsure of the stack allocation rules. reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 199905	2014-01-23 16:18:02 +00:00
Simon Atanasyan	f2ddc8cbe5	[Object][ELF][Mips] Print symbol name for MIPS ELF relocations. llvm-svn: 199898	2014-01-23 15:05:45 +00:00
Elena Demikhovsky	6f951ffaa0	AVX-512: added VPERM2D VPERM2Q VPERM2PS VPERM2PD instructions, they give better sequences than VPERMI llvm-svn: 199893	2014-01-23 14:27:26 +00:00
Tim Northover	cfdf1357ee	ARM: use litpools for normal i32 imms when compiling minsize. With constant-sharing, litpool loads consume 4 + N2 bytes of code, but movw/movt pairs consume 8N. This means litpools are better than movw/movt even with just one use. Other materialisation strategies can still be better though, so the logic is a little odd. llvm-svn: 199891	2014-01-23 13:43:47 +00:00
Artyom Skrobov	6e789cecc9	Prevent repetitive warnings for unrecognized processors and features llvm-svn: 199886	2014-01-23 11:31:38 +00:00
Chandler Carruth	46bbc995de	[LPM] Make LoopSimplify no longer a LoopPass and instead both a utility function and a FunctionPass. This has many benefits. The motivating use case was to be able to compute function analysis passes after running LoopSimplify (to avoid invalidating them) and then to run other passes which require LoopSimplify. Specifically passes like unrolling and vectorization are critical to wire up to BranchProbabilityInfo and BlockFrequencyInfo so that they can be profile aware. For the LoopVectorize pass the only things in the way are LoopSimplify and LCSSA. This fixes LoopSimplify and LCSSA is next on my list. There are also a bunch of other benefits of doing this: - It is now very feasible to make more passes preserve LoopSimplify because they can simply run it after changing a loop. Because subsequence passes can assume LoopSimplify is preserved we can reduce the runs of this pass to the times when we actually mutate a loop structure. - The new pass manager should be able to more easily support loop passes factored in this way. - We can at long, long last observe that LoopSimplify is preserved across SCEV. This halves the number of times we run LoopSimplify!!! Now, getting here wasn't trivial. First off, the interfaces used by LoopSimplify are all over the map regarding how analysis are updated. We end up with weird "pass" parameters as a consequence. I'll try to clean at least some of this up later -- I'll have to have it all clean for the new pass manager. Next up I discovered a really frustrating bug. LoopUnroll claims to preserve LoopSimplify. That's actually a lie. But the way the LoopPassManager ends up running the passes, it always ran LoopSimplify on the unrolled-into loop, rectifying this oversight before any verification could kick in and point out that in fact nothing was preserved. So I've added code to the unroller to actually simplify the surrounding loop when it succeeds at unrolling. The only functional change in the test suite is that we now catch a case that was previously missed because SCEV and other loop transforms see their containing loops as simplified and thus don't miss some opportunities. One test case has been converted to check that we catch this case rather than checking that we miss it but at least don't get the wrong answer. Note that I have #if-ed out all of the verification logic in LoopSimplify! This is a temporary workaround while extracting these bits from the LoopPassManager. Currently, there is no way to have a pass in the LoopPassManager which preserves LoopSimplify along with one which does not. The LPM will try to verify on each loop in the nest that LoopSimplify holds but the now-Function-pass cannot distinguish what loop is being verified and so must try to verify all of them. The inner most loop is clearly no longer simplified as there is a pass which didn't even attempt to preserve it. =/ Once I get LCSSA out (and maybe LoopVectorize and some other fixes) I'll be able to re-enable this check and catch any places where we are still failing to preserve LoopSimplify. If this causes problems I can back this out and try to commit all of this at once, but so far this seems to work and allow much more incremental progress. llvm-svn: 199884	2014-01-23 11:23:19 +00:00
Hao Liu	80b39e0b02	[AArch64]Add CHECK for two test cases testing scalar_to_vector committed in r199461. llvm-svn: 199861	2014-01-23 02:09:30 +00:00
Jack Carter	710434d0c0	[Mips] TargetStreamer Support for .set mips16. This patch updates .set mips16 support which affects the ELF ABI and its flags. In addition the patch uses a common interface for both the MipsTargetSteamer and MipsObjectStreamer that the assembler uses for both ELF and ASCII output for these directives. llvm-svn: 199851	2014-01-22 23:08:42 +00:00
Owen Anderson	2224c08928	Revert r162101 and replace it with a solution that works for targets where the pointer type is illegal. This is a horrible bit of code. We're calling a simplification routine in the middle of type legalization. We tell the simplification routine that it's running after legalization, but some of the types it will encounter will be illegal! The fix is only to invoke the simplification if the types in question were legal, so that none of its invariants will be violated. llvm-svn: 199847	2014-01-22 22:34:17 +00:00
Matt Arsenault	5eede68ba6	Add CHECK-LABELs llvm-svn: 199846	2014-01-22 22:32:58 +00:00
Tom Stellard	d5181ee67d	Revert "R600: Add work-around for the CF stack entry HW bug" This reverts commit 35b8331cad6eb512a2506adbc394201181da94ba. The -debug-only flag for llc doesn't appear to be available in all build configurations. llvm-svn: 199845	2014-01-22 22:20:54 +00:00
Rafael Espindola	00a0fd2714	Provide a dummy section to fix a crash with inline assembly in LTO. Fixes pr18508. llvm-svn: 199843	2014-01-22 22:11:14 +00:00
Tom Stellard	cd874ab98c	R600: Add work-around for the CF stack entry HW bug The CF stack can be corrupted if you use CF_ALU_PUSH_BEFORE, CF_ALU_ELSE_AFTER, CF_ALU_BREAK, or CF_ALU_CONTINUE when the number of sub-entries on the stack is greater than or equal to the stack entry size and sub-entries modulo 4 is either 0 or 3 (on cedar the bug is present when number of sub-entries module 8 is either 7 or 0) We choose to be conservative and always apply the work-around when the number of sub-enries is greater than or equal to the stack entry size, so that we can safely over-allocate the stack when we are unsure of the stack allocation rules. reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 199842	2014-01-22 21:55:46 +00:00
Tom Stellard	ae477cc774	R600: Refactor stack size calculation reviewed-by: Vincent Lejeune <vljn at ovi.com> llvm-svn: 199840	2014-01-22 21:55:43 +00:00
Matt Arsenault	52e557deb2	Handle an addrspacecast case in memcpyopt llvm-svn: 199836	2014-01-22 21:53:19 +00:00
Alp Toker	c7f817a228	Eliminate inappropriate use of FindProgramByName() from lli llvm-svn: 199835	2014-01-22 21:52:35 +00:00
Quentin Colombet	f6ddfeb084	Add a testcase for r199430. llvm-svn: 199831	2014-01-22 20:11:50 +00:00
Tom Stellard	19af07fe92	R600: MOVA is vector only llvm-svn: 199827	2014-01-22 19:24:24 +00:00
Tom Stellard	0971c460b5	R600: Take alignment into account when calculating the stack offset llvm-svn: 199826	2014-01-22 19:24:23 +00:00
Tom Stellard	d424fe57e4	R600: Add support for global addresses with constant initializers llvm-svn: 199825	2014-01-22 19:24:21 +00:00
Tom Stellard	452996a15e	R600: Begin private memory at the second GPR. This way private memory does not over-write work group information stored in GPRs 0 and 1. llvm-svn: 199824	2014-01-22 19:24:19 +00:00
Tom Stellard	369c33de20	R600/SI: Add support for i8 and i16 private loads/stores llvm-svn: 199823	2014-01-22 19:24:14 +00:00
Matt Arsenault	bd62448a58	Bug 18228 - Fix accepting bitcasts between vectors of pointers with a different number of elements. Bitcasts were passing with vectors of pointers with different number of elements since the number of elements was checking SrcTy->getVectorNumElements() == SrcTy->getVectorNumElements() which isn't helpful. The addrspacecast was also wrong, but that case at least is caught by the verifier. Refactor bitcast and addrspacecast handling in castIsValid to be more readable and fix this problem. llvm-svn: 199821	2014-01-22 19:21:33 +00:00
Greg Fitzgerald	d54e246d6a	Fix inline assembly that switches between ARM and Thumb modes This patch restores the ARM mode if the user's inline assembly does not. In the object streamer, it ensures that instructions following the inline assembly are encoded correctly and that correct mapping symbols are emitted. For the asm streamer, it emits a .arm or .thumb directive. This patch does not ensure that the inline assembly contains the ADR instruction to switch modes at runtime. The problem we need to solve is code like this: int foo(int a, int b) { int r = a + b; asm volatile( ".align 2 \n" ".arm \n" "add r0,r0,r0 \n" : : "r"(r)); return r+1; } If we compile this function in thumb mode then the inline assembly will switch to arm mode. We need to make sure that we switch back to thumb mode after emitting the inline assembly or we will incorrectly encode the instructions that follow (i.e. the assembly instructions for return r+1). Based on patch by David Peixotto Change-Id: Ib57f6d2d78a22afad5de8693fba6230ff56ba48b llvm-svn: 199818	2014-01-22 18:32:35 +00:00
David Woodhouse	fa88b9de95	[x86] Allow segment and address-size overrides for INS[BWLQ] (PR9385) llvm-svn: 199809	2014-01-22 15:08:55 +00:00
David Woodhouse	b7c155c55a	[x86] Allow segment and address-size overrides for OUTS[BWLQ] (PR9385) llvm-svn: 199808	2014-01-22 15:08:49 +00:00
David Woodhouse	39833d37a3	[x86] Allow segment and address-size overrides for MOVS[BWLQ] (PR9385) llvm-svn: 199807	2014-01-22 15:08:42 +00:00
David Woodhouse	4515fd303f	]x86] Allow segment and address-size overrides for CMPS[BWLQ] (PR9385) llvm-svn: 199806	2014-01-22 15:08:36 +00:00
David Woodhouse	02c50a95e8	[x86] Allow address-size overrides for SCAS{8,16,32,64} (PR9385) llvm-svn: 199805	2014-01-22 15:08:27 +00:00
David Woodhouse	59ef208820	[x86] Allow address-size overrides for STOS[BWLQ] (PR9385) llvm-svn: 199804	2014-01-22 15:08:21 +00:00
David Woodhouse	e01fc03be8	[x86] Allow segment and address-size overrides for LODS[BWLQ] (PR9385) llvm-svn: 199803	2014-01-22 15:08:08 +00:00
Elena Demikhovsky	45e0f9e6b6	AVX512: combining setcc and zext is wrong on AVX512 because vector compare instruction puts result in mask register. llvm-svn: 199798	2014-01-22 12:26:19 +00:00
James Molloy	2e64527bc0	MachineCopyPropagation has special logic for removing COPY instructions. It will remove plain COPYs using eraseFromParent(), but if the COPY has imp-defs/imp-uses it will convert it to a KILL, to keep the imp-def around. This actually totally breaks and causes the machine verifier to cry in several cases, one of which being: %RAX<def> = COPY %RCX<kill> %ECX<def> = COPY %EAX<kill>, %RAX<imp-use,kill> These subregister copies are together identified as noops, so are both removed. However, the second one as it has an imp-use gets converted into a kill: %ECX<def> = KILL %EAX<kill>, %RAX<imp-use,kill> As the original COPY has been removed, the verifier goes into tears at the use of undefined EAX and RAX. There are several hacky solutions to this hacky problem (which is all to do with imp-use/def weirdnesses), but the least hacky I've come up with is to always remove COPYs by converting to KILLs. KILLs are no-ops to the code generator so the generated code doesn't change (which is why they were partially used in the first place), but using them also keeps the def/use and imp-def/imp-use chains alive: %RAX<def> = KILL %RCX<kill> %ECX<def> = KILL %EAX<kill>, %RAX<imp-use,kill> The patch passes all test cases including the ones that check the removal of MOVs in this circumstance, along with an extra test I added to check subregister behaviour (which made the machine verifier fall over before my patch). The patch also adds some DEBUG() statements because the file hadn't got any. llvm-svn: 199797	2014-01-22 09:12:27 +00:00
Kevin Qin	9a631f3af4	[AArch64 NEON] Try to generate CONCAT_VECTOR when lowering BUILD_VECTOR or SHUFFLE_VECTOR. llvm-svn: 199791	2014-01-22 06:11:03 +00:00
Venkatraman Govindaraju	2dcfaac1b8	[Sparc] Add support for inline assembly constraints which specify registers by their aliases. llvm-svn: 199786	2014-01-22 03:18:42 +00:00
Venkatraman Govindaraju	6c498f0e2f	[Sparc] Add support for inline assembly constraint 'I'. llvm-svn: 199781	2014-01-22 01:29:51 +00:00
Venkatraman Govindaraju	0a2da12ffd	[Sparc] Do not add PC to _GLOBAL_OFFSET_TABLE_ address to access GOT in absolute code. Fixes PR#18521 llvm-svn: 199775	2014-01-22 00:13:18 +00:00
Duncan P. N. Exon Smith	6d0186fb05	CodeGen: Stop treating vectors as aggregates Fix a crash in SjLjEHPrepare::lowerIncomingArguments caused by treating VectorType like an aggregate. It's first-class! <rdar://problem/15854596> llvm-svn: 199768	2014-01-21 22:46:46 +00:00
Chandler Carruth	a4347f800e	Tweak the spelling of the asserts requirement a bit more. This makes it match the (reasonably prevelant) usage in Clang's test suite and so seems more "canonical". llvm-svn: 199767	2014-01-21 22:39:19 +00:00
Andrew Trick	e67db7b7b2	Fix PR18572 - llc crash during GenericScheduler::initPolicy(). Generalized the heuristic that looks at the (very rough) size of the register file before enabling regpressure tracking. llvm-svn: 199766	2014-01-21 21:27:37 +00:00
David Majnemer	4537541170	Forgot to add testcase for r198590 llvm-svn: 199765	2014-01-21 20:39:11 +00:00
Hal Finkel	ca5b9beeb1	Fix pointer info on PPC byval stores For PPC64 SVR (and Darwin), the stores that take byval aggregate parameters from registers into the stack frame had MachinePointerInfo objects with incorrect offsets. These offsets are relative to the object itself, not to the stack frame base. This fixes self hosting on PPC64 when compiling with -enable-aa-sched-mi. llvm-svn: 199763	2014-01-21 20:15:58 +00:00
Justin Holewinski	dbdc639118	[NVPTX] Add missing patterns for div.approx with immediate denominator llvm-svn: 199746	2014-01-21 14:40:05 +00:00
Saleem Abdulrasool	aae6a31cf3	tools: support decoding ARM EHABI opcodes in readobj Add support to llvm-readobj to decode the actual opcodes. The ARM EHABI opcodes are a variable length instruction set that describe the operations required for properly unwinding stack frames. The primary motivation for this change is to ease the creation of tests for the ARM EHABI object emission as well as the unwinding directive handling in the ARM IAS. Thanks to Logan Chien for an extra test case! llvm-svn: 199708	2014-01-21 02:33:15 +00:00
Saleem Abdulrasool	d7349ac01d	ARM IAS: add support for .unwind_raw directive This implements the unwind_raw directive for the ARM IAS. The unwind_raw directive takes the form of a stack offset value followed by one or more bytes representing the opcodes to be emitted. The opcode emitted will interpreted as if it were assembled by the opcode assembler via the standard unwinding directives. Thanks to Logan Chien for an extra test! llvm-svn: 199707	2014-01-21 02:33:10 +00:00
Saleem Abdulrasool	4a5175ebb3	ARM IAS: support .personalityindex The .personalityindex directive is equivalent to the .personality directive with the ARM EABI personality with the specific index (0, 1, 2). Both of these directives indicate personality routines, so enhance the personality directive handling to take into account personalityindex. Bonus fix: flush the UnwindContext at the beginning of a new function. Thanks to Logan Chien for additional tests! llvm-svn: 199706	2014-01-21 02:33:02 +00:00
Kevin Qin	d925e0a953	[AArch64 NEON] Fix a bug caused by undef lane when generating VEXT. It was commited as r199628 but reverted in r199628 as causing regression test failed. It's because of old vervsion of patch I used to commit. Sorry for mistake. llvm-svn: 199704	2014-01-21 01:48:52 +00:00
Andrea Di Biagio	5b5e7e3cb5	[X86] Teach how to combine a vselect into a movss/movsd Add target specific rules for combining vselect dag nodes into movss/movsd when possible. If the vector type of the vselect dag node in input is either MVT::v4i13 or MVT::v4f32, then try to fold according to rules: 1) fold (vselect (build_vector (0, -1, -1, -1)), A, B) -> (movss A, B) 2) fold (vselect (build_vector (-1, 0, 0, 0)), A, B) -> (movss B, A) If the vector type of the vselect dag node in input is either MVT::v2i64 or MVT::v2f64 (and we have SSE2), then try to fold according to rules: 3) fold (vselect (build_vector (0, -1)), A, B) -> (movsd A, B) 4) fold (vselect (build_vector (-1, 0)), A, B) -> (movsd B, A) llvm-svn: 199683	2014-01-20 19:35:22 +00:00

1 2 3 4 5 ...

22474 Commits