llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Bill Wendling	7546bed590	Second attempt: Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'll change the JIT with a follow-up patch. llvm-svn: 70343	2009-04-29 00:15:41 +00:00
Bill Wendling	ef47ace92f	r70270 isn't ready yet. Back this out. Sorry for the noise. llvm-svn: 70275	2009-04-28 01:04:53 +00:00
Bill Wendling	2799e916c3	Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'm not 100% sure if it's necessary to change it there... llvm-svn: 70270	2009-04-28 00:21:31 +00:00
Duncan Sands	d2ba02aa87	Don't try to make BUILD_VECTOR operands have the same type as the vector element type: allow them to be of a wider integer type than the element type all the way through the system, and not just as far as LegalizeDAG. This should be safe because it used to be this way (the old type legalizer would produce such nodes), so backends should be able to handle it. In fact only targets which have legal vector types with an illegal promoted element type will ever see this (eg: <4 x i16> on ppc). This fixes a regression with the new type legalizer (vec_splat.ll). Also, treat SCALAR_TO_VECTOR the same as BUILD_VECTOR. After all, it is just a special case of BUILD_VECTOR. llvm-svn: 69467	2009-04-18 20:16:54 +00:00
Dan Gohman	66fc2f0a88	Fix this test so that it doesn't spuriously fail due to some unrelated debugging output happening to contain the string "store". llvm-svn: 67849	2009-03-27 16:17:22 +00:00
Mon P Wang	3d7fb6738a	Added missing support for widening when splitting an unary op (PR3683) and expanding a bit convert (PR3711). In both cases, we extract the valid part of the widen vector and then do the conversion. llvm-svn: 67175	2009-03-18 06:24:04 +00:00
Evan Cheng	2a51157172	Add another test case for r64440. llvm-svn: 67156	2009-03-18 02:43:01 +00:00
Evan Cheng	71ca3e2bdb	Private linkage support for PPC / Darwin. llvm-svn: 62955	2009-01-25 06:32:01 +00:00
Evan Cheng	d7cc550900	Fix PPC ISD::Declare isel and eliminate the need for PPCTargetLowering::LowerGlobalAddress to check if isVerifiedDebugInfoDesc() is true. Given the recent changes, it would falsely return true for a lot of GlobalAddressSDNode's. llvm-svn: 62373	2009-01-16 22:57:32 +00:00
Rafael Espindola	0aba6c9435	Add the private linkage. llvm-svn: 62279	2009-01-15 20:18:42 +00:00
Chris Lattner	4166afffa7	this test should not run opt -std-compile-opts, it should run just llc. llvm-svn: 61979	2009-01-09 05:32:00 +00:00
Chris Lattner	2d3e57c337	rename a file to follow naming conventions. llvm-svn: 61550	2009-01-02 01:52:35 +00:00
Evan Cheng	da55c4ffb7	Fix PR3149. If an early clobber def is a physical register and it is tied to an input operand, it effectively extends the live range of the physical register. Currently we do not have a good way to represent this. 172 %ECX<def> = MOV32rr %reg1039<kill> 180 INLINEASM <es:subl $5,$1 sbbl $3,$0>, 10, %EAX<def>, 14, %ECX<earlyclobber,def>, 9, %EAX<kill>, 36, <fi#0>, 1, %reg0, 0, 9, %ECX<kill>, 36, <fi#1>, 1, %reg0, 0 188 %EAX<def> = MOV32rr %EAX<kill> 196 %ECX<def> = MOV32rr %ECX<kill> 204 %ECX<def> = MOV32rr %ECX<kill> 212 %EAX<def> = MOV32rr %EAX<kill> 220 %EAX<def> = MOV32rr %EAX 228 %reg1039<def> = MOV32rr %ECX<kill> The early clobber operand ties ECX input to the ECX def. The live interval of ECX is represented as this: %reg20,inf = [46,47:1)[174,230:0) 0@174-(230) 1@46-(47) The right way to represent this is something like %reg20,inf = [46,47:2)[174,182:1)[181:230:0) 0@174-(182) 1@181-230 @2@46-(47) Of course that won't work since that means overlapping live ranges defined by two val#. The workaround for now is to add a bit to val# which says the val# is redefined by a early clobber def somewhere. This prevents the move at 228 from being optimized away by SimpleRegisterCoalescing::AdjustCopiesBackFrom. llvm-svn: 61259	2008-12-19 20:58:01 +00:00
Rafael Espindola	7593f0004f	Fix bug 3202. The EH_frame and .eh symbols are now private, except for darwin9 and earlier. The patch also fixes the definition of PrivateGlobalPrefix on pcc linux. llvm-svn: 61242	2008-12-19 10:55:56 +00:00
Evan Cheng	96d87db03b	We have decided not to support inline asm where an output operand with a matching input operand with incompatible type (i.e. either one is a floating point and the other is an integer or the sizes of the types differ). SelectionDAGBuild will catch these and exit with an error. llvm-svn: 61092	2008-12-16 18:21:39 +00:00
Evan Cheng	1b795803dd	Re-did 60519. It turns out Darwin's handling of hidden visibility symbols are a bit more complicate than I expected. Both declarations and weak definitions still need a stub indirection. However, the stubs are in data section and they contain the addresses of the actual symbols. llvm-svn: 60571	2008-12-05 01:06:39 +00:00
Evan Cheng	a77559c870	Remove a (what appears to be) overly strict assertion. Here is what happened: 1. ppcf128 select is expanded to f64 select's. 2. f64 select operand 0 is an i1 truncate, it's promoted to i32 zero_extend. 3. f64 select is updated. It's changed back to a "NewNode" and being re-analyzed. 4. f64 select operands are being processed. Operand 0 is a "NewNode". It's being expunged out of ReplacedValues map. 5. ExpungeNode tries to remap f64 select and notice it's a "NewNode" and assert. Duncan, please take a look. Thanks. llvm-svn: 60443	2008-12-02 21:57:09 +00:00
Duncan Sands	f64dd4b09c	Check that running the DAG combiner between type and operation legalization does something useful. llvm-svn: 60108	2008-11-26 16:44:30 +00:00
Dale Johannesen	652c29e68d	Remove these, which test for optimizations that are not currently done (cf PowerPC/README.txt). llvm-svn: 59456	2008-11-17 18:57:45 +00:00
Dale Johannesen	2487d3100b	Generated code for generic expansion of SETUGT etc. is noticeably worse than previous PPC-specific code. Since the latter was also wrong in some cases and correctness is more important than efficiency, I'm disabling this test temporarily while I fix it. llvm-svn: 58876	2008-11-08 00:49:19 +00:00
Dale Johannesen	5c10f4178e	Xfail an incorrect test. llvm-svn: 58875	2008-11-08 00:40:24 +00:00
Dan Gohman	aeaf83cfb8	Make ISel ignore dead nodes. The DAGCombiner normally eliminates dead nodes, but in this case its missing one. Fixing the DAGCombiner is desirable, but it's somewhat involved. llvm-svn: 58777	2008-11-05 22:56:47 +00:00
Duncan Sands	58ebf09772	Fix PR3011: LegalizeTypes support for scalarizing SELECT_CC. llvm-svn: 58706	2008-11-04 17:31:08 +00:00
Duncan Sands	d2500010a3	Add a bunch of libcalls for ppcf128 that were somehow completely forgotten about when writing LegalizeTypes. llvm-svn: 58508	2008-10-31 14:06:52 +00:00
Duncan Sands	1903629c49	Testcase for PR2986. llvm-svn: 58456	2008-10-30 20:34:30 +00:00
Chris Lattner	a99dc2692a	add testcase for PR2964 llvm-svn: 58393	2008-10-29 18:42:22 +00:00
Duncan Sands	a64641fbd2	Fix darwin ppc llvm-gcc build breakage: intercept ppcf128 to i32 conversion and expand it into a code sequence like in LegalizeDAG. This needs custom ppc lowering of FP_ROUND_INREG, so turn that on and make it work with LegalizeTypes. Probably PPC should simply custom lower the original conversion. llvm-svn: 58329	2008-10-28 15:00:32 +00:00
Duncan Sands	da35d6f7d6	Turn off LegalizeTypes for this test for the moment, while waiting for a proper solution. llvm-svn: 58324	2008-10-28 09:55:04 +00:00
Duncan Sands	ce82e0aa82	Fix a testcase provided by Bill in which the node id could end up being wrong mostly because of forgetting to remap new nodes that morphed into processed nodes through CSE. llvm-svn: 58323	2008-10-28 09:38:36 +00:00
Chris Lattner	3722193550	rename vec_spat -> vec_splat, pointed out by duncan llvm-svn: 58260	2008-10-27 18:28:24 +00:00
Chris Lattner	9737bef5a1	remove eh output from this test. llvm-svn: 58196	2008-10-26 18:53:07 +00:00
Chris Lattner	c369db13cc	Reapply r57699 with a fix to not crash on asms with multiple results. Unlike the previous patch this one actually passes make check. "Fix PR2356 on PowerPC: if we have an input and output that are tied together that have different sizes (e.g. i32 and i64) make sure to reserve registers for the bigger operand." llvm-svn: 57771	2008-10-18 18:49:30 +00:00
Dan Gohman	2eaf4f1c48	Revert r57699. It's causing regressions in test/CodeGen/X86/2008-09-17-inline-asm-1.ll and a few others, and it breaks the llvm-gcc build. llvm-svn: 57747	2008-10-18 01:03:45 +00:00
Chris Lattner	e2342cd790	Fix PR2356 on PowerPC: if we have an input and output that are tied together that have different sizes (e.g. i32 and i64) make sure to reserve registers for the bigger operand. llvm-svn: 57699	2008-10-17 17:52:49 +00:00
Bill Wendling	8d26b9c07a	Testcase for PR1638. llvm-svn: 57590	2008-10-15 18:27:15 +00:00
Dan Gohman	be88cf29b6	Split this test and move it into target-specific directories. This fixes failures on configurations that don't have one or the other targets enabled. llvm-svn: 56926	2008-10-01 19:46:30 +00:00
Dale Johannesen	4184c23365	Remove SelectionDag early allocation of registers for earlyclobbers. Teach Local RA about earlyclobber, and add some tests for it. llvm-svn: 56592	2008-09-24 23:13:09 +00:00
Bill Wendling	b2701e541c	Add trampoline support to PPC. GCC simply calls the "__trampoline_setup" function with appropriate parameters. This allows us to support blocks on PPC. llvm-svn: 56267	2008-09-17 00:30:57 +00:00
Evan Cheng	07046dae41	On some targets, non-move instructions can become move instructions because of coalescing. e.g. vr2 = OR vr0, vr1 => vr2 = OR vr1, vr1 // after coalescing vr0 with vr1 Update the value# of the destination register with the copy instruction if that happens. llvm-svn: 56165	2008-09-12 18:13:14 +00:00
Dale Johannesen	0124a7db51	Testcases for ppc atomics. llvm-svn: 55556	2008-08-30 00:54:31 +00:00
Mon P Wang	7566974359	In lowering SELECT_CC, removed cases where we can't flip the true and false when the compare value has a NaN llvm-svn: 55499	2008-08-28 21:04:05 +00:00
Dale Johannesen	6431c39237	Adjust grep's for new code sequence. llvm-svn: 55320	2008-08-25 18:53:58 +00:00
Evan Cheng	9c8cac5fd7	Fix a catastrophic PPC64 ABI bug: i32 operands which are passed in memory (all of the parameter registers are used) are loaded from sp offsets that were off by 4. llvm-svn: 53979	2008-07-24 08:17:07 +00:00
Dan Gohman	8981962672	Add a new function, ReplaceAllUsesOfValuesWith, which handles bulk replacement of multiple values. This is slightly more efficient than doing multiple ReplaceAllUsesOfValueWith calls, and theoretically could be optimized even further. However, an important property of this new function is that it handles the case where the source value set and destination value set overlap. This makes it feasible for isel to use SelectNodeTo in many very common cases, which is advantageous because SelectNodeTo avoids a temporary node and it doesn't require CSEMap updates for users of values that don't change position. Revamp MorphNodeTo, which is what does all the work of SelectNodeTo, to handle operand lists more efficiently, and to correctly handle a number of corner cases to which its new wider use exposes it. This commit also includes a change to the encoding of post-isel opcodes in SDNodes; now instead of being sandwiched between the target-independent pre-isel opcodes and the target-dependent pre-isel opcodes, post-isel opcodes are now represented as negative values. This makes it possible to test if an opcode is pre-isel or post-isel without having to know the size of the current target's post-isel instruction set. These changes speed up llc overall by 3% and reduce memory usage by 10% on the InstructionCombining.cpp testcase with -fast and -regalloc=local. llvm-svn: 53728	2008-07-17 19:10:17 +00:00
Duncan Sands	c3331602f9	LegalizeTypes support for what seems to be the only missing ppc long double operations: FNEG and FP_EXTEND. llvm-svn: 53723	2008-07-17 17:35:14 +00:00
Duncan Sands	35d3e774ed	The atomic.cmp.swap promotion logic is wrong: it simply does the atomic.cmp.swap on the larger type, which means it blows away whatever is sitting in the bytes just after the memory location, i.e. causes a buffer overflow. This really requires target specific code, which is why LegalizeTypes doesn't try to handle this case generically. The existing (wrong) code in LegalizeDAG will go away automatically once the type legalization code is removed from LegalizeDAG so I'm leaving it there for the moment. Meanwhile, don't test for this feature. llvm-svn: 53669	2008-07-16 08:09:48 +00:00
Duncan Sands	7ca2df2319	LegalizeTypes support for fabs on ppc long double. llvm-svn: 53613	2008-07-15 15:02:44 +00:00
Duncan Sands	58eb5e35da	LegalizeTypes support for promotion of bswap. In LegalizeDAG the value is zero-extended to the new type before byte swapping. It doesn't matter how the extension is done since the new bits are shifted off anyway after the swap, so extend by any old rubbish bits. This results in the final assembler for the testcase being one line shorter. llvm-svn: 53604	2008-07-15 10:18:22 +00:00
Duncan Sands	710be60c23	LegalizeTypes support for promotion of SIGN_EXTEND_INREG. llvm-svn: 53603	2008-07-15 10:14:24 +00:00
Chris Lattner	5f3c587276	Fix an altivec constant miscompilation that Duncan found through his work on legalizetypes. llvm-svn: 53410	2008-07-10 16:33:38 +00:00

1 2 3 4

198 Commits