llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-28 22:42:52 +01:00

Author	SHA1	Message	Date
Duncan Sands	778e45e748	Turn LegalizeTypes back off again for the moment: it is breaking Darwin bootstrap due to missing functionality. llvm-svn: 53721	2008-07-17 17:06:03 +00:00
Duncan Sands	3448d4087f	Add support for promoting and expanding AssertZext and AssertSext. Needed when passing huge integer parameters with the zeroext or signext attributes. llvm-svn: 53684	2008-07-16 16:03:07 +00:00
Duncan Sands	a8b538544a	Test passing of integer parameters for integers of all sizes from i1 to i256. The code is not always that great, for example (x86) movw %di, %ax movw %ax, i17_s where the store could be directly from %di. llvm-svn: 53677	2008-07-16 13:37:36 +00:00
Duncan Sands	be15f51092	Test codegen of loads and stores of all integer sizes from i1 to i256. The generated code is like one huge bug report of things that the DAG combiner fails to simplify! llvm-svn: 53676	2008-07-16 13:10:20 +00:00
Duncan Sands	b2e1ddbd0b	Turn on LegalizeTypes by default. llvm-svn: 53671	2008-07-16 11:36:51 +00:00
Duncan Sands	35d3e774ed	The atomic.cmp.swap promotion logic is wrong: it simply does the atomic.cmp.swap on the larger type, which means it blows away whatever is sitting in the bytes just after the memory location, i.e. causes a buffer overflow. This really requires target specific code, which is why LegalizeTypes doesn't try to handle this case generically. The existing (wrong) code in LegalizeDAG will go away automatically once the type legalization code is removed from LegalizeDAG so I'm leaving it there for the moment. Meanwhile, don't test for this feature. llvm-svn: 53669	2008-07-16 08:09:48 +00:00
Duncan Sands	7ca2df2319	LegalizeTypes support for fabs on ppc long double. llvm-svn: 53613	2008-07-15 15:02:44 +00:00
Duncan Sands	58eb5e35da	LegalizeTypes support for promotion of bswap. In LegalizeDAG the value is zero-extended to the new type before byte swapping. It doesn't matter how the extension is done since the new bits are shifted off anyway after the swap, so extend by any old rubbish bits. This results in the final assembler for the testcase being one line shorter. llvm-svn: 53604	2008-07-15 10:18:22 +00:00
Duncan Sands	710be60c23	LegalizeTypes support for promotion of SIGN_EXTEND_INREG. llvm-svn: 53603	2008-07-15 10:14:24 +00:00
Evan Cheng	05e5317cab	Fix PR2536: a nasty spiller bug. If a two-address instruction uses a register but the use portion of its live range is not part of its liveinterval, it must be defined by an implicit_def. In that case, do not spill the use. e.g. 8 %reg1024<def> = IMPLICIT_DEF 12 %reg1024<def> = INSERT_SUBREG %reg1024<kill>, %reg1025, 2 The live range [12, 14) are not part of the r1024 live interval since it's defined by an implicit def. It will not conflicts with live interval of r1025. Now suppose both registers are spilled, you can easily see a situation where both registers are reloaded before the INSERT_SUBREG and both target registers that would overlap. llvm-svn: 53503	2008-07-12 01:56:02 +00:00
Duncan Sands	52f1dbf139	Port a shift-by-1 optimization from LegalizeDAG: it was presumably added after the rest of the code was copied to LegalizeTypes. llvm-svn: 53459	2008-07-11 16:54:57 +00:00
Bill Wendling	9f17caa9a9	The frame address on an x86-64 box needs to be offset by -8, not -4. llvm-svn: 53450	2008-07-11 07:18:52 +00:00
Bill Wendling	3be8dca83f	Put CPPBackend tests into their own directory and run them only if they're supported. llvm-svn: 53427	2008-07-10 22:35:32 +00:00
Chris Lattner	5f3c587276	Fix an altivec constant miscompilation that Duncan found through his work on legalizetypes. llvm-svn: 53410	2008-07-10 16:33:38 +00:00
Evan Cheng	02a618dc56	Fix for PR2472. Use movss to set lower 32-bits of a zero XMM vector. llvm-svn: 53386	2008-07-10 01:08:23 +00:00
Anton Korobeynikov	f710ada483	Testcase for PR2024 llvm-svn: 53327	2008-07-09 14:09:41 +00:00
Dan Gohman	6057cf766c	Refactor the tablegen DAGISelEmitter code for outputing calls to getTargetNode and SelectNodeTo to reduce duplication, and to make some of the getTargetNode code available to SelectNodeTo. Use SelectNodeTo instead of getTargetNode in several new interesting cases, as it mutates nodes in place instead of creating new ones. This triggers some scheduling behavior differences due to nodes being presented to the scheduler in a different order. Some of the arbitrary scheduling decisions it makes are now arbitrarily made differently. This is visible in CodeGen/PowerPC/LargeAbsoluteAddr.ll, where a trivial scheduling difference led to a trivial register allocation difference. llvm-svn: 53203	2008-07-07 21:00:17 +00:00
Evan Cheng	cf3a4ad46d	Fix two serious LSR bugs. 1. LSR runOnLoop is always returning false regardless if any transformation is made. 2. AddUsersIfInteresting can create new instructions that are added to DeadInsts. But there is a later early exit which prevents them from being freed. llvm-svn: 53193	2008-07-07 19:51:32 +00:00
Dale Johannesen	51edab312c	Considering predecessors of exit blocks gets us a little more tail merging. llvm-svn: 52986	2008-07-01 21:50:49 +00:00
Chris Lattner	95fecdd63a	Implement split and scalarize for SELECT_CC, fixing PR2504 llvm-svn: 52887	2008-06-30 02:43:01 +00:00
Chris Lattner	153b6695b8	test doesn't need eh info llvm-svn: 52811	2008-06-27 03:14:20 +00:00
Dale Johannesen	76f5dc0cc4	Allow for rounding up of stack frame. llvm-svn: 52751	2008-06-26 01:55:32 +00:00
Chris Lattner	2b67ff8632	when we know the signbit of an input to uint_to_fp is zero, change it to sint_to_fp on targets where that is cheaper (and visaversa of course). This allows us to compile uint_to_fp to: _test: movl 4(%esp), %eax shrl $23, %eax cvtsi2ss %eax, %xmm0 movl 8(%esp), %eax movss %xmm0, (%eax) ret instead of: .align 3 LCPI1_0: ## double .long 0 ## double least significant word 4.5036e+15 .long 1127219200 ## double most significant word 4.5036e+15 .text .align 4,0x90 .globl _test _test: subl $12, %esp movl 16(%esp), %eax shrl $23, %eax movl %eax, (%esp) movl $1127219200, 4(%esp) movsd (%esp), %xmm0 subsd LCPI1_0, %xmm0 cvtsd2ss %xmm0, %xmm0 movl 20(%esp), %eax movss %xmm0, (%eax) addl $12, %esp ret llvm-svn: 52747	2008-06-26 00:16:49 +00:00
Evan Cheng	71fbfe73c1	- Fix a x86 vector isel bug: illegal transformation of a vector_shuffle into a shift. - Add a readme entry for a missing vector_shuffle optimization that results in awful codegen. llvm-svn: 52740	2008-06-25 20:52:59 +00:00
Mon P Wang	7d89d61387	Added MemOperands to Atomic operations since Atomics touches memory. Added abstract class MemSDNode for any Node that have an associated MemOperand Changed atomic.lcs => atomic.cmp.swap, atomic.las => atomic.load.add, and atomic.lss => atomic.load.sub llvm-svn: 52706	2008-06-25 08:15:39 +00:00
Evan Cheng	bab5925a0b	Enable two-address remat by default. llvm-svn: 52701	2008-06-25 01:16:38 +00:00
Dale Johannesen	244433ebb1	v2f32 is now a valid (MMX) type which breaks this test (doesn't work for any MMX vector types, it's not me). Rewritten to use v2i16 which is generic and going to stay that way; I think that preserves the point of the test. llvm-svn: 52692	2008-06-24 22:03:36 +00:00
Evan Cheng	a62f5f0f82	If it's determined safe, remat MOV32r0 (i.e. xor r, r) and others as it is instead of using the longer MOV32ri instruction. llvm-svn: 52670	2008-06-24 07:10:51 +00:00
Bill Wendling	2501066409	This situation can occur: ,------. \| \| \| v \| t2 = phi ... t1 ... \| \| \| v \| t1 = ... \| ... = ... t1 ... \| \| `------' where there is a use in a PHI node that's a predecessor to the defining block. We don't want to mark all predecessors as having the value "alive" in this case. Also, the assert was too restrictive and didn't handle this case. llvm-svn: 52655	2008-06-23 23:41:14 +00:00
Bill Wendling	d6b7d457cf	Make test work on non-x86 machines (like my G4 PPC). llvm-svn: 52619	2008-06-23 06:16:31 +00:00
Duncan Sands	1dd6ef8f8e	Support for load/store of expanded float types. I don't know if a truncating store is possible here, but added support for it anyway. llvm-svn: 52577	2008-06-21 17:00:47 +00:00
Evan Cheng	1d07cd32c2	Undo spill weight tweak. Need to investigate the performance regressions. llvm-svn: 52572	2008-06-21 06:45:54 +00:00
Evan Cheng	4006f4cdf0	ISD::UNDEF should be expanded recursively / iteratively. llvm-svn: 52508	2008-06-19 22:01:11 +00:00
Eli Friedman	570aa6f801	Fix a bug with <8 x i16> shuffle lowering on X86 where parts of the shuffle could be skipped. The check is invalid because the loop index i doesn't correspond to the element actually inserted. The correct check is already done a few lines earlier, for whether the element is already in the right spot, so this shouldn't have any effect on the codegen for code that was already correct. llvm-svn: 52486	2008-06-19 06:09:51 +00:00
Evan Cheng	919b735586	New test case. llvm-svn: 52483	2008-06-19 01:50:24 +00:00
Evan Cheng	ee801276b3	This also got better (55 - 51 instructions). But doing one more re-materialization. llvm-svn: 52482	2008-06-19 01:50:13 +00:00
Evan Cheng	56e17b525c	This got better. llvm-svn: 52481	2008-06-19 01:46:43 +00:00
Evan Cheng	8cfd1d39a1	Do not issue identity copies. llvm-svn: 52373	2008-06-16 22:52:53 +00:00
Evan Cheng	d27948e716	- Add "Commutative" property to intrinsics. This allows tblgen to generate the commuted variants for dagisel matching code. - Mark lots of X86 intrinsics as "Commutative" to allow load folding. llvm-svn: 52353	2008-06-16 20:29:38 +00:00
Evan Cheng	2e99c9cbf8	Teach the spiller to commute instructions in order to fold a reload. This hits 410 times on 444.namd and 122 times on 252.eon. llvm-svn: 52266	2008-06-13 23:58:02 +00:00
Duncan Sands	40c8db881a	Disable some DAG combiner optimizations that may be wrong for volatile loads and stores. In fact this is almost all of them! There are three types of problems: (1) it is wrong to change the width of a volatile memory access. These may be used to do memory mapped i/o, in which case a load can have an effect even if the result is not used. Consider loading an i32 but only using the lower 8 bits. It is wrong to change this into a load of an i8, because you are no longer tickling the other three bytes. It is also unwise to make a load/store wider. For example, changing an i16 load into an i32 load is wrong no matter how aligned things are, since the fact of loading an additional 2 bytes can have i/o side-effects. (2) it is wrong to change the number of volatile load/stores: they may be counted by the hardware. (3) it is wrong to change a volatile load/store that requires one memory access into one that requires several. For example on x86-32, you can store a double in one processor operation, but to store an i64 requires two (two i32 stores). In a multi-threaded program you may want to bitcast an i64 to a double and store as a double because that will occur atomically, and be indivisible to other threads. So it would be wrong to convert the store-of-double into a store of an i64, because this will become two i32 stores - no longer atomic. My policy here is to say that the number of processor operations for an illegal operation is undefined. So it is alright to change a store of an i64 (requires at least two stores; but could be validly lowered to memcpy for example) into a store of double (one processor op). In short, if the new store is legal and has the same size then I say that the transform is ok. It would also be possible to say that transforms are always ok if before they were illegal, whether after they are illegal or not, but that's more awkward to do and I doubt it buys us anything much. However this exposed an interesting thing - on x86-32 a store of i64 is considered legal! That is because operations are marked legal by default, regardless of whether the type is legal or not. In some ways this is clever: before type legalization this means that operations on illegal types are considered legal; after type legalization there are no illegal types so now operations are only legal if they really are. But I consider this to be too cunning for mere mortals. Better to do things explicitly by testing AfterLegalize. So I have changed things so that operations with illegal types are considered illegal - indeed they can never map to a machine operation. However this means that the DAG combiner is more conservative because before it was "accidentally" performing transforms where the type was illegal because the operation was nonetheless marked legal. So in a few such places I added a check on AfterLegalize, which I suppose was actually just forgotten before. This causes the DAG combiner to do slightly more than it used to, which resulted in the X86 backend blowing up because it got a slightly surprising node it wasn't expecting, so I tweaked it. llvm-svn: 52254	2008-06-13 19:07:40 +00:00
Evan Cheng	66ce588b87	Fix some tests. llvm-svn: 52245	2008-06-12 21:23:38 +00:00
Matthijs Kooijman	d07ffc50fa	Don't try to compile tests for the ev56 alpha subtarget, which hasn't been supported since r33492. llvm-svn: 52237	2008-06-12 13:44:26 +00:00
Dale Johannesen	47cee90b57	Fix parameter spelling: sse not sse1 llvm-svn: 52185	2008-06-10 17:57:58 +00:00
Matthijs Kooijman	00a807266e	Fix some more quoting issues in RUN lines, this time regarding unintended variable expansions involving the $ character. This fixes 4 tests that were not running properly before. llvm-svn: 52183	2008-06-10 16:10:32 +00:00
Matthijs Kooijman	281711dc95	Remove double pipes in RUN commandlines. This fixes 5 testcases that were not being run properly before. llvm-svn: 52180	2008-06-10 15:11:36 +00:00
Matthijs Kooijman	c638fe5b8b	For all RUN lines starting with "not", redirect stderr to /dev/null so tests don't fail when (expected) error output is produced. This fixes 17 tests. While I was there, I also made all RUN lines of the form "not llvm-as..." a bit more consistent, they now all redirect stderr and stdout to /dev/null and use input redirect to read their input. llvm-svn: 52174	2008-06-10 12:57:32 +00:00
Dan Gohman	f5602924ae	Convert several tests to use temporary files instead of redundantly executing the test commands. llvm-svn: 52163	2008-06-10 00:36:41 +00:00
Rafael Espindola	feaadb1e05	add support for PIC on linux x86-64 llvm-svn: 52139	2008-06-09 09:52:31 +00:00
Anton Korobeynikov	aed2cbb0a1	Remove invalid test llvm-svn: 52093	2008-06-08 16:59:10 +00:00

1 2 3 4 5 ...

978 Commits