llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-31 16:02:52 +01:00

Author	SHA1	Message	Date
Evan Cheng	f3eb9e3262	Re-enable branchfolding common code hoisting optimization. Fixed a liveness test bug and also taught it to update liveins. llvm-svn: 131241	2011-05-12 20:30:01 +00:00
Devang Patel	344808fbe5	Identify end of prologue (and beginning of function body) using DW_LNS_set_prologue_end line table opcode. llvm-svn: 131194	2011-05-11 19:22:19 +00:00
Nadav Rotem	57dd315a3b	Fixes a bug in the DAGCombiner. LoadSDNodes have two values (data, chain). If there is a store after the load node, then there is a chain, which means that there is another user. Thus, asking hasOneUser would fail. Instead we ask hasNUsesOfValue on the 'data' value. llvm-svn: 131183	2011-05-11 14:40:50 +00:00
Nadav Rotem	2a654a69ed	Add custom lowering of X86 vector SRA/SRL/SHL when the shift amount is a splat vector. llvm-svn: 131179	2011-05-11 08:12:09 +00:00
Rafael Espindola	dfc30289f1	Revert 131172 as it is causing clang to miscompile itself. I will try to provide a reduced testcase. llvm-svn: 131176	2011-05-11 03:27:17 +00:00
Evan Cheng	271e0ebf0a	Add a late optimization to BranchFolding that hoist common instruction sequences at the start of basic blocks to their common predecessor. It's actually quite common (e.g. about 50 times in JM/lencod) and has shown to be a nice code size benefit. e.g. pushq %rax testl %edi, %edi jne LBB0_2 ## BB#1: xorb %al, %al popq %rdx ret LBB0_2: xorb %al, %al callq _foo popq %rdx ret => pushq %rax xorb %al, %al testl %edi, %edi je LBB0_2 ## BB#1: callq _foo LBB0_2: popq %rdx ret rdar://9145558 llvm-svn: 131172	2011-05-11 01:03:01 +00:00
Benjamin Kramer	ba7c9948e8	X86: Add a bunch of peeps for add and sub of SETB. "b + ((a < b) ? 1 : 0)" compiles into cmpl %esi, %edi adcl $0, %esi instead of cmpl %esi, %edi sbbl %eax, %eax andl $1, %eax addl %esi, %eax This saves a register, a false dependency on %eax (Intel's CPUs still don't ignore it) and it's shorter. llvm-svn: 131070	2011-05-08 18:36:07 +00:00
Jakob Stoklund Olesen	bb09bbccb8	Emit a proper error message when register allocators run out of registers. This can't be just an assertion, users can always write impossible inline assembly. Such an assembly statement should be included in the error message. llvm-svn: 131024	2011-05-06 21:58:30 +00:00
Eli Friedman	f7b4d848ae	Re-revert r130877; it's apparently causing a regression on 197.parser, possibly related to cbnz formation. llvm-svn: 130977	2011-05-06 05:23:07 +00:00
Rafael Espindola	ab39b8319b	Don't produce a __debug_frame. I tested both gdb on a bootstrapped clang and and the gdb testsuite on OS X (snow leopard) and both are happy using __eh_frame. llvm-svn: 130937	2011-05-05 18:43:39 +00:00
Eli Friedman	09ec41fcde	Avoid extra vreg copies for arguments passed in registers. Specifically, this can make MachineCSE more effective in some cases (especially in small functions). PR8361 / part of rdar://problem/8259436 . llvm-svn: 130928	2011-05-05 16:53:34 +00:00
Jakob Stoklund Olesen	f27731bf40	Prepare remaining tests for -join-physreg going away. llvm-svn: 130893	2011-05-04 23:54:59 +00:00
Jakob Stoklund Olesen	e964058440	Fix a batch of x86 tests to be coalescer independent. Most of these tests require a single mov instruction that can come either before or after a 2-addr instruction. -join-physregs changes the behavior, but the results are equivalent. llvm-svn: 130891	2011-05-04 23:54:51 +00:00
Eli Friedman	5b78092546	Re-commit r130862 with a minor change to avoid an iterator running off the edge in some cases. Original message: Teach MachineCSE how to do simple cross-block CSE involving physregs. This allows, for example, eliminating duplicate cmpl's on x86. Part of rdar://problem/8259436 . llvm-svn: 130877	2011-05-04 22:10:36 +00:00
Eli Friedman	cc74616be6	Back out r130862; it appears to be breaking bootstrap. llvm-svn: 130867	2011-05-04 20:48:42 +00:00
Eli Friedman	e086e00208	Teach MachineCSE how to do simple cross-block CSE involving physregs. This allows, for example, eliminating duplicate cmpl's on x86. Part of rdar://problem/8259436 . llvm-svn: 130862	2011-05-04 19:54:24 +00:00
Jakob Stoklund Olesen	4d020cd8e5	Don't depend on the physreg coalescing order. llvm-svn: 130818	2011-05-04 01:01:47 +00:00
Bill Wendling	67f5e8f0a7	Replace the "movnt" intrinsics with a native store + nontemporal metadata bit. <rdar://problem/8460511> llvm-svn: 130791	2011-05-03 21:11:17 +00:00
Rafael Espindola	d49e7769a7	Add r130623 back now that ELF has been fixed to work with -fno-dwarf2-cfi-asm. llvm-svn: 130658	2011-05-01 15:44:13 +00:00
Rafael Espindola	eb5d0cb4f4	GCC uses a different encoding of pointers in the FDE when using -fno-dwarf2-cfi-asm. Implement the same behavior. llvm-svn: 130637	2011-05-01 04:49:54 +00:00
Rafael Espindola	886aa563be	Revert the previous patch while I figure out how to make llvm-gcc less agressive about disabling cfi on linux :-( llvm-svn: 130626	2011-04-30 23:03:44 +00:00
Rafael Espindola	9455887b10	Enable CFI on OS X. Currently the output should be almost identical to the one produced by CodeGen to make the transition easier. The only two differences I know of are: * Some files get an extra advance loc of size 0. This will be fixed when relaxations are enabled. * The optimization of declaring an EH symbol as an external variable is not implemented. This is a subset of adding the nounwind attribute, so we if really this at -O0 we should probably do it at the IL level. llvm-svn: 130623	2011-04-30 22:29:54 +00:00
Jakob Stoklund Olesen	c140b5ac84	Allow folded spills in test. llvm-svn: 130599	2011-04-30 08:00:50 +00:00
Jakob Stoklund Olesen	2db84c62f6	Weekly fix of register allocation dependent unit tests. llvm-svn: 130567	2011-04-30 01:37:52 +00:00
Rafael Espindola	16455286cb	Change DwarfCFIException's member variables to track what it actually emmits: .cfi_personality, .cfi_lsda and the moves. llvm-svn: 130503	2011-04-29 14:48:51 +00:00
Eli Friedman	afc21e9be2	fast-isel sret calls, try 2. We actually do need to do something on x86-32. rdar://problem/9303592 . llvm-svn: 130429	2011-04-28 20:19:12 +00:00
Eli Friedman	73c94e2b24	Actually revert r130348 correctly. llvm-svn: 130418	2011-04-28 18:20:24 +00:00
Eli Friedman	7b68473355	Revert r130348; causing buildbot issues on x86-32. llvm-svn: 130412	2011-04-28 18:06:10 +00:00
Devang Patel	900ceb725b	Teach dwarf writer to handle complex address expression for .debug_loc entries. This fixes clang generated blocks' variables' debug info. Radar 9279956. llvm-svn: 130373	2011-04-28 02:22:40 +00:00
Eli Friedman	86181251f3	Fix a silly mistake in r130338. llvm-svn: 130360	2011-04-28 00:42:03 +00:00
Eli Friedman	bcb7cd335d	fast-isel sret. We actually don't need to do anything special on x86. :) rdar://problem/9303592 . llvm-svn: 130348	2011-04-27 23:58:52 +00:00
Eli Friedman	c5406cdb50	Make the fast-isel code for literal 0.0 a bit shorter/faster, since 0.0 is common. rdar://problem/9303592 . llvm-svn: 130338	2011-04-27 22:41:55 +00:00
Eli Friedman	00b153c2eb	Fix an edge case involving branches in fast-isel on x86. rdar://problem/9303306 . llvm-svn: 130272	2011-04-27 01:34:27 +00:00
Evan Cheng	dea3347167	Be careful about scheduling nodes above previous calls. It increase usages of more callee-saved registers and introduce copies. Only allows it if scheduling a node above calls would end up lessen register pressure. Call operands also has added ABI restrictions for register allocation, so be extra careful with hoisting them above calls. rdar://9329627 llvm-svn: 130245	2011-04-26 21:31:35 +00:00
Benjamin Kramer	249006aad2	Force a triple on this test to unbreak windows buildbots. llvm-svn: 130226	2011-04-26 18:47:43 +00:00
Dan Gohman	fbb7ade7ae	Fast-isel support for simple inline asms. llvm-svn: 130205	2011-04-26 17:18:34 +00:00
Rafael Espindola	7503d15062	Add test for PR9743. llvm-svn: 130198	2011-04-26 14:17:42 +00:00
Devang Patel	83eac5e134	A dbg.declare may not be in entry block, even if it is referring to an incoming argument. However, It is appropriate to emit DBG_VALUE referring to this incoming argument in entry block in MachineFunction. llvm-svn: 130129	2011-04-25 16:33:52 +00:00
Benjamin Kramer	b2992c34b5	Make tests more useful. lit needs a linter ... llvm-svn: 130126	2011-04-25 10:12:01 +00:00
NAKAMURA Takumi	6efe7518bf	test/CodeGen/X86/shrink-compare.ll: Relax expressions for Win64. llvm-svn: 130039	2011-04-23 00:15:45 +00:00
Chris Lattner	d9c0db9bd7	Recommit the fix for rdar://9289512 with a couple tweaks to fix bugs exposed by the gcc dejagnu testsuite: 1. The load may actually be used by a dead instruction, which would cause an assert. 2. The load may not be used by the current chain of instructions, and we could move it past a side-effecting instruction. Change how we process uses to define the problem away. llvm-svn: 130018	2011-04-22 21:59:37 +00:00
Benjamin Kramer	f6eab5f86e	DAGCombine: fold "(zext x) == C" into "x == (trunc C)" if the trunc is lossless. On x86 this allows to fold a load into the cmp, greatly reducing register pressure. movzbl (%rdi), %eax cmpl $47, %eax -> cmpb $47, (%rdi) This shaves 8k off gcc.o on i386. I'll leave applying the patch in README.txt to Chris :) llvm-svn: 130005	2011-04-22 18:47:44 +00:00
Benjamin Kramer	7feae20986	X86: Try to use a smaller encoding by transforming (X << C1) & C2 into (X & (C2 >> C1)) & C1. (Part of PR5039) This tends to happen a lot with bitfield code generated by clang. A simple example for x86_64 is uint64_t foo(uint64_t x) { return (x&1) << 42; } which used to compile into bloated code: shlq $42, %rdi ## encoding: [0x48,0xc1,0xe7,0x2a] movabsq $4398046511104, %rax ## encoding: [0x48,0xb8,0x00,0x00,0x00,0x00,0x00,0x04,0x00,0x00] andq %rdi, %rax ## encoding: [0x48,0x21,0xf8] ret ## encoding: [0xc3] with this patch we can fold the immediate into the and: andq $1, %rdi ## encoding: [0x48,0x83,0xe7,0x01] movq %rdi, %rax ## encoding: [0x48,0x89,0xf8] shlq $42, %rax ## encoding: [0x48,0xc1,0xe0,0x2a] ret ## encoding: [0xc3] It's possible to save another byte by using 'andl' instead of 'andq' but I currently see no way of doing that without making this code even more complicated. See the TODOs in the code. llvm-svn: 129990	2011-04-22 15:30:40 +00:00
Daniel Dunbar	3a96439b36	Revert r1296656, "Fix rdar://9289512 - not folding load into compare at -O0...", which broke a couple GCC test suite tests at -O0. llvm-svn: 129914	2011-04-21 16:14:46 +00:00
Daniel Dunbar	d11dec1469	llc: Eliminate a use of getDarwinMajorNumber(). - As before, there is a minor semantic change here (evidenced by the test change) for Darwin triples that have no version component. I debated changing the default behavior of isOSVersionLT, but decided it made more sense for triples to be explicit. llvm-svn: 129805	2011-04-19 20:46:13 +00:00
Eli Friedman	01f94bd648	Add support for FastISel'ing varargs calls. llvm-svn: 129765	2011-04-19 17:22:22 +00:00
Chris Lattner	f15db6c86f	Implement support for x86 fastisel of small fixed-sized memcpys, which are generated en-mass for C++ PODs. On my c++ test file, this cuts the fast isel rejects by 10x and shrinks the generated .s file by 5% llvm-svn: 129755	2011-04-19 05:52:03 +00:00
Chris Lattner	7d07af0bf2	Implement support for fast isel of calls of i1 arguments, even though they are illegal, when they are a truncate from something else. This eliminates fully half of all the fastisel rejections on a test c++ file I'm working with, which should make a substantial improvement for -O0 compile of c++ code. This fixed rdar://9297003 - fast isel bails out on all functions taking bools llvm-svn: 129752	2011-04-19 05:09:50 +00:00
Chris Lattner	3c4af7bfee	Handle i1/i8/i16 constant integer arguments to calls by prepromoting them. Before we would bail out on i1 arguments all together, now we just bail on non-constant ones. Also, we used to emit extraneous code. e.g. test12 was: movb $0, %al movzbl %al, %edi callq _test12 and test13 was: movb $0, %al xorl %edi, %edi movb %al, 7(%rsp) callq _test13f Now we get: movl $0, %edi callq _test12 and: movl $0, %edi callq _test13f llvm-svn: 129751	2011-04-19 04:42:38 +00:00
Chris Lattner	87b2a0ab2a	be layout aware, to produce: testb $1, %al je LBB0_2 ## BB#1: ## %if.then movb $0, %al instead of: testb $1, %al jne LBB0_1 jmp LBB0_2 LBB0_1: ## %if.then movb $0, %al how 'bout that. llvm-svn: 129749	2011-04-19 04:26:32 +00:00

1 2 3 4 5 ...

2467 Commits