llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Jim Laskey	d72f4cfe04	Can't move a load node if it's chain is not used. llvm-svn: 30609	2006-09-26 07:37:42 +00:00
Chris Lattner	b826d65e01	Various random and minor code cleanups. llvm-svn: 30608	2006-09-26 03:57:53 +00:00
Chris Lattner	c628ee3402	print the preds of each MBB llvm-svn: 30606	2006-09-26 03:41:59 +00:00
Chris Lattner	09176ab54f	Compile: int x __attribute__((used)); to: .data .comm _x,4 ; 'x' .no_dead_strip _x on both x86 and ppc darwin targets. llvm-svn: 30605	2006-09-26 03:39:53 +00:00
Chris Lattner	167aa73273	Add support for targets that want to do something with the llvm.used list, because they have an aggressive linker that does dead code stripping. llvm-svn: 30604	2006-09-26 03:38:18 +00:00
Jim Laskey	6ae9f53d2c	Accidental enable of bad code llvm-svn: 30601	2006-09-25 21:11:32 +00:00
Jim Laskey	640b7dbed5	Fix chain dropping in load and drop unused stores in ret blocks. llvm-svn: 30600	2006-09-25 19:32:58 +00:00
Chris Lattner	2281c3f6ca	more notes llvm-svn: 30598	2006-09-25 17:12:14 +00:00
Jim Laskey	ba2f6127b2	Core antialiasing for load and store. llvm-svn: 30597	2006-09-25 16:29:54 +00:00
Andrew Lenharth	55851a4bfd	Fix jump tables to match gcc (and the ABI and whatnot) llvm-svn: 30594	2006-09-24 19:46:56 +00:00
Andrew Lenharth	58f5a24f0c	Add support for other relocation bases to jump tables, as well as custom asm directives llvm-svn: 30593	2006-09-24 19:45:58 +00:00
Andrew Lenharth	f6b4462853	jump table note llvm-svn: 30591	2006-09-24 13:13:10 +00:00
Evan Cheng	2086ffb27b	PIC jump table entries are always 32-bit. This fixes PIC jump table support on X86-64. llvm-svn: 30590	2006-09-24 05:22:38 +00:00
Nick Lewycky	800fff3067	Style changes only. Remove dead code, fix a comment. llvm-svn: 30588	2006-09-23 15:13:08 +00:00
Chris Lattner	c0f674b9fd	Be far more careful when splitting a loop header, either to form a preheader or when splitting loops with a common header into multiple loops. In particular the old code would always insert the preheader before the old loop header. This is disasterous in cases where the loop hasn't been rotated. For example, it can produce code like: .. outside the loop... jmp LBB1_2 #bb13.outer LBB1_1: #bb1 movsd 8(%esp,%esi,8), %xmm1 mulsd (%edi), %xmm1 addsd %xmm0, %xmm1 addl $24, %edi incl %esi jmp LBB1_3 #bb13 LBB1_2: #bb13.outer leal (%edx,%eax,8), %edi pxor %xmm1, %xmm1 xorl %esi, %esi LBB1_3: #bb13 movapd %xmm1, %xmm0 cmpl $4, %esi jl LBB1_1 #bb1 Note that the loop body is actually LBB1_1 + LBB1_3, which means that the loop now contains an uncond branch WITHIN it to jump around the inserted loop header (LBB1_2). Doh. This patch changes the preheader insertion code to insert it in the right spot, producing this code: ... outside the loop, fall into the header ... LBB1_1: #bb13.outer leal (%edx,%eax,8), %esi pxor %xmm0, %xmm0 xorl %edi, %edi jmp LBB1_3 #bb13 LBB1_2: #bb1 movsd 8(%esp,%edi,8), %xmm0 mulsd (%esi), %xmm0 addsd %xmm1, %xmm0 addl $24, %esi incl %edi LBB1_3: #bb13 movapd %xmm0, %xmm1 cmpl $4, %edi jl LBB1_2 #bb1 Totally crazy, no branch in the loop! :) llvm-svn: 30587	2006-09-23 08:19:21 +00:00
Chris Lattner	56c1c10ca1	Teach UpdateDomInfoForRevectoredPreds to handle revectored preds that are not reachable, making it general purpose enough for use by InsertPreheaderForLoop. Eliminate custom dominfo updating code in InsertPreheaderForLoop, using UpdateDomInfoForRevectoredPreds instead. llvm-svn: 30586	2006-09-23 07:40:52 +00:00
Chris Lattner	bf0b610dfa	add method, correct comment llvm-svn: 30584	2006-09-23 04:03:45 +00:00
Evan Cheng	02e193e2ff	Delete dead code; fix 80 col violations. llvm-svn: 30583	2006-09-22 21:43:59 +00:00
Rafael Espindola	9cfd72a3d1	add a note llvm-svn: 30581	2006-09-22 11:36:17 +00:00
Nate Begeman	7bcce1a7f6	Fold AND and ROTL more often llvm-svn: 30577	2006-09-22 05:01:56 +00:00
Devang Patel	b34118f8bd	remove extra white spaces. llvm-svn: 30576	2006-09-22 01:07:57 +00:00
Devang Patel	8248ba3afc	Use iterative algorith to assign DFS number. This reduces call stack depth. llvm-svn: 30575	2006-09-22 01:05:33 +00:00
Evan Cheng	ce6a660148	Make it work for DAG combine of multi-value nodes. llvm-svn: 30573	2006-09-21 19:04:05 +00:00
Jim Laskey	231343018b	core corrections llvm-svn: 30570	2006-09-21 17:35:47 +00:00
Jim Laskey	50750cf500	Basic "in frame" alias analysis. llvm-svn: 30568	2006-09-21 16:28:59 +00:00
Rafael Espindola	a51ec7153c	more condition codes llvm-svn: 30567	2006-09-21 13:06:26 +00:00
Rafael Espindola	4de4f87be5	if a constant can't be an immediate, add it to the constant pool llvm-svn: 30566	2006-09-21 11:29:52 +00:00
Chris Lattner	c17b86ef22	fold (aext (and (trunc x), cst)) -> (and x, cst). llvm-svn: 30561	2006-09-21 06:40:43 +00:00
Chris Lattner	d9fca453f1	Check the right value type. This fixes 186.crafty on x86 llvm-svn: 30560	2006-09-21 06:17:39 +00:00
Chris Lattner	ba7013ca78	implemented llvm-svn: 30559	2006-09-21 06:14:54 +00:00
Chris Lattner	34768d5361	Compile: int %test(ulong %tmp) { %tmp = load ulong %tmp ; <ulong> [#uses=1] %tmp.mask = shr ulong %tmp, ubyte 50 ; <ulong> [#uses=1] %tmp.mask = cast ulong %tmp.mask to ubyte %tmp2 = and ubyte %tmp.mask, 3 ; <ubyte> [#uses=1] %tmp2 = cast ubyte %tmp2 to int ; <int> [#uses=1] ret int %tmp2 } to: _test: movl 4(%esp), %eax movl 4(%eax), %eax shrl $18, %eax andl $3, %eax ret instead of: _test: movl 4(%esp), %eax movl 4(%eax), %eax shrl $18, %eax # TRUNCATE movb %al, %al andb $3, %al movzbl %al, %eax ret llvm-svn: 30558	2006-09-21 06:14:31 +00:00
Chris Lattner	eb12877970	Generalize (zext (truncate x)) and (sext (truncate x)) folding to work when the src/dst are not the same size. This catches things like "truncate 32-bit X to 8 bits, then zext to 16", which happens a bit on X86. llvm-svn: 30557	2006-09-21 06:00:20 +00:00
Chris Lattner	437703d4c9	Fit in 80-cols llvm-svn: 30556	2006-09-21 05:46:00 +00:00
Chris Lattner	e87cf1c708	Fix Transforms/IndVarsSimplify/2006-09-20-LFTR-Crash.ll llvm-svn: 30555	2006-09-21 05:12:20 +00:00
Nick Lewycky	401794f2a7	Fix compile error. llvm-svn: 30553	2006-09-21 02:08:31 +00:00
Nick Lewycky	2aff202559	Don't rewrite ConstantExpr::get. llvm-svn: 30552	2006-09-21 01:05:35 +00:00
Nick Lewycky	eb301d20a6	Once we're down to "setcc type constant1, constant2", at least come up with the right answer. llvm-svn: 30550	2006-09-20 23:02:24 +00:00
Anton Korobeynikov	59ef7e94eb	Adding codegeneration for StdCall & FastCall calling conventions llvm-svn: 30549	2006-09-20 22:03:51 +00:00
Andrew Lenharth	ce3954cac0	Account for pseudo-ops correctly llvm-svn: 30548	2006-09-20 20:08:52 +00:00
Chris Lattner	663748827c	The DarwinAsmPrinter need not check for isDarwin. createPPCAsmPrinterPass should create the right asmprinter subclass. llvm-svn: 30542	2006-09-20 17:12:19 +00:00
Chris Lattner	6d66264a5f	Wrap some darwin'isms with isDarwin checks. llvm-svn: 30541	2006-09-20 17:07:15 +00:00
Nick Lewycky	99b3c50130	Use a total ordering to compare instructions. Fixes infinite loop in resolve(). llvm-svn: 30540	2006-09-20 17:04:01 +00:00
Andrew Lenharth	cf0746ba2a	simplify llvm-svn: 30535	2006-09-20 15:37:57 +00:00
Andrew Lenharth	d12f2d614a	catch constants more often llvm-svn: 30534	2006-09-20 15:05:49 +00:00
Andrew Lenharth	3be0c58274	clarify with test case llvm-svn: 30531	2006-09-20 14:48:00 +00:00
Andrew Lenharth	2ccefe5b91	Add Note llvm-svn: 30530	2006-09-20 14:40:01 +00:00
Chris Lattner	6b434ee662	item done llvm-svn: 30518	2006-09-20 06:41:56 +00:00
Chris Lattner	a0243b3ad3	Compile: int test3(int a, int b) { return (a < 0) ? a : 0; } to: _test3: srawi r2, r3, 31 and r3, r2, r3 blr instead of: _test3: cmpwi cr0, r3, 1 li r2, 0 blt cr0, LBB2_2 ;entry LBB2_1: ;entry mr r3, r2 LBB2_2: ;entry blr This implements: PowerPC/select_lt0.ll:seli32_a_a llvm-svn: 30517	2006-09-20 06:41:35 +00:00
Chris Lattner	f9c4e07bf7	add a note llvm-svn: 30515	2006-09-20 06:32:10 +00:00
Chris Lattner	e78d019082	Fold the full generality of (any_extend (truncate x)) llvm-svn: 30514	2006-09-20 06:29:17 +00:00
Chris Lattner	6440707b6f	Two things: 1. teach SimplifySetCC that '(srl (ctlz x), 5) == 0' is really x != 0. 2. Teach visitSELECT_CC to use SimplifySetCC instead of calling it and ignoring the result. This allows us to compile: bool %test(ulong %x) { %tmp = setlt ulong %x, 4294967296 ret bool %tmp } to: _test: cntlzw r2, r3 cmplwi cr0, r3, 1 srwi r2, r2, 5 li r3, 0 beq cr0, LBB1_2 ; LBB1_1: ; mr r3, r2 LBB1_2: ; blr instead of: _test: addi r2, r3, -1 cntlzw r2, r2 cntlzw r3, r3 srwi r2, r2, 5 cmplwi cr0, r2, 0 srwi r2, r3, 5 li r3, 0 bne cr0, LBB1_2 ; LBB1_1: ; mr r3, r2 LBB1_2: ; blr This isn't wonderful, but it's an improvement. llvm-svn: 30513	2006-09-20 06:19:26 +00:00
Chris Lattner	102718b1b2	This is already done llvm-svn: 30512	2006-09-20 04:59:33 +00:00
Chris Lattner	6ddcf6bba8	We went through all that trouble to compute whether it was safe to transform this comparison, but never checked it. Whoops, no wonder we miscompiled 177.mesa! llvm-svn: 30511	2006-09-20 04:44:59 +00:00
Chris Lattner	4d97247875	Improve PPC64 equality comparisons like PPC32 comparisons. llvm-svn: 30510	2006-09-20 04:33:27 +00:00
Chris Lattner	69390a3f80	Two improvements: 1. Codegen this comparison: if (X == 0x8000) as: cmplwi cr0, r3, 32768 bne cr0, LBB1_2 ;cond_next instead of: lis r2, 0 ori r2, r2, 32768 cmpw cr0, r3, r2 bne cr0, LBB1_2 ;cond_next 2. Codegen this comparison: if (X == 0x12345678) as: xoris r2, r3, 4660 cmplwi cr0, r2, 22136 bne cr0, LBB1_2 ;cond_next instead of: lis r2, 4660 ori r2, r2, 22136 cmpw cr0, r3, r2 bne cr0, LBB1_2 ;cond_next llvm-svn: 30509	2006-09-20 04:25:47 +00:00
Chris Lattner	ee42b9ae24	Add a note that we should match rlwnm better llvm-svn: 30508	2006-09-20 03:59:25 +00:00
Chris Lattner	3057944738	Legalize is no longer limited to cleverness with just constant shift amounts. Allow it to be clever when possible and fall back to the gross code when needed. This allows us to compile: long long foo1(long long X, int C) { return X << (C\|32); } long long foo2(long long X, int C) { return X << (C&~32); } to: _foo1: rlwinm r2, r5, 0, 27, 31 slw r3, r4, r2 li r4, 0 blr .globl _foo2 .align 4 _foo2: rlwinm r2, r5, 0, 27, 25 subfic r5, r2, 32 slw r3, r3, r2 srw r5, r4, r5 or r3, r3, r5 slw r4, r4, r2 blr instead of: _foo1: ori r2, r5, 32 subfic r5, r2, 32 addi r6, r2, -32 srw r5, r4, r5 slw r3, r3, r2 slw r6, r4, r6 or r3, r3, r5 slw r4, r4, r2 or r3, r3, r6 blr .globl _foo2 .align 4 _foo2: rlwinm r2, r5, 0, 27, 25 subfic r5, r2, 32 addi r6, r2, -32 srw r5, r4, r5 slw r3, r3, r2 slw r6, r4, r6 or r3, r3, r5 slw r4, r4, r2 or r3, r3, r6 blr llvm-svn: 30507	2006-09-20 03:47:40 +00:00
Chris Lattner	644c6814ae	Expand 64-bit shifts more optimally if we know that the high bit of the shift amount is one or zero. For example, for: long long foo1(long long X, int C) { return X << (C\|32); } long long foo2(long long X, int C) { return X << (C&~32); } we get: _foo1: movb $31, %cl movl 4(%esp), %edx andb 12(%esp), %cl shll %cl, %edx xorl %eax, %eax ret _foo2: movb $223, %cl movl 4(%esp), %eax movl 8(%esp), %edx andb 12(%esp), %cl shldl %cl, %eax, %edx shll %cl, %eax ret instead of: _foo1: subl $4, %esp movl %ebx, (%esp) movb $32, %bl movl 8(%esp), %eax movl 12(%esp), %edx movb %bl, %cl orb 16(%esp), %cl shldl %cl, %eax, %edx shll %cl, %eax xorl %ecx, %ecx testb %bl, %bl cmovne %eax, %edx cmovne %ecx, %eax movl (%esp), %ebx addl $4, %esp ret _foo2: subl $4, %esp movl %ebx, (%esp) movb $223, %cl movl 8(%esp), %eax movl 12(%esp), %edx andb 16(%esp), %cl shldl %cl, %eax, %edx shll %cl, %eax xorl %ecx, %ecx xorb %bl, %bl testb %bl, %bl cmovne %eax, %edx cmovne %ecx, %eax movl (%esp), %ebx addl $4, %esp ret llvm-svn: 30506	2006-09-20 03:38:48 +00:00
Evan Cheng	a7347758f5	Back out Chris' last set of changes. This breaks 177.mesa and povray somehow. llvm-svn: 30505	2006-09-20 01:39:40 +00:00
Evan Cheng	8652c13f13	80 col. llvm-svn: 30504	2006-09-20 01:10:02 +00:00
Andrew Lenharth	0240d56eb6	If we have an add, do it in the pointer realm, not the int realm. This is critical in the linux kernel for pointer analysis correctness llvm-svn: 30496	2006-09-19 18:24:51 +00:00
Chris Lattner	66029d909b	Fix UnitTests/2005-05-12-Int64ToFP.c with llc-beta. In particular, do not allow it to go into an infinite loop, filling up the disk! llvm-svn: 30494	2006-09-19 18:02:01 +00:00
Rafael Espindola	cd52f85028	fix header add comments untabify llvm-svn: 30486	2006-09-19 16:41:40 +00:00
Rafael Espindola	6c7627e002	Implement a MachineFunctionPass to fix the mul instruction llvm-svn: 30485	2006-09-19 15:49:25 +00:00
Chris Lattner	2b076f26b7	item done llvm-svn: 30483	2006-09-19 06:19:03 +00:00
Chris Lattner	2d2d80a4c2	implement select.ll:test19-22 llvm-svn: 30482	2006-09-19 06:18:21 +00:00
Chris Lattner	92c8924309	Fold the PPCISD shifts when presented with 0 inputs. This occurs for code like: long long test(long long X, int Y) { return 1ULL << Y; } long long test2(long long X, int Y) { return -1LL << Y; } which we used to compile to: _test: li r2, 1 subfic r3, r5, 32 li r4, 0 addi r6, r5, -32 srw r3, r2, r3 slw r4, r4, r5 slw r6, r2, r6 or r3, r4, r3 slw r4, r2, r5 or r3, r3, r6 blr _test2: li r2, -1 subfic r3, r5, 32 addi r6, r5, -32 srw r3, r2, r3 slw r4, r2, r5 slw r2, r2, r6 or r3, r4, r3 or r3, r3, r2 blr Now we produce: _test: li r2, 1 addi r3, r5, -32 subfic r4, r5, 32 slw r3, r2, r3 srw r4, r2, r4 or r3, r4, r3 slw r4, r2, r5 blr _test2: li r2, -1 subfic r3, r5, 32 addi r6, r5, -32 srw r3, r2, r3 slw r4, r2, r5 slw r2, r2, r6 or r3, r4, r3 or r3, r3, r2 blr llvm-svn: 30479	2006-09-19 05:22:59 +00:00
Chris Lattner	61d08597df	Fold extract_element(cst) to cst llvm-svn: 30478	2006-09-19 05:02:39 +00:00
Chris Lattner	556f869e88	Minor speedup for legalize by avoiding some malloc traffic llvm-svn: 30477	2006-09-19 04:51:23 +00:00
Evan Cheng	65afc6af9f	Fix a typo. llvm-svn: 30474	2006-09-18 23:28:33 +00:00
Evan Cheng	67b248dbc6	Allow i32 UDIV, SDIV, UREM, SREM to be expanded into libcalls. llvm-svn: 30470	2006-09-18 21:49:04 +00:00
Nick Lewycky	96939f2d94	Walk down the dominator tree instead of the control flow graph. That means that we can't modify the CFG any more, at least not until it's possible to update the dominator tree (PR217). llvm-svn: 30469	2006-09-18 21:09:35 +00:00
Andrew Lenharth	6d43749a47	A pass to remove the worst of the replay trap offenders, and as a bonus, align basic blocks when it is free to do so llvm-svn: 30467	2006-09-18 19:44:29 +00:00
Chris Lattner	1efde528d6	Fix an infinite loop building the CFE llvm-svn: 30465	2006-09-18 18:27:05 +00:00
Andrew Lenharth	5d958d3405	Jump tables on Alpha llvm-svn: 30463	2006-09-18 18:01:03 +00:00
Andrew Lenharth	9c54a925e8	oops llvm-svn: 30462	2006-09-18 18:00:18 +00:00
Andrew Lenharth	00bbd5641b	absolute addresses must match pointer size llvm-svn: 30461	2006-09-18 17:59:35 +00:00
Jim Laskey	07ac577a34	Sort out mangled names for globals llvm-svn: 30460	2006-09-18 14:47:26 +00:00
Chris Lattner	39218c2b0c	Implement a trivial optzn: of vastart is never called in a function that takes ... args, remove the '...'. This is Transforms/DeadArgElim/dead_vaargs.ll llvm-svn: 30459	2006-09-18 07:02:31 +00:00
Chris Lattner	68009a61c5	add a note. Our 64-bit shifts are ~30% slower than gcc's llvm-svn: 30457	2006-09-18 05:36:54 +00:00
Chris Lattner	9c8bffb5e8	Implement InstCombine/cast.ll:test31. This speeds up 462.libquantum by 26%. llvm-svn: 30456	2006-09-18 05:27:43 +00:00
Chris Lattner	be102d68c2	add a helper method llvm-svn: 30452	2006-09-18 04:54:57 +00:00
Chris Lattner	a1349de598	This is closer to what we really want. llvm-svn: 30451	2006-09-18 04:54:35 +00:00
Chris Lattner	f7e8879212	Implement Transforms/InstCombine/shift-sra.ll:test0 llvm-svn: 30450	2006-09-18 04:31:40 +00:00
Chris Lattner	6ee34e89bc	Rewrite shift/and/compare sequences to promote better licm of the RHS. Use isLogicalShift/isArithmeticShift to simplify code. llvm-svn: 30448	2006-09-18 04:22:48 +00:00
Anton Korobeynikov	7c2118575c	Added some eye-candy for Subtarget type checking Added X86 StdCall & FastCall calling conventions. Codegen will follow. llvm-svn: 30446	2006-09-17 20:25:45 +00:00
Chris Lattner	547b62a967	Add ShiftInst::isLogical/ArithmeticShift methods. llvm-svn: 30445	2006-09-17 19:29:56 +00:00
Chris Lattner	8aa718b0ed	Add new SetCondInst::isRelational/isEquality methods. Rename Instruction::isRelational to Instruction::isComparison. llvm-svn: 30444	2006-09-17 19:14:47 +00:00
Nick Lewycky	8fbfe60cee	Explain change with a comment. llvm-svn: 30443	2006-09-17 17:51:00 +00:00
Nick Lewycky	22b1a725ae	Fix PR912. The input to erase() must not be a reference to the data being erased. llvm-svn: 30442	2006-09-17 16:23:36 +00:00
Anton Korobeynikov	b2b7c2f8b9	Small fixes for supporting dll* linkage types llvm-svn: 30441	2006-09-17 13:06:18 +00:00
Chris Lattner	563785bc55	add a note noticed through source inspection llvm-svn: 30418	2006-09-16 23:57:51 +00:00
Chris Lattner	73f5ad9f38	Oh yeah, this is needed too llvm-svn: 30407	2006-09-16 05:08:34 +00:00
Chris Lattner	22b3d6fba9	add a note llvm-svn: 30406	2006-09-16 03:30:19 +00:00
Chris Lattner	a4689e489e	Fix Transforms/InstCombine/2006-09-15-CastToBool.ll and PR913 llvm-svn: 30405	2006-09-16 03:14:10 +00:00
Chris Lattner	594d4d9483	simplify control flow, no functionality change llvm-svn: 30403	2006-09-16 00:21:44 +00:00
Chris Lattner	4318df13d7	Allow custom expand of mul llvm-svn: 30402	2006-09-16 00:09:24 +00:00
Chris Lattner	bf14089e11	add a nate note llvm-svn: 30399	2006-09-15 20:31:36 +00:00
Chris Lattner	ce8928eed5	revert previous two patches. They cause miscompilation of MultiSource/Applications/Burg llvm-svn: 30397	2006-09-15 17:24:45 +00:00
Owen Anderson	d55cc3f6d8	Revert my previous work on ArgumentPromotion. Further investigation has revealed these changes to be incorrect. They just weren't showing up in any of our current testcases. llvm-svn: 30385	2006-09-15 05:22:51 +00:00

1 2 3 4 5 ...

15210 Commits