llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-28 14:32:51 +01:00

Author	SHA1	Message	Date
Chris Lattner	50d16c2939	Fix a crash when threading a block that includes a MRV call result. DemoteRegToStack doesn't work with MRVs yet, because it relies on the ability to load/store things. This fixes PR2285. llvm-svn: 50667	2008-05-05 20:21:22 +00:00
Torok Edwin	a68816f876	processStore may delete the instruction, avoid using dyn_cast<> on already freed memory. llvm-svn: 50618	2008-05-04 08:51:25 +00:00
Devang Patel	b3112b4417	Do not sink getresult. llvm-svn: 50600	2008-05-03 00:36:30 +00:00
Dan Gohman	27156711ef	Fix a mistake in the computation of leading zeros for udiv. llvm-svn: 50591	2008-05-02 21:30:02 +00:00
Chris Lattner	96467cc665	strength reduce exp2 into ldexp, rdar://5852514 llvm-svn: 50586	2008-05-02 18:43:35 +00:00
Chris Lattner	e45796cb56	add a FIXME so we remember to eventually remove this code. llvm-svn: 50582	2008-05-02 17:18:31 +00:00
Bill Wendling	277f2a205d	Porting r50563 from Tak to mainline. llvm-svn: 50564	2008-05-02 00:43:20 +00:00
Dale Johannesen	d758175889	Don't try to create PHIs of struct types. Fallout from x86-64 calling convention work. llvm-svn: 50545	2008-05-01 22:27:44 +00:00
Dan Gohman	793c9fed45	Fix an overaggressive SimplifyDemandedBits optimization on urem. This fixes the 254.gap regression on x86 and the 403.gcc regression on x86-64. llvm-svn: 50537	2008-05-01 19:13:24 +00:00
Chris Lattner	ab0c73a95c	1) add '-debug' output 2) Return NULL instead of false in several places for tidiness. 3) fix a bug optimizing sprintf(p, "%c", x) llvm-svn: 50521	2008-05-01 06:39:12 +00:00
Chris Lattner	be2bafbe92	Delete the IPO simplify-libcalls and completely reimplement it as a FunctionPass. This makes it simpler, fixes dozens of bugs, adds a couple of minor features, and shrinks is considerably: from 2214 to 1437 lines. llvm-svn: 50520	2008-05-01 06:25:24 +00:00
Owen Anderson	aaa86b3249	This condition got inverted accidentally. llvm-svn: 50473	2008-04-30 07:16:33 +00:00
Chris Lattner	15195e00ee	move lowering of llvm.memset -> store from simplify libcalls to instcombine. llvm-svn: 50472	2008-04-30 06:39:11 +00:00
Owen Anderson	cc69e3444e	Revert r50441. The original code was correct. Add some more comments so that I don't make the same mistake in the future. llvm-svn: 50446	2008-04-29 21:51:00 +00:00
Owen Anderson	2caa79ae70	Fix a bug in memcpyopt where the memcpy-memcpy transform was never being applied because we were checking for it in the wrong order. This caused a miscompilation because the return slot optimization assumes that the call it is dealing with is NOT a memcpy. llvm-svn: 50444	2008-04-29 21:26:06 +00:00
Owen Anderson	8150660ba3	We should be returning true here since we've changed the function. llvm-svn: 50442	2008-04-29 21:02:46 +00:00
Owen Anderson	9db9df7329	A lot of cleanups and documentation improvements, as well as a few corner case fixes. Most of this was suggested by Chris. llvm-svn: 50441	2008-04-29 20:59:33 +00:00
Owen Anderson	5b7928f3d2	Rename DeadLoopElimination to LoopDeletion, part 2. llvm-svn: 50437	2008-04-29 20:06:54 +00:00
Owen Anderson	f9e11b8327	Rename DeadLoopElimination to LoopDeletion, part one. llvm-svn: 50436	2008-04-29 19:58:07 +00:00
Chris Lattner	5bd55b0885	don't eliminate load from volatile value on paths where the load is dead. This fixes the second half of PR2262 llvm-svn: 50430	2008-04-29 17:28:22 +00:00
Chris Lattner	7099f3c400	fix a subtle volatile handling bug. llvm-svn: 50428	2008-04-29 17:13:43 +00:00
Owen Anderson	dc1c838b4d	Clarify what we mean by a dead loop. llvm-svn: 50406	2008-04-29 06:34:55 +00:00
Chris Lattner	51fe8415da	don't delete the last store to an alloca if the store is volatile. llvm-svn: 50390	2008-04-29 04:58:38 +00:00
Owen Anderson	45b160745b	Add some more comments. llvm-svn: 50384	2008-04-29 00:45:15 +00:00
Owen Anderson	e0f9e8446b	Remove debugging code. llvm-svn: 50383	2008-04-29 00:39:24 +00:00
Owen Anderson	4cc52fd657	Add dead loop elimination, which removes dead loops for which we can compute the trip count. llvm-svn: 50382	2008-04-29 00:38:34 +00:00
Dan Gohman	9e4db7f0bd	Fix DSE to not eliminate volatile loads with no uses. llvm-svn: 50370	2008-04-28 19:51:27 +00:00
Dan Gohman	1b7238e6e4	Teach InstCombine's ComputeMaskedBits what SelectionDAG's ComputeMaskedBits knows about cttz, ctlz, and ctpop. Teach SelectionDAG's ComputeMaskedBits what InstCombine's knows about SRem. And teach them both some things about high bits in Mul, UDiv, URem, and Sub. This allows instcombine and dagcombine to eliminate sign-extension operations in several new cases. llvm-svn: 50358	2008-04-28 17:02:21 +00:00
Chris Lattner	39a4281deb	Implement a signficant optimization for inline asm: When choosing between constraints with multiple options, like "ir", test to see if we can use the 'i' constraint and go with that if possible. This produces more optimal ASM in all cases (sparing a register and an instruction to load it), and fixes inline asm like this: void test () { asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14)); } Previously we would dump "42" into a memory location (which is ok for the 'm' constraint) which would cause a problem because the 'c' modifier is not valid on memory operands. Isn't it great how inline asm turns 'missed optimization' into 'compile failed'?? Incidentally, this was the todo in PowerPC/2007-04-24-InlineAsm-I-Modifier.ll Please do NOT pull this into Tak. llvm-svn: 50315	2008-04-27 00:37:18 +00:00
Chris Lattner	b83aaaa855	Move a bunch of inline asm code out of line. llvm-svn: 50313	2008-04-27 00:09:47 +00:00
Dale Johannesen	cfba8d51b8	change comments per review llvm-svn: 50300	2008-04-25 21:16:07 +00:00
Dan Gohman	c4b6768db4	Remove the code from CodeGenPrepare that moved getresult instructions to the block that defines their operands. This doesn't work in the case that the operand is an invoke, because invoke is a terminator and must be the last instruction in a block. Replace it with support in SelectionDAGISel for copying struct values into sequences of virtual registers. llvm-svn: 50279	2008-04-25 18:27:55 +00:00
Nick Lewycky	1f831c0f57	Remove 'unwinds to' support from mainline. This patch undoes r47802 r47989 r48047 r48084 r48085 r48086 r48088 r48096 r48099 r48109 and r48123. llvm-svn: 50265	2008-04-25 16:53:59 +00:00
Chris Lattner	1a6268f776	Don't infininitely thread branches when a threaded edge goes back to the block, e.g.: Threading edge through bool from 'bb37.us.thread3829' to 'bb37.us' with cost: 1, across block: bb37.us: ; preds = %bb37.us.thread3829, %bb37.us, %bb33 %D1361.1.us = phi i32 [ %tmp36, %bb33 ], [ %D1361.1.us, %bb37.us ], [ 0, %bb37.us.thread3829 ] ; <i32> [#uses=2] %tmp39.us = icmp eq i32 %D1361.1.us, 0 ; <i1> [#uses=1] br i1 %tmp39.us, label %bb37.us, label %bb42.us llvm-svn: 50251	2008-04-25 04:12:29 +00:00
Chris Lattner	a646a59205	code restructuring, not functionality change. llvm-svn: 50203	2008-04-24 00:21:50 +00:00
Chris Lattner	10ffed0ed0	Don't replace multiple result of calls with undef, sccp tracks getresult values, not call values in this case. llvm-svn: 50202	2008-04-24 00:19:54 +00:00
Chris Lattner	8c6e641cf4	code cleanup, no functionality change. llvm-svn: 50201	2008-04-24 00:16:28 +00:00
Dale Johannesen	d70ea13581	Rewrite previous patch to suit Chris's preference. llvm-svn: 50174	2008-04-23 18:34:37 +00:00
Chris Lattner	721ea7ca10	Rewrite multiple return value handling in SCCP. Before, the -sccp pass would turn every getresult instruction into undef. This helps with rdar://5778210 llvm-svn: 50140	2008-04-23 05:38:20 +00:00
Dale Johannesen	3007fc4e1b	Do not change the type of a ByVal argument to a type of a different size. llvm-svn: 50121	2008-04-23 01:03:05 +00:00
Evan Cheng	680839e258	Don't do: "(X & 4) >> 1 == 2 --> (X & 4) == 4" if there are more than one uses of the shift result. llvm-svn: 50118	2008-04-23 00:38:06 +00:00
Chris Lattner	e304ae5621	Start doing the significantly useful part of jump threading: handle cases where a comparison has a phi input and that phi is a constant. For example, stuff like: Threading edge through bool from 'bb2149' to 'bb2231' with cost: 1, across block: bb2237: ; preds = %bb2231, %bb2149 %tmp2328.rle = phi i32 [ %tmp2232, %bb2231 ], [ %tmp2232439, %bb2149 ] ; <i32> [#uses=2] %done.0 = phi i32 [ %done.2, %bb2231 ], [ 0, %bb2149 ] ; <i32> [#uses=1] %tmp2239 = icmp eq i32 %done.0, 0 ; <i1> [#uses=1] br i1 %tmp2239, label %bb2231, label %bb2327 or bb38.i298: ; preds = %bb33.i295, %bb1693 %tmp39.i296.rle = phi %struct.ibox* [ null, %bb1693 ], [ %tmp39.i296.rle1109, %bb33.i295 ] ; <%struct.ibox> [#uses=2] %minspan.1.i291.reg2mem.1 = phi i32 [ 32000, %bb1693 ], [ %minspan.0.i288, %bb33.i295 ] ; <i32> [#uses=1] %tmp40.i297 = icmp eq %struct.ibox %tmp39.i296.rle, null ; <i1> [#uses=1] br i1 %tmp40.i297, label %implfeeds.exit311, label %bb43.i301 This triggers thousands of times in spec. llvm-svn: 50110	2008-04-22 21:40:39 +00:00
Chris Lattner	c59cf9c8da	Dig through multiple levels of AND to thread jumps if needed. llvm-svn: 50106	2008-04-22 20:46:09 +00:00
Chris Lattner	dcbc6443ae	Teach jump threading to thread through blocks like: br (and X, phi(Y, Z, false)), label L1, label L2 This triggers once on 252.eon and 6 times on 176.gcc. Blocks in question often look like this: bb262: ; preds = %bb261, %bb248 %iftmp.251.0 = phi i1 [ true, %bb261 ], [ false, %bb248 ] ; <i1> [#uses=4] %tmp270 = icmp eq %struct.rtx_def* %tmp.0.i, null ; <i1> [#uses=1] %bothcond = or i1 %iftmp.251.0, %tmp270 ; <i1> [#uses=1] br i1 %bothcond, label %bb288, label %bb273 In this case, it is clear that it doesn't matter if tmp.0.i is null when coming from bb261. When coming from bb248, it is all that matters. Another random example: check_asm_operands.exit: ; preds = %check_asm_operands.exit.thr_comm, %bb30.i, %bb12.i, %bb6.i413 %tmp.0.i420 = phi i1 [ true, %bb6.i413 ], [ true, %bb12.i ], [ true, %bb30.i ], [ false, %check_asm_operands.exit.thr_comm ; <i1> [#uses=1] call void @llvm.stackrestore( i8* %savedstack ) nounwind %tmp4389 = icmp eq i32 %added_sets_1.0, 0 ; <i1> [#uses=1] %tmp4394 = icmp eq i32 %added_sets_2.0, 0 ; <i1> [#uses=1] %bothcond80 = and i1 %tmp4389, %tmp4394 ; <i1> [#uses=1] %bothcond81 = and i1 %bothcond80, %tmp.0.i420 ; <i1> [#uses=1] br i1 %bothcond81, label %bb4398, label %bb4397 Here is the case from 252.eon: bb290.i.i: ; preds = %bb23.i57.i.i, %bb8.i39.i.i, %bb100.i.i, %bb100.i.i, %bb85.i.i110 %myEOF.1.i.i = phi i1 [ true, %bb100.i.i ], [ true, %bb100.i.i ], [ true, %bb85.i.i110 ], [ true, %bb8.i39.i.i ], [ false, %bb23.i57.i.i ] ; <i1> [#uses=2] %i.4.i.i = phi i32 [ %i.1.i.i, %bb85.i.i110 ], [ %i.0.i.i, %bb100.i.i ], [ %i.0.i.i, %bb100.i.i ], [ %i.3.i.i, %bb8.i39.i.i ], [ %i.3.i.i, %bb23.i57.i.i ] ; <i32> [#uses=3] %tmp292.i.i = load i8* %tmp16.i.i100, align 1 ; <i8> [#uses=1] %tmp293.not.i.i = icmp ne i8 %tmp292.i.i, 0 ; <i1> [#uses=1] %bothcond.i.i = and i1 %tmp293.not.i.i, %myEOF.1.i.i ; <i1> [#uses=1] br i1 %bothcond.i.i, label %bb202.i.i, label %bb301.i.i Factoring out 3 common predecessors. On the path from any blocks other than bb23.i57.i.i, the load and compare are dead. llvm-svn: 50096	2008-04-22 07:05:46 +00:00
Chris Lattner	003d69adef	refactor some code, no functionality change. llvm-svn: 50094	2008-04-22 06:36:15 +00:00
Chris Lattner	8837037473	remove dead code. llvm-svn: 50080	2008-04-22 03:21:48 +00:00
Chris Lattner	14be19cf1e	optimize "p != gep p, ..." better. This allows us to compile getelementptr-seteq.ll into: define i1 @test(i64 %X, %S* %P) { %C = icmp eq i64 %X, -1 ; <i1> [#uses=1] ret i1 %C } instead of: define i1 @test(i64 %X, %S* %P) { %A.idx.mask = and i64 %X, 4611686018427387903 ; <i64> [#uses=1] %C = icmp eq i64 %A.idx.mask, 4611686018427387903 ; <i1> [#uses=1] ret i1 %C } And fixes the second half of PR2235. This speeds up the insertion sort case by 45%, from 1.12s to 0.77s. In practice, this will significantly speed up for loops structured like: for (double *P = Base + N; P != Base; --P) ... Which happens frequently for C++ iterators. llvm-svn: 50079	2008-04-22 02:53:33 +00:00
Chris Lattner	6e88cf849b	fix grammar-o, thanks to Duncan for noticing. llvm-svn: 50047	2008-04-21 18:25:01 +00:00
Owen Anderson	b171c54227	Remove unneeded #include's. llvm-svn: 50035	2008-04-21 07:47:38 +00:00
Owen Anderson	bc6046416f	Refactor memcpyopt based on Chris' suggestions. Consolidate several functions and simplify code that was fallout from the separation of memcpyopt and gvn. llvm-svn: 50034	2008-04-21 07:45:10 +00:00

1 2 3 4 5 ...

2448 Commits