llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-28 14:32:51 +01:00

Author	SHA1	Message	Date
Chris Lattner	51fe8415da	don't delete the last store to an alloca if the store is volatile. llvm-svn: 50390	2008-04-29 04:58:38 +00:00
Owen Anderson	45b160745b	Add some more comments. llvm-svn: 50384	2008-04-29 00:45:15 +00:00
Owen Anderson	e0f9e8446b	Remove debugging code. llvm-svn: 50383	2008-04-29 00:39:24 +00:00
Owen Anderson	4cc52fd657	Add dead loop elimination, which removes dead loops for which we can compute the trip count. llvm-svn: 50382	2008-04-29 00:38:34 +00:00
Dan Gohman	9e4db7f0bd	Fix DSE to not eliminate volatile loads with no uses. llvm-svn: 50370	2008-04-28 19:51:27 +00:00
Dan Gohman	1b7238e6e4	Teach InstCombine's ComputeMaskedBits what SelectionDAG's ComputeMaskedBits knows about cttz, ctlz, and ctpop. Teach SelectionDAG's ComputeMaskedBits what InstCombine's knows about SRem. And teach them both some things about high bits in Mul, UDiv, URem, and Sub. This allows instcombine and dagcombine to eliminate sign-extension operations in several new cases. llvm-svn: 50358	2008-04-28 17:02:21 +00:00
Chris Lattner	ede7e89144	Fix PR2256, yet another miscompilation in simplifycfg of i multiple return values. Bill, please pull this into Tak. llvm-svn: 50332	2008-04-28 00:19:07 +00:00
Chris Lattner	39a4281deb	Implement a signficant optimization for inline asm: When choosing between constraints with multiple options, like "ir", test to see if we can use the 'i' constraint and go with that if possible. This produces more optimal ASM in all cases (sparing a register and an instruction to load it), and fixes inline asm like this: void test () { asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14)); } Previously we would dump "42" into a memory location (which is ok for the 'm' constraint) which would cause a problem because the 'c' modifier is not valid on memory operands. Isn't it great how inline asm turns 'missed optimization' into 'compile failed'?? Incidentally, this was the todo in PowerPC/2007-04-24-InlineAsm-I-Modifier.ll Please do NOT pull this into Tak. llvm-svn: 50315	2008-04-27 00:37:18 +00:00
Chris Lattner	b83aaaa855	Move a bunch of inline asm code out of line. llvm-svn: 50313	2008-04-27 00:09:47 +00:00
Chris Lattner	2798e42a9f	When SRoA'ing a global variable, make sure the new globals get the appropriate alignment. This fixes a miscompilation of 252.eon on x86-64 (rdar://5891920). Bill, please pull this into Tak. llvm-svn: 50308	2008-04-26 07:40:11 +00:00
Dale Johannesen	cfba8d51b8	change comments per review llvm-svn: 50300	2008-04-25 21:16:07 +00:00
Dan Gohman	c4b6768db4	Remove the code from CodeGenPrepare that moved getresult instructions to the block that defines their operands. This doesn't work in the case that the operand is an invoke, because invoke is a terminator and must be the last instruction in a block. Replace it with support in SelectionDAGISel for copying struct values into sequences of virtual registers. llvm-svn: 50279	2008-04-25 18:27:55 +00:00
Nate Begeman	928566cbfe	Feedback from chris llvm-svn: 50271	2008-04-25 17:45:52 +00:00
Nick Lewycky	1f831c0f57	Remove 'unwinds to' support from mainline. This patch undoes r47802 r47989 r48047 r48084 r48085 r48086 r48088 r48096 r48099 r48109 and r48123. llvm-svn: 50265	2008-04-25 16:53:59 +00:00
Nate Begeman	26b6f640a1	Teach the PruningFunctionCloner how to look through loads with ConstantExpression GEPs pointing into constant globals. llvm-svn: 50256	2008-04-25 06:37:06 +00:00
Chris Lattner	1a6268f776	Don't infininitely thread branches when a threaded edge goes back to the block, e.g.: Threading edge through bool from 'bb37.us.thread3829' to 'bb37.us' with cost: 1, across block: bb37.us: ; preds = %bb37.us.thread3829, %bb37.us, %bb33 %D1361.1.us = phi i32 [ %tmp36, %bb33 ], [ %D1361.1.us, %bb37.us ], [ 0, %bb37.us.thread3829 ] ; <i32> [#uses=2] %tmp39.us = icmp eq i32 %D1361.1.us, 0 ; <i1> [#uses=1] br i1 %tmp39.us, label %bb37.us, label %bb42.us llvm-svn: 50251	2008-04-25 04:12:29 +00:00
Evan Cheng	c532dca878	Adjust inline cost computation to be less aggressive. llvm-svn: 50222	2008-04-24 18:42:47 +00:00
Chris Lattner	a646a59205	code restructuring, not functionality change. llvm-svn: 50203	2008-04-24 00:21:50 +00:00
Chris Lattner	10ffed0ed0	Don't replace multiple result of calls with undef, sccp tracks getresult values, not call values in this case. llvm-svn: 50202	2008-04-24 00:19:54 +00:00
Chris Lattner	8c6e641cf4	code cleanup, no functionality change. llvm-svn: 50201	2008-04-24 00:16:28 +00:00
Chris Lattner	be35a0c224	Split some code out of the main SimplifyCFG loop into its own function. Fix said code to handle merging return instructions together correctly when handling multiple return values. llvm-svn: 50199	2008-04-24 00:01:19 +00:00
Devang Patel	88251a96f1	Check type instead of no. of operands. llvm-svn: 50179	2008-04-23 20:18:29 +00:00
Dale Johannesen	d70ea13581	Rewrite previous patch to suit Chris's preference. llvm-svn: 50174	2008-04-23 18:34:37 +00:00
Chris Lattner	bc79442313	simplify code for propagation of constant arguments into callees. llvm-svn: 50142	2008-04-23 06:16:27 +00:00
Chris Lattner	408b51d214	Fix a number of bugs in ipconstantprop, simplify the code, fit in 80 cols, fix read after free bug (PR2238). llvm-svn: 50141	2008-04-23 05:59:23 +00:00
Chris Lattner	721ea7ca10	Rewrite multiple return value handling in SCCP. Before, the -sccp pass would turn every getresult instruction into undef. This helps with rdar://5778210 llvm-svn: 50140	2008-04-23 05:38:20 +00:00
Dale Johannesen	3007fc4e1b	Do not change the type of a ByVal argument to a type of a different size. llvm-svn: 50121	2008-04-23 01:03:05 +00:00
Evan Cheng	680839e258	Don't do: "(X & 4) >> 1 == 2 --> (X & 4) == 4" if there are more than one uses of the shift result. llvm-svn: 50118	2008-04-23 00:38:06 +00:00
Chris Lattner	e304ae5621	Start doing the significantly useful part of jump threading: handle cases where a comparison has a phi input and that phi is a constant. For example, stuff like: Threading edge through bool from 'bb2149' to 'bb2231' with cost: 1, across block: bb2237: ; preds = %bb2231, %bb2149 %tmp2328.rle = phi i32 [ %tmp2232, %bb2231 ], [ %tmp2232439, %bb2149 ] ; <i32> [#uses=2] %done.0 = phi i32 [ %done.2, %bb2231 ], [ 0, %bb2149 ] ; <i32> [#uses=1] %tmp2239 = icmp eq i32 %done.0, 0 ; <i1> [#uses=1] br i1 %tmp2239, label %bb2231, label %bb2327 or bb38.i298: ; preds = %bb33.i295, %bb1693 %tmp39.i296.rle = phi %struct.ibox* [ null, %bb1693 ], [ %tmp39.i296.rle1109, %bb33.i295 ] ; <%struct.ibox> [#uses=2] %minspan.1.i291.reg2mem.1 = phi i32 [ 32000, %bb1693 ], [ %minspan.0.i288, %bb33.i295 ] ; <i32> [#uses=1] %tmp40.i297 = icmp eq %struct.ibox %tmp39.i296.rle, null ; <i1> [#uses=1] br i1 %tmp40.i297, label %implfeeds.exit311, label %bb43.i301 This triggers thousands of times in spec. llvm-svn: 50110	2008-04-22 21:40:39 +00:00
Chris Lattner	c59cf9c8da	Dig through multiple levels of AND to thread jumps if needed. llvm-svn: 50106	2008-04-22 20:46:09 +00:00
Chris Lattner	dcbc6443ae	Teach jump threading to thread through blocks like: br (and X, phi(Y, Z, false)), label L1, label L2 This triggers once on 252.eon and 6 times on 176.gcc. Blocks in question often look like this: bb262: ; preds = %bb261, %bb248 %iftmp.251.0 = phi i1 [ true, %bb261 ], [ false, %bb248 ] ; <i1> [#uses=4] %tmp270 = icmp eq %struct.rtx_def* %tmp.0.i, null ; <i1> [#uses=1] %bothcond = or i1 %iftmp.251.0, %tmp270 ; <i1> [#uses=1] br i1 %bothcond, label %bb288, label %bb273 In this case, it is clear that it doesn't matter if tmp.0.i is null when coming from bb261. When coming from bb248, it is all that matters. Another random example: check_asm_operands.exit: ; preds = %check_asm_operands.exit.thr_comm, %bb30.i, %bb12.i, %bb6.i413 %tmp.0.i420 = phi i1 [ true, %bb6.i413 ], [ true, %bb12.i ], [ true, %bb30.i ], [ false, %check_asm_operands.exit.thr_comm ; <i1> [#uses=1] call void @llvm.stackrestore( i8* %savedstack ) nounwind %tmp4389 = icmp eq i32 %added_sets_1.0, 0 ; <i1> [#uses=1] %tmp4394 = icmp eq i32 %added_sets_2.0, 0 ; <i1> [#uses=1] %bothcond80 = and i1 %tmp4389, %tmp4394 ; <i1> [#uses=1] %bothcond81 = and i1 %bothcond80, %tmp.0.i420 ; <i1> [#uses=1] br i1 %bothcond81, label %bb4398, label %bb4397 Here is the case from 252.eon: bb290.i.i: ; preds = %bb23.i57.i.i, %bb8.i39.i.i, %bb100.i.i, %bb100.i.i, %bb85.i.i110 %myEOF.1.i.i = phi i1 [ true, %bb100.i.i ], [ true, %bb100.i.i ], [ true, %bb85.i.i110 ], [ true, %bb8.i39.i.i ], [ false, %bb23.i57.i.i ] ; <i1> [#uses=2] %i.4.i.i = phi i32 [ %i.1.i.i, %bb85.i.i110 ], [ %i.0.i.i, %bb100.i.i ], [ %i.0.i.i, %bb100.i.i ], [ %i.3.i.i, %bb8.i39.i.i ], [ %i.3.i.i, %bb23.i57.i.i ] ; <i32> [#uses=3] %tmp292.i.i = load i8* %tmp16.i.i100, align 1 ; <i8> [#uses=1] %tmp293.not.i.i = icmp ne i8 %tmp292.i.i, 0 ; <i1> [#uses=1] %bothcond.i.i = and i1 %tmp293.not.i.i, %myEOF.1.i.i ; <i1> [#uses=1] br i1 %bothcond.i.i, label %bb202.i.i, label %bb301.i.i Factoring out 3 common predecessors. On the path from any blocks other than bb23.i57.i.i, the load and compare are dead. llvm-svn: 50096	2008-04-22 07:05:46 +00:00
Chris Lattner	003d69adef	refactor some code, no functionality change. llvm-svn: 50094	2008-04-22 06:36:15 +00:00
Chris Lattner	8837037473	remove dead code. llvm-svn: 50080	2008-04-22 03:21:48 +00:00
Chris Lattner	14be19cf1e	optimize "p != gep p, ..." better. This allows us to compile getelementptr-seteq.ll into: define i1 @test(i64 %X, %S* %P) { %C = icmp eq i64 %X, -1 ; <i1> [#uses=1] ret i1 %C } instead of: define i1 @test(i64 %X, %S* %P) { %A.idx.mask = and i64 %X, 4611686018427387903 ; <i64> [#uses=1] %C = icmp eq i64 %A.idx.mask, 4611686018427387903 ; <i1> [#uses=1] ret i1 %C } And fixes the second half of PR2235. This speeds up the insertion sort case by 45%, from 1.12s to 0.77s. In practice, this will significantly speed up for loops structured like: for (double *P = Base + N; P != Base; --P) ... Which happens frequently for C++ iterators. llvm-svn: 50079	2008-04-22 02:53:33 +00:00
Chris Lattner	6e88cf849b	fix grammar-o, thanks to Duncan for noticing. llvm-svn: 50047	2008-04-21 18:25:01 +00:00
Owen Anderson	b171c54227	Remove unneeded #include's. llvm-svn: 50035	2008-04-21 07:47:38 +00:00
Owen Anderson	bc6046416f	Refactor memcpyopt based on Chris' suggestions. Consolidate several functions and simplify code that was fallout from the separation of memcpyopt and gvn. llvm-svn: 50034	2008-04-21 07:45:10 +00:00
Chris Lattner	ca82082080	don't assume that the argument passed to fprintf("%s" is a string. This fixes a crash in opt on 433.milc. llvm-svn: 50023	2008-04-21 03:18:33 +00:00
Chris Lattner	9a21fbcf81	Use the new SplitBlockPredecessors to implement a todo. llvm-svn: 50022	2008-04-21 02:57:57 +00:00
Chris Lattner	57e11a167d	Move SplitBlockPredecessors out of loopsimplify into BasicBlockUtils.h as a global helper function. At the same type, switch it from taking a vector of predecessors to an arbitrary sequential input. This allows us to switch LoopSimplify to use a SmallVector for various temporary vectors that it passed into SplitBlockPredecessors. llvm-svn: 50020	2008-04-21 01:28:02 +00:00
Chris Lattner	ba4567c335	Move domtree/frontier updating earlier, allowing us to use it to update phi nodes, removing a hack. llvm-svn: 50019	2008-04-21 01:05:08 +00:00
Chris Lattner	53fa4554be	Factor dominator tree and frontier updating into SplitBlockPredecessors instead of doing it after every call. llvm-svn: 50018	2008-04-21 00:54:38 +00:00
Chris Lattner	34894abd7f	fit some more code in 80 cols. llvm-svn: 50016	2008-04-21 00:25:49 +00:00
Chris Lattner	7da61d5766	simplify code, fit in 80 cols. llvm-svn: 50015	2008-04-21 00:23:14 +00:00
Chris Lattner	45b3ace5a3	fit in 80 cols llvm-svn: 50014	2008-04-21 00:19:16 +00:00
Chris Lattner	56fbe56c25	finish the first cut of a jump threading pass implementation. llvm-svn: 50006	2008-04-20 22:39:42 +00:00
Chris Lattner	4037996a0c	replace a slow and verbose version of Instruction::isUsedOutsideOfBlock with a call to Instruction::isUsedOutsideOfBlock. llvm-svn: 50005	2008-04-20 22:18:22 +00:00
Chris Lattner	c714a8cee8	we can only thread blocks when there is a pred we can determine the succ of. llvm-svn: 50003	2008-04-20 21:18:09 +00:00
Chris Lattner	4fcfb808fb	improve comments, infrastructure, and add some validity checks for threading. Add a cost function. llvm-svn: 50002	2008-04-20 21:13:06 +00:00
Chris Lattner	c8c74f39db	Add a new Jump Threading pass, which will handle cases such as those in PR2235. Right now the pass is not very effective. :) llvm-svn: 50000	2008-04-20 20:35:01 +00:00

1 2 3 4 5 ...

4028 Commits