llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Chris Lattner	ac2b465cb4	Fix a bug in the previous checkin: if the exit block is not the same as the back-edge block, we must check the preincremented value. llvm-svn: 12968	2004-04-15 20:26:22 +00:00
Chris Lattner	dcf2ca93e6	Change the canonical induction variable that we insert. Instead of producing code like this: Loop: X = phi 0, X2 ... X2 = X + 1 if (X != N-1) goto Loop We now generate code that looks like this: Loop: X = phi 0, X2 ... X2 = X + 1 if (X2 != N) goto Loop This has two big advantages: 1. The trip count of the loop is now explicit in the code, allowing the direct implementation of Loop::getTripCount() 2. This reduces register pressure in the loop, and allows X and X2 to be put into the same register. As a consequence of the second point, the code we generate for loops went from: .LBB2: # no_exit.1 ... mov %EDI, %ESI inc %EDI cmp %ESI, 2 mov %ESI, %EDI jne .LBB2 # PC rel: no_exit.1 To: .LBB2: # no_exit.1 ... inc %ESI cmp %ESI, 3 jne .LBB2 # PC rel: no_exit.1 ... which has two fewer moves, and uses one less register. llvm-svn: 12961	2004-04-15 15:21:43 +00:00
Chris Lattner	6fcf8c7402	ADd a trivial instcombine: load null -> null llvm-svn: 12940	2004-04-14 03:28:36 +00:00
Chris Lattner	545e77c9d5	Add SCCP support for constant folding calls, implementing: test/Regression/Transforms/SCCP/calltest.ll llvm-svn: 12921	2004-04-13 19:43:54 +00:00
Chris Lattner	8c0e9c95e9	Constant propagation should remove the dead instructions llvm-svn: 12917	2004-04-13 19:28:20 +00:00
Chris Lattner	70f6a0ddcf	Fix LoopSimplify/2004-04-13-LoopSimplifyUpdateDomFrontier.ll LoopSimplify was not updating dominator frontiers correctly in some cases. llvm-svn: 12890	2004-04-13 16:23:25 +00:00
Chris Lattner	ae63c235f9	Refactor code a bit to make it simpler and eliminate the goto llvm-svn: 12888	2004-04-13 15:21:18 +00:00
Chris Lattner	faf377df58	This patch addresses PR35: Loop simplify should reconstruct nested loops. This is fairly straight-forward, but was a real nightmare to get just perfect. aarg. :) llvm-svn: 12884	2004-04-13 05:05:33 +00:00
Chris Lattner	603d821596	Add support for removing invoke instructions llvm-svn: 12858	2004-04-12 05:15:13 +00:00
Chris Lattner	ca1428e01c	Fix a bug in my select transformation llvm-svn: 12826	2004-04-11 01:39:19 +00:00
Chris Lattner	777977bf2e	Update the value numbering interface. llvm-svn: 12824	2004-04-10 22:33:34 +00:00
Chris Lattner	96fca8de3d	Implement InstCombine/select.ll:test13* llvm-svn: 12821	2004-04-10 22:21:27 +00:00
Chris Lattner	618c89d5eb	Implement InstCombine/add.ll:test20 Canonicalize add of sign bit constant into a xor llvm-svn: 12819	2004-04-10 22:01:55 +00:00
Chris Lattner	6d569b52ed	Rewrite the GCSE pass to be substantially simpler, a bit more efficient, and a bit more powerful llvm-svn: 12817	2004-04-10 21:11:11 +00:00
Chris Lattner	22c22de2f0	Fix spurious warning in release mode llvm-svn: 12816	2004-04-10 19:15:56 +00:00
Chris Lattner	924b6c173c	Simplify code a bit, and fix a bug that was breaking perlbmk llvm-svn: 12814	2004-04-10 18:06:21 +00:00
Chris Lattner	f126f03878	Fix a bug in my checkin last night that was breaking programs using invoke. llvm-svn: 12813	2004-04-10 16:53:29 +00:00
Chris Lattner	d4979e2904	Fix previous patch llvm-svn: 12811	2004-04-10 07:27:48 +00:00
Chris Lattner	3b211f0432	Correctly update counters llvm-svn: 12810	2004-04-10 07:02:02 +00:00
Chris Lattner	1676188024	Simplify code a bit, and use alias analysis to allow us to delete unused call and invoke instructions that are known to not write to memory. llvm-svn: 12807	2004-04-10 06:53:09 +00:00
Chris Lattner	306540a2f4	Implement select.ll:test12* This transforms code like this: %C = or %A, %B %D = select %cond, %C, %A into: %C = select %cond, %B, 0 %D = or %A, %C Since B is often a constant, the select can often be eliminated. In any case, this reduces the usage count of A, allowing subsequent optimizations to happen. This xform applies when the operator is any of: add, sub, mul, or, xor, and, shl, shr llvm-svn: 12800	2004-04-09 23:46:01 +00:00
Chris Lattner	3a6e4b9a35	Fold binary operators with a constant operand into select instructions that have a constant operand. This implements add.ll:test19, shift.ll:test15*, and others that are not tested llvm-svn: 12794	2004-04-09 19:05:30 +00:00
Chris Lattner	0e1f5553df	Implement select.ll:test11 llvm-svn: 12793	2004-04-09 18:19:44 +00:00
Chris Lattner	0ca3cbfa5e	Implement InstCombine/cast-propagate.ll llvm-svn: 12784	2004-04-08 20:39:49 +00:00
Chris Lattner	cf8117ccbd	Implement InstCombine/select.ll:test[7-10] llvm-svn: 12769	2004-04-08 04:43:23 +00:00
Chris Lattner	2e89e48999	Implement test/Regression/Transforms/InstCombine/getelementptr_index.ll llvm-svn: 12762	2004-04-07 18:38:20 +00:00
Chris Lattner	c92af54ed5	Fix a bug in yesterdays checkins which broke siod. siod is a great testcase! :) llvm-svn: 12659	2004-04-05 16:02:41 +00:00
Chris Lattner	6c961339a3	Fix InstCombine/2004-04-04-InstCombineReplaceAllUsesWith.ll llvm-svn: 12658	2004-04-05 02:10:19 +00:00
Chris Lattner	9236135e8f	Support getelementptr instructions which use uint's to index into structure types and can have arbitrary 32- and 64-bit integer types indexing into sequential types. llvm-svn: 12653	2004-04-05 01:30:19 +00:00
Chris Lattner	cf5bd8a9ab	Rewrite the indvars pass to use the ScalarEvolution analysis. This also implements some new features for the indvars pass, including linear function test replacement, exit value substitution, and it works with a much more general class of induction variables and loops. llvm-svn: 12620	2004-04-02 20:24:31 +00:00
Chris Lattner	441ab4b903	Remove some assertions that are now bogus with the last patch I put in llvm-svn: 12595	2004-04-01 19:21:46 +00:00
Chris Lattner	eda638b0be	Fix PR306: Loop simplify incorrectly updates dominator information Testcase: LoopSimplify/2004-04-01-IncorrectDomUpdate.ll llvm-svn: 12592	2004-04-01 19:06:07 +00:00
Chris Lattner	6aaea5f86b	Add warning llvm-svn: 12573	2004-03-31 22:00:30 +00:00
Chris Lattner	b6612acb18	Implement select.ll:test[3-6] llvm-svn: 12544	2004-03-30 19:37:13 +00:00
Chris Lattner	58a6a4d57a	Add a simple select instruction lowering pass llvm-svn: 12540	2004-03-30 18:41:10 +00:00
Chris Lattner	d191e5625c	X % -1 == X % 1 == 0 llvm-svn: 12520	2004-03-26 16:11:24 +00:00
Chris Lattner	e15fb6ac61	Two changes: #1 is to unconditionally strip constantpointerrefs out of instruction operands where they are absolutely pointless and inhibit optimization. GRRR! #2 is to implement InstCombine/getelementptr_const.ll llvm-svn: 12519	2004-03-25 22:59:29 +00:00
Chris Lattner	078f97b50d	Teach the optimizer to delete zero sized alloca's (but not mallocs!) llvm-svn: 12507	2004-03-19 06:08:10 +00:00
Chris Lattner	eccc0e01b2	Be more accurate llvm-svn: 12464	2004-03-17 01:59:27 +00:00
Chris Lattner	6fdcd7174b	Fix bug in previous checkin llvm-svn: 12458	2004-03-16 23:36:49 +00:00
Chris Lattner	59342a757e	Okay, so there is no reasonable way for tail duplication to update SSA form, as it is making effectively arbitrary modifications to the CFG and we don't have a domset/domfrontier implementations that can handle the dynamic updates. Instead of having a bunch of code that doesn't actually work in practice, just demote any potentially tricky values to the stack (causing the problem to go away entirely). Later invocations of mem2reg will rebuild SSA for us. This fixes all of the major performance regressions with tail duplication from LLVM 1.1. For example, this loop: --- int popcount(int x) { int result = 0; while (x != 0) { result = result + (x & 0x1); x = x >> 1; } return result; } --- Used to be compiled into: int %popcount(int %X) { entry: br label %loopentry loopentry: ; preds = %entry, %no_exit %x.0 = phi int [ %X, %entry ], [ %tmp.9, %no_exit ] ; <int> [#uses=3] %result.1.0 = phi int [ 0, %entry ], [ %tmp.6, %no_exit ] ; <int> [#uses=2] %tmp.1 = seteq int %x.0, 0 ; <bool> [#uses=1] br bool %tmp.1, label %loopexit, label %no_exit no_exit: ; preds = %loopentry %tmp.4 = and int %x.0, 1 ; <int> [#uses=1] %tmp.6 = add int %tmp.4, %result.1.0 ; <int> [#uses=1] %tmp.9 = shr int %x.0, ubyte 1 ; <int> [#uses=1] br label %loopentry loopexit: ; preds = %loopentry ret int %result.1.0 } And is now compiled into: int %popcount(int %X) { entry: br label %no_exit no_exit: ; preds = %entry, %no_exit %x.0.0 = phi int [ %X, %entry ], [ %tmp.9, %no_exit ] ; <int> [#uses=2] %result.1.0.0 = phi int [ 0, %entry ], [ %tmp.6, %no_exit ] ; <int> [#uses=1] %tmp.4 = and int %x.0.0, 1 ; <int> [#uses=1] %tmp.6 = add int %tmp.4, %result.1.0.0 ; <int> [#uses=2] %tmp.9 = shr int %x.0.0, ubyte 1 ; <int> [#uses=2] %tmp.1 = seteq int %tmp.9, 0 ; <bool> [#uses=1] br bool %tmp.1, label %loopexit, label %no_exit loopexit: ; preds = %no_exit ret int %tmp.6 } llvm-svn: 12457	2004-03-16 23:29:09 +00:00
Chris Lattner	dc22f37eb5	Do not try to optimize PHI nodes with incredibly high degree. This reduces SCCP time from 615s to 1.49s on a large testcase that has a gigantic switch statement that all of the blocks in the function go to (an intepreter). llvm-svn: 12442	2004-03-16 19:49:59 +00:00
Chris Lattner	2175b35b46	Do not copy gigantic switch instructions llvm-svn: 12441	2004-03-16 19:45:22 +00:00
Chris Lattner	9ae2b6acd7	Fix a regression from this patch: http://mail.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20040308/013095.html Basically, this patch only updated the immediate dominatees of the header node to tell them that the preheader also dominated them. In practice, ALL dominatees of the header node are also dominated by the preheader. This fixes: LoopSimplify/2004-03-15-IncorrectDomUpdate. and PR293 llvm-svn: 12434	2004-03-16 06:00:15 +00:00
Chris Lattner	d8905ad834	Add counters for the number of calls elimianted llvm-svn: 12420	2004-03-15 05:46:59 +00:00
Chris Lattner	8a82176edc	Implement LICM of calls in simple cases. This is sufficient to move around sin/cos/strlen calls and stuff. This implements: LICM/call_sink_pure_function.ll LICM/call_sink_const_function.ll llvm-svn: 12415	2004-03-15 04:11:30 +00:00
Chris Lattner	7e7c3332b8	Do not create empty basic blocks when the lowerswitch pass expects blocks to be non-empty! This fixes LowerSwitch/2004-03-13-SwitchIsDefaultCrash.ll llvm-svn: 12384	2004-03-14 04:14:31 +00:00
Chris Lattner	39b3ae34bd	If a block is dead, dominators will not be calculated for it. Because of this loop information won't see it, and we could have unreachable blocks pointing to the non-header node of blocks in a natural loop. This isn't tidy, so have the loopsimplify pass clean it up. llvm-svn: 12380	2004-03-14 03:59:22 +00:00
Chris Lattner	52ac108b28	Add some debugging output Fix InstCombine/2004-03-13-InstCombineInfLoop.ll which caused an infinite loop compiling (I think) povray. llvm-svn: 12365	2004-03-13 23:54:27 +00:00
Chris Lattner	8d45aeaff1	This little patch speeds up the loop used to update the dominator set analysis. On the testcase from GCC PR12440, which has a LOT of loops (1392 of which require preheaders to be inserted), this speeds up the loopsimplify pass from 1.931s to 0.1875s. The loop in question goes from 1.65s -> 0.0097s, which isn't bad. All of these times are a debug build. This adds a dependency on DominatorTree analysis that was not there before, but we always had dominatortree available anyway, because LICM requires both loop simplify and DT, so this doesn't add any extra analysis in practice. llvm-svn: 12362	2004-03-13 22:01:26 +00:00

1 2 3 4 5 ...

605 Commits