llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Chris Lattner	710a51d72b	It's not just a printer, it's actually an analysis too llvm-svn: 13064	2004-04-19 03:42:32 +00:00
Chris Lattner	69d4611250	Remove code to update loop depths llvm-svn: 13058	2004-04-19 03:02:09 +00:00
Chris Lattner	1849aa8b1f	Add #include llvm-svn: 13057	2004-04-19 03:01:23 +00:00
Chris Lattner	ab6502f058	Move isLoopInvariant to the Loop class llvm-svn: 13051	2004-04-18 22:46:08 +00:00
Chris Lattner	509116ec78	Add new method llvm-svn: 13050	2004-04-18 22:45:27 +00:00
Chris Lattner	5a0ed18724	Correct rewriting of exit blocks after my last patch llvm-svn: 13048	2004-04-18 22:27:10 +00:00
Chris Lattner	06e17bb6f7	Fix computation of exit blocks llvm-svn: 13047	2004-04-18 22:21:41 +00:00
Chris Lattner	8e42c6f409	Loop exit sets are no longer explicitly held, they are dynamically computed on demand. llvm-svn: 13046	2004-04-18 22:15:13 +00:00
Chris Lattner	7174acca00	Change the ExitBlocks list from being explicitly contained in the Loop structure to being dynamically computed on demand. This makes updating loop information MUCH easier. llvm-svn: 13045	2004-04-18 22:14:10 +00:00
Chris Lattner	13140766df	Reduce the unrolling limit llvm-svn: 13040	2004-04-18 18:06:14 +00:00
Chris Lattner	430968ac2f	If the preheader of the loop was the entry block of the function, make sure that the exit block of the loop becomes the new entry block of the function. This was causing a verifier assertion on 252.eon. llvm-svn: 13039	2004-04-18 17:38:42 +00:00
Chris Lattner	199b58db3f	Be much more careful about how we update instructions outside of the loop using instructions inside of the loop. This should fix the MishaTest failure from last night. llvm-svn: 13038	2004-04-18 17:32:39 +00:00
Chris Lattner	08232425a0	Implement method llvm-svn: 13036	2004-04-18 06:54:48 +00:00
Chris Lattner	33ec7f2f9f	After unrolling our single basic block loop, fold it into the preheader and exit block. The primary motivation for doing this is that we can now unroll nested loops. This makes a pretty big difference in some cases. For example, in 183.equake, we are now beating the native compiler with the CBE, and we are a lot closer with LLC. I'm now going to play around a bit with the unroll factor and see what effect it really has. llvm-svn: 13034	2004-04-18 06:27:43 +00:00
Chris Lattner	f2045a8c05	Fix a bug: this does not preserve the CFG! While we're at it, add support for updating loop information correctly. llvm-svn: 13033	2004-04-18 05:38:37 +00:00
Chris Lattner	6606b526f6	Add a new method, add a check missing that caused a segfault if a loop didn't have a canonical indvar llvm-svn: 13032	2004-04-18 05:38:05 +00:00
Chris Lattner	b0d23bf99d	Initial checkin of a simple loop unroller. This pass is extremely basic and limited. Even in it's extremely simple state (it can only fully unroll single basic block loops that execute a constant number of times), it already helps improve performance a LOT on some benchmarks, particularly with the native code generators. llvm-svn: 13028	2004-04-18 05:20:17 +00:00
Chris Lattner	e0f56972f0	Make the tail duplication threshold accessible from the command line instead of hardcoded llvm-svn: 13025	2004-04-18 00:52:43 +00:00
Chris Lattner	22ca3df5b1	Fix a memory leak. We leaked the vector holding the entries in switch tables. llvm-svn: 13023	2004-04-17 23:49:15 +00:00
Chris Lattner	b5ee2bcb62	Add the ability to compute exit values for complex loop using unanalyzable operations. This allows us to compile this testcase: int main() { int h = 1; do h = 3 * h + 1; while (h <= 256); printf("%d\n", h); return 0; } into this: int %main() { entry: call void %__main( ) %tmp.6 = call int (sbyte, ...) %printf( sbyte* getelementptr ([4 x sbyte]* %.str_1, long 0, long 0), int 364 ) ; <int> [#uses=0] ret int 0 } This testcase was taken directly from 256.bzip2, believe it or not. This code is not as general as I would like. Next up is to refactor it a bit to handle more cases. llvm-svn: 13019	2004-04-17 22:58:41 +00:00
Chris Lattner	740ae78ae6	If the loop executes a constant number of times, try a bit harder to replace exit values. llvm-svn: 13018	2004-04-17 18:44:09 +00:00
Chris Lattner	9a73de2ba2	Add the ability to compute trip counts that are only controlled by constants even if the loop is using expressions that we can't compute as a closed-form. This allows us to calculate that this function always returns 55: int test() { double X; int Count = 0; for (X = 100; X > 1; X = sqrt(X), ++Count) /empty/; return Count; } And allows us to compute trip counts for loops like: int h = 1; do h = 3 * h + 1; while (h <= 256); (which occurs in bzip2), and for this function, which occurs after inlining and other optimizations: int popcount() { int x = 666; int result = 0; while (x != 0) { result = result + (x & 0x1); x = x >> 1; } return result; } We still cannot compute the exit values of result or h in the two loops above, which means we cannot delete the loop, but we are getting closer. Being able to compute a constant trip count for these two loops will allow us to unroll them completely though. llvm-svn: 13017	2004-04-17 18:36:24 +00:00
Chris Lattner	bcb690dc9b	Fix a HUGE pessimization on X86. The indvars pass was taking this (familiar) function: int _strlen(const char str) { int len = 0; while (str++) len++; return len; } And transforming it to use a ulong induction variable, because the type of the pointer index was left as a constant long. This is obviously very bad. The fix is to shrink long constants in getelementptr instructions to intptr_t, making the indvars pass insert a uint induction variable, which is much more efficient. Here's the before code for this function: int %_strlen(sbyte* %str) { entry: %tmp.13 = load sbyte* %str ; <sbyte> [#uses=1] %tmp.24 = seteq sbyte %tmp.13, 0 ; <bool> [#uses=1] br bool %tmp.24, label %loopexit, label %no_exit no_exit: ; preds = %entry, %no_exit * %indvar = phi uint [ %indvar.next, %no_exit ], [ 0, %entry ] ; <uint> [#uses=2] * %indvar = phi ulong [ %indvar.next, %no_exit ], [ 0, %entry ] ; <ulong> [#uses=2] %indvar1 = cast ulong %indvar to uint ; <uint> [#uses=1] %inc.02.sum = add uint %indvar1, 1 ; <uint> [#uses=1] %inc.0.0 = getelementptr sbyte* %str, uint %inc.02.sum ; <sbyte> [#uses=1] %tmp.1 = load sbyte %inc.0.0 ; <sbyte> [#uses=1] %tmp.2 = seteq sbyte %tmp.1, 0 ; <bool> [#uses=1] %indvar.next = add ulong %indvar, 1 ; <ulong> [#uses=1] %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] br bool %tmp.2, label %loopexit.loopexit, label %no_exit loopexit.loopexit: ; preds = %no_exit %indvar = cast uint %indvar to int ; <int> [#uses=1] %inc.1 = add int %indvar, 1 ; <int> [#uses=1] ret int %inc.1 loopexit: ; preds = %entry ret int 0 } Here's the after code: int %_strlen(sbyte* %str) { entry: %inc.02 = getelementptr sbyte* %str, uint 1 ; <sbyte> [#uses=1] %tmp.13 = load sbyte %str ; <sbyte> [#uses=1] %tmp.24 = seteq sbyte %tmp.13, 0 ; <bool> [#uses=1] br bool %tmp.24, label %loopexit, label %no_exit no_exit: ; preds = %entry, %no_exit *** %indvar = phi uint [ %indvar.next, %no_exit ], [ 0, %entry ] ; <uint> [#uses=3] %indvar = cast uint %indvar to int ; <int> [#uses=1] %inc.0.0 = getelementptr sbyte* %inc.02, uint %indvar ; <sbyte> [#uses=1] %inc.1 = add int %indvar, 1 ; <int> [#uses=1] %tmp.1 = load sbyte %inc.0.0 ; <sbyte> [#uses=1] %tmp.2 = seteq sbyte %tmp.1, 0 ; <bool> [#uses=1] %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] br bool %tmp.2, label %loopexit, label %no_exit loopexit: ; preds = %entry, %no_exit %len.0.1 = phi int [ 0, %entry ], [ %inc.1, %no_exit ] ; <int> [#uses=1] ret int %len.0.1 } llvm-svn: 13016	2004-04-17 18:16:10 +00:00
Chris Lattner	5c85946417	Even if there are not any induction variables in the loop, if we can compute the trip count for the loop, insert one so that we can canonicalize the exit condition. llvm-svn: 13015	2004-04-17 18:08:33 +00:00
Chris Lattner	6d5decd7d4	Add support for evaluation of exp/log/log10/pow llvm-svn: 13011	2004-04-16 22:35:33 +00:00
Chris Lattner	ed423cc09d	Fix some really nasty dominance bugs that were exposed by my patch to make the verifier more strict. This fixes building zlib llvm-svn: 13002	2004-04-16 18:08:07 +00:00
Misha Brukman	cb5de6bca6	Fix retriving parent Function. llvm-svn: 13001	2004-04-16 17:37:12 +00:00
Brian Gaeke	4b9f67c638	Include <cmath> for compatibility with gcc 3.0.x (the system compiler on Debian.) llvm-svn: 12986	2004-04-16 15:57:32 +00:00
Misha Brukman	aadcd46d25	Assert if deleting BasicBlock before removing it from Function. llvm-svn: 12983	2004-04-16 15:47:21 +00:00
Chris Lattner	bc458be5f9	Fix some of the strange CBE-only failures that happened last night. llvm-svn: 12980	2004-04-16 06:03:17 +00:00
Chris Lattner	644ad23b21	Make sure to check for a very bad class of errors: an instruction that does not dominate all of its users, but is in the same basic block as its users. This class of error is what caused the mysterious CBE only failures last night. llvm-svn: 12979	2004-04-16 05:51:47 +00:00
Chris Lattner	8f8bd7daac	Bugpoint was not correctly capturing stderr! This caused it to "find" bugs that didn't exist, missing the ones that do :( llvm-svn: 12978	2004-04-16 05:35:58 +00:00
Chris Lattner	06eda01d1b	Fix Inline/2004-04-15-InlineDeletesCall.ll Basically we were using SimplifyCFG as a huge sledgehammer for a simple optimization. Because simplifycfg does so many things, we can't use it for this purpose. llvm-svn: 12977	2004-04-16 05:17:59 +00:00
Chris Lattner	ac2b465cb4	Fix a bug in the previous checkin: if the exit block is not the same as the back-edge block, we must check the preincremented value. llvm-svn: 12968	2004-04-15 20:26:22 +00:00
Brian Gaeke	e708b1d5ef	Give SparcV9CodeEmitter a head-of-file comment and a PassName. llvm-svn: 12967	2004-04-15 20:23:13 +00:00
Chris Lattner	dcf2ca93e6	Change the canonical induction variable that we insert. Instead of producing code like this: Loop: X = phi 0, X2 ... X2 = X + 1 if (X != N-1) goto Loop We now generate code that looks like this: Loop: X = phi 0, X2 ... X2 = X + 1 if (X2 != N) goto Loop This has two big advantages: 1. The trip count of the loop is now explicit in the code, allowing the direct implementation of Loop::getTripCount() 2. This reduces register pressure in the loop, and allows X and X2 to be put into the same register. As a consequence of the second point, the code we generate for loops went from: .LBB2: # no_exit.1 ... mov %EDI, %ESI inc %EDI cmp %ESI, 2 mov %ESI, %EDI jne .LBB2 # PC rel: no_exit.1 To: .LBB2: # no_exit.1 ... inc %ESI cmp %ESI, 3 jne .LBB2 # PC rel: no_exit.1 ... which has two fewer moves, and uses one less register. llvm-svn: 12961	2004-04-15 15:21:43 +00:00
Chris Lattner	a86cf626b5	add some helpful methods. Rearrange #includes to proper order llvm-svn: 12960	2004-04-15 15:16:02 +00:00
Chris Lattner	e0156bd979	Factor a bunch of classes out into a public header llvm-svn: 12958	2004-04-15 15:07:24 +00:00
Chris Lattner	ff600e280d	Unbreak the build llvm-svn: 12956	2004-04-15 14:17:43 +00:00
Chris Lattner	276a6e102c	Implement a FIXME: if we're going to insert a cast, we might as well only insert it once! llvm-svn: 12955	2004-04-14 22:01:22 +00:00
John Criswell	8a4525ae64	Remove code to adjust the iterator for llvm.readio and llvm.writeio. The iterator is pointing at the next instruction which should not disappear when doing the load/store replacement. llvm-svn: 12954	2004-04-14 21:27:56 +00:00
Brian Gaeke	8e2fb33172	Fix typo. llvm-svn: 12953	2004-04-14 21:21:56 +00:00
Chris Lattner	7f5e4b6d55	This is a trivial tweak to the addrec insertion code: insert the increment at the bottom of the loop instead of the top. This reduces the number of overlapping live ranges a lot, for example, eliminating a spill in an important loop in 183.equake with linear scan. I still need to make the exit comparison of the loop use the post-incremented version of this variable, but this is an easy first step. llvm-svn: 12952	2004-04-14 21:11:25 +00:00
Brian Gaeke	2c02798e86	Add a TargetData to the PassManager regardless of the TargetMachine. This should unbreak the Sparc JIT again. llvm-svn: 12949	2004-04-14 17:45:52 +00:00
John Criswell	bed6463449	Remove the return type check for llvm.readio. This check is done for all functions and is not needed here. Simplify the pointer type check per Chris's suggestions. llvm-svn: 12945	2004-04-14 15:06:48 +00:00
John Criswell	e00ecd7e84	Added code to verify that llvm.readio's pointer argument returns something that matches its return type. llvm-svn: 12944	2004-04-14 14:49:36 +00:00
John Criswell	11f7f60028	Finish adding the llvm.readio and llvm.writeio intrinsics. Sorry these didn't get in yesterday. llvm-svn: 12942	2004-04-14 13:46:52 +00:00
Chris Lattner	6fcf8c7402	ADd a trivial instcombine: load null -> null llvm-svn: 12940	2004-04-14 03:28:36 +00:00
Chris Lattner	64431dbce7	This is the real fix for Codegen/X86/2004-04-13-FPCMOV-Crash.llx which works even when the "optimization" I added before is turned off. It generates this extremely pointless code: test: fld QWORD PTR [%ESP + 4] mov %AL, 0 test %AL, %AL fcmove %ST(0), %ST(0) ret Good thing the optimizer will have removed this before code generation anyway. :) llvm-svn: 12939	2004-04-14 02:42:32 +00:00
John Criswell	94de925685	Added support for the llvm.readio and llvm.writeio intrinsics. On x86, memory operations occur in-order, so these are just lowered into volatile loads and stores. llvm-svn: 12936	2004-04-13 22:13:14 +00:00

1 2 3 4 5 ...

6105 Commits