llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-30 15:32:52 +01:00

Author	SHA1	Message	Date
Brian Gaeke	ab7dd80200	Add accessor function. llvm-svn: 16622	2004-09-30 20:14:29 +00:00
Brian Gaeke	65540b3e58	Correct type of accessor functions. llvm-svn: 16621	2004-09-30 20:14:18 +00:00
Brian Gaeke	90a286872c	Namespacify. Add accessor function. llvm-svn: 16620	2004-09-30 20:14:07 +00:00
Chris Lattner	af68e9a012	Disable the 'WARNING: Found global types that are not compatible' warning that always prints when linking programs to libstdc++ :( llvm-svn: 16603	2004-09-30 00:12:29 +00:00
Chris Lattner	8341306cba	Hrm, debugging printouts do not need to be in here llvm-svn: 16598	2004-09-29 21:21:14 +00:00
Chris Lattner	79ceb6ba53	* Pull range optimization code out into new InsertRangeTest function. * SubOne/AddOne functions always return ConstantInt, declare them as such * Pull code for handling setcc X, cst, where cst is at the end of the range, or cc is LE or GE up earlier in visitSetCondInst. This reduces #iterations in some cases. * Fold: (div X, C1) op C2 -> range check, implementing div.ll:test6 - test9. llvm-svn: 16588	2004-09-29 17:40:11 +00:00
Chris Lattner	778a49acfd	Do not insert trivially dead select instructions, which allows us to potentially fold more in one pass. llvm-svn: 16583	2004-09-29 05:43:32 +00:00
Chris Lattner	572652718c	Fold binary expressions and casts into PHI nodes that have all constant inputs. This takes something like this: %A = phi int [ 3, %cond_false.0 ], [ 2, %endif.0.i ], [ 2, %endif.1.i ] %B = div int %tmp.243, 4 and turns it into: %A = phi int [ 3/4, %cond_false.0 ], [ 2/4, %endif.0.i ], [ 2/4, %endif.1.i ] which is later simplified (in this case) into %A = 0. This triggers thousands of times in spec, for example, 269 times in 176.gcc. This is tested by InstCombine/add.ll:test23 and set.ll:test18. llvm-svn: 16582	2004-09-29 05:07:12 +00:00
Chris Lattner	4ea03eea49	Hrm, really, all tests passed without this, but it is scary to think how... llvm-svn: 16568	2004-09-29 03:16:24 +00:00
Chris Lattner	1ad393b186	Remove debugging printout Instcombine (setcc (truncate X), C1). This occurs THOUSANDS of times in many benchmarks. Particularlly common seem to be things like (seteq (cast bool X to int), int 0) This turns it into (seteq bool %X, false), which then becomes (not %X). llvm-svn: 16567	2004-09-29 03:09:18 +00:00
Chris Lattner	0046cec2a2	Fold (X setcc C1) \| (X setcc C2) This implements or.ll:test1[89] llvm-svn: 16561	2004-09-28 22:33:08 +00:00
Chris Lattner	d3cfa5aba5	Fold (and (setcc X, C1), (setcc X, C2)) This is important for several reasons: 1. Benchmarks have lots of code that looks like this (perlbmk in particular): %tmp.2.i = setne int %tmp.0.i, 128 ; <bool> [#uses=1] %tmp.6343 = seteq int %tmp.0.i, 1 ; <bool> [#uses=1] %tmp.63 = and bool %tmp.2.i, %tmp.6343 ; <bool> [#uses=1] we now fold away the setne, a clear improvement. 2. In the more important cases, such as (X >= 10) & (X < 20), we now produce smaller code: (X-10) < 10. 3. Perhaps the nicest effect of this patch is that it really helps out the code generators. In particular, for a 'range test' like the above, instead of generating this on X86 (the difference on PPC is even more pronounced): cmp %EAX, 50 setge %CL cmp %EAX, 100 setl %AL and %CL, %AL cmp %CL, 0 we now generate this: add %EAX, -50 cmp %EAX, 50 Furthermore, this causes setcc's to be folded into branches more often. These combinations trigger dozens of times in the spec benchmarks, particularly in 176.gcc, 186.crafty, 253.perlbmk, 254.gap, & 099.go. llvm-svn: 16559	2004-09-28 21:48:02 +00:00
Chris Lattner	d7b9ededb4	Implement X / C1 / C2 folding Implement (setcc (shl X, C1), C2) folding. The second one occurs several dozen times in spec. The first was added just in case. :) These are tested by shift.ll:test2[12], and div.ll:test5 llvm-svn: 16549	2004-09-28 18:22:15 +00:00
Chris Lattner	a4e0ed87bc	shl is always zero extending, so always use a zero extending shift right. This latent bug was exposed by recent changes, and is tested as: llvm/test/Regression/Transforms/InstCombine/2004-09-28-BadShiftAndSetCC.llx llvm-svn: 16546	2004-09-28 17:54:07 +00:00
Alkis Evlogimenos	4f5920aaef	Add includes and use std:: for standard library calls to make code compile on windows. This patch was contributed by Paolo Invernizzi. llvm-svn: 16539	2004-09-28 14:42:44 +00:00
Alkis Evlogimenos	7ff66b2884	Pull assignment out of for loop conditional in order for this to compile under windows. Patch contributed by Paolo Invernizzi! llvm-svn: 16534	2004-09-28 02:40:37 +00:00
Chris Lattner	f953091075	Fix two bugs: one where a condition was mistakenly swapped, and another where we folded (X & 254) -> X < 1 instead of X < 2. These problems were latent problems exposed by the latest patch. llvm-svn: 16528	2004-09-27 19:29:18 +00:00
Chris Lattner	a715ffded3	Fold: (setcc (shr X, ShAmt), CI), where 'cc' is eq or ne. This xform triggers often, for example: 6x in povray, 1x in gzip, 279x in gcc, 1x in crafty, 8x in eon, 11x in perlbmk, 362x in gap, 4x in vortex, 14 in m88ksim, 211x in 126.gcc, 1x in compress, 11x in ijpeg, and 4x in 147.vortex. llvm-svn: 16521	2004-09-27 16:18:50 +00:00
Chris Lattner	9d4748d32d	Implement shift-and combinations, implementing InstCombine/and.ll:test19-21 These combinations trigger 4 times in povray, 7x in gcc, 4x in gap, and 2x in bzip2. llvm-svn: 16508	2004-09-24 15:21:34 +00:00
Chris Lattner	7e603bfc67	Move LHSI->hasOneUse() into the arms of the conditional, reindenting code. No functionality changes here. llvm-svn: 16505	2004-09-23 21:52:49 +00:00
Chris Lattner	00ea30c3c5	Implement Transforms/InstCombine/and.ll:test18, a case that occurs 20 times in perlbmk llvm-svn: 16504	2004-09-23 21:46:38 +00:00
Chris Lattner	6409a166e8	Implement select.ll:test16: fold load (select C, X, null) -> load X llvm-svn: 16499	2004-09-23 15:46:00 +00:00
Chris Lattner	537636bb55	Do not fold (X + C1 != C2) if there are other users of the add. Doing this transformation used to take a loop like this: int Array[1000]; void test(int X) { int i; for (i = 0; i < 1000; ++i) Array[i] += X; } Compiled to LLVM is: no_exit: ; preds = %entry, %no_exit %indvar = phi uint [ 0, %entry ], [ %indvar.next, %no_exit ] ; <uint> [#uses=2] %tmp.4 = getelementptr [1000 x int]* %Array, int 0, uint %indvar ; <int> [#uses=2] %tmp.7 = load int %tmp.4 ; <int> [#uses=1] %tmp.9 = add int %tmp.7, %X ; <int> [#uses=1] store int %tmp.9, int* %tmp.4 * %indvar.next = add uint %indvar, 1 ; <uint> [#uses=2] * %exitcond = seteq uint %indvar.next, 1000 ; <bool> [#uses=1] br bool %exitcond, label %return, label %no_exit and turn it into a loop like this: no_exit: ; preds = %entry, %no_exit %indvar = phi uint [ 0, %entry ], [ %indvar.next, %no_exit ] ; <uint> [#uses=3] %tmp.4 = getelementptr [1000 x int]* %Array, int 0, uint %indvar ; <int> [#uses=2] %tmp.7 = load int %tmp.4 ; <int> [#uses=1] %tmp.9 = add int %tmp.7, %X ; <int> [#uses=1] store int %tmp.9, int* %tmp.4 * %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] * %exitcond = seteq uint %indvar, 999 ; <bool> [#uses=1] br bool %exitcond, label %return, label %no_exit Note that indvar.next and indvar can no longer be coallesced. In machine code terms, this patch changes this code: .LBBtest_1: # no_exit mov %EDX, OFFSET Array mov %ESI, %EAX add %ESI, DWORD PTR [%EDX + 4%ECX] mov %EDX, OFFSET Array mov DWORD PTR [%EDX + 4%ECX], %ESI mov %EDX, %ECX inc %EDX cmp %ECX, 999 mov %ECX, %EDX jne .LBBtest_1 # no_exit into this: .LBBtest_1: # no_exit mov %EDX, OFFSET Array mov %ESI, %EAX add %ESI, DWORD PTR [%EDX + 4%ECX] mov %EDX, OFFSET Array mov DWORD PTR [%EDX + 4%ECX], %ESI inc %ECX cmp %ECX, 1000 jne .LBBtest_1 # no_exit We need better instruction selection to get this: .LBBtest_1: # no_exit add DWORD PTR [Array + 4*%ECX], EAX inc %ECX cmp %ECX, 1000 jne .LBBtest_1 # no_exit ... but at least there is less register juggling llvm-svn: 16473	2004-09-21 21:35:23 +00:00
Chris Lattner	b64bfebc25	Fix potential miscompilations: InstCombine/2004-09-20-BadLoadCombine*.llx llvm-svn: 16447	2004-09-20 10:15:10 +00:00
Alkis Evlogimenos	a3a9fa1d80	Fix loop condition so that we don't decrement off the beginning of the list. llvm-svn: 16440	2004-09-20 06:42:58 +00:00
Chris Lattner	43c0372c0b	'Pass' should now not be derived from by clients. Instead, they should derive from ModulePass. Instead of implementing Pass::run, then should implement ModulePass::runOnModule. llvm-svn: 16436	2004-09-20 04:48:05 +00:00
Chris Lattner	b65d7c65d6	Prototype more accurately llvm-svn: 16433	2004-09-20 04:43:57 +00:00
Chris Lattner	c137c9ac39	Prototype these functions more accurately llvm-svn: 16432	2004-09-20 04:43:15 +00:00
Chris Lattner	f607a26457	Make isSafeToLoadUnconditionally a bit smarter, implementing PR362 and Regression/Transforms/InstCombine/CPP_min_max.llx llvm-svn: 16409	2004-09-19 19:18:10 +00:00
Chris Lattner	98d434988e	Remove a whole bunch of horrible hacky code that was used to promote allocas whose addresses where used by trivial phi nodes and select instructions. This is now performed by the instcombine pass, which is more powerful, is much simpler, and is faster. This allows the deletion of a bunch of code, two FIXME's and two gotos. llvm-svn: 16406	2004-09-19 18:51:51 +00:00
Chris Lattner	f45dc6dae6	Make instruction combining a bit more aggressive in the face of volatile loads, and implement two new transforms: InstCombine/load.ll:test[56]. llvm-svn: 16404	2004-09-19 18:43:46 +00:00
Chris Lattner	188b4e4983	Add comment llvm-svn: 16400	2004-09-19 01:05:16 +00:00
Chris Lattner	12bcdf2e01	Fix the inliner to always delete any edges from the external call node to a function being deleted. Due to optimizations done while inlining, there can be edges from the external call node to a function node that were not apparent any longer. This fixes the compiler crash while compiling 175.vpr llvm-svn: 16399	2004-09-18 21:37:03 +00:00
Chris Lattner	223e9d38b5	Convert this pass to be a CallGraphSCCPass instead of a Pass, which eliminates the worklist and makes it more efficient. This does not change functionality at all. llvm-svn: 16390	2004-09-18 00:34:13 +00:00
Chris Lattner	f31ae4da07	Make sure to remove the Select instruction as well llvm-svn: 16389	2004-09-18 00:32:40 +00:00
Chris Lattner	228e66d208	Fix typo in comment llvm-svn: 16384	2004-09-17 03:58:39 +00:00
Chris Lattner	e54e70266e	Add a newline llvm-svn: 16369	2004-09-15 17:53:52 +00:00
Reid Spencer	c6a8d70cff	Convert code to compile with vc7.1. Patch contributed by Paolo Invernizzi. Thanks Paolo! llvm-svn: 16368	2004-09-15 17:06:42 +00:00
Chris Lattner	5751f19d86	Fix a bug in the previous checkin that broke 255.vortex llvm-svn: 16355	2004-09-15 02:34:40 +00:00
Chris Lattner	6833186048	Make sure to update alias analysis information as we transform the function. This fixes PR420 and Regression/Transforms/LICM/2004-09-14-AliasAnalysisInvalidate.llx llvm-svn: 16348	2004-09-15 01:04:07 +00:00
Chris Lattner	1d9b957384	If given an AliasSetTracker object to update, update it. llvm-svn: 16347	2004-09-15 01:02:54 +00:00
Chris Lattner	a7635e78c2	Remove a long-dead pass. Actually, this pass was never used at all. llvm-svn: 16337	2004-09-14 16:33:01 +00:00
Alkis Evlogimenos	0c50e0f211	Fixes to make LLVM compile with vc7.1. Patch contributed by Paolo Invernizzi! llvm-svn: 16152	2004-09-03 18:19:51 +00:00
Reid Spencer	c4abcbefb1	Changes For Bug 352 Move include/Config and include/Support into include/llvm/Config, include/llvm/ADT and include/llvm/Support. From here on out, all LLVM public header files must be under include/llvm/. llvm-svn: 16137	2004-09-01 22:55:40 +00:00
Reid Spencer	7117a132ea	Initial checkin of a pass to lower packed operations to scalars operations. This also registers the pass with opt with a -lower-packed command line option. Patch contributed by Brad Jones. llvm-svn: 15987	2004-08-21 21:39:24 +00:00
Chris Lattner	9f60c755f8	If we are linking two global variables and they have the same size, do not spew warnings, even if the types don't match. llvm-svn: 15933	2004-08-20 00:30:39 +00:00
Chris Lattner	8e2dc1a98a	Implement test/Regression/Transforms/GlobalConstifier/phi-select.llx This allows more globals to be marked constant, particularly global arrays. llvm-svn: 15735	2004-08-14 20:57:17 +00:00
Chris Lattner	84445ee674	If we are extracting a block that has multiple successors that are the same block (common in a switch), make sure to remove extra edges in successor blocks. This fixes CodeExtractor/2004-08-12-BlockExtractPHI.ll and should be pulled into LLVM 1.3 (though the regression test need not be, as that would require pulling in the LoopExtract.cpp changes). llvm-svn: 15717	2004-08-13 03:27:07 +00:00
Chris Lattner	2c3ad7902d	When we code extract some stuff, leave the codeRepl block in the place where the extracted code was, instead of putting it at the end of the function llvm-svn: 15716	2004-08-13 03:17:39 +00:00
Chris Lattner	8537ae6e2c	"extract" the block extractor pass from bugpoint (haha) llvm-svn: 15714	2004-08-13 03:05:17 +00:00
Chris Lattner	7abc1c2473	Add value mapper support for select constant exprs. This should fix a bug Nate ran into when bugpointing siod. This fix should go into LLVM 1.3 llvm-svn: 15712	2004-08-13 02:43:19 +00:00
Chris Lattner	32ad5d0bab	This patch makes the inliner refuse to inline functions that have alloca instructions in the body of the function (not the entry block). This fixes test/Programs/SingleSource/Regression/C/2004-08-12-InlinerAndAllocas.c and test/Programs/External/SPEC/CINT2000/176.gcc on zion. This should obviously be pulled into 1.3. llvm-svn: 15684	2004-08-12 05:45:09 +00:00
Chris Lattner	85e2339cfa	Fix code extraction of unwind blocks. This fixed bugs that bugpoint can run into. This should go into 1.3 llvm-svn: 15679	2004-08-12 03:17:02 +00:00
Chris Lattner	430578a835	Hrm, this pass didn't compile. This bugfix should go into 1.3! llvm-svn: 15676	2004-08-12 02:44:23 +00:00
Chris Lattner	2f98c58e84	Fix InstCombine/2004-08-10-BoolSetCC.ll, a bug that is miscompiling 176.gcc. Note that this is apparently not the only bug miscompiling gcc though. :( llvm-svn: 15639	2004-08-11 00:50:51 +00:00
Chris Lattner	32b5c4960c	Fix InstCombine/2004-08-09-RemInfLoop.llx This should go into the 1.3 branch llvm-svn: 15593	2004-08-09 21:05:48 +00:00
Chris Lattner	c5a25532c7	Fix another really nasty regression that Anshu pointed out. In cases where dangling constant users were removed from a function, causing it to be dead, we never removed the call graph edge from the external node to the function. In most cases, this didn't cause a problem (by luck). This should definitely go into 1.3 llvm-svn: 15570	2004-08-08 03:29:50 +00:00
Chris Lattner	0c29b326fb	Two fixes: 1. Fix a REALLY nasty cyclic replacement issue that Anshu discovered, causing nondeterminstic crashes and memory corruption. 2. For performance, don't go inserting constantexpr casts of GV pointers. This should definitely go into 1.3 llvm-svn: 15568	2004-08-08 01:30:07 +00:00
Chris Lattner	7f088ed899	This DEBUG is buggy. comment it out because it's not worth fixing. This should go into 1.3 llvm-svn: 15567	2004-08-08 01:27:56 +00:00
Alkis Evlogimenos	f853362a44	Stop using getValues(). llvm-svn: 15487	2004-08-04 08:44:43 +00:00
Chris Lattner	41c8b70624	Fix a regression in InstCombine/xor.ll llvm-svn: 15410	2004-08-01 19:42:59 +00:00
Chris Lattner	170b31f44d	Expose this as a functionpass llvm-svn: 15369	2004-07-31 10:01:58 +00:00
Misha Brukman	4b70aa2e78	Fix De Morgan's name. llvm-svn: 15343	2004-07-30 12:50:08 +00:00
Chris Lattner	e63c404df2	Start using the PatternMatcher a bit. llvm-svn: 15342	2004-07-30 07:50:03 +00:00
Misha Brukman	8760d70159	Fix #includes of i*.h => Instructions.h as per PR403. llvm-svn: 15337	2004-07-29 17:30:57 +00:00
Misha Brukman	58104df77b	Fix #includes of i*.h => Instructions.h as per PR403. llvm-svn: 15334	2004-07-29 17:30:56 +00:00
Misha Brukman	2a80e53645	Fix #includes of i*.h => Instructions.h as per PR403. llvm-svn: 15328	2004-07-29 17:05:13 +00:00
Alkis Evlogimenos	fb27f702ca	Merge i*.h headers into Instructions.h as part of bug403. llvm-svn: 15325	2004-07-29 12:17:34 +00:00
Robert Bocchino	4325ca6606	This change fixed a bug in the function visitMul. The prior version assumed that a constant on the RHS of a multiplication was either an IntConstant or an FPConstant. It checked for an IntConstant and then, if it did not find one, did a hard cast to an FPConstant. That code would crash if the RHS were a ConstantExpr that was neither an IntConstant nor an FPConstant. This version replaces the hard cast with a dyn_cast. It performs the same way for IntConstants and FPConstants but does nothing, instead of crashing, for constant expressions. The regression test for this change is 2004-07-27-ConstantExprMul.ll. llvm-svn: 15291	2004-07-27 21:02:21 +00:00
Brian Gaeke	45adb41f46	Make the create...() functions for some of these passes return a FunctionPass *. llvm-svn: 15276	2004-07-27 17:43:21 +00:00
Chris Lattner	5cfb85064e	Fix hoisting of void typed values, e.g. calls llvm-svn: 15263	2004-07-27 07:38:32 +00:00
Chris Lattner	59aff88abe	Implement DeadStoreElim/alloca.llx by observing that allocas are dead at the end of the function (either return or unwind) llvm-svn: 15232	2004-07-26 06:14:11 +00:00
Chris Lattner	c2e9c56906	Throttle back indvar substitution from creating multiplies in loops. This is bad bad bad. llvm-svn: 15227	2004-07-26 02:47:12 +00:00
Chris Lattner	a259f9201c	* Substantially simplify how free instructions are handled (potentially fixing a bug in DSE). * Delete dead operand uses iteratively instead of recursively, using a SetVector. * Defer deletion of dead operand uses until the end of processing, which means we don't have to bother with updating the AliasSetTracker. This speeds up DSE substantially. llvm-svn: 15204	2004-07-25 11:09:56 +00:00
Chris Lattner	0a9d5e6f14	Free instructions kill values too. This implements DeadStoreElim/free.llx llvm-svn: 15199	2004-07-25 07:58:38 +00:00
Chris Lattner	29e97f36bf	obvious fix llvm-svn: 15162	2004-07-24 07:51:27 +00:00
Chris Lattner	7e9731bc4f	This is a trivial dead store elimination pass. It very very simple and can be improved in many ways. But: stop laughing, even with -basicaa it deletes 15% of the stores in 252.eon :) llvm-svn: 15101	2004-07-22 08:00:28 +00:00
Chris Lattner	7b301dfa9d	Update GC intrinsics to take a pointer to the object as well as a pointer to the field being updated. Patch contributed by Tobias Nurmiranta llvm-svn: 15097	2004-07-22 05:51:13 +00:00
Brian Gaeke	f18cdca667	These files don't need to include <iostream> since they include "Support/Debug.h". llvm-svn: 15089	2004-07-21 20:50:33 +00:00
Chris Lattner	b77bda4432	* Further cleanup. * Test for whether bits are shifted out during the optzn. If so, the fold is illegal, though it can be handled explicitly for setne/seteq This fixes the miscompilation of 254.gap last night, which was a latent bug exposed by other optimizer improvements. llvm-svn: 15085	2004-07-21 20:14:10 +00:00
Chris Lattner	a3c10c2012	Make cast-cast code a bit more defensive "simplify" a bit of code for comparison/and folding llvm-svn: 15082	2004-07-21 19:50:44 +00:00
Chris Lattner	62c40d3982	Remove special casing of pointers and treat them generically as integers of the appopriate size. This gives us the ability to eliminate int -> ptr -> int llvm-svn: 15063	2004-07-21 04:27:24 +00:00
Chris Lattner	4e4b8b0ad2	Fix a serious code pessimization problem. If an inlined function has a single return, clone the 'ret' BB code into the block AFTER the inlined call, not the other way around. llvm-svn: 15030	2004-07-20 05:45:24 +00:00
Chris Lattner	f8c20caf25	Implement Transforms/InstCombine/IntPtrCast.ll llvm-svn: 15029	2004-07-20 05:21:00 +00:00
Chris Lattner	da83200d72	Ignore instructions that are in trivially dead functions. This allows us to constify 14 globals instead of 4 in a trivial C++ testcase. llvm-svn: 15027	2004-07-20 03:58:07 +00:00
Chris Lattner	f6eb30e9a8	Implement InstCombine/GEPIdxCanon.ll llvm-svn: 15024	2004-07-20 01:48:15 +00:00
Chris Lattner	8277141f09	Implement SimplifyCFG/BrUnwind.ll llvm-svn: 15022	2004-07-20 01:17:38 +00:00
Chris Lattner	d9c41a82a3	Rewrite cast->cast elimination code completely based on the information we actually care about. Someday when the cast instruction is gone, we can do better here, but this will do for now. This implements instcombine/cast.ll:test17/18 as well. llvm-svn: 15018	2004-07-20 00:59:32 +00:00
Chris Lattner	ffc1df7399	Fix a performance regression from the CPR patch, simplify code llvm-svn: 14974	2004-07-18 21:34:16 +00:00
Chris Lattner	9de817e13e	Strip out and simplify some code. This also fixes the regression last night compiling cfrac. It did not realize that code like this: int G; int *H = &G; takes the address of G. llvm-svn: 14973	2004-07-18 19:56:20 +00:00
Chris Lattner	8c4d6aa7e8	Minor cleanup, no functionality change llvm-svn: 14972	2004-07-18 18:59:44 +00:00
Reid Spencer	7b03169e0e	Remove an if statement that would never be reached. llvm-svn: 14968	2004-07-18 08:41:47 +00:00
Reid Spencer	6d26720976	Delete a redundant if branch. llvm-svn: 14967	2004-07-18 08:34:52 +00:00
Reid Spencer	3a18547fcb	Expand the coercion of constants to include the newly constant Globals. llvm-svn: 14966	2004-07-18 08:34:19 +00:00
Reid Spencer	2b7bae6b4f	Delete a no-op loop. llvm-svn: 14965	2004-07-18 08:32:43 +00:00
Reid Spencer	7236678c32	Expand the scope to include global values because they are now constants too. llvm-svn: 14964	2004-07-18 08:32:10 +00:00
Reid Spencer	90795f0825	Avoid an unnecessary isa<Constant>. llvm-svn: 14963	2004-07-18 08:31:18 +00:00
Chris Lattner	71f281984d	Remove useless statistic, fix some slightly broken logic llvm-svn: 14958	2004-07-18 07:22:58 +00:00
Chris Lattner	99e46b2e81	Fix a rather serious bug in previous checkin llvm-svn: 14957	2004-07-18 06:56:58 +00:00
Reid Spencer	7f33869f9b	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage llvm-svn: 14953	2004-07-18 00:44:37 +00:00
Reid Spencer	14243817ec	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage - Minimize redundant isa<GlobalValue> usage - Correct isa<Constant> for GlobalValue subclass llvm-svn: 14950	2004-07-18 00:38:32 +00:00
Reid Spencer	51149979ce	bug 122: - Minimize redundant isa<GlobalValue> usage llvm-svn: 14948	2004-07-18 00:32:14 +00:00
Reid Spencer	2a8914b64b	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage - Correct isa<Constant> for GlobalValue subclass llvm-svn: 14947	2004-07-18 00:31:05 +00:00
Reid Spencer	66f4e3dc24	bug 122: - Minimize redundant isa<GlobalValue> usage - Correct isa<Constant> for GlobalValue subclass llvm-svn: 14946	2004-07-18 00:29:57 +00:00
Reid Spencer	2bfe4ec3cf	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage - Rename methods to get ride of ConstantPointerRef usage llvm-svn: 14945	2004-07-18 00:25:04 +00:00
Reid Spencer	55d436cc07	bug 122: - Excise dead CPR procesing. llvm-svn: 14944	2004-07-18 00:23:51 +00:00
Reid Spencer	56ec516cb4	bug 122: - Replace ConstantPointerRef usage with GlobalValue usage - Correct test ordering for GlobalValue subclass llvm-svn: 14943	2004-07-18 00:19:45 +00:00
Chris Lattner	952bbd726b	This patch was contributed by Daniel Berlin! Speed up SCCP substantially by processing overdefined values quickly. This patch speeds up SCCP by about 30-40% on large testcases. llvm-svn: 14861	2004-07-15 23:36:43 +00:00
Chris Lattner	0193bbef1f	Fix PR404 try #2 This version takes about 1s longer than the previous one (down to 2.35s), but on the positive side, it actually works :) llvm-svn: 14856	2004-07-15 08:20:22 +00:00
Chris Lattner	fde0e6bb95	Revert previous patch until I get a bug fixed llvm-svn: 14853	2004-07-15 05:36:31 +00:00
Chris Lattner	9896dfb8e7	Fix PR404: Loop simplify is really slow on 252.eon This eliminates an NNlogN algorithm from the loop simplify pass, replacing it with a much simpler and faster alternative. In a debug build, this reduces gccas time on eon from 85s to 42s. llvm-svn: 14851	2004-07-15 04:27:04 +00:00
Chris Lattner	2db1894038	Progress on PR341 llvm-svn: 14840	2004-07-15 02:06:12 +00:00
Chris Lattner	0f61d55197	Fixes working towards PR341 llvm-svn: 14839	2004-07-15 01:50:47 +00:00
Chris Lattner	de4449ef34	Now that we codegen the portable "sizeof" efficiently, we can use it for malloc lowering. This means that lowerallocations doesn't need targetdata anymore. yaay. llvm-svn: 14835	2004-07-15 01:08:08 +00:00
Chris Lattner	fca5fee9d5	Factor some code to handle "load (constantexpr cast foo)" just like "load (cast foo)". This allows us to compile C++ code like this: class Bclass { public: virtual int operator()() { return 666; } }; class Dclass: public Bclass { public: virtual int operator()() { return 667; } } ; int main(int argc, char argv) { Dclass x; return x(); } Into this: int %main(int %argc, sbyte %argv) { entry: call void %__main( ) ret int 667 } Instead of this: int %main(int %argc, sbyte** %argv) { entry: %x = alloca "struct.std::bad_typeid" ; <"struct.std::bad_typeid"> [#uses=3] call void %__main( ) %tmp.1.i.i = getelementptr "struct.std::bad_typeid" %x, uint 0, uint 0, uint 0 ; <int (...)*> [#uses=1] store int (...) getelementptr ([3 x int (...)] %vtable for Bclass, int 0, long 2), int (...)*** %tmp.1.i.i %tmp.3.i = getelementptr "struct.std::bad_typeid"* %x, int 0, uint 0, uint 0 ; <int (...)*> [#uses=1] store int (...) getelementptr ([3 x int (...)] %vtable for Dclass, int 0, long 2), int (...)*** %tmp.3.i %tmp.5 = load int ("struct.std::bad_typeid")* cast (int (...)** getelementptr ([3 x int (...)] %vtable for Dclass, int 0, long 2) to int ("struct.std::bad_typeid")) ; <int ("struct.std::bad_typeid")> [#uses=1] %tmp.6 = call int %tmp.5( "struct.std::bad_typeid" %x ) ; <int> [#uses=1] ret int %tmp.6 ret int 0 } In order words, we now resolve the virtual function call. llvm-svn: 14783	2004-07-13 01:49:43 +00:00
Chris Lattner	ed776e0c06	Check to make sure types are sized before calling getTypeSize on them. llvm-svn: 14649	2004-07-06 19:28:42 +00:00
Brian Gaeke	f9c3f6178a	It doesn't matter what the 2nd operand is; if the GEP has 2 operands and the first is a zero, we should leave it alone. llvm-svn: 14648	2004-07-06 19:24:47 +00:00
Brian Gaeke	ef192b1a19	Add helper function. Don't touch GEPs for which DecomposeArrayRef is not going to do anything special (e.g., < 2 indices, or 2 indices and the last one is a constant.) llvm-svn: 14647	2004-07-06 18:15:39 +00:00
Chris Lattner	560992669f	Implement rem.ll:test3 llvm-svn: 14640	2004-07-06 07:38:18 +00:00
Chris Lattner	97ebe25e6c	Fix a minor bug where we would go into infinite loops on some constants llvm-svn: 14638	2004-07-06 07:11:42 +00:00
Chris Lattner	4d6c479f08	Implement InstCombine/sub.ll:test15: X % -Y === X % Y Also, remove X % -1 = 0, because it's not true for unsigneds, and the signed case is superceeded by this new handling. llvm-svn: 14637	2004-07-06 07:01:22 +00:00
Reid Spencer	50ec3f9325	Add #include <iostream> since Value.h does not #include it any more. llvm-svn: 14622	2004-07-04 12:19:56 +00:00
Chris Lattner	aafe5aea0d	Implement add.ll:test22, a common case in MSIL files llvm-svn: 14587	2004-07-03 00:26:11 +00:00
Chris Lattner	1120899069	Do not call getTypeSize on a type that has no size llvm-svn: 14584	2004-07-02 22:55:47 +00:00
Brian Gaeke	96e0f832f8	Get rid of a dead variable, and fix a typo in a comment. llvm-svn: 14560	2004-07-02 05:30:01 +00:00
Brian Gaeke	dbf4f13e85	Make this pass use a more specific debug message than "Processing:". llvm-svn: 14541	2004-07-01 19:27:10 +00:00
Vikram S. Adve	5ad9c7dd34	Restoring this file. llvm-svn: 14478	2004-06-29 14:20:27 +00:00
Chris Lattner	7d0108c315	Remove unused file llvm-svn: 14460	2004-06-28 00:46:58 +00:00
Chris Lattner	975c95e99a	These passes are long dead/obsolete. They never worked in the first place and are a maintenence burden. Nuke nuke nuke llvm-svn: 14457	2004-06-28 00:44:18 +00:00
Chris Lattner	8cfa7ad533	Implement InstCombine/add.ll:test21 llvm-svn: 14443	2004-06-27 22:51:36 +00:00
Chris Lattner	11f2a4542e	New constant expression lowering pass to simplify your instruction selection needs. Contributed by Vladimir Prus! llvm-svn: 14399	2004-06-25 07:48:09 +00:00
Vikram S. Adve	be2d02dd92	This file is unused, and duplicates functionality in TraceValues.cpp. llvm-svn: 14369	2004-06-24 20:16:22 +00:00
Chris Lattner	b4df848a58	Two fixes. First, stop using the ugly shouldSubstituteIndVar method. Second, disable substitution of quadratic addrec expressions to avoid putting multiplies in loops! llvm-svn: 14358	2004-06-24 06:49:18 +00:00
Misha Brukman	cf1f753730	Moved to lib/VMCore llvm-svn: 14348	2004-06-23 17:21:17 +00:00
Brian Gaeke	5da0956b27	Use new IsNAN() wrapper. llvm-svn: 14340	2004-06-23 00:25:35 +00:00
Misha Brukman	43f0a951fe	File depends on DSA, moved to lib/Analysis/DataStructure llvm-svn: 14325	2004-06-22 18:11:38 +00:00
Chris Lattner	97ab2cec69	FINALLY Fix a really nasty nondeterministic bug that has been haunting us since May 1st. In this code, the pred iterator was being invalidated sometimes causing the wrong entries to be added to PHI nodes. The fix for this is to defererence and safe the *PI value before we hack on branch instructions, which changes use/def chains, which SOMETIMES invalidates the iterator. llvm-svn: 14278	2004-06-21 07:19:01 +00:00
Chris Lattner	892df61ed3	Comment out the isnan stuff until we get a proper autoconf test for it breaking the build on sparc is not acceptable. llvm-svn: 14277	2004-06-21 06:17:21 +00:00
Chris Lattner	8723083f3b	Make order of argument addition deterministic. In particular, the layout of ConstantInt objects in memory used to determine which order arguments were added in in some cases. llvm-svn: 14276	2004-06-21 00:07:58 +00:00
Chris Lattner	27578fd2f5	Make use of BinaryOperator::create* methods to shrinkify code. llvm-svn: 14262	2004-06-20 05:04:01 +00:00
Chris Lattner	6324d6d00b	Fix the inliner to be deterministic, not letting its output depend on the relative location of Function objects in memory. llvm-svn: 14260	2004-06-20 04:11:48 +00:00
Chris Lattner	58b3f16797	Add some DEBUG output to the simplifycfg routines Fix another non-deterministic behavior, this one should actually speed up the code though as it was doing silly things. llvm-svn: 14258	2004-06-20 01:13:18 +00:00
Chris Lattner	39a31b56ac	Now that dominator tree children are built in determinstic order, this horrible code can go away llvm-svn: 14254	2004-06-19 20:23:35 +00:00
Chris Lattner	ef8a52f263	This will hopefully fix a heisenbug that Vladimir Merzliakov is running into valiantly trying to compile stuff on freebsd. llvm-svn: 14251	2004-06-19 19:01:26 +00:00
Chris Lattner	3d68124555	Fix a nasty bug, noticed by Reid llvm-svn: 14249	2004-06-19 18:15:50 +00:00
Chris Lattner	185f2b060b	Fix one source of nondeterminism in the -licm pass: the hoist pass was processing blocks in whatever order they happened to end up in the dominator tree data structure. Force an ordering. llvm-svn: 14248	2004-06-19 08:56:43 +00:00
Chris Lattner	d2720878ec	Change to use the StableBasicBlockNumbering class llvm-svn: 14247	2004-06-19 08:42:40 +00:00
Chris Lattner	3aed3a8bf2	Do not let the numbering of PHI nodes placed in the function depend on non-deterministic things like the ordering of blocks in the dominance frontier of a BB. Unfortunately, I don't know of a better way to solve this problem than to explicitly sort the BB's in function-order before processing them. This is guaranteed to slow the pass down a bit, but is absolutely necessary to get usable diffs between two different tools executing the mem2reg or scalarrepl pass. Before this, bazillions of spurious diff failures occurred all over the place due to the different order of processing PHIs: - %tmp.111 = getelementptr %struct.Connector_struct* %upcon.0.0, uint 0, uint 0 + %tmp.111 = getelementptr %struct.Connector_struct* %upcon.0.1, uint 0, uint 0 Now, the diffs match. llvm-svn: 14244	2004-06-19 07:40:14 +00:00
Chris Lattner	6e7b76ce94	Do not sort by the address of LLVM ConstantInt* objects. This produces nondeterministic results that depend on where these objects land in memory. Instead, sort by the value of the constant, which is stable. Before this patch, the -simplifycfg pass run from two different compilers could cause different code to be generated, though it was semantically the same: @@ -12258,8 +12258,8 @@ %s_addr.1 = phi sbyte* [ %s, %entry ], [ %inc.0, %no_exit ] ; <sbyte> [#uses=5] %tmp.1 = load sbyte %s_addr.1 ; <sbyte> [#uses=1] switch sbyte %tmp.1, label %no_exit [ - sbyte 0, label %loopexit sbyte 46, label %loopexit + sbyte 0, label %loopexit ] We need to stomp all of this stuff out. llvm-svn: 14243	2004-06-19 07:02:14 +00:00
Chris Lattner	5b4da6dd16	Do not loop over uses as we delete them. This causes iterators to be invalidated out from under us. This bug goes back to revision 1.1: scary. llvm-svn: 14242	2004-06-19 02:02:22 +00:00
Chris Lattner	91b27e01b3	Implement Transforms/InstCombine/and.ll:test17, a common case that occurs due to unordered comparison macros in math.h llvm-svn: 14221	2004-06-18 06:07:51 +00:00
Chris Lattner	8f9fb1d2ea	Do not function resolve intrinsics. This prevents warnings and possible bad things from happening due to declare bool %llvm.isunordered(double, double) declare bool %llvm.isunordered(float, float) llvm-svn: 14219	2004-06-18 05:50:48 +00:00
Brian Gaeke	4a904bfdcc	I love the smell of a freshly broken PowerPC build in the morning. llvm-svn: 14206	2004-06-17 22:27:04 +00:00
Chris Lattner	13cea4ef6f	Fix compilation problem on freebsd. Problem noted by Vladimir Merzliakov in PR371 llvm-svn: 14203	2004-06-17 21:20:52 +00:00
Chris Lattner	0cd29ae2cd	Rename Type::PrimitiveID to TypeId and ::getPrimitiveID() to ::getTypeID() llvm-svn: 14201	2004-06-17 18:19:28 +00:00
Chris Lattner	00b0a866dd	Rename Type::PrimitiveID to TypeId and ::getPrimitiveID() to ::getTypeID() Delete two functions that are now methods on the Type class llvm-svn: 14200	2004-06-17 18:16:02 +00:00
Brian Gaeke	a46100448f	Fix typo in DEBUG printout. llvm-svn: 14196	2004-06-17 07:26:52 +00:00
Brian Gaeke	cf3cb8944a	Um, did someone make a typo or something? llvm-svn: 14192	2004-06-15 23:09:50 +00:00
Chris Lattner	1adcf0441d	Remove support for the isnan intrinsic llvm-svn: 14186	2004-06-15 21:37:54 +00:00
Brian Gaeke	a4353cabab	Quick hack to get this file compiling again on Mac OS X. The right thing to do is write an autoconf macro that checks whether __isnan or isnan actually works using the C++ compiler after #include <cmath>, instead of doing it the easy way with AC_CHECK_FUNCS(). llvm-svn: 14171	2004-06-14 06:33:19 +00:00
Alkis Evlogimenos	dd550dc9cd	Add constant folding capabilities to the isunordered intrinsic. llvm-svn: 14168	2004-06-13 01:23:56 +00:00
Chris Lattner	66790212ba	Constant fold the isnan intrinsic llvm-svn: 14150	2004-06-11 06:16:23 +00:00
Chris Lattner	52654bf072	Fix a bug in my checkin from last night that caused miscompilations of 186.crafty, fhourstones and 132.ijpeg. Bugpoint makes really nasty miscompilations embarassingly easy to find. It narrowed it down to the instcombiner and this testcase (from fhourstones): bool %l7153_l4706_htstat_loopentry_2E_4_no_exit_2E_4(int* %i, [32 x int]* %works, int* %tmp.98.out) { newFuncRoot: %tmp.96 = load int* %i ; <int> [#uses=1] %tmp.97 = getelementptr [32 x int]* %works, long 0, int %tmp.96 ; <int> [#uses=1] %tmp.98 = load int %tmp.97 ; <int> [#uses=2] %tmp.99 = load int* %i ; <int> [#uses=1] %tmp.100 = and int %tmp.99, 7 ; <int> [#uses=1] %tmp.101 = seteq int %tmp.100, 7 ; <bool> [#uses=2] %tmp.102 = cast bool %tmp.101 to int ; <int> [#uses=0] br bool %tmp.101, label %codeRepl4.exitStub, label %codeRepl3.exitStub codeRepl4.exitStub: ; preds = %newFuncRoot store int %tmp.98, int* %tmp.98.out ret bool true codeRepl3.exitStub: ; preds = %newFuncRoot store int %tmp.98, int* %tmp.98.out ret bool false } ... which only has one combination performed on it: $ llvm-as < t.ll \| opt -instcombine -debug \| llvm-dis IC: Old = %tmp.101 = seteq int %tmp.100, 7 ; <bool> [#uses=1] New = setne int %tmp.100, 0 ; <bool>:<badref> [#uses=0] IC: MOD = br bool %tmp.101, label %codeRepl3.exitStub, label %codeRepl4.exitStub IC: MOD = %tmp.97 = getelementptr [32 x int]* %works, uint 0, int %tmp.96 ; <int*> [#uses=1] It doesn't get much better than this. :) llvm-svn: 14109	2004-06-10 02:33:20 +00:00
Chris Lattner	18cab818db	More minor cleanups llvm-svn: 14108	2004-06-10 02:12:35 +00:00
Chris Lattner	ba89f54271	Eliminate many occurrances of Instruction:: llvm-svn: 14107	2004-06-10 02:07:29 +00:00
Chris Lattner	b2434f1222	Implement InstCombine/select.ll:test15* llvm-svn: 14095	2004-06-09 07:59:58 +00:00
Chris Lattner	90a2bbc74c	Be more careful about the order we put stuff onto the worklist. This allow us to collapse this: bool %le(int %A, int %B) { %c1 = setgt int %A, %B %tmp = select bool %c1, int 1, int 0 %c2 = setlt int %A, %B %result = select bool %c2, int -1, int %tmp %c3 = setle int %result, 0 ret bool %c3 } into: bool %le(int %A, int %B) { %c3 = setle int %A, %B ; <bool> [#uses=1] ret bool %c3 } which is handy, because the Java FE makes these sequences all over the place. This is tested as: test/Regression/Transforms/InstCombine/JavaCompare.ll llvm-svn: 14086	2004-06-09 05:08:07 +00:00
Chris Lattner	a7b01d4467	Implement select.ll:test14* llvm-svn: 14083	2004-06-09 04:24:29 +00:00
Brian Gaeke	68eb7345df	Expand head-of-file comment. llvm-svn: 13982	2004-06-03 05:03:02 +00:00
Brian Gaeke	56ae0e9023	Use new form of unconditional branch constructor. llvm-svn: 13930	2004-06-01 20:06:10 +00:00
Chris Lattner	d702e10ea0	Fix one of the major things that is causing the C Backend to infinite loop llvm-svn: 13872	2004-05-28 05:02:13 +00:00
John Criswell	e56868d266	Fix a bug in the -deadtypeelim pass. The SymbolTable re-write changed it to eliminate the wrong type. llvm-svn: 13855	2004-05-27 21:16:46 +00:00
Chris Lattner	97bd391386	Fix InstCombine/load.ll & PR347. This code hadn't been updated after the "structs with more than 256 elements" related changes to the GEP instruction. Also it was not handling the ConstantAggregateZero class. Now it does! llvm-svn: 13834	2004-05-27 17:30:27 +00:00
Chris Lattner	c65ff058f3	Implement constant folding of fmod, which is used a lot in povray llvm-svn: 13823	2004-05-27 07:25:00 +00:00
Chris Lattner	e91dc2545c	Restructure call constant folding code a bit to make it simpler Add support for acos/asin/atan. 188.ammp contains three calls to acos with constant arguments. Constant folding it allows elimination of those 3 calls and three FP divisions of the results. llvm-svn: 13821	2004-05-27 06:26:28 +00:00
Alkis Evlogimenos	7542a978fc	Do not pass a null pointer if this instruction is not prepended or appended anywhere. llvm-svn: 13798	2004-05-26 22:50:28 +00:00
Alkis Evlogimenos	3c45e590c5	Use one destination constructor for the unconditional branch. llvm-svn: 13792	2004-05-26 21:38:14 +00:00
Reid Spencer	fec48b0d9d	Convert to SymbolTable's new iteration interface. llvm-svn: 13754	2004-05-25 08:53:40 +00:00
Reid Spencer	08e75b22a5	Convert to SymbolTable's new lookup and iteration interfaces. llvm-svn: 13751	2004-05-25 08:52:20 +00:00
Reid Spencer	09c1dbce28	Remove unused header file. llvm-svn: 13750	2004-05-25 08:51:36 +00:00
Reid Spencer	169865d831	Make this pass simply invoke SymbolTable::strip(). llvm-svn: 13749	2004-05-25 08:51:25 +00:00
Chris Lattner	66f55c8e78	Implement InstCombine:shift.ll:test16, which turns (X >> C1) & C2 != C3 into (X & (C2 << C1)) != (C3 << C1), where the shift may be either left or right and the compare may be any one. This triggers 1546 times in 176.gcc alone, as it is a common pattern that occurs for bitfield accesses. llvm-svn: 13740	2004-05-25 06:32:08 +00:00
Chris Lattner	b8fb5e789b	Implement instcombine/cast.ll:test16: Canonicalize cast X to bool into a setne instruction llvm-svn: 13736	2004-05-25 04:29:21 +00:00
Chris Lattner	79409ebc27	Fix a bug in my previous checkin llvm-svn: 13717	2004-05-24 06:24:46 +00:00
Chris Lattner	52e7345268	Spelling people's names right is kinda important llvm-svn: 13702	2004-05-23 21:27:29 +00:00
Chris Lattner	fbdf40f86a	Fix cases where we missed inlining some more obvious candidates because the caller was in an SCC. llvm-svn: 13693	2004-05-23 21:22:17 +00:00
Chris Lattner	bf8b81252f	Simplify the interface and remove an unneeded #include llvm-svn: 13692	2004-05-23 21:21:35 +00:00
Chris Lattner	fee8ce6131	Fairly substantial changes to update the alias analysis we are querying as we make the transformation. This allows us to use interprocedural alias analyses successfully. llvm-svn: 13691	2004-05-23 21:21:17 +00:00
Chris Lattner	cf81f974b8	Adjust to the changes in the AliasSetTracker interface llvm-svn: 13690	2004-05-23 21:20:19 +00:00
Chris Lattner	33de90e8c6	Add support for replacement of formal arguments with simpler expressions. llvm-svn: 13689	2004-05-23 21:19:55 +00:00
Chris Lattner	f113f6a630	Implement the -lowergc pass which is used by code generators (like the CBE) that do not have builtin support for garbage collection. llvm-svn: 13688	2004-05-23 21:19:22 +00:00
Brian Gaeke	16bd3c5d34	Add CloneTraceInto(), which is based on (and has mostly the same effects as) CloneFunctionInto(). llvm-svn: 13601	2004-05-19 09:08:14 +00:00
Brian Gaeke	9adf9d8bfc	Move RemapInstruction() to ValueMapper, so that it can be shared with CloneTrace, and because it is primarily an operation on ValueMaps. It is now a global (non-static) function which can be pulled in using ValueMapper.h. llvm-svn: 13600	2004-05-19 09:08:12 +00:00
Brian Gaeke	b41b628afd	Clean up this pass somewhat: Add better comments, including a better head-of-file comment. Prune #includes. Fix a FIXME that Chris put here by using doInitialization(). Use DEBUG() to print out debug msgs. Give names to basic blocks inserted by this pass. Expand tabs. Use InsertProfilingInitCall() from ProfilingUtils to insert the initialize call. llvm-svn: 13581	2004-05-14 21:21:52 +00:00
Chris Lattner	729d1ba904	This was not meant to be committed llvm-svn: 13565	2004-05-13 20:56:34 +00:00
Chris Lattner	b296747100	Fix a nasty bug that caused us to unroll EXTREMELY large loops due to overflow in the size calculation. This is not something you want to see: Loop Unroll: F[main] Loop %no_exit Loop Size = 2 Trip Count = 2147483648 - UNROLLING! The problem was that 2*2147483648 == 0. Now we get: Loop Unroll: F[main] Loop %no_exit Loop Size = 2 Trip Count = 2147483648 - TOO LARGE: 4294967296>100 Thanks to some anonymous person playing with the demo page that repeatedly caused zion to go into swapping land. That's one way to ensure you'll get a quick bugfix. :) Testcase here: Transforms/LoopUnroll/2004-05-13-DontUnrollTooMuch.ll llvm-svn: 13564	2004-05-13 20:43:31 +00:00
Chris Lattner	f97ef5191a	Do not pass in the same argument to the extracted function more than once, and give the extracted function a more useful name than just foo_code. llvm-svn: 13493	2004-05-12 16:26:18 +00:00
Chris Lattner	645130ed0e	Implement support for code extracting basic blocks that have a return instruction in them. llvm-svn: 13490	2004-05-12 16:07:41 +00:00
Chris Lattner	ed40ce44d6	Implement splitting of PHI nodes, allowing block extraction of BB's that have PHI node entries from multiple outside-the-region blocks. This also fixes extraction of the entry block in a function. Yaay. This has successfully block extracted all (but one) block from the score_move function in obsequi (out of 33). Hrm, I wonder which block the bug is in. :) llvm-svn: 13489	2004-05-12 15:29:13 +00:00
Chris Lattner	3ee79b93d7	* Pull some code out into the definedInRegion/definedInCaller methods * Add a stub for the severSplitPHINodes which will allow us to bbextract bb's with PHI nodes in them soon. * Remove unused arguments from findInputsOutputs * Dramatically simplify the code in findInputsOutputs. In particular, nothing really cares whether or not a PHI node is using something. * Move moveCodeToFunction to after emitCallAndSwitchStatement as that's the order they get called. * Fix a bug where we would code extract a region that included a call to vastart. Like 'alloca', calls to vastart must stay in the function that they are defined in. * Add some comments. llvm-svn: 13482	2004-05-12 06:01:40 +00:00
Chris Lattner	67e58adb41	Generate substantially better code when there are a limited number of exits from the extracted region. If the return has 0 or 1 exit blocks, the new function returns void. If it has 2 exits, it returns bool, otherwise it returns a ushort as before. This allows us to use a conditional branch instruction when there are two exit blocks, as often happens during block extraction. llvm-svn: 13481	2004-05-12 04:14:24 +00:00
Chris Lattner	7f4cd3b0be	Two minor improvements: 1. Get rid of the silly abort block. When doing bb extraction, we get one abort block for every block extracted, which is kinda annoying. 2. If the switch ends up having a single destination, turn it into an unconditional branch. I would like to add support for conditional branches, but to do this we will want to have the function return a bool instead of a ushort. llvm-svn: 13478	2004-05-12 03:22:33 +00:00
Chris Lattner	0efd1cb264	Fix stupid bug in my checkin yesterday llvm-svn: 13429	2004-05-08 22:41:42 +00:00
Chris Lattner	e3b3e333b0	Implement folding of GEP's like: %tmp.0 = getelementptr [50 x sbyte]* %ar, uint 0, int 5 ; <sbyte> [#uses=2] %tmp.7 = getelementptr sbyte %tmp.0, int 8 ; <sbyte*> [#uses=1] together. This patch actually allows us to simplify and generalize the code. llvm-svn: 13415	2004-05-07 22:09:22 +00:00
Chris Lattner	6340390476	Fix PR336: The instcombine pass asserts when visiting load instruction llvm-svn: 13400	2004-05-07 15:35:56 +00:00
Chris Lattner	05f657f5c2	Do not mark instructions in unreachable sections of the function as live. This fixes PR332 and ADCE/2004-05-04-UnreachableBlock.llx llvm-svn: 13349	2004-05-04 17:00:46 +00:00
Chris Lattner	7896144611	Minor efficiency tweak, suggested by Patrick Meredith llvm-svn: 13341	2004-05-04 15:19:33 +00:00
Brian Gaeke	a5b32230db	Fix typo llvm-svn: 13340	2004-05-03 23:52:07 +00:00
Brian Gaeke	dcfc3c580e	In InsertProfilingInitCall(), make it legal to pass in a null array, in which case you'll get a null array and zero passed to the profiling function. llvm-svn: 13336	2004-05-03 22:06:33 +00:00
Brian Gaeke	c8cd0e9092	Add initial implementation of basic-block tracing instrumentation pass. llvm-svn: 13335	2004-05-03 22:06:32 +00:00
Chris Lattner	d8345001fa	Do not clone arbitrary condition instructions. llvm-svn: 13316	2004-05-02 05:19:36 +00:00
Chris Lattner	da2d746a3b	Do not infinitely "unroll" single BB loops. llvm-svn: 13315	2004-05-02 05:02:03 +00:00
Chris Lattner	5f393764c8	Dont' merge terminators that are needed to select PHI node values. llvm-svn: 13312	2004-05-02 01:00:44 +00:00
Chris Lattner	bd705d7776	Implement SimplifyCFG/branch-cond-merge.ll Turning "if (A < B && B < C)" into "if (A < B & B < C)" llvm-svn: 13311	2004-05-01 23:35:43 +00:00
Chris Lattner	eb59aec632	Make sure to reprocess instructions used by deleted instructions to avoid missing opportunities for combination. llvm-svn: 13309	2004-05-01 23:27:23 +00:00
Chris Lattner	f5a5668cf6	Make sure the instruction combiner doesn't lose track of instructions when replacing them, missing the opportunity to do simplifications llvm-svn: 13308	2004-05-01 23:19:52 +00:00
Chris Lattner	911e21e8ca	Fix my missing parens llvm-svn: 13307	2004-05-01 22:41:51 +00:00
Chris Lattner	82278b599b	Implement SimplifyCFG/branch-cond-prop.ll llvm-svn: 13306	2004-05-01 22:36:37 +00:00
Chris Lattner	9b53c7c797	Fix a major pessimization in the instcombiner. If an allocation instruction is only used by a cast, and the casted type is the same size as the original allocation, it would eliminate the cast by folding it into the allocation. Unfortunately, it was placing the new allocation instruction right before the cast, which could pull (for example) alloca instructions into the body of a function. This turns statically allocatable allocas into expensive dynamically allocated allocas, which is bad bad bad. This fixes the problem by placing the new allocation instruction at the same place the old one was, duh. :) llvm-svn: 13289	2004-04-30 04:37:52 +00:00
Chris Lattner	02c65b5395	Changes to fix up the inst_iterator to pass to boost iterator checks. This patch was graciously contributed by Vladimir Prus. llvm-svn: 13185	2004-04-27 15:13:33 +00:00
Chris Lattner	5bad19bde7	Instcombine X/-1 --> 0-X llvm-svn: 13172	2004-04-26 14:01:59 +00:00
Misha Brukman	144b5572e1	* Allow aggregating extracted function arguments (controlled by flag) * Commandline option (for now) controls that flag that is passed in llvm-svn: 13141	2004-04-23 23:54:17 +00:00
Chris Lattner	b72fa5f541	Move the scev expansion code into this pass, where it belongs. There is still room for cleanup, but at least the code modification is out of the analysis now. llvm-svn: 13135	2004-04-23 21:29:48 +00:00
Misha Brukman	1dc8e19185	Clarify the logic: the flag is renamed to `deleteFn' to signify it will delete the function instead of isolating it. This also means the condition is reversed. llvm-svn: 13112	2004-04-22 23:00:51 +00:00
Misha Brukman	ddace6ecbe	Add a flag to choose between isolating a function or deleting the function from the Module. The default behavior keeps functionality as before: the chosen function is the one that remains. llvm-svn: 13111	2004-04-22 22:52:22 +00:00
Chris Lattner	6716bcd0cc	Disable a previous patch that was causing indvars to loop infinitely :( llvm-svn: 13108	2004-04-22 15:12:36 +00:00
Chris Lattner	507ba1c3c3	Fix an extremely serious thinko I made in revision 1.60 of this file. llvm-svn: 13106	2004-04-22 14:59:40 +00:00
Chris Lattner	d1906e2ace	Implement a todo, rewriting all possible scev expressions inside of the loop. This eliminates the extra add from the previous case, but it's not clear that this will be a performance win overall. Tommorows test results will tell. :) llvm-svn: 13103	2004-04-21 23:36:08 +00:00
Chris Lattner	96752d27f4	This code really wants to iterate over the OPERANDS of an instruction, not over its USES. If it's dead it doesn't have any uses! :) Thanks to the fabulous and mysterious Bill Wendling for pointing this out. :) llvm-svn: 13102	2004-04-21 22:29:37 +00:00
Chris Lattner	a3e2004609	Implement a fixme. The helps loops that have induction variables of different types in them. Instead of creating an induction variable for all types, it creates a single induction variable and casts to the other sizes. This generates this code: no_exit: ; preds = %entry, %no_exit %indvar = phi uint [ %indvar.next, %no_exit ], [ 0, %entry ] ; <uint> [#uses=4] *** %j.0.0 = cast uint %indvar to short ; <short> [#uses=1] %indvar = cast uint %indvar to int ; <int> [#uses=1] %tmp.7 = getelementptr short* %P, uint %indvar ; <short> [#uses=1] store short %j.0.0, short %tmp.7 %inc.0 = add int %indvar, 1 ; <int> [#uses=2] %tmp.2 = setlt int %inc.0, %N ; <bool> [#uses=1] %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] br bool %tmp.2, label %no_exit, label %loopexit instead of: no_exit: ; preds = %entry, %no_exit %indvar = phi ushort [ %indvar.next, %no_exit ], [ 0, %entry ] ; <ushort> [#uses=2] *** %indvar = phi uint [ %indvar.next, %no_exit ], [ 0, %entry ] ; <uint> [#uses=3] %indvar = cast uint %indvar to int ; <int> [#uses=1] %indvar = cast ushort %indvar to short ; <short> [#uses=1] %tmp.7 = getelementptr short* %P, uint %indvar ; <short> [#uses=1] store short %indvar, short %tmp.7 %inc.0 = add int %indvar, 1 ; <int> [#uses=2] %tmp.2 = setlt int %inc.0, %N ; <bool> [#uses=1] %indvar.next = add uint %indvar, 1 *** %indvar.next = add ushort %indvar, 1 br bool %tmp.2, label %no_exit, label %loopexit This is an improvement in register pressure, but probably doesn't happen that often. The more important fix will be to get rid of the redundant add. llvm-svn: 13101	2004-04-21 22:22:01 +00:00
Chris Lattner	7d02bae5d8	Fix an incredibly nasty iterator invalidation problem. I am too spoiled by ilists :) Eventually it would be nice if CallGraph maintained an ilist of CallGraphNode's instead of a vector of pointers to them, but today is not that day. llvm-svn: 13100	2004-04-21 20:44:33 +00:00
Alkis Evlogimenos	904f4f9a21	Include cerrno (gcc-3.4 fix) llvm-svn: 13091	2004-04-21 16:11:40 +00:00
Chris Lattner	ade6ddc694	Fix typeo llvm-svn: 13089	2004-04-21 14:23:18 +00:00
Chris Lattner	15eb3c1f39	REALLY fix PR324: don't delete linkonce functions until after the SCC traversal is done, which avoids invalidating iterators in the SCC traversal routines llvm-svn: 13088	2004-04-20 22:06:53 +00:00
Chris Lattner	602146eea1	Fix PR325 llvm-svn: 13081	2004-04-20 20:26:03 +00:00
Chris Lattner	5a1e3f099f	Fix PR324 and testcase: Inline/2004-04-20-InlineLinkOnce.llx llvm-svn: 13080	2004-04-20 20:20:59 +00:00
Chris Lattner	29f69938e7	Initial checkin of a simple loop unswitching pass. It still needs work, but it's a start, and seems to do it's basic job. llvm-svn: 13068	2004-04-19 18:07:02 +00:00
Chris Lattner	1849aa8b1f	Add #include llvm-svn: 13057	2004-04-19 03:01:23 +00:00
Chris Lattner	ab6502f058	Move isLoopInvariant to the Loop class llvm-svn: 13051	2004-04-18 22:46:08 +00:00
Chris Lattner	5a0ed18724	Correct rewriting of exit blocks after my last patch llvm-svn: 13048	2004-04-18 22:27:10 +00:00
Chris Lattner	8e42c6f409	Loop exit sets are no longer explicitly held, they are dynamically computed on demand. llvm-svn: 13046	2004-04-18 22:15:13 +00:00
Chris Lattner	7174acca00	Change the ExitBlocks list from being explicitly contained in the Loop structure to being dynamically computed on demand. This makes updating loop information MUCH easier. llvm-svn: 13045	2004-04-18 22:14:10 +00:00
Chris Lattner	13140766df	Reduce the unrolling limit llvm-svn: 13040	2004-04-18 18:06:14 +00:00
Chris Lattner	430968ac2f	If the preheader of the loop was the entry block of the function, make sure that the exit block of the loop becomes the new entry block of the function. This was causing a verifier assertion on 252.eon. llvm-svn: 13039	2004-04-18 17:38:42 +00:00
Chris Lattner	199b58db3f	Be much more careful about how we update instructions outside of the loop using instructions inside of the loop. This should fix the MishaTest failure from last night. llvm-svn: 13038	2004-04-18 17:32:39 +00:00
Chris Lattner	33ec7f2f9f	After unrolling our single basic block loop, fold it into the preheader and exit block. The primary motivation for doing this is that we can now unroll nested loops. This makes a pretty big difference in some cases. For example, in 183.equake, we are now beating the native compiler with the CBE, and we are a lot closer with LLC. I'm now going to play around a bit with the unroll factor and see what effect it really has. llvm-svn: 13034	2004-04-18 06:27:43 +00:00
Chris Lattner	f2045a8c05	Fix a bug: this does not preserve the CFG! While we're at it, add support for updating loop information correctly. llvm-svn: 13033	2004-04-18 05:38:37 +00:00
Chris Lattner	b0d23bf99d	Initial checkin of a simple loop unroller. This pass is extremely basic and limited. Even in it's extremely simple state (it can only fully unroll single basic block loops that execute a constant number of times), it already helps improve performance a LOT on some benchmarks, particularly with the native code generators. llvm-svn: 13028	2004-04-18 05:20:17 +00:00
Chris Lattner	e0f56972f0	Make the tail duplication threshold accessible from the command line instead of hardcoded llvm-svn: 13025	2004-04-18 00:52:43 +00:00
Chris Lattner	740ae78ae6	If the loop executes a constant number of times, try a bit harder to replace exit values. llvm-svn: 13018	2004-04-17 18:44:09 +00:00
Chris Lattner	bcb690dc9b	Fix a HUGE pessimization on X86. The indvars pass was taking this (familiar) function: int _strlen(const char str) { int len = 0; while (str++) len++; return len; } And transforming it to use a ulong induction variable, because the type of the pointer index was left as a constant long. This is obviously very bad. The fix is to shrink long constants in getelementptr instructions to intptr_t, making the indvars pass insert a uint induction variable, which is much more efficient. Here's the before code for this function: int %_strlen(sbyte* %str) { entry: %tmp.13 = load sbyte* %str ; <sbyte> [#uses=1] %tmp.24 = seteq sbyte %tmp.13, 0 ; <bool> [#uses=1] br bool %tmp.24, label %loopexit, label %no_exit no_exit: ; preds = %entry, %no_exit * %indvar = phi uint [ %indvar.next, %no_exit ], [ 0, %entry ] ; <uint> [#uses=2] * %indvar = phi ulong [ %indvar.next, %no_exit ], [ 0, %entry ] ; <ulong> [#uses=2] %indvar1 = cast ulong %indvar to uint ; <uint> [#uses=1] %inc.02.sum = add uint %indvar1, 1 ; <uint> [#uses=1] %inc.0.0 = getelementptr sbyte* %str, uint %inc.02.sum ; <sbyte> [#uses=1] %tmp.1 = load sbyte %inc.0.0 ; <sbyte> [#uses=1] %tmp.2 = seteq sbyte %tmp.1, 0 ; <bool> [#uses=1] %indvar.next = add ulong %indvar, 1 ; <ulong> [#uses=1] %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] br bool %tmp.2, label %loopexit.loopexit, label %no_exit loopexit.loopexit: ; preds = %no_exit %indvar = cast uint %indvar to int ; <int> [#uses=1] %inc.1 = add int %indvar, 1 ; <int> [#uses=1] ret int %inc.1 loopexit: ; preds = %entry ret int 0 } Here's the after code: int %_strlen(sbyte* %str) { entry: %inc.02 = getelementptr sbyte* %str, uint 1 ; <sbyte> [#uses=1] %tmp.13 = load sbyte %str ; <sbyte> [#uses=1] %tmp.24 = seteq sbyte %tmp.13, 0 ; <bool> [#uses=1] br bool %tmp.24, label %loopexit, label %no_exit no_exit: ; preds = %entry, %no_exit *** %indvar = phi uint [ %indvar.next, %no_exit ], [ 0, %entry ] ; <uint> [#uses=3] %indvar = cast uint %indvar to int ; <int> [#uses=1] %inc.0.0 = getelementptr sbyte* %inc.02, uint %indvar ; <sbyte> [#uses=1] %inc.1 = add int %indvar, 1 ; <int> [#uses=1] %tmp.1 = load sbyte %inc.0.0 ; <sbyte> [#uses=1] %tmp.2 = seteq sbyte %tmp.1, 0 ; <bool> [#uses=1] %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] br bool %tmp.2, label %loopexit, label %no_exit loopexit: ; preds = %entry, %no_exit %len.0.1 = phi int [ 0, %entry ], [ %inc.1, %no_exit ] ; <int> [#uses=1] ret int %len.0.1 } llvm-svn: 13016	2004-04-17 18:16:10 +00:00
Chris Lattner	5c85946417	Even if there are not any induction variables in the loop, if we can compute the trip count for the loop, insert one so that we can canonicalize the exit condition. llvm-svn: 13015	2004-04-17 18:08:33 +00:00
Chris Lattner	6d5decd7d4	Add support for evaluation of exp/log/log10/pow llvm-svn: 13011	2004-04-16 22:35:33 +00:00
Chris Lattner	ed423cc09d	Fix some really nasty dominance bugs that were exposed by my patch to make the verifier more strict. This fixes building zlib llvm-svn: 13002	2004-04-16 18:08:07 +00:00
Brian Gaeke	4b9f67c638	Include <cmath> for compatibility with gcc 3.0.x (the system compiler on Debian.) llvm-svn: 12986	2004-04-16 15:57:32 +00:00
Chris Lattner	bc458be5f9	Fix some of the strange CBE-only failures that happened last night. llvm-svn: 12980	2004-04-16 06:03:17 +00:00
Chris Lattner	06eda01d1b	Fix Inline/2004-04-15-InlineDeletesCall.ll Basically we were using SimplifyCFG as a huge sledgehammer for a simple optimization. Because simplifycfg does so many things, we can't use it for this purpose. llvm-svn: 12977	2004-04-16 05:17:59 +00:00
Chris Lattner	ac2b465cb4	Fix a bug in the previous checkin: if the exit block is not the same as the back-edge block, we must check the preincremented value. llvm-svn: 12968	2004-04-15 20:26:22 +00:00
Chris Lattner	dcf2ca93e6	Change the canonical induction variable that we insert. Instead of producing code like this: Loop: X = phi 0, X2 ... X2 = X + 1 if (X != N-1) goto Loop We now generate code that looks like this: Loop: X = phi 0, X2 ... X2 = X + 1 if (X2 != N) goto Loop This has two big advantages: 1. The trip count of the loop is now explicit in the code, allowing the direct implementation of Loop::getTripCount() 2. This reduces register pressure in the loop, and allows X and X2 to be put into the same register. As a consequence of the second point, the code we generate for loops went from: .LBB2: # no_exit.1 ... mov %EDI, %ESI inc %EDI cmp %ESI, 2 mov %ESI, %EDI jne .LBB2 # PC rel: no_exit.1 To: .LBB2: # no_exit.1 ... inc %ESI cmp %ESI, 3 jne .LBB2 # PC rel: no_exit.1 ... which has two fewer moves, and uses one less register. llvm-svn: 12961	2004-04-15 15:21:43 +00:00
Chris Lattner	6fcf8c7402	ADd a trivial instcombine: load null -> null llvm-svn: 12940	2004-04-14 03:28:36 +00:00
Chris Lattner	545e77c9d5	Add SCCP support for constant folding calls, implementing: test/Regression/Transforms/SCCP/calltest.ll llvm-svn: 12921	2004-04-13 19:43:54 +00:00
Chris Lattner	778f09027f	Add a simple call constant propagation interface. llvm-svn: 12919	2004-04-13 19:28:52 +00:00
Chris Lattner	8c0e9c95e9	Constant propagation should remove the dead instructions llvm-svn: 12917	2004-04-13 19:28:20 +00:00
Chris Lattner	70f6a0ddcf	Fix LoopSimplify/2004-04-13-LoopSimplifyUpdateDomFrontier.ll LoopSimplify was not updating dominator frontiers correctly in some cases. llvm-svn: 12890	2004-04-13 16:23:25 +00:00
Chris Lattner	ae63c235f9	Refactor code a bit to make it simpler and eliminate the goto llvm-svn: 12888	2004-04-13 15:21:18 +00:00
Chris Lattner	faf377df58	This patch addresses PR35: Loop simplify should reconstruct nested loops. This is fairly straight-forward, but was a real nightmare to get just perfect. aarg. :) llvm-svn: 12884	2004-04-13 05:05:33 +00:00
Chris Lattner	a3d3872a88	Actually update the call graph as the inliner changes it. This allows us to execute other CallGraphSCCPasses after the inliner without crashing. llvm-svn: 12861	2004-04-12 05:37:29 +00:00
Chris Lattner	603d821596	Add support for removing invoke instructions llvm-svn: 12858	2004-04-12 05:15:13 +00:00
Chris Lattner	af22e5f826	Stop printing Function* llvm-svn: 12857	2004-04-12 04:06:56 +00:00
Chris Lattner	1c83ee0436	Simplify code a bit, and be sure to mark the external node as potentially throwing llvm-svn: 12856	2004-04-12 04:06:38 +00:00
Chris Lattner	ca1428e01c	Fix a bug in my select transformation llvm-svn: 12826	2004-04-11 01:39:19 +00:00
Chris Lattner	777977bf2e	Update the value numbering interface. llvm-svn: 12824	2004-04-10 22:33:34 +00:00
Chris Lattner	96fca8de3d	Implement InstCombine/select.ll:test13* llvm-svn: 12821	2004-04-10 22:21:27 +00:00
Chris Lattner	618c89d5eb	Implement InstCombine/add.ll:test20 Canonicalize add of sign bit constant into a xor llvm-svn: 12819	2004-04-10 22:01:55 +00:00
Chris Lattner	6d569b52ed	Rewrite the GCSE pass to be substantially simpler, a bit more efficient, and a bit more powerful llvm-svn: 12817	2004-04-10 21:11:11 +00:00
Chris Lattner	22c22de2f0	Fix spurious warning in release mode llvm-svn: 12816	2004-04-10 19:15:56 +00:00
Chris Lattner	924b6c173c	Simplify code a bit, and fix a bug that was breaking perlbmk llvm-svn: 12814	2004-04-10 18:06:21 +00:00
Chris Lattner	f126f03878	Fix a bug in my checkin last night that was breaking programs using invoke. llvm-svn: 12813	2004-04-10 16:53:29 +00:00
Chris Lattner	d4979e2904	Fix previous patch llvm-svn: 12811	2004-04-10 07:27:48 +00:00
Chris Lattner	3b211f0432	Correctly update counters llvm-svn: 12810	2004-04-10 07:02:02 +00:00
Chris Lattner	1676188024	Simplify code a bit, and use alias analysis to allow us to delete unused call and invoke instructions that are known to not write to memory. llvm-svn: 12807	2004-04-10 06:53:09 +00:00
Chris Lattner	306540a2f4	Implement select.ll:test12* This transforms code like this: %C = or %A, %B %D = select %cond, %C, %A into: %C = select %cond, %B, 0 %D = or %A, %C Since B is often a constant, the select can often be eliminated. In any case, this reduces the usage count of A, allowing subsequent optimizations to happen. This xform applies when the operator is any of: add, sub, mul, or, xor, and, shl, shr llvm-svn: 12800	2004-04-09 23:46:01 +00:00
Chris Lattner	8ccddbd123	Fold code like: if (C) V1 \|= V2; into: Vx = V1 \| V2; V1 = select C, V1, Vx when the expression can be evaluated unconditionally and is cheap to execute. This limited form of if conversion is quite handy in lots of cases. For example, it turns this testcase into straight-line code: int in0 ; int in1 ; int in2 ; int in3 ; int in4 ; int in5 ; int in6 ; int in7 ; int in8 ; int in9 ; int in10; int in11; int in12; int in13; int in14; int in15; long output; void mux(void) { output = (in0 ? 0x00000001 : 0) \| (in1 ? 0x00000002 : 0) \| (in2 ? 0x00000004 : 0) \| (in3 ? 0x00000008 : 0) \| (in4 ? 0x00000010 : 0) \| (in5 ? 0x00000020 : 0) \| (in6 ? 0x00000040 : 0) \| (in7 ? 0x00000080 : 0) \| (in8 ? 0x00000100 : 0) \| (in9 ? 0x00000200 : 0) \| (in10 ? 0x00000400 : 0) \| (in11 ? 0x00000800 : 0) \| (in12 ? 0x00001000 : 0) \| (in13 ? 0x00002000 : 0) \| (in14 ? 0x00004000 : 0) \| (in15 ? 0x00008000 : 0) ; } llvm-svn: 12798	2004-04-09 22:50:22 +00:00
Chris Lattner	3a6e4b9a35	Fold binary operators with a constant operand into select instructions that have a constant operand. This implements add.ll:test19, shift.ll:test15*, and others that are not tested llvm-svn: 12794	2004-04-09 19:05:30 +00:00
Chris Lattner	0e1f5553df	Implement select.ll:test11 llvm-svn: 12793	2004-04-09 18:19:44 +00:00
Chris Lattner	0ca3cbfa5e	Implement InstCombine/cast-propagate.ll llvm-svn: 12784	2004-04-08 20:39:49 +00:00
Chris Lattner	d8efae05fe	Implement ScalarRepl/select_promote.ll llvm-svn: 12779	2004-04-08 19:59:34 +00:00
Chris Lattner	77beb73ce2	Remove the "really gross hacks" that are there to deal with recursive functions. Now we collect all of the call sites we are interested in inlining, then inline them. This entirely avoids issues with trying to inline a call site we got by inlining another call site. This also eliminates iterator invalidation issues. llvm-svn: 12770	2004-04-08 06:34:31 +00:00
Chris Lattner	cf8117ccbd	Implement InstCombine/select.ll:test[7-10] llvm-svn: 12769	2004-04-08 04:43:23 +00:00
Chris Lattner	2e89e48999	Implement test/Regression/Transforms/InstCombine/getelementptr_index.ll llvm-svn: 12762	2004-04-07 18:38:20 +00:00
Chris Lattner	c92af54ed5	Fix a bug in yesterdays checkins which broke siod. siod is a great testcase! :) llvm-svn: 12659	2004-04-05 16:02:41 +00:00
Chris Lattner	6c961339a3	Fix InstCombine/2004-04-04-InstCombineReplaceAllUsesWith.ll llvm-svn: 12658	2004-04-05 02:10:19 +00:00
Chris Lattner	9236135e8f	Support getelementptr instructions which use uint's to index into structure types and can have arbitrary 32- and 64-bit integer types indexing into sequential types. llvm-svn: 12653	2004-04-05 01:30:19 +00:00
Chris Lattner	cf5bd8a9ab	Rewrite the indvars pass to use the ScalarEvolution analysis. This also implements some new features for the indvars pass, including linear function test replacement, exit value substitution, and it works with a much more general class of induction variables and loops. llvm-svn: 12620	2004-04-02 20:24:31 +00:00
Chris Lattner	3f202e3a54	Fix the obvious bug in my previous checkin llvm-svn: 12618	2004-04-02 18:15:10 +00:00
Chris Lattner	bca948c99d	Implement Transforms/SimplifyCFG/return-merge.ll This actually causes us to turn code like: return C ? A : B; into a select instruction. llvm-svn: 12617	2004-04-02 18:13:43 +00:00
Chris Lattner	973cb73b4f	Fix PR310 and TailDup/2004-04-01-DemoteRegToStack.llx llvm-svn: 12597	2004-04-01 20:28:45 +00:00
Chris Lattner	441ab4b903	Remove some assertions that are now bogus with the last patch I put in llvm-svn: 12595	2004-04-01 19:21:46 +00:00
Chris Lattner	eda638b0be	Fix PR306: Loop simplify incorrectly updates dominator information Testcase: LoopSimplify/2004-04-01-IncorrectDomUpdate.ll llvm-svn: 12592	2004-04-01 19:06:07 +00:00
Chris Lattner	6aaea5f86b	Add warning llvm-svn: 12573	2004-03-31 22:00:30 +00:00
Chris Lattner	1c0ddbfb7d	Fix linking of constant expr casts due to type resolution changes. With this and the other patches 253.perlbmk links again. llvm-svn: 12565	2004-03-31 02:58:28 +00:00
Brian Gaeke	59c80cfd05	Start cleaning up this pass so that I can debug it. llvm-svn: 12548	2004-03-30 19:53:46 +00:00
Chris Lattner	145aea5c4c	Now that all the code generators support the select instruction, and the instcombine pass can eliminate many nasty cases of them, start generating them in the optimizers llvm-svn: 12545	2004-03-30 19:44:05 +00:00
Chris Lattner	b6612acb18	Implement select.ll:test[3-6] llvm-svn: 12544	2004-03-30 19:37:13 +00:00
Chris Lattner	58a6a4d57a	Add a simple select instruction lowering pass llvm-svn: 12540	2004-03-30 18:41:10 +00:00
Chris Lattner	d191e5625c	X % -1 == X % 1 == 0 llvm-svn: 12520	2004-03-26 16:11:24 +00:00
Chris Lattner	e15fb6ac61	Two changes: #1 is to unconditionally strip constantpointerrefs out of instruction operands where they are absolutely pointless and inhibit optimization. GRRR! #2 is to implement InstCombine/getelementptr_const.ll llvm-svn: 12519	2004-03-25 22:59:29 +00:00
Chris Lattner	078f97b50d	Teach the optimizer to delete zero sized alloca's (but not mallocs!) llvm-svn: 12507	2004-03-19 06:08:10 +00:00
Chris Lattner	0f0a253571	Fix bug: CodeExtractor/2004-03-17-MissedLiveIns.ll With this fix we now successfully extract all 149 loops from 256.bzip2 without crashing or miscompiling the program! llvm-svn: 12493	2004-03-18 05:56:32 +00:00
Chris Lattner	521d687d11	Add statistics to the loop extractor. The loop extractor has successfully extracted all 63 loops for Olden/bh without crashing and without miscompiling the program!!! llvm-svn: 12491	2004-03-18 05:46:10 +00:00
Chris Lattner	c835211d82	Fix problem with PHI nodes having multiple predecessors from different exit nodes llvm-svn: 12490	2004-03-18 05:43:18 +00:00
Chris Lattner	b1bc514730	Fix CodeExtractor/2004-03-17-UpdatePHIsOutsideRegion.ll llvm-svn: 12489	2004-03-18 05:38:31 +00:00
Chris Lattner	69fdd9f14a	Seriously simplify and correct the PHI node handling code. llvm-svn: 12487	2004-03-18 05:28:49 +00:00
Chris Lattner	345cf6f177	Fix CodeExtractor/2004-03-17-OutputMismatch.ll llvm-svn: 12486	2004-03-18 04:12:05 +00:00
Chris Lattner	0d233c03fc	Fix several bugs in the extractor: 1. Names were not put on the new arguments created (ok, this just helps sanity :) 2. Fix outgoing pointer values 3. Do not insert stores for values that had not been computed 4. Fix some wierd problems with the outset calculation This fixes CodeExtractor/2004-03-14-DominanceProblem.ll, making the extractor work on at least one simple case! llvm-svn: 12484	2004-03-18 03:49:40 +00:00
Chris Lattner	55114016ea	The code extractor needs dominator info. Provide it llvm-svn: 12483	2004-03-18 03:48:06 +00:00
Chris Lattner	7c0d39dcd6	Prune #includes, moving the module interface to the front. Note that this exposed the fact that the header was not self-contained. There is a reason we do things :) llvm-svn: 12481	2004-03-18 03:15:29 +00:00
Chris Lattner	849234af99	Fix compilation of mesa, which I broke earlier today llvm-svn: 12465	2004-03-17 02:02:47 +00:00
Chris Lattner	eccc0e01b2	Be more accurate llvm-svn: 12464	2004-03-17 01:59:27 +00:00
Chris Lattner	6fdcd7174b	Fix bug in previous checkin llvm-svn: 12458	2004-03-16 23:36:49 +00:00
Chris Lattner	59342a757e	Okay, so there is no reasonable way for tail duplication to update SSA form, as it is making effectively arbitrary modifications to the CFG and we don't have a domset/domfrontier implementations that can handle the dynamic updates. Instead of having a bunch of code that doesn't actually work in practice, just demote any potentially tricky values to the stack (causing the problem to go away entirely). Later invocations of mem2reg will rebuild SSA for us. This fixes all of the major performance regressions with tail duplication from LLVM 1.1. For example, this loop: --- int popcount(int x) { int result = 0; while (x != 0) { result = result + (x & 0x1); x = x >> 1; } return result; } --- Used to be compiled into: int %popcount(int %X) { entry: br label %loopentry loopentry: ; preds = %entry, %no_exit %x.0 = phi int [ %X, %entry ], [ %tmp.9, %no_exit ] ; <int> [#uses=3] %result.1.0 = phi int [ 0, %entry ], [ %tmp.6, %no_exit ] ; <int> [#uses=2] %tmp.1 = seteq int %x.0, 0 ; <bool> [#uses=1] br bool %tmp.1, label %loopexit, label %no_exit no_exit: ; preds = %loopentry %tmp.4 = and int %x.0, 1 ; <int> [#uses=1] %tmp.6 = add int %tmp.4, %result.1.0 ; <int> [#uses=1] %tmp.9 = shr int %x.0, ubyte 1 ; <int> [#uses=1] br label %loopentry loopexit: ; preds = %loopentry ret int %result.1.0 } And is now compiled into: int %popcount(int %X) { entry: br label %no_exit no_exit: ; preds = %entry, %no_exit %x.0.0 = phi int [ %X, %entry ], [ %tmp.9, %no_exit ] ; <int> [#uses=2] %result.1.0.0 = phi int [ 0, %entry ], [ %tmp.6, %no_exit ] ; <int> [#uses=1] %tmp.4 = and int %x.0.0, 1 ; <int> [#uses=1] %tmp.6 = add int %tmp.4, %result.1.0.0 ; <int> [#uses=2] %tmp.9 = shr int %x.0.0, ubyte 1 ; <int> [#uses=2] %tmp.1 = seteq int %tmp.9, 0 ; <bool> [#uses=1] br bool %tmp.1, label %loopexit, label %no_exit loopexit: ; preds = %no_exit ret int %tmp.6 } llvm-svn: 12457	2004-03-16 23:29:09 +00:00
Chris Lattner	e04883605a	This code was both incredibly complex and incredibly broken. Fix it. llvm-svn: 12456	2004-03-16 23:23:11 +00:00
Chris Lattner	c260cfab09	Punt if we see gigantic PHI nodes. This improves a huge interpreter loop testcase from 32.5s in -raise to take .3s llvm-svn: 12443	2004-03-16 19:52:53 +00:00
Chris Lattner	dc22f37eb5	Do not try to optimize PHI nodes with incredibly high degree. This reduces SCCP time from 615s to 1.49s on a large testcase that has a gigantic switch statement that all of the blocks in the function go to (an intepreter). llvm-svn: 12442	2004-03-16 19:49:59 +00:00
Chris Lattner	2175b35b46	Do not copy gigantic switch instructions llvm-svn: 12441	2004-03-16 19:45:22 +00:00
Chris Lattner	9ae2b6acd7	Fix a regression from this patch: http://mail.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20040308/013095.html Basically, this patch only updated the immediate dominatees of the header node to tell them that the preheader also dominated them. In practice, ALL dominatees of the header node are also dominated by the preheader. This fixes: LoopSimplify/2004-03-15-IncorrectDomUpdate. and PR293 llvm-svn: 12434	2004-03-16 06:00:15 +00:00
Chris Lattner	b9c53cdb65	Restore old inlining heuristic. As the comment indicates, this is a nasty horrible hack. llvm-svn: 12423	2004-03-15 06:38:14 +00:00
Chris Lattner	d8905ad834	Add counters for the number of calls elimianted llvm-svn: 12420	2004-03-15 05:46:59 +00:00
Chris Lattner	8a82176edc	Implement LICM of calls in simple cases. This is sufficient to move around sin/cos/strlen calls and stuff. This implements: LICM/call_sink_pure_function.ll LICM/call_sink_const_function.ll llvm-svn: 12415	2004-03-15 04:11:30 +00:00
Chris Lattner	781ede7382	Mostly cosmetic improvements. Do fix the bug where a global value was considered an input. llvm-svn: 12406	2004-03-15 01:26:44 +00:00
Chris Lattner	3933f4660f	Assert that input blocks meet the invariants we expect Simplify the input/output finder. All elements of a basic block are instructions. Any used arguments are also inputs. An instruction can only be used by another instruction. llvm-svn: 12405	2004-03-15 01:18:23 +00:00
Chris Lattner	49038b7708	Fix several bugs in the loop extractor. In particular, subloops were never extracted, and a function that contained a single top-level loop never had the loop extracted, regardless of how much non-loop code there was. llvm-svn: 12403	2004-03-15 00:02:02 +00:00
Chris Lattner	cfe9af750b	No correctness fixes here, just minor qoi fixes: * Don't insert a branch to the switch instruction after the call, just make it a single block. * Insert the new alloca instructions in the entry block of the original function instead of having them execute dynamically * Don't make the default edge of the switch instruction go back to the switch. The loop extractor shouldn't create new loops! * Give meaningful names to the alloca slots and the reload instructions * Some minor code simplifications llvm-svn: 12402	2004-03-14 23:43:24 +00:00
Chris Lattner	17c2c776c3	Simplify code a bit, and fix bug CodeExtractor/2004-03-14-NoSwitchSupport.ll This also implements a two minor improvements: * Don't insert live-out stores IN the region, insert them on the code path that exits the region * If the region is exited to the same block from multiple paths, share the switch statement entry, live-out store code, and the basic block. llvm-svn: 12401	2004-03-14 23:05:49 +00:00
Chris Lattner	f8d6c1a252	Simplify the code a bit by making the collection of basic blocks to extract a member of the class. While we're at it, turn the collection into a set instead of a vector to improve efficiency and make queries simpler. llvm-svn: 12400	2004-03-14 22:34:55 +00:00
Chris Lattner	95c238ef5b	Split into two passes. Now there is the general loop extractor, usable on the command line, and the single loop extractor, usable by bugpoint llvm-svn: 12390	2004-03-14 20:01:36 +00:00
Chris Lattner	cf5d48e8af	Passes don't print stuff! llvm-svn: 12385	2004-03-14 04:17:53 +00:00
Chris Lattner	7e7c3332b8	Do not create empty basic blocks when the lowerswitch pass expects blocks to be non-empty! This fixes LowerSwitch/2004-03-13-SwitchIsDefaultCrash.ll llvm-svn: 12384	2004-03-14 04:14:31 +00:00
Chris Lattner	e52c176a7c	Minor random cleanups llvm-svn: 12382	2004-03-14 04:01:47 +00:00
Chris Lattner	f3b0377169	FunctionPass's should not define their own 'run' method. Require 'simplified' loops, not just raw natural loops. This fixes CodeExtractor/2004-03-13-LoopExtractorCrash.ll llvm-svn: 12381	2004-03-14 04:01:06 +00:00
Chris Lattner	39b3ae34bd	If a block is dead, dominators will not be calculated for it. Because of this loop information won't see it, and we could have unreachable blocks pointing to the non-header node of blocks in a natural loop. This isn't tidy, so have the loopsimplify pass clean it up. llvm-svn: 12380	2004-03-14 03:59:22 +00:00
Chris Lattner	b23b38259a	Verify functions as they are produced if -debug is specified. Reduce curly braceage llvm-svn: 12378	2004-03-14 03:17:22 +00:00
Chris Lattner	46c006bb19	Move prototype to IPO.h instead of Scalar.h Make sure that the file interface header (IPO.h) is included first remove dead #incldue llvm-svn: 12375	2004-03-14 02:37:16 +00:00
Chris Lattner	3d96322890	Indent anon namespace properly, add copyright block llvm-svn: 12373	2004-03-14 02:34:07 +00:00
Chris Lattner	1685a3af78	Move to the IPO library. Utils shouldn't contain passes. llvm-svn: 12372	2004-03-14 02:32:27 +00:00
Chris Lattner	5003f9e473	DemoteRegToStack got moved from DemoteRegToStack.h to Local.h llvm-svn: 12368	2004-03-14 02:13:38 +00:00
Chris Lattner	52ac108b28	Add some debugging output Fix InstCombine/2004-03-13-InstCombineInfLoop.ll which caused an infinite loop compiling (I think) povray. llvm-svn: 12365	2004-03-13 23:54:27 +00:00
Chris Lattner	763b6c41d4	This change makes two big adjustments. * Be a lot more accurate about what the effects will be when inlining a call to a function when an argument is an alloca. * Dramatically reduce the penalty for inlining a call in a large function. This heuristic made it almost impossible to inline a function into a large function, no matter how small the callee is. llvm-svn: 12363	2004-03-13 23:15:45 +00:00
Chris Lattner	8d45aeaff1	This little patch speeds up the loop used to update the dominator set analysis. On the testcase from GCC PR12440, which has a LOT of loops (1392 of which require preheaders to be inserted), this speeds up the loopsimplify pass from 1.931s to 0.1875s. The loop in question goes from 1.65s -> 0.0097s, which isn't bad. All of these times are a debug build. This adds a dependency on DominatorTree analysis that was not there before, but we always had dominatortree available anyway, because LICM requires both loop simplify and DT, so this doesn't add any extra analysis in practice. llvm-svn: 12362	2004-03-13 22:01:26 +00:00
Chris Lattner	e10c6cd509	Implement sub.ll:test14 llvm-svn: 12355	2004-03-13 00:11:49 +00:00

... 5 6 7 8 9 ...

1899 Commits