llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-30 07:22:55 +01:00

Author	SHA1	Message	Date
Chris Lattner	2be8c4a870	Optimize div/rem + select combinations more. In particular, implement div.ll:test10 and rem.ll:test4. llvm-svn: 18838	2004-12-12 21:48:58 +00:00
Chris Lattner	0a2feabdc9	Properly implement copying of a global, fixing the 255.vortex & povray failures from last night. llvm-svn: 18832	2004-12-12 19:34:41 +00:00
Chris Lattner	9e7ce53b82	Simplify code and do not invalidate iterators. This fixes a crash compiling TimberWolfMC that was exposed due to recent optimizer changes. llvm-svn: 18831	2004-12-12 18:23:20 +00:00
Chris Lattner	97adae1fa4	Though the previous xform applies to literally dozens (hundreds?) of variables in SPEC, the subsequent optimziations that we are after don't play with with FP values, so disable this xform for them. Really we just don't want stuff like: double G; (always 0 or 412312.312) = G; turning into: bool G_b; = G_b ? 412312.312 : 0; We'd rather just do the load. -Chris llvm-svn: 18819	2004-12-12 06:03:06 +00:00
Chris Lattner	f125dc0e49	If a variable can only hold two values, and is not already a bool, shrink it down to actually BE a bool. This allows simple value range propagation stuff work harder, deleting comparisons in bzip2 in some hot loops. This implements GlobalOpt/integer-bool.ll, which is the essence of the loop condition distilled into a testcase. llvm-svn: 18817	2004-12-12 05:53:50 +00:00
Chris Lattner	02c04bbf45	If one side of and/or is known to be 0/-1, it doesn't matter if the other side is overdefined. This allows us to fold conditions like: if (X < Y \|\| Y > Z) in some cases. llvm-svn: 18807	2004-12-11 23:15:19 +00:00
Chris Lattner	59ce936426	Only cound if we actually made a change. llvm-svn: 18800	2004-12-11 17:00:14 +00:00
Chris Lattner	8378361f0c	The split bb is really the exit of the old function llvm-svn: 18799	2004-12-11 16:59:54 +00:00
Chris Lattner	06bfa390f6	Two bug fixes: 1. Actually increment the Statistic for the GV elim optzn 2. When resolving undef branches, only resolve branches in executable blocks, avoiding marking a bunch of completely dead blocks live. This has a big impact on the quality of the generated code. With this patch, we positively rip up vortex, compiling Ut_MoveBytes to a single memcpy call. In vortex we get this: 12 ipsccp - Number of globals found to be constant 986 ipsccp - Number of arguments constant propagated 1378 ipsccp - Number of basic blocks unreachable 8919 ipsccp - Number of instructions removed llvm-svn: 18796	2004-12-11 06:05:53 +00:00
Chris Lattner	5af0ef5c44	Do not delete the entry block to a function. llvm-svn: 18795	2004-12-11 05:32:19 +00:00
Chris Lattner	943f94d2b3	Implement Transforms/SCCP/ipsccp-gvar.ll, by tracking values stored to non-address-taken global variables. llvm-svn: 18790	2004-12-11 05:15:59 +00:00
Chris Lattner	beeab3a124	Fix a bug where we could delete dead invoke instructions with uses. In functions where we fully constant prop the return value, replace all ret instructions with 'ret undef'. llvm-svn: 18786	2004-12-11 02:53:57 +00:00
Chris Lattner	d1d00e017b	Implement SCCP/ipsccp-conditional.ll, by totally deleting dead blocks. llvm-svn: 18781	2004-12-10 22:29:08 +00:00
Chris Lattner	5be2c2e299	Fix SCCP/2004-12-10-UndefBranchBug.ll llvm-svn: 18776	2004-12-10 20:41:50 +00:00
Chris Lattner	40e97884f6	Fix Regression/Transforms/SimplifyCFG/2004-12-10-SimplifyCFGCrash.ll, and the failure on make_dparser last night. llvm-svn: 18766	2004-12-10 17:42:31 +00:00
Chris Lattner	ead42a768e	This is the initial implementation of IPSCCP, as requested by Brian. This implements SCCP/ipsccp-basic.ll, rips apart Olden/mst (as described in PR415), and does other nice things. There is still more to come with this, but it's a start. llvm-svn: 18752	2004-12-10 08:02:06 +00:00
Chris Lattner	f5962a7d6c	note to self: Do not check in debugging code! llvm-svn: 18693	2004-12-09 07:15:52 +00:00
Chris Lattner	1e2b3fd343	Implement trivial sinking for load instructions. This causes us to sink 567 loads in spec llvm-svn: 18692	2004-12-09 07:14:34 +00:00
Chris Lattner	8529319568	Do extremely simple sinking of instructions when they are only used in a successor block. This turns cases like this: x = a op b if (c) { use x } into: if (c) { x = a op b use x } This triggers 3965 times in spec, and is tested by Regression/Transforms/InstCombine/sink_instruction.ll This appears to expose a bug in the X86 backend for 177.mesa, which I'm looking in to. llvm-svn: 18677	2004-12-08 23:43:58 +00:00
Alkis Evlogimenos	fe549f46e1	Fix this regression and remove the XFAIL from this test. llvm-svn: 18674	2004-12-08 23:10:30 +00:00
Chris Lattner	62e3f533f3	Fix Transforms/InstCombine/2004-12-08-RemInfiniteLoop.ll llvm-svn: 18670	2004-12-08 22:20:34 +00:00
Chris Lattner	8ecbc96420	Add support for compilers without argument dependent name lookup, contributed by Bjørn Wennberg llvm-svn: 18627	2004-12-08 16:12:20 +00:00
Chris Lattner	2f52add966	Remove unneeded class qualifier, contributed by Bjørn Wennberg llvm-svn: 18625	2004-12-08 16:05:02 +00:00
Reid Spencer	f929fb661d	For PR387:\ Add doInitialization method to avoid overloaded virtuals llvm-svn: 18602	2004-12-07 08:11:36 +00:00
Chris Lattner	a5a08f5484	Implement stripping of debug symbols, making the --strip-debug options in gccas/gccld more than just a noop. llvm-svn: 18456	2004-12-03 16:22:08 +00:00
Chris Lattner	45529acebd	Initial reimplementation of the -strip pass, with a stub for implementing -S llvm-svn: 18440	2004-12-02 21:25:03 +00:00
Chris Lattner	d742c10964	This pass is moving to lib IPO llvm-svn: 18439	2004-12-02 21:24:40 +00:00
Chris Lattner	afb31cf723	Implement a FIXME by checking to make sure that a malloc is not being used in scary and unknown ways before we promote it. This fixes the miscompilation of 188.ammp that has been plauging us since a globalopt patch went in. Thanks a ton to Tanya for helping me diagnose the problem! llvm-svn: 18418	2004-12-02 07:11:07 +00:00
Chris Lattner	fd541d4228	Fix a minor bug where we set a var to initialized on malloc, not on store. This doesn't fix anything that I'm aware of, just noticed it by inspection llvm-svn: 18417	2004-12-02 06:25:58 +00:00
Chris Lattner	23c9c1bb62	This pass is completely broken. llvm-svn: 18387	2004-11-30 17:09:06 +00:00
Chris Lattner	c58bc8d96e	Squelch warning llvm-svn: 18381	2004-11-30 07:47:34 +00:00
Chris Lattner	537e5a6b50	Fix test/Regression/Transforms/LICM/2004-09-14-AliasAnalysisInvalidate.llx This only fails on darwin or on X86 under valgrind. llvm-svn: 18377	2004-11-30 07:01:15 +00:00
Chris Lattner	25b93fea2a	Alkis noticed that this variable is dead. Thanks! llvm-svn: 18369	2004-11-30 04:01:44 +00:00
Chris Lattner	b237a19a78	If we have something like this: if (x) { code ... } else { code ... } Turn it into: code if (x) { ... } else { ... } This reduces code size and in some common cases allows us to completely eliminate the conditional. This turns several if/then/else blocks in loops into straightline code in 179.art, turning the loops into single basic blocks (good for modsched even!). Maybe now brg will leave me alone ;-) llvm-svn: 18366	2004-11-30 00:29:14 +00:00
Chris Lattner	ba782edfb7	Allow hoisting loads of globals and alloca's in conditionals. llvm-svn: 18363	2004-11-29 21:26:12 +00:00
Reid Spencer	dd287f7758	Fix for PR454: * Make sure we handle signed to unsigned conversion correctly * Move this visitSetCondInst case to its own method. llvm-svn: 18312	2004-11-28 21:31:15 +00:00
Chris Lattner	7301399232	Make DSE potentially more aggressive by being more specific about alloca sizes. llvm-svn: 18309	2004-11-28 20:44:37 +00:00
Chris Lattner	2c4161fc57	Implement Regression/Transforms/InstCombine/getelementptr_cast.ll, which occurs many times in crafty llvm-svn: 18273	2004-11-27 17:55:46 +00:00
Chris Lattner	8715ef738f	Provide size information when checking to see if we can LICM a load, this allows us to hoist more loads in some cases. llvm-svn: 18265	2004-11-26 21:20:09 +00:00
Chris Lattner	ee2f552b0f	Do not count debugger intrinsics in size estimation. llvm-svn: 18110	2004-11-22 17:23:57 +00:00
Chris Lattner	9d1117ed96	Ignore debugger intrinsics when doing inlining size computations. llvm-svn: 18109	2004-11-22 17:21:44 +00:00
Chris Lattner	4b58b75683	Do not consider debug intrinsics in the size computations for loop unrolling. Patch contributed by Michael McCracken! llvm-svn: 18108	2004-11-22 17:18:36 +00:00
Misha Brukman	9dd523842a	Allow constructor parameter to override aggregating args; fix spacing llvm-svn: 18028	2004-11-20 02:20:27 +00:00
Chris Lattner	73b5c56fc1	Fix the exposed prototype for the lower packed pass, thanks to Morten Ofstad. llvm-svn: 17996	2004-11-19 16:49:34 +00:00
Chris Lattner	a973b1a1a4	CPR is dead. llvm-svn: 17992	2004-11-19 16:24:57 +00:00
Chris Lattner	ce8249f570	Delete stoppoints that occur for the same source line. llvm-svn: 17970	2004-11-18 21:41:39 +00:00
Chris Lattner	7fb7c81ebf	Check in hook that I forgot llvm-svn: 17956	2004-11-18 17:24:20 +00:00
Chris Lattner	92e712b00f	Do not delete dead invoke instructions! llvm-svn: 17897	2004-11-16 16:32:28 +00:00
Reid Spencer	e986ef23d7	Remove unused variable for compilation by VC++. Patch contributed by Morten Ofstad. llvm-svn: 17830	2004-11-15 17:29:41 +00:00
Chris Lattner	f95f7e05a5	Minor cleanups. There is no reason for SCCP to derive from instvisitor anymore. llvm-svn: 17825	2004-11-15 07:15:04 +00:00
Chris Lattner	4aa7dc02bf	Count more accurately llvm-svn: 17824	2004-11-15 07:02:42 +00:00
Chris Lattner	20a9efa189	Quiet warnings on the persephone tester llvm-svn: 17821	2004-11-15 05:54:07 +00:00
Chris Lattner	e87a1360b3	Two minor improvements: 1. Speedup getValueState by having it not consider Arguments. It's better to just add them before we start SCCP'ing. 2. SCCP can delete the contents of dead blocks. No really, it's ok! This reduces the size of the IR for subsequent passes, even though simplifycfg would do the same job. In practice, simplifycfg does not run until much later than sccp in gccas llvm-svn: 17820	2004-11-15 05:45:33 +00:00
Chris Lattner	4ad574191b	rename InstValue to LatticeValue, as it holds for more than instructions. llvm-svn: 17818	2004-11-15 05:03:30 +00:00
Chris Lattner	bde8da9e43	Substantially refactor the SCCP class into an SCCP pass and an SCCPSolver class. The only changes are minor: * Do not try to SCCP instructions that return void in the rewrite loop. This is silly and fool hardy, wasting a map lookup and adding an entry to the map which is never used. * If we decide something has an undefined value, rewrite it to undef, potentially leading to further simplications. llvm-svn: 17816	2004-11-15 04:44:20 +00:00
Chris Lattner	e4a97f4bee	If a global is just loaded and restored, realize that it is not changing value. This allows us to turn more globals into constants and eliminate them. This patch implements GlobalOpt/load-store-global.llx. Note that this patch speeds up 255.vortex from: Output/255.vortex.out-cbe.time:program 7.640000 Output/255.vortex.out-llc.time:program 9.810000 to: Output/255.vortex.out-cbe.time:program 7.250000 Output/255.vortex.out-llc.time:program 9.490000 Which isn't bad at all! llvm-svn: 17746	2004-11-14 20:50:30 +00:00
Chris Lattner	3d61b688a9	This optimization makes MANY phi nodes that all have the same incoming value. If this happens, detect it early instead of relying on instcombine to notice it later. This can be a big speedup, because PHI nodes can have many incoming values. llvm-svn: 17741	2004-11-14 19:29:34 +00:00
Chris Lattner	1e4cad9176	Implement instcombine/phi.ll:test6 - pulling operations through PHI nodes. This exposes subsequent optimization possiblities and reduces code size. This triggers 1423 times in spec. llvm-svn: 17740	2004-11-14 19:13:23 +00:00
Chris Lattner	fdd41995d8	Transform this: %X = alloca ... %Y = alloca ... X == Y into false. This allows us to simplify some stuff in eon (and probably many other C++ programs) where operator= was checking for self assignment. Folding this allows us to SROA several additional structs. llvm-svn: 17735	2004-11-14 07:33:16 +00:00
Chris Lattner	d6d9a04344	Remove note to self llvm-svn: 17734	2004-11-14 06:57:47 +00:00
Chris Lattner	363e78c357	If a function always returns a constant, replace all calls sites with that constant value. This makes the return value dead and allows for simplification in the caller. This implements IPConstantProp/return-constant.ll This triggers several dozen times throughout SPEC. llvm-svn: 17730	2004-11-14 06:10:11 +00:00
Chris Lattner	d0a0af0818	Teach SROA how to promote an array index that is variable, if the dimension of the array is just two. This occurs 8 times in gcc, 6 times in crafty, and 12 times in 099.go. This implements ScalarRepl/sroa_two.ll llvm-svn: 17727	2004-11-14 05:00:19 +00:00
Chris Lattner	bc35272f86	Rearrange some code, no functionality changes. llvm-svn: 17724	2004-11-14 04:24:28 +00:00
Chris Lattner	9b2f0f93e3	Remove debugging code llvm-svn: 17719	2004-11-13 23:32:53 +00:00
Chris Lattner	9749bf21f8	Argument promotion transforms functions to unconditionally load their argument pointers. This is only valid to do if the function already unconditionally loaded an argument or if the pointer passed in is known to be valid. Make sure to do the required checks. This fixed ArgumentPromotion/control-flow.ll and the Burg program. llvm-svn: 17718	2004-11-13 23:31:34 +00:00
Chris Lattner	70e351fb1c	Simplify handling of shifts to be the same as we do for adds. Add support for (X * C1) + (X * C2) (where * can be mul or shl), allowing us to fold: Y+Y+Y+Y+Y+Y+Y+Y into %tmp.8 = shl long %Y, ubyte 3 ; <long> [#uses=1] instead of %tmp.4 = shl long %Y, ubyte 2 ; <long> [#uses=1] %tmp.12 = shl long %Y, ubyte 2 ; <long> [#uses=1] %tmp.8 = add long %tmp.4, %tmp.12 ; <long> [#uses=1] This implements add.ll:test25 Also add support for (XC1)-(XC2) -> X*(C1-C2), implementing sub.ll:test18 llvm-svn: 17704	2004-11-13 19:50:12 +00:00
Chris Lattner	7a8d26a581	Fold: (X + (X << C2)) --> X * ((1 << C2) + 1) ((X << C2) + X) --> X * ((1 << C2) + 1) This means that we now canonicalize "Y+Y+Y" into: %tmp.2 = mul long %Y, 3 ; <long> [#uses=1] instead of: %tmp.10 = shl long %Y, ubyte 1 ; <long> [#uses=1] %tmp.6 = add long %Y, %tmp.10 ; <long> [#uses=1] llvm-svn: 17701	2004-11-13 19:31:40 +00:00
Chris Lattner	d348f5b9fb	Lazily create the abort message, so only translation units that use unwind will actually get it. llvm-svn: 17700	2004-11-13 19:07:32 +00:00
Chris Lattner	ab917141d0	Fix: CodeExtractor/2004-11-12-InvokeExtract.ll llvm-svn: 17699	2004-11-13 00:06:45 +00:00
Chris Lattner	c6b1d7a081	Fix a bug where the code extractor would get a bit confused handling invoke instructions, setting DefBlock to a block it did not have dom info for. llvm-svn: 17697	2004-11-12 23:50:44 +00:00
Chris Lattner	922a1b4601	Simplify handling of constant initializers llvm-svn: 17696	2004-11-12 22:42:57 +00:00
Chris Lattner	1a469385bd	Actually, leave the check in. This prevents us from counting dead arguments as IPCP opportunities. llvm-svn: 17680	2004-11-11 07:47:54 +00:00
Chris Lattner	dba9c2b0ef	Fix bug: IPConstantProp/deadarg.ll llvm-svn: 17679	2004-11-11 07:46:29 +00:00
Chris Lattner	d920b5b770	Make IP Constant prop more aggressive about handling self recursive calls. This implements IPConstantProp/recursion.ll llvm-svn: 17666	2004-11-10 19:43:59 +00:00
Chris Lattner	b214a04a16	Do not let dead constant expressions hanging off of functions prevent IPCP. This allows to elimination of a bunch of global pool descriptor args from programs being pool allocated (and is also generally useful!) llvm-svn: 17657	2004-11-09 20:47:30 +00:00
Chris Lattner	9f035c9fdb	Change this back so that I get stable numbers to reflect the change from the nightly testers llvm-svn: 17646	2004-11-09 08:05:23 +00:00
Chris Lattner	b924e8be62	Fix bug: 2004-11-08-FreeUseCrash.ll llvm-svn: 17642	2004-11-09 05:10:56 +00:00
Chris Lattner	a9e56ddaa3	VERY large functions that are only called from one place are not really exciting to inline. Only inline medium or small sized functions with a single call site. llvm-svn: 17588	2004-11-07 21:46:47 +00:00
Chris Lattner	a060b55dea	This is V9 specific, move it there. llvm-svn: 17545	2004-11-07 00:39:26 +00:00
Chris Lattner	b5ff07e46e	Remove dead vars llvm-svn: 17482	2004-11-05 04:46:22 +00:00
Chris Lattner	064071600d	Fix some warnings on VC++ llvm-svn: 17481	2004-11-05 04:45:43 +00:00
Chris Lattner	33ed2c8984	* Rearrange code slightly * Disable broken transforms for simplifying (setcc (cast X to larger), CI) where CC is not != or == llvm-svn: 17422	2004-11-02 03:50:32 +00:00
Chris Lattner	b81520727e	Speed up the tail duplication pass on the testcase below from 68.2s to 1.23s: #define CL0(a) case a: f(); goto c; #define CL1(a) CL0(a##0) CL0(a##1) CL0(a##2) CL0(a##3) CL0(a##4) CL0(a##5) \ CL0(a##6) CL0(a##7) CL0(a##8) CL0(a##9) #define CL2(a) CL1(a##0) CL1(a##1) CL1(a##2) CL1(a##3) CL1(a##4) CL1(a##5) \ CL1(a##6) CL1(a##7) CL1(a##8) CL1(a##9) #define CL3(a) CL2(a##0) CL2(a##1) CL2(a##2) CL2(a##3) CL2(a##4) CL2(a##5) \ CL2(a##6) CL2(a##7) CL2(a##8) CL2(a##9) #define CL4(a) CL3(a##0) CL3(a##1) CL3(a##2) CL3(a##3) CL3(a##4) CL3(a##5) \ CL3(a##6) CL3(a##7) CL3(a##8) CL3(a##9) void f(); void a() { int b; c: switch (b) { CL4(1) } } This comes from GCC PR 15524 llvm-svn: 17390	2004-11-01 07:05:07 +00:00
Chris Lattner	3083f2a424	Do not compute the predecessor list for a block unless we need it. This speeds up simplifycfg on this program, from 44.87s to 0.29s (with a profiled build): #define CL0(a) case a: goto c; #define CL1(a) CL0(a##0) CL0(a##1) CL0(a##2) CL0(a##3) CL0(a##4) CL0(a##5) \ CL0(a##6) CL0(a##7) CL0(a##8) CL0(a##9) #define CL2(a) CL1(a##0) CL1(a##1) CL1(a##2) CL1(a##3) CL1(a##4) CL1(a##5) \ CL1(a##6) CL1(a##7) CL1(a##8) CL1(a##9) #define CL3(a) CL2(a##0) CL2(a##1) CL2(a##2) CL2(a##3) CL2(a##4) CL2(a##5) \ CL2(a##6) CL2(a##7) CL2(a##8) CL2(a##9) #define CL4(a) CL3(a##0) CL3(a##1) CL3(a##2) CL3(a##3) CL3(a##4) CL3(a##5) \ CL3(a##6) CL3(a##7) CL3(a##8) CL3(a##9) void f(); void a() { int b; c: switch (b) { CL4(1) } } This testcase is contrived to expose N^2 behavior, but this patch should speedup simplifycfg on any programs that use large switch statements. This testcase comes from GCC PR17895. llvm-svn: 17389	2004-11-01 06:53:58 +00:00
Reid Spencer	d3f7233495	Change Library Names Not To Conflict With Others When Installed llvm-svn: 17286	2004-10-27 23:18:45 +00:00
Chris Lattner	dd0094e4ed	Convert 'struct' to 'class' in various places to adhere to the coding standards and work better with VC++. Patch contributed by Morten Ofstad! llvm-svn: 17281	2004-10-27 16:14:51 +00:00
Chris Lattner	000424b69e	Hrm, this code was severely botched. As it turns out, this patch: http://mail.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20041018/019708.html exposed ANOTHER latent bug in this xform, which caused Prolangs-C/bison to fill the zion nightly tester disk up and make the tester barf. This is obviously not a good thing, so lets fix this bug shall we? :) llvm-svn: 17276	2004-10-27 05:57:15 +00:00
Chris Lattner	abfae1c72e	Initialize with the correct constant type llvm-svn: 17270	2004-10-27 03:55:24 +00:00
Chris Lattner	895f16ebfc	Fix compatibility with MSVC, patch by Morten Ofstad llvm-svn: 17218	2004-10-25 18:45:16 +00:00
Reid Spencer	9f6f128630	Eliminate compilation warning on uninitialized variable. llvm-svn: 17163	2004-10-22 16:10:39 +00:00
Chris Lattner	f0e2ce2279	* empty log message * llvm-svn: 17161	2004-10-22 06:43:28 +00:00
Chris Lattner	6321852143	Fix a bug Nate noticed, where we miscompiled a simple testcase llvm-svn: 17157	2004-10-22 04:53:16 +00:00
Reid Spencer	e48ba34fd4	We won't use automake llvm-svn: 17155	2004-10-22 03:35:04 +00:00
Brian Gaeke	71ff3efdd5	Explain what this pass does. llvm-svn: 17146	2004-10-20 19:38:58 +00:00
Chris Lattner	e555b5c5cb	Hrm, some people complain when the compiler cheerfully tells them what it's doing... I guess they're right. llvm-svn: 17142	2004-10-19 06:33:16 +00:00
Reid Spencer	ce514b1c2c	Initial automake generated Makefile template llvm-svn: 17136	2004-10-18 23:55:41 +00:00
Nate Begeman	49e38c2185	Initial implementation of the strength reduction for GEP instructions in loops. This optimization is not turned on by default yet, but may be run with the opt tool's -loop-reduce flag. There are many FIXMEs listed in the code that will make it far more applicable to a wide range of code, but you have to start somewhere :) This limited version currently triggers on the following tests in the MultiSource directory: pcompress2: 7 times cfrac: 5 times anagram: 2 times ks: 6 times yacr2: 2 times llvm-svn: 17134	2004-10-18 21:08:22 +00:00
Chris Lattner	9e2649a7be	Get this file compiling with VC++, patch contributed by Morten Ofstad. Thanks Morten! llvm-svn: 17125	2004-10-18 15:43:46 +00:00
Reid Spencer	2b5cfe8be8	Correction to allow compilation with Visual C++. Patch contributed by Morten Ofstad. Thanks Morten! llvm-svn: 17123	2004-10-18 14:38:48 +00:00
Chris Lattner	b1827a765a	Simplify code by deleting instructions that preceed unreachable instructions. Simplify code by simplifying terminators that branch to blocks that start with an unreachable instruction. llvm-svn: 17116	2004-10-18 04:07:22 +00:00
Chris Lattner	3c1be63b41	Turn store -> null/undef into the LLVM unreachable instruction! This simple change hacks off 10K of bytecode from perlbmk (.5%) even though the front-end is not generating them yet and we are not optimizing the resultant code. This isn't too bad. llvm-svn: 17111	2004-10-18 03:00:50 +00:00
Chris Lattner	18333c64de	Turn things with obviously undefined semantics into 'store -> null' llvm-svn: 17110	2004-10-18 02:59:09 +00:00
Chris Lattner	dbfa9a5e9c	My friend the invoke instruction does not dominate all basic blocks if it occurs in the entry node of a function llvm-svn: 17109	2004-10-18 01:48:31 +00:00
Chris Lattner	f7bf129037	Fix a bug that occurs when the constant value is the result of an invoke. In particular, invoke ret values are only live in the normal dest of the invoke not in the unwind dest. llvm-svn: 17108	2004-10-18 01:21:17 +00:00
Chris Lattner	44a7cf480d	Getting ADCE to interact well with unreachable instructions seems like a nontrivial exercise that I'm not interested in tackling right now. Just punt and treat them like unwind's. This 'fixes' test/Regression/Transforms/ADCE/unreachable-function.ll llvm-svn: 17106	2004-10-17 23:45:06 +00:00
Chris Lattner	9e7ddf8596	Fix Regression/Transforms/Inline/2004-10-17-InlineFunctionWithoutReturn.ll If a function had no return instruction in it, and the result of the inlined call instruction was used, we would crash. llvm-svn: 17104	2004-10-17 23:21:07 +00:00
Chris Lattner	795ae57cde	Remove printout, realize that instructions in the entry block dominate all other blocks. llvm-svn: 17099	2004-10-17 21:31:34 +00:00
Chris Lattner	ac92e5fcd0	When inserting PHI nodes, don't insert any phi nodes that are obviously unneccesary. This allows us to delete several hundred phi nodes of the form PHI(x,x,x,undef) from 253.perlbmk and probably other programs as well. This implements Mem2Reg/UndefValuesMerge.ll llvm-svn: 17098	2004-10-17 21:25:56 +00:00
Chris Lattner	306dd8a44a	Enhance hasConstantValue to ignore undef values in phi nodes. This allows it to think that PHI[4, undef] == 4. llvm-svn: 17096	2004-10-17 21:23:26 +00:00
Chris Lattner	ef0888e493	hasConstantValue will soon return instructions that don't dominate the PHI node, so prepare for this. llvm-svn: 17095	2004-10-17 21:22:38 +00:00
Chris Lattner	68f14bc09c	Fix a type violation llvm-svn: 17069	2004-10-16 23:28:04 +00:00
Chris Lattner	d12442c206	Kill the bogon that slipped into my buffer before I committed. llvm-svn: 17067	2004-10-16 19:46:33 +00:00
Chris Lattner	b55574181d	Implement InstCombine/getelementptr.ll:test9, which is the source of many ugly and giant constnat exprs in some programs. llvm-svn: 17066	2004-10-16 19:44:59 +00:00
Chris Lattner	2fae8a1ef9	Add support for unreachable llvm-svn: 17056	2004-10-16 18:21:33 +00:00
Chris Lattner	3ebca6fb19	Optimize instructions involving undef values. For example X+undef == undef. llvm-svn: 17047	2004-10-16 18:11:37 +00:00
Chris Lattner	4fca8caaee	Add support for UndefValue llvm-svn: 17046	2004-10-16 18:10:31 +00:00
Chris Lattner	ca01f160ee	When promoting mem2reg, make uninitialized values become undef isntead of 0. llvm-svn: 17045	2004-10-16 18:10:06 +00:00
Chris Lattner	80f963c30b	Handle undef values as undefined on the constant lattice ignore unreachable instructions llvm-svn: 17044	2004-10-16 18:09:41 +00:00
Chris Lattner	c630ba08cf	Add note llvm-svn: 17043	2004-10-16 18:09:25 +00:00
Chris Lattner	4a37579191	Add support for the undef value. Implement a new optimization based on globals that are initialized with undef. When promoting malloc to a global, start out initialized to undef llvm-svn: 17042	2004-10-16 18:09:00 +00:00
Chris Lattner	7319d9289f	Fix a bug John tracked down in libstdc++ where we were incorrectly deleting weak functions. Thanks for finding this John! llvm-svn: 16997	2004-10-14 19:53:50 +00:00
Chris Lattner	88ad718c75	When converting phi nodes into select instructions, we shouldn't promote PHI nodes unless we KNOW that we are able to promote all of them. This fixes: test/Regression/Transforms/SimplifyCFG/PhiNoEliminate.ll llvm-svn: 16973	2004-10-14 05:13:36 +00:00
Reid Spencer	e6418ec30f	Update to reflect changes in Makefile rules. llvm-svn: 16950	2004-10-13 11:46:52 +00:00
Chris Lattner	63f15158f4	Transform memmove -> memcpy when the source is obviously constant memory. llvm-svn: 16932	2004-10-12 04:52:52 +00:00
Chris Lattner	b605c860fc	Fix a REALLY obscure bug in my previous checkin, which was splicing the END marker from one ilist into the middle of another basic block! llvm-svn: 16925	2004-10-12 01:02:29 +00:00
Chris Lattner	a6d0e41b72	Handle a common case more carefully. In particular, instead of transforming pointer recurrences into expressions from this: %P_addr.0.i.0 = phi sbyte* [ getelementptr ([8 x sbyte]* %.str_1, int 0, int 0), %entry ], [ %inc.0.i, %no_exit.i ] %inc.0.i = getelementptr sbyte* %P_addr.0.i.0, int 1 ; <sbyte> [#uses=2] into this: %inc.0.i = getelementptr sbyte getelementptr ([8 x sbyte]* %.str_1, int 0, int 0), int %inc.0.i.rec Actually create something nice, like this: %inc.0.i = getelementptr [8 x sbyte]* %.str_1, int 0, int %inc.0.i.rec llvm-svn: 16924	2004-10-11 23:06:50 +00:00
Chris Lattner	f9e79568c2	Reenable the transform, turning X/-10 < 1 into X > -10 llvm-svn: 16918	2004-10-11 19:40:04 +00:00
Chris Lattner	9d662982e0	This patch implements two things (sorry). First, it allows SRA of globals that have embedded arrays, implementing GlobalOpt/globalsra-partial.llx. This comes up infrequently, but does allow, for example, deleting several stores to dead parts of globals in dhrystone. Second, this implements GlobalOpt/malloc-promote-.llx, which is the following nifty transformation: Basically if a global pointer is initialized with malloc, and we can tell that the program won't notice, we transform this: struct foo FooPtr; ... FooPtr = malloc(sizeof(struct foo)); ... FooPtr->A FooPtr->B Into: struct foo FooPtrBody; ... FooPtrBody.A FooPtrBody.B This comes up occasionally, for example, the 'disp' global in 183.equake (where the xform speeds the CBE version of the program up from 56.16s to 52.40s (7%) on apoc), and the 'desired_accept', 'fixLRBT', 'macroArray', & 'key_queue' globals in 300.twolf (speeding it up from 22.29s to 21.55s (3.4%)). The nice thing about this xform is that it exposes the resulting global to global variable optimization and makes alias analysis easier in addition to eliminating a few loads. llvm-svn: 16916	2004-10-11 05:54:41 +00:00
Chris Lattner	21dcce4e33	Just because we cannot completely eliminate all uses of a global, we can still optimize away all of the indirect calls and loads, etc from it. This turns code like this: if (G != 0) G(); into if (G != 0) ActualCallee(); This triggers a couple of times in gcc and libstdc++. llvm-svn: 16901	2004-10-10 23:14:11 +00:00
Reid Spencer	1b7459b29d	Initial version of automake Makefile.am file. llvm-svn: 16893	2004-10-10 22:20:40 +00:00
Chris Lattner	d10f9ed455	Fix 2004-10-10-CastStoreOnce.llx, by adjusting types back if we strip off a cast llvm-svn: 16878	2004-10-10 17:07:12 +00:00
Chris Lattner	cd4a7a5293	Implement GlobalOpt/deadglobal-2.llx, deletion of globals that are only stored to, but are stored at variable indexes. This occurs at least in 176.gcc, but probably others, and we should handle it for completeness. llvm-svn: 16876	2004-10-10 16:47:33 +00:00
Chris Lattner	b0d3511aa6	Avoid calling use_size() which could (in theory) be expensive if the global has a large number of users. Instead, just keep track of whether we're making changes as we do so. This patch has no functionlity changes. llvm-svn: 16874	2004-10-10 16:43:46 +00:00
Chris Lattner	6b0cd15f9c	Eliminate global pointers that are only stored a single value and null if we know that all uses of the global will trap if the pointer contained is null. In this case, we forward substitute the stored value to any uses. This has the effect of devirtualizing trivial globals in trivial cases. For example, 164.gzip contains this: gzip.h:extern int (read_buf) OF((char buf, unsigned size)); bits.c: read_buf = file_read; deflate.c: lookahead = read_buf((char)window, deflate.c: n = read_buf((char)window+strstart+lookahead, more); Since read_buf has to point to file_read at every use, we just replace the calls through read_buf with a direct call to file_read. This occurs in several benchmarks, including 176.gcc and 164.gzip. Direct calls are good and stuff. llvm-svn: 16871	2004-10-09 21:48:45 +00:00
Chris Lattner	f8a6e4402e	Use DEBUG instead of DebugFlag directly, as DebugFlag does not respect -debug-only! llvm-svn: 16868	2004-10-09 19:30:36 +00:00
Chris Lattner	51a762c414	Fix infinite loop due to iteration llvm-svn: 16864	2004-10-09 03:32:52 +00:00
Chris Lattner	c5085d212c	Implement sub.ll:test17, -X/C -> X/-C llvm-svn: 16863	2004-10-09 02:50:40 +00:00
Chris Lattner	4f504b0751	If we found a dead global, we should at least delete it... llvm-svn: 16858	2004-10-08 22:05:31 +00:00
Chris Lattner	7fc483bf28	* Pull out the meat of runOnModule into another function for clarity. * Do not lead dangling dead constants prevent optimization * Iterate global optimization while we're making progress. These changes allow us to be more aggressive, handling cases like GlobalOpt/iterate.llx without a problem (turning it into 'ret int 0'). llvm-svn: 16857	2004-10-08 20:59:28 +00:00
Chris Lattner	d777571d0c	We might as well delete the known-dead global sooner rather than later since we know it is dead. llvm-svn: 16855	2004-10-08 20:25:55 +00:00
Chris Lattner	816a8a5e1e	Temporarily disable a buggy transformation until it can be fixed. This fixes 254.gap. llvm-svn: 16853	2004-10-08 19:15:44 +00:00
Chris Lattner	71aecc5006	Implement SRA for global variables. This allows the other global variable optimizations to trigger much more often. This allows the elimination of several dozen more global variables in Programs/External. Note that we only do this for non-constant globals: constant globals will already be optimized out if the accesses to them permit it. This implements Transforms/GlobalOpt/globalsra.llx llvm-svn: 16842	2004-10-08 17:32:09 +00:00
Chris Lattner	5839d93b51	Instcombine (X & FF00) + xx00 -> (X+xx00) & FF00, implementing and.ll:test27 This comes up when doing adds to bitfield elements. llvm-svn: 16836	2004-10-08 05:07:56 +00:00
Chris Lattner	87259c3ce9	Little patch to turn (shl (add X, 123), 4) -> (add (shl X, 4), 123 << 4) This triggers in cases of bitfield additions, opening opportunities for future improvements. llvm-svn: 16834	2004-10-08 03:46:20 +00:00
Chris Lattner	7882b54197	Improve comments, no functionality changes llvm-svn: 16814	2004-10-07 21:30:30 +00:00
Chris Lattner	e1d5d599bd	Fix a bug in the safety analysis routine llvm-svn: 16804	2004-10-07 06:01:25 +00:00
Chris Lattner	e7ec24c63e	Comment cleanups llvm-svn: 16803	2004-10-07 06:00:24 +00:00
Chris Lattner	ad9fe72e72	* Rename pass to globalopt, since we do more than just constify * Instead of handling dead functions specially, just nuke them. * Be more aggressive about cleaning up after constification, in particular, handle getelementptr instructions and constantexprs. * Be a little bit more structured about how we process globals. *** Delete globals that are only stored to, and never read. These are clearly not useful, so they should go. This implements deadglobal.llx This last one triggers quite a few times. In particular, 2208 in the external tests, 1865 of which are in 252.eon. This shrinks eon from 1995094 to 1732341 bytes of bytecode. llvm-svn: 16802	2004-10-07 04:16:33 +00:00
Chris Lattner	4a19983f2d	Implement GlobalConstifier/trivialstore.llx, and also do some simplifications of the resultant program to avoid making later passes do it all. This allows us to constify globals that just have the same constant that they are initialized stored into them. Suprisingly this comes up ALL of the freaking time, dozens of times in SPEC, 30 times in vortex alone. For example, on 256.bzip2, it allows us to constify these two globals: %smallMode = internal global ubyte 0 ; <ubyte> [#uses=8] %verbosity = internal global int 0 ; <int> [#uses=49] Which (with later optimizations) results in the bytecode file shrinking from 82286 to 69686 bytes! Lets hear it for IPO :) For the record, it's nuking lots of "if (verbosity > 2) { do lots of stuff }" code. llvm-svn: 16793	2004-10-06 20:57:02 +00:00
Chris Lattner	ff8cbd01e7	Instcombine: -(X sdiv C) -> (X sdiv -C), tested by sub.ll:test16 llvm-svn: 16769	2004-10-06 15:08:25 +00:00
Chris Lattner	5f0c904ec0	Reduce code growth implied by the tail duplication pass by not duplicating an instruction if it can be hoisted to a common dominator of the block. This implements: test/Regression/Transforms/TailDup/MergeTest.ll llvm-svn: 16758	2004-10-06 03:27:37 +00:00
Brian Gaeke	ab7dd80200	Add accessor function. llvm-svn: 16622	2004-09-30 20:14:29 +00:00
Brian Gaeke	65540b3e58	Correct type of accessor functions. llvm-svn: 16621	2004-09-30 20:14:18 +00:00
Brian Gaeke	90a286872c	Namespacify. Add accessor function. llvm-svn: 16620	2004-09-30 20:14:07 +00:00
Chris Lattner	af68e9a012	Disable the 'WARNING: Found global types that are not compatible' warning that always prints when linking programs to libstdc++ :( llvm-svn: 16603	2004-09-30 00:12:29 +00:00
Chris Lattner	8341306cba	Hrm, debugging printouts do not need to be in here llvm-svn: 16598	2004-09-29 21:21:14 +00:00
Chris Lattner	79ceb6ba53	* Pull range optimization code out into new InsertRangeTest function. * SubOne/AddOne functions always return ConstantInt, declare them as such * Pull code for handling setcc X, cst, where cst is at the end of the range, or cc is LE or GE up earlier in visitSetCondInst. This reduces #iterations in some cases. * Fold: (div X, C1) op C2 -> range check, implementing div.ll:test6 - test9. llvm-svn: 16588	2004-09-29 17:40:11 +00:00
Chris Lattner	778a49acfd	Do not insert trivially dead select instructions, which allows us to potentially fold more in one pass. llvm-svn: 16583	2004-09-29 05:43:32 +00:00
Chris Lattner	572652718c	Fold binary expressions and casts into PHI nodes that have all constant inputs. This takes something like this: %A = phi int [ 3, %cond_false.0 ], [ 2, %endif.0.i ], [ 2, %endif.1.i ] %B = div int %tmp.243, 4 and turns it into: %A = phi int [ 3/4, %cond_false.0 ], [ 2/4, %endif.0.i ], [ 2/4, %endif.1.i ] which is later simplified (in this case) into %A = 0. This triggers thousands of times in spec, for example, 269 times in 176.gcc. This is tested by InstCombine/add.ll:test23 and set.ll:test18. llvm-svn: 16582	2004-09-29 05:07:12 +00:00
Chris Lattner	4ea03eea49	Hrm, really, all tests passed without this, but it is scary to think how... llvm-svn: 16568	2004-09-29 03:16:24 +00:00
Chris Lattner	1ad393b186	Remove debugging printout Instcombine (setcc (truncate X), C1). This occurs THOUSANDS of times in many benchmarks. Particularlly common seem to be things like (seteq (cast bool X to int), int 0) This turns it into (seteq bool %X, false), which then becomes (not %X). llvm-svn: 16567	2004-09-29 03:09:18 +00:00
Chris Lattner	0046cec2a2	Fold (X setcc C1) \| (X setcc C2) This implements or.ll:test1[89] llvm-svn: 16561	2004-09-28 22:33:08 +00:00
Chris Lattner	d3cfa5aba5	Fold (and (setcc X, C1), (setcc X, C2)) This is important for several reasons: 1. Benchmarks have lots of code that looks like this (perlbmk in particular): %tmp.2.i = setne int %tmp.0.i, 128 ; <bool> [#uses=1] %tmp.6343 = seteq int %tmp.0.i, 1 ; <bool> [#uses=1] %tmp.63 = and bool %tmp.2.i, %tmp.6343 ; <bool> [#uses=1] we now fold away the setne, a clear improvement. 2. In the more important cases, such as (X >= 10) & (X < 20), we now produce smaller code: (X-10) < 10. 3. Perhaps the nicest effect of this patch is that it really helps out the code generators. In particular, for a 'range test' like the above, instead of generating this on X86 (the difference on PPC is even more pronounced): cmp %EAX, 50 setge %CL cmp %EAX, 100 setl %AL and %CL, %AL cmp %CL, 0 we now generate this: add %EAX, -50 cmp %EAX, 50 Furthermore, this causes setcc's to be folded into branches more often. These combinations trigger dozens of times in the spec benchmarks, particularly in 176.gcc, 186.crafty, 253.perlbmk, 254.gap, & 099.go. llvm-svn: 16559	2004-09-28 21:48:02 +00:00
Chris Lattner	d7b9ededb4	Implement X / C1 / C2 folding Implement (setcc (shl X, C1), C2) folding. The second one occurs several dozen times in spec. The first was added just in case. :) These are tested by shift.ll:test2[12], and div.ll:test5 llvm-svn: 16549	2004-09-28 18:22:15 +00:00
Chris Lattner	a4e0ed87bc	shl is always zero extending, so always use a zero extending shift right. This latent bug was exposed by recent changes, and is tested as: llvm/test/Regression/Transforms/InstCombine/2004-09-28-BadShiftAndSetCC.llx llvm-svn: 16546	2004-09-28 17:54:07 +00:00
Alkis Evlogimenos	4f5920aaef	Add includes and use std:: for standard library calls to make code compile on windows. This patch was contributed by Paolo Invernizzi. llvm-svn: 16539	2004-09-28 14:42:44 +00:00
Alkis Evlogimenos	7ff66b2884	Pull assignment out of for loop conditional in order for this to compile under windows. Patch contributed by Paolo Invernizzi! llvm-svn: 16534	2004-09-28 02:40:37 +00:00
Chris Lattner	f953091075	Fix two bugs: one where a condition was mistakenly swapped, and another where we folded (X & 254) -> X < 1 instead of X < 2. These problems were latent problems exposed by the latest patch. llvm-svn: 16528	2004-09-27 19:29:18 +00:00
Chris Lattner	a715ffded3	Fold: (setcc (shr X, ShAmt), CI), where 'cc' is eq or ne. This xform triggers often, for example: 6x in povray, 1x in gzip, 279x in gcc, 1x in crafty, 8x in eon, 11x in perlbmk, 362x in gap, 4x in vortex, 14 in m88ksim, 211x in 126.gcc, 1x in compress, 11x in ijpeg, and 4x in 147.vortex. llvm-svn: 16521	2004-09-27 16:18:50 +00:00
Chris Lattner	9d4748d32d	Implement shift-and combinations, implementing InstCombine/and.ll:test19-21 These combinations trigger 4 times in povray, 7x in gcc, 4x in gap, and 2x in bzip2. llvm-svn: 16508	2004-09-24 15:21:34 +00:00
Chris Lattner	7e603bfc67	Move LHSI->hasOneUse() into the arms of the conditional, reindenting code. No functionality changes here. llvm-svn: 16505	2004-09-23 21:52:49 +00:00
Chris Lattner	00ea30c3c5	Implement Transforms/InstCombine/and.ll:test18, a case that occurs 20 times in perlbmk llvm-svn: 16504	2004-09-23 21:46:38 +00:00
Chris Lattner	6409a166e8	Implement select.ll:test16: fold load (select C, X, null) -> load X llvm-svn: 16499	2004-09-23 15:46:00 +00:00
Chris Lattner	537636bb55	Do not fold (X + C1 != C2) if there are other users of the add. Doing this transformation used to take a loop like this: int Array[1000]; void test(int X) { int i; for (i = 0; i < 1000; ++i) Array[i] += X; } Compiled to LLVM is: no_exit: ; preds = %entry, %no_exit %indvar = phi uint [ 0, %entry ], [ %indvar.next, %no_exit ] ; <uint> [#uses=2] %tmp.4 = getelementptr [1000 x int]* %Array, int 0, uint %indvar ; <int> [#uses=2] %tmp.7 = load int %tmp.4 ; <int> [#uses=1] %tmp.9 = add int %tmp.7, %X ; <int> [#uses=1] store int %tmp.9, int* %tmp.4 * %indvar.next = add uint %indvar, 1 ; <uint> [#uses=2] * %exitcond = seteq uint %indvar.next, 1000 ; <bool> [#uses=1] br bool %exitcond, label %return, label %no_exit and turn it into a loop like this: no_exit: ; preds = %entry, %no_exit %indvar = phi uint [ 0, %entry ], [ %indvar.next, %no_exit ] ; <uint> [#uses=3] %tmp.4 = getelementptr [1000 x int]* %Array, int 0, uint %indvar ; <int> [#uses=2] %tmp.7 = load int %tmp.4 ; <int> [#uses=1] %tmp.9 = add int %tmp.7, %X ; <int> [#uses=1] store int %tmp.9, int* %tmp.4 * %indvar.next = add uint %indvar, 1 ; <uint> [#uses=1] * %exitcond = seteq uint %indvar, 999 ; <bool> [#uses=1] br bool %exitcond, label %return, label %no_exit Note that indvar.next and indvar can no longer be coallesced. In machine code terms, this patch changes this code: .LBBtest_1: # no_exit mov %EDX, OFFSET Array mov %ESI, %EAX add %ESI, DWORD PTR [%EDX + 4%ECX] mov %EDX, OFFSET Array mov DWORD PTR [%EDX + 4%ECX], %ESI mov %EDX, %ECX inc %EDX cmp %ECX, 999 mov %ECX, %EDX jne .LBBtest_1 # no_exit into this: .LBBtest_1: # no_exit mov %EDX, OFFSET Array mov %ESI, %EAX add %ESI, DWORD PTR [%EDX + 4%ECX] mov %EDX, OFFSET Array mov DWORD PTR [%EDX + 4%ECX], %ESI inc %ECX cmp %ECX, 1000 jne .LBBtest_1 # no_exit We need better instruction selection to get this: .LBBtest_1: # no_exit add DWORD PTR [Array + 4*%ECX], EAX inc %ECX cmp %ECX, 1000 jne .LBBtest_1 # no_exit ... but at least there is less register juggling llvm-svn: 16473	2004-09-21 21:35:23 +00:00
Chris Lattner	b64bfebc25	Fix potential miscompilations: InstCombine/2004-09-20-BadLoadCombine*.llx llvm-svn: 16447	2004-09-20 10:15:10 +00:00
Alkis Evlogimenos	a3a9fa1d80	Fix loop condition so that we don't decrement off the beginning of the list. llvm-svn: 16440	2004-09-20 06:42:58 +00:00
Chris Lattner	43c0372c0b	'Pass' should now not be derived from by clients. Instead, they should derive from ModulePass. Instead of implementing Pass::run, then should implement ModulePass::runOnModule. llvm-svn: 16436	2004-09-20 04:48:05 +00:00
Chris Lattner	b65d7c65d6	Prototype more accurately llvm-svn: 16433	2004-09-20 04:43:57 +00:00
Chris Lattner	c137c9ac39	Prototype these functions more accurately llvm-svn: 16432	2004-09-20 04:43:15 +00:00
Chris Lattner	f607a26457	Make isSafeToLoadUnconditionally a bit smarter, implementing PR362 and Regression/Transforms/InstCombine/CPP_min_max.llx llvm-svn: 16409	2004-09-19 19:18:10 +00:00
Chris Lattner	98d434988e	Remove a whole bunch of horrible hacky code that was used to promote allocas whose addresses where used by trivial phi nodes and select instructions. This is now performed by the instcombine pass, which is more powerful, is much simpler, and is faster. This allows the deletion of a bunch of code, two FIXME's and two gotos. llvm-svn: 16406	2004-09-19 18:51:51 +00:00
Chris Lattner	f45dc6dae6	Make instruction combining a bit more aggressive in the face of volatile loads, and implement two new transforms: InstCombine/load.ll:test[56]. llvm-svn: 16404	2004-09-19 18:43:46 +00:00
Chris Lattner	188b4e4983	Add comment llvm-svn: 16400	2004-09-19 01:05:16 +00:00
Chris Lattner	12bcdf2e01	Fix the inliner to always delete any edges from the external call node to a function being deleted. Due to optimizations done while inlining, there can be edges from the external call node to a function node that were not apparent any longer. This fixes the compiler crash while compiling 175.vpr llvm-svn: 16399	2004-09-18 21:37:03 +00:00
Chris Lattner	223e9d38b5	Convert this pass to be a CallGraphSCCPass instead of a Pass, which eliminates the worklist and makes it more efficient. This does not change functionality at all. llvm-svn: 16390	2004-09-18 00:34:13 +00:00
Chris Lattner	f31ae4da07	Make sure to remove the Select instruction as well llvm-svn: 16389	2004-09-18 00:32:40 +00:00
Chris Lattner	228e66d208	Fix typo in comment llvm-svn: 16384	2004-09-17 03:58:39 +00:00
Chris Lattner	e54e70266e	Add a newline llvm-svn: 16369	2004-09-15 17:53:52 +00:00
Reid Spencer	c6a8d70cff	Convert code to compile with vc7.1. Patch contributed by Paolo Invernizzi. Thanks Paolo! llvm-svn: 16368	2004-09-15 17:06:42 +00:00
Chris Lattner	5751f19d86	Fix a bug in the previous checkin that broke 255.vortex llvm-svn: 16355	2004-09-15 02:34:40 +00:00
Chris Lattner	6833186048	Make sure to update alias analysis information as we transform the function. This fixes PR420 and Regression/Transforms/LICM/2004-09-14-AliasAnalysisInvalidate.llx llvm-svn: 16348	2004-09-15 01:04:07 +00:00
Chris Lattner	1d9b957384	If given an AliasSetTracker object to update, update it. llvm-svn: 16347	2004-09-15 01:02:54 +00:00
Chris Lattner	a7635e78c2	Remove a long-dead pass. Actually, this pass was never used at all. llvm-svn: 16337	2004-09-14 16:33:01 +00:00
Alkis Evlogimenos	0c50e0f211	Fixes to make LLVM compile with vc7.1. Patch contributed by Paolo Invernizzi! llvm-svn: 16152	2004-09-03 18:19:51 +00:00
Reid Spencer	c4abcbefb1	Changes For Bug 352 Move include/Config and include/Support into include/llvm/Config, include/llvm/ADT and include/llvm/Support. From here on out, all LLVM public header files must be under include/llvm/. llvm-svn: 16137	2004-09-01 22:55:40 +00:00
Reid Spencer	7117a132ea	Initial checkin of a pass to lower packed operations to scalars operations. This also registers the pass with opt with a -lower-packed command line option. Patch contributed by Brad Jones. llvm-svn: 15987	2004-08-21 21:39:24 +00:00
Chris Lattner	9f60c755f8	If we are linking two global variables and they have the same size, do not spew warnings, even if the types don't match. llvm-svn: 15933	2004-08-20 00:30:39 +00:00
Chris Lattner	8e2dc1a98a	Implement test/Regression/Transforms/GlobalConstifier/phi-select.llx This allows more globals to be marked constant, particularly global arrays. llvm-svn: 15735	2004-08-14 20:57:17 +00:00
Chris Lattner	84445ee674	If we are extracting a block that has multiple successors that are the same block (common in a switch), make sure to remove extra edges in successor blocks. This fixes CodeExtractor/2004-08-12-BlockExtractPHI.ll and should be pulled into LLVM 1.3 (though the regression test need not be, as that would require pulling in the LoopExtract.cpp changes). llvm-svn: 15717	2004-08-13 03:27:07 +00:00
Chris Lattner	2c3ad7902d	When we code extract some stuff, leave the codeRepl block in the place where the extracted code was, instead of putting it at the end of the function llvm-svn: 15716	2004-08-13 03:17:39 +00:00
Chris Lattner	8537ae6e2c	"extract" the block extractor pass from bugpoint (haha) llvm-svn: 15714	2004-08-13 03:05:17 +00:00
Chris Lattner	7abc1c2473	Add value mapper support for select constant exprs. This should fix a bug Nate ran into when bugpointing siod. This fix should go into LLVM 1.3 llvm-svn: 15712	2004-08-13 02:43:19 +00:00
Chris Lattner	32ad5d0bab	This patch makes the inliner refuse to inline functions that have alloca instructions in the body of the function (not the entry block). This fixes test/Programs/SingleSource/Regression/C/2004-08-12-InlinerAndAllocas.c and test/Programs/External/SPEC/CINT2000/176.gcc on zion. This should obviously be pulled into 1.3. llvm-svn: 15684	2004-08-12 05:45:09 +00:00
Chris Lattner	85e2339cfa	Fix code extraction of unwind blocks. This fixed bugs that bugpoint can run into. This should go into 1.3 llvm-svn: 15679	2004-08-12 03:17:02 +00:00
Chris Lattner	430578a835	Hrm, this pass didn't compile. This bugfix should go into 1.3! llvm-svn: 15676	2004-08-12 02:44:23 +00:00
Chris Lattner	2f98c58e84	Fix InstCombine/2004-08-10-BoolSetCC.ll, a bug that is miscompiling 176.gcc. Note that this is apparently not the only bug miscompiling gcc though. :( llvm-svn: 15639	2004-08-11 00:50:51 +00:00
Chris Lattner	32b5c4960c	Fix InstCombine/2004-08-09-RemInfLoop.llx This should go into the 1.3 branch llvm-svn: 15593	2004-08-09 21:05:48 +00:00
Chris Lattner	c5a25532c7	Fix another really nasty regression that Anshu pointed out. In cases where dangling constant users were removed from a function, causing it to be dead, we never removed the call graph edge from the external node to the function. In most cases, this didn't cause a problem (by luck). This should definitely go into 1.3 llvm-svn: 15570	2004-08-08 03:29:50 +00:00
Chris Lattner	0c29b326fb	Two fixes: 1. Fix a REALLY nasty cyclic replacement issue that Anshu discovered, causing nondeterminstic crashes and memory corruption. 2. For performance, don't go inserting constantexpr casts of GV pointers. This should definitely go into 1.3 llvm-svn: 15568	2004-08-08 01:30:07 +00:00
Chris Lattner	7f088ed899	This DEBUG is buggy. comment it out because it's not worth fixing. This should go into 1.3 llvm-svn: 15567	2004-08-08 01:27:56 +00:00
Alkis Evlogimenos	f853362a44	Stop using getValues(). llvm-svn: 15487	2004-08-04 08:44:43 +00:00
Chris Lattner	41c8b70624	Fix a regression in InstCombine/xor.ll llvm-svn: 15410	2004-08-01 19:42:59 +00:00
Chris Lattner	170b31f44d	Expose this as a functionpass llvm-svn: 15369	2004-07-31 10:01:58 +00:00
Misha Brukman	4b70aa2e78	Fix De Morgan's name. llvm-svn: 15343	2004-07-30 12:50:08 +00:00
Chris Lattner	e63c404df2	Start using the PatternMatcher a bit. llvm-svn: 15342	2004-07-30 07:50:03 +00:00
Misha Brukman	8760d70159	Fix #includes of i*.h => Instructions.h as per PR403. llvm-svn: 15337	2004-07-29 17:30:57 +00:00
Misha Brukman	58104df77b	Fix #includes of i*.h => Instructions.h as per PR403. llvm-svn: 15334	2004-07-29 17:30:56 +00:00
Misha Brukman	2a80e53645	Fix #includes of i*.h => Instructions.h as per PR403. llvm-svn: 15328	2004-07-29 17:05:13 +00:00
Alkis Evlogimenos	fb27f702ca	Merge i*.h headers into Instructions.h as part of bug403. llvm-svn: 15325	2004-07-29 12:17:34 +00:00
Robert Bocchino	4325ca6606	This change fixed a bug in the function visitMul. The prior version assumed that a constant on the RHS of a multiplication was either an IntConstant or an FPConstant. It checked for an IntConstant and then, if it did not find one, did a hard cast to an FPConstant. That code would crash if the RHS were a ConstantExpr that was neither an IntConstant nor an FPConstant. This version replaces the hard cast with a dyn_cast. It performs the same way for IntConstants and FPConstants but does nothing, instead of crashing, for constant expressions. The regression test for this change is 2004-07-27-ConstantExprMul.ll. llvm-svn: 15291	2004-07-27 21:02:21 +00:00
Brian Gaeke	45adb41f46	Make the create...() functions for some of these passes return a FunctionPass *. llvm-svn: 15276	2004-07-27 17:43:21 +00:00
Chris Lattner	5cfb85064e	Fix hoisting of void typed values, e.g. calls llvm-svn: 15263	2004-07-27 07:38:32 +00:00
Chris Lattner	59aff88abe	Implement DeadStoreElim/alloca.llx by observing that allocas are dead at the end of the function (either return or unwind) llvm-svn: 15232	2004-07-26 06:14:11 +00:00
Chris Lattner	c2e9c56906	Throttle back indvar substitution from creating multiplies in loops. This is bad bad bad. llvm-svn: 15227	2004-07-26 02:47:12 +00:00
Chris Lattner	a259f9201c	* Substantially simplify how free instructions are handled (potentially fixing a bug in DSE). * Delete dead operand uses iteratively instead of recursively, using a SetVector. * Defer deletion of dead operand uses until the end of processing, which means we don't have to bother with updating the AliasSetTracker. This speeds up DSE substantially. llvm-svn: 15204	2004-07-25 11:09:56 +00:00
Chris Lattner	0a9d5e6f14	Free instructions kill values too. This implements DeadStoreElim/free.llx llvm-svn: 15199	2004-07-25 07:58:38 +00:00
Chris Lattner	29e97f36bf	obvious fix llvm-svn: 15162	2004-07-24 07:51:27 +00:00
Chris Lattner	7e9731bc4f	This is a trivial dead store elimination pass. It very very simple and can be improved in many ways. But: stop laughing, even with -basicaa it deletes 15% of the stores in 252.eon :) llvm-svn: 15101	2004-07-22 08:00:28 +00:00
Chris Lattner	7b301dfa9d	Update GC intrinsics to take a pointer to the object as well as a pointer to the field being updated. Patch contributed by Tobias Nurmiranta llvm-svn: 15097	2004-07-22 05:51:13 +00:00
Brian Gaeke	f18cdca667	These files don't need to include <iostream> since they include "Support/Debug.h". llvm-svn: 15089	2004-07-21 20:50:33 +00:00
Chris Lattner	b77bda4432	* Further cleanup. * Test for whether bits are shifted out during the optzn. If so, the fold is illegal, though it can be handled explicitly for setne/seteq This fixes the miscompilation of 254.gap last night, which was a latent bug exposed by other optimizer improvements. llvm-svn: 15085	2004-07-21 20:14:10 +00:00
Chris Lattner	a3c10c2012	Make cast-cast code a bit more defensive "simplify" a bit of code for comparison/and folding llvm-svn: 15082	2004-07-21 19:50:44 +00:00
Chris Lattner	62c40d3982	Remove special casing of pointers and treat them generically as integers of the appopriate size. This gives us the ability to eliminate int -> ptr -> int llvm-svn: 15063	2004-07-21 04:27:24 +00:00
Chris Lattner	4e4b8b0ad2	Fix a serious code pessimization problem. If an inlined function has a single return, clone the 'ret' BB code into the block AFTER the inlined call, not the other way around. llvm-svn: 15030	2004-07-20 05:45:24 +00:00
Chris Lattner	f8c20caf25	Implement Transforms/InstCombine/IntPtrCast.ll llvm-svn: 15029	2004-07-20 05:21:00 +00:00
Chris Lattner	da83200d72	Ignore instructions that are in trivially dead functions. This allows us to constify 14 globals instead of 4 in a trivial C++ testcase. llvm-svn: 15027	2004-07-20 03:58:07 +00:00
Chris Lattner	f6eb30e9a8	Implement InstCombine/GEPIdxCanon.ll llvm-svn: 15024	2004-07-20 01:48:15 +00:00
Chris Lattner	8277141f09	Implement SimplifyCFG/BrUnwind.ll llvm-svn: 15022	2004-07-20 01:17:38 +00:00
Chris Lattner	d9c41a82a3	Rewrite cast->cast elimination code completely based on the information we actually care about. Someday when the cast instruction is gone, we can do better here, but this will do for now. This implements instcombine/cast.ll:test17/18 as well. llvm-svn: 15018	2004-07-20 00:59:32 +00:00
Chris Lattner	ffc1df7399	Fix a performance regression from the CPR patch, simplify code llvm-svn: 14974	2004-07-18 21:34:16 +00:00
Chris Lattner	9de817e13e	Strip out and simplify some code. This also fixes the regression last night compiling cfrac. It did not realize that code like this: int G; int *H = &G; takes the address of G. llvm-svn: 14973	2004-07-18 19:56:20 +00:00
Chris Lattner	8c4d6aa7e8	Minor cleanup, no functionality change llvm-svn: 14972	2004-07-18 18:59:44 +00:00
Reid Spencer	7b03169e0e	Remove an if statement that would never be reached. llvm-svn: 14968	2004-07-18 08:41:47 +00:00
Reid Spencer	6d26720976	Delete a redundant if branch. llvm-svn: 14967	2004-07-18 08:34:52 +00:00
Reid Spencer	3a18547fcb	Expand the coercion of constants to include the newly constant Globals. llvm-svn: 14966	2004-07-18 08:34:19 +00:00
Reid Spencer	2b7bae6b4f	Delete a no-op loop. llvm-svn: 14965	2004-07-18 08:32:43 +00:00
Reid Spencer	7236678c32	Expand the scope to include global values because they are now constants too. llvm-svn: 14964	2004-07-18 08:32:10 +00:00
Reid Spencer	90795f0825	Avoid an unnecessary isa<Constant>. llvm-svn: 14963	2004-07-18 08:31:18 +00:00
Chris Lattner	71f281984d	Remove useless statistic, fix some slightly broken logic llvm-svn: 14958	2004-07-18 07:22:58 +00:00
Chris Lattner	99e46b2e81	Fix a rather serious bug in previous checkin llvm-svn: 14957	2004-07-18 06:56:58 +00:00

... 3 4 5 6 7 ...

1950 Commits