llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-01 16:33:37 +01:00

Author	SHA1	Message	Date
Daniel Dunbar	8efbce934e	Add suggested parentheses. llvm-svn: 91853	2009-12-21 23:27:57 +00:00
Chris Lattner	8bd3f74d82	Add a fastpath to Load GVN to special case when we have exactly one dominating load to avoid even messing around with SSAUpdate at all. In this case (which is very common, we can just use the input value directly). This speeds up GVN time on gcc.c-torture/20001226-1.c from 36.4s to 16.3s, which still isn't great, but substantially better and this is a simple speedup that applies to lots of different cases. llvm-svn: 91851	2009-12-21 23:15:48 +00:00
Chris Lattner	41cad2092d	refactor some code out to a new helper method. llvm-svn: 91849	2009-12-21 23:04:33 +00:00
Chris Lattner	1cbad45619	improve indentation avoid a pointless conversion from weakvh to trackingvh, no functionality change. llvm-svn: 91848	2009-12-21 22:43:03 +00:00
Bob Wilson	eb77079db5	Remove special-case SROA optimization of variable indexes to one-element and two-element arrays. After restructuring the SROA code, it was not safe to do this without adding more checking. It is not clear that this special-case has really been useful, and removing this simplifies the code quite a bit. llvm-svn: 91828	2009-12-21 18:39:47 +00:00
Chris Lattner	cd9fb503c8	revert r89298, which was committed without a testcase. I think the underlying PHI node insertion issue in SSAUpdate is fixed. llvm-svn: 91821	2009-12-21 07:45:57 +00:00
Chris Lattner	c54fd1e777	fix PR5837 by having SSAUpdate reuse phi nodes for the 'GetValueInMiddleOfBlock' case, instead of inserting duplicates. A similar fix is almost certainly needed by the machine-level SSAUpdate implementation. llvm-svn: 91820	2009-12-21 07:16:11 +00:00
Chris Lattner	c9bfe8679e	give instcombine some helper functions for matching MIN and MAX, and implement some optimizations for MIN(MIN()) and MAX(MAX()) and MIN(MAX()) etc. This substantially improves the code in PR5822 but doesn't kick in much elsewhere. 2 max's were optimized in pairlocalalign and one in smg2000. llvm-svn: 91814	2009-12-21 06:03:05 +00:00
Chris Lattner	f1474e1761	enhance x-(-A) -> x+A to preserve NUW/NSW. Use the presence of NSW/NUW to fold "icmp (x+cst), x" to a constant in cases where it would otherwise be undefined behavior. Surprisingly (to me at least), this triggers hundreds of the times in a few benchmarks: lencode, ldecode, and 466.h264ref seem to really like this. llvm-svn: 91812	2009-12-21 04:04:05 +00:00
Chris Lattner	d34eb29977	Optimize all cases of "icmp (X+Cst), X" to something simpler. This triggers a bunch in lencode, ldecod, spass, 176.gcc, 252.eon, among others. It is also the first part of PR5822 llvm-svn: 91811	2009-12-21 03:19:28 +00:00
Douglas Gregor	f39dd74a3f	Fix a bunch of little errors that Clang complains about when its being pedantic llvm-svn: 91764	2009-12-19 07:05:23 +00:00
Chris Lattner	d9bf69f1a5	fix PR5827 by disabling the phi slicing transformation in a case where instcombine would have to split a critical edge due to a phi node of an invoke. Since instcombine can't change the CFG, it has to bail out from doing the transformation. llvm-svn: 91763	2009-12-19 07:01:15 +00:00
Bob Wilson	067a1be3e9	Update my SROA changes in response to review. * change FindElementAndOffset to return a uint64_t instead of unsigned, and to identify the type to be used for that result in a GEP instruction. * move "isa<ConstantInt>" to be first in conditional. * replace some dyn_casts with casts. * add a comment about handling mem intrinsics. llvm-svn: 91762	2009-12-19 06:53:17 +00:00
Bob Wilson	03b6955c7f	Reapply 91459 with a simple fix for the problem that broke the x86_64-darwin bootstrap. This also replaces the WeakVH references that Chris objected to with normal Value references. llvm-svn: 91711	2009-12-18 20:14:40 +00:00
Eli Friedman	c8ab298dbd	Optimize icmp of null and select of two constants even if the select has multiple uses. (The construct in question was found in gcc.) llvm-svn: 91675	2009-12-18 08:22:35 +00:00
Dan Gohman	87a4da3d59	Eliminte unnecessary uses of <cstdio>. llvm-svn: 91666	2009-12-18 03:25:51 +00:00
Dan Gohman	0f8f7f179d	Add Loop contains utility methods for testing whether a loop contains another loop, or an instruction. The loop form is substantially more efficient on large loops than the typical code it replaces. llvm-svn: 91654	2009-12-18 01:24:09 +00:00
Dan Gohman	66a17c5ce2	Minor code simplification. llvm-svn: 91653	2009-12-18 01:20:44 +00:00
Dan Gohman	3ea233d1f2	Don't pass const pointers by reference. llvm-svn: 91647	2009-12-18 00:38:08 +00:00
Dan Gohman	dcf5423e51	Update a comment. llvm-svn: 91645	2009-12-18 00:28:43 +00:00
Dan Gohman	c483bed5e8	Reapply LoopStrengthReduce and IVUsers cleanups, excluding the part of 91296 that caused trouble -- the Processed list needs to be preserved for the livetime of the pass, as AddUsersIfInteresting is called from other passes. llvm-svn: 91641	2009-12-18 00:06:20 +00:00
Eli Friedman	9543d05079	Allow instcombine to combine "sext(a) >u const" to "a >u trunc(const)". llvm-svn: 91631	2009-12-17 22:42:29 +00:00
Eli Friedman	8afea2d095	Make the ptrtoint comparison simplification work if one side is a global. llvm-svn: 91624	2009-12-17 21:27:47 +00:00
Eli Friedman	ff5c248066	Slightly generalize transformation of memmove(a,a,n) so that it also applies to memcpy. (Such a memcpy is technically illegal, but in practice is safe and is generated by struct self-assignment in C code.) llvm-svn: 91621	2009-12-17 21:07:31 +00:00
Bob Wilson	9d4b46c0e6	Re-revert 91459. It's breaking the x86_64 darwin bootstrap. llvm-svn: 91607	2009-12-17 18:34:24 +00:00
Evan Cheng	18e334195d	Revert 91280-91283, 91286-91289, 91291, 91293, 91295-91296. It apparently introduced a non-deterministic behavior in the optimizer somewhere. llvm-svn: 91598	2009-12-17 09:39:49 +00:00
Daniel Dunbar	c4abbc0ab6	Reapply r91459, it was only unmasking the bug, and since TOT is still broken having it reverted does no good. llvm-svn: 91559	2009-12-16 20:09:53 +00:00
Daniel Dunbar	929f303477	Revert "Reapply 91184 with fixes and an addition to the testcase to cover the problem", this broke llvm-gcc bootstrap for release builds on x86_64-apple-darwin10. This reverts commit db22309800b224a9f5f51baf76071d7a93ce59c9. llvm-svn: 91534	2009-12-16 10:56:17 +00:00
Chris Lattner	2d5fc1649a	reapply my strstr optimization. I have reproduced the x86-64 bootstrap miscompile (i386.o miscompares) but it happens both with and without this patch. llvm-svn: 91532	2009-12-16 09:32:05 +00:00
Chris Lattner	8751050a34	revert my strstr optimization, I'm told it breaks x86-64 bootstrap. Will reapply with a fix when I get a chance. llvm-svn: 91486	2009-12-16 00:46:02 +00:00
Bob Wilson	8505aa648d	Reapply 91184 with fixes and an addition to the testcase to cover the problem found last time. Instead of trying to modify the IR while iterating over it, I've change it to keep a list of WeakVH references to dead instructions, and then delete those instructions later. I also added some special case code to detect and handle the situation when both operands of a memcpy intrinsic are referencing the same alloca. llvm-svn: 91459	2009-12-15 22:00:51 +00:00
Chris Lattner	fa960751b1	optimize strstr, PR5783 llvm-svn: 91438	2009-12-15 19:14:40 +00:00
Dan Gohman	a1003e40f7	Delete an unused function. llvm-svn: 91432	2009-12-15 16:30:09 +00:00
Chris Lattner	284fe3f14c	add some other xforms that should be done as part of PR5783 llvm-svn: 91428	2009-12-15 09:05:13 +00:00
Chris Lattner	587962c667	Remove isPod() from DenseMapInfo, splitting it out to its own isPodLike type trait. This is a generally useful type trait for more than just DenseMap, and we really care about whether something acts like a pod, not whether it really is a pod. llvm-svn: 91421	2009-12-15 07:26:43 +00:00
Dan Gohman	abb2ea84d9	Fix a thinko; isNotAlreadyContainedIn had a built-in negative, so the condition was inverted when the code was converted to contains(). llvm-svn: 91295	2009-12-14 17:31:01 +00:00
Dan Gohman	ea5d4d2354	Remove unnecessary #includes. llvm-svn: 91293	2009-12-14 17:19:06 +00:00
Dan Gohman	5066c25e62	Instead of having a ScalarEvolution pointer member in BasedUser, just pass the ScalarEvolution pointer into the functions which need it. llvm-svn: 91289	2009-12-14 17:12:51 +00:00
Dan Gohman	b001599b67	Don't bother cleaning up if there's nothing to clean up. llvm-svn: 91288	2009-12-14 17:10:44 +00:00
Dan Gohman	178e26adbf	Delete an unused variable. llvm-svn: 91287	2009-12-14 17:08:09 +00:00
Dan Gohman	44a2a502f2	LSR itself doesn't need LoopInfo. llvm-svn: 91283	2009-12-14 17:02:34 +00:00
Dan Gohman	449298eb08	LSR itself doesn't need DominatorTree. llvm-svn: 91282	2009-12-14 16:57:08 +00:00
Dan Gohman	bf3406f85f	Remove the code in LSR that manually hoists expansions out of loops; SCEVExpander does this automatically. llvm-svn: 91281	2009-12-14 16:52:55 +00:00
Dan Gohman	0a0423c492	Minor code cleanups. llvm-svn: 91280	2009-12-14 16:37:29 +00:00
Chris Lattner	6603d21f13	revert r91184, because it causes a crash on a .bc file I just sent to Bob. llvm-svn: 91268	2009-12-14 05:11:02 +00:00
Chandler Carruth	f5a2d328af	Don't leave pointers uninitialized in the default constructor. GCC complains about the potential use of these uninitialized members under certain conditions. llvm-svn: 91239	2009-12-13 07:04:45 +00:00
Bob Wilson	8486cae4ce	Revise scalar replacement to be more flexible about handle bitcasts and GEPs. While scanning through the uses of an alloca, keep track of the current offset relative to the start of the alloca, and check memory references to see if the offset & size correspond to a component within the alloca. This has the nice benefit of unifying much of the code from isSafeUseOfAllocation, isSafeElementUse, and isSafeUseOfBitCastedAllocation. The code to rewrite the uses of a promoted alloca, after it is determined to be safe, is reorganized in the same way. Also, when rewriting GEP instructions, mark them as "in-bounds" since all the indices are known to be safe. llvm-svn: 91184	2009-12-11 23:47:40 +00:00
Eric Christopher	70ca0b3d58	Make sure the immediate dominator isn't NULL through iterations of the loop. We could get to this condition via indirect branches. llvm-svn: 91009	2009-12-10 00:25:41 +00:00
Chris Lattner	ffedf37584	Fix PR5744, a case where we were getting the pointer size instead of the value size. This only manifested when memdep inprecisely returns clobber, which is do to a caching issue in the PR5744 testcase. We can 'efficiently emulate' this by using '-no-aa' llvm-svn: 91004	2009-12-10 00:11:45 +00:00
Chris Lattner	af8114cef6	allow this to build when the #if 0's are enabled. No functionality change. llvm-svn: 90999	2009-12-10 00:04:46 +00:00
Dan Gohman	3ae2ed72a7	Dereference loopHeader after checking for null rather than before. llvm-svn: 90990	2009-12-09 22:55:01 +00:00
Chris Lattner	bf3d03b576	fix hte last remaining known (by me) phi translation bug. When we reanalyze clobbers to forward pieces of large stores to small loads, we need to consider the properly phi translated pointer in the store block. llvm-svn: 90978	2009-12-09 18:21:46 +00:00
Chris Lattner	a97922837e	change GetStoreValueForLoad to use IRBuilder, which is cleaner and implicitly constant folds. llvm-svn: 90977	2009-12-09 18:13:28 +00:00
Bob Wilson	9e68616e49	Fix a comment. llvm-svn: 90975	2009-12-09 18:05:27 +00:00
Chris Lattner	db0baa713d	change AnalyzeLoadFromClobberingMemInst/AnalyzeLoadFromClobberingStore to require the load ty/ptr to be passed in, no functionality change. llvm-svn: 90960	2009-12-09 07:37:07 +00:00
Chris Lattner	b3587059c8	change AnalyzeLoadFromClobberingWrite and clients to pass in type and pointer instead of the load. No functionality change. llvm-svn: 90959	2009-12-09 07:34:10 +00:00
Chris Lattner	e0207b46d2	change NonLocalDepEntry from being a typedef for an std::pair to be its own small class. No functionality change. llvm-svn: 90956	2009-12-09 07:08:01 +00:00
Chris Lattner	4645bb977a	add some aborts to #if 0's. llvm-svn: 90929	2009-12-09 02:41:54 +00:00
Chris Lattner	dda5ca59e2	Switch GVN and memdep to use PHITransAddr, which correctly handles phi translation of complex expressions like &A[i+1]. This has the following benefits: 1. The phi translation logic is all contained in its own class with a strong interface and verification that it is self consistent. 2. The logic is more correct than before. Previously, if intermediate expressions got PHI translated, we'd miss the update and scan for the wrong pointers in predecessor blocks. @phi_trans2 is a testcase for this. 3. We have a lot less code in memdep. We can handle phi translation across blocks of things like @phi_trans3, which is pretty insane :). This patch should fix the miscompiles of 255.vortex, and I tested it with a bootstrap of llvm-gcc, llvm-test and dejagnu of course. llvm-svn: 90926	2009-12-09 01:59:31 +00:00
Bob Wilson	04cc375a1a	Some superficial cleanups. llvm-svn: 90866	2009-12-08 18:27:03 +00:00
Bob Wilson	d673b32280	Clean up dead operands left around after SROA replaces a mem intrinsic. I'm not aware that this does anything significant on its own, but it's needed for another patch that I'm working on. llvm-svn: 90864	2009-12-08 18:22:03 +00:00
Duncan Sands	897f9579d6	Teach GlobalOpt to delete aliases with internal linkage (after forwarding any uses). GlobalDCE can also do this, but is only run at -O3. llvm-svn: 90850	2009-12-08 10:10:20 +00:00
Nick Lewycky	5a00cea348	Remove unnecessary #include "llvm/LLVMContext.h". llvm-svn: 90836	2009-12-08 05:45:41 +00:00
Chris Lattner	7066a138ff	fix PR5698 llvm-svn: 90708	2009-12-06 17:17:23 +00:00
Chris Lattner	ea3007ddb8	constant fold loads from memcpy's from global constants. This is important because clang lowers nontrivial automatic struct/array inits to memcpy from a global array. llvm-svn: 90698	2009-12-06 05:29:56 +00:00
Chris Lattner	8885e71303	add support for forwarding mem intrinsic values to non-local loads. llvm-svn: 90697	2009-12-06 04:54:31 +00:00
Chris Lattner	5eba6ee969	Handle forwarding local memsets to loads. For example, we optimize this: short x(short A) { memset(A, 1, sizeof(A)*100); return A[42]; } to 'return 257' instead of doing the load. llvm-svn: 90695	2009-12-06 01:57:02 +00:00
Nick Lewycky	10693e2bb0	Generalize this optimization to work on equality comparisons between any two integers that are constant except for a single bit (the same n-th bit in each). llvm-svn: 90646	2009-12-05 05:00:00 +00:00
Bob Wilson	8c1617ed73	Fix up some comments. llvm-svn: 90603	2009-12-04 21:57:37 +00:00
Bob Wilson	514e0d319a	Fix 80-column violations. llvm-svn: 90601	2009-12-04 21:51:35 +00:00
Chris Lattner	422a3ff7d5	add an assert to make it really clear what this is doing. Return singularval as a compile time perf optimization to avoid a load. llvm-svn: 90507	2009-12-04 01:03:32 +00:00
Bob Wilson	c717c5e7ae	Fix a comment typo. llvm-svn: 90487	2009-12-03 21:47:07 +00:00
Owen Anderson	251cb28a25	Fix this crasher, and add a FIXME for a missed optimization. llvm-svn: 90408	2009-12-03 03:43:29 +00:00
Chris Lattner	9ce833945e	improve portability to avoid conflicting with std::next in c++'0x. Patch by Howard Hinnant! llvm-svn: 90365	2009-12-03 00:50:42 +00:00
Jim Grosbach	ccb304105a	Move EliminateDuplicatePHINodes() from SimplifyCFG.cpp to Local.cpp llvm-svn: 90324	2009-12-02 17:06:45 +00:00
Andreas Neustifter	e70972f8d5	Cheap, mostly strict, stable sorting. This is necessary for tests so the results are comparable. llvm-svn: 90320	2009-12-02 15:57:15 +00:00
Owen Anderson	f47cde694f	Cleanup/remove some parts of the lifetime region handling code in memdep and GVN, per Chris' comments. Adjust testcases to match. llvm-svn: 90304	2009-12-02 07:35:19 +00:00
Chris Lattner	b541c63e60	factor some code better. llvm-svn: 90299	2009-12-02 06:44:58 +00:00
Chris Lattner	90aff65316	formatting cleanups. llvm-svn: 90298	2009-12-02 06:35:55 +00:00
Chris Lattner	cb2c4e9f42	tidy up, remove dependence on order of evaluation of function args from EmitMemCpy. llvm-svn: 90297	2009-12-02 06:05:42 +00:00
Chris Lattner	7c0c90df97	fix PR5640 by tracking whether a block is the header of a loop more precisely, which prevents us from infinitely peeling the loop. llvm-svn: 90211	2009-12-01 06:04:43 +00:00
Benjamin Kramer	d9780ec7c5	Revert r90089 for now, it's breaking selfhost. llvm-svn: 90097	2009-11-29 21:17:48 +00:00
Benjamin Kramer	a1d24b5a8d	Fix two FIXMEs. llvm-svn: 90089	2009-11-29 20:29:30 +00:00
Chris Lattner	cd6fed25d5	add testcases for the foo_with_overflow op xforms added recently and fix bugs exposed by the tests. Testcases from Alastair Lynn! llvm-svn: 90056	2009-11-29 02:57:29 +00:00
Chris Lattner	d48ff7ea6a	Implement PR5634. llvm-svn: 90046	2009-11-29 00:51:17 +00:00
Chris Lattner	83284453a1	reenable load address insertion in load pre. This allows us to handle cases like this: void test(int N, double* G) { long j; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } where G[1] isn't live into the loop. llvm-svn: 90041	2009-11-28 16:08:18 +00:00
Chris Lattner	f8d8142a06	Enhance InsertPHITranslatedPointer to be able to return a list of newly inserted instructions. No functionality change until someone starts using it. llvm-svn: 90039	2009-11-28 15:39:14 +00:00
Chris Lattner	f3e5cbfc99	disable value insertion for now, I need to figure out how to inform GVN about the newly inserted values. This fixes PR5631. llvm-svn: 90022	2009-11-27 22:50:07 +00:00
Chris Lattner	73b425ba51	Rework InsertPHITranslatedPointer to handle the recursive case, this fixes PR5630 and sets the stage for the next phase of goodness (testcase pending). llvm-svn: 90019	2009-11-27 22:05:15 +00:00
Chris Lattner	bdaed088ea	factor some logic out of instcombine into a new SimplifyAddInst method. llvm-svn: 90011	2009-11-27 17:42:22 +00:00
Chris Lattner	cdfa9dadf1	fix PR5436 by making the 'simple' case of SRoA not promote out of range array indexes. The "complex" case of SRoA still handles them, and correctly. This fixes a weirdness where we'd correctly avoid transforming A[0][42] if the 42 was too large, but we'd only do it if it was one gep, not two separate ones. llvm-svn: 90007	2009-11-27 16:37:41 +00:00
Chris Lattner	a466dbe80a	teach GVN's load PRE to insert computations of the address in predecessors where it is not available. It's unclear how to get this inserted computation into GVN's scalar availability sets, Owen, help? :) llvm-svn: 89997	2009-11-27 08:25:10 +00:00
Chris Lattner	0971e6da1f	Fix phi translation in load PRE to agree with the phi translation done by memdep, and reenable gep translation again. llvm-svn: 89992	2009-11-27 06:31:14 +00:00
Chris Lattner	6611a6f733	factor some instcombine simplifications for getelementptr out to a new SimplifyGEPInst method in InstructionSimplify.h. No functionality change. llvm-svn: 89980	2009-11-27 00:29:05 +00:00
Chris Lattner	e949f49b23	fix crash on Transforms/InstCombine/intrinsics.ll introduced by r89970 llvm-svn: 89972	2009-11-26 22:08:06 +00:00
Chris Lattner	cf7665b0c8	Fix PR5471 by removing an instcombine xform. Some pieces of the code generates store to undef and some generates store to null as the idiom for undefined behavior. Since simplifycfg zaps both, don't remove the undefined behavior in instcombine. llvm-svn: 89971	2009-11-26 22:04:42 +00:00
Chris Lattner	08e20f453d	implement a bunch of xforms for overflow intrinsics, based on a patch by Alastair Lynn. llvm-svn: 89970	2009-11-26 21:42:47 +00:00
Edward O'Callaghan	4b197b8908	Reverting patch in revision 89758, initial attempt at fixing PR5373 has proven to be bogus. llvm-svn: 89844	2009-11-25 05:38:41 +00:00
Edward O'Callaghan	8c1cd4fdbc	Fix for PR5373, Credit to Jakub Staszak. llvm-svn: 89758	2009-11-24 11:51:52 +00:00
Dan Gohman	58bb87921b	Make ConstantFoldConstantExpression recursively visit the entire ConstantExpr, not just the top-level operator. This allows it to fold many more constants. Also, make GlobalOpt call ConstantFoldConstantExpression on GlobalVariable initializers. llvm-svn: 89659	2009-11-23 16:22:21 +00:00
Dan Gohman	0ef3e7cf76	Fix a use of an invalidated iterator in the case where there are multiple adjacent uses of a dead basic block from the same user. This fixes PR5596. llvm-svn: 89658	2009-11-23 16:13:39 +00:00
Nick Lewycky	b3bedf4b2d	Pull LLVMContext out of PromoteMemToReg. llvm-svn: 89645	2009-11-23 03:50:44 +00:00
Nick Lewycky	76fbcdaaa7	Remove LLVMContext and its include. llvm-svn: 89644	2009-11-23 03:34:29 +00:00
Nick Lewycky	8cbd0c3156	Remove unused LLVMContext. llvm-svn: 89642	2009-11-23 03:29:18 +00:00
Nick Lewycky	9d1ee635e3	Reapply r88830 with a bugfix: this transform only applies to icmp eq/ne. This fixes part of PR5438. llvm-svn: 89639	2009-11-23 03:17:33 +00:00
Eric Christopher	455f4d8400	Add more optimizations for object size checking, enable handling of object size intrinsic and verify return type is correct. Collect various code in one place. llvm-svn: 89523	2009-11-21 01:01:30 +00:00
Dan Gohman	d115107ef2	Make Loop::getLoopLatch() work on loops which don't have preheaders, as it may be used in contexts where preheader insertion may have failed due to an indirectbr. Make LoopSimplify's LoopSimplify::SeparateNestedLoop properly fail in the case that it would require splitting an indirectbr edge. These fix PR5502. llvm-svn: 89484	2009-11-20 20:51:18 +00:00
Dan Gohman	94cca19d9d	Fix IPSCCP's code for deleting dead blocks to tolerate outstanding blockaddress users. This fixes PR5569. llvm-svn: 89483	2009-11-20 20:19:14 +00:00
Daniel Dunbar	09a7f92b02	Revert "Add some rough optimizations for checking routines.", it buildeth not. llvm-svn: 89482	2009-11-20 20:17:30 +00:00
Eric Christopher	61485dfd00	Add some rough optimizations for checking routines. llvm-svn: 89479	2009-11-20 19:57:37 +00:00
Duncan Sands	5f5ec2a6ec	Fix PR5563, an expensive checks failure when running on tests/Transforms/InstCombine/shufflemask-undef.ll. If anyone cares, the use of 2*e here (and the equivalent all over the place in instcombine) seems wrong, though harmless: it should really be twice the length of the input vector. I think shufflevector used to require that the mask have the same length as the input, but I don't think that's true any more. I don't care enough about vectors to do anything about this... llvm-svn: 89456	2009-11-20 13:19:51 +00:00
Dan Gohman	44ddc50043	Extend CaptureTracking to indicate when a value is never stored, even if it is not ultimately captured. Teach BasicAliasAnalysis that a local object address which does not escape and is never stored does not alias with a value resulting from a load. llvm-svn: 89398	2009-11-19 21:57:48 +00:00
Dan Gohman	026230b0a9	Enable hoisting of loads from constant memory by default. In cases where they are lowered to instruction sequences more complex than a simple load, such that CodeGen cannot rematerialize them, a reload from a spill slot is likely to be cheaper than the complex sequence. llvm-svn: 89374	2009-11-19 19:00:10 +00:00
Jim Grosbach	f5f954f888	Eliminate duplicate phi nodes in loops. Loop rotation, for example, can introduce these, and it's beneficial to later passes to clean them up. llvm-svn: 89298	2009-11-19 02:03:18 +00:00
Jim Grosbach	5787c6eb88	Make EliminateDuplicatePHINodes() available as a utility function llvm-svn: 89297	2009-11-19 02:02:10 +00:00
Jim Grosbach	2853c74dc9	grammar llvm-svn: 89145	2009-11-17 21:37:04 +00:00
Jim Grosbach	1aa8f6c5c7	80-column violations llvm-svn: 89123	2009-11-17 19:05:35 +00:00
Evan Cheng	aaa58b7653	Generalize OptimizeLoopTermCond to optimize more loop terminating icmp to use postinc iv. llvm-svn: 89116	2009-11-17 18:10:11 +00:00
Jim Grosbach	5851dbf184	Remove trailing whitespace llvm-svn: 89110	2009-11-17 17:53:56 +00:00
Devang Patel	04a40c8ff7	Remove debug info attached with an instruction. llvm-svn: 89016	2009-11-17 00:47:06 +00:00
David Greene	47e8728c22	Fix an expensive-checks error. The Mask and LHSMask may not be of the same size, so don't do the transformation if they're different. llvm-svn: 88972	2009-11-16 21:52:23 +00:00
Duncan Sands	03e15012ed	CreateIntCast takes an "isSigned" parameter. Pass "true" for it, rather than a name. llvm-svn: 88908	2009-11-16 12:32:28 +00:00
Chris Lattner	15cd19dddb	make PRE of loads preserve the alignment of the moved load instruction. llvm-svn: 88865	2009-11-15 19:58:31 +00:00
Chris Lattner	59f69de88f	fix a bug handling 'not x' when x is undef. llvm-svn: 88864	2009-11-15 19:57:43 +00:00
Nick Lewycky	f05946faff	Revert r88830 and r88831 which appear to have caused a selfhost buildbot some grief. I suspect this patch merely exposed a bug else. llvm-svn: 88841	2009-11-15 07:47:32 +00:00
Nick Lewycky	14a2122db3	Teach instcombine to look for booleans in wider integers when it encounters a zext(icmp). It may be able to optimize that away. This fixes one of the cases in PR5438. llvm-svn: 88830	2009-11-15 05:55:17 +00:00
Nick Lewycky	316e082216	Remove LLVMContext from reassociate. It was threaded through every function but ultimately never used. llvm-svn: 88763	2009-11-14 07:25:54 +00:00
Dan Gohman	406baaac43	Add an option for running GVN with redundant load processing disabled. llvm-svn: 88742	2009-11-14 02:27:51 +00:00
Owen Anderson	81f2ff1d61	Re-enable this code, since redundant PHIs are now being better nuked. llvm-svn: 87042	2009-11-12 23:22:41 +00:00
Chris Lattner	9a1f2aaff0	use isInstructionTriviallyDead, as pointed out by Duncan llvm-svn: 87035	2009-11-12 21:58:18 +00:00
Chris Lattner	00a9240c9c	implement a nice little efficiency hack in the inliner. Since we're now running IPSCCP early, and we run functionattrs interlaced with the inliner, we often (particularly for small or noop functions) completely propagate all of the information about a call to its call site in IPSSCP (making a call dead) and functionattrs is smart enough to realize that the function is readonly (because it is interlaced with inliner). To improve compile time and make the inliner threshold more accurate, realize that we don't have to inline dead readonly function calls. Instead, just delete the call. This happens all the time for C++ codes, here are some counters from opt/llvm-ld counting the number of times calls were deleted vs inlined on various apps: Tramp3d opt: 5033 inline - Number of call sites deleted, not inlined 24596 inline - Number of functions inlined llvm-ld: 667 inline - Number of functions deleted because all callers found 699 inline - Number of functions inlined 483.xalancbmk opt: 8096 inline - Number of call sites deleted, not inlined 62528 inline - Number of functions inlined llvm-ld: 217 inline - Number of allocas merged together 2158 inline - Number of functions inlined 471.omnetpp: 331 inline - Number of call sites deleted, not inlined 8981 inline - Number of functions inlined llvm-ld: 171 inline - Number of functions deleted because all callers found 629 inline - Number of functions inlined Deleting a call is much faster than inlining it, and is insensitive to the size of the callee. :) llvm-svn: 86975	2009-11-12 07:56:08 +00:00
Evan Cheng	b0a193db31	- Teach LSR to avoid changing cmp iv stride if it will create an immediate that cannot be folded into target cmp instruction. - Avoid a phase ordering issue where early cmp optimization would prevent the later count-to-zero optimization. - Add missing checks which could cause LSR to reuse stride that does not have users. - Fix a bug in count-to-zero optimization code which failed to find the pre-inc iv's phi node. - Remove, tighten, loosen some incorrect checks disable valid transformations. - Quite a bit of code clean up. llvm-svn: 86969	2009-11-12 07:35:05 +00:00
Chris Lattner	01fddcec53	use getPredicateOnEdge to fold comparisons through PHI nodes, which implements GCC PR18046. This also gets us 360 more jump threads on 176.gcc. llvm-svn: 86953	2009-11-12 05:24:05 +00:00
Chris Lattner	3e63fb7318	various fixes to the lattice transfer functions. llvm-svn: 86952	2009-11-12 04:57:13 +00:00
Chris Lattner	c1619b4fe9	switch jump threading to use getPredicateOnEdge in one place making the new LVI stuff smart enough to subsume some special cases in the old code. Disable them when LVI is around, the testcase still passes. llvm-svn: 86951	2009-11-12 04:37:50 +00:00
Daniel Dunbar	e01ea92e28	Add the braces gcc suggested. llvm-svn: 86933	2009-11-12 02:52:56 +00:00
Chris Lattner	68f3b53ddc	with the new code we can thread non-instruction values. This allows us to handle the test10 testcase. llvm-svn: 86924	2009-11-12 01:41:34 +00:00
Chris Lattner	ea8b237a74	this argument can be an arbitrary value, it doesn't need to be an instruction. llvm-svn: 86923	2009-11-12 01:37:43 +00:00
Chris Lattner	b5bb115ece	expose edge information and switch j-t to use it. llvm-svn: 86920	2009-11-12 01:29:10 +00:00
Chris Lattner	73b7ed2d9c	pass TD into a SimplifyCmpInst call. Add another case that uses LVI info when -enable-jump-threading-lvi is passed. llvm-svn: 86886	2009-11-11 22:31:38 +00:00
Duncan Sands	f0d9823d0b	Don't trivially delete unused calls to llvm.invariant.start. This allows llvm.invariant.start to be used without necessarily being paired with a call to llvm.invariant.end. If you run the entire optimization pipeline then such calls are in fact deleted (adce does it), but that's actually a good thing since we probably do want them to be zapped late in the game. There should really be an integration test that checks that the llvm.invariant.start call lasts long enough that all passes that do interesting things with it get to do their stuff before it is deleted. But since no passes do anything interesting with it yet this will have to wait for later. llvm-svn: 86840	2009-11-11 15:34:13 +00:00
Chris Lattner	36009e416c	remove the now dead condprop pass, PR3906. llvm-svn: 86810	2009-11-11 05:56:35 +00:00
Chris Lattner	b45381c3f0	stub out some LazyValueInfo interfaces, and have JumpThreading start using them in a trivial way when -enable-jump-threading-lvi is passed. enable-jump-threading-lvi will be my playground for awhile. llvm-svn: 86789	2009-11-11 02:08:33 +00:00
Chris Lattner	c1709a798a	add a fixme llvm-svn: 86766	2009-11-11 00:21:58 +00:00
Evan Cheng	ea76ec6720	Block terminator may be a switch. llvm-svn: 86761	2009-11-11 00:00:21 +00:00
Devang Patel	5c983cb2ab	Implement support to debug inlined functions. llvm-svn: 86748	2009-11-10 23:06:00 +00:00
Chris Lattner	f66a81aecd	implement a TODO by teaching jump threading about "xor x, 1". llvm-svn: 86739	2009-11-10 22:39:16 +00:00
Chris Lattner	ec4264fbb0	move some generally useful functions out of jump threading into libanalysis and transformutils. llvm-svn: 86735	2009-11-10 22:26:15 +00:00
Chris Lattner	a163be92fc	fix a crash in SCCP handling extractvalue of an array, pointed out and tracked down by Stephan Reiter! llvm-svn: 86726	2009-11-10 22:02:09 +00:00
Chris Lattner	f48b199c43	improve comment. llvm-svn: 86723	2009-11-10 21:45:09 +00:00
Chris Lattner	fca84b3dff	Make jump threading eliminate blocks that just contain phi nodes, debug intrinsics, and an unconditional branch when possible. This reuses the TryToSimplifyUncondBranchFromEmptyBlock function split out of simplifycfg. llvm-svn: 86722	2009-11-10 21:40:01 +00:00
Evan Cheng	f5e85bec73	Generalize lsr code that optimize loop to count down towards zero. llvm-svn: 86715	2009-11-10 21:14:05 +00:00
Duncan Sands	1053bb18c6	Add defensive break. llvm-svn: 86705	2009-11-10 19:36:40 +00:00
Duncan Sands	bfba3451b2	Fix obvious typo. llvm-svn: 86694	2009-11-10 18:21:37 +00:00
Chris Lattner	dc0722e39a	clarify logic. llvm-svn: 86689	2009-11-10 17:00:47 +00:00
Duncan Sands	732a2ed037	Teach DSE to eliminate useless trampolines. llvm-svn: 86683	2009-11-10 13:49:50 +00:00
Duncan Sands	a25c87ef1f	Add brackets to make gcc-4.4 happy. llvm-svn: 86681	2009-11-10 09:32:10 +00:00
Victor Hernandez	3c98070f2c	Update computeArraySize() to use ComputeMultiple() to determine the array size associated with a malloc; also extend PerformHeapAllocSRoA() to check if the optimized malloc's arg had its highest bit set, so that it is safe for ComputeMultiple() to look through sext instructions while determining the optimized malloc's array size llvm-svn: 86676	2009-11-10 08:32:25 +00:00
Chris Lattner	562cc40dbb	unify the code that determines whether it is a good idea to change the type of a computation. This fixes some infinite loops when dealing with TD that has no native types. llvm-svn: 86670	2009-11-10 07:23:37 +00:00
Nick Lewycky	4939d449e1	Simplify. llvm-svn: 86668	2009-11-10 07:00:43 +00:00
Nick Lewycky	f6be02e523	Reapply r86359, "Teach dead store elimination that certain intrinsics write to memory just like a store" with bug fixed (partial-overwrite.ll is the regression test). llvm-svn: 86667	2009-11-10 06:46:40 +00:00
Chris Lattner	9428f34d89	refactor TryToSimplifyUncondBranchFromEmptyBlock out of SimplifyCFG. llvm-svn: 86666	2009-11-10 05:59:26 +00:00
Oscar Fuentes	ce417d0ac2	CMake: Support for building llvm loadable modules. llvm-svn: 86656	2009-11-10 02:45:37 +00:00
Chris Lattner	f3fc70a936	make jump threading recursively simplify expressions instead of doing it just one level deep. On the testcase we go from getting this: F1: ; preds = %T2 %F = and i1 true, %cond ; <i1> [#uses=1] br i1 %F, label %X, label %Y to a fully threaded: F1: ; preds = %T2 br label %Y This changes gets us to the point where we're forming (too many) switch instructions on doug's strswitch testcase. llvm-svn: 86646	2009-11-10 01:57:31 +00:00
Chris Lattner	a087a1ca04	don't invalidate PN, rewrite of this code is in progress anyway. llvm-svn: 86639	2009-11-10 01:19:06 +00:00
Chris Lattner	a279728372	add a new SimplifyInstruction API, which is like ConstantFoldInstruction, except that the result may not be a constant. Switch jump threading to use it so that it gets things like (X & 0) -> 0, which occur when phi preds are deleted and the remaining phi pred was a zero. llvm-svn: 86637	2009-11-10 01:08:51 +00:00
Jeffrey Yasskin	23ac706aab	Fix DenseMap iterator constness. This patch forbids implicit conversion of DenseMap::const_iterator to DenseMap::iterator which was possible because DenseMapIterator inherited (publicly) from DenseMapConstIterator. Conversion the other way around is now allowed as one may expect. The template DenseMapConstIterator is removed and the template parameter IsConst which specifies whether the iterator is constant is added to DenseMapIterator. Actually IsConst parameter is not necessary since the constness can be determined from KeyT but this is not relevant to the fix and can be addressed later. Patch by Victor Zverovich! llvm-svn: 86636	2009-11-10 01:02:17 +00:00
Chris Lattner	3730cf6fef	factor simplification logic for AND and OR out to InstSimplify from instcombine. llvm-svn: 86635	2009-11-10 00:55:12 +00:00
Chris Lattner	9941f27797	pull a bunch of logic out of instcombine into instsimplify for compare simplification, this handles the foldable fcmp x,x cases among many others. llvm-svn: 86627	2009-11-09 23:55:12 +00:00
Chris Lattner	9aa69f2205	inline a simple function. llvm-svn: 86625	2009-11-09 23:31:49 +00:00
Chris Lattner	25700676d4	rename SimplifyCompare -> SimplifyCmpInst and split it into Simplify[IF]Cmp pieces. Add some predicates to CmpInst to determine whether a predicate is fp or int. llvm-svn: 86624	2009-11-09 23:28:39 +00:00
Chris Lattner	131172dc76	fix ConstantFoldCompareInstOperands to take the LHS/RHS as individual operands instead of taking a temporary array llvm-svn: 86619	2009-11-09 23:06:58 +00:00
Chris Lattner	29db7d6abe	use instructionsimplify instead of a weak clone of ad-hoc folding stuff. llvm-svn: 86616	2009-11-09 23:00:14 +00:00
Chris Lattner	8cb0f89bbd	stub out a new form of BasicBlock::RemovePredecessorAndSimplify which simplifies instruction users of PHIs when the phi is eliminated. This will be moved to transforms/utils after some other refactoring. llvm-svn: 86603	2009-11-09 22:32:36 +00:00
Dan Gohman	7010cad26a	Fix a comment in a typo that Duncan noticed. llvm-svn: 86575	2009-11-09 18:59:22 +00:00
Dan Gohman	457b8bad4e	Generalize LCSSA to handle loops with exits with predecessors outside the loop. This is needed because with indirectbr it may not be possible for LoopSimplify to guarantee that all loop exit predecessors are inside the loop. This fixes PR5437. LCCSA no longer actually requires LoopSimplify form, but for now it must still have the dependency because the PassManager doesn't know how to schedule LoopSimplify otherwise. llvm-svn: 86569	2009-11-09 18:28:24 +00:00
Chris Lattner	f2b3c795fd	if a 'with overflow' intrinsic just has the normal result used, simplify it to a normal binop. Patch by Alastair Lynn, testcase by me. llvm-svn: 86524	2009-11-09 07:07:56 +00:00
Chris Lattner	aaa7706cd5	fix PR5104: when printing a single character, return the result of putchar in case there is an error. llvm-svn: 86515	2009-11-09 04:57:04 +00:00
Chris Lattner	5a3a41a757	enhance PHI slicing to handle the case when a slicable PHI is begin used by a chain of other PHIs. llvm-svn: 86503	2009-11-09 01:38:00 +00:00
Owen Anderson	bbbb62a090	Small cleanups. llvm-svn: 86499	2009-11-09 00:48:15 +00:00
Owen Anderson	7ac0e198c3	Revert my previous patch to ABCD and fix things the right way. There are two problems addressed here: 1) We need to avoid processing sigma nodes as phi nodes for constraint generation. 2) We need to generate constraints for comparisons against constants properly. This includes our first working ABCD test! llvm-svn: 86498	2009-11-09 00:44:44 +00:00
Chris Lattner	fe6b17a13a	comment typos pointed out by Duncan llvm-svn: 86497	2009-11-09 00:41:49 +00:00
Owen Anderson	045615edfb	Fix an issue where the ordering of blocks within a function could lead to different constraint graphs being produced. The cause was that we were incorrectly marking sigma instructions as processed after handling the sigma-specific constraints for them, potentially neglecting to process them as normal instructions as well. Unfortunately, the testcase that inspired this still doesn't work because of a bug in the solver, which is next on the list to debug. llvm-svn: 86486	2009-11-08 22:36:55 +00:00
Chris Lattner	6c67b00026	Teach an instcombine to not pull trunc instructions through PHI nodes when both the source and dest are illegal types, since it would cause the phi to grow (for example, we shouldn't transform test14b's phi to a phi on i320). This fixes an infinite loop on i686 bootstrap with phi slicing turned on, so turn it back on. llvm-svn: 86483	2009-11-08 21:20:06 +00:00
Chris Lattner	11b6e3c1eb	reapply r8644[3-5] with only the scary part (SliceUpIllegalIntegerPHI) disabled. llvm-svn: 86480	2009-11-08 19:23:30 +00:00
Daniel Dunbar	1543f2c26f	Speculatively revert r8644[3-5], they seem to be leading to infinite loops in llvm-gcc bootstrap. llvm-svn: 86478	2009-11-08 17:52:47 +00:00
Chris Lattner	cddc8aa1b8	teach a couple of instcombine transformations involving PHIs to not turn a PHI in a legal type into a PHI of an illegal type, and add a new optimization that breaks up insane integer PHI nodes into small pieces (PR3451). llvm-svn: 86443	2009-11-08 08:21:13 +00:00
Nick Lewycky	2b3ac2b1a7	Improve tail call elimination to handle the switch statement. llvm-svn: 86403	2009-11-07 21:10:15 +00:00
Chris Lattner	c6bb31e5ea	make instcombine only rewrite a chain of computation (eliminating some extends) if the new type of the computation is legal or if both the source and dest are illegal. This prevents instcombine from changing big chains of computation into i64 on 32-bit targets for example. llvm-svn: 86398	2009-11-07 19:11:46 +00:00
Chris Lattner	15b00179d0	Revert r86359, it is breaking the self host on the llvm-gcc-i386-darwin9 build bot. llvm-svn: 86391	2009-11-07 17:59:32 +00:00
Nick Lewycky	80180a0497	Teach dead store elimination that certain intrinsics write to memory just like a store. llvm-svn: 86359	2009-11-07 08:34:40 +00:00
Chris Lattner	3482ad7de0	reapply 86289, 86278, 86270, 86267, 86266 & 86264 plus a fix (making pred factoring only happen if threading is guaranteed to be successful). This now survives an X86-64 bootstrap of llvm-gcc. llvm-svn: 86355	2009-11-07 08:05:03 +00:00
Nick Lewycky	f49c373c13	Oops, FunctionContainsEscapingAllocas is really used to mean two different things. Back out part of r86349 for a moment. llvm-svn: 86353	2009-11-07 07:42:38 +00:00
Nick Lewycky	a2b0965613	Dust off tail recursion elimination. Fix a fixme by applying CaptureTracking and add a .ll to demo the new capability. llvm-svn: 86349	2009-11-07 07:10:01 +00:00
Devang Patel	84b2af870e	Revert following patches to fix llvmgcc bootstrap. 86289, 86278, 86270, 86267, 86266 & 86264 Chris, please take a look. llvm-svn: 86321	2009-11-07 01:32:59 +00:00
Victor Hernandez	4613fc8629	- new SROA mallocs should have the mallocs running-or'ed, not the malloc's bitcast - fix ProcessInternalGlobal() debug output llvm-svn: 86317	2009-11-07 00:41:19 +00:00
Jeffrey Yasskin	66008446d5	Avoid "ambiguous 'else'" warning from gcc. llvm-svn: 86314	2009-11-07 00:26:47 +00:00
Victor Hernandez	8736a8fca4	Re-commit r86077 now that r86290 fixes the 179.art and 175.vpr ARM regressions. Here is the original commit message: This commit updates malloc optimizations to operate on malloc calls that have constant int size arguments. Update CreateMalloc so that its callers specify the size to allocate: MallocInst-autoupgrade users use non-TargetData-computed allocation sizes. Optimization uses use TargetData to compute the allocation size. Now that malloc calls can have constant sizes, update isArrayMallocHelper() to use TargetData to determine the size of the malloced type and the size of malloced arrays. Extend getMallocType() to support malloc calls that have non-bitcast uses. Update OptimizeGlobalAddressOfMalloc() to optimize malloc calls that have non-bitcast uses. The bitcast use of a malloc call has to be treated specially here because the uses of the bitcast need to be replaced and the bitcast needs to be erased (just like the malloc call) for OptimizeGlobalAddressOfMalloc() to work correctly. Update PerformHeapAllocSRoA() to optimize malloc calls that have non-bitcast uses. The bitcast use of the malloc is not handled specially here because ReplaceUsesOfMallocWithGlobal replaces through the bitcast use. Update OptimizeOnceStoredGlobal() to not care about the malloc calls' bitcast use. Update all globalopt malloc tests to not rely on autoupgraded-MallocInsts, but instead use explicit malloc calls with correct allocation sizes. llvm-svn: 86311	2009-11-07 00:16:28 +00:00
Chris Lattner	411461dd99	Fix a bug where we'd call SplitBlockPredecessors with a pred in the set only once even if it has multiple edges to BB. llvm-svn: 86299	2009-11-06 23:19:58 +00:00
Eli Friedman	98daf610fb	Remove function left over from other jump threading cleanup. llvm-svn: 86289	2009-11-06 21:24:57 +00:00
Chris Lattner	93a3e74486	Fix a problem discovered on self host. llvm-svn: 86278	2009-11-06 19:21:48 +00:00
Chris Lattner	17dc911808	remove more code subsumed by r86264 llvm-svn: 86270	2009-11-06 18:24:32 +00:00
Chris Lattner	8b677402d3	eliminate some more code subsumed by r86264 llvm-svn: 86267	2009-11-06 18:22:54 +00:00
Chris Lattner	79534a05af	remove now redundant code, r86264 handles this case. llvm-svn: 86266	2009-11-06 18:20:58 +00:00
Chris Lattner	b688868418	Extend jump threading to support much more general threading predicates. This allows us to jump thread things like: _ZN12StringSwitchI5ColorE4CaseILj7EEERS1_RAT__KcRKS0_.exit119: %tmp1.i24166 = phi i8 [ 1, %bb5.i117 ], [ %tmp1.i24165, %_Z....exit ], [ %tmp1.i24165, %bb4.i114 ] %toBoolnot.i87 = icmp eq i8 %tmp1.i24166, 0 ; <i1> [#uses=1] %tmp4.i90 = icmp eq i32 %tmp2.i, 6 ; <i1> [#uses=1] %or.cond173 = and i1 %toBoolnot.i87, %tmp4.i90 ; <i1> [#uses=1] br i1 %or.cond173, label %bb4.i96, label %_ZN12... Where it is "obvious" that when coming from %bb5.i117 that the 'and' is always false. This triggers a surprisingly high number of times in the testsuite, and gets us closer to generating good code for doug's strswitch testcase. This also make a bunch of other code in jump threading redundant, I'll rip out in the next patch. This survived an enable-checking llvm-gcc bootstrap. llvm-svn: 86264	2009-11-06 18:15:14 +00:00
Chris Lattner	6a31c1141c	remove some more Context arguments. llvm-svn: 86235	2009-11-06 05:59:53 +00:00
Chris Lattner	903ae55e1c	remove a bunch of extraneous LLVMContext arguments from various APIs, addressing PR5325. llvm-svn: 86231	2009-11-06 04:27:31 +00:00
Victor Hernandez	a5a12cd62e	Revert r86077 because it caused crashes in 179.art and 175.vpr on ARM llvm-svn: 86213	2009-11-06 01:33:24 +00:00
Dan Gohman	54aa68b309	Teach LSR to avoid calling SplitCriticalEdge on edges with indirectbr. llvm-svn: 86193	2009-11-05 23:34:59 +00:00
Dan Gohman	e93b2dbc0f	Avoid calling getUniqueExitBlocks from within LoopSimplify, as it depends on loops having dedicated exits, which LoopSimplify can no longer always guarantee. llvm-svn: 86181	2009-11-05 21:48:32 +00:00
Dan Gohman	6a3eefbfeb	LoopDeletion depends on loops having dedicated exits. llvm-svn: 86180	2009-11-05 21:47:04 +00:00
Dan Gohman	f65735f0c5	The introduction of indirectbr meant the introduction of unsplittable critical edges, which means the introduction of loops which cannot be transformed to LoopSimplify form. Fix LoopSimplify to avoid transforming such loops into invalid code. llvm-svn: 86176	2009-11-05 21:14:46 +00:00
Dan Gohman	3c155aa3cd	Update various Loop optimization passes to cope with the possibility that LoopSimplify form may not be available. llvm-svn: 86175	2009-11-05 21:11:53 +00:00
Dan Gohman	cf44615b89	Teach LoopUnroll how to bail if LoopSimplify can't give it what it needs. llvm-svn: 86164	2009-11-05 19:44:06 +00:00
Dan Gohman	0c3b2f8419	Call getAnalysis<LoopInfo> the normal way, instead of asking passed-in LoopPassManager for it. llvm-svn: 86163	2009-11-05 19:43:25 +00:00
Dan Gohman	a25cd27300	Delete an unused member variable. llvm-svn: 86160	2009-11-05 19:33:15 +00:00
Dan Gohman	7e9c38c364	Add an assertion to catch indirectbr in SplitBlockPredecessors. This makes several optimization passes abort in cases where they're currently silently miscompiling code. Remove the indirectbr assertion from SplitEdge. Indirectbr is only a problem for critical edges, and SplitEdge defers to SplitCriticalEdge to handle those, and SplitCriticalEdge has its own assertion for indirectbr. llvm-svn: 86147	2009-11-05 18:25:44 +00:00
Benjamin Kramer	a38019a3de	Teach SimplifyLibCalls to fold memcmp calls with constant arguments. llvm-svn: 86141	2009-11-05 17:44:22 +00:00
Benjamin Kramer	f10e3edefb	Do map insert+find in one step. TODO -= 2. llvm-svn: 86133	2009-11-05 14:33:27 +00:00
Victor Hernandez	21ec158c23	Update CreateMalloc so that its callers specify the size to allocate: MallocInst-autoupgrade users use non-TargetData-computed allocation sizes. Optimization uses use TargetData to compute the allocation size. Now that malloc calls can have constant sizes, update isArrayMallocHelper() to use TargetData to determine the size of the malloced type and the size of malloced arrays. Extend getMallocType() to support malloc calls that have non-bitcast uses. Update OptimizeGlobalAddressOfMalloc() to optimize malloc calls that have non-bitcast uses. The bitcast use of a malloc call has to be treated specially here because the uses of the bitcast need to be replaced and the bitcast needs to be erased (just like the malloc call) for OptimizeGlobalAddressOfMalloc() to work correctly. Update PerformHeapAllocSRoA() to optimize malloc calls that have non-bitcast uses. The bitcast use of the malloc is not handled specially here because ReplaceUsesOfMallocWithGlobal replaces through the bitcast use. Update OptimizeOnceStoredGlobal() to not care about the malloc calls' bitcast use. Update all globalopt malloc tests to not rely on autoupgraded-MallocInsts, but instead use explicit malloc calls with correct allocation sizes. llvm-svn: 86077	2009-11-05 00:03:03 +00:00
Chris Lattner	a003aee613	improve DSE when TargetData is not around, based on work by Hans Wennborg! llvm-svn: 86067	2009-11-04 23:20:12 +00:00
Chris Lattner	4864198071	Fix an iterator invalidation bug that happens when a hashtable resizes in IPSCCP. This fixes PR5394. llvm-svn: 86036	2009-11-04 18:57:42 +00:00
Chris Lattner	d8ab8fbe9f	move two functions up higher in the file. Delete a useless argument to EmitGEPOffset. Implement some new transforms for optimizing subtracts of two pointer to ints into the same vector. This happens for C++ iterator idioms for example, stringmap takes a const char* that points to the start and end of a string. Once inlined, we want the pointer difference to turn back into a length. This is rdar://7362831. llvm-svn: 86021	2009-11-04 08:05:20 +00:00
Chris Lattner	b634c6cdb5	reimplement multiple return value handling in IPSCCP, making it more aggressive an correct. This survives building llvm in 64-bit mode with optimizations and the built llvm passes make check. llvm-svn: 85973	2009-11-03 23:40:48 +00:00
Chris Lattner	10c341e8fa	finish half thunk thought llvm-svn: 85937	2009-11-03 20:52:57 +00:00
Chris Lattner	2aa5962af6	fix an IPSCCP bug I introduced when I changed IPSCCP to start working on functions that don't have local linkage. Basically, we need to be more careful about propagating argument information to functions whose results we aren't tracking. This fixes a miscompilation of LLVMCConfigurationEmitter.cpp when built with an llvm-gcc that has ipsccp enabled. llvm-svn: 85923	2009-11-03 19:24:51 +00:00
Chris Lattner	bfe755788f	fix a subtle bug I introduced when refactoring SCCP. Testcase to follow. llvm-svn: 85903	2009-11-03 16:50:11 +00:00
Benjamin Kramer	c6b59b889b	Eliminate some temporaries. llvm-svn: 85896	2009-11-03 12:52:50 +00:00
Chris Lattner	9d0f925cf0	remove a isFreeCall check: it is a callinst that can write to memory already. llvm-svn: 85863	2009-11-03 05:33:46 +00:00
Ted Kremenek	cd7ab8bfa0	Alphabetize. llvm-svn: 85859	2009-11-03 04:01:53 +00:00
Chris Lattner	b1dfdadabd	turn IPSCCP back on now that the iterator invalidation bug is fixed. llvm-svn: 85858	2009-11-03 03:42:51 +00:00
Chris Lattner	00c9eb665d	fix a nasty iterator invalidation bug from my conversion from std::map to DenseMap, exposed on release llvm-gcc bootstrap. llvm-svn: 85840	2009-11-02 23:25:39 +00:00
Chris Lattner	6f515d4ba8	revert r8579[56], which are causing unhappiness in buildbot land. llvm-svn: 85818	2009-11-02 19:31:10 +00:00
Chris Lattner	a1776913ab	disable IPSCCP support for multiple return values, it is buggy, so just disable it until I can fix it. llvm-svn: 85810	2009-11-02 18:22:51 +00:00
Chris Lattner	f1afb57935	improve IPSCCP to be able to propagate the result of "!mayBeOverridden" function to calls of that function, regardless of whether it has local linkage or has its address taken. Not escaping should only affect whether we make an aggressive assumption about the arguments to a function, not whether we can track the result of it. llvm-svn: 85795	2009-11-02 07:33:59 +00:00
Chris Lattner	c9279a9b4d	don't mark the arguments of prototype overdefined, they will never be queried. llvm-svn: 85793	2009-11-02 06:34:04 +00:00
Chris Lattner	9c7e443b07	restore some code I removed in r85788, refactor it into a shared place instead of duplicating it 4 times. llvm-svn: 85792	2009-11-02 06:28:16 +00:00
Chris Lattner	23f5603692	remove some confused code that dates from when we had "multiple return values" but not "first class aggregates" llvm-svn: 85791	2009-11-02 06:17:06 +00:00
Chris Lattner	c670f866ce	avoid redundant lookups in BBExecutable, and make it a SmallPtrSet. llvm-svn: 85790	2009-11-02 06:11:23 +00:00
Chris Lattner	16b825e51f	Use the libanalysis 'ConstantFoldLoadFromConstPtr' function instead of reinventing SCCP-specific logic. This gives us new powers. llvm-svn: 85789	2009-11-02 06:06:14 +00:00
Chris Lattner	32acebedf7	switch the main 'ValueState' map from being an std::map to being a DenseMap. Doing this required being aware of subtle iterator invalidation issues, but it provides a big speedup. In a release-asserts build, this sped up optimizing 403.gcc from 1.34s -> 0.79s (IPSCCP) and 1.11s -> 0.44s (SCCP). This commit also conflates in a bunch of general cleanups, sorry. llvm-svn: 85788	2009-11-02 05:55:40 +00:00
Chris Lattner	9fc809ca55	fix a bug exposed by moving SRoA earlier which caused a crash building kc++ llvm-svn: 85786	2009-11-02 04:37:17 +00:00
Chris Lattner	7e74831e52	only IPSCCP incoming arguments if the function is executable, this fixes an assertion on the buildbot. llvm-svn: 85784	2009-11-02 03:25:55 +00:00
Chris Lattner	0a8c553eb2	add a new ValueState::getConstantInt() helper, use it to simplify some code. llvm-svn: 85783	2009-11-02 03:21:36 +00:00
Chris Lattner	4372b58329	tidy up some more: remove some extraneous inline specifiers, return harder. llvm-svn: 85780	2009-11-02 03:03:42 +00:00
Chris Lattner	1c8cb53667	eliminate the SCCPSolver::getValueMapping method. llvm-svn: 85778	2009-11-02 02:54:24 +00:00
Chris Lattner	f02562df08	fix failures introduced in r85774 llvm-svn: 85777	2009-11-02 02:48:17 +00:00
Chris Lattner	d4f286cb0c	factor duplicated code into a new DeleteInstructionInBlock function, eliminate temporary (and pointless) smallvector. llvm-svn: 85776	2009-11-02 02:47:51 +00:00
Chris Lattner	a806b18c99	Chris used to use '...' instead of proper grammar. llvm-svn: 85775	2009-11-02 02:33:50 +00:00
Chris Lattner	37dc1cb0fb	remove some extraneous llvmcontext stuff. llvm-svn: 85774	2009-11-02 02:30:06 +00:00

... 3 4 5 6 7 ...

6292 Commits