llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
Rafael Espindola	f5dbddfa0e	Use the new range metadata in computeMaskedBits and add a new optimization to instruction simplify that lets us remove an and when loding a boolean value. llvm-svn: 153423	2012-03-26 01:44:11 +00:00
Chandler Carruth	276dad7263	Teach instsimplify how to simplify comparisons of pointers which are constant-offsets of a common base using the generic GEP-walking logic I added for computing pointer differences in the same situation. llvm-svn: 153419	2012-03-25 21:28:14 +00:00
Chandler Carruth	6c38813ec1	Switch the pointer-difference simplification logic to only work with inbounds GEPs. This isn't really necessary for simplifying pointer differences, but I'm planning to re-use the same code to simplify pointer comparisons where it is necessary. Since real code almost exclusively uses inbounds GEPs, it doesn't seem worth it to support the extra complexity of turning it on and off. If anyone would like that back, feel free to shout. Note that instcombine will still catch any of these patterns. llvm-svn: 153418	2012-03-25 20:43:07 +00:00
Chandler Carruth	fc1ee5b5d6	Teach the function cloner (and thus the inliner) to simplify PHINodes aggressively. There are lots of dire warnings about this being expensive that seem to predate switching to the TrackingVH-based value remapper that is automatically updated on RAUW. This makes it easy to not just prune single-entry PHIs, but to fully simplify PHIs, and to recursively simplify the newly inlined code to propagate PHINode simplifications. This introduces a bit of a thorny problem though. We may end up simplifying a branch condition to a constant when we fold PHINodes, and we would like to nuke any dead blocks resulting from this so that time isn't wasted continually analyzing them, but this isn't easy. Deleting basic blocks after they are fully cloned and mapped into the new function currently requires manually updating the value map. The last piece of the simplification-during-inlining puzzle will require either switching to WeakVH mappings or some other piece of refactoring. I've left a FIXME in the testcase about this. llvm-svn: 153410	2012-03-25 10:34:54 +00:00
Eli Bendersky	3ef88c1833	Continue cleanup of LIT, getting rid of the remaining artifacts from dejagnu * Removed test/lib/llvm.exp - it is no longer needed * Deleted the dg.exp reading code from test/lit.cfg. There are no dg.exp files left in the test suite so this code is no longer required. test/lit.cfg is now much shorter and clearer * Removed a lot of duplicate code in lit.local.cfg files that need access to the root configuration, by adding a "root" attribute to the TestingConfig object. This attribute is dynamically computed to provide the same information as was previously provided by the custom getRoot functions. * Documented the config.root attribute in docs/CommandGuide/lit.pod llvm-svn: 153408	2012-03-25 09:02:19 +00:00
Chandler Carruth	2a69d3eac5	Move the instruction simplification of callsite arguments in the inliner to instead rely on much more generic and powerful instruction simplification in the function cloner (and thus inliner). This teaches the pruning function cloner to use instsimplify rather than just the constant folder to fold values during cloning. This can simplify a large number of things that constant folding alone cannot begin to touch. For example, it will realize that 'or' and 'and' instructions with certain constant operands actually become constants regardless of what their other operand is. It also can thread back through the caller to perform simplifications that are only possible by looking up a few levels. In particular, GEPs and pointer testing tend to fold much more heavily with this change. This should (in some cases) have a positive impact on compile times with optimizations on because the inliner itself will simply avoid cloning a great deal of code. It already attempted to prune proven-dead code, but now it will be use the stronger simplifications to prove more code dead. llvm-svn: 153403	2012-03-25 04:03:40 +00:00
Chandler Carruth	c626d97320	FileCheck-ize this test. Note the FIXME I've introduced here: we've regressed seriously here, we are no longer removing allocas during inline cleanup. This appears to be because of lifetime markers "using" them. =/ I'll look into this shortly. llvm-svn: 153394	2012-03-24 21:24:19 +00:00
Dan Gohman	ef28237798	Don't convert objc_retainAutoreleasedReturnValue to objc_retain if it is retaining the return value of an invoke that it immediately follows. llvm-svn: 153344	2012-03-23 18:09:00 +00:00
Dan Gohman	0c02608af1	It's not possible to insert code immediately after an invoke in the same basic block, and it's not safe to insert code in the successor blocks if the edges are critical edges. Splitting those edges is possible, but undesirable, especially on the unwind side. Instead, make the bottom-up code motion to consider invokes to be part of their successor blocks, rather than part of their parent blocks, so that it doesn't push code past them and onto the edges. This fixes PR12307. llvm-svn: 153343	2012-03-23 17:47:54 +00:00
Andrew Trick	bc649f32f0	Convert -indvars tests that rely on SCEV expansion to -loop-reduce tests. llvm-svn: 153259	2012-03-22 17:10:07 +00:00
Andrew Trick	c9948561ef	Remove tests: indvars trivially preserves GEPs now. llvm-svn: 153258	2012-03-22 17:09:46 +00:00
Andrew Trick	f1a4ea97fc	Remove test: trivial canonical IV test which is covered by other SCEV tests. llvm-svn: 153257	2012-03-22 17:09:34 +00:00
Andrew Trick	30b64bcc90	Remove redundant -enable-iv-rewrite=false flags from test cases. llvm-svn: 153255	2012-03-22 17:09:04 +00:00
Andrew Trick	83eb659a9f	LoopSimplify bug fix. Handle indirect loop back edges. Do not call SplitBlockPredecessors on a loop preheader when one of the predecessors is an indirectbr. Otherwise, you will hit this assert: !isa<IndirectBrInst>(Preds[i]->getTerminator()) && "Cannot split an edge from an IndirectBrInst" llvm-svn: 153134	2012-03-20 21:24:52 +00:00
Andrew Trick	3dd6e9db31	LSR: teach isSimplifiedLoopNest to handle PHI IVUsers. llvm-svn: 153132	2012-03-20 21:24:44 +00:00
Andrew Trick	9995b30fe2	LSR: fix IVUsers isSimplifiedLoopNest to perform a full domtree walk instead of skipping the current loop. My prior fix was incomplete because of an overzealous compile-time optimization: Better fix for: <rdar://problem/11049788> Segmentation fault: 11 in LoopStrengthReduce llvm-svn: 153131	2012-03-20 21:24:40 +00:00
Nick Lewycky	92a7d87ceb	Factor out the multiply analysis code in ComputeMaskedBits and apply it to the overflow checking multiply intrinsic as well. Add a test for this, updating the test from grep to FileCheck. llvm-svn: 153028	2012-03-18 23:28:48 +00:00
Bill Wendling	9343ed10c6	Revert r152907. llvm-svn: 152935	2012-03-16 18:20:54 +00:00
Bill Wendling	3c44ed8385	The alignment of the pointer part of the store instruction may have an alignment. If that's the case, then we want to make sure that we don't increase the alignment of the store instruction. Because if we increase it to be "more aligned" than the pointer, code-gen may use instructions which require a greater alignment than the pointer guarantees. <rdar://problem/11043589> llvm-svn: 152907	2012-03-16 07:40:08 +00:00
Chandler Carruth	e0a21944a1	Rip out support for 'llvm.noinline'. This thing has a strange history... It was added in 2007 as the first cut at supporting no-inline attributes, but we didn't have function attributes of any form at the time. However, it was added without any mention in the LangRef or other documentation. Later on, in 2008, Devang added function notes for 'inline=never' and then turned them into proper function attributes. From that point onward, as far as I can tell, the world moved on, and no one has touched 'llvm.noinline' in any meaningful way since. It's time has now come. We have had better mechanisms for doing this for a long time, all the frontends I'm aware of use them, and this is just holding back progress. Given that it was never a documented feature of the IR, I've provided no auto-upgrade support. If people know of real, in-the-wild bitcode that relies on this, yell at me and I'll add it, but I seriously doubt anyone cares. llvm-svn: 152904	2012-03-16 06:10:15 +00:00
Andrew Trick	719339e40f	LSR fix: Add isSimplifiedLoopNest to IVUsers analysis. Only record IVUsers that are dominated by simplified loop headers. Otherwise SCEVExpander will crash while looking for a preheader. I previously tried to work around this in LSR itself, but that was insufficient. This way, LSR can continue to run if some uses are not in simple loops, as long as we don't attempt to analyze those users. Fixes <rdar://problem/11049788> Segmentation fault: 11 in LoopStrengthReduce llvm-svn: 152892	2012-03-16 03:16:56 +00:00
Eli Friedman	0763584d78	In InstCombiner::visitOr, make sure we reverse the operand swap used for checking for or-of-xor operations after those checks; a later check expects that any constant will be in Op1. PR12234. llvm-svn: 152884	2012-03-16 00:52:42 +00:00
Matt Beaumont-Gay	7f3db984b3	line endings llvm-svn: 152832	2012-03-15 20:24:29 +00:00
Rafael Espindola	ac42573389	Short term fix for pr12270 before we change dominates to handle unreachable code. While here, reduce indentation. llvm-svn: 152803	2012-03-15 15:52:59 +00:00
Aaron Ballman	bf6eebde21	Fixed a transform crash when setting a negative size value for memset. Fixes PR12202. llvm-svn: 152756	2012-03-15 00:05:31 +00:00
Chandler Carruth	889ecbc0f8	Extend the inline cost calculation to account for bonuses due to correlated pairs of pointer arguments at the callsite. This is designed to recognize the common C++ idiom of begin/end pointer pairs when the end pointer is a constant offset from the begin pointer. With the C-based idiom of a pointer and size, the inline cost saw the constant size calculation, and this provides the same level of information for begin/end pairs. In order to propagate this information we have to search for candidate operations on a pair of pointer function arguments (or derived from them) which would be simplified if the pointers had a known constant offset. Then the callsite analysis looks for such pointer pairs in the argument list, and applies the appropriate bonus. This helps LLVM detect that half of bounds-checked STL algorithms (such as hash_combine_range, and some hybrid sort implementations) disappear when inlined with a constant size input. However, it's not a complete fix due the inaccuracy of our cost metric for constants in general. I'm looking into that next. Benchmarks showed no significant code size change, and very minor performance changes. However, specific code such as hashing is showing significantly cleaner inlining decisions. llvm-svn: 152752	2012-03-14 23:19:53 +00:00
Dan Gohman	a30e1f4576	When an invoke is marked with metadata indicating its unwind edge should be ignored by ARC optimization, don't insert new ARC runtime calls in the unwind destination. llvm-svn: 152748	2012-03-14 23:05:06 +00:00
Chris Lattner	84f83c2727	enhance jump threading to preserve TBAA information when PRE'ing loads, fixing rdar://11039258, an issue that came up when inspecting clang's bootstrapped codegen. llvm-svn: 152635	2012-03-13 18:07:41 +00:00
Dan Gohman	fa43b599ac	Teach globalopt how to evaluate an invoke with a non-void return type. llvm-svn: 152634	2012-03-13 18:01:37 +00:00
Duncan Sands	60c339c405	Generalize the "trunc(ptrtoint(x)) - trunc(ptrtoint(y)) -> trunc(ptrtoint(x-y))" optimization introduced by Chandler. llvm-svn: 152626	2012-03-13 14:07:05 +00:00
Eli Friedman	77682009bc	Fix regression from r151466: an we can't replace uses of an instruction reachable from the entry block with uses of an instruction not reachable from the entry block. PR12231. llvm-svn: 152595	2012-03-13 01:06:07 +00:00
Chandler Carruth	015ff468c2	When inlining a function and adding its inner call sites to the candidate set for subsequent inlining, try to simplify the arguments to the inner call site now that inlining has been performed. The goal here is to propagate and fold constants through deeply nested call chains. Without doing this, we loose the inliner bonus that should be applied because the arguments don't match the exact pattern the cost estimator uses. Reviewed on IRC by Benjamin Kramer. llvm-svn: 152556	2012-03-12 11:19:33 +00:00
Chandler Carruth	d1c1c98162	Teach instsimplify how to constant fold pointer differences. Typically instcombine has handled this, but pointer differences show up in several contexts where we would like to get constant folding, and cannot afford to run instcombine. Specifically, I'm working on improving the constant folding of arguments used in inline cost analysis with instsimplify. Doing this in instsimplify implies some algorithm changes. We have to handle multiple layers of all-constant GEPs because instsimplify cannot fold them into a single GEP the way instcombine can. Also, we're only interested in all-constant GEPs. The result is that this doesn't really replace the instcombine logic, it's just complimentary and focused on constant folding. Reviewed on IRC by Benjamin Kramer. llvm-svn: 152555	2012-03-12 11:19:31 +00:00
Chandler Carruth	98464723a5	FileCheck-ize this test. llvm-svn: 152554	2012-03-12 11:19:28 +00:00
Andrew Trick	db66ee17be	Move llc + target triple tests into X86 llvm-svn: 152502	2012-03-10 19:03:51 +00:00
Benjamin Kramer	dbfa526afc	Don't try to filecheck bitcode. llvm-svn: 152498	2012-03-10 18:07:46 +00:00
Bill Wendling	5f16e35eed	Make this transformation slightly less agressive and more correct. The 'CmpInst::isFalseWhenEqual' function returns 'false' for values other than simply equality. For instance, it returns 'false' for <= or >=. This isn't the correct behavior for this transformation, which is checking for strict equality and non-equality. It was causing the gcc.c-torture/execute/frame-address.c test to fail because it would completely (and incorrectly) optimize a whole function into a 'ret i32 0'. llvm-svn: 152497	2012-03-10 17:56:03 +00:00
Dan Gohman	784659a39f	When identifying exit nodes for the reverse-CFG reverse-post-order traversal, consider nodes for which the only successors are backedges which the traversal is ignoring to be exit nodes. This fixes a problem where the bottom-up traversal was failing to visit split blocks along split loop backedges. This fixes rdar://10989035. llvm-svn: 152421	2012-03-09 18:50:52 +00:00
Duncan Sands	8139573edf	Eliminate switch cases that can never match, for example removes all negative switch cases if the branch condition is known to be positive. Inspired by a recent improvement to GCC's VRP. llvm-svn: 152405	2012-03-09 13:45:18 +00:00
Chandler Carruth	63f95ab839	Undo a previous restriction on the inline cost calculation which Nick introduced. Specifically, there are cost reductions for all constant-operand icmp instructions against an alloca, regardless of whether the alloca will in fact be elligible for SROA. That means we don't want to abort the icmp reduction computation when we abort the SROA reduction computation. That in turn frees us from the need to keep a separate worklist and defer the ICmp calculations. Use this new-found freedom and some judicious function boundaries to factor the innards of computing the cost factor of any given instruction out of the loop over the instructions and into static helper functions. This greatly simplifies the code, and hopefully makes it more clear what is happening here. Reviewed by Eric Christopher. There is some concern that we'd like to ensure this doesn't get out of hand, and I plan to benchmark the effects of this change over the next few days along with some further fixes to the inline cost. llvm-svn: 152368	2012-03-09 02:49:36 +00:00
Eli Friedman	59cebb7902	Make sure we don't return bits outside the mask in ComputeMaskedBits. PR12189. llvm-svn: 152066	2012-03-05 23:09:40 +00:00
Duncan Sands	ccc56e1071	Nick pointed out on IRC that GVN's propagateEquality wasn't propagating equalities into phi node operands for which the equality is known to hold in the incoming basic block. That's because replaceAllDominatedUsesWith wasn't handling phi nodes correctly in general (that this didn't give wrong results was just luck: the specific way GVN uses replaceAllDominatedUsesWith precluded wrong changes to phi nodes). llvm-svn: 152006	2012-03-04 13:25:19 +00:00
Benjamin Kramer	2a3719125f	LVI: Recognize the form instcombine canonicalizes range checks into when forming constant ranges. This could probably be made a lot smarter, but this is a common case and doesn't require LVI to scan a lot of code. With this change CVP can optimize away the "shift == 0" case in Hashing.h that only gets hit when "shift" is in a range not containing 0. llvm-svn: 151919	2012-03-02 15:34:43 +00:00
Duncan Sands	207ee17589	Have GVN also do condition propagation when the right-hand side is not a constant. This fixes PR1768. llvm-svn: 151713	2012-02-29 11:12:03 +00:00
Bill Wendling	690b72d2b3	Testcase for r151691. llvm-svn: 151694	2012-02-29 01:53:13 +00:00
Pete Cooper	ab5f2302dc	Reverted r152620 - DSE: Shorten memset when a later store overwrites the start of it. There were all sorts of buildbot issues llvm-svn: 151621	2012-02-28 05:06:24 +00:00
Pete Cooper	93352dcd53	DSE: Shorten memset when a later store overwrites the start of it llvm-svn: 151620	2012-02-28 04:27:10 +00:00
Duncan Sands	9e95178a81	When performing a conditional branch depending on the value of a comparison %cmp (eg: A==B) we already replace %cmp with "true" under the true edge, and with "false" under the false edge. This change enhances this to replace the negated compare (A!=B) with "false" under the true edge and "true" under the false edge. Reported to improve perlbench results by 1%. llvm-svn: 151517	2012-02-27 08:14:30 +00:00
Rafael Espindola	2d9b864afe	Fix this assert. IP can point to an instruction with strange dominance properties (invoke). Just assert that the instruction we return dominates the insertion point. llvm-svn: 151511	2012-02-27 02:13:03 +00:00
Rafael Espindola	868ea25522	Add testcase for the previous commit. llvm-svn: 151475	2012-02-26 05:49:57 +00:00
Rafael Espindola	34b7c064cb	Change the implementation of dominates(inst, inst) to one based on what the verifier does. This correctly handles invoke. Thanks to Duncan, Andrew and Chris for the comments. Thanks to Joerg for the early testing. llvm-svn: 151469	2012-02-26 02:19:19 +00:00
Nick Lewycky	a93c874757	Reinstate the optimization from r151449 with a fix to not turn 'gep %x' into 'gep null' when the icmp predicate is unsigned (or is signed without inbounds). llvm-svn: 151467	2012-02-26 02:09:49 +00:00
Nick Lewycky	849715d31f	Roll these back to r151448 until I figure out how they're breaking MultiSource/Applications/lua. llvm-svn: 151463	2012-02-25 23:01:19 +00:00
Nick Lewycky	1636c6eaef	An argument and a local identified object (eg. a noalias call) could turn out equal if both are null. In the test, scope type %t and global @y by adding a 'gep' prefix to them. llvm-svn: 151452	2012-02-25 20:19:07 +00:00
Nick Lewycky	94be1c7d95	Teach instsimplify to be more aggressive when analyzing comparisons of pointers by using llvm::isIdentifiedObject. Also teach it to handle GEPs that have the same base pointer and constant operands. Fixes PR11238! llvm-svn: 151449	2012-02-25 19:07:42 +00:00
Chris Lattner	b01936f21a	fix PR12075, a regression in a recent transform I added. In unreachable code, gep chains can be infinite. Just like "stripPointerCasts", use a set to keep track of visited instructions so we don't recurse infinitely. llvm-svn: 151383	2012-02-24 19:01:58 +00:00
Duncan Sands	30c1ce0834	Teach GVN that x+y is the same as y+x and that x<y is the same as y>x. llvm-svn: 151365	2012-02-24 15:16:31 +00:00
Rafael Espindola	23cd372dbf	Semantically revert 151015. Add a comment on why we should be able to assert the dominance once the dominates method is fixed and why we can use the builder's insertion point. Fixes pr12048. llvm-svn: 151125	2012-02-22 03:21:39 +00:00
Nick Lewycky	664d5b131f	Use the target-aware constant folder on expressions to improve the chance they'll be simple enough to simulate, and to reduce the chance we'll encounter equal but different simple pointer constants. This removes the symptoms from PR11352 but is not a full fix. A proper fix would either require a guarantee that two constant objects we simulate are folded when equal, or a different way of handling equal pointers (ie., trying a constantexpr icmp on them to see whether we know they're equal or non-equal or unsure). llvm-svn: 151093	2012-02-21 22:08:06 +00:00
Benjamin Kramer	dacc2e8edb	InstCombine: Don't transform a signed icmp of two GEPs into a signed compare of the indices. This transformation is not safe in some pathological cases (signed icmp of pointers should be an extremely rare thing, but it's valid IR!). Add an explanatory comment. Kudos to Duncan for pointing out this edge case (and not giving up explaining it until I finally got it). llvm-svn: 151055	2012-02-21 13:31:09 +00:00
Nick Lewycky	b9cf2477b9	Check for the correct size in the invariant marker. llvm-svn: 151003	2012-02-20 23:32:26 +00:00
Benjamin Kramer	64719820cf	Test case for r150978. llvm-svn: 150979	2012-02-20 19:00:28 +00:00
Benjamin Kramer	9ade8e4d79	InstCombine: When comparing two GEPs that were derived from the same base pointer but use different types, expand the offset calculation and to the compare on the offset if profitable. This came up in SmallVector code. llvm-svn: 150962	2012-02-20 15:07:47 +00:00
Benjamin Kramer	3d87f26b44	InstCombine: Make OptimizePointerDifference more aggressive. - Ignore pointer casts. - Also expand GEPs that aren't constantexprs when they have one use or only constant indices. - We now compile "&foo[i] - &foo[j]" into "i - j". llvm-svn: 150961	2012-02-20 14:34:57 +00:00
Chris Lattner	50ad7c3f54	fold comparisons of gep'd alloca points with null to false, implementing PR12013. We now compile the testcase to: __Z4testv: ## @_Z4testv ## BB#0: ## %_ZN4llvm15SmallVectorImplIiE9push_backERKi.exit pushq %rbx subq $64, %rsp leaq 32(%rsp), %rbx movq %rbx, (%rsp) leaq 64(%rsp), %rax movq %rax, 16(%rsp) movl $1, 32(%rsp) leaq 36(%rsp), %rax movq %rax, 8(%rsp) leaq (%rsp), %rdi callq __Z1gRN4llvm11SmallVectorIiLj8EEE movq (%rsp), %rdi cmpq %rbx, %rdi je LBB0_2 ## BB#1: callq _free LBB0_2: ## %_ZN4llvm11SmallVectorIiLj8EED1Ev.exit addq $64, %rsp popq %rbx ret instead of: __Z4testv: ## @_Z4testv ## BB#0: pushq %rbx subq $64, %rsp xorl %eax, %eax leaq (%rsp), %rbx addq $32, %rbx movq %rbx, (%rsp) movq %rbx, 8(%rsp) leaq 64(%rsp), %rcx movq %rcx, 16(%rsp) je LBB0_2 ## BB#1: movl $1, 32(%rsp) movq %rbx, %rax LBB0_2: ## %_ZN4llvm15SmallVectorImplIiE9push_backERKi.exit addq $4, %rax movq %rax, 8(%rsp) leaq (%rsp), %rdi callq __Z1gRN4llvm11SmallVectorIiLj8EEE movq (%rsp), %rdi cmpq %rbx, %rdi je LBB0_4 ## BB#3: callq _free LBB0_4: ## %_ZN4llvm11SmallVectorIiLj8EED1Ev.exit addq $64, %rsp popq %rbx ret This doesn't shrink clang noticably though. llvm-svn: 150944	2012-02-20 00:42:49 +00:00
Rafael Espindola	5154b9bedb	Don't skip debug instructions when looking for the insertion point of the cast. If we do, we can end up with inst1 --------------- < Insertion point dbg inst new inst instead of the desired inst1 new inst --------------- < Insertion point dbg inst Another option would be for InsertNoopCastOfTo (or its callers) to move the insertion point and we would end up with inst1 dbg inst new inst --------------- < Insertion point but that complicates the callers. This fixes PR12018 (and firefox's build). llvm-svn: 150884	2012-02-18 17:22:58 +00:00
Eli Friedman	be89455c98	Fix a rather nasty regression from r150690: LHS != RHS does not imply LHS->stripPointerCasts() != RHS->stripPointerCasts(). llvm-svn: 150863	2012-02-18 03:29:25 +00:00
Dan Gohman	71b80f9e8c	Calls and invokes with the new clang.arc.no_objc_arc_exceptions metadata may still unwind, but only in ways that the ARC optimizer doesn't need to consider. This permits more aggressive optimization. llvm-svn: 150829	2012-02-17 18:59:53 +00:00
Nick Lewycky	a37a7e5a0f	Remove question. llvm-svn: 150809	2012-02-17 09:55:20 +00:00
Nick Lewycky	a5a53772d9	Add support for invariant.start inside the static constructor evaluator. This is useful to represent a variable that is const in the source but can't be constant in the IR because of a non-trivial constructor. If globalopt evaluates the constructor, and there was an invariant.start with no matching invariant.end possible, it will mark the global constant afterwards. llvm-svn: 150794	2012-02-17 06:59:21 +00:00
Benjamin Kramer	8c809e592f	InstSimplify: Ignore pointer casts when constant folding compares between pointers. llvm-svn: 150690	2012-02-16 13:49:39 +00:00
Eli Bendersky	4afdeeb682	Replace all instances of dg.exp file with lit.local.cfg, since all tests are run with LIT now and now Dejagnu. dg.exp is no longer needed. Patch reviewed by Daniel Dunbar. It will be followed by additional cleanup patches. llvm-svn: 150664	2012-02-16 06:28:33 +00:00
Eli Friedman	18f18c7618	loop-rotate shouldn't hoist alloca instructions out of a loop. Patch by Patrik Hägglund, with slightly modified test. Issue reported by Patrik Hägglund on llvmdev. llvm-svn: 150642	2012-02-16 00:41:10 +00:00
Andrew Trick	c1482c669a	Add simplifyLoopLatch to LoopRotate pass. This folds a simple loop tail into a loop latch. It covers the common (in fortran) case of postincrement loops. It's a "free" way to expose this type of loop to downstream loop optimizations that bail out on non-canonical loops (getLoopLatch is a heavily used check). llvm-svn: 150439	2012-02-14 00:00:23 +00:00
Devang Patel	7f07d60411	Check against umin while converting fcmp into an icmp. llvm-svn: 150425	2012-02-13 23:05:18 +00:00
Dan Gohman	20fd978e4b	Just like in regular escape analysis, loads and stores through (but not of) a block pointer do not cause the block pointer to escape. This fixes rdar://10803830. llvm-svn: 150424	2012-02-13 22:57:02 +00:00
Hal Finkel	56c6162a55	Update BBVectorize to use aliasesUnknownInst. This allows BBVectorize to check the "unknown instruction" list in the alias sets. This is important to prevent instruction fusing from reordering function calls. Resolves PR11920. llvm-svn: 150250	2012-02-10 15:52:40 +00:00
Duncan Sands	931ce8ee15	Fix PR11948: the result type of an icmp may be a vector of boolean - don't assume it is a boolean. llvm-svn: 150247	2012-02-10 14:31:24 +00:00
Duncan Sands	205d9394e8	Revert commit 149912 (lattner) and add a testcase that shows the problem (which is that patterns no longer match for vectors of booleans, because you only get ConstantDataVector when the vector element type is i8, i16, etc, not when it is i1). Original commit message: Remove some dead code and tidy things up now that vectors use ConstantDataVector instead of always using ConstantVector. llvm-svn: 150246	2012-02-10 14:26:42 +00:00
Benjamin Kramer	1a2b069bb9	GlobalOpt: Be more aggressive about elminating side-effect free static dtors. GlobalOpt runs early in the pipeline (before inlining) and complex class hierarchies often introduce bitcasts or GEPs which weren't optimized away. Teach it to ignore side-effect free instructions instead of depending on other passes to remove them. llvm-svn: 150174	2012-02-09 14:26:06 +00:00
Bill Wendling	2fbed70727	The 'unwind' instruction is deprecated and will be removed, making this test obsolete. llvm-svn: 149880	2012-02-06 18:18:47 +00:00
Nick Lewycky	bad48a142a	Teach GlobalOpt to handle atomic accesses to globals. * Most of the transforms come through intact by having each transformed load or store copy the ordering and synchronization scope of the original. * The transform that turns a global only accessed in main() into an alloca (since main is non-recursive) with a store of the initial value uses an unordered store, since it's guaranteed to be the first thing to happen in main. (Threads may have started before main (!) but they can't have the address of a function local before the point in the entry block we insert our code.) * The heap-SRoA transforms are disabled in the face of atomic operations. This can probably be improved; it seems odd to have atomic accesses to an alloca that doesn't have its address taken. AnalyzeGlobal keeps track of the strongest ordering found in any use of the global. This is more information than we need right now, but it's cheap to compute and likely to be useful. llvm-svn: 149847	2012-02-05 19:56:38 +00:00
Duncan Sands	eb56d51cfb	Reduce the number of dom queries made by GVN's conditional propagation logic by half: isOnlyReachableViaThisEdge was trying to be clever and handle the case of a branch to a basic block which is contained in a loop. This costs a domtree lookup and is completely useless due to GVN's position in the pass pipeline: all loops have preheaders at this point, which means it is enough for isOnlyReachableViaThisEdge to check that Dst has only one predecessor. (I checked this theoretical argument by running over the entire nightly testsuite, and indeed it is so!). llvm-svn: 149838	2012-02-05 18:25:50 +00:00
Hal Finkel	34ae699943	Boost the effective chain depth of loads and stores. By default, boost the chain depth contribution of loads and stores. This will allow a load/store pair to vectorize even when it would not otherwise be long enough to satisfy the chain depth requirement. llvm-svn: 149761	2012-02-04 04:14:04 +00:00
Dan Gohman	d18622bd02	Fix SSAUpdaterImpl's RecordMatchingPHI to record exactly the PHI nodes which were matched, rather than climbing up the original PHI node's operands to rediscover PHI nodes for recording, since the PHI nodes found that are not necessarily part of the matched set. This fixes rdar://10589171. llvm-svn: 149654	2012-02-03 01:07:01 +00:00
Jim Grosbach	bc7e9b3c96	Revert "Disable InstCombine unsafe folding bitcasts of calls w/ varargs." This reverts commit d0e277d272d517ca1cda368267d199f0da7cad95. llvm-svn: 149647	2012-02-03 00:00:50 +00:00
Hal Finkel	8cf5de5774	Add a basic-block autovectorization pass. This is the initial checkin of the basic-block autovectorization pass along with some supporting vectorization infrastructure. Special thanks to everyone who helped review this code over the last several months (especially Tobias Grosser). llvm-svn: 149468	2012-02-01 03:51:43 +00:00
Jim Grosbach	6186319c3f	Disable InstCombine unsafe folding bitcasts of calls w/ varargs. Changing arguments from being passed as fixed to varargs is unsafe, as the ABI may require they be handled differently (stack vs. register, for example). Remove two tests which rely on the bitcast being folded into the direct call, which is exactly the transformation that's unsafe. llvm-svn: 149457	2012-02-01 00:08:17 +00:00
Bill Wendling	7761976036	Remove all references to the old EH. There was always the current EH. -- Ministry of Truth llvm-svn: 149335	2012-01-31 02:09:07 +00:00
Bill Wendling	76beba7841	Update test to new EH model. llvm-svn: 149333	2012-01-31 02:05:13 +00:00
Rafael Espindola	7bddde2b49	Add r149110 back with a fix for when the vector and the int have the same width. llvm-svn: 149151	2012-01-27 23:33:07 +00:00
Rafael Espindola	7800e62486	Revert r149110 and add a testcase that was crashing since that revision. Unfortunately I also had to disable constant-pool-sharing.ll the code it tests has been updated to use the IL logic. llvm-svn: 149148	2012-01-27 22:42:48 +00:00
Chris Lattner	929f66cdfa	enhance constant folding to be able to constant fold bitcast of ConstantVector's to integer type. llvm-svn: 149110	2012-01-27 01:44:03 +00:00
Nick Lewycky	7693b940b6	Support pointer comparisons against constants, when looking at the inline-cost savings from a pointer argument becoming an alloca. Sometimes callees will even compare a pointer to null and then branch to an otherwise unreachable block! Detect these cases and compute the number of saved instructions, instead of bailing out and reporting no savings. llvm-svn: 148941	2012-01-25 08:27:40 +00:00
Nick Lewycky	54e62d71e0	Make Value::isDereferenceablePointer() handle unreachable code blocks. (This returns false in the event the computation feeding into the pointer is unreachable, which maybe ought to be true -- but this is at least consistent with undef->isDereferenceablePointer().) Fixes PR11825! llvm-svn: 148671	2012-01-23 00:05:17 +00:00
Andrew Trick	207780ec8e	Handle a corner case with IV chain collection with bailout instead of assert. Fixes PR11783: bad cast to AddRecExpr. llvm-svn: 148572	2012-01-20 21:23:40 +00:00
Andrew Trick	343a9ff799	Test case comments missing from my previous checkin. llvm-svn: 148571	2012-01-20 21:21:27 +00:00
Nick Lewycky	55814f0b32	Fix CountCodeReductionForAlloca to more accurately represent what SROA can and can't handle. Also don't produce non-zero results for things which won't be transformed by SROA at all just because we saw the loads/stores before we saw the use of the address. llvm-svn: 148536	2012-01-20 08:35:20 +00:00
Andrew Trick	be3e9530e1	SCEVExpander fixes. Affects LSR and indvars. LSR has gradually been improved to more aggressively reuse existing code, particularly existing phi cycles. This exposed problems with the SCEVExpander's sloppy treatment of its insertion point. I applied some rigor to the insertion point problem that will hopefully avoid an endless bug cycle in this area. Changes: - Always used properlyDominates to check safe code hoisting. - The insertion point provided to SCEV is now considered a lower bound. This is usually a block terminator or the use itself. Under no cirumstance may SCEVExpander insert below this point. - LSR is reponsible for finding a "canonical" insertion point across expansion of different expressions. - Robust logic to determine whether IV increments are in "expanded" form and/or can be safely hoisted above some insertion point. Fixes PR11783: SCEVExpander assert. llvm-svn: 148535	2012-01-20 07:41:13 +00:00
Dan Gohman	9bb84ffb6c	Set the "tail" flag on pattern-matched objc_storeStrong calls. rdar://10531041. llvm-svn: 148490	2012-01-19 19:14:36 +00:00
Dan Gohman	48f4e5752e	Use llvm.global_ctors to locate global constructors instead of recognizing them by name. llvm-svn: 148416	2012-01-18 21:19:38 +00:00
Andrew Trick	345fe24275	Test case rename llvm-svn: 148344	2012-01-17 22:27:45 +00:00
Dan Gohman	9b37a5592c	Add a new ObjC ARC optimization pass to eliminate unneeded autorelease push+pop pairs. llvm-svn: 148330	2012-01-17 20:52:24 +00:00
Andrew Trick	f2988aa6f4	LSR fix: broaden the check for loop preheaders. It's becoming clear that LoopSimplify needs to unconditionally create loop preheaders. But that is a bigger fix. For now, continuing to hack LSR. Fixes rdar://10701050 "Cannot split an edge from an IndirectBrInst" assert. llvm-svn: 148288	2012-01-17 06:45:52 +00:00
Andrew Trick	071cb0a076	Fix a corner case hit by redundant phi elimination running after LSR. Fixes PR11761: bad IR w/ redundant Phi elim llvm-svn: 148177	2012-01-14 03:17:23 +00:00
Dan Gohman	922244c634	Implement proper ObjC ARC objc_retainBlock "escape" analysis, so that the optimizer doesn't eliminate objc_retainBlock calls which are needed for their side effect of copying blocks onto the heap. This implements rdar://10361249. llvm-svn: 148076	2012-01-13 00:39:07 +00:00
Duncan Sands	b545beb54e	Don't try to create a GEP when the pointee type is unsized (such GEPs are invalid). Fixes a crash on array1.C from the GCC testsuite when compiled with dragonegg. llvm-svn: 147946	2012-01-11 12:20:08 +00:00
Stepan Dyatkovskiy	7ba274153a	Improved compile time: 1. Size heuristics changed. Now we calculate number of unswitching branches only once per loop. 2. Some checks was moved from UnswitchIfProfitable to processCurrentLoop, since it is not changed during processCurrentLoop iteration. It allows decide to skip some loops at an early stage. Extended statistics: - Added total number of instructions analyzed. llvm-svn: 147935	2012-01-11 08:40:51 +00:00
Bill Wendling	2a03f15116	If the global variable is removed by the linker, then don't constant merge it with other symbols. An object in the __cfstring section is suppoed to be filled with CFString objects, which have a pointer to ___CFConstantStringClassReference followed by a pointer to a __cstring. If we allow the object in the __cstring section to be merged with another global, then it could end up in any section. Because the linker is going to remove these symbols in the final executable, we shouldn't bother to merge them. <rdar://problem/10564621> llvm-svn: 147899	2012-01-11 00:13:08 +00:00
Andrew Trick	db66631fb3	Enable LSR IV Chains with sufficient heuristics. These heuristics are sufficient for enabling IV chains by default. Performance analysis has been done for i386, x86_64, and thumbv7. The optimization is rarely important, but can significantly speed up certain cases by eliminating spill code within the loop. Unrolled loops are prime candidates for IV chains. In many cases, the final code could still be improved with more target specific optimization following LSR. The goal of this feature is for LSR to make the best choice of induction variables. Instruction selection may not completely take advantage of this feature yet. As a result, there could be cases of slight code size increase. Code size can be worse on x86 because it doesn't support postincrement addressing. In fact, when chains are formed, you may see redundant address plus stride addition in the addressing mode. GenerateIVChains tries to compensate for the common cases. On ARM, code size increase can be mitigated by using postincrement addressing, but downstream codegen currently misses some opportunities. llvm-svn: 147826	2012-01-10 01:45:08 +00:00
Andrew Trick	09d73ea35b	Adding IV chain generation to LSR. After collecting chains, check if any should be materialized. If so, hide the chained IV users from the LSR solver. LSR will only solve for the head of the chain. GenerateIVChains will then materialize the chained IV users by computing the IV relative to its previous value in the chain. In theory, chained IV users could be exposed to LSR's solver. This would be considerably complicated to implement and I'm not aware of a case where we need it. In practice it's more important to intelligently prune the search space of nontrivial loops before running the solver, otherwise the solver is often forced to prune the most optimal solutions. Hiding the chained users does this well, so that LSR is more likely to find the best IV for the chain as a whole. llvm-svn: 147801	2012-01-09 21:18:52 +00:00
Benjamin Kramer	f9cefbfed0	InstCombine: Teach foldLogOpOfMaskedICmpsHelper that sign bit tests are bit tests. This subsumes several other transforms while enabling us to catch more cases. llvm-svn: 147777	2012-01-09 17:23:27 +00:00
Benjamin Kramer	e1321329f4	Tweak my last commit to be less conservative about uses. We still save an instruction when just the "and" part is replaced. Also change the code to match comments more closely. llvm-svn: 147753	2012-01-08 21:12:51 +00:00
Benjamin Kramer	e94856c8c4	InstCombine: If we have a bit test and a sign test anded/ored together, merge the sign bit into the bit test. This is common in bit field code, e.g. checking if the first or the last bit of a bit field is set. llvm-svn: 147749	2012-01-08 18:32:24 +00:00
Andrew Trick	d9eb9c8780	LSR: Don't optimize loops if an outer loop has no preheader. LoopSimplify may not run on some outer loops, e.g. because of indirect branches. SCEVExpander simply cannot handle outer loops with no preheaders. Fixes rdar://10655343 SCEVExpander segfault. llvm-svn: 147718	2012-01-07 03:16:50 +00:00
Andrew Trick	8a5a1e603e	Extended replaceCongruentPhis to handle mixed phi types. llvm-svn: 147707	2012-01-07 01:12:09 +00:00
Andrew Trick	0ec80535d9	comment typo llvm-svn: 147701	2012-01-07 00:29:20 +00:00
Dan Gohman	a4fde8485d	Fix SpeculativelyExecuteBB to either speculate all or none of the phis present in the bottom of the CFG triangle, as the transformation isn't ever valuable if the branch can't be eliminated. Also, unify some heuristics between SimplifyCFG's multiple if-converters, for consistency. This fixes rdar://10627242. llvm-svn: 147630	2012-01-05 23:58:56 +00:00
Eli Friedman	5af9c3cbbb	PR11705, part 2: globalopt shouldn't put inttoptr/ptrtoint operations into global initializers if there's an implied extension or truncation. llvm-svn: 147625	2012-01-05 23:03:32 +00:00
Dan Gohman	4fc691d9ef	Revert r56315. When the instruction to speculate is a load, this code can incorrectly move the load across a store. This never happens in practice today, but only because the current heuristics accidentally preclude it. llvm-svn: 147623	2012-01-05 22:54:35 +00:00
Benjamin Kramer	e5589bccdd	FileCheck hygiene. llvm-svn: 147580	2012-01-05 00:43:34 +00:00
Nick Lewycky	d6260dc3cb	Teach instcombine all sorts of great stuff about shifts that have exact, nuw or nsw bits on them. llvm-svn: 147528	2012-01-04 09:28:29 +00:00
Andrew Trick	6839d66ab3	Fix SCEVExpander to handle loops with no preheader when LSR gives it a "phony" insertion point. Fixes rdar://10619599: "SelectionDAGBuilder shouldn't visit PHI nodes!" assert llvm-svn: 147439	2012-01-02 21:25:10 +00:00
Nick Lewycky	c7e12f7dbf	Make use of the exact bit when optimizing '(X >>exact 3) << 1' to eliminate the 'and' that would zero out the trailing bits, and to produce an exact shift ourselves. llvm-svn: 147391	2011-12-31 21:30:22 +00:00
Nick Lewycky	7425820374	Change CaptureTracking to pass a Use* instead of a Value* when a value is captured. This allows the tracker to look at the specific use, which may be especially interesting for function calls. Use this to fix 'nocapture' deduction in FunctionAttrs. The existing one does not iterate until a fixpoint and does not guarantee that it produces the same result regardless of iteration order. The new implementation builds up a graph of how arguments are passed from function to function, and uses a bottom-up walk on the argument-SCCs to assign nocapture. This gets us nocapture more often, and does so rather efficiently and independent of iteration order. llvm-svn: 147327	2011-12-28 23:24:21 +00:00
Nick Lewycky	f4c21901a3	Turn cos(-x) into cos(x). Patch by Alexander Malyshev! llvm-svn: 147291	2011-12-27 18:25:50 +00:00
Nick Lewycky	295e397220	Teach simplifycfg to recompute branch weights when merging some branches, and to discard weights when appropriate. Still more to do (and a new TODO), but it's a start! llvm-svn: 147286	2011-12-27 04:31:52 +00:00
Nick Lewycky	56e04db381	Update the branch weight metadata when reversing the order of a branch. llvm-svn: 147280	2011-12-26 20:54:14 +00:00
Chandler Carruth	a012c64ced	Add an explicit test that we now fold cttz.i32(..., true) >> 5 -> 0. This is a result of Benjamin's work on ValueTracking. llvm-svn: 147259	2011-12-24 22:34:15 +00:00
Benjamin Kramer	94f07f8c2c	InstCombine: Add a combine that turns (2^n)-1 ^ x back into (2^n)-1 - x iff x is smaller than 2^n and it fuses with a following add. This was intended to undo the sub canonicalization in cases where it's not profitable, but it also finds some cases on it's own. llvm-svn: 147256	2011-12-24 17:31:53 +00:00
Benjamin Kramer	b5e584392b	ComputeMaskedBits: Make knownzero computation more aggressive for ctlz with undef zero. unsigned foo(unsigned x) { return 31 - __builtin_clz(x); } now compiles into a single "bsrl" instruction on x86. llvm-svn: 147255	2011-12-24 17:31:46 +00:00
Benjamin Kramer	0b4d2e3d2a	InstCombine: Canonicalize (2^n)-1 - x into (2^n)-1 ^ x iff x is known to be smaller than 2^n. This has the obvious advantage of being commutable and is always a win on x86 because const - x wastes a register there. On less weird architectures this may lead to a regression because other arithmetic doesn't fuse with it anymore. I'll address that problem in a followup. llvm-svn: 147254	2011-12-24 17:31:38 +00:00
Nick Lewycky	0c92d31b61	Move this test from date-name to feature-name, and port it to FileCheck. llvm-svn: 147223	2011-12-23 18:41:31 +00:00
Chad Rosier	d16131e35c	Reinstate r146578; it doesn't appear to be the cause of some recent execution- time regressions. In general, it is beneficial to compile-time. Original commit message: Fix for bug #11429: Wrong behaviour for switches. Small improvement for code size heuristics. llvm-svn: 147175	2011-12-22 21:06:36 +00:00
Benjamin Kramer	5d07d63540	Give string constants generated by IRBuilder private linkage. Fixes PR11640. llvm-svn: 147144	2011-12-22 14:22:14 +00:00
Chad Rosier	4ab165f664	Speculatively revert r146578 to determine if it is the cause of a number of performance regressions (both execution-time and compile-time) on our nightly testers. Original commit message: Fix for bug #11429: Wrong behaviour for switches. Small improvement for code size heuristics. llvm-svn: 147131	2011-12-22 02:40:57 +00:00
Nick Lewycky	9adbd36737	Make some intrinsics safe to speculatively execute. llvm-svn: 147036	2011-12-21 05:52:02 +00:00
Andrew Trick	a1c4f73f87	Unit test for r146950: LSR postinc expansion, PR11571. llvm-svn: 146951	2011-12-20 01:43:20 +00:00
Joerg Sonnenberger	8cf8d64d19	Allow inlining of functions with returns_twice calls, if they have the attribute themselve. llvm-svn: 146851	2011-12-18 20:35:43 +00:00
Kevin Enderby	42fffe915a	Revert r146822 at Pete Cooper's request as it broke clang self hosting. Hope I did this correctly :) llvm-svn: 146834	2011-12-17 19:48:52 +00:00
Pete Cooper	0ec73f6e98	SimplifyCFG now predicts some conditional branches to true or false depending on previous branch on same comparison operands. For example, if (a == b) { if (a > b) // this is false Fixes some of the issues on <rdar://problem/10554090> llvm-svn: 146822	2011-12-17 06:32:38 +00:00
Pete Cooper	550b96ab46	Added InstCombine for "select cond, ~cond, x" type patterns These can be reduced to "~cond & x" or "~cond \| x" llvm-svn: 146624	2011-12-15 00:56:45 +00:00
Eli Friedman	5dd57bb40a	Make loop preheader insertion in LoopSimplify handle the case where the loop header is a landing pad correctly (by splitting the landingpad out of the loop header). Make some adjustments to the rest of LoopSimplify to make it clear that the rest of LoopSimplify isn't making bad assumptions about the presence of landing pads. PR11575. llvm-svn: 146621	2011-12-15 00:50:34 +00:00
Dan Gohman	1add31cc93	Move Instruction::isSafeToSpeculativelyExecute out of VMCore and into Analysis as a standalone function, since there's no need for it to be in VMCore. Also, update it to use isKnownNonZero and other goodies available in Analysis, making it more precise, enabling more aggressive optimization. llvm-svn: 146610	2011-12-14 23:49:11 +00:00
Andrew Trick	9c88f32f94	LSR: Fold redundant bitcasts on-the-fly. llvm-svn: 146597	2011-12-14 22:07:19 +00:00
Stepan Dyatkovskiy	14cb78c6fb	Fix for bug #11429 : Wrong behaviour for switches. Small improvement for code size heuristics. llvm-svn: 146578	2011-12-14 19:19:17 +00:00
Dan Gohman	e9572aa680	It turns out that clang does use pointer-to-function types to point to ARC-managed pointers sometimes. This fixes rdar://10551239. llvm-svn: 146577	2011-12-14 19:10:53 +00:00
Joerg Sonnenberger	5b25b4d437	Only replace fwrite with fputc, if the return value is unused. llvm-svn: 146411	2011-12-12 20:18:31 +00:00
Chandler Carruth	2bedf185c9	Manually upgrade the test suite to specify the flag to cttz and ctlz. I followed three heuristics for deciding whether to set 'true' or 'false': - Everything target independent got 'true' as that is the expected common output of the GCC builtins. - If the target arch only has one way of implementing this operation, set the flag in the way that exercises the most of codegen. For most architectures this is also the likely path from a GCC builtin, with 'true' being set. It will (eventually) require lowering away that difference, and then lowering to the architecture's operation. - Otherwise, set the flag differently dependending on which target operation should be tested. Let me know if anyone has any issue with this pattern or would like specific tests of another form. This should allow the x86 codegen to just iteratively improve as I teach the backend how to differentiate between the two forms, and everything else should remain exactly the same. llvm-svn: 146370	2011-12-12 11:59:10 +00:00
Andrew Trick	4f0b3bb42b	Add -unroll-runtime for unrolling loops with run-time trip counts. Patch by Brendon Cahoon! This extends the existing LoopUnroll and LoopUnrollPass. Brendon measured no regressions in the llvm test suite with -unroll-runtime enabled. This implementation works by using the existing loop unrolling code to unroll the loop by a power-of-two (default 8). It generates an if-then-else sequence of code prior to the loop to execute the extra iterations before entering the unrolled loop. llvm-svn: 146245	2011-12-09 06:19:40 +00:00
Nick Lewycky	d2c1661e9f	Fix infinite loop in DSE when deleting a free in a reachable loop that's also trivially infinite. llvm-svn: 146197	2011-12-08 22:36:35 +00:00
Andrew Trick	04c98888bc	LSR: prune undesirable formulae early. It's always good to prune early, but formulae that are unsatisfactory in their own right need to be removed before running any other pruning heuristics. We easily avoid generating such formulae, but we need them as an intermediate basis for forming other good formulae. llvm-svn: 145906	2011-12-06 03:13:31 +00:00
Chad Rosier	7096fea51c	Probably not a good idea to convert a single vector load into a memcpy. We don't do this now, but add a test case to prevent this from happening in the future. Additional test for rdar://9892684 llvm-svn: 145879	2011-12-06 00:19:08 +00:00
Chad Rosier	c50cbc5a65	Make the MemCpyOptimizer a bit more aggressive. I can't think of a scenerio where this would be bad as the backend shouldn't have a problem inlining small memcpys. rdar://10510150 llvm-svn: 145865	2011-12-05 22:37:00 +00:00
Nadav Rotem	1a91e4381d	Add support for vectors of pointers. llvm-svn: 145801	2011-12-05 06:29:09 +00:00
Pete Cooper	32e376f7e1	Fixed deadstoreelimination bug where negative indices were incorrectly causing the optimisation to occur Turns out long long + unsigned long long is unsigned. Doh! Fixes http://llvm.org/bugs/show_bug.cgi?id=11455 llvm-svn: 145731	2011-12-03 00:04:30 +00:00
Chad Rosier	d830d783e2	Add support for constant folding the pow intrinsic. rdar://10514247 llvm-svn: 145730	2011-12-03 00:00:03 +00:00
Chad Rosier	4d25975a28	Prevent library calls from being folded if -fno-builtin has been specified. rdar://10500969 llvm-svn: 145639	2011-12-01 22:14:50 +00:00
Pete Cooper	c708e83499	Improved fix for abs(val) != 0 to check other similar case. Also fixed style issues and confusing comment llvm-svn: 145618	2011-12-01 19:13:26 +00:00
Pete Cooper	d4569610df	Removed use of grep from test and moved it to be with other icmp tests llvm-svn: 145570	2011-12-01 04:35:26 +00:00
Pete Cooper	7e03b7250d	Added instcombine pattern to spot comparing -val or val against 0. (val != 0) == (-val != 0) so "abs(val) != 0" becomes "val != 0" Fixes <rdar://problem/10482509> llvm-svn: 145563	2011-12-01 03:58:40 +00:00
Andrew Trick	8da55f9048	Better test case found in duplicate PR10570. llvm-svn: 145484	2011-11-30 06:26:42 +00:00
Andrew Trick	247f749767	LSR: handle the expansion of phi operands that use postinc forms of the IV. Fixes PR11431: SCEVExpander::expandAddRecExprLiterally(const llvm::SCEVAddRecExpr*): Assertion `(!isa<Instruction>(Result) \|\| SE.DT->dominates(cast<Instruction>(Result), Builder.GetInsertPoint())) && "postinc expansion does not dominate use"' failed. llvm-svn: 145482	2011-11-30 06:07:54 +00:00
Chad Rosier	c5fa9f413a	Add support for sqrt, sqrtl, and sqrtf in TargetLibraryInfo. Disable (fptrunc (sqrt (fpext x))) -> (sqrtf x) transformation if -fno-builtin is specified. rdar://10466410 llvm-svn: 145460	2011-11-29 23:57:10 +00:00
Duncan Sands	97cc6da56c	Fix a theoretical problem (not seen in the wild): if different instances of a weak variable are compiled by different compilers, such as GCC and LLVM, while LLVM may increase the alignment to the preferred alignment there is no reason to think that GCC will use anything more than the ABI alignment. Since it is the GCC version that might end up in the final program (as the linkage is weak), it is wrong to increase the alignment of loads from the global up to the preferred alignment as the alignment might only be the ABI alignment. Increasing alignment up to the ABI alignment might be OK, but I'm not totally convinced that it is. It seems better to just leave the alignment of weak globals alone. llvm-svn: 145413	2011-11-29 18:26:38 +00:00
Andrew Trick	4c2bb1ade3	Reenable this IndVars unit test. SCEV can't optimize undef in all cases, which is a separate issue from this test case. llvm-svn: 145343	2011-11-29 00:52:04 +00:00
Eli Friedman	bc47555417	Add a missing safety check to ProcessUGT_ADDCST_ADD. Fixes PR11438. llvm-svn: 145316	2011-11-28 23:32:19 +00:00
Eli Friedman	473a76a0df	Make SelectionDAG::InferPtrAlignment use llvm::ComputeMaskedBits instead of duplicating the logic for globals. Make llvm::ComputeMaskedBits handle GlobalVariables slightly more aggressively, to match what InferPtrAlignment knew how to do. llvm-svn: 145304	2011-11-28 22:48:22 +00:00
Chris Lattner	c050288d89	remove a test that is using old-style llvm.dbg intrinsics, apparently only fails on ppc and arm hosts. llvm-svn: 145188	2011-11-27 18:13:47 +00:00
Chris Lattner	84bf52737a	remove autoupgrade support for old forms of llvm.prefetch and the old trampoline forms. Both of these were correct in LLVM 3.0, and we don't need to support LLVM 2.9 and earlier in mainline. llvm-svn: 145174	2011-11-27 07:42:04 +00:00
Chris Lattner	9d1e8420ff	Upgrade syntax of tests using volatile instructions to use 'load volatile' instead of 'volatile load', which is archaic. llvm-svn: 145171	2011-11-27 06:54:59 +00:00
Chris Lattner	011a5bf0aa	remove autoupgrade support for really old-style debug info intrinsics. I think this is the last of autoupgrade that can be removed in 3.1. Can the atomic upgrade stuff also go? llvm-svn: 145169	2011-11-27 06:18:33 +00:00
Chandler Carruth	2bf0dccc04	FileCheck-ize this test and make it more precise. This is in preparation for adding other tests. llvm-svn: 145143	2011-11-26 08:24:25 +00:00
Richard Smith	d647537b9c	Correctly byte-swap APInts with bit-widths greater than 64. llvm-svn: 145111	2011-11-23 21:33:37 +00:00
Duncan Sands	3c1878ef53	Fix a crash in which a multiplication was being reported as being both negative and positive: positive, because it could be directly computed to be positive; negative, because the nsw flags means it is either negative or undefined (the multiplication always overflowed). llvm-svn: 145104	2011-11-23 16:26:47 +00:00
Nick Lewycky	566ea855fd	Fix crasher in GVN due to my recent capture tracking changes. llvm-svn: 145047	2011-11-21 19:42:56 +00:00
Benjamin Kramer	1967d4e20b	XFAIL this test until I figure out what indvars is doing here (or find someone who does) llvm-svn: 145008	2011-11-20 11:10:03 +00:00
Andrew Trick	fe5f7fc3b8	Fix a corner case in updating LoopInfo after fully unrolling an outer loop. The loop tree's inclusive block lists are painful and expensive to update. (I have no idea why they're inclusive). The design was supposed to handle this case but the implementation missed it and my unit tests weren't thorough enough. Fixes PR11335: loop unroll update. llvm-svn: 144970	2011-11-18 03:42:41 +00:00
Andrew Trick	7dc21d8c0e	Fix an overly general check in SimplifyIndvar to handle useless phi cycles. The right way to check for a binary operation is cast<BinaryOperator>. The original check: cast<Instruction> && numOperands() == 2 would match phi "instructions", leading to an infinite loop in extreme corner case: a useless phi with operands [self, constant] that prior optimization passes failed to remove, being used in the loop by another useless phi, in turn being used by an lshr or udiv. Fixes PR11350: runaway iteration assertion. llvm-svn: 144935	2011-11-17 23:36:35 +00:00
Eli Friedman	d02d82d355	Add support for custom names for library functions in TargetLibraryInfo. Add a custom name for fwrite and fputs on x86-32 OSX. Make SimplifyLibCalls honor the custom names for fwrite and fputs. Fixes <rdar://problem/9815881>. llvm-svn: 144876	2011-11-17 01:27:36 +00:00
Nick Lewycky	29efc8f15d	Fix typo in test. llvm-svn: 144774	2011-11-16 03:56:38 +00:00
Nick Lewycky	ff690249a9	Merge isObjectPointerWithTrustworthySize with getPointerSize. Use it when looking at the size of the pointee. Fixes PR11390! llvm-svn: 144773	2011-11-16 03:49:48 +00:00
Andrew Trick	fe618116fc	Fix SCEV overly optimistic back edge taken count for multi-exit loops. Fixes PR11375: Different results for 'clang++ huh.cpp'... llvm-svn: 144746	2011-11-16 00:52:40 +00:00
Nick Lewycky	a0b2f7ca1d	Refactor capture tracking (which already had a couple flags for whether returns and stores capture) to permit the caller to see each capture point and decide whether to continue looking. Use this inside memdep to do an analysis that basicaa won't do. This lets us solve another devirtualization case, fixing PR8908! llvm-svn: 144580	2011-11-14 22:49:42 +00:00
Nick Lewycky	772024a00d	Don't try to loop on iterators that are potentially invalidated inside the loop. Fixes PR11361! llvm-svn: 144454	2011-11-12 03:09:12 +00:00
Eli Friedman	a83fbaff5f	Make sure scalarrepl picks the correct alloca when it rewrites a bitcast. Fixes PR11353. llvm-svn: 144442	2011-11-12 02:07:50 +00:00
Eli Friedman	127d98ab35	Get rid of an optimization in SCCP which appears to have many issues. Specifically, it doesn't handle many cases involving undef correctly, and it is missing other checks which lead to it trying to re-mark a value marked as a constant with a different value. It also appears to trigger very rarely. Fixes PR11357. llvm-svn: 144352	2011-11-11 01:16:15 +00:00
Pete Cooper	38700a1201	DeadStoreElimination can now trim the size of a store if the end of the store is dead. Currently checks alignment and killing stores on a power of 2 boundary as this is likely to trim the size of the earlier store without breaking large vector stores into scalar ones. Fixes <rdar://problem/10140300> llvm-svn: 144239	2011-11-09 23:07:35 +00:00
Eli Friedman	6bda990650	Fix code to match comment. Fixes PR11340, a regression from r143209. llvm-svn: 144121	2011-11-08 21:08:02 +00:00
Pete Cooper	a85aa24d64	LICM pass now understands invariant load metadata. Nothing generates this yet so it will currently never get used in real tests llvm-svn: 144107	2011-11-08 19:30:00 +00:00
Bill Wendling	a855903bda	Convert to the new EH model. llvm-svn: 144050	2011-11-08 00:23:01 +00:00
Nick Lewycky	7ea3dd8ae5	Do simple cross-block DSE when we encounter a free statement. Fixes PR11240. llvm-svn: 143808	2011-11-05 10:48:42 +00:00
Dan Gohman	e689158987	Add tests for existing InstSimplify features. llvm-svn: 143721	2011-11-04 18:39:16 +00:00
Dan Gohman	19a8523a2f	Teach instsimplify to simplify calls to undef. llvm-svn: 143719	2011-11-04 18:32:42 +00:00
Daniel Dunbar	0193e03f99	Speculatively revert "DeadStoreElimination can now trim the size of a store if the end of it is dead.", which appears to break bootstrapping LLVM. llvm-svn: 143668	2011-11-04 00:48:26 +00:00
Pete Cooper	4902705b5f	DeadStoreElimination can now trim the size of a store if the end of it is dead. Only currently done if the later store is writing to a power of 2 address or has the same alignment as the earlier store as then its likely to not break up large stores into smaller ones Fixes <rdar://problem/10140300> llvm-svn: 143630	2011-11-03 18:01:56 +00:00
Andrew Trick	3c1e831108	Rewrite LinearFunctionTestReplace to handle pointer-type IVs. We've been hitting asserts in this code due to the many supported combintions of modes (iv-rewrite/no-iv-rewrite) and IV types. This second rewrite of the code attempts to deal with these cases systematically. llvm-svn: 143546	2011-11-02 17:19:57 +00:00
Andrew Trick	c9baf3a7a1	Broaden an assert to handle enable-iv-rewrite=true following r143183. Narrowest possible fix for PR11279. llvm-svn: 143522	2011-11-02 00:02:45 +00:00
Eli Friedman	676558ae92	Make sure we use the right insertion point when instcombine replaces a PHI with another instruction. (Specifically, don't insert an arbitrary instruction before a PHI.) Fixes PR11275. llvm-svn: 143437	2011-11-01 04:49:29 +00:00
Duncan Sands	1077c1fa88	Reapply commit 143214 with a fix: m_ICmp doesn't match conditions with the given predicate, it matches any condition and returns the predicate - d'oh! Original commit message: The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143318	2011-10-30 19:56:36 +00:00
Benjamin Kramer	d32c541fe4	SimplifyLibCalls: Use IRBuilder.CreateGlobalString when creating a string for printf->puts, which correctly sets the unnamed_addr bit on the resulting GlobalVariable. Fixes PR11264. llvm-svn: 143289	2011-10-29 19:43:31 +00:00
Eli Friedman	7c9bef9ba8	Revert r143214; it's breaking a bunch of stuff. llvm-svn: 143265	2011-10-29 00:56:07 +00:00
Duncan Sands	7791a854c3	The expression icmp eq (select (icmp eq x, 0), 1, x), 0 folds to false. Spotted by my super-optimizer in 186.crafty and 450.soplex. We really need a proper infrastructure for handling generalizations of this kind of thing (which occur a lot), however this case is so simple that I decided to go ahead and implement it directly. llvm-svn: 143214	2011-10-28 19:01:20 +00:00
Duncan Sands	3483c23658	A shift of a power of two is a power of two or zero. For completeness - not spotted in the wild. llvm-svn: 143211	2011-10-28 18:30:05 +00:00
Duncan Sands	5730fe6a31	Fold icmp ugt (udiv X, Y), X to false. Spotted by my super-optimizer in 186.crafty. llvm-svn: 143209	2011-10-28 18:17:44 +00:00
Andrew Trick	77532be5e0	LFTR should avoid a type mismatch with null pointer IVs. Fixes rdar://10359193 Indvar LinearFunctionTestReplace assertion llvm-svn: 143183	2011-10-28 03:45:11 +00:00
Duncan Sands	ca325638c8	Reapply commit 143028 with a fix: the problem was casting a ConstantExpr Mul using BinaryOperator (which only works for instructions) when it should have been a cast to OverflowingBinaryOperator (which also works for constants). While there, correct a few other dubious looking uses of BinaryOperator. Thanks to Chad Rosier for the testcase. Original commit message: My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143125	2011-10-27 19:16:21 +00:00
Bob Wilson	2ca603d9b7	Revert Duncan's r143028 expression folding which appears to be the culprit behind a compile failure on 483.xalancbmk. llvm-svn: 143102	2011-10-27 15:47:25 +00:00
Eli Friedman	e6918ac01a	It is not safe to sink an alloca into a stacksave/stackrestore pair, so don't do that. <rdar://problem/10352360> llvm-svn: 143093	2011-10-27 01:33:51 +00:00
Duncan Sands	5c8fa99c32	The maximum power of 2 dividing a power of 2 is itself. This occurs in 403.gcc and was spotted by my super-optimizer. llvm-svn: 143054	2011-10-26 20:55:21 +00:00
Duncan Sands	c463f54342	My super-optimizer noticed that we weren't folding this expression to true: (x *nsw x) sgt 0, where x = (y \| 1). This occurs in 464.h264ref. llvm-svn: 143028	2011-10-26 15:31:51 +00:00
Nick Lewycky	4d47e224d7	A dead malloc, a free(NULL) and a free(undef) are all trivially dead instructions. This doesn't introduce any optimizations we weren't doing before (except potentially due to pass ordering issues), now passes will eliminate them sooner as part of their own cleanups. llvm-svn: 142787	2011-10-24 04:35:36 +00:00
Cameron Zwarich	2dd06afcf5	The element insertion code in scalar replacement doesn't handle incorrect element types, even though the element extraction code does. It is surprising that this bug has been here for so long. Fixes <rdar://problem/10318778>. llvm-svn: 142740	2011-10-23 07:02:10 +00:00
Nick Lewycky	1d759dcde7	Oops! Fix test I forgot to submit as part of r142735. llvm-svn: 142736	2011-10-22 22:07:31 +00:00
Nick Lewycky	25e5f6896b	A non-escaping malloc in the entry block is not unlike an alloca. Do dead-store elimination on them too. llvm-svn: 142735	2011-10-22 21:59:35 +00:00
Eli Friedman	5012ac7cc0	Remap blockaddress correctly when inlining a function. Fixes PR10162. llvm-svn: 142684	2011-10-21 20:45:19 +00:00
Eli Friedman	fb0b9216e1	Extend instcombine's shufflevector simplification to handle more cases where the input and output vectors have different sizes. Patch by Xiaoyi Guo. llvm-svn: 142671	2011-10-21 19:06:29 +00:00
Eli Friedman	e8f8cf1f33	Refactor code from inlining and globalopt that checks whether a function definition is unused, and enhance it so it can tell that functions which are only used by a blockaddress are in fact dead. This probably doesn't happen much on most code, but the Linux kernel's _THIS_IP_ can trigger this issue with blockaddress. (GlobalDCE can also handle the given tescase, but we only run that at -O3.) Found while looking at PR11180. llvm-svn: 142572	2011-10-20 05:23:42 +00:00
Nick Lewycky	21a67a1454	"@string = constant i8 0" is a value i8* string of length zero. Analyze that correctly in GetStringLength, fixing PR11181! llvm-svn: 142558	2011-10-20 00:34:35 +00:00
Dan Gohman	5e2d8538d7	Teach the ARC optimizer about the !clang.arc.copy_on_escape metadata tag on objc_retainBlock calls, which indicates that they may be optimized away. rdar://10211286. llvm-svn: 142298	2011-10-17 22:53:25 +00:00
Lang Hames	5ef0a146b9	Fixed quoting on default data layout option. llvm-svn: 142286	2011-10-17 21:54:43 +00:00
Bill Wendling	2c5486d770	Add support for the Objective-C personality function to the instruction combining of the landingpad instruction. The ObjC personality function acts almost identically to the C++ personality function. In particular, it uses "null" as a "catch-all" value. llvm-svn: 142256	2011-10-17 21:20:24 +00:00
Dan Gohman	13624a6c83	Suppress partial retain+release elimination when there's a possibility that it will span multiple CFG diamonds/triangles which could have different controlling predicates. rdar://10282956 llvm-svn: 142222	2011-10-17 18:48:25 +00:00
Bill Wendling	584c5f9c62	Correct over-zealous removal of hack. Some code want to check that any call within a function has the 'returns twice' attribute, not just that the current function has one. llvm-svn: 142221	2011-10-17 18:43:40 +00:00
Bill Wendling	c5372de48f	Temporarily XFAIL waiting for a fix. llvm-svn: 142215	2011-10-17 18:25:32 +00:00
Chandler Carruth	9c33ff8a8b	Add a routine to swap branch instruction operands, and update any profile metadata at the same time. Use it to preserve metadata attached to a branch when re-writing it in InstCombine. Add metadata to the canonicalize_branch InstCombine test, and check that it is tranformed correctly. Reviewed by Nick Lewycky! llvm-svn: 142168	2011-10-17 01:11:57 +00:00
Nick Lewycky	f590cdf15e	Oops! Fix testcase. llvm-svn: 142151	2011-10-16 20:20:15 +00:00
Nick Lewycky	c8b7f776e6	When looking for dependencies on the src pointer, scan the src pointer. Scanning on the memcpy call will pull up other unrelated stuff. Fixes PR11142. llvm-svn: 142150	2011-10-16 20:13:32 +00:00
Andrew Trick	0ef2965563	Fix SCEVExpander assert during LSR: "argument of incompatible type". Just because we're dealing with a GEP doesn't mean we can assert the SCEV has a pointer type. The fix is simply to ignore the SCEV pointer type, which we really didn't need. Fixes PR11138 webkit crash. llvm-svn: 142058	2011-10-15 06:19:55 +00:00
Andrew Trick	923129b028	Reapply r141870, SCEV expansion of post-inc. Speculatively reapply to see if this test case still crashes on linux. I may have fixed it in my last checkin. llvm-svn: 141895	2011-10-13 21:55:29 +00:00
Andrew Trick	109f7dbd1e	Revert r141870. The test case crashes on linux with data corruption. A deeper issue was exposed. llvm-svn: 141873	2011-10-13 17:58:24 +00:00
Andrew Trick	05d7cb17d5	LSR: Reuse the post-inc expansion of expressions. This avoids unnecessary expansion of expressions and allows the SCEV expander to work on expression DAGs, not just trees. Fixes PR11090. llvm-svn: 141870	2011-10-13 17:31:47 +00:00
Lang Hames	069669eb13	Removed colons from some target datalayout strings in test, since they don't match the required format. llvm-svn: 141825	2011-10-12 22:24:17 +00:00
Cameron Zwarich	fac176ac51	Fix PR11106 by correcting a typo that has been in the code for over a year. This would have never worked, since the element type of a vector type is never a vector type. Also fix the conditional to be more direct in checking whether EltTy is a vector type. llvm-svn: 141713	2011-10-11 21:26:40 +00:00
Cameron Zwarich	211901eb9f	Add a test for PR10565. llvm-svn: 141647	2011-10-11 06:10:37 +00:00
Cameron Zwarich	a34d748f83	Remove a lot of the fancy scalar replacement code for dealing with llvm-gcc's lowering of NEON code. It provides little-to-no benefit now and only introduces additional complexity. llvm-svn: 141646	2011-10-11 06:10:30 +00:00
Andrew Trick	d36852e6b1	Move replaceCongruentIVs into SCEVExapander and bias toward "expanded" IVs. Indvars previously chose randomly between congruent IVs. Now it will bias the decision toward IVs that SCEVExpander likes to create. This was not done to fix any problem, it's just a welcome side effect of factoring code. llvm-svn: 141633	2011-10-11 02:28:51 +00:00
Lang Hames	386b01379a	Added a testcase for r141599, rdar://problem/10063881. llvm-svn: 141628	2011-10-11 01:32:10 +00:00
Andrew Trick	9d4d1281ad	Unit test for LSR phi reuse in r141442. llvm-svn: 141472	2011-10-08 02:34:51 +00:00
Duncan Sands	559ef2f491	Teach GVN to also propagate switch cases. For example, in this code switch (n) { case 27: do_something(x); ... } the call do_something(x) will be replaced with do_something(27). In gcc-as-one-big-file this results in the removal of about 500 lines of bitcode (about 0.02%), so has about 1/10 of the effect of propagating branch conditions. llvm-svn: 141360	2011-10-07 08:29:06 +00:00
Eli Friedman	dd48bb30de	PR11061: Make simplifylibcalls fold strcmp("", x) correctly. While I'm here, fix the related issue with strncmp, add some actual tests for strcmp and strncmp, and start using StringRef::compare for constant folding instead of using strcmp/strncmp so that the optimized IR isn't dependent on the host's implementation of strcmp. llvm-svn: 141227	2011-10-05 22:27:16 +00:00
Jim Grosbach	254b9ed208	Revert 141203. InstCombine is looping on unit tests. llvm-svn: 141209	2011-10-05 20:44:29 +00:00
Rafael Espindola	8247f7a5dd	Check for the returns_twice attribute in callsFunctionThatReturnsTwice. This fixes PR11038, but there are still some cleanups to be done. llvm-svn: 141204	2011-10-05 20:05:13 +00:00
Jim Grosbach	a03dd9189f	Update InstCombine worklist after instruction transform is complete. When updating the worklist for InstCombine, the Add/AddUsersToWorklist functions may access the instruction(s) being added, for debug output for example. If the instructions aren't yet added to the basic block, this can result in a crash. Finish the instruction transformation before adjusting the worklist instead. rdar://10238555 llvm-svn: 141203	2011-10-05 20:05:00 +00:00
Duncan Sands	f7df28c1f5	GVN does simple propagation of conditions: when it sees a conditional branch "br i1 %x, label %if_true, label %if_false" then it replaces "%x" with "true" in places only reachable via the %if_true arm, and with "false" in places only reachable via the %if_false arm. Except that actually it doesn't: if value numbering shows that %y is equal to %x then, yes, %y will be turned into true/false in this way, but any occurrences of %x itself are not transformed. Fix this. What's more, it's often the case that %x is an equality comparison such as "%x = icmp eq %A, 0", in which case every occurrence of %A that is only reachable via the %if_true arm can be replaced with 0. Implement this and a few other variations on this theme. This reduces the number of lines of LLVM IR in "GCC as one big file" by 0.2%. It has a bigger impact on Ada code, typically reducing the number of lines of bitcode by around 0.4% by removing repeated compiler generated checks. Passes the LLVM nightly testsuite and the Ada ACATS testsuite. llvm-svn: 141177	2011-10-05 14:28:49 +00:00
Duncan Sands	348e8c285a	Generalize GVN's conditional propagation logic slightly: it's OK for the false/true destination to have multiple predecessors as long as the extra ones are dominated by the branch destination. llvm-svn: 141176	2011-10-05 14:17:01 +00:00
Andrew Trick	c60e2addd9	LSR should avoid redundant edge splitting. This handles the case in which LSR rewrites an IV user that is a phi and splits critical edges originating from a switch. Fixes <rdar://problem/6453893> LSR is not splitting edges "nicely" llvm-svn: 141059	2011-10-04 03:50:44 +00:00
Andrew Trick	f9b98a3c3e	Unit test for r140919, loop unroll heuristics. llvm-svn: 141049	2011-10-04 00:07:02 +00:00
Rafael Espindola	4700f53cee	Add the returns_twice attribute to LLVM. llvm-svn: 141001	2011-10-03 14:45:37 +00:00
Nick Lewycky	7cd1bfb89d	Add a new icmp+select optz'n. Also shows off the load(cst) folding added in r140966. llvm-svn: 140969	2011-10-02 10:37:37 +00:00

... 3 4 5 6 7 ...

3010 Commits