llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-01 16:33:37 +01:00

Author	SHA1	Message	Date
Benjamin Kramer	21a49e9375	simplify-libcalls: fold strstr(a, b) == a -> strncmp(a, b, strlen(b)) == 0 llvm-svn: 106047	2010-06-15 21:34:25 +00:00
Rafael Espindola	ab5183047b	Remove the arm_aapcscc marker from the tests. It is the default for the linux targets. llvm-svn: 106029	2010-06-15 19:04:29 +00:00
Chris Lattner	88d51b0f4c	jump threading can't split a critical edge from an indirectbr. This fixes PR7356. llvm-svn: 105950	2010-06-14 19:45:43 +00:00
Benjamin Kramer	443f74025b	Test case for r105914. llvm-svn: 105915	2010-06-13 16:16:54 +00:00
Kenneth Uildriks	73367eb575	Partial specialization was not checking the callsite to make sure it was using the same constants as the specialization, leading to calls to the wrong specialization. Patch by Takumi Nakamura\! llvm-svn: 105528	2010-06-05 14:50:21 +00:00
Devang Patel	8bf4434e6e	Copy location info for current function argument from dbg.declare if respective store instruction does not have any location info. llvm-svn: 105490	2010-06-04 22:27:30 +00:00
Duncan Sands	da677f56f2	Fix PR7272: when inlining through a callsite with byval arguments, the newly created allocas may be used by inlined calls, so these need to have their tail call flags cleared. Fixes PR7272. llvm-svn: 105255	2010-05-31 21:00:26 +00:00
Nick Lewycky	418d80e555	The memcpy intrinsic only takes i8* for %src and %dst, so cast them to that first. Fixes PR7265. llvm-svn: 105206	2010-05-31 06:16:35 +00:00
Dale Johannesen	6b20aa3751	Add missing space; works for me. llvm-svn: 104992	2010-05-28 18:45:59 +00:00
Dan Gohman	22d22caaed	Teach instcombine to promote alloca array sizes. llvm-svn: 104945	2010-05-28 15:09:00 +00:00
Dan Gohman	bab79afa29	Add a testcase for getelementptr index promotion. llvm-svn: 104944	2010-05-28 15:07:59 +00:00
Devang Patel	8d4eb26f24	Do not drop location info for inlined function args. llvm-svn: 104884	2010-05-27 20:25:04 +00:00
Duncan Sands	32d3986765	Teach instCombine to remove malloc+free if malloc's only uses are comparisons to null. Patch by Matti Niemenmaa. llvm-svn: 104871	2010-05-27 19:09:06 +00:00
Benjamin Kramer	0acbf37982	Properly promote operands when optimizing a single-character memcmp. llvm-svn: 104648	2010-05-25 22:53:43 +00:00
Nick Lewycky	fc4c30e9e3	Actually run the test. Thanks Daniel Dunbar! llvm-svn: 103720	2010-05-13 17:41:06 +00:00
Nick Lewycky	38e49fbf52	Add testcase for r103653. llvm-svn: 103699	2010-05-13 06:00:14 +00:00
Chris Lattner	e74d980a02	make simplifycfg insert an llvm.trap before the 'unreachable' it introduces when it detects undefined behavior. llvm.trap generally codegens into some thing really small (e.g. a 2 byte ud2 instruction on x86) and debugging this sort of thing is "nontrivial". For example, we now compile: void foo() { (int)0 = 42; } into: _foo: pushl %ebp movl %esp, %ebp ud2 Some may even claim that this is a security hole, though that seems dubious to me. This addresses rdar://7958343 - Optimizing away null dereference potentially allows arbitrary code execution llvm-svn: 103356	2010-05-08 22:15:59 +00:00
Chris Lattner	0b442d35da	Teach instcombine to transform a bitcast/(zext\|trunc)/bitcast sequence with a vector input and output into a shuffle vector. This sort of sequence happens when the input code stores with one type and reloads with another type and then SROA promotes to i96 integers, which make everyone sad. This fixes rdar://7896024 llvm-svn: 103354	2010-05-08 21:50:26 +00:00
Chris Lattner	1037630863	Fix PR7052, patch by Jakub Staszak! llvm-svn: 103347	2010-05-08 20:01:44 +00:00
Devang Patel	9290f59fb8	Update test to use valid debug info. llvm-svn: 103287	2010-05-07 20:34:00 +00:00
Dan Gohman	1512bd9998	Add an LLVM IR version of code sinking. This uses the same simple algorithm as MachineSink, but it isn't constrained by MachineInstr-level details. llvm-svn: 103257	2010-05-07 15:40:13 +00:00
Duncan Sands	7db9873b74	Use llvm.foo as the intrinsic, rather than llvm.dbg.value. Since the values passed to llvm.dbg.value were not valid for the intrinsic, it might have caused trouble one day if the verifier ever started checking for valid debug info. llvm-svn: 103038	2010-05-04 20:09:25 +00:00
Duncan Sands	a3857d3d9a	Fix a variant of PR6112 found by thinking about it: when doing RAUW of a global variable with a local variable in function F, if function local metadata M in function G was using the global then M would become function-local to both F and G, which is not allowed. See the testcase for an example. Fixed by detecting this situation and zapping the metadata operand when it occurs. llvm-svn: 103007	2010-05-04 12:43:36 +00:00
Devang Patel	fa560fdfc1	Check for side effects before splitting loop. Patch by Jakub Staszak! llvm-svn: 102928	2010-05-03 18:06:58 +00:00
Chris Lattner	afaee8e110	revert r102831. We already delete dead readonly calls in other places, killing a valid transformation is not the right answer. llvm-svn: 102850	2010-05-01 17:19:38 +00:00
Owen Anderson	443d813b45	Disable the call-deletion transformation introduced in r86975. Without halting analysis, it is illegal to delete a call to a read-only function. The correct solution is almost certainly to add a "must halt" attribute and only allow deletions in its presence. XFAIL the relevant testcase for now. llvm-svn: 102831	2010-05-01 08:34:28 +00:00
Chris Lattner	61a8beaae0	fix PR5009 by making CGSCCPM realize that a call was devirtualized if an indirect call site was removed and a direct one was added, not just if an indirect call site was modified to be direct. llvm-svn: 102830	2010-05-01 06:38:43 +00:00
Chris Lattner	660cc3ac57	rename test llvm-svn: 102829	2010-05-01 06:34:13 +00:00
Chris Lattner	9ee72a47c2	Implement rdar://6295824 and PR6724 with two tiny changes that can have a big effect :). The first is to enable the iterative SCC passmanager juice that kicks in when the scc passmgr detects that a function pass has devirtualized a call. In this case, it will rerun all the passes it manages on the SCC, up to the iteration count limit (4). This is useful because a function pass may devirualize a call, and we want the inliner to inline it, or pruneeh to infer stuff about it, etc. The second patch is to add all call sites to the DevirtualizedCalls list the inliner uses. This list is about to get renamed, but the jist of this is that the inliner now reconsiders all inlined call sites as candidates for further inlining. The intuition is this that in cases like this: f() { g(1); } g(int x) { h(x); } We analyze this bottom up, and may decide that it isn't profitable to inline H into G. Next step, we decide that it is profitable to inline G into F, and do so, which means that F now calls H. Even though the call from G -> H may not have been profitable to inline, the call from F -> H may be (in this case because a constant allows folding etc). In my spot checks, this doesn't have a big impact on code. For example, the LLC output for 252.eon grew from 0.02% (from 317252 to 317308) and 176.gcc actually shrunk by .3% (from 1525612 to 1520964 bytes). 252.eon never iterated in the SCC Passmgr, 176.gcc iterated at most 1 time. llvm-svn: 102823	2010-05-01 01:15:56 +00:00
Chris Lattner	b893cedad2	The inliner has traditionally not considered call sites that appear due to inlining a callee as candidates for futher inlining, but a recent patch made it do this if those call sites were indirect and became direct. Unfortunately, in bizarre cases (see testcase) doing this can cause us to infinitely inline mutually recursive functions into callers not in the cycle. Fix this by keeping track of the inline history from which callsite inline candidates got inlined from. This shouldn't affect any "real world" code, but is required for a follow on patch that is coming up next. llvm-svn: 102822	2010-05-01 01:05:10 +00:00
Chris Lattner	3eb6a9f076	Dan recently disabled recursive inlining within a function, but we were still inlining self-recursive functions into other functions. Inlining a recursive function into itself has the potential to reduce recursion depth by a factor of 2, inlining a recursive function into something else reduces recursion depth by exactly 1. Since inlining a recursive function into something else is a weird form of loop peeling, turn this off. The deleted testcase was added by Dale in r62107, since then we're leaning towards not inlining recursive stuff ever. In any case, if we like inlining recursive stuff, it should be done within the recursive function itself to get the algorithm recursion depth win. llvm-svn: 102798	2010-04-30 22:37:22 +00:00
Devang Patel	8146cb492f	Preserve debug info attached with call instruction while eliminating dead argument. Radar 7927803 llvm-svn: 102760	2010-04-30 20:23:54 +00:00
Chris Lattner	45c337c939	fix this to work with objdir != srcdir llvm-svn: 102547	2010-04-28 22:34:35 +00:00
Chris Lattner	4629370fa2	fix PR6112 - When globalopt (or any other pass) does RAUW(@G, %G), metadata references in non-function-local MDNodes should drop to null. llvm-svn: 102519	2010-04-28 20:16:12 +00:00
Chris Lattner	9065710fcf	fix PR6940: sitofp(undef) folds to 0.0, not undef. llvm-svn: 102358	2010-04-26 18:21:23 +00:00
Chris Lattner	ace5b97b5c	no longer xfail llvm-svn: 102220	2010-04-23 22:39:33 +00:00
Chris Lattner	790231f95e	fix some failures my callgraph dump format change broke. llvm-svn: 102197	2010-04-23 18:38:40 +00:00
Chris Lattner	775c94002d	testcase for the bug that required a patch to be reverted. llvm-svn: 102195	2010-04-23 18:31:01 +00:00
Chris Lattner	85dd1e42b6	disable my previous inliner patch, it appears to be busting self-host. llvm-svn: 102153	2010-04-23 00:41:03 +00:00
Chris Lattner	5d87e1be44	The inliner was choosing to not consider call sites that appear in the SCC as a result of inlining as candidates for inlining. Change this so that it does consider call sites that change from being indirect to being direct as a result of inlining. This allows it to completely "devirtualize" the testcase. llvm-svn: 102146	2010-04-22 23:37:35 +00:00
Chris Lattner	66e308198d	add a DEBUG call so that -debug lists when CGSCCPM iterates. Fix RefreshCallGraph to use CGN->replaceCallEdge instead of hand rolling its own loop. replaceCallEdge properly maintains the reference counts of the nodes, fixing a crash exposed by the iterative callgraph stuff. llvm-svn: 102120	2010-04-22 20:42:33 +00:00
Chris Lattner	c840cfe5c9	Implement (but don't enable) PR6724 and rdar://6295824. In short, we have RefreshCallGraph detect when a function pass devirtualizes a call, and have CGSCCPassMgr iterate (up to a count) when this happens. This allows (in the example) GVN to devirtualize the call in foo, then the inliner to inline it away. This is not currently enabled because I haven't done any analysis on the (potentially substantial) code size or performance impact of doing this, and guess what, it exposes callgraph updating bugs in various passes. This is progress though, and you can play with it by passing -max-cg-scc-iterations=5 to opt. llvm-svn: 101973	2010-04-21 00:47:40 +00:00
Dan Gohman	4d1724c3e8	Revert r101471. For tight recursive functions which have multiple recursive callsites, inlining can reduce the number of calls by exponential factors, as it does in MultiSource/Benchmarks/Olden/treeadd. More involved heuristics will be needed. llvm-svn: 101969	2010-04-21 00:43:30 +00:00
Chris Lattner	e5a995a834	RewriteLoopBodyWithConditionConstant can end up rewriting the condition we're unswitching on. In this case, don't try to simplify the second copy of the loop which may be dead or not, but is probably a constant now. This fixes PR6879 llvm-svn: 101870	2010-04-20 05:09:16 +00:00
Chris Lattner	994155dc91	Fix rdar://7879828 - crash in CallGraph, a self host issue. Arg promotion was deleting call graph nodes that still had references from the 'indirect' CGN. Like the inliner, it should only delete the function if all references are gone. llvm-svn: 101845	2010-04-20 00:46:50 +00:00
Dan Gohman	e52396cb52	Remove the Expr member from IVUsers. Instead of remembering the expression, just ask ScalarEvolution for it on demand. This helps IVUsers be more robust in the case of expressions changing underneath it. This fixes PR6862. llvm-svn: 101819	2010-04-19 21:48:58 +00:00
Nick Lewycky	c639c07492	Fix declarations in a few more tests. llvm-svn: 101676	2010-04-17 21:29:25 +00:00
Nick Lewycky	7abefa1195	Fix intrinsic signature in this test. llvm-svn: 101674	2010-04-17 21:12:55 +00:00
Bob Wilson	ad00f21093	Re-commit my previous SSAUpdater changes. The previous version naively tried to determine where to place PHIs by iteratively comparing reaching definitions at each block. That was just plain wrong. This version now computes the dominator tree within the subset of the CFG where PHIs may need to be placed, and then places the PHIs in the iterated dominance frontier of each definition. The rest of the patch is mostly the same, with a few more performance improvements added in. llvm-svn: 101612	2010-04-17 03:08:24 +00:00
Dan Gohman	58add81e7d	Disable inlining of recursive calls. It can complicate tailcallelim and dependent analyses, and increase code size, so doing it profitably would require more complex heuristics. llvm-svn: 101471	2010-04-16 16:01:18 +00:00
Dan Gohman	f0457a82fa	Refine the detection of seemingly infinitely recursive calls where the callee is expected to be expanded to something else by codegen, so that normal infinitely recursive calls are still transformed. llvm-svn: 101468	2010-04-16 15:57:50 +00:00
Chris Lattner	16e1226366	move comment. llvm-svn: 101433	2010-04-16 01:05:52 +00:00
Chris Lattner	f90092185c	fix PR6832: we were using the alignment of a pointer when we wanted the alignment of the pointee. llvm-svn: 101432	2010-04-16 01:05:38 +00:00
Evan Cheng	a05db75002	Trim tests and convert to FileCheck. llvm-svn: 101277	2010-04-14 20:22:17 +00:00
Nick Lewycky	cc67a7f57e	Revert r101213. llvm-svn: 101231	2010-04-14 04:51:58 +00:00
Nick Lewycky	49c5407ceb	Commit testcase for r101213. llvm-svn: 101214	2010-04-14 03:46:42 +00:00
Dan Gohman	1521831bff	Teach ScalarEvolution to simplify smax and umax when it can prove that one operand is always greater than another. llvm-svn: 101142	2010-04-13 16:51:03 +00:00
Dan Gohman	a83b45d7b5	Teach IndVarSimplify how to eliminate remainder operators where the numerator is an induction variable. For example, with code like this: for (i=0;i<n;++i) x[i%n] = 0; IndVarSimplify will now recognize that i is always less than n inside the loop, and eliminate the remainder. llvm-svn: 101113	2010-04-13 01:46:36 +00:00
Dan Gohman	ba53e85cc3	Suppress LinearFunctionTestReplace when the computed backedge-taken expression is a UDiv and it doesn't appear that the UDiv came from the user's source. ScalarEvolution has recently figured out how to compute a tripcount expression for the inner loop in SingleSource/Benchmarks/Shootout/sieve.c, using a udiv. Emitting a udiv instruction dramatically slows down the enclosing loop. llvm-svn: 101068	2010-04-12 21:13:43 +00:00
Eric Christopher	6b38179ee2	Verify function prototypes before trying to optimize functions. We also need TargetData, just return false if we don't have it. Update testcases accordingly. Fixes PR6807. llvm-svn: 101011	2010-04-12 04:48:00 +00:00
Dan Gohman	dff42439b2	Re-apply r101000, with a fix: Don't eliminate an icmp which is part of the loop exit test. This usually doesn't come up for a variety of reasons, but it isn't impossible, so make IndVarSimplify handle it conservatively. llvm-svn: 101008	2010-04-12 02:21:50 +00:00
Dan Gohman	97a1bdfafc	Revert 101000, which is breaking self-host builds. llvm-svn: 101002	2010-04-12 00:17:10 +00:00
Dan Gohman	7e250afcfa	Teach IndVarSimplify how to eliminate comparisons involving induction variables. For example, with code like this: for (i=0;i<n;++i) if (i<n) x[i] = 0; IndVarSimplify will now recognize that i is always less than n inside the loop, and eliminate the if. llvm-svn: 101000	2010-04-11 23:10:12 +00:00
Chris Lattner	92b4e858d0	fix PR6743, a case where we'd delete an instruction before using it in some cases. llvm-svn: 100937	2010-04-10 18:26:57 +00:00
Chris Lattner	c5ee900be8	fix PR6760, a missing check in heap SRoA. llvm-svn: 100936	2010-04-10 18:19:22 +00:00
Dan Gohman	67e02ffd79	When determining a canonical insert position, don't climb deeper into adjacent loops. Also, ensure that the insert position is dominated by the loop latch of any loop in the post-inc set which has a latch. llvm-svn: 100906	2010-04-09 22:07:05 +00:00
Dan Gohman	e36761b7d0	When emitting code for an add, don't force a SCEVUnknown wrapper around a hoisted intermediate result if the intermediate result isn't an Instruction. llvm-svn: 100884	2010-04-09 19:14:31 +00:00
Dan Gohman	853ff6b580	Fix a bug in IVUsers which was permitting non-affine addrecs to be sent to LSR, which it isn't prepared to handle. llvm-svn: 100839	2010-04-09 01:22:56 +00:00
Chris Lattner	08fed5e338	fix a SCCP miscompilation that could happen when a forced constant is changed to a constant, we would end up adding the instruction to the wrong worklist, preventing it from being properly revisited. This fixes rdar://7832370 llvm-svn: 100837	2010-04-09 01:14:31 +00:00
Dan Gohman	5bf62639ed	Print empty structs as {} rather than { }. llvm-svn: 100787	2010-04-08 18:03:05 +00:00
Chris Lattner	23334439e9	add newlines at the end of files. llvm-svn: 100705	2010-04-07 22:53:17 +00:00
Dan Gohman	b5210c934f	Generalize IVUsers to track arbitrary expressions rather than expressions explicitly split into stride-and-offset pairs. Also, add the ability to track multiple post-increment loops on the same expression. This refines the concept of "normalizing" SCEV expressions used for to post-increment uses, and introduces a dedicated utility routine for normalizing and denormalizing expressions. This fixes the expansion of expressions which are post-increment users of more than one loop at a time. More broadly, this takes LSR another step closer to being able to reason about more than one loop at a time. llvm-svn: 100699	2010-04-07 22:27:08 +00:00
Mon P Wang	484bbe6aa9	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100304	2010-04-04 03:10:48 +00:00
Chris Lattner	065798cd82	add integer overflow check for the fp induction variable checker. Amusingly, we already had tests that we should have rejects because they would be miscompiled in the testsuite. The remaining issue with this is that we don't check that the branch causes us to exit the loop if it fails, so we don't actually know if we remain in bounds. llvm-svn: 100284	2010-04-03 07:18:48 +00:00
Chris Lattner	a8c9d26ea1	fix PR6761, a miscompilation due to the fp->int IV conversion stuff. More bugs remain though. llvm-svn: 100282	2010-04-03 06:30:03 +00:00
Chris Lattner	c13dd4f035	convert to filecheck llvm-svn: 100281	2010-04-03 06:27:56 +00:00
Chris Lattner	e1f0cf628b	rename feature test. llvm-svn: 100279	2010-04-03 06:24:28 +00:00
Chris Lattner	4d6928d1c7	actually just remove this, will move the real feature test here. llvm-svn: 100278	2010-04-03 06:24:03 +00:00
Chris Lattner	17259c40aa	rename test since it is a feature test. llvm-svn: 100277	2010-04-03 06:22:52 +00:00
Chris Lattner	5c2f2edd08	first half of a pass through IndVarSimplify::HandleFloatingPointIV, this cleans up a bunch of code and also fixes several crashes and miscompiles. More to come unfortunately, this optimization is quite broken. llvm-svn: 100270	2010-04-03 05:54:59 +00:00
Bob Wilson	5a3200f750	Revert all my SSAUpdater patches. The PHI placement algorithm is not correct (what was I thinking?) and there's also a problem with LCSSA. I'll try again later with fixes. --- Reverse-merging r100263 into '.': U lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100177 into '.': G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100148 into '.': G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100147 into '.': U include/llvm/Transforms/Utils/SSAUpdater.h G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100131 into '.': G include/llvm/Transforms/Utils/SSAUpdater.h G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100130 into '.': G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100126 into '.': G include/llvm/Transforms/Utils/SSAUpdater.h G lib/Transforms/Utils/SSAUpdater.cpp --- Reverse-merging r100050 into '.': D test/Transforms/GVN/2010-03-31-RedundantPHIs.ll --- Reverse-merging r100047 into '.': G include/llvm/Transforms/Utils/SSAUpdater.h G lib/Transforms/Utils/SSAUpdater.cpp llvm-svn: 100264	2010-04-03 03:50:38 +00:00
Mon P Wang	0ccf050ca3	Revert r100191 since it breaks objc in clang llvm-svn: 100199	2010-04-02 18:43:02 +00:00
Mon P Wang	a01350755e	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100191	2010-04-02 18:04:15 +00:00
Dan Gohman	8835a0200a	Manually notify ScalarEvolution before making an operand replacement, since it can't currently observe such changes automatically. llvm-svn: 100186	2010-04-02 14:48:31 +00:00
Dan Gohman	7051600e3f	Revert the recent alignment changes. They're broken for -Os because, in particular, they end up aligning strings at 16-byte boundaries, and there's no way for GlobalOpt to check OptForSize. llvm-svn: 100172	2010-04-02 03:04:37 +00:00
Dan Gohman	ece7e3c015	Make globalopt refine global variable alignment. llvm-svn: 100160	2010-04-02 00:14:16 +00:00
Bob Wilson	b66bed7e3c	Add a redundant PHI testcase for SSAUpdater to go with svn r100047. llvm-svn: 100050	2010-03-31 21:38:43 +00:00
Gabor Greif	b7ecfb134b	testcase for r99914, provided by baldrick! llvm-svn: 100043	2010-03-31 20:37:13 +00:00
Bob Wilson	aae933cc81	Revert Mon Ping's change 99928, since it broke all the llvm-gcc buildbots. llvm-svn: 99948	2010-03-30 22:27:04 +00:00
Mon P Wang	9351ea594a	Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) A update of langref will occur in a subsequent checkin. llvm-svn: 99928	2010-03-30 20:55:56 +00:00
Chris Lattner	1d47291927	fix PR6642, GVN forwarding from memset to load of the base of the memset. llvm-svn: 99488	2010-03-25 05:58:19 +00:00
Eric Christopher	b3abae8852	Reapply r99451 with a fix to move the NoInline check to the cost functions instead of InlineFunction. llvm-svn: 99483	2010-03-25 04:49:10 +00:00
Eric Christopher	e293604548	Temporarily revert this, it's causing an issue with an internal project. llvm-svn: 99451	2010-03-24 23:35:21 +00:00
Chris Lattner	a9eb6a6987	add some accessors to callsite/callinst/invokeinst to check for the noinline attribute, and make the inliner refuse to inline a call site when the call site is marked noinline even if the callee isn't. This fixes PR6682. llvm-svn: 99341	2010-03-23 22:59:07 +00:00
Evan Cheng	1144c23330	Teach simplify libcall to transform __strcpy_chk to __memcpy_chk to enable optimizations down stream. llvm-svn: 99282	2010-03-23 15:48:04 +00:00
Evan Cheng	a544f02286	Fix an incorrect logic causing instcombine to miss some _chk -> non-chk transformations. llvm-svn: 99263	2010-03-23 06:06:09 +00:00
Evan Cheng	34c5c9af6f	Fix a typo in ValueTracking that's causing instcombine to delete needed shift instructions. llvm-svn: 98416	2010-03-13 02:20:29 +00:00
Duncan Sands	01532e804a	When constant folding GEP of GEP, do not crash if an index of the inner GEP is not a ConstantInt. llvm-svn: 98359	2010-03-12 17:55:20 +00:00
Dan Gohman	c421549beb	Make isLCSSA ignore uses in blocks not reachable from the entry block, as LCSSA no longer transforms such uses. llvm-svn: 98033	2010-03-09 01:53:33 +00:00
Evan Cheng	db47eab2a3	Re-commit 97860 with fix. getMallocAllocatedType may return null. llvm-svn: 98000	2010-03-08 22:54:36 +00:00
Eric Christopher	a671e4f7aa	Migrate _chk call lowering from SimplifyLibCalls to InstCombine. Stub out the remainder of the calls that we should lower in some way and move the tests to the new correct directory. Fix up tests that are now optimized more than they were before by -instcombine. llvm-svn: 97875	2010-03-06 10:50:38 +00:00
Eric Christopher	b3f0926f84	Temporarily revert: Log: Transform @llvm.objectsize to integer if the argument is a result of malloc of known size. Modified: llvm/trunk/lib/Transforms/InstCombine/InstCombineCalls.cpp llvm/trunk/test/Transforms/InstCombine/objsize.ll It appears to be causing swb and nightly test failures. llvm-svn: 97866	2010-03-06 03:11:35 +00:00
Evan Cheng	071e007ce6	Transform @llvm.objectsize to integer if the argument is a result of malloc of known size. llvm-svn: 97860	2010-03-06 01:01:42 +00:00
Evan Cheng	3346ea1d4e	Safely turn memset_chk etc. to non-chk variant if the known object size is >= memset / memcpy / memmove size. llvm-svn: 97828	2010-03-05 20:59:47 +00:00
Evan Cheng	782183fe4a	Instcombine should turn llvm.objectsize of a alloca with static size to an integer. llvm-svn: 97827	2010-03-05 20:47:23 +00:00
Chris Lattner	789121d6e2	fix PR6512, a case where instcombine would incorrectly merge loads from different addr spaces. llvm-svn: 97813	2010-03-05 18:53:28 +00:00
Chris Lattner	617f774b4e	Fix PR6503. This turned into a much more interesting and nasty bug. Various parts of the cmp\|cmp and cmp&cmp folding logic wasn't prepared for vectors (unrelated to the bug but noticed while in the code) and the code was definitely not safe to use by the (cast icmp)\|(cast icmp) handling logic that I added in r95855. Fix all this up by changing the various routines to more consistently use IRBuilder and not pass in the I which had the wrong type. llvm-svn: 97801	2010-03-05 08:46:26 +00:00
Chris Lattner	f44ffcbc2c	make these less sensitive to temporary naming. llvm-svn: 97799	2010-03-05 08:43:33 +00:00
Chris Lattner	3d5ab3df06	remove this testcase, it isn't clear what it was testing and it is subsumed by or.ll llvm-svn: 97798	2010-03-05 08:43:06 +00:00
Chris Lattner	e457488032	fix a nice subtle reassociate bug which would only occur in a very specific use pattern embodied in the carefully reduced testcase. llvm-svn: 97794	2010-03-05 07:18:54 +00:00
Nick Lewycky	58ab63e179	Make the 'icmp pred trunc(ext(X)), CST --> icmp pred X, ext(trunc(CST))' transformation much more careful. Truncating binary '01' to '1' sounds like it's safe until you realize that it switched from positive to negative under a signed interpretation, and that depends on the icmp predicate. Also a few miscellaneous cleanups. llvm-svn: 97721	2010-03-04 06:54:10 +00:00
Chris Lattner	9e230fb6b2	fix incorrect folding of icmp with undef, PR6481. llvm-svn: 97659	2010-03-03 19:46:03 +00:00
Bill Wendling	d1f658563d	This test case: long test(long x) { return (x & 123124) \| 3; } Currently compiles to: _test: orl $3, %edi movq %rdi, %rax andq $123127, %rax ret This is because instruction and DAG combiners canonicalize (or (and x, C), D) -> (and (or, D), (C \| D)) However, this is only profitable if (C & D) != 0. It gets in the way of the 3-addressification because the input bits are known to be zero. llvm-svn: 97616	2010-03-03 00:35:56 +00:00
Dan Gohman	1625456786	Non-affine post-inc SCEV expansions have more code which must be emitted after the increment. Make sure the insert position reflects this. This fixes PR6453. llvm-svn: 97537	2010-03-02 01:59:21 +00:00
Dan Gohman	37bf232609	Floating-point add, sub, and mul are now spelled fadd, fsub, and fmul, respectively. llvm-svn: 97531	2010-03-02 01:11:08 +00:00
Devang Patel	9f858ad942	Remove tests that checks @llvm.dbg.stoppoint handling. llvm-svn: 97493	2010-03-01 20:33:48 +00:00
Chris Lattner	04209058b9	stop using anders-aa llvm-svn: 97492	2010-03-01 20:24:50 +00:00
Devang Patel	6dd4084f57	@llvm.dbg.stoppoint intrinsic is not used anymore. Delete dead testcase. llvm-svn: 97489	2010-03-01 19:46:08 +00:00
Devang Patel	ef282ea4c2	Update to use new debug info encoding scheme. As a bonus, now the test passes! llvm-svn: 97487	2010-03-01 19:41:26 +00:00
Devang Patel	66fd0f6b4b	Remove this test because it checks wheter optimizer handled @llvm.dbg.global_variable appropriately or not. LLVM does not use this scheme to encode debug info for global variables any more. llvm-svn: 97480	2010-03-01 19:14:25 +00:00
Dan Gohman	5e58ab0b56	LLVM instruction syntax doesn't have trailing semicolons. llvm-svn: 97456	2010-03-01 17:53:15 +00:00
John McCall	69bc985550	Teach APFloat how to create both QNaNs and SNaNs and with arbitrary-width payloads. APFloat's internal folding routines always make QNaNs now, instead of sometimes making QNaNs and sometimes SNaNs depending on the type. llvm-svn: 97364	2010-02-28 02:51:25 +00:00
Chris Lattner	93fd3ddf24	fix PR6414, a nondeterminism issue in IPSCCP which was because of a subtle interation in a loop operating in densemap order. llvm-svn: 97288	2010-02-27 00:07:42 +00:00
Chris Lattner	3832b527d8	fix PR6435 another bug from the MallocInst elimination work. llvm-svn: 97231	2010-02-26 18:23:13 +00:00
Chris Lattner	8a60e7080e	this file lacks a run line! llvm-svn: 97208	2010-02-26 02:40:57 +00:00
Chris Lattner	8cb1dc746d	rewrite OptimizeGlobalAddressOfMalloc to fix PR6422, some bugs introduced when mallocinst was eliminated. llvm-svn: 97178	2010-02-25 22:33:52 +00:00
Dan Gohman	52ed61204b	Make LoopSimplify change conditional branches in loop exiting blocks which branch on undef to branch on a boolean constant for the edge exiting the loop. This helps ScalarEvolution compute trip counts for loops. Teach ScalarEvolution to recognize single-value PHIs, when safe, and ForgetSymbolicName to forget such single-value PHI nodes as apprpriate in ForgetSymbolicName. llvm-svn: 97126	2010-02-25 06:57:05 +00:00
Dan Gohman	b3bf4992c5	Don't do (X != Y) ? X : Y -> X for floating-point values; it doesn't handle NaN properly. Do (X une Y) ? X : Y -> X if one of X and Y is not zero. llvm-svn: 96955	2010-02-23 17:17:57 +00:00
Dan Gohman	5a6b9ad7db	Remove the code which constant-folded ptrtoint(inttoptr(x)+c) to getelementptr. Despite only doing so in the case where x is a known array object and c can be converted to an index within range, this could still be invalid if c is actually the address of an object allocated outside of LLVM. Also, SCEVExpander, the original motivation for this code, has since been improved to avoid inttoptr+ptroint in more cases. llvm-svn: 96950	2010-02-23 16:35:41 +00:00
Dan Gohman	0891078790	Convert this test to FileCheck and add a testcase for PR3574. llvm-svn: 96851	2010-02-23 01:28:09 +00:00
Evan Cheng	d9816ef946	Instcombine constant folding can normalize gep with negative index to index with large offset. When instcombine objsize checking transformation sees these geps where the offset seemingly point out of bound, it should just return "i don't know" rather than asserting. llvm-svn: 96825	2010-02-22 23:34:00 +00:00
Dan Gohman	fa70dbe3f7	Add a test for canonicalizing ConstantExpr operands. llvm-svn: 96820	2010-02-22 23:07:52 +00:00
Dan Gohman	dcc3634e46	Constant-fold certain comparisons with infinity and negative infinity. llvm-svn: 96777	2010-02-22 04:06:03 +00:00
Dan Gohman	9268d67079	Teach ScalarEvolution how to compute a tripcount for a loop with true or false as its exit condition. These are usually eliminated by SimplifyCFG, but the may be left around during a pass which wishes to preserve the CFG. llvm-svn: 96683	2010-02-19 18:12:07 +00:00
Dan Gohman	2a4208c74e	Fold bswap(undef) to undef. llvm-svn: 96432	2010-02-17 00:54:58 +00:00
Bob Wilson	7bb549dc8e	Testcase for critical edge splitting with load PRE. llvm-svn: 96385	2010-02-16 20:48:55 +00:00
Chris Lattner	a8505609fe	fix PR6305 by handling BlockAddress in a helper function called by jump threading. llvm-svn: 96263	2010-02-15 20:47:49 +00:00
Eric Christopher	96f3c4222f	Fix a problem where we had bitcasted operands that gave us odd offsets since the bitcasted pointer size and the offset pointer size are going to be different types for the GEP vs base object. llvm-svn: 96134	2010-02-13 23:38:01 +00:00
Chris Lattner	ecb203898a	1. modernize the constantmerge pass, using densemap/smallvector. 2. don't bother trying to merge globals in non-default sections, doing so is quite dubious at best anyway. 3. fix a bug reported by Arnaud de Grandmaison where we'd try to merge two globals in different address spaces. llvm-svn: 95995	2010-02-12 18:17:23 +00:00
Chris Lattner	fbf6ef7c34	rename test llvm-svn: 95993	2010-02-12 18:05:00 +00:00
Dan Gohman	c40eb525ad	Reapply the new LoopStrengthReduction code, with compile time and bug fixes, and with improved heuristics for analyzing foreign-loop addrecs. This change also flattens IVUsers, eliminating the stride-oriented groupings, which makes it easier to work with. llvm-svn: 95975	2010-02-12 10:34:29 +00:00
Eric Christopher	2e0201ee18	Make sure that ConstantExpr offsets also aren't off of extern symbols. Thanks to Duncan Sands for the testcase! llvm-svn: 95877	2010-02-11 17:44:04 +00:00
Chris Lattner	a59eb7c09c	Rename ValueRequiresCast to ShouldOptimizeCast, to better reflect what it does. Enhance it to return false to optimizing vector sign extensions from vector comparisions, which is the idiom used to get a splatted vector for a vector comparison. Doing this breaks vector-casts.ll, add some compensating transformations to handle the important case they cover without depending on this canonicalization. This fixes rdar://7434900 a serious pessimization of vector compares. llvm-svn: 95855	2010-02-11 06:26:33 +00:00
Chris Lattner	ef91e752a6	convert to filecheck. llvm-svn: 95854	2010-02-11 06:24:37 +00:00
Chris Lattner	a087e6e82f	Make DSE only scan blocks that are reachable from the entry block. Other blocks may have pointer cycles that will crash basicaa and other alias analyses. In any case, there is no point wasting cycles optimizing dead blocks. This fixes rdar://7635088 llvm-svn: 95852	2010-02-11 05:11:54 +00:00
Chris Lattner	199f4187b6	a testcase that doesn't crash GVN but could someday. llvm-svn: 95851	2010-02-11 05:08:05 +00:00
Chris Lattner	733ffcdb1f	Make jump threading honor x\|undef -> true and x&undef -> false, instead of considering x\|undef -> x, which may not be true. llvm-svn: 95850	2010-02-11 04:40:44 +00:00
Eric Christopher	9516309f55	Add ConstantExpr handling to Intrinsic::objectsize lowering. Update testcase accordingly now that we can optimize another section. llvm-svn: 95846	2010-02-11 01:48:54 +00:00
Eric Christopher	6691a59247	Move Intrinsic::objectsize lowering back to InstCombineCalls and enable constant 0 offset lowering. llvm-svn: 95691	2010-02-09 21:24:27 +00:00
Eric Christopher	871cf7bce2	Pull these back out, they're a little too aggressive and time consuming for a simple optimization. llvm-svn: 95671	2010-02-09 17:29:18 +00:00
Chris Lattner	26b712379f	fix PR6193, only considering sign extensions from i1 for this xform. llvm-svn: 95642	2010-02-09 01:12:41 +00:00
Eric Christopher	428b385575	Add a new pass to do llvm.objsize lowering using SCEV. Initial skeleton and SCEVUnknown lowering implemented, the rest should come relatively quickly. Move testcase to new directory. Move pass to right before SimplifyLibCalls - which is moved down a bit so we can take advantage of a few opts. llvm-svn: 95628	2010-02-09 00:35:38 +00:00
Bob Wilson	60fb5a2446	Add a test for my change to disable reassociation for i1 types. llvm-svn: 95465	2010-02-06 01:16:25 +00:00
Jakob Stoklund Olesen	670458b3be	Teach SimplifyCFG about magic pointer constants. Weird code sometimes uses pointer constants other than null. This patch teaches SimplifyCFG to build switch instructions in those cases. Code like this: void f(const char x) { if (!x) puts("null"); else if ((uintptr_t)x == 1) puts("one"); else if (x == (char)2 \|\| x == (char)3) puts("two"); else if ((intptr_t)x == 4) puts("four"); else puts(x); } Now becomes a switch: define void @f(i8 %x) nounwind ssp { entry: %magicptr23 = ptrtoint i8* %x to i64 ; <i64> [#uses=1] switch i64 %magicptr23, label %if.else16 [ i64 0, label %if.then i64 1, label %if.then2 i64 2, label %if.then9 i64 3, label %if.then9 i64 4, label %if.then14 ] Note that LLVM's own DenseMap uses magic pointers. llvm-svn: 95439	2010-02-05 22:03:18 +00:00
Chris Lattner	44965f1107	fix logical-select to invoke filecheck right, and fix hte instcombine xform it is checking to actually pass. There is no need to match m_SelectCst<0, -1> since instcombine canonicalizes that into not(sext). Add matches for sext(not(x)) in addition to not(sext(x)). llvm-svn: 95420	2010-02-05 19:53:02 +00:00
Eric Christopher	f89979ce6a	Remove this code for now. I have a better idea and will rewrite with that in mind. llvm-svn: 95402	2010-02-05 19:04:06 +00:00
Eric Christopher	ee4a176739	Temporarily revert this since it appears to have caused a build failure. llvm-svn: 95294	2010-02-04 06:41:27 +00:00
Eric Christopher	9b3e42f09e	Rework constant expr and array handling for objectsize instcombining. Fix bugs where we would compute out of bounds as in bounds, and where we couldn't know that the linker could override the size of an array. Add a few new testcases, change existing testcase to use a private global array instead of extern. llvm-svn: 95283	2010-02-04 02:55:34 +00:00
Eric Christopher	fe6ab1518e	If we're dealing with a zero-length array, don't lower to any particular size, we just don't know what the length is yet. llvm-svn: 95266	2010-02-03 23:56:07 +00:00
Evan Cheng	e273e42195	Revert 94937 and move the noreturn check to codegen. llvm-svn: 95198	2010-02-03 03:55:59 +00:00
Eric Christopher	ac28e14b77	Recommit this, looks like it wasn't the cause. llvm-svn: 95165	2010-02-03 00:21:58 +00:00
Eric Christopher	f070aae6f7	Hopefully temporarily revert this. llvm-svn: 95154	2010-02-02 23:01:31 +00:00
Eric Christopher	575fe8690d	Re-add strcmp and known size object size checking optimization. Passed bootstrap and nightly test run here. llvm-svn: 95145	2010-02-02 22:10:43 +00:00
Chris Lattner	9f50341a96	don't turn (A & (C0?-1:0)) \| (B & ~(C0?-1:0)) -> C0 ? A : B for vectors. Codegen is generating awful code or segfaulting in various cases (e.g. PR6204). llvm-svn: 95058	2010-02-02 02:43:51 +00:00
Chris Lattner	e471d94f91	fix a crash in loop unswitch on a loop invariant vector condition. llvm-svn: 95055	2010-02-02 02:26:54 +00:00
Chris Lattner	5371fc3f06	remove an unreduced testcase, rename another. llvm-svn: 95054	2010-02-02 02:23:37 +00:00
Chris Lattner	18e6b4eb6b	fix PR6195, a bug constant folding scalar -> vector compares. llvm-svn: 94997	2010-02-01 20:04:40 +00:00
Chris Lattner	8e4042108e	fix PR6197 - infinite recursion in ipsccp due to block addresses evaluateICmpRelation wasn't handling blockaddress. llvm-svn: 94993	2010-02-01 19:35:08 +00:00
Dan Gohman	0b2c2769ba	Generalize target-independent folding rules for sizeof to handle more cases, and implement target-independent folding rules for alignof and offsetof. Also, reassociate reassociative operators when it leads to more folding. Generalize ScalarEvolution's isOffsetOf to recognize offsetof on arrays. Rename getAllocSizeExpr to getSizeOfExpr, and getFieldOffsetExpr to getOffsetOfExpr, for consistency with analagous ConstantExpr routines. Make the target-dependent folder promote GEP array indices to pointer-sized integers, to make implicit casting explicit and exposed to subsequent folding. And add a bunch of testcases for this new functionality, and a bunch of related existing functionality. llvm-svn: 94987	2010-02-01 18:27:38 +00:00
Chris Lattner	5f10919836	fix rdar://7590304, a miscompilation of objc apps on arm. The caller of objc message send was getting marked arm_apcscc, but the prototype isn't. This is fine at runtime because objcmsgsend is implemented in assembly. Only turn a mismatched caller and callee into 'unreachable' if the callee is a definition. llvm-svn: 94986	2010-02-01 18:11:34 +00:00
Chris Lattner	a336497d3f	fix rdar://7590304, an infinite loop in instcombine. In the invoke case, instcombine can't zap the invoke for fear of changing the CFG. However, we have to do something to prevent the next iteration of instcombine from inserting another store -> undef before the invoke thereby getting into infinite iteration between dead store elim and store insertion. Just zap the callee to null, which will prevent the next iteration from doing anything. llvm-svn: 94985	2010-02-01 18:04:58 +00:00
Eli Friedman	0babc63336	Remove test which is no longer relevant. llvm-svn: 94944	2010-01-31 04:40:45 +00:00
Eli Friedman	19c5c57885	Simplify/generalize the xor+add->sign-extend instcombine. llvm-svn: 94943	2010-01-31 04:29:12 +00:00
Eli Friedman	58c7936637	Add a small transform: transform -(X<<Y) to (-X<<Y) when the shift has a single use and X is free to negate. llvm-svn: 94941	2010-01-31 02:30:23 +00:00
Evan Cheng	c2f3c20680	Do not mark no-return calls tail calls. It'll screw up special calls like longjmp and it doesn't make much sense for performance reason. If my logic is faulty, please let me know. llvm-svn: 94937	2010-01-31 00:59:31 +00:00
Bob Wilson	0f04082970	Check alignment of loads when deciding whether it is safe to execute them unconditionally. Besides checking the offset, also check that the underlying object is aligned as much as the load itself. llvm-svn: 94875	2010-01-30 04:42:39 +00:00
Bob Wilson	ccd1585ba8	Remove ARM-specific calling convention from this test. Target data is needed for this test, but otherwise, there's nothing ARM-specific about it and no need to specify the calling convention. llvm-svn: 94862	2010-01-30 00:40:23 +00:00
Eric Christopher	47d90f7adb	Revert my last couple of patches. They appear to have broken bison. llvm-svn: 94841	2010-01-29 21:16:24 +00:00
Bob Wilson	f897b7b37e	Improve isSafeToLoadUnconditionally to recognize that GEPs with constant indices are safe if the result is known to be within the bounds of the underlying object. llvm-svn: 94829	2010-01-29 19:19:08 +00:00
Eric Christopher	f01379e6c2	Make strcpy_chk lower to strcpy if we have a safe size. llvm-svn: 94783	2010-01-29 01:37:11 +00:00
Eric Christopher	7d74af1824	Add constant support to object size handling and remove default lowering. We'll either figure it out, or not and be lowered by SelectionDAGBuild. Add test. llvm-svn: 94775	2010-01-29 01:09:57 +00:00
Duncan Sands	a3395c61b5	Fix PR6165. The bug was that LHSKnownZero was being and'd with DemandedMask when it should have been and'd with LowBits. Fix that and while there beef up the logic in the case of a negative LHS. llvm-svn: 94745	2010-01-28 17:22:42 +00:00
Bob Wilson	2e1a609654	Avoid creating redundant PHIs in SSAUpdater::GetValueInMiddleOfBlock. This was already being done in SSAUpdater::GetValueAtEndOfBlock so I've just changed SSAUpdater to check for existing PHIs in both places. llvm-svn: 94690	2010-01-27 22:01:02 +00:00
Victor Hernandez	e6321dc910	When converting dbg.declare to dbg.value, attach promoted store's debug metadata to dbg.value llvm-svn: 94634	2010-01-27 00:44:36 +00:00
Dan Gohman	71fc5e8fce	-disable-output is no longer needed with -analyze. llvm-svn: 94574	2010-01-26 19:25:59 +00:00
Victor Hernandez	f12e8a120f	In mem2reg, for all alloca/stores that get promoted where the alloca has an associated llvm.dbg.declare instrinsic, insert an llvm.dbg.var intrinsic before each store. llvm-svn: 94493	2010-01-26 02:42:15 +00:00
Victor Hernandez	253c09eb8f	Revert r94260 until findDbgDeclare() is made more efficient llvm-svn: 94432	2010-01-25 17:52:13 +00:00
Chris Lattner	91fafbd4e8	change the canonical form of "cond ? -1 : 0" to be "sext cond" instead of a select. This simplifies some instcombine code, matches the policy for zext (cond ? 1 : 0 -> zext), and allows us to generate better code for a testcase on ppc. llvm-svn: 94339	2010-01-24 00:09:49 +00:00
Nick Lewycky	f9a681fb61	Speculatively revert r94322 to see if it fixes darwin selfhost buildbot. llvm-svn: 94331	2010-01-23 20:32:12 +00:00
Chris Lattner	b444bc0234	third bug from PR6119: the xor dupe extension allows for arbitrary terminators in predecessors, don't assume it is a conditional or uncond branch. The testcase shows an example where they can happen with switches. llvm-svn: 94323	2010-01-23 19:21:31 +00:00
Nick Lewycky	8bbb754c7c	Teach DAE that even though it can't modify the function signature of an externally visible function, it can still find all callers of it and replace the parameters to a dead argument with undef. llvm-svn: 94322	2010-01-23 19:19:34 +00:00
Chris Lattner	7130788ea2	add an early out to ProcessBranchOnXOR to speed it up, handle the case when we can infer an input to the xor from all inputs that agree, instead of going into an infinite loop. Another part of PR6199 llvm-svn: 94321	2010-01-23 19:16:25 +00:00
Chris Lattner	e4391a1adb	fix a crash in jump threading, PR6119 llvm-svn: 94319	2010-01-23 18:56:07 +00:00
Chris Lattner	8909d5aca5	implement a simple instcombine xform that has been in the readme forever. llvm-svn: 94318	2010-01-23 18:49:30 +00:00
Mon P Wang	b7fce13b78	InstCombine should not fold sext/zext of a vector and a bitcast to a scalar to a sext/zext llvm-svn: 94280	2010-01-23 04:35:57 +00:00
Victor Hernandez	7ce9006ba0	In mem2reg, for all alloca/stores that get promoted where the alloca has an associated llvm.dbg.declare instrinsic, insert an llvm.dbg.var intrinsic before each store llvm-svn: 94260	2010-01-23 00:17:34 +00:00
Dan Gohman	525f7d7833	Revert LoopStrengthReduce.cpp to pre-r94061 for now. llvm-svn: 94123	2010-01-22 00:46:49 +00:00
Nick Lewycky	938b8b195c	Fix a crasher trying to fold each element in a comparison between two vectors if one of the vectors didn't have elements (such as undef). Fixes PR 6096. Fix an issue in the constant folder where fcmp (<2 x %ty>, <2 x %ty>) would have <2 x i1> type if constant folding was successful and i1 type if it wasn't. This exposed a related issue in the bitcode reader. llvm-svn: 94069	2010-01-21 07:03:21 +00:00
Dan Gohman	be34c35f32	Re-implement the main strength-reduction portion of LoopStrengthReduction. This new version is much more aggressive about doing "full" reduction in cases where it reduces register pressure, and also more aggressive about rewriting induction variables to count down (or up) to zero when doing so reduces register pressure. It currently uses fairly simplistic algorithms for finding reuse opportunities, but it introduces a new framework allows it to combine multiple strategies at once to form hybrid solutions, instead of doing all full-reduction or all base+index. llvm-svn: 94061	2010-01-21 02:09:26 +00:00
Dan Gohman	190fee462e	Add nounwinds. llvm-svn: 93919	2010-01-19 21:51:51 +00:00

... 2 3 4 5 6 ...

1889 Commits