llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-24 05:23:45 +02:00

Author	SHA1	Message	Date
Chris Lattner	1c8bd32732	consolodate various GEP tests into getelementptr.ll using filecheck. llvm-svn: 80514	2009-08-30 21:02:36 +00:00
Chris Lattner	ac17c19a06	another huge testcase, this time from 'gs' in llvm-test. llvm-svn: 80513	2009-08-30 21:02:02 +00:00
Chris Lattner	6c6cd82568	remove another poorly-reduced testcase which came from ldecod in llvm-test. llvm-svn: 80512	2009-08-30 21:01:14 +00:00
Chris Lattner	94ea2fb674	this testcase is 500 lines long and is distilled from bzip2, just remove it. llvm-svn: 80511	2009-08-30 21:00:11 +00:00
Chris Lattner	22c8be162d	convert to filecheck llvm-svn: 80510	2009-08-30 20:48:15 +00:00
Chris Lattner	d21a94f8d6	Fix PR4748: don't fold gep(bitcast(x)) into bitcast(gep) when x is itself a bitcast. Since we have gep(bitcast(bitcast(y))) in this case, just wait for the two bitcasts to get zapped. This prevents instcombine from confusing some aliasing stuff, and allows it to directly eliminate the load in the testcase. llvm-svn: 80508	2009-08-30 20:38:21 +00:00
Devang Patel	fbaeda732e	Reapply 79977. Use MDNodes to encode debug info in llvm IR. llvm-svn: 80406	2009-08-28 23:24:31 +00:00
Chris Lattner	f37893e7a1	Fix PR3913, patch by Jakub Staszak! llvm-svn: 80327	2009-08-28 00:43:14 +00:00
Chris Lattner	7785b6000e	Implement a new optimization in the inliner: if inlining multiple calls into a function and if the calls bring in arrays, try to merge them together to reduce stack size. For example, in the testcase we'd previously end up with 4 allocas, now we end up with 2 allocas. As described in the comments, this is not really the ideal solution to this problem, but it is surprisingly effective. For example, on 176.gcc, we end up eliminating 67 arrays at "gccas" time and another 24 at "llvm-ld" time. One piece of concern that I didn't look into: at -O0 -g with forced inlining this will almost certainly result in worse debug info. I think this is acceptable though given that this is a case of "debugging optimized code", and we don't want debug info to prevent the optimizer from doing things anyway. llvm-svn: 80215	2009-08-27 06:29:33 +00:00
Chris Lattner	93d567a70d	the inliner shouldn't crash on this. llvm-svn: 80214	2009-08-27 06:20:45 +00:00
Devang Patel	10c075a316	Revert 79977. It causes llvm-gcc bootstrap failures on some platforms. llvm-svn: 80073	2009-08-26 05:01:18 +00:00
Dan Gohman	e298d55930	Special-case static allocas in IndVarSimplify's loop invariant sinking code, since they are special. If the loop preheader happens to be the entry block of a function, don't sink static allocas out of it. This fixes PR4775. llvm-svn: 80010	2009-08-25 17:42:10 +00:00
Dan Gohman	bf08e82d8e	Remove obsolete -f flags. llvm-svn: 79992	2009-08-25 15:38:29 +00:00
Devang Patel	7d42bfab6c	Update DebugInfo interface to use metadata, instead of special named llvm.dbg.... global variables, to encode debugging information in llvm IR. This is mostly a mechanical change that tests metadata support very well. This change speeds up llvm-gcc by more then 6% at "-O0 -g" (measured by compiling InstructionCombining.cpp!) llvm-svn: 79977	2009-08-25 05:24:07 +00:00
Dan Gohman	d240c19451	Change getelementptr folding to use APInt instead of uint64_t for offset computations. This fixes a truncation bug on targets that don't have 64-bit pointers. llvm-svn: 79639	2009-08-21 16:52:54 +00:00
Dan Gohman	e3245061a9	Add targetdata strings to these tests, since SimplifyLibCalls uses TargetData to find the pointer size. llvm-svn: 79490	2009-08-19 23:18:49 +00:00
Dan Gohman	cc511acf87	Fix a bug in the over-index constant folding. When over-indexing an array member of a struct, it's possible to land in an arbitrary position inside that struct, such that attempting to find further getelementptr indices will fail. In such cases, folding cannot be done. llvm-svn: 79485	2009-08-19 22:46:59 +00:00
Dan Gohman	bc59c24278	Canonicalize indices in a constantexpr GEP. If Indices exceed the static extents of the static array type, it causes GlobalOpt and other passes to be more conservative. This canonicalization also allows the constant folder to add "inbounds" to GEPs. llvm-svn: 79440	2009-08-19 18:18:36 +00:00
Nick Lewycky	47bc7e0bd0	Fix up PHI nodes correctly in the presence of unreachable BBs, part two. Also delete a newed pointer, and improve readability a little bit. llvm-svn: 79411	2009-08-19 07:16:57 +00:00
Dan Gohman	807652ac3a	Fix SimplifyLibcalls and ValueTracking to check mayBeOverridden before performing optimizations based on constant string values. llvm-svn: 79384	2009-08-19 00:11:12 +00:00
Dan Gohman	b0cf049a1e	Generalize ScalarEvolution to be able to analyze GEPs when TargetData is not present. It still uses TargetData when available. This generalization also fixed some limitations in the TargetData case; the attached testcase covers this. llvm-svn: 79344	2009-08-18 16:46:41 +00:00
Dan Gohman	0b1af29372	Fix a bug that caused globalopt to miscompile tramp3d: don't miss unruly indices for arrays that are members of structs. llvm-svn: 79337	2009-08-18 14:58:19 +00:00
Nick Lewycky	db9fe40304	Test the pass the test is actually for, instead of one that doesn't exist. llvm-svn: 79257	2009-08-17 17:41:29 +00:00
Nick Lewycky	afaae7957e	Don't crash on critical edge. Patch by Andre Tavares. llvm-svn: 79252	2009-08-17 17:00:57 +00:00
Nick Lewycky	3e2d98a039	Add a test that shows that SSI is working correctly. llvm-svn: 79230	2009-08-17 07:32:08 +00:00
Nick Lewycky	e791519ea3	Don't crash trying to promote VLAs. llvm-svn: 79226	2009-08-17 05:37:31 +00:00
Eli Friedman	ca19f19760	Fix for PR3016: detect the tricky case, where there are unfoldable references to a PHI node in the block being folded, and disable the transformation in that case. The correct transformation of such PHI nodes depends on whether BB dominates Succ, and dominance is expensive to compute here. (Alternatively, it's possible to check whether any uses are live, but that's also essentially a dominance calculation. Another alternative is to use reg2mem, but it probably isn't a good idea to use that in simplifycfg.) Also, remove some incorrect code from CanPropagatePredecessorsForPHIs which is made unnecessary with this patch: it didn't consider the case where a PHI node in BB has multiple uses. llvm-svn: 79174	2009-08-16 04:23:49 +00:00
Nick Lewycky	de61ef6c5e	SSI construction should just go ahead and ignore instructions in unreachable blocks. llvm-svn: 79132	2009-08-15 20:12:18 +00:00
Mon P Wang	d56e4482fc	When InstCombine simplifies a load -> extract element to gep -> load, place the new load by the old load instead of by the extract element because a store could have occurred between the load and extract element. llvm-svn: 78891	2009-08-13 05:12:13 +00:00
Dan Gohman	9f80d2be6b	Make LLVM Assembly dramatically easier to read by aligning the comments, using formatted_raw_ostream's PadToColumn. Before: bb1: ; preds = %bb %2 = sext i32 %i.01 to i64 ; <i64> [#uses=1] %3 = getelementptr double* %p, i64 %2 ; <double> [#uses=1] %4 = load double %3, align 8 ; <double> [#uses=1] %5 = fmul double %4, 1.100000e+00 ; <double> [#uses=1] %6 = sext i32 %i.01 to i64 ; <i64> [#uses=1] %7 = getelementptr double* %p, i64 %6 ; <double> [#uses=1] After: bb1: ; preds = %bb %2 = sext i32 %i.01 to i64 ; <i64> [#uses=1] %3 = getelementptr double %p, i64 %2 ; <double> [#uses=1] %4 = load double %3, align 8 ; <double> [#uses=1] %5 = fmul double %4, 1.100000e+00 ; <double> [#uses=1] %6 = sext i32 %i.01 to i64 ; <i64> [#uses=1] %7 = getelementptr double* %p, i64 %6 ; <double*> [#uses=1] Several tests required whitespace adjustments. llvm-svn: 78816	2009-08-12 17:23:50 +00:00
Dan Gohman	00ee3a9a1a	Transform -X/C to X/-C, implementing a README.txt entry. llvm-svn: 78812	2009-08-12 16:37:02 +00:00
Dan Gohman	d5b6e35080	Optimize (x/C)*C to x if the division is exact. llvm-svn: 78811	2009-08-12 16:33:09 +00:00
Dan Gohman	8d69df5773	Optimize exact sdiv by a constant power of 2 to ashr. llvm-svn: 78714	2009-08-11 20:47:47 +00:00
Dan Gohman	08e747855c	Don't assume that external global variables are aligned at their preferred alignment. Only the minimum alignment guaranteed by the ABI may be assumed. llvm-svn: 78668	2009-08-11 15:50:03 +00:00
Dan Gohman	aba682a290	Add -disable-output. Thanks Bill! llvm-svn: 78009	2009-08-03 22:24:22 +00:00
Dan Gohman	39f93f6443	Add a new Constant::getIntegerValue helper function, and convert a few places in InstCombine to use it, to fix problems handling pointer types. This fixes the recent llvm-gcc bootstrap error. llvm-svn: 78005	2009-08-03 22:07:33 +00:00
Eli Friedman	7bb0485237	PR4662: Fix a crash introduced by the recent LLVMContext changes. llvm-svn: 77716	2009-07-31 19:36:47 +00:00
Daniel Dunbar	89cb72a6bc	Fix PR4645 which was fallout from the fix for PR4641. - Call RAUW to delete all instructions (this is a patch from Nick Lewycky). llvm-svn: 77512	2009-07-29 22:00:43 +00:00
Nick Lewycky	1961298b63	Just discard the output, no need to turn it back into text. llvm-svn: 77439	2009-07-29 06:14:52 +00:00
Chris Lattner	e5f1099d05	don't dump .bc file to stdout, and simplify this to a trivial testcase. llvm-svn: 77436	2009-07-29 05:32:07 +00:00
Nick Lewycky	e0524c1795	Bulk erasing instructions without RAUWing them is unsafe. Instead, break them into a new BB that has no predecessors. llvm-svn: 77433	2009-07-29 05:17:50 +00:00
Dan Gohman	0d0dd7b732	Teach instcombine to respect and preserve inbounds. Add inbounds to a few tests where it is required for the expected transformation. llvm-svn: 77290	2009-07-28 01:40:03 +00:00
Chris Lattner	0426853d67	merge vector-casts-0.ll into vector-casts.ll llvm-svn: 76864	2009-07-23 05:33:39 +00:00
Chris Lattner	c687344f0c	Make some existing optimizations that would only trigger on scalars also apply to vectors. This allows us to compile this: #include <emmintrin.h> __m128i a(__m128 a, __m128 b) { return a==a & b==b; } __m128i b(__m128 a, __m128 b) { return a!=a \| b!=b; } to: _a: cmpordps %xmm1, %xmm0 ret _b: cmpunordps %xmm1, %xmm0 ret with clang instead of to a ton of horrible code. llvm-svn: 76863	2009-07-23 05:32:17 +00:00
Chris Lattner	f4474da353	convert a test to filecheck format. This fixes an endemic problem with negative tests: this test wasn't checking what it thought it was because it was grepping .bc, not .ll. llvm-svn: 76861	2009-07-23 05:27:48 +00:00
Chris Lattner	7061a4300d	rename test llvm-svn: 76860	2009-07-23 05:25:12 +00:00
Dan Gohman	f2c6e6a1bd	Add a testcase for PR2831. llvm-svn: 76527	2009-07-21 01:02:18 +00:00
Dan Gohman	74a435e9f1	The upper argument of ConstantRange is exclusive, not inclusive. llvm-svn: 76492	2009-07-20 22:34:18 +00:00
Dan Gohman	00b05492f1	Revert the addition of hasNoPointerOverflow to GEPOperator. Getelementptrs that are defined to wrap are virtually useless to optimization, and getelementptrs that are undefined on any kind of overflow are too restrictive -- it's difficult to ensure that all intermediate addresses are within bounds. I'm going to take a different approach. Remove a few optimizations that depended on this flag. llvm-svn: 76437	2009-07-20 17:43:30 +00:00
Eli Friedman	e507c1afaa	Canonicalize bitcasts between types like <1 x i64> and i64 to insertelement/extractelement. I'm not entirely sure this is precisely what we want to do: should we prefer bitcast(insertelement) or insertelement(bitcast)? Similarly. should we prefer extractelement(bitcast) or bitcast(extractelement)? llvm-svn: 76345	2009-07-18 23:06:53 +00:00
Eli Friedman	debc43cb11	Back out 76300; apparently the preference is to canonicalize the other way (bitcast -> insert/extractelement). llvm-svn: 76325	2009-07-18 19:04:16 +00:00
Eli Friedman	65a5fe312a	Add combine: X sdiv (1 << Y) -> X udiv (1 << Y) when X doesn't have the sign bit set. llvm-svn: 76304	2009-07-18 09:53:21 +00:00
Eli Friedman	f1878fcda1	Canonicalize insert/extractelement from single-element vectors into bitcasts. It would also be possible to canonicalize the other way; does anyone have a preference? llvm-svn: 76300	2009-07-18 09:07:47 +00:00
Eli Friedman	7b1597133d	Fix simplifylibcalls memset recognition to work on 64-bit platforms where int is 32 bits. llvm-svn: 76293	2009-07-18 08:34:51 +00:00
Dan Gohman	50e65d8c93	Fill in some holes in ScalarEvolution's loop iteration condition analysis. This allows indvars to emit a simpler loop trip count expression. llvm-svn: 76085	2009-07-16 17:34:36 +00:00
Eli Friedman	6aa39dcd93	Switch invars away from using isTrapping when it really shouldn't be using it. llvm-svn: 75852	2009-07-15 22:48:29 +00:00
Eli Friedman	048d13f9bb	Don't restrict the set of instructions where we try to constant-fold the operands; it's possible to end up with a constant-foldable operand to most instructions, even those which can't trap. llvm-svn: 75845	2009-07-15 22:13:34 +00:00
Dan Gohman	5329511fae	Fix the expansion of umax and smax in the case where one or more of the operands have pointer type, so that the resulting type matches the original SCEV type, and so that unnecessary ptrtoints are avoided in common cases. llvm-svn: 75680	2009-07-14 20:57:04 +00:00
Dan Gohman	9525cf679f	Add a testcase for a bug fixed by r75634. llvm-svn: 75644	2009-07-14 18:15:00 +00:00
Dale Johannesen	35fc3243a8	Revert 75571; I'm convinced this isn't the right thing to do. llvm-svn: 75642	2009-07-14 17:48:25 +00:00
Eli Friedman	63028801b8	Fix trivial todo in instcombine. llvm-svn: 75586	2009-07-14 02:01:53 +00:00
Dan Gohman	493855541b	Update LoopSimplify and LoopUnswitch to use the new makeLoopInvariant function. llvm-svn: 75584	2009-07-14 01:37:59 +00:00
Dan Gohman	b9f3a3c96b	Fix indvars to not assume that a loop with a single unique exit block has a single unique exiting block. llvm-svn: 75579	2009-07-14 01:09:02 +00:00
Dale Johannesen	de1ed58935	Don't delete asm's just because their inputs are undefined; xor R, R is a common and valid idiom for zeroing a register, for example. llvm-svn: 75571	2009-07-14 00:45:38 +00:00
Eli Friedman	a6c7a3d44e	PR4548: optimize zext+udiv+trunc to udiv. llvm-svn: 75539	2009-07-13 22:46:01 +00:00
Eli Friedman	47839d3dec	Fix bug in run-line. llvm-svn: 75534	2009-07-13 22:31:30 +00:00
Eli Friedman	6b51ac6728	Canonicalize boolean +/- a constant to a select. (I think it's reasonably clear that we want to have a canonical form for constructs like this; if anyone thinks that a select is not the best canonical form, please tell me.) llvm-svn: 75531	2009-07-13 22:27:52 +00:00
Dan Gohman	a9953c0a28	Reapply 75252, with a fix to avoid the infinite recursion case. The check for avoiding re-analyzing a widening cast needed to happen earlier, as getSCEV itself may result in a isLoopGuardedByCond query. llvm-svn: 75511	2009-07-13 21:35:55 +00:00
Chris Lattner	b4bd955891	Move the re-sort of invalidated NonLocalPointerDeps cache earlier so that all code paths get it. PR4256 was about a case where the phi translation loop would find all preds in the Visited cache, so it could get by without re-sorting the NonLocalPointerDeps cache. Fix this by resorting it earlier, there is no reason not to do this. This patch inspired by Jakub Staszak's patch. llvm-svn: 75476	2009-07-13 17:14:23 +00:00
Nick Lewycky	d3d5cfa475	Revert r75252 which was causing some crashes at compile time. llvm-svn: 75384	2009-07-11 20:38:25 +00:00
Dan Gohman	404a92e330	Generalize ScalarEvolution's cast-folding code to support more kinds of loops. Add several new functions to for working with ScalarEvolution's add-hoc value-range analysis functionality. llvm-svn: 75252	2009-07-10 16:42:52 +00:00
Nick Lewycky	c707d9c60d	There's no need to consider PHI nodes in the same block as the instruction we're inserting sigma/phi functions for. Patch by Andre Tavares. llvm-svn: 75138	2009-07-09 15:59:27 +00:00
Nick Lewycky	d46a7b2d22	Remove the vicmp and vfcmp instructions. Because we never had a release with these instructions, no autoupgrade or backwards compatibility support is provided. llvm-svn: 74991	2009-07-08 03:04:38 +00:00
Chris Lattner	54c0359890	do not try to analyze bitcasts from i64 to <2 x i32> in ComputedMaskedBits. While we could do this, doing so requires adjusting the demanded mask and the code isn't doing that yet. This fixes PR4495 llvm-svn: 74699	2009-07-02 16:04:08 +00:00
Dan Gohman	e3b1f9e14b	Fix an instcombine abort on a scalar-to-vector bitcast. This fixes PR4487. llvm-svn: 74646	2009-07-01 21:38:46 +00:00
Dan Gohman	d496b35af0	Don't cache PHI exit values from exhaustive evaluations, because an individual exhaustive evaluation reflects only the exit value implied by an individual exit, which may differ from the actual exit value of the loop if there are other exits. This fixes PR4477. llvm-svn: 74447	2009-06-29 20:34:13 +00:00
Dan Gohman	28702fab4e	Don't try to split a loop when the controlling icmp instruction doesn't have an IV-based operand. This fixes PR4471. llvm-svn: 74399	2009-06-27 22:58:27 +00:00
Dan Gohman	8d2a45fadb	Teach LoopSimplify how to merge multiple loop exits into a single exit, when one of them can be converted to a trivial icmp and conditional branch. This addresses what is essentially a phase ordering problem. SimplifyCFG knows how to do this transformation, but it doesn't do so if the primary block has any instructions in it other than an icmp and a branch. In the given testcase, the block contains other instructions, however they are loop-invariant and can be hoisted. SimplifyCFG doesn't have LoopInfo though, so it can't hoist them. And, it's important that the blocks be merged before LoopRotation, as it doesn't support multiple-exit loops. llvm-svn: 74396	2009-06-27 21:30:38 +00:00
Dan Gohman	4acfd5098d	When a value is used multiple times within a single PHI, instructions inserted to replace that value must dominate all of of the basic blocks associated with the uses of the value in the PHI, not just one of them. llvm-svn: 74376	2009-06-27 05:16:57 +00:00
Dan Gohman	49b2ecafe7	Add some testcases for some of the recent ScalarEvolution bug fixes. llvm-svn: 74353	2009-06-26 22:54:11 +00:00
Dan Gohman	ba8760719f	Fix LCSSA to avoid emitting a PHI node for the unwind destination of an invoke instruction, since the value isn't really live across that edge. llvm-svn: 74242	2009-06-26 00:31:13 +00:00
Dan Gohman	b4e1f166e1	Simplify [su]max(MAX, n) to MAX. This comes up in loop tripcount computations in loops with multiple exits. Adjust the testcase for PR4436 so that the relevant portion isn't optimized away. llvm-svn: 74073	2009-06-24 14:46:22 +00:00
Dan Gohman	c2c1e1ff38	When inserting code into a loop preheader, insert it before the terminator, instead of after the last phi. This fixes a bug exposed by ScalarEvolution analyzing more kinds of loops. This fixes PR4436. llvm-svn: 74072	2009-06-24 14:31:06 +00:00
Dan Gohman	eb0a67278b	Fix ScalarEvolution's backedge-taken count computations to check for overflow when computing a integer division to round up. Thanks to Nick Lewycky for noticing this! llvm-svn: 73862	2009-06-21 23:46:38 +00:00
Nick Lewycky	4020821885	Expand this test to handle more cases (remainder and shifts) of zero. llvm-svn: 73839	2009-06-21 01:56:41 +00:00
Chris Lattner	affcc71da2	implement PR4424: 0/x is always 0 for integer division. llvm-svn: 73835	2009-06-21 01:15:55 +00:00
Dan Gohman	b60dedbf0a	Tweak this test to be a little less unusual. llvm-svn: 73808	2009-06-20 00:40:56 +00:00
Dan Gohman	29100270c0	Generalize isLoopGuardedByCond's checking to consider two SCEVUnknowns with identical Instructions to be equal. This allows it to analze cases such as the attached testcase, where the front-end has cloned the loop controlling expression. Along with r73805, this lets IndVarSimplify eliminate all the sign-extend casts in the loop in the attached testcase. llvm-svn: 73807	2009-06-20 00:35:32 +00:00
Dan Gohman	d920fdb643	Don't (unconditionally) use getSCEVAtScope to simplify the step expression in IVUsers, because in the case of a use of a non-linear addrec outside of a loop, this causes the addrec to be evaluated as a linear addrec. llvm-svn: 73774	2009-06-19 17:33:15 +00:00
Chris Lattner	8f6f044afd	make jump threading handle lexically identical compare instructions as if they were multiple uses of the same instruction. This interacts well with the existing loadpre that j-t does to open up many new jump threads earlier. llvm-svn: 73768	2009-06-19 16:27:56 +00:00
Nick Lewycky	a5f89b09c6	Teach jump threading to look at comparisons between phi nodes and non-constants. llvm-svn: 73755	2009-06-19 04:56:29 +00:00
Chris Lattner	8ddc06469c	Improve tail call elim to move loads above readonly calls when it allows forming a tail call. Patch by Frits van Bommel. This implements PR4323. llvm-svn: 73752	2009-06-19 04:22:16 +00:00
Chris Lattner	3a683c551f	part of PR4405: disable a contentious optimization for strcmp -> memcmp when the lengths of the strings are unknown. Patch by Nick Lewycky! llvm-svn: 73751	2009-06-19 04:17:36 +00:00
Dan Gohman	fd857b0406	Remove the code from IVUsers that attempted to handle casted induction variables in cases where the cast isn't foldable. It ended up being a pessimization in many cases. This could be fixed, but it would require a bunch of complicated code in IVUsers' clients. The advantages of this approach aren't visible enough to justify it at this time. llvm-svn: 73706	2009-06-18 16:54:06 +00:00
Dan Gohman	dc884a7830	Generalize the zext(trunc(t) & C) instcombine to work even with C is not a low-bits mask, and add a similar instcombine for zext((trunc(t) & C) ^ C). llvm-svn: 73705	2009-06-18 16:30:21 +00:00
Dan Gohman	1530824138	Instcombine zext(trunc(x) & mask) to x&mask, even if the trunc has multiple users. llvm-svn: 73656	2009-06-17 23:17:05 +00:00
Dan Gohman	50b7d0d843	Add -disable-output to a bunch of tests that don't care about the output. llvm-svn: 73633	2009-06-17 20:56:26 +00:00
Dale Johannesen	26f0dd9021	This fixes a bug introduced in 72661, which can move loads back past a check that the load address is valid, see new testcase. The test that went in with 72661 has exactly this case, except that the conditional it's moving past is checking something else; I've settled for changing that test to reference a global, not a pointer. It may be possible to scan all the tests you pass and make sure none of them are checking any component of the address, but it's not trivial and I'm not trying to do that here. llvm-svn: 73632	2009-06-17 20:48:23 +00:00
Eli Friedman	36d7ca738e	Correct an accidental duplication of the test (patch doesn't handle creating new files very well). llvm-svn: 73599	2009-06-17 03:05:00 +00:00
Eli Friedman	b3947071ff	PR3439: Correct a silly mistake in the SimplifyDemandedUseBits code for SRem. llvm-svn: 73598	2009-06-17 02:57:36 +00:00
Dan Gohman	54bbef1525	Generalize a few more instcombines to be vector/scalar-independent. llvm-svn: 73541	2009-06-16 19:55:29 +00:00
Dan Gohman	56b5a88785	Instcombine's ShrinkDemandedConstant may strip bits out of constants, obscuring what would otherwise be a low-bits mask. Use ComputeMaskedBits to compute what ShrinkDemandedConstant knew about to reconstruct a low-bits mask value. llvm-svn: 73540	2009-06-16 19:52:01 +00:00
Chris Lattner	f54c97c579	Testcase for r73506 llvm-svn: 73508	2009-06-16 17:23:25 +00:00
Dan Gohman	2e737ac21f	Support vector casts in more places, fixing a variety of assertion failures. To support this, add some utility functions to Type to help support vector/scalar-independent code. Change ConstantInt::get and ConstantFP::get to support vector types, and add an overload to ConstantInt::get that uses a static IntegerType type, for convenience. Introduce a new getConstant method for ScalarEvolution, to simplify common use cases. llvm-svn: 73431	2009-06-15 22:12:54 +00:00
Dale Johannesen	2d0be306fb	Fix the crash in this test. This is basically the same problem addressed in 31284, but the patch there only addressed the case where an invoke is the first thing in a block. llvm-svn: 73416	2009-06-15 20:59:27 +00:00
Chris Lattner	52510b0788	fix testcase to properly check for the patch in r73195. llvm-svn: 73380	2009-06-15 05:46:02 +00:00
Dan Gohman	d3a8d79c0d	Implement more aggressive folding of add operand lists when they contain multiplications of constants with add operations. This helps simplify several kinds of things; in particular it helps simplify expressions like ((-1 * (%a + %b)) + %a) to %b, as expressions like this often come up in loop trip count computations. llvm-svn: 73361	2009-06-14 22:58:51 +00:00
Dan Gohman	37fef35e88	Teach SCEVExpander's visitAddRecExpr to reuse an existing canonical induction variable when the addrec to be expanded does not require a wider type. This eliminates the need for IndVarSimplify to micro-manage SCEV expansions, because SCEVExpander now automatically expands them in the form that IndVarSimplify considers to be canonical. (LSR still micro-manages its SCEV expansions, because it's optimizing for the target, rather than for other optimizations.) Also, this uses the new getAnyExtendExpr, which has more clever expression simplification logic than the IndVarSimplify code it replaces, and this cleans up some ugly expansions in code such as the included masked-iv.ll testcase. llvm-svn: 73294	2009-06-13 16:25:49 +00:00
Dan Gohman	f9b0419cd8	Don't do (x - (y - z)) --> (x + (z - y)) on floating-point types, because it may round differently. This fixes PR4374. llvm-svn: 73243	2009-06-12 19:23:25 +00:00
Nick Lewycky	1e36649f95	Given two identical weak functions, produce one internal function and two weak thunks. llvm-svn: 73230	2009-06-12 15:56:56 +00:00
Nick Lewycky	cc239d7680	This test is wrong. If you have two weak functions F and G you can't make either one call the other since either one can be replaced at link time, and they need to be independent. llvm-svn: 73225	2009-06-12 13:24:41 +00:00
Chris Lattner	e0360f8ae8	Fix 4366: store to null in non-default addr space should not be turned into unreachable. llvm-svn: 73195	2009-06-11 17:54:56 +00:00
Eli Friedman	770f633389	PR4340: Run SimplifyDemandedVectorElts on insertelement instructions; sometimes it can find simplifications that won't be found otherwise. llvm-svn: 73006	2009-06-06 20:08:03 +00:00
Dan Gohman	5f6f8101d5	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Dan Gohman	05fe1217c7	Check in test changes that I accidentally left out of r72872. llvm-svn: 72875	2009-06-04 18:22:31 +00:00
Dan Gohman	6e9ad19ef7	Don't attempt to simplify an non-affine IV expression if it can't be simplified to a loop-invariant value. This fixes PR4315. llvm-svn: 72798	2009-06-03 19:11:31 +00:00
Evan Cheng	7875093e82	Avoid infinite looping in AllGlobalLoadUsesSimpleEnoughForHeapSRA(). This can happen when PHI uses are recursively dependent on each other. llvm-svn: 72710	2009-06-02 00:56:07 +00:00
Eli Friedman	2b0edc3327	PR4286: Make RewriteLoadUserOfWholeAlloca and RewriteStoreUserOfWholeAlloca deal with tail padding because isSafeUseOfBitCastedAllocation expects them to. Otherwise, we crash trying to erase the bitcast. llvm-svn: 72688	2009-06-01 09:14:32 +00:00
Owen Anderson	928040c625	Be more aggressive in doing LoadPRE by tracing backwards when a block only has a single predecessor. Patch by Jakub Staszak. llvm-svn: 72661	2009-05-31 09:03:40 +00:00
Chris Lattner	8ac63163fe	fix PR4284, a bug in simplifylibcalls handling memcmp. Patch by Benjamin Kramer! llvm-svn: 72625	2009-05-30 18:43:04 +00:00
Nick Lewycky	3dd0d690f3	Use Operands.data() instead of &Operands[0] where Operands is a potentially empty SmallVector. llvm-svn: 72512	2009-05-28 04:08:10 +00:00
Dan Gohman	2884c5153c	Revert 72493 and replace it with a more conservative fix, for now: don't rewrite the comparison if there is any implicit extension or truncation on the induction variable. I'm planning for IVUsers to eventually take over some of the work of this code, and for it to be generalized. llvm-svn: 72496	2009-05-27 21:10:47 +00:00
Dan Gohman	994001e5ef	In ChangeCompareStride, when the stride to be reused is truncated to a smaller type, promoted its offset back up to the type of the new comparison. This fixes PR4222. llvm-svn: 72493	2009-05-27 20:00:18 +00:00
Dan Gohman	0124c21ba0	Teach SCEVExpander to avoid creating over-indexed GEP indices when possible. For example, it now emits %p.2.ip.1 = getelementptr [3 x [3 x double]]* %p, i64 2, i64 %tmp, i64 1 instead of the equivalent but less obvious %p.2.ip.1 = getelementptr [3 x [3 x double]]* %p, i64 0, i64 %tmp, i64 19 llvm-svn: 72452	2009-05-27 02:00:53 +00:00
Dan Gohman	fb34a67498	In cases where a pointer value is an operand of a multiplication or division operation, don't attempt to use the operation's value as the base of a getelementptr. This fixes PR4271. llvm-svn: 72422	2009-05-26 17:41:16 +00:00
Chris Lattner	8f4210d099	make memdep use the getModRefInfo method for stores instead of the low-level alias() method, allowing it to reason more aggressively about pointers into constant memory. PR4189 llvm-svn: 72403	2009-05-25 21:28:56 +00:00
Dan Gohman	eb3ddbb1ac	When rewriting the loop exit test with the canonical induction variable, leave the original comparison in place if it has other uses, since the other uses won't be dominated by the new comparison instruction. llvm-svn: 72369	2009-05-24 19:11:38 +00:00
Dan Gohman	fdba9c8fce	Generalize SCEVExpander::visitAddRecExpr's GEP persuit, and avoid sending SCEVUnknowns to expandAddToGEP. This avoids the need for expandAddToGEP to bend the rules and peek into SCEVUnknown expressions. Factor out the code for testing whether a SCEV can be factored by a constant for use in a GEP index. This allows it to handle SCEVAddRecExprs, by recursing. As a result, SCEVExpander can now put more things in GEP indices, so it emits fewer explicit mul instructions. llvm-svn: 72366	2009-05-24 18:06:31 +00:00
Torok Edwin	8936fc2e28	The rewriter may hold references to instructions that are deleted because they are trivially dead. Fix by clearing the rewriter cache before deleting the trivially dead instructions. Also make InsertedExpressions use an AssertingVH to catch these bugs easier. llvm-svn: 72364	2009-05-24 14:23:16 +00:00
Evan Cheng	77529302a6	Fix bug in FoldFCmp_IntToFP_Cst. If inttofp is a uintofp, use unsigned instead of signed integer constant. llvm-svn: 72300	2009-05-22 23:10:53 +00:00
Dan Gohman	d5fc3518d5	Teach IndVarSimplify's FixUsesBeforeDefs to handle InvokeInsts by assuming that the use of the value is in a block dominated by the "normal" destination. LangRef.html and other documentation sources don't explicitly guarantee this, but it seems to be assumed in other places in LLVM at least. This fixes an assertion failure on the included testcase, which is derived from the Ada testsuite. FixUsesBeforeDefs is a temporary measure which I'm looking to replace with a more capable solution. llvm-svn: 72266	2009-05-22 16:47:11 +00:00
Dan Gohman	82df35a657	Fix a thinko in the code that adapted SCEVMulExpr operands for use in expanding SCEVAddExprs with GEPs. The operands of a SCEVMulExpr need to be multiplied together, not added. llvm-svn: 72250	2009-05-22 07:14:20 +00:00
Eli Friedman	b32b64b5b4	Fix broken logic in DominatorTreeBase::Split. Part of PR4238. llvm-svn: 72231	2009-05-21 21:47:54 +00:00
Eli Friedman	d4f9668eb7	Fix some incorrect logic in DominanceFrontier::splitBlock. Part of PR4238. llvm-svn: 72223	2009-05-21 20:40:30 +00:00
Dan Gohman	fc28858d91	Teach ValueTracking a new way to analyze PHI nodes, and and teach Instcombine to be more aggressive about using SimplifyDemandedBits on shift nodes. This allows a shift to be simplified to zero in the included test case. llvm-svn: 72204	2009-05-21 02:28:33 +00:00
Dan Gohman	9e0f5a28dc	Suppress the IV reversal transformation in the case that the RHS of the comparison is defined inside the loop. This fixes a use-before-def problem, because the transformation puts a use of the RHS outside the loop. llvm-svn: 72149	2009-05-20 00:34:08 +00:00
Dan Gohman	922033d119	Teach SCEVExpander to expand arithmetic involving pointers into GEP instructions. It attempts to create high-level multi-operand GEPs, though in cases where this isn't possible it falls back to casting the pointer to i8* and emitting a GEP with that. Using GEP instructions instead of ptrtoint+arithmetic+inttoptr helps pointer analyses that don't use ScalarEvolution, such as BasicAliasAnalysis. Also, make the AddrModeMatcher more aggressive in handling GEPs. Previously it assumed that operand 0 of a GEP would require a register in almost all cases. It now does extra checking and can do more matching if operand 0 of the GEP is foldable. This fixes a problem that was exposed by SCEVExpander using GEPs. llvm-svn: 72093	2009-05-19 02:15:55 +00:00
Dan Gohman	904f081ce7	Add nounwind to a few tests. llvm-svn: 72002	2009-05-18 15:16:49 +00:00
Dale Johannesen	da2e1e314b	Testcase for 71688. llvm-svn: 71691	2009-05-13 18:33:24 +00:00
Chris Lattner	eb2f327449	calls in nothrow functions can be marked nothrow even if the callee is not known to be nothrow. This allows readnone/readonly functions to be deleted even if we don't know whether the callee can throw. llvm-svn: 71676	2009-05-13 17:39:14 +00:00
Chris Lattner	927ebd34e2	Fix PR4206 - crash in simplify lib calls llvm-svn: 71644	2009-05-13 06:26:11 +00:00
Dan Gohman	d13f674130	Factor the code for collecting IV users out of LSR into an IVUsers class, and generalize it so that it can be used by IndVarSimplify. Implement the base IndVarSimplify transformation code using IVUsers. This removes TestOrigIVForWrap and associated code, as ScalarEvolution now has enough builtin overflow detection and folding logic to handle all the same cases, and more. Run "opt -iv-users -analyze -disable-output" on your favorite loop for an example of what IVUsers does. This lets IndVarSimplify eliminate IV casts and compute trip counts in more cases. Also, this happens to finally fix the remaining testcases in PR1301. Now that IndVarSimplify is being more aggressive, it occasionally runs into the problem where ScalarEvolutionExpander's code for avoiding duplicate expansions makes it difficult to ensure that all expanded instructions dominate all the instructions that will use them. As a temporary measure, IndVarSimplify now uses a FixUsesBeforeDefs function to fix up instructions inserted by SCEVExpander. Fortunately, this code is contained, and can be easily removed once a more comprehensive solution is available. llvm-svn: 71535	2009-05-12 02:17:14 +00:00
Dan Gohman	cac9b5c5be	When forgetting SCEVs for loop PHIs, don't forget SCEVUnknown values. These values aren't analyzable, so they don't care if more information about the loop trip count can be had. Also, SCEVUnknown is used for a PHI while the PHI itself is being analyzed, so it needs to be left in the Scalars map. This fixes a variety of subtle issues. llvm-svn: 71533	2009-05-12 01:27:58 +00:00
Chris Lattner	0fd5aea274	fix RewriteStoreUserOfWholeAlloca to use the correct type size method, fixing a crash on PR4146. While the store will ultimately overwrite the "padded size" number of bits in memory, the stored value may be a subset of this size. This function only wants to handle the case where all bits are stored. llvm-svn: 71224	2009-05-08 15:54:41 +00:00
Eli Friedman	a280375b23	PR4123: don't crash when inlining a call which uses its own result. llvm-svn: 71199	2009-05-08 00:22:04 +00:00
Dan Gohman	ebacd61d7d	Revert 71165. It did more than just revert 71158 and it introduced several regressions. The problem due to 71158 is now fixed. llvm-svn: 71176	2009-05-07 19:46:24 +00:00
Duncan Sands	e90202e388	Revert r70876 and add a testcase (@c7) showing the problem: bits captured, but the pointer marked nocapture. In fact I now recall that this problem is why only readnone functions returning void were considered before! However keep a small fix that was also in r70876: a readnone function returning void can result in bits being captured if it unwinds, so test for this. llvm-svn: 71168	2009-05-07 18:08:34 +00:00
Bill Wendling	9f97e4a3dc	Temporarily revert r71158. It was causing a failure during a full bootstrap: checking for bcopy... no checking for getc_unlocked... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decUtility.c:360: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decUtility.o] Error 1 make[4]: * Waiting for unfinished jobs.... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decNumber.c:5591: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decNumber.o] Error 1 make[3]: * [all-stage2-libdecnumber] Error 2 make[3]: *** Waiting for unfinished jobs.... llvm-svn: 71165	2009-05-07 17:26:14 +00:00
Dan Gohman	9a6a882979	Constant-fold ptrtoint+add+inttoptr to gep when the pointer is an array and the add is within range. This helps simplify expressions expanded by ScalarEvolutionExpander. llvm-svn: 71158	2009-05-07 14:24:56 +00:00
Duncan Sands	b71ad70b4e	Fix PR3754: don't mark functions that wrap MallocInst with the readnone. Since MallocInst is scheduled for deletion it doesn't seem worth doing anything more subtle, such as having mayWriteToMemory return true for MallocInst. llvm-svn: 71077	2009-05-06 08:42:00 +00:00
Duncan Sands	880eaf5278	Allow readonly functions to unwind exceptions. Teach the optimizers about this. For example, a readonly function with no uses cannot be removed unless it is also marked nounwind. llvm-svn: 71071	2009-05-06 06:49:50 +00:00
Bill Wendling	5f4fcbeb10	Temporarily reverting r71008. It was causing this failure: Running /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/ CodeGen/X86/dg.exp ... FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/ CodeGen/X86/change-compare-stride-1.ll Failed with exit(1) at line 2 while running: grep {cmpq $-478,} change-compare-stride-1.ll.tmp child process exited abnormally llvm-svn: 71013	2009-05-05 20:49:46 +00:00
David Greene	2bb2b3840e	Handle overflow of 64-bit loop conditions. llvm-svn: 71008	2009-05-05 20:22:36 +00:00
Duncan Sands	4c7021febf	Teach capture tracking that readonly functions can only capture their arguments by returning them or throwing an exception or not based on the argument value. Patch essentially by Frits van Bommel. llvm-svn: 70876	2009-05-04 16:50:29 +00:00
Chris Lattner	6807ddd3d9	* Sink 4 duplicates of edge threading validity checks and DOUT prints into ThreadEdge directly. This shares the code, but is just a refactoring. * Make JumpThreading compute the set of loop headers and avoid threading across them. This prevents jump threading from forming irreducible loops (goodness) but also prevents it from threading in other cases that are beneficial (see the comment above FindFunctionBackedges). llvm-svn: 70820	2009-05-04 02:28:08 +00:00
Dan Gohman	a79cce4aef	Previously, RecursivelyDeleteDeadInstructions provided an option of returning a list of pointers to Values that are deleted. This was unsafe, because the pointers in the list are, by nature of what RecursivelyDeleteDeadInstructions does, always dangling. Replace this with a simple callback mechanism. This may eventually be removed if all clients can reasonably be expected to use CallbackVH. Use this to factor out the dead-phi-cycle-elimination code from LSR utility function, and generalize it to use the RecursivelyDeleteTriviallyDeadInstructions utility function. This makes LSR more aggressive about eliminating dead PHI cycles; adjust tests to either be less trivial or to simply expect fewer instructions. llvm-svn: 70636	2009-05-02 18:29:22 +00:00
Dan Gohman	25d21786d3	Don't try to mix integers and pointers in an icmp instruction in getSCEVAtScope. llvm-svn: 70495	2009-04-30 16:40:30 +00:00
Dale Johannesen	15486ddd95	Fix recent regression in gcc.dg/pr26719.c (6835035). llvm-svn: 70386	2009-04-29 16:38:47 +00:00
Dan Gohman	346c77f79d	As with r70333, give the primary induction variable a use so that it can't be trivially eliminated. llvm-svn: 70334	2009-04-28 22:05:13 +00:00
Dan Gohman	5bb06cda1e	Make this testcase slightly less trivial, so that it doesn't fail if indvars happens to optimize away the unused primary induction variable. llvm-svn: 70333	2009-04-28 22:03:26 +00:00
Dale Johannesen	626b0a32f7	Fix PR 4086, a bug in FP IV elimination. llvm-svn: 70247	2009-04-27 21:03:15 +00:00
Dan Gohman	ff30ebd710	Teach getZeroExtendExpr and getSignExtendExpr to use trip-count information to simplify [sz]ext({a,+,b}) to {zext(a),+,[zs]ext(b)}, as appropriate. These functions and the trip count code each call into the other, so this requires careful handling to avoid infinite recursion. During the initial trip count computation, conservative SCEVs are used, which are subsequently discarded once the trip count is actually known. Among other benefits, this change lets LSR automatically eliminate some unnecessary zext-inreg and sext-inreg operation where the operand is an induction variable. llvm-svn: 70241	2009-04-27 20:16:15 +00:00
Dan Gohman	820b45049b	Handle ands with ~0 correctly too. This fixes PR4052. llvm-svn: 70176	2009-04-27 01:41:10 +00:00
Dan Gohman	a7fae1f865	Add several more icmp simplifications. Transform signed comparisons into unsigned ones when the operands are known to have the same sign bit value. llvm-svn: 70053	2009-04-25 17:12:48 +00:00
Dan Gohman	9eb5ba6eb7	Handle ands with 0 and shifts by 0 correctly. These aren't common, but indvars shouldn't crash on them. This fixes PR4054. llvm-svn: 70051	2009-04-25 17:05:40 +00:00
Dan Gohman	ea9a6d22d3	Fix an error in this test. llvm-svn: 69893	2009-04-23 15:22:28 +00:00
Dan Gohman	c0f47d6ec1	Change SCEVExpander's expandCodeFor to provide more flexibility with the persistent insertion point, and change IndVars to make use of it. This fixes a bug where IndVars was holding on to a stale insertion point and forcing the SCEVExpander to continue to use it. This fixes PR4038. llvm-svn: 69892	2009-04-23 15:16:49 +00:00
Owen Anderson	caa90b2561	Testcase for PR2639. llvm-svn: 69867	2009-04-23 04:30:52 +00:00
Owen Anderson	bf7354995a	Testcase for PR2537. llvm-svn: 69866	2009-04-23 04:26:42 +00:00
Owen Anderson	f04f0e15c7	Fix typo. llvm-svn: 69865	2009-04-23 04:24:19 +00:00
Owen Anderson	a1a09bc01f	Testcase for PR3085. llvm-svn: 69863	2009-04-23 04:21:14 +00:00
Owen Anderson	d4b3279a3f	Add testcase from PR3086. llvm-svn: 69862	2009-04-23 04:14:03 +00:00
Evan Cheng	bdfff0ba69	Make sure both operands have binary instructions have the same type. llvm-svn: 69844	2009-04-22 23:39:28 +00:00
Evan Cheng	2af546d5fa	Avoid deferencing use_begin() if value does not have a use. llvm-svn: 69836	2009-04-22 22:45:37 +00:00
Dan Gohman	0ab6ecf6a1	SCEVExpander's InsertCastOfTo knows how to move existing cast instructions in order to avoid inserting new ones. However, if the cast instruction is the SCEVExpander's InsertPt, this causes subsequently emitted instructions to be inserted near the cast, and not at the location of the original insert point. Fix this by adjusting the insert point in such cases. This fixes PR4009. llvm-svn: 69808	2009-04-22 16:11:16 +00:00
Chris Lattner	95aad4d625	fix a crash on a pointless but valid zero-length memset, rdar://6808691 llvm-svn: 69680	2009-04-21 16:52:12 +00:00
Dale Johannesen	040d118b17	Another testcase for IV shortening. llvm-svn: 69247	2009-04-16 00:45:21 +00:00
Dale Johannesen	427e9aade9	Enhance induction variable code to remove the sext around sext(shorter IV + constant), using a longer IV instead, when it can figure out the add can't overflow. This comes up a lot in subscripting; mainly affects 64 bit. llvm-svn: 69123	2009-04-15 01:10:12 +00:00
Devang Patel	7323064183	While inlining, clone llvm.dbg.func.start intrinsic and adjust llvm.dbg.region.end instrinsic. This nested llvm.dbg.func.start/llvm.dbg.region.end pair now enables DW_TAG_inlined_subroutine support in code generator. llvm-svn: 69118	2009-04-15 00:17:06 +00:00
Evan Cheng	dba98a0669	Optimize conditional branch on i1 phis with non-constant inputs. This turns: eq: %3 = icmp eq i32 %1, %2 br label %join ne: %4 = icmp ne i32 %1, %2 br label %join join: %5 = phi i1 [%3, %eq], [%4, %ne] br i1 %5, label %yes, label %no => eq: %3 = icmp eq i32 %1, %2 br i1 %3, label %yes, label %no ne: %4 = icmp ne i32 %1, %2 br i1 %4, label %yes, label %no llvm-svn: 69102	2009-04-14 23:40:03 +00:00
Chris Lattner	c1bfdc9bb2	Add a new "available_externally" linkage type. This is intended to support C99 inline, GNU extern inline, etc. Related bugzilla's include PR3517, PR3100, & PR2933. Nothing uses this yet, but it appears to work. llvm-svn: 68940	2009-04-13 05:44:34 +00:00
Chris Lattner	f03202e76d	add some optimizations for strncpy/strncat and factor some code. Patch by Benjamin Kramer! llvm-svn: 68885	2009-04-12 05:06:39 +00:00
Chris Lattner	7d75f78b92	Instcombine should not promote whole computation trees to "strange" integer types, unless they are already strange. This prevents it from turning the code produced by SROA into crazy libcalls and stuff that the code generator can't handle. In the attached example, the result was an i96 multiply that caused the x86 backend to assert. Note that if TargetData had an idea of what the legal types are for a target that this could be used to stop instcombine from introducing i64 muls, as Scott wanted. llvm-svn: 68598	2009-04-08 05:41:03 +00:00
Chris Lattner	2f520929d4	fix rdar://6762290, a crash compiling cxx filt with clang. llvm-svn: 68500	2009-04-07 05:03:34 +00:00
Ed Schouten	ff25f858fd	Let the strcat optimizer return the pointer to the start of the buffer, instead of the place where it started to perform the string copy. - PR3661 - Patch by Benjamin Kramer! llvm-svn: 68443	2009-04-06 13:06:48 +00:00
Owen Anderson	851ce6d1d5	Reapply r68211, with the miscompilations it caused fixed. llvm-svn: 68262	2009-04-01 23:53:49 +00:00
Dan Gohman	a134448980	Revert r68172. It caused regressions in Applications/Burg/burg Applications/ClamAV/clamscan and many other tests. llvm-svn: 68211	2009-04-01 16:37:47 +00:00
Owen Anderson	d7c837bb4b	Enhance GVN to propagate simple conditionals. This fixes PR3921. llvm-svn: 68172	2009-04-01 01:20:45 +00:00
Evan Cheng	c419350132	Throttle back "fold select into operand" transformation. InstCombine should not generate selects of two constants unless they are selects of 0 and 1. e.g. define i32 @t1(i32 %c, i32 %x) nounwind { %t1 = icmp eq i32 %c, 0 %t2 = lshr i32 %x, 18 %t3 = select i1 %t1, i32 %t2, i32 %x ret i32 %t3 } was turned into define i32 @t2(i32 %c, i32 %x) nounwind { %t1 = icmp eq i32 %c, 0 %t2 = select i1 %t1, i32 18, i32 0 %t3 = lshr i32 %x, %t2 ret i32 %t3 } For most targets, that means materializing two constants and then a select. e.g. On x86-64 movl %esi, %eax shrl $18, %eax testl %edi, %edi cmovne %esi, %eax ret => xorl %eax, %eax testl %edi, %edi movl $18, %ecx cmovne %eax, %ecx movl %esi, %eax shrl %cl, %eax ret Also, the optimizer and codegen can reason about shl / and / add, etc. by a constant. This optimization will hinder optimizations using ComputeMaskedBits. llvm-svn: 68142	2009-03-31 20:42:45 +00:00
Devang Patel	ec65625744	Loop Index Split can eliminate a loop if it can determin if loop body is executed only once. There was a bug in determining IV based value of the iteration for which the loop body is executed. Fix it. llvm-svn: 68071	2009-03-30 22:24:10 +00:00
Devang Patel	8c31ea5290	Before deleting a basic block, give other loop passes a chance cleanup analysis values, related to the instructions in the basic block. llvm-svn: 67719	2009-03-25 23:57:48 +00:00
Chris Lattner	c055403764	Fix PR3874 by restoring a condition I removed, but making it more precise than it used to be. llvm-svn: 67662	2009-03-25 00:28:58 +00:00
Chris Lattner	aabd3eeeff	canonicalize inttoptr and ptrtoint instructions which cast pointers to/from integer types that are not intptr_t to convert to intptr_t then do an integer conversion to the dest type. This exposes the cast to the optimizer. llvm-svn: 67638	2009-03-24 18:35:40 +00:00
Chris Lattner	51a4134e1c	two changes: 1. Make instcombine always canonicalize trunc x to i1 into an icmp(x&1). This exposes the AND to other instcombine xforms and is more of what the code generator expects. 2. Rewrite the remaining trunc pattern match to use 'match', which simplifies it a lot. llvm-svn: 67635	2009-03-24 18:15:30 +00:00
Chris Lattner	623662e8e1	Fix instcombine to not introduce undefined shifts when merging two shifts together. This fixes PR3851. llvm-svn: 67411	2009-03-20 22:41:15 +00:00
Chris Lattner	6dce8d4135	aha, DAE does have to think about PHI nodes. Many thanks to "Dr Evil" (aka Duncan) for pointing this out :) llvm-svn: 67212	2009-03-18 16:48:45 +00:00
Chris Lattner	0542f9f1ba	Fix PR3826 - InstComb assert with vector shift, by not calling ComputeNumSignBits on a vector. llvm-svn: 67211	2009-03-18 16:32:19 +00:00
Zhou Sheng	90fc23d03d	Fix a bug. If I->use_empty(), this method should return false. llvm-svn: 67180	2009-03-18 07:56:13 +00:00
Chris Lattner	7bef74e92f	Fix PR3807 by inserting 'insertelement' instructions in the normal dest of an invoke instead of after the invoke (in its block), which is invalid. llvm-svn: 67139	2009-03-18 00:31:45 +00:00
Chris Lattner	120540fec6	remove a test that depends on -debug. llvm-svn: 66937	2009-03-13 20:31:48 +00:00

... 2 3 4 5 6 ...

1265 Commits