llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 22:42:46 +02:00

Author	SHA1	Message	Date
Dan Gohman	c97817aac3	Make DenseMap's insert return a pair, to more closely resemble std::map. llvm-svn: 53177	2008-07-07 17:46:23 +00:00
Nick Lewycky	94f9c5a42e	Fix missed optimization opportunity when analyzing cast of mul and select. llvm-svn: 53151	2008-07-05 21:19:34 +00:00
Evan Cheng	2005804de6	- Re-apply 52748 and friends with fix. GetConstantStringInfo() returns an empty string for ConstantAggregateZero case which surprises selectiondag. - Correctly handle memcpy from constant string which is zero-initialized. llvm-svn: 52891	2008-06-30 07:31:25 +00:00
Anton Korobeynikov	6f260767ec	Revert (52748 and friends): Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. This unbreaks llvm-gcc bootstrap. llvm-svn: 52884	2008-06-29 17:57:03 +00:00
Eric Christopher	4f05c48718	Move GetConstantStringInfo to lib/Analysis. Remove string output routine from Constant. Update all callers. Change debug intrinsic api slightly to accomodate move of routine, these now return values instead of strings. llvm-svn: 52748	2008-06-26 00:31:12 +00:00
Chris Lattner	73b52018e9	Fix PR2488, a case where we deleted stack restores too aggressively. llvm-svn: 52702	2008-06-25 05:59:28 +00:00
Eli Friedman	369401ef95	Fix for PR2479: correctly optimize expressions like (a > 13) & (a == 15). See also PR1800, which is about the signed case. llvm-svn: 52608	2008-06-21 23:36:13 +00:00
Chris Lattner	e588f546c5	Fix PR2471, which is a bug involving an invalid promotion from a conditional load. llvm-svn: 52525	2008-06-20 05:12:56 +00:00
Bill Wendling	68bdf9f6d4	Remove dead code causing a warning. llvm-svn: 52502	2008-06-19 18:00:44 +00:00
Dan Gohman	a18aa3f3a2	Use Instruction::moveBefore instead of manipulating the instruction list directly. llvm-svn: 52498	2008-06-19 17:47:47 +00:00
Chris Lattner	f9d9f0ec4c	Fix the regressions on sext-misc.ll my patch yesterday caused. llvm-svn: 52466	2008-06-18 18:11:55 +00:00
Chris Lattner	93da79f7a1	implement some simple bswap optimizations, rdar://5992453 llvm-svn: 52442	2008-06-18 04:33:20 +00:00
Chris Lattner	7e403da191	make truncate/sext elimination capable of changing phi's. This implements rdar://6013816 and the testcase in Transforms/InstCombine/sext-misc.ll. llvm-svn: 52440	2008-06-18 04:00:49 +00:00
Duncan Sands	ec220a7c48	Fix typo that changed the logic to something wrong. Spotted by Nick Lewycky. llvm-svn: 52411	2008-06-17 15:55:30 +00:00
Matthijs Kooijman	238b1e8d69	Pass around Instruction* instead of Instruction& in FindInsertedValue and friends. llvm-svn: 52318	2008-06-16 13:13:08 +00:00
Matthijs Kooijman	dedcf00fcc	80 column fixes. llvm-svn: 52316	2008-06-16 12:57:37 +00:00
Matthijs Kooijman	1dd7d9cdc1	Move FindScalarValue from InstructionCombining.cpp to ValueTracking.cpp. While I'm at it, rename it to FindInsertedValue. The only functional change is that newly created instructions are no longer added to instcombine's worklist, but that is not really necessary anyway (and I'll commit some improvements next that will completely remove the need). llvm-svn: 52315	2008-06-16 12:48:21 +00:00
Eli Friedman	11d4c94933	Don't skip over instructions other than loads that might read memory when trying to sink stores. llvm-svn: 52259	2008-06-13 22:02:12 +00:00
Eli Friedman	d38a639deb	Make sure SimplifyStoreAtEndOfBlock doesn't mess with loops; the structure checks are incorrect if the blocks aren't distinct. Fixes PR2435. llvm-svn: 52257	2008-06-13 21:17:49 +00:00
Gabor Greif	10de8c6c59	fix a minor deviation from the original in my previous commit llvm-svn: 52247	2008-06-12 21:51:29 +00:00
Gabor Greif	509b3a75f4	op_iterator-ify some loops, low hanging fruit only, there is more llvm-svn: 52246	2008-06-12 21:37:33 +00:00
Matthijs Kooijman	0f9df32e12	Teach instruction combining about the extractvalue. It can succesfully fold useless insert-extract chains, similar to how it folds them for vectors. Add a testcase for this. llvm-svn: 52217	2008-06-11 14:05:05 +00:00
Matthijs Kooijman	511d6a5cd3	Clarify a comment. llvm-svn: 52212	2008-06-11 09:00:12 +00:00
Chris Lattner	4a896996cb	Limit the icmp+phi merging optimization to the cases where it is profitable: don't make i1 phis when it won't be possible to eliminate them. llvm-svn: 52097	2008-06-08 20:52:11 +00:00
Zhou Sheng	06fc769e52	As Chris suggested, handle the situation if ShAmt larger than BitWidth, otherwise, opt might crash. llvm-svn: 52041	2008-06-06 08:32:05 +00:00
Zhou Sheng	eaa93efd52	If BitWidth equals to ShtAmt, the RHSKnownZero[BitWidth-ShiftAmt-1] will crash the opt. Just fix this. Test case in llvm/test/Transforms/InstCombine/2008-06-05-ashr-crash.ll llvm-svn: 52003	2008-06-05 14:23:44 +00:00
Chris Lattner	ea60f0ccc3	move CannotBeNegativeZero to ValueTracking. Simplify some signbit comparisons. llvm-svn: 51864	2008-06-02 01:29:46 +00:00
Chris Lattner	4960857273	move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits out of instcombine into a new file in libanalysis. This also teaches ComputeNumSignBits about the number of sign bits in a constantint. llvm-svn: 51863	2008-06-02 01:18:21 +00:00
Duncan Sands	d14212a3e1	When simplifying a call to a bitcast function, tighten up the conditions for performing the transform when only the function declaration is available: no longer allow turning i32 into i64 for example. Only allow changing between pointer types, and between pointer types and integers of the same size. For return values ptr -> intptr was already allowed; I added ptr -> ptr and intptr -> ptr while there. As shown by a recent objc testcase, changing the way parameters/return values are passed can be fatal when calling code written in assembler that directly manipulates call arguments and return values unless the transform has no impact on the way they are passed at the codegen level. While it is possible to imagine an ABI that treats integers of pointer size differently to pointers, I don't think LLVM supports any so the transform should now be safe while still being useful. llvm-svn: 51834	2008-06-01 07:38:42 +00:00
Nick Lewycky	1bcd80adf7	Peer through sext/zext when looking for not(cmp). llvm-svn: 51819	2008-05-31 19:01:33 +00:00
Nick Lewycky	b30afdb62b	Add more i1 optimizations. add, sub, mul, s/udiv on i1 are now simplified away. llvm-svn: 51817	2008-05-31 17:59:52 +00:00
Nick Lewycky	cdcdcddc85	Adding i1 is always Xor. llvm-svn: 51816	2008-05-31 17:10:28 +00:00
Dan Gohman	d8b84813d5	const-ify getOpcode. llvm-svn: 51698	2008-05-29 19:53:46 +00:00
Chris Lattner	7a7da4f9c3	Implement PR2370: memmove(x,x,size) -> noop. llvm-svn: 51636	2008-05-28 05:30:41 +00:00
Nick Lewycky	744dad8004	"ret (constexpr)" can't be folded into a Constant. Add a method to Analysis/ConstantFolding to fold ConstantExpr's, then make instcombine use it to try to use targetdata to fold constant expressions on void instructions. Also extend the icmp(inttoptr, inttoptr) folding to handle the case where int size != ptr size. llvm-svn: 51559	2008-05-25 20:56:15 +00:00
Chris Lattner	3def8b4e53	Fix a serious brain-o. Obviously no-one reviewed my patch :( This fixes PR2359 llvm-svn: 51536	2008-05-24 04:06:28 +00:00
Dan Gohman	8b6f4366ae	Tidy up BasicBlock::getFirstNonPHI, and change a bunch of places to use it instead of duplicating its functionality. llvm-svn: 51499	2008-05-23 21:05:58 +00:00
Matthijs Kooijman	e9217fe486	Replace some weird usage of UserOp1 introduced in r49492 by a plain if. llvm-svn: 51482	2008-05-23 16:17:48 +00:00
Nick Lewycky	6a16ace643	Constant integer vectors may also be negated. llvm-svn: 51476	2008-05-23 04:54:45 +00:00
Nick Lewycky	16773d5239	Typo. llvm-svn: 51475	2008-05-23 04:39:38 +00:00
Nick Lewycky	bd2da8098d	Revert X + X --> X * 2 optz'n which pessimizes heavily on x86. llvm-svn: 51474	2008-05-23 04:34:58 +00:00
Nick Lewycky	427209006f	Implement X + X for vectors. llvm-svn: 51472	2008-05-23 04:14:51 +00:00
Nick Lewycky	e62259c369	Fix a recently added optimization to not crash on vectors. llvm-svn: 51471	2008-05-23 03:26:47 +00:00
Dan Gohman	67e1a58e22	Generalize the new code in instcombine's ComputeNumSignBits for handling and/or to handle more cases (such as this add-sitofp.ll testcase), and port it to selectiondag's ComputeNumSignBits. llvm-svn: 51469	2008-05-23 02:28:01 +00:00
Dan Gohman	eafccb7d8f	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51467	2008-05-23 01:52:21 +00:00
Dan Gohman	b48d4a75f6	Port SelectionDAG's ComputeNumSignBits-using code to instcombine, now that instcombine also has ComputeNumSignBits. llvm-svn: 51350	2008-05-20 21:01:12 +00:00
Chris Lattner	b387fd90fc	Teach instcombine 4 new xforms: (add (sext x), cst) --> (sext (add x, cst')) (add (sext x), (sext y)) --> (sext (add int x, y)) (add double (sitofp x), fpcst) --> (sitofp (add int x, intcst)) (add double (sitofp x), (sitofp y)) --> (sitofp (add int x, y)) This generally reduces conversions. For example MiBench/telecomm-gsm gets these simplifications: HACK2: %tmp67.i142.i.i = sext i16 %tmp6.i141.i.i to i32 ; <i32> [#uses=1] %tmp23.i139.i.i = sext i16 %tmp2.i138.i.i to i32 ; <i32> [#uses=1] %tmp8.i143.i.i = add i32 %tmp67.i142.i.i, %tmp23.i139.i.i ; <i32> [#uses=3] HACK2: %tmp67.i121.i.i = sext i16 %tmp6.i120.i.i to i32 ; <i32> [#uses=1] %tmp23.i118.i.i = sext i16 %tmp2.i117.i.i to i32 ; <i32> [#uses=1] %tmp8.i122.i.i = add i32 %tmp67.i121.i.i, %tmp23.i118.i.i ; <i32> [#uses=3] HACK2: %tmp67.i.i190.i = sext i16 %tmp6.i.i189.i to i32 ; <i32> [#uses=1] %tmp23.i.i187.i = sext i16 %tmp2.i.i186.i to i32 ; <i32> [#uses=1] %tmp8.i.i191.i = add i32 %tmp67.i.i190.i, %tmp23.i.i187.i ; <i32> [#uses=3] HACK2: %tmp67.i173.i.i.i = sext i16 %tmp6.i172.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i170.i.i.i = sext i16 %tmp2.i169.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i174.i.i.i = add i32 %tmp67.i173.i.i.i, %tmp23.i170.i.i.i ; <i32> [#uses=3] HACK2: %tmp67.i152.i.i.i = sext i16 %tmp6.i151.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i149.i.i.i = sext i16 %tmp2.i148.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i153.i.i.i = add i32 %tmp67.i152.i.i.i, %tmp23.i149.i.i.i ; <i32> [#uses=3] HACK2: %tmp67.i.i.i.i = sext i16 %tmp6.i.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i.i5.i.i = sext i16 %tmp2.i.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i.i7.i.i = add i32 %tmp67.i.i.i.i, %tmp23.i.i5.i.i ; <i32> [#uses=3] This also fixes a bug in ComputeNumSignBits handling select and makes it more aggressive with and/or. llvm-svn: 51302	2008-05-20 05:46:13 +00:00
Chris Lattner	323a985507	fix two issues Neil noticed, thanks! llvm-svn: 51296	2008-05-20 03:50:52 +00:00
Dan Gohman	3f6b53dba0	Make AssociativeOpt static. llvm-svn: 51290	2008-05-20 01:14:05 +00:00
Dan Gohman	2d2351f037	Add a ComputeNumSignBits function for use by instcombine, based on the code in SelectionDAG. llvm-svn: 51279	2008-05-19 22:14:15 +00:00
Chris Lattner	859485412b	switch to Type::getFPMantissaWidth instead of reinventing it. llvm-svn: 51275	2008-05-19 21:17:23 +00:00
Chris Lattner	92599bcc72	minor cleanups, teach instcombine that sitofp/uitofp cannot produce a negative zero. llvm-svn: 51272	2008-05-19 20:27:56 +00:00
Chris Lattner	63c384df1e	convert fptosi(sitofp x) -> x if the fp value has enough bits in its mantissa to accurately represent the integer. This triggers 9 times in 471.omnetpp, though 8 of those seem to be inlined from the same place. llvm-svn: 51271	2008-05-19 20:25:04 +00:00
Chris Lattner	1435b94f62	Fold FP comparisons where one operand is converted from an integer type and the other operand is a constant into integer comparisons. This happens surprisingly frequently (e.g. 10 times in 471.omnetpp), which are things like this: %tmp8283 = sitofp i32 %tmp82 to double %tmp1013 = fcmp ult double %tmp8283, 0.0 Clearly comparing tmp82 against i32 0 is cheaper here. this also triggers 8 times in gobmk, including this one: %tmp375376 = sitofp i32 %tmp375 to double %tmp377 = fcmp ogt double %tmp375376, 8.150000e+01 which is comparing an integer against 81.5 :). llvm-svn: 51268	2008-05-19 20:18:56 +00:00
Chris Lattner	ad02ff166e	remove debug output llvm-svn: 51264	2008-05-19 20:03:53 +00:00
Chris Lattner	510a6b249c	be more aggressive about transforming add -> or when the operands have no intersecting bits. This triggers all over the place, for example in lencode, with adds of stuff like: %tmp580 = mul i32 %tmp579, 2 %tmp582 = and i32 %b8, 1 and %tmp28 = shl i32 %abs.i, 1 %sign.0 = select i1 %tmp23, i32 1, i32 0 and %tmp344 = shl i32 %tmp343, 2 %tmp346 = and i32 %tmp96, 3 etc. llvm-svn: 51263	2008-05-19 20:01:56 +00:00
Chris Lattner	8c0f0a0e6c	Fix PR2339 llvm-svn: 51226	2008-05-18 04:11:26 +00:00
Nick Lewycky	6f3744c685	Move isTrueWhenEqual to ICmpInst. llvm-svn: 51215	2008-05-17 07:33:39 +00:00
Gabor Greif	d61f20217a	API change for {BinaryOperator\|CmpInst\|CastInst}::create*() --> Create. Legacy interfaces will be in place for some time. (Merge from use-diet branch.) llvm-svn: 51200	2008-05-16 19:29:10 +00:00
Chris Lattner	00e8e1e258	implement PR2328. llvm-svn: 51176	2008-05-16 02:59:42 +00:00
Gabor Greif	48ffb6c7dc	Fix a bunch of 80col violations that arose from the Create API change. Tweak makefile targets to find these better. llvm-svn: 51143	2008-05-15 10:04:30 +00:00
Bill Wendling	c1d9f9604b	Situations can arise when you have a function called that returns a 'void', but is bitcast to return a floating point value. The result of the instruction may not be used by the program afterwards, and LLVM will happily remove all instructions except the call. But, on some platforms, if a value is returned as a floating point, it may need to be removed from the stack (like x87). Thus, we can't get rid of the bitcast even if there isn't a use of the value. llvm-svn: 51134	2008-05-14 22:45:20 +00:00
Dan Gohman	bab18cae46	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Chris Lattner	b6db834a47	don't sink invokes, even if they are readonly. This fixes a crash on kimwitu++. llvm-svn: 50901	2008-05-09 15:07:33 +00:00
Chris Lattner	02ca137915	Implement PR2298. This transforms: ~x < ~y --> y < x -x == -y --> x == y llvm-svn: 50882	2008-05-09 05:19:28 +00:00
Chris Lattner	4c1ef3628b	More than just loads can read from memory: readonly calls like strlen also need to be checked for memory modifying instructions before we can sink them. THis fixes the second half of PR2297. llvm-svn: 50860	2008-05-08 17:37:37 +00:00
Chris Lattner	cba8b4c7e8	Make instcombine's DSE respect loads as well as stores. It is not safe to delete the first store in: store x -> p load p store y -> p This is for PR2297. llvm-svn: 50859	2008-05-08 17:20:30 +00:00
Anton Korobeynikov	ddb93e7a02	Turn StripPointerCast() into a method llvm-svn: 50836	2008-05-07 22:54:15 +00:00
Dan Gohman	6ea87fa437	Fix a bug in the ComputeMaskedBits logic for multiply. llvm-svn: 50793	2008-05-07 00:35:55 +00:00
Anton Korobeynikov	90ee6d6616	Make StripPointerCast a common function (should we mak it method of Value instead?) llvm-svn: 50775	2008-05-06 22:52:30 +00:00
Devang Patel	c5aee6c84d	Fix typo. llvm-svn: 50713	2008-05-06 05:40:11 +00:00
Dan Gohman	faf9df7227	Correct the value of LowBits in srem and urem handling in ComputeMaskedBits. llvm-svn: 50692	2008-05-06 00:51:48 +00:00
Devang Patel	b3112b4417	Do not sink getresult. llvm-svn: 50600	2008-05-03 00:36:30 +00:00
Dan Gohman	27156711ef	Fix a mistake in the computation of leading zeros for udiv. llvm-svn: 50591	2008-05-02 21:30:02 +00:00
Dan Gohman	793c9fed45	Fix an overaggressive SimplifyDemandedBits optimization on urem. This fixes the 254.gap regression on x86 and the 403.gcc regression on x86-64. llvm-svn: 50537	2008-05-01 19:13:24 +00:00
Chris Lattner	15195e00ee	move lowering of llvm.memset -> store from simplify libcalls to instcombine. llvm-svn: 50472	2008-04-30 06:39:11 +00:00
Chris Lattner	5bd55b0885	don't eliminate load from volatile value on paths where the load is dead. This fixes the second half of PR2262 llvm-svn: 50430	2008-04-29 17:28:22 +00:00
Chris Lattner	7099f3c400	fix a subtle volatile handling bug. llvm-svn: 50428	2008-04-29 17:13:43 +00:00
Chris Lattner	51fe8415da	don't delete the last store to an alloca if the store is volatile. llvm-svn: 50390	2008-04-29 04:58:38 +00:00
Dan Gohman	1b7238e6e4	Teach InstCombine's ComputeMaskedBits what SelectionDAG's ComputeMaskedBits knows about cttz, ctlz, and ctpop. Teach SelectionDAG's ComputeMaskedBits what InstCombine's knows about SRem. And teach them both some things about high bits in Mul, UDiv, URem, and Sub. This allows instcombine and dagcombine to eliminate sign-extension operations in several new cases. llvm-svn: 50358	2008-04-28 17:02:21 +00:00
Dale Johannesen	cfba8d51b8	change comments per review llvm-svn: 50300	2008-04-25 21:16:07 +00:00
Nick Lewycky	1f831c0f57	Remove 'unwinds to' support from mainline. This patch undoes r47802 r47989 r48047 r48084 r48085 r48086 r48088 r48096 r48099 r48109 and r48123. llvm-svn: 50265	2008-04-25 16:53:59 +00:00
Dale Johannesen	d70ea13581	Rewrite previous patch to suit Chris's preference. llvm-svn: 50174	2008-04-23 18:34:37 +00:00
Dale Johannesen	3007fc4e1b	Do not change the type of a ByVal argument to a type of a different size. llvm-svn: 50121	2008-04-23 01:03:05 +00:00
Evan Cheng	680839e258	Don't do: "(X & 4) >> 1 == 2 --> (X & 4) == 4" if there are more than one uses of the shift result. llvm-svn: 50118	2008-04-23 00:38:06 +00:00
Chris Lattner	8837037473	remove dead code. llvm-svn: 50080	2008-04-22 03:21:48 +00:00
Chris Lattner	14be19cf1e	optimize "p != gep p, ..." better. This allows us to compile getelementptr-seteq.ll into: define i1 @test(i64 %X, %S* %P) { %C = icmp eq i64 %X, -1 ; <i1> [#uses=1] ret i1 %C } instead of: define i1 @test(i64 %X, %S* %P) { %A.idx.mask = and i64 %X, 4611686018427387903 ; <i64> [#uses=1] %C = icmp eq i64 %A.idx.mask, 4611686018427387903 ; <i1> [#uses=1] ret i1 %C } And fixes the second half of PR2235. This speeds up the insertion sort case by 45%, from 1.12s to 0.77s. In practice, this will significantly speed up for loops structured like: for (double *P = Base + N; P != Base; --P) ... Which happens frequently for C++ iterators. llvm-svn: 50079	2008-04-22 02:53:33 +00:00
Torok Edwin	e038c595c1	g++-4.3 build-fix: CHAR_BIT requires <climits>. llvm-svn: 49989	2008-04-20 08:33:11 +00:00
Chris Lattner	f390d62b7f	Switch to using Simplified ConstantFP::get API. llvm-svn: 49977	2008-04-20 00:41:09 +00:00
Dan Gohman	318d9a6605	Teach InstCombine's ComputeMaskedBits to handle pointer expressions in addition to integer expressions. Rewrite GetOrEnforceKnownAlignment as a ComputeMaskedBits problem, moving all of its special alignment knowledge to ComputeMaskedBits as low-zero-bits knowledge. Also, teach ComputeMaskedBits a few basic things about Mul and PHI instructions. This improves ComputeMaskedBits-based simplifications in a few cases, but more noticeably it significantly improves instcombine's alignment detection for loads, stores, and memory intrinsics. llvm-svn: 49492	2008-04-10 18:43:06 +00:00
Gabor Greif	6c6b8a57f3	API changes for class Use size reduction, wave 1. Specifically, introduction of XXX::Create methods for Users that have a potentially variable number of Uses. llvm-svn: 49277	2008-04-06 20:25:17 +00:00
Nate Begeman	610aa2511c	Don't eliminate bitcast instructions that change the type of a pointer llvm-svn: 48971	2008-03-31 00:22:16 +00:00
Evan Cheng	563b265f37	Handle a special case xor undef, undef -> 0. Technically this should be transformed to undef. But this is such a common idiom (misuse) we are going to handle it. llvm-svn: 48791	2008-03-25 20:07:13 +00:00
Evan Cheng	1d63708523	Transform (zext (or (icmp), (icmp))) to (or (zext (cimp), (zext icmp))) if at least one of the (zext icmp) can be transformed to eliminate an icmp. llvm-svn: 48715	2008-03-24 00:21:34 +00:00
Duncan Sands	530554ab0a	Fix the build for gcc-4.2. llvm-svn: 48639	2008-03-21 08:32:17 +00:00
Chris Lattner	96cdf21ed4	Teach masked value is zero about add and sub, and use MVIZ to simplify things like (X & 4) >> 1 == 2 --> (X & 4) == 4. since it is obvious that the shift doesn't remove any bits. llvm-svn: 48631	2008-03-21 05:19:58 +00:00
Bill Wendling	7d054f8b3f	The inst combining of inttoptr into GEP with one index was using the bit size of the type instead of the byte size. This was causing troublesome mis-compilations. True to form, this took 2 days to find and is a one-line fix. :-P llvm-svn: 48354	2008-03-14 05:12:19 +00:00
Chris Lattner	7925cc72c0	Reimplement the parameter attributes support, phase #1 . hilights: 1. There is now a "PAListPtr" class, which is a smart pointer around the underlying uniqued parameter attribute list object, and manages its refcount. It is now impossible to mess up the refcount. 2. PAListPtr is now the main interface to the underlying object, and the underlying object is now completely opaque. 3. Implementation details like SmallVector and FoldingSet are now no longer part of the interface. 4. You can create a PAListPtr with an arbitrary sequence of ParamAttrsWithIndex's, no need to make a SmallVector of a specific size (you can just use an array or scalar or vector if you wish). 5. All the client code that had to check for a null pointer before dereferencing the pointer is simplified to just access the PAListPtr directly. 6. The interfaces for adding attrs to a list and removing them is a bit simpler. Phase #2 will rename some stuff (e.g. PAListPtr) and do other less invasive changes. llvm-svn: 48289	2008-03-12 17:45:29 +00:00
Devang Patel	0c7fb89803	Skip functions that return multiple values. llvm-svn: 48233	2008-03-11 18:04:06 +00:00
Nick Lewycky	1d6b50743f	Don't eliminate blocks that are only reachable by unwind_to. llvm-svn: 48106	2008-03-09 08:50:23 +00:00
Nick Lewycky	f249c5d5ad	Don't try to simplify urem and srem using arithmetic rules that don't work under modulo (overflow). Fixes PR1933. llvm-svn: 47987	2008-03-06 06:48:30 +00:00
Chris Lattner	5aeccb7353	Folding or(fcmp,fcmp) only works if the operands of the fcmps are the same fp type. llvm-svn: 47750	2008-02-29 06:09:11 +00:00
Bill Wendling	bd1f1ae160	De-tabify. llvm-svn: 47599	2008-02-26 10:53:30 +00:00
Dale Johannesen	ae08bdb4cf	Split ParameterAttributes.h, putting the complicated stuff into ParamAttrsList.h. Per feedback from ParamAttrs changes. llvm-svn: 47504	2008-02-22 22:17:59 +00:00
Zhou Sheng	0742fbfedf	Fixed a typo. llvm-svn: 47478	2008-02-22 10:00:35 +00:00
Anton Korobeynikov	c41f5b6af4	Fix newly-introduced 4.3 warnings llvm-svn: 47375	2008-02-20 12:07:57 +00:00
Anton Korobeynikov	fd6b669c80	Make Transforms to be 4.3 warnings-clean llvm-svn: 47371	2008-02-20 11:26:25 +00:00
Dale Johannesen	ecb2b233b1	Expand ParameterAttributes to 32 bits (in preparation for adding alignment info, not there yet). Clean up interfaces to reference ParameterAttributes consistently. llvm-svn: 47342	2008-02-19 21:38:47 +00:00
Chris Lattner	6bb889cf84	fdiv/frem of undef can produce undef, because the undef operand can be a SNaN. We could be more aggressive and turn this into unreachable, but that is less nice, and not really worth it. llvm-svn: 47313	2008-02-19 06:12:18 +00:00
Nick Lewycky	1f3c58df08	Correctly fold divide-by-constant, even when faced with overflow. llvm-svn: 47287	2008-02-18 22:48:05 +00:00
Chris Lattner	a378b03483	Transforming -A + -B --> -(A + B) isn't safe for FP, thanks to Dale for noticing this! llvm-svn: 47276	2008-02-18 17:50:16 +00:00
Chris Lattner	9851db050b	optimize away stackrestore calls that have no intervening alloca or call. llvm-svn: 47258	2008-02-18 06:12:38 +00:00
Chris Lattner	2fa904b3af	Fold (-x + -y) -> -(x+y) which promotes better association, fixing the second half of PR2047 llvm-svn: 47244	2008-02-17 21:03:36 +00:00
Dan Gohman	588498082a	Rename APInt's isPositive to isNonNegative, to reflect what it actually does. llvm-svn: 47090	2008-02-13 22:09:18 +00:00
Chris Lattner	96deed5d4d	Fix a bug compiling PR1978 (perhaps not the only one though) which was incorrectly simplifying "x == (gep x, 1, i)" into false, even though i could be negative. As it turns out, all the code to handle this already existed, we just need to disable the incorrect optimization case and let the general case handle it. llvm-svn: 46739	2008-02-05 04:45:32 +00:00
Nick Lewycky	febd3642ce	There are some cases where icmp(add) can be folded into a new icmp. Handle them. llvm-svn: 46687	2008-02-03 16:33:09 +00:00
Nick Lewycky	29cd604126	Hack on vectors too. llvm-svn: 46684	2008-02-03 08:19:11 +00:00
Nick Lewycky	ce4c4698d7	Fold away one multiply in instcombine. This would normally be caught in reassociate anyways, but they could be generated during instcombine's run. llvm-svn: 46683	2008-02-03 07:42:09 +00:00
Chris Lattner	83f411c586	eliminate additions of 0.0 when they are obviously dead. This has to be careful to avoid turning -0.0 + 0.0 -> -0.0 which is incorrect. llvm-svn: 46499	2008-01-29 06:52:45 +00:00
Nick Lewycky	6b070b1b93	Handle some more combinations of extend and icmp. Fixes PR1940. llvm-svn: 46431	2008-01-28 03:48:02 +00:00
Chris Lattner	359756ea4b	Fix PR1932 by disabling an xform invalid for fdiv. llvm-svn: 46429	2008-01-28 00:58:18 +00:00
Chris Lattner	aa553aa0c1	Fold fptrunc(add (fpextend x), (fpextend y)) -> add(x,y), as GCC does. llvm-svn: 46406	2008-01-27 05:29:54 +00:00
Nick Lewycky	13b6bc91d6	Enable the fix I just checked in, silly me. llvm-svn: 46247	2008-01-22 05:42:02 +00:00
Nick Lewycky	78780f175b	Multiply can be evaluated in a different type, so long as the target type has a smaller bitwidth. llvm-svn: 46244	2008-01-22 05:08:48 +00:00
Duncan Sands	81e35b4d47	I noticed that the trampoline straightening transformation could drop attributes on varargs call arguments. Also, it could generate invalid IR if the transformed call already had the 'nest' attribute somewhere (this can never happen for code coming from llvm-gcc, but it's a theoretical possibility). Fix both problems. llvm-svn: 45973	2008-01-14 19:52:09 +00:00
Chris Lattner	d22a5f6314	Turn a memcpy from a double* into a load/store of double instead of a load/store of i64. The later prevents promotion/scalarrepl of the source and dest in many cases. This fixes the 300% performance regression of the byval stuff on stepanov_v1p2. llvm-svn: 45945	2008-01-14 00:28:35 +00:00
Chris Lattner	8560bb9d98	factor memcpy/memmove simplification out to its own SimplifyMemTransfer method, no functionality change. llvm-svn: 45944	2008-01-13 23:50:23 +00:00
Chris Lattner	5fbf76aaf4	simplify some code. If we can infer alignment for source and dest that are greater than memcpy alignment, and if we lower to load/store, use the best alignment info we have. llvm-svn: 45943	2008-01-13 22:30:28 +00:00
Chris Lattner	4f69f1a721	simplify some code by adding a InsertBitCastBefore method, make memmove->memcpy conversion a bit simpler. llvm-svn: 45942	2008-01-13 22:23:22 +00:00
Chris Lattner	32eae5daa5	Fix PR1907, a nasty miscompilation because instcombine didn't realize that ne & sgt was a signed comparison (it was only looking at whether the left compare was signed). llvm-svn: 45937	2008-01-13 20:59:02 +00:00
Duncan Sands	7414cc131b	When turning a call to a bitcast function into a direct call, if this becomes a varargs call then deal correctly with any parameter attributes on the newly vararg call arguments. llvm-svn: 45931	2008-01-13 08:02:44 +00:00
Chris Lattner	67f581b344	Implement PR1795, an instcombine hack for forming GEPs with integer pointer arithmetic. llvm-svn: 45745	2008-01-08 07:23:51 +00:00
Duncan Sands	7955cf0cd7	Small cleanup for handling of type/parameter attribute incompatibility. llvm-svn: 45704	2008-01-07 17:16:06 +00:00
Duncan Sands	fd975e4b3d	The transform that tries to turn calls to bitcast functions into direct calls bails out unless caller and callee have essentially equivalent parameter attributes. This is illogical - the callee's attributes should be of no relevance here. Rework the logic, which incidentally fixes a crash when removed arguments have attributes. llvm-svn: 45658	2008-01-06 18:27:01 +00:00
Duncan Sands	b8489f09a2	When transforming a call to a bitcast function into a direct call with cast parameters and cast return value (if any), instcombine was prepared to cast any non-void return value into any other, whether castable or not. Add a new predicate for testing whether casting is valid, and check it both for the return value and (as a cleanup) for the parameters. llvm-svn: 45657	2008-01-06 10:12:28 +00:00
Chris Lattner	7e1c3aa702	remove a couple more unsafe xforms in the face of overflow. llvm-svn: 45613	2008-01-05 01:22:42 +00:00
Chris Lattner	983697dfac	remove the (x-y) < 0 comparison xform, it miscompiles things that are not equality comparisons, for example: (2147479553+4096)-2147479553 < 0 != (2147479553+4096) < 2147479553 llvm-svn: 45612	2008-01-05 01:18:20 +00:00
Chris Lattner	ad9a6ccb83	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Christopher Lamb	dfad5f19b4	Disable null pointer folding transforms for non-generic address spaces. This should probably be a target-specific predicate based on address space. That way for targets where this isn't applicable the predicate can be optimized away. llvm-svn: 45403	2007-12-29 07:56:53 +00:00
Owen Anderson	ebd3e9c500	Repair a transform that Chris noticed a bug in. Thanks to Nicholas for pointing out my stupid mistakes when writing this patch. :-) llvm-svn: 45384	2007-12-28 07:42:12 +00:00
Chris Lattner	2456399ce5	disable this instcombine xform, it miscompiles: define i32 @main() { entry: %z = alloca i32 ; <i32> [#uses=2] store i32 0, i32 %z %tmp = load i32* %z ; <i32> [#uses=1] %sub = sub i32 %tmp, 1 ; <i32> [#uses=1] %cmp = icmp ult i32 %sub, 0 ; <i1> [#uses=1] %retval = select i1 %cmp, i32 1, i32 0 ; <i32> [#uses=1] ret i32 %retval } into ret 1, instead of ret 0. Christopher, please investigate. llvm-svn: 45383	2007-12-28 06:24:31 +00:00
Chris Lattner	d64df490ca	implement InstCombine/shift-trunc-shift.ll. This allows us to compile: #include <math.h> int t1(double d) { return signbit(d); } into: _t1: movd %xmm0, %rax shrq $63, %rax ret instead of: _t1: movd %xmm0, %rax shrq $32, %rax shrl $31, %eax ret on x86-64. llvm-svn: 45311	2007-12-22 09:07:47 +00:00
Christopher Lamb	7ca648a7b1	Implement review feedback, including additional transforms (icmp slt (sub A B) 1) -> (icmp sle A B) icmp sgt (sub A B) -1) -> (icmp sge A B) and add testcase. llvm-svn: 45256	2007-12-20 07:21:11 +00:00
Chris Lattner	1a386cbdae	simplify this code with the new m_Zero() pattern. Make sure the select only has a single use, and generalize it to not require N to be a constant. llvm-svn: 45250	2007-12-20 01:56:58 +00:00
Duncan Sands	56f3add5b7	When inlining through an 'nounwind' call, mark inlined calls 'nounwind'. It is important for correct C++ exception handling that nounwind markings do not get lost, so this transformation is actually needed for correctness. llvm-svn: 45218	2007-12-19 21:13:37 +00:00
Christopher Lamb	be0cbc7e92	Fold subtracts into integer compares vs. zero. This improves generate code for this case on X86 from _foo: movl $99, %ecx movl 4(%esp), %eax subl %eax, %ecx xorl %edx, %edx testl %ecx, %ecx cmovs %edx, %eax ret to _foo: xorl %ecx, %ecx movl 4(%esp), %eax cmpl $99, %eax cmovg %ecx, %eax ret llvm-svn: 45173	2007-12-18 21:32:20 +00:00
Christopher Lamb	051a1320e8	Fix comments llvm-svn: 45170	2007-12-18 20:33:11 +00:00
Christopher Lamb	d56318b885	Remove an orthogonal transformation of the selection condition from my most recent submission. llvm-svn: 45169	2007-12-18 20:30:28 +00:00
Duncan Sands	242f80be86	Rename isNoReturn to doesNotReturn, and isNoUnwind to doesNotThrow. llvm-svn: 45160	2007-12-18 09:59:50 +00:00
Christopher Lamb	437b4d229e	Fix typos. llvm-svn: 45159	2007-12-18 09:45:40 +00:00
Christopher Lamb	aeb76743dc	Fold certain additions through selects (and their compares) so as to eliminate subtractions. This code is often produced by the SMAX expansion in SCEV. This implements test/Transforms/InstCombine/2007-12-18-AddSelCmpSub.ll llvm-svn: 45158	2007-12-18 09:34:41 +00:00
Christopher Lamb	a608afb52e	Change the PointerType api for creating pointer types. The old functionality of PointerType::get() has become PointerType::getUnqual(), which returns a pointer in the generic address space. The new prototype of PointerType::get() requires both a type and an address space. llvm-svn: 45082	2007-12-17 01:12:55 +00:00
Duncan Sands	bf62f62058	Make instcombine promote inline asm calls to 'nounwind' calls. Remove special casing of inline asm from the inliner. There is a potential problem: the verifier rejects invokes of inline asm (not sure why). If an asm call is not marked "nounwind" in some .ll, and instcombine is not run, but the inliner is run, then an illegal module will be created. This is bad but I'm not sure what the best approach is. I'm tempted to remove the check in the verifier... llvm-svn: 45073	2007-12-16 15:51:49 +00:00
Wojciech Matyjewicz	8bb1d9e67c	1. "Upgrage" comments. 2. Using zero-extended value of Scale and unsigned division is safe provided that Scale doesn't have the sign bit set. Previously these 2 instructions: %p = bitcast [100 x {i8,i8,i8}]* %x to i8* %q = getelementptr i8* %p, i32 -4 were combined into: %q = getelementptr [100 x { i8, i8, i8 }]* %x, i32 0, i32 1431655764, i32 0 what was incorrect. llvm-svn: 44936	2007-12-12 15:21:32 +00:00
Chris Lattner	861df2f4e9	simplify some code. llvm-svn: 44655	2007-12-06 06:25:04 +00:00
Chris Lattner	1f2a96f9c1	move some ashr-specific code out of commonShiftTransforms into visitAShr. llvm-svn: 44650	2007-12-06 01:59:46 +00:00
Duncan Sands	3602011bec	Fix PR1146: parameter attributes are longer part of the function type, instead they belong to functions and function calls. This is an updated and slightly corrected version of Reid Spencer's original patch. The only known problem is that auto-upgrading of bitcode files doesn't seem to work properly (see test/Bitcode/AutoUpgradeIntrinsics.ll). Hopefully a bitcode guru (who might that be? :) ) will fix it. llvm-svn: 44359	2007-11-27 13:23:08 +00:00
Chris Lattner	d1e03b5387	Implement PR1822 llvm-svn: 44318	2007-11-25 21:27:53 +00:00
Duncan Sands	114968a3e8	Fix PR1816. If a bitcast of a function only exists because of a trivial difference in function attributes, allow calls to it to be converted to direct calls. Based on a patch by Török Edwin. While there, move the various lists of mutually incompatible parameters etc out of the verifier and into ParameterAttributes.h. llvm-svn: 44315	2007-11-25 14:10:56 +00:00
Chris Lattner	642ae99085	add a comment. llvm-svn: 44293	2007-11-23 22:35:18 +00:00
Chris Lattner	bff48b8f0d	Fix PR1817. llvm-svn: 44284	2007-11-22 23:47:13 +00:00
Chris Lattner	5574ba5ce6	Fix PR1800 by correcting mistaken logic. llvm-svn: 44188	2007-11-16 06:04:17 +00:00
Andrew Lenharth	83adca5075	Better check llvm-svn: 43897	2007-11-08 18:45:15 +00:00
Andrew Lenharth	310a65171d	Fix PR1780 llvm-svn: 43893	2007-11-08 17:39:28 +00:00
Chris Lattner	907b9b92fe	Implement PR1777 by detecting dependent phis that all compute the same value. llvm-svn: 43777	2007-11-06 21:52:06 +00:00
Chris Lattner	e194944b29	wrap long lines llvm-svn: 43745	2007-11-06 01:15:27 +00:00
Dan Gohman	adc21ed938	Fix an abort in instcombine when folding creates a vector rem instruction. llvm-svn: 43743	2007-11-05 23:16:33 +00:00
Duncan Sands	eb464e976f	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Chris Lattner	d29624e11a	Fix InstCombine/2007-10-31-RangeCrash.ll llvm-svn: 43596	2007-11-01 02:18:41 +00:00
Chris Lattner	ae9cfd2fb0	simplify some code by using the new isNaN predicate llvm-svn: 43305	2007-10-24 18:54:45 +00:00
Chris Lattner	483c471daa	Implement a couple of foldings for ordered and unordered comparisons, implementing cases related to PR1738. llvm-svn: 43289	2007-10-24 05:38:08 +00:00
Devang Patel	eff4619cc8	Try again. Instead of loading small global string from memory, use integer constant. llvm-svn: 43148	2007-10-18 19:52:32 +00:00
Evan Cheng	1c34d807ce	Reverting r43070 for now. It's causing llc test failures. llvm-svn: 43103	2007-10-17 23:51:13 +00:00
Devang Patel	cf2f9d6daa	Apply "Instead of loading small c string constant, use integer constant directly" transformation while processing load instruction. llvm-svn: 43070	2007-10-17 07:24:40 +00:00
Devang Patel	c3d0477a0e	Use immediate stores. llvm-svn: 43055	2007-10-16 23:44:18 +00:00
Devang Patel	7d1d5d6bf6	Achieve same result but use fewer lines of code. llvm-svn: 42985	2007-10-15 15:31:35 +00:00
Devang Patel	f65c028dad	Dest type is always i8 *. This allows some simplification. Do not filter memmove. llvm-svn: 42930	2007-10-12 20:10:21 +00:00
Chris Lattner	3af877f26a	Fix a bug in my patch last night that broke InstCombine/2007-10-12-Crash.ll llvm-svn: 42920	2007-10-12 18:05:47 +00:00
Gabor Greif	cbfb655705	eliminate warning llvm-svn: 42892	2007-10-12 07:44:54 +00:00
Chris Lattner	3c23c37233	Fix some 80 column violations. Fix DecomposeSimpleLinearExpr to handle simple constants better. Don't nuke gep(bitcast(allocation)) if the bitcast(allocation) will fold the allocation. This fixes PR1728 and Instcombine/malloc3.ll llvm-svn: 42891	2007-10-12 05:30:59 +00:00
Devang Patel	15d6257fa8	Lower memcpy if it makes sense. llvm-svn: 42864	2007-10-11 17:21:57 +00:00
Dale Johannesen	529cc16893	Tone down an overzealous optimization. llvm-svn: 42582	2007-10-03 17:45:27 +00:00
Duncan Sands	f7abe75944	Improve comment. llvm-svn: 42132	2007-09-19 10:25:38 +00:00
Duncan Sands	d88f60ed32	A global variable with external weak linkage can be null, while an alias could alias such a global variable. llvm-svn: 42130	2007-09-19 10:10:31 +00:00
Dan Gohman	2de5779a99	Instcombine x-((x/y)*y) into a remainder operator. llvm-svn: 42035	2007-09-17 17:31:57 +00:00
Duncan Sands	901cb2662d	Factor the trampoline transformation into a subroutine. llvm-svn: 42021	2007-09-17 10:26:40 +00:00
Dale Johannesen	575bd6070a	Remove the assumption that FP's are either float or double from some of the many places in the optimizers it appears, and do something reasonable with x86 long double. Make APInt::dump() public, remove newline, use it to dump ConstantSDNode's. Allow APFloats in FoldingSet. Expand X86 backend handling of long doubles (conversions to/from int, mostly). llvm-svn: 41967	2007-09-14 22:26:36 +00:00
Chris Lattner	9c3cd36dd0	silence a bogus gcc warning. llvm-svn: 41949	2007-09-14 03:07:24 +00:00
Duncan Sands	c63fd15cd9	Turn calls to trampolines into calls to the underlying nested function. llvm-svn: 41844	2007-09-11 14:35:41 +00:00
Chris Lattner	3ace09794b	remove some dead code, this is handled by constant folding. llvm-svn: 41819	2007-09-10 23:46:29 +00:00
Chris Lattner	8e6c39d961	Don't zap back to back volatile load/stores llvm-svn: 41759	2007-09-07 05:33:03 +00:00
Dale Johannesen	86f367a6b7	Next round of APFloat changes. Use APFloat in UpgradeParser and AsmParser. Change all references to ConstantFP to use the APFloat interface rather than double. Remove the ConstantFP double interfaces. Use APFloat functions for constant folding arithmetic and comparisons. (There are still way too many places APFloat is just a wrapper around host float/double, but we're getting there.) llvm-svn: 41747	2007-09-06 18:13:44 +00:00
Nick Lewycky	79e179ff1f	Use isTrueWhenEqual. Thanks Chris! llvm-svn: 41741	2007-09-06 02:40:25 +00:00
Nick Lewycky	2f66503c0a	When the two operands of an icmp are equal, there are five possible predicates that would make the icmp true. Fixes PR1637. llvm-svn: 41740	2007-09-06 01:10:22 +00:00
Chuck Rose III	4f602f5eba	Forgot to obey 80 column rule. Fixing that. llvm-svn: 41725	2007-09-05 20:36:41 +00:00
Chuck Rose III	a1061872a7	Added default parameters to GetElementPtrInstr constructor call. Visual Studio 2k5 was getting confused and was unable to compile it. Suspected compiler error. llvm-svn: 41721	2007-09-05 16:54:38 +00:00
David Greene	8cda5af2e7	Update GEP constructors to use an iterator interface to fix GLIBCXX_DEBUG issues. llvm-svn: 41697	2007-09-04 15:46:09 +00:00
Chris Lattner	73aa3d62dc	Cut off crazy computation. This helps PR1622 slightly. llvm-svn: 41522	2007-08-28 04:23:55 +00:00
David Greene	5b85021be8	Update InvokeInst to work like CallInst llvm-svn: 41506	2007-08-27 19:04:21 +00:00
Chris Lattner	50f25115cd	Transform a load from an undef/zero global into an undef/global even if we have complex pointer manipulation going on. This allows us to compile stuff like this: __m128i foo(__m128i x){ static const unsigned int c_0[4] = { 0, 0, 0, 0 }; __m128i v_Zero = _mm_loadu_si128((__m128i*)c_0); x = _mm_unpacklo_epi8(x, v_Zero); return x; } into: _foo: xorps %xmm1, %xmm1 punpcklbw %xmm1, %xmm0 ret llvm-svn: 41022	2007-08-11 18:48:48 +00:00
Chris Lattner	3548932573	when we see a unaligned load from an insufficiently aligned global or alloca, increase the alignment of the load, turning it into an aligned load. This allows us to compile: #include <xmmintrin.h> __m128i foo(__m128i x){ static const unsigned int c_0[4] = { 0, 0, 0, 0 }; __m128i v_Zero = _mm_loadu_si128((__m128i*)c_0); x = _mm_unpacklo_epi8(x, v_Zero); return x; } into: _foo: punpcklbw _c_0.5944, %xmm0 ret .data .lcomm _c_0.5944,16,4 # c_0.5944 instead of: _foo: movdqu _c_0.5944, %xmm1 punpcklbw %xmm1, %xmm0 ret .data .lcomm _c_0.5944,16,2 # c_0.5944 llvm-svn: 40971	2007-08-09 19:05:49 +00:00
Nick Lewycky	34cf98c558	It's safe to fold not of fcmp. llvm-svn: 40870	2007-08-06 20:04:16 +00:00
Chris Lattner	6d8e77a703	at the end of instcombine, explicitly clear WorklistMap. This shrinks it down to something small. On the testcase from PR1432, this speeds up instcombine from 0.7959s to 0.5000s, (59%) llvm-svn: 40840	2007-08-05 08:47:58 +00:00
Chandler Carruth	00e56b0e81	This is the patch to provide clean intrinsic function overloading support in LLVM. It cleans up the intrinsic definitions and generally smooths the process for more complicated intrinsic writing. It will be used by the upcoming atomic intrinsics as well as vector and float intrinsics in the future. This also changes the syntax for llvm.bswap, llvm.part.set, llvm.part.select, and llvm.ct* intrinsics. They are automatically upgraded by both the LLVM ASM reader and the bitcode reader. The test cases have been updated, with special tests added to ensure the automatic upgrading is supported. llvm-svn: 40807	2007-08-04 01:51:18 +00:00
Chris Lattner	5ca7348fc4	Replacing a cast with another one does not reduce the number of casts in the input. llvm-svn: 40741	2007-08-02 17:23:38 +00:00
Chris Lattner	7c6e8f735f	Disable an xform that causes an infinite loop. This fixes PR1594 llvm-svn: 40739	2007-08-02 16:56:32 +00:00
Chris Lattner	25a8bfdedb	wrap some long lines. Major offenders that are left include gvn, gvnpre, dse, and predsimplify. To see these, use: make check-line-length llvm-svn: 40738	2007-08-02 16:53:43 +00:00
Chris Lattner	0111f62050	Enhance instcombine to be more aggressive about folding casts of operations of casts. This implements InstCombine/zext-fold.ll llvm-svn: 40726	2007-08-02 06:11:14 +00:00
David Greene	f06a395bb9	New CallInst interface to address GLIBCXX_DEBUG errors caused by indexing an empty std::vector. Updates to all clients. llvm-svn: 40660	2007-08-01 03:43:44 +00:00
Lauro Ramos Venancio	abf6c6d469	Fix a bug in GetKnownAlignment of packed structs. llvm-svn: 40649	2007-07-31 20:13:21 +00:00
Reid Spencer	eb6f2d338a	Fix a typo/thinko. llvm-svn: 40599	2007-07-30 19:53:57 +00:00
Chris Lattner	914de64a0a	completely remove a transformation that is unsafe in the face of undefs. llvm-svn: 40439	2007-07-23 17:10:17 +00:00
Devang Patel	f45fc256e1	Apply temporary work around to fix llvm mis-compilation reported in PR 1556. llvm-svn: 40133	2007-07-21 00:34:29 +00:00
Chris Lattner	9663eb4a5b	this xform is already done by the constant folder. llvm-svn: 40124	2007-07-20 22:06:41 +00:00
Dan Gohman	87107326f6	Optimize alignment of loads and stores. llvm-svn: 40102	2007-07-20 16:34:21 +00:00
Dan Gohman	0ba554c0c8	Fix comments about vectors to use the current wording. llvm-svn: 39921	2007-07-16 14:29:03 +00:00
Chris Lattner	66b7f0c956	Repair a regression in Transforms/InstCombine/mul.ll that Reid noticed. llvm-svn: 39896	2007-07-16 04:15:34 +00:00
Chris Lattner	f6a91d74e0	Implement shift-simplify.ll:test[45]. First teach instcombine that sign bit checks only demand the sign bit, this allows simplify demanded bits to hack on expressions better. Second, teach instcombine that ashr is useless if only the sign bit is demanded. llvm-svn: 39880	2007-07-15 20:54:51 +00:00
Chris Lattner	9cb8da1cb8	Implement shift-simplify.ll:test3, turning: (X << 31) <s 0 --> (X&1) != 0 This happens dozens of times in the CFE. llvm-svn: 39879	2007-07-15 20:42:37 +00:00
Chris Lattner	4624e16f16	Significantly improve the documentation of the instcombine divide/compare transformation. Also, keep track of which end of the integer interval overflows occur on. This fixes Transforms/InstCombine/2007-06-21-DivCompareMiscomp.ll and rdar://5278853, a miscompilation of perl. llvm-svn: 37692	2007-06-21 18:11:19 +00:00
Chris Lattner	26537049eb	refactor a bunch of code out of visitICmpInstWithInstAndIntCst into its own routine. llvm-svn: 37679	2007-06-20 23:46:26 +00:00
Chris Lattner	6809b4b4fa	silence a bogus warning Duraid ran into. llvm-svn: 37649	2007-06-19 05:43:49 +00:00
Chris Lattner	3a979f2fa5	Generalize many transforms to work on ~ of vectors in addition to ~ of integer ops. This implements Transforms/InstCombine/and-or-not.ll test3/test4, and finishes off PR1510 llvm-svn: 37589	2007-06-15 06:23:19 +00:00
Chris Lattner	6c32b44b4c	Implement two xforms: 1. ~(~X \| Y) === (X & ~Y) 2. (A\|B) & ~(A&B) -> A^B This allows us to transform ~(~(a\|b) \| (a&b)) -> a^b. This implements PR1510 for scalar values. llvm-svn: 37584	2007-06-15 05:58:24 +00:00
Chris Lattner	96c6cf8b89	delete some obviously dead vector operations, which deletes a few thousand operations from Duraids example. llvm-svn: 37582	2007-06-15 05:26:55 +00:00
Lauro Ramos Venancio	27fa43f343	Fix PR1499. llvm-svn: 37472	2007-06-06 17:08:48 +00:00
Chris Lattner	4cc07421ee	fix a miscompilation when passing a float through varargs llvm-svn: 37297	2007-05-23 01:17:04 +00:00
Chris Lattner	dda5066a5e	Fix Transforms/InstCombine/2007-05-18-CastFoldBug.ll, a bug that devastates objc code due to the way the FE lowers objc message sends. llvm-svn: 37256	2007-05-19 06:51:32 +00:00
Chris Lattner	dfc6f4a06c	Fix Transforms/InstCombine/2007-05-14-Crash.ll llvm-svn: 37057	2007-05-15 00:16:00 +00:00
Dan Gohman	167379e73a	Fix typos. llvm-svn: 36994	2007-05-11 21:10:54 +00:00
Chris Lattner	3936efb43b	fix regressions from my previous checking, including Transforms/InstCombine/2006-12-08-ICmp-Combining.ll llvm-svn: 36989	2007-05-11 16:58:45 +00:00
Chris Lattner	f7adc33cbd	fix Transforms/InstCombine/2007-05-10-icmp-or.ll llvm-svn: 36984	2007-05-11 05:55:56 +00:00
Nick Lewycky	c2306ff5b4	Fix typo in comment. llvm-svn: 36873	2007-05-06 13:37:16 +00:00
Chris Lattner	619ffa3881	Fix a bug in my previous patch llvm-svn: 36857	2007-05-06 07:24:03 +00:00
Chris Lattner	26c55e2dda	Implement Transforms/InstCombine/cast_ptr.ll llvm-svn: 36809	2007-05-05 22:41:33 +00:00
Chris Lattner	46b06d19d4	wrap long lines llvm-svn: 36807	2007-05-05 22:32:24 +00:00
Chris Lattner	750321383f	Fix InstCombine/2007-05-04-Crash.ll and PR1384 llvm-svn: 36775	2007-05-05 01:59:31 +00:00
Devang Patel	cd45427a87	Drop 'const' llvm-svn: 36662	2007-05-03 01:11:54 +00:00
Devang Patel	8ee9065162	Use 'static const char' instead of 'static const int'. Due to darwin gcc bug, one version of darwin linker coalesces static const int, which defauts PassID based pass identification. llvm-svn: 36652	2007-05-02 21:39:20 +00:00
Devang Patel	38a66bc82e	Do not use typeinfo to identify pass in pass manager. llvm-svn: 36632	2007-05-01 21:15:47 +00:00
Chris Lattner	5cb586acf5	fix a bug triggered by 403.gcc llvm-svn: 36527	2007-04-28 05:27:36 +00:00
Chris Lattner	13b1e24f41	Fix several latent bugs in EmitGEPOffset that didn't manifest with its previous clients. This fixes MallocBench/gs llvm-svn: 36525	2007-04-28 04:52:43 +00:00
Chris Lattner	22d3205e80	uhn zap cvs llvm-svn: 36523	2007-04-28 03:50:56 +00:00
Chris Lattner	56ac7ba950	Implement PR1345 and Transforms/InstCombine/bitcast-gep.ll llvm-svn: 36521	2007-04-28 00:57:34 +00:00
Chris Lattner	e0fe3b5c35	refactor some code relating to pointer cast xforms, pulling it out of the codepath for unrelated casts. llvm-svn: 36511	2007-04-27 17:44:50 +00:00
Zhou Sheng	9339daf407	Make use of ConstantInt::isZero instead of ConstantInt::isNullValue. llvm-svn: 36261	2007-04-19 05:39:12 +00:00
Chris Lattner	fe00dd8315	Extend store merging to support the 'if/then' version in addition to if/then/else. This sinks the two stores in this example into a single store in cond_next. In this case, it allows elimination of the load as well: store double 0.000000e+00, double* @s.3060 %tmp3 = fcmp ogt double %tmp1, 5.000000e-01 ; <i1> [#uses=1] br i1 %tmp3, label %cond_true, label %cond_next cond_true: ; preds = %entry store double 1.000000e+00, double* @s.3060 br label %cond_next cond_next: ; preds = %entry, %cond_true %tmp6 = load double* @s.3060 ; <double> [#uses=1] This implements Transforms/InstCombine/store-merge.ll:test2 llvm-svn: 36040	2007-04-15 01:02:18 +00:00
Chris Lattner	ecd0fda993	refactor some code, no functionality change. llvm-svn: 36037	2007-04-15 00:07:55 +00:00
Chris Lattner	022c2bc0c3	fix long lines llvm-svn: 36031	2007-04-14 23:32:02 +00:00
Chris Lattner	9764a3cf09	Implement Transforms/InstCombine/vec_extract_elt.ll, transforming: define i32 @test(float %f) { %tmp7 = insertelement <4 x float> undef, float %f, i32 0 %tmp17 = bitcast <4 x float> %tmp7 to <4 x i32> %tmp19 = extractelement <4 x i32> %tmp17, i32 0 ret i32 %tmp19 } into: define i32 @test(float %f) { %tmp19 = bitcast float %f to i32 ; <i32> [#uses=1] ret i32 %tmp19 } On PPC, this is the difference between: _test: mfspr r2, 256 oris r3, r2, 8192 mtspr 256, r3 stfs f1, -16(r1) addi r3, r1, -16 addi r4, r1, -32 lvx v2, 0, r3 stvx v2, 0, r4 lwz r3, -32(r1) mtspr 256, r2 blr and: _test: stfs f1, -4(r1) nop nop nop lwz r3, -4(r1) blr llvm-svn: 36025	2007-04-14 23:02:14 +00:00

... 3 4 5 6 7 ...

1194 Commits