llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Dan Gohman	8c51905154	This cast<Instruction> is unnecessary. llvm-svn: 96771	2010-02-22 02:07:36 +00:00
Dan Gohman	220c2212c1	Rename getSDiv to getExactSDiv to reflect its behavior in cases where the division would have a remainder. llvm-svn: 96693	2010-02-19 19:35:48 +00:00
Dan Gohman	9db0689627	Check for overflow when scaling up an add or an addrec for scaled reuse. llvm-svn: 96692	2010-02-19 19:32:49 +00:00
Dale Johannesen	0f58c73229	recommit 96626, evidence that it broke things appears to be spurious llvm-svn: 96662	2010-02-19 07:14:22 +00:00
Dale Johannesen	d73b81bf12	Revert 96626, which causes build failure on ppc Darwin. llvm-svn: 96653	2010-02-19 01:54:37 +00:00
Dan Gohman	58199e30dc	When determining the set of interesting reuse factors, consider strides in foreign loops. This helps locate reuse opportunities with existing induction variables in foreign loops and reduces the need for inserting new ones. This fixes rdar://7657764. llvm-svn: 96629	2010-02-19 00:05:23 +00:00
Dan Gohman	2f0615ee5e	Indvars needs to explicitly notify ScalarEvolution when it is replacing a loop exit value, so that if a loop gets deleted, ScalarEvolution isn't stick holding on to dangling SCEVAddRecExprs for that loop. This fixes PR6339. llvm-svn: 96626	2010-02-18 23:26:33 +00:00
Dan Gohman	f0d7b5df4b	Hoist this loop-invariant logic out of the loop. llvm-svn: 96614	2010-02-18 21:34:02 +00:00
Dan Gohman	82f46afcdd	Delete some unneeded casts. llvm-svn: 96429	2010-02-17 00:42:19 +00:00
Dan Gohman	493a1fcbe0	Don't attempt to divide INT_MIN by -1; consider such cases to have overflowed. llvm-svn: 96428	2010-02-17 00:41:53 +00:00
Bob Wilson	86dded571f	Rename SuccessorNumber to GetSuccessorNumber. llvm-svn: 96387	2010-02-16 21:06:42 +00:00
Dan Gohman	e2da5181db	Refactor rewriting for PHI nodes into a separate function. llvm-svn: 96382	2010-02-16 20:25:07 +00:00
Bob Wilson	a866c660db	Split critical edges as needed for load PRE. llvm-svn: 96378	2010-02-16 19:51:59 +00:00
Bob Wilson	1115365fcf	Refactor to share code to find the position of a basic block successor in the terminator's list of successors. llvm-svn: 96377	2010-02-16 19:49:17 +00:00
Dan Gohman	815d966a5a	Fix whitespace. llvm-svn: 96372	2010-02-16 19:42:34 +00:00
Duncan Sands	1b33dd3c83	There are two ways of checking for a given type, for example isa<PointerType>(T) and T->isPointerTy(). Convert most instances of the first form to the second form. Requested by Chris. llvm-svn: 96344	2010-02-16 11:11:14 +00:00
Dan Gohman	d19ecedc40	Split the main for-each-use loop again, this time for GenerateTruncates, as it also peeks at which registers are being used by other uses. This makes LSR less sensitive to use-list order. llvm-svn: 96308	2010-02-16 01:42:53 +00:00
Chris Lattner	a8505609fe	fix PR6305 by handling BlockAddress in a helper function called by jump threading. llvm-svn: 96263	2010-02-15 20:47:49 +00:00
Duncan Sands	2acaf3609c	Uniformize the names of type predicates: rather than having isFloatTy and isInteger, we now have isFloatTy and isIntegerTy. Requested by Chris! llvm-svn: 96223	2010-02-15 16:12:20 +00:00
Dan Gohman	c954fc3333	Fix whitespace. llvm-svn: 96179	2010-02-14 18:51:39 +00:00
Dan Gohman	900be69752	Fix a comment. llvm-svn: 96178	2010-02-14 18:51:20 +00:00
Dan Gohman	de32b2fac0	When complicated expressions are broken down into subexpressions with multiplication by constants distributed through, occasionally those subexpressions can include both x and -x. For now, if this condition is discovered within LSR, just prune such cases away, as they won't be profitable. This fixes a "zero allocated in a base register" assertion failure. llvm-svn: 96177	2010-02-14 18:50:49 +00:00
Dan Gohman	dbffcb6c92	Actually, this code doesn't have to be quite so conservative in the no-TLI case. But it should still default to declining the transformation. llvm-svn: 96152	2010-02-14 03:21:49 +00:00
Dan Gohman	3b065f43af	Don't attempt aggressive post-inc uses if TargetLowering is not available, because profitability can't be sufficiently approximated. llvm-svn: 96148	2010-02-14 02:45:21 +00:00
John McCall	5089836939	Make LSR not crash if invoked without target lowering info, e.g. if invoked from opt. llvm-svn: 96135	2010-02-13 23:40:16 +00:00
Eric Christopher	96f3c4222f	Fix a problem where we had bitcasted operands that gave us odd offsets since the bitcasted pointer size and the offset pointer size are going to be different types for the GEP vs base object. llvm-svn: 96134	2010-02-13 23:38:01 +00:00
Chris Lattner	23368e55d7	remove dead code. llvm-svn: 96109	2010-02-13 19:07:06 +00:00
Chris Lattner	0874dc6dab	Split some code out to a helper function (FindReusablePredBB) and add a doxygen comment. Cache the phi entry to avoid doing tons of PHINode::getBasicBlockIndex calls in the common case. On my insane testcase from re2c, this speeds up CGP from 617.4s to 7.9s (78x). llvm-svn: 96083	2010-02-13 05:35:08 +00:00
Chris Lattner	9c797fdf1d	Speed up codegen prepare from 3.58s to 0.488s. llvm-svn: 96081	2010-02-13 05:01:14 +00:00
Chris Lattner	975da20868	PHINode::getBasicBlockIndex is O(n) in the number of inputs to a PHI, avoid it in the common case where the BB occurs in the same index for multiple phis. This speeds up CGP on an insane testcase from 8.35 to 3.58s. llvm-svn: 96080	2010-02-13 04:24:19 +00:00
Chris Lattner	1a067d53b0	iterate over preds using PHI information when available instead of using pred_begin/end. It is much faster. llvm-svn: 96079	2010-02-13 04:15:26 +00:00
Chris Lattner	9be0b06335	speed up CGP a bit by scanning predecessors through phi operands instead of with pred_begin/end. llvm-svn: 96078	2010-02-13 04:04:42 +00:00
Dan Gohman	25722fd3fc	Fix a pruning heuristic which implicitly assumed that SmallPtrSet is deterministically sorted. llvm-svn: 96071	2010-02-13 02:06:02 +00:00
Jakob Stoklund Olesen	8ce1b3d280	Enable the inlinehint attribute in the Inliner. Functions explicitly marked inline will get an inlining threshold slightly more aggressive than the default for -O3. This means than -O3 builds are mostly unaffected while -Os builds will be a bit bigger and faster. The difference depends entirely on how many 'inline's are sprinkled on the source. In the CINT2006 suite, only these tests are significantly affected under -Os: Size Time 471.omnetpp +1.63% -1.85% 473.astar +4.01% -6.02% 483.xalancbmk +4.60% 0.00% Note that 483.xalancbmk runs too quickly to give useful timing results. llvm-svn: 96066	2010-02-13 01:51:53 +00:00
Dan Gohman	cbf5771473	Reapply 95979, a compile-time speedup, now that the bug it exposed is fixed. llvm-svn: 96005	2010-02-12 19:35:25 +00:00
Dan Gohman	006952d39d	Fix this code to avoid dereferencing an end() iterator in offset distributions it doesn't expect. llvm-svn: 96002	2010-02-12 19:20:37 +00:00
Chris Lattner	ecb203898a	1. modernize the constantmerge pass, using densemap/smallvector. 2. don't bother trying to merge globals in non-default sections, doing so is quite dubious at best anyway. 3. fix a bug reported by Arnaud de Grandmaison where we'd try to merge two globals in different address spaces. llvm-svn: 95995	2010-02-12 18:17:23 +00:00
Daniel Dunbar	43ad3dfe00	Revert "Reverse the order for collecting the parts of an addrec. The order", it is breaking llvm-gcc bootstrap. llvm-svn: 95988	2010-02-12 17:27:08 +00:00
Dan Gohman	2f46f79492	Reverse the order for collecting the parts of an addrec. The order doesn't matter, except that ScalarEvolution tends to need less time to fold the results this way. llvm-svn: 95979	2010-02-12 11:08:26 +00:00
Dan Gohman	c40eb525ad	Reapply the new LoopStrengthReduction code, with compile time and bug fixes, and with improved heuristics for analyzing foreign-loop addrecs. This change also flattens IVUsers, eliminating the stride-oriented groupings, which makes it easier to work with. llvm-svn: 95975	2010-02-12 10:34:29 +00:00
Eric Christopher	2e0201ee18	Make sure that ConstantExpr offsets also aren't off of extern symbols. Thanks to Duncan Sands for the testcase! llvm-svn: 95877	2010-02-11 17:44:04 +00:00
Chris Lattner	a59eb7c09c	Rename ValueRequiresCast to ShouldOptimizeCast, to better reflect what it does. Enhance it to return false to optimizing vector sign extensions from vector comparisions, which is the idiom used to get a splatted vector for a vector comparison. Doing this breaks vector-casts.ll, add some compensating transformations to handle the important case they cover without depending on this canonicalization. This fixes rdar://7434900 a serious pessimization of vector compares. llvm-svn: 95855	2010-02-11 06:26:33 +00:00
Chris Lattner	a087e6e82f	Make DSE only scan blocks that are reachable from the entry block. Other blocks may have pointer cycles that will crash basicaa and other alias analyses. In any case, there is no point wasting cycles optimizing dead blocks. This fixes rdar://7635088 llvm-svn: 95852	2010-02-11 05:11:54 +00:00
Chris Lattner	733ffcdb1f	Make jump threading honor x\|undef -> true and x&undef -> false, instead of considering x\|undef -> x, which may not be true. llvm-svn: 95850	2010-02-11 04:40:44 +00:00
Eric Christopher	9516309f55	Add ConstantExpr handling to Intrinsic::objectsize lowering. Update testcase accordingly now that we can optimize another section. llvm-svn: 95846	2010-02-11 01:48:54 +00:00
Devang Patel	e87a8a944d	Ignore dbg info intrinsics. llvm-svn: 95828	2010-02-11 00:20:49 +00:00
Devang Patel	43cc7530be	Strip new llvm.dbg.value intrinsic. llvm-svn: 95807	2010-02-10 21:19:56 +00:00
Dan Gohman	92b6122204	Fix "the the" and similar typos. llvm-svn: 95781	2010-02-10 16:03:48 +00:00
Eric Christopher	6691a59247	Move Intrinsic::objectsize lowering back to InstCombineCalls and enable constant 0 offset lowering. llvm-svn: 95691	2010-02-09 21:24:27 +00:00
Eric Christopher	871cf7bce2	Pull these back out, they're a little too aggressive and time consuming for a simple optimization. llvm-svn: 95671	2010-02-09 17:29:18 +00:00
Chris Lattner	f6d9c1f90c	simplify this code, duh. llvm-svn: 95643	2010-02-09 01:14:06 +00:00
Chris Lattner	26b712379f	fix PR6193, only considering sign extensions from i1 for this xform. llvm-svn: 95642	2010-02-09 01:12:41 +00:00
Eric Christopher	1ff7f162e2	Add file in here too. llvm-svn: 95641	2010-02-09 01:11:03 +00:00
Eric Christopher	428b385575	Add a new pass to do llvm.objsize lowering using SCEV. Initial skeleton and SCEVUnknown lowering implemented, the rest should come relatively quickly. Move testcase to new directory. Move pass to right before SimplifyLibCalls - which is moved down a bit so we can take advantage of a few opts. llvm-svn: 95628	2010-02-09 00:35:38 +00:00
Chris Lattner	9635fe03ca	fix some problems handling large vectors reported in PR6230 llvm-svn: 95616	2010-02-08 23:56:03 +00:00
Jakob Stoklund Olesen	83ebc265b3	Reintroduce the InlineHint function attribute. This time it's for real! I am going to hook this up in the frontends as well. The inliner has some experimental heuristics for dealing with the inline hint. When given a -respect-inlinehint option, functions marked with the inline keyword are given a threshold just above the default for -O3. We need some experiments to determine if that is the right thing to do. llvm-svn: 95466	2010-02-06 01:16:28 +00:00
Jakob Stoklund Olesen	7b4c60adae	Don't unroll loops containing function calls. llvm-svn: 95454	2010-02-05 23:21:31 +00:00
Jakob Stoklund Olesen	670458b3be	Teach SimplifyCFG about magic pointer constants. Weird code sometimes uses pointer constants other than null. This patch teaches SimplifyCFG to build switch instructions in those cases. Code like this: void f(const char x) { if (!x) puts("null"); else if ((uintptr_t)x == 1) puts("one"); else if (x == (char)2 \|\| x == (char)3) puts("two"); else if ((intptr_t)x == 4) puts("four"); else puts(x); } Now becomes a switch: define void @f(i8 %x) nounwind ssp { entry: %magicptr23 = ptrtoint i8* %x to i64 ; <i64> [#uses=1] switch i64 %magicptr23, label %if.else16 [ i64 0, label %if.then i64 1, label %if.then2 i64 2, label %if.then9 i64 3, label %if.then9 i64 4, label %if.then14 ] Note that LLVM's own DenseMap uses magic pointers. llvm-svn: 95439	2010-02-05 22:03:18 +00:00
Chris Lattner	44965f1107	fix logical-select to invoke filecheck right, and fix hte instcombine xform it is checking to actually pass. There is no need to match m_SelectCst<0, -1> since instcombine canonicalizes that into not(sext). Add matches for sext(not(x)) in addition to not(sext(x)). llvm-svn: 95420	2010-02-05 19:53:02 +00:00
Dan Gohman	96cad72d2d	Implement releaseMemory in CodeGenPrepare and free the BackEdges container data. This prevents it from holding onto dangling pointers and potentially behaving unpredictably. llvm-svn: 95409	2010-02-05 19:24:11 +00:00
Dan Gohman	8142ba6bbb	Use a SmallSetVector instead of a SetVector; this code showed up as a malloc caller in a profile. llvm-svn: 95407	2010-02-05 19:20:15 +00:00
Eric Christopher	f89979ce6a	Remove this code for now. I have a better idea and will rewrite with that in mind. llvm-svn: 95402	2010-02-05 19:04:06 +00:00
Bob Wilson	c7e8107ff2	Do not reassociate expressions with i1 type. SimplifyCFG converts some short-circuited conditions to AND/OR expressions, and those expressions are often converted back to a short-circuited form in code gen. The original source order may have been optimized to take advantage of the expected values, and if we reassociate them, we change the order and subvert that optimization. Radar 7497329. llvm-svn: 95333	2010-02-04 23:32:37 +00:00
Jakob Stoklund Olesen	54b09bc819	Increase inliner thresholds by 25. This makes the inliner about as agressive as it was before my changes to the inliner cost calculations. These levels give the same performance and slightly smaller code than before. llvm-svn: 95320	2010-02-04 18:48:20 +00:00
Eric Christopher	ee4a176739	Temporarily revert this since it appears to have caused a build failure. llvm-svn: 95294	2010-02-04 06:41:27 +00:00
Eric Christopher	9b3e42f09e	Rework constant expr and array handling for objectsize instcombining. Fix bugs where we would compute out of bounds as in bounds, and where we couldn't know that the linker could override the size of an array. Add a few new testcases, change existing testcase to use a private global array instead of extern. llvm-svn: 95283	2010-02-04 02:55:34 +00:00
Eric Christopher	fe6ab1518e	If we're dealing with a zero-length array, don't lower to any particular size, we just don't know what the length is yet. llvm-svn: 95266	2010-02-03 23:56:07 +00:00
Bob Wilson	8635e08b7f	Adjust the heuristics used to decide when SROA is likely to be profitable. The SRThreshold value makes perfect sense for checking if an entire aggregate should be promoted to a scalar integer, but it is not so good for splitting an aggregate into its separate elements. A struct may contain a large embedded array along with some scalar fields that would benefit from being split apart by SROA. Even if the total aggregate size is large, it may still be good to perform SROA. Thus, the most important piece of this patch is simply moving the aggregate size comparison vs. SRThreshold so that it guards only the aggregate promotion. We have also been checking the number of elements to decide if an aggregate should be split up. The limit of "SRThreshold/4" seemed rather arbitrary, and I don't think it's very useful to derive this limit from SRThreshold anyway. I've collected some data showing that the current default limit of 32 (since SRThreshold defaults to 128) is a reasonable cutoff for struct types. One thing suggested by the data is that distinguishing between structs and arrays might be useful. There are (obviously) a lot more large arrays than large structs (as measured by the number of elements and not the total size -- a large array inside a struct still counts as a single element given the way we do SROA right now). Out of 8377 arrays where we successfully performed SROA while compiling a large set of benchmarks, only 16 of them had more than 8 elements. And, for those 16 arrays, it's not at all clear that SROA was actually beneficial. So, to offset the compile time cost of investigating more large structs for SROA, the patch lowers the limit on array elements to 8. This fixes Apple Radar 7563690. llvm-svn: 95224	2010-02-03 17:23:56 +00:00
Evan Cheng	e273e42195	Revert 94937 and move the noreturn check to codegen. llvm-svn: 95198	2010-02-03 03:55:59 +00:00
Bob Wilson	cabb8bea4c	Fix some comment typos. llvm-svn: 95170	2010-02-03 00:33:21 +00:00
Eric Christopher	ac28e14b77	Recommit this, looks like it wasn't the cause. llvm-svn: 95165	2010-02-03 00:21:58 +00:00
Eric Christopher	f070aae6f7	Hopefully temporarily revert this. llvm-svn: 95154	2010-02-02 23:01:31 +00:00
Eric Christopher	048492d669	Reformat my last patch slightly. llvm-svn: 95147	2010-02-02 22:29:26 +00:00
Eric Christopher	575fe8690d	Re-add strcmp and known size object size checking optimization. Passed bootstrap and nightly test run here. llvm-svn: 95145	2010-02-02 22:10:43 +00:00
Chris Lattner	9f50341a96	don't turn (A & (C0?-1:0)) \| (B & ~(C0?-1:0)) -> C0 ? A : B for vectors. Codegen is generating awful code or segfaulting in various cases (e.g. PR6204). llvm-svn: 95058	2010-02-02 02:43:51 +00:00
Chris Lattner	e471d94f91	fix a crash in loop unswitch on a loop invariant vector condition. llvm-svn: 95055	2010-02-02 02:26:54 +00:00
Dan Gohman	95e0161d1b	LangRef.html says that inttoptr and ptrtoint always use zero-extension when the cast is extending. llvm-svn: 95046	2010-02-02 01:44:02 +00:00
Eric Christopher	02559754dc	Don't need to check the last argument since it'll always be bool. We also don't use TargetData here. llvm-svn: 95040	2010-02-02 00:51:45 +00:00
Eric Christopher	48f2aae32f	More indentation/tabification fixes. llvm-svn: 95036	2010-02-02 00:13:06 +00:00
Eric Christopher	1004836336	Untabify previous commit. llvm-svn: 95035	2010-02-02 00:06:55 +00:00
Eric Christopher	9e9e599070	Formatting. llvm-svn: 95027	2010-02-01 23:25:03 +00:00
Bob Wilson	1f966e8ca6	Add an option to GVN to remove all partially redundant loads. This is currently disabled by default. This divides the existing load PRE code into 2 phases: first it checks that it is safe to move the load to each of the predecessors where it is unavailable, and then if it is safe, the code is changed to move the load. Radar 7571861. llvm-svn: 95007	2010-02-01 21:17:14 +00:00
Chris Lattner	333bf5abf5	cleanups. llvm-svn: 94995	2010-02-01 19:54:45 +00:00
Chris Lattner	5f10919836	fix rdar://7590304, a miscompilation of objc apps on arm. The caller of objc message send was getting marked arm_apcscc, but the prototype isn't. This is fine at runtime because objcmsgsend is implemented in assembly. Only turn a mismatched caller and callee into 'unreachable' if the callee is a definition. llvm-svn: 94986	2010-02-01 18:11:34 +00:00
Chris Lattner	a336497d3f	fix rdar://7590304, an infinite loop in instcombine. In the invoke case, instcombine can't zap the invoke for fear of changing the CFG. However, we have to do something to prevent the next iteration of instcombine from inserting another store -> undef before the invoke thereby getting into infinite iteration between dead store elim and store insertion. Just zap the callee to null, which will prevent the next iteration from doing anything. llvm-svn: 94985	2010-02-01 18:04:58 +00:00
Bob Wilson	8207d33d94	Fix pr6198 by moving the isSized() check to an outer conditional. The testcase from pr6198 does not crash for me -- I don't know what's up with that -- so I'm not adding it to the tests. llvm-svn: 94984	2010-02-01 17:41:44 +00:00
Eli Friedman	19c5c57885	Simplify/generalize the xor+add->sign-extend instcombine. llvm-svn: 94943	2010-01-31 04:29:12 +00:00
Eli Friedman	58c7936637	Add a small transform: transform -(X<<Y) to (-X<<Y) when the shift has a single use and X is free to negate. llvm-svn: 94941	2010-01-31 02:30:23 +00:00
Evan Cheng	c2f3c20680	Do not mark no-return calls tail calls. It'll screw up special calls like longjmp and it doesn't make much sense for performance reason. If my logic is faulty, please let me know. llvm-svn: 94937	2010-01-31 00:59:31 +00:00
Bob Wilson	0f04082970	Check alignment of loads when deciding whether it is safe to execute them unconditionally. Besides checking the offset, also check that the underlying object is aligned as much as the load itself. llvm-svn: 94875	2010-01-30 04:42:39 +00:00
Bob Wilson	e979c978e0	Use more specific types to avoid casts. No functionality change. llvm-svn: 94863	2010-01-30 00:41:10 +00:00
Jakob Stoklund Olesen	e3fd8b5848	Keep iterating over all uses when meeting a phi node in AllUsesOfValueWillTrapIfNull(). This bug was exposed by my inliner cost changes in r94615, and caused failures of lencod on most architectures when building with LTO. This patch fixes lencod and 464.h264ref on x86-64 (and likely others). llvm-svn: 94858	2010-01-29 23:54:14 +00:00
Bob Wilson	71bc5f0787	Preserve load alignment in instcombine transformations. I've been unable to create a testcase where this matters. The select+load transformation only occurs when isSafeToLoadUnconditionally is true, and in those situations, instcombine also changes the underlying objects to be aligned. This seems like a good idea regardless, and I've verified that it doesn't pessimize the subsequent realignment. llvm-svn: 94850	2010-01-29 22:39:21 +00:00
Eric Christopher	47d90f7adb	Revert my last couple of patches. They appear to have broken bison. llvm-svn: 94841	2010-01-29 21:16:24 +00:00
Bob Wilson	bb651db10d	Use uint64_t instead of unsigned for offsets and sizes. llvm-svn: 94835	2010-01-29 20:34:28 +00:00
Bob Wilson	f897b7b37e	Improve isSafeToLoadUnconditionally to recognize that GEPs with constant indices are safe if the result is known to be within the bounds of the underlying object. llvm-svn: 94829	2010-01-29 19:19:08 +00:00
Duncan Sands	d6baca9159	Having RHSKnownZero and RHSKnownOne be alternative names for KnownZero and KnownOne (via APInt &RHSKnownZero = KnownZero, etc) seems dangerous and confusing to me: it is easy not to notice this, and then wonder why KnownZero/RHSKnownZero changed underneath you when you modified RHSKnownZero/KnownZero etc. So get rid of this. No intended functionality change (tested with "make check" + llvm-gcc bootstrap). llvm-svn: 94802	2010-01-29 06:18:46 +00:00
Eric Christopher	f01379e6c2	Make strcpy_chk lower to strcpy if we have a safe size. llvm-svn: 94783	2010-01-29 01:37:11 +00:00
Eric Christopher	7d74af1824	Add constant support to object size handling and remove default lowering. We'll either figure it out, or not and be lowered by SelectionDAGBuild. Add test. llvm-svn: 94775	2010-01-29 01:09:57 +00:00
Bill Wendling	f6736d2bae	Generic reformatting and comment fixing. No functionality change. llvm-svn: 94771	2010-01-29 00:52:43 +00:00

1 2 3 4 5 ...

6501 Commits