llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 12:02:58 +02:00

Author	SHA1	Message	Date
Nick Lewycky	289c30130a	Now that we look at all the header PHIs, we need to consider all the header PHIs when deciding that the loop has stopped evolving. Fixes miscompile in the gcc torture testsuite! llvm-svn: 142843	2011-10-24 21:02:38 +00:00
Chandler Carruth	d04f838629	Remove return heuristics from the static branch probabilities, and introduce no-return or unreachable heuristics. The return heuristics from the Ball and Larus paper don't work well in practice as they pessimize early return paths. The only good hitrate return heuristics are those for: - NULL return - Constant return - negative integer return Only the last of these three can possibly require significant code for the returning block, and even the last is fairly rare and usually also a constant. As a consequence, even for the cold return paths, there is little code on that return path, and so little code density to be gained by sinking it. The places where sinking these blocks is valuable (inner loops) will already be weighted appropriately as the edge is a loop-exit branch. All of this aside, early returns are nearly as common as all three of these return categories, and should actually be predicted as taken! Rather than muddy the waters of the static predictions, just remain silent on returns and let the CFG itself dictate any layout or other issues. However, the return heuristic was flagging one very important case: unreachable. Unfortunately it still gave a 1/4 chance of the branch-to-unreachable occuring. It also didn't do a rigorous job of finding those blocks which post-dominate an unreachable block. This patch builds a more powerful analysis that should flag all branches to blocks known to then reach unreachable. It also has better worst-case runtime complexity by not looping through successors for each block. The previous code would perform an N^2 walk in the event of a single entry block branching to N successors with a switch where each successor falls through to the next and they finally fall through to a return. Test case added for noreturn heuristics. Also doxygen comments improved along the way. llvm-svn: 142793	2011-10-24 12:01:08 +00:00
Nick Lewycky	64d4e26aec	Reapply r142781 with fix. Original message: Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142790	2011-10-24 06:57:05 +00:00
Nick Lewycky	341bf1548e	PHI nodes not in the loop header aren't part of the loop iteration initial state. Furthermore, they might not have two operands. This fixes the underlying issue behind the crashes introduced in r142781. llvm-svn: 142788	2011-10-24 05:51:01 +00:00
Nick Lewycky	d72de74587	Speculatively revert r142781. Bots are showing Assertion `i_nocapture < OperandTraits<PHINode>::operands(this) && "getOperand() out of range!"' failed. coming out of indvars. llvm-svn: 142786	2011-10-24 04:00:25 +00:00
Chandler Carruth	c5722dbec0	Simplify the design of BranchProbabilityInfo by collapsing it into a single class. Previously it was split between two classes, one internal and one external. The concern seemed to center around exposing the weights used, but those can remain confined to the implementation file. Having a single class to maintain the state and analyses in use will also simplify several of the enhancements I want to make to our static heuristics. llvm-svn: 142783	2011-10-24 01:40:45 +00:00
Nick Lewycky	5ab7948d71	Enhance SCEV's brute force loop analysis to handle multiple PHI nodes in the loop header when computing the trip count. With this, we now constant evaluate: struct ListNode { const struct ListNode next; int i; }; static const struct ListNode node1 = {0, 1}; static const struct ListNode node2 = {&node1, 2}; static const struct ListNode node3 = {&node2, 3}; int test() { int sum = 0; for (const struct ListNode n = &node3; n != 0; n = n->next) sum += n->i; return sum; } llvm-svn: 142781	2011-10-23 23:43:14 +00:00
Chandler Carruth	93ee01158c	Tidy up a loop to be more idiomatic for LLVM's codebase, and remove some extraneous whitespace. Trying to clean-up this pass as much as I can before I start making functional changes. llvm-svn: 142780	2011-10-23 22:40:13 +00:00
Chandler Carruth	151d4fc273	Teach the BranchProbabilityInfo pass to print its results, and use that to bring it under direct test instead of merely indirectly testing it in the BlockFrequencyInfo pass. The next step is to start adding tests for the various heuristics employed, and to start fixing those heuristics once they're under test. llvm-svn: 142778	2011-10-23 21:21:50 +00:00
Benjamin Kramer	9adc582e35	Add compare operators to BranchProbability and use it to determine if an edge is hot. llvm-svn: 142751	2011-10-23 11:19:14 +00:00
Nick Lewycky	ce8bfeadff	Make SCEV's brute force analysis stronger in two ways. Firstly, we should be able to constant fold load instructions where the argument is a constant. Second, we should be able to watch multiple PHI nodes through the loop; this patch only supports PHIs in loop headers, more can be done here. With this patch, we now constant evaluate: static const int arr[] = {1, 2, 3, 4, 5}; int test() { int sum = 0; for (int i = 0; i < 5; ++i) sum += arr[i]; return sum; } llvm-svn: 142731	2011-10-22 19:58:20 +00:00
Benjamin Kramer	917737037d	Extend the floating point heuristic to consider NaN checks unlikely. llvm-svn: 142687	2011-10-21 21:13:47 +00:00
Benjamin Kramer	66a199d42f	BranchProbabilityInfo: floating point equality is unlikely. This is from the same paper from Ball and Larus as the rest of the currently implemented heuristics. llvm-svn: 142677	2011-10-21 20:12:47 +00:00
Eli Friedman	0c3b1df94e	A FIXME about block addresses and indirectbr. llvm-svn: 142569	2011-10-20 04:05:33 +00:00
Eli Friedman	6ff41ea07a	Simplify; no intended functional change. llvm-svn: 142567	2011-10-20 03:23:14 +00:00
Nick Lewycky	21a67a1454	"@string = constant i8 0" is a value i8* string of length zero. Analyze that correctly in GetStringLength, fixing PR11181! llvm-svn: 142558	2011-10-20 00:34:35 +00:00
Chandler Carruth	12a645d6f6	Generalize the reading of probability metadata to work for both branches and switches, with arbitrary numbers of successors. Still optimized for the common case of 2 successors for a conditional branch. Add a test case for switch metadata showing up in the BlockFrequencyInfo pass. llvm-svn: 142493	2011-10-19 10:32:19 +00:00
Chandler Carruth	18a382b4b6	Teach the BranchProbabilityInfo analysis pass to read any metadata encoding of probabilities. In the absense of metadata, it continues to fall back on static heuristics. This allows __builtin_expect, after lowering through llvm.expect a branch instruction's metadata, to actually enter the branch probability model. This is one component of resolving PR2577. llvm-svn: 142492	2011-10-19 10:30:30 +00:00
Chandler Carruth	13b475d4f6	Add pass printing support to BlockFrequencyInfo pass. The implementation layer already had support for printing the results of this analysis, but the wiring was missing. Now that printing the analysis works, actually bring some of this analysis, and the BranchProbabilityInfo analysis that it wraps, under test! I'm planning on fixing some bugs and doing other work here, so having a nice place to add regression tests and a way to observe the results is really useful. llvm-svn: 142491	2011-10-19 10:12:41 +00:00
Devang Patel	fb32a2ca1b	Update DebugInfoFinder to match recent debug info encoding changes. llvm-svn: 142295	2011-10-17 22:30:34 +00:00
Bill Wendling	584c5f9c62	Correct over-zealous removal of hack. Some code want to check that any call within a function has the 'returns twice' attribute, not just that the current function has one. llvm-svn: 142221	2011-10-17 18:43:40 +00:00
Bill Wendling	2ee7de36b8	Now that we have the ReturnsTwice function attribute, this method is obsolete. Check the attribute instead. <rdar://problem/8031714> llvm-svn: 142212	2011-10-17 18:22:52 +00:00
Chandler Carruth	0334d27ec8	Delete a dead member. Dunno if this was ever used, but the current code directly manipulates the weights inside of the BranchProbabilityInfo that is passed in. llvm-svn: 142163	2011-10-16 22:27:54 +00:00
Andrew Trick	0ef2965563	Fix SCEVExpander assert during LSR: "argument of incompatible type". Just because we're dealing with a GEP doesn't mean we can assert the SCEV has a pointer type. The fix is simply to ignore the SCEV pointer type, which we really didn't need. Fixes PR11138 webkit crash. llvm-svn: 142058	2011-10-15 06:19:55 +00:00
Nick Lewycky	610fc64d78	An instruction's operands aren't necessarily instructions or constants. They could be arguments, for example. No testcase because this is a bug-fix broken out of a larger optimization patch. llvm-svn: 141951	2011-10-14 09:38:46 +00:00
Eli Friedman	e702d68556	Enhance the memdep interface so that users can tell the difference between a dependency which cannot be calculated and a path reaching the entry point of the function. This patch introduces isNonFuncLocal, which replaces isUnknown in some cases. Patch by Xiaoyi Guo. llvm-svn: 141896	2011-10-13 22:14:57 +00:00
Andrew Trick	923129b028	Reapply r141870, SCEV expansion of post-inc. Speculatively reapply to see if this test case still crashes on linux. I may have fixed it in my last checkin. llvm-svn: 141895	2011-10-13 21:55:29 +00:00
Andrew Trick	03cd1b6a5d	Fix memory corruption I introduced a few checkins ago. Self-review easily caught this obvious bug. llvm-svn: 141880	2011-10-13 18:49:23 +00:00
Andrew Trick	109f7dbd1e	Revert r141870. The test case crashes on linux with data corruption. A deeper issue was exposed. llvm-svn: 141873	2011-10-13 17:58:24 +00:00
Andrew Trick	05d7cb17d5	LSR: Reuse the post-inc expansion of expressions. This avoids unnecessary expansion of expressions and allows the SCEV expander to work on expression DAGs, not just trees. Fixes PR11090. llvm-svn: 141870	2011-10-13 17:31:47 +00:00
Andrew Trick	38d4b87695	SCEV: Rewrite TrandformForPostIncUse to handle expression DAGs, not just expression trees. Partially fixes PR11090. Test case will be with the full fix. llvm-svn: 141868	2011-10-13 17:21:09 +00:00
Andrew Trick	5286636188	Slightly more useful tracing. llvm-svn: 141867	2011-10-13 17:06:38 +00:00
Eric Christopher	57c57a3260	Add a new wrapper node for a DILexicalBlock that encapsulates it and a file. Since it should only be used when necessary propagate it through the backend code generation and tweak testcases accordingly. This helps with code like in clang's test/CodeGen/debug-info-line.c where we have multiple #line directives within a single lexical block and want to generate only a single block that contains each file change. Part of rdar://10246360 llvm-svn: 141729	2011-10-11 22:59:11 +00:00
Andrew Trick	d36852e6b1	Move replaceCongruentIVs into SCEVExapander and bias toward "expanded" IVs. Indvars previously chose randomly between congruent IVs. Now it will bias the decision toward IVs that SCEVExpander likes to create. This was not done to fix any problem, it's just a welcome side effect of factoring code. llvm-svn: 141633	2011-10-11 02:28:51 +00:00
Andrew Trick	430029d79a	Add an extra safety check in front of the optimization in r141442. llvm-svn: 141470	2011-10-08 02:16:39 +00:00
Andrew Trick	75743b069e	LSR should only reuse phis that match its formula. Fixes rdar://problem/5064068 llvm-svn: 141442	2011-10-07 23:46:21 +00:00
Eli Friedman	4d63ca106a	Remove the old atomic instrinsics. autoupgrade functionality is included with this patch. llvm-svn: 141333	2011-10-06 23:20:49 +00:00
Andrew Trick	671f73b951	Fixes PR11070 - assert in SCEV getConstantEvolvingPHIOperands. llvm-svn: 141219	2011-10-05 22:06:53 +00:00
Andrew Trick	94a7b27585	Typo. Thanks Bob. llvm-svn: 141188	2011-10-05 16:52:28 +00:00
Chandler Carruth	9fc5856d54	Fix a broken assert found by -Wparentheses. llvm-svn: 141168	2011-10-05 07:02:23 +00:00
Andrew Trick	79a14ca34e	Fix disabled SCEV analysis caused r141161 and add unit test. I noticed during self-review that my previous checkin disabled some analysis. Even with the reenabled analysis the test case runs in about 5ms. Without the fix, it will take several minutes at least. llvm-svn: 141164	2011-10-05 05:58:49 +00:00
Andrew Trick	5789485111	Avoid exponential recursion in SCEV getConstantEvolvingPHI and EvaluateExpression. Note to compiler writers: never recurse on multiple instruction operands without memoization. Fixes rdar://10187945. Was taking 45s, now taking 5ms. llvm-svn: 141161	2011-10-05 03:25:31 +00:00
Nick Lewycky	6833d72fc0	The product of two chrec's can always be represented as a chrec. llvm-svn: 141066	2011-10-04 06:51:26 +00:00
Nick Lewycky	4898eef762	Reapply r140979 with fix! We never did get a testcase, but careful review of the logic by David Meyer revealed this bug. llvm-svn: 140992	2011-10-03 07:10:45 +00:00
Nick Lewycky	79fec8116f	Revert r140979 due to reports of bootstrap failure. llvm-svn: 140980	2011-10-03 05:14:59 +00:00
Nick Lewycky	a760a29395	Add one more case we compute a max trip count. llvm-svn: 140979	2011-10-03 01:03:57 +00:00
Andrew Trick	0489c5410d	Inlining and unrolling heuristics should be aware of free truncs. We want heuristics to be based on accurate data, but more importantly we don't want llvm to behave randomly. A benign trunc inserted by an upstream pass should not cause a wild swings in optimization level. See PR11034. It's a general problem with threshold-based heuristics, but we can make it less bad. llvm-svn: 140919	2011-10-01 01:39:05 +00:00
Andrew Trick	a1161d94f5	whitespace llvm-svn: 140916	2011-10-01 01:27:56 +00:00
Andrew Trick	50915b5136	indvars: generalize SCEV getPreStartForSignExtend. Handle general Add expressions to avoid leaving around redundant 32-bit IVs. llvm-svn: 140701	2011-09-28 17:02:54 +00:00
Eli Friedman	f4f4a75d2b	PR10628: Fix getModRefInfo so it queries the underlying alias() implementation correctly while checking nocapture calls. llvm-svn: 140666	2011-09-28 00:34:27 +00:00

1 2 3 4 5 ...

4020 Commits