llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-30 23:42:52 +01:00

Author	SHA1	Message	Date
Arnold Schwaighofer	d0c38e0586	SimplifyCFG: If convert single conditional stores This resurrects r179957, but adds code that makes sure we don't touch atomic/volatile stores: This transformation will transform a conditional store with a preceeding uncondtional store to the same location: a[i] = may-alias with a[i] load if (cond) a[i] = Y into an unconditional store. a[i] = X may-alias with a[i] load tmp = cond ? Y : X; a[i] = tmp We assume that on average the cost of a mispredicted branch is going to be higher than the cost of a second store to the same location, and that the secondary benefits of creating a bigger basic block for other optimizations to work on outway the potential case where the branch would be correctly predicted and the cost of the executing the second store would be noticably reflected in performance. hmmer's execution time improves by 30% on an imac12,2 on ref data sets. With this change we are on par with gcc's performance (gcc also performs this transformation). There was a 1.2 % performance improvement on a ARM swift chip. Other tests in the test-suite+external seem to be mostly uninfluenced in my experiments: This optimization was triggered on 41 tests such that the executable was different before/after the patch. Only 1 out of the 40 tests (dealII) was reproducable below 100% (by about .4%). Given that hmmer benefits so much I believe this to be a fair trade off. llvm-svn: 180731	2013-04-29 21:28:24 +00:00
Arnold Schwaighofer	76cf4c753d	Revert "SimplifyCFG: If convert single conditional stores" There is the temptation to make this tranform dependent on target information as it is not going to be beneficial on all (sub)targets. Therefore, we should probably do this in MI Early-Ifconversion. This reverts commit r179957. Original commit message: "SimplifyCFG: If convert single conditional stores This transformation will transform a conditional store with a preceeding uncondtional store to the same location: a[i] = may-alias with a[i] load if (cond) a[i] = Y into an unconditional store. a[i] = X may-alias with a[i] load tmp = cond ? Y : X; a[i] = tmp We assume that on average the cost of a mispredicted branch is going to be higher than the cost of a second store to the same location, and that the secondary benefits of creating a bigger basic block for other optimizations to work on outway the potential case were the branch would be correctly predicted and the cost of the executing the second store would be noticably reflected in performance. hmmer's execution time improves by 30% on an imac12,2 on ref data sets. With this change we are on par with gcc's performance (gcc also performs this transformation). There was a 1.2 % performance improvement on a ARM swift chip. Other tests in the test-suite+external seem to be mostly uninfluenced in my experiments: This optimization was triggered on 41 tests such that the executable was different before/after the patch. Only 1 out of the 40 tests (dealII) was reproducable below 100% (by about .4%). Given that hmmer benefits so much I believe this to be a fair trade off. I am going to watch performance numbers across the builtbots and will revert this if anything unexpected comes up." llvm-svn: 179980	2013-04-21 13:09:04 +00:00
Arnold Schwaighofer	a5ec409858	SimplifyCFG: If convert single conditional stores This transformation will transform a conditional store with a preceeding uncondtional store to the same location: a[i] = may-alias with a[i] load if (cond) a[i] = Y into an unconditional store. a[i] = X may-alias with a[i] load tmp = cond ? Y : X; a[i] = tmp We assume that on average the cost of a mispredicted branch is going to be higher than the cost of a second store to the same location, and that the secondary benefits of creating a bigger basic block for other optimizations to work on outway the potential case were the branch would be correctly predicted and the cost of the executing the second store would be noticably reflected in performance. hmmer's execution time improves by 30% on an imac12,2 on ref data sets. With this change we are on par with gcc's performance (gcc also performs this transformation). There was a 1.2 % performance improvement on a ARM swift chip. Other tests in the test-suite+external seem to be mostly uninfluenced in my experiments: This optimization was triggered on 41 tests such that the executable was different before/after the patch. Only 1 out of the 40 tests (dealII) was reproducable below 100% (by about .4%). Given that hmmer benefits so much I believe this to be a fair trade off. I am going to watch performance numbers across the builtbots and will revert this if anything unexpected comes up. llvm-svn: 179957	2013-04-20 21:42:09 +00:00
Hans Wennborg	9311589e8d	simplifycfg: Fix integer overflow converting switch into icmp. If a switch instruction has a case for every possible value of its type, with the same successor, SimplifyCFG would replace it with an icmp ult, but the computation of the bound overflows in that case, which inverts the test. Patch by Jed Davis! llvm-svn: 179587	2013-04-16 08:35:36 +00:00
Michael Gottesman	66864f3c32	Converted 8x tests of SimplifyCFG to use FileCheck instead of grep. llvm-svn: 179087	2013-04-09 05:18:53 +00:00
Bill Wendling	a225da9719	Don't remove a landing pad if the invoke requires a table entry. An invoke may require a table entry. For instance, when the function it calls is expected to throw. <rdar://problem/13360379> llvm-svn: 176827	2013-03-11 20:53:00 +00:00
Andrew Trick	861b80b6d8	SimplifyCFG fix for volatile load/store. Fixes rdar:13349374. Volatile loads and stores need to be preserved even if the language standard says they are undefined. "volatile" in this context means "get out of the way compiler, let my platform handle it". Additionally, this is the only way I know of with llvm to write to the first page (when hardware allows) without dropping to assembly. llvm-svn: 176599	2013-03-07 01:03:35 +00:00
Bill Wendling	db672f1bc8	Use references to attribute groups on the call/invoke instructions. Listing all of the attributes for the callee of a call/invoke instruction is way too much and makes the IR unreadable. Use references to attributes instead. llvm-svn: 175877	2013-02-22 09:09:42 +00:00
Michael Gottesman	bb4501dec6	Filecheckized 2x tests in SimplifyCFG and removed their date prefix to fit with current llvm style for test names. llvm-svn: 174011	2013-01-31 01:04:23 +00:00
Chandler Carruth	ce01d44307	Re-revert r173342, without losing the compile time improvements, flat out bug fixes, or functionality preserving refactorings. llvm-svn: 173610	2013-01-27 06:42:03 +00:00
Chandler Carruth	0e2f6c4f00	Switch this code away from Value::isUsedInBasicBlock. That code either loops over instructions in the basic block or the use-def list of the value, neither of which are really efficient when repeatedly querying about values in the same basic block. What's more, we already know that the CondBB is small, and so we can do a much more efficient test by counting the uses in CondBB, and seeing if those account for all of the uses. Finally, we shouldn't blanket fail on any such instruction, instead we should conservatively assume that those instructions are part of the cost. Note that this actually fixes a bug in the pass because isUsedInBasicBlock has a really terrible bug in it. I'll fix that in my next commit, but the fix for it would make this code suddenly take the compile time hit I thought it already was taking, so I wanted to go ahead and migrate this code to a faster & better pattern. The bug in isUsedInBasicBlock was also causing other tests to test the wrong thing entirely: for example we weren't actually disabling speculation for floating point operations as intended (and tested), but the test passed because we failed to speculate them due to the isUsedInBasicBlock failure. llvm-svn: 173417	2013-01-25 05:40:09 +00:00
Benjamin Kramer	78bb80941e	Reapply chandlerc's r173342 now that the miscompile it was triggering is fixed. Original commit message: Plug TTI into the speculation logic, giving it a real cost interface that can be specialized by targets. The goal here is not to be more aggressive, but to just be more accurate with very obvious cases. There are instructions which are known to be truly free and which were not being modeled as such in this code -- see the regression test which is distilled from an inner loop of zlib. Everywhere the TTI cost model is insufficiently conservative I've added explicit checks with FIXME comments to go add proper modelling of these cost factors. If this causes regressions, the likely solution is to make TTI even more conservative in its cost estimates, but test cases will help here. llvm-svn: 173357	2013-01-24 16:44:25 +00:00
Chandler Carruth	a1525efea7	Revert r173342 temporarily. It appears to cause a very late miscompile of stage2 in a bootstrap. Still investigating.... llvm-svn: 173343	2013-01-24 13:24:24 +00:00
Chandler Carruth	b10553145f	Plug TTI into the speculation logic, giving it a real cost interface that can be specialized by targets. The goal here is not to be more aggressive, but to just be more accurate with very obvious cases. There are instructions which are known to be truly free and which were not being modeled as such in this code -- see the regression test which is distilled from an inner loop of zlib. Everywhere the TTI cost model is insufficiently conservative I've added explicit checks with FIXME comments to go add proper modelling of these cost factors. If this causes regressions, the likely solution is to make TTI even more conservative in its cost estimates, but test cases will help here. llvm-svn: 173342	2013-01-24 12:39:29 +00:00
Chandler Carruth	fd4cbdd2c2	Address a large chunk of this FIXME by accumulating the cost for unfolded constant expressions rather than checking each one independently. llvm-svn: 173341	2013-01-24 12:05:17 +00:00
Chandler Carruth	bc67fc9f93	Switch the constant expression speculation cost evaluation away from a cost fuction that seems both a bit ad-hoc and also poorly suited to evaluating constant expressions. Notably, it is missing any support for trivial expressions such as 'inttoptr'. I could fix this routine, but it isn't clear to me all of the constraints its other users are operating under. The core protection that seems relevant here is avoiding the formation of a select instruction wich a further chain of select operations in a constant expression operand. Just explicitly encode that constraint. Also, update the comments and organization here to make it clear where this needs to go -- this should be driven off of real cost measurements which take into account the number of constants expressions and the depth of the constant expression tree. llvm-svn: 173340	2013-01-24 11:53:01 +00:00
Dmitri Gribenko	fa45287455	Tests: rewrite 'opt ... %s' to 'opt ... < %s' so that opt does not emit a ModuleID This is done to avoid odd test failures, like the one fixed in r171243. My previous regex was not good enough to find these. llvm-svn: 171343	2013-01-01 13:57:25 +00:00
Dmitri Gribenko	e3769d450b	Tests: rewrite 'opt ... %s' to 'opt ... < %s' so that opt does not emit a ModuleID This is done to avoid odd test failures, like the one fixed in r171243. llvm-svn: 171246	2012-12-30 01:28:40 +00:00
Evan Cheng	af9b73ef6f	Fix logic to determine whether to turn a switch into a lookup table. When the tables cannot fit in registers (i.e. bitmap), do not emit the table if it's using an illegal type. rdar://12779436 llvm-svn: 168970	2012-11-30 02:02:42 +00:00
Hans Wennborg	d166484584	Only do switch-to-lookup table transformation when TargetTransformInfo is available. llvm-svn: 167552	2012-11-07 21:35:12 +00:00
Hans Wennborg	7dd7657cec	Fix bad test IR in switch_to_lookup_table.ll llvm-svn: 167543	2012-11-07 18:38:24 +00:00
Hans Wennborg	4c6d01059c	Remove fixme about unreachable cases from SwitchToLookupTable SimplifyCFG will have removed those cases for us. llvm-svn: 167132	2012-10-31 16:15:25 +00:00
Hans Wennborg	d162380e59	Do simple constant propagation in lookup table formation for switches By propagating the value for the switch condition, LLVM can now build lookup tables for code such as: switch (x) { case 1: return 5; case 2: return 42; case 3: case 4: case 5: return x - 123; default: return 123; } Given that x is known for each case, "x - 123" becomes a constant for cases 3, 4, and 5. llvm-svn: 167115	2012-10-31 13:42:45 +00:00
Hans Wennborg	885eff267a	switch_to_lookup_table.ll: Remove some unnecessary lines, comments, function attributes, etc. llvm-svn: 167016	2012-10-30 15:11:52 +00:00
Hans Wennborg	40eb1b4055	Use TargetTransformInfo to control switch-to-lookup table transformation When the switch-to-lookup tables transform landed in SimplifyCFG, it was pointed out that this could be inappropriate for some targets. Since there was no way at the time for the pass to know anything about the target, an awkward reverse-transform was added in CodeGenPrepare that turned lookup tables back into switches for some targets. This patch uses the new TargetTransformInfo to determine if a switch should be transformed, and removes CodeGenPrepare::ConvertLoadToSwitch. llvm-svn: 167011	2012-10-30 11:23:25 +00:00
Jakob Stoklund Olesen	d69b3afa22	Also optimize large switch statements. The isValueEqualityComparison() guard at the top of SimplifySwitch() only applies to some of the possible transformations. The newer transformations work just fine on large switches, and the check on predecessor count is nonsensical. llvm-svn: 166710	2012-10-25 18:51:15 +00:00
Benjamin Kramer	bbb006ad7d	SimplifyCFG: Enhance the "remove CFG edge that leads to null pointer dereference" optimization to also handle instructions with multiple uses. We conservatively only check the first use to avoid walking long use chains. This catches the common case of having both a load and a store to a pointer supplied by a PHI node. llvm-svn: 165232	2012-10-04 16:11:49 +00:00
Benjamin Kramer	54a33840fc	SimplifyCFG: Don't crash when forming a switch bitmap with an undef default value. Fixes PR13985. llvm-svn: 164934	2012-10-01 11:31:48 +00:00
Benjamin Kramer	cb89947f87	Fix a integer overflow in SimplifyCFG's look up table formation logic. If the width is very large it gets truncated from uint64_t to uint32_t when passed to TD->fitsInLegalInteger. The truncated value can fit in a register. This manifested in massive memory usage or crashes (PR13946). llvm-svn: 164784	2012-09-27 18:29:58 +00:00
Hans Wennborg	e1a73f6ca3	Address Duncan's comments on r164684: - Put statistics in alphabetical order - Don't use getZextValue when building TableInt, just use APInts - Introduce Create{Z,S}ExtOrTrunc in IRBuilder. llvm-svn: 164696	2012-09-26 14:01:53 +00:00
Hans Wennborg	d3f44548bf	SimplifyCFG: Make the switch-to-lookup table transformation store the tables in bitmaps when they fit in a target-legal register. This saves some space, and it also allows for building tables that would otherwise be deemed too sparse. One interesting case that this hits is example 7 from http://blog.regehr.org/archives/320. We currently generate good code for this when lowering the switch to the selection DAG: we build a bitmask to decide whether to jump to one block or the other. My patch will result in the same bitmask, but it removes the need for the jump, as the return value can just be retrieved from the mask. llvm-svn: 164684	2012-09-26 09:44:49 +00:00
Manman Ren	43bb9c6bc6	SimplifyCFG: sink common codes from IF, ELSE blocks down to END block. We already have HoistThenElseCodeToIf, this patch implements SinkThenElseCodeToEnd. When END block has only two predecessors and each predecessor terminates with unconditional branches, we compare instructions in IF and ELSE blocks backwards and check whether we can sink the common instructions down. rdar://12191395 llvm-svn: 164325	2012-09-20 22:37:36 +00:00
Hans Wennborg	d260d5d0b8	SimplifyCFG: Don't generate invalid code for switch used to initialize two variables where the first variable is returned and the second ignored. I don't think this occurs in practice (other passes should have cleaned up the unused phi node), but it should still be handled correctly. Also make the logic for determining if we should return early less sketchy. llvm-svn: 164225	2012-09-19 14:24:21 +00:00
Manman Ren	e628d655cf	PGO: preserve branch-weight metadata when simplifying Switch to a sub, an icmp and a conditional branch; also when removing dead cases from a switch. llvm-svn: 164084	2012-09-18 00:47:33 +00:00
Manman Ren	77511d5a11	PGO: preserve branch-weight metadata when simplifying Switch Hanlde the case when we split the default edge if the default target has "icmp" and unconditinal branch. llvm-svn: 164076	2012-09-17 23:07:43 +00:00
Manman Ren	5cee74c6a1	PGO: preserve branch-weight metadata when simplifying SwitchOnSelect. llvm-svn: 164068	2012-09-17 22:28:55 +00:00
Manman Ren	d4fb9f6c48	PGO: preserve branch-weight metadata when simplifying two branches with a common destination in SimplifyCondBranchToCondBranch. llvm-svn: 164054	2012-09-17 21:30:40 +00:00
Manman Ren	5e4c29b751	PGO: preserve branch-weight metadata when simplifying two branches with a common destination. Updated previous implementation to fix a case not covered: // PBI: br i1 %x, TrueDest, BB // BI: br i1 %y, TrueDest, FalseDest The other case was handled correctly. // PBI: br i1 %x, BB, FalseDest // BI: br i1 %y, TrueDest, FalseDest Also tried to use 64-bit arithmetic instead of APInt with scale to simplify the computation. Let me know if you have other opinions about this. llvm-svn: 163954	2012-09-15 00:39:57 +00:00
Manman Ren	be0144b3f7	PGO: preserve branch-weight metadata when simplifying a switch with a single case to a conditional branch and when removing dead cases. llvm-svn: 163942	2012-09-14 21:53:06 +00:00
Manman Ren	5d2410ad77	PGO: preserve branch-weight metadata when merging two switches where the default target of the first switch is not the basic block the second switch is in (PredDefault != BB). llvm-svn: 163916	2012-09-14 17:29:56 +00:00
Manman Ren	9949a09bfb	PGO: preserve branch-weight metadata when removing a case which jumps to the default target. llvm-svn: 163724	2012-09-12 17:04:11 +00:00
Manman Ren	d492bca8a6	SimplifyCFG: preserve branch-weight metadata when creating a new switch from a pair of switch/branch where both depend on the value of the same variable and the default case of the first switch/branch goes to the second switch/branch. Code clean up and fixed a few issues: 1> handling the case where some cases of the 2nd switch are invalidated 2> correctly calculate the weight for the 2nd switch when it is a conditional eq Testing case is modified from Alastair's original patch. llvm-svn: 163635	2012-09-11 17:43:35 +00:00
Andrew Trick	822cf12e93	Remove an incorrect assert during branch weight propagation. Patch and test case by Alastair Murray! llvm-svn: 163437	2012-09-08 00:07:26 +00:00
Hans Wennborg	533fd843bc	Fix switch_to_lookup_table.ll test from r163302. The lookup tables did not get built in a deterministic order. This makes them get built in the order that the corresponding phi nodes were found. llvm-svn: 163305	2012-09-06 10:10:35 +00:00
Hans Wennborg	6b07d4afc7	Build lookup tables for switches (PR884) This adds a transformation to SimplifyCFG that attemps to turn switch instructions into loads from lookup tables. It works on switches that are only used to initialize one or more phi nodes in a common successor basic block, for example: int f(int x) { switch (x) { case 0: return 5; case 1: return 4; case 2: return -2; case 5: return 7; case 6: return 9; default: return 42; } This speeds up the code by removing the hard-to-predict jump, and reduces code size by removing the code for the jump targets. llvm-svn: 163302	2012-09-06 09:43:28 +00:00
Eli Friedman	449495cd62	The normal edge of an invoke is not allowed to branch to a block with a landingpad. Enforce it in the verifier, and fix the regression tests to match. llvm-svn: 161697	2012-08-10 20:55:20 +00:00
Nuno Lopes	97c381ea93	fix PR13339 (remove the predecessor from the unwind BB when removing an invoke) llvm-svn: 160325	2012-07-16 22:49:40 +00:00
Eric Christopher	185293d560	Revert "IntRange:" as it appears to be breaking self hosting. This reverts commit b2833d9dcba88c6f0520cad760619200adc0442c. llvm-svn: 159618	2012-07-02 23:22:21 +00:00
Chandler Carruth	5d3a0ce4e5	Fix the remaining TCL-style quotes found in the testsuite. This is another mechanical change accomplished though the power of terrible Perl scripts. I have manually switched some "s to 's to make escaping simpler. While I started this to fix tests that aren't run in all configurations, the massive number of tests is due to a really frustrating fragility of our testing infrastructure: things like 'grep -v', 'not grep', and 'expected failures' can mask broken tests all too easily. Essentially, I'm deeply disturbed that I can change the testsuite so radically without causing any change in results for most platforms. =/ llvm-svn: 159547	2012-07-02 19:09:46 +00:00
Nuno Lopes	e967ebe7bb	fix the regression I introduced in r159385 (it's necessary to update PHI nodes in unwind BB llvm-svn: 159534	2012-07-02 16:14:47 +00:00

1 2 3 4 5

223 Commits