llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-01 16:33:37 +01:00

Author	SHA1	Message	Date
Duncan Sands	7c3f34d524	Add an m_Div pattern for matching either a udiv or an sdiv and use it to simplify the "(X/Y)*Y->X when the division is exact" transform. llvm-svn: 125004	2011-02-07 09:36:32 +00:00
Chris Lattner	1c1b342a62	teach instsimplify to transform (X / Y) * Y to X when the div is an exact udiv. llvm-svn: 124994	2011-02-06 22:05:31 +00:00
Eric Christopher	b81307b728	Remove premature optimization that avoided calculating argument weights if we weren't going to inline the function. The rest of the code using this was removed. Fixes PR9154. llvm-svn: 124991	2011-02-06 21:27:46 +00:00
Anders Carlsson	61133e38a9	Simplify test, as suggested by Chris. llvm-svn: 124990	2011-02-06 20:22:49 +00:00
Anders Carlsson	1eeebf1c22	When loading from a constant, fold inttoptr if the integer type and the resulting pointer type both have the same size. llvm-svn: 124987	2011-02-06 20:11:56 +00:00
Anders Carlsson	96a35fc26e	Fix another warning. llvm-svn: 124961	2011-02-05 18:33:43 +00:00
Eric Christopher	6dbf0c6bbe	Fix cut and paste error spotted by Jakob. llvm-svn: 124930	2011-02-05 02:48:47 +00:00
Eric Christopher	ddc2157034	Rewrite how the indirect call bonus is handled. This now works by: a) Making it a per call site bonus for functions that we can move from indirect to direct calls. b) Reduces the bonus from 500 to 100 per call site. c) Subtracts the size of the possible newly inlineable call from the bonus to only add a bonus if we can inline a small function to devirtualize it. Also changes the bonus from a positive that's subtracted to a negative that's added. Fixes the remainder of rdar://8546196 by reducing the object file size after inlining by 84%. llvm-svn: 124916	2011-02-05 00:49:15 +00:00
Duncan Sands	fc33df78c1	Improve threading of comparisons over select instructions (spotted by my auto-simplifier). This has a big impact on Ada code, but not much else. Unfortunately the impact is mostly negative! This is due to PR9004 (aka SCCP failing to resolve conditional branch conditions in the destination blocks of the branch), in which simple correlated expressions are not resolved but complicated ones are, so simplifying has a bad effect! llvm-svn: 124788	2011-02-03 09:37:39 +00:00
Devang Patel	2fef292729	Fix typo in comment. llvm-svn: 124759	2011-02-03 00:13:47 +00:00
Devang Patel	71b1fadf20	Add support to describe template value parameter in debug info. llvm-svn: 124755	2011-02-02 22:35:53 +00:00
Devang Patel	89455dc7cd	Add support to describe template parameter type in debug info. llvm-svn: 124752	2011-02-02 21:38:25 +00:00
Duncan Sands	7eecb72021	Reenable the transform "(X*Y)/Y->X" when the multiplication is known not to overflow (nsw flag), which was disabled because it breaks 254.gap. I have informed the GAP authors of the mistake in their code, and arranged for the testsuite to use -fwrapv when compiling this benchmark. llvm-svn: 124746	2011-02-02 20:52:00 +00:00
Duncan Sands	c03dbe4b1c	Add a m_Undef pattern for convenience. This is so that code that uses pattern matching can also pattern match undef, creating a more uniform style. llvm-svn: 124657	2011-02-01 09:06:20 +00:00
Duncan Sands	659237307a	Add a m_SignBit pattern for convenience. llvm-svn: 124656	2011-02-01 08:50:33 +00:00
Duncan Sands	06e82c76ee	Have m_One also match constant vectors for which every element is 1. llvm-svn: 124655	2011-02-01 08:39:12 +00:00
Eric Christopher	f8b2388751	Reapply 124275 since the Dragonegg failure was unreproducible. llvm-svn: 124641	2011-02-01 01:16:32 +00:00
Duncan Sands	987c8bc759	Commit 124487 broke 254.gap. See if disabling the part that might be triggered by PR9088 fixes things. llvm-svn: 124561	2011-01-30 18:24:20 +00:00
Duncan Sands	ac01c21937	Transform (X/Y)*Y into X if the division is exact. Instcombine already knows how to do this and more, but would only do it if X/Y had only one use. Spotted as the most common missed simplification in SPEC by my auto-simplifier, now that it knows about nuw/nsw/exact flags. This removes a bunch of multiplications from 447.dealII and 483.xalancbmk. It also removes a lot from tramp3d-v4, which results in much more inlining. llvm-svn: 124560	2011-01-30 18:03:50 +00:00
Nick Lewycky	67acf52b2e	Fix comment. llvm-svn: 124544	2011-01-29 19:55:23 +00:00
Frits van Bommel	92dc04df67	Move InstCombine's knowledge of fdiv to SimplifyInstruction(). llvm-svn: 124534	2011-01-29 15:26:31 +00:00
Duncan Sands	0587f785bf	Fix typo: should have been testing that X was odd, not V. llvm-svn: 124533	2011-01-29 13:27:00 +00:00
Andrew Trick	72f17d97f3	Implementation of path profiling. Modified patch by Adam Preuss. This builds on the existing framework for block tracing, edge profiling and optimal edge profiling. See -help-hidden for new flags. For documentation, see the technical report "Implementation of Path Profiling..." in llvm.org/pubs. llvm-svn: 124515	2011-01-29 01:09:53 +00:00
Duncan Sands	07617615f4	This dyn_cast should be a cast. Pointed out by Frits van Bommel. llvm-svn: 124497	2011-01-28 18:53:08 +00:00
Duncan Sands	e788a04c09	Thread divisions over selects and phis. This doesn't fire much and has basically zero effect on the testsuite (it improves two Ada testcases). llvm-svn: 124496	2011-01-28 18:50:50 +00:00
Duncan Sands	1a18d8df96	My auto-simplifier noticed that ((X/Y)Y)/Y occurs several times in SPEC benchmarks, and that it can be simplified to X/Y. (In general you can only simplify (ZY)/Y to Z if the multiplication did not overflow; if Z has the form "X/Y" then this is the case). This patch implements that transform and moves some Div logic out of instcombine and into InstructionSimplify. Unfortunately instcombine gets in the way somewhat, since it likes to change (X/Y)Y into X-(X rem Y), so I had to teach instcombine about this too. Finally, thanks to the NSW/NUW flags, sometimes we know directly that "ZY" does not overflow, because the flag says so, so I added that logic too. This eliminates a bunch of divisions and subtractions in 447.dealII, and has good effects on some other benchmarks too. It seems to have quite an effect on tramp3d-v4 but it's hard to say if it's good or bad because inlining decisions changed, resulting in massive changes all over. llvm-svn: 124487	2011-01-28 16:51:11 +00:00
Eric Christopher	54f709d970	Temporarily revert 124275 to see if it brings the dragonegg buildbot back. llvm-svn: 124312	2011-01-26 19:40:31 +00:00
Duncan Sands	803522ec6f	APInt has a method for determining whether a number is a power of 2 which is more efficient than countPopulation - use it. llvm-svn: 124283	2011-01-26 08:44:16 +00:00
Nick Lewycky	fc7a74c9a0	Fix memory corruption. If one of the SCEV creation functions calls another but doesn't return immediately after then the insert position in UniqueSCEVs will be out of date. No test because this is a memory corruption issue. Fixes PR9051! llvm-svn: 124282	2011-01-26 08:40:22 +00:00
Eric Christopher	cb32adbd3f	Separate out the constant bonus from the size reduction metrics. Rework a few loops accordingly. Should be no functional change. This is a step for more accurate cost/benefit analysis of devirt/inlining bonuses. llvm-svn: 124275	2011-01-26 02:58:39 +00:00
Eric Christopher	45e584b1b7	Coding style formatting changes. llvm-svn: 124260	2011-01-26 01:09:59 +00:00
Duncan Sands	017a3d76f7	In which I discover that zero+zero is zero, d'oh! llvm-svn: 124188	2011-01-25 15:14:15 +00:00
Duncan Sands	4d8a541ae2	See if this fixes llvm-gcc bootstrap. llvm-svn: 124184	2011-01-25 12:15:09 +00:00
Duncan Sands	92b081bd42	According to my auto-simplifier the most common missed simplifications in optimized code are: (non-negative number)+(power-of-two) != 0 -> true and (x \| 1) != 0 -> true Instcombine knows about the second one of course, but only does it if X\|1 has only one use. These fire thousands of times in the testsuite. llvm-svn: 124183	2011-01-25 09:38:29 +00:00
Eric Christopher	8c76a85e48	Reorganize this so that the early exit and special cases come early rather than interspersed. No functional change. llvm-svn: 124168	2011-01-25 01:34:31 +00:00
Dan Gohman	db0dc19c04	Give GetUnderlyingObject a TargetData, to keep it in sync with BasicAA's DecomposeGEPExpression, which recently began using a TargetData. This fixes PR8968, though the testcase is awkward to reduce. Also, update several off GetUnderlyingObject's users which happen to have a TargetData handy to pass it in. llvm-svn: 124134	2011-01-24 18:53:32 +00:00
Chris Lattner	1a73dcfbdd	fix PR8928 by clearing a stale map, patch by Jakub Staszak! llvm-svn: 124132	2011-01-24 18:36:51 +00:00
Dan Gohman	308677c046	Add a comment. llvm-svn: 124126	2011-01-24 17:54:18 +00:00
Nick Lewycky	13a2b8281f	Simplify some code with no functionality change. Make the test a lot more robust against smarter optimizations, using the power of FileCheck. llvm-svn: 124081	2011-01-23 20:06:05 +00:00
Ted Kremenek	880c19c032	Null initialize a few variables flagged by clang's -Wuninitialized-experimental warning. While these don't look like real bugs, clang's -Wuninitialized-experimental analysis is stricter than GCC's, and these fixes have the benefit of being general nice cleanups. llvm-svn: 124073	2011-01-23 17:05:06 +00:00
Nick Lewycky	2503c9f9c8	Use value ranges to fold ext(trunc) in SCEV when possible. llvm-svn: 124062	2011-01-23 06:20:19 +00:00
Nick Lewycky	4440e5815b	Have SCEV turn sext(x) into zext(x) when x is s>= 0. This applies many times in "make check" alone. llvm-svn: 124046	2011-01-22 22:06:21 +00:00
Eric Christopher	ee9652eceb	Add a FIXME explaining the move to a single indirect call bonus per function that we can change from indirect to direct. llvm-svn: 124045	2011-01-22 21:56:53 +00:00
Eric Christopher	5da99702ec	Only apply the devirtualization bonus once instead of per-call site in the target function. Fixes part of rdar://8546196 llvm-svn: 124044	2011-01-22 21:17:33 +00:00
Duncan Sands	1faa8712c9	At -O123 the early-cse pass is run before instcombine has run. According to my auto-simplier the transform most missed by early-cse is (zext X) != 0 -> X != 0. This patch adds this transform and some related logic to InstructionSimplify and removes some of the logic from instcombine (unfortunately not all because there are several situations in which instcombine can improve things by making new instructions, whereas instsimplify is not allowed to do this). At -O2 this often results in more than 15% more simplifications by early-cse, and results in hundreds of lines of bitcode being eliminated from the testsuite. I did see some small negative effects in the testsuite, for example a few additional instructions in three programs. One program, 483.xalancbmk, got an additional 35 instructions, which seems to be due to a function getting an additional instruction and then being inlined all over the place. llvm-svn: 123911	2011-01-20 13:21:55 +00:00
Nick Lewycky	51c13384f5	Similarly, analyze truncate through multiply. llvm-svn: 123842	2011-01-19 18:56:00 +00:00
Nick Lewycky	9867e58096	Add a missed SCEV fold that is required to continue analyzing the IR produced by indvars through the scev expander. trunc(add x, y) --> add(trunc x, y). Currently SCEV largely folds the other way which is probably wrong, but preserved to minimize churn. Instcombine doesn't do this fold either, demonstrating a missed optz'n opportunity on code doing add+trunc+add. llvm-svn: 123838	2011-01-19 16:59:46 +00:00
Nick Lewycky	5a538b62ca	Add a missing SCEV simplification sext(zext x) --> zext x. llvm-svn: 123832	2011-01-19 15:56:12 +00:00
Dan Gohman	df668227fb	Teach BasicAA to return PartialAlias in cases where both pointers are pointing to the same object, one pointer is accessing the entire object, and the other is access has a non-zero size. This prevents TBAA from kicking in and saying NoAlias in such cases. llvm-svn: 123775	2011-01-18 21:16:06 +00:00
Duncan Sands	732cb58b61	For completeness, generalize the (X + Y) - Y -> X transform and add X - (X + 1) -> -1. These were not recommended by my auto-simplifier since they don't fire often enough. However they do fire from time to time, for example they remove one subtraction from the final bitcode for 483.xalancbmk. llvm-svn: 123755	2011-01-18 11:50:19 +00:00

1 2 3 4 5 ...

3704 Commits