llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-01 08:23:21 +01:00

Author	SHA1	Message	Date
Chris Lattner	02088f3ab8	Teach instsimplify some tricks about exact/nuw/nsw shifts. improve interfaces to instsimplify to take this info. llvm-svn: 125196	2011-02-09 17:15:04 +00:00
Eric Christopher	ddc2157034	Rewrite how the indirect call bonus is handled. This now works by: a) Making it a per call site bonus for functions that we can move from indirect to direct calls. b) Reduces the bonus from 500 to 100 per call site. c) Subtracts the size of the possible newly inlineable call from the bonus to only add a bonus if we can inline a small function to devirtualize it. Also changes the bonus from a positive that's subtracted to a negative that's added. Fixes the remainder of rdar://8546196 by reducing the object file size after inlining by 84%. llvm-svn: 124916	2011-02-05 00:49:15 +00:00
Devang Patel	71b1fadf20	Add support to describe template value parameter in debug info. llvm-svn: 124755	2011-02-02 22:35:53 +00:00
Devang Patel	89455dc7cd	Add support to describe template parameter type in debug info. llvm-svn: 124752	2011-02-02 21:38:25 +00:00
Eric Christopher	f8b2388751	Reapply 124275 since the Dragonegg failure was unreproducible. llvm-svn: 124641	2011-02-01 01:16:32 +00:00
Bill Wendling	481f64ab68	This is #included by .c files. Remove C++-style comments. llvm-svn: 124552	2011-01-29 21:54:26 +00:00
Frits van Bommel	92dc04df67	Move InstCombine's knowledge of fdiv to SimplifyInstruction(). llvm-svn: 124534	2011-01-29 15:26:31 +00:00
Andrew Trick	72f17d97f3	Implementation of path profiling. Modified patch by Adam Preuss. This builds on the existing framework for block tracing, edge profiling and optimal edge profiling. See -help-hidden for new flags. For documentation, see the technical report "Implementation of Path Profiling..." in llvm.org/pubs. llvm-svn: 124515	2011-01-29 01:09:53 +00:00
Duncan Sands	1a18d8df96	My auto-simplifier noticed that ((X/Y)Y)/Y occurs several times in SPEC benchmarks, and that it can be simplified to X/Y. (In general you can only simplify (ZY)/Y to Z if the multiplication did not overflow; if Z has the form "X/Y" then this is the case). This patch implements that transform and moves some Div logic out of instcombine and into InstructionSimplify. Unfortunately instcombine gets in the way somewhat, since it likes to change (X/Y)Y into X-(X rem Y), so I had to teach instcombine about this too. Finally, thanks to the NSW/NUW flags, sometimes we know directly that "ZY" does not overflow, because the flag says so, so I added that logic too. This eliminates a bunch of divisions and subtractions in 447.dealII, and has good effects on some other benchmarks too. It seems to have quite an effect on tramp3d-v4 but it's hard to say if it's good or bad because inlining decisions changed, resulting in massive changes all over. llvm-svn: 124487	2011-01-28 16:51:11 +00:00
Eric Christopher	54f709d970	Temporarily revert 124275 to see if it brings the dragonegg buildbot back. llvm-svn: 124312	2011-01-26 19:40:31 +00:00
Eric Christopher	cb32adbd3f	Separate out the constant bonus from the size reduction metrics. Rework a few loops accordingly. Should be no functional change. This is a step for more accurate cost/benefit analysis of devirt/inlining bonuses. llvm-svn: 124275	2011-01-26 02:58:39 +00:00
Duncan Sands	92b081bd42	According to my auto-simplifier the most common missed simplifications in optimized code are: (non-negative number)+(power-of-two) != 0 -> true and (x \| 1) != 0 -> true Instcombine knows about the second one of course, but only does it if X\|1 has only one use. These fire thousands of times in the testsuite. llvm-svn: 124183	2011-01-25 09:38:29 +00:00
Eric Christopher	4a1aea2da3	Perhaps a bit too much vertical whitespace. llvm-svn: 124148	2011-01-24 22:19:14 +00:00
Dan Gohman	db0dc19c04	Give GetUnderlyingObject a TargetData, to keep it in sync with BasicAA's DecomposeGEPExpression, which recently began using a TargetData. This fixes PR8968, though the testcase is awkward to reduce. Also, update several off GetUnderlyingObject's users which happen to have a TargetData handy to pass it in. llvm-svn: 124134	2011-01-24 18:53:32 +00:00
Cameron Zwarich	739742ce7d	Convert a std::vector to a SmallVector for another 5.4% speedup on domtree. llvm-svn: 124065	2011-01-23 06:54:22 +00:00
Cameron Zwarich	79cc467078	In the simpler version of the link-eval data structure that we use in dominator computation, the Ancestor field is always set to the Parent, so we can remove the explicit link entirely and merge the Parent and Ancestor fields. Instead of checking for whether an ancestor exists for a node or not, we simply check whether the node has already been processed. This is simpler if Compress is inlined into Eval, so I did that as well. This is about a 3% speedup running -domtree on test-suite + SPEC2000 & SPEC2006, but it also opens up some opportunities for further improvement. llvm-svn: 124061	2011-01-23 06:16:06 +00:00
Cameron Zwarich	f881ff46fe	Remove useless struct fields. llvm-svn: 124058	2011-01-23 05:11:18 +00:00
Cameron Zwarich	2f271f19aa	Remove friend declaration for removed function. llvm-svn: 124057	2011-01-23 04:54:34 +00:00
Cameron Zwarich	d857dd6882	Convert a std::vector to a SmallVector. llvm-svn: 124055	2011-01-23 04:30:59 +00:00
Cameron Zwarich	1a5627dd02	Simplify some code now that we've removed the more optimal (but slower) version of the link-eval data structure from dominator computation. llvm-svn: 124053	2011-01-23 04:13:53 +00:00
Cameron Zwarich	caca9a63e6	There is no point in verifying an analysis that is never updated. llvm-svn: 123743	2011-01-18 05:44:04 +00:00
Cameron Zwarich	62a9d4d454	Remove some now-unused DominanceFrontier methods. llvm-svn: 123726	2011-01-18 04:21:57 +00:00
Chris Lattner	75599bb566	remove the partial specialization pass. It is unmaintained and has bugs. llvm-svn: 123554	2011-01-16 00:27:10 +00:00
Duncan Sands	44c273d907	Move some shift transforms out of instcombine and into InstructionSimplify. While there, I noticed that the transform "undef >>a X -> undef" was wrong. For example if X is 2 then the top two bits must be equal, so the result can not be anything. I fixed this in the constant folder as well. Also, I made the transform for "X << undef" stronger: it now folds to undef always, even though X might be zero. This is in accordance with the LangRef, but I must admit that it is fairly aggressive. Also, I added "i32 X << 32 -> undef" following the LangRef and the constant folder, likewise fairly aggressive. llvm-svn: 123417	2011-01-14 00:37:45 +00:00
Tobias Grosser	189efecfed	Add single entry / single exit accessors. Add methods for accessing the (single) entry / exit edge of a region. If no such edge exists, null is returned. Both accessors return the start block of the corresponding edge. The edge can finally be formed by utilizing Region::getEntry() or Region::getExit(); Contributed by: Andreas Simbuerger <simbuerg@fim.uni-passau.de> llvm-svn: 123410	2011-01-13 23:18:04 +00:00
Jakob Stoklund Olesen	add3ef6ca2	Put the Dominator improvements back in. They were not the cause of bootstrap miscomparisons. llvm-svn: 123273	2011-01-11 21:23:09 +00:00
Jakob Stoklund Olesen	264d2e5c08	Speculatively revert the recent improvements to Dominators.h in an attempt to track down the gcc bootstrap miscompare. llvm-svn: 123254	2011-01-11 19:26:30 +00:00
Chris Lattner	8c0a70a30c	some comment improvements. llvm-svn: 123243	2011-01-11 17:11:59 +00:00
Chris Lattner	b6a67a9068	Step #2 to improve trip count analysis for loops like this: void f(int* begin, int* end) { std::fill(begin, end, 0); } which turns into a != exit expression where one pointer is strided and (thanks to step #1) known to not overflow, and the other is loop invariant. The observation here is that, though the IV is strided by 4 in this case, that the IV has to become equal to the end value. It cannot "miss" the end value by stepping over it, because if it did, the strided IV expression would eventually wrap around. Handle this by turning A != B into "A-B != 0" where the A-B part is known to be NUW. llvm-svn: 123131	2011-01-09 22:26:35 +00:00
Tobias Grosser	8dadf82d11	DominatorTree->print() now prints the status of the DFSNumbers correctly llvm-svn: 123120	2011-01-09 16:00:09 +00:00
Chris Lattner	71c356a8b0	fit in 80 cols llvm-svn: 123085	2011-01-08 20:53:41 +00:00
Chris Lattner	50aaa34f29	make this file properly self contained. llvm-svn: 123059	2011-01-08 08:19:49 +00:00
Owen Anderson	a52e5dbdfa	Stub out a new updating interface to AliasAnalysis, allowing stateful analyses to be informed when a pointer value has potentially become escaping. Implementations can choose to either fall back to conservative responses for that value, or may recompute their analysis to accomodate the change. llvm-svn: 122777	2011-01-03 21:38:41 +00:00
Nick Lewycky	4840065424	Add spliceFunction to the CallGraph interface. This allows users to efficiently update a callGraph when performing the common operation of splicing the body to a new function and updating all callers (such as via RAUW). No users yet, though this is intended for DeadArgumentElimination as part of PR8887. llvm-svn: 122728	2011-01-03 03:19:35 +00:00
Chris Lattner	e396e846b4	split dom frontier handling stuff out to its own DominanceFrontier header, so that Dominators.h is just domtree. Also prune #includes a bit. llvm-svn: 122714	2011-01-02 22:09:33 +00:00
Cameron Zwarich	482eeb4c8e	Remove an unused member function. llvm-svn: 122693	2011-01-02 12:37:22 +00:00
Cameron Zwarich	8444a578bb	Fix a typo in a variable name. llvm-svn: 122691	2011-01-02 12:17:10 +00:00
Cameron Zwarich	72a49f9271	Move a load into the only branch where it is used and eliminate a temporary. llvm-svn: 122690	2011-01-02 10:50:14 +00:00
Cameron Zwarich	0a0e69ca0d	Add the explanatory comment from r122680's commit message to the code itself. llvm-svn: 122689	2011-01-02 10:40:14 +00:00
Cameron Zwarich	e522fd8efe	Tidy up indentation. llvm-svn: 122688	2011-01-02 10:10:02 +00:00
Cameron Zwarich	7becf43554	Fix a typo, which should also fix the failure on llvm-x86_64-linux-checks. llvm-svn: 122687	2011-01-02 10:06:44 +00:00
Cameron Zwarich	25272921bb	Remove the #ifdef'd code for balancing the eval-link data structure. It doesn't compile, and everyone's tests have shown it to be slower in practice, even for quite large graphs. I also hope to do an optimization that is only correct with the simpler data structure, which would break this even further. llvm-svn: 122684	2011-01-02 07:53:49 +00:00
Cameron Zwarich	c8a0461c46	Speed up dominator computation some more by optimizing bucket processing. When naively implemented, the Lengauer-Tarjan algorithm requires a separate bucket for each vertex. However, this is unnecessary, because each vertex is only placed into a single bucket (that of its semidominator), and each vertex's bucket is processed before it is added to any bucket itself. Instead of using a bucket per vertex, we use a single array Buckets that has two purposes. Before the vertex V with DFS number i is processed, Buckets[i] stores the index of the first element in V's bucket. After V's bucket is processed, Buckets[i] stores the index of the next element in the bucket to which V now belongs, if any. Reading from the buckets can also be optimized. Instead of processing the bucket of V's parent at the end of processing V, we process the bucket of V itself at the beginning of processing V. This means that the case of the root vertex can be simplified somewhat. It also means that we don't need to look up the DFS number of the semidominator of every node in the bucket we are processing, since we know it is the current index being processed. This is a 6.5% speedup running -domtree on test-suite + SPEC2000/2006, with larger speedups of around 12% on the larger benchmarks like GCC. llvm-svn: 122680	2011-01-02 07:03:00 +00:00
Duncan Sands	87515f1e57	Clarify that InstructionSimplify only returns values that dominate the original instruction. llvm-svn: 122601	2010-12-28 18:01:19 +00:00
Duncan Sands	707bdbd129	Small optimization to speed up replacementPreservesLCSSAForm. llvm-svn: 122600	2010-12-28 18:00:24 +00:00
Chris Lattner	d4daf9f002	implement enough of the memset inference algorithm to recognize and insert memsets. This is still missing one important validity check, but this is enough to compile stuff like this: void test0(std::vector<char> &X) { for (std::vector<char>::iterator I = X.begin(), E = X.end(); I != E; ++I) *I = 0; } void test1(std::vector<int> &X) { for (long i = 0, e = X.size(); i != e; ++i) X[i] = 0x01010101; } With: $ clang t.cpp -S -o - -O2 -emit-llvm \| opt -loop-idiom \| opt -O3 \| llc to: __Z5test0RSt6vectorIcSaIcEE: ## @_Z5test0RSt6vectorIcSaIcEE ## BB#0: ## %entry subq $8, %rsp movq (%rdi), %rax movq 8(%rdi), %rsi cmpq %rsi, %rax je LBB0_2 ## BB#1: ## %bb.nph subq %rax, %rsi movq %rax, %rdi callq ___bzero LBB0_2: ## %for.end addq $8, %rsp ret ... __Z5test1RSt6vectorIiSaIiEE: ## @_Z5test1RSt6vectorIiSaIiEE ## BB#0: ## %entry subq $8, %rsp movq (%rdi), %rax movq 8(%rdi), %rdx subq %rax, %rdx cmpq $4, %rdx jb LBB1_2 ## BB#1: ## %for.body.preheader andq $-4, %rdx movl $1, %esi movq %rax, %rdi callq _memset LBB1_2: ## %for.end addq $8, %rsp ret llvm-svn: 122573	2010-12-26 23:42:51 +00:00
Chris Lattner	c56d20aa48	move isBytewiseValue out to ValueTracking.h/cpp llvm-svn: 122565	2010-12-26 20:15:01 +00:00
Dan Gohman	db69a33d9f	Constify. llvm-svn: 122447	2010-12-22 22:10:08 +00:00
Duncan Sands	6b0ed93a81	Pull a few more simplifications out of instcombine (there are still plenty left though!), in particular for multiplication. llvm-svn: 122330	2010-12-21 14:00:22 +00:00
Dan Gohman	295ba3ab26	Move Value::getUnderlyingObject to be a standalone function so that it can live in Analysis instead of VMCore. llvm-svn: 121885	2010-12-15 20:02:24 +00:00

1 2 3 4 5 ...

1760 Commits