llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-28 14:32:51 +01:00

Author	SHA1	Message	Date
Anton Korobeynikov	30085a6f51	Revert invalid r60393. It causes llvm-gcc bootstrap fails in release builds. See PR3160 for details llvm-svn: 60604	2008-12-05 19:38:49 +00:00
Chris Lattner	211146e709	Fix test/Transforms/GVN/pre-load.ll llvm-svn: 60594	2008-12-05 17:04:12 +00:00
Chris Lattner	35547ba5ca	Make IsValueFullyAvailableInBlock safe. llvm-svn: 60588	2008-12-05 07:49:08 +00:00
Chris Lattner	2a9747548e	Implement PRE of loads in the GVN pass with a pretty cheap and straight-forward implementation. This does not require any extra alias analysis queries beyond what we already do for non-local loads. Some programs really really like load PRE. For example, SPASS triggers this ~1000 times, ~300 times in 255.vortex, and ~1500 times on 403.gcc. The biggest limitation to the implementation is that it does not split critical edges. This is a huge killer on many programs and should be addressed after the initial patch is enabled by default. The implementation of this should incidentally speed up rejection of non-local loads because it avoids creating the repl densemap in cases when it won't be used for fully redundant loads. This is currently disabled by default. Before I turn this on, I need to fix a couple of miscompilations in the testsuite, look at compile time performance numbers, and look at perf impact. This is pretty close to ready though. llvm-svn: 60408	2008-12-02 08:16:11 +00:00
Owen Anderson	92e405b332	Fix an issue that Chris noticed, where local PRE was not properly instantiating a new value numbering set after splitting a critical edge. This increases the number of instances of PRE on 403.gcc from ~60 to ~570. llvm-svn: 60393	2008-12-02 04:09:22 +00:00
Chris Lattner	3b908483b7	Rename some variables, only increment BI once at the start of the loop instead of throughout it. llvm-svn: 60339	2008-12-01 07:35:54 +00:00
Chris Lattner	c6e6eaf6d3	pull the predMap densemap out of the inner loop of performPRE, so that it isn't reallocated all the time. This is a tiny speedup for GVN: 3.90->3.88s llvm-svn: 60338	2008-12-01 07:29:03 +00:00
Chris Lattner	c1adf6fc51	Make GVN be more intelligent about redundant load elimination: when finding dependent load/stores, realize that they are the same if aliasing claims must alias instead of relying on the pointers to be exactly equal. This makes load elimination more aggressive. For example, on 403.gcc, we had: < 68 gvn - Number of instructions PRE'd < 152718 gvn - Number of instructions deleted < 49699 gvn - Number of loads deleted < 6153 memdep - Number of dirty cached non-local responses < 169336 memdep - Number of fully cached non-local responses < 162428 memdep - Number of uncached non-local responses now we have: > 64 gvn - Number of instructions PRE'd > 153623 gvn - Number of instructions deleted > 49856 gvn - Number of loads deleted > 5022 memdep - Number of dirty cached non-local responses > 159030 memdep - Number of fully cached non-local responses > 162443 memdep - Number of uncached non-local responses That's an extra 157 loads deleted and extra 905 other instructions nuked. This slows down GVN very slightly, from 3.91 to 3.96s. llvm-svn: 60314	2008-12-01 01:31:36 +00:00
Chris Lattner	bd1bc4a75e	Reimplement the non-local dependency data structure in terms of a sorted vector instead of a densemap. This shrinks the memory usage of this thing substantially (the high water mark) as well as making operations like scanning it faster. This speeds up memdep slightly, gvn goes from 3.9376 to 3.9118s on 403.gcc This also splits out the statistics for the cached non-local case to differentiate between the dirty and clean cached case. Here's the stats for 403.gcc: 6153 memdep - Number of dirty cached non-local responses 169336 memdep - Number of fully cached non-local responses 162428 memdep - Number of uncached non-local responses yay for caching :) llvm-svn: 60313	2008-12-01 01:15:42 +00:00
Chris Lattner	1f8482ffc8	Cache analyses in ivars and add some useful DEBUG output. This speeds up GVN from 4.0386s to 3.9376s. llvm-svn: 60310	2008-12-01 00:40:32 +00:00
Chris Lattner	77908d9ccf	improve indentation, do cheap checks before expensive ones, remove some fixme's. This speeds up GVN very slightly on 403.gcc (4.06->4.03s) llvm-svn: 60309	2008-11-30 23:39:23 +00:00
Chris Lattner	2f7da36732	Fix a fixme by making memdep's handling of allocations more logical. If we see that a load depends on the allocation of its memory with no intervening stores, we now return a 'None' depedency instead of "Normal". This tweaks GVN to do its optimization with the new result. llvm-svn: 60267	2008-11-30 01:39:32 +00:00
Chris Lattner	3e86ec7289	Change MemDep::getNonLocalDependency to return its results as a smallvector instead of a DenseMap. This speeds up GVN by 5% on 403.gcc. llvm-svn: 60255	2008-11-29 21:33:22 +00:00
Chris Lattner	ffc1af1619	reimplement getNonLocalDependency with a simpler worklist formulation that is faster and doesn't require nonLazyHelper. Much less code. llvm-svn: 60253	2008-11-29 21:22:42 +00:00
Chris Lattner	96c72eef4b	Split getDependency into getDependency and getDependencyFrom, the former does caching, the later doesn't. This dramatically simplifies the logic in getDependency and getDependencyFrom. llvm-svn: 60234	2008-11-29 03:47:00 +00:00
Chris Lattner	6bf62f050c	Introduce and use a new MemDepResult class to hold the results of a memdep query. This makes it crystal clear what cases can escape from MemDep that the clients have to handle. This also gives the clients a nice simplified interface to it that is easy to poke at. This patch also makes DepResultTy and MemoryDependenceAnalysis::DepType private, yay. llvm-svn: 60231	2008-11-29 02:29:27 +00:00
Chris Lattner	e9295510b5	Reimplement the internal abstraction used by MemDep in terms of a pointer/int pair instead of a manually bitmangled pointer. This forces clients to think a little more about checking the appropriate pieces and will be useful for internal implementation improvements later. I'm not particularly happy with this. After going through this I don't think that the clients of memdep should be exposed to the internal type at all. I'll fix this in a subsequent commit. This has no functionality change. llvm-svn: 60230	2008-11-29 01:43:36 +00:00
Nuno Lopes	cc4f37aa68	fix memleak by cleaning the global sets on pass exit llvm-svn: 57353	2008-10-10 16:25:50 +00:00
Duncan Sands	8f296a3788	Add <cstdio> include where needed by gcc-4.4. Patch by Samuel Tardieu. llvm-svn: 57291	2008-10-08 07:23:46 +00:00
Duncan Sands	88d8323743	Factorize code: remove variants of "strip off pointer bitcasts and GEP's", and centralize the logic in Value::getUnderlyingObject. The difference with stripPointerCasts is that stripPointerCasts only strips GEPs if all indices are zero, while getUnderlyingObject strips GEPs no matter what the indices are. llvm-svn: 56922	2008-10-01 15:25:41 +00:00
Dan Gohman	e1f9be27bc	Tidy up several unbeseeming casts from pointer to intptr_t. llvm-svn: 55779	2008-09-04 17:05:41 +00:00
Owen Anderson	94bd638e81	Fix a bug that prevented PRE from applying in some cases. llvm-svn: 55744	2008-09-03 23:06:07 +00:00
Owen Anderson	14510f8fee	Put a heuristic in place to prevent GVN from falling into bad cases with massively complicated CFGs. This speeds up a particular testcase from 12+ hours to 5 seconds with little perceptible loss of quality. llvm-svn: 55391	2008-08-26 22:07:42 +00:00
Chris Lattner	fd64cbf22d	consolidate DenseMapInfo implementations, and add one for std::pair. Patch contributed by m-s. llvm-svn: 55167	2008-08-22 05:08:25 +00:00
Duncan Sands	b4646d7dbe	Supress a gcc-4.3 warning. llvm-svn: 53771	2008-07-18 21:06:02 +00:00
Owen Anderson	8500aba55a	Make PRE actually handle critical edges (by splitting them). Confirmed that bootstrap passes with this change. llvm-svn: 53762	2008-07-18 18:03:38 +00:00
Owen Anderson	5524271234	Enable PRE. My last batch of changes fixed the miscompile. llvm-svn: 53730	2008-07-17 19:41:00 +00:00
Owen Anderson	7fb36c8bba	Factor MergeBlockIntoPredecessor out into BasicBlockUtils. llvm-svn: 53705	2008-07-17 00:01:40 +00:00
Owen Anderson	f91aa3b22f	There's no need to iterate block merging and PRE. In fact, iterating the latter could cause problems for memdep when it breaks critical edges. llvm-svn: 53691	2008-07-16 17:52:31 +00:00
Owen Anderson	e12f7904ff	Revert this, as it seems to still be broken. llvm-svn: 53627	2008-07-15 17:59:02 +00:00
Owen Anderson	73bdb28e89	Enable local PRE by default. llvm-svn: 53616	2008-07-15 16:28:23 +00:00
Owen Anderson	f1b3898e7f	Have GVN do a pre-pass over the CFG that folds away unconditional branches where possible. This allows local PRE to be more aggressive. llvm-svn: 53615	2008-07-15 16:28:06 +00:00
Owen Anderson	cc861329ee	Don't call lookupNumber more than we have to. llvm-svn: 53470	2008-07-11 20:05:13 +00:00
Owen Anderson	035bf4d2ca	Use information already present in the ValueTable to fast-fail when we know there won't be a value number match. This speeds up GVN on a case where there are very few redundancies by ~25%. llvm-svn: 53108	2008-07-03 17:44:33 +00:00
Owen Anderson	0b0017e514	Avoid a redundant call. llvm-svn: 53040	2008-07-02 18:15:31 +00:00
Owen Anderson	5747d627e0	A better fix for PR2503 that doesn't pessimize GVN in the presence of unreachable blocks. llvm-svn: 53032	2008-07-02 17:20:16 +00:00
Evan Cheng	174f11c202	Disable PRE. It's breaking bootstrapping. llvm-svn: 52643	2008-06-23 21:22:35 +00:00
Owen Anderson	0dcae02ad4	Tighten the conditions under which we do PRE, remove some unneeded code, and correct our preserved analyses list, since we do now change the CFG by splitting critical edges during PRE. llvm-svn: 52631	2008-06-23 17:49:45 +00:00
Evan Cheng	2ff47edc37	Enable PRE. llvm-svn: 52574	2008-06-21 07:26:53 +00:00
Owen Anderson	5213d215e0	Really disable PRE. llvm-svn: 52531	2008-06-20 08:59:13 +00:00
Owen Anderson	3231e506cb	Change around the data structures used to store availability sets, resulting in a GVN+PRE that is faster that GVN alone was before. llvm-svn: 52521	2008-06-20 01:15:47 +00:00
Evan Cheng	d4c055e1f8	Disable PRE for now. It seems to be breaking llvm-gcc bootstrapping. llvm-svn: 52518	2008-06-20 01:01:07 +00:00
Owen Anderson	c979fcf8ff	Add a hidden -disable-pre flag for testing purposes. This should be removed once benchmarking is completed. llvm-svn: 52506	2008-06-19 19:57:25 +00:00
Owen Anderson	35fb8dbfef	PRE requires that critical edges be split. llvm-svn: 52505	2008-06-19 19:54:19 +00:00
Owen Anderson	87d761460e	Be sure to remove values from the value numbering table after we delete them. This fixes a failure on povray. llvm-svn: 52499	2008-06-19 17:53:26 +00:00
Owen Anderson	aa850d32db	Revert support for insertvalue and extractvalue instructions for the moment. GVN expects that all inputs which to an instruction fall somewhere in the value hierarchy, which isn't true for these. llvm-svn: 52496	2008-06-19 17:25:39 +00:00
Owen Anderson	5b8d39569b	Add support for extractvalue and insertvalue instructions in GVN. llvm-svn: 52472	2008-06-18 21:59:00 +00:00
Owen Anderson	3f78e260c1	Add local PRE to GVN. This only operates in cases where it would not increase code size, namely when the instantiated expression would only need to be created in one predecessor. llvm-svn: 52471	2008-06-18 21:41:49 +00:00
Owen Anderson	c1ac0c1c41	We don't want to find dependencies within the same block in this case. It leads to incorrect results because we're detecting something at or after the call we're querying on. llvm-svn: 52433	2008-06-17 22:27:06 +00:00
Owen Anderson	22a982f9eb	Switch GVN to use ScopedHashTable. llvm-svn: 52242	2008-06-12 19:25:32 +00:00
Matthijs Kooijman	1fd76cd396	Update comments and documentation to reflect that GCSE and ValueNumbering are deprecated by the GVN and GVNPRE passes. llvm-svn: 51983	2008-06-05 07:55:49 +00:00
Owen Anderson	264b60b69d	Remove unneeded #include. llvm-svn: 51955	2008-06-04 18:28:10 +00:00
Nate Begeman	fdedac42c7	Teach GVN to not assert on vector comparisons llvm-svn: 51230	2008-05-18 19:49:05 +00:00
Owen Anderson	6a2ee9af7a	Fix Analysis/BasicAA/pure-const-dce.ll. This turned out to be a correctness bug as well as a missed optimization. We weren't properly checking for local dependencies before moving on to non-local ones when doing non-local read-only call CSE. llvm-svn: 51082	2008-05-13 23:18:30 +00:00
Owen Anderson	7f6db08b5f	Make the non-local CSE safety checks slightly more thorough. llvm-svn: 51035	2008-05-13 13:41:23 +00:00
Owen Anderson	c54c61634c	Add support for non-local CSE of read-only calls. llvm-svn: 51024	2008-05-13 08:17:22 +00:00
Owen Anderson	0256b368f8	Go back to passing the analyses around as parameters. llvm-svn: 50995	2008-05-12 20:15:55 +00:00
Owen Anderson	3f2d12126f	Move the various analyses used by GVN into static variables so we don't have to keep passing them around or refetching them. llvm-svn: 50963	2008-05-12 08:15:27 +00:00
Owen Anderson	b171c54227	Remove unneeded #include's. llvm-svn: 50035	2008-04-21 07:47:38 +00:00
Owen Anderson	cd1b9c4b43	Make GVN able to remove unnecessary calls to read-only functions again. llvm-svn: 49842	2008-04-17 05:36:50 +00:00
Owen Anderson	f55bae07b7	Fix PR2213 by simultaneously making GVN more aggressive with the return values of calls and less aggressive with non-readnone calls. llvm-svn: 49516	2008-04-11 05:11:49 +00:00
Owen Anderson	ca7e0e21f3	Factor a bunch of functionality related to memcpy and memset transforms out of GVN and into its own pass. llvm-svn: 49419	2008-04-09 08:23:16 +00:00
Owen Anderson	0d844f6205	Remove accidentally duplicated code. llvm-svn: 49418	2008-04-09 07:55:01 +00:00
Owen Anderson	4ad5a5201c	Add operator= implementations to SparseBitVector, allowing it to be used in GVN. This results in both time and memory savings for GVN. For example, one testcase went from 10.5s to 6s with this patch. llvm-svn: 49345	2008-04-07 17:38:23 +00:00
Owen Anderson	93ab00f1d9	Make GVN more memory efficient, particularly on code that contains a large number of allocations, which GVN can't optimize anyways. llvm-svn: 49329	2008-04-07 09:59:07 +00:00
Gabor Greif	6c6b8a57f3	API changes for class Use size reduction, wave 1. Specifically, introduction of XXX::Create methods for Users that have a potentially variable number of Uses. llvm-svn: 49277	2008-04-06 20:25:17 +00:00
Chris Lattner	10e3ff7e5f	change iterator invalidation avoidance to just move the iterator backward when something changes, instead of moving forward. This allows us to simplify memset lowering, inserting the memset at the end of the range of stuff we're touching instead of at the start. This, in turn, allows us to make use of the addressing instructions already used in the function instead of inserting our own. For example, we now codegen: %tmp41 = getelementptr [8 x i8]* %ref_idx, i32 0, i32 0 ; <i8> [#uses=2] call void @llvm.memset.i64( i8 %tmp41, i8 -1, i64 8, i32 1 ) instead of: %tmp20 = getelementptr [8 x i8]* %ref_idx, i32 0, i32 7 ; <i8> [#uses=1] %ptroffset = getelementptr i8 %tmp20, i64 -7 ; <i8> [#uses=1] call void @llvm.memset.i64( i8 %ptroffset, i8 -1, i64 8, i32 1 ) llvm-svn: 48940	2008-03-29 05:15:47 +00:00
Chris Lattner	48b3859ee9	make the common case of a single store (which clearly shouldn't be turned into a memset!) faster by avoiding an allocation of an std::list node. llvm-svn: 48939	2008-03-29 04:52:12 +00:00
Chris Lattner	722c9a539f	give form-memset a significantly more sane heuristic, enable it by default. llvm-svn: 48937	2008-03-29 04:36:18 +00:00
Chris Lattner	0a18724a00	make memset inference significantly more powerful: it can now handle memsets that initialize "structs of arrays" and other store sequences that are not sequential. This is still only enabled if you pass -form-memset-from-stores. The flag is not heavily tested and I haven't analyzed the perf regressions when -form-memset-from-stores is passed either, but this causes no make check regressions. llvm-svn: 48909	2008-03-28 06:45:13 +00:00
Evan Cheng	95cc5fca5c	Temporarily disabling memset forming optimization. Add an option. llvm-svn: 48720	2008-03-24 05:28:38 +00:00
Chris Lattner	16f62d36e8	implement an initial hack at a straight-line store -> memset optimization. This fires dozens of times across spec and multisource, but I don't know if it actually speeds stuff up. Hopefully the testers will show something nice :) llvm-svn: 48680	2008-03-22 05:37:16 +00:00
Chris Lattner	9a567d824d	implement the logic for memset insertion and store deletion. llvm-svn: 48679	2008-03-22 04:13:49 +00:00
Chris Lattner	9d8b1ee347	This is a partially implemented and currently disabled start of a store merging optimization. Nothing to see here, hopefully more later :) llvm-svn: 48670	2008-03-22 00:31:52 +00:00
Chris Lattner	18f7655a45	the size of a smallvector shouldn't be part of the interface to these methods. llvm-svn: 48662	2008-03-21 22:01:16 +00:00
Chris Lattner	15d06c679b	make gvn marginally faster by reallocating the lastSeenLoad map for each basic block. llvm-svn: 48660	2008-03-21 21:33:23 +00:00
Chris Lattner	b8102d9de3	Minor cleanups and shrinkification. llvm-svn: 48658	2008-03-21 21:14:38 +00:00
Owen Anderson	6c2454d9d1	Fix a bug in GVN that Duncan noticed, where we potentially need to insert a pointer bitcast when performing return slot optimization. llvm-svn: 48343	2008-03-13 22:07:10 +00:00
Owen Anderson	5887233a3f	Improve the return slot optimization to be both more aggressive (not limited to sret parameters), and safer (when the passed pointer might be invalid). Thanks to Duncan and Chris for the idea behind this, and extra thanks to Duncan for helping me work out the trap-safety. llvm-svn: 48280	2008-03-12 07:37:44 +00:00
Owen Anderson	eadd074b22	Fix an issue where GVN had the sizes of the two memcpy's reverse, resulting in an invalid transformation. llvm-svn: 47639	2008-02-26 23:06:17 +00:00
Owen Anderson	6eafd532ab	Fix an issue where GVN was performing the return slot optimization when it was not safe. This is fixed by more aggressively checking that the return slot is not used elsewhere in the function. llvm-svn: 47544	2008-02-25 04:08:09 +00:00
Owen Anderson	432abc0479	Fix an issue where GVN would try to use an instruction before its definition when performing return slot optimization. llvm-svn: 47541	2008-02-25 00:40:41 +00:00
Anton Korobeynikov	fd6b669c80	Make Transforms to be 4.3 warnings-clean llvm-svn: 47371	2008-02-20 11:26:25 +00:00
Owen Anderson	52ed56338d	When performing return slot optimization, remember to inform memdep when we're removing the memcpy. llvm-svn: 47364	2008-02-20 08:23:02 +00:00
Owen Anderson	6196cdcb48	Refactor this method a bit, and correct a test that was completely wrong but happened to work out anyways. :-) llvm-svn: 47321	2008-02-19 07:07:51 +00:00
Chris Lattner	99e0b1c063	isa+cast -> dyncast. llvm-svn: 47320	2008-02-19 06:53:20 +00:00
Chris Lattner	0ee0f38084	simplify this code again, try 2 :) llvm-svn: 47319	2008-02-19 06:52:38 +00:00
Owen Anderson	d60bb0a64b	Fix a comment. llvm-svn: 47318	2008-02-19 06:51:23 +00:00
Owen Anderson	dbc264003e	Major improvements to yesterday's return slot optimization. Remove some unneccessary constraints, and add some others that should have been in from the first place. Document the whole thing better. llvm-svn: 47315	2008-02-19 06:35:43 +00:00
Owen Anderson	4e6f18d5bf	Factor the profitability check for return slot optimization out into a static function. At some point in the future, this check will become smarter. llvm-svn: 47310	2008-02-19 03:27:34 +00:00
Owen Anderson	3782cd74d1	An sret parameter is required to be the first parameter, so there's no need to loop over all the parameters of the callee looking for it. llvm-svn: 47309	2008-02-19 03:15:29 +00:00
Owen Anderson	ea5cdf1a83	Cleanup some of my patches from yesterday. Refactor the check for which xform to apply to a memcpy into processInstruction. Also, fix a bug in the check due to missing braces. llvm-svn: 47307	2008-02-19 03:09:45 +00:00
Owen Anderson	5c258ed93d	Fix Transforms/GVN/memcpy.ll, which Chris broke in r47275 by reordering the branches. memcpy's are a kind of CallInst. llvm-svn: 47305	2008-02-19 02:53:23 +00:00
Chris Lattner	a3318d17d4	minor code simplification, no functionality change. llvm-svn: 47275	2008-02-18 17:47:29 +00:00
Owen Anderson	7b092ea631	Add support to GVN for performing sret return slot optimization. This means that, if an sret function tail calls another sret function, it should pass its own sret parameter to the tail callee, allowing it to fill in the correct return value. llvm-gcc does not emit this by default. Instead, it allocates space in the caller for the sret of the tail call and then uses memcpy to copy the result into the caller's sret parameter. This optimization detects and optimizes that case. llvm-svn: 47265	2008-02-18 09:24:53 +00:00
Nick Lewycky	0dd6ce5d3a	Fix PR2032. Inform the alias analysis of changes to the underlying program. llvm-svn: 47111	2008-02-14 07:11:24 +00:00
Owen Anderson	274aa2846e	Re-apply the patch to improve the optimizations of memcpy's, with several bugs fixed. This now passes PPC bootstrap. llvm-svn: 47026	2008-02-12 21:15:18 +00:00
Eli Friedman	69268a529e	Fix for bug 1996: optimize out loads of undef. This code basically just checks for a malloc/alloca immediately followed by a load. llvm-svn: 47006	2008-02-12 12:08:14 +00:00
Bill Wendling	8a28ab4b1f	Temporarily reverting: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20080128/057882.html This is causing a miscompilation on PPC G5 and just now seeing it on iMac x86-64. llvm-svn: 46822	2008-02-06 20:03:07 +00:00
Owen Anderson	aaba6f96da	Allow GVN to hack on memcpy's, making them open to further optimization. llvm-svn: 46693	2008-02-04 02:59:58 +00:00

1 2 3 4

187 Commits