llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Dale Johannesen	26f0dd9021	This fixes a bug introduced in 72661, which can move loads back past a check that the load address is valid, see new testcase. The test that went in with 72661 has exactly this case, except that the conditional it's moving past is checking something else; I've settled for changing that test to reference a global, not a pointer. It may be possible to scan all the tests you pass and make sure none of them are checking any component of the address, but it's not trivial and I'm not trying to do that here. llvm-svn: 73632	2009-06-17 20:48:23 +00:00
Owen Anderson	928040c625	Be more aggressive in doing LoadPRE by tracing backwards when a block only has a single predecessor. Patch by Jakub Staszak. llvm-svn: 72661	2009-05-31 09:03:40 +00:00
Chris Lattner	8f4210d099	make memdep use the getModRefInfo method for stores instead of the low-level alias() method, allowing it to reason more aggressively about pointers into constant memory. PR4189 llvm-svn: 72403	2009-05-25 21:28:56 +00:00
Owen Anderson	851ce6d1d5	Reapply r68211, with the miscompilations it caused fixed. llvm-svn: 68262	2009-04-01 23:53:49 +00:00
Dan Gohman	a134448980	Revert r68172. It caused regressions in Applications/Burg/burg Applications/ClamAV/clamscan and many other tests. llvm-svn: 68211	2009-04-01 16:37:47 +00:00
Owen Anderson	d7c837bb4b	Enhance GVN to propagate simple conditionals. This fixes PR3921. llvm-svn: 68172	2009-04-01 01:20:45 +00:00
John Criswell	d1fd7b1f72	Do not attempt to do parial redundancy elimination on void values. Also fixed a punctuation error in the header comment. This fixes PR3775. llvm-svn: 66542	2009-03-10 15:04:53 +00:00
Devang Patel	9abfbc2e14	Skip DbgInfoIntrinsic. llvm-svn: 66244	2009-03-06 02:59:27 +00:00
Owen Anderson	9d3e18ed39	Add a test for r61358, which I forgot to add way back when. llvm-svn: 64904	2009-02-18 07:50:22 +00:00
Chris Lattner	b36503c31b	fix two more cases where we could let the NLPDI cache get unsorted. With this, sqlite3 now passes. llvm-svn: 62839	2009-01-23 07:12:16 +00:00
Chris Lattner	ddc8e78d54	Fix PR3358, a really nasty bug where recursive phi translated analyses could be run without the caches properly sorted. This can fix all sorts of weirdness. Many thanks to Bill for coming up with the 'issorted' verification idea. llvm-svn: 62757	2009-01-22 07:04:01 +00:00
Chris Lattner	b3becc5776	fix PR3217: fully cached queries need to be verified against the visited set before they are used. If used, their blocks need to be added to the visited set so that subsequent queries don't use conflicting pointer values in the cache result blocks. llvm-svn: 61080	2008-12-16 07:10:09 +00:00
Chris Lattner	8119a1f70d	Add a testcase for GCC PR 23455, which lpre handles now. Add some comments about why we're not getting other cases. llvm-svn: 61032	2008-12-15 07:49:24 +00:00
Chris Lattner	30c1871282	gvn now hoists this load out of the hot non-call path. llvm-svn: 61028	2008-12-15 06:34:48 +00:00
Chris Lattner	ea2933ff07	Adjust testcase to make it more stable across visitation order changes, unbreaking it after r61024. llvm-svn: 61025	2008-12-15 04:42:00 +00:00
Chris Lattner	22cfa14eed	make GVN try to rename inputs to the resultant replaced values, which cleans up the generated code a bit. This should have the added benefit of not randomly renaming functions/globals like my previous patch did. :) llvm-svn: 61023	2008-12-15 03:46:38 +00:00
Chris Lattner	c92b131639	Implement initial support for PHI translation in memdep. This means that memdep keeps track of how PHIs affect the pointer in dep queries, which allows it to eliminate the load in cases like rle-phi-translate.ll, which basically end up being: BB1: X = load P br BB3 BB2: Y = load Q br BB3 BB3: R = phi [P] [Q] load R turning "load R" into a phi of X/Y. In addition to additional exposed opportunities, this makes memdep safe in many cases that it wasn't before (which is required for load PRE) and also makes it substantially more efficient. For example, consider: bb1: // has many predecessors. P = some_operator() load P In this example, previously memdep would scan all the predecessors of BB1 to see if they had something that would mustalias P. In some cases (e.g. test/Transforms/GVN/rle-must-alias.ll) it would actually find them and end up eliminating something. In many other cases though, it would scan and not find anything useful. MemDep now stops at a block if the pointer is defined in that block and cannot be phi translated to predecessors. This causes it to miss the (rare) cases like rle-must-alias.ll, but makes it faster by not scanning tons of stuff that is unlikely to be useful. For example, this speeds up GVN as a whole from 3.928s to 2.448s (60%)!. IMO, scalar GVN should be enhanced to simplify the rle-must-alias pointer base anyway, which would allow the loads to be eliminated. In the future, this should be enhanced to phi translate through geps and bitcasts as well (as indicated by FIXMEs) making memdep even more powerful. llvm-svn: 61022	2008-12-15 03:35:32 +00:00
Chris Lattner	8f6a8a85a3	another random testcase that shouldn't crash gvn and is good for coverage with future changes. llvm-svn: 61011	2008-12-14 21:20:46 +00:00
Chris Lattner	af4007b39f	RLE isn't smart enough to eliminate this safely yet. llvm-svn: 60994	2008-12-13 21:04:20 +00:00
Chris Lattner	cc5ee569a3	rename some tests to be more uniform in naming convention. llvm-svn: 60988	2008-12-13 18:47:40 +00:00
Chris Lattner	5cb658f43c	gvn should never crash on this. llvm-svn: 60987	2008-12-13 18:39:44 +00:00
Bill Wendling	34182ae3ae	Temporarily revert r60973. It's inexplicably causing a failure when self-hosting LLVM: llvm[2]: Linking Release executable opt (without symbols) ... Undefined symbols: "llvm::APFloat::IEEEsingle", referenced from: __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(Constants.o) __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(AsmWriter.o) __ZN4llvm7APFloat10IEEEsingleE$non_lazy_ptr in libLLVMCore.a(ConstantFold.o) "llvm::APFloat::IEEEdouble", referenced from: __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(Constants.o) __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(AsmWriter.o) __ZN4llvm7APFloat10IEEEdoubleE$non_lazy_ptr in libLLVMCore.a(ConstantFold.o) ld: symbol(s) not found This is in release mode. To replicate, compile llvm and llvm-gcc in optimized mode. Then build llvm, in optimized mode, with the newly created compiler. llvm-svn: 60977	2008-12-13 09:28:44 +00:00
Chris Lattner	8753175cd6	make RLE preserve the name of the load that it replaces. This is just a pretification of the IR. llvm-svn: 60973	2008-12-13 07:22:47 +00:00
Chris Lattner	2550938060	loosen up an assertion that isn't valid when called from invalidateCachedPointerInfo. Thanks to Bill for sending me a testcase. llvm-svn: 60805	2008-12-09 22:45:32 +00:00
Chris Lattner	6a5e9eaa36	Teach BasicAA::getModRefInfo(CallSite, CallSite) some tricks based on readnone/readonly functions. Teach memdep to look past readonly calls when analyzing deps for a readonly call. This allows elimination of a few more calls from 403.gcc: before: 63 gvn - Number of instructions PRE'd 153986 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted after: 63 gvn - Number of instructions PRE'd 153991 gvn - Number of instructions deleted 50069 gvn - Number of loads deleted 5 calls isn't much, but this adds plumbing for the next change. llvm-svn: 60794	2008-12-09 21:19:42 +00:00
Chris Lattner	211146e709	Fix test/Transforms/GVN/pre-load.ll llvm-svn: 60594	2008-12-05 17:04:12 +00:00
Chris Lattner	2a9747548e	Implement PRE of loads in the GVN pass with a pretty cheap and straight-forward implementation. This does not require any extra alias analysis queries beyond what we already do for non-local loads. Some programs really really like load PRE. For example, SPASS triggers this ~1000 times, ~300 times in 255.vortex, and ~1500 times on 403.gcc. The biggest limitation to the implementation is that it does not split critical edges. This is a huge killer on many programs and should be addressed after the initial patch is enabled by default. The implementation of this should incidentally speed up rejection of non-local loads because it avoids creating the repl densemap in cases when it won't be used for fully redundant loads. This is currently disabled by default. Before I turn this on, I need to fix a couple of miscompilations in the testsuite, look at compile time performance numbers, and look at perf impact. This is pretty close to ready though. llvm-svn: 60408	2008-12-02 08:16:11 +00:00
Owen Anderson	bd844014fa	Add a test for my previous PRE fix. llvm-svn: 60394	2008-12-02 04:25:42 +00:00
Chris Lattner	01150dce74	testcase for my previous commit. llvm-svn: 60315	2008-12-01 01:42:03 +00:00
Chris Lattner	203a3299e9	don't require GVN to work on dead values, just make the test return the loaded value. llvm-svn: 60252	2008-11-29 21:21:48 +00:00
Owen Anderson	5747d627e0	A better fix for PR2503 that doesn't pessimize GVN in the presence of unreachable blocks. llvm-svn: 53032	2008-07-02 17:20:16 +00:00
Owen Anderson	a9fd2b7e53	Use the -enable-pre flag so this test doesn't fail. llvm-svn: 52784	2008-06-26 17:03:28 +00:00
Owen Anderson	3f78e260c1	Add local PRE to GVN. This only operates in cases where it would not increase code size, namely when the instantiated expression would only need to be created in one predecessor. llvm-svn: 52471	2008-06-18 21:41:49 +00:00
Gabor Greif	807c2df887	sabre brings to my attention that the 'tr' suffix is also obsolete llvm-svn: 51349	2008-05-20 21:00:03 +00:00
Gabor Greif	d8a4dbb5da	Rename the last test with .llx extension to .ll, resolve duplicate test by renaming to isnan2. Now that no test has llx ending there is no need to search for them from dg.exp too. llvm-svn: 51328	2008-05-20 19:52:04 +00:00
Owen Anderson	f67c06279b	Add a testcase for non-local CSE of read-only calls. llvm-svn: 51025	2008-05-13 08:17:44 +00:00
Owen Anderson	15e930588a	Add testcase for PR2213. llvm-svn: 49517	2008-04-11 05:13:32 +00:00
Owen Anderson	ca7e0e21f3	Factor a bunch of functionality related to memcpy and memset transforms out of GVN and into its own pass. llvm-svn: 49419	2008-04-09 08:23:16 +00:00
Chris Lattner	12cecbbb25	add a testcase for forming memset from noncontiguous stores. llvm-svn: 48938	2008-03-29 04:51:35 +00:00
Chris Lattner	97e4d98c2d	apparently tclsh doesn't lex like bash. Weird. llvm-svn: 48732	2008-03-24 17:41:57 +00:00
Chris Lattner	3a6d3372f5	pass the option so this test tests the right thing. llvm-svn: 48731	2008-03-24 17:36:38 +00:00
Owen Anderson	2f91173e40	Use normal naming convention for test. llvm-svn: 48693	2008-03-22 21:08:33 +00:00
Chris Lattner	16f62d36e8	implement an initial hack at a straight-line store -> memset optimization. This fires dozens of times across spec and multisource, but I don't know if it actually speeds stuff up. Hopefully the testers will show something nice :) llvm-svn: 48680	2008-03-22 05:37:16 +00:00
Owen Anderson	6c2454d9d1	Fix a bug in GVN that Duncan noticed, where we potentially need to insert a pointer bitcast when performing return slot optimization. llvm-svn: 48343	2008-03-13 22:07:10 +00:00
Owen Anderson	5887233a3f	Improve the return slot optimization to be both more aggressive (not limited to sret parameters), and safer (when the passed pointer might be invalid). Thanks to Duncan and Chris for the idea behind this, and extra thanks to Duncan for helping me work out the trap-safety. llvm-svn: 48280	2008-03-12 07:37:44 +00:00
Owen Anderson	a39b18223f	Add PR number to testcase. llvm-svn: 47640	2008-02-26 23:16:11 +00:00
Owen Anderson	eadd074b22	Fix an issue where GVN had the sizes of the two memcpy's reverse, resulting in an invalid transformation. llvm-svn: 47639	2008-02-26 23:06:17 +00:00
Owen Anderson	6eafd532ab	Fix an issue where GVN was performing the return slot optimization when it was not safe. This is fixed by more aggressively checking that the return slot is not used elsewhere in the function. llvm-svn: 47544	2008-02-25 04:08:09 +00:00
Owen Anderson	432abc0479	Fix an issue where GVN would try to use an instruction before its definition when performing return slot optimization. llvm-svn: 47541	2008-02-25 00:40:41 +00:00
Chris Lattner	a6dc8aaa3f	make this just a bit more strict. llvm-svn: 47274	2008-02-18 17:33:10 +00:00
Owen Anderson	7b092ea631	Add support to GVN for performing sret return slot optimization. This means that, if an sret function tail calls another sret function, it should pass its own sret parameter to the tail callee, allowing it to fill in the correct return value. llvm-gcc does not emit this by default. Instead, it allocates space in the caller for the sret of the tail call and then uses memcpy to copy the result into the caller's sret parameter. This optimization detects and optimizes that case. llvm-svn: 47265	2008-02-18 09:24:53 +00:00
Nick Lewycky	12098ea401	Testcase for PR2032. llvm-svn: 47113	2008-02-14 07:15:11 +00:00
Eli Friedman	cf6273d015	Add a note pointing to PR1996. llvm-svn: 47055	2008-02-13 07:56:04 +00:00
Eli Friedman	75220639b6	Add test for PR1996. (This is my first time adding a test for a transform, so please review.) llvm-svn: 47050	2008-02-13 06:55:57 +00:00
Owen Anderson	274aa2846e	Re-apply the patch to improve the optimizations of memcpy's, with several bugs fixed. This now passes PPC bootstrap. llvm-svn: 47026	2008-02-12 21:15:18 +00:00
Bill Wendling	8a28ab4b1f	Temporarily reverting: http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20080128/057882.html This is causing a miscompilation on PPC G5 and just now seeing it on iMac x86-64. llvm-svn: 46822	2008-02-06 20:03:07 +00:00
Owen Anderson	e2150dfe24	Make this test more aggressive, to cover recent improvements. llvm-svn: 46695	2008-02-04 04:55:24 +00:00
Owen Anderson	aaba6f96da	Allow GVN to hack on memcpy's, making them open to further optimization. llvm-svn: 46693	2008-02-04 02:59:58 +00:00
Dan Gohman	794fa1f8f7	Convert tests using "\| wc -l \| grep ..." to use the count script. llvm-svn: 41097	2007-08-15 13:36:28 +00:00
Owen Anderson	9c996b3212	Fix a failure I accidentally caused in my last commit by mishandling the removal of redundant phis. llvm-svn: 40650	2007-07-31 20:18:28 +00:00
Owen Anderson	d178c05c62	Fix a misoptimization in aha. llvm-svn: 40642	2007-07-31 17:43:14 +00:00
Owen Anderson	a8d0a6ee40	Fix a bug caused by indiscriminantly asking for the dominators of a predecessor. llvm-svn: 40595	2007-07-30 16:57:08 +00:00
Owen Anderson	ed3e63a203	Fix a bug introduced in my last commit. llvm-svn: 40542	2007-07-26 18:57:04 +00:00
Owen Anderson	f2b10d3de3	Fix a couple more bugs in the phi construction by pulling in code that does almost the same things from LCSSA. llvm-svn: 40540	2007-07-26 18:26:51 +00:00
Owen Anderson	ed75133924	Fix what is _hopefully_ the last corner case for loops. llvm-svn: 40503	2007-07-25 23:54:42 +00:00
Owen Anderson	1e53615073	My last commit was not correct for nested loops. Fix it, and add a testcase for it. llvm-svn: 40498	2007-07-25 22:19:40 +00:00
Owen Anderson	ce9d73dbe7	Fix an infinite loop on 300.twolf. llvm-svn: 40497	2007-07-25 22:03:06 +00:00
Owen Anderson	b215074820	Fix a bug in non-local memdep that was causing an infinite loop on 175.vpr. llvm-svn: 40495	2007-07-25 21:26:36 +00:00
Owen Anderson	522f7a7608	Fix a bug that was causing GVN to crash on 252.eon. llvm-svn: 40494	2007-07-25 21:13:41 +00:00
Owen Anderson	6a1a8d05b8	Add basic support for performing whole-function RLE. Note: This has not yet been thoroughly tested. Use at your own risk. llvm-svn: 40489	2007-07-25 19:57:03 +00:00
Owen Anderson	eb9f1b612c	Add a GVN pass, using the value numbering code I developed for GVNPRE and the load elimination code from RedundantLoadElimination. llvm-svn: 40469	2007-07-24 17:55:58 +00:00

1 2 3 4 5

221 Commits