llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-28 22:42:52 +01:00

Author	SHA1	Message	Date
Evan Cheng	ee801276b3	This also got better (55 - 51 instructions). But doing one more re-materialization. llvm-svn: 52482	2008-06-19 01:50:13 +00:00
Evan Cheng	56e17b525c	This got better. llvm-svn: 52481	2008-06-19 01:46:43 +00:00
Owen Anderson	597b40ed60	Remove this test until the corresponding patch is reapplied because it's causing make check to crash for some people. llvm-svn: 52473	2008-06-18 22:37:31 +00:00
Owen Anderson	3f78e260c1	Add local PRE to GVN. This only operates in cases where it would not increase code size, namely when the instantiated expression would only need to be created in one predecessor. llvm-svn: 52471	2008-06-18 21:41:49 +00:00
Matthijs Kooijman	b7e2818227	Rewrite the DeadArgumentElimination pass, to use a more explicit tracking of dependencies between return values and/or arguments. Also make the handling of arguments and return values the same. The pass now looks properly inside returned structs, but only at the first level (ie, not inside nested structs). Also add a testcase for testing various variations of (multiple) dead rerturn values. llvm-svn: 52459	2008-06-18 11:12:53 +00:00
Matthijs Kooijman	8177832230	Reapply r52397 (make IPConstProp promote returned arguments), but fixed this time. Sorry for the trouble! This time, also add a testcase, which I should have done in the first place... llvm-svn: 52455	2008-06-18 08:30:37 +00:00
Matthijs Kooijman	bb59138fa6	Reapply r52396, it was unrelated to the breakage (that was caused by r52397, my commit after this). llvm-svn: 52453	2008-06-18 08:09:27 +00:00
Chris Lattner	93da79f7a1	implement some simple bswap optimizations, rdar://5992453 llvm-svn: 52442	2008-06-18 04:33:20 +00:00
Chris Lattner	2a22e66e47	temporarily revert this testcase since its patch was reverted. llvm-svn: 52441	2008-06-18 04:03:23 +00:00
Chris Lattner	7e403da191	make truncate/sext elimination capable of changing phi's. This implements rdar://6013816 and the testcase in Transforms/InstCombine/sext-misc.ll. llvm-svn: 52440	2008-06-18 04:00:49 +00:00
Devang Patel	8f157a3670	Preserve dominance frontier while trivially unswitching loop. llvm-svn: 52438	2008-06-18 02:16:38 +00:00
Matthijs Kooijman	f0adaf34a1	Learn IPConstProp to look at individual return values and propagate them individually. Also learn IPConstProp how returning first class aggregates work, in addition to old style multiple return instructions. Modify the return-constants testscase to confirm this behaviour. llvm-svn: 52396	2008-06-17 12:02:52 +00:00
Evan Cheng	8cfd1d39a1	Do not issue identity copies. llvm-svn: 52373	2008-06-16 22:52:53 +00:00
Dan Gohman	c1fd5f170b	Refine the change in r52258 for avoiding use-before-def conditions when changing the stride of a comparison so that it's slightly more precise, by having it scan the instruction list to determine if there is a use of the condition after the point where the condition will be inserted. llvm-svn: 52371	2008-06-16 22:34:15 +00:00
Evan Cheng	d27948e716	- Add "Commutative" property to intrinsics. This allows tblgen to generate the commuted variants for dagisel matching code. - Mark lots of X86 intrinsics as "Commutative" to allow load folding. llvm-svn: 52353	2008-06-16 20:29:38 +00:00
Matthijs Kooijman	bdd5cae51c	Make testcase check for extractvalue instead of extractelement. llvm-svn: 52317	2008-06-16 13:03:44 +00:00
Matthijs Kooijman	e01197eaa9	Store the result of multiple identical run lines in a temporary file. llvm-svn: 52314	2008-06-16 12:21:25 +00:00
Matthijs Kooijman	f6b1a51a94	Fix PR numbers, I accidentally switched two digits. llvm-svn: 52311	2008-06-16 09:38:23 +00:00
Chris Lattner	e987a3bdd1	If we are checking to see if the result of a call aliases a pointer derived from a local allocation, if the local allocation never escapes, the pointers can't alias. This implements PR2436 llvm-svn: 52301	2008-06-16 06:19:11 +00:00
Chris Lattner	63195a4694	this is unneeded now. llvm-svn: 52298	2008-06-16 04:25:39 +00:00
Chris Lattner	61de4d6454	resolve PR2453 by adding a run line. llvm-svn: 52296	2008-06-16 04:22:39 +00:00
Wojciech Matyjewicz	58f5cc8864	Fix PR2434. When scanning for exising binary operator to reuse don't take into account the instrucion pointed by InsertPt. Thanks to it, returning the new value of InsertPt to the InsertBinop() caller can be avoided. The bug was, actually, in visitAddRecExpr() method which wasn't correctly handling changes of InsertPt. There shouldn't be any performance regression, as -gvn pass (run after -indvars) removes any redundant binops. llvm-svn: 52291	2008-06-15 19:07:39 +00:00
Evan Cheng	2e99c9cbf8	Teach the spiller to commute instructions in order to fold a reload. This hits 410 times on 444.namd and 122 times on 252.eon. llvm-svn: 52266	2008-06-13 23:58:02 +00:00
Eli Friedman	88cdc65941	Remove unnecessary target lines. llvm-svn: 52261	2008-06-13 22:12:16 +00:00
Eli Friedman	46782c75fe	Remove unnecessary target lines. llvm-svn: 52260	2008-06-13 22:10:32 +00:00
Eli Friedman	11d4c94933	Don't skip over instructions other than loads that might read memory when trying to sink stores. llvm-svn: 52259	2008-06-13 22:02:12 +00:00
Dan Gohman	7f1fcc4339	Protect ChangeCompareStride from situations in which it is possible for it to generate use-before-def IR, such as in this testcase. llvm-svn: 52258	2008-06-13 21:43:41 +00:00
Eli Friedman	d38a639deb	Make sure SimplifyStoreAtEndOfBlock doesn't mess with loops; the structure checks are incorrect if the blocks aren't distinct. Fixes PR2435. llvm-svn: 52257	2008-06-13 21:17:49 +00:00
Duncan Sands	40c8db881a	Disable some DAG combiner optimizations that may be wrong for volatile loads and stores. In fact this is almost all of them! There are three types of problems: (1) it is wrong to change the width of a volatile memory access. These may be used to do memory mapped i/o, in which case a load can have an effect even if the result is not used. Consider loading an i32 but only using the lower 8 bits. It is wrong to change this into a load of an i8, because you are no longer tickling the other three bytes. It is also unwise to make a load/store wider. For example, changing an i16 load into an i32 load is wrong no matter how aligned things are, since the fact of loading an additional 2 bytes can have i/o side-effects. (2) it is wrong to change the number of volatile load/stores: they may be counted by the hardware. (3) it is wrong to change a volatile load/store that requires one memory access into one that requires several. For example on x86-32, you can store a double in one processor operation, but to store an i64 requires two (two i32 stores). In a multi-threaded program you may want to bitcast an i64 to a double and store as a double because that will occur atomically, and be indivisible to other threads. So it would be wrong to convert the store-of-double into a store of an i64, because this will become two i32 stores - no longer atomic. My policy here is to say that the number of processor operations for an illegal operation is undefined. So it is alright to change a store of an i64 (requires at least two stores; but could be validly lowered to memcpy for example) into a store of double (one processor op). In short, if the new store is legal and has the same size then I say that the transform is ok. It would also be possible to say that transforms are always ok if before they were illegal, whether after they are illegal or not, but that's more awkward to do and I doubt it buys us anything much. However this exposed an interesting thing - on x86-32 a store of i64 is considered legal! That is because operations are marked legal by default, regardless of whether the type is legal or not. In some ways this is clever: before type legalization this means that operations on illegal types are considered legal; after type legalization there are no illegal types so now operations are only legal if they really are. But I consider this to be too cunning for mere mortals. Better to do things explicitly by testing AfterLegalize. So I have changed things so that operations with illegal types are considered illegal - indeed they can never map to a machine operation. However this means that the DAG combiner is more conservative because before it was "accidentally" performing transforms where the type was illegal because the operation was nonetheless marked legal. So in a few such places I added a check on AfterLegalize, which I suppose was actually just forgotten before. This causes the DAG combiner to do slightly more than it used to, which resulted in the X86 backend blowing up because it got a slightly surprising node it wasn't expecting, so I tweaked it. llvm-svn: 52254	2008-06-13 19:07:40 +00:00
Matthijs Kooijman	4736a83c41	XFAIL some tests that became failing due to the extra error reporting recently. PR's are created for these. llvm-svn: 52250	2008-06-13 16:52:35 +00:00
Nick Lewycky	0aa53f2b52	Crash less. The i64 restriction in BinomialCoefficient caused some problems with code that was expecting different bit widths for different values. Make getTruncateOrZeroExtend a method on ScalarEvolution, and use it. llvm-svn: 52248	2008-06-13 04:38:55 +00:00
Evan Cheng	66ce588b87	Fix some tests. llvm-svn: 52245	2008-06-12 21:23:38 +00:00
Evan Cheng	b0d847cf05	Revert 52223. llvm-svn: 52243	2008-06-12 20:55:39 +00:00
Matthijs Kooijman	d07ffc50fa	Don't try to compile tests for the ev56 alpha subtarget, which hasn't been supported since r33492. llvm-svn: 52237	2008-06-12 13:44:26 +00:00
Matthijs Kooijman	c2d6b13f1e	Pass -silence-passes to bugpoint in testcases, this makes two out of three bugpoint testcases work again. llvm-svn: 52236	2008-06-12 13:12:11 +00:00
Matthijs Kooijman	3a11ccc589	Add line continuation character so the avoid dup loop header test actually runs. llvm-svn: 52228	2008-06-12 08:49:04 +00:00
Evan Cheng	9b0c0a0f00	Avoid duplicating loop header which leads to unnatural loops (and just seem like general badness to me, likely to cause code explosion). Patch by Florian Brandner. llvm-svn: 52223	2008-06-11 19:07:54 +00:00
Gordon Henriksen	02d34b6a0c	Don't send checkpoints to stderr for the vmcore.ml test. llvm-svn: 52218	2008-06-11 14:58:01 +00:00
Matthijs Kooijman	0f9df32e12	Teach instruction combining about the extractvalue. It can succesfully fold useless insert-extract chains, similar to how it folds them for vectors. Add a testcase for this. llvm-svn: 52217	2008-06-11 14:05:05 +00:00
Dale Johannesen	e86cbb7893	Use %link not %llvmgxx (which includes -c) to do the link. The test still fails because an expected symbol is not present, and I don't see why it should be. llvm-svn: 52188	2008-06-10 18:01:54 +00:00
Dale Johannesen	c1d2ca1701	Suppress ObjC FE warnings, which cause the test to fail. Warnings are legitimate. llvm-svn: 52187	2008-06-10 18:00:45 +00:00
Dale Johannesen	6654d90831	Add -w to inhibit gcc warnings, which causes the harness to fail the tests. The warning all appear legitimate. llvm-svn: 52186	2008-06-10 18:00:09 +00:00
Dale Johannesen	47cee90b57	Fix parameter spelling: sse not sse1 llvm-svn: 52185	2008-06-10 17:57:58 +00:00
Matthijs Kooijman	3488e4542b	Ignore stderr for some more tests that expect warnings there. This fixes 2 testcases. llvm-svn: 52184	2008-06-10 16:13:38 +00:00
Matthijs Kooijman	00a807266e	Fix some more quoting issues in RUN lines, this time regarding unintended variable expansions involving the $ character. This fixes 4 tests that were not running properly before. llvm-svn: 52183	2008-06-10 16:10:32 +00:00
Matthijs Kooijman	e8fb62fb3c	Fix some escaping and quoting in RUN lines, mainly involving { and <. In two cases quoting of <{ didn't work out, so I changed the grep to check for }> instead. This fixes 7 testcases that were not properly running before. llvm-svn: 52182	2008-06-10 16:04:47 +00:00
Matthijs Kooijman	281711dc95	Remove double pipes in RUN commandlines. This fixes 5 testcases that were not being run properly before. llvm-svn: 52180	2008-06-10 15:11:36 +00:00
Matthijs Kooijman	98322ead14	Remove trailing whitespace after line continuations in test cases to them work. This fixes two test cases that were not being run properly before. llvm-svn: 52179	2008-06-10 15:07:07 +00:00
Matthijs Kooijman	15ab3c5f19	Let some more tests ignore expected output on stderr. Also, use > %t instead of -o %t for output in one test since that also works when %t already exists. This fixes 6 testcases. llvm-svn: 52178	2008-06-10 15:04:14 +00:00
Matthijs Kooijman	b79081c161	Fix some llvm-gcc warnings in testcases, mostly by adding includes or adding declarations. These are the fixes that I was pretty confident about, there are still a lot of other llvm-gcc warnings of which I'm not sure if they can be safely ignored or fixed, without breaking the test case. This fixes 11 testcases. llvm-svn: 52176	2008-06-10 14:37:44 +00:00
Matthijs Kooijman	c638fe5b8b	For all RUN lines starting with "not", redirect stderr to /dev/null so tests don't fail when (expected) error output is produced. This fixes 17 tests. While I was there, I also made all RUN lines of the form "not llvm-as..." a bit more consistent, they now all redirect stderr and stdout to /dev/null and use input redirect to read their input. llvm-svn: 52174	2008-06-10 12:57:32 +00:00
Matthijs Kooijman	82d762a948	Suppress the (stderr) output of -aa-eval, this fixes 5 tests. llvm-svn: 52173	2008-06-10 12:39:15 +00:00
Matthijs Kooijman	4be5c7f83e	Change llvm.exp so it no longer ignores some errors when executing dejagnu tests. This breaks 80 tests in the tree. The interesting part here is that this no longer ignores syntax errors in RUN command lines. Some tests have not been working all the time because of this. The tricky part is that it now also views any stderr output as an error. This can be suppressed in tcl 8.5, but let's not add this dependency. Instead, all testcases should be changed to redirect stderr if they expect stderr output. This holds in particular for lines like: ; RUN: not llvm-as < %s where an error is expected (but I think I can solve this by modifying the not script). Also, compilations resulting in warnings will now also fail (so the warnings should be fixed, disabled or redirected...). I'll continue with fixing the testcases that are broken now. llvm-svn: 52172	2008-06-10 12:28:43 +00:00
Dan Gohman	f5602924ae	Convert several tests to use temporary files instead of redundantly executing the test commands. llvm-svn: 52163	2008-06-10 00:36:41 +00:00
Dan Gohman	9eace09bfa	Fix two more not-grep tests that were missing llvm-dis. llvm-svn: 52159	2008-06-09 22:36:45 +00:00
Dan Gohman	68f8fbdac4	Re-apply 52002, allowing the verifier to accept non-MRV struct return types on functions, with adjustments so that it accepts both new-style aggregate returns and old-style MRV returns, including those with only a single member. llvm-svn: 52157	2008-06-09 21:26:13 +00:00
Duncan Sands	a15ae3d239	Test that prune-eh doesn't make deductions based on bodies of functions with weak linkage. llvm-svn: 52141	2008-06-09 11:28:41 +00:00
Rafael Espindola	feaadb1e05	add support for PIC on linux x86-64 llvm-svn: 52139	2008-06-09 09:52:31 +00:00
Chris Lattner	806f0a8411	lower calls to abs to inline code, PR2337 llvm-svn: 52138	2008-06-09 08:26:51 +00:00
Chris Lattner	7864575654	Fix PR2411, where ip constant prop would propagate the result of a weak function. llvm-svn: 52137	2008-06-09 07:58:07 +00:00
Chris Lattner	4a896996cb	Limit the icmp+phi merging optimization to the cases where it is profitable: don't make i1 phis when it won't be possible to eliminate them. llvm-svn: 52097	2008-06-08 20:52:11 +00:00
Anton Korobeynikov	aed2cbb0a1	Remove invalid test llvm-svn: 52093	2008-06-08 16:59:10 +00:00
Evan Cheng	c7ed1b9258	Speculatively execute a block when the the block is the then part of a triangle shape and it contains a single, side effect free, cheap instruction. The branch is eliminated by adding a select instruction. i.e. Turn BB: %t1 = icmp br i1 %t1, label %BB1, label %BB2 BB1: %t3 = add %t2, c br label BB2 BB2: => BB: %t1 = icmp %t4 = add %t2, c %t3 = select i1 %t1, %t2, %t3 llvm-svn: 52073	2008-06-07 08:52:29 +00:00
Evan Cheng	bc28ef2028	Fix run line. llvm-svn: 52072	2008-06-07 08:40:16 +00:00
Anton Korobeynikov	a9fa994d9b	Testcase for PR2418 llvm-svn: 52047	2008-06-06 16:08:56 +00:00
Dan Gohman	70fe9e347d	Revert 52002. llvm-svn: 52030	2008-06-05 23:57:06 +00:00
Zhou Sheng	d7b035ee2b	Add a test case for opt -instcombine bug fix in revision 52003. llvm-svn: 52004	2008-06-05 14:25:11 +00:00
Matthijs Kooijman	ebf00c0f65	Change the Verifier to support returning first class aggregrates. Add a testcase for functions returning first class aggregrates. llvm-svn: 52002	2008-06-05 14:00:36 +00:00
Zhou Sheng	9c9d852f08	Add a test case for APInt bug fix in r51999. llvm-svn: 52000	2008-06-05 13:42:21 +00:00
Matthijs Kooijman	6e1c286f53	Learn ScalarReplAggregrates how stores and loads of first class aggregrates work and how to replace them into individual values. Also, when trying to replace an aggregrate that is used by load or store with a single (large) integer, don't crash (but don't replace the aggregrate either). Also adds a testcase for both structs and arrays. llvm-svn: 51997	2008-06-05 12:51:53 +00:00
Matthijs Kooijman	775c91b2f5	Let StructRetPromotion check if all if its users are really calls or invokesn, not other instructions. This fixes a crash with the added testcase. llvm-svn: 51992	2008-06-05 08:57:20 +00:00
Matthijs Kooijman	df97b7b4a2	Let StructRetPromotion check if it's users are really calling it and not passing its pointer. Fixes test with added testcase. llvm-svn: 51991	2008-06-05 08:48:32 +00:00
Evan Cheng	e77d6a1a2d	Fix a memcpy lowering bug. Even though the memcpy alignment is smaller than the desired alignment, the frame destination alignment may still be larger than the desired alignment. Don't change its alignment to something smaller. llvm-svn: 51970	2008-06-04 23:37:54 +00:00
Chris Lattner	7e3db1af97	Rewrite a bunch of the CBE's inline asm code, giving it the ability to handle indirect input operands. This fixes PR2407. llvm-svn: 51952	2008-06-04 18:03:28 +00:00
Duncan Sands	5a6c6a92c1	Change packed struct layout so that field sizes are the same as in unpacked structs, only field positions differ. This only matters for structs containing x86 long double or an apint; it may cause backwards compatibility problems if someone has bitcode containing a packed struct with a field of one of those types. The issue is that only 10 bytes are needed to hold an x86 long double: the store size is 10 bytes, but the ABI size is 12 or 16 bytes (linux/ darwin) which comes from rounding the store size up by the alignment. Because it seemed silly not to pack an x86 long double into 10 bytes in a packed struct, this is what was done. I now think this was a mistake. Reserving the ABI size for an x86 long double field even in a packed struct makes things more uniform: the ABI size is now always used when reserving space for a type. This means that developers are less likely to make mistakes. It also makes life easier for the CBE which otherwise could not represent all LLVM packed structs (PR2402). Front-end people might need to adjust the way they create LLVM structs - see following change to llvm-gcc. llvm-svn: 51928	2008-06-04 08:21:45 +00:00
Owen Anderson	3f738eb65b	Testcase for LoopIndexSplit and DomFrontier. llvm-svn: 51916	2008-06-03 18:32:27 +00:00
Dan Gohman	9562f7f0c8	nounwindify. llvm-svn: 51893	2008-06-03 01:21:11 +00:00
Dan Gohman	fbf0f6cf8e	Constant folding for insertvalue and extractvalue. llvm-svn: 51889	2008-06-03 00:15:20 +00:00
Devang Patel	b1798d2be0	Update dom tree. Fix PR 2372. llvm-svn: 51887	2008-06-02 22:52:56 +00:00
Scott Michel	5323d58281	Add necessary 64-bit support so that gcc frontend compiles (mostly). Current issue is operand promotion for setcc/select... but looks like the fundamental stuff is implemented for CellSPU. llvm-svn: 51884	2008-06-02 22:18:03 +00:00
Dan Gohman	5a9c2a3434	Implement CBE support for first-class structs and array values, and insertvalue and extractvalue instructions. First-class array values are not trivial because C doesn't support them. The approach I took here is to wrap all arrays in structs. Feedback is welcome. The 2007-01-15-NamedArrayType.ll test needed to be modified because it has a "not grep" for a string that now exists, because array types now have associated struct types, and those struct types have names. llvm-svn: 51881	2008-06-02 21:30:49 +00:00
Dan Gohman	385b7d76ed	Fix the position of MemOperands in nodes that use variadic_ops in DAGISelEmitter output. This bug was recently uncovered by the addition of patterns for CALL32m and CALL64m, which are nodes that now have both MemOperands and variadic_ops. This bug was especially visible with PIC in various configurations, because the new patterns are matching the indirect call code used in many PIC configurations. llvm-svn: 51877	2008-06-02 17:40:38 +00:00
Wojciech Matyjewicz	06e4c8a420	Fixes PR2395. Looking for a constant in a GEP tail (when the first GEP is longer than the second one) should stop after finding one. Added break instruction guarantees it. It also changes difference between offsets to absolute value of this difference in the condition. llvm-svn: 51875	2008-06-02 17:26:12 +00:00
Owen Anderson	7700de3137	Fix two issues that Eli Friedman pointed out, where would misoptimized code like: char a[200]; init(a, a+200); OR int a[200]; char* b = (char)a; char c = (char*)a; foo(b, c); llvm-svn: 51850	2008-06-01 22:26:26 +00:00
Owen Anderson	d194f76cb4	Test for PR2401 llvm-svn: 51849	2008-06-01 21:55:55 +00:00
Duncan Sands	d14212a3e1	When simplifying a call to a bitcast function, tighten up the conditions for performing the transform when only the function declaration is available: no longer allow turning i32 into i64 for example. Only allow changing between pointer types, and between pointer types and integers of the same size. For return values ptr -> intptr was already allowed; I added ptr -> ptr and intptr -> ptr while there. As shown by a recent objc testcase, changing the way parameters/return values are passed can be fatal when calling code written in assembler that directly manipulates call arguments and return values unless the transform has no impact on the way they are passed at the codegen level. While it is possible to imagine an ABI that treats integers of pointer size differently to pointers, I don't think LLVM supports any so the transform should now be safe while still being useful. llvm-svn: 51834	2008-06-01 07:38:42 +00:00
Chris Lattner	da1e2c8fa3	update this patch to handle an extraneous &1. This should be pulled into the 2.3 release branch. llvm-svn: 51824	2008-05-31 19:50:53 +00:00
Nick Lewycky	1bcd80adf7	Peer through sext/zext when looking for not(cmp). llvm-svn: 51819	2008-05-31 19:01:33 +00:00
Nick Lewycky	b30afdb62b	Add more i1 optimizations. add, sub, mul, s/udiv on i1 are now simplified away. llvm-svn: 51817	2008-05-31 17:59:52 +00:00
Nick Lewycky	cdcdcddc85	Adding i1 is always Xor. llvm-svn: 51816	2008-05-31 17:10:28 +00:00
Chris Lattner	43a47ddd89	Fix the CBE's handling of instructions whose result is an i1. Previously, we did not truncate the value down to i1 with (x&1). This caused a problem when the computation of x was nontrivial, for example, "add i1 1, 1" would return 2 instead of 0. This makes the testcase compile into: ... llvm_cbe_t = (((llvm_cbe_r == 0u) + (llvm_cbe_r == 0u))&1); llvm_cbe_u = (((unsigned int )(bool )llvm_cbe_t)); ... instead of: ... llvm_cbe_t = ((llvm_cbe_r == 0u) + (llvm_cbe_r == 0u)); llvm_cbe_u = (((unsigned int )(bool )llvm_cbe_t)); ... This fixes a miscompilation of mediabench/adpcm/rawdaudio/rawdaudio and 403.gcc with the CBE, regressions from LLVM 2.2. Tanya, please pull this into the release branch. llvm-svn: 51813	2008-05-31 09:23:55 +00:00
Dan Gohman	ac5c3382fe	IR, bitcode reader, bitcode writer, and asmparser changes to insertvalue and extractvalue to use constant indices instead of Value* indices. And begin updating LangRef.html. There's definately more to come here, but I'm checking this basic support in now to make it available to people who are interested. llvm-svn: 51806	2008-05-31 00:58:22 +00:00
Mikhail Glushenkov	9db02580c5	Fix the -opt switch and add a test case for it. llvm-svn: 51784	2008-05-30 19:56:27 +00:00
Mikhail Glushenkov	4d71bea2c9	Fix: 'sink' handling was broken. llvm-svn: 51750	2008-05-30 06:23:29 +00:00
Nick Lewycky	a02482cfaa	Unbreak this test. llvm-svn: 51726	2008-05-30 05:02:37 +00:00
Dan Gohman	aa8fcd5657	Add patterns for CALL32m and CALL64m. They aren't matched in most cases due to an isel deficiency already noted in lib/Target/X86/README.txt, but they can be matched in this fold-call.ll testcase, for example. This is interesting mainly because it exposes a tricky tblgen bug; tblgen was incorrectly computing the starting index for variable_ops in the case of a complex pattern. llvm-svn: 51706	2008-05-29 21:50:34 +00:00
Dan Gohman	e256337a1a	Expand small memmovs using inline code. Set the X86 threshold for expanding memmove to a more plausible value, now that it's actually being used. llvm-svn: 51696	2008-05-29 19:42:22 +00:00
Anton Korobeynikov	eb3cd5e822	For PR1338: Rename test dirs llvm-svn: 51695	2008-05-29 19:17:15 +00:00
Owen Anderson	2a0090d9bc	Move these tests into the proper directory. llvm-svn: 51685	2008-05-29 16:30:29 +00:00
Owen Anderson	bd3940abc7	Replace the old ADCE implementation with a new one that more simply solves the one case that ADCE catches that normal DCE doesn't: non-induction variable loop computations. This implementation handles this problem without using postdominators. llvm-svn: 51668	2008-05-29 08:45:13 +00:00
Evan Cheng	04c0915a2f	Implement vector shift up / down and insert zero with ps{rl}lq / ps{rl}ldq. llvm-svn: 51667	2008-05-29 08:22:04 +00:00
Evan Cheng	f2e38956ff	Add nounwind. llvm-svn: 51665	2008-05-29 07:09:24 +00:00
Evan Cheng	cd45b11bc1	Fix PR2289: vr defined by multiple implicit_def as result of coalescing. llvm-svn: 51648	2008-05-28 17:40:10 +00:00
Evan Cheng	591b57edd6	Teach local register allocator to deal with landing pad MBB's. llvm-svn: 51647	2008-05-28 17:22:32 +00:00
Chris Lattner	7a7da4f9c3	Implement PR2370: memmove(x,x,size) -> noop. llvm-svn: 51636	2008-05-28 05:30:41 +00:00
Dan Gohman	568685ffa7	Specify a target so that this tests tests what it's intended to test. llvm-svn: 51600	2008-05-27 17:55:57 +00:00
Dan Gohman	3ba9d77adb	Make this test independent of the target-triple; the stack alignment is specifically what this test depends on. llvm-svn: 51599	2008-05-27 17:44:23 +00:00
Nick Lewycky	0ba4adf4ef	Whoops -- forgot PR reference on this test. llvm-svn: 51569	2008-05-26 20:23:33 +00:00
Nick Lewycky	c096899392	The Linux ABI emits an extra "movl %esp, %ebp" in function prologue and sometimes a "mov %ebp, %esp" in the epilogue. Force these tests that rely on counting 'mov' to use i686-apple-darwin8.8.0 where they were written. llvm-svn: 51568	2008-05-26 20:18:56 +00:00
Nick Lewycky	7116ad5a18	Use {} instead of "" in RUN lines. llvm-svn: 51561	2008-05-26 01:27:08 +00:00
Nick Lewycky	f24743a6bb	Don't treat values as signed when looking at loop steppings in HowForToNonZero. llvm-svn: 51560	2008-05-25 23:43:32 +00:00
Nick Lewycky	744dad8004	"ret (constexpr)" can't be folded into a Constant. Add a method to Analysis/ConstantFolding to fold ConstantExpr's, then make instcombine use it to try to use targetdata to fold constant expressions on void instructions. Also extend the icmp(inttoptr, inttoptr) folding to handle the case where int size != ptr size. llvm-svn: 51559	2008-05-25 20:56:15 +00:00
Chris Lattner	3def8b4e53	Fix a serious brain-o. Obviously no-one reviewed my patch :( This fixes PR2359 llvm-svn: 51536	2008-05-24 04:06:28 +00:00
Chris Lattner	bde5fd685d	Fix PR2358 by resolving calls with undef arguments to overdefined. llvm-svn: 51535	2008-05-24 03:59:33 +00:00
Evan Cheng	e5e0b4660d	Eliminate x86.sse2.punpckh.qdq and x86.sse2.punpckl.qdq. llvm-svn: 51533	2008-05-24 02:56:30 +00:00
Evan Cheng	564238c841	Eliminate x86.sse2.movs.d, x86.sse2.shuf.pd, x86.sse2.unpckh.pd, and x86.sse2.unpckl.pd intrinsics. These will be lowered into shuffles. llvm-svn: 51531	2008-05-24 02:14:05 +00:00
Evan Cheng	e9c1c96f7b	New loadl_pd and loadh_pd tests. llvm-svn: 51525	2008-05-24 00:10:02 +00:00
Evan Cheng	365e0f3932	Autoupgrade x86.sse2.loadh.pd and x86.sse2.loadl.pd. llvm-svn: 51523	2008-05-24 00:08:39 +00:00
Dan Gohman	abbe3d47ab	Don't silently truncate array extents to 32 bits. llvm-svn: 51505	2008-05-23 21:40:55 +00:00
Evan Cheng	4f660778f0	Use movlps / movhps to modify low / high half of 16-byet memory location. llvm-svn: 51501	2008-05-23 21:23:16 +00:00
Dan Gohman	2412469191	Remove lingering references to .llx and .tr in the tests. llvm-svn: 51500	2008-05-23 21:15:35 +00:00
Dan Gohman	6cc0b4f262	Use PMULDQ for v2i64 multiplies when SSE4.1 is available. And add load-folding table entries for PMULDQ and PMULLD. llvm-svn: 51489	2008-05-23 17:49:40 +00:00
Matthijs Kooijman	cf417144f6	Restucture a part of the SimplifyCFG pass and include a testcase. The SimplifyCFG pass looks at basic blocks that contain only phi nodes, followed by an unconditional branch. In a lot of cases, such a block (BB) can be merged into their successor (Succ). This merging is performed by TryToSimplifyUncondBranchFromEmptyBlock. It does this by taking all phi nodes in the succesor block Succ and expanding them to include the predecessors of BB. Furthermore, any phi nodes in BB are moved to Succ and expanded to include the predecessors of Succ as well. Before attempting this merge, CanPropagatePredecessorsForPHIs checks to see if all phi nodes can be properly merged. All functional changes are made to this function, only comments were updated in TryToSimplifyUncondBranchFromEmptyBlock. In the original code, CanPropagatePredecessorsForPHIs looks quite convoluted and more like stack of checks added to handle different kinds of situations than a comprehensive check. In particular the first check in the function did some value checking for the case that BB and Succ have a common predecessor, while the last check in the function simply rejected all cases where BB and Succ have a common predecessor. The first check was still useful in the case that BB did not contain any phi nodes at all, though, so it was not completely useless. Now, CanPropagatePredecessorsForPHIs is restructured to to look a lot more similar to the code that actually performs the merge. Both functions now look at the same phi nodes in about the same order. Any conflicts (phi nodes with different values for the same source) that could arise from merging or moving phi nodes are detected. If no conflicts are found, the merge can happen. Apart from only restructuring the checks, two main changes in functionality happened. Firstly, the old code rejected blocks with common predecessors in most cases. The new code performs some extra checks so common predecessors can be handled in a lot of cases. Wherever common predecessors still pose problems, the blocks are left untouched. Secondly, the old code rejected the merge when values (phi nodes) from BB were used in any other place than Succ. However, it does not seem that there is any situation that would require this check. Even more, this can be proven. Consider that BB is a block containing of a single phi node "%a" and a branch to Succ. Now, since the definition of %a will dominate all of its uses, BB will dominate all blocks that use %a. Furthermore, since the branch from BB to Succ is unconditional, Succ will also dominate all uses of %a. Now, assume that one predecessor of Succ is not dominated by BB (and thus not dominated by Succ). Since at least one use of %a (but in reality all of them) is reachable from Succ, you could end up at a use of %a without passing through it's definition in BB (by coming from X through Succ). This is a contradiction, meaning that our original assumption is wrong. Thus, all predecessors of Succ must also be dominated by BB (and thus also by Succ). This means that moving the phi node %a from BB to Succ does not pose any problems when the two blocks are merged, and any use checks are not needed. llvm-svn: 51478	2008-05-23 09:09:41 +00:00
Nick Lewycky	6a16ace643	Constant integer vectors may also be negated. llvm-svn: 51476	2008-05-23 04:54:45 +00:00
Nick Lewycky	bd2da8098d	Revert X + X --> X * 2 optz'n which pessimizes heavily on x86. llvm-svn: 51474	2008-05-23 04:34:58 +00:00
Nick Lewycky	427209006f	Implement X + X for vectors. llvm-svn: 51472	2008-05-23 04:14:51 +00:00
Nick Lewycky	e62259c369	Fix a recently added optimization to not crash on vectors. llvm-svn: 51471	2008-05-23 03:26:47 +00:00
Dan Gohman	67e1a58e22	Generalize the new code in instcombine's ComputeNumSignBits for handling and/or to handle more cases (such as this add-sitofp.ll testcase), and port it to selectiondag's ComputeNumSignBits. llvm-svn: 51469	2008-05-23 02:28:01 +00:00
Dan Gohman	c7007dd0dc	Make structs and arrays first-class types, and add assembly and bitcode support for the extractvalue and insertvalue instructions and constant expressions. Note that this does not yet include CodeGen support. llvm-svn: 51468	2008-05-23 01:55:30 +00:00
Evan Cheng	097e95b1f7	Bug: rcpps can only folds a load if the address is 16-byte aligned. Fixed many 'ps' load folding patterns in X86InstrSSE.td which are missing the proper alignment checks. Also fixed some 80 col. violations. llvm-svn: 51462	2008-05-23 00:37:07 +00:00
Evan Cheng	dc3a3d3a2c	Add a couple of test cases. llvm-svn: 51441	2008-05-22 21:19:19 +00:00
Evan Cheng	d1373cd497	Add missing patterns. llvm-svn: 51435	2008-05-22 18:56:56 +00:00
Chris Lattner	6a45cf9dd6	Add support for multiple-return values in inline asm. This should get inline asm working as well as it did previously with the CBE with the new MRV support for inline asm. llvm-svn: 51420	2008-05-22 06:19:37 +00:00
Chris Lattner	477239c56d	testcase for PR2267 llvm-svn: 51408	2008-05-22 04:45:22 +00:00
Evan Cheng	8e02953de8	Fix PR2343. An interesting coalescer bug. BB1: vr1025 = copy vr1024 .. BB2: vr1024 = op = op vr1025 <loop eventually branch back to BB1> Even though vr1025 is copied from vr1024, it's not safe to coalesced them since live range of vr1025 intersects the def of vr1024. This happens when vr1025 is assigned the value of the previous iteration of vr1024 in the loop. llvm-svn: 51394	2008-05-21 22:34:12 +00:00
Gabor Greif	4a39cea7e7	resurrect lost tests by renaming them to not end with .tr llvm-svn: 51375	2008-05-21 14:48:24 +00:00
Gabor Greif	b03785f0cd	Eliminate questionable syntax for stdin redirection. This probably also speeds things up a bit. llvm-svn: 51357	2008-05-20 22:07:21 +00:00
Chris Lattner	821dc30131	Fix PR2346 by marking vaarg as volatile so that licm doesn't try to hoist them. llvm-svn: 51356	2008-05-20 22:05:28 +00:00
Dan Gohman	7d78d53d2a	Oops, commit the version of this test that actually works. llvm-svn: 51351	2008-05-20 21:19:36 +00:00
Dan Gohman	b48d4a75f6	Port SelectionDAG's ComputeNumSignBits-using code to instcombine, now that instcombine also has ComputeNumSignBits. llvm-svn: 51350	2008-05-20 21:01:12 +00:00
Gabor Greif	807c2df887	sabre brings to my attention that the 'tr' suffix is also obsolete llvm-svn: 51349	2008-05-20 21:00:03 +00:00
Gabor Greif	d8a4dbb5da	Rename the last test with .llx extension to .ll, resolve duplicate test by renaming to isnan2. Now that no test has llx ending there is no need to search for them from dg.exp too. llvm-svn: 51328	2008-05-20 19:52:04 +00:00
Evan Cheng	55e3957c96	More local spiller complexity! If local spiller optimization turns some instruction into an identity copy, it will be removed. If the output register happens to be dead (and source is obviously killed), transfer the kill / dead information to last use / def in the same MBB. llvm-svn: 51306	2008-05-20 08:13:21 +00:00
Evan Cheng	408425f0e0	Don't spill dead def. llvm-svn: 51305	2008-05-20 08:10:37 +00:00
Chris Lattner	b387fd90fc	Teach instcombine 4 new xforms: (add (sext x), cst) --> (sext (add x, cst')) (add (sext x), (sext y)) --> (sext (add int x, y)) (add double (sitofp x), fpcst) --> (sitofp (add int x, intcst)) (add double (sitofp x), (sitofp y)) --> (sitofp (add int x, y)) This generally reduces conversions. For example MiBench/telecomm-gsm gets these simplifications: HACK2: %tmp67.i142.i.i = sext i16 %tmp6.i141.i.i to i32 ; <i32> [#uses=1] %tmp23.i139.i.i = sext i16 %tmp2.i138.i.i to i32 ; <i32> [#uses=1] %tmp8.i143.i.i = add i32 %tmp67.i142.i.i, %tmp23.i139.i.i ; <i32> [#uses=3] HACK2: %tmp67.i121.i.i = sext i16 %tmp6.i120.i.i to i32 ; <i32> [#uses=1] %tmp23.i118.i.i = sext i16 %tmp2.i117.i.i to i32 ; <i32> [#uses=1] %tmp8.i122.i.i = add i32 %tmp67.i121.i.i, %tmp23.i118.i.i ; <i32> [#uses=3] HACK2: %tmp67.i.i190.i = sext i16 %tmp6.i.i189.i to i32 ; <i32> [#uses=1] %tmp23.i.i187.i = sext i16 %tmp2.i.i186.i to i32 ; <i32> [#uses=1] %tmp8.i.i191.i = add i32 %tmp67.i.i190.i, %tmp23.i.i187.i ; <i32> [#uses=3] HACK2: %tmp67.i173.i.i.i = sext i16 %tmp6.i172.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i170.i.i.i = sext i16 %tmp2.i169.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i174.i.i.i = add i32 %tmp67.i173.i.i.i, %tmp23.i170.i.i.i ; <i32> [#uses=3] HACK2: %tmp67.i152.i.i.i = sext i16 %tmp6.i151.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i149.i.i.i = sext i16 %tmp2.i148.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i153.i.i.i = add i32 %tmp67.i152.i.i.i, %tmp23.i149.i.i.i ; <i32> [#uses=3] HACK2: %tmp67.i.i.i.i = sext i16 %tmp6.i.i.i.i to i32 ; <i32> [#uses=1] %tmp23.i.i5.i.i = sext i16 %tmp2.i.i.i.i to i32 ; <i32> [#uses=1] %tmp8.i.i7.i.i = add i32 %tmp67.i.i.i.i, %tmp23.i.i5.i.i ; <i32> [#uses=3] This also fixes a bug in ComputeNumSignBits handling select and makes it more aggressive with and/or. llvm-svn: 51302	2008-05-20 05:46:13 +00:00
Dan Gohman	7681889f75	Run vortex-bug as x86-64, which is what the original bug was triggered on. llvm-svn: 51289	2008-05-20 00:54:39 +00:00
Devang Patel	9f385d71c2	Do not erase induction variable increment if it is used outside the loop. llvm-svn: 51280	2008-05-19 22:23:55 +00:00
Chris Lattner	63c384df1e	convert fptosi(sitofp x) -> x if the fp value has enough bits in its mantissa to accurately represent the integer. This triggers 9 times in 471.omnetpp, though 8 of those seem to be inlined from the same place. llvm-svn: 51271	2008-05-19 20:25:04 +00:00
Chris Lattner	1435b94f62	Fold FP comparisons where one operand is converted from an integer type and the other operand is a constant into integer comparisons. This happens surprisingly frequently (e.g. 10 times in 471.omnetpp), which are things like this: %tmp8283 = sitofp i32 %tmp82 to double %tmp1013 = fcmp ult double %tmp8283, 0.0 Clearly comparing tmp82 against i32 0 is cheaper here. this also triggers 8 times in gobmk, including this one: %tmp375376 = sitofp i32 %tmp375 to double %tmp377 = fcmp ogt double %tmp375376, 8.150000e+01 which is comparing an integer against 81.5 :). llvm-svn: 51268	2008-05-19 20:18:56 +00:00
Chris Lattner	510a6b249c	be more aggressive about transforming add -> or when the operands have no intersecting bits. This triggers all over the place, for example in lencode, with adds of stuff like: %tmp580 = mul i32 %tmp579, 2 %tmp582 = and i32 %b8, 1 and %tmp28 = shl i32 %abs.i, 1 %sign.0 = select i1 %tmp23, i32 1, i32 0 and %tmp344 = shl i32 %tmp343, 2 %tmp346 = and i32 %tmp96, 3 etc. llvm-svn: 51263	2008-05-19 20:01:56 +00:00

1 2 3 4 5 ...

5497 Commits