llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 19:52:54 +01:00

Author	SHA1	Message	Date
Dan Gohman	a5a50a8853	Fix embedded CRLF characters. llvm-svn: 54125	2008-07-27 18:37:58 +00:00
Nate Begeman	1396e3d206	Fix test RUN line llvm-svn: 54040	2008-07-25 19:08:59 +00:00
Nate Begeman	5523d40e4b	Disable mov{L, LP, HP, HLP, *DUP} shuffles for mmx mmx needs its own fancy shuffle logic based on unpack; for now we get correct but awful code. Also commit Mon Ping's VSETCC patch llvm-svn: 54039	2008-07-25 19:05:58 +00:00
Dan Gohman	6d394147f2	This test needs -aggressive-remat enabled. llvm-svn: 54015	2008-07-25 15:25:32 +00:00
Evan Cheng	d4eb684258	Teach ARM isLegalAddressingMode to handle unknown type without crashing. This fixes pr2589. llvm-svn: 54004	2008-07-25 00:55:17 +00:00
Dan Gohman	680e1bd958	Enable rematerialization of constants using AliasAnalysis::pointsToConstantMemory, and knowledge of PseudoSourceValues. This unfortunately isn't sufficient to allow constants to be rematerialized in PIC mode -- the extra indirection is a complication. llvm-svn: 54000	2008-07-25 00:02:30 +00:00
Dan Gohman	1ecbcecdf3	Put the LICM of constant GlobalVariables, introduced in r53945, under a command-line option, and disable it by default. It introduced performance regressions because CodeGen is currently not able to remat such loads. llvm-svn: 53997	2008-07-24 23:57:25 +00:00
Dan Gohman	da5c2b50b8	Add target triples so these tests behave as expected on non-darwin hosts. llvm-svn: 53991	2008-07-24 18:08:01 +00:00
Evan Cheng	9c8cac5fd7	Fix a catastrophic PPC64 ABI bug: i32 operands which are passed in memory (all of the parameter registers are used) are loaded from sp offsets that were off by 4. llvm-svn: 53979	2008-07-24 08:17:07 +00:00
Evan Cheng	055f5e6ed0	New test case. llvm-svn: 53971	2008-07-24 00:22:05 +00:00
Chris Lattner	8eb899ecbc	"Allow LICM to sink or lift loads from constant memory. Also add a test case for this. This allows instructions like loads from global variables declared to be constant to be moved out of loops." Patch by Stefanus Du Toit! llvm-svn: 53945	2008-07-23 05:06:28 +00:00
Dan Gohman	6564581be0	Enable first-class aggregates support. Remove the GetResultInst instruction. It is still accepted in LLVM assembly and bitcode, where it is now auto-upgraded to ExtractValueInst. Also, remove support for return instructions with multiple values. These are auto-upgraded to use InsertValueInst instructions. The IRBuilder still accepts multiple-value returns, and auto-upgrades them to InsertValueInst instructions. llvm-svn: 53941	2008-07-23 00:34:11 +00:00
Evan Cheng	20c9cdbe69	Fix PR2485: do all 4-element SSE shuffles in max. of 2 shuffle instructions. Based on patch by Nicolas Capens. llvm-svn: 53939	2008-07-23 00:22:17 +00:00
Duncan Sands	550e0de239	LegalizeTypes support for VSETCC. Fixes PR2575. llvm-svn: 53938	2008-07-22 23:54:03 +00:00
Evan Cheng	1aa928a8e6	Fix pr2566: incorrect assumption about bit_convert. It doesn't not have to output a vector value. Patch by Nicolas Capens! llvm-svn: 53932	2008-07-22 20:42:56 +00:00
Evan Cheng	901d469e05	Fix PR2574: implement v2f32 scalar_to_vector. llvm-svn: 53927	2008-07-22 18:39:19 +00:00
Dan Gohman	693339b859	Add the PR number to the test. llvm-svn: 53880	2008-07-21 21:50:25 +00:00
Dan Gohman	8f7b6c8113	Fix a bug in LSR's dead-PHI cleanup. If a PHI has a def-use chain that leads into a cycle involving a different PHI, LSR got stuck running around that cycle looking for the original PHI. To avoid this, keep track of visited PHIs and stop searching if we see one more than once. This fixes PR2570. llvm-svn: 53879	2008-07-21 21:45:02 +00:00
Wojciech Matyjewicz	eea926ec20	Fix PR2088. Use modulo linear equation solver to compute loop iteration count. llvm-svn: 53810	2008-07-20 15:55:14 +00:00
Bill Wendling	98b6e63176	Fix for first part of PR2562. Generate the "pinsrw" instruction for inserts into v4i16 vectors. llvm-svn: 53807	2008-07-20 02:32:23 +00:00
Nick Lewycky	13166526c5	XFAIL this test. llvm-svn: 53793	2008-07-19 15:52:06 +00:00
Wojciech Matyjewicz	852a8f47f1	While testing particular algorithms to compute loop iteration count the brute force evaluation (ComputeIterationCountExhaustively) should be turned off. It doesn't apply to trip-count2.ll because this file tests the brute force evaluation. The test for PR2364 (2008-05-25-NegativeStepToZero.ll) currently fails showing that the patch for this bug doesn't work. I'll fix it in a few hours with a patch for PR2088. llvm-svn: 53792	2008-07-19 13:26:15 +00:00
Anton Korobeynikov	6f354293fe	Testcase for PR2549 llvm-svn: 53785	2008-07-19 06:31:12 +00:00
Duncan Sands	ef45c602b6	Softfloat support for FDIV. Patch by Richard Pennington. llvm-svn: 53773	2008-07-18 21:18:48 +00:00
Dan Gohman	b97c076af4	In the CBackend, use casts to force integer add, subtract, and multiply to be done as unsigned, so that they have well defined behavior on overflow. This fixes PR2408. llvm-svn: 53767	2008-07-18 18:43:12 +00:00
Evan Cheng	d26080487b	Subreg live interval valno may not have a corresponding def machineinstr since it's less precise. llvm-svn: 53734	2008-07-17 19:48:53 +00:00
Evan Cheng	48b2f3dfe9	Add nounwind. llvm-svn: 53733	2008-07-17 19:48:04 +00:00
Dan Gohman	8981962672	Add a new function, ReplaceAllUsesOfValuesWith, which handles bulk replacement of multiple values. This is slightly more efficient than doing multiple ReplaceAllUsesOfValueWith calls, and theoretically could be optimized even further. However, an important property of this new function is that it handles the case where the source value set and destination value set overlap. This makes it feasible for isel to use SelectNodeTo in many very common cases, which is advantageous because SelectNodeTo avoids a temporary node and it doesn't require CSEMap updates for users of values that don't change position. Revamp MorphNodeTo, which is what does all the work of SelectNodeTo, to handle operand lists more efficiently, and to correctly handle a number of corner cases to which its new wider use exposes it. This commit also includes a change to the encoding of post-isel opcodes in SDNodes; now instead of being sandwiched between the target-independent pre-isel opcodes and the target-dependent pre-isel opcodes, post-isel opcodes are now represented as negative values. This makes it possible to test if an opcode is pre-isel or post-isel without having to know the size of the current target's post-isel instruction set. These changes speed up llc overall by 3% and reduce memory usage by 10% on the InstructionCombining.cpp testcase with -fast and -regalloc=local. llvm-svn: 53728	2008-07-17 19:10:17 +00:00
Duncan Sands	c3331602f9	LegalizeTypes support for what seems to be the only missing ppc long double operations: FNEG and FP_EXTEND. llvm-svn: 53723	2008-07-17 17:35:14 +00:00
Duncan Sands	778e45e748	Turn LegalizeTypes back off again for the moment: it is breaking Darwin bootstrap due to missing functionality. llvm-svn: 53721	2008-07-17 17:06:03 +00:00
Matthijs Kooijman	5ec5e264e4	Make GlobalOpt preserve address spaces when scalar replacing aggregate globals. llvm-svn: 53716	2008-07-17 11:59:53 +00:00
Chris Lattner	eccd57d118	Fix PR2553 llvm-svn: 53715	2008-07-17 06:07:20 +00:00
Duncan Sands	3448d4087f	Add support for promoting and expanding AssertZext and AssertSext. Needed when passing huge integer parameters with the zeroext or signext attributes. llvm-svn: 53684	2008-07-16 16:03:07 +00:00
Duncan Sands	a8b538544a	Test passing of integer parameters for integers of all sizes from i1 to i256. The code is not always that great, for example (x86) movw %di, %ax movw %ax, i17_s where the store could be directly from %di. llvm-svn: 53677	2008-07-16 13:37:36 +00:00
Duncan Sands	be15f51092	Test codegen of loads and stores of all integer sizes from i1 to i256. The generated code is like one huge bug report of things that the DAG combiner fails to simplify! llvm-svn: 53676	2008-07-16 13:10:20 +00:00
Matthijs Kooijman	c05651e3ce	Add a few cases to instcombine's extractvalue testcase. llvm-svn: 53675	2008-07-16 12:57:25 +00:00
Matthijs Kooijman	0625e0fda6	Un-XFAIL multdeadretval, since instcombine now properly handles the mess deadargelim leaves behind :-) llvm-svn: 53674	2008-07-16 12:56:52 +00:00
Duncan Sands	b2e1ddbd0b	Turn on LegalizeTypes by default. llvm-svn: 53671	2008-07-16 11:36:51 +00:00
Duncan Sands	35d3e774ed	The atomic.cmp.swap promotion logic is wrong: it simply does the atomic.cmp.swap on the larger type, which means it blows away whatever is sitting in the bytes just after the memory location, i.e. causes a buffer overflow. This really requires target specific code, which is why LegalizeTypes doesn't try to handle this case generically. The existing (wrong) code in LegalizeDAG will go away automatically once the type legalization code is removed from LegalizeDAG so I'm leaving it there for the moment. Meanwhile, don't test for this feature. llvm-svn: 53669	2008-07-16 08:09:48 +00:00
Evan Cheng	7218339189	Fix PR2296. Do not transform x86_sse2_storel_dq into a full-width store. llvm-svn: 53666	2008-07-16 07:28:14 +00:00
Matthijs Kooijman	45140a0497	XFAIL the multdeadretval test for now, I will be fixing instcombine to make it work again tomorrow. llvm-svn: 53614	2008-07-15 16:05:09 +00:00
Duncan Sands	7ca2df2319	LegalizeTypes support for fabs on ppc long double. llvm-svn: 53613	2008-07-15 15:02:44 +00:00
Matthijs Kooijman	48fd953b49	Remove a few tests which no longer hold for deadargelim (since it is now allowed to canonicalize return values). Add a test that checks if return value and function attributes are not removed. llvm-svn: 53612	2008-07-15 14:57:01 +00:00
Matthijs Kooijman	21162f1db9	Add a testcase for the canonicalizations now performed by deadargelim. llvm-svn: 53611	2008-07-15 14:42:58 +00:00
Matthijs Kooijman	f940585c1c	Make deadargelim a bit less smart, so it doesn't choke on nested structs as return values that are still (partially) live. Instead of updating all uses of a call instruction after removing some elements, it now just rebuilds the original struct (With undef gaps where the unused values were) and leaves it to instcombine to clean this up. The added testcase still fails currently, but this is due to instcombine which isn't good enough yet. I will fix that part next. llvm-svn: 53608	2008-07-15 14:03:10 +00:00
Matthijs Kooijman	02ffbaf305	Fix typo. llvm-svn: 53605	2008-07-15 13:15:10 +00:00
Duncan Sands	58eb5e35da	LegalizeTypes support for promotion of bswap. In LegalizeDAG the value is zero-extended to the new type before byte swapping. It doesn't matter how the extension is done since the new bits are shifted off anyway after the swap, so extend by any old rubbish bits. This results in the final assembler for the testcase being one line shorter. llvm-svn: 53604	2008-07-15 10:18:22 +00:00
Duncan Sands	710be60c23	LegalizeTypes support for promotion of SIGN_EXTEND_INREG. llvm-svn: 53603	2008-07-15 10:14:24 +00:00
Chris Lattner	ef7178406b	Reimplement LinkFunctionProtos in terms of GetLinkageResult. This fixes the second half of link-global-to-func.ll and causes some minor changes in messages. There are two TODOs here. First, this causes a regression in 2008-07-06-AliasWeakDest.ll, which is now failing (so I xfailed it). Anton, I would really appreciate it if you could take a look at this. It should be a matter of adding proper alias support to GetLinkageResult, and was probably already a latent bug that would manifest with globals. The second todo is to reimplement LinkAlias in the same pattern as function and global linking. This should be pretty straight-forward for someone who knows aliases, but isn't a requirement for correctness. llvm-svn: 53548	2008-07-14 07:23:24 +00:00
Chris Lattner	d31cacb5d6	implement linking of globals to functions, in one direction (replacing a function with a global). This is needed when building llvm itself with LTO on darwin, because of the EXPLICIT_SYMBOL hack in lib/system/DynamicLibrary.cpp. Implementation of linking the other way will need to wait for a cleanup of LinkFunctionProtos. llvm-svn: 53546	2008-07-14 06:49:45 +00:00
Chris Lattner	b786d147c9	Fix a bunch of bugs handling vector compare constant expressions, fixing PR2317. llvm-svn: 53544	2008-07-14 05:17:31 +00:00
Chris Lattner	14faada3a3	Fix PR2506 by being a bit more careful about reverse fact propagation when disproving a condition. This actually compiles the existing testcase (udiv_select_to_select_shift) to: define i64 @test(i64 %X, i1 %Cond) { entry: %divisor1.t = lshr i64 %X, 3 ; <i64> [#uses=1] %quotient2 = lshr i64 %X, 3 ; <i64> [#uses=1] %sum = add i64 %divisor1.t, %quotient2 ; <i64> [#uses=1] ret i64 %sum } instead of: define i64 @test(i64 %X, i1 %Cond) { entry: %quotient1.v = select i1 %Cond, i64 3, i64 4 ; <i64> [#uses=1] %quotient1 = lshr i64 %X, %quotient1.v ; <i64> [#uses=1] %quotient2 = lshr i64 %X, 3 ; <i64> [#uses=1] %sum = add i64 %quotient1, %quotient2 ; <i64> [#uses=1] ret i64 %sum } llvm-svn: 53534	2008-07-14 00:15:52 +00:00
Chris Lattner	3444f4d4c4	Fix mishandling of the infinite loop case when merging two blocks. This fixes PR2540. llvm-svn: 53533	2008-07-13 22:23:11 +00:00
Nick Lewycky	df9e9f0b0e	Stop creating extraneous smax/umax in SCEV. This removes a regression where we started complicating many loops ('for' loops, in fact). llvm-svn: 53508	2008-07-12 07:41:32 +00:00
Nick Lewycky	3fb5816774	Enhance analysis of srem. Remove dead code analyzing urem. 'urem' of power-of-2 is canonicalized to an 'and' instruction. llvm-svn: 53506	2008-07-12 05:04:38 +00:00
Evan Cheng	05e5317cab	Fix PR2536: a nasty spiller bug. If a two-address instruction uses a register but the use portion of its live range is not part of its liveinterval, it must be defined by an implicit_def. In that case, do not spill the use. e.g. 8 %reg1024<def> = IMPLICIT_DEF 12 %reg1024<def> = INSERT_SUBREG %reg1024<kill>, %reg1025, 2 The live range [12, 14) are not part of the r1024 live interval since it's defined by an implicit def. It will not conflicts with live interval of r1025. Now suppose both registers are spilled, you can easily see a situation where both registers are reloaded before the INSERT_SUBREG and both target registers that would overlap. llvm-svn: 53503	2008-07-12 01:56:02 +00:00
Duncan Sands	52f1dbf139	Port a shift-by-1 optimization from LegalizeDAG: it was presumably added after the rest of the code was copied to LegalizeTypes. llvm-svn: 53459	2008-07-11 16:54:57 +00:00
Nick Lewycky	8cd0f2058e	Add another optimization from PR2330. Also catch some missing cases that are similar. llvm-svn: 53451	2008-07-11 07:20:53 +00:00
Bill Wendling	9f17caa9a9	The frame address on an x86-64 box needs to be offset by -8, not -4. llvm-svn: 53450	2008-07-11 07:18:52 +00:00
Chris Lattner	16b8ae98c1	Fix folding of icmp's of i1 where the comparison is signed. The code was using the algorithm for folding unsigned comparisons which is completely wrong. This has been broken since the signless types change. llvm-svn: 53444	2008-07-11 04:20:58 +00:00
Chris Lattner	f3f6b6d7af	Fix a bogus optimization: folding (slt (zext i1 A to i32), 1) -> (slt i1 A, true) This cause a regression in InstCombine/JavaCompare, which was doing the right thing on accident. To handle the missed case, generalize the comparisons based on masked bits a little bit to handle comparisons against the max value. For example, we can now xform (slt i32 (and X, 4), 4) -> (setne i32 (and X, 4), 4) llvm-svn: 53443	2008-07-11 04:09:09 +00:00
Chris Lattner	43a2b1b16d	make this condition more precise. llvm-svn: 53442	2008-07-11 03:54:57 +00:00
Chris Lattner	9dff6fbe58	Implement PR2538 llvm-svn: 53438	2008-07-11 00:30:06 +00:00
Bill Wendling	3be8dca83f	Put CPPBackend tests into their own directory and run them only if they're supported. llvm-svn: 53427	2008-07-10 22:35:32 +00:00
Chris Lattner	5f3c587276	Fix an altivec constant miscompilation that Duncan found through his work on legalizetypes. llvm-svn: 53410	2008-07-10 16:33:38 +00:00
Matthijs Kooijman	ca5124a630	Restructure dead argument elimination, try #3 :-) Rewrite the DeadArgumentElimination pass, to use a more explicit tracking of dependencies between return values and/or arguments. Also make the handling of arguments and return values the same. The pass now looks properly inside returned structs, but only at the first level (ie, not inside nested structs). This version fixed a few more bugs and was cleaned up a bit. It now passes all of LLVM's testing, and should still pass SPEC2006. There is still a minor bug with regard to returning nested structs. Since there is currently nothing that emits such IR, I will fix that in a seperate commit (partly because it requires a non-trivial fix). llvm-svn: 53400	2008-07-10 10:24:08 +00:00
Nick Lewycky	26ccb8e9a8	Fix overzealous optimization. Thanks to Duncan Sands for pointing out my error! llvm-svn: 53393	2008-07-10 05:51:40 +00:00
Evan Cheng	02a618dc56	Fix for PR2472. Use movss to set lower 32-bits of a zero XMM vector. llvm-svn: 53386	2008-07-10 01:08:23 +00:00
Chris Lattner	563d2c9fac	Fix a case where vector comparison constant folding would cause an infinite recursion. part of PR2529 llvm-svn: 53383	2008-07-10 00:29:28 +00:00
Chris Lattner	4fbada0bef	elementwise comparison of vector constants was completely wrong. Fix it for PR2529 llvm-svn: 53380	2008-07-10 00:08:17 +00:00
Anton Korobeynikov	f710ada483	Testcase for PR2024 llvm-svn: 53327	2008-07-09 14:09:41 +00:00
Nick Lewycky	6341c5a7ec	Fold (a < 8) && (b < 8) into (a\|b) < 8 for unsigned less or greater than. llvm-svn: 53282	2008-07-09 07:29:11 +00:00
Nick Lewycky	38fa84fa12	Fold ((1 << a) & 1) to (a == 0). llvm-svn: 53276	2008-07-09 05:20:13 +00:00
Chris Lattner	1a2c55201e	Fix a broken test. Neither load is eliminable without changing the CFG. llvm-svn: 53273	2008-07-09 05:01:02 +00:00
Nick Lewycky	2a6469c9a5	Reduce x - y to -y when we know the 'x' part will get masked off anyways. llvm-svn: 53271	2008-07-09 04:32:37 +00:00
Devang Patel	2b56d5281d	If loop induction variable's start value is less then its exit value then do not split the loop. llvm-svn: 53265	2008-07-09 00:12:01 +00:00
Dale Johannesen	fee6f32586	Testcase for debug info from data-only files. This one is x86-32-Darwin specific. llvm-svn: 53255	2008-07-08 21:57:56 +00:00
Chris Lattner	12ebc344e1	'Optimize' test llvm-svn: 53242	2008-07-08 18:33:33 +00:00
Chris Lattner	7f0adf0b34	new testcase for PR2496 llvm-svn: 53239	2008-07-08 17:18:05 +00:00
Duncan Sands	a6a427cb61	Testcase for PR2520. llvm-svn: 53230	2008-07-08 10:11:36 +00:00
Chris Lattner	d4bcc9011b	Fix three bugs: 1) evaluate [v]fcmp true/false with undefs to true or false instead of undef. 2) fix vector comparisons with undef to return a vector result instead of i1 3) fix vector comparisons with evaluatable results to return vector true/false instead of i1 true/false (PR2529) llvm-svn: 53220	2008-07-08 05:46:34 +00:00
Dan Gohman	6057cf766c	Refactor the tablegen DAGISelEmitter code for outputing calls to getTargetNode and SelectNodeTo to reduce duplication, and to make some of the getTargetNode code available to SelectNodeTo. Use SelectNodeTo instead of getTargetNode in several new interesting cases, as it mutates nodes in place instead of creating new ones. This triggers some scheduling behavior differences due to nodes being presented to the scheduler in a different order. Some of the arbitrary scheduling decisions it makes are now arbitrarily made differently. This is visible in CodeGen/PowerPC/LargeAbsoluteAddr.ll, where a trivial scheduling difference led to a trivial register allocation difference. llvm-svn: 53203	2008-07-07 21:00:17 +00:00
Evan Cheng	cf3a4ad46d	Fix two serious LSR bugs. 1. LSR runOnLoop is always returning false regardless if any transformation is made. 2. AddUsersIfInteresting can create new instructions that are added to DeadInsts. But there is a later early exit which prevents them from being freed. llvm-svn: 53193	2008-07-07 19:51:32 +00:00
Anton Korobeynikov	d1b5a2bf91	Testcase for PR2463 llvm-svn: 53157	2008-07-05 23:33:40 +00:00
Anton Korobeynikov	69c88b40ed	Testcase for PR2146 llvm-svn: 53155	2008-07-05 23:03:46 +00:00
Nick Lewycky	94f9c5a42e	Fix missed optimization opportunity when analyzing cast of mul and select. llvm-svn: 53151	2008-07-05 21:19:34 +00:00
Owen Anderson	1acfb69ad7	Remove the ability for ADCE to remove unreachable blocks in loop nests, because, as Eli pointed out, SimplifyCFG already does this. llvm-svn: 53104	2008-07-03 17:21:41 +00:00
Owen Anderson	e20158affb	Add support to ADCE for pruning unreachable blocks. This addresses the final part of PR2509. llvm-svn: 53038	2008-07-02 18:05:19 +00:00
Owen Anderson	5747d627e0	A better fix for PR2503 that doesn't pessimize GVN in the presence of unreachable blocks. llvm-svn: 53032	2008-07-02 17:20:16 +00:00
Dale Johannesen	51edab312c	Considering predecessors of exit blocks gets us a little more tail merging. llvm-svn: 52986	2008-07-01 21:50:49 +00:00
Chris Lattner	95fecdd63a	Implement split and scalarize for SELECT_CC, fixing PR2504 llvm-svn: 52887	2008-06-30 02:43:01 +00:00
Duncan Sands	307bc51955	Regression test for PR2443. llvm-svn: 52826	2008-06-27 14:22:20 +00:00
Duncan Sands	c4678a026a	Use the c modifier to tell llvm-ar not to issue a warning when creating the archive (the warning causes the test to fail). llvm-svn: 52824	2008-06-27 10:52:12 +00:00
Chris Lattner	153b6695b8	test doesn't need eh info llvm-svn: 52811	2008-06-27 03:14:20 +00:00
Chris Lattner	f40ef5f964	when linking globals, make sure to preserve the address space of the global. llvm-svn: 52810	2008-06-27 03:10:24 +00:00
Evan Cheng	407d3b820b	XFAIL for now. llvm-svn: 52795	2008-06-26 22:09:29 +00:00
Owen Anderson	a9fd2b7e53	Use the -enable-pre flag so this test doesn't fail. llvm-svn: 52784	2008-06-26 17:03:28 +00:00
Matthijs Kooijman	b1217bdbb0	Make LLVM compile on DragonFly BSD (PR2499). Patch by Hasso Tepper! llvm-svn: 52781	2008-06-26 10:36:58 +00:00
Dale Johannesen	76f5dc0cc4	Allow for rounding up of stack frame. llvm-svn: 52751	2008-06-26 01:55:32 +00:00
Chris Lattner	2b67ff8632	when we know the signbit of an input to uint_to_fp is zero, change it to sint_to_fp on targets where that is cheaper (and visaversa of course). This allows us to compile uint_to_fp to: _test: movl 4(%esp), %eax shrl $23, %eax cvtsi2ss %eax, %xmm0 movl 8(%esp), %eax movss %xmm0, (%eax) ret instead of: .align 3 LCPI1_0: ## double .long 0 ## double least significant word 4.5036e+15 .long 1127219200 ## double most significant word 4.5036e+15 .text .align 4,0x90 .globl _test _test: subl $12, %esp movl 16(%esp), %eax shrl $23, %eax movl %eax, (%esp) movl $1127219200, 4(%esp) movsd (%esp), %xmm0 subsd LCPI1_0, %xmm0 cvtsd2ss %xmm0, %xmm0 movl 20(%esp), %eax movss %xmm0, (%eax) addl $12, %esp ret llvm-svn: 52747	2008-06-26 00:16:49 +00:00
Evan Cheng	71fbfe73c1	- Fix a x86 vector isel bug: illegal transformation of a vector_shuffle into a shift. - Add a readme entry for a missing vector_shuffle optimization that results in awful codegen. llvm-svn: 52740	2008-06-25 20:52:59 +00:00
Chris Lattner	36049c026a	simplify shell syntax to work better on solaris, patch by Nathan Keynes! llvm-svn: 52721	2008-06-25 16:03:42 +00:00
Mon P Wang	7d89d61387	Added MemOperands to Atomic operations since Atomics touches memory. Added abstract class MemSDNode for any Node that have an associated MemOperand Changed atomic.lcs => atomic.cmp.swap, atomic.las => atomic.load.add, and atomic.lss => atomic.load.sub llvm-svn: 52706	2008-06-25 08:15:39 +00:00
Chris Lattner	73b52018e9	Fix PR2488, a case where we deleted stack restores too aggressively. llvm-svn: 52702	2008-06-25 05:59:28 +00:00
Evan Cheng	bab5925a0b	Enable two-address remat by default. llvm-svn: 52701	2008-06-25 01:16:38 +00:00
Dale Johannesen	244433ebb1	v2f32 is now a valid (MMX) type which breaks this test (doesn't work for any MMX vector types, it's not me). Rewritten to use v2i16 which is generic and going to stay that way; I think that preserves the point of the test. llvm-svn: 52692	2008-06-24 22:03:36 +00:00
Dan Gohman	b9384c5e87	Revert 52645, the loop unroller changes. It caused a regression in 252.eon. llvm-svn: 52688	2008-06-24 20:44:42 +00:00
Matthijs Kooijman	ff03ea8aeb	Commit the new DeadArgElim pass again, this time with the gcc bootstrap failures fixed. Also add a testcase to reproduce the gcc bootstrap failure in very much reduced form. llvm-svn: 52677	2008-06-24 16:30:26 +00:00
Evan Cheng	a62f5f0f82	If it's determined safe, remat MOV32r0 (i.e. xor r, r) and others as it is instead of using the longer MOV32ri instruction. llvm-svn: 52670	2008-06-24 07:10:51 +00:00
Bill Wendling	2501066409	This situation can occur: ,------. \| \| \| v \| t2 = phi ... t1 ... \| \| \| v \| t1 = ... \| ... = ... t1 ... \| \| `------' where there is a use in a PHI node that's a predecessor to the defining block. We don't want to mark all predecessors as having the value "alive" in this case. Also, the assert was too restrictive and didn't handle this case. llvm-svn: 52655	2008-06-23 23:41:14 +00:00
Dan Gohman	7f6ee1cd4b	Revamp the loop unroller, extending it to correctly update PHI nodes in the presence of out-of-loop users of in-loop values and the trip count is not a known multiple of the unroll count, and to be a bit simpler overall. This fixes PR2253. llvm-svn: 52645	2008-06-23 21:29:41 +00:00
Bill Wendling	d6b7d457cf	Make test work on non-x86 machines (like my G4 PPC). llvm-svn: 52619	2008-06-23 06:16:31 +00:00
Dan Gohman	62d8bc0480	Improve LSR's dead-phi detection to handle use-def cycles with more than two nodes. llvm-svn: 52617	2008-06-22 20:44:02 +00:00
Chris Lattner	d80c865a09	Fix PR2369 by making scalarrepl more careful about promoting structures. Its default threshold is to promote things that are smaller than 128 bytes, which is sane. However, it is not sane to do this for things that turn into 128 registers. Add a cap on the number of registers introduced, defaulting to 128/4=32. llvm-svn: 52611	2008-06-22 17:46:21 +00:00
Eli Friedman	369401ef95	Fix for PR2479: correctly optimize expressions like (a > 13) & (a == 15). See also PR1800, which is about the signed case. llvm-svn: 52608	2008-06-21 23:36:13 +00:00
Duncan Sands	dd3b6236c8	This file is empty. llvm-svn: 52596	2008-06-21 20:26:50 +00:00
Duncan Sands	1d9305bfb8	Turn off llvm-gcc warnings when running "make check". llvm-svn: 52595	2008-06-21 20:22:58 +00:00
Duncan Sands	1dd6ef8f8e	Support for load/store of expanded float types. I don't know if a truncating store is possible here, but added support for it anyway. llvm-svn: 52577	2008-06-21 17:00:47 +00:00
Evan Cheng	1d07cd32c2	Undo spill weight tweak. Need to investigate the performance regressions. llvm-svn: 52572	2008-06-21 06:45:54 +00:00
Evan Cheng	b65bceda9c	Back out Matthijs' DAE patches. It's miscompiling gcc driver. llvm-svn: 52570	2008-06-21 00:31:44 +00:00
Matthijs Kooijman	564fe9092f	Add testcase that checks that DeadArgElim doesn't touch stuff it shouldn't touch. llvm-svn: 52540	2008-06-20 15:38:22 +00:00
Matthijs Kooijman	a3222e3730	Recommit r52459, rewriting of the dead argument elimination pass. This is a fixed version that no longer uses multimap::equal_range, which resulted in a pointer invalidation problem. Also, DAE::InspectedFunctions was not really necessary, so it got removed. Lastly, this version no longer applies the extra arg hack on functions who did not have any arguments to start with. llvm-svn: 52532	2008-06-20 09:36:16 +00:00
Chris Lattner	0177b31bde	Fix a warning, closing PR2452 llvm-svn: 52529	2008-06-20 05:33:29 +00:00
Chris Lattner	873cf9817f	Fix a warning. llvm-svn: 52528	2008-06-20 05:31:04 +00:00
Chris Lattner	9ab7735924	Fix an error handling redefinition of linkonce functions where the types differ. Patch by Nathan Keynes! llvm-svn: 52527	2008-06-20 05:29:39 +00:00
Chris Lattner	0daba8a204	fix a warning. llvm-svn: 52526	2008-06-20 05:28:56 +00:00
Chris Lattner	e588f546c5	Fix PR2471, which is a bug involving an invalid promotion from a conditional load. llvm-svn: 52525	2008-06-20 05:12:56 +00:00
Evan Cheng	4006f4cdf0	ISD::UNDEF should be expanded recursively / iteratively. llvm-svn: 52508	2008-06-19 22:01:11 +00:00
Matthijs Kooijman	343ce1868f	Modify some ipconstprop tests to also test with invokes. llvm-svn: 52491	2008-06-19 09:27:44 +00:00
Eli Friedman	570aa6f801	Fix a bug with <8 x i16> shuffle lowering on X86 where parts of the shuffle could be skipped. The check is invalid because the loop index i doesn't correspond to the element actually inserted. The correct check is already done a few lines earlier, for whether the element is already in the right spot, so this shouldn't have any effect on the codegen for code that was already correct. llvm-svn: 52486	2008-06-19 06:09:51 +00:00
Evan Cheng	919b735586	New test case. llvm-svn: 52483	2008-06-19 01:50:24 +00:00
Evan Cheng	ee801276b3	This also got better (55 - 51 instructions). But doing one more re-materialization. llvm-svn: 52482	2008-06-19 01:50:13 +00:00
Evan Cheng	56e17b525c	This got better. llvm-svn: 52481	2008-06-19 01:46:43 +00:00
Owen Anderson	597b40ed60	Remove this test until the corresponding patch is reapplied because it's causing make check to crash for some people. llvm-svn: 52473	2008-06-18 22:37:31 +00:00
Owen Anderson	3f78e260c1	Add local PRE to GVN. This only operates in cases where it would not increase code size, namely when the instantiated expression would only need to be created in one predecessor. llvm-svn: 52471	2008-06-18 21:41:49 +00:00
Matthijs Kooijman	b7e2818227	Rewrite the DeadArgumentElimination pass, to use a more explicit tracking of dependencies between return values and/or arguments. Also make the handling of arguments and return values the same. The pass now looks properly inside returned structs, but only at the first level (ie, not inside nested structs). Also add a testcase for testing various variations of (multiple) dead rerturn values. llvm-svn: 52459	2008-06-18 11:12:53 +00:00
Matthijs Kooijman	8177832230	Reapply r52397 (make IPConstProp promote returned arguments), but fixed this time. Sorry for the trouble! This time, also add a testcase, which I should have done in the first place... llvm-svn: 52455	2008-06-18 08:30:37 +00:00
Matthijs Kooijman	bb59138fa6	Reapply r52396, it was unrelated to the breakage (that was caused by r52397, my commit after this). llvm-svn: 52453	2008-06-18 08:09:27 +00:00
Chris Lattner	93da79f7a1	implement some simple bswap optimizations, rdar://5992453 llvm-svn: 52442	2008-06-18 04:33:20 +00:00
Chris Lattner	2a22e66e47	temporarily revert this testcase since its patch was reverted. llvm-svn: 52441	2008-06-18 04:03:23 +00:00
Chris Lattner	7e403da191	make truncate/sext elimination capable of changing phi's. This implements rdar://6013816 and the testcase in Transforms/InstCombine/sext-misc.ll. llvm-svn: 52440	2008-06-18 04:00:49 +00:00
Devang Patel	8f157a3670	Preserve dominance frontier while trivially unswitching loop. llvm-svn: 52438	2008-06-18 02:16:38 +00:00
Matthijs Kooijman	f0adaf34a1	Learn IPConstProp to look at individual return values and propagate them individually. Also learn IPConstProp how returning first class aggregates work, in addition to old style multiple return instructions. Modify the return-constants testscase to confirm this behaviour. llvm-svn: 52396	2008-06-17 12:02:52 +00:00
Evan Cheng	8cfd1d39a1	Do not issue identity copies. llvm-svn: 52373	2008-06-16 22:52:53 +00:00
Dan Gohman	c1fd5f170b	Refine the change in r52258 for avoiding use-before-def conditions when changing the stride of a comparison so that it's slightly more precise, by having it scan the instruction list to determine if there is a use of the condition after the point where the condition will be inserted. llvm-svn: 52371	2008-06-16 22:34:15 +00:00
Evan Cheng	d27948e716	- Add "Commutative" property to intrinsics. This allows tblgen to generate the commuted variants for dagisel matching code. - Mark lots of X86 intrinsics as "Commutative" to allow load folding. llvm-svn: 52353	2008-06-16 20:29:38 +00:00
Matthijs Kooijman	bdd5cae51c	Make testcase check for extractvalue instead of extractelement. llvm-svn: 52317	2008-06-16 13:03:44 +00:00
Matthijs Kooijman	e01197eaa9	Store the result of multiple identical run lines in a temporary file. llvm-svn: 52314	2008-06-16 12:21:25 +00:00
Matthijs Kooijman	f6b1a51a94	Fix PR numbers, I accidentally switched two digits. llvm-svn: 52311	2008-06-16 09:38:23 +00:00
Chris Lattner	e987a3bdd1	If we are checking to see if the result of a call aliases a pointer derived from a local allocation, if the local allocation never escapes, the pointers can't alias. This implements PR2436 llvm-svn: 52301	2008-06-16 06:19:11 +00:00

1 2 3 4 5 ...

5628 Commits