llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 13:02:52 +02:00

Author	SHA1	Message	Date
Chris Lattner	c3905a79e4	fix bogus test llvm-svn: 93069	2010-01-09 19:24:49 +00:00
Chris Lattner	2abed51288	fix bogus test llvm-svn: 93068	2010-01-09 19:24:18 +00:00
Jeffrey Yasskin	53a8f3981c	Fix http://llvm.org/PR5729 : x86-64 tail calls were putting their targets into R11, and then asserting that the target was in R9. Since R9 isn't reserved for the target anymore, and is used as an argument, this patch changes the assertion. llvm-svn: 93065	2010-01-09 18:56:43 +00:00
Dan Gohman	771144e807	Use WriteAsOperand instead of getName() to print loop header names, so that unnamed blocks are handled. llvm-svn: 93059	2010-01-09 18:17:45 +00:00
Chris Lattner	b142e13f84	only factor from expressions whose uses are empty and whose base is the right expression type. This fixes PR5981. llvm-svn: 93045	2010-01-09 06:01:36 +00:00
Dan Gohman	3708af1c59	Revert an earlier change to SIGN_EXTEND_INREG for vectors. The VTSDNode really does need to be a vector type, because TargetLowering::getOperationAction for SIGN_EXTEND_INREG uses that type, and it needs to be able to distinguish between vectors and scalars. Also, fix some more issues with legalization of vector casts. llvm-svn: 93043	2010-01-09 02:13:55 +00:00
Evan Cheng	2e497d1ed4	Fix a critical bug in 64-bit atomic operation lowering for 32-bit. The results of the cmpxchg8b instructions are being thrown away when it branches back to the top of the checking loop. This means the loop always compares against the old value and this can result in a dead lock. llvm-svn: 93028	2010-01-08 23:41:50 +00:00
Chris Lattner	05ae88cc8f	teach instcombine to delete sign extending shift pairs (sra(shl X, C), C) when the input is already sign extended. llvm-svn: 93019	2010-01-08 19:04:21 +00:00
Chris Lattner	0dc48180de	fix PR5978 by peeling the loop so that we avoid shifting the result int by 8 for the first byte. While normally harmless, if the result is smaller than a byte, this shift is invalid. llvm-svn: 93018	2010-01-08 19:02:23 +00:00
Evan Cheng	f96a9ec02b	ReplaceAllUsesOfValueWith may delete other nodes that the one being replaced. Do not delete dead nodes again. llvm-svn: 92988	2010-01-08 02:36:12 +00:00
Chris Lattner	944f9c4ac1	teach ComputeNumSignBits to look through PHI nodes. llvm-svn: 92964	2010-01-07 23:44:37 +00:00
Chris Lattner	b8ec2bccaf	filecheckize llvm-svn: 92963	2010-01-07 23:42:23 +00:00
Chris Lattner	db8fa82914	Enhance instcombine to reason more strongly about promoting computation that feeds into a zext, similar to the patch I did yesterday for sext. There is a lot of room for extension beyond this patch. llvm-svn: 92962	2010-01-07 23:41:00 +00:00
Chris Lattner	e0199dff81	Fix rdar://7517201, a regression introduced by r92849. When folding a and(any_ext(load)) both the any_ext and the load have to have only a single use. This removes the anyext-uses.ll testcase which started failing because it is unreduced and unclear what it is testing. llvm-svn: 92950	2010-01-07 21:59:23 +00:00
Evan Cheng	4523041394	APInt'fy TargetLowering::SimplifySetCC to fix PR5963. llvm-svn: 92943	2010-01-07 20:58:44 +00:00
Devang Patel	10ee76f7f4	Use separate namespace for named metadata. llvm-svn: 92931	2010-01-07 19:39:36 +00:00
Chris Lattner	0b0caf0877	fix a globalopt crash on 'bullet' (handling evaluation of a store to an element of a vector in a static ctor) which occurs with an unrelated patch I'm testing. Annoyingly, EvaluateStoreInto basically does exactly the same stuff as InsertElement constant folding, but it now handles vectors, and you can't insertelement into a vector. It would be 'really nice' if GEP into a vector were not legal. llvm-svn: 92889	2010-01-07 01:16:21 +00:00
Evan Cheng	51d86260ff	Fix a minor regression from my dag combiner changes. One more place which needs to look pass truncates. llvm-svn: 92885	2010-01-07 00:54:06 +00:00
Jakob Stoklund Olesen	09012552b8	Add comments. llvm-svn: 92883	2010-01-07 00:51:04 +00:00
Jakob Stoklund Olesen	a63aa4e54b	Add Target hook to duplicate machine instructions. Some instructions refer to unique labels, and so cannot be trivially cloned with CloneMachineInstr. llvm-svn: 92873	2010-01-06 23:47:07 +00:00
Evan Cheng	25dcf9b830	Teach dag combine to fold the following transformation more aggressively: (OP (trunc x), (trunc y)) -> (trunc (OP x, y)) Unfortunately this simple change causes dag combine to infinite looping. The problem is the shrink demanded ops optimization tend to canonicalize expressions in the opposite manner. That is badness. This patch disable those optimizations in dag combine but instead it is done as a late pass in sdisel. This also exposes some deficiencies in dag combine and x86 setcc / brcond lowering. Teach them to look pass ISD::TRUNCATE in various places. llvm-svn: 92849	2010-01-06 19:38:29 +00:00
Duncan Sands	de0adbdf25	Fix a README item: have functionattrs look through selects and phi nodes when deciding which pointers point to local memory. I actually checked long ago how useful this is, and it isn't very: it hardly ever fires in the testsuite, but since Chris wants it here it is! llvm-svn: 92836	2010-01-06 15:37:47 +00:00
Duncan Sands	4ef1119d94	Partially address a README by having functionattrs consider calls to memcpy, memset and other intrinsics that only access their arguments to be readnone if the intrinsic's arguments all point to local memory. This improves the testcase in the README to readonly, but it could in theory be made readnone, however this would involve more sophisticated analysis that looks through the memcpy. llvm-svn: 92829	2010-01-06 08:45:52 +00:00
Duncan Sands	9ce94f877c	This is testing a darwin specific feature, so only turn it on for darwin (it fails on linux). llvm-svn: 92826	2010-01-06 05:49:26 +00:00
Chris Lattner	0b73344d8a	Teach instcombine's sext elimination logic to be more aggressive. Previously, instcombine would only promote an expression tree to the larger type if doing so eliminated two casts. This is because a need to manually do the sign extend after the promoted expression tree with two shifts. Now, we keep track of whether the result of the computation is going to be properly sign extended already. If so, we can unconditionally promote the expression, which allows us to zap more sext's. This implements rdar://6598839 (aka gcc pr38751) llvm-svn: 92815	2010-01-06 01:56:21 +00:00
Dan Gohman	93a28a6ce9	Move this test from test/Transforms/IndVarSimplify to test/CodeGen/X86, as doesn't use -indvars, and it does use llc -march=x86-64. llvm-svn: 92799	2010-01-05 22:52:54 +00:00
Bill Wendling	7e9607ab56	Don't assign the shift the same type as the variable being shifted. This could result in illegal types for the SHL operator. llvm-svn: 92797	2010-01-05 22:39:10 +00:00
Victor Hernandez	0e7561092b	Re-add parsing of function-local metadata; this time with testcase. llvm-svn: 92793	2010-01-05 22:22:14 +00:00
Chris Lattner	53b9ed70ee	more rearrangement and cleanup, fix my test failure. llvm-svn: 92792	2010-01-05 22:21:18 +00:00
Chris Lattner	2f69f6a822	remove two trunc xforms that are subsumed by EvaluateInDifferentType. The only difference is that EvaluateInDifferentType checks to ensure they are profitable before doing them :) llvm-svn: 92788	2010-01-05 22:01:41 +00:00
Chris Lattner	96e30cb44f	merge some tests. llvm-svn: 92786	2010-01-05 21:54:09 +00:00
Chris Lattner	fd23a9b6dd	merge cast2 into cast.ll llvm-svn: 92784	2010-01-05 21:48:13 +00:00
Devang Patel	311b5584e5	Allow null to be an element of NamedMDNode. e.g. !llvm.stuff = !{!0, !1 , null} llvm-svn: 92783	2010-01-05 21:47:32 +00:00
Chris Lattner	24e500eb45	remove useless test. llvm-svn: 92782	2010-01-05 21:46:22 +00:00
Chris Lattner	61de3ae41a	another example. llvm-svn: 92781	2010-01-05 21:43:08 +00:00
Chris Lattner	65d5ec781a	remove a useless negative test, add a rdar # to an xfail that I'm working on. llvm-svn: 92777	2010-01-05 21:37:44 +00:00
Chris Lattner	3e7dbaf22d	clean up tests. llvm-svn: 92776	2010-01-05 21:32:59 +00:00
Chris Lattner	13293b9738	just remove this xform which is subsumed by others. llvm-svn: 92775	2010-01-05 21:16:30 +00:00
David Greene	19324a5b81	Add an !eq() operator to TableGen. It operates on strings only. Use !cast<string>() to compare other types of objects. llvm-svn: 92754	2010-01-05 19:11:42 +00:00
Chris Lattner	f457542506	optimize comparisons against cttz/ctlz/ctpop, patch by Alastair Lynn! llvm-svn: 92745	2010-01-05 18:09:56 +00:00
Dan Gohman	5fa04f2707	Delete useless trailing semicolons. llvm-svn: 92740	2010-01-05 17:55:26 +00:00
Devang Patel	eb75073664	If a scope has only one instruction then first instruction is also the last instruction. llvm-svn: 92736	2010-01-05 16:59:17 +00:00
Chris Lattner	491e03b6ef	optimize cttz and ctlz when we can prove something about the leading/trailing bits. Patch by Alastair Lynn! llvm-svn: 92706	2010-01-05 07:23:56 +00:00
Chris Lattner	2ef4ba7cf5	fix an infinite loop in reassociate building emacs. llvm-svn: 92679	2010-01-05 04:55:35 +00:00
Devang Patel	3b08c33f33	Remove dead debug info intrinsics. Intrinsic::dbg_stoppoint Intrinsic::dbg_region_start Intrinsic::dbg_region_end Intrinsic::dbg_func_start AutoUpgrade simply ignores these intrinsics now. llvm-svn: 92557	2010-01-05 01:10:40 +00:00
Devang Patel	6915c52d56	Fix debug_inlined section entries for routines whose names are changed through __asm() extension. llvm-svn: 92533	2010-01-04 23:04:36 +00:00
Dan Gohman	73b0882c6e	Make this test more portable. llvm-svn: 92514	2010-01-04 21:23:34 +00:00
Devang Patel	e04ac70892	Remove oversimplified test case. llvm-svn: 92510	2010-01-04 20:54:06 +00:00
Dan Gohman	b71bc40eed	Add some tests and update an existing test to reflect recent x86 isel peeps. llvm-svn: 92509	2010-01-04 20:53:54 +00:00
Devang Patel	46d7029a91	The test, derived from optimzed IR, does not mention "bar" in debug info anywhere so the dwarf writer is not expected to emit any debug info for function "bar". llvm-svn: 92499	2010-01-04 19:41:13 +00:00
Chris Lattner	3b060b2d41	Truncate GEP indexes larger than the pointer size down to pointer size when doing this transform if the GEP is not inbounds. No testcase because it is very difficult to trigger this: instcombine already canonicalizes GEP indices to pointer size, so it relies specific permutations of the instcombine worklist. Thanks to Duncan for pointing this possible problem out. llvm-svn: 92495	2010-01-04 18:57:15 +00:00
Anton Korobeynikov	3915cf5ef4	Fix invalid chain folding for memory variant of sdiv / udiv llvm-svn: 92472	2010-01-04 10:31:54 +00:00
Chris Lattner	ce3f5f3448	implement an instcombine xform needed by clang's codegen on the example in PR4216. This doesn't trigger in the testsuite, so I'd really appreciate someone scrutinizing the logic for correctness. llvm-svn: 92458	2010-01-04 06:03:59 +00:00
Chris Lattner	8e83066d12	fix PR5930, allowing the asmprinter to emit difference between two labels as a truncate. llvm-svn: 92455	2010-01-03 18:33:18 +00:00
Chris Lattner	49cda26f7e	add PR# llvm-svn: 92451	2010-01-03 18:10:58 +00:00
Chris Lattner	7246a69d2b	differences between two blockaddress's don't cause a global variable initializer to require relocations. llvm-svn: 92450	2010-01-03 18:09:40 +00:00
Chris Lattner	647c629ee4	generalize the previous transformation to handle indexing into arrays of structs and other arrays, so long as all the subsequent indexes are constants. This triggers frequently for stuff like: @divisions = internal constant [29 x [2 x i32]] [[2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 1], [2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 1], [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 2], [2 x i32] zeroinitializer, [2 x i32] zeroinitializer, [2 x i32] zeroinitializer, [2 x i32] [i32 0, i32 2], [2 x i32] [i32 0, i32 1], [2 x i32] zeroinitializer, [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 0], [2 x i32] [i32 1, i32 1], [2 x i32] [i32 1, i32 2], [2 x i32] [i32 1, i32 2]], align 32 ; <[29 x [2 x i32]]> [#uses=50] %623 = getelementptr inbounds [29 x [2 x i32]] @divisions, i64 0, i64 %619, i64 0 ; <i32*> [#uses=1] %684 = icmp eq i32 %683, 999 also for the "my_defs" table in 'gs', etc. llvm-svn: 92444	2010-01-03 03:03:27 +00:00
Chris Lattner	acb0c133ec	teach instcombine to optimize idioms like A[i]&42 == 0. This occurs in 403.gcc in mode_mask_array, in safe-ctype.c (which is copied in multiple apps) in _sch_istable, etc. llvm-svn: 92427	2010-01-02 22:08:28 +00:00
Chris Lattner	4af67af013	Teach the table lookup optimization to generate range compares when a consequtive sequence of elements all satisfies the predicate. Like the double compare case, this generates better code than the magic constant case and generalizes to more than 32/64 element array lookups. Here are some examples where it triggers. From 403.gcc, most accesses to the rtx_class array are handled, e.g.: @rtx_class = constant [153 x i8] c"xxxxxmmmmmmmmxxxxxxxxxxxxmxxxxxxiiixxxxxxxxxxxxxxxxxxxooxooooooxxoooooox3x2c21c2222ccc122222ccccaaaaaa<<<<<<<<<<<<<<<<<<111111111111bbooxxxxxxxxxxcc2211x", align 32 ; <[153 x i8]> [#uses=547] %142 = icmp eq i8 %141, 105 @rtx_class = constant [153 x i8] c"xxxxxmmmmmmmmxxxxxxxxxxxxmxxxxxxiiixxxxxxxxxxxxxxxxxxxooxooooooxxoooooox3x2c21c2222ccc122222ccccaaaaaa<<<<<<<<<<<<<<<<<<111111111111bbooxxxxxxxxxxcc2211x", align 32 ; <[153 x i8]> [#uses=543] %165 = icmp eq i8 %164, 60 Also, most of the 59-element arrays (mode_class/rid_to_yy, etc) optimized before are actually range compares. This lets 32-bit machines optimize them. 400.perlbmk has stuff like this: 400.perlbmk: PL_regkind, even for 32-bit: @PL_regkind = constant [62 x i8] c"\00\00\02\02\02\06\06\06\06\09\09\0B\0B\0D\0E\0E\0E\11\12\12\14\14\16\16\18\18\1A\1A\1C\1C\1E\1F !!!$$&'((((,-.///88886789:;8$", align 32 ; <[62 x i8]> [#uses=4] %811 = icmp ne i8 %810, 33 @PL_utf8skip = constant [256 x i8] c"\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\01\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\02\03\03\03\03\03\03\03\03\03\03\03\03\03\03\03\03\04\04\04\04\04\04\04\04\05\05\05\05\06\06\07\0D", align 32 ; <[256 x i8]> [#uses=94] %12 = icmp ult i8 %10, 2 etc. llvm-svn: 92426	2010-01-02 21:50:18 +00:00
Nick Lewycky	cda0109ec5	Fix logic error in previous commit. The != case needs to become an or, not an and. llvm-svn: 92419	2010-01-02 16:14:56 +00:00
Nick Lewycky	3cc8fe073a	Optimize pointer comparison into the typesafe form, now that the backends will handle them efficiently. This is the opposite direction of the transformation we used to have here. llvm-svn: 92418	2010-01-02 15:25:44 +00:00
Chris Lattner	e1a2489017	Generalize the previous xform to handle cases where exactly two elements match or don't match with two comparisons. For example, the testcase compiles into: define i1 @test5(i32 %X) { %1 = icmp eq i32 %X, 2 ; <i1> [#uses=1] %2 = icmp eq i32 %X, 7 ; <i1> [#uses=1] %R = or i1 %1, %2 ; <i1> [#uses=1] ret i1 %R } This generalizes the previous xforms when the array is larger than 64 elements (and this case matches) and generates better code for cases where it overlaps with the magic bitshift case. This generalizes more cases than you might expect. For example, 400.perlbmk has: @PL_utf8skip = constant [256 x i8] c"\01\01\01\... %15 = icmp ult i8 %7, 7 403.gcc has: @rid_to_yy = internal constant [114 x i16] [i16 259, i16 260, ... %18 = icmp eq i16 %16, 295 and xalancbmk has a bunch of examples, such as _ZN11xercesc_2_5L15gCombiningCharsE and _ZN11xercesc_2_5L10gBaseCharsE. llvm-svn: 92417	2010-01-02 09:35:17 +00:00
Chris Lattner	1cdc77b8da	enhance the compare/load/index optimization to work on any load from a global with 32/64 elements or less (depending on whether i64 is native on the target), generating a bitshift idiom to determine the result. For example, on test4 we produce: define i1 @test4(i32 %X) { %1 = lshr i32 933, %X ; <i32> [#uses=1] %2 = and i32 %1, 1 ; <i32> [#uses=1] %R = icmp ne i32 %2, 0 ; <i1> [#uses=1] ret i1 %R } This triggers in a number of interesting cases, for example, here's an fp case: @A.3255 = internal constant [4 x double] [double 4.100000e+00, double -3.900000e+00, double -1.000000e+00, double 1.000000e+00], align 32 ; <[4 x double]> [#uses=7] ... %7 = fcmp olt double %3, 0.000000e+00 In this case we make the slen2_tab global dead, which is nice: @slen2_tab = internal constant [16 x i32] [i32 0, i32 1, i32 2, i32 3, i32 0, i32 1, i32 2, i32 3, i32 1, i32 2, i32 3, i32 1, i32 2, i32 3, i32 2, i32 3], align 32 ; <[16 x i32]> [#uses=1] ... %204 = icmp eq i32 %46, 0 Perl has a bunch of these, also on the 'Perl_regkind' array: @Perl_yygindex = internal constant [51 x i16] [i16 0, i16 0, i16 0, i16 0, i16 374, i16 351, i16 0, i16 -12, i16 0, i16 946, i16 413, i16 -83, i16 0, i16 0, i16 0, i16 -311, i16 -13, i16 4007, i16 2893, i16 0, i16 0, i16 0, i16 0, i16 0, i16 372, i16 -8, i16 0, i16 0, i16 246, i16 -131, i16 43, i16 86, i16 208, i16 -45, i16 -169, i16 987, i16 0, i16 0, i16 0, i16 0, i16 308, i16 0, i16 -271, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0, i16 0], align 32 ; <[51 x i16]> [#uses=1] ... %1364 = icmp eq i16 %1361, 0 186.crafty really likes this on 64-bit machines, because it triggers on a bunch of globals like this: @white_outpost = internal constant [64 x i8] c"\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\00\02\02\00\00\00\00\00\04\05\05\04\00\00\00\00\03\06\06\03\00\00\00\00\00\01\01\00\00\00\00\00\00\00\00\00\00\00", align 32 ; <[64 x i8]> [#uses=2] However the big winner is 403.gcc, which triggers hundreds of times, eliminating all the accesses to the 57-element arrays 'mode_class', mode_unit_size, mode_bitsize, regclass_map, etc. go 64-bit machines :) llvm-svn: 92415	2010-01-02 08:56:52 +00:00
Chris Lattner	59136ba5ad	enhance the previous optimization to work with fcmp in addition to icmp. llvm-svn: 92412	2010-01-02 08:20:51 +00:00
Chris Lattner	f3f6c10218	Teach instcombine to fold compares of loads from constant arrays with variable indices into a comparison of the index with a constant. The most common occurrence of this that I see by far is stuff like: if ("foobar"[i] == '\0') ... which we compile into: if (i == 6), saving a load and materialization of the global address. This also exposes loop trip count information to later passes in many cases. This triggers hundreds of times in xalancbmk, which is where I first noticed it, but it also triggers in many other apps. Here are a few interesting ones from various apps: @must_be_connected_without = internal constant [8 x i8] [i8 getelementptr inbounds ([3 x i8]* @.str64320, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str27283, i64 0, i64 0), i8* getelementptr inbounds ([4 x i8]* @.str71327, i64 0, i64 0), i8* getelementptr inbounds ([4 x i8]* @.str72328, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str18274, i64 0, i64 0), i8* getelementptr inbounds ([6 x i8]* @.str11267, i64 0, i64 0), i8* getelementptr inbounds ([3 x i8]* @.str32288, i64 0, i64 0), i8* null], align 32 ; <[8 x i8]> [#uses=2] %scevgep.i = getelementptr [8 x i8] @must_be_connected_without, i64 0, i64 %indvar.i ; <i8*> [#uses=1] %17 = load ... %18 = icmp eq i8 %17, null ; <i1> [#uses=1] -> icmp eq i64 %indvar.i, 7 @yytable1095 = internal constant [84 x i8] c"\12\01(\05\06\07\08\09\0A\0B\0C\0D\0E1\0F\10\11266\1D: \10\11,-,0\03'\10\11B6\04\17&\18\1945\05\06\07\08\09\0A\0B\0C\0D\0E\1E\0F\10\11\1A\1B\1C$3+>#%;<IJ=ADFEGH9KL\00\00\00C", align 32 ; <[84 x i8]> [#uses=2] %57 = getelementptr inbounds [84 x i8]* @yytable1095, i64 0, i64 %56 ; <i8> [#uses=1] %mode.0.in = getelementptr inbounds [9 x i32] @mb_mode_table, i64 0, i64 %.pn ; <i32> [#uses=1] load ... %64 = icmp eq i8 %58, 4 ; <i1> [#uses=1] -> icmp eq i64 %.pn, 35 ; <i1> [#uses=0] @gsm_DLB = internal constant [4 x i16] [i16 6554, i16 16384, i16 26214, i16 32767] %scevgep.i = getelementptr [4 x i16] @gsm_DLB, i64 0, i64 %indvar.i ; <i16*> [#uses=1] %425 = load %scevgep.i %426 = icmp eq i16 %425, -32768 ; <i1> [#uses=0] -> false llvm-svn: 92411	2010-01-02 08:12:04 +00:00
Chris Lattner	cf784992da	remove the instcombine transformations that are inserting nasty pointer to int casts that confuse later optimizations. See PR3351 for details. This improves but doesn't complete fix 483.xalancbmk because llvm-gcc does this xform in GCC's "fold" routine as well. Clang++ will do better I guess. llvm-svn: 92408	2010-01-02 00:31:05 +00:00
Chris Lattner	9e64bad0da	allow this to work on linux hosts. llvm-svn: 92407	2010-01-02 00:22:15 +00:00
Chris Lattner	fe8af82cd4	Teach codegen to handle: (X != null) \| (Y != null) --> (X\|Y) != 0 (X == null) & (Y == null) --> (X\|Y) == 0 so that instcombine can stop doing this for pointers. This is part of PR3351, which is a case where instcombine doing this for pointers (inserting ptrtoint) is pessimizing code. llvm-svn: 92406	2010-01-02 00:00:03 +00:00
Chris Lattner	4e49a69ec5	rename file. llvm-svn: 92405	2010-01-01 23:55:04 +00:00
Chris Lattner	ef4fba933d	add a simple instcombine xform, simplify another one to use hasAllZeroIndices() instead of hand rolling a loop. llvm-svn: 92403	2010-01-01 23:09:08 +00:00
Chris Lattner	feb7b1af69	generalize the pointer difference optimization to handle a constantexpr gep on the 'base' side of the expression. This completes comment #4 in PR3351, which comes from 483.xalancbmk. llvm-svn: 92402	2010-01-01 22:42:29 +00:00
Chris Lattner	89b1b63bdf	teach instcombine to optimize pointer difference idioms involving constant expressions. This is a step towards comment #4 in PR3351. llvm-svn: 92401	2010-01-01 22:29:12 +00:00
Chris Lattner	ce7717e168	implement the transform requested in PR5284 llvm-svn: 92398	2010-01-01 18:34:40 +00:00
Chris Lattner	44298d184a	Teach codegen to lower llvm.powi to an efficient (but not optimal) multiply sequence when the power is a constant integer. Before, our codegen for std::pow(.., int) always turned into a libcall, which was really inefficient. This should also make many gfortran programs happier I'd imagine. llvm-svn: 92388	2010-01-01 03:32:16 +00:00
Chris Lattner	3d38dbff2a	Make this more likely to generate a libcall. llvm-svn: 92387	2010-01-01 03:26:51 +00:00
Chris Lattner	e5f5e4b151	add a few trivial instcombines for llvm.powi. llvm-svn: 92383	2010-01-01 01:52:15 +00:00
Chris Lattner	662a872e15	When factoring multiply expressions across adds, factor both positive and negative forms of constants together. This allows us to compile: int foo(int x, int y) { return (x-y) + (x-y) + (x-y); } into: _foo: ## @foo subl %esi, %edi leal (%rdi,%rdi,2), %eax ret instead of (where the 3 and -3 were not factored): _foo: imull $-3, 8(%esp), %ecx imull $3, 4(%esp), %eax addl %ecx, %eax ret this started out as: movl 12(%ebp), %ecx imull $3, 8(%ebp), %eax subl %ecx, %eax subl %ecx, %eax subl %ecx, %eax ret This comes from PR5359. llvm-svn: 92381	2010-01-01 01:13:15 +00:00
Chris Lattner	dd837a069b	test case we alredy get right. llvm-svn: 92380	2010-01-01 00:50:00 +00:00
Chris Lattner	ebe5932016	reuse negates where possible instead of always creating them from scratch. This allows us to optimize test12 into: define i32 @test12(i32 %X) { %factor = mul i32 %X, -3 ; <i32> [#uses=1] %Z = add i32 %factor, 6 ; <i32> [#uses=1] ret i32 %Z } instead of: define i32 @test12(i32 %X) { %Y = sub i32 6, %X ; <i32> [#uses=1] %C = sub i32 %Y, %X ; <i32> [#uses=1] %Z = sub i32 %C, %X ; <i32> [#uses=1] ret i32 %Z } llvm-svn: 92373	2009-12-31 20:34:32 +00:00
Chris Lattner	59379f8bb0	teach reassociate to factor x+x+x -> x*3. While I'm at it, fix RemoveDeadBinaryOp to actually do something. llvm-svn: 92368	2009-12-31 19:24:52 +00:00
Chris Lattner	4870f6a384	simple fix for an incorrect factoring which causes a miscompilation, PR5458. llvm-svn: 92354	2009-12-31 08:33:49 +00:00
Chris Lattner	d9ed31f6d0	merge some more tests in. llvm-svn: 92353	2009-12-31 08:32:22 +00:00
Chris Lattner	67d96ee04c	filecheckize llvm-svn: 92352	2009-12-31 08:29:56 +00:00
Chris Lattner	70788c5e57	add some basic named MD tests. llvm-svn: 92336	2009-12-31 03:00:49 +00:00
Chris Lattner	83d5230121	fix two bogus tests that the asmparser now rejects. llvm-svn: 92303	2009-12-30 05:54:51 +00:00
Chris Lattner	03a91d987f	reimplement insertvalue/extractvalue metadata handling to not blindly accept invalid input. Actually add a testcase. llvm-svn: 92297	2009-12-30 05:14:00 +00:00
Chris Lattner	c42b8ff24e	fix parsing of mdstring values. llvm-svn: 92290	2009-12-30 04:13:37 +00:00
Chris Lattner	c6b925c592	Each instruction is allowed to have multiple different metadata objects on them. Though the entire compiler supports this, the asmparser didn't. llvm-svn: 92270	2009-12-29 21:25:40 +00:00
Chris Lattner	86c74f4783	Do not crash when .ll printing metadata that smells like debug info, but isn't. llvm-svn: 92268	2009-12-29 21:17:33 +00:00
Sanjiv Gupta	543a6716fb	Extern declaration for unordered.f32 libcall was not being emitted. Fixed that. llvm-svn: 92242	2009-12-29 03:24:34 +00:00
Sanjiv Gupta	efad5b2a93	Fixed llc crash for zext (i1 -> i8) loads. llvm-svn: 92201	2009-12-28 04:53:24 +00:00
Dale Johannesen	a62752a06e	Testcase for llvm-gcc checkin 92108. llvm-svn: 92110	2009-12-24 01:10:43 +00:00
Chris Lattner	4e96d36f72	handle equality memcmp of 8 bytes on x86-64 with two unaligned loads and a compare. On other targets we end up with a call to memcmp because we don't want 16 individual byte loads. We should be able to use movups as well, but we're failing to select the generated icmp. llvm-svn: 92107	2009-12-24 01:07:17 +00:00
Chris Lattner	5d3919d5f9	move an optimization for memcmp out of simplifylibcalls and into SDISel. This optimization was causing simplifylibcalls to introduce type-unsafe nastiness. This is the first step, I'll be expanding the memcmp optimizations shortly, covering things that we really really wouldn't want simplifylibcalls to do. llvm-svn: 92098	2009-12-24 00:37:38 +00:00
Daniel Dunbar	3495400c8a	Remove an XFAIL. llvm-svn: 92036	2009-12-23 20:13:44 +00:00
Mikhail Glushenkov	7db8203dd2	Allow (set_option SwitchOption, true). llvm-svn: 91997	2009-12-23 12:49:30 +00:00
Sanjiv Gupta	7872817f59	Reapply 91904. llvm-svn: 91996	2009-12-23 11:19:09 +00:00
Sanjiv Gupta	1cd15ef29f	deleting empty file. llvm-svn: 91994	2009-12-23 10:35:24 +00:00
Sanjiv Gupta	70e1523215	Reverting back 91904. llvm-svn: 91993	2009-12-23 09:46:01 +00:00
Dale Johannesen	b4485fd8a9	Use more sensible type for flags in asms. PR 5570. Patch by Sylve`re Teissier (sorry, ASCII only). llvm-svn: 91988	2009-12-23 07:32:51 +00:00
Eric Christopher	ce677a909d	Update objectsize intrinsic and associated dependencies. Fix lowering code and update testcases. llvm-svn: 91979	2009-12-23 02:51:48 +00:00
Anton Korobeynikov	04878d43e1	Add testcase for PR5703 llvm-svn: 91931	2009-12-22 22:37:23 +00:00
Evan Cheng	7cd6bfe549	Remove target attribute break-sse-dep. Instead, do not fold load into sse partial update instructions unless optimizing for size. llvm-svn: 91910	2009-12-22 17:47:23 +00:00
Sanjiv Gupta	9581b4dc62	While converting one of the operands to a memory operand, we need to check if it is Legal and does not result into a cyclic dep. llvm-svn: 91904	2009-12-22 14:25:37 +00:00
Chris Lattner	de6faded57	specify a triple to use, fixing the test on non-x86-64 hosts. llvm-svn: 91900	2009-12-22 07:01:12 +00:00
Bob Wilson	0dc93264b1	Generalize SROA to allow the first index of a GEP to be non-zero. Add a missing check that an array reference doesn't go past the end of the array, and remove some redundant checks for in-bound array and vector references that are no longer needed. llvm-svn: 91897	2009-12-22 06:57:14 +00:00
Chris Lattner	226e849772	various cleanups, make the disassemble reject lines with too much data on them, for example: addb %al, (%rax) simple-tests.txt:11:5: error: excess data detected in input 0 0 0 0 0 ^ llvm-svn: 91896	2009-12-22 06:56:51 +00:00
Chris Lattner	3b333ebfb8	rewrite the file parser for the disassembler, implementing support for comments. Also, check in a simple testcase for the disassembler, including a test for r91864 llvm-svn: 91894	2009-12-22 06:37:58 +00:00
Chris Lattner	4e30207029	Implement PR5795 by merging duplicated return blocks. This could go further by merging all returns in a function into a single one, but simplifycfg currently likes to duplicate the return (an unfortunate choice!) llvm-svn: 91890	2009-12-22 06:07:30 +00:00
Chris Lattner	3f9fc699e0	convert to filecheck llvm-svn: 91889	2009-12-22 06:04:26 +00:00
David Greene	3bfe36b5d9	Fix a bug in !subst where TableGen would go and resubstitute text it had just substituted. This could cause infinite looping in certain pathological cases. llvm-svn: 91843	2009-12-21 21:21:34 +00:00
Daniel Dunbar	52b8297759	XFAIL these tests on powerpc, under the assumption that no one cares. If you care, feel free to fix. llvm-svn: 91826	2009-12-21 17:31:59 +00:00
Chris Lattner	c54fd1e777	fix PR5837 by having SSAUpdate reuse phi nodes for the 'GetValueInMiddleOfBlock' case, instead of inserting duplicates. A similar fix is almost certainly needed by the machine-level SSAUpdate implementation. llvm-svn: 91820	2009-12-21 07:16:11 +00:00
Chris Lattner	ae95fddf98	add check lines for min/max tests. llvm-svn: 91816	2009-12-21 06:08:50 +00:00
Chris Lattner	013b88ee59	really convert this to filecheck. llvm-svn: 91815	2009-12-21 06:06:10 +00:00
Chris Lattner	c9bfe8679e	give instcombine some helper functions for matching MIN and MAX, and implement some optimizations for MIN(MIN()) and MAX(MAX()) and MIN(MAX()) etc. This substantially improves the code in PR5822 but doesn't kick in much elsewhere. 2 max's were optimized in pairlocalalign and one in smg2000. llvm-svn: 91814	2009-12-21 06:03:05 +00:00
Chris Lattner	d5bdc7876d	filecheckize llvm-svn: 91813	2009-12-21 05:53:13 +00:00
Chris Lattner	f1474e1761	enhance x-(-A) -> x+A to preserve NUW/NSW. Use the presence of NSW/NUW to fold "icmp (x+cst), x" to a constant in cases where it would otherwise be undefined behavior. Surprisingly (to me at least), this triggers hundreds of the times in a few benchmarks: lencode, ldecode, and 466.h264ref seem to really like this. llvm-svn: 91812	2009-12-21 04:04:05 +00:00
Chris Lattner	d34eb29977	Optimize all cases of "icmp (X+Cst), X" to something simpler. This triggers a bunch in lencode, ldecod, spass, 176.gcc, 252.eon, among others. It is also the first part of PR5822 llvm-svn: 91811	2009-12-21 03:19:28 +00:00
Chris Lattner	072f39fd3a	convert to filecheck llvm-svn: 91810	2009-12-21 03:11:05 +00:00
Chris Lattner	07f0e8ec8a	fix an overly conservative caching issue that caused memdep to cache a pointer as being unavailable due to phi trans in the wrong place. This would cause later queries to fail even when they didn't involve phi trans. llvm-svn: 91787	2009-12-19 21:29:22 +00:00
Chris Lattner	4f562e5f14	fix inconsistent use of tabs llvm-svn: 91783	2009-12-19 20:44:43 +00:00
Sanjiv Gupta	14c9f2ed42	Emit direction operand in binary insns that stores in memory. llvm-svn: 91777	2009-12-19 13:52:01 +00:00
Sanjiv Gupta	df6eadc436	Test cases for changes done in 91768. llvm-svn: 91773	2009-12-19 11:38:14 +00:00
Chris Lattner	d9bf69f1a5	fix PR5827 by disabling the phi slicing transformation in a case where instcombine would have to split a critical edge due to a phi node of an invoke. Since instcombine can't change the CFG, it has to bail out from doing the transformation. llvm-svn: 91763	2009-12-19 07:01:15 +00:00
Evan Cheng	bc37151dea	Increase opportunities to optimize (brcond (srl (and c1), c2)). llvm-svn: 91717	2009-12-18 21:31:31 +00:00
Bob Wilson	03b6955c7f	Reapply 91459 with a simple fix for the problem that broke the x86_64-darwin bootstrap. This also replaces the WeakVH references that Chris objected to with normal Value references. llvm-svn: 91711	2009-12-18 20:14:40 +00:00
Mikhail Glushenkov	606e829658	Make 'set_option' work with list options. This works now: (set_option "list_opt", ["val_1", "val_2", "val_3"]) llvm-svn: 91679	2009-12-18 11:27:26 +00:00
Eli Friedman	c8ab298dbd	Optimize icmp of null and select of two constants even if the select has multiple uses. (The construct in question was found in gcc.) llvm-svn: 91675	2009-12-18 08:22:35 +00:00
Evan Cheng	d97d025eba	On recent Intel u-arch's, folding loads into some unary SSE instructions can be non-optimal. To be precise, we should avoid folding loads if the instructions only update part of the destination register, and the non-updated part is not needed. e.g. cvtss2sd, sqrtss. Unfolding the load from these instructions breaks the partial register dependency and it can improve performance. e.g. movss (%rdi), %xmm0 cvtss2sd %xmm0, %xmm0 instead of cvtss2sd (%rdi), %xmm0 An alternative method to break dependency is to clear the register first. e.g. xorps %xmm0, %xmm0 cvtss2sd (%rdi), %xmm0 llvm-svn: 91672	2009-12-18 07:40:29 +00:00
Dan Gohman	d97f165eb2	Tidy up this testcase and add test for tailcall optimization with unreachable. llvm-svn: 91650	2009-12-18 01:05:06 +00:00
Bob Wilson	a9f20f9f6e	Handle ARM inline asm "w" constraints with 64-bit ("d") registers. The change in SelectionDAGBuilder is needed to allow using bitcasts to convert between f64 (the default type for ARM "d" registers) and 64-bit Neon vector types. Radar 7457110. llvm-svn: 91649	2009-12-18 01:03:29 +00:00
Dan Gohman	c382d6519c	Remove "tail" keywords. These calls are not intended to be tail calls. This protects this test from depending on codegen not performing the tail call optimization by default. llvm-svn: 91648	2009-12-18 01:02:18 +00:00
Jakob Stoklund Olesen	b39930cf6d	Add test case for the phi reuse patch. llvm-svn: 91642	2009-12-18 00:11:44 +00:00
Sean Callanan	06b6feb2e1	Instruction fixes, added instructions, and AsmString changes in the X86 instruction tables. Also (while I was at it) cleaned up the X86 tables, removing tabs and 80-line violations. This patch was reviewed by Chris Lattner, but please let me know if there are any problems. * X86.td Removed tabs and fixed 80-line violations X86Instr64bit.td (IRET, POPCNT, BT_, LSL, SWPGS, PUSH_S, POP_S, L_S, SMSW) Added (CALL, CMOV) Added qualifiers (JMP) Added PC-relative jump instruction (POPFQ/PUSHFQ) Added qualifiers; renamed PUSHFQ to indicate that it is 64-bit only (ambiguous since it has no REX prefix) (MOV) Added rr form going the other way, which is encoded differently (MOV) Changed immediates to offsets, which is more correct; also fixed MOV64o64a to have to a 64-bit offset (MOV) Fixed qualifiers (MOV) Added debug-register and condition-register moves (MOVZX) Added more forms (ADC, SUB, SBB, AND, OR, XOR) Added reverse forms, which (as with MOV) are encoded differently (ROL) Made REX.W required (BT) Uncommented mr form for disassembly only (CVT__2__) Added several missing non-intrinsic forms (LXADD, XCHG) Reordered operands to make more sense for MRMSrcMem (XCHG) Added register-to-register forms (XADD, CMPXCHG, XCHG) Added non-locked forms * X86InstrSSE.td (CVTSS2SI, COMISS, CVTTPS2DQ, CVTPS2PD, CVTPD2PS, MOVQ) Added * X86InstrFPStack.td (COM_FST0, COMP_FST0, COM_FI, COM_FIP, FFREE, FNCLEX, FNOP, FXAM, FLDL2T, FLDL2E, FLDPI, FLDLG2, FLDLN2, F2XM1, FYL2X, FPTAN, FPATAN, FXTRACT, FPREM1, FDECSTP, FINCSTP, FPREM, FYL2XP1, FSINCOS, FRNDINT, FSCALE, FCOMPP, FXSAVE, FXRSTOR) Added (FCOM, FCOMP) Added qualifiers (FSTENV, FSAVE, FSTSW) Fixed opcode names (FNSTSW) Added implicit register operand * X86InstrInfo.td (opaque512mem) Added for FXSAVE/FXRSTOR (offset8, offset16, offset32, offset64) Added for MOV (NOOPW, IRET, POPCNT, IN, BTC, BTR, BTS, LSL, INVLPG, STR, LTR, PUSHFS, PUSHGS, POPFS, POPGS, LDS, LSS, LES, LFS, LGS, VERR, VERW, SGDT, SIDT, SLDT, LGDT, LIDT, LLDT, LODSD, OUTSB, OUTSW, OUTSD, HLT, RSM, FNINIT, CLC, STC, CLI, STI, CLD, STD, CMC, CLTS, XLAT, WRMSR, RDMSR, RDPMC, SMSW, LMSW, CPUID, INVD, WBINVD, INVEPT, INVVPID, VMCALL, VMCLEAR, VMLAUNCH, VMRESUME, VMPTRLD, VMPTRST, VMREAD, VMWRITE, VMXOFF, VMXON) Added (NOOPL, POPF, POPFD, PUSHF, PUSHFD) Added qualifier (JO, JNO, JB, JAE, JE, JNE, JBE, JA, JS, JNS, JP, JNP, JL, JGE, JLE, JG, JCXZ) Added 32-bit forms (MOV) Changed some immediate forms to offset forms (MOV) Added reversed reg-reg forms, which are encoded differently (MOV) Added debug-register and condition-register moves (CMOV) Added qualifiers (AND, OR, XOR, ADC, SUB, SBB) Added reverse forms, like MOV (BT) Uncommented memory-register forms for disassembler (MOVSX, MOVZX) Added forms (XCHG, LXADD) Made operand order make sense for MRMSrcMem (XCHG) Added register-register forms (XADD, CMPXCHG) Added unlocked forms * X86InstrMMX.td (MMX_MOVD, MMV_MOVQ) Added forms * X86InstrInfo.cpp: Changed PUSHFQ to PUSHFQ64 to reflect table change * X86RegisterInfo.td: Added debug and condition register sets * x86-64-pic-3.ll: Fixed testcase to reflect call qualifier * peep-test-3.ll: Fixed testcase to reflect test qualifier * cmov.ll: Fixed testcase to reflect cmov qualifier * loop-blocks.ll: Fixed testcase to reflect call qualifier * x86-64-pic-11.ll: Fixed testcase to reflect call qualifier * 2009-11-04-SubregCoalescingBug.ll: Fixed testcase to reflect call qualifier * x86-64-pic-2.ll: Fixed testcase to reflect call qualifier * live-out-reg-info.ll: Fixed testcase to reflect test qualifier * tail-opts.ll: Fixed testcase to reflect call qualifiers * x86-64-pic-10.ll: Fixed testcase to reflect call qualifier * bss-pagealigned.ll: Fixed testcase to reflect call qualifier * x86-64-pic-1.ll: Fixed testcase to reflect call qualifier * widen_load-1.ll: Fixed testcase to reflect call qualifier llvm-svn: 91638	2009-12-18 00:01:26 +00:00
Eli Friedman	9543d05079	Allow instcombine to combine "sext(a) >u const" to "a >u trunc(const)". llvm-svn: 91631	2009-12-17 22:42:29 +00:00
Eli Friedman	8afea2d095	Make the ptrtoint comparison simplification work if one side is a global. llvm-svn: 91624	2009-12-17 21:27:47 +00:00
Eli Friedman	ff5c248066	Slightly generalize transformation of memmove(a,a,n) so that it also applies to memcpy. (Such a memcpy is technically illegal, but in practice is safe and is generated by struct self-assignment in C code.) llvm-svn: 91621	2009-12-17 21:07:31 +00:00
Bob Wilson	9d4b46c0e6	Re-revert 91459. It's breaking the x86_64 darwin bootstrap. llvm-svn: 91607	2009-12-17 18:34:24 +00:00
Mikhail Glushenkov	40eeddfb23	Add a 'set_option' action for use in OptionPreprocessor. llvm-svn: 91594	2009-12-17 07:49:16 +00:00
Eli Friedman	5c7e38b936	Aggressively flip compare constant expressions where appropriate; constant folding in particular expects null to be on the RHS. llvm-svn: 91587	2009-12-17 06:07:04 +00:00
Evan Cheng	dbd8789125	Revert this dag combine change: Fold (zext (and x, cst)) -> (and (zext x), cst) DAG combiner likes to optimize expression in the other way so this would end up cause an infinite looping. llvm-svn: 91574	2009-12-17 00:40:05 +00:00
Daniel Dunbar	c4abbc0ab6	Reapply r91459, it was only unmasking the bug, and since TOT is still broken having it reverted does no good. llvm-svn: 91559	2009-12-16 20:09:53 +00:00
Daniel Dunbar	929f303477	Revert "Reapply 91184 with fixes and an addition to the testcase to cover the problem", this broke llvm-gcc bootstrap for release builds on x86_64-apple-darwin10. This reverts commit db22309800b224a9f5f51baf76071d7a93ce59c9. llvm-svn: 91534	2009-12-16 10:56:17 +00:00
Chris Lattner	2d5fc1649a	reapply my strstr optimization. I have reproduced the x86-64 bootstrap miscompile (i386.o miscompares) but it happens both with and without this patch. llvm-svn: 91532	2009-12-16 09:32:05 +00:00
Nick Lewycky	503ef79cc5	Make this test pass on Linux. llvm-svn: 91521	2009-12-16 07:35:25 +00:00
Devang Patel	537d43e757	XFAIL on ppc-darwin. llvm-svn: 91495	2009-12-16 02:11:38 +00:00
Evan Cheng	aaf2f58a04	Re-enable 91381 with fixes. llvm-svn: 91489	2009-12-16 00:53:11 +00:00
Chris Lattner	8751050a34	revert my strstr optimization, I'm told it breaks x86-64 bootstrap. Will reapply with a fix when I get a chance. llvm-svn: 91486	2009-12-16 00:46:02 +00:00
Dale Johannesen	365ae431a7	Do better with physical reg operands (typically, from inline asm) in local register allocator. If a reg-reg copy has a phys reg input and a virt reg output, and this is the last use of the phys reg, assign the phys reg to the virt reg. If a reg-reg copy has a phys reg output and we need to reload its spilled input, reload it directly into the phys reg than passing it through another reg. Following 76208, there is sometimes no dependency between the def of a phys reg and its use; this creates a window where that phys reg can be used for spilling (this is true in linear scan also). This is bad and needs to be fixed a better way, although 76208 works too well in practice to be reverted. However, there should normally be no spilling within inline asm blocks. The patch here goes a long way towards making this actually be true. llvm-svn: 91485	2009-12-16 00:29:41 +00:00
Bob Wilson	8505aa648d	Reapply 91184 with fixes and an addition to the testcase to cover the problem found last time. Instead of trying to modify the IR while iterating over it, I've change it to keep a list of WeakVH references to dead instructions, and then delete those instructions later. I also added some special case code to detect and handle the situation when both operands of a memcpy intrinsic are referencing the same alloca. llvm-svn: 91459	2009-12-15 22:00:51 +00:00
Chris Lattner	fa960751b1	optimize strstr, PR5783 llvm-svn: 91438	2009-12-15 19:14:40 +00:00
Mikhail Glushenkov	7743ee06be	Convert llvmc tests to FileCheck. llvm-svn: 91420	2009-12-15 07:21:14 +00:00
Mikhail Glushenkov	2d69ef4077	Support hook invocation from 'append_cmd'. llvm-svn: 91419	2009-12-15 07:20:50 +00:00
Kenneth Uildriks	c0ab5a6e88	For fastcc on x86, let ECX be used as a return register after EAX and EDX llvm-svn: 91410	2009-12-15 03:27:52 +00:00
Evan Cheng	4adb4acc7b	Disable 91381 for now. It's miscompiling ARMISelDAG2DAG.cpp. llvm-svn: 91405	2009-12-15 03:07:11 +00:00
Mikhail Glushenkov	93c8d86be9	Validate the generated C++ code in llvmc tests. Checks that the code generated by 'tblgen --emit-llvmc' can be actually compiled. Also fixes two bugs found in this way: - forward_transformed_value didn't work with non-list arguments - cl::ZeroOrOne is now called cl::Optional llvm-svn: 91404	2009-12-15 03:04:52 +00:00
Mikhail Glushenkov	d2373fd4dc	Pipe 'grep' output to 'count'. llvm-svn: 91403	2009-12-15 03:04:14 +00:00
Mikhail Glushenkov	39b16212f2	Allow $CALL(Hook, '$INFILE') for non-join tools. llvm-svn: 91402	2009-12-15 03:04:02 +00:00
Evan Cheng	c531da60aa	Make 91378 more conservative. 1. Only perform (zext (shl (zext x), y)) -> (shl (zext x), y) when y is a constant. This makes sure it remove at least one zest. 2. If the shift is a left shift, make sure the original shift cannot shift out bits. llvm-svn: 91399	2009-12-15 03:00:32 +00:00
Evan Cheng	cd8f0de016	Use sbb x, x to materialize carry bit in a GPR. The result is all one's or all zero's. llvm-svn: 91381	2009-12-15 00:53:42 +00:00
Evan Cheng	bd48ad16fa	Fold (zext (and x, cst)) -> (and (zext x), cst). llvm-svn: 91380	2009-12-15 00:52:11 +00:00
Evan Cheng	f3b2e55b34	Propagate zest through logical shift. llvm-svn: 91378	2009-12-15 00:41:36 +00:00
Dan Gohman	57dc006590	Fix integer cast code to handle vector types. llvm-svn: 91362	2009-12-14 23:40:38 +00:00
Eric Christopher	abf9df5e6d	Add radar fixed in comment. llvm-svn: 91312	2009-12-14 19:07:25 +00:00
Shantonu Sen	9782dd21e0	Remove empty file completely llvm-svn: 91277	2009-12-14 14:15:15 +00:00
Chris Lattner	6603d21f13	revert r91184, because it causes a crash on a .bc file I just sent to Bob. llvm-svn: 91268	2009-12-14 05:11:02 +00:00
Mikhail Glushenkov	9bad5b8fe4	Add a test for the 'init' option property. llvm-svn: 91259	2009-12-14 04:06:38 +00:00
Evan Cheng	ee5b5917fd	Disable r91104 for x86. It causes partial register stall which pessimize code in 32-bit. llvm-svn: 91223	2009-12-12 20:03:14 +00:00
Benjamin Kramer	6cd9b2ba74	Fix some CHECK lines which were ignored by accident. llvm-svn: 91214	2009-12-12 09:25:50 +00:00
Bob Wilson	8486cae4ce	Revise scalar replacement to be more flexible about handle bitcasts and GEPs. While scanning through the uses of an alloca, keep track of the current offset relative to the start of the alloca, and check memory references to see if the offset & size correspond to a component within the alloca. This has the nice benefit of unifying much of the code from isSafeUseOfAllocation, isSafeElementUse, and isSafeUseOfBitCastedAllocation. The code to rewrite the uses of a promoted alloca, after it is determined to be safe, is reorganized in the same way. Also, when rewriting GEP instructions, mark them as "in-bounds" since all the indices are known to be safe. llvm-svn: 91184	2009-12-11 23:47:40 +00:00
Anton Korobeynikov	724c82337f	Lower setcc branchless, if this is profitable. Based on the patch by Brian Lucas! llvm-svn: 91175	2009-12-11 23:01:29 +00:00
Dan Gohman	2e616e859b	Implement vector widening, splitting, and scalarizing for SIGN_EXTEND_INREG. llvm-svn: 91158	2009-12-11 21:31:27 +00:00
Dan Gohman	0a78e32f6b	Change this to the correct PR number. llvm-svn: 91148	2009-12-11 20:09:21 +00:00
Dan Gohman	c22d542754	Make getUniqueExitBlocks's precondition assert more precise, to avoid spurious failures. This fixes PR5758. llvm-svn: 91147	2009-12-11 20:05:23 +00:00
Dan Gohman	b2cbb1e37e	Fix the result type of SELECT nodes lowered from Select instructions with aggregate return values. This fixes PR5754. llvm-svn: 91145	2009-12-11 19:50:50 +00:00
Anton Korobeynikov	f8b2e2868e	Honour setHasCalls() set from isel. This is used in some weird cases like general dynamic TLS model. This fixes PR5723 llvm-svn: 91144	2009-12-11 19:39:55 +00:00
Evan Cheng	4c304eebe9	Tests for 91103 and 91104. llvm-svn: 91105	2009-12-11 06:02:21 +00:00
Eric Christopher	02ce8cd8a6	Add a test for the fix in revision 91009. llvm-svn: 91062	2009-12-10 21:11:40 +00:00
Evan Cheng	4b7cf3ed41	It's not safe to coalesce a move where src and dst registers have different subregister indices. e.g.: %reg16404:1<def> = MOV8rr %reg16412:2<kill> llvm-svn: 91061	2009-12-10 20:59:45 +00:00
Chris Lattner	ffedf37584	Fix PR5744, a case where we were getting the pointer size instead of the value size. This only manifested when memdep inprecisely returns clobber, which is do to a caching issue in the PR5744 testcase. We can 'efficiently emulate' this by using '-no-aa' llvm-svn: 91004	2009-12-10 00:11:45 +00:00
Evan Cheng	bc633478bd	Fix test. llvm-svn: 90988	2009-12-09 22:24:42 +00:00
Evan Cheng	9e2442c0be	Optimize splat of a scalar load into a shuffle of a vector load when it's legal. e.g. vector_shuffle (scalar_to_vector (i32 load (ptr + 4))), undef, <0, 0, 0, 0> => vector_shuffle (v4i32 load ptr), undef, <1, 1, 1, 1> iff ptr is 16-byte aligned (or can be made into 16-byte aligned). llvm-svn: 90984	2009-12-09 21:00:30 +00:00
Chris Lattner	bf3d03b576	fix hte last remaining known (by me) phi translation bug. When we reanalyze clobbers to forward pieces of large stores to small loads, we need to consider the properly phi translated pointer in the store block. llvm-svn: 90978	2009-12-09 18:21:46 +00:00
Chris Lattner	2f9b661ab8	Add a minor optimization: if we haven't changed the operands of an add, there is no need to scan the world to find the same add again. This invalidates the previous testcase, which wasn't wonderful anyway, because it needed a run of instcombine to permute the use-lists in just the right way to before GVN was run (so it was really fragile). Not a big loss. llvm-svn: 90973	2009-12-09 17:27:45 +00:00
Chris Lattner	e05f9a128c	fix PR5733, a case where we'd replace an add with a lexically identical binary operator that wasn't an add. In this case, a xor. Whoops. llvm-svn: 90971	2009-12-09 17:18:49 +00:00
Chris Lattner	8361f3cfc9	merge crash-2.ll into crash.ll llvm-svn: 90969	2009-12-09 17:17:26 +00:00
Chris Lattner	1f1da3a5a6	the code in GVN that tries to forward large loads to small stores is not phi translating, thus it miscompiles really crazy testcases. This is from inspection, I haven't seen this in the wild. llvm-svn: 90930	2009-12-09 02:43:05 +00:00
Chris Lattner	dda5ca59e2	Switch GVN and memdep to use PHITransAddr, which correctly handles phi translation of complex expressions like &A[i+1]. This has the following benefits: 1. The phi translation logic is all contained in its own class with a strong interface and verification that it is self consistent. 2. The logic is more correct than before. Previously, if intermediate expressions got PHI translated, we'd miss the update and scan for the wrong pointers in predecessor blocks. @phi_trans2 is a testcase for this. 3. We have a lot less code in memdep. We can handle phi translation across blocks of things like @phi_trans3, which is pretty insane :). This patch should fix the miscompiles of 255.vortex, and I tested it with a bootstrap of llvm-gcc, llvm-test and dejagnu of course. llvm-svn: 90926	2009-12-09 01:59:31 +00:00
Evan Cheng	41c13e41fe	Teach InferPtrAlignment to infer GV+cst alignment and use it to simplify x86 isl lowering code. llvm-svn: 90925	2009-12-09 01:53:58 +00:00
Devang Patel	d5a8051dea	Remove tests that are not suitable anymore. Plus they are not testing the original bugfixes anymore. These tests were inserted to check bug fixes in code that handled debug info intrinsics. These intrinsics are no longer used and now llvm parser simply ignores old .dbg intrinsics from these dead tests. llvm-svn: 90923	2009-12-09 01:46:00 +00:00
Devang Patel	11874672da	Revert 90858 90875 and 90805 for now. llvm-svn: 90898	2009-12-08 23:21:45 +00:00
Evan Cheng	edcc21919f	- Support inline asm 'w' constraint for 128-bit vector types. - Also support the 'q' NEON registers asm code. llvm-svn: 90894	2009-12-08 23:06:22 +00:00
Daniel Dunbar	1b05b09ba4	CMake/lit: Add llvm_{unit_,}site_config parameters, and always pass them when running tests from the project files. llvm-svn: 90869	2009-12-08 19:47:36 +00:00
Devang Patel	cb39ef375f	Do not try to push dead variable's debug info into namespace info. llvm-svn: 90857	2009-12-08 15:01:35 +00:00
Duncan Sands	897f9579d6	Teach GlobalOpt to delete aliases with internal linkage (after forwarding any uses). GlobalDCE can also do this, but is only run at -O3. llvm-svn: 90850	2009-12-08 10:10:20 +00:00
Anton Korobeynikov	0ace515a4c	Reduce (cmp 0, and_su (foo, bar)) into (bit foo, bar). This saves extra instruction. Patch inspired by Brian Lucas! llvm-svn: 90819	2009-12-08 01:03:04 +00:00
Evan Cheng	433b8a8753	Test case for 90787. llvm-svn: 90791	2009-12-07 19:42:22 +00:00
David Greene	73ad44c6b6	Use FileCheck and set nounwind on calls. llvm-svn: 90790	2009-12-07 19:40:26 +00:00
Dan Gohman	44e25ed254	Don't enable the post-RA scheduler on x86 except at -O3. In its current form, it is too expensive in compile time. llvm-svn: 90781	2009-12-07 19:04:31 +00:00
Mikhail Glushenkov	9f567e2e67	Implement 'forward_value' and 'forward_transformed_value'. llvm-svn: 90770	2009-12-07 17:03:05 +00:00
Anton Korobeynikov	eee906f4f0	Dynamic stack realignment use of sp register as source/dest register in "bic sp, sp, #15" leads to unpredicatble behaviour in Thumb2 mode. Emit the following code instead: mov r4, sp bic r4, r4, #15 mov sp, r4 llvm-svn: 90724	2009-12-06 22:39:50 +00:00
Chris Lattner	7066a138ff	fix PR5698 llvm-svn: 90708	2009-12-06 17:17:23 +00:00
Chris Lattner	ea3007ddb8	constant fold loads from memcpy's from global constants. This is important because clang lowers nontrivial automatic struct/array inits to memcpy from a global array. llvm-svn: 90698	2009-12-06 05:29:56 +00:00
Chris Lattner	8885e71303	add support for forwarding mem intrinsic values to non-local loads. llvm-svn: 90697	2009-12-06 04:54:31 +00:00
Chris Lattner	6d180b4a2c	gvn is optimizing this better now. llvm-svn: 90696	2009-12-06 04:16:05 +00:00
Chris Lattner	5eba6ee969	Handle forwarding local memsets to loads. For example, we optimize this: short x(short A) { memset(A, 1, sizeof(A)*100); return A[42]; } to 'return 257' instead of doing the load. llvm-svn: 90695	2009-12-06 01:57:02 +00:00
Chris Lattner	f9ff4c0fc4	merge two tests. llvm-svn: 90691	2009-12-06 01:47:24 +00:00
Bill Wendling	887646a585	Temporarily revert r90502. It was causing the llvm-gcc bootstrap on PPC to fail. llvm-svn: 90653	2009-12-05 07:30:23 +00:00
Nick Lewycky	10693e2bb0	Generalize this optimization to work on equality comparisons between any two integers that are constant except for a single bit (the same n-th bit in each). llvm-svn: 90646	2009-12-05 05:00:00 +00:00
Dan Gohman	cf29c2243b	Fix this code to use DIScope instead of DICompileUnit, as in r90181. Don't print "SrcLine"; just print the filename and line number, which is obvious enough and more informative. llvm-svn: 90631	2009-12-05 00:23:29 +00:00
Dan Gohman	e23727694c	Remove now-redundant llvm-as invocations. llvm-svn: 90626	2009-12-05 00:02:37 +00:00
Bill Wendling	87980517df	Add testcase for PR4262. llvm-svn: 90623	2009-12-04 23:29:57 +00:00
Bill Wendling	7993d94840	Temporarily revert r72620 because r72619 was reverted. llvm-svn: 90619	2009-12-04 23:16:56 +00:00
Chris Lattner	107fc93d48	Fix PR5551 by not ignoring the top level constantexpr when folding a load from constant. llvm-svn: 90545	2009-12-04 06:29:29 +00:00
Chris Lattner	0876163071	Small and carefully crafted testcase showing a miscompilation by GVN that I'm working on. This is manifesting as a miscompile of 255.vortex on some targets. No check lines yet because it fails. llvm-svn: 90520	2009-12-04 02:12:12 +00:00
Jakob Stoklund Olesen	7c5af26d12	Also attempt trivial coalescing for live intervals that end in a copy. The coalescer is supposed to clean these up, but when setting up parameters for a function call, there may be copies to physregs. If the defining instruction has been LICM'ed far away, the coalescer won't touch it. The register allocation hint does not always work - when the register allocator is backtracking, it clears the hints. This patch takes care of a few more cases that r90163 missed. llvm-svn: 90502	2009-12-04 00:16:04 +00:00
Nate Begeman	3a9c51f256	Don't pull vector sext through both hands of a logical operation, since doing so prevents the fusion of vector sext and setcc into vsetcc. Add a testcase for the above transformation. Fix a bogus use of APInt noticed while tracking this down. llvm-svn: 90423	2009-12-03 07:11:29 +00:00
Bob Wilson	b53c801366	Recognize canonical forms of vector shuffles where the same vector is used for both source operands. In the canonical form, the 2nd operand is changed to an undef and the shuffle mask is adjusted to only reference elements from the 1st operand. Radar 7434842. llvm-svn: 90417	2009-12-03 06:40:55 +00:00
Owen Anderson	251cb28a25	Fix this crasher, and add a FIXME for a missed optimization. llvm-svn: 90408	2009-12-03 03:43:29 +00:00
Chris Lattner	3bf9321d67	add a failing testcase. llvm-svn: 90380	2009-12-03 01:46:18 +00:00
Chris Lattner	851aea6ce2	fix PR5673 by being more careful about pointers to functions. llvm-svn: 90369	2009-12-03 01:05:45 +00:00
Bill Wendling	0eb481a249	Remove unnecessary check. llvm-svn: 90352	2009-12-02 22:02:20 +00:00
Owen Anderson	f47cde694f	Cleanup/remove some parts of the lifetime region handling code in memdep and GVN, per Chris' comments. Adjust testcases to match. llvm-svn: 90304	2009-12-02 07:35:19 +00:00
Chris Lattner	2d3554c3d9	merge sext-2 into sext.ll llvm-svn: 90293	2009-12-02 05:34:35 +00:00
Chris Lattner	3781027d07	rename test llvm-svn: 90292	2009-12-02 05:32:33 +00:00
Chris Lattner	2c2a69cd14	filecheckize llvm-svn: 90291	2009-12-02 05:32:16 +00:00
Mon P Wang	91ac05d480	Fixed an assertion failure for tracking sext of a vector of integers llvm-svn: 90290	2009-12-02 04:59:58 +00:00
Evan Cheng	0c687845b1	Fix PR5391: support early clobber physical register def tied with a use (ewwww) - A valno should be set HasRedefByEC if there is an early clobber def in the middle of its live ranges. It should not be set if the def of the valno is defined by an early clobber. - If a physical register def is tied to an use and it's an early clobber, it just means the HasRedefByEC is set since it's still one continuous live range. - Add a couple of missing checks for HasRedefByEC in the coalescer. In general, it should not coalesce a vr with a physical register if the physical register has a early clobber def somewhere. This is overly conservative but that's the price for using such a nasty inline asm "feature". llvm-svn: 90269	2009-12-01 22:25:00 +00:00
Jim Grosbach	7688d320c9	test case for IV-Users simplification loop improvement llvm-svn: 90260	2009-12-01 21:53:51 +00:00
Devang Patel	45858cdfe6	Clear function specific containers while processing end of a function, even if DW_TAG_subprogram for current function is not found. llvm-svn: 90247	2009-12-01 18:13:48 +00:00
Chris Lattner	ec294dac55	minimize this a bit more. llvm-svn: 90216	2009-12-01 07:30:01 +00:00
Chris Lattner	7323159b21	merge 2009-11-29-ReverseMap.ll into crash.ll llvm-svn: 90212	2009-12-01 06:22:10 +00:00
Chris Lattner	7c0c90df97	fix PR5640 by tracking whether a block is the header of a loop more precisely, which prevents us from infinitely peeling the loop. llvm-svn: 90211	2009-12-01 06:04:43 +00:00
Jakob Stoklund Olesen	f07d6129a2	Use CFG connectedness as a secondary sort key when deciding the order of copy coalescing. This means that well connected blocks are copy coalesced before the less connected blocks. Connected blocks are more difficult to coalesce because intervals are more complicated, so handling them first gives a greater chance of success. llvm-svn: 90194	2009-12-01 03:03:00 +00:00
Dan Gohman	6bb055cfcd	Add a comment about A[i+(j+1)]. llvm-svn: 90185	2009-12-01 01:38:10 +00:00
Evan Cheng	fcbc30f36e	Fix PR5614: parts of a physical register def may be killed the rest. llvm-svn: 90180	2009-12-01 00:44:45 +00:00
Devang Patel	41e98a786b	Test case for r90175. llvm-svn: 90176	2009-12-01 00:13:06 +00:00
Jakob Stoklund Olesen	ce2743a619	New virtual registers created for spill intervals should inherit allocation hints from the original register. This helps us avoid silly copies when rematting values that are copied to a physical register: leaq _.str44(%rip), %rcx movq %rcx, %rsi call _strcmp becomes: leaq _.str44(%rip), %rsi call _strcmp The coalescer will not touch the movq because that would tie down the physical register. llvm-svn: 90163	2009-11-30 22:55:54 +00:00
Bill Wendling	bacc153c6c	Debug info is disabled on PPC Darwin. llvm-svn: 90160	2009-11-30 22:23:29 +00:00
Nick Lewycky	51b973c964	Add a testcase for the current llvm-gcc build failure. llvm-svn: 90112	2009-11-30 07:02:18 +00:00
Mon P Wang	22b4e4e223	Add test case for r90108 llvm-svn: 90109	2009-11-30 02:42:27 +00:00
Nick Lewycky	bc81d0985e	Fix this test on 64-bit systems which seem to use i64 for gep indices sometimes while 32-bit gcc uses i32. llvm-svn: 90106	2009-11-30 02:23:57 +00:00
Nick Lewycky	5f9ca6b5c9	Commit r90099 made LLVM simplify one of these constant expressions a little more. Update the syntax we're checking for and filecheckize it too. This will fix the selfhost buildbots but will 'break' the others (sigh) because they're still linked against older LLVM which is emitting less optimized IR. llvm-svn: 90104	2009-11-30 00:38:56 +00:00
Nick Lewycky	116b145b02	Teach ConstantFolding to do a better job when folding gep(bitcast). This permits the devirtualization of llvm.org/PR3100#c9 when compiled by clang. llvm-svn: 90099	2009-11-29 21:40:55 +00:00
Chris Lattner	cd6fed25d5	add testcases for the foo_with_overflow op xforms added recently and fix bugs exposed by the tests. Testcases from Alastair Lynn! llvm-svn: 90056	2009-11-29 02:57:29 +00:00
Chris Lattner	5b1941cafb	add PR# llvm-svn: 90049	2009-11-29 01:28:58 +00:00
Chris Lattner	8ba0b842a2	Add a testcase for: void test(int N, double* G) { long j; for (j = 1; j < N - 1; j++) G[j] = G[j] + G[j+1] + G[j-1]; } which we now compile to one load in the loop: LBB1_2: ## %bb movsd 16(%rsi,%rax,8), %xmm2 incq %rdx addsd %xmm2, %xmm1 addsd %xmm1, %xmm0 movapd %xmm2, %xmm1 movsd %xmm0, 8(%rsi,%rax,8) incq %rax cmpq %rcx, %rax jne LBB1_2 instead of: LBB1_2: ## %bb movsd 8(%rsi,%rax,8), %xmm0 addsd 16(%rsi,%rax,8), %xmm0 addsd (%rsi,%rax,8), %xmm0 movsd %xmm0, 8(%rsi,%rax,8) incq %rax cmpq %rcx, %rax jne LBB1_2 llvm-svn: 90048	2009-11-29 01:15:43 +00:00
Chris Lattner	e7dbdc6a7e	add a testcase for void test9(int N, double* G) { long j; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } llvm-svn: 90047	2009-11-29 01:04:40 +00:00
Chris Lattner	d48ff7ea6a	Implement PR5634. llvm-svn: 90046	2009-11-29 00:51:17 +00:00
Nick Lewycky	ff44d9d88a	Teach memdep to look for memory use intrinsics during dependency queries. Fixes PR5574. llvm-svn: 90045	2009-11-28 21:27:49 +00:00
Chris Lattner	83284453a1	reenable load address insertion in load pre. This allows us to handle cases like this: void test(int N, double* G) { long j; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } where G[1] isn't live into the loop. llvm-svn: 90041	2009-11-28 16:08:18 +00:00
Chris Lattner	f825d5d176	implement a FIXME: limit the depth that DecomposeGEPExpression goes the same way that getUnderlyingObject does it. This fixes the 'DecomposeGEPExpression and getUnderlyingObject disagree!' assertion on sqlite3. llvm-svn: 90038	2009-11-28 15:12:41 +00:00
Chris Lattner	f3e5cbfc99	disable value insertion for now, I need to figure out how to inform GVN about the newly inserted values. This fixes PR5631. llvm-svn: 90022	2009-11-27 22:50:07 +00:00
Chris Lattner	1fc57583fa	I accidentally implemented this :) llvm-svn: 90014	2009-11-27 19:56:00 +00:00
Chris Lattner	b1fceb6006	add support for recursive phi translation and phi translation of add with immediate. This allows us to optimize this function: void test(int N, double* G) { long j; G[1] = 1; for (j = 1; j < N - 1; j++) G[j+1] = G[j] + G[j+1]; } to only do one load every iteration of the loop. llvm-svn: 90013	2009-11-27 19:11:31 +00:00
Chris Lattner	6f124b48c3	add two simple test cases we now optimize (to one load in the loop each) and one we don't (corresponding to the fixme I added yesterday). llvm-svn: 90012	2009-11-27 18:08:30 +00:00
Chris Lattner	cdfa9dadf1	fix PR5436 by making the 'simple' case of SRoA not promote out of range array indexes. The "complex" case of SRoA still handles them, and correctly. This fixes a weirdness where we'd correctly avoid transforming A[0][42] if the 42 was too large, but we'd only do it if it was one gep, not two separate ones. llvm-svn: 90007	2009-11-27 16:37:41 +00:00
Chris Lattner	02211273c7	filecheckize llvm-svn: 90006	2009-11-27 16:31:59 +00:00
Duncan Sands	638c57757d	While this test is testing a problem in the generic part of codegen, the problem only shows for msp430 and pic16 which is why it specifies them using -march. But it is wrong to put such tests in CodeGen/Generic, since not everyone builds these targets. Put a copy of the test in each of the target test directories. llvm-svn: 90005	2009-11-27 16:04:14 +00:00
Chris Lattner	a466dbe80a	teach GVN's load PRE to insert computations of the address in predecessors where it is not available. It's unclear how to get this inserted computation into GVN's scalar availability sets, Owen, help? :) llvm-svn: 89997	2009-11-27 08:25:10 +00:00
Chris Lattner	9c8da17055	add some tests for memdep phi translation + PRE. llvm-svn: 89996	2009-11-27 06:42:42 +00:00
Chris Lattner	3e12a00447	this test is failing, and is expected to. llvm-svn: 89995	2009-11-27 06:36:28 +00:00
Chris Lattner	ed6850eb34	filecheckize llvm-svn: 89994	2009-11-27 06:33:09 +00:00
Chris Lattner	479eda6018	rename test. llvm-svn: 89993	2009-11-27 06:31:55 +00:00
Chris Lattner	0971e6da1f	Fix phi translation in load PRE to agree with the phi translation done by memdep, and reenable gep translation again. llvm-svn: 89992	2009-11-27 06:31:14 +00:00
Chris Lattner	16ee3226ce	redisable this, my bootstrap worked because it wasn't an optimized build, whoops. llvm-svn: 89991	2009-11-27 05:53:01 +00:00
Chris Lattner	ea3b1f2186	try again. llvm-svn: 89990	2009-11-27 05:19:56 +00:00
Chris Lattner	895214c65e	this is causing buildbot failures, disable for now. llvm-svn: 89985	2009-11-27 01:52:22 +00:00
Chris Lattner	02ffb0a608	teach phi translation of GEPs to simplify geps like 'gep x, 0'. This allows us to compile the example from PR5313 into: LBB1_2: ## %bb incl %ecx movb %al, (%rsi) movslq %ecx, %rax movb (%rdi,%rax), %al testb %al, %al jne LBB1_2 instead of: LBB1_2: ## %bb movslq %eax, %rcx incl %eax movb (%rdi,%rcx), %cl movb %cl, (%rsi) movslq %eax, %rcx cmpb $0, (%rdi,%rcx) jne LBB1_2 llvm-svn: 89981	2009-11-27 00:34:38 +00:00
Chris Lattner	4810fa619f	teach memdep to do trivial PHI translation of GEPs. More to come. llvm-svn: 89979	2009-11-27 00:07:37 +00:00
Chris Lattner	4824ebfded	Teach memdep to phi translate bitcasts. This allows us to compile the example in GCC PR16799 to: LBB1_2: ## %bb1 movl %eax, %eax subq %rax, %rdi movq %rdi, (%rcx) movl (%rdi), %eax testl %eax, %eax je LBB1_2 instead of: LBB1_2: ## %bb1 movl (%rdi), %ecx subq %rcx, %rdi movq %rdi, (%rax) cmpl $0, (%rdi) je LBB1_2 llvm-svn: 89978	2009-11-26 23:41:07 +00:00
Chris Lattner	4bf628a9ba	convert to filecheck llvm-svn: 89977	2009-11-26 23:32:59 +00:00
Chris Lattner	cf7665b0c8	Fix PR5471 by removing an instcombine xform. Some pieces of the code generates store to undef and some generates store to null as the idiom for undefined behavior. Since simplifycfg zaps both, don't remove the undefined behavior in instcombine. llvm-svn: 89971	2009-11-26 22:04:42 +00:00
Chris Lattner	911e5047d0	@test9 is a testcase for r89958. Before 89958, we misanalyzed the first expression as P+4+4i which we considered to possibly alias P+4j. Now we correctly analyze the former one as P+1+4i. @test10 is a sanity test that verfies that we know that P+4+4i != P+4*i. llvm-svn: 89960	2009-11-26 19:25:46 +00:00
Chris Lattner	ce573daf09	Implement PR1143 (at -m64) by making basicaa look through extensions. We previously already handled it at -m32 because there were no i32->i64 extensions for addressing. llvm-svn: 89959	2009-11-26 18:53:33 +00:00
Chris Lattner	d86a693b70	teach GetLinearExpression to be a bit more aggressive. llvm-svn: 89955	2009-11-26 17:00:01 +00:00
Chris Lattner	993cb8c911	update status of this. basicaa is much improved now, only missing the one form (in this testcase). Dan, do you consider this example to be important? llvm-svn: 89953	2009-11-26 16:42:00 +00:00
Chris Lattner	9c88c96b3f	Teach basicaa that x\|c == x+c when the c bits of x are clear. This allows us to compile the example in readme.txt into: LBB1_1: ## %bb movl 4(%rdx,%rax), %ecx movl %ecx, %esi imull (%rdx,%rax), %esi imull %esi, %ecx movl %esi, 8(%rdx,%rax) imull %ecx, %esi movl %ecx, 12(%rdx,%rax) movl %esi, 16(%rdx,%rax) imull %ecx, %esi movl %esi, 20(%rdx,%rax) addq $16, %rax cmpq $4000, %rax jne LBB1_1 instead of: LBB1_1: movl (%rdx,%rax), %ecx imull 4(%rdx,%rax), %ecx movl %ecx, 8(%rdx,%rax) imull 4(%rdx,%rax), %ecx movl %ecx, 12(%rdx,%rax) imull 8(%rdx,%rax), %ecx movl %ecx, 16(%rdx,%rax) imull 12(%rdx,%rax), %ecx movl %ecx, 20(%rdx,%rax) addq $16, %rax cmpq $4000, %rax jne LBB1_1 GCC (4.2) doesn't seem to be able to eliminate the loads in this testcase either, it generates: L2: movl (%rdx), %eax imull 4(%rdx), %eax movl %eax, 8(%rdx) imull 4(%rdx), %eax movl %eax, 12(%rdx) imull 8(%rdx), %eax movl %eax, 16(%rdx) imull 12(%rdx), %eax movl %eax, 20(%rdx) addl $4, %ecx addq $16, %rdx cmpl $1002, %ecx jne L2 llvm-svn: 89952	2009-11-26 16:26:43 +00:00
Chris Lattner	677b93d4c8	teach basicaa that A[i] != A[i+1]. llvm-svn: 89951	2009-11-26 16:18:10 +00:00
Chris Lattner	82257f0385	rename test llvm-svn: 89950	2009-11-26 16:08:41 +00:00
Chris Lattner	69e59e50f3	Change the other half of aliasGEP (which handles GEP differencing) to use DecomposeGEPExpression. This dramatically simplifies and shrinks the code by eliminating the horrible CheckGEPInstructions method, fixes a miscompilation (@test3) and makes the code more aggressive. In particular, we now handle the @test4 case, which is reduced from the SmallPtrSet constructor. Missing this caused us to emit a variable length memset instead of a fixed size one. llvm-svn: 89922	2009-11-26 02:17:34 +00:00
Chris Lattner	862a3532d6	add a new random feature test llvm-svn: 89921	2009-11-26 02:16:28 +00:00
Evan Cheng	dd352c2a81	Test for 89905. llvm-svn: 89906	2009-11-26 00:35:01 +00:00
Dale Johannesen	8f1aaa92b2	Test for llvm-gcc checkin 89898. llvm-svn: 89899	2009-11-25 23:50:09 +00:00
Evan Cheng	bdedf32e51	ProcessImplicitDefs should watch out for invalidated iterator and extra implicit operands on copies. llvm-svn: 89880	2009-11-25 21:13:39 +00:00
Bruno Cardoso Lopes	038281c523	Support PIC loading of constant pool entries llvm-svn: 89863	2009-11-25 12:17:58 +00:00
Edward O'Callaghan	4b197b8908	Reverting patch in revision 89758, initial attempt at fixing PR5373 has proven to be bogus. llvm-svn: 89844	2009-11-25 05:38:41 +00:00
Dale Johannesen	5809ff0e58	Do not store R31 into the caller's link area on PPC. This violates the ABI (that area is "reserved"), and while it is safe if all code is generated with current compilers, there is some very old code around that uses that slot for something else, and breaks if it is stored into. Adjust testcases looking for current behavior. I've verified that the stack frame size is right in all testcases, whether it changed or not. 7311323. llvm-svn: 89811	2009-11-24 22:59:02 +00:00
Edward O'Callaghan	8c1cd4fdbc	Fix for PR5373, Credit to Jakub Staszak. llvm-svn: 89758	2009-11-24 11:51:52 +00:00
Evan Cheng	b81878ed80	Enable predication of NEON instructions in Thumb2 mode. llvm-svn: 89748	2009-11-24 08:06:15 +00:00
Anton Korobeynikov	0f885eb7fd	Materialize global addresses via movt/movw pair, this is always better than doing the same via constpool: 1. Load from constpool costs 3 cycles on A9, movt/movw pair - just 2. 2. Load from constpool might stall up to 300 cycles due to cache miss. 3. Movt/movw does not use load/store unit. 4. Less constpool entries => better compiler performance. This is only enabled on ELF systems, since darwin does not have needed relocations (yet). llvm-svn: 89720	2009-11-24 00:44:37 +00:00
Jim Grosbach	76b545e988	move fconst[sd] to UAL. <rdar://7414913> llvm-svn: 89700	2009-11-23 21:08:25 +00:00
Jim Grosbach	b7607ee5fe	update test for 89694 llvm-svn: 89695	2009-11-23 20:39:53 +00:00
Dan Gohman	58bb87921b	Make ConstantFoldConstantExpression recursively visit the entire ConstantExpr, not just the top-level operator. This allows it to fold many more constants. Also, make GlobalOpt call ConstantFoldConstantExpression on GlobalVariable initializers. llvm-svn: 89659	2009-11-23 16:22:21 +00:00
Dan Gohman	0ef3e7cf76	Fix a use of an invalidated iterator in the case where there are multiple adjacent uses of a dead basic block from the same user. This fixes PR5596. llvm-svn: 89658	2009-11-23 16:13:39 +00:00
Nick Lewycky	9d1ee635e3	Reapply r88830 with a bugfix: this transform only applies to icmp eq/ne. This fixes part of PR5438. llvm-svn: 89639	2009-11-23 03:17:33 +00:00
Chris Lattner	632f60ccc9	remove a silly condition that doesn't make a lot of sense anymore. llvm-svn: 89601	2009-11-22 16:15:59 +00:00
Edward O'Callaghan	573a04cfbb	Miss two, PR5307. llvm-svn: 89596	2009-11-22 15:35:28 +00:00
Edward O'Callaghan	a295e7bd9b	Convert Thumb2 tests to FileCheck for PR5307. llvm-svn: 89595	2009-11-22 15:18:27 +00:00

... 4 5 6 7 8 ...

9184 Commits