llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 06:22:56 +02:00

Author	SHA1	Message	Date
Chris Lattner	3adae91c70	rename a function to indicate that it checks for profitability as well as legality. Make load sinking and gep sinking more careful: we only do it when it won't pessimize loads from the stack. This has the added benefit of not producing code that is unanalyzable to SROA. llvm-svn: 65209	2009-02-21 00:46:50 +00:00
Evan Cheng	c2541a4450	Fix strange logic in CollectIVUsers used to determine whether all uses are addresses, part 1. This fixes an obvious logic bug. Previously if the only in-loop use is a PHI, it would return AllUsesAreAddresses as true. llvm-svn: 65178	2009-02-20 22:16:49 +00:00
Dan Gohman	b8783d240b	Simplify code and reduce indentation. No functionality change. llvm-svn: 65167	2009-02-20 21:27:23 +00:00
Dan Gohman	4612c1d92f	Fix 80-column violations. llvm-svn: 65159	2009-02-20 21:06:57 +00:00
Dan Gohman	33c5714553	It's not necessary to check if Base is null here. llvm-svn: 65157	2009-02-20 21:05:23 +00:00
Dan Gohman	271a6f1142	Add a comment about how Imm can be used for loop-variant values. llvm-svn: 65147	2009-02-20 20:29:04 +00:00
Evan Cheng	d8aad94754	Factor address mode matcher out of codegen prepare to make it available to other passes, e.g. loop strength reduction. llvm-svn: 65134	2009-02-20 18:24:38 +00:00
Zhou Sheng	0c2e862ad0	Just roll back the previous change to -mem2reg. Will re-think about this according to Chris's comments. llvm-svn: 65126	2009-02-20 17:49:33 +00:00
Zhou Sheng	580c176f47	patch to update the line number information in pass -mem2reg. Currently this pass will delete the variable declaration info, and keep the line number info. But the kept line number info is not updated, and some is redundant or not correct, this patch just updates those info. llvm-svn: 65123	2009-02-20 16:31:35 +00:00
Dan Gohman	4e8fc41d48	Implement "superhero" strength reduction, or full strength reduction of address calculations down to basic pointer arithmetic. This is currently off by default, as it needs a few other features before it becomes generally useful. And even when enabled, full strength reduction is only performed when it doesn't increase register pressure, and when several other conditions are true. This also factors out a bunch of exisiting LSR code out of StrengthReduceStridedIVUsers into separate functions, and tidies up IV insertion. This actually decreases register pressure even in non-superhero mode. The change in iv-users-in-other-loops.ll is an example of this; there are two more adds because there are two fewer leas, and there is less spilling. llvm-svn: 65108	2009-02-20 04:17:46 +00:00
Dan Gohman	eb7aa11e26	Use DEBUG() instead of passing *DOUT to WriteAsOperand, since the latter just passes a null reference when debugging is not enabled. llvm-svn: 65060	2009-02-19 19:32:06 +00:00
Dan Gohman	9c41f5e046	Make the debug output of LSR less cryptic and more informative. llvm-svn: 65057	2009-02-19 19:23:27 +00:00
Duncan Sands	d1fef83598	In theory the aliasee may have dead constant users here. Since we only do the transform if there is one use, strip off any such users in the hope of making the transform fire more often. llvm-svn: 64926	2009-02-18 17:55:38 +00:00
Dan Gohman	451474da4a	Use a sign-extend instead of a zero-extend when promoting a trip count value when the original loop iteration condition is signed and the canonical induction variable won't undergo signed overflow. This isn't required for correctness; it just preserves more information about original loop iteration values. Add a getTruncateOrSignExtend method to ScalarEvolution, following getTruncateOrZeroExtend. llvm-svn: 64918	2009-02-18 17:22:41 +00:00
Dan Gohman	5530918aff	Simplify by using dyn_cast instead of isa and cast. llvm-svn: 64917	2009-02-18 16:54:33 +00:00
Dan Gohman	30770ee7b3	Add explicit keywords. llvm-svn: 64915	2009-02-18 16:37:45 +00:00
Dan Gohman	0e73582689	Eliminate several more unnecessary intptr_t casts. llvm-svn: 64888	2009-02-18 05:09:16 +00:00
Dan Gohman	3fc2e67140	Fix a corner case in the new indvars promotion logic: if there are multiple IV's in a loop, some of them may under go signed or unsigned wrapping even if the IV that's used in the loop exit condition doesn't. Restrict sign-extension-elimination and zero-extension-elimination to only those that operate on the original loop-controlling IV. llvm-svn: 64866	2009-02-18 00:52:00 +00:00
Dan Gohman	4f0fccdf9b	Fix a typo in a comment. llvm-svn: 64859	2009-02-18 00:08:39 +00:00
Duncan Sands	e605b83258	If an alias is dead and so is its aliasee, then globaldce would crash because the alias would still be using the aliasee when the aliasee was deleted. llvm-svn: 64844	2009-02-17 23:05:26 +00:00
Dan Gohman	ced54f0173	LoopIndexSplit doesn't actually use ScalarEvolution. llvm-svn: 64811	2009-02-17 20:50:11 +00:00
Dan Gohman	59b08852dc	Add a method to ScalarEvolution for telling it when a loop has been modified in a way that may effect the trip count calculation. Change IndVars to use this method when it rewrites pointer or floating-point induction variables instead of using a doInitialization method to sneak these changes in before ScalarEvolution has a chance to see the loop. This eliminates the need for LoopPass to depend on ScalarEvolution. llvm-svn: 64810	2009-02-17 20:49:49 +00:00
Chris Lattner	0837686a2a	commit a tweaked version of Daniel's patch for PR3599. We now eliminate all the extensions and all but the one required truncate from the testcase, but the or/and/shift stuff still isn't zapped. llvm-svn: 64809	2009-02-17 20:47:23 +00:00
Dan Gohman	72f656f2ef	Delete trailing whitespace. llvm-svn: 64784	2009-02-17 19:13:57 +00:00
Duncan Sands	c0436287c2	This transform also applies to private linkage. llvm-svn: 64773	2009-02-17 17:50:04 +00:00
Dan Gohman	07418e014e	Fix 80-column violation. llvm-svn: 64766	2009-02-17 15:57:39 +00:00
Evan Cheng	9a8e419015	Strengthen the "non-constant stride must dominate loop preheader" check. llvm-svn: 64703	2009-02-17 00:13:06 +00:00
Dan Gohman	36c8002915	Simplify; fix some 80-column violations. llvm-svn: 64702	2009-02-17 00:10:53 +00:00
Dan Gohman	e06ea828a2	Fix EnforceKnownAlignment so that it doesn't ever reduce the alignment of an alloca or global variable. llvm-svn: 64693	2009-02-16 23:02:21 +00:00
Nick Lewycky	6feb7523b8	Fix typo caused by too much surfing, dudes... llvm-svn: 64626	2009-02-16 04:26:53 +00:00
Dan Gohman	47a6dc9ad1	Delete this long-commented-out code. The situation it seems to have been written for is no longer relevant with the elimination of signed and unsigned types. llvm-svn: 64625	2009-02-16 02:57:42 +00:00
Dan Gohman	3d93bc5654	Change these tests to use regular loads instead of llvm.x86.sse2.loadu.dq. Enhance instcombine to use the preferred field of GetOrEnforceKnownAlignment in more cases, so that regular IR operations are optimized in the same way that the intrinsics currently are. llvm-svn: 64623	2009-02-16 00:44:23 +00:00
Nick Lewycky	9178be6059	Update the list of function annotations for nocapture. All of these came up when I was looking at functions used by python. Highlights include, better largefile support (64-bit file sizes on 32-bit systems), fputs string is nocapture, popen/pclose added (popen being noalias return), modf and frexp and friends. Also added some missing 'break' statements and combined identical sections. llvm-svn: 64615	2009-02-15 22:47:25 +00:00
Duncan Sands	6c1ce1dbd5	Make this more useful for cleaning up after the one-definition-rule llvm-gcc changes (coming soon to a tree near you!). llvm-svn: 64588	2009-02-15 11:54:49 +00:00
Duncan Sands	0e6fcb078c	If the target of an alias has internal linkage, then the alias can be morphed into the target. Implement this transform, and fix a crash in the existing transform at the same time. llvm-svn: 64583	2009-02-15 09:56:08 +00:00
Evan Cheng	02d9156a8d	Fix pr3571: If stride is a value defined by an instruction, make sure it dominates the loop preheader. When IV users are strength reduced, the stride is inserted into the preheader. It could create a use before def situation. llvm-svn: 64579	2009-02-15 06:06:15 +00:00
Evan Cheng	e0558412a4	ifdef out unneeded if statement. llvm-svn: 64575	2009-02-15 03:20:37 +00:00
Dan Gohman	3695fd42a9	Extend the IndVarSimplify support for promoting induction variables: - Test for signed and unsigned wrapping conditions, instead of just testing for non-negative induction ranges. - Handle loops with GT comparisons, in addition to LT comparisons. - Support more cases of induction variables that don't start at 0. llvm-svn: 64532	2009-02-14 02:31:09 +00:00
Dan Gohman	928d619b5e	Clarify debug output. llvm-svn: 64531	2009-02-14 02:26:50 +00:00
Dan Gohman	bd231d2e7b	Simplify some code. hasComputableLoopEvolution is overkill in this case. No functionality change. llvm-svn: 64530	2009-02-14 02:25:19 +00:00
Dan Gohman	f01c6af944	In CodeGenPrepare's debug output, use WriteAsOperand instead of printing getName(), so that unnamed values are printed correctly. llvm-svn: 64468	2009-02-13 17:45:12 +00:00
Dan Gohman	484ce19297	Complete the sentance in this comment. I have reservations about the code it describes, but at least now the comment is right. llvm-svn: 64465	2009-02-13 17:36:42 +00:00
Nick Lewycky	0a8e13fd8b	Mark strto* as readonly when the endptr is null. llvm-svn: 64460	2009-02-13 17:08:33 +00:00
Nick Lewycky	7ec551cfad	On strtod and friends, mark 'endptr' nocapture in the function prototype, and mark the first argument nocapture if endptr=NULL for each particular call. llvm-svn: 64453	2009-02-13 15:31:46 +00:00
Dan Gohman	aec5be6b01	Fix the code that checked if a SCEVAddRecExpr Start contains an addrec in a different loop to check the value being added to the accumulated Start value, not the Start value before it has the new value added to it. This prevents LSR from going crazy on the included testcase. Dale, please review. llvm-svn: 64440	2009-02-13 03:58:31 +00:00
Dan Gohman	3ade7d2346	Fix LSR's IV sorting function to explicitly sort by bitwidth after sorting by stride value. This prevents it from missing IV reuse opportunities in a host-sensitive manner. llvm-svn: 64415	2009-02-13 00:26:43 +00:00
Dan Gohman	02d4601fcf	Teach IndVarSimplify to optimize code using the C "int" type for loop induction on LP64 targets. When the induction variable is used in addressing, IndVars now is usually able to inserst a 64-bit induction variable and eliminates the sign-extending cast. This is also useful for code using C "short" types for induction variables on targets with 32-bit addressing. Inserting a wider induction variable is easy; the tricky part is determining when trunc(sext(i)) expressions are no-ops. This requires range analysis of the loop trip count. A common case is when the original loop iteration starts at 0 and exits when the induction variable is signed-less-than a fixed value; this case is now handled. This replaces IndVarSimplify's OptimizeCanonicalIVType. It was doing the same optimization, but it was limited to loops with constant trip counts, because it was running after the loop rewrite, and the information about the original induction variable is lost by that point. Rename ScalarEvolution's executesAtLeastOnce to isLoopGuardedByCond, generalize it to be able to test for ICMP_NE conditions, and move it to be a public function so that IndVars can use it. llvm-svn: 64407	2009-02-12 22:19:27 +00:00
Dan Gohman	f74d17b36a	Add a utility function to LoopInfo to return the exit block when the loop has exactly one exit, and make use of it in LoopIndexSplit. llvm-svn: 64388	2009-02-12 18:08:24 +00:00
Dan Gohman	faf109b851	This code doesn't actually use the ExitingBlocks list. llvm-svn: 64376	2009-02-12 16:36:26 +00:00
Chris Lattner	5babade39e	Fix a nasty bug (PR3550) where the inline pass could incorrectly mark calls with the tail marker when inlining them through an invoke. Patch, testcase, and perfect analysis by Jay Foad! llvm-svn: 64364	2009-02-12 07:06:42 +00:00
Chris Lattner	d093b49b81	improve naming of values in GVN, patch by Jay Foad! llvm-svn: 64363	2009-02-12 07:00:35 +00:00
Chris Lattner	e5ec807aaf	fix PR3537: if resetting bbi back to the start of a block, we need to forget about already inserted expressions. llvm-svn: 64362	2009-02-12 06:56:08 +00:00
Nick Lewycky	1a40fb2473	Don't mark all args to strtod and friends as nocapture. llvm-svn: 64352	2009-02-12 03:18:34 +00:00
Nate Begeman	9b68eff12e	the two non-mask arguments to a shufflevector must be the same width, but they do not have to be the same width as the result value. llvm-svn: 64335	2009-02-11 22:36:25 +00:00
Devang Patel	dd611eac76	If llvm.dbg.region.end is disappearing then remove corresponding llvm.dbg.func.start also. llvm-svn: 64278	2009-02-11 01:29:06 +00:00
Devang Patel	60571be0de	Ignore dbg intrinsic while folding unconditional branch. llvm-svn: 64242	2009-02-10 22:14:17 +00:00
Devang Patel	db7596dbee	Use early exits. Reduce indentation. llvm-svn: 64226	2009-02-10 19:28:07 +00:00
Devang Patel	6c041de2ff	Do not clone llvm.dbg.func.start and corresponding llvm.dbg.region.end during inlining. llvm-svn: 64209	2009-02-10 07:48:18 +00:00
Devang Patel	7377e7aa89	Enable scalar replacement of AllocaInst whose one of the user is dbg info. llvm-svn: 64207	2009-02-10 07:00:59 +00:00
Dale Johannesen	ef9b8f0d4c	Fix PR 3471, and some cleanups. llvm-svn: 64177	2009-02-09 22:14:15 +00:00
Bill Wendling	22173be9c5	Mistakenly turned this on. llvm-svn: 64065	2009-02-08 01:32:00 +00:00
Bill Wendling	4ed0306d6f	Revert r63999. It was breaking self-hosting builds. llvm-svn: 64062	2009-02-08 00:58:05 +00:00
Mon P Wang	028d995112	Instrcombine should not change load(cast p) to cast(load p) if the cast changes the address space of the pointer. llvm-svn: 64035	2009-02-07 22:19:29 +00:00
Mike Stump	ea0132f5bc	Insert space to avoid warning and make code more readable. llvm-svn: 64003	2009-02-07 03:36:02 +00:00
Devang Patel	85ae609834	Ignore DbgInfoIntrinsics. llvm-svn: 63923	2009-02-06 06:19:06 +00:00
Chris Lattner	5118081112	fix PR3489, use bits instead of bytes. llvm-svn: 63916	2009-02-06 04:34:07 +00:00
Devang Patel	a6f77d01c7	Ignore dbg intrinsics while propagating conditional expression info. Take 2. llvm-svn: 63898	2009-02-05 23:32:52 +00:00
Devang Patel	72f5fba371	Revert rev. 63876. It is causing llvm-gcc bootstrap failure. llvm-svn: 63888	2009-02-05 21:46:41 +00:00
Devang Patel	5b3fe253c5	Remove dead blocks in the end. llvm-svn: 63880	2009-02-05 19:59:42 +00:00
Devang Patel	66eee02024	Ignore dbg intrinsics while propagating conditional expression info. llvm-svn: 63876	2009-02-05 19:15:39 +00:00
Devang Patel	e665f78460	Ignore dbg intrinsics while folding switch instruction. llvm-svn: 63802	2009-02-05 00:30:42 +00:00
Devang Patel	10be164b28	Ignore dbg intrinsics. llvm-svn: 63781	2009-02-04 21:39:48 +00:00
Devang Patel	2fac28a8c7	While folding vallue comparison terminators ignore dbg intrinsics. llvm-svn: 63700	2009-02-04 01:06:11 +00:00
Devang Patel	bc5a1a7007	Ignore dbg intrinsics while hoisting common code in the two blocks up into the branch block. llvm-svn: 63687	2009-02-04 00:03:08 +00:00
Devang Patel	4b56b3c66e	Do not let dbg intrinsic block folding of two entry phi node. llvm-svn: 63671	2009-02-03 22:12:02 +00:00
Devang Patel	ffd9b999f8	If "optimize for size" attribute is set then block non-trivial loop unswitches but allow trivial loop unswitches. llvm-svn: 63670	2009-02-03 22:04:27 +00:00
Chris Lattner	4d41e7d461	teach "convert from scalar" to handle loads of fca's. llvm-svn: 63659	2009-02-03 21:08:45 +00:00
Chris Lattner	fc79cef792	refactor the interface to ConvertUsesOfLoadToScalar, renaming it to ConvertScalar_ExtractValue llvm-svn: 63658	2009-02-03 21:01:03 +00:00
Chris Lattner	e638ec187b	convert ConvertUsesOfLoadToScalar to use IRBuilder, no functionality change. llvm-svn: 63652	2009-02-03 19:45:44 +00:00
Chris Lattner	db7a4ea569	switch ConvertScalar_InsertValue to use an IRBuilder, no functionality change. llvm-svn: 63651	2009-02-03 19:41:50 +00:00
Chris Lattner	eb3d568867	make scalar conversion handle stores of first class aggregate values. loads are not yet handled (coming soon to an sroa near you). llvm-svn: 63649	2009-02-03 19:30:11 +00:00
Chris Lattner	5f3116636b	Make SROA produce a vector only when the alloca is actually accessed at least once as a vector. This prevents it from compiling the example in not-a-vector into: define double @test(double %A, double %B) { %tmp4 = insertelement <7 x double> undef, double %A, i32 0 %tmp = insertelement <7 x double> %tmp4, double %B, i32 4 %tmp2 = extractelement <7 x double> %tmp, i32 4 ret double %tmp2 } instead, producing the integer code. Producing vectors when they aren't otherwise in the program is dangerous because a lot of other code treats them carefully and doesn't want to break them down. OTOH, many things want to break down tasty i448's. llvm-svn: 63638	2009-02-03 18:15:05 +00:00
Evan Cheng	b3da5fb3a4	APInt'fy SimplifyDemandedVectorElts so it can analyze vectors with more than 64 elements. llvm-svn: 63631	2009-02-03 10:05:09 +00:00
Chris Lattner	447b5517bc	add another case of undefined behavior without crashing, PR3466. llvm-svn: 63620	2009-02-03 07:08:57 +00:00
Nick Lewycky	a676cf98e3	Revert r63600. It didn't fix the bug, it just moved it a bit. llvm-svn: 63618	2009-02-03 06:30:37 +00:00
Nick Lewycky	cd8353b6fe	Update the callgraph when replacing InvokeInst with CallInst when inlining. llvm-svn: 63600	2009-02-03 04:34:40 +00:00
Chris Lattner	b47738daab	Teach ConvertUsesToScalar to handle memset, allowing it to handle crazy cases like: struct f { int A, B, C, D, E, F; }; short test4() { struct f A; A.A = 1; memset(&A.B, 2, 12); return A.C; } llvm-svn: 63596	2009-02-03 02:01:43 +00:00
Chris Lattner	2dae393299	rearrange how SRoA handles promotion of allocas to vectors. With the new world order, it can handle cases where the first store into the alloca is an element of the vector, instead of requiring the first analyzed store to have the vector type itself. This allows us to un-xfail test/CodeGen/X86/vec_ins_extract.ll. llvm-svn: 63590	2009-02-03 01:30:09 +00:00
Chris Lattner	7ce69dfa56	inline SROA::ConvertToScalar, no functionality change. llvm-svn: 63544	2009-02-02 20:44:45 +00:00
Chris Lattner	ce09ac0c3d	Fix a bug which caused us to miscompile a couple of Ada tests. Thanks for the beautiful reduced testcase Duncan! llvm-svn: 63529	2009-02-02 18:02:59 +00:00
Duncan Sands	b469789780	Fix a comment (bytes -> bits), reformat a comment and remove trailing whitespace. No functionality change. llvm-svn: 63511	2009-02-02 10:06:20 +00:00
Duncan Sands	3d56fe0ca0	Fix an obvious thinko. llvm-svn: 63510	2009-02-02 09:53:14 +00:00
Chris Lattner	6402178a04	reduce indentation, (~XorCST->getValue()).isSignBit() -> isMaxSignedValue() llvm-svn: 63500	2009-02-02 07:15:30 +00:00
Nick Lewycky	e25b96473e	Reinstate this optimization to fold icmp of xor when possible. Don't try to turn icmp eq a+x, b+x into icmp eq a, b if a+x or b+x has other uses. This may have been increasing register pressure leading to the bzip2 slowdown. llvm-svn: 63487	2009-01-31 21:30:05 +00:00
Chris Lattner	26698a600e	Fix PR3452 (an infinite loop bootstrapping) by disabling the recent improvements to the EvaluateInDifferentType code. This code works by just inserted a bunch of new code and then seeing if it is useful. Instcombine is not allowed to do this: it can only insert new code if it is useful, and only when it is converging to a more canonical fixed point. Now that we iterate when DCE makes progress, this causes an infinite loop when the code ends up not being used. llvm-svn: 63483	2009-01-31 19:05:27 +00:00
Chris Lattner	c4729610fc	now that all the pieces are in place, teach instcombine's simplifydemandedbits to simplify instructions with multiple uses in contexts where it can get away with it. This allows it to simplify the code in multi-use-or.ll into a single 'add double'. This change is particularly interesting because it will cover up for some common codegen bugs with large integers created due to the recent SROA patch. When working on fixing those bugs, this should be disabled. llvm-svn: 63481	2009-01-31 08:40:03 +00:00
Chris Lattner	85ecfee7f3	simplify/clarify control flow and improve comments, no functionality change. llvm-svn: 63480	2009-01-31 08:24:16 +00:00
Chris Lattner	a899f8b75d	make some fairly meaty internal changes to how SimplifyDemandedBits works. Now, if it detects that "V" is the same as some other value, SimplifyDemandedBits returns the new value instead of RAUW'ing it immediately. This has two benefits: 1) simpler code in the recursive SimplifyDemandedBits routine. 2) it allows future fun stuff in instcombine where an operation has multiple uses and can be simplified in one context, but not all. #2 isn't implemented yet, this patch should have no functionality change. llvm-svn: 63479	2009-01-31 08:15:18 +00:00
Chris Lattner	95fe6579dd	minor cleanups llvm-svn: 63477	2009-01-31 07:26:06 +00:00
Chris Lattner	abf34563ec	make sure to set Changed=true when instcombine hacks on the code, not doing so prevents it from properly iterating and prevents it from deleting the entire body of dce-iterate.ll llvm-svn: 63476	2009-01-31 07:04:22 +00:00
Chris Lattner	235913be77	Simplify and generalize the SROA "convert to scalar" transformation to be able to handle ANY alloca that is poked by loads and stores of bitcasts and GEPs with constant offsets. Before the code had a number of annoying limitations and caused it to miss cases such as storing into holes in structs and complex casts (as in bitfield-sroa) where we had unions of bitfields etc. This also handles a number of important cases that are exposed due to the ABI lowering stuff we do to pass stuff by value. One case that is pretty great is that we compile 2006-11-07-InvalidArrayPromote.ll into: define i32 @func(<4 x float> %v0, <4 x float> %v1) nounwind { %tmp10 = call <4 x i32> @llvm.x86.sse2.cvttps2dq(<4 x float> %v1) %tmp105 = bitcast <4 x i32> %tmp10 to i128 %tmp1056 = zext i128 %tmp105 to i256 %tmp.upgrd.43 = lshr i256 %tmp1056, 96 %tmp.upgrd.44 = trunc i256 %tmp.upgrd.43 to i32 ret i32 %tmp.upgrd.44 } which turns into: _func: subl $28, %esp cvttps2dq %xmm1, %xmm0 movaps %xmm0, (%esp) movl 12(%esp), %eax addl $28, %esp ret Which is pretty good code all things considering :). One effect of this is that SROA will start generating arbitrary bitwidth integers that are a multiple of 8 bits. In the case above, we got a 256 bit integer, but the codegen guys assure me that it can handle the simple and/or/shift/zext stuff that we're doing on these operations. This addresses rdar://6532315 llvm-svn: 63469	2009-01-31 02:28:54 +00:00
Gabor Greif	690930270b	use precise getters llvm-svn: 63402	2009-01-30 18:21:13 +00:00
Chris Lattner	f9dd07a3c3	Fix some issues with volatility, move "CanConvertToScalar" check after the others. llvm-svn: 63227	2009-01-28 20:16:43 +00:00
Duncan Sands	aee16d4916	Rename getAnalysisToUpdate to getAnalysisIfAvailable. llvm-svn: 63198	2009-01-28 13:14:17 +00:00
Mon P Wang	80efbf07bd	Fixed optimization of combining two shuffles where the first shuffle inputs has a different number of elements than the output. llvm-svn: 62998	2009-01-26 04:39:00 +00:00
Chris Lattner	f93b292d9b	Handle single-entry phi nodes gracefully in condprop. llvm-svn: 62985	2009-01-26 02:18:20 +00:00
Chris Lattner	5549fb4e74	Fix PR3408 by making a non-obvious assumption very obvious, and handling the flaw inherent in that assumption. :) llvm-svn: 62984	2009-01-26 02:11:30 +00:00
Chris Lattner	e9c2c98b59	More cleanups and simplifications, no functionality change. llvm-svn: 62983	2009-01-26 01:57:01 +00:00
Chris Lattner	1e52603152	tidy asserts llvm-svn: 62982	2009-01-26 01:38:24 +00:00
Nick Lewycky	459667b48d	The function that does nothing but call malloc is noalias return. llvm-svn: 62956	2009-01-25 07:59:57 +00:00
Dale Johannesen	b818fa1bd3	Revert previous change; even this mild and clearly more accurate change loses more than it gains on benchmarks. llvm-svn: 62938	2009-01-24 21:49:34 +00:00
Torok Edwin	2a7e7066b3	testcase for PR3381. Also it was an empty struct, not a void after all. llvm-svn: 62920	2009-01-24 17:16:04 +00:00
Torok Edwin	726354d4ce	void* is represented as pointer to empty struct {}. Thus we need to check whether the struct is empty before trying to index into it. This fixes PR3381. llvm-svn: 62918	2009-01-24 11:30:49 +00:00
Dale Johannesen	1867040e00	Improve the inlining cost function a bit. Little practical effect. llvm-svn: 62908	2009-01-24 01:27:33 +00:00
Chris Lattner	d386e82ec9	Make InstCombineStoreToCast handle aggregates more aggressively, handling the case in Transforms/InstCombine/cast-store-gep.ll, which is a heavily reduced testcase from Clang on x86-64. llvm-svn: 62904	2009-01-24 01:00:13 +00:00
Gabor Greif	2d7bf2a76c	use CallSite::isCalle instead of slow getOperandNo llvm-svn: 62877	2009-01-23 21:17:04 +00:00
Gabor Greif	fbd40904c1	Simplify the logic of getting hold of a PHI predecessor block. There is now a direct way from value-use-iterator to incoming block in PHINode's API. This way we avoid the iterator->index->iterator trip, and especially the costly getOperandNo() invocation. Additionally there is now an assertion that the iterator really refers to one of the PHI's Uses. llvm-svn: 62869	2009-01-23 19:40:15 +00:00
Gabor Greif	d56b0a8c03	introduce a useful abstraction to find out if a Use is in the call position of an instruction llvm-svn: 62788	2009-01-22 21:35:57 +00:00
Chris Lattner	ca83aa289a	Remove uses of uint32_t in favor of 'unsigned' for better compatibility with cygwin. Patch by Jay Foad! llvm-svn: 62695	2009-01-21 18:09:24 +00:00
Dale Johannesen	6854f86296	Make special cases (0 inf nan) work for frem. Besides APFloat, this involved removing code from two places that thought they knew the result of frem(0., x) but were wrong. llvm-svn: 62645	2009-01-21 00:35:19 +00:00
Chris Lattner	6ade48fcaa	another fix for PR3354 llvm-svn: 62561	2009-01-20 01:15:41 +00:00
Bill Wendling	5281f4fb64	Doxygen-ify comments. llvm-svn: 62546	2009-01-19 23:43:56 +00:00
Chris Lattner	e8fa6f2468	Fix a problem exposed by PR3354: simplifycfg was making a potentially trapping instruction be executed unconditionally. llvm-svn: 62541	2009-01-19 23:03:13 +00:00
Chris Lattner	45a7b5ce57	improve compatibility with cygwin, patch by Jay Foad! llvm-svn: 62535	2009-01-19 22:00:18 +00:00
Chris Lattner	b88febb5cd	Fix PR3353, infinitely jump threading an infinite loop make from switches. llvm-svn: 62529	2009-01-19 21:20:34 +00:00
Bill Wendling	bf83203ae6	Temporarily revert r62487. It's causing this error during a release bootstrap of llvm-gcc. Most likely, it's miscompiling one of the "gen*" programs: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./prev-gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./prev-gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.6.0/bin/ -c -g -O2 -mdynamic-no-pic -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -mdynamic-no-pic -DHAVE_CONFIG_H -DGENERATOR_FILE -I. -Ibuild -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/build -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DENABLE_LLVM -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/../llvm.src/include -D_DEBUG -D_GNU_SOURCE -D__STDC_LIMIT_MACROS -D__STDC_CONSTANT_MACROS -o build/gencondmd.o build/gencondmd.c ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected '}' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: warning: excess elements in struct initializer ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: warning: (near initialization for 'insn_conditions[4]') ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected '}' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:926: error: expected ',' or ';' before ')' token ../../llvm-gcc.src/gcc/config/i386/mmx.md:927: error: expected identifier or '(' before ',' token ../../llvm-gcc.src/gcc/config/i386/sse.md:3458: error: expected identifier or '(' before ',' token ... llvm-svn: 62506	2009-01-19 08:46:20 +00:00
Chris Lattner	bb76cc9447	Fix PR3016, a bug which can occur do to an invalid assumption: we assumed a CFG structure that would be valid when all code in the function is reachable, but not all code is necessarily reachable. Do a simple, but horrible, CFG walk to check for this case. llvm-svn: 62487	2009-01-19 02:46:28 +00:00
Chris Lattner	af6f58bbf4	reduce indentation by using 'continue', no functionality change. llvm-svn: 62477	2009-01-19 02:07:32 +00:00
Chris Lattner	c03b442e54	Fix some problems in SpeculativelyExecuteBB. Basically, because of dead code, a phi could use the speculated instruction that was not in "BB2". Make this check explicit and tighten up some other corners. This fixes PR3292. No testcase becauase this depends entirely on visitation order of blocks and requires a sequence of 8 passes to repro. llvm-svn: 62476	2009-01-19 00:36:37 +00:00
Chris Lattner	ead19aaccb	Make this a bit more explicit about which cases need the check. No functionality change. llvm-svn: 62474	2009-01-18 23:22:07 +00:00
Chris Lattner	6f03811071	Fix rdar://6505632, an llc crash on 483.xalancbmk llvm-svn: 62470	2009-01-18 20:35:00 +00:00
Duncan Sands	ddfeabbab7	BasicAliasAnalysis and FunctionAttrs were both doing very similar pointer capture analysis. Factor out the common logic. The new version is from FunctionAttrs since it does a better job than the version in BasicAliasAnalysis llvm-svn: 62461	2009-01-18 12:19:30 +00:00
Nick Lewycky	39fcb513ca	Fix copy and pasted typos that prevented strtok_r, realloc, getenv, ungetc, putc, puts, perror, vscanf and vsscanf from getting annotations. Add annotations for eight printf functions, memalign, pread and pwrite. On Linux, llvm-gcc sometimes renames strdup, getc, putc, strtok_r, scanf and sscanf. Match the alternate function names. Fix a crash annotating opendir. Don't mark fsetpos's second parameter as nocapture. It's supposed to be captured. Do mark fopen's path and mode strings as nocapture. Mark ferror as readonly, but not fileno which may set errno. llvm-svn: 62456	2009-01-18 04:34:36 +00:00
Gabor Greif	20f36c51bd	introduce typedef for complicated vector, and use it too llvm-svn: 62384	2009-01-17 00:09:08 +00:00
Gabor Greif	0d0908d06a	typo llvm-svn: 62377	2009-01-16 23:08:50 +00:00
Chris Lattner	5d1ed9ed1f	Fix PR3335 by not turning a store to one address space into a store to another. llvm-svn: 62351	2009-01-16 20:12:52 +00:00
Chris Lattner	59dfd7d4af	reduce indentation by using early exits, no functionality change. llvm-svn: 62350	2009-01-16 20:08:59 +00:00
Evan Cheng	e7c9310d1b	Clean up previous cast optimization a bit. Also make zext elimination a bit more aggressive: if it's not necessary to emit an AND (i.e. high bits are already zero), it's profitable to evaluate the operand at a different type. llvm-svn: 62297	2009-01-16 02:11:43 +00:00
Rafael Espindola	0aba6c9435	Add the private linkage. llvm-svn: 62279	2009-01-15 20:18:42 +00:00
Gabor Greif	b1f92de36f	avoid using iterators when they get invalidated potentially this fixes PR3332 llvm-svn: 62271	2009-01-15 18:40:09 +00:00
Evan Cheng	340e5fe0a6	Eliminate a redundant check. llvm-svn: 62264	2009-01-15 17:09:07 +00:00
Evan Cheng	d504f9fe27	- Teach CanEvaluateInDifferentType of this xform: sext (zext ty1), ty2 -> zext ty2 - Looking at the number of sign bits of the a sext instruction to determine whether new trunc + sext pair should be added when its source is being evaluated in a different type. llvm-svn: 62263	2009-01-15 17:01:23 +00:00
Chris Lattner	fa0c0e19f6	Fix PR3325, a miscompilation of invokes by IPSCCP. Patch by Jay Foad! llvm-svn: 62244	2009-01-14 21:01:16 +00:00
Dale Johannesen	816f9bc81d	Fix the time regression I introduced in 464.h264ref with my earlier patch to this file. The issue there was that all uses of an IV inside a loop are actually references to Base[IV2], and there was one use outside that was the same but LSR didn't see the base or the scaling because it didn't recurse into uses outside the loop; thus, it used base+IVscale mode inside the loop instead of pulling base out of the loop. This was extra bad because register pressure later forced both base and IV into memory. Doing that recursion, at least enough to figure out addressing modes, is a good idea in general; the change in AddUsersIfInteresting does this. However, there were side effects.... It is also possible for recursing outside the loop to introduce another IV where there was only 1 before (if the refs inside are not scaled and the ref outside is). I don't think this is a common case, but it's in the testsuite. It is right to be very aggressive about getting rid of such introduced IVs (CheckForIVReuse and the handling of nonzero RewriteFactor in StrengthReduceStridedIVUsers). In the testcase in question the new IV produced this way has both a nonconstant stride and a nonzero base, neither of which was handled before. And when inserting new code that feeds into a PHI, it's right to put such code at the original location rather than in the PHI's immediate predecessor(s) when the original location is outside the loop (a case that couldn't happen before) (RewriteInstructionToUseNewBase); better to avoid making multiple copies of it in this case. Also, the mechanism for keeping SCEV's corresponding to GEP's no longer works, as the GEP might change after its SCEV is remembered, invalidating the SCEV, and we might get a bad SCEV value when looking up the GEP again for a later loop. This also couldn't happen before, as we weren't recursing into GEP's outside the loop. Also, when we build an expression that involves a (possibly non-affine) IV from a different loop as well as an IV from the one we're interested in (containsAddRecFromDifferentLoop), don't recurse into that. We can't do much with it and will get in trouble if we try to create new non-affine IVs or something. More testcases are coming. llvm-svn: 62212	2009-01-14 02:35:31 +00:00
Chris Lattner	2461d79aa9	rewrite OptimizeAwayTrappingUsesOfLoads to 1) avoid a temporary vector and extraneous loop over it, 2) not delete globals used by phis/selects etc which could actually be useful. This fixes PR3321. Many thanks to Duncan for narrowing this down. llvm-svn: 62201	2009-01-14 00:12:58 +00:00
Dale Johannesen	e458c47a74	Fix testsuite regressions from recursive inlining. llvm-svn: 62189	2009-01-13 22:43:37 +00:00
Dan Gohman	958861e65e	Make instcombine ensure that all allocas are explicitly aligned at at least their preferred alignment. llvm-svn: 62176	2009-01-13 20:18:38 +00:00
Duncan Sands	661959b54e	Correct a comment. llvm-svn: 62165	2009-01-13 13:48:44 +00:00
Dale Johannesen	12bb54e183	Enable recursive inlining. Reduce inlining threshold back to 200; 400 seems to be too high, loses more than it gains. llvm-svn: 62107	2009-01-12 22:11:50 +00:00
Duncan Sands	bcdbfb63dc	Rename getABITypeSize to getTypePaddedSize, as suggested by Chris. llvm-svn: 62099	2009-01-12 20:38:59 +00:00

1 2 3 4 5 ...

5087 Commits