llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Dan Gohman	da82dc2ec1	Generalize LSR's OptimizeSMax to handle unsigned max tests as well as signed max tests. Along with r73717, this helps CodeGen avoid emitting code for a maximum operation for this class of loop. llvm-svn: 73718	2009-06-18 20:23:18 +00:00
Dan Gohman	fd857b0406	Remove the code from IVUsers that attempted to handle casted induction variables in cases where the cast isn't foldable. It ended up being a pessimization in many cases. This could be fixed, but it would require a bunch of complicated code in IVUsers' clients. The advantages of this approach aren't visible enough to justify it at this time. llvm-svn: 73706	2009-06-18 16:54:06 +00:00
Dan Gohman	dc884a7830	Generalize the zext(trunc(t) & C) instcombine to work even with C is not a low-bits mask, and add a similar instcombine for zext((trunc(t) & C) ^ C). llvm-svn: 73705	2009-06-18 16:30:21 +00:00
Dan Gohman	e72fd546a2	Teach ScalarEvolution how to recognize another xor(and(x, C), C) case. If C is a single bit and the and gets analyzed as a truncate and zero-extend, the xor can be represnted as an add. llvm-svn: 73664	2009-06-18 00:00:20 +00:00
Dan Gohman	1530824138	Instcombine zext(trunc(x) & mask) to x&mask, even if the trunc has multiple users. llvm-svn: 73656	2009-06-17 23:17:05 +00:00
Dan Gohman	50b7d0d843	Add -disable-output to a bunch of tests that don't care about the output. llvm-svn: 73633	2009-06-17 20:56:26 +00:00
Dale Johannesen	26f0dd9021	This fixes a bug introduced in 72661, which can move loads back past a check that the load address is valid, see new testcase. The test that went in with 72661 has exactly this case, except that the conditional it's moving past is checking something else; I've settled for changing that test to reference a global, not a pointer. It may be possible to scan all the tests you pass and make sure none of them are checking any component of the address, but it's not trivial and I'm not trying to do that here. llvm-svn: 73632	2009-06-17 20:48:23 +00:00
Anton Korobeynikov	7fd29c57a8	Initial support for some Thumb2 instructions. Patch by Viktor Kutuzov and Anton Korzh from Access Softek, Inc. llvm-svn: 73622	2009-06-17 18:13:58 +00:00
Eli Friedman	36d7ca738e	Correct an accidental duplication of the test (patch doesn't handle creating new files very well). llvm-svn: 73599	2009-06-17 03:05:00 +00:00
Eli Friedman	b3947071ff	PR3439: Correct a silly mistake in the SimplifyDemandedUseBits code for SRem. llvm-svn: 73598	2009-06-17 02:57:36 +00:00
Dan Gohman	473789f75a	Fix ScalarEvolution's Xor handling to not assume that an And that gets recognized with a SCEVZeroExtendExpr must be an And with a low-bits mask. With r73540, this is no longer the case. llvm-svn: 73594	2009-06-17 01:22:39 +00:00
Dale Johannesen	64c7072138	Test for llvm-gcc patch 73564. llvm-svn: 73565	2009-06-16 22:18:33 +00:00
Anton Korobeynikov	d6004a164c	Make the test target-neutral llvm-svn: 73547	2009-06-16 20:25:25 +00:00
Dan Gohman	54bbef1525	Generalize a few more instcombines to be vector/scalar-independent. llvm-svn: 73541	2009-06-16 19:55:29 +00:00
Dan Gohman	56b5a88785	Instcombine's ShrinkDemandedConstant may strip bits out of constants, obscuring what would otherwise be a low-bits mask. Use ComputeMaskedBits to compute what ShrinkDemandedConstant knew about to reconstruct a low-bits mask value. llvm-svn: 73540	2009-06-16 19:52:01 +00:00
Anton Korobeynikov	a74b8323d0	GNU as refuses to assemble "pop {}" instruction. Do not emit such (this is the case when we have thumb vararg function with single callee-saved register, which is handled separately). llvm-svn: 73529	2009-06-16 18:49:08 +00:00
Chris Lattner	f54c97c579	Testcase for r73506 llvm-svn: 73508	2009-06-16 17:23:25 +00:00
Evan Cheng	a98ff05fca	If a val# is defined by an implicit_def and it is being removed, all of the copies off the val# were removed. This causes problem later since the scavenger will see uses of registers without defs. The proper solution is to change the copies into implicit_def's instead. TurnCopyIntoImpDef turns a copy into implicit_def and remove the val# defined by it. This causes an scavenger assertion later if the def reaches other blocks. Disable the transformation if the value live interval extends beyond its def block. llvm-svn: 73478	2009-06-16 07:12:58 +00:00
Eli Friedman	6a984089f4	Add some generic expansion logic for SMULO and UMULO. Fixes UMULO support for x86, and UMULO/SMULO for many architectures, including PPC (PR4201), ARM, and Cell. The resulting expansion isn't perfect, but it's not bad. llvm-svn: 73477	2009-06-16 06:58:29 +00:00
Devang Patel	5941941827	Use MainCU if it is available. llvm-svn: 73457	2009-06-16 02:09:30 +00:00
Dan Gohman	255bcad466	Update this test to use fmul instead of mul. llvm-svn: 73436	2009-06-15 22:49:34 +00:00
Dan Gohman	2e737ac21f	Support vector casts in more places, fixing a variety of assertion failures. To support this, add some utility functions to Type to help support vector/scalar-independent code. Change ConstantInt::get and ConstantFP::get to support vector types, and add an overload to ConstantInt::get that uses a static IntegerType type, for convenience. Introduce a new getConstant method for ScalarEvolution, to simplify common use cases. llvm-svn: 73431	2009-06-15 22:12:54 +00:00
Devang Patel	1fb2606b12	Gracefully handle imbalanced inline function begin and end markers. llvm-svn: 73426	2009-06-15 21:45:50 +00:00
Evan Cheng	4b77794613	ifcvt should ignore cfg where true and false successors are the same. llvm-svn: 73423	2009-06-15 21:24:34 +00:00
Dale Johannesen	2d0be306fb	Fix the crash in this test. This is basically the same problem addressed in 31284, but the patch there only addressed the case where an invoke is the first thing in a block. llvm-svn: 73416	2009-06-15 20:59:27 +00:00
Bill Wendling	a0a5984345	This test is failing. Revert for now. llvm-svn: 73404	2009-06-15 19:10:56 +00:00
Bill Wendling	1ea00229de	Add another testcase for r71478. llvm-svn: 73399	2009-06-15 18:36:34 +00:00
Arnold Schwaighofer	6b340f9247	CheckTailCallReturnConstraints is missing a check on the incomming chain of the RETURN node. The incomming chain must be the outgoing chain of the CALL node. This causes the backend to identify tail calls that are not tail calls. This patch fixes this. llvm-svn: 73387	2009-06-15 14:43:36 +00:00
Evan Cheng	3219c7fbe5	Part 1. - Change register allocation hint to a pair of unsigned integers. The hint type is zero (which means prefer the register specified as second part of the pair) or entirely target dependent. - Allow targets to specify alternative register allocation orders based on allocation hint. Part 2. - Use the register allocation hint system to implement more aggressive load / store multiple formation. - Aggressively form LDRD / STRD. These are formed before register allocation. It has to be done this way to shorten live interval of base and offset registers. e.g. v1025 = LDR v1024, 0 v1026 = LDR v1024, 0 => v1025,v1026 = LDRD v1024, 0 If this transformation isn't done before allocation, v1024 will overlap v1025 which means it more difficult to allocate a register pair. - Even with the register allocation hint, it may not be possible to get the desired allocation. In that case, the post-allocation load / store multiple pass must fix the ldrd / strd instructions. They can either become ldm / stm instructions or back to a pair of ldr / str instructions. This is work in progress, not yet enabled. llvm-svn: 73381	2009-06-15 08:28:29 +00:00
Chris Lattner	52510b0788	fix testcase to properly check for the patch in r73195. llvm-svn: 73380	2009-06-15 05:46:02 +00:00
Dan Gohman	d3a8d79c0d	Implement more aggressive folding of add operand lists when they contain multiplications of constants with add operations. This helps simplify several kinds of things; in particular it helps simplify expressions like ((-1 * (%a + %b)) + %a) to %b, as expressions like this often come up in loop trip count computations. llvm-svn: 73361	2009-06-14 22:58:51 +00:00
Duncan Sands	3a4ae072d0	Testcase for PR4332. llvm-svn: 73353	2009-06-14 22:22:42 +00:00
Dan Gohman	37fef35e88	Teach SCEVExpander's visitAddRecExpr to reuse an existing canonical induction variable when the addrec to be expanded does not require a wider type. This eliminates the need for IndVarSimplify to micro-manage SCEV expansions, because SCEVExpander now automatically expands them in the form that IndVarSimplify considers to be canonical. (LSR still micro-manages its SCEV expansions, because it's optimizing for the target, rather than for other optimizations.) Also, this uses the new getAnyExtendExpr, which has more clever expression simplification logic than the IndVarSimplify code it replaces, and this cleans up some ugly expansions in code such as the included masked-iv.ll testcase. llvm-svn: 73294	2009-06-13 16:25:49 +00:00
Evan Cheng	d0a66e438f	Add a ARM specific pre-allocation pass that re-schedule loads / stores from consecutive addresses togther. This makes it easier for the post-allocation pass to form ldm / stm. This is step 1. We are still missing a lot of ldm / stm opportunities because of register allocation are not done in the desired order. More enhancements coming. llvm-svn: 73291	2009-06-13 09:12:55 +00:00
Devang Patel	bcc1187643	llvm.dbg.region.end() intrinsic is not required to be in _last_ basic block in a function. If that happens then any basic block that follows (lexically) the block with regin.end will not have scope info available. LexicalScopeStack relies on processing basic block in CFG order, but this processing order is not guaranteed. Things get complicated when the optimizer gets a chance to optimizer IR with dbg intrinsics. Apply defensive patch to preserve at least one lexical scope till the end of function. llvm-svn: 73282	2009-06-13 02:16:18 +00:00
Dan Gohman	67ec24b541	Adjust this test's regex strings so that they work regardless of the target's pointer size. This avoids the need for -m32 on the llvm-gcc command-line, which some targets may not support. llvm-svn: 73270	2009-06-12 23:31:14 +00:00
Dan Gohman	e27a52f9b1	Add -m32 to llvm-gcc commands, so that this test behaves as expected on systems which default to a 64-bit target. llvm-svn: 73265	2009-06-12 23:02:02 +00:00
Evan Cheng	98216808fe	If killed register is defined by implicit_def, do not clear it since it's live range may overlap another def of same register. llvm-svn: 73255	2009-06-12 21:34:26 +00:00
Evan Cheng	2f784781aa	Mark some pattern-less instructions as neverHasSideEffects. llvm-svn: 73252	2009-06-12 20:46:18 +00:00
Devang Patel	8d9aa4249a	Clear AbstractInstanceRootMap at the end of the function. llvm-svn: 73244	2009-06-12 19:24:05 +00:00
Dan Gohman	f9b0419cd8	Don't do (x - (y - z)) --> (x + (z - y)) on floating-point types, because it may round differently. This fixes PR4374. llvm-svn: 73243	2009-06-12 19:23:25 +00:00
Dale Johannesen	b5be21ef41	Testcase for llvm-gcc patch 73238. llvm-svn: 73239	2009-06-12 18:41:53 +00:00
Arnold Schwaighofer	780e3addf8	Fix Bug 4278: X86-64 with -tailcallopt calling convention out of sync with regular cc. The only difference between the tail call cc and the normal cc was that one parameter register - R9 - was reserved for calling functions through a function pointer. After time the tail call cc has gotten out of sync with the regular cc. We can use R11 which is also caller saved but not used as parameter register for potential function pointers and remove the special tail call cc on x86-64. llvm-svn: 73233	2009-06-12 16:26:57 +00:00
Nick Lewycky	1e36649f95	Given two identical weak functions, produce one internal function and two weak thunks. llvm-svn: 73230	2009-06-12 15:56:56 +00:00
Nick Lewycky	cc239d7680	This test is wrong. If you have two weak functions F and G you can't make either one call the other since either one can be replaced at link time, and they need to be independent. llvm-svn: 73225	2009-06-12 13:24:41 +00:00
Nick Lewycky	61f78a2674	Fix regular expression. llvm-svn: 73221	2009-06-12 05:39:02 +00:00
Nick Lewycky	bbce41f698	Don't remove aggregate-typed module level constants before encoding functions since functions may contain aggregate constants too. llvm-svn: 73220	2009-06-12 05:20:12 +00:00
Nick Lewycky	127b1cc900	In an XFAIL line, treat "XFAIL: foobar" as a regular expression to be matched against the target triple, instead of equivalent to "XFAIL: ". llvm-svn: 73219	2009-06-12 05:18:32 +00:00
Nick Lewycky	e3b5c81cb8	XFAIL this on PPC Linux. This keeps showing up in the buildbot and isn't easy to fix, and I'd like it to stop masking real failures. llvm-svn: 73211	2009-06-11 23:43:02 +00:00
Dale Johannesen	60e261db11	Test for rev 73205 (PR 4349) llvm-svn: 73206	2009-06-11 20:48:09 +00:00
Chris Lattner	e0360f8ae8	Fix 4366: store to null in non-default addr space should not be turned into unreachable. llvm-svn: 73195	2009-06-11 17:54:56 +00:00
Daniel Dunbar	06ef64d379	Remove empty test (my DejaGNU doesn't like this) llvm-svn: 73148	2009-06-09 21:24:39 +00:00
Bill Wendling	c34ea869f5	Remove empty file. llvm-svn: 73140	2009-06-09 18:55:39 +00:00
David Greene	a51f014e59	Revert 73074 and 73099 because Windows doesn't have POSIX regular expressions. We will add an OpenBSD implementation and re-apply ASAP. llvm-svn: 73138	2009-06-09 18:31:17 +00:00
David Greene	e3c4370a47	Add a !patsubst operator. Use on string types. llvm-svn: 73099	2009-06-08 23:05:37 +00:00
Anton Korobeynikov	c82243e658	Add testcase for register scanveger assertion fix in r72755 (double def due to livevars) llvm-svn: 73096	2009-06-08 22:54:15 +00:00
David Greene	1f88852460	Add a more robust !if test. llvm-svn: 73091	2009-06-08 22:34:57 +00:00
David Greene	5b0714ad86	Fix DejaGNU run line to escape special characters. llvm-svn: 73090	2009-06-08 22:20:58 +00:00
David Greene	62a2f2fb97	Make IntInits and ListInits typed. This helps deduce types of !if and other operators. For the rare cases where a list type cannot be deduced, provide a []<type> syntax, where <type> is the list element type. llvm-svn: 73078	2009-06-08 20:23:18 +00:00
David Greene	21ba6012b2	Add a !regmatch operator to do pattern matching in TableGen. llvm-svn: 73074	2009-06-08 17:00:34 +00:00
Eli Friedman	62028b7323	Fix the run-line for this test to work correctly outside of x86. llvm-svn: 73025	2009-06-07 09:44:19 +00:00
Eli Friedman	2964aa5a38	Tweak the expansion code for BIT_CONVERT to generate better code converting from an MMX vector to an i64. llvm-svn: 73024	2009-06-07 09:41:57 +00:00
Eli Friedman	d4b463b0dc	Slightly generalize the code that handles shuffles of consecutive loads on x86 to handle more cases. Fix a bug in said code that would cause it to read past the end of an object. Rewrite the code in SelectionDAGLegalize::ExpandBUILD_VECTOR to be a bit more general. Remove PerformBuildVectorCombine, which is no longer necessary with these changes. In addition to simplifying the code, with this change, we can now catch a few more cases of consecutive loads. llvm-svn: 73012	2009-06-07 06:52:44 +00:00
Eli Friedman	2b6cb1684f	PR3628: Add patterns to match SHL/SRL/SRA to the corresponding Altivec instructions. llvm-svn: 73009	2009-06-07 01:07:55 +00:00
Eli Friedman	770f633389	PR4340: Run SimplifyDemandedVectorElts on insertelement instructions; sometimes it can find simplifications that won't be found otherwise. llvm-svn: 73006	2009-06-06 20:08:03 +00:00
Eli Friedman	2dadbd05f9	Fix the expansion for CONCAT_VECTORS so that it doesn't create illegal types. llvm-svn: 72993	2009-06-06 07:08:26 +00:00
Eli Friedman	4395222136	Avoid crashing on a variable-index insertelement with element type i16. llvm-svn: 72991	2009-06-06 06:32:50 +00:00
Eli Friedman	e546f94ef5	Get rid of some bogus patterns for X86vzmovl. Don't create VZEXT_MOVL nodes for vectors with an i16 element type. Add an optimization for building a vector which is all zeros/undef except for the bottom element, where the bottom element is an i8 or i16. llvm-svn: 72988	2009-06-06 06:05:10 +00:00
Eli Friedman	539325c8e7	Fix an obvious typo. llvm-svn: 72987	2009-06-06 05:55:37 +00:00
Eli Friedman	1227d199be	Get rid of a bogus pattern that interferes with optimization. llvm-svn: 72985	2009-06-06 04:17:04 +00:00
Eli Friedman	05eef883e8	PR2598: make sure to expand illegal forms of integer/floating-point conversions for x86, like <2 x i32> -> <2 x float> and <4 x i16> -> <4 x float>. llvm-svn: 72983	2009-06-06 03:57:58 +00:00
Devang Patel	8d170194e8	Add new function attribute - noimplicitfloat Update code generator to use this attribute and remove NoImplicitFloat target option. Update llc to set this attribute when -no-implicit-float command line option is used. llvm-svn: 72959	2009-06-05 21:57:13 +00:00
Nate Begeman	058d4eeccf	Adapt the x86 build_vector dagcombine to the current state of the legalizer. build vectors with i64 elements will only appear on 32b x86 before legalize. Since vector widening occurs during legalize, and produces i64 build_vector elements, the dag combiner is never run on these before legalize splits them into 32b elements. Teach the build_vector dag combine in x86 back end to recognize consecutive loads producing the low part of the vector. Convert the two uses of TLI's consecutive load recognizer to pass LoadSDNodes since that was required implicitly. Add a testcase for the transform. Old: subl $28, %esp movl 32(%esp), %eax movl 4(%eax), %ecx movl %ecx, 4(%esp) movl (%eax), %eax movl %eax, (%esp) movaps (%esp), %xmm0 pmovzxwd %xmm0, %xmm0 movl 36(%esp), %eax movaps %xmm0, (%eax) addl $28, %esp ret New: movl 4(%esp), %eax pmovzxwd (%eax), %xmm0 movl 8(%esp), %eax movaps %xmm0, (%eax) ret llvm-svn: 72957	2009-06-05 21:37:30 +00:00
Evan Cheng	ea31ec569b	Changing allocation ordering from r3 ... r0 back to r0 ... r3. The order change no longer make sense after the coalescing changes we have made since then. llvm-svn: 72955	2009-06-05 19:08:58 +00:00
Dan Gohman	31fc8d27b1	Fix an erroneous check for isFNeg; the FNeg case is handled a few lines later on. llvm-svn: 72904	2009-06-04 23:43:29 +00:00
Bill Wendling	60f5c8184b	Fix these so that they work on non-x86 Darwin machines. llvm-svn: 72903	2009-06-04 23:37:19 +00:00
Bill Wendling	b7c990bc90	Specify that this works for Darwin. llvm-svn: 72899	2009-06-04 22:56:29 +00:00
Dan Gohman	5f6f8101d5	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Devang Patel	9757e4f9f3	Add new function attribute - noredzone. Update code generator to use this attribute and remove DisableRedZone target option. Update llc to set this attribute when -disable-red-zone command line option is used. llvm-svn: 72894	2009-06-04 22:05:33 +00:00
Evan Cheng	dada49d18a	RALinScan::attemptTrivialCoalescing() was returning a virtual register instead of the physical register it is allocated to. This resulted in virtual register(s) being added the live-in sets. llvm-svn: 72890	2009-06-04 20:53:36 +00:00
Evan Cheng	8a6c448ab0	A value defined by an implicit_def can be liven to a use BB. This is unfortunate. But register allocator still has to add it to the live-in set of the use BB. llvm-svn: 72888	2009-06-04 20:25:48 +00:00
Dale Johannesen	a9a7e5d234	For XTARGET to work on targets not in the list, there must also be an XFAIL line. Fix a couple examples of this. llvm-svn: 72876	2009-06-04 18:27:43 +00:00
Dan Gohman	05fe1217c7	Check in test changes that I accidentally left out of r72872. llvm-svn: 72875	2009-06-04 18:22:31 +00:00
Eli Friedman	11070e275f	PR3739, part 2: Use an explicit store to spill XMM registers. (Previously, the code tried to use "push", which doesn't exist for XMM registers.) llvm-svn: 72836	2009-06-04 02:32:04 +00:00
Eli Friedman	fd27229206	PR3739, part 1: Disable the red zone on Win64. llvm-svn: 72830	2009-06-04 02:02:01 +00:00
Evan Cheng	e3a05e6690	Re-apply 72756 with fixes. One of those was introduced by we changed MachineInstrBuilder::addReg() interface. llvm-svn: 72826	2009-06-04 01:15:28 +00:00
Eli Friedman	dbf32ddf16	PR4317: Handle splits where the new block is unreachable correctly in DominatorTreeBase::Split. llvm-svn: 72810	2009-06-03 21:42:06 +00:00
Evan Cheng	b71402d6ae	For Darwin / x86_64, override -relocation-model=static to pic if the output is assembly since Darwin assembler does not really support -static codeine. I view this as a temporary workaround until the assembler / linker changes. llvm-svn: 72806	2009-06-03 21:13:54 +00:00
Dan Gohman	6e9ad19ef7	Don't attempt to simplify an non-affine IV expression if it can't be simplified to a loop-invariant value. This fixes PR4315. llvm-svn: 72798	2009-06-03 19:11:31 +00:00
Evan Cheng	4e47a019ba	Fix for PR4225: When rewriter reuse a value in a physical register , it clear the register kill operand marker and its kill ops information. However, the cleared operand may be a def of a super-register. Clear the kill ops info for the super-register's sub-registers as well. llvm-svn: 72758	2009-06-03 09:00:27 +00:00
Evan Cheng	82f8fa333e	Temporarily revert 72756 for now. llvm-svn: 72757	2009-06-03 07:40:47 +00:00
Evan Cheng	5afbef29fa	Fold preceding / trailing base inc / dec into the single load / store as well. llvm-svn: 72756	2009-06-03 06:14:58 +00:00
Dan Gohman	609f627ed7	Revert r72734. The Darwin assembler doesn't support the static relocation model on x86-64. Higher level logic should override the relocation model to PIC on x86_64-apple-darwin. llvm-svn: 72746	2009-06-03 00:37:20 +00:00
Dan Gohman	f6e6588203	Fix CodeGenPrepare's address-mode sinking to handle unusual addresses, involving Base values which do not have Pointer type. This fixes PR4297. llvm-svn: 72739	2009-06-02 21:29:13 +00:00
Evan Cheng	7e66d61bec	On Darwin x86_64 small code model doesn't guarantee code address fits in 32-bit. llvm-svn: 72734	2009-06-02 20:09:31 +00:00
Evan Cheng	7875093e82	Avoid infinite looping in AllGlobalLoadUsesSimpleEnoughForHeapSRA(). This can happen when PHI uses are recursively dependent on each other. llvm-svn: 72710	2009-06-02 00:56:07 +00:00
Eli Friedman	2b0edc3327	PR4286: Make RewriteLoadUserOfWholeAlloca and RewriteStoreUserOfWholeAlloca deal with tail padding because isSafeUseOfBitCastedAllocation expects them to. Otherwise, we crash trying to erase the bitcast. llvm-svn: 72688	2009-06-01 09:14:32 +00:00
Owen Anderson	928040c625	Be more aggressive in doing LoadPRE by tracing backwards when a block only has a single predecessor. Patch by Jakub Staszak. llvm-svn: 72661	2009-05-31 09:03:40 +00:00
Chris Lattner	8ac63163fe	fix PR4284, a bug in simplifylibcalls handling memcmp. Patch by Benjamin Kramer! llvm-svn: 72625	2009-05-30 18:43:04 +00:00
Duncan Sands	3d77d1fcfc	Adjust these tests now that "extern inline" functions are being output with bodies and available_externally linkage. llvm-svn: 72620	2009-05-30 13:57:05 +00:00
Evan Cheng	2d198e1bc2	(i64 (zext (srl GR32 8))) -> movzbl AH is not safe since srl 8 only clear the top 8 bits. llvm-svn: 72618	2009-05-30 08:43:27 +00:00
Nick Lewycky	a9de2f1c81	Give embedded metadata its own type instead of relying on EmptyStructTy. llvm-svn: 72610	2009-05-30 05:06:04 +00:00
Duncan Sands	f4fe76d46b	Dan noticed that the verifier wasn't thoroughly checking uses of invoke results (see the testcases). Tighten up the checking. llvm-svn: 72586	2009-05-29 19:39:36 +00:00
Evan Cheng	57f85a1529	Remove an accidental commit. llvm-svn: 72560	2009-05-29 05:28:52 +00:00
Evan Cheng	550fc9ba9f	More h-registers tricks: folding zext nodes. llvm-svn: 72558	2009-05-29 01:44:43 +00:00
Evan Cheng	a36a15ff66	Do not try to create a MVT type of width 0. llvm-svn: 72557	2009-05-28 23:52:18 +00:00
Eli Friedman	5a376ed43e	Add explicit test for PR4280. llvm-svn: 72539	2009-05-28 21:04:35 +00:00
Eli Friedman	8b0b7c2d6d	Add a testcase which got fixed by recent legalization work. llvm-svn: 72517	2009-05-28 05:10:20 +00:00
Nick Lewycky	3dd0d690f3	Use Operands.data() instead of &Operands[0] where Operands is a potentially empty SmallVector. llvm-svn: 72512	2009-05-28 04:08:10 +00:00
Evan Cheng	40810c4d1b	Added optimization that narrow load / op / store and the 'op' is a bit twiddling instruction and its second operand is an immediate. If bits that are touched by 'op' can be done with a narrower instruction, reduce the width of the load and store as well. This happens a lot with bitfield manipulation code. e.g. orl $65536, 8(%rax) => orb $1, 10(%rax) Since narrowing is not always a win, e.g. i32 -> i16 is a loss on x86, dag combiner consults with the target before performing the optimization. llvm-svn: 72507	2009-05-28 00:35:15 +00:00
Dan Gohman	2884c5153c	Revert 72493 and replace it with a more conservative fix, for now: don't rewrite the comparison if there is any implicit extension or truncation on the induction variable. I'm planning for IVUsers to eventually take over some of the work of this code, and for it to be generalized. llvm-svn: 72496	2009-05-27 21:10:47 +00:00
Dan Gohman	994001e5ef	In ChangeCompareStride, when the stride to be reused is truncated to a smaller type, promoted its offset back up to the type of the new comparison. This fixes PR4222. llvm-svn: 72493	2009-05-27 20:00:18 +00:00
Bill Wendling	2944bcd25c	This looks like it passes now. llvm-svn: 72485	2009-05-27 17:43:21 +00:00
Dan Gohman	0124c21ba0	Teach SCEVExpander to avoid creating over-indexed GEP indices when possible. For example, it now emits %p.2.ip.1 = getelementptr [3 x [3 x double]]* %p, i64 2, i64 %tmp, i64 1 instead of the equivalent but less obvious %p.2.ip.1 = getelementptr [3 x [3 x double]]* %p, i64 0, i64 %tmp, i64 19 llvm-svn: 72452	2009-05-27 02:00:53 +00:00
Dan Gohman	5dd6f54a5f	Teach BasicAliasAnalysis to understand constant gep indices that fall beyond their associated static array type. I believe that this fixes a legitimate bug, because BasicAliasAnalysis already has code to check for this condition that works for non-constant indices, however it was missing the case of constant indices. With this change, it checks for both. This fixes PR4267, and miscompiles of SPEC 188.ammp and 464.h264.href. llvm-svn: 72451	2009-05-27 01:48:27 +00:00
Dale Johannesen	02649aa161	Testcase for (llvm-gcc-4.2) 72442 (PR 4242). llvm-svn: 72443	2009-05-26 23:19:19 +00:00
Dan Gohman	fb34a67498	In cases where a pointer value is an operand of a multiplication or division operation, don't attempt to use the operation's value as the base of a getelementptr. This fixes PR4271. llvm-svn: 72422	2009-05-26 17:41:16 +00:00
Chris Lattner	8f4210d099	make memdep use the getModRefInfo method for stores instead of the low-level alias() method, allowing it to reason more aggressively about pointers into constant memory. PR4189 llvm-svn: 72403	2009-05-25 21:28:56 +00:00
Dan Gohman	eb3ddbb1ac	When rewriting the loop exit test with the canonical induction variable, leave the original comparison in place if it has other uses, since the other uses won't be dominated by the new comparison instruction. llvm-svn: 72369	2009-05-24 19:11:38 +00:00
Dan Gohman	fdba9c8fce	Generalize SCEVExpander::visitAddRecExpr's GEP persuit, and avoid sending SCEVUnknowns to expandAddToGEP. This avoids the need for expandAddToGEP to bend the rules and peek into SCEVUnknown expressions. Factor out the code for testing whether a SCEV can be factored by a constant for use in a GEP index. This allows it to handle SCEVAddRecExprs, by recursing. As a result, SCEVExpander can now put more things in GEP indices, so it emits fewer explicit mul instructions. llvm-svn: 72366	2009-05-24 18:06:31 +00:00
Torok Edwin	8936fc2e28	The rewriter may hold references to instructions that are deleted because they are trivially dead. Fix by clearing the rewriter cache before deleting the trivially dead instructions. Also make InsertedExpressions use an AssertingVH to catch these bugs easier. llvm-svn: 72364	2009-05-24 14:23:16 +00:00
Torok Edwin	99b1003c2e	Fix PR4254. The DAGCombiner created a negative shiftamount, stored in an unsigned variable. Later the optimizer eliminated the shift entirely as being undefined. Example: (srl (shl X, 56) 48). ShiftAmt is 4294967288. Fix it by checking that the shiftamount is positive, and storing in a signed variable. llvm-svn: 72331	2009-05-23 17:29:48 +00:00
Torok Edwin	beb86bd0b4	available_externall linkage is not local, this was confusing the codegenerator, and it wasn't generating calls through @PLT for these functions. hasLocalLinkage() is now false for available_externally, I attempted to fix the inliner and dce to handle available_externally properly. It passed make check. llvm-svn: 72328	2009-05-23 14:06:57 +00:00
Eli Friedman	262a99ffed	Fix test to account for legalization changes; I think this ends up running an extra DAGCombine pass which improves the code a bit. llvm-svn: 72326	2009-05-23 13:15:11 +00:00
Evan Cheng	77529302a6	Fix bug in FoldFCmp_IntToFP_Cst. If inttofp is a uintofp, use unsigned instead of signed integer constant. llvm-svn: 72300	2009-05-22 23:10:53 +00:00
Duncan Sands	bbd03677ee	Add a new codegen pass that normalizes dwarf exception handling code in preparation for code generation. The main thing it does is handle the case when eh.exception calls (and, in a future patch, eh.selector calls) are far away from landing pads. Right now in practice you only find eh.exception calls close to landing pads: either in a landing pad (the common case) or in a landing pad successor, due to loop passes shifting them about. However future exception handling improvements will result in calls far from landing pads: (1) Inlining of rewinds. Consider the following case: In function @f: ... invoke @g to label %normal unwind label %unwinds ... unwinds: %ex = call i8* @llvm.eh.exception() ... In function @g: ... invoke @something to label %continue unwind label %handler ... handler: %ex = call i8* @llvm.eh.exception() ... perform cleanups ... "rethrow exception" Now inline @g into @f. Currently this is turned into: In function @f: ... invoke @something to label %continue unwind label %handler ... handler: %ex = call i8* @llvm.eh.exception() ... perform cleanups ... invoke "rethrow exception" to label %normal unwind label %unwinds unwinds: %ex = call i8* @llvm.eh.exception() ... However we would like to simplify invoke of "rethrow exception" into a branch to the %unwinds label. Then %unwinds is no longer a landing pad, and the eh.exception call there is then far away from any landing pads. (2) Using the unwind instruction for cleanups. It would be nice to have codegen handle the following case: invoke @something to label %continue unwind label %run_cleanups ... handler: ... perform cleanups ... unwind This requires turning "unwind" into a library call, which necessarily takes a pointer to the exception as an argument (this patch also does this unwind lowering). But that means you are using eh.exception again far from a landing pad. (3) Bugpoint simplifications. When bugpoint is simplifying exception handling code it often generates eh.exception calls far from a landing pad, which then causes codegen to assert. Bugpoint then latches on to this assertion and loses sight of the original problem. Note that it is currently rare for this pass to actually do anything. And in fact it normally shouldn't do anything at all given the code coming out of llvm-gcc! But it does fire a few times in the testsuite. As far as I can see this is almost always due to the LoopStrengthReduce codegen pass introducing pointless loop preheader blocks which are landing pads and only contain a branch to another block. This other block contains an eh.exception call. So probably by tweaking LoopStrengthReduce a bit this can be avoided. llvm-svn: 72276	2009-05-22 20:36:31 +00:00
Dan Gohman	d5fc3518d5	Teach IndVarSimplify's FixUsesBeforeDefs to handle InvokeInsts by assuming that the use of the value is in a block dominated by the "normal" destination. LangRef.html and other documentation sources don't explicitly guarantee this, but it seems to be assumed in other places in LLVM at least. This fixes an assertion failure on the included testcase, which is derived from the Ada testsuite. FixUsesBeforeDefs is a temporary measure which I'm looking to replace with a more capable solution. llvm-svn: 72266	2009-05-22 16:47:11 +00:00
Dan Gohman	82df35a657	Fix a thinko in the code that adapted SCEVMulExpr operands for use in expanding SCEVAddExprs with GEPs. The operands of a SCEVMulExpr need to be multiplied together, not added. llvm-svn: 72250	2009-05-22 07:14:20 +00:00
Torok Edwin	9b75edbd14	Revert this. There's no way to verifiy indirect calls, and an optimizer can turn indirect call into direct call, thus the verifier would reject something it previously accepted. llvm-svn: 72249	2009-05-22 07:12:05 +00:00
Torok Edwin	8c1af7f5be	Verify that calling conventions match function prototype. This only rejects mismatches between target specific calling convention and C/LLVM specific calling convention. There are too many fastcc/C, coldcc/cc42 mismatches in the testsuite, these are not reject by the verifier. llvm-svn: 72248	2009-05-22 06:41:43 +00:00
Eli Friedman	b32b64b5b4	Fix broken logic in DominatorTreeBase::Split. Part of PR4238. llvm-svn: 72231	2009-05-21 21:47:54 +00:00
Eli Friedman	d4f9668eb7	Fix some incorrect logic in DominanceFrontier::splitBlock. Part of PR4238. llvm-svn: 72223	2009-05-21 20:40:30 +00:00
Dan Gohman	fc28858d91	Teach ValueTracking a new way to analyze PHI nodes, and and teach Instcombine to be more aggressive about using SimplifyDemandedBits on shift nodes. This allows a shift to be simplified to zero in the included test case. llvm-svn: 72204	2009-05-21 02:28:33 +00:00
Eli Friedman	b6fe72e457	Fix for PR4235: to build a floating-point value from integer parts, build an integer and cast that to a float. This fixes a crash caused by trying to split an f32 into two f16's. This changes the behavior in test/CodeGen/XCore/fneg.ll because that testcase now triggers a DAGCombine which converts the fneg into an integer operation. If someone is interested, it's probably possible to tweak the test to generate an actual fneg. llvm-svn: 72162	2009-05-20 06:02:09 +00:00
Evan Cheng	ff129ff17f	Fix test on non-darwin hosts. llvm-svn: 72161	2009-05-20 05:45:36 +00:00
Evan Cheng	e17c02e328	Try again. Allow call to immediate address for ELF or when in static relocation mode. llvm-svn: 72160	2009-05-20 04:53:57 +00:00
Evan Cheng	8a4887572e	Cannot use immediate as call absolute target in PIC mode. llvm-svn: 72154	2009-05-20 01:11:00 +00:00
Dan Gohman	9e0f5a28dc	Suppress the IV reversal transformation in the case that the RHS of the comparison is defined inside the loop. This fixes a use-before-def problem, because the transformation puts a use of the RHS outside the loop. llvm-svn: 72149	2009-05-20 00:34:08 +00:00
Bob Wilson	c6726ecca5	Fix pr4058 and pr4059. Do not split i64 or double arguments between r3 and the stack. Patch by Sandeep Patel. llvm-svn: 72106	2009-05-19 10:02:36 +00:00
Bob Wilson	ec676a76e7	Fix pr4091: Add support for "m" constraint in ARM inline assembly. llvm-svn: 72105	2009-05-19 05:53:42 +00:00
Dan Gohman	922033d119	Teach SCEVExpander to expand arithmetic involving pointers into GEP instructions. It attempts to create high-level multi-operand GEPs, though in cases where this isn't possible it falls back to casting the pointer to i8* and emitting a GEP with that. Using GEP instructions instead of ptrtoint+arithmetic+inttoptr helps pointer analyses that don't use ScalarEvolution, such as BasicAliasAnalysis. Also, make the AddrModeMatcher more aggressive in handling GEPs. Previously it assumed that operand 0 of a GEP would require a register in almost all cases. It now does extra checking and can do more matching if operand 0 of the GEP is foldable. This fixes a problem that was exposed by SCEVExpander using GEPs. llvm-svn: 72093	2009-05-19 02:15:55 +00:00
Bill Wendling	ae8a483328	Commands beginning with '--' are converted to '-f' by gcc. Blech! llvm-svn: 72023	2009-05-18 18:09:36 +00:00
Dan Gohman	592d65ba06	Teach ScalarEvolution to recognize x^-1 in the case where non-demanded bits have been stripped out by instcombine. llvm-svn: 72010	2009-05-18 16:29:04 +00:00
Dan Gohman	2c9bd7e0cb	Make ScalarEvolution::isLoopGuardedByCond work even when the edge entering a loop is a non-split critical edge. llvm-svn: 72004	2009-05-18 15:36:09 +00:00
Dan Gohman	904f081ce7	Add nounwind to a few tests. llvm-svn: 72002	2009-05-18 15:16:49 +00:00
Duncan Sands	ead6c97920	Check that the gcc front-end is not doing inlining when not doing unit-at-a-time. llvm-svn: 71986	2009-05-17 19:37:02 +00:00
Anton Korobeynikov	85accafcba	Mark rotl/rotr as expand. This generates pretty ugly code, but this is better than nothing. llvm-svn: 71976	2009-05-17 10:16:28 +00:00
Anton Korobeynikov	8753e89b79	Typo llvm-svn: 71975	2009-05-17 10:15:22 +00:00
Jakob Stoklund Olesen	fa57451cf5	Help DejaGnu avoid pipe-jam by producing less output from certain test cases. When a test fails with more than a pipeful of output on stdout AND stderr, one of the DejaGnu programs blocks. The problem can be avoided by redirecting stdout to a file. llvm-svn: 71919	2009-05-16 00:34:42 +00:00
David Greene	9a1e15d0d0	Implement !if, analogous to $(if) in GNU make. llvm-svn: 71815	2009-05-14 23:26:46 +00:00
David Greene	5841d58c6e	Fix tests to not upset DejaGNU. llvm-svn: 71811	2009-05-14 23:21:40 +00:00
David Greene	70881bc6ae	Graduate LLVM to the big leagues by embedding a LISP processor into TableGen. Ok, not really, but do support some common LISP functions: * car * cdr * null llvm-svn: 71805	2009-05-14 22:38:31 +00:00
David Greene	fab0ee79db	Implement a !foreach operator analogous to GNU make's $(foreach). Use it on dags and lists like this: class decls { string name; } def Decls : decls; class B<list<string> names> : A<!foreach(Decls.name, names, !strconcat(Decls.name, ", Sr."))>; llvm-svn: 71803	2009-05-14 22:23:47 +00:00
David Greene	26054a566e	Implement a !subst operation simmilar to $(subst) in GNU make to do def/var/string substitution on generic pattern templates. For example: def Type; def v4f32 : Type; def TYPE : Type; class GenType<Type t> { let type = !(subst TYPE, v4f32, t); } def TheType : GenType<TYPE>; llvm-svn: 71801	2009-05-14 21:54:42 +00:00
David Greene	e2871b8c65	Implement !cast. llvm-svn: 71794	2009-05-14 21:22:49 +00:00
Dan Gohman	a09e38894a	Add nounwind to this test. llvm-svn: 71734	2009-05-13 22:29:12 +00:00
Bill Wendling	c76422f45d	Remove too large testcase. llvm-svn: 71730	2009-05-13 21:51:26 +00:00
Bill Wendling	35584a26be	Move the bookkeeping of the debug scopes back to the place where it belonged. The variable declaration stuff wasn't happy with it where it was. Sorry that the testcase is so big. Bugpoint wasn't able to reduce it successfully. llvm-svn: 71714	2009-05-13 20:33:33 +00:00
Dale Johannesen	da2e1e314b	Testcase for 71688. llvm-svn: 71691	2009-05-13 18:33:24 +00:00
Chris Lattner	eb2f327449	calls in nothrow functions can be marked nothrow even if the callee is not known to be nothrow. This allows readnone/readonly functions to be deleted even if we don't know whether the callee can throw. llvm-svn: 71676	2009-05-13 17:39:14 +00:00
Chris Lattner	927ebd34e2	Fix PR4206 - crash in simplify lib calls llvm-svn: 71644	2009-05-13 06:26:11 +00:00
Evan Cheng	e43bfc153e	If header of inner loop is aligned, do not align the outer loop header. We don't want to add nops in the outer loop for the sake of aligning the inner loop. llvm-svn: 71609	2009-05-12 23:58:14 +00:00
Evan Cheng	c7f7276825	Teach TransferDeadness to delete truly dead instructions if they do not produce side effects. llvm-svn: 71606	2009-05-12 23:07:00 +00:00
Evan Cheng	b0a4c44103	Add nounwind. llvm-svn: 71575	2009-05-12 18:35:43 +00:00
Evan Cheng	d6e3e4d746	Fixed a stack slot coloring with reg bug: do not update implicit use / def when doing forward / backward propagation. llvm-svn: 71574	2009-05-12 18:31:57 +00:00
Bob Wilson	16f684a429	Fix pr4195: When iterating through predecessor blocks, break out of the loop after finding the (unique) layout predecessor. Sometimes a block may be listed more than once, and processing it more than once in this loop can lead to inconsistent values for FtTBB/FtFBB, since the AnalyzeBranch method does not clear these values. There's no point in continuing the loop regardless. The testcase for this is reduced from the 2003-05-02-DependentPHI SingleSource test. llvm-svn: 71536	2009-05-12 03:48:10 +00:00
Dan Gohman	d13f674130	Factor the code for collecting IV users out of LSR into an IVUsers class, and generalize it so that it can be used by IndVarSimplify. Implement the base IndVarSimplify transformation code using IVUsers. This removes TestOrigIVForWrap and associated code, as ScalarEvolution now has enough builtin overflow detection and folding logic to handle all the same cases, and more. Run "opt -iv-users -analyze -disable-output" on your favorite loop for an example of what IVUsers does. This lets IndVarSimplify eliminate IV casts and compute trip counts in more cases. Also, this happens to finally fix the remaining testcases in PR1301. Now that IndVarSimplify is being more aggressive, it occasionally runs into the problem where ScalarEvolutionExpander's code for avoiding duplicate expansions makes it difficult to ensure that all expanded instructions dominate all the instructions that will use them. As a temporary measure, IndVarSimplify now uses a FixUsesBeforeDefs function to fix up instructions inserted by SCEVExpander. Fortunately, this code is contained, and can be easily removed once a more comprehensive solution is available. llvm-svn: 71535	2009-05-12 02:17:14 +00:00
Dan Gohman	cac9b5c5be	When forgetting SCEVs for loop PHIs, don't forget SCEVUnknown values. These values aren't analyzable, so they don't care if more information about the loop trip count can be had. Also, SCEVUnknown is used for a PHI while the PHI itself is being analyzed, so it needs to be left in the Scalars map. This fixes a variety of subtle issues. llvm-svn: 71533	2009-05-12 01:27:58 +00:00
Evan Cheng	9b27f3ec42	Teach LSR to optimize more loop exit compares, i.e. change them to use postinc iv value. Previously LSR would only optimize those which are in the loop latch block. However, if LSR can prove it is safe (and profitable), it's now possible to change those not in the latch blocks to use postinc values. Also, if the compare is the only use, LSR would place the iv increment instruction before the compare instead in the latch. llvm-svn: 71485	2009-05-11 22:33:01 +00:00
Dale Johannesen	dd32623987	Fix PR4188. TailMerging can't tolerate inexact sucessor info. llvm-svn: 71478	2009-05-11 21:54:13 +00:00
Dan Gohman	25ab4c185c	Make this grep line a little more specific so that it doesn't accidentally match something unrelated. llvm-svn: 71458	2009-05-11 18:49:56 +00:00
Dan Gohman	dfa39efe6d	When scalarizing a vector BITCAST, check whether the operand has vector type, rather than assume that it does. If the operand is not vector, it shouldn't be run through ScalarizeVectorOp. This fixes one of the testcases in PR3886. llvm-svn: 71453	2009-05-11 18:30:42 +00:00
Dan Gohman	0edabc8a6f	Convert a subtract into a negate and an add when it helps x86 address folding. llvm-svn: 71446	2009-05-11 18:02:53 +00:00
Dale Johannesen	f86e34065b	Reverse a loop that is counting up to a maximum to count down to 0 instead, under very restricted circumstances. Adjust 4 testcases in which this optimization fires. llvm-svn: 71439	2009-05-11 17:15:42 +00:00
Nick Lewycky	f417462ddf	Make MDNode use CallbackVH. Also change MDNode to store Value* instead of Constant* in preperation of a future change to support holding non-Constants in an MDNode. llvm-svn: 71407	2009-05-10 20:57:05 +00:00
Anton Korobeynikov	fe1c6d85b8	Add MSP430 test for PR4136 llvm-svn: 71392	2009-05-10 14:48:36 +00:00
Eli Friedman	aec1764402	Allow scalar evolution to compute iteration counts for loops with a pointer-based condition. This fixes PR3171. llvm-svn: 71354	2009-05-09 12:32:42 +00:00
Evan Cheng	06b0d3879e	Enable loop bb placement optimization. llvm-svn: 71291	2009-05-08 23:35:49 +00:00
Dan Gohman	141989d3c2	Fix bogus overflow checks by replacing them with actual overflow checks. llvm-svn: 71284	2009-05-08 23:11:16 +00:00
Dan Gohman	98da279d6d	Use .td for tablegen files, not .ll. llvm-svn: 71277	2009-05-08 23:01:28 +00:00
Dan Gohman	603f022049	Fold trunc casts into add-recurrence expressions, allowing the add-recurrence to be exposed. Add a new SCEV folding rule to help simplify expressions in the presence of these extra truncs. llvm-svn: 71264	2009-05-08 21:03:19 +00:00
Chris Lattner	7b2dabcac9	Fix PR4152: asm constraint validation happens before dag combine, so we need to work a bit to combine things like (x+c1+c2) into x+c3. llvm-svn: 71232	2009-05-08 18:23:14 +00:00
Chris Lattner	0fd5aea274	fix RewriteStoreUserOfWholeAlloca to use the correct type size method, fixing a crash on PR4146. While the store will ultimately overwrite the "padded size" number of bits in memory, the stored value may be a subset of this size. This function only wants to handle the case where all bits are stored. llvm-svn: 71224	2009-05-08 15:54:41 +00:00
Evan Cheng	2a1d20b0fb	Optimize code placement in loop to eliminate unconditional branches or move unconditional branch to the outside of the loop. e.g. /// A: /// ... /// <fallthrough to B> /// /// B: --> loop header /// ... /// jcc <cond> C, [exit] /// /// C: /// ... /// jmp B /// /// ==> /// /// A: /// ... /// jmp B /// /// C: --> new loop header /// ... /// <fallthough to B> /// /// B: /// ... /// jcc <cond> C, [exit] llvm-svn: 71209	2009-05-08 06:34:09 +00:00
Eli Friedman	a280375b23	PR4123: don't crash when inlining a call which uses its own result. llvm-svn: 71199	2009-05-08 00:22:04 +00:00
Bob Wilson	d61f4e70d8	Fix pr4100. Do not remove no-op copies when they are dead. The register scavenger gets confused about register liveness if it doesn't see them. I'm not thrilled with this solution, but it only comes up when there are dead copies in the code, which is something that hopefully doesn't happen much. Here is what happens in pr4100: As shown in the following excerpt from the debug output of llc, the source of a move gets reloaded from the stack, inserting a new load instruction before the move. Since that source operand is a kill, the physical register is free to be reused for the destination of the move. The move ends up being a no-op, copying R3 to R3, so it is deleted. But, it leaves behind the load to reload %reg1028 into R3, and that load is not updated to show that it's destination operand (R3) is dead. The scavenger gets confused by that load because it thinks that R3 is live. Starting RegAlloc of: %reg1025<def,dead> = MOVr %reg1028<kill>, 14, %reg0, %reg0 Regs have values: Reloading %reg1028 into R3 Last use of R3[%reg1028], removing it from live set Assigning R3 to %reg1025 Register R3 [%reg1025] is never used, removing it from live set Alternative solutions might be either marking the load as dead, or zapping the load along with the no-op copy. I couldn't see an easy way to do either of those, though. llvm-svn: 71196	2009-05-07 23:47:03 +00:00
Dan Gohman	ebacd61d7d	Revert 71165. It did more than just revert 71158 and it introduced several regressions. The problem due to 71158 is now fixed. llvm-svn: 71176	2009-05-07 19:46:24 +00:00
Duncan Sands	e90202e388	Revert r70876 and add a testcase (@c7) showing the problem: bits captured, but the pointer marked nocapture. In fact I now recall that this problem is why only readnone functions returning void were considered before! However keep a small fix that was also in r70876: a readnone function returning void can result in bits being captured if it unwinds, so test for this. llvm-svn: 71168	2009-05-07 18:08:34 +00:00
Bill Wendling	9f97e4a3dc	Temporarily revert r71158. It was causing a failure during a full bootstrap: checking for bcopy... no checking for getc_unlocked... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decUtility.c:360: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decUtility.o] Error 1 make[4]: * Waiting for unfinished jobs.... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decNumber.c:5591: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decNumber.o] Error 1 make[3]: * [all-stage2-libdecnumber] Error 2 make[3]: *** Waiting for unfinished jobs.... llvm-svn: 71165	2009-05-07 17:26:14 +00:00
Dan Gohman	9a6a882979	Constant-fold ptrtoint+add+inttoptr to gep when the pointer is an array and the add is within range. This helps simplify expressions expanded by ScalarEvolutionExpander. llvm-svn: 71158	2009-05-07 14:24:56 +00:00
Bill Wendling	864cbcfc46	THis doesn't fail. llvm-svn: 71142	2009-05-07 01:41:42 +00:00
Bill Wendling	7c50dcd02e	Temporarily revert r71010. It was causing massive failures during self-hosting. llvm-svn: 71138	2009-05-07 01:27:25 +00:00
Evan Cheng	0ee6696fd8	Do not use register as base ptr of pre- and post- inc/dec load / store nodes. llvm-svn: 71098	2009-05-06 18:25:01 +00:00
Duncan Sands	8478d08c36	Nounwind is not valid for function return values. llvm-svn: 71082	2009-05-06 13:51:18 +00:00
Duncan Sands	28e07fdaa2	OCaml parameter attribute bindings from PR2752. Incomplete, but better than nothing. llvm-svn: 71081	2009-05-06 12:21:17 +00:00
Duncan Sands	b71ad70b4e	Fix PR3754: don't mark functions that wrap MallocInst with the readnone. Since MallocInst is scheduled for deletion it doesn't seem worth doing anything more subtle, such as having mayWriteToMemory return true for MallocInst. llvm-svn: 71077	2009-05-06 08:42:00 +00:00
Duncan Sands	880eaf5278	Allow readonly functions to unwind exceptions. Teach the optimizers about this. For example, a readonly function with no uses cannot be removed unless it is also marked nounwind. llvm-svn: 71071	2009-05-06 06:49:50 +00:00
Lang Hames	fcc5ebb1d4	Renamed Spiller classes (plus uses and related files) to VirtRegRewriter. llvm-svn: 71057	2009-05-06 02:36:21 +00:00
Mikhail Glushenkov	d9ef672a0d	The 'forward_as' property did not use its second argument. See PR4159 for details. Patch by Martin Nowack! llvm-svn: 71054	2009-05-06 01:41:19 +00:00
Evan Cheng	0d781df8dc	Quotes should be printed before private prefix; some code clean up. llvm-svn: 71032	2009-05-05 22:50:29 +00:00
Dan Gohman	5e839321f2	If a MachineBasicBlock has multiple ways of reaching another block, allow it to have multiple CFG edges to that block. This is needed to allow MachineBasicBlock::isOnlyReachableByFallthrough to work correctly. This fixes PR4126. llvm-svn: 71018	2009-05-05 21:10:19 +00:00
Bill Wendling	5f4fcbeb10	Temporarily reverting r71008. It was causing this failure: Running /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/ CodeGen/X86/dg.exp ... FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/ CodeGen/X86/change-compare-stride-1.ll Failed with exit(1) at line 2 while running: grep {cmpq $-478,} change-compare-stride-1.ll.tmp child process exited abnormally llvm-svn: 71013	2009-05-05 20:49:46 +00:00
Evan Cheng	984da04cd0	Enable stack coloring with regs at -O3. llvm-svn: 71010	2009-05-05 20:30:36 +00:00
David Greene	2bb2b3840e	Handle overflow of 64-bit loop conditions. llvm-svn: 71008	2009-05-05 20:22:36 +00:00
Chris Lattner	5cc9a36d1c	Add basic support for code generation of addrspace(257) -> FS relative on x86. Patch by Zoltan Varga! llvm-svn: 70992	2009-05-05 18:52:19 +00:00
David Greene	9aad2bbcf9	Allow multiclass def names to contain "#NAME"" where TableGen replaces #NAME# with the name of the defm instantiating the multiclass. This is useful for AVX instruction naming where a "V" prefix is standard throughout the ISA. For example: multiclass SSE_AVX_Inst<...> { def SS : Instr<...>; def SD : Instr<...>; def PS : Instr<...>; def PD : Instr<...>; def V#NAME#SS : Instr<...>; def V#NAME#SD : Instr<...>; def V#NAME#PS : Instr<...>; def V#NAME#PD : Instr<...>; } defm ADD : SSE_AVX_Inst<...>; Results in ADDSS ADDSD ADDPS ADDPD VADDSS VADDSD VADDPS VADDPD llvm-svn: 70979	2009-05-05 16:28:25 +00:00
Mikhail Glushenkov	2b4696b585	Fix incorrect code generation with ENV. See PR4157 for details. Patch by Martin Nowack! llvm-svn: 70973	2009-05-05 12:34:34 +00:00
Dan Gohman	2973567a95	X86FastISel doesn't support the -tailcallopt ABI. llvm-svn: 70902	2009-05-04 19:50:33 +00:00
Anton Korobeynikov	262a397978	Fix code emission for conditional branches. Patch by Collin Winter! llvm-svn: 70898	2009-05-04 19:10:38 +00:00
Bill Wendling	417e759a87	Use %llvmgcc instead of llvm-gcc. llvm-svn: 70886	2009-05-04 18:00:27 +00:00
Duncan Sands	4c7021febf	Teach capture tracking that readonly functions can only capture their arguments by returning them or throwing an exception or not based on the argument value. Patch essentially by Frits van Bommel. llvm-svn: 70876	2009-05-04 16:50:29 +00:00
Duncan Sands	b77e5b9e2e	Check that pure/const functions are marked nounwind. llvm-svn: 70875	2009-05-04 16:47:11 +00:00
Argyrios Kyrtzidis	fb958c2b09	-Remove the DwarfWriter::RecordSourceLine calls from the instruction selectors. -Depend on DebugLocs for source line info. (Comes with Regression-Be-Gone(tm)) llvm-svn: 70871	2009-05-04 16:23:49 +00:00
Duncan Sands	1b56ebfb59	Testcase for PR3967. llvm-svn: 70856	2009-05-04 12:54:02 +00:00
Chris Lattner	6807ddd3d9	* Sink 4 duplicates of edge threading validity checks and DOUT prints into ThreadEdge directly. This shares the code, but is just a refactoring. * Make JumpThreading compute the set of loop headers and avoid threading across them. This prevents jump threading from forming irreducible loops (goodness) but also prevents it from threading in other cases that are beneficial (see the comment above FindFunctionBackedges). llvm-svn: 70820	2009-05-04 02:28:08 +00:00
Argyrios Kyrtzidis	e68261749e	Revert r70803 for now, it causes a regression. llvm-svn: 70811	2009-05-03 23:27:19 +00:00
Argyrios Kyrtzidis	bb6e4d027c	-Remove the DwarfWriter::RecordSourceLine calls from the instruction selectors. -Depend on DebugLocs for source line info. llvm-svn: 70803	2009-05-03 22:03:35 +00:00
Dan Gohman	a79cce4aef	Previously, RecursivelyDeleteDeadInstructions provided an option of returning a list of pointers to Values that are deleted. This was unsafe, because the pointers in the list are, by nature of what RecursivelyDeleteDeadInstructions does, always dangling. Replace this with a simple callback mechanism. This may eventually be removed if all clients can reasonably be expected to use CallbackVH. Use this to factor out the dead-phi-cycle-elimination code from LSR utility function, and generalize it to use the RecursivelyDeleteTriviallyDeadInstructions utility function. This makes LSR more aggressive about eliminating dead PHI cycles; adjust tests to either be less trivial or to simply expect fewer instructions. llvm-svn: 70636	2009-05-02 18:29:22 +00:00
Chris Lattner	c6d561ed27	'The attached patch fixes an issue where llc -march=cpp fails with "Invalid primitive type" on input containing the x86_fp80 type.' Patch by Collin Winter! llvm-svn: 70610	2009-05-01 23:54:26 +00:00
Dan Gohman	0dc2b769b0	When printing a SCEVUnknown with pointer type, don't print an artificial "ptrtoint", as it tends to clutter up complicated expressions. The cast operators now print both source and destination types, which is usually sufficient. llvm-svn: 70554	2009-05-01 17:02:22 +00:00
Dan Gohman	3c9f4f765c	Extend ScalarEvolution's getBackedgeTakenCount to be able to compute an upper-bound value for the trip count, in addition to the actual trip count. Use this to allow getZeroExtendExpr and getSignExtendExpr to fold casts in more cases. This may eventually morph into a more general value-range analysis capability; there are certainly plenty of places where more complete value-range information would allow more folding. llvm-svn: 70509	2009-04-30 20:47:05 +00:00
Dan Gohman	25d21786d3	Don't try to mix integers and pointers in an icmp instruction in getSCEVAtScope. llvm-svn: 70495	2009-04-30 16:40:30 +00:00
Evan Cheng	b7d41a6680	Mark MOV8mr_NOREX and MOV8rm_NOREX as mayStore / mayLoad respectively. llvm-svn: 70461	2009-04-30 00:58:57 +00:00
Chris Lattner	794fb5b4b3	fix a regression handling indirect results: these need to be considered memory operands otherwise the writebacks get lost when the inline asm doesn't otherwise have side effects. This fixes rdar://6839427, though clang really shouldn't generate these anymore. llvm-svn: 70455	2009-04-30 00:48:50 +00:00
Nate Begeman	b407809122	Fix infinite recursion in the C++ code which handles movddup by making it unnecessary. llvm-svn: 70425	2009-04-29 22:47:44 +00:00
Dan Gohman	06aff30f01	Generalize the cast-of-addrec folding to handle folding of SCEVs like (sext i8 {-128,+,1} to i64) to i64 {-128,+,1}, where the iteration crosses from negative to positive, but is still safe if the trip count is within range. llvm-svn: 70421	2009-04-29 22:28:28 +00:00
Dan Gohman	9fa631c81a	Fix this test to match the new output from scalar-evolution. llvm-svn: 70410	2009-04-29 21:06:20 +00:00
Dan Gohman	55befacc69	Include the source type in SCEV cast expression debug output, and print sext, zext, and trunc, instead of signextend, zeroextend, and truncate, respectively, for consistency with the main IR. llvm-svn: 70405	2009-04-29 20:27:52 +00:00
Dale Johannesen	15486ddd95	Fix recent regression in gcc.dg/pr26719.c (6835035). llvm-svn: 70386	2009-04-29 16:38:47 +00:00
Evan Cheng	62fdc300dd	spillPhysRegAroundRegDefsUses() may have invalidated iterators stored in fixed_ IntervalPtrs. Reset them. llvm-svn: 70378	2009-04-29 07:16:34 +00:00
Chris Lattner	e0b97f682d	testcase for PR4082 llvm-svn: 70375	2009-04-29 06:46:27 +00:00
Chris Lattner	e1eefefdc3	Disable the load-shrinking optimization from looking at anything larger than 64-bits, avoiding a crash. This should really be fixed to use APInts, though type legalization happens to help us out and we get good code on the attached testcase at least. This fixes rdar://6836460 llvm-svn: 70360	2009-04-29 03:45:07 +00:00
Bill Wendling	7546bed590	Second attempt: Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'll change the JIT with a follow-up patch. llvm-svn: 70343	2009-04-29 00:15:41 +00:00
Dan Gohman	346c77f79d	As with r70333, give the primary induction variable a use so that it can't be trivially eliminated. llvm-svn: 70334	2009-04-28 22:05:13 +00:00
Dan Gohman	5bb06cda1e	Make this testcase slightly less trivial, so that it doesn't fail if indvars happens to optimize away the unused primary induction variable. llvm-svn: 70333	2009-04-28 22:03:26 +00:00
Dan Gohman	211c5de27d	Fix a grammaro in a comment. llvm-svn: 70331	2009-04-28 21:54:23 +00:00
Anton Korobeynikov	1799ac4b55	Properly print 'P' modifier on inline asm memory operands. This should fix PR3379 and PR4064. Patch inspired by Edwin Török! llvm-svn: 70328	2009-04-28 21:49:33 +00:00
Dale Johannesen	db6d3a77dc	Test for llvm-gcc bug fixed by 70301. llvm-svn: 70302	2009-04-28 17:16:30 +00:00
Evan Cheng	754a0d2f9e	Fix PR4034. Bug in LiveInterval::join when it's compacting new valno's. llvm-svn: 70291	2009-04-28 06:24:09 +00:00
Evan Cheng	8a9736a26c	Fix for PR4051. When 2address pass delete an instruction, update kill info when necessary. llvm-svn: 70279	2009-04-28 02:12:36 +00:00
Bill Wendling	ef47ace92f	r70270 isn't ready yet. Back this out. Sorry for the noise. llvm-svn: 70275	2009-04-28 01:04:53 +00:00
Bill Wendling	2799e916c3	Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'm not 100% sure if it's necessary to change it there... llvm-svn: 70270	2009-04-28 00:21:31 +00:00
Dale Johannesen	626b0a32f7	Fix PR 4086, a bug in FP IV elimination. llvm-svn: 70247	2009-04-27 21:03:15 +00:00
Evan Cheng	c315cf24e3	Fix PR4076. Correctly create live interval of physical register with two-address update. llvm-svn: 70245	2009-04-27 20:42:46 +00:00
Dan Gohman	e1a532cb4f	Permit ChangeCompareStride to rewrite a comparison when the factor between the comparison's iv stride and the candidate stride is exactly -1. llvm-svn: 70244	2009-04-27 20:35:32 +00:00
Dan Gohman	ff30ebd710	Teach getZeroExtendExpr and getSignExtendExpr to use trip-count information to simplify [sz]ext({a,+,b}) to {zext(a),+,[zs]ext(b)}, as appropriate. These functions and the trip count code each call into the other, so this requires careful handling to avoid infinite recursion. During the initial trip count computation, conservative SCEVs are used, which are subsequently discarded once the trip count is actually known. Among other benefits, this change lets LSR automatically eliminate some unnecessary zext-inreg and sext-inreg operation where the operand is an induction variable. llvm-svn: 70241	2009-04-27 20:16:15 +00:00
Dale Johannesen	2a494ee2e1	Test for (llvm-gcc) 70231. llvm-svn: 70233	2009-04-27 19:15:09 +00:00
Nate Begeman	7902a2344d	Revert accidental testcase reduction llvm-svn: 70226	2009-04-27 18:42:40 +00:00
Nate Begeman	9d121924fd	2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. llvm-svn: 70225	2009-04-27 18:41:29 +00:00
Evan Cheng	43fc90ae59	Fix PR4056. It's possible a physical register def is dead if its implicit use is deleted by two-address pass. llvm-svn: 70213	2009-04-27 17:36:47 +00:00

... 3 4 5 6 7 ...

7260 Commits