llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Eli Friedman	b32b64b5b4	Fix broken logic in DominatorTreeBase::Split. Part of PR4238. llvm-svn: 72231	2009-05-21 21:47:54 +00:00
Eli Friedman	d4f9668eb7	Fix some incorrect logic in DominanceFrontier::splitBlock. Part of PR4238. llvm-svn: 72223	2009-05-21 20:40:30 +00:00
Dan Gohman	fc28858d91	Teach ValueTracking a new way to analyze PHI nodes, and and teach Instcombine to be more aggressive about using SimplifyDemandedBits on shift nodes. This allows a shift to be simplified to zero in the included test case. llvm-svn: 72204	2009-05-21 02:28:33 +00:00
Eli Friedman	b6fe72e457	Fix for PR4235: to build a floating-point value from integer parts, build an integer and cast that to a float. This fixes a crash caused by trying to split an f32 into two f16's. This changes the behavior in test/CodeGen/XCore/fneg.ll because that testcase now triggers a DAGCombine which converts the fneg into an integer operation. If someone is interested, it's probably possible to tweak the test to generate an actual fneg. llvm-svn: 72162	2009-05-20 06:02:09 +00:00
Evan Cheng	ff129ff17f	Fix test on non-darwin hosts. llvm-svn: 72161	2009-05-20 05:45:36 +00:00
Evan Cheng	e17c02e328	Try again. Allow call to immediate address for ELF or when in static relocation mode. llvm-svn: 72160	2009-05-20 04:53:57 +00:00
Evan Cheng	8a4887572e	Cannot use immediate as call absolute target in PIC mode. llvm-svn: 72154	2009-05-20 01:11:00 +00:00
Dan Gohman	9e0f5a28dc	Suppress the IV reversal transformation in the case that the RHS of the comparison is defined inside the loop. This fixes a use-before-def problem, because the transformation puts a use of the RHS outside the loop. llvm-svn: 72149	2009-05-20 00:34:08 +00:00
Bob Wilson	c6726ecca5	Fix pr4058 and pr4059. Do not split i64 or double arguments between r3 and the stack. Patch by Sandeep Patel. llvm-svn: 72106	2009-05-19 10:02:36 +00:00
Bob Wilson	ec676a76e7	Fix pr4091: Add support for "m" constraint in ARM inline assembly. llvm-svn: 72105	2009-05-19 05:53:42 +00:00
Dan Gohman	922033d119	Teach SCEVExpander to expand arithmetic involving pointers into GEP instructions. It attempts to create high-level multi-operand GEPs, though in cases where this isn't possible it falls back to casting the pointer to i8* and emitting a GEP with that. Using GEP instructions instead of ptrtoint+arithmetic+inttoptr helps pointer analyses that don't use ScalarEvolution, such as BasicAliasAnalysis. Also, make the AddrModeMatcher more aggressive in handling GEPs. Previously it assumed that operand 0 of a GEP would require a register in almost all cases. It now does extra checking and can do more matching if operand 0 of the GEP is foldable. This fixes a problem that was exposed by SCEVExpander using GEPs. llvm-svn: 72093	2009-05-19 02:15:55 +00:00
Bill Wendling	ae8a483328	Commands beginning with '--' are converted to '-f' by gcc. Blech! llvm-svn: 72023	2009-05-18 18:09:36 +00:00
Dan Gohman	592d65ba06	Teach ScalarEvolution to recognize x^-1 in the case where non-demanded bits have been stripped out by instcombine. llvm-svn: 72010	2009-05-18 16:29:04 +00:00
Dan Gohman	2c9bd7e0cb	Make ScalarEvolution::isLoopGuardedByCond work even when the edge entering a loop is a non-split critical edge. llvm-svn: 72004	2009-05-18 15:36:09 +00:00
Dan Gohman	904f081ce7	Add nounwind to a few tests. llvm-svn: 72002	2009-05-18 15:16:49 +00:00
Duncan Sands	ead6c97920	Check that the gcc front-end is not doing inlining when not doing unit-at-a-time. llvm-svn: 71986	2009-05-17 19:37:02 +00:00
Anton Korobeynikov	85accafcba	Mark rotl/rotr as expand. This generates pretty ugly code, but this is better than nothing. llvm-svn: 71976	2009-05-17 10:16:28 +00:00
Anton Korobeynikov	8753e89b79	Typo llvm-svn: 71975	2009-05-17 10:15:22 +00:00
Jakob Stoklund Olesen	fa57451cf5	Help DejaGnu avoid pipe-jam by producing less output from certain test cases. When a test fails with more than a pipeful of output on stdout AND stderr, one of the DejaGnu programs blocks. The problem can be avoided by redirecting stdout to a file. llvm-svn: 71919	2009-05-16 00:34:42 +00:00
David Greene	9a1e15d0d0	Implement !if, analogous to $(if) in GNU make. llvm-svn: 71815	2009-05-14 23:26:46 +00:00
David Greene	5841d58c6e	Fix tests to not upset DejaGNU. llvm-svn: 71811	2009-05-14 23:21:40 +00:00
David Greene	70881bc6ae	Graduate LLVM to the big leagues by embedding a LISP processor into TableGen. Ok, not really, but do support some common LISP functions: * car * cdr * null llvm-svn: 71805	2009-05-14 22:38:31 +00:00
David Greene	fab0ee79db	Implement a !foreach operator analogous to GNU make's $(foreach). Use it on dags and lists like this: class decls { string name; } def Decls : decls; class B<list<string> names> : A<!foreach(Decls.name, names, !strconcat(Decls.name, ", Sr."))>; llvm-svn: 71803	2009-05-14 22:23:47 +00:00
David Greene	26054a566e	Implement a !subst operation simmilar to $(subst) in GNU make to do def/var/string substitution on generic pattern templates. For example: def Type; def v4f32 : Type; def TYPE : Type; class GenType<Type t> { let type = !(subst TYPE, v4f32, t); } def TheType : GenType<TYPE>; llvm-svn: 71801	2009-05-14 21:54:42 +00:00
David Greene	e2871b8c65	Implement !cast. llvm-svn: 71794	2009-05-14 21:22:49 +00:00
Dan Gohman	a09e38894a	Add nounwind to this test. llvm-svn: 71734	2009-05-13 22:29:12 +00:00
Bill Wendling	c76422f45d	Remove too large testcase. llvm-svn: 71730	2009-05-13 21:51:26 +00:00
Bill Wendling	35584a26be	Move the bookkeeping of the debug scopes back to the place where it belonged. The variable declaration stuff wasn't happy with it where it was. Sorry that the testcase is so big. Bugpoint wasn't able to reduce it successfully. llvm-svn: 71714	2009-05-13 20:33:33 +00:00
Dale Johannesen	da2e1e314b	Testcase for 71688. llvm-svn: 71691	2009-05-13 18:33:24 +00:00
Chris Lattner	eb2f327449	calls in nothrow functions can be marked nothrow even if the callee is not known to be nothrow. This allows readnone/readonly functions to be deleted even if we don't know whether the callee can throw. llvm-svn: 71676	2009-05-13 17:39:14 +00:00
Chris Lattner	927ebd34e2	Fix PR4206 - crash in simplify lib calls llvm-svn: 71644	2009-05-13 06:26:11 +00:00
Evan Cheng	e43bfc153e	If header of inner loop is aligned, do not align the outer loop header. We don't want to add nops in the outer loop for the sake of aligning the inner loop. llvm-svn: 71609	2009-05-12 23:58:14 +00:00
Evan Cheng	c7f7276825	Teach TransferDeadness to delete truly dead instructions if they do not produce side effects. llvm-svn: 71606	2009-05-12 23:07:00 +00:00
Evan Cheng	b0a4c44103	Add nounwind. llvm-svn: 71575	2009-05-12 18:35:43 +00:00
Evan Cheng	d6e3e4d746	Fixed a stack slot coloring with reg bug: do not update implicit use / def when doing forward / backward propagation. llvm-svn: 71574	2009-05-12 18:31:57 +00:00
Bob Wilson	16f684a429	Fix pr4195: When iterating through predecessor blocks, break out of the loop after finding the (unique) layout predecessor. Sometimes a block may be listed more than once, and processing it more than once in this loop can lead to inconsistent values for FtTBB/FtFBB, since the AnalyzeBranch method does not clear these values. There's no point in continuing the loop regardless. The testcase for this is reduced from the 2003-05-02-DependentPHI SingleSource test. llvm-svn: 71536	2009-05-12 03:48:10 +00:00
Dan Gohman	d13f674130	Factor the code for collecting IV users out of LSR into an IVUsers class, and generalize it so that it can be used by IndVarSimplify. Implement the base IndVarSimplify transformation code using IVUsers. This removes TestOrigIVForWrap and associated code, as ScalarEvolution now has enough builtin overflow detection and folding logic to handle all the same cases, and more. Run "opt -iv-users -analyze -disable-output" on your favorite loop for an example of what IVUsers does. This lets IndVarSimplify eliminate IV casts and compute trip counts in more cases. Also, this happens to finally fix the remaining testcases in PR1301. Now that IndVarSimplify is being more aggressive, it occasionally runs into the problem where ScalarEvolutionExpander's code for avoiding duplicate expansions makes it difficult to ensure that all expanded instructions dominate all the instructions that will use them. As a temporary measure, IndVarSimplify now uses a FixUsesBeforeDefs function to fix up instructions inserted by SCEVExpander. Fortunately, this code is contained, and can be easily removed once a more comprehensive solution is available. llvm-svn: 71535	2009-05-12 02:17:14 +00:00
Dan Gohman	cac9b5c5be	When forgetting SCEVs for loop PHIs, don't forget SCEVUnknown values. These values aren't analyzable, so they don't care if more information about the loop trip count can be had. Also, SCEVUnknown is used for a PHI while the PHI itself is being analyzed, so it needs to be left in the Scalars map. This fixes a variety of subtle issues. llvm-svn: 71533	2009-05-12 01:27:58 +00:00
Evan Cheng	9b27f3ec42	Teach LSR to optimize more loop exit compares, i.e. change them to use postinc iv value. Previously LSR would only optimize those which are in the loop latch block. However, if LSR can prove it is safe (and profitable), it's now possible to change those not in the latch blocks to use postinc values. Also, if the compare is the only use, LSR would place the iv increment instruction before the compare instead in the latch. llvm-svn: 71485	2009-05-11 22:33:01 +00:00
Dale Johannesen	dd32623987	Fix PR4188. TailMerging can't tolerate inexact sucessor info. llvm-svn: 71478	2009-05-11 21:54:13 +00:00
Dan Gohman	25ab4c185c	Make this grep line a little more specific so that it doesn't accidentally match something unrelated. llvm-svn: 71458	2009-05-11 18:49:56 +00:00
Dan Gohman	dfa39efe6d	When scalarizing a vector BITCAST, check whether the operand has vector type, rather than assume that it does. If the operand is not vector, it shouldn't be run through ScalarizeVectorOp. This fixes one of the testcases in PR3886. llvm-svn: 71453	2009-05-11 18:30:42 +00:00
Dan Gohman	0edabc8a6f	Convert a subtract into a negate and an add when it helps x86 address folding. llvm-svn: 71446	2009-05-11 18:02:53 +00:00
Dale Johannesen	f86e34065b	Reverse a loop that is counting up to a maximum to count down to 0 instead, under very restricted circumstances. Adjust 4 testcases in which this optimization fires. llvm-svn: 71439	2009-05-11 17:15:42 +00:00
Nick Lewycky	f417462ddf	Make MDNode use CallbackVH. Also change MDNode to store Value* instead of Constant* in preperation of a future change to support holding non-Constants in an MDNode. llvm-svn: 71407	2009-05-10 20:57:05 +00:00
Anton Korobeynikov	fe1c6d85b8	Add MSP430 test for PR4136 llvm-svn: 71392	2009-05-10 14:48:36 +00:00
Eli Friedman	aec1764402	Allow scalar evolution to compute iteration counts for loops with a pointer-based condition. This fixes PR3171. llvm-svn: 71354	2009-05-09 12:32:42 +00:00
Evan Cheng	06b0d3879e	Enable loop bb placement optimization. llvm-svn: 71291	2009-05-08 23:35:49 +00:00
Dan Gohman	141989d3c2	Fix bogus overflow checks by replacing them with actual overflow checks. llvm-svn: 71284	2009-05-08 23:11:16 +00:00
Dan Gohman	98da279d6d	Use .td for tablegen files, not .ll. llvm-svn: 71277	2009-05-08 23:01:28 +00:00
Dan Gohman	603f022049	Fold trunc casts into add-recurrence expressions, allowing the add-recurrence to be exposed. Add a new SCEV folding rule to help simplify expressions in the presence of these extra truncs. llvm-svn: 71264	2009-05-08 21:03:19 +00:00
Chris Lattner	7b2dabcac9	Fix PR4152: asm constraint validation happens before dag combine, so we need to work a bit to combine things like (x+c1+c2) into x+c3. llvm-svn: 71232	2009-05-08 18:23:14 +00:00
Chris Lattner	0fd5aea274	fix RewriteStoreUserOfWholeAlloca to use the correct type size method, fixing a crash on PR4146. While the store will ultimately overwrite the "padded size" number of bits in memory, the stored value may be a subset of this size. This function only wants to handle the case where all bits are stored. llvm-svn: 71224	2009-05-08 15:54:41 +00:00
Evan Cheng	2a1d20b0fb	Optimize code placement in loop to eliminate unconditional branches or move unconditional branch to the outside of the loop. e.g. /// A: /// ... /// <fallthrough to B> /// /// B: --> loop header /// ... /// jcc <cond> C, [exit] /// /// C: /// ... /// jmp B /// /// ==> /// /// A: /// ... /// jmp B /// /// C: --> new loop header /// ... /// <fallthough to B> /// /// B: /// ... /// jcc <cond> C, [exit] llvm-svn: 71209	2009-05-08 06:34:09 +00:00
Eli Friedman	a280375b23	PR4123: don't crash when inlining a call which uses its own result. llvm-svn: 71199	2009-05-08 00:22:04 +00:00
Bob Wilson	d61f4e70d8	Fix pr4100. Do not remove no-op copies when they are dead. The register scavenger gets confused about register liveness if it doesn't see them. I'm not thrilled with this solution, but it only comes up when there are dead copies in the code, which is something that hopefully doesn't happen much. Here is what happens in pr4100: As shown in the following excerpt from the debug output of llc, the source of a move gets reloaded from the stack, inserting a new load instruction before the move. Since that source operand is a kill, the physical register is free to be reused for the destination of the move. The move ends up being a no-op, copying R3 to R3, so it is deleted. But, it leaves behind the load to reload %reg1028 into R3, and that load is not updated to show that it's destination operand (R3) is dead. The scavenger gets confused by that load because it thinks that R3 is live. Starting RegAlloc of: %reg1025<def,dead> = MOVr %reg1028<kill>, 14, %reg0, %reg0 Regs have values: Reloading %reg1028 into R3 Last use of R3[%reg1028], removing it from live set Assigning R3 to %reg1025 Register R3 [%reg1025] is never used, removing it from live set Alternative solutions might be either marking the load as dead, or zapping the load along with the no-op copy. I couldn't see an easy way to do either of those, though. llvm-svn: 71196	2009-05-07 23:47:03 +00:00
Dan Gohman	ebacd61d7d	Revert 71165. It did more than just revert 71158 and it introduced several regressions. The problem due to 71158 is now fixed. llvm-svn: 71176	2009-05-07 19:46:24 +00:00
Duncan Sands	e90202e388	Revert r70876 and add a testcase (@c7) showing the problem: bits captured, but the pointer marked nocapture. In fact I now recall that this problem is why only readnone functions returning void were considered before! However keep a small fix that was also in r70876: a readnone function returning void can result in bits being captured if it unwinds, so test for this. llvm-svn: 71168	2009-05-07 18:08:34 +00:00
Bill Wendling	9f97e4a3dc	Temporarily revert r71158. It was causing a failure during a full bootstrap: checking for bcopy... no checking for getc_unlocked... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decUtility.c:360: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decUtility.o] Error 1 make[4]: * Waiting for unfinished jobs.... Assertion failed: (0 && "Unknown SCEV kind!"), function operator(), file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmCore.roots/llvmCore~obj/src/lib/Analysis/ScalarEvolution.cpp, line 511. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvmgcc42.roots/llvmgcc42~obj/src/libdecnumber/decNumber.c:5591: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. make[4]: * [decNumber.o] Error 1 make[3]: * [all-stage2-libdecnumber] Error 2 make[3]: *** Waiting for unfinished jobs.... llvm-svn: 71165	2009-05-07 17:26:14 +00:00
Dan Gohman	9a6a882979	Constant-fold ptrtoint+add+inttoptr to gep when the pointer is an array and the add is within range. This helps simplify expressions expanded by ScalarEvolutionExpander. llvm-svn: 71158	2009-05-07 14:24:56 +00:00
Bill Wendling	864cbcfc46	THis doesn't fail. llvm-svn: 71142	2009-05-07 01:41:42 +00:00
Bill Wendling	7c50dcd02e	Temporarily revert r71010. It was causing massive failures during self-hosting. llvm-svn: 71138	2009-05-07 01:27:25 +00:00
Evan Cheng	0ee6696fd8	Do not use register as base ptr of pre- and post- inc/dec load / store nodes. llvm-svn: 71098	2009-05-06 18:25:01 +00:00
Duncan Sands	8478d08c36	Nounwind is not valid for function return values. llvm-svn: 71082	2009-05-06 13:51:18 +00:00
Duncan Sands	28e07fdaa2	OCaml parameter attribute bindings from PR2752. Incomplete, but better than nothing. llvm-svn: 71081	2009-05-06 12:21:17 +00:00
Duncan Sands	b71ad70b4e	Fix PR3754: don't mark functions that wrap MallocInst with the readnone. Since MallocInst is scheduled for deletion it doesn't seem worth doing anything more subtle, such as having mayWriteToMemory return true for MallocInst. llvm-svn: 71077	2009-05-06 08:42:00 +00:00
Duncan Sands	880eaf5278	Allow readonly functions to unwind exceptions. Teach the optimizers about this. For example, a readonly function with no uses cannot be removed unless it is also marked nounwind. llvm-svn: 71071	2009-05-06 06:49:50 +00:00
Lang Hames	fcc5ebb1d4	Renamed Spiller classes (plus uses and related files) to VirtRegRewriter. llvm-svn: 71057	2009-05-06 02:36:21 +00:00
Mikhail Glushenkov	d9ef672a0d	The 'forward_as' property did not use its second argument. See PR4159 for details. Patch by Martin Nowack! llvm-svn: 71054	2009-05-06 01:41:19 +00:00
Evan Cheng	0d781df8dc	Quotes should be printed before private prefix; some code clean up. llvm-svn: 71032	2009-05-05 22:50:29 +00:00
Dan Gohman	5e839321f2	If a MachineBasicBlock has multiple ways of reaching another block, allow it to have multiple CFG edges to that block. This is needed to allow MachineBasicBlock::isOnlyReachableByFallthrough to work correctly. This fixes PR4126. llvm-svn: 71018	2009-05-05 21:10:19 +00:00
Bill Wendling	5f4fcbeb10	Temporarily reverting r71008. It was causing this failure: Running /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/ CodeGen/X86/dg.exp ... FAIL: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/test/ CodeGen/X86/change-compare-stride-1.ll Failed with exit(1) at line 2 while running: grep {cmpq $-478,} change-compare-stride-1.ll.tmp child process exited abnormally llvm-svn: 71013	2009-05-05 20:49:46 +00:00
Evan Cheng	984da04cd0	Enable stack coloring with regs at -O3. llvm-svn: 71010	2009-05-05 20:30:36 +00:00
David Greene	2bb2b3840e	Handle overflow of 64-bit loop conditions. llvm-svn: 71008	2009-05-05 20:22:36 +00:00
Chris Lattner	5cc9a36d1c	Add basic support for code generation of addrspace(257) -> FS relative on x86. Patch by Zoltan Varga! llvm-svn: 70992	2009-05-05 18:52:19 +00:00
David Greene	9aad2bbcf9	Allow multiclass def names to contain "#NAME"" where TableGen replaces #NAME# with the name of the defm instantiating the multiclass. This is useful for AVX instruction naming where a "V" prefix is standard throughout the ISA. For example: multiclass SSE_AVX_Inst<...> { def SS : Instr<...>; def SD : Instr<...>; def PS : Instr<...>; def PD : Instr<...>; def V#NAME#SS : Instr<...>; def V#NAME#SD : Instr<...>; def V#NAME#PS : Instr<...>; def V#NAME#PD : Instr<...>; } defm ADD : SSE_AVX_Inst<...>; Results in ADDSS ADDSD ADDPS ADDPD VADDSS VADDSD VADDPS VADDPD llvm-svn: 70979	2009-05-05 16:28:25 +00:00
Mikhail Glushenkov	2b4696b585	Fix incorrect code generation with ENV. See PR4157 for details. Patch by Martin Nowack! llvm-svn: 70973	2009-05-05 12:34:34 +00:00
Dan Gohman	2973567a95	X86FastISel doesn't support the -tailcallopt ABI. llvm-svn: 70902	2009-05-04 19:50:33 +00:00
Anton Korobeynikov	262a397978	Fix code emission for conditional branches. Patch by Collin Winter! llvm-svn: 70898	2009-05-04 19:10:38 +00:00
Bill Wendling	417e759a87	Use %llvmgcc instead of llvm-gcc. llvm-svn: 70886	2009-05-04 18:00:27 +00:00
Duncan Sands	4c7021febf	Teach capture tracking that readonly functions can only capture their arguments by returning them or throwing an exception or not based on the argument value. Patch essentially by Frits van Bommel. llvm-svn: 70876	2009-05-04 16:50:29 +00:00
Duncan Sands	b77e5b9e2e	Check that pure/const functions are marked nounwind. llvm-svn: 70875	2009-05-04 16:47:11 +00:00
Argyrios Kyrtzidis	fb958c2b09	-Remove the DwarfWriter::RecordSourceLine calls from the instruction selectors. -Depend on DebugLocs for source line info. (Comes with Regression-Be-Gone(tm)) llvm-svn: 70871	2009-05-04 16:23:49 +00:00
Duncan Sands	1b56ebfb59	Testcase for PR3967. llvm-svn: 70856	2009-05-04 12:54:02 +00:00
Chris Lattner	6807ddd3d9	* Sink 4 duplicates of edge threading validity checks and DOUT prints into ThreadEdge directly. This shares the code, but is just a refactoring. * Make JumpThreading compute the set of loop headers and avoid threading across them. This prevents jump threading from forming irreducible loops (goodness) but also prevents it from threading in other cases that are beneficial (see the comment above FindFunctionBackedges). llvm-svn: 70820	2009-05-04 02:28:08 +00:00
Argyrios Kyrtzidis	e68261749e	Revert r70803 for now, it causes a regression. llvm-svn: 70811	2009-05-03 23:27:19 +00:00
Argyrios Kyrtzidis	bb6e4d027c	-Remove the DwarfWriter::RecordSourceLine calls from the instruction selectors. -Depend on DebugLocs for source line info. llvm-svn: 70803	2009-05-03 22:03:35 +00:00
Dan Gohman	a79cce4aef	Previously, RecursivelyDeleteDeadInstructions provided an option of returning a list of pointers to Values that are deleted. This was unsafe, because the pointers in the list are, by nature of what RecursivelyDeleteDeadInstructions does, always dangling. Replace this with a simple callback mechanism. This may eventually be removed if all clients can reasonably be expected to use CallbackVH. Use this to factor out the dead-phi-cycle-elimination code from LSR utility function, and generalize it to use the RecursivelyDeleteTriviallyDeadInstructions utility function. This makes LSR more aggressive about eliminating dead PHI cycles; adjust tests to either be less trivial or to simply expect fewer instructions. llvm-svn: 70636	2009-05-02 18:29:22 +00:00
Chris Lattner	c6d561ed27	'The attached patch fixes an issue where llc -march=cpp fails with "Invalid primitive type" on input containing the x86_fp80 type.' Patch by Collin Winter! llvm-svn: 70610	2009-05-01 23:54:26 +00:00
Dan Gohman	0dc2b769b0	When printing a SCEVUnknown with pointer type, don't print an artificial "ptrtoint", as it tends to clutter up complicated expressions. The cast operators now print both source and destination types, which is usually sufficient. llvm-svn: 70554	2009-05-01 17:02:22 +00:00
Dan Gohman	3c9f4f765c	Extend ScalarEvolution's getBackedgeTakenCount to be able to compute an upper-bound value for the trip count, in addition to the actual trip count. Use this to allow getZeroExtendExpr and getSignExtendExpr to fold casts in more cases. This may eventually morph into a more general value-range analysis capability; there are certainly plenty of places where more complete value-range information would allow more folding. llvm-svn: 70509	2009-04-30 20:47:05 +00:00
Dan Gohman	25d21786d3	Don't try to mix integers and pointers in an icmp instruction in getSCEVAtScope. llvm-svn: 70495	2009-04-30 16:40:30 +00:00
Evan Cheng	b7d41a6680	Mark MOV8mr_NOREX and MOV8rm_NOREX as mayStore / mayLoad respectively. llvm-svn: 70461	2009-04-30 00:58:57 +00:00
Chris Lattner	794fb5b4b3	fix a regression handling indirect results: these need to be considered memory operands otherwise the writebacks get lost when the inline asm doesn't otherwise have side effects. This fixes rdar://6839427, though clang really shouldn't generate these anymore. llvm-svn: 70455	2009-04-30 00:48:50 +00:00
Nate Begeman	b407809122	Fix infinite recursion in the C++ code which handles movddup by making it unnecessary. llvm-svn: 70425	2009-04-29 22:47:44 +00:00
Dan Gohman	06aff30f01	Generalize the cast-of-addrec folding to handle folding of SCEVs like (sext i8 {-128,+,1} to i64) to i64 {-128,+,1}, where the iteration crosses from negative to positive, but is still safe if the trip count is within range. llvm-svn: 70421	2009-04-29 22:28:28 +00:00
Dan Gohman	9fa631c81a	Fix this test to match the new output from scalar-evolution. llvm-svn: 70410	2009-04-29 21:06:20 +00:00
Dan Gohman	55befacc69	Include the source type in SCEV cast expression debug output, and print sext, zext, and trunc, instead of signextend, zeroextend, and truncate, respectively, for consistency with the main IR. llvm-svn: 70405	2009-04-29 20:27:52 +00:00
Dale Johannesen	15486ddd95	Fix recent regression in gcc.dg/pr26719.c (6835035). llvm-svn: 70386	2009-04-29 16:38:47 +00:00
Evan Cheng	62fdc300dd	spillPhysRegAroundRegDefsUses() may have invalidated iterators stored in fixed_ IntervalPtrs. Reset them. llvm-svn: 70378	2009-04-29 07:16:34 +00:00
Chris Lattner	e0b97f682d	testcase for PR4082 llvm-svn: 70375	2009-04-29 06:46:27 +00:00
Chris Lattner	e1eefefdc3	Disable the load-shrinking optimization from looking at anything larger than 64-bits, avoiding a crash. This should really be fixed to use APInts, though type legalization happens to help us out and we get good code on the attached testcase at least. This fixes rdar://6836460 llvm-svn: 70360	2009-04-29 03:45:07 +00:00
Bill Wendling	7546bed590	Second attempt: Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'll change the JIT with a follow-up patch. llvm-svn: 70343	2009-04-29 00:15:41 +00:00
Dan Gohman	346c77f79d	As with r70333, give the primary induction variable a use so that it can't be trivially eliminated. llvm-svn: 70334	2009-04-28 22:05:13 +00:00
Dan Gohman	5bb06cda1e	Make this testcase slightly less trivial, so that it doesn't fail if indvars happens to optimize away the unused primary induction variable. llvm-svn: 70333	2009-04-28 22:03:26 +00:00
Dan Gohman	211c5de27d	Fix a grammaro in a comment. llvm-svn: 70331	2009-04-28 21:54:23 +00:00
Anton Korobeynikov	1799ac4b55	Properly print 'P' modifier on inline asm memory operands. This should fix PR3379 and PR4064. Patch inspired by Edwin Török! llvm-svn: 70328	2009-04-28 21:49:33 +00:00
Dale Johannesen	db6d3a77dc	Test for llvm-gcc bug fixed by 70301. llvm-svn: 70302	2009-04-28 17:16:30 +00:00
Evan Cheng	754a0d2f9e	Fix PR4034. Bug in LiveInterval::join when it's compacting new valno's. llvm-svn: 70291	2009-04-28 06:24:09 +00:00
Evan Cheng	8a9736a26c	Fix for PR4051. When 2address pass delete an instruction, update kill info when necessary. llvm-svn: 70279	2009-04-28 02:12:36 +00:00
Bill Wendling	ef47ace92f	r70270 isn't ready yet. Back this out. Sorry for the noise. llvm-svn: 70275	2009-04-28 01:04:53 +00:00
Bill Wendling	2799e916c3	Massive check in. This changes the "-fast" flag to "-O#" in llc. If you want to use the old behavior, the flag is -O0. This change allows for finer-grained control over which optimizations are run at different -O levels. Most of this work was pretty mechanical. The majority of the fixes came from verifying that a "fast" variable wasn't used anymore. The JIT still uses a "Fast" flag. I'm not 100% sure if it's necessary to change it there... llvm-svn: 70270	2009-04-28 00:21:31 +00:00
Dale Johannesen	626b0a32f7	Fix PR 4086, a bug in FP IV elimination. llvm-svn: 70247	2009-04-27 21:03:15 +00:00
Evan Cheng	c315cf24e3	Fix PR4076. Correctly create live interval of physical register with two-address update. llvm-svn: 70245	2009-04-27 20:42:46 +00:00
Dan Gohman	e1a532cb4f	Permit ChangeCompareStride to rewrite a comparison when the factor between the comparison's iv stride and the candidate stride is exactly -1. llvm-svn: 70244	2009-04-27 20:35:32 +00:00
Dan Gohman	ff30ebd710	Teach getZeroExtendExpr and getSignExtendExpr to use trip-count information to simplify [sz]ext({a,+,b}) to {zext(a),+,[zs]ext(b)}, as appropriate. These functions and the trip count code each call into the other, so this requires careful handling to avoid infinite recursion. During the initial trip count computation, conservative SCEVs are used, which are subsequently discarded once the trip count is actually known. Among other benefits, this change lets LSR automatically eliminate some unnecessary zext-inreg and sext-inreg operation where the operand is an induction variable. llvm-svn: 70241	2009-04-27 20:16:15 +00:00
Dale Johannesen	2a494ee2e1	Test for (llvm-gcc) 70231. llvm-svn: 70233	2009-04-27 19:15:09 +00:00
Nate Begeman	7902a2344d	Revert accidental testcase reduction llvm-svn: 70226	2009-04-27 18:42:40 +00:00
Nate Begeman	9d121924fd	2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. llvm-svn: 70225	2009-04-27 18:41:29 +00:00
Evan Cheng	43fc90ae59	Fix PR4056. It's possible a physical register def is dead if its implicit use is deleted by two-address pass. llvm-svn: 70213	2009-04-27 17:36:47 +00:00
Dan Gohman	4aeebb184b	Fix the syntax for a PR number in a test. llvm-svn: 70208	2009-04-27 15:08:34 +00:00
Dan Gohman	220d4325b0	Make this test slightly more strict. llvm-svn: 70180	2009-04-27 03:05:26 +00:00
Dan Gohman	744f455d55	When transforming sext(trunc(load(x))) into sext(smaller load(x)), the trunc is directly replaced with the smaller load, so don't try to create a new sext node. This fixes PR4050. llvm-svn: 70179	2009-04-27 02:00:55 +00:00
Dan Gohman	820b45049b	Handle ands with ~0 correctly too. This fixes PR4052. llvm-svn: 70176	2009-04-27 01:41:10 +00:00
Sanjiv Gupta	0877e1eabe	Any size of integral indices are allowed in gep for indexing into sequential types. Also adding a test case to check the indices type allowed into struct. llvm-svn: 70134	2009-04-26 17:14:35 +00:00
Chris Lattner	92716db3bb	add testcase for strange types of gep indices llvm-svn: 70085	2009-04-25 22:20:49 +00:00
Chris Lattner	f795b63fb2	testcase and asmparser fix for PR4066 llvm-svn: 70080	2009-04-25 21:26:00 +00:00
Dan Gohman	a7fae1f865	Add several more icmp simplifications. Transform signed comparisons into unsigned ones when the operands are known to have the same sign bit value. llvm-svn: 70053	2009-04-25 17:12:48 +00:00
Dan Gohman	9eb5ba6eb7	Handle ands with 0 and shifts by 0 correctly. These aren't common, but indvars shouldn't crash on them. This fixes PR4054. llvm-svn: 70051	2009-04-25 17:05:40 +00:00
Torok Edwin	285a5fb1d5	Fix g++-4.4.0 warning, it was causing llvm-nm to fail on wrapped BC files: Path.cpp:59: warning: case label value exceeds maximum value for type magic[0] is a (signed) char, but some case values are unsigned (e.g. 0xde). When magic[0] was 0xde, the switch has taken the default branch instead of case 0xde branch. Apparently this was the behaviour with older versions of gcc too, but not with g++. Now g++-4.4 behaves as gcc, and ignores unsigned case values out of range signed range. llvm-svn: 70038	2009-04-25 10:25:12 +00:00
Evan Cheng	696a04eba2	Do not share a single unknown val# for all the live ranges merged into a physical sub-register live interval. When coalescer is merging in clobbered virtaul register live interval into a physical register live interval, give each virtual register val# a separate val# in the physical register live interval. Otherwise, the coalescer would have lost track of the definitions information it needs to make correct coalescing decisions. llvm-svn: 70026	2009-04-25 09:25:19 +00:00
Dale Johannesen	493c3bcdc0	Fix PR 4057, a crash doing float->char const folding. This particular one is undefined behavior (although this isn't related to the crash), so it will no longer do it at compile time, which seems better. llvm-svn: 69990	2009-04-24 21:34:13 +00:00
David Greene	a28b42f818	Fix multiclass inheritance to limit value resolution to new defs added by base multiclasses. Do not attempt to alter defs from previous base multiclasses. This fixes multiple multiclass inheritance. llvm-svn: 69974	2009-04-24 16:55:41 +00:00
Rafael Espindola	4e7a0bf1f1	Fix PR 4004 by including the call to __tls_get_addr in X86tlsaddr. This is not very elegant, but neither is the tls specification :-( llvm-svn: 69968	2009-04-24 12:59:40 +00:00
Rafael Espindola	0b1037ad26	Revert 69952. Causes testsuite failures on linux x86-64. llvm-svn: 69967	2009-04-24 12:40:33 +00:00
Nate Begeman	c1a09c7dfa	PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next. llvm-svn: 69952	2009-04-24 03:42:54 +00:00
David Greene	a92a50c7c2	Make BinOps typed and require a type specifier for !nameconcat. This allows binops to be used in typed contexts such as when passing arguments to classes. llvm-svn: 69921	2009-04-23 21:25:15 +00:00
Dan Gohman	3499a53e1d	Explicitly pass -tailcallopt=false to these tests so that they work as intended no matter what the default setting of that option is. llvm-svn: 69911	2009-04-23 19:39:41 +00:00
Dale Johannesen	ab02315fdb	Testcase for 69795. llvm-svn: 69901	2009-04-23 18:04:04 +00:00
Dan Gohman	ea9a6d22d3	Fix an error in this test. llvm-svn: 69893	2009-04-23 15:22:28 +00:00
Dan Gohman	c0f47d6ec1	Change SCEVExpander's expandCodeFor to provide more flexibility with the persistent insertion point, and change IndVars to make use of it. This fixes a bug where IndVars was holding on to a stale insertion point and forcing the SCEVExpander to continue to use it. This fixes PR4038. llvm-svn: 69892	2009-04-23 15:16:49 +00:00
Nick Lewycky	32cfba44df	Simplify trunc(extend(x)) in SCEVs, just for completeness. Also fix some odd whitespace in the same file. llvm-svn: 69870	2009-04-23 05:15:08 +00:00
Owen Anderson	0c0498a365	Testcase for PR3909. llvm-svn: 69868	2009-04-23 04:33:42 +00:00
Owen Anderson	caa90b2561	Testcase for PR2639. llvm-svn: 69867	2009-04-23 04:30:52 +00:00
Owen Anderson	bf7354995a	Testcase for PR2537. llvm-svn: 69866	2009-04-23 04:26:42 +00:00
Owen Anderson	f04f0e15c7	Fix typo. llvm-svn: 69865	2009-04-23 04:24:19 +00:00
Owen Anderson	a1a09bc01f	Testcase for PR3085. llvm-svn: 69863	2009-04-23 04:21:14 +00:00
Owen Anderson	d4b3279a3f	Add testcase from PR3086. llvm-svn: 69862	2009-04-23 04:14:03 +00:00
Dan Gohman	4523a11557	Add more ulimit limits, to catch more kinds of runaway behavior. llvm-svn: 69847	2009-04-23 00:28:31 +00:00
Evan Cheng	bdfff0ba69	Make sure both operands have binary instructions have the same type. llvm-svn: 69844	2009-04-22 23:39:28 +00:00
Evan Cheng	2af546d5fa	Avoid deferencing use_begin() if value does not have a use. llvm-svn: 69836	2009-04-22 22:45:37 +00:00
David Greene	e41e6599cf	Allow defm to inherit from multiple multiclasses. llvm-svn: 69832	2009-04-22 22:17:51 +00:00
David Greene	0698602922	Implement !nameconcat to concatenate strings and look up the resulting name in the symbol table, returning an object. llvm-svn: 69822	2009-04-22 20:18:10 +00:00
Duncan Sands	bd414a0baa	Testcase for PR2958. llvm-svn: 69818	2009-04-22 18:55:17 +00:00
David Greene	9d99a33f27	Implement multiclass inheritance. llvm-svn: 69810	2009-04-22 16:42:54 +00:00
Dan Gohman	0ab6ecf6a1	SCEVExpander's InsertCastOfTo knows how to move existing cast instructions in order to avoid inserting new ones. However, if the cast instruction is the SCEVExpander's InsertPt, this causes subsequently emitted instructions to be inserted near the cast, and not at the location of the original insert point. Fix this by adjusting the insert point in such cases. This fixes PR4009. llvm-svn: 69808	2009-04-22 16:11:16 +00:00
Duncan Sands	6f29099800	These tests are x86 specific. llvm-svn: 69798	2009-04-22 10:39:51 +00:00
Evan Cheng	a36c6c6819	It has finally happened. Spiller is now using live interval info. This fixes a very subtle bug. vr defined by an implicit_def is allowed overlap with any register since it doesn't actually modify anything. However, if it's used as a two-address use, its live range can be extended and it can be spilled. The spiller must take care not to emit a reload for the vn number that's defined by the implicit_def. This is both a correctness and performance issue. llvm-svn: 69743	2009-04-21 22:46:52 +00:00
Dan Gohman	19990f2310	When turning (ashr(shl(x, n), n)) into sext(trunc(x)), the width of the type to truncate to should be the number of bits of the value that are preserved, not the number that are clobbered with sign-extension. This fixes regressions in ldecod. llvm-svn: 69704	2009-04-21 20:18:36 +00:00
Devang Patel	17f434a8f0	Test case for revision 69683. llvm-svn: 69684	2009-04-21 17:21:01 +00:00
Chris Lattner	95aad4d625	fix a crash on a pointless but valid zero-length memset, rdar://6808691 llvm-svn: 69680	2009-04-21 16:52:12 +00:00
Evan Cheng	c248188b46	Added a linearscan register allocation optimization. When the register allocator spill an interval with multiple uses in the same basic block, it creates a different virtual register for each of the reloads. e.g. %reg1498<def> = MOV32rm %reg1024, 1, %reg0, 12, %reg0, Mem:LD(4,4) [sunkaddr39 + 0] %reg1506<def> = MOV32rm %reg1024, 1, %reg0, 8, %reg0, Mem:LD(4,4) [sunkaddr42 + 0] %reg1486<def> = MOV32rr %reg1506 %reg1486<def> = XOR32rr %reg1486, %reg1498, %EFLAGS<imp-def,dead> %reg1510<def> = MOV32rm %reg1024, 1, %reg0, 4, %reg0, Mem:LD(4,4) [sunkaddr45 + 0] => %reg1498<def> = MOV32rm %reg2036, 1, %reg0, 12, %reg0, Mem:LD(4,4) [sunkaddr39 + 0] %reg1506<def> = MOV32rm %reg2037, 1, %reg0, 8, %reg0, Mem:LD(4,4) [sunkaddr42 + 0] %reg1486<def> = MOV32rr %reg1506 %reg1486<def> = XOR32rr %reg1486, %reg1498, %EFLAGS<imp-def,dead> %reg1510<def> = MOV32rm %reg2038, 1, %reg0, 4, %reg0, Mem:LD(4,4) [sunkaddr45 + 0] From linearscan's point of view, each of reg2036, 2037, and 2038 are separate registers, each is "killed" after a single use. The reloaded register is available and it's often clobbered right away. e.g. In thise case reg1498 is allocated EAX while reg2036 is allocated RAX. This means we end up with multiple reloads from the same stack slot in the same basic block. Now linearscan recognize there are other reloads from same SS in the same BB. So it'll "downgrade" RAX (and its aliases) after reg2036 is allocated until the next reload (reg2037) is done. This greatly increase the likihood reloads from SS are reused. This speeds up sha1 from OpenSSL by 5.8%. It is also an across the board win for SPEC2000 and 2006. llvm-svn: 69585	2009-04-20 08:01:12 +00:00
Chris Lattner	13a0dd0288	testcase for PR3898 llvm-svn: 69473	2009-04-18 20:49:22 +00:00
Duncan Sands	d2ba02aa87	Don't try to make BUILD_VECTOR operands have the same type as the vector element type: allow them to be of a wider integer type than the element type all the way through the system, and not just as far as LegalizeDAG. This should be safe because it used to be this way (the old type legalizer would produce such nodes), so backends should be able to handle it. In fact only targets which have legal vector types with an illegal promoted element type will ever see this (eg: <4 x i16> on ppc). This fixes a regression with the new type legalizer (vec_splat.ll). Also, treat SCALAR_TO_VECTOR the same as BUILD_VECTOR. After all, it is just a special case of BUILD_VECTOR. llvm-svn: 69467	2009-04-18 20:16:54 +00:00
Dale Johannesen	8a4446429e	Adjust XFAIL syntax, maybe that will help. The other way worked for me... llvm-svn: 69414	2009-04-18 02:01:23 +00:00
Dale Johannesen	05d46aca49	patch 69408 breaks this by removing the opportunity for the optimization it's testing to kick in (although it improves the code, getting rid of all spills). I don't understand the optimization well enough to rescue the test, so XFAILing. llvm-svn: 69409	2009-04-18 00:11:50 +00:00
Bob Wilson	b3e4773035	Rename file to have the correct suffix. llvm-svn: 69380	2009-04-17 20:40:20 +00:00
Bob Wilson	b8756b00cd	Use CallConvLower.h and TableGen descriptions of the calling conventions for ARM. Patch by Sandeep Patel. llvm-svn: 69371	2009-04-17 19:07:39 +00:00
Rafael Espindola	d74132e2c5	For general dynamic TLS access we must use leaq foo@TLSGD(%rip), %rdi as part of the instruction sequence. Using a register other than %rdi and then copying it to %rdi is not valid. llvm-svn: 69350	2009-04-17 14:35:58 +00:00
Evan Cheng	2d5be54315	Teach spiller to unfold instructions which modref spill slot when a scratch register is available and when it's profitable. e.g. xorq %r12<kill>, %r13 addq %rax, -184(%rbp) addq %r13, -184(%rbp) ==> xorq %r12<kill>, %r13 movq -184(%rbp), %r12 addq %rax, %r12 addq %r13, %r12 movq %r12, -184(%rbp) Two more instructions, but fewer memory accesses. It can also open up opportunities for more optimizations. llvm-svn: 69341	2009-04-17 01:29:40 +00:00
Rafael Espindola	a07d1c3103	fix PR3995. A scale must be 1, 2, 4 or 8. llvm-svn: 69284	2009-04-16 12:34:53 +00:00
Dan Gohman	98aa1d9693	Expand GEPs in ScalarEvolution expressions. SCEV expressions can now have pointer types, though in contrast to C pointer types, SCEV addition is never implicitly scaled. This not only eliminates the need for special code like IndVars' EliminatePointerRecurrence and LSR's own GEP expansion code, it also does a better job because it lets the normal optimizations handle pointer expressions just like integer expressions. Also, since LLVM IR GEPs can't directly index into multi-dimensional VLAs, moving the GEP analysis out of client code and into the SCEV framework makes it easier for clients to handle multi-dimensional VLAs the same way as other arrays. Some existing regression tests show improved optimization. test/CodeGen/ARM/2007-03-13-InstrSched.ll in particular improved to the point where if-conversion started kicking in; I turned it off for this test to preserve the intent of the test. llvm-svn: 69258	2009-04-16 03:18:22 +00:00
Dale Johannesen	040d118b17	Another testcase for IV shortening. llvm-svn: 69247	2009-04-16 00:45:21 +00:00
Bill Wendling	4153589196	Check for alignment. llvm-svn: 69140	2009-04-15 04:51:05 +00:00
Dale Johannesen	427e9aade9	Enhance induction variable code to remove the sext around sext(shorter IV + constant), using a longer IV instead, when it can figure out the add can't overflow. This comes up a lot in subscripting; mainly affects 64 bit. llvm-svn: 69123	2009-04-15 01:10:12 +00:00
Devang Patel	7323064183	While inlining, clone llvm.dbg.func.start intrinsic and adjust llvm.dbg.region.end instrinsic. This nested llvm.dbg.func.start/llvm.dbg.region.end pair now enables DW_TAG_inlined_subroutine support in code generator. llvm-svn: 69118	2009-04-15 00:17:06 +00:00
Bill Wendling	0861f3e874	Testcase for r69104. llvm-svn: 69110	2009-04-15 00:04:11 +00:00
Evan Cheng	dba98a0669	Optimize conditional branch on i1 phis with non-constant inputs. This turns: eq: %3 = icmp eq i32 %1, %2 br label %join ne: %4 = icmp ne i32 %1, %2 br label %join join: %5 = phi i1 [%3, %eq], [%4, %ne] br i1 %5, label %yes, label %no => eq: %3 = icmp eq i32 %1, %2 br i1 %3, label %yes, label %no ne: %4 = icmp ne i32 %1, %2 br i1 %4, label %yes, label %no llvm-svn: 69102	2009-04-14 23:40:03 +00:00
Dan Gohman	e1c4d4c5be	Fix the RUN lines so that this test actually tests. llvm-svn: 69096	2009-04-14 22:50:17 +00:00
Dan Gohman	365c457893	For the h-register addressing-mode trick, use the correct value for any non-address uses of the address value. This fixes 186.crafty. llvm-svn: 69094	2009-04-14 22:45:05 +00:00
Dan Gohman	3c19cf07d9	When the result of an EXTRACT_SUBREG, INSERT_SUBREG, or SUBREG_TO_REG operator is used by a CopyToReg to export the value to a different block, don't reuse the CopyToReg's register for the subreg operation result if the register isn't precisely the right class for the subreg operation. Also, rename the h-registers.ll test, now that there are more than one. llvm-svn: 69087	2009-04-14 22:17:14 +00:00
Evan Cheng	b64f2c1b08	Some of GR8_NOREX registers are only available in 64-bit mode. llvm-svn: 69049	2009-04-14 16:57:43 +00:00
Dale Johannesen	862ade6f10	Use the output of the asm so the optimizer won't delete it. llvm-svn: 69018	2009-04-14 01:51:40 +00:00
Evan Cheng	9f44d3148c	Fix PR3934 part 2. findOnlyInterestingUse() was not setting IsCopy and IsDstPhys which are returned by value and used by callee. This happened to work on the earlier test cases because of a logic error in the caller side. llvm-svn: 69006	2009-04-14 00:32:25 +00:00
Evan Cheng	fa48d5c8d0	PR3934: Fix a bogus two-address pass assertion. llvm-svn: 68979	2009-04-13 20:04:24 +00:00
Dan Gohman	be7227005f	Implement x86 h-register extract support. - Add patterns for h-register extract, which avoids a shift and mask, and in some cases a temporary register. - Add address-mode matching for turning (X>>(8-n))&(255<<n), where n is a valid address-mode scale value, into an h-register extract and a scaled-offset address. - Replace X86's MOV32to32_ and related instructions with the new target-independent COPY_TO_SUBREG instruction. On x86-64 there are complicated constraints on h registers, and CodeGen doesn't currently provide a high-level way to express all of them, so they are handled with a bunch of special code. This code currently only supports extracts where the result is used by a zero-extend or a store, though these are fairly common. These transformations are not always beneficial; since there are only 4 h registers, they sometimes require extra move instructions, and this sometimes increases register pressure because it can force out values that would otherwise be in one of those registers. However, this appears to be relatively uncommon. llvm-svn: 68962	2009-04-13 16:09:41 +00:00
Rafael Espindola	72347bffce	X86-64 TLS support for local exec and initial exec. llvm-svn: 68947	2009-04-13 13:02:49 +00:00
Chris Lattner	c1bfdc9bb2	Add a new "available_externally" linkage type. This is intended to support C99 inline, GNU extern inline, etc. Related bugzilla's include PR3517, PR3100, & PR2933. Nothing uses this yet, but it appears to work. llvm-svn: 68940	2009-04-13 05:44:34 +00:00
Rafael Espindola	ad8137187c	In X86DAGToDAGISel::MatchWrapper, if base or index are set, avoid matching only if symbolic addresses are RIP relatives. llvm-svn: 68924	2009-04-12 23:00:38 +00:00
Rafael Espindola	412b15f4ed	Add tests for the parts of X86-64 TLS that are already implemented. llvm-svn: 68901	2009-04-12 10:43:41 +00:00
Chris Lattner	6d6cf3ff4a	fix a cross-block fastisel crash handling overflow intrinsics. See comment for details. This fixes rdar://6772169 llvm-svn: 68890	2009-04-12 07:51:14 +00:00
Chris Lattner	f03202e76d	add some optimizations for strncpy/strncat and factor some code. Patch by Benjamin Kramer! llvm-svn: 68885	2009-04-12 05:06:39 +00:00
Chris Lattner	42b8e431b6	move a target-specific test into its directory so it isn't run if you don't configure the ARM target in. llvm-svn: 68843	2009-04-10 23:58:38 +00:00
Chris Lattner	0577b8e2ef	fix two problems with machine sinking: 1. Sinking would crash when the first instruction of a block was sunk due to iterator problems. 2. Instructions could be sunk to their current block, causing an infinite loop. This fixes PR3968 llvm-svn: 68787	2009-04-10 16:38:36 +00:00
Rafael Espindola	88986ef511	Don't fold a load if the other operand is a TLS address. With this we generate movl %gs:0, %eax leal i@NTPOFF(%eax), %eax instead of movl $i@NTPOFF, %eax addl %gs:0, %eax llvm-svn: 68778	2009-04-10 10:09:34 +00:00
Bob Wilson	c53238dff1	Fix pr3954. The register scavenger asserts for inline assembly with register destinations that are tied to source operands. The TargetInstrDescr::findTiedToSrcOperand method silently fails for inline assembly. The existing MachineInstr::isRegReDefinedByTwoAddr was very close to doing what is needed, so this revision makes a few changes to that method and also renames it to isRegTiedToUseOperand (for consistency with the very similar isRegTiedToDefOperand and because it handles both two-address instructions and inline assembly with tied registers). llvm-svn: 68714	2009-04-09 17:16:43 +00:00
Chris Lattner	301c4f39a0	reg0 references are not real registers. This fixes a crash on the attached testcase. llvm-svn: 68712	2009-04-09 16:50:43 +00:00
Dan Gohman	68de98eef3	Generalize ExtendUsesToFormExtLoad to be usable for ANY_EXTEND, in addition to ZERO_EXTEND and SIGN_EXTEND. Fix a bug in the way it checked for live-out values, and simplify the way it find users by using SDNode::use_iterator's (relatively) new features. Also, make it slightly more permissive on targets with free truncates. In SelectionDAGBuild, avoid creating ANY_EXTEND nodes that are larger than necessary. If the target's SwitchAmountTy has enough bits, use it. This exposes the truncate to optimization early, enabling more optimizations. llvm-svn: 68670	2009-04-09 03:51:29 +00:00
Rafael Espindola	7eb72dc5f2	Re-apply 68552. Tested by bootstrapping llvm-gcc and using that to build llvm. llvm-svn: 68645	2009-04-08 21:14:34 +00:00
Bob Wilson	e0e4a070da	Add testcase for PR3795. llvm-svn: 68620	2009-04-08 18:00:55 +00:00

... 2 3 4 5 6 ...

7080 Commits