llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 05:52:53 +02:00

Author	SHA1	Message	Date
Jakob Stoklund Olesen	e43aca1c39	Inflate register classes after coalescing. Coalescing can remove copy-like instructions with sub-register operands that constrained the register class. Examples are: x86: GR32_ABCD:sub_8bit_hi -> GR32 arm: DPR_VFP2:ssub0 -> DPR Recompute the register class of any virtual registers that are used by less instructions after coalescing. This affects code generation for the Cortex-A8 where we use NEON instructions for f32 operations, c.f. fp_convert.ll: vadd.f32 d16, d1, d0 vcvt.s32.f32 d0, d16 The register allocator is now free to use d16 for the temporary, and that comes first in the allocation order because it doesn't interfere with any s-registers. llvm-svn: 137133	2011-08-09 18:19:41 +00:00
Rafael Espindola	2da6e6a1d8	print st_shndx with the correct number of bits. llvm-svn: 136880	2011-08-04 15:50:13 +00:00
Rafael Espindola	c1a076eeb1	print st_other with the correct number of bits. llvm-svn: 136877	2011-08-04 15:38:19 +00:00
Rafael Espindola	368850841d	print st_type with the correct number of bits. llvm-svn: 136875	2011-08-04 15:24:00 +00:00
Rafael Espindola	e08bb3d50f	Print st_bind with the correct number of bits. llvm-svn: 136874	2011-08-04 15:10:35 +00:00
Rafael Espindola	865ab6cb05	Print r_sym with the correct number of bits. llvm-svn: 136873	2011-08-04 14:48:27 +00:00
Rafael Espindola	f65dd30907	Print r_type with the correct number of bits. llvm-svn: 136872	2011-08-04 14:39:30 +00:00
Rafael Espindola	edfafcbfb0	Change anther counter to decimal. llvm-svn: 136870	2011-08-04 14:01:03 +00:00
Rafael Espindola	3e8393e6f7	Don't print a counter in hex. llvm-svn: 136869	2011-08-04 13:39:15 +00:00
Benjamin Kramer	d93ac7d0b6	Remove underscore that's breaking linux buildbots. llvm-svn: 136833	2011-08-03 23:13:01 +00:00
Jakub Staszak	9d083611d4	Use MachineBranchProbabilityInfo in If-Conversion instead of its own heuristics. llvm-svn: 136826	2011-08-03 22:34:43 +00:00
Devang Patel	99a2f0d98c	Use byte offset, instead of element number, to access merged global. llvm-svn: 136759	2011-08-03 01:25:46 +00:00
Eric Christopher	96b31d5681	Add support for the 'Q' constraint. Fixes rdar://9866494 llvm-svn: 136523	2011-07-29 21:18:58 +00:00
Jakob Stoklund Olesen	cc29034b4c	Transfer implicit operands in NEONMoveFixPass. Later passes /are/ using this information when running the register scavenger. This fixes the second problem in PR10520. llvm-svn: 136440	2011-07-29 00:27:35 +00:00
Jakob Stoklund Olesen	f97f492104	Add -verify-arm-pseudo-expand. This hidden llc option runs the machine code verifier after expanding ARM pseudo-instructions, but before if-conversion. The machine code verifier is much better at pointing out liveness errors that can trip up the register scavenger. llvm-svn: 136439	2011-07-29 00:27:32 +00:00
Jakob Stoklund Olesen	5f429460ba	Handle REG_SEQUENCE with implicitly defined operands. Code like that would only be produced by bugpoint, but we should still handle it correctly. When a register is defined by a REG_SEQUENCE of undefs, the register itself is undef. Previously, we would create a register with uses but no defs. Fixes part of PR10520. llvm-svn: 136401	2011-07-28 21:38:51 +00:00
Jim Grosbach	906ecb46ed	FileCheck'ize test. llvm-svn: 136135	2011-07-26 20:49:44 +00:00
Jakob Stoklund Olesen	89e84069d2	Fix a crash when building 177.mesa for armv6. When splitting a live range immediately before an LDR_POST instruction that redefines the address register, make sure to use the correct value number in leaveIntvBefore. We need the value number entering the instruction. <rdar://problem/9793765> llvm-svn: 135413	2011-07-18 18:47:13 +00:00
Owen Anderson	7a380bac06	Remove VMOVDneon and VMOVQ, which are just aliases for VORR. This continues to simplify the path towards an auto-generated disassembler. llvm-svn: 135290	2011-07-15 18:46:47 +00:00
Eric Christopher	be21240f6f	Add a testcase for r135123. Part of rdar://9761830 llvm-svn: 135133	2011-07-14 06:23:09 +00:00
Evan Cheng	37ff73dfaf	Improve codegen for select's: if (x != 0) x = 1 if (x == 1) x = 1 Previous codegen looks like this: mov r1, r0 cmp r1, #1 mov r0, #0 moveq r0, #1 The naive lowering select between two different values. It should recognize the test is equality test so it's more a conditional move rather than a select: cmp r0, #1 movne r0, #0 rdar://9758317 llvm-svn: 135017	2011-07-13 00:42:17 +00:00
Jim Grosbach	93f2ebb5e7	Simplify printing of ARM shifted immediates. Print shifted immediate values directly rather than as a payload+shifter value pair. This makes for more readable output assembly code, simplifies the instruction printer, and is consistent with how Thumb immediates are displayed. llvm-svn: 134902	2011-07-11 16:48:36 +00:00
Cameron Zwarich	1efde78890	Add a missing test for r134882. llvm-svn: 134889	2011-07-11 08:35:17 +00:00
Jakob Stoklund Olesen	acaf9e9ce1	Be more aggressive about following hints. RAGreedy::tryAssign will now evict interference from the preferred register even when another register is free. To support this, add the EvictionCost struct that counts how many hints are broken by an eviction. We don't want to break one hint just to satisfy another. Rename canEvict to shouldEvict, and add the first bit of eviction policy that doesn't depend on spill weights: Always make room in the preferred register as long as the evictees can be split and aren't already assigned to their preferred register. Also make the CSR avoidance more accurate. When looking for a cheaper register it is OK to use a new volatile register. Only CSR aliases that have never been used before should be avoided. llvm-svn: 134735	2011-07-08 20:46:18 +00:00
Jim Grosbach	435ca7304c	Use ARMPseudoExpand for ARM tail calls. llvm-svn: 134719	2011-07-08 18:50:22 +00:00
Evan Cheng	952943f744	Change some ARM subtarget features to be single bit yes/no in order to sink them down to MC layer. Also fix tests. llvm-svn: 134590	2011-07-07 03:55:05 +00:00
Chandler Carruth	1926e141f1	FileCheck-ize and simplify RUN lines. llvm-svn: 134352	2011-07-02 20:43:11 +00:00
Eric Christopher	d369a9fe83	Add support for the 'j' immediate constraint. This is conditionalized on supporting the instruction that the constraint is for 'movw'. Part of rdar://9119939 llvm-svn: 134222	2011-07-01 01:00:07 +00:00
Eric Christopher	4bc6b7e1a6	Add support for the ARM 't' register constraint. And another testcase for the 'x' register constraint. Part of rdar://9119939 llvm-svn: 134220	2011-07-01 00:30:46 +00:00
Eric Christopher	d40f06b48f	Add support for the 'x' constraint. Part of rdar://9307836 and rdar://9119939 llvm-svn: 134215	2011-07-01 00:14:47 +00:00
Cameron Zwarich	2ffbcf9b96	In the ARM global merging pass, allow extraneous alignment specifiers. This pass already makes the assumption, which is correct on ARM, that a type's alignment is less than its alloc size. This improves codegen with Clang (which inserts a lot of extraneous alignment specifiers) and fixes <rdar://problem/9695089>. llvm-svn: 134106	2011-06-29 22:24:25 +00:00
Benjamin Kramer	d97872524b	Don't depend on the optimization reverted in r134067. llvm-svn: 134068	2011-06-29 14:07:18 +00:00
Eric Christopher	bb65f96b18	Allow lr in the register options here. llvm-svn: 133935	2011-06-27 20:31:01 +00:00
Chad Rosier	3127a19140	The Neon VCVT (between floating-point and fixed-point, Advanced SIMD) instructions can be used to match combinations of multiply/divide and VCVT (between floating-point and integer, Advanced SIMD). Basically the VCVT immediate operand that specifies the number of fraction bits corresponds to a floating-point multiply or divide by the corresponding power of 2. For example, VCVT (floating-point to fixed-point, Advanced SIMD) can replace a combination of VMUL and VCVT (floating-point to integer) as follows: Example (assume d17 = <float 8.000000e+00, float 8.000000e+00>): vmul.f32 d16, d17, d16 vcvt.s32.f32 d16, d16 becomes: vcvt.s32.f32 d16, d16, #3 Similarly, VCVT (fixed-point to floating-point, Advanced SIMD) can replace a combinations of VCVT (integer to floating-point) and VDIV as follows: Example (assume d17 = <float 8.000000e+00, float 8.000000e+00>): vcvt.f32.s32 d16, d16 vdiv.f32 d16, d17, d16 becomes: vcvt.f32.s32 d16, d16, #3 llvm-svn: 133813	2011-06-24 19:23:04 +00:00
Nick Lewycky	7f45c2bd84	Needs a triple. llvm-svn: 133634	2011-06-22 19:42:14 +00:00
Nick Lewycky	bf55e4b776	Emit trailing padding on constant vectors when TargetData says that the vector is larger than the sum of the elements (including per-element padding). llvm-svn: 133631	2011-06-22 18:55:03 +00:00
Devang Patel	f610afdefb	Test case for r133560. llvm-svn: 133585	2011-06-22 00:03:42 +00:00
Evan Cheng	40adfc21f6	Teach dag combine to match halfword byteswap patterns. 1. (((x) & 0xFF00) >> 8) \| (((x) & 0x00FF) << 8) => (bswap x) >> 16 2. ((x&0xff)<<8)\|((x&0xff00)>>8)\|((x&0xff000000)>>8)\|((x&0x00ff0000)<<8)) => (rotl (bswap x) 16) This allows us to eliminate most of the def : Pat patterns for ARM rev16 revsh instructions. It catches many more cases for ARM and x86. rdar://9609108 llvm-svn: 133503	2011-06-21 06:01:08 +00:00
Chris Lattner	ad5400fa72	rip out a ton of intrinsic modernization logic from AutoUpgrade.cpp, which is for pre-2.9 bitcode files. We keep x86 unaligned loads, movnt, crc32, and the target indep prefetch change. As usual, updating the testsuite is a PITA. llvm-svn: 133337	2011-06-18 06:05:24 +00:00
Evan Cheng	df9192b200	Add an alternative rev16 pattern. We should figure out a better way to handle these complex rev patterns. rdar://9609108 llvm-svn: 133289	2011-06-17 20:47:21 +00:00
Chris Lattner	0899957b99	make the asmparser reject function and type redefinitions. 'Merging' hasn't been needed since llvm-gcc 3.4 days. llvm-svn: 133248	2011-06-17 07:06:44 +00:00
Chris Lattner	4eb6f76fa6	Remove support for using "foo" as symbols instead of %"foo". This is ancient syntax and has been long obsolete. As usual, updating the tests is the nasty part of this. llvm-svn: 133242	2011-06-17 06:36:20 +00:00
Chris Lattner	9ec82f54d4	manually upgrade a bunch of tests to modern syntax, and remove some that are either unreduced or only test old syntax. llvm-svn: 133228	2011-06-17 03:14:27 +00:00
Cameron Zwarich	681f02ec26	Update an insertion point iterator after replacing a return instruction with a tail call pseudoinstruction. This fixes <rdar://problem/9624333>. llvm-svn: 133227	2011-06-17 02:16:43 +00:00
Eli Friedman	014d4feac5	Force a triple here so this test doesn't fail on EABI hosts (like clang-native-arm-cortex-a9). llvm-svn: 133134	2011-06-16 01:49:31 +00:00
Chad Rosier	26513932a2	Typos. llvm-svn: 133128	2011-06-16 01:24:24 +00:00
Chad Rosier	66fa658a4b	Revision r128665 added an optimization to make use of NEON multiplier accumulator forwarding. Specifically (from SVN log entry): Distribute (A + B) * C to (A * C) + (B * C) to make use of NEON multiplier accumulator forwarding: vadd d3, d0, d1 vmul d3, d3, d2 => vmul d3, d0, d2 vmla d3, d1, d2 Make sure it catches cases where operand 1 is add/fadd/sub/fsub, which was intended in the original revision. llvm-svn: 133127	2011-06-16 01:21:54 +00:00
Rafael Espindola	8edd93b519	Testcase for previous commit. llvm-svn: 133089	2011-06-15 21:18:51 +00:00
Evan Cheng	30f84a59ae	Another revsh pattern. rdar://9609059 llvm-svn: 133064	2011-06-15 17:17:48 +00:00
Evan Cheng	7624839811	PerformBFICombine - (bfi A, (and B, Mask1), Mask2) -> (bfi A, B, Mask2) iff the bits being cleared by the AND are not demanded by the BFI. The previous BFI dag combine rule was actually incorrect (or used to be correct until BFI representation changed). rdar://9609030 llvm-svn: 133034	2011-06-15 01:12:31 +00:00
Tanya Lattner	5ee64fc868	Add an optimization that looks for a specific pair-wise add pattern and generates a vpaddl instruction instead of scalarizing the add. Includes a test case. llvm-svn: 133027	2011-06-14 23:48:48 +00:00
Bruno Cardoso Lopes	15b9096112	Since ARM's prefetch implementation predicted the presence of a instruction cache prefetch and now that the info from "prefetch" to "ARMPreload" is present, only add a testcase for PLI. llvm-svn: 132978	2011-06-14 05:11:46 +00:00
Bruno Cardoso Lopes	b6afc5168f	Add one more argument to the prefetch intrinsic to indicate whether it's a data or instruction cache access. Update the targets to match it and also teach autoupgrade. llvm-svn: 132976	2011-06-14 04:58:37 +00:00
Jakob Stoklund Olesen	2cac2ea7a1	Be less aggressive about hinting in RAFast. In particular, don't spill dirty registers only to satisfy a hint. It is not worth it. The attached test case provides an example where the fast allocator would spill a register when other registers are available. llvm-svn: 132900	2011-06-13 03:26:46 +00:00
Cameron Zwarich	af47f4a117	A CCState was being created without setting whether it is in the Call or Prologue state, causing an assertion failure downstream. This fixes <rdar://problem/9562908>. This really seems like it should always be set at CCState creation time, so mistakes like this can never happen. I'll take a look at doing that. llvm-svn: 132811	2011-06-09 22:30:07 +00:00
Eric Christopher	bd0677f8db	Another possible bug. Stopgap until we can autogenerate tables and constraint lengths. Part of rdar://9037836 and rdar://9119939 llvm-svn: 132598	2011-06-03 22:09:12 +00:00
Eric Christopher	51ff48ad30	Fix an off by one error. Part of rdar://9037836 and rdar://9119939 llvm-svn: 132590	2011-06-03 20:44:52 +00:00
Eric Christopher	e831655dd9	Make the Uv constraint a memory operand. This doesn't solve the addressing mode problem mentioned in r132559. Backend part of rdar://9037836 and part of rdar://9119939 llvm-svn: 132561	2011-06-03 17:24:37 +00:00
Eli Friedman	eae10d6163	Add ARM fast-isel support for materializing the address of a global in cases where the global uses an indirect symbol. rdar://9431157 llvm-svn: 132522	2011-06-03 01:13:19 +00:00
Devang Patel	1c30f3ac27	During post RA scheduling, do not try to chase reg defs. to preserve DBG_VALUEs. This approach has several downsides, for example, it does not work when dbg value is a constant integer, it does not work if reg is defined more than once, it places end of debug value range markers in the wrong place. It even causes misleading incorrect debug info when duplicate DBG_VALUE instructions point to same reg def. Instead, use simpler approach and let DBG_VALUE follow its predecessor instruction. After live debug value analysis pass, all DBG_VALUE instruction are placed at the right place. Thanks Jakob for the hint! llvm-svn: 132483	2011-06-02 20:07:12 +00:00
Eric Christopher	9fe91039e4	Allow bitcasts between valid types of the same size and vector types if the vector type is legal. Fixes rdar://9306086 llvm-svn: 132420	2011-06-01 19:55:10 +00:00
John McCall	64ff21faa7	On Darwin ARM, set the UNWIND_RESUME libcall to _Unwind_SjLj_Resume. This is important for the correct lowering of unwind instructions (which doesn't matter at all) and llvm.eh.resume calls (which does). Take 2, now with more basic competence. llvm-svn: 132295	2011-05-29 19:50:32 +00:00
John McCall	ffdb2d5e70	I didn't mean to commit these residues of a personal project. llvm-svn: 132293	2011-05-29 19:41:56 +00:00
John McCall	46c7b963b2	On Darwin ARM, set the UNWIND_RESUME libcall to _Unwind_SjLj_Resume. This is important for the correct lowering of unwind instructions (which doesn't matter at all) and llvm.eh.resume calls (which does). llvm-svn: 132291	2011-05-29 19:39:04 +00:00
Bruno Cardoso Lopes	6d5e369a10	Add support for ARM ldrexd/strexd intrinsics. They both use i32 register pairs to load/store i64 values. Since there's no current support to explicitly declare such restrictions, implement it by using specific hardcoded register pairs during isel. llvm-svn: 132248	2011-05-28 04:07:29 +00:00
Eric Christopher	000dd7d0e6	Implement the 'M' output modifier for arm inline asm. This is fairly register allocation dependent and will occasionally break. WIP in the register allocator to model paired/etc registers. rdar://9119939 llvm-svn: 132242	2011-05-28 01:40:44 +00:00
Cameron Zwarich	cd3c1b5829	Fix the remaining atomic intrinsics to use the right register classes on Thumb2, and add some basic tests for them. llvm-svn: 132235	2011-05-27 23:54:00 +00:00
Rafael Espindola	2230168a0f	Make size computation less brittle. llvm-svn: 132222	2011-05-27 22:05:41 +00:00
Jakob Stoklund Olesen	516eb93107	Make room for register allocation to improve. llvm-svn: 132213	2011-05-27 20:15:06 +00:00
Evan Cheng	0fcb465bab	Don't use movw / movt for iOS static codegen for now to workaround some tools issues. rdar://9514789 llvm-svn: 132211	2011-05-27 20:11:27 +00:00
Evan Cheng	4192d53d1e	Add iOS test llvm-svn: 132203	2011-05-27 19:04:21 +00:00
Eli Friedman	55343ef7bb	And fix the test in r132194. llvm-svn: 132196	2011-05-27 18:14:28 +00:00
Eli Friedman	560532051b	Fix a silly mistake (which trips over an assertion) in r132099. rdar://9515076 llvm-svn: 132194	2011-05-27 18:02:04 +00:00
Devang Patel	e0b7ab9296	During branch folding avoid inserting redundant DBG_VALUE machine instructions. llvm-svn: 132148	2011-05-26 21:47:59 +00:00
Eli Friedman	93ffb875ad	Rewrite fast-isel integer cast handling to handle more cases, and to be simpler and more consistent. The practical effects here are that x86-64 fast-isel can now handle trunc from i8 to i1, and ARM fast-isel can handle many more constructs involving integers narrower than 32 bits (including loads, stores, and many integer casts). rdar://9437928 . llvm-svn: 132099	2011-05-25 23:49:02 +00:00
Eric Christopher	807da21e47	Implement the 'm' modifier. Note that it only works for memory operands. Part of rdar://9119939 llvm-svn: 132081	2011-05-25 20:51:58 +00:00
Cameron Zwarich	beae5f20e8	Make tTAILJMPr/tTAILJMPrND emit a tBX without a preceding MOV of PC to LR. This fixes <rdar://problem/9495913> llvm-svn: 132042	2011-05-25 04:45:27 +00:00
Eric Christopher	4f193f9555	Implement the arm 'L' asm modifier. Part of rdar://9119939 llvm-svn: 132024	2011-05-24 23:27:13 +00:00
Eric Christopher	a6d7ccb170	Implement the immediate part of the 'B' modifier. Part of rdar://9119939 llvm-svn: 132023	2011-05-24 23:15:43 +00:00
Eric Christopher	03965fa3b6	Add support for the arm 'y' asm modifier. Fixes part of rdar://9444657 llvm-svn: 132011	2011-05-24 22:10:34 +00:00
Cameron Zwarich	5a416bda73	Fix <rdar://problem/9476260> by having tail calls always generate 32-bit branches in Darwin Thumb2 code. Tail calls are already disabled on Thumb1. llvm-svn: 131894	2011-05-23 01:57:17 +00:00
Renato Golin	759db3cbe3	RTABI chapter 4.3.4 specifies __eabi_mem* calls. Specifically, __eabi_memset accepts parameters (ptr, size, value) in a different order than GNU's memset (ptr, value, size), therefore the special lowering in AAPCS mode. Implementation by Evzen Muller. llvm-svn: 131868	2011-05-22 21:41:23 +00:00
Tanya Lattner	6814933ea6	Handle perfect shuffle case that generates a vrev for vectors of floats. Add test case. llvm-svn: 131582	2011-05-18 21:44:54 +00:00
Tanya Lattner	06cb9cbf98	In r131488 I misunderstood how VREV works. It splits the vector in half and splits each half. Therefore, the real problem was that we were using a VREV64 for a 4xi16, when we should have been using a VREV32. Updated test case and reverted change to the PerfectShuffle Table. llvm-svn: 131529	2011-05-18 06:42:21 +00:00
Tanya Lattner	7145d69427	vrev is incorrectly defined in the perfect shuffle table. The ordering is backwards (should be 0x3210 versus 0x1032) which exposed a bug when doing a shuffle on a 4xi16. I've attached a test case. llvm-svn: 131488	2011-05-17 20:48:40 +00:00
Jakob Stoklund Olesen	16f11212fc	Teach LiveInterval::isZeroLength about null SlotIndexes. When instructions are deleted, they leave tombstone SlotIndex entries. The isZeroLength method should ignore these null indexes. This causes RABasic to sometimes spill a callee-saved register in the abi-isel.ll test, so don't run that test with -regalloc=basic. Prioritizing register allocation according to spill weight can cause more registers to be used. llvm-svn: 131436	2011-05-16 23:50:05 +00:00
Galina Kistanova	f8f6de03c6	Correction. Use explicit target triple in the test. llvm-svn: 131252	2011-05-12 21:55:34 +00:00
Nadav Rotem	57dd315a3b	Fixes a bug in the DAGCombiner. LoadSDNodes have two values (data, chain). If there is a store after the load node, then there is a chain, which means that there is another user. Thus, asking hasOneUser would fail. Instead we ask hasNUsesOfValue on the 'data' value. llvm-svn: 131183	2011-05-11 14:40:50 +00:00
Rafael Espindola	46b0ce1b5f	Produce a __debug_frame section on darwin ARM when appropriate. llvm-svn: 131151	2011-05-10 21:04:45 +00:00
Dan Gohman	62dbd536c0	Give this test an explicit register allocator, so that it can work even if the default register allocator is changed. llvm-svn: 130883	2011-05-04 23:14:02 +00:00
Bill Wendling	279e17e523	SjLj EH could produce a machine basic block that legitimately has more than one landing pad as its successor. SjLj exception handling jumps to the correct landing pad via a switch statement that's generated right before code-gen. Loosen the constraint in the machine instruction verifier to allow for this. Note, this isn't the most rigorous check since we cannot determine where that switch statement came from. But it's marginally better than turning this check off when SjLj exceptions are used. <rdar://problem/9187612> llvm-svn: 130881	2011-05-04 22:54:05 +00:00
Galina Kistanova	3b29721cf4	This test fails on ARM. The test shouldn't explicitly specify alignment (and alignment 4 is wrong) and requires hard-float. llvm-svn: 130875	2011-05-04 21:57:44 +00:00
Devang Patel	8823e24dde	Do not emit location expression size twice. llvm-svn: 130854	2011-05-04 19:00:57 +00:00
Jakob Stoklund Olesen	4755a27704	Fix a bunch of ARM tests to be register allocation independent. llvm-svn: 130800	2011-05-03 22:31:21 +00:00
Evan Cheng	3d6402587f	Make the test less likely to fail with minor changes. llvm-svn: 130778	2011-05-03 19:09:32 +00:00
Bob Wilson	78011dcf2e	Remove test for iOS divmod function, since that is disabled for now. llvm-svn: 130769	2011-05-03 17:54:49 +00:00
Bruno Cardoso Lopes	9dd575e4a9	Add a few ARM coprocessor intrinsics. Testcases included llvm-svn: 130763	2011-05-03 17:29:22 +00:00
Dan Gohman	7beb845bab	Add an unfolded offset field to LSR's Formula record. This is used to model constants which can be added to base registers via add-immediate instructions which don't require an additional register to materialize the immediate. llvm-svn: 130743	2011-05-03 00:46:49 +00:00
Jakob Stoklund Olesen	2db84c62f6	Weekly fix of register allocation dependent unit tests. llvm-svn: 130567	2011-04-30 01:37:52 +00:00
Eli Friedman	919bf1ca71	Make FastEmit_ri_ try a bit harder to succeed for supported operations; FastEmit_i can fail for non-Thumb2 ARM. Makes ARMSimplifyAddress work correctly, and reduces the number of fast-isel bailouts on non-Thumb ARM. llvm-svn: 130560	2011-04-29 23:34:52 +00:00
Eli Friedman	1940912660	Switch to ImmLeaf (which can be used by FastISel) for a few more common ARM/Thumb2 patterns. llvm-svn: 130552	2011-04-29 22:48:03 +00:00
Eli Friedman	8f66e9361d	Fix run-line, again. :( llvm-svn: 130540	2011-04-29 21:33:03 +00:00
Eli Friedman	7d05eaa3f4	Re-committing r130454, which does not in fact break anything. Fix a rather obscure crash caused by ARM fast-isel generating code which redefines a register. rdar://problem/9338332 . llvm-svn: 130539	2011-04-29 21:22:56 +00:00
Eric Christopher	ba0cef1b0b	Add trunc->branch support, this won't help with clang's i8->i1 truncations for bools, but is a start. llvm-svn: 130534	2011-04-29 20:02:39 +00:00
Eli Friedman	a93906d0c3	Revert r130454; apparently this doesn't actually work. llvm-svn: 130462	2011-04-28 23:55:14 +00:00
Eli Friedman	83b734b444	Fix runline. llvm-svn: 130455	2011-04-28 23:12:24 +00:00
Eli Friedman	9a80f23666	Fix a rather obscure crash caused by ARM fast-isel generating code which redefines a register. rdar://problem/9338332 . llvm-svn: 130454	2011-04-28 23:03:25 +00:00
Devang Patel	900ceb725b	Teach dwarf writer to handle complex address expression for .debug_loc entries. This fixes clang generated blocks' variables' debug info. Radar 9279956. llvm-svn: 130373	2011-04-28 02:22:40 +00:00
Evan Cheng	fa34d31aa4	If converter was being too cute. It look for root BBs (which don't have successors) and use inverse depth first search to traverse the BBs. However that doesn't work when the CFG has infinite loops. Simply do a linear traversal of all BBs work just fine. rdar://9344645 llvm-svn: 130324	2011-04-27 19:32:43 +00:00
Jakob Stoklund Olesen	adb564f3cd	Also add <imp-def> operands for defined and dead super-registers when rewriting. We cannot rely on the <imp-def> operands added by LiveIntervals in all cases as demonstrated by the test case. llvm-svn: 130313	2011-04-27 17:42:31 +00:00
Evan Cheng	dea3347167	Be careful about scheduling nodes above previous calls. It increase usages of more callee-saved registers and introduce copies. Only allows it if scheduling a node above calls would end up lessen register pressure. Call operands also has added ABI restrictions for register allocation, so be extra careful with hoisting them above calls. rdar://9329627 llvm-svn: 130245	2011-04-26 21:31:35 +00:00
Evan Cheng	ffcb599719	This test should be in MC. It breaks with changes to scheduling / register allocation so it's being removed. llvm-svn: 130243	2011-04-26 21:09:04 +00:00
Chris Lattner	37fec9f729	don't emit the symbol name twice for local bss and common symbols. For example, don't emit: .comm _i,4,2 ## @i ## @i instead emit: .comm _i,4,2 ## @i llvm-svn: 130192	2011-04-26 06:14:13 +00:00
Eric Christopher	2fbd7a6280	Make this test disable fast isel as it's not needed. llvm-svn: 130165	2011-04-25 22:39:46 +00:00
Benjamin Kramer	b2992c34b5	Make tests more useful. lit needs a linter ... llvm-svn: 130126	2011-04-25 10:12:01 +00:00
Andrew Trick	a130d110d1	Thumb2 and ARM add/subtract with carry fixes. Fixes Thumb2 ADCS and SBCS lowering: <rdar://problem/9275821>. t2ADCS/t2SBCS are now pseudo instructions, consistent with ARM, so the assembly printer correctly prints the 's' suffix. Fixes Thumb2 adde -> SBC matching to check for live/dead carry flags. Fixes the internal ARM machine opcode mnemonic for ADCS/SBCS. Fixes ARM SBC lowering to check for live carry (potential bug). llvm-svn: 130048	2011-04-23 03:55:32 +00:00
Devang Patel	692ae3cdc6	Fix DWARF description of Q registers. llvm-svn: 129952	2011-04-21 23:22:35 +00:00
Devang Patel	85b3a170f5	Fix DWARF description of S registers. llvm-svn: 129947	2011-04-21 22:48:26 +00:00
Devang Patel	6f8d1e876c	Test case for r129922 llvm-svn: 129934	2011-04-21 20:16:43 +00:00
Evan Cheng	28877b11a2	Remove -use-divmod-libcall. Let targets opt in when they are available. llvm-svn: 129884	2011-04-20 22:20:12 +00:00
Eric Christopher	4c3c7c8211	Rewrite the expander for umulo/smulo to remember to sign extend the input manually and pass all (now) 4 arguments to the mul libcall. Add a new ExpandLibCall for just this (copied gratuitously from type legalization). Fixes rdar://9292577 llvm-svn: 129842	2011-04-20 01:19:45 +00:00
Daniel Dunbar	140e365c49	CodeGen: Eliminate a use of getDarwinMajorNumber(). - There is a minor semantic change here (evidenced by the test change) for Darwin triples that have no version component. I debated changing the default behavior of isOSVersionLT, but decided it made more sense for triples to be explicit. llvm-svn: 129802	2011-04-19 20:32:39 +00:00
Bob Wilson	3daeb462cb	This patch combines several changes from Evan Cheng for rdar://8659675. Making use of VFP / NEON floating point multiply-accumulate / subtraction is difficult on current ARM implementations for a few reasons. 1. Even though a single vmla has latency that is one cycle shorter than a pair of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause additional pipeline stall. So it's frequently better to single codegen vmul + vadd. 2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to stall for 4 cycles. We need to schedule them apart. 3. A vmla followed vmla is a special case. Obvious issuing back to back RAW vmla + vmla is very bad. But this isn't ideal either: vmul vadd vmla Instead, we want to expand the second vmla: vmla vmul vadd Even with the 4 cycle vmul stall, the second sequence is still 2 cycles faster. Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough but it isn't the optimial solution. This patch attempts to make it possible to use vmla / vmls in cases where it is profitable. A. Add missing isel predicates which cause vmla to be codegen'ed. B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to compute a fmul and a fmla. C. Add additional isel checks for vmla, avoid cases where vmla is feeding into fp instructions (except for the #3 exceptional case). D. Add ARM hazard recognizer to model the vmla / vmls hazards. E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the vmla / vmls will trigger one of the special hazards. Enable these fp vmlx codegen changes for Cortex-A9. llvm-svn: 129775	2011-04-19 18:11:57 +00:00
Bob Wilson	56f64ab701	Add -mcpu=cortex-a9-mp. It's cortex-a9 with MP extension. rdar://8648637. llvm-svn: 129774	2011-04-19 18:11:52 +00:00
Bob Wilson	0cbbc50f26	Avoid some 's' 16-bit instruction which partially update CPSR (and add false dependency) when it isn't dependent on last CPSR defining instruction. rdar://8928208 llvm-svn: 129773	2011-04-19 18:11:49 +00:00
Bob Wilson	886994b683	Avoid write-after-write issue hazards for Cortex-A9. Add a avoidWriteAfterWrite() target hook to identify register classes that suffer from write-after-write hazards. For those register classes, try to avoid writing the same register in two consecutive instructions. This is currently disabled by default. We should not spill to avoid hazards! The command line flag -avoid-waw-hazard can be used to enable waw avoidance. llvm-svn: 129772	2011-04-19 18:11:45 +00:00
Jakob Stoklund Olesen	c84b16717b	Tighten test case a bit. Ideally, we would match an S-register to its containing D-register, but that requires arithmetic (divide by 2). llvm-svn: 129756	2011-04-19 06:14:45 +00:00
Jakob Stoklund Olesen	c9861cc9f6	Make tests register allocation independent again. llvm-svn: 129739	2011-04-19 00:14:43 +00:00
Evan Cheng	56c151cba9	Do not lose mem_operands while lowering VLD / VST intrinsics. llvm-svn: 129738	2011-04-19 00:04:03 +00:00
Eric Christopher	e1103d0a86	Fix a bug where we were counting the alias sets as completely used registers for fast allocation a different way. This has us updating used registers only when we're using that exact register. Fixes rdar://9207598 llvm-svn: 129711	2011-04-18 19:26:25 +00:00
Evan Cheng	b720f37282	Fix divmod libcall lowering. Convert to {S\|U}DIVREM first and then expand the node to a libcall. rdar://9280991 llvm-svn: 129633	2011-04-16 03:08:26 +00:00
Cameron Zwarich	5e9c2506d8	Add ORR and EOR to the CMP peephole optimizer. It's hard to get isel to generate a case involving EOR, so I only added a test for ORR. llvm-svn: 129610	2011-04-15 21:24:38 +00:00
Cameron Zwarich	05fb4f0c81	The AND instruction leaves the V flag unmodified, so it falls victim to the same problem as all of the other instructions we fold with CMPs. llvm-svn: 129602	2011-04-15 20:45:00 +00:00
Cameron Zwarich	ddbf79c32b	Add missing register forms of instructions to the ARM CMP-folding code. This fixes <rdar://problem/9287901>. llvm-svn: 129599	2011-04-15 20:28:28 +00:00
Evan Cheng	f33f509d45	Fix another fcopysign lowering bug. If src is f64 and destination is f32, don't forget to right shift the source by 32 first. rdar://9287902 llvm-svn: 129556	2011-04-15 01:31:00 +00:00
Cameron Zwarich	6b4e85338c	Fix a typo in an ARM-specific DAG combine. This fixes <rdar://problem/9278274>. llvm-svn: 129468	2011-04-13 21:01:19 +00:00
Cameron Zwarich	37f1db39c4	Fix an obvious problem with an alignment computation. AsmPrinter actually does the max itself, so it is not easy to write a test case for this, but I added a test case that would fail if the code in AsmPrinter were removed. llvm-svn: 129432	2011-04-13 09:02:43 +00:00
Cameron Zwarich	3f06fb96e5	If a global variable has a specified alignment that is less than the preferred alignment for its type, use the minimum of the specified alignment and the ABI alignment. This fixes <rdar://problem/9275290>. llvm-svn: 129428	2011-04-13 06:03:16 +00:00
Andrew Trick	916e01c917	Recommit r129383. PreRA scheduler heuristic fixes: VRegCycle, TokenFactor latency. Additional fixes: Do something reasonable for subtargets with generic itineraries by handle node latency the same as for an empty itinerary. Now nodes default to unit latency unless an itinerary explicitly specifies a zero cycle stage or it is a TokenFactor chain. Original fixes: UnitsSharePred was a source of randomness in the scheduler: node priority depended on the queue data structure. I rewrote the recent VRegCycle heuristics to completely replace the old heuristic without any randomness. To make the ndoe latency adjustments work, I also needed to do something a little more reasonable with TokenFactor. I gave it zero latency to its consumers and always schedule it as low as possible. llvm-svn: 129421	2011-04-13 00:38:32 +00:00
Eric Christopher	147cad907a	Temporarily revert r129408 to see if it brings the bots back. llvm-svn: 129417	2011-04-13 00:20:59 +00:00
Eric Christopher	c72bd6024f	Fix a bug where we were counting the alias sets as completely used registers for fast allocation. Fixes rdar://9207598 llvm-svn: 129408	2011-04-12 23:23:14 +00:00
Andrew Trick	d83e7b6a5d	Revert 129383. It causes some targets to hit a scheduler assert. llvm-svn: 129385	2011-04-12 20:14:07 +00:00
Andrew Trick	1e0821075d	PreRA scheduler heuristic fixes: VRegCycle, TokenFactor latency. UnitsSharePred was a source of randomness in the scheduler: node priority depended on the queue data structure. I rewrote the recent VRegCycle heuristics to completely replace the old heuristic without any randomness. To make these heuristic adjustments to node latency work, I also needed to do something a little more reasonable with TokenFactor. I gave it zero latency to its consumers and always schedule it as low as possible. llvm-svn: 129383	2011-04-12 19:54:36 +00:00
Cameron Zwarich	c05412175e	Split a store of a VMOVDRR into two integer stores to avoid mixing NEON and ARM stores of arguments in the same cache line. This fixes the second half of <rdar://problem/8674845>. llvm-svn: 129345	2011-04-12 02:24:17 +00:00
Evan Cheng	ea0d287a8a	Look pass copies when determining whether hoisting would end up inserting more copies. rdar://9266679 llvm-svn: 129297	2011-04-11 21:09:18 +00:00
Chris Lattner	9fb9788a47	remove a bunch of CHECK lines that aren't checking what they thought they were, because alternation was expanding wrong in {{}}'s. llvm-svn: 129194	2011-04-09 06:31:06 +00:00
Chris Lattner	de62b962e8	don't test for codegen of 'store undef' llvm-svn: 129184	2011-04-09 02:31:26 +00:00
Evan Cheng	bc053100af	Change -arm-trap-func= into a non-arm specific option. Now Intrinsic::trap is lowered into a call to the specified trap function at sdisel time. llvm-svn: 129152	2011-04-08 21:37:21 +00:00
Evan Cheng	9049eb2113	Add option to emit @llvm.trap as a function call instead of a trap instruction. rdar://9249183. llvm-svn: 129107	2011-04-07 20:31:12 +00:00
Andrew Trick	36a1759769	Added a check in the preRA scheduler for potential interference on a induction variable. The preRA scheduler is unaware of induction vars, so we look for potential "virtual register cycles" instead. Fixes <rdar://problem/8946719> Bad scheduling prevents coalescing llvm-svn: 129100	2011-04-07 19:54:57 +00:00
Tanya Lattner	3deb96fad7	Prevent ARM DAG Combiner from doing an AND or OR combine on an illegal vector type (vectors of size 3). Also included test cases. llvm-svn: 129074	2011-04-07 15:24:20 +00:00
Evan Cheng	859dff2c87	Change -arm-divmod-libcall to a target neutral option. llvm-svn: 129045	2011-04-07 00:58:44 +00:00
Owen Anderson	37b60bdf09	Teach the ARM peephole optimizer that RSB, RSC, ADC, and SBC can be used for folded comparisons, just like ADD and SUB. llvm-svn: 129038	2011-04-06 23:35:59 +00:00
Jakob Stoklund Olesen	a0e0f8d74b	These tests no longer require linear scan because reserved register coalescing is now universal. llvm-svn: 128936	2011-04-05 21:40:41 +00:00
Johnny Chen	8b1acb8d9b	Fix test-llvm failures. llvm-svn: 128906	2011-04-05 18:41:40 +00:00
Eric Christopher	b126193e19	Fix up testcase for previous commit. llvm-svn: 128870	2011-04-05 00:56:01 +00:00
Cameron Zwarich	9573b6277e	Do some peephole optimizations to remove pointless VMOVs from Neon to integer registers that arise from argument shuffling with the soft float ABI. These instructions are particularly slow on Cortex A8. This fixes one half of <rdar://problem/8674845>. llvm-svn: 128759	2011-04-02 02:40:43 +00:00
Jim Grosbach	039844acc5	LDRD/STRD instructions should print both Rt and Rt2 in the asm string. llvm-svn: 128736	2011-04-01 20:26:57 +00:00
Evan Cheng	830f695385	Add test case. llvm-svn: 128707	2011-04-01 06:27:25 +00:00
Evan Cheng	985215c699	FileCheck'ify test. llvm-svn: 128706	2011-04-01 03:36:33 +00:00
Jakob Stoklund Olesen	33f01d005c	Fix ARM tests to be register allocator independent. llvm-svn: 128680	2011-03-31 22:14:03 +00:00
Evan Cheng	64850406cf	Distribute (A + B) * C to (A * C) + (B * C) to make use of NEON multiplier accumulator forwarding: vadd d3, d0, d1 vmul d3, d3, d2 => vmul d3, d0, d2 vmla d3, d1, d2 llvm-svn: 128665	2011-03-31 19:38:48 +00:00
Jakob Stoklund Olesen	e72dfb1c45	Pick a conservative register class when creating a small live range for remat. The rematerialized instruction may require a more constrained register class than the register being spilled. In the test case, the spilled register has been inflated to the DPR register class, but we are rematerializing a load of the ssub_0 sub-register which only exists for DPR_VFP2 registers. The register class is reinflated after spilling, so the conservative choice is only temporary. llvm-svn: 128610	2011-03-31 03:54:44 +00:00
Cameron Zwarich	1b8f91d2c8	Add a ARM-specific SD node for VBSL so that forms with a constant first operand can be recognized. This fixes <rdar://problem/9183078>. llvm-svn: 128584	2011-03-30 23:01:21 +00:00
Evan Cheng	ed09135349	Add intrinsics @llvm.arm.neon.vmulls and @llvm.arm.neon.vmullu.* back. Frontends was lowering them to sext / uxt + mul instructions. Unfortunately the optimization passes may hoist the extensions out of the loop and separate them. When that happens, the long multiplication instructions can be broken into several scalar instructions, causing significant performance issue. Note the vmla and vmls intrinsics are not added back. Frontend will codegen them as intrinsics vmull* + add / sub. Also note the isel optimizations for catching mul + sext / zext are not changed either. First part of rdar://8832507, rdar://9203134 llvm-svn: 128502	2011-03-29 23:06:19 +00:00
Cameron Zwarich	95260e5ebb	Add Neon SINT_TO_FP and UINT_TO_FP lowering from v4i16 to v4f32. Fixes <rdar://problem/8875309> and <rdar://problem/9057191>. llvm-svn: 128492	2011-03-29 21:41:55 +00:00
Evan Cheng	5bcaef9cc9	Optimizing (zext A + zext B) * C, to (VMULL A, C) + (VMULL B, C) during isel lowering to fold the zero-extend's and take advantage of no-stall back to back vmul + vmla: vmull q0, d4, d6 vmlal q0, d5, d6 is faster than vaddl q0, d4, d5 vmovl q1, d6 vmul q0, q0, q1 This allows us to vmull + vmlal for: f = vmull_u8( vget_high_u8(s), c); f = vmlal_u8(f, vget_low_u8(s), c); rdar://9197392 llvm-svn: 128444	2011-03-29 01:56:09 +00:00
Devang Patel	2cea16e9bb	Enable GlobalMerge on darwin. llvm-svn: 128183	2011-03-23 23:34:19 +00:00
Evan Cheng	6e799c3c58	Cmp peephole optimization isn't always safe for signed arithmetics. int tries = INT_MAX; while (tries > 0) { tries--; } The check should be: subs r4, #1 cmp r4, #0 bgt LBB0_1 The subs can set the overflow V bit when r4 is INT_MAX+1 (which loop canonicalization apparently does in this case). cmp #0 would have cleared it while not changing the N and Z bits. Since BGT is dependent on the V bit, i.e. (N == V) && !Z, it is not safe to eliminate the cmp #0. rdar://9172742 llvm-svn: 128179	2011-03-23 22:52:04 +00:00
Rafael Espindola	b5c6ae67ac	Write the section table and the section data in the same order that gun as does. This makes it a lot easier to compare the output of both as the addresses are now a lot closer. llvm-svn: 127972	2011-03-20 18:44:20 +00:00
Evan Cheng	93d04c1c00	Match a few more obvious patterns to revsh. rdar://9147637. llvm-svn: 127913	2011-03-18 21:52:42 +00:00
Daniel Dunbar	8757b8c000	Revert r127757, "Patch to a fix dwarf relocation problem on ARM. One-line fix plus the test where it used to break.", which broke Clang self-host of a Debug+Asserts compiler, on OS X. llvm-svn: 127763	2011-03-16 22:16:39 +00:00
Renato Golin	bf788a5626	Patch to a fix dwarf relocation problem on ARM. One-line fix plus the test where it used to break. llvm-svn: 127757	2011-03-16 21:05:52 +00:00
Bill Wendling	388dad6d62	Some minor cleanups based on feedback. llvm-svn: 127694	2011-03-15 20:47:26 +00:00
Evan Cheng	59ba6777c3	Do not form thumb2 ldrd / strd if the offset is by multiple of 4. rdar://9133587 llvm-svn: 127683	2011-03-15 18:41:52 +00:00
Evan Cheng	29faaebae9	Add a peephole optimization to optimize pairs of bitcasts. e.g. v2 = bitcast v1 ... v3 = bitcast v2 ... = v3 => v2 = bitcast v1 ... = v1 if v1 and v3 are of in the same register class. bitcast between i32 and fp (and others) are often not nops since they are in different register classes. These bitcast instructions are often left because they are in different basic blocks and cannot be eliminated by dag combine. rdar://9104514 llvm-svn: 127668	2011-03-15 05:13:13 +00:00
Bill Wendling	713c4bc3ee	Testcase for r127630. llvm-svn: 127648	2011-03-15 01:49:08 +00:00
Jim Grosbach	3de97c6e32	Clean up ARM tail calls a bit. They're pseudo-instructions for normal branches. Also more cleanly separate the ARM vs. Thumb functionality. Previously, the encoding would be incorrect for some Thumb instructions (the indirect calls). llvm-svn: 127637	2011-03-15 00:30:40 +00:00
Bill Wendling	da1364d669	Generate a VTBL instruction instead of a series of loads and stores when we can. As Nate pointed out, VTBL isn't super performant, but it has to be better than this: _shuf: @ BB#0: @ %entry push {r4, r7, lr} add r7, sp, #4 sub sp, #12 mov r4, sp bic r4, r4, #7 mov sp, r4 mov r2, sp vmov d16, r0, r1 orr r0, r2, #6 orr r3, r2, #7 vst1.8 {d16[0]}, [r3] vst1.8 {d16[5]}, [r0] subs r4, r7, #4 orr r0, r2, #5 vst1.8 {d16[4]}, [r0] orr r0, r2, #4 vst1.8 {d16[4]}, [r0] orr r0, r2, #3 vst1.8 {d16[0]}, [r0] orr r0, r2, #2 vst1.8 {d16[2]}, [r0] orr r0, r2, #1 vst1.8 {d16[1]}, [r0] vst1.8 {d16[3]}, [r2] vldr.64 d16, [sp] vmov r0, r1, d16 mov sp, r4 pop {r4, r7, pc} The "illegal" testcase in vext.ll is no longer illegal. <rdar://problem/9078775> llvm-svn: 127630	2011-03-14 23:02:38 +00:00
Eric Christopher	8180806e0f	Fix this test up a bit. llvm-svn: 127621	2011-03-14 21:05:21 +00:00
Evan Cheng	cb70b9e80b	Minor optimization. sign-ext/anyext of undef is still undef. llvm-svn: 127598	2011-03-14 18:15:55 +00:00
Eric Christopher	392d8f7d08	Saving files before committing is overrated. Add a RUN line to this test. llvm-svn: 127520	2011-03-12 01:36:23 +00:00
Eric Christopher	80a45901e0	Sometimes isPredicable lies to us and tells us we don't need the operands. Go ahead and add them on when we might want to use them and let later passes remove them. Fixes rdar://9118569 llvm-svn: 127518	2011-03-12 01:09:29 +00:00
Jim Grosbach	27eaca3e0d	Properly pseudo-ize the ARM LDMIA_RET instruction. This has the nice side- effect that we get proper instruction printing using the "pop" mnemonic for it. llvm-svn: 127502	2011-03-11 22:51:41 +00:00
Cameron Zwarich	bf5c9cd119	Roll r127459 back in: Optimize trivial branches in CodeGenPrepare, which often get created from the lowering of objectsize intrinsics. Unfortunately, a number of tests were relying on llc not optimizing trivial branches, so I had to add an option to allow them to continue to test what they originally tested. This fixes <rdar://problem/8785296> and <rdar://problem/9112893>. llvm-svn: 127498	2011-03-11 21:52:04 +00:00
Daniel Dunbar	a02706c889	Revert r127459, "Optimize trivial branches in CodeGenPrepare, which often get created from the", it broke some GCC test suite tests. llvm-svn: 127477	2011-03-11 19:30:30 +00:00
Cameron Zwarich	9ed726c151	Optimize trivial branches in CodeGenPrepare, which often get created from the lowering of objectsize intrinsics. Unfortunately, a number of tests were relying on llc not optimizing trivial branches, so I had to add an option to allow them to continue to test what they originally tested. This fixes <rdar://problem/8785296> and <rdar://problem/9112893>. llvm-svn: 127459	2011-03-11 04:54:27 +00:00
Evan Cheng	d5d2d4a158	Avoid replacing the value of a directly stored load with the stored value if the load is indexed. rdar://9117613. llvm-svn: 127440	2011-03-11 00:48:56 +00:00
Jim Grosbach	1986d9ac8f	Properly pseudo-ize MOVCCr and MOVCCs. llvm-svn: 127434	2011-03-10 23:56:09 +00:00
Bob Wilson	f8c4d1ded9	Fix a compiler crash where a Glue value had multiple uses. Radar 9049552. llvm-svn: 127198	2011-03-08 01:17:20 +00:00
Joerg Sonnenberger	5f2f5fa638	Be nice to Xcore and the XMOS assembler and avoid quoting section names that contain only letters, digits and the characters "_" and ".". llvm-svn: 127028	2011-03-04 20:03:14 +00:00
Devang Patel	3efe510847	XFAIL for all. These tests are darwin specific anyway. llvm-svn: 127022	2011-03-04 19:38:10 +00:00
Devang Patel	23ee9fdba3	Disable ARMGlobalMerge on darwin. The debugger is not yet able to extract individual variable's info from merged global. llvm-svn: 127019	2011-03-04 19:11:05 +00:00
Joerg Sonnenberger	bb93506f95	Bug#9033: For the ELF assembler output, always quote the section name. llvm-svn: 126963	2011-03-03 22:31:08 +00:00
Cameron Zwarich	6a4612ba06	Eliminate the unused CodeGenPrepare option to split critical edges. llvm-svn: 126825	2011-03-02 03:31:46 +00:00
Bill Wendling	304dda7810	Narrow right shifts need to encode their immediates differently from a normal shift. 16-bit: imm6<5:3> = '001', 8 - <imm> is encded in imm6<2:0> 32-bit: imm6<5:4> = '01',16 - <imm> is encded in imm6<3:0> 64-bit: imm6<5> = '1', 32 - <imm> is encded in imm6<4:0> llvm-svn: 126723	2011-03-01 01:00:59 +00:00
Jakob Stoklund Olesen	2bec7738eb	Fix typo introduced by r126661: "Fix a typo which ..." llvm-svn: 126666	2011-02-28 19:18:59 +00:00
Evan Cheng	4e6d375744	Fix a typo which cause dag combine crash. rdar://9059537. llvm-svn: 126661	2011-02-28 18:45:27 +00:00
Bob Wilson	6bbffe19e9	Add patterns to use post-increment addressing for Neon VST1-lane instructions. llvm-svn: 126477	2011-02-25 06:42:42 +00:00
Devang Patel	bac565c8a3	Move arch specific tests in arch specific directories. llvm-svn: 126401	2011-02-24 19:06:27 +00:00

... 2 3 4 5 6 ...

1155 Commits