llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 12:12:47 +01:00

Author	SHA1	Message	Date
Evan Cheng	1c169777ca	Update tests. llvm-svn: 85050	2009-10-25 07:53:48 +00:00
Bob Wilson	8f4f73da55	Revert 84843. Evan, this was breaking some of the if-conversion tests. llvm-svn: 84868	2009-10-22 16:52:21 +00:00
Evan Cheng	2edd1efa46	Move if-conversion before post-regalloc scheduling so the predicated instruction get scheduled properly. llvm-svn: 84843	2009-10-22 06:48:32 +00:00
Evan Cheng	8fdd1661fa	Don't generate sbfx / ubfx with negative lsb field. Patch by David Conrad. llvm-svn: 84813	2009-10-22 00:40:00 +00:00
Evan Cheng	275a09e55d	Match more patterns to movt. llvm-svn: 84751	2009-10-21 08:15:52 +00:00
Anton Korobeynikov	7b6fe9f251	Fix invalid for vector types fneg(bitconvert(x)) => bitconvert(x ^ sign) transform. llvm-svn: 84683	2009-10-20 21:37:45 +00:00
Chris Lattner	b9bbaf7f4d	convert to filecheck syntax and make a lot more aggressive. llvm-svn: 84517	2009-10-19 18:27:56 +00:00
Chris Lattner	6fd5bc3ba0	rename test llvm-svn: 84515	2009-10-19 18:18:07 +00:00
Evan Cheng	b1580b5c48	Enable post-alloc scheduling for all ARM variants except for Thumb1. llvm-svn: 84249	2009-10-16 06:11:08 +00:00
Bob Wilson	d66a3fd73b	Revise ARM inline assembly memory operands to require the memory address to be in a register. The previous use of ARM address mode 2 was completely arbitrary and inappropriate for Thumb. Radar 7137468. llvm-svn: 84022	2009-10-13 20:50:28 +00:00
Sandeep Patel	1584038783	Add ARMv6T2 SBFX/UBFX instructions. Approved by Anton Korobeynikov. llvm-svn: 84009	2009-10-13 18:59:48 +00:00
Benjamin Kramer	34c117d8b7	Eliminate some redundant llvm-as calls. llvm-svn: 83837	2009-10-12 09:31:55 +00:00
Dan Gohman	b535009219	Update this test; the code is the same but it gets counted as one fewer remat. llvm-svn: 83690	2009-10-09 23:31:04 +00:00
Bob Wilson	011e458c11	Merge a bunch of NEON tests into larger files so they run faster. llvm-svn: 83667	2009-10-09 20:20:54 +00:00
Bob Wilson	de71518edb	Convert some ARM tests with lots of greps to use FileCheck. llvm-svn: 83651	2009-10-09 17:20:46 +00:00
Bob Wilson	d48cacb92f	Commit one last NEON test to use FileCheck. That's all of them now! llvm-svn: 83617	2009-10-09 05:31:56 +00:00
Bob Wilson	a8746e6bd1	Convert more NEON tests to use FileCheck. llvm-svn: 83616	2009-10-09 05:14:48 +00:00
Bob Wilson	8092fef09a	Add codegen support for NEON vst4lane intrinsics with 128-bit vectors. llvm-svn: 83600	2009-10-09 00:01:36 +00:00
Bob Wilson	979cb24a81	Add codegen support for NEON vst3lane intrinsics with 128-bit vectors. llvm-svn: 83598	2009-10-08 23:51:31 +00:00
Bob Wilson	233992bc56	Add codegen support for NEON vst2lane intrinsics with 128-bit vectors. llvm-svn: 83596	2009-10-08 23:38:24 +00:00
Bob Wilson	395adfabef	Convert more NEON tests to use FileCheck. llvm-svn: 83595	2009-10-08 23:33:03 +00:00
Bob Wilson	5b96a53ffe	Add codegen support for NEON vld4lane intrinsics with 128-bit vectors. Also fix some copy-and-paste errors in previous changes. llvm-svn: 83590	2009-10-08 22:53:57 +00:00
Bob Wilson	1fe8b7e27c	Convert more NEON tests to use FileCheck. llvm-svn: 83587	2009-10-08 22:33:53 +00:00
Bob Wilson	7209d78713	Add codegen support for NEON vld3lane intrinsics with 128-bit vectors. llvm-svn: 83585	2009-10-08 22:27:33 +00:00
Anton Korobeynikov	f9c811c948	Use lower16 / upper16 imm modifiers to asmprint 32-bit imms splitted via movt/movw pair. llvm-svn: 83572	2009-10-08 20:43:22 +00:00
Bob Wilson	3a55fe2105	Add codegen support for NEON vld2lane intrinsics with 128-bit vectors. llvm-svn: 83568	2009-10-08 18:56:10 +00:00
Bob Wilson	225627ec81	Convert more NEON tests to use FileCheck. llvm-svn: 83528	2009-10-08 06:02:10 +00:00
Bob Wilson	276bdabb9a	Add codegen support for NEON vst4 intrinsics with <1 x i64> vectors. llvm-svn: 83526	2009-10-08 05:18:18 +00:00
Bob Wilson	8aa1d328b5	Add codegen support for NEON vst3 intrinsics with <1 x i64> vectors. llvm-svn: 83518	2009-10-08 00:28:28 +00:00
Bob Wilson	958e4ae815	Add codegen support for NEON vst2 intrinsics with <1 x i64> vectors. llvm-svn: 83513	2009-10-08 00:21:01 +00:00
Bob Wilson	729cd181a2	Add codegen support for NEON vld4 intrinsics with <1 x i64> vectors. llvm-svn: 83508	2009-10-07 23:54:04 +00:00
Bob Wilson	c7baa2832f	Convert more NEON tests to use FileCheck. llvm-svn: 83507	2009-10-07 23:47:21 +00:00
Bob Wilson	3cbf156518	Add codegen support for NEON vld3 intrinsics with <1 x i64> vectors. llvm-svn: 83506	2009-10-07 23:39:57 +00:00
Bob Wilson	0ffa9679a5	Add codegen support for NEON vld2 intrinsics with <1 x i64> vectors. llvm-svn: 83502	2009-10-07 22:57:01 +00:00
Bob Wilson	97bab9ef32	Convert more NEON tests to use FileCheck. llvm-svn: 83497	2009-10-07 22:30:19 +00:00
Bob Wilson	a367eb439c	Convert test to FileCheck. llvm-svn: 83487	2009-10-07 20:51:42 +00:00
Bob Wilson	cee91108da	Add codegen support for NEON vst4 intrinsics with 128-bit vectors. llvm-svn: 83486	2009-10-07 20:49:18 +00:00
Bob Wilson	af14187764	Add codegen support for NEON vst3 intrinsics with 128-bit vectors. llvm-svn: 83484	2009-10-07 20:30:08 +00:00
Bob Wilson	62a3e55cea	Add codegen support for NEON vst2 intrinsics with 128-bit vectors. llvm-svn: 83482	2009-10-07 18:47:39 +00:00
Bob Wilson	9bb47b3e5d	Add codegen support for NEON vld4 intrinsics with 128-bit vectors. llvm-svn: 83479	2009-10-07 18:09:32 +00:00
Bob Wilson	b38401ccef	Add codegen support for NEON vld3 intrinsics with 128-bit vectors. llvm-svn: 83471	2009-10-07 17:24:55 +00:00
Bob Wilson	39328dad67	Add tests for vld2 of 128-bit vectors. llvm-svn: 83468	2009-10-07 17:19:13 +00:00
Bob Wilson	636d635cc1	Update NEON struct names to match llvm-gcc changes. (This is not required for correctness but might help with sanity.) llvm-svn: 83415	2009-10-06 21:16:19 +00:00
Evan Cheng	d93fbb28ed	Fix tests. llvm-svn: 83241	2009-10-02 06:53:57 +00:00
Evan Cheng	d6e64a4cfd	Move load / store multiple before post-alloc scheduling. llvm-svn: 83236	2009-10-02 04:57:15 +00:00
David Goodwin	a4b73e486e	Remove neonfp attribute and instead set default based on CPU string. Add -arm-use-neon-fp to override the default. llvm-svn: 83218	2009-10-01 22:19:57 +00:00
David Goodwin	d0edce4c0d	Restore the -post-RA-scheduler flag as an override for the target specification. Remove -mattr for setting PostRAScheduler enable and instead use CPU string. llvm-svn: 83215	2009-10-01 21:46:35 +00:00
David Goodwin	a282690f82	Remove -post-RA-schedule flag and add a TargetSubtarget method to enable post-register-allocation scheduling. By default it is off. For ARM, enable/disable with -mattr=+/-postrasched. Enable by default for cortex-a8. llvm-svn: 83122	2009-09-30 00:10:16 +00:00
David Goodwin	34aa421e3a	Post-RA regressions. llvm-svn: 83075	2009-09-29 17:10:26 +00:00
Evan Cheng	d2bf81c7af	Fix PR4687. Pre ARMv5te does not support ldrd / strd. Patch by John Tytgat. llvm-svn: 83058	2009-09-29 07:07:30 +00:00
Evan Cheng	ddc8678b00	Coalescer should not delete extract_subreg, insert_subreg, and subreg_to_reg of physical registers. This is especially critical for the later two since they start the live interval of a super-register. e.g. %DO<def> = INSERT_SUBREG %D0<undef>, %S0<kill>, 1 If this instruction is eliminated, the register scavenger will not be happy as D0 is not defined previously. This fixes PR5055. llvm-svn: 82968	2009-09-28 05:28:43 +00:00
Anton Korobeynikov	189ce11684	Use movt/movw pair to materialize 32 bit constants on ARMv6T2+. This should be better than single load from constpool. llvm-svn: 82948	2009-09-27 23:52:58 +00:00
Evan Cheng	eafd25a098	Remove this test. llvm-svn: 82869	2009-09-26 18:51:37 +00:00
Daniel Dunbar	923dc62615	"Update" tests for -disable-if-conversion removal. I think branch.ll should just be removed, but I XFAIL'd it for now. llvm-svn: 82847	2009-09-26 05:29:36 +00:00
Evan Cheng	0b55bde4b3	Convert test to filecheck. llvm-svn: 82835	2009-09-26 02:41:17 +00:00
Evan Cheng	ac17fbc5fe	Flip -disable-post-RA-scheduler to -post-RA-scheduler. llvm-svn: 82803	2009-09-25 21:38:11 +00:00
Dan Gohman	0ac693a89e	Improve MachineMemOperand handling. - Allocate MachineMemOperands and MachineMemOperand lists in MachineFunctions. This eliminates MachineInstr's std::list member and allows the data to be created by isel and live for the remainder of codegen, avoiding a lot of copying and unnecessary translation. This also shrinks MemSDNode. - Delete MemOperandSDNode. Introduce MachineSDNode which has dedicated fields for MachineMemOperands. - Change MemSDNode to have a MachineMemOperand member instead of its own fields with the same information. This introduces some redundancy, but it's more consistent with what MachineInstr will eventually want. - Ignore alignment when searching for redundant loads for CSE, but remember the greatest alignment. Target-specific code which previously used MemOperandSDNodes with generic SDNodes now use MemIntrinsicSDNodes, with opcodes in a designated range so that the SelectionDAG framework knows that MachineMemOperand information is available. llvm-svn: 82794	2009-09-25 20:36:54 +00:00
Bob Wilson	94e29af5ac	pr4926: ARM requires the stack pointer to be aligned, even for leaf functions. For the AAPCS ABI, SP must always be 4-byte aligned, and at any "public interface" it must be 8-byte aligned. For the older ARM APCS ABI, the stack alignment is just always 4 bytes. For X86, we currently align SP at entry to a function (e.g., to 16 bytes for Darwin), but no stack alignment is needed at other times, such as for a leaf function. After discussing this with Dan, I decided to go with the approach of adding a new "TransientStackAlignment" field to TargetFrameInfo. This value specifies the stack alignment that must be maintained even in between calls. It defaults to 1 except for ARM, where it is 4. (Some other targets may also want to set this if they have similar stack requirements. It's not currently required for PPC because it sets targetHandlesStackFrameRounding and handles the alignment in target-specific code.) The existing StackAlignment value specifies the alignment upon entry to a function, which is how we've been using it anyway. llvm-svn: 82767	2009-09-25 14:41:49 +00:00
Bob Wilson	4cb29f6864	Convert to FileCheck. llvm-svn: 82710	2009-09-24 20:23:02 +00:00
Evan Cheng	e9267f10c7	Fix PR5024 with a big hammer: disable the double-def assertion in the scavenger. LiveVariables add implicit kills to correctly track partial register kills. This works well enough and is fairly accurate. But coalescer can make it impossible to maintain these markers. e.g. BL <ga:sss1>, %R0<kill,undef>, %S0<kill>, %R0<imp-def>, %R1<imp-def,dead>, %R2<imp-def,dead>, %R3<imp-def,dead>, %R12<imp-def,dead>, %LR<imp-def,dead>, %D0<imp-def>, ... ... %reg1031<def> = FLDS <cp#1>, 0, 14, %reg0, Mem:LD4[ConstantPool] ... %S0<def> = FCPYS %reg1031<kill>, 14, %reg0, %D0<imp-use,kill> When reg1031 and S0 are coalesced, the copy (FCPYS) will be eliminated the the implicit-kill of D0 is lost. In this case it's possible to move the marker to the FLDS. But in many cases, this is not possible. Suppose %reg1031<def> = FOO <cp#1>, %D0<imp-def> ... %S0<def> = FCPYS %reg1031<kill>, 14, %reg0, %D0<imp-use,kill> When FCPYS goes away, the definition of S0 is the "FOO" instruction. However, transferring the D0 implicit-kill to FOO doesn't work since it is the def of D0 itself. We need to fix this in another time by introducing a "kill" pseudo instruction to track liveness. Disabling the assertion is not ideal, but machine verifier is doing that job now. It's important to know double-def is not a miscomputation since it means a register should be free but it's not tracked as free. It's a performance issue instead. llvm-svn: 82677	2009-09-24 02:27:09 +00:00
Evan Cheng	863ed2677b	Fix PR5024. LiveVariables physical register defs should commit only after all of the defs are processed. Also fix a implicit_def propagation bug: a implicit_def of a physical register should be applied to uses of the sub-registers. llvm-svn: 82616	2009-09-23 06:28:31 +00:00
Evan Cheng	97c8597450	Fix PR5024. LiveVariables::FindLastPartialDef should return a set of sub-registers that were defined by the last partial def, not just a single sub-register. llvm-svn: 82535	2009-09-22 08:34:46 +00:00
Evan Cheng	07d521ed99	Fix a pasto. Also simplify for Bill's benefit. llvm-svn: 82505	2009-09-22 01:48:19 +00:00
Evan Cheng	a6d602a5c1	Clean up spill weight computation. Also some changes to give loop induction variable increment / decrement slighter high priority. This has major impact on some micro-benchmarks. On MultiSource/Applications and spec tests, it's a minor win. It also reduce 256.bzip instruction count by 8%, 55 on 164.gzip on i386 / Darwin. llvm-svn: 82485	2009-09-21 21:12:25 +00:00
Evan Cheng	3edb8b18a5	Fix PR4986. "r1024 = insert_subreg r1024, undef, 2" cannot be turned in an implicit_def. Instead, it's an identity copy so it should be eliminated. Also make sure to update livevariable kill information. llvm-svn: 82436	2009-09-21 04:32:32 +00:00
Bob Wilson	9bcea11785	Convert more tests to FileCheck. llvm-svn: 81915	2009-09-15 20:58:02 +00:00
Sandeep Patel	7727a68464	Fix superreg use in ARMAsmPrinter. Approved by Anton Korobeynikov. llvm-svn: 81878	2009-09-15 17:53:11 +00:00
Anton Korobeynikov	fa4c7562d5	Define proper subreg sets for arm - this should fix bunch of subtle problems with subreg - superreg mapping and also fix PR4965. llvm-svn: 81657	2009-09-13 00:59:43 +00:00
Dan Gohman	8e55f0f55b	Remove an unnecessary -f. llvm-svn: 81546	2009-09-11 18:41:06 +00:00
Dan Gohman	f2c290dfa6	Convert more tests to avoid llvm-as. llvm-svn: 81545	2009-09-11 18:36:27 +00:00
Bob Wilson	7b39f31422	Don't swap the operands of a subtraction when trying to create a post-decrement load/store. llvm-svn: 81464	2009-09-10 22:09:31 +00:00
Bob Wilson	877a857b4b	Fix pr4939: Change FPCCToARMCC to translate SETOLE to ARMCC::LS. See the bug report for details. llvm-svn: 81397	2009-09-09 23:14:54 +00:00
Dan Gohman	142428ce64	Eliminate more uses of llvm-as and llvm-dis. llvm-svn: 81293	2009-09-09 00:09:15 +00:00
Anton Korobeynikov	2b6ef7724e	Unbreak getOnesVector() / getZeroVector() to use valid ARM extended imm's. llvm-svn: 81262	2009-09-08 22:51:43 +00:00
Anton Korobeynikov	0b3a620d60	Add NEON 'laned' operations. This fixes another bunch of gcc testsuite fails and makes the code faster. llvm-svn: 81220	2009-09-08 15:22:32 +00:00
Daniel Dunbar	2a64e85835	Remove stale greps. llvm-svn: 80986	2009-09-04 05:07:52 +00:00
Bob Wilson	25410ac604	Convert tests to FileCheck. llvm-svn: 80983	2009-09-04 04:07:19 +00:00
Bob Wilson	9e02907942	Convert a test to FileCheck. llvm-svn: 80975	2009-09-04 00:32:31 +00:00
Evan Cheng	41e87f2f13	Reference to hidden symbols do not have to go through non-lazy pointer in non-pic mode. rdar://7187172. llvm-svn: 80904	2009-09-03 07:04:02 +00:00
Anton Korobeynikov	7125d63acf	More missed vdup patterns llvm-svn: 80838	2009-09-02 21:21:28 +00:00
Bob Wilson	6972a16bbc	Add support for generating code for vst{234}lane intrinsics. llvm-svn: 80707	2009-09-01 18:51:56 +00:00
Bob Wilson	75b2b04e1e	Fix incorrect declarations of intrinsics in this test. llvm-svn: 80705	2009-09-01 18:50:43 +00:00
Bob Wilson	d638cc8869	Add test for vld{234}_lane instructions. llvm-svn: 80658	2009-09-01 04:27:10 +00:00
Bob Wilson	03f5a5bfff	Fix pr4843: When an instruction has multiple destination registers that are tied to different source registers, the TwoAddressInstructionPass needs to be smarter. Change it to check before replacing a source register whether that source register is tied to a different destination register, and if so, defer handling it until a subsequent iteration. llvm-svn: 80654	2009-09-01 04:18:40 +00:00
Jim Grosbach	4e0e9a4870	SJLJ is arm/darwin only for now. force the triple for the test llvm-svn: 80651	2009-09-01 02:34:49 +00:00
Jim Grosbach	9a220088ac	Clean up LSDA name generation and use for SJLJ exception handling. This makes an eggregious hack somewhat more palatable. Bringing the LSDA forward and making it a GV available for reference would be even better, but is beyond the scope of what I'm looking to solve at this point. Objective C++ code could generate function names that broke the previous scheme. This fixes that. llvm-svn: 80649	2009-09-01 01:57:56 +00:00
David Goodwin	0fc3764297	Don't mark a register live at an undef use. llvm-svn: 80621	2009-08-31 20:47:02 +00:00
Anton Korobeynikov	17529667db	Add missed pattern llvm-svn: 80502	2009-08-30 19:06:39 +00:00
Anton Korobeynikov	a261afbf14	EXTRACT_VECTOR_ELEMENT can have result type different from element type. Remove the assertion and generalize the code for ARM NEON stuff. llvm-svn: 80498	2009-08-30 17:14:54 +00:00
Anton Korobeynikov	b2e6f5eed4	Do not assert on too wide splats we don't support. llvm-svn: 80409	2009-08-29 00:08:18 +00:00
Anton Korobeynikov	9fd6082c10	Add missed extract_element pattern llvm-svn: 80408	2009-08-28 23:41:26 +00:00
Evan Cheng	d7a07ab112	Let Darwin linker auto-synthesize stubs and lazy-pointers. This deletes a bunch of nasty code in ARM asm printer. llvm-svn: 80404	2009-08-28 23:18:09 +00:00
Evan Cheng	2d5d3700e9	v4, v5 does not support sxtb / sxth. llvm-svn: 80322	2009-08-28 00:31:43 +00:00
Anton Korobeynikov	cb0fdc4505	scalar_to_vector is fully legal now (implemented as subreg accesses) llvm-svn: 80249	2009-08-27 16:04:47 +00:00
Anton Korobeynikov	e17a92c545	Ok, sometimes it's profitable to turn scalar_to_vector stuff into subreg access. Add a testcase. llvm-svn: 80246	2009-08-27 14:51:42 +00:00
Evan Cheng	984f8efcaa	Fix PR4789. Teach eliminateFrameIndex how to handle VLDRQ and VSTRQ which cannot fold any immediate offset. llvm-svn: 80191	2009-08-27 01:23:50 +00:00
Bob Wilson	c7d92cfb15	Convert some more Neon tests to FileCheck. llvm-svn: 80120	2009-08-26 18:11:50 +00:00
Anton Korobeynikov	1c904039ce	Expand scalar_to_vector - we don't have any isel logic for it now llvm-svn: 80107	2009-08-26 16:26:09 +00:00
David Goodwin	047f69da86	Fixup register kills after scheduling. llvm-svn: 80002	2009-08-25 17:03:05 +00:00
Dan Gohman	bf08e82d8e	Remove obsolete -f flags. llvm-svn: 79992	2009-08-25 15:38:29 +00:00
Dale Johannesen	add8a314dd	Split test into 3. llvm-svn: 79926	2009-08-24 17:51:19 +00:00
Eli Friedman	79615641f1	Make x86 test actually test x86 code generation. Fix the construct on ARM, which was breaking by coincidence, and add a similar testcase for ARM. llvm-svn: 79719	2009-08-22 03:13:10 +00:00
Bob Wilson	79c0af15d0	Use CHECK-NEXT to make sure we're only getting one copy of each shuffle instruction. llvm-svn: 79702	2009-08-22 00:13:23 +00:00
Bob Wilson	6d4400e852	Match VTRN, VZIP, and VUZP shuffles. Restore the tests for these operations, now using shuffles instead of intrinsics. llvm-svn: 79673	2009-08-21 20:54:19 +00:00
Bob Wilson	0da4ec0046	Add some tests for vext.16 and vext.32. llvm-svn: 79638	2009-08-21 16:35:24 +00:00
Bob Wilson	c046b62f1a	Remove Neon intrinsics for VZIP, VUZP, and VTRN. We will represent these as vector shuffles. Temporarily remove the tests for these operations until the new implementation is working. llvm-svn: 79579	2009-08-21 00:01:42 +00:00
Bob Wilson	fae9057bf0	Add support for Neon VEXT (vector extract) shuffles. This is derived from a patch by Anton Korzh. I modified it to recognize the VEXT shuffles during legalization and lower them to a target-specific DAG node. llvm-svn: 79428	2009-08-19 17:03:43 +00:00
Bill Wendling	962adec4ee	Reapply r79127. It was fixed by d0k. llvm-svn: 79136	2009-08-15 21:21:19 +00:00
Bill Wendling	bfebbb6477	Revert r79127. It was causing compilation errors. llvm-svn: 79135	2009-08-15 21:14:01 +00:00
Evan Cheng	5d841097a9	Change allowsUnalignedMemoryAccesses to take type argument since some targets support unaligned mem access only for certain types. (Should it be size instead?) ARM v7 supports unaligned access for i16 and i32, some v6 variants support it as well. llvm-svn: 79127	2009-08-15 19:23:44 +00:00
Jakob Stoklund Olesen	7f4ef2d59a	Refine EarlyClobber assert in register scavenger. It is legal for an inline asm operand to use an earlyclobber register if the use operand is tied to the earlyclobber operand. The issue is discussed here: http://gcc.gnu.org/ml/gcc/1999-04n/msg00431.html We should perhaps let only the machine code verifier worry about these finer details. EarlyClobber operands are not really interesting to the scavenger. This fixes PR4528 for the third time. llvm-svn: 79122	2009-08-15 18:16:58 +00:00
Jakob Stoklund Olesen	8f6660c417	Don't setCalleeSavedInfoValid() until spills are interted. In a naked function, the flag is never set and getPristineRegs() returns an empty list. That means naked functions are able to clobber callee saved registers, but that is the whole point of naked functions. This fixes PR4716. llvm-svn: 79096	2009-08-15 13:10:46 +00:00
Bob Wilson	0cf2be2466	Generate Neon VTBL and VTBX instructions from the corresponding intrinsics. llvm-svn: 78835	2009-08-12 20:51:55 +00:00
Chris Lattner	aa0dbe5764	now that these are in file-check format, we can merge them together into one bigger test (which runs faster) llvm-svn: 78672	2009-08-11 15:54:17 +00:00
Bob Wilson	2195d82b90	Convert more Neon tests to use FileCheck. llvm-svn: 78648	2009-08-11 05:51:19 +00:00
Bob Wilson	d64e304671	Use vAny type to get rid of Neon intrinsics that differed only in whether the overloaded vector types allowed floating-point or integer vector elements. Most of these operations actually depend on the element type, so bitcasting was not an option. If you include the vpadd intrinsics that I updated earlier, this gets rid of 20 intrinsics. llvm-svn: 78646	2009-08-11 05:39:44 +00:00
Bob Wilson	1c75a23299	Use new EVT::vAny type to combine Neon intrinsics for VPADD. llvm-svn: 78632	2009-08-11 01:15:26 +00:00
David Goodwin	fcb59a8a30	Use FileCheck. llvm-svn: 78614	2009-08-10 23:14:14 +00:00
David Goodwin	151235d75d	Use FileCheck... its good for you... llvm-svn: 78613	2009-08-10 23:06:57 +00:00
David Goodwin	7c0b4485d1	Fix test. llvm-svn: 78611	2009-08-10 22:58:08 +00:00
David Goodwin	2e2fe66e85	Fix test. llvm-svn: 78606	2009-08-10 22:31:04 +00:00
David Goodwin	36a5b02e4f	Use NEON for single-precision int<->FP conversions. llvm-svn: 78604	2009-08-10 22:17:39 +00:00
Dan Gohman	fe048746c2	Add nounwind keywords. llvm-svn: 78568	2009-08-10 16:48:40 +00:00
Chris Lattner	cc70d578be	Make the big switch: Change MCSectionMachO to represent a section semantically instead of syntactically as a string. This means that it keeps track of the segment, section, flags, etc directly and asmprints them in the right format. This also includes parsing and validation support for llvm-mc and "attribute(section)", so we should now start getting errors about invalid section attributes from the compiler instead of the assembler on darwin. Still todo: 1) Uniquing of darwin mcsections 2) Move all the Darwin stuff out to MCSectionMachO.[cpp\|h] 3) there are a few FIXMEs, for example what is the syntax to get the S_GB_ZEROFILL segment type? llvm-svn: 78547	2009-08-10 01:39:42 +00:00
Bob Wilson	8b13d5c8e3	Add tests for Neon VZIP and VUZP instructions. llvm-svn: 78529	2009-08-09 06:48:29 +00:00
Bob Wilson	06b61e2598	Add a test for Neon VTRN instructions. llvm-svn: 78528	2009-08-09 06:30:46 +00:00
Bob Wilson	a2913fe5f5	Convert more Neon tests to use FileCheck. llvm-svn: 78433	2009-08-07 23:45:02 +00:00
David Goodwin	c0fe95d8ce	Make NEON single-precision FP support the default for cortex-a8 (again). llvm-svn: 78430	2009-08-07 23:32:33 +00:00
Anton Korobeynikov	9b52601704	2 more vdup.32 cases llvm-svn: 78419	2009-08-07 22:36:50 +00:00
Bob Wilson	bd7627b23e	Implement Neon VST[234] operations. llvm-svn: 78330	2009-08-06 18:47:44 +00:00
Bob Wilson	5cbc89337b	Fix incorrect intrinsic declarations. llvm-svn: 78329	2009-08-06 18:46:26 +00:00
Bob Wilson	6fb1102b9a	Add tests for new NEON vld instructions. llvm-svn: 78264	2009-08-06 00:38:31 +00:00
Bob Wilson	a12289f373	Convert more Neon tests to FileCheck. llvm-svn: 78261	2009-08-05 23:51:20 +00:00
Anton Korobeynikov	07ce0611d9	Missed pieces for ARM HardFP ABI. Patch by Sandeep Patel! llvm-svn: 78225	2009-08-05 19:04:42 +00:00
Bob Wilson	36d2cedfcb	Convert more Neon tests to use FileCheck. llvm-svn: 78111	2009-08-04 22:01:41 +00:00
Bob Wilson	423086a047	Convert a few Neon tests to use FileCheck. llvm-svn: 78108	2009-08-04 21:33:22 +00:00
Jakob Stoklund Olesen	5edb25cd45	Clean up the handling of two-address operands in RegScavenger. This fixes PR4528. llvm-svn: 78107	2009-08-04 21:30:30 +00:00
David Goodwin	648590849c	Add NEON single-precision FP support for fabs and fneg. llvm-svn: 78101	2009-08-04 20:39:05 +00:00
David Goodwin	5efde448fa	Match common pattern for FNMAC. Add NEON SP support. llvm-svn: 78085	2009-08-04 18:44:29 +00:00
David Goodwin	e034df4626	Improve tests. llvm-svn: 78083	2009-08-04 18:11:59 +00:00
David Goodwin	99adffe5f2	Initial support for single-precision FP using NEON. Added "neonfp" attribute to enable. Added patterns for some binary FP operations. llvm-svn: 78081	2009-08-04 17:53:06 +00:00
Evan Cheng	d840bf2eac	Fix PR4528. This scavenger assertion is too strict. The two-address value is killed by another operand. There is probably a better fix. Either 1) scavenger can look at other operands, or 2) livevariables can be smarter about kill markers. Patches welcome. llvm-svn: 78072	2009-08-04 16:52:44 +00:00
Bob Wilson	eb3b616a7e	Lower CONCAT_VECTOR during legalization instead of matching it during isel. Add a testcase. llvm-svn: 77992	2009-08-03 20:36:38 +00:00
Jakob Stoklund Olesen	1b274fd5f0	Fix Bug 4657: register scavenger asserts with subreg lowering When LowerSubregsInstructionPass::LowerInsert eliminates an INSERT_SUBREG instriction because it is an identity copy, make sure that the same registers are alive before and after the elimination. When the super-register is marked <undef> this requires inserting an IMPLICIT_DEF instruction to make sure the super register is live. Fix a related bug where a kill flag on the inserted sub-register was not transferred properly. Finally, clear the undef flag in MachineInstr::addRegisterKilled. Undef implies dead and kill implies live, so they cant both be valid. llvm-svn: 77989	2009-08-03 20:08:18 +00:00
Chris Lattner	1e3d2247ba	switch to filecheck format llvm-svn: 77841	2009-08-02 00:32:26 +00:00
Evan Cheng	fabbd6219a	Add VFP3 D registers to the DPR register class. llvm-svn: 77521	2009-07-29 23:03:41 +00:00
Bob Wilson	355e0b70e0	Change Neon VLDn intrinsics to return multiple values instead of really wide vectors. Likewise, change VSTn intrinsics to take separate arguments for each vector in a multi-vector struct. Adjust tests accordingly. llvm-svn: 77468	2009-07-29 16:39:22 +00:00
Bob Wilson	ec256c8938	Add support for ARM Neon VREV instructions. Patch by Anton Korzh, with some modifications from me. llvm-svn: 77101	2009-07-26 00:39:34 +00:00
Evan Cheng	4a77f28c47	Use getTargetConstant instead of getConstant since it's meant as an constant operand. llvm-svn: 76803	2009-07-22 22:03:29 +00:00
Evan Cheng	88dbc00ca7	Ignore undef uses. llvm-svn: 76799	2009-07-22 21:51:42 +00:00
Evan Cheng	949c2404a2	Fix ARM isle code that optimize multiply by constants which are power-of-2 +/- 1. llvm-svn: 76520	2009-07-21 00:31:12 +00:00
Evan Cheng	919f5c5559	Forgot this test earlier. llvm-svn: 76485	2009-07-20 21:46:42 +00:00
Chris Lattner	499fe29f12	fix an arm codegen bug (the same as PR4482 on ppc) where available_externally symbols were not getting stubs. While I'm at it, add a big testcase for stub generation to make sure I don't break anything. llvm-svn: 75737	2009-07-15 04:12:33 +00:00
Evan Cheng	4249ad4c00	Remove a bogus assertion. llvm-svn: 75206	2009-07-10 00:23:48 +00:00
Bob Wilson	f5f52fa9d6	Handle 'a' modifier on inline assembly operands. This is part of the fix for pr4521. llvm-svn: 75201	2009-07-09 23:54:51 +00:00
Lang Hames	ceb80b14d3	Improved tracking of value number kills. VN kills are now represented as an (index,bool) pair. The bool flag records whether the kill is a PHI kill or not. This code will be used to enable splitting of live intervals containing PHI-kills. A slight change to live interval weights introduced an extra spill into lsr-code-insertion (outside the critical sections). The test condition has been updated to reflect this. llvm-svn: 75097	2009-07-09 03:57:02 +00:00
Bob Wilson	8d4a8b9370	Implement NEON vst1 instruction. llvm-svn: 75037	2009-07-08 20:32:02 +00:00
Bob Wilson	3809b333de	Implement NEON vld1 instructions. llvm-svn: 75019	2009-07-08 18:11:30 +00:00
Chris Lattner	2939f0a318	Change these tests to use [fi]cmp+sext instead of v[fi]cmp. No functionality change. llvm-svn: 74979	2009-07-08 00:46:57 +00:00
Evan Cheng	5a279bb4b2	Add bfc to armv6t2. llvm-svn: 74868	2009-07-06 22:23:46 +00:00
Evan Cheng	2570d8b541	Added ARM::mls for armv6t2. llvm-svn: 74866	2009-07-06 22:05:45 +00:00
Evan Cheng	f20e4fba49	Add thumb2 sign / zero extend with rotate instructions. llvm-svn: 74755	2009-07-03 01:43:10 +00:00
Evan Cheng	e6989735a6	CommuteChangesDestination() should check if to-be-commuted instruction defines any register. Also teaches the default commuteInstruction() to commute instruction without definitions (e.g. X86::test / ARM::tsp). llvm-svn: 74602	2009-07-01 08:29:08 +00:00
Evan Cheng	7d78cb531e	Remove special handling of implicit_def. Fix a couple more bugs in liveintervalanalysis and coalescer handling of implicit_def. Note, isUndef marker must be placed even on implicit_def def operand or else the scavenger will not ignore it. This is necessary because -O0 path does not use liveintervalanalysis, it treats implicit_def just like any other def. llvm-svn: 74601	2009-07-01 08:19:36 +00:00
Evan Cheng	37503e9671	Handle IMPLICIT_DEF with isUndef operand marker, part 2. This patch moves the code to annotate machineoperands to LiveIntervalAnalysis. It also add markers for implicit_def that define physical registers. The rest, is just a lot of details. llvm-svn: 74580	2009-07-01 01:59:31 +00:00
Evan Cheng	28b9e77f19	Temporarily restore the scavenger implicit_def checking code. MachineOperand isUndef mark is not being put on implicit_def of physical registers (created for parameter passing, etc.). llvm-svn: 74519	2009-06-30 09:19:42 +00:00
Evan Cheng	c6c942b70f	Add a bit IsUndef to MachineOperand. This indicates the def / use register operand is defined by an implicit_def. That means it can def / use any register and passes (e.g. register scavenger) can feel free to ignore them. The register allocator, when it allocates a register to a virtual register defined by an implicit_def, can allocate any physical register without worrying about overlapping live ranges. It should mark all of operands of the said virtual register so later passes will do the right thing. This is not the best solution. But it should be a lot less fragile to having the scavenger try to track what is defined by implicit_def. llvm-svn: 74518	2009-06-30 08:49:04 +00:00
Evan Cheng	093adf3ff9	Implement Thumb2 ldr. After much back and forth, I decided to deviate from ARM design and split LDR into 4 instructions (r + imm12, r + imm8, r + r << imm12, constantpool). The advantage of this is 1) it follows the latest ARM technical manual, and 2) makes it easier to reduce the width of the instruction later. The down side is this creates more inconsistency between the two sub-targets. We should split ARM LDR instruction in a similar fashion later. I've added a README entry for this. llvm-svn: 74420	2009-06-29 07:51:04 +00:00
David Goodwin	b2c485c6bd	ORN and BIC tests. llvm-svn: 74289	2009-06-26 16:20:06 +00:00
Evan Cheng	7883ae3121	Fix tests: Count -> count. llvm-svn: 74282	2009-06-26 07:05:57 +00:00
Evan Cheng	da10be895c	Fix a CodeGenDAGPatterns bug. Check if top level predicates match when it's looking for duplicates. llvm-svn: 74276	2009-06-26 05:59:16 +00:00
Evan Cheng	4ac765118d	Select ADC, SBC, and RSC instead of the ADCS, SBCS, and RSCS when the carry bit def is not used. llvm-svn: 74228	2009-06-25 23:34:10 +00:00
Evan Cheng	0cced3daa8	ISD::ADDE / ISD::SUBE updates the carry bit so they should isle to ADCS and SBCS / RSCS. llvm-svn: 74200	2009-06-25 20:59:23 +00:00
Evan Cheng	b4139189b0	Move thumb and thumb2 tests into separate directories. llvm-svn: 74068	2009-06-24 06:36:07 +00:00
Evan Cheng	eaad82627b	Proper patterns for thumb2 shift and rotate instructions. llvm-svn: 73987	2009-06-23 19:39:13 +00:00
Bob Wilson	6db76aaf10	Add support for ARM's Advanced SIMD (NEON) instruction set. This is still a work in progress but most of the NEON instruction set is supported. llvm-svn: 73919	2009-06-22 23:27:02 +00:00
Evan Cheng	2814371831	It's coalescer, not coaleser. llvm-svn: 73902	2009-06-22 21:09:17 +00:00
Bob Wilson	0c2c5f65e2	For Darwin on ARMv6 and newer, make register r9 available for use as a caller-saved register. llvm-svn: 73901	2009-06-22 21:01:46 +00:00
Evan Cheng	2410955c62	Fix another register coalescer crash: forgot to check if the instruction being updated has already been coalesced. llvm-svn: 73898	2009-06-22 20:49:32 +00:00
Evan Cheng	b37e7e24d0	hasFP should return true if frame address is taken. llvm-svn: 73893	2009-06-22 18:38:48 +00:00
Evan Cheng	b45918e5bb	Fix PR4419: handle defs of partial uses. llvm-svn: 73816	2009-06-20 04:34:51 +00:00
Evan Cheng	f18de63563	Enable arm pre-allocation load / store multiple optimization pass. llvm-svn: 73791	2009-06-19 23:17:27 +00:00
Eli Friedman	003abaa60d	Mark a few Thumb instructions commutable; just happened to spot this while experimenting. I'm reasonably sure this is correct, but please tell me if these instructions have some strange property which makes this change unsafe. llvm-svn: 73746	2009-06-19 01:43:08 +00:00
Anton Korobeynikov	7fd29c57a8	Initial support for some Thumb2 instructions. Patch by Viktor Kutuzov and Anton Korzh from Access Softek, Inc. llvm-svn: 73622	2009-06-17 18:13:58 +00:00
Anton Korobeynikov	d6004a164c	Make the test target-neutral llvm-svn: 73547	2009-06-16 20:25:25 +00:00
Anton Korobeynikov	a74b8323d0	GNU as refuses to assemble "pop {}" instruction. Do not emit such (this is the case when we have thumb vararg function with single callee-saved register, which is handled separately). llvm-svn: 73529	2009-06-16 18:49:08 +00:00
Evan Cheng	a98ff05fca	If a val# is defined by an implicit_def and it is being removed, all of the copies off the val# were removed. This causes problem later since the scavenger will see uses of registers without defs. The proper solution is to change the copies into implicit_def's instead. TurnCopyIntoImpDef turns a copy into implicit_def and remove the val# defined by it. This causes an scavenger assertion later if the def reaches other blocks. Disable the transformation if the value live interval extends beyond its def block. llvm-svn: 73478	2009-06-16 07:12:58 +00:00
Evan Cheng	4b77794613	ifcvt should ignore cfg where true and false successors are the same. llvm-svn: 73423	2009-06-15 21:24:34 +00:00
Evan Cheng	3219c7fbe5	Part 1. - Change register allocation hint to a pair of unsigned integers. The hint type is zero (which means prefer the register specified as second part of the pair) or entirely target dependent. - Allow targets to specify alternative register allocation orders based on allocation hint. Part 2. - Use the register allocation hint system to implement more aggressive load / store multiple formation. - Aggressively form LDRD / STRD. These are formed before register allocation. It has to be done this way to shorten live interval of base and offset registers. e.g. v1025 = LDR v1024, 0 v1026 = LDR v1024, 0 => v1025,v1026 = LDRD v1024, 0 If this transformation isn't done before allocation, v1024 will overlap v1025 which means it more difficult to allocate a register pair. - Even with the register allocation hint, it may not be possible to get the desired allocation. In that case, the post-allocation load / store multiple pass must fix the ldrd / strd instructions. They can either become ldm / stm instructions or back to a pair of ldr / str instructions. This is work in progress, not yet enabled. llvm-svn: 73381	2009-06-15 08:28:29 +00:00
Evan Cheng	d0a66e438f	Add a ARM specific pre-allocation pass that re-schedule loads / stores from consecutive addresses togther. This makes it easier for the post-allocation pass to form ldm / stm. This is step 1. We are still missing a lot of ldm / stm opportunities because of register allocation are not done in the desired order. More enhancements coming. llvm-svn: 73291	2009-06-13 09:12:55 +00:00
Evan Cheng	98216808fe	If killed register is defined by implicit_def, do not clear it since it's live range may overlap another def of same register. llvm-svn: 73255	2009-06-12 21:34:26 +00:00
Evan Cheng	2f784781aa	Mark some pattern-less instructions as neverHasSideEffects. llvm-svn: 73252	2009-06-12 20:46:18 +00:00
Anton Korobeynikov	c82243e658	Add testcase for register scanveger assertion fix in r72755 (double def due to livevars) llvm-svn: 73096	2009-06-08 22:54:15 +00:00
Evan Cheng	ea31ec569b	Changing allocation ordering from r3 ... r0 back to r0 ... r3. The order change no longer make sense after the coalescing changes we have made since then. llvm-svn: 72955	2009-06-05 19:08:58 +00:00
Dan Gohman	5f6f8101d5	Split the Add, Sub, and Mul instruction opcodes into separate integer and floating-point opcodes, introducing FAdd, FSub, and FMul. For now, the AsmParser, BitcodeReader, and IRBuilder all preserve backwards compatability, and the Core LLVM APIs preserve backwards compatibility for IR producers. Most front-ends won't need to change immediately. This implements the first step of the plan outlined here: http://nondot.org/sabre/LLVMNotes/IntegerOverflow.txt llvm-svn: 72897	2009-06-04 22:49:04 +00:00
Evan Cheng	8a6c448ab0	A value defined by an implicit_def can be liven to a use BB. This is unfortunate. But register allocator still has to add it to the live-in set of the use BB. llvm-svn: 72888	2009-06-04 20:25:48 +00:00
Evan Cheng	e3a05e6690	Re-apply 72756 with fixes. One of those was introduced by we changed MachineInstrBuilder::addReg() interface. llvm-svn: 72826	2009-06-04 01:15:28 +00:00
Evan Cheng	82f8fa333e	Temporarily revert 72756 for now. llvm-svn: 72757	2009-06-03 07:40:47 +00:00
Evan Cheng	5afbef29fa	Fold preceding / trailing base inc / dec into the single load / store as well. llvm-svn: 72756	2009-06-03 06:14:58 +00:00
Bob Wilson	c6726ecca5	Fix pr4058 and pr4059. Do not split i64 or double arguments between r3 and the stack. Patch by Sandeep Patel. llvm-svn: 72106	2009-05-19 10:02:36 +00:00

... 2 3 4 5 6 ...

552 Commits