llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 14:33:02 +02:00

Author	SHA1	Message	Date
Owen Anderson	acda6f77a8	Reduce code duplication. llvm-svn: 127899	2011-03-18 19:46:58 +00:00
Bill Wendling	da1364d669	Generate a VTBL instruction instead of a series of loads and stores when we can. As Nate pointed out, VTBL isn't super performant, but it has to be better than this: _shuf: @ BB#0: @ %entry push {r4, r7, lr} add r7, sp, #4 sub sp, #12 mov r4, sp bic r4, r4, #7 mov sp, r4 mov r2, sp vmov d16, r0, r1 orr r0, r2, #6 orr r3, r2, #7 vst1.8 {d16[0]}, [r3] vst1.8 {d16[5]}, [r0] subs r4, r7, #4 orr r0, r2, #5 vst1.8 {d16[4]}, [r0] orr r0, r2, #4 vst1.8 {d16[4]}, [r0] orr r0, r2, #3 vst1.8 {d16[0]}, [r0] orr r0, r2, #2 vst1.8 {d16[2]}, [r0] orr r0, r2, #1 vst1.8 {d16[1]}, [r0] vst1.8 {d16[3]}, [r2] vldr.64 d16, [sp] vmov r0, r1, d16 mov sp, r4 pop {r4, r7, pc} The "illegal" testcase in vext.ll is no longer illegal. <rdar://problem/9078775> llvm-svn: 127630	2011-03-14 23:02:38 +00:00
Jim Grosbach	2226dfbea2	Remove dead code. These ARM instruction definitions no longer exist. llvm-svn: 127509	2011-03-11 23:15:02 +00:00
Bob Wilson	1497601a7b	Remove unused conditional negate operations. llvm-svn: 127090	2011-03-05 16:54:31 +00:00
Bob Wilson	6bbffe19e9	Add patterns to use post-increment addressing for Neon VST1-lane instructions. llvm-svn: 126477	2011-02-25 06:42:42 +00:00
Chris Lattner	bcf2d46d8a	Enhance ComputeMaskedBits to know that aligned frameindexes have their low bits set to zero. This allows us to optimize out explicit stack alignment code like in stack-align.ll:test4 when it is redundant. Doing this causes the code generator to start turning FI+cst into FI\|cst all over the place, which is general goodness (that is the canonical form) except that various pieces of the code generator don't handle OR aggressively. Fix this by introducing a new SelectionDAG::isBaseWithConstantOffset predicate, and using it in places that are looking for ADD(X,CST). The ARM backend in particular was missing a lot of addressing mode folding opportunities around OR. llvm-svn: 125470	2011-02-13 22:25:43 +00:00
Bob Wilson	65f4a70b82	Add codegen support for using post-increment NEON load/store instructions. The vld1-lane, vld1-dup and vst1-lane instructions do not yet support using post-increment versions, but all the rest of the NEON load/store instructions should be handled now. llvm-svn: 125014	2011-02-07 17:43:21 +00:00
Bob Wilson	46b105c6a2	Change VLD3/4 and VST3/4 for quad registers to not update the address register. These operations are expanded to pairs of loads or stores, and the first one uses the address register update to produce the address for the second one. So far, the second load/store has also updated the address register, just for convenience, since that output has never been used. In anticipation of actually supporting post-increment updates for these operations, this changes the non-updating operations to use a non-updating load/store for the second instruction. llvm-svn: 125013	2011-02-07 17:43:15 +00:00
Evan Cheng	6dc21c7358	Sorry, several patches in one. TargetInstrInfo: Change produceSameValue() to take MachineRegisterInfo as an optional argument. When in SSA form, targets can use it to make more aggressive equality analysis. Machine LICM: 1. Eliminate isLoadFromConstantMemory, use MI.isInvariantLoad instead. 2. Fix a bug which prevent CSE of instructions which are not re-materializable. 3. Use improved form of produceSameValue. ARM: 1. Teach ARM produceSameValue to look pass some PIC labels. 2. Look for operands from different loads of different constant pool entries which have same values. 3. Re-implement PIC GA materialization using movw + movt. Combine the pair with a "add pc" or "ldr [pc]" to form pseudo instructions. This makes it possible to re-materialize the instruction, allow machine LICM to hoist the set of instructions out of the loop and make it possible to CSE them. It's a bit hacky, but it significantly improve code quality. 4. Some minor bug fixes as well. With the fixes, using movw + movt to materialize GAs significantly outperform the load from constantpool method. 186.crafty and 255.vortex improved > 20%, 254.gap and 176.gcc ~10%. llvm-svn: 123905	2011-01-20 08:34:58 +00:00
Daniel Dunbar	dd1dd698f0	ARM/ISel: Factor out isScaledConstantInRange() helper. llvm-svn: 123823	2011-01-19 15:12:16 +00:00
Evan Cheng	53ec6fc591	Materialize GA addresses with movw + movt pairs for Darwin in PIC mode. e.g. movw r0, :lower16:(L_foo$non_lazy_ptr-(LPC0_0+4)) movt r0, :upper16:(L_foo$non_lazy_ptr-(LPC0_0+4)) LPC0_0: add r0, pc, r0 It's not yet enabled by default as some tests are failing. I suspect bugs in down stream tools. llvm-svn: 123619	2011-01-17 08:03:18 +00:00
Anton Korobeynikov	d37cb4cd1c	Model operand restrictions of mul-like instructions on ARMv5 via earlyclobber stuff. This should fix PRs 2313 and 8157. Unfortunately, no testcase, since it'd be dependent on register assignments. llvm-svn: 122663	2011-01-01 20:38:38 +00:00
Andrew Trick	53f4556c64	whitespace llvm-svn: 122539	2010-12-24 04:28:06 +00:00
Chris Lattner	65c5243bd6	rename MVT::Flag to MVT::Glue. "Flag" is a terrible name for something that just glues two nodes together, even if it is sometimes used for flags. llvm-svn: 122310	2010-12-21 02:38:05 +00:00
Bob Wilson	125f1a26e0	Use PairDRegs to implement ConcatVectors. No functionality change. llvm-svn: 122017	2010-12-17 01:21:08 +00:00
Jim Grosbach	84c2b29b58	Thumb1 had two patterns for the same load-from-constant-pool instruction. Canonicalize on tLDRpci and remove tLDRcp. llvm-svn: 121920	2010-12-15 23:52:36 +00:00
Bill Wendling	258739db7e	Reapply r121808 now that the missing patterns have been supplied. llvm-svn: 121820	2010-12-15 01:03:19 +00:00
Bill Wendling	ad13b53237	Revert r121808 until I can fix the build. llvm-svn: 121815	2010-12-15 00:04:00 +00:00
Bill Wendling	2883bbb8c0	Make the ISel selections for LDR/STR the same as before the LDRr/LDRi split. In particular, we want ldr r2, [r3] to be equivalent to ldr r2, [r3, #0] and not ldr r2, [r3, r0] llvm-svn: 121808	2010-12-14 23:40:49 +00:00
Bill Wendling	61720b79f9	The tLDR et al instructions were emitting either a reg/reg or reg/imm instruction based on the t_addrmode_s# mode and what it returned. There is some obvious badness to this. In particular, it's hard to do MC-encoding when the instruction may change out from underneath you after the t_addrmode_s# variable is finally resolved. The solution is to revert a long-ago change that merged the reg/reg and reg/imm versions. There is the addition of several new addressing modes. They no longer have extraneous operands associated with them. I.e., if it's reg/reg we don't have to have a dummy zero immediate tacked on to the SDNode. There are some obvious cleanups here, which will happen shortly. llvm-svn: 121747	2010-12-14 03:36:38 +00:00
Bob Wilson	5ff13f9d5c	Fix some invalid alignments for Neon vld-dup and vld/st-lane instructions. Alignments smaller than the total size of the memory being loaded or stored, unless the alignment is 8 bytes, are not allowed. Add tests for this, too. llvm-svn: 121506	2010-12-10 19:37:42 +00:00
Evan Cheng	fc78767730	Making use of VFP / NEON floating point multiply-accumulate / subtraction is difficult on current ARM implementations for a few reasons. 1. Even though a single vmla has latency that is one cycle shorter than a pair of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause additional pipeline stall. So it's frequently better to single codegen vmul + vadd. 2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to stall for 4 cycles. We need to schedule them apart. 3. A vmla followed vmla is a special case. Obvious issuing back to back RAW vmla + vmla is very bad. But this isn't ideal either: vmul vadd vmla Instead, we want to expand the second vmla: vmla vmul vadd Even with the 4 cycle vmul stall, the second sequence is still 2 cycles faster. Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough but it isn't the optimial solution. This patch attempts to make it possible to use vmla / vmls in cases where it is profitable. A. Add missing isel predicates which cause vmla to be codegen'ed. B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to compute a fmul and a fmla. C. Add additional isel checks for vmla, avoid cases where vmla is feeding into fp instructions (except for the #3 exceptional case). D. Add ARM hazard recognizer to model the vmla / vmls hazards. E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the vmla / vmls will trigger one of the special hazards. Work in progress, only A+B are enabled. llvm-svn: 120960	2010-12-05 22:04:16 +00:00
Bob Wilson	bd3d3d2937	Add support for NEON VLD3-dup instructions. The encoding for alignment in VLD4-dup instructions is still a work in progress. llvm-svn: 120356	2010-11-30 00:00:35 +00:00
Bob Wilson	aa197b07e6	Add support for NEON VLD3-dup instructions. llvm-svn: 120312	2010-11-29 19:35:29 +00:00
Bob Wilson	3bb61d1932	Add support for NEON VLD2-dup instructions. llvm-svn: 120236	2010-11-28 06:51:26 +00:00
Evan Cheng	2b30babdcc	Fix a cut-n-paste-error. llvm-svn: 119866	2010-11-19 23:01:16 +00:00
Evan Cheng	00c4672acc	Avoid isel movcc of large immediates when the large immediate is available in a register. These immediates aren't free. llvm-svn: 119558	2010-11-17 20:56:30 +00:00
Evan Cheng	a7d3c3d387	Add conditional move of large immediate. llvm-svn: 118968	2010-11-13 02:25:14 +00:00
Evan Cheng	a08372d4b4	Fix an obvious typo which inverted an immediate. llvm-svn: 118951	2010-11-13 00:27:47 +00:00
Evan Cheng	19f018a1be	Add conditional mvn instructions. llvm-svn: 118935	2010-11-12 22:42:47 +00:00
Duncan Sands	41edf30895	Simplify uses of MVT and EVT. An MVT can be compared directly with a SimpleValueType, while an EVT supports equality and inequality comparisons with SimpleValueType. llvm-svn: 118169	2010-11-03 12:17:33 +00:00
Jim Grosbach	c10d3f3d4b	Break ARM addrmode4 (load/store multiple base address) into its constituent parts. Represent the operation mode as an optional operand instead. rdar://8614429 llvm-svn: 118137	2010-11-03 01:01:43 +00:00
Bob Wilson	411b511ac0	Add support for alignment operands on VLD1-lane instructions. This is another part of the fix for Radar 8599955. llvm-svn: 117976	2010-11-01 23:40:51 +00:00
Evan Cheng	44d2802e1d	Shifter ops are not always free. Do not fold them (especially to form complex load / store addressing mode) when they have higher cost and when they have more than one use. llvm-svn: 117509	2010-10-27 23:41:30 +00:00
Jim Grosbach	30f6744f05	First part of refactoring ARM addrmode2 (load/store) instructions to be more explicit about the operands. Split out the different variants into separate instructions. This gives us the ability to, among other things, assign different scheduling itineraries to the variants. rdar://8477752. llvm-svn: 117409	2010-10-26 22:37:02 +00:00
Jim Grosbach	f35d3b2aa1	trailing whitespace llvm-svn: 117050	2010-10-21 19:38:40 +00:00
Bob Wilson	c3fb4427f4	Support alignment for NEON vld-lane and vst-lane instructions. llvm-svn: 116776	2010-10-19 00:16:32 +00:00
Jim Grosbach	c0a61c0796	Allow use of the 16-bit literal move instruction in CMOVs for Thumb2 mode. llvm-svn: 115890	2010-10-07 00:53:56 +00:00
Jim Grosbach	1e2566c20d	Allow use of the 16-bit literal move instruction in CMOVs for ARM mode. llvm-svn: 115884	2010-10-07 00:42:42 +00:00
Jim Grosbach	fd8705ab4a	Add specializations of addrmode2 that allow differentiating those forms which require the use of the shifter-operand. This will be used to split the ldr/str instructions such that those versions needing the shifter operand can get a different scheduling itenerary, as in some cases, the use of the shifter can cause different scheduling than the simpler forms. llvm-svn: 115066	2010-09-29 19:03:54 +00:00
Jim Grosbach	9939090a14	Add braces for legibility. llvm-svn: 115043	2010-09-29 17:32:29 +00:00
Bob Wilson	ff8139baa0	Set alignment operand for NEON VST instructions. llvm-svn: 114709	2010-09-23 23:42:37 +00:00
Bob Wilson	026ef4b7f8	Set alignment operand for NEON VLD instructions. llvm-svn: 114696	2010-09-23 21:43:54 +00:00
Chris Lattner	55043ef46a	fix a long standing wart: all the ComplexPattern's were being passed the root of the match, even though only a few patterns actually needed this (one in X86, several in ARM [which should be refactored anyway], and some in CellSPU that I don't feel like detangling). Instead of requiring all ComplexPatterns to take the dead root, have targets opt into getting the root by putting SDNPWantRoot on the ComplexPattern. llvm-svn: 114471	2010-09-21 20:31:19 +00:00
Eric Christopher	3d2aa0da5f	Fix QOpcode assignment to Opc. llvm-svn: 113837	2010-09-14 08:31:25 +00:00
Bob Wilson	ba02d5b620	Convert some VTBL and VTBX instructions to use pseudo instructions prior to register allocation. Remove the NEONPreAllocPass, which is no longer needed. Yeah!! llvm-svn: 113818	2010-09-13 23:55:10 +00:00
Bob Wilson	6f35180bec	Switch all the NEON vld-lane and vst-lane instructions over to the new pseudo-instruction approach. Change ARMExpandPseudoInsts to use a table to record all the NEON load/store information. llvm-svn: 113812	2010-09-13 23:01:35 +00:00
Chris Lattner	c0e5368884	remove some dead code. t2addrmode_imm8s4 is never used in a pattern, so there is no need to define a matching function. llvm-svn: 113122	2010-09-05 22:51:11 +00:00
Bob Wilson	8ef469b2a5	Finish converting the rest of the NEON VLD instructions to use pseudo- instructions prior to regalloc. Since it's getting a little close to the 2.8 branch deadline, I'll have to leave the rest of the instructions handled by the NEONPreAllocPass for now, but I didn't want to leave half of the VLD instructions converted and the other half not. llvm-svn: 112983	2010-09-03 18:16:02 +00:00
Bob Wilson	8951c7592c	Convert VLD1 and VLD2 instructions to use pseudo-instructions until after regalloc. llvm-svn: 112825	2010-09-02 16:00:54 +00:00
Chris Lattner	b74759a9fa	temporarily revert r112664, it is causing a decoding conflict, and the testcases should be merged. llvm-svn: 112711	2010-09-01 16:00:50 +00:00
Bill Wendling	bb6052cfd6	We have a chance for an optimization. Consider this code: int x(int t) { if (t & 256) return -26; return 0; } We generate this: tst.w r0, #256 mvn r0, #25 it eq moveq r0, #0 while gcc generates this: ands r0, r0, #256 it ne mvnne r0, #25 bx lr Scandalous really! During ISel time, we can look for this particular pattern. One where we have a "MOVCC" that uses the flag off of a CMPZ that itself is comparing an AND instruction to 0. Something like this (greatly simplified): %r0 = ISD::AND ... ARMISD::CMPZ %r0, 0 @ sets [CPSR] %r0 = ARMISD::MOVCC 0, -26 @ reads [CPSR] All we have to do is convert the "ISD::AND" into an "ARM::ANDS" that sets [CPSR] when it's zero. The zero value will all ready be in the %r0 register and we only need to change it if the AND wasn't zero. Easy! llvm-svn: 112664	2010-08-31 22:41:22 +00:00
Bob Wilson	956e07b985	Use pseudo instructions for VST1 and VST2. llvm-svn: 112357	2010-08-28 05:12:57 +00:00
Bob Wilson	abdcae7f20	We don't need to custom-select VLDMQ and VSTMQ anymore. llvm-svn: 112336	2010-08-28 00:20:11 +00:00
Bob Wilson	31d487d235	Change ARM VFP VLDM/VSTM instructions to use addressing mode #4 , just like all the other LDM/STM instructions. This fixes asm printer crashes when compiling with -O0. I've changed one of the NEON tests (vst3.ll) to run with -O0 to check this in the future. Prior to this change VLDM/VSTM used addressing mode #5, but not really. The offset field was used to hold a count of the number of registers being loaded or stored, and the AM5 opcode field was expanded to specify the IA or DB mode, instead of the standard ADD/SUB specifier. Much of the backend was not aware of these special cases. The crashes occured when rewriting a frameindex caused the AM5 offset field to be changed so that it did not have a valid submode. I don't know exactly what changed to expose this now. Maybe we've never done much with -O0 and NEON. Regardless, there's no longer any reason to keep a count of the VLDM/VSTM registers, so we can use addressing mode #4 and clean things up in a lot of places. llvm-svn: 112322	2010-08-27 23:18:17 +00:00
Bob Wilson	efc503afd2	Use pseudo instructions for VST3. llvm-svn: 112208	2010-08-26 18:51:29 +00:00
Bob Wilson	e74da18e57	Use pseudo instructions for VST1d64Q. llvm-svn: 112170	2010-08-26 05:33:30 +00:00
Bob Wilson	b85b3cf91f	Start converting NEON load/stores to use pseudo instructions, beginning here with the VST4 instructions. Until after register allocation, we want to represent sets of adjacent registers by a single super-register. These VST4 pseudo instructions have a single QQ or QQQQ source register operand. They get expanded to the real VST4 instructions with 4 separate D register operands. Once this conversion is complete, we'll be able to remove the NEONPreAllocPass and avoid some fragile and hacky code elsewhere. llvm-svn: 112108	2010-08-25 23:27:42 +00:00
Jakob Stoklund Olesen	20dbe1681b	Don't call tablegen'ed Predicate_* functions in the ARM target. llvm-svn: 111277	2010-08-17 20:39:04 +00:00
Evan Cheng	ee59acf6dd	Add -disable-shifter-op to disable isel of shifter ops. On Cortex-a9 the shifts cost extra instructions so it might be better to emit them separately to take advantage of dual-issues. llvm-svn: 109934	2010-07-30 23:33:54 +00:00
Bob Wilson	822b21f0de	Also use REG_SEQUENCE for VTBX instructions. llvm-svn: 107743	2010-07-07 00:08:54 +00:00
Bob Wilson	ce80768ebf	Use REG_SEQUENCE nodes to make the table registers for VTBL instructions be allocated to consecutive registers. llvm-svn: 107730	2010-07-06 23:36:25 +00:00
Duncan Sands	25a2d02c5c	Remove an unused and a pointless variable. llvm-svn: 107131	2010-06-29 13:00:29 +00:00
Dan Gohman	0b5b93b20d	Eliminate unnecessary uses of getZExtValue(). llvm-svn: 106279	2010-06-18 14:22:04 +00:00
Bob Wilson	d81a716d59	Remove the hidden "neon-reg-sequence" option. The reg sequences are working now, so there's no need to disable them. llvm-svn: 106155	2010-06-16 21:34:01 +00:00
Bob Wilson	2945a0ac66	For NEON vectors with 32- or 64-bit elements, select BUILD_VECTORs and VECTOR_SHUFFLEs to REG_SEQUENCE instructions. The standard ISD::BUILD_VECTOR node corresponds closely to REG_SEQUENCE but I couldn't use it here because its operands do not get legalized. That is pretty awful, but I guess it makes sense for other targets. Instead, I have added an ARM-specific version of BUILD_VECTOR that will have its operands properly legalized. This fixes the rest of Radar 7872877. llvm-svn: 105439	2010-06-04 00:04:02 +00:00
Dale Johannesen	891a19d5ae	Early implementation of tail call for ARM. A temporary flag -arm-tail-calls defaults to off, so there is no functional change by default. Intrepid users may try this; simple cases work but there are bugs. llvm-svn: 105413	2010-06-03 21:09:53 +00:00
Jim Grosbach	f3bd81ce11	Clean up 80 column violations. No functional change. llvm-svn: 105350	2010-06-02 21:53:11 +00:00
Bob Wilson	e2ff075e4c	Add the cc_out operand for t2RSBrs instructions. I missed this when I changed the instruction class for t2RSB to add that operand in svn r104582. Radar 8033757. llvm-svn: 104907	2010-05-28 00:27:15 +00:00
Jakob Stoklund Olesen	d48a2f5afd	Fix a few places that depended on the numeric value of subreg indices. Add assertions in places that depend on consecutive indices. llvm-svn: 104510	2010-05-24 17:13:28 +00:00
Jakob Stoklund Olesen	ac6f519e79	Switch ARMRegisterInfo.td to use SubRegIndex and eliminate the parallel enums from ARMRegisterInfo.h llvm-svn: 104508	2010-05-24 16:54:32 +00:00
Evan Cheng	b7657dfa38	Target instruction selection should copy memoperands. llvm-svn: 104110	2010-05-19 06:06:09 +00:00
Evan Cheng	3bce87c79f	Turn on -neon-reg-sequence by default. Using NEON load / store multiple instructions will no longer create gobs of vmov of D registers! llvm-svn: 103960	2010-05-17 19:51:20 +00:00
Evan Cheng	7facc1504a	Model vst lane instructions with REG_SEQUENCE. llvm-svn: 103898	2010-05-16 03:27:48 +00:00
Evan Cheng	9301a88cbb	Model 128-bit vld lane with REG_SEQUENCE. llvm-svn: 103868	2010-05-15 07:53:37 +00:00
Evan Cheng	db2c1543c8	Model 64-bit lane vld with REG_SEQUENCE. llvm-svn: 103851	2010-05-15 01:36:29 +00:00
Evan Cheng	ba77bacf48	Model VST_UPD and VSToddUPD pair with REG_SEQUENCE. llvm-svn: 103833	2010-05-14 22:54:52 +00:00
Evan Cheng	ab9e0f7315	Model VLD_UPD and VLDodd_UPD pair with REG_SEQUENCE. llvm-svn: 103790	2010-05-14 18:54:59 +00:00
Evan Cheng	d1f60c198d	Fix comments. llvm-svn: 103749	2010-05-14 00:21:45 +00:00
Evan Cheng	5311f25b0b	Model some vst3 and vst4 with reg_sequence. llvm-svn: 103453	2010-05-11 01:19:40 +00:00
Evan Cheng	91747dc5dc	Model some vld3 instructions with REG_SEQUENCE. llvm-svn: 103437	2010-05-10 21:26:24 +00:00
Evan Cheng	7f0d8f1ab0	Model vld2 / vst2 with reg_sequence. llvm-svn: 103411	2010-05-10 17:34:18 +00:00
Bob Wilson	e5f295c76d	Add a missing break statement to fix unintentional fall-through (replacing the previous patch for the same issue). llvm-svn: 103183	2010-05-06 16:05:26 +00:00
Jim Grosbach	bffae8af60	Fix unintentional fallthrough. Patch by Edmund Grimley-Evans <Edmund.Grimley-Evans@arm.com> llvm-svn: 103181	2010-05-06 15:32:49 +00:00
Evan Cheng	6a76e7d9ae	Model CONCAT_VECTORS of two 64-bit values as a REG_SEQUENCE. llvm-svn: 103104	2010-05-05 18:28:36 +00:00
Evan Cheng	1440201528	With -neon-reg-sequence, models forming a Q register from a pair of consecutive D registers as a REG_SEQUENCE. llvm-svn: 103047	2010-05-04 20:39:49 +00:00
Jim Grosbach	b9dccb6103	Update ARM DAGtoDAG for matching UBFX instruction for unsigned bitfield extraction. This fixes PR5998. llvm-svn: 102144	2010-04-22 23:24:18 +00:00
Dan Gohman	a0f855157e	Use const qualifiers with TargetLowering. This eliminates several const_casts, and it reinforces the design of the Target classes being immutable. SelectionDAGISel::IsLegalToFold is now a static member function, because PIC16 uses it in an unconventional way. There is more room for API cleanup here. And PIC16's AsmPrinter no longer uses TargetLowering. llvm-svn: 101635	2010-04-17 15:26:15 +00:00
Evan Cheng	428f7f39a8	Use getAL() rather than a major constant. llvm-svn: 101446	2010-04-16 05:46:06 +00:00
Evan Cheng	c843326d60	Use default lowering of DYNAMIC_STACKALLOC. As far as I can tell, ARM isle is doing the right thing and codegen looks correct for both Thumb and Thumb2. llvm-svn: 101410	2010-04-15 22:20:34 +00:00
Evan Cheng	2f6d7ecd1b	ARM SelectDYN_ALLOC should emit a copy from SP rather than referencing SP directly. In cases where there are two dyn_alloc in the same BB it would have caused the old SP value to be reused and badness ensues. rdar://7493908 llvm is generating poor code for dynamic alloca, I'll fix that later. llvm-svn: 101383	2010-04-15 18:42:28 +00:00
Bob Wilson	195c570fd3	Fix VLDMQ and VSTMQ instructions to use the correct encoding and address modes. These instructions are only needed for codegen, so I've removed all the explicit encoding bits for now; they should be set in the same way as the for VLDMD and VSTMD whenever we add encodings for VFP. The use of addrmode5 requires that the instructions be custom-selected so that the number of registers can be set in the AM5Opc value. llvm-svn: 99309	2010-03-23 18:54:46 +00:00
Bob Wilson	58c4740582	Change VST1 instructions for loading Q register values to operate on pairs of D registers. Add a separate VST1q instruction with a Q register source operand for use by storeRegToStackSlot. llvm-svn: 99265	2010-03-23 06:20:33 +00:00
Bob Wilson	2764399dd8	Change VLD1 instructions for loading Q register values to operate on pairs of D registers. Add a separate VLD1q instruction with a Q register destination operand for use by loadRegFromStackSlot. llvm-svn: 99261	2010-03-23 05:25:43 +00:00
Bob Wilson	f23a45e151	Rename some VLD1/VST1 instructions to match the implementation, i.e., the corresponding NEON instructions, instead of operation they are currently used for. llvm-svn: 99189	2010-03-22 18:13:18 +00:00
Bob Wilson	73d7323c91	Re-commit r98683 ("remove redundant writeback flag from ARM address mode 6") with changes to add a separate optional register update argument. Change all the NEON instructions with address register writeback to use it. llvm-svn: 99095	2010-03-20 22:13:40 +00:00
Bob Wilson	a98f30a3a2	Rename some instructions for consistency and sanity: use "_UPD" suffix for load/stores with address register writeback, and use "odd" suffix to distinguish instructions to access odd numbered registers (instead of "a" and "b"). No functional changes. llvm-svn: 99066	2010-03-20 18:35:24 +00:00
Bob Wilson	3778e7f389	Revert 98683. It is breaking something in the disassembler. llvm-svn: 98692	2010-03-16 23:01:13 +00:00
Bob Wilson	79f10e6233	Remove redundant writeback flag from ARM address mode 6. Also remove the optional register update argument, which is currently unused -- when we add support for that, it can just be a separate operand. llvm-svn: 98683	2010-03-16 21:44:40 +00:00
Chris Lattner	1707a88a2c	Sink InstructionSelect() out of each target into SDISel, and rename it DoInstructionSelection. Inline "SelectRoot" into it from DAGISelHeader. Sink some other stuff out of DAGISelHeader into SDISel. Eliminate the various 'Indent' stuff from various targets, which dates to when isel was recursive. 17 files changed, 114 insertions(+), 430 deletions(-) llvm-svn: 97555	2010-03-02 06:34:30 +00:00

1 2 3 4 5 ...

413 Commits