llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 12:02:58 +02:00

Author	SHA1	Message	Date
Bob Wilson	0ec96428ba	Check for extractelement with a variable operand for the element number. For NEON we had been assuming this was always an immediate constant. llvm-svn: 118175	2010-11-03 16:24:50 +00:00
Duncan Sands	41edf30895	Simplify uses of MVT and EVT. An MVT can be compared directly with a SimpleValueType, while an EVT supports equality and inequality comparisons with SimpleValueType. llvm-svn: 118169	2010-11-03 12:17:33 +00:00
Evan Cheng	eab7251695	Fix preload instruction isel. Only v7 supports pli, and only v7 with mp extension supports pldw. Add subtarget attribute to denote mp extension support and legalize illegal ones to nothing. llvm-svn: 118160	2010-11-03 06:34:55 +00:00
Evan Cheng	b41703bc2f	Add support to match @llvm.prefetch to pld / pldw / pli. rdar://8601536. llvm-svn: 118152	2010-11-03 05:14:24 +00:00
Bob Wilson	a9c593e696	NEON does not support truncating vector stores. Radar 8598391. llvm-svn: 117940	2010-11-01 18:31:39 +00:00
Bob Wilson	183c466006	Overhaul memory barriers in the ARM backend. Radar 8601999. There were a number of issues to fix up here: * The "device" argument of the llvm.memory.barrier intrinsic should be used to distinguish the "Full System" domain from the "Inner Shareable" domain. It has nothing to do with using DMB vs. DSB instructions. * The compiler should never need to emit DSB instructions. Remove the ARMISD::SYNCBARRIER node and also remove the instruction patterns for DSB. * Merge the separate DMB/DSB instructions for options only used for the disassembler with the default DMB/DSB instructions. Add the default "full system" option ARM_MB::SY to the ARM_MB::MemBOpt enum. * Add a separate ARMISD::MEMBARRIER_MCR node for subtargets that implement a data memory barrier using the MCR instruction. * Fix up encodings for these instructions (except MCR). I also updated the tests and added a few new ones to check for DMB options that were not currently being exercised. llvm-svn: 117756	2010-10-30 00:54:37 +00:00
Evan Cheng	92293993bd	- Don't schedule nodes with only MVT::Flag and MVT::Other values for latency. - Compute CopyToReg use operand latency correctly. llvm-svn: 117674	2010-10-29 18:07:31 +00:00
John Thompson	6115a7f1d4	Inline asm multiple alternative constraints development phase 2 - improved basic logic, added initial platform support. llvm-svn: 117667	2010-10-29 17:29:13 +00:00
Bob Wilson	2f8b69b196	Fix compiler warnings about signed/unsigned comparisons. llvm-svn: 117511	2010-10-27 23:49:00 +00:00
Bob Wilson	cdc8dff3ac	SelectionDAG shuffle nodes do not allow operands with different numbers of elements than the result vector type. So, when an instruction like: %8 = shufflevector <2 x float> %4, <2 x float> %7, <4 x i32> <i32 1, i32 0, i32 3, i32 2> is translated to a DAG, each operand is changed to a concat_vectors node that appends 2 undef elements. That is: shuffle [a,b], [c,d] is changed to: shuffle [a,b,u,u], [c,d,u,u] That's probably the right thing for x86 but for NEON, we'd much rather have: shuffle [a,b,c,d], undef Teach the DAG combiner how to do that transformation for ARM. Radar 8597007. llvm-svn: 117482	2010-10-27 20:38:28 +00:00
Evan Cheng	71b2f935db	Enable ARM fastcc. llvm-svn: 117194	2010-10-23 02:19:37 +00:00
Evan Cheng	efac5b5f8d	Add fastcc cc: pass and return VFP / NEON values in registers. Controlled by -arm-fastcc for now. llvm-svn: 117119	2010-10-22 18:23:05 +00:00
Dale Johannesen	a324c8c6bd	Fix crash introduced in 116852. 8573915. llvm-svn: 116955	2010-10-20 22:03:37 +00:00
Jim Grosbach	a8c0be5343	Add a pre-dispatch SjLj EH hook on the unwind edge for targets to do any setup they require. Use this for ARM/Darwin to rematerialize the base pointer from the frame pointer when required. rdar://8564268 llvm-svn: 116879	2010-10-19 23:27:08 +00:00
Dale Johannesen	ee87cbe4e9	Enable using vdup for vector constants which are splat of integers by default, and remove the controlling flag, now that LICM will hoist such vdup's. 8003375. llvm-svn: 116852	2010-10-19 20:00:17 +00:00
Jim Grosbach	440b0e6b34	Don't mark argument value stores as immutable, as otherwise the post-RA scheduler may reorder loads from them before the stores and other such badness. PR8347. Patch by David Meyer llvm-svn: 116602	2010-10-15 18:34:47 +00:00
Bob Wilson	6b6b53ad6f	Remove unused ARMISD::AND selection DAG node. llvm-svn: 116566	2010-10-15 04:34:40 +00:00
Anton Korobeynikov	f1be021755	User proper libcall names & condcodes while compiling for ARM EABI. Patch by Evzen Muller! llvm-svn: 114991	2010-09-28 21:39:26 +00:00
Bob Wilson	dc396388cb	Add a command line option "-arm-strict-align" to disallow unaligned memory accesses for ARM targets that would otherwise allow it. Radar 8465431. llvm-svn: 114941	2010-09-28 04:09:35 +00:00
Evan Cheng	1d50dccdc5	Enable code placement optimization pass for ARM. llvm-svn: 114746	2010-09-24 19:07:23 +00:00
Jim Grosbach	d8735f1db1	Add support for ELF PLT references for ARM MC asm printing. Adding a new VariantKind to the MCSymbolExpr seems like overkill, but I'm not sure there's a more straightforward way to get the printing difference captured. (i.e., x86 uses @PLT, ARM uses (PLT)). llvm-svn: 114613	2010-09-22 23:27:36 +00:00
Bob Wilson	0f341d4792	Change VDUPLANE DAG combiner to just return the result instead of calling CombineTo to avoid putting the result on the worklist. I don't think it makes much difference for now, but it might help someday as we add more DAG combine optimizations. llvm-svn: 114595	2010-09-22 22:27:30 +00:00
Bob Wilson	11b219e461	Combine both VMOVDRR(VMOVRRD) and VMOVRRD(VMOVDRR), instead of just doing one of those. Refactor to share code for handling BUILD_VECTOR(VMOVRRD). I don't have a testcase that exercises this, but it seems like an obvious good thing to do. llvm-svn: 114589	2010-09-22 22:09:21 +00:00
Owen Anderson	d9fd152c3a	Enable target-specific mul-lowering on ARM, even at -Os. Remove a test that this makes irrelevant, but add a new test for the new, improved functionality. llvm-svn: 114494	2010-09-21 22:51:46 +00:00
Chris Lattner	3dde58c15a	convert a couple more places to use the new getStore() llvm-svn: 114463	2010-09-21 18:51:21 +00:00
Bob Wilson	c4345abcc0	Define the TargetLowering::getTgtMemIntrinsic hook for ARM so that NEON load and store intrinsics are represented with MemIntrinsicSDNodes. llvm-svn: 114454	2010-09-21 17:56:22 +00:00
Chris Lattner	4320dda4fb	convert the targets off the non-MachinePointerInfo of getLoad. llvm-svn: 114410	2010-09-21 06:44:06 +00:00
Chris Lattner	f94de5bf46	reimplement memcpy/memmove/memset lowering to use MachinePointerInfo instead of srcvalue/offset pairs. This corrects SV info for mem operations whose size is > 32-bits. llvm-svn: 114401	2010-09-21 05:40:29 +00:00
Bob Wilson	670e1915c0	Add target-specific DAG combiner for BUILD_VECTOR and VMOVRRD. An i64 value should be in GPRs when it's going to be used as a scalar, and we use VMOVRRD to make that happen, but if the value is converted back to a vector we need to fold to a simple bit_convert. Radar 8407927. llvm-svn: 114233	2010-09-17 22:59:05 +00:00
Eric Christopher	ae179e4cbd	Split out some of the calling convention bits so that they can be used for fast-isel. llvm-svn: 113652	2010-09-10 22:42:06 +00:00
Evan Cheng	c9cb37516d	Teach if-converter to be more careful with predicating instructions that would take multiple cycles to decode. For the current if-converter clients (actually only ARM), the instructions that are predicated on false are not nops. They would still take machine cycles to decode. Micro-coded instructions such as LDM / STM can potentially take multiple cycles to decode. If-converter should take treat them as non-micro-coded simple instructions. llvm-svn: 113570	2010-09-10 01:29:16 +00:00
Jim Grosbach	6aba2dc40a	remove trailing whitespace llvm-svn: 113338	2010-09-08 03:54:02 +00:00
Bob Wilson	24fa0b33b1	Replace NEON vabdl, vaba, and vabal intrinsics with combinations of the vabd intrinsic and add and/or zext operations. In the case of vaba, this also avoids the need for a DAG combine pattern to combine vabd with add. Update tests. Auto-upgrade the old intrinsics. llvm-svn: 112941	2010-09-03 01:35:08 +00:00
Bob Wilson	3348d2eb50	Remove NEON vmull, vmlal, and vmlsl intrinsics, replacing them with multiply, add, and subtract operations with zero-extended or sign-extended vectors. Update tests. Add auto-upgrade support for the old intrinsics. llvm-svn: 112773	2010-09-01 23:50:19 +00:00
Bill Wendling	385ad1516f	Create an ARMISD::AND node. This node is exactly like the "ARM::AND" node, but it sets the CPSR register. llvm-svn: 112393	2010-08-29 03:02:11 +00:00
Daniel Dunbar	9b7c2ce591	ARM/Thumb2: Fix a misselect in getARMCmp, when attempting to adjust a signed comparison that would overflow. - The other under/overflow cases can't actually happen because the immediates which would trigger them are legal (so we don't enter this code), but adjusted the style to make it clear the transform is always valid. llvm-svn: 112053	2010-08-25 16:58:05 +00:00
Bob Wilson	0039bc228b	Replace the arm.neon.vmovls and vmovlu intrinsics with vector sign-extend and zero-extend operations. llvm-svn: 111614	2010-08-20 04:54:02 +00:00
Bob Wilson	412be3eea6	Expand ZERO_EXTEND operations for NEON vector types. Testcase from Nick Lewycky. llvm-svn: 111341	2010-08-18 01:45:52 +00:00
Bob Wilson	6239dc42c6	Allow more cases of undef shuffle indices and add tests for them. llvm-svn: 111226	2010-08-17 05:54:34 +00:00
Bob Wilson	1e40f2351c	Ignore undef shuffle indices when checking for a VTRN shuffle. Radar 8290937. llvm-svn: 111208	2010-08-16 23:37:17 +00:00
Bob Wilson	ca672ee828	Temporarily disable tail calls on ARM to work around some linker problems. llvm-svn: 111050	2010-08-13 22:43:33 +00:00
Jim Grosbach	1128a47289	cortex m4 has floating point support, but only single precision. llvm-svn: 110810	2010-08-11 15:44:15 +00:00
Bill Wendling	f10d5c00fc	Consider this code snippet: float t1(int argc) { return (argc == 1123) ? 1.234f : 2.38213f; } We would generate truly awful code on ARM (those with a weak stomach should look away): _t1: movw r1, #1123 movs r2, #1 movs r3, #0 cmp r0, r1 mov.w r0, #0 it eq moveq r0, r2 movs r1, #4 cmp r0, #0 it ne movne r3, r1 adr r0, #LCPI1_0 ldr r0, [r0, r3] bx lr The problem was that legalization was creating a cascade of SELECT_CC nodes, for for the comparison of "argc == 1123" which was fed into a SELECT node for the ?: statement which was itself converted to a SELECT_CC node. This is because the ARM back-end doesn't have custom lowering for SELECT nodes, so it used the default "Expand". I added a fairly simple "LowerSELECT" to the ARM back-end. It takes care of this testcase, but can obviously be expanded to include more cases. Now we generate this, which looks optimal to me: _t1: movw r1, #1123 movs r2, #0 cmp r0, r1 adr r0, #LCPI0_0 it eq moveq r2, #4 ldr r0, [r0, r2] bx lr .align 2 LCPI0_0: .long 1075344593 @ float 2.382130e+00 .long 1067316150 @ float 1.234000e+00 llvm-svn: 110799	2010-08-11 08:43:16 +00:00
Evan Cheng	5fca4ca5f9	- Add subtarget feature -mattr=+db which determine whether an ARM cpu has the memory and synchronization barrier dmb and dsb instructions. - Change instruction names to something more sensible (matching name of actual instructions). - Added tests for memory barrier codegen. llvm-svn: 110785	2010-08-11 06:22:01 +00:00
Evan Cheng	784a286b92	Delete some unused instructions. llvm-svn: 110710	2010-08-10 19:36:22 +00:00
Evan Cheng	d9a1b0d046	Re-apply r110655 with fixes. Epilogue must restore sp from fp if the function stack frame has a var-sized object. Also added a test case to check for the added benefit of this patch: it's optimizing away the unnecessary restore of sp from fp for some non-leaf functions. llvm-svn: 110707	2010-08-10 19:30:19 +00:00
Daniel Dunbar	872e84afb5	Revert r110655, "Fix ARM hasFP() semantics. It should return true whenever FP register is", it breaks a couple test-suite tests. llvm-svn: 110701	2010-08-10 18:32:02 +00:00
Evan Cheng	3d47dbe761	Fix ARM hasFP() semantics. It should return true whenever FP register is reserved, not available for general allocation. This eliminates all the extra checks for Darwin. This change also fixes the use of FP to access frame indices in leaf functions and cleaned up some confusing code in epilogue emission. llvm-svn: 110655	2010-08-10 06:26:49 +00:00
Dale Johannesen	53bc276b33	Remove switch for disabling ARM tail calls. They seem to be working correctly. No functional change. llvm-svn: 110226	2010-08-04 18:07:17 +00:00
Bob Wilson	6a2437480a	Combine NEON VABD (absolute difference) intrinsics with ADDs to make VABA (absolute difference with accumulate) intrinsics. Radar 8228576. llvm-svn: 110170	2010-08-04 00:12:08 +00:00

1 2 3 4 5 ...

502 Commits