llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 22:42:46 +02:00

Author	SHA1	Message	Date
Jim Grosbach	657ab4a8ee	By default, the eh.sjlj.setjmp/longjmp intrinsics should just do nothing rather than assuming a target will custom lower them. Targets which do so should exlicitly mark them as having custom lowerings. PR7454. llvm-svn: 107734	2010-07-06 23:44:52 +00:00
Devang Patel	7ab104353b	Propagate debug loc. llvm-svn: 107710	2010-07-06 22:08:15 +00:00
Dan Gohman	808f334f79	Reapply r107655 with fixes; insert the pseudo instruction into the block before calling the expansion hook. And don't put EFLAGS in a mbb's live-in list twice. llvm-svn: 107691	2010-07-06 20:24:04 +00:00
Dan Gohman	4d264f7e51	Revert r107655. llvm-svn: 107668	2010-07-06 15:49:48 +00:00
Dan Gohman	6a73079aba	Fix a bunch of custom-inserter functions to handle the case where the pseudo instruction is not at the end of the block. llvm-svn: 107655	2010-07-06 15:18:19 +00:00
Evan Cheng	47f3a2db40	Remove isSS argument from CreateFixedObject. Fixed objects cannot be spill slots so it's always false. llvm-svn: 107550	2010-07-03 00:40:23 +00:00
Bob Wilson	17dc7d716b	ARM function alignments were off by a power of two. svn 83242 changed getFunctionAlignment and the corresponding use of that value in the ARM asm printer, but now we're using the standard asm printer. The result of this was that function alignments were dropped completely for Thumb functions. Radar 8143571. llvm-svn: 107435	2010-07-01 22:26:26 +00:00
Duncan Sands	b955b3bf92	Remove initialized but otherwise unused variables. llvm-svn: 107127	2010-06-29 11:22:26 +00:00
Eli Friedman	893ce468be	Followup to r106770: actually generate SXTB and SXTH for sign-extensions. llvm-svn: 106940	2010-06-26 04:36:50 +00:00
Evan Cheng	f222c09b61	It's now possible to run code placement pass for ARM. llvm-svn: 106935	2010-06-26 01:52:05 +00:00
Evan Cheng	346aecdb8b	Change if-conversion block size limit checks to add some flexibility. llvm-svn: 106901	2010-06-25 22:42:03 +00:00
Dale Johannesen	b1fc776fca	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Bob Wilson	0a84b9b677	Reduce indentation. llvm-svn: 106819	2010-06-25 04:12:31 +00:00
Dale Johannesen	e618e80a13	Do not do tail calls to external symbols. If the branch turns out to be ARM-to-Thumb or vice versa the linker cannot resolve this. 8120438. If this optimization is going to be useful we probably need a compiler flag "assume callees are same architecture" or something like that. llvm-svn: 106662	2010-06-23 18:52:34 +00:00
Jim Grosbach	414eb48a14	When using libcall expansions for the atomic intrinsics, the explicit MEMBARRIER fences aren't necessary for ARM. Tell the combiner to fold them away. llvm-svn: 106631	2010-06-23 16:08:49 +00:00
Bob Wilson	ef565a2ffd	sign_extend_inreg needs to be expanded for pre-v6 Thumb as well as ARM. Radar 8104310. llvm-svn: 106484	2010-06-21 21:27:34 +00:00
Bob Wilson	44afe2065d	Fix error message to match function name. llvm-svn: 106381	2010-06-19 05:32:09 +00:00
Evan Cheng	f40b8f0e32	Disable sibcall optimization for Thumb1 for now since Thumb1RegisterInfo::emitEpilogue is not expecting them. llvm-svn: 106368	2010-06-19 01:01:32 +00:00
Jim Grosbach	b8c94667a8	back-end libcall handling for ATOMIC_SWAP (__sync_lock_test_and_set) llvm-svn: 106342	2010-06-18 23:03:10 +00:00
Jim Grosbach	c599143b45	Enable Expand handling of atomics for subtargets that can't do them inline. llvm-svn: 106336	2010-06-18 22:35:32 +00:00
Dale Johannesen	a441c8fd45	Enable tail calls on ARM by default, with some basic tests. This has been well tested on Darwin but not elsewhere. It should work provided the linker correctly resolves B.W <label in other function> which it has not seen before, at least from llvm-based compilers. I'm leaving the arm-tail-calls switch in until I see if there's any problems because of that; it might need to be disabled for some environments. llvm-svn: 106299	2010-06-18 19:00:18 +00:00
Dale Johannesen	9f18fc3fa2	Last round of changes for ARM tail calls. Not turning them on yet. llvm-svn: 106295	2010-06-18 18:13:11 +00:00
Jakob Stoklund Olesen	6c387d99ca	Treat the ARM inline asm {cc} constraint as a physreg (%CPSR), just like X86 does for {flags}. If we create virtual registers of the CCR class, RegAllocFast may try to spill them, and we can't do that. llvm-svn: 106289	2010-06-18 16:49:33 +00:00
Jim Grosbach	f3f401f911	Thumb1 and any pre-v6 ARM target should use the libcall expansion of ISD::MEMBARRIER. v7 and v7 ARM mode continue to use the custom lowering. llvm-svn: 106204	2010-06-17 02:02:03 +00:00
Jim Grosbach	8d77e0298c	simplify code a bit and add a more explanatory assert for cases that previously would result in 'cannot yet select' errors. llvm-svn: 106199	2010-06-17 01:37:00 +00:00
Jim Grosbach	b5bea8fdba	format and 80-column cleanup llvm-svn: 106173	2010-06-16 23:45:49 +00:00
Bob Wilson	d81a716d59	Remove the hidden "neon-reg-sequence" option. The reg sequences are working now, so there's no need to disable them. llvm-svn: 106155	2010-06-16 21:34:01 +00:00
Evan Cheng	46b89e05fd	Make post-ra scheduling, anti-dep breaking, and register scavenger (conservatively) aware of predicated instructions. This enables ARM to move if-conversion before post-ra scheduler. llvm-svn: 106091	2010-06-16 07:35:02 +00:00
Dale Johannesen	e60351e83a	Next round of tail call changes. Register used in a tail call must not be callee-saved; following x86, add a new regclass to represent this. Also fixes a couple of bugs. Still disabled by default; Thumb doesn't work yet. llvm-svn: 106053	2010-06-15 22:08:33 +00:00
Bob Wilson	56db632295	Add basic support for NEON modified immediates besides VMOV. llvm-svn: 106030	2010-06-15 19:05:35 +00:00
Bob Wilson	32016c38ee	Rename functions referring to VMOV immediates to refer to NEON "modified immediate" operands. These functions have so far only been used for VMOV but they also apply to other NEON instructions with modified immediate operands. No functional changes. llvm-svn: 105969	2010-06-14 22:19:57 +00:00
Bob Wilson	470551e0ef	Add a missing bitcast. This code used to only handle conversions between i64 and f64 types, but now it also handle Neon vector types, so the f64 result of VMOVDRR may need to be converted to a Neon type. Radar 8084742. llvm-svn: 105845	2010-06-11 22:45:25 +00:00
Bob Wilson	5e3c60fb63	Add instruction encoding for the Neon VMOV immediate instruction. This changes the machine instruction representation of the immediate value to be encoded into an integer with similar fields as the actual VMOV instruction. This makes things easier for the disassembler, since it can just stuff the bits into the immediate operand, but harder for the asm printer since it has to decode the value to be printed. Testcase for the encoding will follow later when MC has more support for ARM. llvm-svn: 105836	2010-06-11 21:34:50 +00:00
Bob Wilson	9cf6656d4b	Further changes for Neon vector shuffles: - change isShuffleMaskLegal to show that all shuffles with 32-bit and 64-bit elements are legal - the Neon shuffle instructions do not support 64-bit elements, but we were not checking for that before lowering shuffles to use them - remove some 64-bit element vduplane patterns that are no longer needed llvm-svn: 105586	2010-06-07 23:53:38 +00:00
Dale Johannesen	df4dc9ed33	Improvements to tail call code. No functional effect unless using -arm-tail-calls. llvm-svn: 105515	2010-06-05 00:51:39 +00:00
Dale Johannesen	f47a852290	More thoroughly disable tails calls by default. 8060143, although this doesn't fix the real problem with tail call. llvm-svn: 105472	2010-06-04 18:04:24 +00:00
Bob Wilson	2945a0ac66	For NEON vectors with 32- or 64-bit elements, select BUILD_VECTORs and VECTOR_SHUFFLEs to REG_SEQUENCE instructions. The standard ISD::BUILD_VECTOR node corresponds closely to REG_SEQUENCE but I couldn't use it here because its operands do not get legalized. That is pretty awful, but I guess it makes sense for other targets. Instead, I have added an ARM-specific version of BUILD_VECTOR that will have its operands properly legalized. This fixes the rest of Radar 7872877. llvm-svn: 105439	2010-06-04 00:04:02 +00:00
Dale Johannesen	891a19d5ae	Early implementation of tail call for ARM. A temporary flag -arm-tail-calls defaults to off, so there is no functional change by default. Intrepid users may try this; simple cases work but there are bugs. llvm-svn: 105413	2010-06-03 21:09:53 +00:00
Jim Grosbach	f3bd81ce11	Clean up 80 column violations. No functional change. llvm-svn: 105350	2010-06-02 21:53:11 +00:00
Evan Cheng	01ab4e1d5a	Schedule high latency instructions for latency reduction even if they are not vfp / NEON instructions. llvm-svn: 105060	2010-05-28 23:25:23 +00:00
Jim Grosbach	b004e2cf0f	Update the saved stack pointer in the sjlj function context following either an alloca() or an llvm.stackrestore(). rdar://8031573 llvm-svn: 104900	2010-05-27 23:49:24 +00:00
Jim Grosbach	d788f9b580	back out 104862/104869. Can reuse stacksave after all. Very cool. llvm-svn: 104897	2010-05-27 23:11:57 +00:00
Jim Grosbach	c2c7753f15	add ISD::STACKADDR to get the current stack pointer. Will be used by sjlj EH to update the jmpbuf in the presence of VLAs. llvm-svn: 104862	2010-05-27 18:23:48 +00:00
Jim Grosbach	bb4860d2a2	Adjust eh.sjlj.setjmp to properly have a chain and to have an opcode entry in ISD::. No functional change. llvm-svn: 104734	2010-05-26 20:22:18 +00:00
Bob Wilson	49df2d928d	Clean up indentation. llvm-svn: 104580	2010-05-25 03:36:52 +00:00
Evan Cheng	9b011e343c	LR is in GPR, not tGPR even in Thumb1 mode. llvm-svn: 104518	2010-05-24 18:00:18 +00:00
Bob Wilson	c7efe760d6	VDUP doesn't support vectors with 64-bit elements. llvm-svn: 104455	2010-05-23 05:42:31 +00:00
Evan Cheng	241d2c434e	Implement @llvm.returnaddress. rdar://8015977. llvm-svn: 104421	2010-05-22 01:47:14 +00:00
Jim Grosbach	b6cc69c655	Implement eh.sjlj.longjmp for ARM. Clean up the intrinsic a bit. Followups: docs patch for the builtin and eh.sjlj.setjmp cleanup to match longjmp. llvm-svn: 104419	2010-05-22 01:06:18 +00:00
Bob Wilson	b8ebb375b6	Recognize more BUILD_VECTORs and VECTOR_SHUFFLEs that can be implemented by copying VFP subregs. This exposed a bunch of dead code in the *spill-q.ll tests, so I tweaked those tests to keep that code from being optimized away. Radar 7872877. llvm-svn: 104415	2010-05-22 00:23:12 +00:00
Evan Cheng	6397a77e16	Change ARM scheduling default to list-hybrid if the target supports floating point instructions (and is not using soft float). llvm-svn: 104307	2010-05-21 00:43:17 +00:00
Evan Cheng	b5de7de4ce	Allow targets more controls on what nodes are scheduled by reg pressure, what for latency in hybrid mode. llvm-svn: 104293	2010-05-20 23:26:43 +00:00
Bob Wilson	11aebf39f1	Handle Neon v2f64 and v2i64 vector shuffles as register copies. This fixes the remaining issue with pr7167. llvm-svn: 104257	2010-05-20 18:39:53 +00:00
Evan Cheng	46e08acfa5	Code refactoring: pull SchedPreference enum from TargetLowering.h to TargetMachine.h and put it in its own namespace. llvm-svn: 104147	2010-05-19 20:19:50 +00:00
Evan Cheng	e2980af336	Sink dag combine's post index load / store code that swap base ptr and index into the target hook. Only the target knows whether the swap is safe. In Thumb2 mode, the offset must be an immediate. rdar://7998649 llvm-svn: 104060	2010-05-18 21:31:17 +00:00
Anton Korobeynikov	a80267a946	Generalize the ARM DAG combiner of mul with constants to all power-of-two cases. llvm-svn: 103901	2010-05-16 08:54:20 +00:00
Anton Korobeynikov	314ccc5501	Some cheap DAG combine goodness for multiplication with a particular constant. This can be extended later on to handle more "complex" constants. llvm-svn: 103881	2010-05-15 18:16:59 +00:00
Evan Cheng	16f27a70ef	v4i64 and v8i64 are only synthesizable when NEON is available. llvm-svn: 103855	2010-05-15 02:20:21 +00:00
Evan Cheng	85497bd415	Allow TargetLowering::getRegClassFor() to be called on illegal types. Also allow target to override it in order to map register classes to illegal but synthesizable types. e.g. v4i64, v8i64 for ARM / NEON. llvm-svn: 103854	2010-05-15 02:18:07 +00:00
Evan Cheng	2af2c9fa14	Added a QQQQ register file to model 4-consecutive Q registers. llvm-svn: 103760	2010-05-14 02:13:41 +00:00
Dan Gohman	fb6f4da0e0	Implement a bunch more TargetSelectionDAGInfo infrastructure. Move EmitTargetCodeForMemcpy, EmitTargetCodeForMemset, and EmitTargetCodeForMemmove out of TargetLowering and into SelectionDAGInfo to exercise this. llvm-svn: 103481	2010-05-11 17:31:57 +00:00
Evan Cheng	11130a0a22	Select @llvm.trap to the special B with 1111 condition (i.e. trap) instruction. llvm-svn: 103459	2010-05-11 07:26:32 +00:00
Evan Cheng	7f0d8f1ab0	Model vld2 / vst2 with reg_sequence. llvm-svn: 103411	2010-05-10 17:34:18 +00:00
Jim Grosbach	2db1618b44	Clean up the conditional for handling of sign_extend_inreg based on whether the extract instructions are available. rdar://7956878 llvm-svn: 103277	2010-05-07 18:34:55 +00:00
Jim Grosbach	e04cc6cb43	Cleanup of ARMv7M support. Move hardware divide and Thumb2 extract/pack instructions to subtarget features and update tests to reflect. PR5717. llvm-svn: 103136	2010-05-05 23:44:43 +00:00
Jim Grosbach	3630aff780	Add initial support for ARMv7M subtarget and cortex-m3 cpu. Patch by Jordy <snhjordy@gmail.com>. Followup patches will add some tests and adjust to use Subtarget features for the instructions. llvm-svn: 103119	2010-05-05 20:44:35 +00:00
Evan Cheng	6a76e7d9ae	Model CONCAT_VECTORS of two 64-bit values as a REG_SEQUENCE. llvm-svn: 103104	2010-05-05 18:28:36 +00:00
Dan Gohman	68f04d06c8	Get rid of the EdgeMapping map. Instead, just check for BasicBlock changes before doing phi lowering for switches. llvm-svn: 102809	2010-05-01 00:01:06 +00:00
Dan Gohman	a0f855157e	Use const qualifiers with TargetLowering. This eliminates several const_casts, and it reinforces the design of the Target classes being immutable. SelectionDAGISel::IsLegalToFold is now a static member function, because PIC16 uses it in an unconventional way. There is more room for API cleanup here. And PIC16's AsmPrinter no longer uses TargetLowering. llvm-svn: 101635	2010-04-17 15:26:15 +00:00
Dan Gohman	5c8db5ab3f	Move per-function state out of TargetLowering subclasses and into MachineFunctionInfo subclasses. llvm-svn: 101634	2010-04-17 14:41:14 +00:00
Bob Wilson	7e53f886d2	Revise my previous change to ExpandBIT_CONVERT. I hadn't realized that this may be called when either the source or destination type is i64, and my change also hadn't fixed the most obvious problem -- assuming that i64 will only be bitconverted to f64, ignoring the various vector types. Radar 7873160. llvm-svn: 101615	2010-04-17 05:30:19 +00:00
Evan Cheng	c843326d60	Use default lowering of DYNAMIC_STACKALLOC. As far as I can tell, ARM isle is doing the right thing and codegen looks correct for both Thumb and Thumb2. llvm-svn: 101410	2010-04-15 22:20:34 +00:00
Anders Carlsson	32747b3841	Fix build. llvm-svn: 101335	2010-04-15 03:11:28 +00:00
Dan Gohman	0e0b8cf9fd	Add const qualifiers to CodeGen's use of LLVM IR constructs. llvm-svn: 101334	2010-04-15 01:51:59 +00:00
Jim Grosbach	92490b077a	Add -arm-long-calls option to force calls to be indirect. This makes the kernel linker happier when dealing with kexts. Radar 7805069 llvm-svn: 101303	2010-04-14 22:28:31 +00:00
Bob Wilson	7b19d89e3a	Don't custom lower bit converts to ARM VMOVDRRD or VMOVDRR when the operand does not have a legal type. The legalizer does not know how to handle those nodes. Radar 7854640. llvm-svn: 101282	2010-04-14 20:45:23 +00:00
Bob Wilson	526e615ff9	Handle a v2f64 formal parameter that is split between registers and memory such that the entire second half is in memory. Radar 7855014. llvm-svn: 101181	2010-04-13 22:03:22 +00:00
Bob Wilson	6bc6581ca7	Expand SELECT and SELECT_CC for NEON vector types. Radar 7770501. llvm-svn: 100568	2010-04-06 22:02:24 +00:00
Mon P Wang	484bbe6aa9	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100304	2010-04-04 03:10:48 +00:00
Mon P Wang	0ccf050ca3	Revert r100191 since it breaks objc in clang llvm-svn: 100199	2010-04-02 18:43:02 +00:00
Mon P Wang	a01350755e	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100191	2010-04-02 18:04:15 +00:00
Bob Wilson	aae933cc81	Revert Mon Ping's change 99928, since it broke all the llvm-gcc buildbots. llvm-svn: 99948	2010-03-30 22:27:04 +00:00
Mon P Wang	9351ea594a	Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) A update of langref will occur in a subsequent checkin. llvm-svn: 99928	2010-03-30 20:55:56 +00:00
Jim Grosbach	7dc50db8fa	tweak the arm if conversion heuristic llvm-svn: 99402	2010-03-24 16:15:14 +00:00
Jim Grosbach	b19d22fcae	try being more permissive for if-conversion on ARM V7. see what the nightly test run permformance numbers say as to whether it helps. llvm-svn: 99355	2010-03-24 00:03:13 +00:00
Bob Wilson	9501c478f7	Revert this change, since it was causing ARM performance regressions. --- Reverse-merging r98889 into '.': U lib/Target/ARM/ARMInstrNEON.td U lib/Target/ARM/ARMISelLowering.h U lib/Target/ARM/ARMInstrInfo.td U lib/Target/ARM/ARMInstrVFP.td U lib/Target/ARM/ARMISelLowering.cpp U lib/Target/ARM/ARMInstrFormats.td llvm-svn: 99010	2010-03-19 22:51:32 +00:00
Anton Korobeynikov	eeae840ed7	Get rid of target-specific fp <-> int nodes when still I'm here. llvm-svn: 98889	2010-03-18 22:35:45 +00:00
Anton Korobeynikov	23c07f492e	Get rid of target-specific nodes for fp16 <-> fp32 conversion. llvm-svn: 98888	2010-03-18 22:35:37 +00:00
Bob Wilson	ac5881b842	Translate "cc" clobber in ARM inline assembly to ARM::CCRRegisterClass. Radar 7459078. llvm-svn: 98586	2010-03-15 23:09:18 +00:00
Bill Wendling	5493cf6715	Now that the default for Darwin platforms is to place the LSDA into the TEXT section, remove the target-specific code that performs this. llvm-svn: 98580	2010-03-15 21:09:38 +00:00
Anton Korobeynikov	90fcfccc91	Add substarget feature for FP16 llvm-svn: 98503	2010-03-14 18:42:38 +00:00
Anton Korobeynikov	48357cdc62	Add codegen support for FP16 on ARM llvm-svn: 98502	2010-03-14 18:42:31 +00:00
Bill Wendling	c9cfc16363	The ARM EH experiment worked! Place the LSDA into the TEXT section for ARM platforms. This involves making the encoding indirect, pcrel, and sdata4 instead of an absolute pointer. The references to the type infos are then non-lazy pointers. Revision 98019 changed the encoding of non-lazy pointers to add the symbol to the non-lazy pointer definition if it's a local symbol (otherwise, it's external and set to '0' so that the loader can adjust it to the real value). This paved the way for this change to work on ARM. llvm-svn: 98068	2010-03-09 18:31:07 +00:00
Bill Wendling	344fec6285	This is part of an LLC-beta test used to test <rdar://problem/6804645>. Please bear with the awful code. It won't last in its current state beyond tonight. llvm-svn: 98040	2010-03-09 02:46:12 +00:00
Bill Wendling	5990930d72	Remove dead parameter passing. llvm-svn: 97536	2010-03-02 01:55:18 +00:00
Bob Wilson	4ffb88d388	Check for comparisons of +/- zero when optimizing less-than-or-equal and greater-than-or-equal SELECT_CCs to NEON vmin/vmax instructions. This is only allowed when UnsafeFPMath is set or when at least one of the operands is known to be nonzero. llvm-svn: 97065	2010-02-24 22:15:53 +00:00
Jim Grosbach	6f72657d6e	LowerCall() should always do getCopyFromReg() to reference the stack pointer. Machine instruction selection is much happier when operands are in virtual registers. llvm-svn: 97012	2010-02-24 01:43:03 +00:00
Bob Wilson	84fc0200bd	Use NEON vmin/vmax instructions for floating-point selects. Radar 7461718. llvm-svn: 96572	2010-02-18 06:05:53 +00:00
David Greene	1efa05ab91	Remove an assumption of default arguments. This is in anticipation of a change to SelectionDAG build APIs. llvm-svn: 96230	2010-02-15 16:55:24 +00:00
Jim Grosbach	a7e098af3b	tighten up eh.setjmp sequence a bit. llvm-svn: 95603	2010-02-08 23:22:00 +00:00
Evan Cheng	9057fea7ef	Revert 95130. llvm-svn: 95160	2010-02-02 23:55:14 +00:00
Evan Cheng	48375fbf4f	Pass callsite return type to TargetLowering::LowerCall and use that to check sibcall eligibility. llvm-svn: 95130	2010-02-02 21:29:10 +00:00
Anton Korobeynikov	f7651ec593	Fix a gross typo: ARMv6+ may or may not support unaligned memory operations. Even if they are suported by the core, they can be disabled (this is just a configuration bit inside some register). Allow unaligned memops on darwin and conservatively disallow them otherwise. llvm-svn: 94889	2010-01-30 14:08:12 +00:00
Evan Cheng	237629e476	Eliminate target hook IsEligibleForTailCallOptimization. Target independent isel should always pass along the "tail call" property. Change target hook LowerCall's parameter "isTailCall" into a refernce. If the target decides it's impossible to honor the tail call request, it should set isTailCall to false to make target independent isel happy. llvm-svn: 94626	2010-01-27 00:07:07 +00:00
Bob Wilson	5daaa4b21c	Wrap some comments to 80 columns. llvm-svn: 93940	2010-01-19 22:56:26 +00:00
Jim Grosbach	70af2216fd	Patch by David Conrad: "On ARMv6T2 this turns cttz into rbit, clz instead of the 4 instruction sequence it is now." llvm-svn: 93758	2010-01-18 19:58:49 +00:00
Jim Grosbach	36c7973e76	Name change for consistency. No functional change. llvm-svn: 93480	2010-01-15 00:22:18 +00:00
Jim Grosbach	ca3006c171	EmitAtomicCmpSwap() custome inserter needs to delete the MI passed in. EmitAtomicBinary() already does this. llvm-svn: 93479	2010-01-15 00:18:34 +00:00
Jakob Stoklund Olesen	97a8d154fb	ARM "l" constraint for inline asm means R0-R7, also for Thumb2. This is consistent with llvm-gcc's arm/constraints.md. Certain instructions (e.g. CBZ, CBNZ) require a low register, even in Thumb2 mode. llvm-svn: 93436	2010-01-14 18:19:56 +00:00
Jakob Stoklund Olesen	27e36e52f5	Fix pasto llvm-svn: 93342	2010-01-13 19:54:39 +00:00
Bill Wendling	fc4c238bd5	Add more plumbing. This time in the LowerArguments and "get" functions which return partial registers. This affected the back-end lowering code some. Also patch up some places I missed before in the "get" functions. llvm-svn: 91880	2009-12-22 02:10:19 +00:00
Evan Cheng	c46a0ba3fc	Delete the instruction just before the function terminates for consistency sake. llvm-svn: 91836	2009-12-21 19:53:39 +00:00
Rafael Espindola	4f903d4548	Fix libstdc++ build on ARM linux and part of PR5770. MI was not being used but it was also not being deleted, so it was kept in the garbage list. The memory itself was freed once the function code gen was done. Once in a while the codegen of another function would create an instruction on the same address. Adding it to the garbage group would work once, but when another pointer was added it would cause an assert as "Cache" was about to be pushed to Ts. For a patch that make us detect problems like this earlier, take a look at http://lists.cs.uiuc.edu/pipermail/llvm-commits/Week-of-Mon-20091214/092758.html With that patch we assert as soon and the new instruction is added to the garbage set. llvm-svn: 91691	2009-12-18 16:59:39 +00:00
Bob Wilson	a9f20f9f6e	Handle ARM inline asm "w" constraints with 64-bit ("d") registers. The change in SelectionDAGBuilder is needed to allow using bitcasts to convert between f64 (the default type for ARM "d" registers) and 64-bit Neon vector types. Radar 7457110. llvm-svn: 91649	2009-12-18 01:03:29 +00:00
Jim Grosbach	76d722dd6c	nand atomic requires opposite operand ordering llvm-svn: 91371	2009-12-15 00:12:35 +00:00
Jim Grosbach	09167e5bbb	Add ARMv6 memory and sync barrier instructions llvm-svn: 91329	2009-12-14 21:24:16 +00:00
Jim Grosbach	266c2d59e6	Thumb2 atomic operations llvm-svn: 91321	2009-12-14 20:14:59 +00:00
Jim Grosbach	87975f6229	atomic binary operations up to 32-bits wide. llvm-svn: 91260	2009-12-14 04:22:04 +00:00
Jim Grosbach	187ad02a4f	Framework for atomic binary operations. The emitter for the pseudo instructions just issues an error for the moment. The front end won't yet generate these intrinsics for ARM, so this is behind the scenes until complete. llvm-svn: 91200	2009-12-12 01:40:06 +00:00
Jim Grosbach	5a1c16e5bb	Rough first pass at compare_and_swap atomic builtins for ARM mode. Work in progress. llvm-svn: 91090	2009-12-11 01:42:04 +00:00
Jim Grosbach	be89da9845	Add memory barrier intrinsic support for ARM. Moving towards adding the atomic operations intrinsics. llvm-svn: 91003	2009-12-10 00:11:09 +00:00
Evan Cheng	edcc21919f	- Support inline asm 'w' constraint for 128-bit vector types. - Also support the 'q' NEON registers asm code. llvm-svn: 90894	2009-12-08 23:06:22 +00:00
Bob Wilson	b53c801366	Recognize canonical forms of vector shuffles where the same vector is used for both source operands. In the canonical form, the 2nd operand is changed to an undef and the shuffle mask is adjusted to only reference elements from the 1st operand. Radar 7434842. llvm-svn: 90417	2009-12-03 06:40:55 +00:00
Anton Korobeynikov	0f885eb7fd	Materialize global addresses via movt/movw pair, this is always better than doing the same via constpool: 1. Load from constpool costs 3 cycles on A9, movt/movw pair - just 2. 2. Load from constpool might stall up to 300 cycles due to cache miss. 3. Movt/movw does not use load/store unit. 4. Less constpool entries => better compiler performance. This is only enabled on ELF systems, since darwin does not have needed relocations (yet). llvm-svn: 89720	2009-11-24 00:44:37 +00:00
Dan Gohman	b5ec39e2dc	Remove ISD::DEBUG_LOC and ISD::DBG_LABEL, which are no longer used. Note that "hasDotLocAndDotFile"-style debug info was already broken; people wanting this functionality should implement it in the AsmPrinter/DwarfWriter code. llvm-svn: 89711	2009-11-23 23:20:51 +00:00
Devang Patel	327919890c	We are not using DBG_STOPPOINT anymore. llvm-svn: 89536	2009-11-21 02:46:55 +00:00
David Greene	58e7c6145b	Add a bool flag to StackObjects telling whether they reference spill slots. The AsmPrinter will use this information to determine whether to print a spill/reload comment. Remove default argument values. It's too easy to pass a wrong argument value when multiple arguments have default values. Make everything explicit to trap bugs early. Update all targets to adhere to the new interfaces.. llvm-svn: 87022	2009-11-12 20:49:22 +00:00
Evan Cheng	af90768b3c	isLegalICmpImmediate should take a signed integer; code clean up. llvm-svn: 86964	2009-11-12 07:13:11 +00:00
Evan Cheng	a11308742c	Add TargetLowering::isLegalICmpImmediate. It tells LSR what immediate can be folded into target icmp instructions. llvm-svn: 86858	2009-11-11 19:05:52 +00:00
Jim Grosbach	ea6c9c17f5	Use Unified Assembly Syntax for the ARM backend. llvm-svn: 86494	2009-11-09 00:11:35 +00:00
Evan Cheng	aaf30ce699	Remove ARMPCLabelIndex from ARMISelLowering. Use ARMFunctionInfo::createConstPoolEntryUId() instead. llvm-svn: 86294	2009-11-06 22:24:13 +00:00
Bob Wilson	7e071e14eb	Revert previous change to a comment. The BlockAddresses go in the constant pool so they don't get wrapped separately. llvm-svn: 85844	2009-11-03 00:02:05 +00:00
Bob Wilson	3144715b53	Put BlockAddresses into ARM constant pools. llvm-svn: 85824	2009-11-02 20:59:23 +00:00
Anton Korobeynikov	09147da530	Handle splats of undefs properly. This includes the testcase for PR5364 as well. llvm-svn: 85767	2009-11-02 00:12:06 +00:00
Jim Grosbach	ace75c4288	Expand 64-bit logical shift right inline llvm-svn: 85687	2009-10-31 21:42:19 +00:00
Jim Grosbach	16ae289667	Expand 64-bit arithmetic shift right inline llvm-svn: 85685	2009-10-31 21:00:56 +00:00
Jim Grosbach	534d2cb249	Expand 64 bit left shift inline rather than using the libcall. For now, this is unconditional. Making it still use the libcall when optimizing for size would be a good adjustment. llvm-svn: 85675	2009-10-31 19:38:01 +00:00
Evan Cheng	9178904e56	It's safe to remat t2LDRpci; Add PseudoSourceValue to load / store's to enable more machine licm. More changes coming. llvm-svn: 85643	2009-10-31 03:39:36 +00:00
Bob Wilson	94d79c1f43	Fix a comment. llvm-svn: 85610	2009-10-30 20:13:25 +00:00
Rafael Espindola	d4fadd76da	This fixes functions like void f (int a1, int a2, int a3, int a4, int a5,...) In ARMTargetLowering::LowerFormalArguments if the function has 4 or more regular arguments we used to set VarArgsFrameIndex using an offset of 0, which is only correct if the function has exactly 4 regular arguments. llvm-svn: 85590	2009-10-30 14:33:14 +00:00
Bob Wilson	95064e348a	Add ARM codegen for indirect branches. clang/test/CodeGen/indirect-goto.c runs! (unoptimized) llvm-svn: 85577	2009-10-30 05:45:42 +00:00
Evan Cheng	16ed5ac7ff	Give ARMISD::EH_SJLJ_LONGJMP and EH_SJLJ_SETJMP names. llvm-svn: 85381	2009-10-28 06:55:03 +00:00
Evan Cheng	1babe43881	Use fconsts and fconstd to materialize small fp constants. llvm-svn: 85362	2009-10-28 01:44:26 +00:00
Bob Wilson	26a4580439	Most of the NEON shuffle instructions do not support 64-bit element types. llvm-svn: 84785	2009-10-21 21:36:27 +00:00
Evan Cheng	275a09e55d	Match more patterns to movt. llvm-svn: 84751	2009-10-21 08:15:52 +00:00
Benjamin Kramer	dee347a8e8	Random #include pruning. llvm-svn: 84632	2009-10-20 11:44:38 +00:00
Bob Wilson	60ffc7b6b9	Revert svn r80498 and replace it with a different solution. The only problem I can see with the original code was that I forgot that this runs after type legalization and hence the result type will always be i32. (Custom legalization of EXTRACT_VECTOR_ELT is only enabled for vector types with 8- and 16-bit elements.) Regarding the FIXME comment: any information about sign and zero-extension should be captured by separate extension operations. The DAG combiner should handle those to produce either VGETLANEu or VGETLANEs, and that seems to be working now. If there are cases that we're missing, let me know. llvm-svn: 84218	2009-10-15 23:12:05 +00:00
Bob Wilson	a98883deaa	More Neon clean-up: avoid the need for custom-lowering vld/st-lane intrinsics by creating TargetConstants during instruction selection instead of during legalization. llvm-svn: 84042	2009-10-13 22:29:24 +00:00
Bob Wilson	88df19e49a	NEON VLD/VST are now fully implemented. For operations that expand to multiple instructions, the expansion is done during selection so there is no need to do anything special during legalization. llvm-svn: 84036	2009-10-13 21:55:24 +00:00
Anton Korobeynikov	aba66ae89b	Add PseudoSourceValues for constpool stuff on ELF (Darwin should use something similar) and register spills. llvm-svn: 83435	2009-10-07 00:06:35 +00:00
Evan Cheng	8cf9f56cca	getFunctionAlignment should return log2 alignment. llvm-svn: 83242	2009-10-02 06:57:25 +00:00
Anton Korobeynikov	829a3a18d2	ARM does not support offset folding (yet). Disable it for now. This fixes PR5031. Unfortunately, there is no small testcase :( llvm-svn: 82643	2009-09-23 19:04:09 +00:00
Evan Cheng	7714c8412d	Fix PR4926. When target hook EmitInstrWithCustomInserter() insert new basic blocks and update CFG, it should also inform sdisel of the changes so the phi source operands will come from the right basic blocks. llvm-svn: 82311	2009-09-19 09:51:03 +00:00
Evan Cheng	7cb9c456e5	Enhance EmitInstrWithCustomInserter() so target can specify CFG changes that sdisel will use to properly complete phi nodes. Not functionality change yet. llvm-svn: 82273	2009-09-18 21:02:19 +00:00
Bob Wilson	989568d935	Expand vector floating-point conversions not supported by NEON. llvm-svn: 82074	2009-09-16 20:20:44 +00:00
Bob Wilson	8770c809c2	Expand some more vector operations not supported by Neon. llvm-svn: 81969	2009-09-16 00:32:15 +00:00
Bob Wilson	c01a94dad0	Neon does not support vector divide or remainder. Expand them. llvm-svn: 81966	2009-09-16 00:17:28 +00:00
Bob Wilson	f091792f40	Expand all v2f64 arithmetic operations for Neon. Radar 7200803. (This should also fix the SingleSource/UnitTests/Vector/sumarray-dbl test.) llvm-svn: 81959	2009-09-15 23:55:57 +00:00
Bob Wilson	877a857b4b	Fix pr4939: Change FPCCToARMCC to translate SETOLE to ARMCC::LS. See the bug report for details. llvm-svn: 81397	2009-09-09 23:14:54 +00:00
Anton Korobeynikov	2b6ef7724e	Unbreak getOnesVector() / getZeroVector() to use valid ARM extended imm's. llvm-svn: 81262	2009-09-08 22:51:43 +00:00
Evan Cheng	41e87f2f13	Reference to hidden symbols do not have to go through non-lazy pointer in non-pic mode. rdar://7187172. llvm-svn: 80904	2009-09-03 07:04:02 +00:00
Sandeep Patel	9c4e094e2a	Retype from unsigned to CallingConv::ID accordingly. Approved by Bob Wilson. llvm-svn: 80773	2009-09-02 08:44:58 +00:00
Bob Wilson	6972a16bbc	Add support for generating code for vst{234}lane intrinsics. llvm-svn: 80707	2009-09-01 18:51:56 +00:00
Bob Wilson	bebadd11e4	Generate code for vld{234}_lane intrinsics. llvm-svn: 80656	2009-09-01 04:26:28 +00:00
Jim Grosbach	9a220088ac	Clean up LSDA name generation and use for SJLJ exception handling. This makes an eggregious hack somewhat more palatable. Bringing the LSDA forward and making it a GV available for reference would be even better, but is beyond the scope of what I'm looking to solve at this point. Objective C++ code could generate function names that broke the previous scheme. This fixes that. llvm-svn: 80649	2009-09-01 01:57:56 +00:00
Anton Korobeynikov	a261afbf14	EXTRACT_VECTOR_ELEMENT can have result type different from element type. Remove the assertion and generalize the code for ARM NEON stuff. llvm-svn: 80498	2009-08-30 17:14:54 +00:00
Anton Korobeynikov	b2e6f5eed4	Do not assert on too wide splats we don't support. llvm-svn: 80409	2009-08-29 00:08:18 +00:00
Evan Cheng	d7a07ab112	Let Darwin linker auto-synthesize stubs and lazy-pointers. This deletes a bunch of nasty code in ARM asm printer. llvm-svn: 80404	2009-08-28 23:18:09 +00:00
Anton Korobeynikov	c1e6083cb8	Hopefully the final missing part :( scalar_to_vector is fully legal now llvm-svn: 80251	2009-08-27 16:25:49 +00:00
Anton Korobeynikov	33d151e85e	Transform float scalar_to_vector into subreg accesses. No idea whether this is profitable or not. llvm-svn: 80245	2009-08-27 14:38:44 +00:00
Bob Wilson	5240e9de02	Remove unneeded ARM-specific DAG nodes for VLD* and VST* Neon operations. The instructions can be selected directly from the intrinsics. We will need to add some ARM-specific nodes for VLD/VST of 3 and 4 128-bit vectors, but those are not yet implemented. llvm-svn: 80117	2009-08-26 17:39:53 +00:00
Anton Korobeynikov	1c904039ce	Expand scalar_to_vector - we don't have any isel logic for it now llvm-svn: 80107	2009-08-26 16:26:09 +00:00
Eli Friedman	79615641f1	Make x86 test actually test x86 code generation. Fix the construct on ARM, which was breaking by coincidence, and add a similar testcase for ARM. llvm-svn: 79719	2009-08-22 03:13:10 +00:00
Bob Wilson	6d4400e852	Match VTRN, VZIP, and VUZP shuffles. Restore the tests for these operations, now using shuffles instead of intrinsics. llvm-svn: 79673	2009-08-21 20:54:19 +00:00
Anton Korobeynikov	20d832fa1b	Fix some typos and use type-based isel for VZIP/VUZP/VTRN llvm-svn: 79625	2009-08-21 12:41:42 +00:00
Anton Korobeynikov	218db4a01c	Add lowering of ARM 4-element shuffles to multiple instructios via perfectshuffle-generated table. llvm-svn: 79624	2009-08-21 12:41:24 +00:00
Anton Korobeynikov	220512160d	Add nodes & dummy matchers for some v{zip,uzp,trn} instructions llvm-svn: 79622	2009-08-21 12:40:50 +00:00
Anton Korobeynikov	dccf7cb911	Expand EXTRACT_SUBVECTOR llvm-svn: 79621	2009-08-21 12:40:35 +00:00
Anton Korobeynikov	f6657d5e02	Provide vext.{16,32} llvm-svn: 79620	2009-08-21 12:40:21 +00:00
Anton Korobeynikov	a2e4bc2312	Use masks not nodes for vector shuffle predicates. Provide set of 'legal' masks, so legalizer won't infinite cycle llvm-svn: 79619	2009-08-21 12:40:07 +00:00
Bob Wilson	fae9057bf0	Add support for Neon VEXT (vector extract) shuffles. This is derived from a patch by Anton Korzh. I modified it to recognize the VEXT shuffles during legalization and lower them to a target-specific DAG node. llvm-svn: 79428	2009-08-19 17:03:43 +00:00
Bill Wendling	962adec4ee	Reapply r79127. It was fixed by d0k. llvm-svn: 79136	2009-08-15 21:21:19 +00:00
Bill Wendling	bfebbb6477	Revert r79127. It was causing compilation errors. llvm-svn: 79135	2009-08-15 21:14:01 +00:00
Evan Cheng	5d841097a9	Change allowsUnalignedMemoryAccesses to take type argument since some targets support unaligned mem access only for certain types. (Should it be size instead?) ARM v7 supports unaligned access for i16 and i32, some v6 variants support it as well. llvm-svn: 79127	2009-08-15 19:23:44 +00:00
Evan Cheng	9d351a7246	Turn on if-conversion for thumb2. llvm-svn: 79084	2009-08-15 07:59:10 +00:00
Anton Korobeynikov	3a0cde8c91	Allow targets to specify their choice of calling conventions per libcall. Take advantage of this in the ARM backend to rectify broken choice of CC when hard float is in effect. PIC16 may want to see if it could be of use in MakePIC16Libcall, which works unchanged. Patch by Sandeep! llvm-svn: 79033	2009-08-14 20:10:52 +00:00
Evan Cheng	67fd47b38b	Add Thumb2 lsr hooks. llvm-svn: 79032	2009-08-14 20:09:37 +00:00
Evan Cheng	ebbcd00c17	80 col violation. llvm-svn: 79026	2009-08-14 19:11:20 +00:00
Bob Wilson	80db08baec	Now that all the legal Neon shuffles (or at least the ones that have been implemented so far) are recognized during legalization, it is easy to fall back to the default expansion for other shuffles. llvm-svn: 78995	2009-08-14 05:16:33 +00:00
Bob Wilson	d337cde6e5	Create a new ARM-specific DAG node, VDUP, to represent a splat from a scalar_to_vector. Generate these VDUP nodes during legalization instead of trying to recognize the pattern during selection. llvm-svn: 78994	2009-08-14 05:13:08 +00:00
Bob Wilson	7a311914ab	During legalization, change Neon vdup_lane operations from shuffles to target-specific VDUPLANE nodes. This allows the subreg handling for the quad-register version to be done easily with Pats in the .td file, instead of with custom code in ARMISelDAGToDAG.cpp. llvm-svn: 78993	2009-08-14 05:08:32 +00:00
Owen Anderson	9df206d02d	Push LLVMContexts through the IntegerType APIs. llvm-svn: 78948	2009-08-13 21:58:54 +00:00
Bob Wilson	e3eedf3cd2	Add a fixme message about canonicalizing floating-point vector types. llvm-svn: 78897	2009-08-13 06:01:30 +00:00
Bob Wilson	8cb7da85e3	Revert r78852 for now. I want to do this differently, but I don't have time to fix it tonight. llvm-svn: 78896	2009-08-13 05:58:56 +00:00
Bob Wilson	2940b8e9a5	Add a comment to describe why vector shuffles are legalized to custom DAG nodes. llvm-svn: 78884	2009-08-13 02:13:04 +00:00
Bob Wilson	11ee30bdc8	Use cast<> instead of dyn_cast<> in places where the type is known. llvm-svn: 78881	2009-08-13 01:57:47 +00:00
Bob Wilson	b089d07a1f	Recognize Neon VDUP shuffles during legalization instead of selection. llvm-svn: 78852	2009-08-12 22:54:19 +00:00
Bob Wilson	d8b7ca4c28	Recognize Neon VREV shuffles during legalization instead of selection. llvm-svn: 78850	2009-08-12 22:31:50 +00:00
Jim Grosbach	74c682dde4	Add catch block handling to SjLj exception handling. llvm-svn: 78817	2009-08-12 17:38:44 +00:00
Evan Cheng	c369ccbe83	Shrink Thumb2 movcc instructions. llvm-svn: 78790	2009-08-12 05:17:19 +00:00
Owen Anderson	48f2f0ae72	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Jim Grosbach	3c898a99bd	Whitespace cleanup. Remove trailing whitespace. llvm-svn: 78666	2009-08-11 15:33:49 +00:00
Bob Wilson	d64e304671	Use vAny type to get rid of Neon intrinsics that differed only in whether the overloaded vector types allowed floating-point or integer vector elements. Most of these operations actually depend on the element type, so bitcasting was not an option. If you include the vpadd intrinsics that I updated earlier, this gets rid of 20 intrinsics. llvm-svn: 78646	2009-08-11 05:39:44 +00:00
Jim Grosbach	c9a1dd9291	SjLj based exception handling unwinding support. This patch is nasty, brutish and short. Well, it's kinda short. Definitely nasty and brutish. The front-end generates the register/unregister calls into the SjLj runtime, call-site indices and landing pad dispatch. The back end fills in the LSDA with the call-site information provided by the front end. Catch blocks are not yet implemented. Built on Darwin and verified no llvm-core "make check" regressions. llvm-svn: 78625	2009-08-11 00:09:57 +00:00
Owen Anderson	b4bce99769	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Owen Anderson	30bf6c8dab	SimpleValueType-ify a few more methods on TargetLowering. llvm-svn: 78595	2009-08-10 20:46:15 +00:00
Owen Anderson	cf56d576eb	Continue the SimpleValueType-ification. llvm-svn: 78593	2009-08-10 20:18:46 +00:00
Evan Cheng	48b49cf5b9	It turns out most of the thumb2 instructions are not allowed to touch SP. The semantics of such instructions are unpredictable. We have just been lucky that tests have been passing. This patch takes pain to ensure all the PEI lowering code does the right thing when lowering frame indices, insert code to manipulate stack pointers, etc. It's also custom lowering dynamic stack alloc into pseudo instructions so we can insert the right instructions at scheduling time. This fixes PR4659 and PR4682. llvm-svn: 78361	2009-08-07 00:34:42 +00:00
Bob Wilson	bd7627b23e	Implement Neon VST[234] operations. llvm-svn: 78330	2009-08-06 18:47:44 +00:00
Anton Korobeynikov	7a0835dec5	Remove redundand checks: the only way to have, e.g. f32 RegVT is exactly hardfloat case. llvm-svn: 78237	2009-08-05 20:15:19 +00:00
Anton Korobeynikov	7f9b6ff4a3	Unbreak the stuff, this is ugly, but we cannot do better for now with 'plain' C calling conv. llvm-svn: 78232	2009-08-05 19:40:16 +00:00
Anton Korobeynikov	07ce0611d9	Missed pieces for ARM HardFP ABI. Patch by Sandeep Patel! llvm-svn: 78225	2009-08-05 19:04:42 +00:00
Dan Gohman	5d566d918b	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Bob Wilson	1fe51064ba	Change DAG nodes for Neon VLD2/3/4 operations to return multiple results. Get rid of yesterday's code to fix the register usage during isel. Select the new DAG nodes to machine instructions. The new pre-alloc pass to choose adjacent registers for these results is not done, so the results of this will generally not assemble yet. llvm-svn: 78136	2009-08-05 00:49:09 +00:00
Bob Wilson	fe37bdfdd8	Lower Neon VLD* intrinsics to custom DAG nodes, and manually allocate the results to fixed registers. llvm-svn: 78025	2009-08-04 00:36:16 +00:00
Bob Wilson	154afab758	Minor cleanup. No functional changes intended. llvm-svn: 78024	2009-08-04 00:25:01 +00:00
Bob Wilson	eb3b616a7e	Lower CONCAT_VECTOR during legalization instead of matching it during isel. Add a testcase. llvm-svn: 77992	2009-08-03 20:36:38 +00:00
Chris Lattner	06d8ca1f56	convert ctors/dtors section to be in TLOF instead of TAI. llvm-svn: 77842	2009-08-02 00:34:36 +00:00
Evan Cheng	5ef6928dff	Fix Thumb2 function call isel. Thumb1 and Thumb2 should share the same instructions for calls since BL and BLX are always 32-bit long and BX is always 16-bit long. Also, we should be using BLX to call external function stubs. llvm-svn: 77756	2009-08-01 00:16:10 +00:00
Chris Lattner	c156a00641	refactor section construction in TLOF to be through an explicit initialize method, which can be called when an MCContext is available. llvm-svn: 77687	2009-07-31 17:42:42 +00:00
Bob Wilson	8624e45518	Lower a 128-bit BUILD_VECTOR with 2 elements to a pair of INSERT_VECTOR_ELTs. llvm-svn: 77557	2009-07-30 00:31:25 +00:00
Evan Cheng	fc846dd401	Optimize Thumb2 jumptable to use tbb / tbh when all the offsets fit in byte / halfword. llvm-svn: 77422	2009-07-29 02:18:14 +00:00
Evan Cheng	cf483eb0c0	In thumb2 mode, add pc is unpredictable. Use add + mov pc instead (that is until more optimization goes in). llvm-svn: 77364	2009-07-28 20:53:24 +00:00
Chris Lattner	c74586940a	the apple "ld_classic" linker doesn't support .literal16 in 32-bit mode, and "ld64" (the default linker) falls back to it in -static mode. llvm-svn: 77334	2009-07-28 17:50:28 +00:00
Chris Lattner	55461787cc	Rip all of the global variable lowering logic out of TargetAsmInfo. Since it is highly specific to the object file that will be generated in the end, this introduces a new TargetLoweringObjectFile interface that is implemented for each of ELF/MachO/COFF/Alpha/PIC16 and XCore. Though still is still a brutal and ugly refactoring, this is a major step towards goodness. This patch also: 1. fixes a bunch of dangling pointer problems in the PIC16 backend. 2. disables the TargetLowering copy ctor which PIC16 was accidentally using. 3. gets us closer to xcore having its own crazy target section flags and pic16 not having to shadow sections with its own objects. 4. fixes wierdness where ELF targets would set CStringSection but not CStringSection_. Factor the code better. 5. fixes some bugs in string lowering on ELF targets. llvm-svn: 77294	2009-07-28 03:13:23 +00:00
Bob Wilson	ec256c8938	Add support for ARM Neon VREV instructions. Patch by Anton Korzh, with some modifications from me. llvm-svn: 77101	2009-07-26 00:39:34 +00:00
Evan Cheng	d615e606c4	Change Thumb2 jumptable codegen to one that uses two level jumps: Before: adr r12, #LJTI3_0_0 ldr pc, [r12, +r0, lsl #2] LJTI3_0_0: .long LBB3_24 .long LBB3_30 .long LBB3_31 .long LBB3_32 After: adr r12, #LJTI3_0_0 add pc, r12, +r0, lsl #2 LJTI3_0_0: b.w LBB3_24 b.w LBB3_30 b.w LBB3_31 b.w LBB3_32 This has several advantages. 1. This will make it easier to optimize this to a TBB / TBH instruction + (smaller) table. 2. This eliminate the need for ugly asm printer hack to force the address into thumb addresses (bit 0 is one). 3. Same codegen for pic and non-pic. 4. This eliminate the need to align the table so constantpool island pass won't have to over-estimate the size. Based on my calculation, the later is probably slightly faster as well since ldr pc with shifter address is very slow. That is, it should be a win as long as the HW implementation can do a reasonable job of branch predict the second branch. llvm-svn: 77024	2009-07-25 00:33:29 +00:00
Owen Anderson	cc287b28c9	Get rid of the Pass+Context magic. llvm-svn: 76702	2009-07-22 00:24:57 +00:00
Chris Lattner	499fe29f12	fix an arm codegen bug (the same as PR4482 on ppc) where available_externally symbols were not getting stubs. While I'm at it, add a big testcase for stub generation to make sure I don't break anything. llvm-svn: 75737	2009-07-15 04:12:33 +00:00
Bob Wilson	8682c6607e	Remove an extra space. llvm-svn: 75658	2009-07-14 18:44:34 +00:00
Torok Edwin	f955a6ef49	llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. This adds location info for all llvm_unreachable calls (which is a macro now) in !NDEBUG builds. In NDEBUG builds location info and the message is off (it only prints "UREACHABLE executed"). llvm-svn: 75640	2009-07-14 16:55:14 +00:00
Bob Wilson	e0478ff8e5	Fix comment typos. llvm-svn: 75479	2009-07-13 18:11:36 +00:00
Torok Edwin	ae8a3ff177	assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds. llvm-svn: 75379	2009-07-11 20:10:48 +00:00
Owen Anderson	8970999512	Thread LLVMContext through MVT and related parts of SDISel. llvm-svn: 75153	2009-07-09 17:57:24 +00:00
David Goodwin	49fbd8d6b7	Use common code for both ARM and Thumb-2 instruction and register info. llvm-svn: 75067	2009-07-08 23:10:31 +00:00
Torok Edwin	ad3be984b7	Start converting to new error handling API. cerr+abort -> llvm_report_error assert(0)+abort -> LLVM_UNREACHABLE (assert(0)+llvm_unreachable-> abort() included) llvm-svn: 75018	2009-07-08 18:01:40 +00:00
Nick Lewycky	d46a7b2d22	Remove the vicmp and vfcmp instructions. Because we never had a release with these instructions, no autoupgrade or backwards compatibility support is provided. llvm-svn: 74991	2009-07-08 03:04:38 +00:00
Evan Cheng	46b98516f6	Add some more Thumb2 multiplication instructions. llvm-svn: 74889	2009-07-07 01:17:28 +00:00
Tilmann Scheller	cea3c16aa5	Add NumFixedArgs attribute to CallSDNode which indicates the number of fixed arguments in a vararg call. With the SVR4 ABI on PowerPC, vector arguments for vararg calls are passed differently depending on whether they are a fixed or a variable argument. Variable vector arguments always go into memory, fixed vector arguments are put into vector registers. If there are no free vector registers available, fixed vector arguments are put on the stack. The NumFixedArgs attribute allows to decide for an argument in a vararg call whether it belongs to the fixed or variable portion of the parameter list. llvm-svn: 74764	2009-07-03 06:44:53 +00:00
Evan Cheng	f20e4fba49	Add thumb2 sign / zero extend with rotate instructions. llvm-svn: 74755	2009-07-03 01:43:10 +00:00
Evan Cheng	dad6a41d14	Thumb2 pre/post indexed loads. llvm-svn: 74696	2009-07-02 07:28:31 +00:00
Evan Cheng	7249bab8a5	80 col violation. llvm-svn: 74693	2009-07-02 06:44:30 +00:00
Bill Wendling	fdd5badace	Update comments to make it clear that the function alignment is the Log2 of the bytes and not bytes. llvm-svn: 74624	2009-07-01 18:50:55 +00:00
Bill Wendling	c0fb316bd3	Add an "alignment" field to the MachineFunction object. It makes more sense to have the alignment be calculated up front, and have the back-ends obey whatever alignment is decided upon. This allows for future work that would allow for precise no-op placement and the like. llvm-svn: 74564	2009-06-30 22:38:32 +00:00
David Goodwin	9e1280adf3	Rename ARMcmpNZ to ARMcmpZ and use it to represent comparisons that set only the Z flag (i.e. eq and ne). Make ARMcmpZ commutative. llvm-svn: 74423	2009-06-29 15:33:01 +00:00
David Goodwin	921faa64cd	Thumb-2 has CLZ. llvm-svn: 74322	2009-06-26 20:47:43 +00:00
Bob Wilson	6db76aaf10	Add support for ARM's Advanced SIMD (NEON) instruction set. This is still a work in progress but most of the NEON instruction set is supported. llvm-svn: 73919	2009-06-22 23:27:02 +00:00
Evan Cheng	706927c96a	Add comments. llvm-svn: 73761	2009-06-19 07:06:07 +00:00
Evan Cheng	8f613095de	Should be using Bcc (average) latency to determine if-conversion threshold, not BL. llvm-svn: 73759	2009-06-19 06:56:26 +00:00
Evan Cheng	f671ce4eba	Latency information for ARM v6. It's rough and not yet hooked up. Right now we are only using branch latency to determine if-conversion limits. llvm-svn: 73747	2009-06-19 01:51:50 +00:00

... 3 4 5 6 7 ...

618 Commits