llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 04:52:54 +02:00

Author	SHA1	Message	Date
Duncan Sands	a18fd5ecb4	Comment out or remove unused parameter names so as to avoid a slew of compiler warnings. llvm-svn: 142574	2011-10-20 08:57:54 +00:00
Hal Finkel	e8091aad04	Add comments to TargetLowering.h indicating that the set*Alignment functions take arguments in log2(bytes) llvm-svn: 142213	2011-10-17 18:23:13 +00:00
Evan Cheng	246086d6f6	Cosmetic change. llvm-svn: 141269	2011-10-06 02:47:18 +00:00
Duncan Sands	d1311488fe	Add codegen support for vector select (in the IR this means a select with a vector condition); such selects become VSELECT codegen nodes. This patch also removes VSETCC codegen nodes, unifying them with SETCC nodes (codegen was actually often using SETCC for vector SETCC already). This ensures that various DAG combiner optimizations kick in for vector comparisons. Passes dragonegg bootstrap with no testsuite regressions (nightly testsuite as well as "make check-all"). Patch mostly by Nadav Rotem. llvm-svn: 139159	2011-09-06 19:07:46 +00:00
Evan Cheng	91aa81acaa	Follow up to r138791. Add a instruction flag: hasPostISelHook which tells the pre-RA scheduler to call a target hook to adjust the instruction. For ARM, this is used to adjust instructions which may be setting the 's' flag. ADC, SBC, RSB, and RSC instructions have implicit def of CPSR (required since it now uses CPSR physical register dependency rather than "glue"). If the carry flag is used, then the target hook will fill in the optional operand with CPSR. Otherwise, the hook will remove the CPSR implicit def from the MachineInstr. llvm-svn: 138810	2011-08-30 19:09:48 +00:00
Duncan Sands	b034709948	Fix a place that was clearly forgotten when the type legalization logic moved over to its own enum. Noticed by Andrey Karpov with the PVS-studio tool. llvm-svn: 136881	2011-08-04 16:01:54 +00:00
Eli Friedman	afd08dcc2c	New approach to r136737: insert the necessary fences for atomic ops in platform-independent code, since a bunch of platforms (ARM, Mips, PPC, Alpha are the relevant targets here) need to do essentially the same thing. I think this completes the basic CodeGen for atomicrmw and cmpxchg. llvm-svn: 136813	2011-08-03 21:06:02 +00:00
Eli Friedman	99b95e531f	Make isLoadExtLegal and isTruncStoreLegal check what the name says. :) This might have some minor effect on CellSPU, but all other targets should be unaffected. Fixing per report from Damien Vincent on llvmdev. llvm-svn: 135462	2011-07-19 02:24:07 +00:00
Chris Lattner	e1fe7061ce	land David Blaikie's patch to de-constify Type, with a few tweaks. llvm-svn: 135375	2011-07-18 04:54:35 +00:00
Benjamin Kramer	44c76d239a	Emit a more efficient magic number multiplication for exact sdivs. We have to do this in DAGBuilder instead of DAGCombiner, because the exact bit is lost after building. struct foo { char x[24]; }; long bar(struct foo a, struct foo b) { return a-b; } is now compiled into movl 4(%esp), %eax subl 8(%esp), %eax sarl $3, %eax imull $-1431655765, %eax, %eax instead of movl 4(%esp), %eax subl 8(%esp), %eax movl $715827883, %ecx imull %ecx movl %edx, %eax shrl $31, %eax sarl $2, %edx addl %eax, %edx movl %edx, %eax llvm-svn: 134695	2011-07-08 10:31:30 +00:00
Eric Christopher	40578e7885	Remove getRegClassForInlineAsmConstraint and all dependencies. Fixes rdar://9643582 llvm-svn: 134123	2011-06-30 01:20:03 +00:00
Nadav Rotem	0cec5ab356	Fix a bug in the type-lowering of integer-promoted elements. Add a check that the newly created simple type is valid before checking its legality. Re-commit the test file. llvm-svn: 133291	2011-06-17 20:54:12 +00:00
Nadav Rotem	1086d5acda	Bugfix: When looking for a legal vector type, stop looking when a non-simple element type is found. This fix addresses some of the tests in Duncan's testcase (forthcoming). llvm-svn: 132891	2011-06-12 14:40:30 +00:00
Eric Christopher	1ae9ec6124	Add a parameter to CCState so that it can access the MachineFunction. No functional change. Part of PR6965 llvm-svn: 132763	2011-06-08 23:55:35 +00:00
Eric Christopher	d68494ffdd	Have LowerOperandForConstraint handle multiple character constraints. Part of rdar://9119939 llvm-svn: 132510	2011-06-02 23:16:42 +00:00
Nadav Rotem	111ad2f6ce	This patch is another step in the direction of adding vector select. In this patch we add a flag to enable a new type legalization decision - to promote integer elements in vectors. Currently, the rest of the codegen does not support this kind of legalization. This flag will be removed when the transition is complete. llvm-svn: 132394	2011-06-01 12:51:46 +00:00
Nadav Rotem	63c8747adf	Refactor the type legalizer. Switch TargetLowering to a new enum - LegalizeTypeAction. This patch does not change the behavior of the type legalizer. The codegen produces the same code. This infrastructural change is needed in order to enable complex decisions for vector types (needed by the vector-select patch). llvm-svn: 132263	2011-05-28 17:57:14 +00:00
Nadav Rotem	531aa71d22	Refactor getActionType and getTypeToTransformTo ; place all of the 'decision' code in one place. Re-apply 131534 and fix the multi-step promotion of integers. llvm-svn: 132217	2011-05-27 21:03:13 +00:00
Duncan Sands	d3292b9f1e	Revert commit 131534 since it seems to have broken several buildbots. Original log entry: Refactor getActionType and getTypeToTransformTo ; place all of the 'decision' code in one place. llvm-svn: 131536	2011-05-18 14:57:56 +00:00
Nadav Rotem	b7d689c706	Refactor getActionType and getTypeToTransformTo ; place all of the 'decision' code in one place. llvm-svn: 131534	2011-05-18 12:26:38 +00:00
Eli Friedman	f608aeaefa	Fix comments per Duncan's review. llvm-svn: 131055	2011-05-07 19:23:14 +00:00
Eli Friedman	12e590e760	Make the logic for determining function alignment more explicit. No functionality change. llvm-svn: 131012	2011-05-06 20:34:06 +00:00
Dan Gohman	7beb845bab	Add an unfolded offset field to LSR's Formula record. This is used to model constants which can be added to base registers via add-immediate instructions which don't require an additional register to materialize the immediate. llvm-svn: 130743	2011-05-03 00:46:49 +00:00
Stuart Hastings	a552942e02	ARM byval support. Will be enabled by another patch to the FE. <rdar://problem/7662569> llvm-svn: 129858	2011-04-20 16:47:52 +00:00
Chris Lattner	0304b82f80	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
Cameron Zwarich	2748634089	Add a RemoveFromWorklist method to DCI. This is needed to do some complicated transformations in target-specific DAG combines without causing DAGCombiner to delete the same node twice. If you know of a better way to avoid this (see my next patch for an example), please let me know. llvm-svn: 128758	2011-04-02 02:40:26 +00:00
Evan Cheng	dd99a0a548	Re-apply r127953 with fixes: eliminate empty return block if it has no predecessors; update dominator tree if cfg is modified. llvm-svn: 127981	2011-03-21 01:19:09 +00:00
Daniel Dunbar	34c65737c3	Revert r127953, "SimplifyCFG has stopped duplicating returns into predecessors to canonicalize IR", it broke a lot of things. llvm-svn: 127954	2011-03-19 21:47:14 +00:00
Evan Cheng	c5f50f7322	SimplifyCFG has stopped duplicating returns into predecessors to canonicalize IR to have single return block (at least getting there) for optimizations. This is general goodness but it would prevent some tailcall optimizations. One specific case is code like this: int f1(void); int f2(void); int f3(void); int f4(void); int f5(void); int f6(void); int foo(int x) { switch(x) { case 1: return f1(); case 2: return f2(); case 3: return f3(); case 4: return f4(); case 5: return f5(); case 6: return f6(); } } => LBB0_2: ## %sw.bb callq _f1 popq %rbp ret LBB0_3: ## %sw.bb1 callq _f2 popq %rbp ret LBB0_4: ## %sw.bb3 callq _f3 popq %rbp ret This patch teaches codegenprep to duplicate returns when the return value is a phi and where the phi operands are produced by tail calls followed by an unconditional branch: sw.bb7: ; preds = %entry %call8 = tail call i32 @f5() nounwind br label %return sw.bb9: ; preds = %entry %call10 = tail call i32 @f6() nounwind br label %return return: %retval.0 = phi i32 [ %call10, %sw.bb9 ], [ %call8, %sw.bb7 ], ... [ 0, %entry ] ret i32 %retval.0 This allows codegen to generate better code like this: LBB0_2: ## %sw.bb jmp _f1 ## TAILCALL LBB0_3: ## %sw.bb1 jmp _f2 ## TAILCALL LBB0_4: ## %sw.bb3 jmp _f3 ## TAILCALL rdar://9147433 llvm-svn: 127953	2011-03-19 17:17:39 +00:00
Cameron Zwarich	cea63dc052	Move more logic into getTypeForExtArgOrReturn. llvm-svn: 127809	2011-03-17 14:53:37 +00:00
Cameron Zwarich	a5746339cc	Rename getTypeForExtendedInteger() to getTypeForExtArgOrReturn(). llvm-svn: 127807	2011-03-17 14:21:56 +00:00
Cameron Zwarich	2bb1e45ea3	The x86-64 ABI says that a bool is only guaranteed to be sign-extended to a byte rather than an int. Thankfully, this only causes LLVM to miss optimizations, not generate incorrect code. This just fixes the zext at the return. We still insert an i32 ZextAssert when reading a function's arguments, but it is followed by a truncate and another i8 ZextAssert so it is not optimized. llvm-svn: 127766	2011-03-16 22:20:18 +00:00
Cameron Zwarich	a1920d7f51	Move getRegPressureLimit() from TargetLoweringInfo to TargetRegisterInfo. llvm-svn: 127175	2011-03-07 21:56:36 +00:00
Owen Anderson	9d0944d6bd	Fix warning when building with clang++. llvm-svn: 126679	2011-02-28 19:58:06 +00:00
Stuart Hastings	539d4e1460	Support for byval parameters on ARM. Will be enabled by a forthcoming patch to the front-end. Radar 7662569. llvm-svn: 126655	2011-02-28 17:17:53 +00:00
Owen Anderson	bd26993873	Allow targets to specify a the type of the RHS of a shift parameterized on the type of the LHS. llvm-svn: 126518	2011-02-25 21:41:48 +00:00
Duncan Sands	760d1a03ba	Rewrite the vector part of getExtendedTypeAction to make it more understandable (at least I find it easier to understand like this). No intended functionality change. llvm-svn: 126382	2011-02-24 11:54:18 +00:00
Evan Cheng	c7ce7e2ac3	Given a pair of floating point load and store, if there are no other uses of the load, then it may be legal to transform the load and store to integer load and store of the same width. This is done if the target specified the transformation as profitable. e.g. On arm, this can transform: vldr.32 s0, [] vstr.32 s0, [] to ldr r12, [] str r12, [] rdar://8944252 llvm-svn: 124708	2011-02-02 01:06:55 +00:00
Evan Cheng	cb39cc2164	Re-implement r122936 with proper target hooks. Now getMaxStoresPerMemcpy etc. takes an option OptSize. If OptSize is true, it would return the inline limit for functions with attribute OptSize. llvm-svn: 122952	2011-01-06 06:52:41 +00:00
Evan Cheng	f7e586d749	Enable sibling call optimization of libcalls which are expanded during legalization time. Since at legalization time there is no mapping from SDNode back to the corresponding LLVM instruction and the return SDNode is target specific, this requires a target hook to check for eligibility. Only x86 and ARM support this form of sibcall optimization right now. rdar://8707777 llvm-svn: 120501	2010-11-30 23:55:39 +00:00
Chris Lattner	f541629573	add TLI support indicating that jumps are more expensive than logical operations and use this to disable a specific optimization. Patch by Micah Villmow! llvm-svn: 120435	2010-11-30 18:12:52 +00:00
Benjamin Kramer	96ac873014	Prune includes. llvm-svn: 118342	2010-11-06 11:45:59 +00:00
Duncan Sands	41edf30895	Simplify uses of MVT and EVT. An MVT can be compared directly with a SimpleValueType, while an EVT supports equality and inequality comparisons with SimpleValueType. llvm-svn: 118169	2010-11-03 12:17:33 +00:00
John Thompson	771289ca4a	Mult-alt constraint incremental development step 3. llvm-svn: 117746	2010-10-29 23:37:38 +00:00
John Thompson	6115a7f1d4	Inline asm multiple alternative constraints development phase 2 - improved basic logic, added initial platform support. llvm-svn: 117667	2010-10-29 17:29:13 +00:00
Dale Johannesen	58fe3193a6	Remove Synthesizable from the Type system; as MMX vector types are no longer Legal on X86, we don't need it. No functional change. 8499854. llvm-svn: 116947	2010-10-20 21:32:10 +00:00
John Thompson	ae3a86d6de	Added skeleton for inline asm multiple alternative constraint support. llvm-svn: 113766	2010-09-13 18:15:37 +00:00
Chris Lattner	36e90f42f3	add a specialization for the MVT form of getTypeAction, since it is trivial. llvm-svn: 112105	2010-08-25 23:05:45 +00:00
Chris Lattner	ef3055ca05	remove some llvmcontext arguments that are now dead post-refactoring. llvm-svn: 112104	2010-08-25 23:00:45 +00:00
Chris Lattner	fe7c4ec039	Change handling of illegal vector types to widen when possible instead of expanding: e.g. <2 x float> -> <4 x float> instead of -> 2 floats. This affects two places in the code: handling cross block values and handling function return and arguments. Since vectors are already widened by legalizetypes, this gives us much better code and unblocks x86-64 abi and SPU abi work. For example, this (which is a silly example of a cross-block value): define <4 x float> @test2(<4 x float> %A) nounwind { %B = shufflevector <4 x float> %A, <4 x float> undef, <2 x i32> <i32 0, i32 1> %C = fadd <2 x float> %B, %B br label %BB BB: %D = fadd <2 x float> %C, %C %E = shufflevector <2 x float> %D, <2 x float> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef> ret <4 x float> %E } Now compiles into: _test2: ## @test2 ## BB#0: addps %xmm0, %xmm0 addps %xmm0, %xmm0 ret previously it compiled into: _test2: ## @test2 ## BB#0: addps %xmm0, %xmm0 pshufd $1, %xmm0, %xmm1 ## kill: XMM0<def> XMM0<kill> XMM0<def> insertps $0, %xmm0, %xmm0 insertps $16, %xmm1, %xmm0 addps %xmm0, %xmm0 ret This implements rdar://8230384 llvm-svn: 112101	2010-08-25 22:49:25 +00:00
Anton Korobeynikov	7ae895e007	Hook in GlobalMerge pass llvm-svn: 109359	2010-07-24 21:52:08 +00:00
Evan Cheng	f215e55d5f	- Allow target to specify when is register pressure "too high". In most cases, it's too late to start backing off aggressive latency scheduling when most of the registers are in use so the threshold should be a bit tighter. - Correctly handle live out's and extract_subreg etc. - Enable register pressure aware scheduling by default for hybrid scheduler. For ARM, this is almost always a win on # of instructions. It's runtime neutral for most of the tests. But for some kernels with high register pressure it can be a huge win. e.g. 464.h264ref reduced number of spills by 54 and sped up by 20%. llvm-svn: 109279	2010-07-23 22:39:59 +00:00
Evan Cheng	df725c25dd	Teach bottom up pre-ra scheduler to track register pressure. Work in progress. llvm-svn: 108991	2010-07-21 06:09:07 +00:00
Evan Cheng	b2ad0066f5	ARM has to provide its own TargetLowering::findRepresentativeClass because its scalar floating point registers alias its vector registers. llvm-svn: 108761	2010-07-19 22:15:08 +00:00
Evan Cheng	cecae7f56c	Teach computeRegisterProperties() to compute "representative" register class for legal value types. A "representative" register class is the largest legal super-reg register class for a value type. e.g. On i386, GR32 is the rep register class for i8 / i16 / i32; on x86_64 it would be GR64. This property will be used by the register pressure tracking instruction scheduler. llvm-svn: 108735	2010-07-19 18:47:01 +00:00
Rafael Espindola	4964bf75a4	Fix a typo and fit in 80 columns. Found by Bob Wilson. llvm-svn: 108164	2010-07-12 18:11:17 +00:00
Rafael Espindola	84716579d4	Fix va_arg for doubles. With this patch VAARG nodes always contain the correct alignment information, which simplifies ExpandRes_VAARG a bit. The patch introduces a new alignment information to TargetLoweringInfo. This is needed since the two natural candidates cannot be used: * The 's' in target data: If this is set to the minimal alignment of any argument, getCallFrameTypeAlignment would return 4 for doubles on ARM for example. * The getTransientStackAlignment method. It is possible for an architecture to have argument less aligned than what we maintain the stack pointer. llvm-svn: 108072	2010-07-11 04:01:49 +00:00
Dan Gohman	fef30fcd5e	Reapply bottom-up fast-isel, with several fixes for x86-32: - Check getBytesToPopOnReturn(). - Eschew ST0 and ST1 for return values. - Fix the PIC base register initialization so that it doesn't ever fail to end up the top of the entry block. llvm-svn: 108039	2010-07-10 09:00:22 +00:00
Bob Wilson	9e8c9204ef	--- Reverse-merging r107947 into '.': U utils/TableGen/FastISelEmitter.cpp --- Reverse-merging r107943 into '.': U test/CodeGen/X86/fast-isel.ll U test/CodeGen/X86/fast-isel-loads.ll U include/llvm/Target/TargetLowering.h U include/llvm/Support/PassNameParser.h U include/llvm/CodeGen/FunctionLoweringInfo.h U include/llvm/CodeGen/CallingConvLower.h U include/llvm/CodeGen/FastISel.h U include/llvm/CodeGen/SelectionDAGISel.h U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/CallingConvLower.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp U lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp U lib/CodeGen/SelectionDAG/FastISel.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp U lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp U lib/CodeGen/SelectionDAG/InstrEmitter.cpp U lib/CodeGen/SelectionDAG/TargetLowering.cpp U lib/Target/XCore/XCoreISelLowering.cpp U lib/Target/XCore/XCoreISelLowering.h U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86ISelLowering.h llvm-svn: 107987	2010-07-09 16:37:18 +00:00
Dan Gohman	7e6e4dd058	Re-apply bottom-up fast-isel, with fixes. Be very careful to avoid emitting a DBG_VALUE after a terminator, or emitting any instructions before an EH_LABEL. llvm-svn: 107943	2010-07-09 00:39:23 +00:00
Dan Gohman	4dcc56a102	Revert 107840 107839 107813 107804 107800 107797 107791. Debug info intrinsics win for now. llvm-svn: 107850	2010-07-08 01:00:56 +00:00
Dan Gohman	424cc6b616	Add X86FastISel support for return statements. This entails refactoring a bunch of stuff, to allow the target-independent calling convention logic to be employed. llvm-svn: 107800	2010-07-07 18:32:53 +00:00
Dan Gohman	b87c534168	Simplify FastISel's constructor by giving it a FunctionLoweringInfo instance, rather than pointers to all of FunctionLoweringInfo's members. This eliminates an NDEBUG ABI sensitivity. llvm-svn: 107789	2010-07-07 16:29:44 +00:00
Dan Gohman	c768525273	Split the SDValue out of OutputArg so that SelectionDAG-independent code can do calling-convention queries. This obviates OutputArgReg. llvm-svn: 107786	2010-07-07 15:54:55 +00:00
Dan Gohman	f60a8be1d0	Move ArgFlagsTy, OutputArg, and InputArg out of SelectionDAGNodes.h and into a new header, TargetCallingConv.h. llvm-svn: 107782	2010-07-07 15:28:42 +00:00
Dan Gohman	d409104054	CanLowerReturn doesn't need a SelectionDAG; it just needs an LLVMContext. SelectBasicBlock doesn't needs its BasicBlock argument. llvm-svn: 107712	2010-07-06 22:19:37 +00:00
Eric Christopher	e873e9978c	Fix up -fstack-protector on linux to use the segment registers. Split out testcases per architecture and os now. Patch from Nelson Elhage. llvm-svn: 107640	2010-07-06 05:18:56 +00:00
Evan Cheng	346aecdb8b	Change if-conversion block size limit checks to add some flexibility. llvm-svn: 106901	2010-06-25 22:42:03 +00:00
Dale Johannesen	b1fc776fca	The hasMemory argument is irrelevant to how the argument for an "i" constraint should get lowered; PR 6309. While this argument was passed around a lot, this is the only place it was used, so it goes away from a lot of other places. llvm-svn: 106893	2010-06-25 21:55:36 +00:00
Dan Gohman	a08a9b8a0b	Reapply r106634, now that the bug it exposed is fixed. llvm-svn: 106746	2010-06-24 14:30:44 +00:00
Daniel Dunbar	be50ef88bd	Revert r106263, "Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass,"... it was causing both 'file' (with clang) and 176.gcc (with llvm-gcc) to be miscompiled. llvm-svn: 106634	2010-06-23 17:09:26 +00:00
Jim Grosbach	12afda541e	Some targets don't require the fencing MEMBARRIER instructions surrounding atomic intrinsics, either because the use locking instructions for the atomics, or because they perform the locking directly. Add support in the DAG combiner to fold away the fences. llvm-svn: 106630	2010-06-23 16:07:42 +00:00
Jim Grosbach	395b0a671c	remove trailing whitespace llvm-svn: 106628	2010-06-23 15:49:18 +00:00
Dan Gohman	8185674354	Fold the ShrinkDemandedOps pass into the regular DAGCombiner pass, which is faster, simpler, and less surprising. llvm-svn: 106263	2010-06-18 01:05:21 +00:00
Evan Cheng	b5de7de4ce	Allow targets more controls on what nodes are scheduled by reg pressure, what for latency in hybrid mode. llvm-svn: 104293	2010-05-20 23:26:43 +00:00
Evan Cheng	46e08acfa5	Code refactoring: pull SchedPreference enum from TargetLowering.h to TargetMachine.h and put it in its own namespace. llvm-svn: 104147	2010-05-19 20:19:50 +00:00
Evan Cheng	85497bd415	Allow TargetLowering::getRegClassFor() to be called on illegal types. Also allow target to override it in order to map register classes to illegal but synthesizable types. e.g. v4i64, v8i64 for ARM / NEON. llvm-svn: 103854	2010-05-15 02:18:07 +00:00
Evan Cheng	ec0d7e6cf2	Get rid of the bit twiddling to read / set OpActions and ValueTypeActions. The small saving in memory isn't worth the increase in runtime and code complexity in my opinion. llvm-svn: 103768	2010-05-14 07:37:40 +00:00
Evan Cheng	5e115a09ec	Eliminate use of magic numbers to access OpActions. It also has the effect of allowing more than 31 scalar value types. MAX_ALLOWED_VALUETYPE had already been updated to 64 a while back. llvm-svn: 103743	2010-05-13 23:25:21 +00:00
Evan Cheng	f2572ea73b	Fix up LoadExtActions, TruncStoreActions, and IndexedModeActions representation and setter and getter's so they will continue to work if the number of scalar ValueType's exceeds 31. llvm-svn: 103742	2010-05-13 23:01:26 +00:00
Evan Cheng	fd47525512	80 col violation. llvm-svn: 103733	2010-05-13 20:14:58 +00:00
Dan Gohman	ca2df906ae	Trim #includes and forward declarations. llvm-svn: 103489	2010-05-11 19:11:43 +00:00
Dan Gohman	fb6f4da0e0	Implement a bunch more TargetSelectionDAGInfo infrastructure. Move EmitTargetCodeForMemcpy, EmitTargetCodeForMemset, and EmitTargetCodeForMemmove out of TargetLowering and into SelectionDAGInfo to exercise this. llvm-svn: 103481	2010-05-11 17:31:57 +00:00
Dan Gohman	eaacb8cb1f	Remove the TargetLowering::getSubtarget() virtual function, which was unused. TargetMachine::getSubtarget() is used instead. llvm-svn: 103474	2010-05-11 16:21:03 +00:00
Dan Gohman	68f04d06c8	Get rid of the EdgeMapping map. Instead, just check for BasicBlock changes before doing phi lowering for switches. llvm-svn: 102809	2010-05-01 00:01:06 +00:00
Evan Cheng	dc0ce1eae8	- Move TargetLowering::EmitTargetCodeForFrameDebugValue to TargetInstrInfo and rename it to emitFrameIndexDebugValue. - Teach spiller to modify DBG_VALUE instructions to reference spill slots. llvm-svn: 102323	2010-04-26 07:38:55 +00:00
Dale Johannesen	d27eedab6d	Stop abusing EmitInstrWithCustomInserter for target-dependent form of DEBUG_VALUE, as it doesn't have reasonable default behavior for unsupported targets. Add a new hook instead. No functional change. llvm-svn: 102320	2010-04-25 21:33:54 +00:00
Dan Gohman	d819e7fc95	Move HandlePHINodesInSuccessorBlocks functions out of SelectionDAGISel and into SelectionDAGBuilder and FastISel. llvm-svn: 102123	2010-04-22 20:46:50 +00:00
Dan Gohman	4ff2d817a3	Add more const qualifiers on TargetMachine and friends. llvm-svn: 101977	2010-04-21 01:34:56 +00:00
Dale Johannesen	510282d54b	Because of the EMMS problem, right now we have to support user-defined operations that use MMX register types, but the compiler shouldn't generate them on its own. This adds a Synthesizable abstraction to represent this, and changes the vector widening computation so it won't produce MMX types. (The motivation is to remove noise from the ABI compatibility part of the gcc test suite, which has some breakage right now.) llvm-svn: 101951	2010-04-20 22:34:09 +00:00
Dan Gohman	a0f855157e	Use const qualifiers with TargetLowering. This eliminates several const_casts, and it reinforces the design of the Target classes being immutable. SelectionDAGISel::IsLegalToFold is now a static member function, because PIC16 uses it in an unconventional way. There is more room for API cleanup here. And PIC16's AsmPrinter no longer uses TargetLowering. llvm-svn: 101635	2010-04-17 15:26:15 +00:00
Evan Cheng	6442d111dd	More work to allow dag combiner to promote 16-bit ops to 32-bit. llvm-svn: 101621	2010-04-17 06:13:15 +00:00
Dan Gohman	1119e6060b	Eliminate an unnecessary SelectionDAG dependency in getOptimalMemOpType. llvm-svn: 101531	2010-04-16 20:11:05 +00:00
Dan Gohman	56bc819d8e	Fix this code to avoid implicit assumptions about the length of the array. llvm-svn: 101530	2010-04-16 20:08:45 +00:00
Evan Cheng	d143bfe0a4	Adding support for dag combiner to promote operations for profit. This requires target specific queries. For example, x86 should promote i16 to i32 when it does not impact load folding. x86 support is off by default. It can be enabled with -promote-16bit. Work in progress. llvm-svn: 101448	2010-04-16 06:14:10 +00:00
Dan Gohman	0e0b8cf9fd	Add const qualifiers to CodeGen's use of LLVM IR constructs. llvm-svn: 101334	2010-04-15 01:51:59 +00:00
Dan Gohman	c64d1d02a3	Factor out EH landing pad code into a separate function, and constify a bunch of stuff to support it. llvm-svn: 101273	2010-04-14 19:53:31 +00:00
Evan Cheng	3fa0b6fb03	Avoid using f64 to lower memcpy from constant string. It's cheaper to use i32 store of immediates. llvm-svn: 100751	2010-04-08 07:37:57 +00:00
Chris Lattner	6f3c3f60ce	unthread MMI from FastISel llvm-svn: 100416	2010-04-05 06:05:26 +00:00
Chris Lattner	1fda8b3a8b	fastisel doesn't need DwarfWriter, remove some tendricles. llvm-svn: 100381	2010-04-05 02:19:28 +00:00
Mon P Wang	484bbe6aa9	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100304	2010-04-04 03:10:48 +00:00
Evan Cheng	5d825988d0	Correctly lower memset / memcpy of undef. It should be a nop. PR6767. llvm-svn: 100208	2010-04-02 19:36:14 +00:00
Mon P Wang	0ccf050ca3	Revert r100191 since it breaks objc in clang llvm-svn: 100199	2010-04-02 18:43:02 +00:00
Mon P Wang	a01350755e	Reapply address space patch after fixing an issue in MemCopyOptimizer. Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) llvm-svn: 100191	2010-04-02 18:04:15 +00:00
Evan Cheng	dc26010cc1	Add comments about DstAlign and SrcAlign. llvm-svn: 100132	2010-04-01 20:10:42 +00:00
Evan Cheng	8728924812	- Avoid using floating point stores to implement memset unless the value is zero. - Do not try to infer GV alignment unless its type is sized. It's not possible to infer alignment if it has opaque type. llvm-svn: 100118	2010-04-01 18:19:11 +00:00
Evan Cheng	562bb43207	Fix sdisel memcpy, memset, memmove lowering: 1. Makes it possible to lower with floating point loads and stores. 2. Avoid unaligned loads / stores unless it's fast. 3. Fix some memcpy lowering logic bug related to when to optimize a load from constant string into a constant. 4. Adjust x86 memcpy lowering threshold to make it more sane. 5. Fix x86 target hook so it uses vector and floating point memory ops more effectively. rdar://7774704 llvm-svn: 100090	2010-04-01 06:04:33 +00:00
Bob Wilson	aae933cc81	Revert Mon Ping's change 99928, since it broke all the llvm-gcc buildbots. llvm-svn: 99948	2010-03-30 22:27:04 +00:00
Mon P Wang	9351ea594a	Added support for address spaces and added a isVolatile field to memcpy, memmove, and memset, e.g., llvm.memcpy.i32(i8, i8, i32, i32) -> llvm.memcpy.p0i8.p0i8.i32(i8, i8, i32, i32, i1) A update of langref will occur in a subsequent checkin. llvm-svn: 99928	2010-03-30 20:55:56 +00:00
Dan Gohman	8493042e4d	Docuemntation corrections from John Myers. llvm-svn: 99454	2010-03-25 00:03:04 +00:00
Dan Gohman	56a90ff2a1	Remove the ConvertActions table and associated code, which is unused. llvm-svn: 99372	2010-03-24 00:53:38 +00:00
Dan Gohman	ce6d4394a8	Revert 99335. getTypeToExpandTo's iterative behavior is actually needed here. llvm-svn: 99339	2010-03-23 22:44:42 +00:00
Dan Gohman	7814ddee81	Remove getTypeToExpandTo, since it isn't adding much value beyond just calling getTypeToTransformTo. llvm-svn: 99335	2010-03-23 22:15:31 +00:00
Bob Wilson	950964e4d0	Remove an unnecessary (and misspelled) typedef. Tweak whitespace. llvm-svn: 98753	2010-03-17 18:43:25 +00:00
Dan Gohman	6b1b9e37d7	Remove getWidenVectorType, which is no longer used. llvm-svn: 98289	2010-03-11 21:39:57 +00:00
Chris Lattner	f3ee582f23	Use Other as a sentinel instead of iAny. llvm-svn: 97914	2010-03-07 07:45:08 +00:00
Bill Wendling	5990930d72	Remove dead parameter passing. llvm-svn: 97536	2010-03-02 01:55:18 +00:00
Mon P Wang	c17e781f35	The previous fix of widening divides that trap was too fragile as it depends on custom lowering and requires that certain types exist in ValueTypes.h. Modified widening to check if an op can trap and if so, the widening algorithm will apply only the op on the defined elements. It is safer to do this in widening because the optimizer can't guarantee removing unused ops in some cases. llvm-svn: 95823	2010-02-10 23:37:45 +00:00
Evan Cheng	9057fea7ef	Revert 95130. llvm-svn: 95160	2010-02-02 23:55:14 +00:00
Evan Cheng	48375fbf4f	Pass callsite return type to TargetLowering::LowerCall and use that to check sibcall eligibility. llvm-svn: 95130	2010-02-02 21:29:10 +00:00
Evan Cheng	a5cb4213ac	Remove a dead target hook. llvm-svn: 94646	2010-01-27 01:44:40 +00:00
Evan Cheng	237629e476	Eliminate target hook IsEligibleForTailCallOptimization. Target independent isel should always pass along the "tail call" property. Change target hook LowerCall's parameter "isTailCall" into a refernce. If the target decides it's impossible to honor the tail call request, it should set isTailCall to false to make target independent isel happy. llvm-svn: 94626	2010-01-27 00:07:07 +00:00
Bill Wendling	d4a402c2aa	Remove warning about non return on a non-void function. llvm-svn: 94532	2010-01-26 08:27:24 +00:00
Chris Lattner	967a37fb26	eliminate the TargetLowering::UsesGlobalOffsetTable bool, which is subsumed by TargetLowering::getJumpTableEncoding(). Change uses of it to be more specific. llvm-svn: 94529	2010-01-26 06:53:37 +00:00
Chris Lattner	d87c50833a	Move getJTISymbol from MachineJumpTableInfo to MachineFunction, which is more convenient, and change getPICJumpTableRelocBaseExpr to take a MachineFunction to match. Next, move the X86 code that create a PICBase symbol to X86TargetLowering::getPICBaseSymbol from X86MCInstLower::GetPICBaseSymbol, which was an asmprinter specific library. This eliminates a 'gross hack', and allows us to implement X86ISelLowering::getPICJumpTableRelocBaseExpr which now calls it. This in turn allows us to eliminate the X86AsmPrinter::printPICJumpTableSetLabel method, which was the only overload of printPICJumpTableSetLabel. llvm-svn: 94526	2010-01-26 06:28:43 +00:00
Chris Lattner	3b2bf7ab66	stub out a new target hook, need some refactoring before I can implement it. llvm-svn: 94521	2010-01-26 05:30:30 +00:00
Chris Lattner	41f661d2c2	this hook should be const. llvm-svn: 94508	2010-01-26 04:19:00 +00:00
Chris Lattner	5ee5b62838	Add support for target-specific 32-bit custom-lowered jump table entries. llvm-svn: 94505	2010-01-26 04:05:28 +00:00
Chris Lattner	efdc572e44	Rearrange handling of jump tables. Highlights: 1. MachineJumpTableInfo is now created lazily for a function the first time it actually makes a jump table instead of for every function. 2. The encoding of jump table entries is now described by the MachineJumpTableInfo::JTEntryKind enum. This enum is determined by the TLI::getJumpTableEncoding() hook, instead of by lots of code scattered throughout the compiler that "knows" that jump table entries are always 32-bits in pic mode (for example). 3. The size and alignment of jump table entries is now calculated based on their kind, instead of at machinefunction creation time. Future work includes using the EntryKind in more places in the compiler, eliminating other logic that "knows" the layout of jump tables in various situations. llvm-svn: 94470	2010-01-25 23:26:13 +00:00
Evan Cheng	25dcf9b830	Teach dag combine to fold the following transformation more aggressively: (OP (trunc x), (trunc y)) -> (trunc (OP x, y)) Unfortunately this simple change causes dag combine to infinite looping. The problem is the shrink demanded ops optimization tend to canonicalize expressions in the opposite manner. That is badness. This patch disable those optimizations in dag combine but instead it is done as a late pass in sdisel. This also exposes some deficiencies in dag combine and x86 setcc / brcond lowering. Teach them to look pass ISD::TRUNCATE in various places. llvm-svn: 92849	2010-01-06 19:38:29 +00:00
Sanjiv Gupta	d17915f6f0	Allow targets to specify the return type of libcalls that are generated for floating point comparisons, rather than hard-coding them as i32. llvm-svn: 92199	2009-12-28 02:40:33 +00:00
Bill Wendling	fc4c238bd5	Add more plumbing. This time in the LowerArguments and "get" functions which return partial registers. This affected the back-end lowering code some. Also patch up some places I missed before in the "get" functions. llvm-svn: 91880	2009-12-22 02:10:19 +00:00
Dan Gohman	b65e2548c8	Delete unused code. llvm-svn: 91743	2009-12-19 01:47:13 +00:00
Dan Gohman	ec4e132814	Fix these asserts to check the invariant that the code actually depends on. llvm-svn: 91360	2009-12-14 23:34:36 +00:00
Evan Cheng	7941695285	Move isConsecutiveLoad to SelectionDAG. It's not target dependent and it's primary used by selectdag passes. llvm-svn: 90922	2009-12-09 01:36:00 +00:00
Evan Cheng	af90768b3c	isLegalICmpImmediate should take a signed integer; code clean up. llvm-svn: 86964	2009-11-12 07:13:11 +00:00
Evan Cheng	a11308742c	Add TargetLowering::isLegalICmpImmediate. It tells LSR what immediate can be folded into target icmp instructions. llvm-svn: 86858	2009-11-11 19:05:52 +00:00
Chris Lattner	a8847d8409	fix some bogus asserts, PR5049 llvm-svn: 86514	2009-11-09 04:47:27 +00:00
Kenneth Uildriks	91ceaa85a2	Add code to check at SelectionDAGISel::LowerArguments time to see if return values can be lowered to registers. Coming soon, code to perform sret-demotion if return values cannot be lowered to registers llvm-svn: 86324	2009-11-07 02:11:54 +00:00
Dan Gohman	3393a4c997	Rename usesCustomDAGSchedInserter to usesCustomInserter, and update a bunch of associated comments, because it doesn't have anything to do with DAGs or scheduling. This is another step in decoupling MachineInstr emitting from scheduling. llvm-svn: 85517	2009-10-29 18:10:34 +00:00
Evan Cheng	4e3bc5efb7	Add a second ValueType argument to isFPImmLegal. llvm-svn: 85361	2009-10-28 01:43:28 +00:00
Evan Cheng	151726ad30	Do away with addLegalFPImmediate. Add a target hook isFPImmLegal which returns true if the fp immediate can be natively codegened by target. llvm-svn: 85281	2009-10-27 19:56:55 +00:00
Evan Cheng	4244113f89	Update comments. llvm-svn: 82313	2009-09-19 10:08:51 +00:00
Evan Cheng	7cb9c456e5	Enhance EmitInstrWithCustomInserter() so target can specify CFG changes that sdisel will use to properly complete phi nodes. Not functionality change yet. llvm-svn: 82273	2009-09-18 21:02:19 +00:00
Sandeep Patel	9c4e094e2a	Retype from unsigned to CallingConv::ID accordingly. Approved by Bob Wilson. llvm-svn: 80773	2009-09-02 08:44:58 +00:00
Duncan Sands	66fce0befb	Revert commit 80428. It completely broke exception handling on x86-32 linux. llvm-svn: 80592	2009-08-31 16:45:16 +00:00
Bill Wendling	9dddea2ec3	- Add target lowering methods to get the preferred format for the FDE and LSDA encodings. - Make some of the values emitted by the FDEs dependent upon the pointer size. This is in line with how GCC does things. And it has the benefit of working for Darwin in 64-bit mode now. llvm-svn: 80428	2009-08-29 12:20:54 +00:00
Bill Wendling	962adec4ee	Reapply r79127. It was fixed by d0k. llvm-svn: 79136	2009-08-15 21:21:19 +00:00
Bill Wendling	bfebbb6477	Revert r79127. It was causing compilation errors. llvm-svn: 79135	2009-08-15 21:14:01 +00:00
Evan Cheng	5d841097a9	Change allowsUnalignedMemoryAccesses to take type argument since some targets support unaligned mem access only for certain types. (Should it be size instead?) ARM v7 supports unaligned access for i16 and i32, some v6 variants support it as well. llvm-svn: 79127	2009-08-15 19:23:44 +00:00
Anton Korobeynikov	3a0cde8c91	Allow targets to specify their choice of calling conventions per libcall. Take advantage of this in the ARM backend to rectify broken choice of CC when hard float is in effect. PIC16 may want to see if it could be of use in MakePIC16Libcall, which works unchanged. Patch by Sandeep! llvm-svn: 79033	2009-08-14 20:10:52 +00:00
Owen Anderson	dbf9a912fb	Add contexts to some of the MVT APIs. No functionality change yet, just the infrastructure work needed to get the contexts to where they need to be first. llvm-svn: 78759	2009-08-12 00:36:31 +00:00
Owen Anderson	48f2f0ae72	Split EVT into MVT and EVT, the former representing _just_ a primitive type, while the latter is capable of representing either a primitive or an extended type. llvm-svn: 78713	2009-08-11 20:47:22 +00:00
Owen Anderson	b4bce99769	Rename MVT to EVT, in preparation for splitting SimpleValueType out into its own struct type. llvm-svn: 78610	2009-08-10 22:56:29 +00:00
Owen Anderson	30bf6c8dab	SimpleValueType-ify a few more methods on TargetLowering. llvm-svn: 78595	2009-08-10 20:46:15 +00:00
Owen Anderson	cf56d576eb	Continue the SimpleValueType-ification. llvm-svn: 78593	2009-08-10 20:18:46 +00:00
Owen Anderson	dcb47bda67	Start moving TargetLowering away from using full MVTs and towards SimpleValueType, which will simplify the privatization of IntegerType in the future. llvm-svn: 78584	2009-08-10 18:56:59 +00:00
Dan Gohman	5d566d918b	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Chris Lattner	a5a1e7b4ed	make getObjFileLowering() return a non-const reference. llvm-svn: 77984	2009-08-03 19:12:26 +00:00
Dan Gohman	3b05aaef0b	Minor whitespace tidiness. llvm-svn: 77807	2009-08-01 21:25:46 +00:00
Chris Lattner	55461787cc	Rip all of the global variable lowering logic out of TargetAsmInfo. Since it is highly specific to the object file that will be generated in the end, this introduces a new TargetLoweringObjectFile interface that is implemented for each of ELF/MachO/COFF/Alpha/PIC16 and XCore. Though still is still a brutal and ugly refactoring, this is a major step towards goodness. This patch also: 1. fixes a bunch of dangling pointer problems in the PIC16 backend. 2. disables the TargetLowering copy ctor which PIC16 was accidentally using. 3. gets us closer to xcore having its own crazy target section flags and pic16 not having to shadow sections with its own objects. 4. fixes wierdness where ELF targets would set CStringSection but not CStringSection_. Factor the code better. 5. fixes some bugs in string lowering on ELF targets. llvm-svn: 77294	2009-07-28 03:13:23 +00:00
Jakob Stoklund Olesen	5ca1bdc613	Add support for promoting SETCC operations. llvm-svn: 76987	2009-07-24 18:22:59 +00:00
Eli Friedman	45160af6bd	Remove shift amount flavor. It isn't actually complete enough to be useful, and it's currently unused. (Some issues: it isn't actually rich enough to capture the semantics on many architectures, and semantics can vary depending on the type being shifted.) llvm-svn: 76633	2009-07-21 20:12:16 +00:00
Eli Friedman	2f54456ce5	Remove FIXME that was already fixed. llvm-svn: 76457	2009-07-20 19:45:16 +00:00
Chris Lattner	72b24cbbf6	Copy ExpandInlineAsm to TargetLowering from TargetAsmInfo. llvm-svn: 76441	2009-07-20 17:51:36 +00:00
Duncan Sands	e431120ddc	In this unreachable code, return an initialized value. This stops gcc warning about possible uses of an uninitialized value when compiling with assertions turned off. llvm-svn: 74775	2009-07-03 16:00:23 +00:00
Tilmann Scheller	cea3c16aa5	Add NumFixedArgs attribute to CallSDNode which indicates the number of fixed arguments in a vararg call. With the SVR4 ABI on PowerPC, vector arguments for vararg calls are passed differently depending on whether they are a fixed or a variable argument. Variable vector arguments always go into memory, fixed vector arguments are put into vector registers. If there are no free vector registers available, fixed vector arguments are put on the stack. The NumFixedArgs attribute allows to decide for an argument in a vararg call whether it belongs to the fixed or variable portion of the parameter list. llvm-svn: 74764	2009-07-03 06:44:53 +00:00
Bill Wendling	fdd5badace	Update comments to make it clear that the function alignment is the Log2 of the bytes and not bytes. llvm-svn: 74624	2009-07-01 18:50:55 +00:00
Bill Wendling	c0fb316bd3	Add an "alignment" field to the MachineFunction object. It makes more sense to have the alignment be calculated up front, and have the back-ends obey whatever alignment is decided upon. This allows for future work that would allow for precise no-op placement and the like. llvm-svn: 74564	2009-06-30 22:38:32 +00:00
David Greene	1733775020	Increase limit for OpActions array The OpActions array had a limit of 32 value types, so change it to use MVT::MAX_ALLOWED_VALUETYPE in its declaration and change the accesses to this array to work with a VT.getSimpleVT() that is larger than 32. Also, add a comment to the place where MVT::MAX_ALLOWED_VALUETYPE is defined indicating that it must be a multiple of 32. This is part of the work allow MVT::LAST_VALUETYPE be greater than 32. llvm-svn: 74130	2009-06-24 22:22:02 +00:00
David Greene	08b3d807e8	This increases the maximum for MVT::LAST_VALUETYPE This change doubles the allowable value for MVT::LAST_VALUETYPE. It does this by doing several things. 1. Introduces MVT::MAX_ALLOWED_LAST_VALUETYPE which in this change has a value of 64. This value contains the current maximum for the MVT::LAST_VALUETYPE. 2. Instead of checking "MVT::LAST_VALUETYPE <= 32", all of those uses now become "MVT::LAST_VALUETYPE <= MVT::MAX_ALLOWED_LAST_VALUETYPE" 3. Changes the dimension of the ValueTypeActions from 2 elements to four elements and adds comments ahead of the declaration indicating the it is "(MVT::MAX_ALLOWED_LAST_VALUETYPE/32) * 2". This at least lets us find what is affected if and when MVT::MAX_ALLOWED_LAST_VALUETYPE gets changed. 4. Adds initializers for the new elements of ValueTypeActions. This does NOT add any types in MVT. That would be done separately. This doubles the size of ValueTypeActions from 64 bits to 128 bits and gives us the freedom to add more types for AVX. llvm-svn: 74110	2009-06-24 19:41:55 +00:00
Devang Patel	e2b1c9a530	mv CodeGen/DebugLoc.h Support/DebugLoc.h llvm-svn: 73786	2009-06-19 22:08:58 +00:00
David Greene	3f7cc686d9	Change IndexedModeAction representation. This changes the IndexedModeAction representation to remove the limitation on the number of value types in MVT. This limitation prevents us from specifying AVX types. Prior to this change IndexedModActions was represented as follows... uint64_t IndexedModeActions[2][ISD::LAST_INDEXED_MODE]; the first dimension was used to represent loads, then stores. This imposed a limitation of 32 on the number of value types that could be handled with this method. The value type was used to shift the two bits into and out of the approprate bits in the uint64_t. With this change the array is now represented as ... uint8_t IndexedModeActions[MVT::LAST_VALUETYPE][2][ISD::LAST_INDEXED_MODE]; Takes more space but removes the limitation on MVT::LAST_VALUETYPE. The first dimension is now the value_type for the reference. The second dimension is the load [0] vs. store[1]. The third dimension represents the various modes for load store. Accesses are now direct, no shifting or masking. There are other limitations that need to be removed, so that MVT::LAST_VALUETYPE can be greater than 32. This is merely the first step towards that goal. llvm-svn: 73104	2009-06-09 00:19:10 +00:00
David Greene	00c5ffe4d4	Oops, didn't mean to commit 73102 yet. Revert it. llvm-svn: 73103	2009-06-08 23:54:42 +00:00
David Greene	901b8d38f4	Change IndexedModeAction representation. This changes the IndexedModeAction representation to remove the limitation on the number of value types in MVT. This limitation prevents us from specifying AVX types. Prior to this change IndexedModActions was represented as follows... uint64_t IndexedModeActions[2][ISD::LAST_INDEXED_MODE]; the first dimension was used to represent loads, then stores. This imposed a limitation of 32 on the number of value types that could be handled with this method. The value type was used to shift the two bits into and out of the approprate bits in the uint64_t. With this change the array is now represented as ... uint8_t IndexedModeActions[MVT::LAST_VALUETYPE][2][ISD::LAST_INDEXED_MODE]; Takes more space but removes the limitation on MVT::LAST_VALUETYPE. The first dimension is now the value_type for the reference. The second dimension is the load [0] vs. store[1]. The third dimension represents the various modes for load store. Accesses are now direct, no shifting or masking. There are other limitations that need to be removed, so that MVT::LAST_VALUETYPE can be greater than 32. This is merely the first step towards that goal. llvm-svn: 73102	2009-06-08 23:53:19 +00:00
Devang Patel	8d170194e8	Add new function attribute - noimplicitfloat Update code generator to use this attribute and remove NoImplicitFloat target option. Update llc to set this attribute when -no-implicit-float command line option is used. llvm-svn: 72959	2009-06-05 21:57:13 +00:00
Nate Begeman	058d4eeccf	Adapt the x86 build_vector dagcombine to the current state of the legalizer. build vectors with i64 elements will only appear on 32b x86 before legalize. Since vector widening occurs during legalize, and produces i64 build_vector elements, the dag combiner is never run on these before legalize splits them into 32b elements. Teach the build_vector dag combine in x86 back end to recognize consecutive loads producing the low part of the vector. Convert the two uses of TLI's consecutive load recognizer to pass LoadSDNodes since that was required implicitly. Add a testcase for the transform. Old: subl $28, %esp movl 32(%esp), %eax movl 4(%eax), %ecx movl %ecx, 4(%esp) movl (%eax), %eax movl %eax, (%esp) movaps (%esp), %xmm0 pmovzxwd %xmm0, %xmm0 movl 36(%esp), %eax movaps %xmm0, (%eax) addl $28, %esp ret New: movl 4(%esp), %eax pmovzxwd (%eax), %xmm0 movl 8(%esp), %eax movaps %xmm0, (%eax) ret llvm-svn: 72957	2009-06-05 21:37:30 +00:00
Evan Cheng	f911a0825b	Default isNarrowingProfitable to false. llvm-svn: 72561	2009-05-29 05:31:40 +00:00
Evan Cheng	40810c4d1b	Added optimization that narrow load / op / store and the 'op' is a bit twiddling instruction and its second operand is an immediate. If bits that are touched by 'op' can be done with a narrower instruction, reduce the width of the load and store as well. This happens a lot with bitfield manipulation code. e.g. orl $65536, 8(%rax) => orb $1, 10(%rax) Since narrowing is not always a win, e.g. i32 -> i16 is a loss on x86, dag combiner consults with the target before performing the optimization. llvm-svn: 72507	2009-05-28 00:35:15 +00:00
Evan Cheng	9bd08f0cde	Run code placement optimization for targets that want it (arm and x86 for now). llvm-svn: 71726	2009-05-13 21:42:09 +00:00
Dan Gohman	e00e346c08	Add a comment about the special meaning of VoidTy in this context. llvm-svn: 71466	2009-05-11 19:11:53 +00:00
Nate Begeman	414534b3eb	Implement review feedback for vector shuffle work. llvm-svn: 70372	2009-04-29 05:20:52 +00:00
Nate Begeman	9d121924fd	2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. llvm-svn: 70225	2009-04-27 18:41:29 +00:00
Rafael Espindola	0b1037ad26	Revert 69952. Causes testsuite failures on linux x86-64. llvm-svn: 69967	2009-04-24 12:40:33 +00:00
Nate Begeman	c1a09c7dfa	PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next. llvm-svn: 69952	2009-04-24 03:42:54 +00:00
Dan Gohman	c9ce27d6b7	Implement support for using modeling implicit-zero-extension on x86-64 with SUBREG_TO_REG, teach SimpleRegisterCoalescing to coalesce SUBREG_TO_REG instructions (which are similar to INSERT_SUBREG instructions), and teach the DAGCombiner to take advantage of this on targets which support it. This eliminates many redundant zero-extension operations on x86-64. This adds a new TargetLowering hook, isZExtFree. It's similar to isTruncateFree, except it only applies to actual definitions, and not no-op truncates which may not zero the high bits. Also, this adds a new optimization to SimplifyDemandedBits: transform operations like x+y into (zext (add (trunc x), (trunc y))) on targets where all the casts are no-ops. In contexts where the high part of the add is explicitly masked off, this allows the mask operation to be eliminated. Fix the DAGCombiner to avoid undoing these transformations to eliminate casts on targets where the casts are no-ops. Also, this adds a new two-address lowering heuristic. Since two-address lowering runs before coalescing, it helps to be able to look through copies when deciding whether commuting and/or three-address conversion are profitable. Also, fix a bug in LiveInterval::MergeInClobberRanges. It didn't handle the case that a clobber range extended both before and beyond an existing live range. In that case, multiple live ranges need to be added. This was exposed by the new subreg coalescing code. Remove 2008-05-06-SpillerBug.ll. It was bugpoint-reduced, and the spiller behavior it was looking for no longer occurrs with the new instruction selection. llvm-svn: 68576	2009-04-08 00:15:30 +00:00
Arnold Schwaighofer	636127325b	Enable tail call optimization for functions that return a struct (bug 3664) and for functions that return types that need extending (e.g i1). llvm-svn: 67934	2009-03-28 08:33:27 +00:00
Evan Cheng	a15fdaa292	Optimize some 64-bit multiplication by constants into two lea's or one lea + shl since imulq is slow (latency 5). e.g. x * 40 => shlq $3, %rdi leaq (%rdi,%rdi,4), %rax This has the added benefit of allowing more multiply to be folded into addressing mode. e.g. a * 24 + b => leaq (%rdi,%rdi,2), %rax leaq (%rsi,%rax,8), %rax llvm-svn: 67917	2009-03-28 05:57:29 +00:00
Dan Gohman	51d4e8db6a	Fix a bunch of Doxygen syntax issues. Escape special characters, and put @file directives on their own comment line. llvm-svn: 65920	2009-03-03 02:55:14 +00:00
Rafael Espindola	880e63bf01	Refactor TLS code and add some tests. The tests and expected results are: pic \| declaration \| linkage \| visibility \| !pic \| declaration \| external \| default \| tls1.ll tls2.ll \| local exec pic \| declaration \| external \| default \| tls1-pic.ll tls2-pic.ll \| general dynamic !pic \| !declaration \| external \| default \| tls3.ll tls4.ll \| initial exec pic \| !declaration \| external \| default \| tls3-pic.ll tls4-pic.ll \| general dynamic !pic \| declaration \| external \| hidden \| tls7.ll tls8.ll \| local exec pic \| declaration \| external \| hidden \| X \| local dynamic !pic \| !declaration \| external \| hidden \| tls9.ll tls10.ll \| local exec pic \| !declaration \| external \| hidden \| X \| local dynamic !pic \| declaration \| internal \| default \| tls5.ll tls6.ll \| local exec pic \| declaration \| internal \| default \| X \| local dynamic The ones marked with an X have not been implemented since local dynamic is not implemented. llvm-svn: 65632	2009-02-27 13:37:18 +00:00
Dan Gohman	4105a38248	Constify TargetInstrInfo::EmitInstrWithCustomInserter, allowing ScheduleDAG's TLI member to use const. llvm-svn: 64018	2009-02-07 16:15:20 +00:00
Dale Johannesen	b7f2857776	Add some DL propagation to places that didn't have it yet. More coming. llvm-svn: 63673	2009-02-03 22:26:09 +00:00
Dale Johannesen	84498c427e	Propagation in TargetLowering. Includes passing a DL into SimplifySetCC which gets called elsewhere. llvm-svn: 63583	2009-02-03 00:47:48 +00:00
Dale Johannesen	7eb7de0393	Fix build on case-sensitive filesystems (i.e. everybody else) llvm-svn: 63448	2009-01-30 23:20:42 +00:00
Dale Johannesen	39738b1ff8	Make LowerCallTo and LowerArguments take a DebugLoc argument. Adjust all callers and overloaded versions. llvm-svn: 63444	2009-01-30 23:10:59 +00:00
Dan Gohman	9d120d6d8f	Make x86's BT instruction matching more thorough, and add some dagcombines that help it match in several more cases. Add several more cases to test/CodeGen/X86/bt.ll. This doesn't yet include matching for BT with an immediate operand, it just covers more register+register cases. llvm-svn: 63266	2009-01-29 01:59:02 +00:00
Dan Gohman	bc43253a91	Make isOperationLegal do what its name suggests, and introduce a new isOperationLegalOrCustom, which does what isOperationLegal previously did. Update a bunch of callers to use isOperationLegalOrCustom instead of isOperationLegal. In some case it wasn't obvious which behavior is desired; when in doubt I changed then to isOperationLegalOrCustom as that preserves their previous behavior. This is for the second half of PR3376. llvm-svn: 63212	2009-01-28 17:46:25 +00:00
Duncan Sands	392dc77fc6	Cleanup whitespace and comments, and tweak some prototypes, in operand type legalization. No functionality change. llvm-svn: 62680	2009-01-21 09:00:29 +00:00
Sanjiv Gupta	074553c4fb	Allow targets to legalize operations (with illegal operands) that produces multiple values. For example, a load with an illegal operand (a load produces two values, a value and chain). llvm-svn: 62663	2009-01-21 04:48:39 +00:00
Dan Gohman	37d7b5be33	Make getWidenVectorType const. llvm-svn: 62265	2009-01-15 17:34:08 +00:00

... 2 3 4 5 6 ...

591 Commits