llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 04:22:57 +02:00

Author	SHA1	Message	Date
Jim Grosbach	30f1b06af3	Using BIC for immediates needs an extra bump for its complexity to get instruction selection to prefer it when possible. rdar://7903972 llvm-svn: 108844	2010-07-20 16:07:04 +00:00
Jim Grosbach	fa61724ac3	Removed un-used code. llvm-svn: 108841	2010-07-20 14:51:32 +00:00
Bruno Cardoso Lopes	0fa595f073	Fix PR7174, a couple o Mips fixes: - Fix a typo for PIC check during jmp table lowering - Also fix the "first jump table basic block is not considered only reachable by fall through" problem, use this ad-hoc solution until I come up with something better. Patch by stetorvs@gmail.com llvm-svn: 108820	2010-07-20 08:37:04 +00:00
Bruno Cardoso Lopes	b127fa9a01	Fix Mips PR7473. Patch by stetorvs@gmail.com llvm-svn: 108816	2010-07-20 07:58:51 +00:00
Eric Christopher	ff47f8d94f	Constify some arguments. llvm-svn: 108812	2010-07-20 06:52:21 +00:00
Bruno Cardoso Lopes	88869cb4db	Add AVX vbroadcast new instruction llvm-svn: 108788	2010-07-20 00:11:13 +00:00
Daniel Dunbar	15f94c52e6	Update CMake files. llvm-svn: 108787	2010-07-20 00:08:13 +00:00
Chris Lattner	9ae74337ef	sink the arm implementations of ASmPrinter and MCInstLower out of the AsmPrinter directory into libarm. Now the ARM InstPrinters depend jsut on the MC stuff, not on vmcore or codegen. llvm-svn: 108783	2010-07-19 23:44:46 +00:00
Chris Lattner	4d232b674d	fix a layering problem by moving the x86 implementation of AsmPrinter and InstLowering into libx86 and out of the asmprinter subdirectory. Now X86/AsmPrinter just depends on MC stuff, not all of codegen and LLVM IR. llvm-svn: 108782	2010-07-19 23:41:57 +00:00
Bruno Cardoso Lopes	4ca44dda21	Add 256-bit vaddsub, vhadd, vhsub, vblend and vdpp instructions! llvm-svn: 108769	2010-07-19 23:32:44 +00:00
Evan Cheng	b2ad0066f5	ARM has to provide its own TargetLowering::findRepresentativeClass because its scalar floating point registers alias its vector registers. llvm-svn: 108761	2010-07-19 22:15:08 +00:00
Daniel Dunbar	1dd74c37c5	X86: Mark JMP{32,64}[mr] as requires 32-bit/64-bit mode. They are the same instruction, we only want to allow the one for the current subtarget. - This also fixes suffix matching for jmp instructions, because it eliminates the ambiguity between 'jmpl' and 'jmpq'. llvm-svn: 108746	2010-07-19 20:44:16 +00:00
Jim Grosbach	dc21ac2e0a	Since ARM emits inline jump tables as part of the ConstantIsland pass, it should set the jump table encloding the EK_Inline. This prevents a second, unused, copy of the table from being emitted after the function body. PR6581. llvm-svn: 108730	2010-07-19 17:20:38 +00:00
Jim Grosbach	5b8c14ce8a	revert so I can get the right PR# in the log message. llvm-svn: 108727	2010-07-19 17:19:40 +00:00
Jim Grosbach	42f3134738	Since ARM emits inline jump tables as part of the ConstantIsland pass, it should set the jump table encloding the EK_Inline. This prevents a second, unused, copy of the table from being emitted after the function body. PR7499. llvm-svn: 108722	2010-07-19 17:18:28 +00:00
Daniel Dunbar	220bd809bf	X86-64: Mark WINCALL and more tail call instructions as code gen only. llvm-svn: 108685	2010-07-19 07:21:07 +00:00
Daniel Dunbar	fa2847103d	X86: Mark some tail call pseduo instruction as code gen only. llvm-svn: 108684	2010-07-19 07:21:04 +00:00
Daniel Dunbar	f228215d4f	X86: Mark In32/64BitMode on LEAVE[64] and SYSEXIT[64]. llvm-svn: 108683	2010-07-19 07:21:01 +00:00
Daniel Dunbar	3b0ff3bac3	MC/X86: We now match instructions like "incl %eax" correctly for the arch we are assembling; remove crufty custom cleanup code. llvm-svn: 108681	2010-07-19 06:14:54 +00:00
Daniel Dunbar	7a3565367a	X86: Mark MOV.*_{TC,NOREX} instruction as code gen only, they aren't real. llvm-svn: 108680	2010-07-19 06:14:49 +00:00
Daniel Dunbar	9409c3fbb2	X86: MOV8o8a, MOV8ao8, etc. are only valid in 32-bit mode. llvm-svn: 108679	2010-07-19 06:14:44 +00:00
Daniel Dunbar	f58b5d7ad0	TblGen/AsmMatcher: Add support for honoring instruction Requires<[]> attributes as part of the matcher. - Currently includes a hack to limit ourselves to "In32BitMode" and "In64BitMode", because we don't have the other infrastructure to properly deal with setting SSE, etc. features on X86. llvm-svn: 108677	2010-07-19 05:44:09 +00:00
Daniel Dunbar	150021561c	Target: Give the TargetAsmParser access to the TargetMachine. - Unfortunate, but necessary for now to handle subtarget instruction matching. Eventually we should factor out the lower level target machine information so we don't need to do this. llvm-svn: 108664	2010-07-19 00:33:49 +00:00
Chris Lattner	be480fb7dc	the stackifier is global! llvm-svn: 108626	2010-07-17 17:42:04 +00:00
Chris Lattner	dac9788e6b	doxygenify some comments. llvm-svn: 108625	2010-07-17 17:40:51 +00:00
Jim Grosbach	270540da7b	Add combiner patterns to more effectively utilize the BFI (bitfield insert) instruction for non-constant operands. This includes the case referenced in the README.txt regarding a bitfield copy. llvm-svn: 108608	2010-07-17 03:30:54 +00:00
Jim Grosbach	e52a4aff12	add BFI to getTargetNodeName() llvm-svn: 108603	2010-07-17 01:50:57 +00:00
Jim Grosbach	5e095020ae	Fix logic think-o llvm-svn: 108601	2010-07-17 01:22:19 +00:00
Eric Christopher	00b8fa89c8	Remove unnecessary check that was subsumed into canRealignStack. llvm-svn: 108588	2010-07-17 00:33:04 +00:00
Eric Christopher	033201e862	Make more explicit and add some currently disabled error messages for stack realignment on ARM. Also check for function attributes as we do on X86 as well as make explicit that we're checking can as well as needs in this function. llvm-svn: 108582	2010-07-17 00:27:24 +00:00
Eric Christopher	cfd5cd156c	Make comment a bit more clear as well as return statement since needsStackRealignment is currently checking the can conditions as well. llvm-svn: 108581	2010-07-17 00:25:41 +00:00
Jim Grosbach	749f4fca0a	Add basic support to code-gen the ARM/Thumb2 bit-field insert (BFI) instruction and a combine pattern to use it for setting a bit-field to a constant value. More to come for non-constant stores. llvm-svn: 108570	2010-07-16 23:05:05 +00:00
Jakob Stoklund Olesen	44949b2e1b	Remove the isMoveInstr() hook. llvm-svn: 108567	2010-07-16 22:35:46 +00:00
Jakob Stoklund Olesen	24994a5d4c	Avoid isMoveInstr when printing XCore pseudo-moves. llvm-svn: 108566	2010-07-16 22:35:37 +00:00
Jakob Stoklund Olesen	c73aa71e90	Use MI.isCopy. llvm-svn: 108565	2010-07-16 22:35:34 +00:00
Jakob Stoklund Olesen	d073973e61	Use a small local function for a single remaining late isMoveInstr call in Thumb2ITBlockPass. llvm-svn: 108564	2010-07-16 22:35:32 +00:00
Bill Wendling	e2833a21c2	Rename DBG_LABEL PROLOG_LABEL, because it's only used during prolog emission and thus is a much more meaningful name. llvm-svn: 108563	2010-07-16 22:20:36 +00:00
Jakob Stoklund Olesen	41b1ea4fc9	Keep valgrind quiet. The isLive() method can read uninitialized memory, but it still gives correct results. llvm-svn: 108561	2010-07-16 22:00:33 +00:00
Jakob Stoklund Olesen	9521e574f8	Emit COPY instead of FMR/FMSD instructions for floating point conversion on PowerPC. llvm-svn: 108555	2010-07-16 21:03:52 +00:00
Eli Friedman	616313e9c6	Add missing attributes to cpp backend. llvm-svn: 108547	2010-07-16 18:47:20 +00:00
Dale Johannesen	80b46398ab	Accept registers with P modifier. PR 5314. llvm-svn: 108545	2010-07-16 18:35:46 +00:00
Jakob Stoklund Olesen	701cbc5c89	Teach PPCInstrInfo::storeRegToStackSlot and loadRegFromStackSlot to add memory operands. Hopefully this fixes the llvm-gcc-powerpc-darwin9 buildbot. It really shouldn't since missing memoperands should not affect correctness. llvm-svn: 108540	2010-07-16 18:22:00 +00:00
Jakob Stoklund Olesen	858d6bb512	Remove the X86::FP_REG_KILL pseudo-instruction and the X86FloatingPointRegKill pass that inserted it. It is no longer necessary to limit the live ranges of FP registers to a single basic block. llvm-svn: 108536	2010-07-16 17:41:44 +00:00
Jakob Stoklund Olesen	5fbe7d869c	Search for a free FP register instead of just assuming FP7 is not in use. llvm-svn: 108535	2010-07-16 17:41:40 +00:00
Jakob Stoklund Olesen	d578c5af7e	Allow x87 FP registers to be alive globally in a function. FP_REG_KILL instructions are still inserted, but can be disabled by passing -live-x87 to llc. The X87FPRegKillInserterPass is going to be removed shortly. CFG edges are partioned into bundles where the x87 stack must be allocated identically. Code is insertad at the end of each basic block that shuffles the live FP registers to match the outgoing bundles expectations. This fix is in preparation for some upcoming register allocator improvements that may extend the live range of registers beyond a basic block, similar to LICM. It also provides a nice runtime speedup if you are building with -mfpmath=387. llvm-svn: 108529	2010-07-16 16:38:12 +00:00
Evan Cheng	ffbae6ad52	Split -enable-finite-only-fp-math to two options: -enable-no-nans-fp-math and -enable-no-infs-fp-math. All of the current codegen fp math optimizations only care whether the fp arithmetics arguments and results can never be NaN. llvm-svn: 108465	2010-07-15 22:07:12 +00:00
Chris Lattner	5e03b135cb	fix the encoding of MMX_MOVFR642Qrr, it starts with 0xF2 not 0xF3, this fixes rdar://8192860. Unfortunately it can only be triggered with llc because llvm-mc matches another (correctly encoded) version of this, so no testcase. llvm-svn: 108454	2010-07-15 20:13:34 +00:00
Eli Friedman	fc1680a9af	Random note about bswap. llvm-svn: 108396	2010-07-15 02:20:38 +00:00
Jakob Stoklund Olesen	0a565bde90	Last COPY conversion. llvm-svn: 108387	2010-07-14 23:58:21 +00:00
Bob Wilson	27e348cfa5	Remove restriction on NEON alignment values. Some of the NEON ld/st instructions use different values (e.g., 2-byte or 4-byte alignment). Also fix ARMInstPrinter to print these alignments as bits instead of bytes. llvm-svn: 108386	2010-07-14 23:54:43 +00:00
Jakob Stoklund Olesen	e3aafe4988	Use TargetOpcode::COPY instead of X86-native register copy instructions when lowering atomics. This will allow those copies to still be coalesced after TII::isMoveInstr is removed. llvm-svn: 108385	2010-07-14 23:50:27 +00:00
Chris Lattner	fa93b779db	fix indentation llvm-svn: 108368	2010-07-14 23:04:59 +00:00
Benjamin Kramer	da3e6cdb26	Don't pass StringRef by reference. llvm-svn: 108366	2010-07-14 22:38:02 +00:00
Chris Lattner	2793cb1bd6	Merge lib/Target/X86/X86COFF.h into include/llvm/Support/COFF.h, patch by Michael Spencer! llvm-svn: 108342	2010-07-14 18:14:33 +00:00
Jim Grosbach	e2d1ecbe70	Improve 64-subtraction of immediates when parts of the immediate can fit in the literal field of an instruction. E.g., long long foo(long long a) { return a - 734439407618LL; } rdar://7038284 llvm-svn: 108339	2010-07-14 17:45:16 +00:00
Bob Wilson	f60d34bfad	Add missing address register update to t2LDM_RET instruction. Patch by Brian Lucas. PR7636. llvm-svn: 108332	2010-07-14 16:02:13 +00:00
Eli Friedman	7175d7558d	A couple potential optimizations inspired by comment 4 in PR6773. llvm-svn: 108328	2010-07-14 06:58:26 +00:00
Evan Cheng	f6478f489d	Fix for PR7193 was overly conservative. The only case where sibcall callee address cannot be allocated a register is in 32-bit mode where the first three arguments are marked inreg. In that case EAX, EDX, and ECX will be used for argument passing. This fixes PR7610. llvm-svn: 108327	2010-07-14 06:44:01 +00:00
Bob Wilson	34f481e895	Add support for NEON VMVN immediate instructions. llvm-svn: 108324	2010-07-14 06:31:50 +00:00
Bob Wilson	298c5c46c1	The bits in the cmode field of 32-bit VMOV immediate instructions all depend of the value of the immediate. llvm-svn: 108323	2010-07-14 06:30:44 +00:00
Chris Lattner	25b9b8f2fc	fix a bug found by a warning I added to clang this morning. llvm-svn: 108309	2010-07-14 01:57:17 +00:00
Bob Wilson	0f581a998c	Add an ARM-specific DAG combining to avoid redundant VDUPLANE nodes. Radar 7373643. llvm-svn: 108303	2010-07-14 01:22:12 +00:00
Dan Gohman	18711b19c9	Don't propagate debug locations to instructions for materializing constants, since they may not be emited near the other instructions which get the same line, and this confuses debug info. llvm-svn: 108302	2010-07-14 01:07:44 +00:00
Bruno Cardoso Lopes	0616a418b6	Add AVX 256-bit compare instructions and a bunch of testcases llvm-svn: 108286	2010-07-13 22:06:38 +00:00
Bob Wilson	7feb850d36	Use a target-specific VMOVIMM DAG node instead of BUILD_VECTOR to represent NEON VMOV-immediate instructions. This simplifies some things. llvm-svn: 108275	2010-07-13 21:16:48 +00:00
Bruno Cardoso Lopes	7bc71d2d0a	AVX 256-bit conversion instructions Add the x86 VEX_L form to handle special cases where VEX_L must be set. llvm-svn: 108274	2010-07-13 21:07:28 +00:00
Kevin Enderby	c26ac60ca8	Added a check that pusha cannot be encoded in 64-bit mode. llvm-svn: 108265	2010-07-13 20:05:41 +00:00
Evan Cheng	069f1f7c9a	Extend the r107852 optimization which turns some fp compare to code sequence using only i32 operations. It now optimize some f64 compares when fp compare is exceptionally slow (e.g. cortex-a8). It also catches comparison against 0.0. llvm-svn: 108258	2010-07-13 19:27:42 +00:00
Evan Cheng	67743f2057	Add an ARM "feature". Cortex-a8 fp comparison is very slow (> 20 cycles). llvm-svn: 108256	2010-07-13 19:21:50 +00:00
Evan Cheng	8cce7c7351	-enable-unsafe-fp-math should not imply -enable-finite-only-fp-math. llvm-svn: 108254	2010-07-13 18:46:14 +00:00
Gabor Greif	9772b3e74f	rotate CallInst operands with this commit the callee moves to the end of the operand array (from the start) and the call arguments now start at index 0 (formerly 1) this ordering is now consistent with InvokeInst this commit only flips the switch, functionally it is equivalent to r101465 I intend to commit several cleanups after a few days of soak period llvm-svn: 108240	2010-07-13 15:31:36 +00:00
Bob Wilson	8c1f6adf81	Move NEON "modified immediate" encode/decode into ARMAddressingModes.h to avoid replicated code. llvm-svn: 108227	2010-07-13 04:44:34 +00:00
Chris Lattner	ddb09ea6ad	my work on adding segment registers to LEA missed the disassembler. Remove some code from the disassembler to compensate, unbreaking disassembly of lea's. llvm-svn: 108226	2010-07-13 04:23:55 +00:00
Bruno Cardoso Lopes	ae37153b05	Add AVX 256-bit packed logical forms llvm-svn: 108224	2010-07-13 02:38:35 +00:00
Bruno Cardoso Lopes	495ae629bb	Add AVX 256-bit unop arithmetic instructions llvm-svn: 108223	2010-07-13 01:53:31 +00:00
Bruno Cardoso Lopes	185483638b	Since AVX is a superset of all SSE versions, only use HasAVX for AVX instructions llvm-svn: 108222	2010-07-13 00:38:47 +00:00
David Greene	d81591ee09	Move some SIMD fragment code into X86InstrFragmentsSIMD so that the utility classes can be used from multiple files. This will aid transitioning to a new refactored x86 SIMD specification. llvm-svn: 108213	2010-07-12 23:41:28 +00:00
Bruno Cardoso Lopes	852e3bf472	Add AVX 256 binary arithmetic instructions llvm-svn: 108207	2010-07-12 23:04:15 +00:00
Bruno Cardoso Lopes	b021506033	More refactoring of basic SSE arith instructions. Open room for 256-bit instructions llvm-svn: 108204	2010-07-12 22:41:32 +00:00
Dan Gohman	e9c4426bb0	Apply the SSE dependence idiom for SSE unary operations to SD instructions too, in addition to SS instructions. And add a comment about it. llvm-svn: 108191	2010-07-12 20:46:04 +00:00
Bob Wilson	33acb6130e	Remove some code that doesn't appear to do anything. All the ARM call instructions already have implicit defs of LR. The comment suggests that this is intended to fix something like pr6111, but it doesn't really do that either. llvm-svn: 108186	2010-07-12 20:22:45 +00:00
Bruno Cardoso Lopes	a4889e6f93	Add AVX 256-bit MOVMSK forms llvm-svn: 108184	2010-07-12 20:06:32 +00:00
Dan Gohman	5a42173004	Check begin!=end, rather than !begin. llvm-svn: 108167	2010-07-12 18:12:35 +00:00
Dan Gohman	a383dfd81f	Don't fast-isel an x87 comparison opcode, as fast-isel doesn't support branching on x87 comparisons yet. This fixes PR7624. llvm-svn: 108149	2010-07-12 15:46:30 +00:00
Duncan Sands	f7b98e2b1e	Convert some tab stops into spaces. llvm-svn: 108130	2010-07-12 08:16:59 +00:00
Rafael Espindola	16319e45c6	Convert getLoadStoreRegOpcode to use a switch. llvm-svn: 108123	2010-07-12 03:43:04 +00:00
Rafael Espindola	4c16632cdf	Convert the last use of getPhysicalRegisterRegClass and remove it. AggressiveAntiDepBreaker should not be using getPhysicalRegisterRegClass. An instruction might be using a register that can only be replaced with one from a subclass of getPhysicalRegisterRegClass. With this patch we use getMinimalPhysRegClass. This is correct, but conservative. We should check the uses of the register and select the largest register class that can be used in all of them. llvm-svn: 108122	2010-07-12 02:55:34 +00:00
Jakob Stoklund Olesen	cc60305c22	A basic block that only uses RFP registers still needs the FP_REG_KILL marker. This fixes PR7375. llvm-svn: 108120	2010-07-12 02:12:47 +00:00
Rafael Espindola	0c1a9aa248	Convert the last getPhysicalRegisterRegClass in VirtRegRewriter.cpp to getMinimalPhysRegClass. It was used to produce spills, and it is better to use the most specific class if possible. Update getLoadStoreRegOpcode to handle GR32_AD. llvm-svn: 108115	2010-07-12 00:52:33 +00:00
Jakob Stoklund Olesen	7af3eff94d	RISC architectures get their memory operand folding for free. The only folding these load/store architectures can do is converting COPY into a load or store, and the target independent part of foldMemoryOperand already knows how to do that. llvm-svn: 108099	2010-07-11 19:19:13 +00:00
Jakob Stoklund Olesen	73e71c4703	Use target independent COPY instructions for the fake fextend and fround operations in x87 code. llvm-svn: 108098	2010-07-11 18:19:39 +00:00
Jakob Stoklund Olesen	c48892383f	Remove redundant branch. Thanks, Anton! llvm-svn: 108097	2010-07-11 17:17:35 +00:00
Jakob Stoklund Olesen	eeabe43059	Remove obsolete README_SSE note. We are generating movaps for all XMM register copies, including scalar floating point values. This is known to be at least as good as movss and movsd for all known architectures up to and including Nehalem because it avoids a partial register stall. The SSEDomainFix pass will switch movaps to movdqa when appropriate (i.e., when operands come from the integer unit). We don't now that switching movaps to movapd has any benefit. The same applies to andps -> pand. llvm-svn: 108096	2010-07-11 17:13:42 +00:00
Rafael Espindola	68bbc41d5e	Make getPhysicalRegisterRegClass non-virtual. Should be able to remove it soon. llvm-svn: 108094	2010-07-11 16:49:10 +00:00
Jakob Stoklund Olesen	ecdef6c130	Replace copyRegToReg with copyPhysReg for SystemZ. llvm-svn: 108092	2010-07-11 16:40:46 +00:00
Jakob Stoklund Olesen	040d64f18b	Avoid SSE instructions in FastIsel when it is not available. llvm-svn: 108091	2010-07-11 16:22:13 +00:00
Chandler Carruth	8425bffa25	Remove two other uses of ATTRIBUTE_UNUSED for variables only used within assert()s, switching to void-casts. Removed an unneeded Compiler.h include as a result. There are two other uses in LLVM, but they're not due to assert()s, so I've left them alone. llvm-svn: 108088	2010-07-11 08:18:12 +00:00
Jakob Stoklund Olesen	8b636d6456	Replace copyRegToReg with copyPhysReg for XCore. llvm-svn: 108087	2010-07-11 07:56:13 +00:00
Jakob Stoklund Olesen	b8af51cebf	Replace copyRegToReg with copyPhysReg for Sparc. llvm-svn: 108086	2010-07-11 07:56:09 +00:00
Jakob Stoklund Olesen	8a62d7e134	Replace copyRegToReg with copyPhysReg for CellSPU. llvm-svn: 108084	2010-07-11 07:31:03 +00:00
Jakob Stoklund Olesen	0b1e64c1d4	Replace copyRegToReg with copyPhysReg for PowerPC. llvm-svn: 108083	2010-07-11 07:31:00 +00:00
Jakob Stoklund Olesen	84ac13069a	Fix PIC16 comments referencing copyRegToReg. llvm-svn: 108082	2010-07-11 07:30:57 +00:00
Jakob Stoklund Olesen	b15ffc7e90	Replace copyRegToReg with copyPhysReg for PIC16. llvm-svn: 108081	2010-07-11 06:53:33 +00:00
Jakob Stoklund Olesen	fb3525531b	Replace copyRegToReg with copyPhysReg for MSP430. llvm-svn: 108080	2010-07-11 06:53:30 +00:00
Jakob Stoklund Olesen	beb86cfa27	Replace copyRegToReg with copyPhysReg for MBlaze. llvm-svn: 108079	2010-07-11 06:53:27 +00:00
Jakob Stoklund Olesen	938e41c1fa	Replace copyRegToReg with copyPhysReg for ARM. llvm-svn: 108078	2010-07-11 06:33:54 +00:00
Jakob Stoklund Olesen	18e465659f	Replace copyRegToReg with copyPhysReg for Blackfin. llvm-svn: 108077	2010-07-11 05:44:34 +00:00
Jakob Stoklund Olesen	821d058fd2	X86InstrInfo::copyRegToReg is dead. Long live copyPhysReg! llvm-svn: 108076	2010-07-11 05:44:30 +00:00
Jakob Stoklund Olesen	08fc7eaaa2	Use COPY in X86FastISel::X86SelectRet. Don't try a cross-class copy. That is very unlikely anywy since return value registers are usually register class friendly. (%EAX, %XMM0, etc). llvm-svn: 108074	2010-07-11 05:17:02 +00:00
Rafael Espindola	84716579d4	Fix va_arg for doubles. With this patch VAARG nodes always contain the correct alignment information, which simplifies ExpandRes_VAARG a bit. The patch introduces a new alignment information to TargetLoweringInfo. This is needed since the two natural candidates cannot be used: * The 's' in target data: If this is set to the minimal alignment of any argument, getCallFrameTypeAlignment would return 4 for doubles on ARM for example. * The getTransientStackAlignment method. It is possible for an architecture to have argument less aligned than what we maintain the stack pointer. llvm-svn: 108072	2010-07-11 04:01:49 +00:00
Jakob Stoklund Olesen	57bbaf37c0	Use COPY in FastISel everywhere it is safe and trivial. The remaining copyRegToReg calls actually check the return value (shock!), so we cannot trivially replace them with COPY instructions. llvm-svn: 108069	2010-07-11 03:31:00 +00:00
Jakob Stoklund Olesen	c1aca7464d	Replace copyRegToReg with copyPhysReg for Mips. llvm-svn: 108066	2010-07-11 01:08:31 +00:00
Jakob Stoklund Olesen	0fc69a96b7	Replace copyRegToReg with copyPhysReg for Alpha. llvm-svn: 108065	2010-07-11 01:08:23 +00:00
Jakob Stoklund Olesen	b1c6191d3b	Use COPY in targets llvm-svn: 108063	2010-07-10 22:43:03 +00:00
Jakob Stoklund Olesen	b1e88a2725	Don't emit st(0)/st(1) copies as FpMOV instructions. Use FpSET_ST? instead. Based on a patch by Rafael Espíndola. Attempt to make the FpSET_ST1 hack more robust, but we are still relying on FpSET_ST0 preceeding it. This is only for supporting really weird x87 inline asm. We support: FpSET_ST0 INLINEASM FpSET_ST0 FpSET_ST1 INLINEASM with and without kills on the arguments. We don't support: FpSET_ST1 FpSET_ST0 INLINEASM nor FpSET_ST1 INLINEASM Just Don't Do It! llvm-svn: 108047	2010-07-10 17:42:34 +00:00
Chandler Carruth	1efbf423c5	Add parentheses yet again to satisfy GCC's warnings. llvm-svn: 108043	2010-07-10 12:06:22 +00:00
Dan Gohman	fef30fcd5e	Reapply bottom-up fast-isel, with several fixes for x86-32: - Check getBytesToPopOnReturn(). - Eschew ST0 and ST1 for return values. - Fix the PIC base register initialization so that it doesn't ever fail to end up the top of the entry block. llvm-svn: 108039	2010-07-10 09:00:22 +00:00
Jakob Stoklund Olesen	bf7dddd5b7	An x86 function returns a floating point value in st(0), and we must make sure it is popped, even if it is ununsed. A CopyFromReg node is too weak to represent the required sideeffect, so insert an FpGET_ST0 instruction directly instead. This will matter when CopyFromReg gets lowered to a generic COPY instruction. llvm-svn: 108037	2010-07-10 04:04:25 +00:00
Bruno Cardoso Lopes	3b9d36bde7	Declare YMM subregisters in the right way! Thanks Jakob llvm-svn: 108022	2010-07-09 21:46:19 +00:00
Bruno Cardoso Lopes	f4180a9a7b	Add AVX 256-bit packed MOVNT variants llvm-svn: 108021	2010-07-09 21:42:42 +00:00
Jakob Stoklund Olesen	ef941722c5	Remember the *_TC opcodes for load/store llvm-svn: 108020	2010-07-09 21:27:55 +00:00
Bruno Cardoso Lopes	6ca8dc935c	Add AVX 256-bit unpack and interleave llvm-svn: 108017	2010-07-09 21:20:35 +00:00
Jakob Stoklund Olesen	d7c882a505	Automatically fold COPY instructions into stack load/store. llvm-svn: 108012	2010-07-09 20:43:13 +00:00
Jakob Stoklund Olesen	53d777f3bd	Fix a few tests llvm-svn: 108011	2010-07-09 20:43:09 +00:00
Jim Grosbach	b591b3b48d	In the presence of variable sized objects, allocate an emergency spill slot. rdar://8131327 llvm-svn: 108008	2010-07-09 20:27:06 +00:00
Bruno Cardoso Lopes	3676e24b67	Start the support for AVX instructions with 256-bit %ymm registers. A couple of notes: - The instructions are being added with dummy placeholder patterns using some 256 specifiers, this is not meant to work now, but since there are some multiclasses generic enough to accept them, when we go for codegen, the stuff will be already there. - Add VEX encoding bits to support YMM - Add MOVUPS and MOVAPS in the first round - Use "Y" as suffix for those Instructions: MOVUPSYrr, ... - All AVX instructions in X86InstrSSE.td will move soon to a new X86InstrAVX file. llvm-svn: 107996	2010-07-09 18:27:43 +00:00
Bob Wilson	9e8c9204ef	--- Reverse-merging r107947 into '.': U utils/TableGen/FastISelEmitter.cpp --- Reverse-merging r107943 into '.': U test/CodeGen/X86/fast-isel.ll U test/CodeGen/X86/fast-isel-loads.ll U include/llvm/Target/TargetLowering.h U include/llvm/Support/PassNameParser.h U include/llvm/CodeGen/FunctionLoweringInfo.h U include/llvm/CodeGen/CallingConvLower.h U include/llvm/CodeGen/FastISel.h U include/llvm/CodeGen/SelectionDAGISel.h U lib/CodeGen/LLVMTargetMachine.cpp U lib/CodeGen/CallingConvLower.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuilder.cpp U lib/CodeGen/SelectionDAG/FunctionLoweringInfo.cpp U lib/CodeGen/SelectionDAG/FastISel.cpp U lib/CodeGen/SelectionDAG/SelectionDAGISel.cpp U lib/CodeGen/SelectionDAG/ScheduleDAGSDNodes.cpp U lib/CodeGen/SelectionDAG/InstrEmitter.cpp U lib/CodeGen/SelectionDAG/TargetLowering.cpp U lib/Target/XCore/XCoreISelLowering.cpp U lib/Target/XCore/XCoreISelLowering.h U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86ISelLowering.h llvm-svn: 107987	2010-07-09 16:37:18 +00:00
Bruno Cardoso Lopes	144923dccf	Merge VEX enums with other x86 enum forms. Also fix all checks of which VEX fields to use. llvm-svn: 107952	2010-07-09 01:56:45 +00:00
Dan Gohman	dad9d461c3	Fix the memoperand offsets in code generated for va_start. llvm-svn: 107948	2010-07-09 01:06:48 +00:00
Chris Lattner	a5c1c795a2	have the mc lowering process handle a few tail call forms, lowering them to jumps where possible and turning the TAILCALL marker in the instruction asm string into a proper comment. This eliminates a FIXME and is on the path to finishing: rdar://7639610 - eliminate encoding and asm info for TAILJMPd TAILJMPr TAILJMPn, etc. However, I can't eliminate the encodings for these instructions because the JIT still exists and has its own copy of the encoder, sigh. llvm-svn: 107946	2010-07-09 00:49:41 +00:00
Bob Wilson	f15e542bdc	Print "dregpair" NEON operands with a space between them, for readability and consistency with other instructions that have lists of register operands. llvm-svn: 107944	2010-07-09 00:47:20 +00:00
Dan Gohman	7e6e4dd058	Re-apply bottom-up fast-isel, with fixes. Be very careful to avoid emitting a DBG_VALUE after a terminator, or emitting any instructions before an EH_LABEL. llvm-svn: 107943	2010-07-09 00:39:23 +00:00
Bruno Cardoso Lopes	a6bfda61b9	Factor out x86 segment override prefix encoding, and also use it for VEX llvm-svn: 107942	2010-07-09 00:38:14 +00:00
Chris Lattner	fe434abafa	reject pseudo instructions early in the encoder. llvm-svn: 107939	2010-07-09 00:17:50 +00:00
Bruno Cardoso Lopes	f00a155876	Remove trailing whitespaces from file llvm-svn: 107937	2010-07-09 00:07:19 +00:00
Chris Lattner	49ac65543c	Change LEA to have 5 operands for its memory operand, just like all other instructions, even though a segment is not allowed. This resolves a bunch of gross hacks in the encoder and makes LEA more consistent with the rest of the instruction set. No functionality change. llvm-svn: 107934	2010-07-08 23:46:44 +00:00
Chris Lattner	18802e1a55	add some long-overdue enums to refer to the parts of the 5-operand X86 memory operand. llvm-svn: 107925	2010-07-08 22:41:28 +00:00
Jakob Stoklund Olesen	1ae7342eaf	Remember the VR64 register class llvm-svn: 107920	2010-07-08 22:30:35 +00:00
Chris Lattner	012d7537ee	Rework segment prefix emission code to handle segments in memory operands at the same type as hard coded segments. This fixes problems where we'd emit the segment override after the REX prefix on instructions like: mov %gs:(%rdi), %rax This fixes rdar://8127102. I have several cleanup patches coming next. llvm-svn: 107917	2010-07-08 22:28:12 +00:00
Chris Lattner	660851a040	introduce a new X86II::getMemoryOperandNo method, which returns the start of the memory operand for an instruction. Introduce a new "X86AddrSegment" enum to reduce # magic numbers referring to X86 memory operand layout. llvm-svn: 107916	2010-07-08 22:27:06 +00:00
Kalle Raiskila	725a1a4ad2	Switch SPU calling convention (function arguments) to a Tablegen implementation. llvm-svn: 107913	2010-07-08 21:15:22 +00:00
Evan Cheng	5307ec12d7	Check for FiniteOnlyFPMath as well. llvm-svn: 107904	2010-07-08 20:12:24 +00:00
Jakob Stoklund Olesen	f9441b5025	Teach the x86 floating point stackifier to handle COPY instructions. This pass runs before COPY instructions are passed to copyPhysReg, so we simply translate COPY to the proper pseudo instruction. Note that copyPhysReg does not handle floating point stack copies. Once COPY is used everywhere, this can be cleaned up a bit, and most of the pseudo instructions can be removed. llvm-svn: 107899	2010-07-08 19:46:30 +00:00
Jakob Stoklund Olesen	aed86b1af7	Implement X86InstrInfo::copyPhysReg llvm-svn: 107898	2010-07-08 19:46:25 +00:00
Bob Wilson	12922e6bec	The NEONPreAllocPass should never have to assign fixed registers anymore. This pass can go away entirely soon. llvm-svn: 107892	2010-07-08 17:45:26 +00:00
Bob Wilson	fca7a252fb	For big-endian systems, VLD2/VST2 with 32-bit vector elements will swap the words within the 64-bit D registers. Use VLD1/VST1 with 64-bit elements instead. llvm-svn: 107890	2010-07-08 17:44:00 +00:00
Bob Wilson	b07d97d333	Clean up a comment. llvm-svn: 107882	2010-07-08 16:54:45 +00:00
Jakob Stoklund Olesen	30aacf68b9	Convert EXTRACT_SUBREG to COPY when emitting machine instrs. EXTRACT_SUBREG no longer appears as a machine instruction. Use COPY instead. Add isCopy() checks in many places using isMoveInstr() and isExtractSubreg(). The isMoveInstr hook will be removed later. llvm-svn: 107879	2010-07-08 16:40:22 +00:00
Jakob Stoklund Olesen	8983ea915c	Remove references to INSERT_SUBREG after de-SSA. Fix X86InstrInfo::convertToThreeAddressWithLEA to generate COPY instead of INSERT_SUBREG. llvm-svn: 107878	2010-07-08 16:40:15 +00:00
Benjamin Kramer	27eb255a70	Teach instcombine to transform (X >s -1) ? C1 : C2 and (X <s 0) ? C2 : C1 into ((X >>s 31) & (C2 - C1)) + C1, avoiding the conditional. This optimization could be extended to take non-const C1 and C2 but we better stay conservative to avoid code size bloat for now. for int sel(int n) { return n >= 0 ? 60 : 100; } we now generate sarl $31, %edi andl $40, %edi leal 60(%rdi), %eax instead of testl %edi, %edi movl $60, %ecx movl $100, %eax cmovnsl %ecx, %eax llvm-svn: 107866	2010-07-08 11:39:10 +00:00
Eric Christopher	091bf69467	A slight reworking of the custom patterns for x86-64 tpoff codegen and correct the testcase for valid assembly. Needs more tests. llvm-svn: 107860	2010-07-08 07:36:46 +00:00
Evan Cheng	3e8530bf14	r107852 is only safe with -enable-unsafe-fp-math to account for +0.0 == -0.0. llvm-svn: 107856	2010-07-08 06:01:49 +00:00
Evan Cheng	ed3f224f04	Optimize some vfp comparisons to integer ones. This patch implements the simplest case when the following conditions are met: 1. The arguments are f32. 2. The arguments are loads and they have no uses other than the comparison. 3. The comparison code is EQ or NE. e.g. vldr.32 s0, [r1] vldr.32 s1, [r0] vcmpe.f32 s1, s0 vmrs apsr_nzcv, fpscr beq LBB0_2 => ldr r1, [r1] ldr r0, [r0] cmp r0, r1 beq LBB0_2 More complicated cases will be implemented in subsequent patches. llvm-svn: 107852	2010-07-08 02:08:50 +00:00
Dale Johannesen	2df647f882	Changes to ARM tail calls, mostly cosmetic. Add explicit testcases for tail calls within the same module. Duplicate some code to humor those who think .w doesn't apply on ARM. Leave this disabled on Thumb1, and add some comments explaining why it's hard and won't gain much. llvm-svn: 107851	2010-07-08 01:18:23 +00:00
Dan Gohman	4dcc56a102	Revert 107840 107839 107813 107804 107800 107797 107791. Debug info intrinsics win for now. llvm-svn: 107850	2010-07-08 01:00:56 +00:00
Jakob Stoklund Olesen	6afcd69bee	fix copies to/from GR8_ABCD_H even more llvm-svn: 107832	2010-07-07 23:04:56 +00:00
Jim Grosbach	46d94f1c1e	grammar llvm-svn: 107831	2010-07-07 22:53:35 +00:00
Jim Grosbach	8f27ad0d9d	Handle cases where the post-RA scheduler may move instructions between the address calculation instructions leading up to a jump table when we're trying to convert them into a TB[H] instruction in Thumb2. This realistically shouldn't happen much, if at all, for well formed inputs, but it's more correct to handle it. rdar://7387682 llvm-svn: 107830	2010-07-07 22:51:22 +00:00
Chris Lattner	155420f59f	finish up support for callw: PR7195 llvm-svn: 107826	2010-07-07 22:35:13 +00:00
Chris Lattner	6a5db9c9c9	Implement the major chunk of PR7195: support for 'callw' in the integrated assembler. Still some discussion to be done. llvm-svn: 107825	2010-07-07 22:27:31 +00:00
Bruno Cardoso Lopes	b92b51191e	Add more assembly opcodes for SSE compare instructions llvm-svn: 107823	2010-07-07 22:24:03 +00:00
Evan Cheng	22b3e8f3b1	Move getExtLoad() and (some) getLoad() DebugLoc argument after EVT argument for consistency sake. llvm-svn: 107820	2010-07-07 22:15:37 +00:00
Devang Patel	82ccfed750	Print undefined/unknown debug value as "undef". llvm-svn: 107818	2010-07-07 21:52:21 +00:00
Jim Grosbach	d13cc7716e	grammar and trailing whitespace llvm-svn: 107811	2010-07-07 21:06:51 +00:00
Jakob Stoklund Olesen	34ec644313	Allow copies between GR8_ABCD_L and GR8_ABCD_H. This fixes PR7540. llvm-svn: 107809	2010-07-07 20:33:27 +00:00
Dan Gohman	d0caefa601	Implement bottom-up fast-isel. This has the advantage of not requiring a separate DCE pass over MachineInstrs. llvm-svn: 107804	2010-07-07 19:20:32 +00:00
Dan Gohman	424cc6b616	Add X86FastISel support for return statements. This entails refactoring a bunch of stuff, to allow the target-independent calling convention logic to be employed. llvm-svn: 107800	2010-07-07 18:32:53 +00:00
Bruno Cardoso Lopes	8d350872d4	Add AVX AES instructions llvm-svn: 107798	2010-07-07 18:24:20 +00:00
Dan Gohman	b2d5b47efb	Give FunctionLoweringInfo an MBB member, avoiding the need to pass it around everywhere, and also give it an InsertPt member, to enable isel to operate at an arbitrary position within a block, rather than just appending to a block. llvm-svn: 107791	2010-07-07 16:47:08 +00:00
Dan Gohman	b87c534168	Simplify FastISel's constructor by giving it a FunctionLoweringInfo instance, rather than pointers to all of FunctionLoweringInfo's members. This eliminates an NDEBUG ABI sensitivity. llvm-svn: 107789	2010-07-07 16:29:44 +00:00
Dan Gohman	c768525273	Split the SDValue out of OutputArg so that SelectionDAG-independent code can do calling-convention queries. This obviates OutputArgReg. llvm-svn: 107786	2010-07-07 15:54:55 +00:00
Bruno Cardoso Lopes	6222076cd1	Add AVX SSE4.2 instructions llvm-svn: 107752	2010-07-07 03:39:29 +00:00
Bruno Cardoso Lopes	931471d7e8	Use only one multiclass to pinsrq instructions llvm-svn: 107750	2010-07-07 01:43:01 +00:00
Bruno Cardoso Lopes	65fbd0530f	Now that almost all SSE4.1 AVX instructions are added, move code around to more appropriate sections. No functionality changes llvm-svn: 107749	2010-07-07 01:33:38 +00:00
Bruno Cardoso Lopes	675ebe2dc0	Add AVX SSE4.1 insertps, ptest and movntdqa instructions llvm-svn: 107747	2010-07-07 01:14:56 +00:00
Bruno Cardoso Lopes	fa10461265	Add AVX SSE4.1 extractps and pinsr instructions llvm-svn: 107746	2010-07-07 01:01:13 +00:00
Bob Wilson	822b21f0de	Also use REG_SEQUENCE for VTBX instructions. llvm-svn: 107743	2010-07-07 00:08:54 +00:00
Jim Grosbach	71b7efe8ad	Mark eh.sjlj.set/longjmp custom lowerings as Darwin-only since that's where they've been tested to work. llvm-svn: 107742	2010-07-07 00:07:57 +00:00
Bruno Cardoso Lopes	54c2f858b3	Add AVX SSE4.1 Extract Integer instructions llvm-svn: 107740	2010-07-07 00:07:24 +00:00
Jim Grosbach	657ab4a8ee	By default, the eh.sjlj.setjmp/longjmp intrinsics should just do nothing rather than assuming a target will custom lower them. Targets which do so should exlicitly mark them as having custom lowerings. PR7454. llvm-svn: 107734	2010-07-06 23:44:52 +00:00
Bob Wilson	ce80768ebf	Use REG_SEQUENCE nodes to make the table registers for VTBL instructions be allocated to consecutive registers. llvm-svn: 107730	2010-07-06 23:36:25 +00:00
Dale Johannesen	81ea05c193	Accept RIP-relative symbols with 'i' constraint, and print the (%rip) only if the 'a' modifier is present. PR 7528. llvm-svn: 107727	2010-07-06 23:27:00 +00:00
Jakob Stoklund Olesen	44c333e87c	Track defs for all aliases in NEONMoveFix. This means that an instruction defining an S register will affect the domain of the parent D register. llvm-svn: 107725	2010-07-06 23:26:23 +00:00
Bruno Cardoso Lopes	b9e1c33054	Add the rest of AVX SSE4.1 packed move with sign/zero extend instructions llvm-svn: 107723	2010-07-06 23:15:17 +00:00
Bruno Cardoso Lopes	0c6ec0b068	Add part of AVX SSE4.1 packed move with sign/zero extend instructions llvm-svn: 107720	2010-07-06 23:01:41 +00:00
Bruno Cardoso Lopes	af8968696a	Fix comment from previous patch llvm-svn: 107717	2010-07-06 22:38:32 +00:00
Bruno Cardoso Lopes	a0b37e839c	Add AVX vblendvpd, vblendvps and vpblendvb instructions Update VEX encoding to support those new instructions llvm-svn: 107715	2010-07-06 22:36:24 +00:00
Dan Gohman	d409104054	CanLowerReturn doesn't need a SelectionDAG; it just needs an LLVMContext. SelectBasicBlock doesn't needs its BasicBlock argument. llvm-svn: 107712	2010-07-06 22:19:37 +00:00
Devang Patel	7ab104353b	Propagate debug loc. llvm-svn: 107710	2010-07-06 22:08:15 +00:00
Bob Wilson	084a11cb59	Represent NEON load/store alignments in bytes, not bits. llvm-svn: 107701	2010-07-06 21:26:18 +00:00
Dan Gohman	808f334f79	Reapply r107655 with fixes; insert the pseudo instruction into the block before calling the expansion hook. And don't put EFLAGS in a mbb's live-in list twice. llvm-svn: 107691	2010-07-06 20:24:04 +00:00
Devang Patel	ffc54b23fe	Fix PR7545 crash. llvm-svn: 107678	2010-07-06 18:18:32 +00:00
Rafael Espindola	e5689571a1	Don't create neon moves in CopyRegToReg. NEONMoveFixPass will do the conversion if profitable. llvm-svn: 107673	2010-07-06 16:24:34 +00:00
Dan Gohman	4d264f7e51	Revert r107655. llvm-svn: 107668	2010-07-06 15:49:48 +00:00
Dan Gohman	c88c36181f	Make getMinimalPhysRegClass' comment mention what makes it different from getPhysicalRegisterRegClass. llvm-svn: 107660	2010-07-06 15:31:55 +00:00
Dan Gohman	6a73079aba	Fix a bunch of custom-inserter functions to handle the case where the pseudo instruction is not at the end of the block. llvm-svn: 107655	2010-07-06 15:18:19 +00:00
Eric Christopher	e873e9978c	Fix up -fstack-protector on linux to use the segment registers. Split out testcases per architecture and os now. Patch from Nelson Elhage. llvm-svn: 107640	2010-07-06 05:18:56 +00:00
Eric Christopher	f1bb5da020	Have the X86 backend use Triple instead of a string and some enums. llvm-svn: 107625	2010-07-05 19:26:33 +00:00
Kalle Raiskila	59cf410bf5	Remove some unused/redundant code. llvm-svn: 107622	2010-07-05 18:40:09 +00:00
Chris Lattner	252f82acc6	more tidying. llvm-svn: 107615	2010-07-05 05:53:14 +00:00

... 2 3 4 5 6 ...

14862 Commits