llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 12:33:33 +02:00

Author	SHA1	Message	Date
Evan Cheng	87def37f67	A few more isAsCheapAsAMove. llvm-svn: 63852	2009-02-05 08:42:55 +00:00
Evan Cheng	a05436f739	Implement multiple with overflow by 2 with an add instruction. llvm-svn: 63090	2009-01-27 03:30:42 +00:00
Nate Begeman	92efc4f0ce	Map address space 256 to gs; similar mappings could be supported for the other x86 segments. address space 0 is stack/default, 1-255 are reserved for client use. llvm-svn: 62980	2009-01-26 01:24:32 +00:00
Evan Cheng	0ed6a9d7e0	Favors generating "not" over "xor -1". For example. unsigned test(unsigned a) { return ~a; } llvm used to generate: movl $4294967295, %eax xorl 4(%esp), %eax Now it generates: movl 4(%esp), %eax notl %eax It's 3 bytes shorter. llvm-svn: 62661	2009-01-21 02:09:05 +00:00
Dan Gohman	8c835f6285	Disable the register+memory forms of the bt instructions for now. Thanks to Eli for pointing out that these forms don't ignore the high bits of their index operands, and as such are not immediately suitable for use by isel. llvm-svn: 62194	2009-01-13 23:23:30 +00:00
Dan Gohman	15e69a394a	Add bt instructions that take immediate operands. llvm-svn: 62180	2009-01-13 20:33:23 +00:00
Dan Gohman	e84cfeac5f	Fix a few more JIT encoding issues in the BT instructions. llvm-svn: 62179	2009-01-13 20:32:45 +00:00
Dan Gohman	ca4475dd7b	Add patterns to match conditional moves with loads folded into their left operand, rather than their right. Do this by commuting the operands and inverting the condition. llvm-svn: 61842	2009-01-07 01:00:24 +00:00
Dan Gohman	e78fdaec67	Define instructions for cmovo and cmovno. llvm-svn: 61836	2009-01-07 00:35:10 +00:00
Dan Gohman	2682e8745c	X86_COND_C and X86_COND_NC are alternate mnemonics for X86_COND_B and X86_COND_AE, respectively. llvm-svn: 61835	2009-01-07 00:15:08 +00:00
Evan Cheng	c52f942d67	Do not isel load folding bt instructions for pentium m, core, core2, and AMD processors. These are significantly slower than a load followed by a bt of a register. llvm-svn: 61557	2009-01-02 05:35:45 +00:00
Chris Lattner	062ed6e3dd	Fix some JIT encodings. llvm-svn: 61425	2008-12-25 01:32:49 +00:00
Chris Lattner	f34b843728	BT memory operands load from their address operand. llvm-svn: 61424	2008-12-25 01:27:10 +00:00
Dan Gohman	1ba93ac6be	Add instruction patterns and encodings for the x86 bt instructions. llvm-svn: 61400	2008-12-23 22:45:23 +00:00
Bill Wendling	13e4a3d0b0	- Use patterns instead of creating completely new instruction matching patterns, which are identical to the original patterns. - Change the multiply with overflow so that we distinguish between signed and unsigned multiplication. Currently, unsigned multiplication with overflow isn't working! llvm-svn: 60963	2008-12-12 21:15:41 +00:00
Bill Wendling	5d026e47c1	Redo the arithmetic with overflow architecture. I was changing the semantics of ISD::ADD to emit an implicit EFLAGS. This was horribly broken. Instead, replace the intrinsic with an ISD::SADDO node. Then custom lower that into an X86ISD::ADD node with a associated SETCC that checks the correct condition code (overflow or carry). Then that gets lowered into the correct X86::ADDOvf instruction. Similar for SUB and MUL instructions. llvm-svn: 60915	2008-12-12 00:56:36 +00:00
Bill Wendling	4c8fb3a0cc	Add sub/mul overflow intrinsics. This currently doesn't have a target-independent way of determining overflow on multiplication. It's very tricky. Patch by Zoltan Varga! llvm-svn: 60800	2008-12-09 22:08:41 +00:00
Nick Lewycky	e277f75880	Fix typo, psuedo -> pseudo. llvm-svn: 60651	2008-12-07 03:49:52 +00:00
Dan Gohman	5dad0993a9	Rename isSimpleLoad to canFoldAsLoad, to better reflect its meaning. llvm-svn: 60487	2008-12-03 18:15:48 +00:00
Bill Wendling	039240b301	Reapply r60382. This time, don't mark "ADC" nodes with "implicit EFLAGS". llvm-svn: 60385	2008-12-02 00:07:05 +00:00
Bill Wendling	16840cba04	Temporarily revert r60382. It caused CodeGen/X86/i2k.ll and others to fail. llvm-svn: 60383	2008-12-01 23:44:08 +00:00
Bill Wendling	628848b540	- Have "ADD" instructions return an implicit EFLAGS. - Add support for seto, setno, setc, and setnc instructions. llvm-svn: 60382	2008-12-01 23:30:42 +00:00
Bill Wendling	c60a07dbf2	Generate something sensible for an [SU]ADDO op when the overflow/carry flag is the conditional for the BRCOND statement. For instance, it will generate: addl %eax, %ecx jo LOF instead of addl %eax, %ecx ; About 10 instructions to compare the signs of LHS, RHS, and sum. jl LOF llvm-svn: 60123	2008-11-26 22:37:40 +00:00
Dan Gohman	e5420a0ae9	Don't set neverHasSideEffects on x86's divide instructions, since they trap on divide-by-zero, and this side effect is otherwise unmodeled. llvm-svn: 59551	2008-11-18 21:29:14 +00:00
Nicolas Geoffray	323dc44a69	Generate code for TLS instructions. llvm-svn: 58141	2008-10-25 15:22:06 +00:00
Evan Cheng	08d0796cf5	Add implicit defs of XMM8 to XMM15 on 32-bit call instructions. While this is not technically true, it tells tblgen that these instructions "clobber" the entire XMM register file. llvm-svn: 57723	2008-10-17 21:02:22 +00:00
Dan Gohman	268cfea6bc	Fun x86 encoding tricks: when adding an immediate value of 128, use a SUB instruction instead of an ADD, because -128 can be encoded in an 8-bit signed immediate field, while +128 can't be. This avoids the need for a 32-bit immediate field in this case. A similar optimization applies to 64-bit adds with 0x80000000, with the 32-bit signed immediate field. To support this, teach tablegen how to handle 64-bit constants. llvm-svn: 57663	2008-10-17 01:33:43 +00:00
Dan Gohman	5d83bd89a5	Define patterns for shld and shrd that match immediate shift counts, and patterns that match dynamic shift counts when the subtract is obscured by a truncate node. Add DAGCombiner support for recognizing rotate patterns when the shift counts are defined by truncate nodes. Fix and simplify the code for commuting shld and shrd instructions to work even when the given instruction doesn't have a parent, and when the caller needs a new instruction. These changes allow LLVM to use the shld, shrd, rol, and ror instructions on x86 to replace equivalent code using two shifts and an or in many more cases. llvm-svn: 57662	2008-10-17 01:23:35 +00:00
Dan Gohman	65702b2eb8	Now that predicates can be composed, simplify several of the predicates by extending simple predicates to create more complex predicates instead of duplicating the logic for the simple predicates. This doesn't reduce much redundancy in DAGISelEmitter.cpp's generated source yet; that will require improvements to DAGISelEmitter.cpp's instruction sorting, to make it more effectively group nodes with similar predicates together. llvm-svn: 57565	2008-10-15 06:50:19 +00:00
Chris Lattner	7910d59d44	Change CALLSEQ_BEGIN and CALLSEQ_END to take TargetConstant's as parameters instead of raw Constants. This prevents the constants from being selected by the isel pass, fixing PR2735. llvm-svn: 57385	2008-10-11 22:08:30 +00:00
Dale Johannesen	3d2e449078	Model hardwired inputs & outputs of x86 8-bit divides correctly. Fixes local RA miscompilation of gcc.c-torture/execute/20020904-1.c -O0. llvm-svn: 57257	2008-10-07 18:54:28 +00:00
Dale Johannesen	dc83b95ba5	Make atomic Swap work, 64-bit on x86-32. Make it all work in non-pic mode. llvm-svn: 57034	2008-10-03 22:25:52 +00:00
Dale Johannesen	27d8955b8f	Pass MemOperand through for 64-bit atomics on 32-bit, incidentally making the case where the memop is a pointer deref work. Fix cmp-and-swap regression. llvm-svn: 57027	2008-10-03 19:41:08 +00:00
Dale Johannesen	dbd7b1bd33	Handle some 64-bit atomics on x86-32, some of the time. llvm-svn: 56963	2008-10-02 18:53:47 +00:00
Dan Gohman	4aacc3ab83	Split x86's ADJCALLSTACK instructions into 32-bit and 64-bit forms. This allows the 64-bit forms to use+def RSP instead of ESP. This doesn't fix any real bugs today, but it is more precise and it makes the debug dumps on x86-64 look more consistent. Also, add some comments describing the CALL instructions' physreg operand uses and defs. llvm-svn: 56925	2008-10-01 18:28:06 +00:00
Dan Gohman	d6e96e9888	Mark CALL instructions as having a Use of ESP/RSP. llvm-svn: 56911	2008-10-01 04:14:30 +00:00
Evan Cheng	b749199c34	Fix PR2835. Do not change the width of a volatile load. llvm-svn: 56792	2008-09-29 17:26:18 +00:00
Evan Cheng	d63fc80c1e	Implement "punpckldq %xmm0, $xmm0" as "pshufd $0x50, %xmm0, %xmm" unless optimizing for code size. llvm-svn: 56711	2008-09-26 23:41:32 +00:00
Evan Cheng	efd1f614ff	Fix patterns for SSE4.1 move and sign extend instructions. Also add instructions which fold VZEXT_MOVL and VZEXT_LOAD. llvm-svn: 56594	2008-09-24 23:27:55 +00:00
Bill Wendling	932818c75a	Reverting r56249. On further investigation, this functionality isn't needed. Apologies for the thrashing. llvm-svn: 56251	2008-09-16 21:48:12 +00:00
Bill Wendling	1a240c8033	- Change "ExternalSymbolSDNode" to "SymbolSDNode". - Add linkage to SymbolSDNode (default to external). - Change ISD::ExternalSymbol to ISD::Symbol. - Change ISD::TargetExternalSymbol to ISD::TargetSymbol These changes pave the way to allowing SymbolSDNodes with non-external linkage. llvm-svn: 56249	2008-09-16 21:12:30 +00:00
Dan Gohman	89660301e3	Rename ConstantSDNode::getValue to getZExtValue, for consistency with ConstantInt. This led to fixing a bug in TargetLowering.cpp using getValue instead of getAPIntValue. llvm-svn: 56159	2008-09-12 16:56:44 +00:00
Evan Cheng	4bc8c9652e	Transform (x << (y&31)) -> (x << y). This takes advantage of the fact x86 shift instructions 2nd operand (shift count) is limited to 0 to 31 (or 63 in the x86-64 case). llvm-svn: 55558	2008-08-30 02:03:58 +00:00
Dale Johannesen	490c016734	Split the ATOMIC NodeType's to include the size, e.g. ATOMIC_LOAD_ADD_{8,16,32,64} instead of ATOMIC_LOAD_ADD. Increased the Hardcoded Constant OpActionsCapacity to match. Large but boring; no functional change. This is to support partial-word atomics on ppc; i8 is not a valid type there, so by the time we get to lowering, the ATOMIC_LOAD nodes looks the same whether the type was i8 or i32. The information can be added to the AtomicSDNode, but that is the largest SDNode; I don't fully understand the SDNode allocation, but it is sensitive to the largest node size, so increasing that must be bad. This is the alternative. llvm-svn: 55457	2008-08-28 02:44:49 +00:00
Bill Wendling	60e176391d	Reverting r55190, r55191, and r55192. They broke the build with this error message: {standard input}:17:bad register name `%sil' make[4]: * [libgcc/./_addvsi3.o] Error 1 make[4]: * Waiting for unfinished jobs.... {standard input}:23:bad register name `%dil' {standard input}:28:bad register name `%dil' make[4]: * [libgcc/./_addvdi3.o] Error 1 {standard input}:18:bad register name `%sil' make[4]: * [libgcc/./_subvsi3.o] Error 1 llvm-svn: 55200	2008-08-22 20:51:05 +00:00
Dan Gohman	897aa30d7c	Anyext tweaks for x86. When extloading a value to i32 or i64, choose instructions that define the full 32 or 64-bit value. When anyexting from i8 to i16 or i32, it's not necessary to zero out the high portion of the register. llvm-svn: 55190	2008-08-22 19:19:31 +00:00
Dan Gohman	411cc551cb	Move the handling of ANY_EXTEND, SIGN_EXTEND_INREG, and TRUNCATE out of X86ISelDAGToDAG.cpp C++ code and into tablegen code. Among other things, using tablegen for these things makes them friendlier to FastISel. Tablegen can handle the case of i8 subregs on x86-32, but currently the C++ code for that case uses MVT::Flag in a tricky way, and it happens to schedule better in some cases. So for now, leave the C++ code in place to handle the i8 case on x86-32. llvm-svn: 55078	2008-08-20 21:27:32 +00:00
Dan Gohman	ebba07cccf	Tablegen generated code already tests the opcode value, so it's not necessary to use dyn_cast in these predicates. llvm-svn: 55055	2008-08-20 15:24:22 +00:00
Bill Wendling	ab390189dc	Revert r55018 and apply the correct "fix" for the 64-bit sub_and_fetch atomic. Just expand it like the other X-bit sub_and_fetches. llvm-svn: 55023	2008-08-20 00:28:16 +00:00
Bill Wendling	ab7c8c091e	Add support for the __sync_sub_and_fetch atomics and friends for X86. The code was already present, but not hooked up to anything. llvm-svn: 55018	2008-08-19 23:09:18 +00:00
Dale Johannesen	15b76de064	Add support for 8 and 16 bit forms of __sync builtins on X86. Change "lock" instructions to be on a separate line. This is needed to work around a bug in the Darwin assembler. llvm-svn: 54999	2008-08-19 18:47:28 +00:00
Dan Gohman	cc784f1662	Re-introduce the 8-bit subreg zext-inreg patterns for x86-32, this time using MOV32to32_ and MOV16to16_. Thanks to Evan for suggesting this. llvm-svn: 54418	2008-08-06 18:27:21 +00:00
Dan Gohman	99d70043f9	xchg does not modify FLAGS. llvm-svn: 54411	2008-08-06 15:52:50 +00:00
Dan Gohman	efb5d2ce6e	Reapply r54147 with a constraint to only use the 8-bit subreg form on x86-64, to avoid the problem with x86-32 having GPRs that don't have 8-bit subregs. Also, change several 16-bit instructions to use equivalent 32-bit instructions. These have a smaller encoding and avoid partial-register updates. llvm-svn: 54223	2008-07-30 18:09:17 +00:00
Dan Gohman	ebe629a4b2	Revert 54147. llvm-svn: 54148	2008-07-29 01:02:18 +00:00
Dan Gohman	1816900fd1	Add x86 isel patterns to match what would be a ZERO_EXTEND_INREG operation, which is represented in codegen as an 'and' operation. This matches them with movz instructions, instead of leaving them to be matched by and instructions with an immediate field. llvm-svn: 54147	2008-07-28 22:18:25 +00:00
Anton Korobeynikov	f13fbd6879	Fix encoding of atomic compare and swap for i64 llvm-svn: 53911	2008-07-22 16:22:48 +00:00
Mon P Wang	7d89d61387	Added MemOperands to Atomic operations since Atomics touches memory. Added abstract class MemSDNode for any Node that have an associated MemOperand Changed atomic.lcs => atomic.cmp.swap, atomic.las => atomic.load.add, and atomic.lss => atomic.load.sub llvm-svn: 52706	2008-06-25 08:15:39 +00:00
Evan Cheng	0570953e28	XOR32rr, etc. are not AsCheapAsMove, but MOV32ri, etc. are. llvm-svn: 52454	2008-06-18 08:13:07 +00:00
Andrew Lenharth	327c3e7559	add missing atomic intrinsic from gcc llvm-svn: 52270	2008-06-14 05:48:15 +00:00
Dan Gohman	00823cb0d4	Teach the DAGISelEmitter to not compute the variable_ops operand index for the input pattern in terms of the output pattern. Instead keep track of how many fixed operands the input pattern actually has, and have the input matching code pass the output-emitting function that index value. This simplifies the code, disentangles variables_ops from the support for predication operations, and makes variable_ops more robust. llvm-svn: 51808	2008-05-31 02:11:25 +00:00
Dan Gohman	aa8fcd5657	Add patterns for CALL32m and CALL64m. They aren't matched in most cases due to an isel deficiency already noted in lib/Target/X86/README.txt, but they can be matched in this fold-call.ll testcase, for example. This is interesting mainly because it exposes a tricky tblgen bug; tblgen was incorrectly computing the starting index for variable_ops in the case of a complex pattern. llvm-svn: 51706	2008-05-29 21:50:34 +00:00
Dan Gohman	4e87d82476	Fix a tblgen problem handling variable_ops in tblgen instruction definitions. This adds a new construct, "discard", for indicating that a named node in the input matching pattern is to be discarded, instead of corresponding to a node in the output pattern. This allows tblgen to know where the arguments for the varaible_ops are supposed to begin. This fixes "rdar://5791600", whatever that is ;-). llvm-svn: 51699	2008-05-29 19:57:41 +00:00
Bill Wendling	81199f0cc8	XOR?RI instructions aren't as cheap as moves. llvm-svn: 51664	2008-05-29 03:46:36 +00:00
Bill Wendling	edb38e9410	Implement "AsCheapAsAMove" for some obviously cheap instructions: xor and the like. llvm-svn: 51662	2008-05-29 01:02:09 +00:00
Evan Cheng	95987c2586	Doh. Alignment is in bytes, not in bits. llvm-svn: 51092	2008-05-14 02:49:43 +00:00
Evan Cheng	cb56638548	- Fix the pasto in the fix for a previous pasto. - Incorporate Chris' comment suggestion. llvm-svn: 51061	2008-05-13 18:59:59 +00:00
Evan Cheng	cf6928983b	- Don't treat anyext 16-bit load as a 32-bit load if it's volatile. - Correct a pasto. llvm-svn: 51054	2008-05-13 16:45:56 +00:00
Evan Cheng	e4ee4c2870	On x86, it's safe to treat i32 load anyext as a normal i32 load. Ditto for i8 anyext load to i16. llvm-svn: 51019	2008-05-13 00:54:02 +00:00
Dan Gohman	efa0925915	Fix a copy+paste bug; pseudo-instructions shouldn't have encoding information. llvm-svn: 50997	2008-05-12 20:22:45 +00:00
Mon P Wang	84a269e023	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Anton Korobeynikov	04c974b1b2	Add General Dynamic TLS model for X86-64. Some parts looks really ugly (look for tlsaddr pattern), but should work. Work is in progress, more models will follow llvm-svn: 50630	2008-05-04 21:36:32 +00:00
Arnold Schwaighofer	f58a35e2ec	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Evan Cheng	0fe99f024d	Fix MMX_MOVQ2DQrr pattern. It's illegal to do a bitconvert from a smaller type to a larger one. llvm-svn: 50278	2008-04-25 18:19:54 +00:00
Evan Cheng	b1d240f973	xchg which references a memory operand does not need to lock prefix. Atomicity is guaranteed. llvm-svn: 49946	2008-04-19 01:20:30 +00:00
Evan Cheng	a626e13995	- Fix atomic operation JIT encoding. - Remove unused instructions. llvm-svn: 49921	2008-04-18 20:55:36 +00:00
Evan Cheng	2b03674feb	Also support Intel asm syntax. llvm-svn: 49878	2008-04-17 23:35:10 +00:00
Evan Cheng	0b36ca5023	Fix assembly code for atomic operations. llvm-svn: 49869	2008-04-17 21:26:35 +00:00
Nate Begeman	81586b24d6	80 col fix llvm-svn: 49569	2008-04-12 00:47:57 +00:00
Evan Cheng	aca67f0b29	Allow certain lea instructions to be rematerialized. llvm-svn: 48855	2008-03-27 01:41:09 +00:00
Arnold Schwaighofer	19a78545d9	Don't loose incoming argument registers. Fix documentation style. llvm-svn: 48545	2008-03-19 16:39:45 +00:00
Evan Cheng	11d2c09adc	Replace all target specific implicit def instructions with a target independent one: TargetInstrInfo::IMPLICIT_DEF. llvm-svn: 48380	2008-03-15 00:03:38 +00:00
Christopher Lamb	0f1c32eb63	Get rid of a pseudo instruction and replace it with subreg based operation on real instructions, ridding the asm printers of the hack used to do this previously. In the process, update LowerSubregs to be careful about eliminating copies that have side affects. Note: the coalescer will have to be careful about this too, when it starts coalescing insert_subreg nodes. llvm-svn: 48329	2008-03-13 05:47:01 +00:00
Christopher Lamb	74f4d837df	Recommitting parts of r48130. These do not appear to cause the observed failures. llvm-svn: 48223	2008-03-11 10:09:17 +00:00
Chris Lattner	9826c9365e	Change the model for FP Stack return to use fp operands on the RET instruction instead of using FpSET_ST0_32. This also generalizes the code to handling returning of multiple FP results. llvm-svn: 48209	2008-03-11 03:23:40 +00:00
Evan Cheng	067ecbc341	Revert 48125, 48126, and 48130 for now to unbreak some x86-64 tests. llvm-svn: 48167	2008-03-10 19:31:26 +00:00
Christopher Lamb	32e5ce3d96	Allow insert_subreg into implicit, target-specific values. Change insert/extract subreg instructions to be able to be used in TableGen patterns. Use the above features to reimplement an x86-64 pseudo instruction as a pattern. llvm-svn: 48130	2008-03-10 06:12:08 +00:00
Andrew Lenharth	95c88272c6	64bit CAS on 32bit x86. llvm-svn: 47929	2008-03-05 01:15:49 +00:00
Evan Cheng	ae414db8d2	80 column violations. llvm-svn: 47878	2008-03-04 03:20:06 +00:00
Evan Cheng	139517b682	Remove -always-fold-and-in-test. llvm-svn: 47871	2008-03-04 00:40:35 +00:00
Andrew Lenharth	ba7f925582	good catch anton llvm-svn: 47800	2008-03-01 23:18:21 +00:00
Andrew Lenharth	f6c220738c	make CAS work llvm-svn: 47799	2008-03-01 22:27:48 +00:00
Andrew Lenharth	b91c664226	all but CAS working on x86 llvm-svn: 47798	2008-03-01 21:52:34 +00:00
Andrew Lenharth	ad29a49169	Add lock prefix support to x86. Also add the instructions necessary for the atomic ops. They are still marked pseudo, since I cannot figure out what format to use, but they are the correct opcode. llvm-svn: 47795	2008-03-01 13:37:02 +00:00
Andrew Lenharth	db9cd46f5d	Atomic op support. If any gcc test uses __sync builtins, it might start failing on archs that haven't implemented them yet llvm-svn: 47430	2008-02-21 06:45:13 +00:00
Evan Cheng	f3a7cd1c62	Poorly named option. llvm-svn: 47400	2008-02-20 20:57:32 +00:00
Evan Cheng	35253f2c22	Add hidden option -x86-fold-and-in-test to test the effect the test / and folding change. llvm-svn: 47351	2008-02-19 23:36:51 +00:00
Chris Lattner	3a4ac3a69e	Don't fold and's into test instructions if they have multiple uses. This compiles test-nofold.ll into: _test: movl $15, %ecx andl 4(%esp), %ecx testl %ecx, %ecx movl $42, %eax cmove %ecx, %eax ret instead of: _test: movl 4(%esp), %eax movl %eax, %ecx andl $15, %ecx testl $15, %eax movl $42, %eax cmove %ecx, %eax ret llvm-svn: 47330	2008-02-19 17:37:35 +00:00
Evan Cheng	a377b2bbd1	Fix a x86-64 codegen deficiency. Allow gv + offset when using rip addressing mode. Before: _main: subq $8, %rsp leaq _X(%rip), %rax movsd 8(%rax), %xmm1 movss _X(%rip), %xmm0 call _t xorl %ecx, %ecx movl %ecx, %eax addq $8, %rsp ret Now: _main: subq $8, %rsp movsd _X+8(%rip), %xmm1 movss _X(%rip), %xmm0 call _t xorl %ecx, %ecx movl %ecx, %eax addq $8, %rsp ret Notice there is another idiotic codegen issue that needs to be fixed asap: xorl %ecx, %ecx movl %ecx, %eax llvm-svn: 46850	2008-02-07 08:53:49 +00:00
Nate Begeman	ead8dfeef2	SSE 4.1 Intrinsics and detection llvm-svn: 46681	2008-02-03 07:18:54 +00:00
Duncan Sands	aff4eef6df	The last pieces needed for loading arbitrary precision integers. This won't actually work (and most of the code is dead) unless the new legalization machinery is turned on. While there, I rationalized the handling of i1, and removed some bogus (and unused) sextload patterns. For i1, this could result in microscopically better code for some architectures (not X86). It might also result in worse code if annotating with AssertZExt nodes turns out to be more harmful than helpful. llvm-svn: 46280	2008-01-23 20:39:46 +00:00
Chris Lattner	41717f6989	This commit changes: 1. Legalize now always promotes truncstore of i1 to i8. 2. Remove patterns and gunk related to truncstore i1 from targets. 3. Rename the StoreXAction stuff to TruncStoreAction in TLI. 4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions. 5. Mark a wide variety of invalid truncstores as such in various targets, e.g. X86 currently doesn't support truncstore of any of its integer types. 6. Add legalize support for truncstores with invalid value input types. 7. Add a dag combine transform to turn store(truncate) into truncstore when safe. The later allows us to compile CodeGen/X86/storetrunc-fp.ll to: _foo: fldt 20(%esp) fldt 4(%esp) faddp %st(1) movl 36(%esp), %eax fstps (%eax) ret instead of: _foo: subl $4, %esp fldt 24(%esp) fldt 8(%esp) faddp %st(1) fstps (%esp) movl 40(%esp), %eax movss (%esp), %xmm0 movss %xmm0, (%eax) addl $4, %esp ret llvm-svn: 46140	2008-01-17 19:59:44 +00:00
Chris Lattner	6846e346a8	rename SDTRet -> SDTNone. Move definition of 'trap' sdnode up from x86 instrinfo to targetselectiondag.td. llvm-svn: 46017	2008-01-15 22:02:54 +00:00
Chris Lattner	0072ce1ff4	no need to expand ISD::TRAP to X86ISD::TRAP, just match ISD::TRAP. llvm-svn: 46015	2008-01-15 21:58:22 +00:00
Anton Korobeynikov	44893ee93d	Fix JIT encoding of trap/ud2 instruction llvm-svn: 46012	2008-01-15 21:40:02 +00:00
Anton Korobeynikov	08ea121968	For PR1839: add initial support for __builtin_trap. llvm-gcc part is missed as well as PPC codegen llvm-svn: 46001	2008-01-15 07:02:33 +00:00
Chris Lattner	a83f66d1bb	remove xchg and shift-reg-by-1 instructions, which are dead. llvm-svn: 45870	2008-01-11 18:00:50 +00:00
Chris Lattner	20d1e419b3	more flags set right llvm-svn: 45860	2008-01-11 07:18:17 +00:00
Chris Lattner	48c54909dc	IMPLICIT_USE and IMPLICIT_DEF are dead, remove them. llvm-svn: 45838	2008-01-10 19:27:54 +00:00
Chris Lattner	9d7971791b	Start inferring side effect information more aggressively, and fix many bugs in the x86 backend where instructions were not marked maystore/mayload, and perf issues where instructions were not marked neverHasSideEffects. It would be really nice if we could write patterns for copy instructions. I have audited all the x86 instructions down to MOVDQAmr. The flags on others and on other targets are probably not right in all cases, but no clients currently use this info that are enabled by default. llvm-svn: 45829	2008-01-10 07:59:24 +00:00
Chris Lattner	7ceba534ba	rename X86InstrX86-64.td -> X86Instr64bit.td llvm-svn: 45826	2008-01-10 05:50:42 +00:00
Chris Lattner	6ad01a9965	remove explicit sets of 'neverHasSideEffects' that can now be inferred from the instr patterns. llvm-svn: 45824	2008-01-10 05:45:39 +00:00
Chris Lattner	9b4f2b2316	get def use info more correct. llvm-svn: 45821	2008-01-10 05:12:37 +00:00
Chris Lattner	6b6e6f1f6e	The pic base can't be duplicated. llvm-svn: 45668	2008-01-06 23:49:32 +00:00
Chris Lattner	14310afe42	rename isLoad -> isSimpleLoad due to evan's desire to have such a predicate. llvm-svn: 45667	2008-01-06 23:38:27 +00:00
Chris Lattner	6ad8e32f6b	getting the pic base has no side effects. llvm-svn: 45618	2008-01-05 03:54:32 +00:00
Evan Cheng	5b9282f3b5	Combine MovePCtoStack + POP32r into one instruction MOVPC32r so it can be moved if needed. llvm-svn: 45605	2008-01-05 00:41:47 +00:00
Chris Lattner	ad9a6ccb83	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Evan Cheng	8f4ec948d3	Fix JIT code emission of X86::MovePCtoStack. llvm-svn: 45307	2007-12-22 02:26:46 +00:00
Bill Wendling	e5af8b6e5c	Add "mayHaveSideEffects" and "neverHasSideEffects" flags to some instructions. I based what flag to set on whether it was already marked as "isRematerializable". If there was a further check to determine if it's "really" rematerializable, then I marked it as "mayHaveSideEffects" and created a check in the X86 back-end similar to the remat one. llvm-svn: 45132	2007-12-17 23:07:56 +00:00
Evan Cheng	42f27a28a4	Fix bsf / bsr jit encoding. llvm-svn: 45037	2007-12-14 18:49:43 +00:00
Dan Gohman	0efc49e9b8	Fix Intel asm syntax for the bsr and bsf instructions. llvm-svn: 45030	2007-12-14 15:10:00 +00:00
Evan Cheng	6909ff8c4b	Fix ctlz and cttz. llvm definition requires them to return number of bits in of the src type when value is zero. llvm-svn: 45029	2007-12-14 08:30:15 +00:00
Evan Cheng	51cf86ded0	Implement ctlz and cttz with bsr and bsf. llvm-svn: 45024	2007-12-14 02:13:44 +00:00
Evan Cheng	343929c773	Fold some and + shift in x86 addressing mode. llvm-svn: 44970	2007-12-13 00:43:27 +00:00
Evan Cheng	64a1febf9a	Implicit def instructions, e.g. X86::IMPLICIT_DEF_GR32, are always re-materializable and they should not be spilled. llvm-svn: 44960	2007-12-12 23:12:09 +00:00
Bill Wendling	934fcd87e7	Unifacalize the CALLSEQ{START,END} stuff. llvm-svn: 44045	2007-11-13 09:19:02 +00:00
Bill Wendling	cc75435ebf	Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If not, then there is the potential for the stack to be changed while the stack's being used by another instruction (like a call). This can only result in tears... llvm-svn: 44037	2007-11-13 00:44:25 +00:00
Owen Anderson	aba398a5ce	Add a flag for indirect branch instructions. Target maintainers: please check that the instructions for your target are correctly marked. llvm-svn: 44012	2007-11-12 07:39:39 +00:00
Evan Cheng	ded6550885	Local spiller optimization: Turn a store folding instruction into a load folding instruction. e.g. xorl %edi, %eax movl %eax, -32(%ebp) movl -36(%ebp), %eax orl %eax, -32(%ebp) => xorl %edi, %eax orl -36(%ebp), %eax mov %eax, -32(%ebp) This enables the unfolding optimization for a subsequent instruction which will also eliminate the newly introduced store instruction. llvm-svn: 43192	2007-10-19 21:23:22 +00:00
Arnold Schwaighofer	6bcd9e7ec2	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Arnold Schwaighofer	d47210011e	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Evan Cheng	9af50ee6ef	Commute x86 cmove instructions by swapping the operands and change the condition to its inverse. Testing this as llcbeta llvm-svn: 42661	2007-10-05 23:13:21 +00:00
Evan Cheng	f658191412	ADC and SBB uses EFLAGS. llvm-svn: 42640	2007-10-05 17:59:57 +00:00
Evan Cheng	f3c130a8b6	Enabling new condition code modeling scheme. llvm-svn: 42459	2007-09-29 00:00:36 +00:00
Evan Cheng	c2acb6f2e5	Stop inventing new words. :-) llvm-svn: 42429	2007-09-28 01:35:02 +00:00
Evan Cheng	d3ff9d3ff7	Pessimisively assume ADJCALLSTACKDOWN / ADJCALLSTACKUP (which becomes sub / add) clobbers EFLAGS. llvm-svn: 42426	2007-09-28 01:19:48 +00:00
Evan Cheng	66eeb8440c	Some assemblers do not recognize aliases pushfd, pushfq, popfd, and popfq. Just emit them as pushf and popf. llvm-svn: 42371	2007-09-26 21:28:00 +00:00
Evan Cheng	37ee6eba29	Typos: POPQ -> POPFQ, POPD -> POPFD. llvm-svn: 42348	2007-09-26 06:38:29 +00:00
Evan Cheng	5cb9dbaaa1	Add pushf{d\|q}, popf{d\|q} to push and pop EFLAGS register. llvm-svn: 42335	2007-09-26 01:29:06 +00:00
Evan Cheng	36b3babfde	Added support for new condition code modeling scheme (i.e. physical register dependency). These are a bunch of instructions that are duplicated so the x86 backend can support both the old and new schemes at the same time. They will be deleted after all the kinks are worked out. llvm-svn: 42285	2007-09-25 01:57:46 +00:00
Dan Gohman	a264777dc1	Fix the syntax for the .loc directive in preparation for using it. llvm-svn: 42268	2007-09-24 19:25:06 +00:00
Dale Johannesen	ea6ffa0b36	Fix PR 1681. When X86 target uses +sse -sse2, keep f32 in SSE registers and f64 in x87. This is effectively a new codegen mode. Change addLegalFPImmediate to permit float and double variants to do different things. Adjust callers. llvm-svn: 42246	2007-09-23 14:52:20 +00:00
Evan Cheng	13797e4a74	Add implicit def of EFLAGS on those instructions that may modify flags. llvm-svn: 41962	2007-09-14 21:48:26 +00:00
Evan Cheng	b43255bc68	Remove (somewhat confusing) Imp<> helper, use let Defs = [], Uses = [] instead. llvm-svn: 41863	2007-09-11 19:55:27 +00:00
Evan Cheng	65df926ced	TableGen no longer emit CopyFromReg nodes for implicit results in physical registers. The scheduler is now responsible for emitting them. llvm-svn: 41781	2007-09-07 23:59:02 +00:00
Dan Gohman	3bc1bc2590	Avoid storing and reloading zeros and other constants from stack slots by flagging the associated instructions as being trivially rematerializable. llvm-svn: 41775	2007-09-07 21:32:51 +00:00
Evan Cheng	527fe7ab57	Mark load instructions with isLoad = 1. llvm-svn: 41595	2007-08-30 05:49:43 +00:00
Dale Johannesen	a85f11d870	Long double patch 4 of N: initial x87 implementation. Lots of problems yet but some simple things work. llvm-svn: 40847	2007-08-05 18:49:15 +00:00
Evan Cheng	3163814591	Switch some multiplication instructions over to the new scheme for testing. llvm-svn: 40723	2007-08-02 05:48:35 +00:00
Evan Cheng	0fa6cdbff5	Mac OS X X86-64 low 4G address not available. llvm-svn: 40701	2007-08-01 23:45:51 +00:00
Evan Cheng	fb587a3851	Be more precise. llvm-svn: 40689	2007-08-01 20:22:37 +00:00
Dan Gohman	e3464e6bec	Change the x86 assembly output to use tab characters to separate the mnemonics from their operands instead of single spaces. This makes the assembly output a little more consistent with various other compilers (f.e. GCC), and slightly easier to read. Also, update the regression tests accordingly. llvm-svn: 40648	2007-07-31 20:11:57 +00:00
Evan Cheng	3493ec0ce1	Redo and generalize previously removed opt for pinsrw: (vextract (v4i32 bc (v4f32 s2v (f32 load ))), 0) -> (i32 load ) llvm-svn: 40628	2007-07-31 08:04:03 +00:00
Christopher Lamb	919ce03da6	Change the x86 backend to use extract_subreg for truncation operations. Passes DejaGnu, SingleSource and MultiSource. llvm-svn: 40578	2007-07-29 01:24:57 +00:00
Dan Gohman	d3a062f01b	In the .loc directive, print the fields as "debug" fields, so they don't get decorated as if for immediate fields for instructions. llvm-svn: 40529	2007-07-26 15:24:15 +00:00
Evan Cheng	53cb03b583	No more noResults. llvm-svn: 40132	2007-07-21 00:34:19 +00:00
Evan Cheng	8312ed6f77	Change instruction description to split OperandList into OutOperandList and InOperandList. This gives one piece of important information: # of results produced by an instruction. An example of the change: def ADD32rr : I<0x01, MRMDestReg, (ops GR32:$dst, GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; => def ADD32rr : I<0x01, MRMDestReg, (outs GR32:$dst), (ins GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; llvm-svn: 40033	2007-07-19 01:14:50 +00:00
Anton Korobeynikov	5635277c36	Long live the exception handling! This patch fills the last necessary bits to enable exceptions handling in LLVM. Currently only on x86-32/linux. In fact, this patch adds necessary intrinsics (and their lowering) which represent really weird target-specific gcc builtins used inside unwinder. After corresponding llvm-gcc patch will land (easy) exceptions should be more or less workable. However, exceptions handling support should not be thought as 'finished': I expect many small and not so small glitches everywhere. llvm-svn: 39855	2007-07-14 14:06:15 +00:00
Dan Gohman	9cbc3fb1ab	Revert the earlier change that removed the M_REMATERIALIZABLE machine instruction flag, and use the flag along with a virtual member function hook for targets to override if there are instructions that are only trivially rematerializable with specific operands (i.e. constant pool loads). llvm-svn: 37728	2007-06-26 00:48:07 +00:00
Dan Gohman	b60d8a92c9	Replace M_REMATERIALIZIBLE and the newly-added isOtherReMaterializableLoad with a general target hook to identify rematerializable instructions. Some instructions are only rematerializable with specific operands, such as loads from constant pools, while others are always rematerializable. This hook allows both to be identified as being rematerializable with the same mechanism. llvm-svn: 37644	2007-06-19 01:48:05 +00:00
Nate Begeman	f496eb7607	Reference correct header llvm-svn: 36834	2007-05-06 04:00:55 +00:00
Bill Wendling	552e4ff1be	Add SSSE3 as a feature of Core2. Add MMX registers to the list of registers clobbered by a call. llvm-svn: 36448	2007-04-25 21:31:48 +00:00
Lauro Ramos Venancio	b75c6c5cbc	X86 TLS: optimize the implementation of "local exec" model. llvm-svn: 36359	2007-04-23 01:28:10 +00:00
Lauro Ramos Venancio	b1a101f0e7	X86 TLS: fix and optimize the implementation of "initial exec" model. llvm-svn: 36355	2007-04-22 22:50:52 +00:00
Lauro Ramos Venancio	bc32d90b46	Implement "general dynamic", "initial exec" and "local exec" TLS models for X86 32 bits. llvm-svn: 36283	2007-04-20 21:38:10 +00:00
Anton Korobeynikov	9bc4b792bf	Implemented correct stack probing on mingw/cygwin for dynamic alloca's. Also, fixed static case in presence of eax livin. This fixes PR331 PS: Why don't we still have push/pop instructions? :) llvm-svn: 36195	2007-04-17 09:20:00 +00:00
Bill Wendling	3b1189afbf	Add support for our first SSSE3 instruction "pmulhrsw". llvm-svn: 35869	2007-04-10 22:10:25 +00:00
Evan Cheng	00a5cbf9e7	Mark re-materializable instructions. llvm-svn: 35230	2007-03-21 00:16:56 +00:00
Jim Laskey	23ed7d2625	Make LABEL a builtin opcode. llvm-svn: 33537	2007-01-26 14:34:52 +00:00
Chris Lattner	784201d1b9	Fix a misencoding of CBW and CWD. This fixes PR1030. llvm-svn: 33486	2007-01-24 18:31:00 +00:00
Evan Cheng	adeea85f7d	- Switch X86-64 JIT to large code size model. - Re-enable some codegen niceties for X86-64 static relocation model codegen. - Clean ups, etc. llvm-svn: 32238	2006-12-05 19:50:18 +00:00
Evan Cheng	456101ebb9	- Use a different wrapper node for RIP-relative GV, etc. - Proper support for both small static and PIC modes under X86-64 - Some (non-optimal) support for medium modes. llvm-svn: 32046	2006-11-30 21:55:46 +00:00
Evan Cheng	f968824cdb	Custom lower READCYCLECOUNTER for x86-64. llvm-svn: 32017	2006-11-29 08:28:13 +00:00
Chris Lattner	7c265ad682	remove dead/redundant vars llvm-svn: 31435	2006-11-03 23:48:56 +00:00
Reid Spencer	db06ed9156	Add debug support for X86/ELF targets (Linux). This allows llvm-gcc4 generated object modules to be debugged with gdb. Hopefully this helps pre-release debugging. llvm-svn: 31299	2006-10-30 22:32:30 +00:00
Evan Cheng	fe5bb5dbe6	Merge ISD::TRUNCSTORE to ISD::STORE. Switch to using StoreSDNode. llvm-svn: 30945	2006-10-13 21:14:26 +00:00
Chris Lattner	b25677f5ca	Move the Imp tblgen class from the X86 backend to common code. llvm-svn: 30907	2006-10-12 17:49:27 +00:00
Chris Lattner	a678c5bac1	Mark ADJCALLSTACKUP/DOWN as clobbering ESP so that virtregmap will notice that it can't assume ESP is unmodified across the instrs. llvm-svn: 30905	2006-10-12 17:42:56 +00:00
Evan Cheng	ca66f49574	Add properties to ComplexPattern. llvm-svn: 30891	2006-10-11 21:03:53 +00:00
Evan Cheng	d22f3dd3ed	Reflects ISD::LOAD / ISD::LOADX / LoadSDNode changes. llvm-svn: 30844	2006-10-09 20:57:25 +00:00
Evan Cheng	02e193e2ff	Delete dead code; fix 80 col violations. llvm-svn: 30583	2006-09-22 21:43:59 +00:00
Evan Cheng	cfd7b147cf	X86ISD::CMP now produces a chain as well as a flag. Make that the chain operand of a conditional branch to allow load folding into CMP / TEST instructions. llvm-svn: 30241	2006-09-11 02:19:56 +00:00
Evan Cheng	15dd42884e	Committing X86-64 support. llvm-svn: 30177	2006-09-08 06:48:29 +00:00
Chris Lattner	9c7673ffca	Eliminate X86ISD::TEST, using X86ISD::CMP instead. Match X86ISD::CMP patterns using test, which provides nice simplifications like: - movl %edi, %ecx - andl $2, %ecx - cmpl $0, %ecx + testl $2, %edi je LBB1_11 #cond_next90 There are a couple of dagiselemitter deficiencies that this exposes, they will be handled later. llvm-svn: 30156	2006-09-07 20:33:45 +00:00
Evan Cheng	798aa508a3	Consistency. llvm-svn: 30152	2006-09-07 19:03:48 +00:00
Evan Cheng	34a49551f5	CALLSEQ_* produces chain even if that's not needed. llvm-svn: 29603	2006-08-11 09:03:33 +00:00
Evan Cheng	100096b2bb	Clean up. llvm-svn: 29228	2006-07-20 21:37:39 +00:00
Evan Cheng	a2eaed93a0	INC / DEC instructions have shorter code size than ADD32ri8, etc. llvm-svn: 29194	2006-07-19 00:27:29 +00:00
Evan Cheng	db529debec	Emit inc / dec of registers as one byte instruction. llvm-svn: 29110	2006-07-11 19:49:49 +00:00
Evan Cheng	1d5fa40da3	Add shift and rotate by 1 instructions / patterns. llvm-svn: 28980	2006-06-29 00:36:51 +00:00
Evan Cheng	a37a2f781e	Remove dead code. llvm-svn: 28938	2006-06-27 20:34:14 +00:00
Evan Cheng	0e2235b803	X86 call instructions can take variable number of operands. Parameters of vector types are passed via XMM registers. llvm-svn: 28789	2006-06-14 22:24:55 +00:00
Evan Cheng	ed96100b00	Incorrect AT&T opcode. llvm-svn: 28666	2006-06-02 21:09:10 +00:00
Evan Cheng	4488266c46	Rename ASM modifier trunc8, trunc16 to subreg8, subreg16. llvm-svn: 28606	2006-05-31 22:34:26 +00:00
Evan Cheng	f90443c471	Sign extender llvm-svn: 28603	2006-05-31 22:05:11 +00:00
Evan Cheng	f7637e403f	A addressing mode folding enhancement: Fold c2 in (x << c1) \| c2 where (c2 < c1) e.g. int test(int x) { return (x << 3) + 7; } This can be codegen'd as: leal 7(,%eax,8), %eax llvm-svn: 28550	2006-05-30 06:59:36 +00:00
Evan Cheng	550e73a900	Remove unused patterns. llvm-svn: 28417	2006-05-20 01:40:16 +00:00
Evan Cheng	3a90665f14	- Use exact-width integer types, e.g. int32_t, to avoid confusion. - Fix a couple of minor bugs in i16immSExt8 and i16immZExt8. - Added loadiPTR fragment used for indirect jumps and calls. llvm-svn: 28392	2006-05-19 18:40:54 +00:00
Evan Cheng	ff19d6478e	Explicitly specify MOV32mi can only be used store 32-bit GV, etc. llvm-svn: 28390	2006-05-19 07:30:36 +00:00
Evan Cheng	070813257a	Use generic iPTR instead i32 to represent pointer type. llvm-svn: 28371	2006-05-17 21:21:41 +00:00
Evan Cheng	dc9b5f5fc0	X86 integer register classes naming changes. Make them consistent with FP, vector classes. llvm-svn: 28324	2006-05-16 07:21:53 +00:00
Evan Cheng	0fb3fc3626	Fixing truncate. Previously we were emitting truncate from r16 to r8 as movw. That is we promote the destination operand to r16. So %CH = TRUNC_R16_R8 %BP is emitted as movw %bp, %cx. This is incorrect. If %cl is live, it would be clobbered. Ideally we want to do the opposite, that is emitted it as movb ??, %ch But this is not possible since %bp does not have a r8 sub-register. We are now defining a new register class R16_ which is a subclass of R16 containing only those 16-bit registers that have r8 sub-registers (i.e. AX - DX). We isel the truncate to two instructions, a MOV16to16_ to copy the value to the R16_ class, followed by a TRUNC_R16_R8. Due to bug 770, the register colaescer is not going to coalesce between R16 and R16_. That will be fixed later so we can eliminate the MOV16to16_. Right now, it can only be eliminated if we are lucky that source and destination registers are the same. llvm-svn: 28164	2006-05-08 08:01:26 +00:00
Evan Cheng	0e9ec8d566	Need extload patterns after Chris' DAG combiner changes llvm-svn: 28127	2006-05-05 08:23:07 +00:00
Evan Cheng	84612a59c2	Better implementation of truncate. ISel matches it to a pseudo instruction that gets emitted as movl (for r32 to i16, i8) or a movw (for r16 to i8). And if the destination gets allocated a subregister of the source operand, then the instruction will not be emitted at all. llvm-svn: 28119	2006-05-05 05:40:20 +00:00
Evan Cheng	11e3cec8bd	Make x86 isel lowering produce tailcall nodes. They are match to normal calls for now. Patch contributed by Alexander Friedman. llvm-svn: 27994	2006-04-27 08:40:39 +00:00
Nate Begeman	0d74cbcb6b	Optimized stores to the constant pool, while cool, are unnecessary. llvm-svn: 27948	2006-04-22 22:31:45 +00:00
Nate Begeman	7ed816f900	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Evan Cheng	169240beb7	- More efficient extract_vector_elt with shuffle and movss, movsd, movd, etc. - Some bug fixes and naming inconsistency fixes. llvm-svn: 27377	2006-04-03 20:53:28 +00:00
Evan Cheng	875c895b0f	Added missing (any_extend (load ...)) patterns. llvm-svn: 27120	2006-03-25 09:45:48 +00:00
Chris Lattner	ec3f1b5cd1	Fix the encodings of these new instructions, hopefully fixing the JIT failures from last night llvm-svn: 26981	2006-03-23 16:13:50 +00:00
Nate Begeman	0ec15cd042	Add support for 8 bit immediates with 16/32 bit cmp instructions llvm-svn: 26966	2006-03-23 01:29:48 +00:00
Evan Cheng	8dd794ea70	Use the generic vector register classes VR64 / VR128 rather than V4F32, V8I16, etc. llvm-svn: 26838	2006-03-18 01:23:20 +00:00
Evan Cheng	ee1a44d5d8	Move some pattern fragments to the right files. llvm-svn: 26831	2006-03-17 19:55:52 +00:00
Evan Cheng	d16fa97974	- Nuke 16-bit SBB instructions. We'll never use them. - Nuke a bogus comment. llvm-svn: 26815	2006-03-17 02:24:04 +00:00
Evan Cheng	d73d06f052	X86ISD::REP_STOS and X86ISD::REP_MOVS now produces a flag. llvm-svn: 26604	2006-03-07 23:34:23 +00:00
Evan Cheng	2327759419	Enable Dwarf debugging info. llvm-svn: 26581	2006-03-07 02:02:57 +00:00
Chris Lattner	999aa36a04	remove the read/write port/io intrinsics. llvm-svn: 26479	2006-03-03 00:19:58 +00:00
Evan Cheng	995c9806ba	* Allow mul, shl nodes to be codegen'd as LEA (if appropriate). * Add patterns to handle GlobalAddress, ConstantPool, etc. MOV32ri to materialize these nodes in registers. ADD32ri to handle %reg + GA, etc. MOV32mi to handle store GA, etc. to memory. llvm-svn: 26374	2006-02-25 10:02:21 +00:00
Evan Cheng	cb9fb051a5	- Clean up the lowering and selection code of ConstantPool, GlobalAddress, and ExternalSymbol. - Use C++ code (rather than tblgen'd selection code) to match the above mentioned leaf nodes. Do not mutate and nodes and do not record the selection in CodeGenMap. These nodes should be safe to duplicate. This is a performance win. llvm-svn: 26335	2006-02-23 20:41:18 +00:00
Evan Cheng	2977507828	PIC related bug fixes. 1. Various asm printer bug. 2. Lowering bug. Now TargetGlobalAddress is wrapped in X86ISD::TGAWrapper. llvm-svn: 26324	2006-02-23 02:43:52 +00:00
Evan Cheng	005de9e2bb	Added MMX, SSE1, and SSE2 vector instructions and some simple patterns. Fixed some existing bugs (wrong predicates, prefixes) at the same time. llvm-svn: 26310	2006-02-22 02:26:30 +00:00
Evan Cheng	8ab6294f94	One more round of reorg so sabre doesn't freak out. :-) llvm-svn: 26303	2006-02-21 20:00:20 +00:00
Evan Cheng	f7296d8fa5	A big more cleaning up. llvm-svn: 26302	2006-02-21 19:30:30 +00:00
Evan Cheng	223d3a073a	Moving things to their proper places. llvm-svn: 26301	2006-02-21 19:26:52 +00:00
Evan Cheng	fee17dfff8	Split instruction info into multiple files, one for each of x87, MMX, and SSE. llvm-svn: 26300	2006-02-21 19:13:53 +00:00
Evan Cheng	89553f1e86	Added separate alias instructions for SSE logical ops that operate on non-packed types. llvm-svn: 26297	2006-02-21 02:24:38 +00:00
Evan Cheng	e68325d8aa	Added MMX and XMM packed integer move instructions, movd and movq. llvm-svn: 26296	2006-02-21 01:39:57 +00:00
Evan Cheng	6a9422ce1c	Added x86 integer vector types: 64-bit packed byte integer (v16i8), 64-bit packed word integer (v8i16), and 64-bit packed doubleword integer (v2i32). llvm-svn: 26294	2006-02-20 22:34:53 +00:00
Evan Cheng	b3d9ee74ad	Added fisttp for fp to int conversion. llvm-svn: 26283	2006-02-18 02:36:28 +00:00
Evan Cheng	bf3558a375	x86 / Darwin PIC support. llvm-svn: 26273	2006-02-18 00:15:05 +00:00
Nate Begeman	9c0ab71f4a	kill ADD_PARTS & SUB_PARTS and replace them with fancy new ADDC, ADDE, SUBC and SUBE nodes that actually expose what's going on and allow for significant simplifications in the targets. llvm-svn: 26255	2006-02-17 05:43:56 +00:00
Evan Cheng	9de0ad321a	pxor (for FLD0SS) encoding was missing the OpSize prefix. llvm-svn: 26244	2006-02-16 23:59:30 +00:00
Evan Cheng	bf4008c701	1. Use pxor instead of xoraps / xorapd to clear FR32 / FR64 registers. This proves to be worth 20% on Ptrdist/ks. Might be related to dependency breaking support. 2. Added FsMOVAPSrr and FsMOVAPDrr as aliases to MOVAPSrr and MOVAPDrr. These are used for FR32 / FR64 reg-to-reg copies. 3. Tell reg-allocator to generate MOVSSrm / MOVSDrm and MOVSSmr / MOVSDmr to spill / restore FsMOVAPSrr and FsMOVAPDrr. llvm-svn: 26241	2006-02-16 22:45:17 +00:00
Evan Cheng	af9730a217	MOVAPSrr and MOVAPDrr instruction format should be MRMSrcReg. llvm-svn: 26234	2006-02-16 19:34:41 +00:00
Evan Cheng	da095e61c1	cvtsd2ss / cvtss2sd encoding bug. llvm-svn: 26193	2006-02-15 00:31:03 +00:00
Evan Cheng	198d9447d6	movaps, movapd encoding bug. llvm-svn: 26192	2006-02-15 00:11:37 +00:00
Chris Lattner	da4bd5c64e	Eliminate the printCallOperand method, using a 'call' modifier on printOperand instead. llvm-svn: 26025	2006-02-06 23:41:19 +00:00
Evan Cheng	8fd9fd7866	Remove an unnecessary predicate. llvm-svn: 25954	2006-02-04 02:23:01 +00:00
Evan Cheng	078962656b	Separate FILD and FILD_FLAG, the later is only used for SSE2. It produces a flag so it can be flagged to a FST. llvm-svn: 25953	2006-02-04 02:20:30 +00:00
Evan Cheng	b78bb375c2	Rearrange code to my liking. :) llvm-svn: 25887	2006-02-01 23:01:57 +00:00
Evan Cheng	ef93bdc0cf	- Use xor to clear integer registers (set R, 0). - Added a new format for instructions where the source register is implied and it is same as the destination register. Used for pseudo instructions that clear the destination register. llvm-svn: 25872	2006-02-01 06:13:50 +00:00
Evan Cheng	45ebd632f2	- Allow XMM load (for scalar use) to be folded into ANDP* and XORP. - Use XORP to implement fneg. llvm-svn: 25857	2006-01-31 22:28:30 +00:00
Chris Lattner	5587b270e4	* Fix 80-column violations * Rename hasSSE -> hasSSE1 to avoid my continual confusion with 'has any SSE'. * Add inline asm constraint specification. llvm-svn: 25854	2006-01-31 19:43:35 +00:00
Evan Cheng	49467b6b5b	Added custom lowering of fabs llvm-svn: 25831	2006-01-31 03:14:29 +00:00
Evan Cheng	d2d96373dc	Always use FP stack instructions to perform i64 to f64 as well as f64 to i64 conversions. SSE does not have instructions to handle these tasks. llvm-svn: 25817	2006-01-30 08:02:57 +00:00
Chris Lattner	b66484069a	The FP stack doesn't support UNDEF, ask the legalizer to legalize it instead of lying and saying we have it. llvm-svn: 25775	2006-01-29 06:44:22 +00:00
Evan Cheng	03aaa82992	AT&T assembly convention: registers are in lower case. llvm-svn: 25714	2006-01-27 22:53:29 +00:00
Evan Cheng	5891f49c47	x86 CPU detection and proper subtarget support llvm-svn: 25679	2006-01-27 08:10:46 +00:00
Chris Lattner	20d4194a0d	PHI and INLINEASM are now built-in instructions provided by Target.td llvm-svn: 25674	2006-01-27 01:46:15 +00:00

... 3 4 5 6 7 ...

668 Commits