llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 12:33:33 +02:00

Author	SHA1	Message	Date
Dale Johannesen	15b76de064	Add support for 8 and 16 bit forms of __sync builtins on X86. Change "lock" instructions to be on a separate line. This is needed to work around a bug in the Darwin assembler. llvm-svn: 54999	2008-08-19 18:47:28 +00:00
Dan Gohman	cc784f1662	Re-introduce the 8-bit subreg zext-inreg patterns for x86-32, this time using MOV32to32_ and MOV16to16_. Thanks to Evan for suggesting this. llvm-svn: 54418	2008-08-06 18:27:21 +00:00
Dan Gohman	99d70043f9	xchg does not modify FLAGS. llvm-svn: 54411	2008-08-06 15:52:50 +00:00
Dan Gohman	efb5d2ce6e	Reapply r54147 with a constraint to only use the 8-bit subreg form on x86-64, to avoid the problem with x86-32 having GPRs that don't have 8-bit subregs. Also, change several 16-bit instructions to use equivalent 32-bit instructions. These have a smaller encoding and avoid partial-register updates. llvm-svn: 54223	2008-07-30 18:09:17 +00:00
Dan Gohman	ebe629a4b2	Revert 54147. llvm-svn: 54148	2008-07-29 01:02:18 +00:00
Dan Gohman	1816900fd1	Add x86 isel patterns to match what would be a ZERO_EXTEND_INREG operation, which is represented in codegen as an 'and' operation. This matches them with movz instructions, instead of leaving them to be matched by and instructions with an immediate field. llvm-svn: 54147	2008-07-28 22:18:25 +00:00
Anton Korobeynikov	f13fbd6879	Fix encoding of atomic compare and swap for i64 llvm-svn: 53911	2008-07-22 16:22:48 +00:00
Mon P Wang	7d89d61387	Added MemOperands to Atomic operations since Atomics touches memory. Added abstract class MemSDNode for any Node that have an associated MemOperand Changed atomic.lcs => atomic.cmp.swap, atomic.las => atomic.load.add, and atomic.lss => atomic.load.sub llvm-svn: 52706	2008-06-25 08:15:39 +00:00
Evan Cheng	0570953e28	XOR32rr, etc. are not AsCheapAsMove, but MOV32ri, etc. are. llvm-svn: 52454	2008-06-18 08:13:07 +00:00
Andrew Lenharth	327c3e7559	add missing atomic intrinsic from gcc llvm-svn: 52270	2008-06-14 05:48:15 +00:00
Dan Gohman	00823cb0d4	Teach the DAGISelEmitter to not compute the variable_ops operand index for the input pattern in terms of the output pattern. Instead keep track of how many fixed operands the input pattern actually has, and have the input matching code pass the output-emitting function that index value. This simplifies the code, disentangles variables_ops from the support for predication operations, and makes variable_ops more robust. llvm-svn: 51808	2008-05-31 02:11:25 +00:00
Dan Gohman	aa8fcd5657	Add patterns for CALL32m and CALL64m. They aren't matched in most cases due to an isel deficiency already noted in lib/Target/X86/README.txt, but they can be matched in this fold-call.ll testcase, for example. This is interesting mainly because it exposes a tricky tblgen bug; tblgen was incorrectly computing the starting index for variable_ops in the case of a complex pattern. llvm-svn: 51706	2008-05-29 21:50:34 +00:00
Dan Gohman	4e87d82476	Fix a tblgen problem handling variable_ops in tblgen instruction definitions. This adds a new construct, "discard", for indicating that a named node in the input matching pattern is to be discarded, instead of corresponding to a node in the output pattern. This allows tblgen to know where the arguments for the varaible_ops are supposed to begin. This fixes "rdar://5791600", whatever that is ;-). llvm-svn: 51699	2008-05-29 19:57:41 +00:00
Bill Wendling	81199f0cc8	XOR?RI instructions aren't as cheap as moves. llvm-svn: 51664	2008-05-29 03:46:36 +00:00
Bill Wendling	edb38e9410	Implement "AsCheapAsAMove" for some obviously cheap instructions: xor and the like. llvm-svn: 51662	2008-05-29 01:02:09 +00:00
Evan Cheng	95987c2586	Doh. Alignment is in bytes, not in bits. llvm-svn: 51092	2008-05-14 02:49:43 +00:00
Evan Cheng	cb56638548	- Fix the pasto in the fix for a previous pasto. - Incorporate Chris' comment suggestion. llvm-svn: 51061	2008-05-13 18:59:59 +00:00
Evan Cheng	cf6928983b	- Don't treat anyext 16-bit load as a 32-bit load if it's volatile. - Correct a pasto. llvm-svn: 51054	2008-05-13 16:45:56 +00:00
Evan Cheng	e4ee4c2870	On x86, it's safe to treat i32 load anyext as a normal i32 load. Ditto for i8 anyext load to i16. llvm-svn: 51019	2008-05-13 00:54:02 +00:00
Dan Gohman	efa0925915	Fix a copy+paste bug; pseudo-instructions shouldn't have encoding information. llvm-svn: 50997	2008-05-12 20:22:45 +00:00
Mon P Wang	84a269e023	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Anton Korobeynikov	04c974b1b2	Add General Dynamic TLS model for X86-64. Some parts looks really ugly (look for tlsaddr pattern), but should work. Work is in progress, more models will follow llvm-svn: 50630	2008-05-04 21:36:32 +00:00
Arnold Schwaighofer	f58a35e2ec	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Evan Cheng	0fe99f024d	Fix MMX_MOVQ2DQrr pattern. It's illegal to do a bitconvert from a smaller type to a larger one. llvm-svn: 50278	2008-04-25 18:19:54 +00:00
Evan Cheng	b1d240f973	xchg which references a memory operand does not need to lock prefix. Atomicity is guaranteed. llvm-svn: 49946	2008-04-19 01:20:30 +00:00
Evan Cheng	a626e13995	- Fix atomic operation JIT encoding. - Remove unused instructions. llvm-svn: 49921	2008-04-18 20:55:36 +00:00
Evan Cheng	2b03674feb	Also support Intel asm syntax. llvm-svn: 49878	2008-04-17 23:35:10 +00:00
Evan Cheng	0b36ca5023	Fix assembly code for atomic operations. llvm-svn: 49869	2008-04-17 21:26:35 +00:00
Nate Begeman	81586b24d6	80 col fix llvm-svn: 49569	2008-04-12 00:47:57 +00:00
Evan Cheng	aca67f0b29	Allow certain lea instructions to be rematerialized. llvm-svn: 48855	2008-03-27 01:41:09 +00:00
Arnold Schwaighofer	19a78545d9	Don't loose incoming argument registers. Fix documentation style. llvm-svn: 48545	2008-03-19 16:39:45 +00:00
Evan Cheng	11d2c09adc	Replace all target specific implicit def instructions with a target independent one: TargetInstrInfo::IMPLICIT_DEF. llvm-svn: 48380	2008-03-15 00:03:38 +00:00
Christopher Lamb	0f1c32eb63	Get rid of a pseudo instruction and replace it with subreg based operation on real instructions, ridding the asm printers of the hack used to do this previously. In the process, update LowerSubregs to be careful about eliminating copies that have side affects. Note: the coalescer will have to be careful about this too, when it starts coalescing insert_subreg nodes. llvm-svn: 48329	2008-03-13 05:47:01 +00:00
Christopher Lamb	74f4d837df	Recommitting parts of r48130. These do not appear to cause the observed failures. llvm-svn: 48223	2008-03-11 10:09:17 +00:00
Chris Lattner	9826c9365e	Change the model for FP Stack return to use fp operands on the RET instruction instead of using FpSET_ST0_32. This also generalizes the code to handling returning of multiple FP results. llvm-svn: 48209	2008-03-11 03:23:40 +00:00
Evan Cheng	067ecbc341	Revert 48125, 48126, and 48130 for now to unbreak some x86-64 tests. llvm-svn: 48167	2008-03-10 19:31:26 +00:00
Christopher Lamb	32e5ce3d96	Allow insert_subreg into implicit, target-specific values. Change insert/extract subreg instructions to be able to be used in TableGen patterns. Use the above features to reimplement an x86-64 pseudo instruction as a pattern. llvm-svn: 48130	2008-03-10 06:12:08 +00:00
Andrew Lenharth	95c88272c6	64bit CAS on 32bit x86. llvm-svn: 47929	2008-03-05 01:15:49 +00:00
Evan Cheng	ae414db8d2	80 column violations. llvm-svn: 47878	2008-03-04 03:20:06 +00:00
Evan Cheng	139517b682	Remove -always-fold-and-in-test. llvm-svn: 47871	2008-03-04 00:40:35 +00:00
Andrew Lenharth	ba7f925582	good catch anton llvm-svn: 47800	2008-03-01 23:18:21 +00:00
Andrew Lenharth	f6c220738c	make CAS work llvm-svn: 47799	2008-03-01 22:27:48 +00:00
Andrew Lenharth	b91c664226	all but CAS working on x86 llvm-svn: 47798	2008-03-01 21:52:34 +00:00
Andrew Lenharth	ad29a49169	Add lock prefix support to x86. Also add the instructions necessary for the atomic ops. They are still marked pseudo, since I cannot figure out what format to use, but they are the correct opcode. llvm-svn: 47795	2008-03-01 13:37:02 +00:00
Andrew Lenharth	db9cd46f5d	Atomic op support. If any gcc test uses __sync builtins, it might start failing on archs that haven't implemented them yet llvm-svn: 47430	2008-02-21 06:45:13 +00:00
Evan Cheng	f3a7cd1c62	Poorly named option. llvm-svn: 47400	2008-02-20 20:57:32 +00:00
Evan Cheng	35253f2c22	Add hidden option -x86-fold-and-in-test to test the effect the test / and folding change. llvm-svn: 47351	2008-02-19 23:36:51 +00:00
Chris Lattner	3a4ac3a69e	Don't fold and's into test instructions if they have multiple uses. This compiles test-nofold.ll into: _test: movl $15, %ecx andl 4(%esp), %ecx testl %ecx, %ecx movl $42, %eax cmove %ecx, %eax ret instead of: _test: movl 4(%esp), %eax movl %eax, %ecx andl $15, %ecx testl $15, %eax movl $42, %eax cmove %ecx, %eax ret llvm-svn: 47330	2008-02-19 17:37:35 +00:00
Evan Cheng	a377b2bbd1	Fix a x86-64 codegen deficiency. Allow gv + offset when using rip addressing mode. Before: _main: subq $8, %rsp leaq _X(%rip), %rax movsd 8(%rax), %xmm1 movss _X(%rip), %xmm0 call _t xorl %ecx, %ecx movl %ecx, %eax addq $8, %rsp ret Now: _main: subq $8, %rsp movsd _X+8(%rip), %xmm1 movss _X(%rip), %xmm0 call _t xorl %ecx, %ecx movl %ecx, %eax addq $8, %rsp ret Notice there is another idiotic codegen issue that needs to be fixed asap: xorl %ecx, %ecx movl %ecx, %eax llvm-svn: 46850	2008-02-07 08:53:49 +00:00
Nate Begeman	ead8dfeef2	SSE 4.1 Intrinsics and detection llvm-svn: 46681	2008-02-03 07:18:54 +00:00
Duncan Sands	aff4eef6df	The last pieces needed for loading arbitrary precision integers. This won't actually work (and most of the code is dead) unless the new legalization machinery is turned on. While there, I rationalized the handling of i1, and removed some bogus (and unused) sextload patterns. For i1, this could result in microscopically better code for some architectures (not X86). It might also result in worse code if annotating with AssertZExt nodes turns out to be more harmful than helpful. llvm-svn: 46280	2008-01-23 20:39:46 +00:00
Chris Lattner	41717f6989	This commit changes: 1. Legalize now always promotes truncstore of i1 to i8. 2. Remove patterns and gunk related to truncstore i1 from targets. 3. Rename the StoreXAction stuff to TruncStoreAction in TLI. 4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions. 5. Mark a wide variety of invalid truncstores as such in various targets, e.g. X86 currently doesn't support truncstore of any of its integer types. 6. Add legalize support for truncstores with invalid value input types. 7. Add a dag combine transform to turn store(truncate) into truncstore when safe. The later allows us to compile CodeGen/X86/storetrunc-fp.ll to: _foo: fldt 20(%esp) fldt 4(%esp) faddp %st(1) movl 36(%esp), %eax fstps (%eax) ret instead of: _foo: subl $4, %esp fldt 24(%esp) fldt 8(%esp) faddp %st(1) fstps (%esp) movl 40(%esp), %eax movss (%esp), %xmm0 movss %xmm0, (%eax) addl $4, %esp ret llvm-svn: 46140	2008-01-17 19:59:44 +00:00
Chris Lattner	6846e346a8	rename SDTRet -> SDTNone. Move definition of 'trap' sdnode up from x86 instrinfo to targetselectiondag.td. llvm-svn: 46017	2008-01-15 22:02:54 +00:00
Chris Lattner	0072ce1ff4	no need to expand ISD::TRAP to X86ISD::TRAP, just match ISD::TRAP. llvm-svn: 46015	2008-01-15 21:58:22 +00:00
Anton Korobeynikov	44893ee93d	Fix JIT encoding of trap/ud2 instruction llvm-svn: 46012	2008-01-15 21:40:02 +00:00
Anton Korobeynikov	08ea121968	For PR1839: add initial support for __builtin_trap. llvm-gcc part is missed as well as PPC codegen llvm-svn: 46001	2008-01-15 07:02:33 +00:00
Chris Lattner	a83f66d1bb	remove xchg and shift-reg-by-1 instructions, which are dead. llvm-svn: 45870	2008-01-11 18:00:50 +00:00
Chris Lattner	20d1e419b3	more flags set right llvm-svn: 45860	2008-01-11 07:18:17 +00:00
Chris Lattner	48c54909dc	IMPLICIT_USE and IMPLICIT_DEF are dead, remove them. llvm-svn: 45838	2008-01-10 19:27:54 +00:00
Chris Lattner	9d7971791b	Start inferring side effect information more aggressively, and fix many bugs in the x86 backend where instructions were not marked maystore/mayload, and perf issues where instructions were not marked neverHasSideEffects. It would be really nice if we could write patterns for copy instructions. I have audited all the x86 instructions down to MOVDQAmr. The flags on others and on other targets are probably not right in all cases, but no clients currently use this info that are enabled by default. llvm-svn: 45829	2008-01-10 07:59:24 +00:00
Chris Lattner	7ceba534ba	rename X86InstrX86-64.td -> X86Instr64bit.td llvm-svn: 45826	2008-01-10 05:50:42 +00:00
Chris Lattner	6ad01a9965	remove explicit sets of 'neverHasSideEffects' that can now be inferred from the instr patterns. llvm-svn: 45824	2008-01-10 05:45:39 +00:00
Chris Lattner	9b4f2b2316	get def use info more correct. llvm-svn: 45821	2008-01-10 05:12:37 +00:00
Chris Lattner	6b6e6f1f6e	The pic base can't be duplicated. llvm-svn: 45668	2008-01-06 23:49:32 +00:00
Chris Lattner	14310afe42	rename isLoad -> isSimpleLoad due to evan's desire to have such a predicate. llvm-svn: 45667	2008-01-06 23:38:27 +00:00
Chris Lattner	6ad8e32f6b	getting the pic base has no side effects. llvm-svn: 45618	2008-01-05 03:54:32 +00:00
Evan Cheng	5b9282f3b5	Combine MovePCtoStack + POP32r into one instruction MOVPC32r so it can be moved if needed. llvm-svn: 45605	2008-01-05 00:41:47 +00:00
Chris Lattner	ad9a6ccb83	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Evan Cheng	8f4ec948d3	Fix JIT code emission of X86::MovePCtoStack. llvm-svn: 45307	2007-12-22 02:26:46 +00:00
Bill Wendling	e5af8b6e5c	Add "mayHaveSideEffects" and "neverHasSideEffects" flags to some instructions. I based what flag to set on whether it was already marked as "isRematerializable". If there was a further check to determine if it's "really" rematerializable, then I marked it as "mayHaveSideEffects" and created a check in the X86 back-end similar to the remat one. llvm-svn: 45132	2007-12-17 23:07:56 +00:00
Evan Cheng	42f27a28a4	Fix bsf / bsr jit encoding. llvm-svn: 45037	2007-12-14 18:49:43 +00:00
Dan Gohman	0efc49e9b8	Fix Intel asm syntax for the bsr and bsf instructions. llvm-svn: 45030	2007-12-14 15:10:00 +00:00
Evan Cheng	6909ff8c4b	Fix ctlz and cttz. llvm definition requires them to return number of bits in of the src type when value is zero. llvm-svn: 45029	2007-12-14 08:30:15 +00:00
Evan Cheng	51cf86ded0	Implement ctlz and cttz with bsr and bsf. llvm-svn: 45024	2007-12-14 02:13:44 +00:00
Evan Cheng	343929c773	Fold some and + shift in x86 addressing mode. llvm-svn: 44970	2007-12-13 00:43:27 +00:00
Evan Cheng	64a1febf9a	Implicit def instructions, e.g. X86::IMPLICIT_DEF_GR32, are always re-materializable and they should not be spilled. llvm-svn: 44960	2007-12-12 23:12:09 +00:00
Bill Wendling	934fcd87e7	Unifacalize the CALLSEQ{START,END} stuff. llvm-svn: 44045	2007-11-13 09:19:02 +00:00
Bill Wendling	cc75435ebf	Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If not, then there is the potential for the stack to be changed while the stack's being used by another instruction (like a call). This can only result in tears... llvm-svn: 44037	2007-11-13 00:44:25 +00:00
Owen Anderson	aba398a5ce	Add a flag for indirect branch instructions. Target maintainers: please check that the instructions for your target are correctly marked. llvm-svn: 44012	2007-11-12 07:39:39 +00:00
Evan Cheng	ded6550885	Local spiller optimization: Turn a store folding instruction into a load folding instruction. e.g. xorl %edi, %eax movl %eax, -32(%ebp) movl -36(%ebp), %eax orl %eax, -32(%ebp) => xorl %edi, %eax orl -36(%ebp), %eax mov %eax, -32(%ebp) This enables the unfolding optimization for a subsequent instruction which will also eliminate the newly introduced store instruction. llvm-svn: 43192	2007-10-19 21:23:22 +00:00
Arnold Schwaighofer	6bcd9e7ec2	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Arnold Schwaighofer	d47210011e	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Evan Cheng	9af50ee6ef	Commute x86 cmove instructions by swapping the operands and change the condition to its inverse. Testing this as llcbeta llvm-svn: 42661	2007-10-05 23:13:21 +00:00
Evan Cheng	f658191412	ADC and SBB uses EFLAGS. llvm-svn: 42640	2007-10-05 17:59:57 +00:00
Evan Cheng	f3c130a8b6	Enabling new condition code modeling scheme. llvm-svn: 42459	2007-09-29 00:00:36 +00:00
Evan Cheng	c2acb6f2e5	Stop inventing new words. :-) llvm-svn: 42429	2007-09-28 01:35:02 +00:00
Evan Cheng	d3ff9d3ff7	Pessimisively assume ADJCALLSTACKDOWN / ADJCALLSTACKUP (which becomes sub / add) clobbers EFLAGS. llvm-svn: 42426	2007-09-28 01:19:48 +00:00
Evan Cheng	66eeb8440c	Some assemblers do not recognize aliases pushfd, pushfq, popfd, and popfq. Just emit them as pushf and popf. llvm-svn: 42371	2007-09-26 21:28:00 +00:00
Evan Cheng	37ee6eba29	Typos: POPQ -> POPFQ, POPD -> POPFD. llvm-svn: 42348	2007-09-26 06:38:29 +00:00
Evan Cheng	5cb9dbaaa1	Add pushf{d\|q}, popf{d\|q} to push and pop EFLAGS register. llvm-svn: 42335	2007-09-26 01:29:06 +00:00
Evan Cheng	36b3babfde	Added support for new condition code modeling scheme (i.e. physical register dependency). These are a bunch of instructions that are duplicated so the x86 backend can support both the old and new schemes at the same time. They will be deleted after all the kinks are worked out. llvm-svn: 42285	2007-09-25 01:57:46 +00:00
Dan Gohman	a264777dc1	Fix the syntax for the .loc directive in preparation for using it. llvm-svn: 42268	2007-09-24 19:25:06 +00:00
Dale Johannesen	ea6ffa0b36	Fix PR 1681. When X86 target uses +sse -sse2, keep f32 in SSE registers and f64 in x87. This is effectively a new codegen mode. Change addLegalFPImmediate to permit float and double variants to do different things. Adjust callers. llvm-svn: 42246	2007-09-23 14:52:20 +00:00
Evan Cheng	13797e4a74	Add implicit def of EFLAGS on those instructions that may modify flags. llvm-svn: 41962	2007-09-14 21:48:26 +00:00
Evan Cheng	b43255bc68	Remove (somewhat confusing) Imp<> helper, use let Defs = [], Uses = [] instead. llvm-svn: 41863	2007-09-11 19:55:27 +00:00
Evan Cheng	65df926ced	TableGen no longer emit CopyFromReg nodes for implicit results in physical registers. The scheduler is now responsible for emitting them. llvm-svn: 41781	2007-09-07 23:59:02 +00:00
Dan Gohman	3bc1bc2590	Avoid storing and reloading zeros and other constants from stack slots by flagging the associated instructions as being trivially rematerializable. llvm-svn: 41775	2007-09-07 21:32:51 +00:00
Evan Cheng	527fe7ab57	Mark load instructions with isLoad = 1. llvm-svn: 41595	2007-08-30 05:49:43 +00:00
Dale Johannesen	a85f11d870	Long double patch 4 of N: initial x87 implementation. Lots of problems yet but some simple things work. llvm-svn: 40847	2007-08-05 18:49:15 +00:00
Evan Cheng	3163814591	Switch some multiplication instructions over to the new scheme for testing. llvm-svn: 40723	2007-08-02 05:48:35 +00:00
Evan Cheng	0fa6cdbff5	Mac OS X X86-64 low 4G address not available. llvm-svn: 40701	2007-08-01 23:45:51 +00:00
Evan Cheng	fb587a3851	Be more precise. llvm-svn: 40689	2007-08-01 20:22:37 +00:00
Dan Gohman	e3464e6bec	Change the x86 assembly output to use tab characters to separate the mnemonics from their operands instead of single spaces. This makes the assembly output a little more consistent with various other compilers (f.e. GCC), and slightly easier to read. Also, update the regression tests accordingly. llvm-svn: 40648	2007-07-31 20:11:57 +00:00
Evan Cheng	3493ec0ce1	Redo and generalize previously removed opt for pinsrw: (vextract (v4i32 bc (v4f32 s2v (f32 load ))), 0) -> (i32 load ) llvm-svn: 40628	2007-07-31 08:04:03 +00:00
Christopher Lamb	919ce03da6	Change the x86 backend to use extract_subreg for truncation operations. Passes DejaGnu, SingleSource and MultiSource. llvm-svn: 40578	2007-07-29 01:24:57 +00:00
Dan Gohman	d3a062f01b	In the .loc directive, print the fields as "debug" fields, so they don't get decorated as if for immediate fields for instructions. llvm-svn: 40529	2007-07-26 15:24:15 +00:00
Evan Cheng	53cb03b583	No more noResults. llvm-svn: 40132	2007-07-21 00:34:19 +00:00
Evan Cheng	8312ed6f77	Change instruction description to split OperandList into OutOperandList and InOperandList. This gives one piece of important information: # of results produced by an instruction. An example of the change: def ADD32rr : I<0x01, MRMDestReg, (ops GR32:$dst, GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; => def ADD32rr : I<0x01, MRMDestReg, (outs GR32:$dst), (ins GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; llvm-svn: 40033	2007-07-19 01:14:50 +00:00
Anton Korobeynikov	5635277c36	Long live the exception handling! This patch fills the last necessary bits to enable exceptions handling in LLVM. Currently only on x86-32/linux. In fact, this patch adds necessary intrinsics (and their lowering) which represent really weird target-specific gcc builtins used inside unwinder. After corresponding llvm-gcc patch will land (easy) exceptions should be more or less workable. However, exceptions handling support should not be thought as 'finished': I expect many small and not so small glitches everywhere. llvm-svn: 39855	2007-07-14 14:06:15 +00:00
Dan Gohman	9cbc3fb1ab	Revert the earlier change that removed the M_REMATERIALIZABLE machine instruction flag, and use the flag along with a virtual member function hook for targets to override if there are instructions that are only trivially rematerializable with specific operands (i.e. constant pool loads). llvm-svn: 37728	2007-06-26 00:48:07 +00:00
Dan Gohman	b60d8a92c9	Replace M_REMATERIALIZIBLE and the newly-added isOtherReMaterializableLoad with a general target hook to identify rematerializable instructions. Some instructions are only rematerializable with specific operands, such as loads from constant pools, while others are always rematerializable. This hook allows both to be identified as being rematerializable with the same mechanism. llvm-svn: 37644	2007-06-19 01:48:05 +00:00
Nate Begeman	f496eb7607	Reference correct header llvm-svn: 36834	2007-05-06 04:00:55 +00:00
Bill Wendling	552e4ff1be	Add SSSE3 as a feature of Core2. Add MMX registers to the list of registers clobbered by a call. llvm-svn: 36448	2007-04-25 21:31:48 +00:00
Lauro Ramos Venancio	b75c6c5cbc	X86 TLS: optimize the implementation of "local exec" model. llvm-svn: 36359	2007-04-23 01:28:10 +00:00
Lauro Ramos Venancio	b1a101f0e7	X86 TLS: fix and optimize the implementation of "initial exec" model. llvm-svn: 36355	2007-04-22 22:50:52 +00:00
Lauro Ramos Venancio	bc32d90b46	Implement "general dynamic", "initial exec" and "local exec" TLS models for X86 32 bits. llvm-svn: 36283	2007-04-20 21:38:10 +00:00
Anton Korobeynikov	9bc4b792bf	Implemented correct stack probing on mingw/cygwin for dynamic alloca's. Also, fixed static case in presence of eax livin. This fixes PR331 PS: Why don't we still have push/pop instructions? :) llvm-svn: 36195	2007-04-17 09:20:00 +00:00
Bill Wendling	3b1189afbf	Add support for our first SSSE3 instruction "pmulhrsw". llvm-svn: 35869	2007-04-10 22:10:25 +00:00
Evan Cheng	00a5cbf9e7	Mark re-materializable instructions. llvm-svn: 35230	2007-03-21 00:16:56 +00:00
Jim Laskey	23ed7d2625	Make LABEL a builtin opcode. llvm-svn: 33537	2007-01-26 14:34:52 +00:00
Chris Lattner	784201d1b9	Fix a misencoding of CBW and CWD. This fixes PR1030. llvm-svn: 33486	2007-01-24 18:31:00 +00:00
Evan Cheng	adeea85f7d	- Switch X86-64 JIT to large code size model. - Re-enable some codegen niceties for X86-64 static relocation model codegen. - Clean ups, etc. llvm-svn: 32238	2006-12-05 19:50:18 +00:00
Evan Cheng	456101ebb9	- Use a different wrapper node for RIP-relative GV, etc. - Proper support for both small static and PIC modes under X86-64 - Some (non-optimal) support for medium modes. llvm-svn: 32046	2006-11-30 21:55:46 +00:00
Evan Cheng	f968824cdb	Custom lower READCYCLECOUNTER for x86-64. llvm-svn: 32017	2006-11-29 08:28:13 +00:00
Chris Lattner	7c265ad682	remove dead/redundant vars llvm-svn: 31435	2006-11-03 23:48:56 +00:00
Reid Spencer	db06ed9156	Add debug support for X86/ELF targets (Linux). This allows llvm-gcc4 generated object modules to be debugged with gdb. Hopefully this helps pre-release debugging. llvm-svn: 31299	2006-10-30 22:32:30 +00:00
Evan Cheng	fe5bb5dbe6	Merge ISD::TRUNCSTORE to ISD::STORE. Switch to using StoreSDNode. llvm-svn: 30945	2006-10-13 21:14:26 +00:00
Chris Lattner	b25677f5ca	Move the Imp tblgen class from the X86 backend to common code. llvm-svn: 30907	2006-10-12 17:49:27 +00:00
Chris Lattner	a678c5bac1	Mark ADJCALLSTACKUP/DOWN as clobbering ESP so that virtregmap will notice that it can't assume ESP is unmodified across the instrs. llvm-svn: 30905	2006-10-12 17:42:56 +00:00
Evan Cheng	ca66f49574	Add properties to ComplexPattern. llvm-svn: 30891	2006-10-11 21:03:53 +00:00
Evan Cheng	d22f3dd3ed	Reflects ISD::LOAD / ISD::LOADX / LoadSDNode changes. llvm-svn: 30844	2006-10-09 20:57:25 +00:00
Evan Cheng	02e193e2ff	Delete dead code; fix 80 col violations. llvm-svn: 30583	2006-09-22 21:43:59 +00:00
Evan Cheng	cfd7b147cf	X86ISD::CMP now produces a chain as well as a flag. Make that the chain operand of a conditional branch to allow load folding into CMP / TEST instructions. llvm-svn: 30241	2006-09-11 02:19:56 +00:00
Evan Cheng	15dd42884e	Committing X86-64 support. llvm-svn: 30177	2006-09-08 06:48:29 +00:00
Chris Lattner	9c7673ffca	Eliminate X86ISD::TEST, using X86ISD::CMP instead. Match X86ISD::CMP patterns using test, which provides nice simplifications like: - movl %edi, %ecx - andl $2, %ecx - cmpl $0, %ecx + testl $2, %edi je LBB1_11 #cond_next90 There are a couple of dagiselemitter deficiencies that this exposes, they will be handled later. llvm-svn: 30156	2006-09-07 20:33:45 +00:00
Evan Cheng	798aa508a3	Consistency. llvm-svn: 30152	2006-09-07 19:03:48 +00:00
Evan Cheng	34a49551f5	CALLSEQ_* produces chain even if that's not needed. llvm-svn: 29603	2006-08-11 09:03:33 +00:00
Evan Cheng	100096b2bb	Clean up. llvm-svn: 29228	2006-07-20 21:37:39 +00:00
Evan Cheng	a2eaed93a0	INC / DEC instructions have shorter code size than ADD32ri8, etc. llvm-svn: 29194	2006-07-19 00:27:29 +00:00
Evan Cheng	db529debec	Emit inc / dec of registers as one byte instruction. llvm-svn: 29110	2006-07-11 19:49:49 +00:00
Evan Cheng	1d5fa40da3	Add shift and rotate by 1 instructions / patterns. llvm-svn: 28980	2006-06-29 00:36:51 +00:00
Evan Cheng	a37a2f781e	Remove dead code. llvm-svn: 28938	2006-06-27 20:34:14 +00:00
Evan Cheng	0e2235b803	X86 call instructions can take variable number of operands. Parameters of vector types are passed via XMM registers. llvm-svn: 28789	2006-06-14 22:24:55 +00:00
Evan Cheng	ed96100b00	Incorrect AT&T opcode. llvm-svn: 28666	2006-06-02 21:09:10 +00:00
Evan Cheng	4488266c46	Rename ASM modifier trunc8, trunc16 to subreg8, subreg16. llvm-svn: 28606	2006-05-31 22:34:26 +00:00
Evan Cheng	f90443c471	Sign extender llvm-svn: 28603	2006-05-31 22:05:11 +00:00
Evan Cheng	f7637e403f	A addressing mode folding enhancement: Fold c2 in (x << c1) \| c2 where (c2 < c1) e.g. int test(int x) { return (x << 3) + 7; } This can be codegen'd as: leal 7(,%eax,8), %eax llvm-svn: 28550	2006-05-30 06:59:36 +00:00
Evan Cheng	550e73a900	Remove unused patterns. llvm-svn: 28417	2006-05-20 01:40:16 +00:00
Evan Cheng	3a90665f14	- Use exact-width integer types, e.g. int32_t, to avoid confusion. - Fix a couple of minor bugs in i16immSExt8 and i16immZExt8. - Added loadiPTR fragment used for indirect jumps and calls. llvm-svn: 28392	2006-05-19 18:40:54 +00:00
Evan Cheng	ff19d6478e	Explicitly specify MOV32mi can only be used store 32-bit GV, etc. llvm-svn: 28390	2006-05-19 07:30:36 +00:00
Evan Cheng	070813257a	Use generic iPTR instead i32 to represent pointer type. llvm-svn: 28371	2006-05-17 21:21:41 +00:00
Evan Cheng	dc9b5f5fc0	X86 integer register classes naming changes. Make them consistent with FP, vector classes. llvm-svn: 28324	2006-05-16 07:21:53 +00:00
Evan Cheng	0fb3fc3626	Fixing truncate. Previously we were emitting truncate from r16 to r8 as movw. That is we promote the destination operand to r16. So %CH = TRUNC_R16_R8 %BP is emitted as movw %bp, %cx. This is incorrect. If %cl is live, it would be clobbered. Ideally we want to do the opposite, that is emitted it as movb ??, %ch But this is not possible since %bp does not have a r8 sub-register. We are now defining a new register class R16_ which is a subclass of R16 containing only those 16-bit registers that have r8 sub-registers (i.e. AX - DX). We isel the truncate to two instructions, a MOV16to16_ to copy the value to the R16_ class, followed by a TRUNC_R16_R8. Due to bug 770, the register colaescer is not going to coalesce between R16 and R16_. That will be fixed later so we can eliminate the MOV16to16_. Right now, it can only be eliminated if we are lucky that source and destination registers are the same. llvm-svn: 28164	2006-05-08 08:01:26 +00:00
Evan Cheng	0e9ec8d566	Need extload patterns after Chris' DAG combiner changes llvm-svn: 28127	2006-05-05 08:23:07 +00:00
Evan Cheng	84612a59c2	Better implementation of truncate. ISel matches it to a pseudo instruction that gets emitted as movl (for r32 to i16, i8) or a movw (for r16 to i8). And if the destination gets allocated a subregister of the source operand, then the instruction will not be emitted at all. llvm-svn: 28119	2006-05-05 05:40:20 +00:00
Evan Cheng	11e3cec8bd	Make x86 isel lowering produce tailcall nodes. They are match to normal calls for now. Patch contributed by Alexander Friedman. llvm-svn: 27994	2006-04-27 08:40:39 +00:00
Nate Begeman	0d74cbcb6b	Optimized stores to the constant pool, while cool, are unnecessary. llvm-svn: 27948	2006-04-22 22:31:45 +00:00
Nate Begeman	7ed816f900	JumpTable support! What this represents is working asm and jit support for x86 and ppc for 100% dense switch statements when relocations are non-PIC. This support will be extended and enhanced in the coming days to support PIC, and less dense forms of jump tables. llvm-svn: 27947	2006-04-22 18:53:45 +00:00
Evan Cheng	169240beb7	- More efficient extract_vector_elt with shuffle and movss, movsd, movd, etc. - Some bug fixes and naming inconsistency fixes. llvm-svn: 27377	2006-04-03 20:53:28 +00:00
Evan Cheng	875c895b0f	Added missing (any_extend (load ...)) patterns. llvm-svn: 27120	2006-03-25 09:45:48 +00:00
Chris Lattner	ec3f1b5cd1	Fix the encodings of these new instructions, hopefully fixing the JIT failures from last night llvm-svn: 26981	2006-03-23 16:13:50 +00:00
Nate Begeman	0ec15cd042	Add support for 8 bit immediates with 16/32 bit cmp instructions llvm-svn: 26966	2006-03-23 01:29:48 +00:00
Evan Cheng	8dd794ea70	Use the generic vector register classes VR64 / VR128 rather than V4F32, V8I16, etc. llvm-svn: 26838	2006-03-18 01:23:20 +00:00
Evan Cheng	ee1a44d5d8	Move some pattern fragments to the right files. llvm-svn: 26831	2006-03-17 19:55:52 +00:00
Evan Cheng	d16fa97974	- Nuke 16-bit SBB instructions. We'll never use them. - Nuke a bogus comment. llvm-svn: 26815	2006-03-17 02:24:04 +00:00
Evan Cheng	d73d06f052	X86ISD::REP_STOS and X86ISD::REP_MOVS now produces a flag. llvm-svn: 26604	2006-03-07 23:34:23 +00:00
Evan Cheng	2327759419	Enable Dwarf debugging info. llvm-svn: 26581	2006-03-07 02:02:57 +00:00
Chris Lattner	999aa36a04	remove the read/write port/io intrinsics. llvm-svn: 26479	2006-03-03 00:19:58 +00:00
Evan Cheng	995c9806ba	* Allow mul, shl nodes to be codegen'd as LEA (if appropriate). * Add patterns to handle GlobalAddress, ConstantPool, etc. MOV32ri to materialize these nodes in registers. ADD32ri to handle %reg + GA, etc. MOV32mi to handle store GA, etc. to memory. llvm-svn: 26374	2006-02-25 10:02:21 +00:00
Evan Cheng	cb9fb051a5	- Clean up the lowering and selection code of ConstantPool, GlobalAddress, and ExternalSymbol. - Use C++ code (rather than tblgen'd selection code) to match the above mentioned leaf nodes. Do not mutate and nodes and do not record the selection in CodeGenMap. These nodes should be safe to duplicate. This is a performance win. llvm-svn: 26335	2006-02-23 20:41:18 +00:00
Evan Cheng	2977507828	PIC related bug fixes. 1. Various asm printer bug. 2. Lowering bug. Now TargetGlobalAddress is wrapped in X86ISD::TGAWrapper. llvm-svn: 26324	2006-02-23 02:43:52 +00:00
Evan Cheng	005de9e2bb	Added MMX, SSE1, and SSE2 vector instructions and some simple patterns. Fixed some existing bugs (wrong predicates, prefixes) at the same time. llvm-svn: 26310	2006-02-22 02:26:30 +00:00
Evan Cheng	8ab6294f94	One more round of reorg so sabre doesn't freak out. :-) llvm-svn: 26303	2006-02-21 20:00:20 +00:00
Evan Cheng	f7296d8fa5	A big more cleaning up. llvm-svn: 26302	2006-02-21 19:30:30 +00:00
Evan Cheng	223d3a073a	Moving things to their proper places. llvm-svn: 26301	2006-02-21 19:26:52 +00:00
Evan Cheng	fee17dfff8	Split instruction info into multiple files, one for each of x87, MMX, and SSE. llvm-svn: 26300	2006-02-21 19:13:53 +00:00
Evan Cheng	89553f1e86	Added separate alias instructions for SSE logical ops that operate on non-packed types. llvm-svn: 26297	2006-02-21 02:24:38 +00:00
Evan Cheng	e68325d8aa	Added MMX and XMM packed integer move instructions, movd and movq. llvm-svn: 26296	2006-02-21 01:39:57 +00:00
Evan Cheng	6a9422ce1c	Added x86 integer vector types: 64-bit packed byte integer (v16i8), 64-bit packed word integer (v8i16), and 64-bit packed doubleword integer (v2i32). llvm-svn: 26294	2006-02-20 22:34:53 +00:00
Evan Cheng	b3d9ee74ad	Added fisttp for fp to int conversion. llvm-svn: 26283	2006-02-18 02:36:28 +00:00
Evan Cheng	bf3558a375	x86 / Darwin PIC support. llvm-svn: 26273	2006-02-18 00:15:05 +00:00
Nate Begeman	9c0ab71f4a	kill ADD_PARTS & SUB_PARTS and replace them with fancy new ADDC, ADDE, SUBC and SUBE nodes that actually expose what's going on and allow for significant simplifications in the targets. llvm-svn: 26255	2006-02-17 05:43:56 +00:00
Evan Cheng	9de0ad321a	pxor (for FLD0SS) encoding was missing the OpSize prefix. llvm-svn: 26244	2006-02-16 23:59:30 +00:00
Evan Cheng	bf4008c701	1. Use pxor instead of xoraps / xorapd to clear FR32 / FR64 registers. This proves to be worth 20% on Ptrdist/ks. Might be related to dependency breaking support. 2. Added FsMOVAPSrr and FsMOVAPDrr as aliases to MOVAPSrr and MOVAPDrr. These are used for FR32 / FR64 reg-to-reg copies. 3. Tell reg-allocator to generate MOVSSrm / MOVSDrm and MOVSSmr / MOVSDmr to spill / restore FsMOVAPSrr and FsMOVAPDrr. llvm-svn: 26241	2006-02-16 22:45:17 +00:00
Evan Cheng	af9730a217	MOVAPSrr and MOVAPDrr instruction format should be MRMSrcReg. llvm-svn: 26234	2006-02-16 19:34:41 +00:00
Evan Cheng	da095e61c1	cvtsd2ss / cvtss2sd encoding bug. llvm-svn: 26193	2006-02-15 00:31:03 +00:00
Evan Cheng	198d9447d6	movaps, movapd encoding bug. llvm-svn: 26192	2006-02-15 00:11:37 +00:00
Chris Lattner	da4bd5c64e	Eliminate the printCallOperand method, using a 'call' modifier on printOperand instead. llvm-svn: 26025	2006-02-06 23:41:19 +00:00
Evan Cheng	8fd9fd7866	Remove an unnecessary predicate. llvm-svn: 25954	2006-02-04 02:23:01 +00:00
Evan Cheng	078962656b	Separate FILD and FILD_FLAG, the later is only used for SSE2. It produces a flag so it can be flagged to a FST. llvm-svn: 25953	2006-02-04 02:20:30 +00:00
Evan Cheng	b78bb375c2	Rearrange code to my liking. :) llvm-svn: 25887	2006-02-01 23:01:57 +00:00
Evan Cheng	ef93bdc0cf	- Use xor to clear integer registers (set R, 0). - Added a new format for instructions where the source register is implied and it is same as the destination register. Used for pseudo instructions that clear the destination register. llvm-svn: 25872	2006-02-01 06:13:50 +00:00
Evan Cheng	45ebd632f2	- Allow XMM load (for scalar use) to be folded into ANDP* and XORP. - Use XORP to implement fneg. llvm-svn: 25857	2006-01-31 22:28:30 +00:00
Chris Lattner	5587b270e4	* Fix 80-column violations * Rename hasSSE -> hasSSE1 to avoid my continual confusion with 'has any SSE'. * Add inline asm constraint specification. llvm-svn: 25854	2006-01-31 19:43:35 +00:00
Evan Cheng	49467b6b5b	Added custom lowering of fabs llvm-svn: 25831	2006-01-31 03:14:29 +00:00
Evan Cheng	d2d96373dc	Always use FP stack instructions to perform i64 to f64 as well as f64 to i64 conversions. SSE does not have instructions to handle these tasks. llvm-svn: 25817	2006-01-30 08:02:57 +00:00
Chris Lattner	b66484069a	The FP stack doesn't support UNDEF, ask the legalizer to legalize it instead of lying and saying we have it. llvm-svn: 25775	2006-01-29 06:44:22 +00:00
Evan Cheng	03aaa82992	AT&T assembly convention: registers are in lower case. llvm-svn: 25714	2006-01-27 22:53:29 +00:00
Evan Cheng	5891f49c47	x86 CPU detection and proper subtarget support llvm-svn: 25679	2006-01-27 08:10:46 +00:00
Chris Lattner	20d4194a0d	PHI and INLINEASM are now built-in instructions provided by Target.td llvm-svn: 25674	2006-01-27 01:46:15 +00:00
Evan Cheng	224e14daa1	Remove the uses of STATUS flag register. Rely on node property SDNPInFlag, SDNPOutFlag, and SDNPOptInFlag instead. llvm-svn: 25629	2006-01-26 00:29:36 +00:00
Chris Lattner	c078165ea6	Emit the copies out of call return registers after the ISD::CALLSEQ_END node, fixing fastcc and the case where a function has a frame pointer due to dynamic allocas. llvm-svn: 25580	2006-01-24 05:17:12 +00:00
Evan Cheng	2e4479d4a8	Rename fcmovae to fcmovnb and fcmova to fcmovnbe (following Intel manual). Some assemblers can't recognize the aliases. llvm-svn: 25494	2006-01-21 02:55:41 +00:00
Evan Cheng	2b24240110	A few more SH{L\|R}D peepholes. llvm-svn: 25473	2006-01-20 01:13:30 +00:00
Evan Cheng	e0f6454c5e	Added i16 SH{L\|R}D patterns. llvm-svn: 25468	2006-01-19 23:26:24 +00:00
Evan Cheng	cfe9d47379	adc and sbb need an incoming flag to ensure it reads the carry flag from add / sub. llvm-svn: 25444	2006-01-19 06:53:20 +00:00
Evan Cheng	be2cf5f948	Two peepholes: (or (x >> c) \| (y << (32 - c))) ==> (shrd x, y, c) (or (x << c) \| (y >> (32 - c))) ==> (shld x, y, c) llvm-svn: 25438	2006-01-19 01:56:29 +00:00
Evan Cheng	1e112de44a	Zero extending load from i1 to i8. llvm-svn: 25391	2006-01-17 07:02:46 +00:00
Evan Cheng	3ce28c990a	Bug fixes: fpGETRESULT should produces a flag result and X86ISD::FST should read a flag. llvm-svn: 25378	2006-01-17 00:19:47 +00:00
Evan Cheng	e678b39014	More typo's llvm-svn: 25375	2006-01-16 23:26:53 +00:00
Evan Cheng	f7c719fe10	Some typo's llvm-svn: 25374	2006-01-16 22:48:46 +00:00
Evan Cheng	de33ca2831	Fix FP_TO_INT**_IN_MEM lowering. llvm-svn: 25368	2006-01-16 21:21:29 +00:00
Evan Cheng	fe83dd85c8	Added patterns for 8-bit multiply llvm-svn: 25338	2006-01-15 10:05:20 +00:00
Nate Begeman	85b2dc0c4e	bswap implementation llvm-svn: 25312	2006-01-14 03:14:10 +00:00
Evan Cheng	d92410b194	A typo. llvm-svn: 25307	2006-01-14 01:18:49 +00:00
Evan Cheng	4d9bab5789	Add truncstore i1 patterns. llvm-svn: 25296	2006-01-13 21:45:19 +00:00
Evan Cheng	794a7cf6fe	Fix sint_to_fp (fild*) support. llvm-svn: 25257	2006-01-12 22:54:21 +00:00
Evan Cheng	169206c67f	Specify transformation from GlobalAddress to TargetGlobalAddress and ExternalSymbol to TargetExternalSymbol. llvm-svn: 25253	2006-01-12 19:36:31 +00:00
Evan Cheng	978f5581c4	X86ISD::SETCC (e.g. SETEr) produces a flag (so multiple SETCC can be linked together). llvm-svn: 25247	2006-01-12 08:27:59 +00:00
Evan Cheng	dd45d29b56	* Materialize GlobalAddress and ExternalSym with MOV32ri rather than LEA32r. * Do not lower GlobalAddress to TargetGlobalAddress. Let isel does it. llvm-svn: 25246	2006-01-12 07:56:47 +00:00
Evan Cheng	5841005bdf	Added ROTL and ROTR. llvm-svn: 25232	2006-01-11 23:20:05 +00:00
Evan Cheng	66540aa32c	Support for MEMCPY and MEMSET. llvm-svn: 25226	2006-01-11 22:15:48 +00:00
Evan Cheng	e42281bcba	* Add special entry code main() (to set x87 to 64-bit precision). * Allow a register node as SelectAddr() base. * ExternalSymbol -> TargetExternalSymbol as direct function callee. * Use X86::ESP register rather than CopyFromReg(X86::ESP) as stack ptr for call parmater passing. llvm-svn: 25207	2006-01-11 06:09:51 +00:00
Evan Cheng	9adc8e5a3d	SSE cmov support. llvm-svn: 25190	2006-01-11 00:33:36 +00:00
Evan Cheng	0cb5e32cda	* fp to sint patterns. * fiadd, fisub, etc. llvm-svn: 25189	2006-01-10 22:22:02 +00:00
Evan Cheng	8504673bb2	FP_TO_INT*_IN_MEM and x87 FP Select support. llvm-svn: 25188	2006-01-10 20:26:56 +00:00
Evan Cheng	154cef5ccb	* Added undef patterns. * Some reorg. llvm-svn: 25163	2006-01-09 23:10:28 +00:00
Evan Cheng	d3babfe458	Support for ADD_PARTS, SUB_PARTS, SHL_PARTS, SHR_PARTS, and SRA_PARTS. llvm-svn: 25158	2006-01-09 18:33:28 +00:00
Evan Cheng	66355df170	Addd (shl x, 1) ==> (shl x, x) peepholes. llvm-svn: 25123	2006-01-06 02:31:59 +00:00
Evan Cheng	1e0d7b98f3	* Fast call support. * FP cmp, setcc, etc. llvm-svn: 25117	2006-01-06 00:43:03 +00:00
Evan Cheng	6c86cf3a5f	Added ConstantFP patterns. llvm-svn: 25108	2006-01-05 02:08:37 +00:00
Evan Cheng	2329411038	DAG based isel call support. llvm-svn: 25103	2006-01-05 00:27:02 +00:00
Evan Cheng	231b11ba87	Added field noResults to Instruction. Currently tblgen cannot tell which operands in the operand list are results so it assumes the first one is a result. This is bad. Ideally we would fix this by separating results from inputs, e.g. (res R32:$dst), (ops R32:$src1, R32:$src2). But that's a more distruptive change. Adding 'let noResults = 1' is the workaround to tell tblgen that the instruction does not produces a result. It works for now since tblgen does not support instructions which produce multiple results. llvm-svn: 25017	2005-12-26 09:11:45 +00:00
Evan Cheng	d87688fe72	* Removed the use of FLAG. Now use hasFlagIn and hasFlagOut instead. * Added a pseudo instruction (for each target) that represent "return void". This is a workaround for lack of optional flag operand (return void is not lowered so it does not have a flag operand.) llvm-svn: 24997	2005-12-23 22:14:32 +00:00
Evan Cheng	995503fc91	More X86 floating point patterns. llvm-svn: 24990	2005-12-23 07:31:11 +00:00
Evan Cheng	e458553c73	Bye bye HACKTROCITY. llvm-svn: 24935	2005-12-22 02:26:21 +00:00
Evan Cheng	fb6413e05a	* Fix a GlobalAddress lowering bug. * Teach DAG combiner about X86ISD::SETCC by adding a TargetLowering hook. llvm-svn: 24921	2005-12-21 23:05:39 +00:00
Evan Cheng	add305de26	Oops. Accidentally deleted RET pattern. It's still needed for return void; llvm-svn: 24920	2005-12-21 22:22:16 +00:00
Evan Cheng	6f15189a77	* Added support for X86 RET with an additional operand to specify number of bytes to pop off stack. * Added support for X86 SETCC. llvm-svn: 24917	2005-12-21 20:21:51 +00:00
Chris Lattner	347c6eedae	This was meant to go in llvm-svn: 24900	2005-12-21 07:50:26 +00:00
Chris Lattner	884def40f4	Rewrite FP stackifier support in the X86InstrInfo.td file, splitting patterns that were overloaded to work before and after the stackifier runs. With the new clean world, it is possible to write patterns for these instructions: woo! This also adds a few simple patterns here and there, though there are a lot still missing. These should be easy to add though. :) See the comments under "Floating Point Stack Support" for more details on the new world order. This patch as absolutely no effect on the generated code, woo! llvm-svn: 24899	2005-12-21 07:47:04 +00:00
Chris Lattner	ee15b5393f	Wrap some long lines: no functionality change llvm-svn: 24898	2005-12-21 05:34:58 +00:00
Evan Cheng	0226113ed5	* Added lowering hook for external weak global address. It inserts a load for Darwin. * Added lowering hook for ISD::RET. It inserts CopyToRegs for the return value (or store / fld / copy to ST(0) for floating point value). This eliminate the need to write C++ code to handle RET with variable number of operands. llvm-svn: 24888	2005-12-21 02:39:21 +00:00
Evan Cheng	ace8f1fafa	SSE2 floating point load / store patterns. SSE2 fp to int conversion patterns. llvm-svn: 24886	2005-12-20 22:59:51 +00:00
Evan Cheng	1c3ea75ffc	Added X86 readport patterns. llvm-svn: 24879	2005-12-20 07:38:38 +00:00
Evan Cheng	bb34a50cb0	X86 conditional branch support. llvm-svn: 24870	2005-12-19 23:12:38 +00:00
Chris Lattner	399dfec939	eliminate some redundancy llvm-svn: 24781	2005-12-17 19:47:05 +00:00
Evan Cheng	6a94c77c55	Added anyext, modelled as zext on X86. llvm-svn: 24759	2005-12-17 01:47:57 +00:00
Evan Cheng	5d90b26707	Added support for cmp, test, and conditional move instructions. llvm-svn: 24756	2005-12-17 01:24:02 +00:00
Evan Cheng	43152cb8b6	* Promote all 1 bit entities to 8 bit. * Handling extload (1 bit -> 8 bit) and remove C++ code that handle 1 bit zextload. llvm-svn: 24726	2005-12-15 19:49:23 +00:00
Evan Cheng	f72e7055c0	Added frameindex, constpool, globaladdr, and externalsym as root nodes of leaaddr. llvm-svn: 24724	2005-12-15 08:31:04 +00:00
Evan Cheng	576b826f71	Use MOV8rm to load 1 bit value. llvm-svn: 24721	2005-12-15 00:59:17 +00:00
Evan Cheng	3b094e89fb	Added sext and zext patterns. llvm-svn: 24705	2005-12-14 02:22:27 +00:00
Evan Cheng	ad1e2fd14a	Add load + store folding srl and sra patterns. llvm-svn: 24696	2005-12-13 07:24:22 +00:00
Evan Cheng	63f60d3edb	Beautify a few patterns. llvm-svn: 24690	2005-12-13 02:40:18 +00:00
Evan Cheng	95d46be9e6	Some shl patterns which do load + store folding. llvm-svn: 24689	2005-12-13 02:34:51 +00:00
Evan Cheng	6beadf1c29	A few helper fragments for loads. e.g. (i8 (load addr:$src)) -> (loadi8 addr:$src). Only to improve readibility. llvm-svn: 24688	2005-12-13 01:57:51 +00:00
Evan Cheng	d233c28d29	Add and, or, and xor patterns which fold load + stores. llvm-svn: 24687	2005-12-13 01:41:36 +00:00
Evan Cheng	62999d6c5d	Add inc + dec patterns which fold load + stores. llvm-svn: 24686	2005-12-13 01:02:47 +00:00
Evan Cheng	7f9fb7b095	Add neg and not patterns which fold load + stores. llvm-svn: 24685	2005-12-13 00:54:44 +00:00
Evan Cheng	240071c011	Missed a couple redundant explicit type casts. llvm-svn: 24684	2005-12-13 00:25:07 +00:00
Evan Cheng	e80ec06aaf	Fix some bad choice of names: i16SExt8 ->i16immSExt8, etc. llvm-svn: 24683	2005-12-13 00:14:11 +00:00
Evan Cheng	ea7f208813	* Split immSExt8 to i16SExt8 and i32SExt8 for i16 and i32 immediate operands. This enables the removal of some explicit type casts. * Rename immZExt8 to i16ZExt8 as well. llvm-svn: 24682	2005-12-13 00:01:09 +00:00
Evan Cheng	0ee9dc460a	Add some integer mul patterns. llvm-svn: 24681	2005-12-12 23:47:46 +00:00
Evan Cheng	6c9f9ea7ec	Add some sub patterns. llvm-svn: 24675	2005-12-12 21:54:05 +00:00
Evan Cheng	145318aefb	Add a few more add / store patterns. e.g. ADD32mi8. llvm-svn: 24670	2005-12-12 19:45:23 +00:00
Evan Cheng	56f62789d7	* Added X86 store patterns. * Added X86 dec patterns. llvm-svn: 24654	2005-12-10 00:48:20 +00:00
Evan Cheng	6610545b7e	Added patterns for ADD8rm, etc. These fold load operands. e.g. addb 4(%esp), %al llvm-svn: 24648	2005-12-09 22:48:48 +00:00
Evan Cheng	6eb25df63a	Added explicit type field to ComplexPattern. llvm-svn: 24637	2005-12-08 02:15:07 +00:00
Evan Cheng	1712ee5ab9	* Added intelligence to X86 LEA addressing mode matching routine so it returns false if the match is not profitable. e.g. leal 1(%eax), %eax. * Added patterns for X86 integer loads and LEA32. llvm-svn: 24635	2005-12-08 02:01:35 +00:00
Evan Cheng	60cc8da341	Remove unnecessary let hasCtrlDep=1 now it can be inferred. llvm-svn: 24611	2005-12-05 23:09:43 +00:00
Chris Lattner	3583f5337b	Several things: 1. Remove redundant type casts now that PR673 is implemented. 2. Implement the OUTir instructions correctly. The port number really is* a 16-bit value, but the patterns should only match if the number is 0-255. Update the patterns so they now match. 3. Fix patterns for shifts to reflect that the shift amount is always an i8, not an i16 as they were believed to be before. This previous fib stopped working when we started knowing that CL has type i8. 4. Change use of i16i8imm in SH*ri patterns to all be imm. llvm-svn: 24599	2005-12-05 02:40:25 +00:00
Evan Cheng	1ce02890ce	Added isel patterns for RET, JMP, and WRITEPORT. llvm-svn: 24588	2005-12-04 08:19:43 +00:00
Evan Cheng	f1352fa7d6	Proper support for shifts with register shift value. llvm-svn: 24559	2005-12-01 00:43:55 +00:00
Nate Begeman	84be54b731	No longer track value types for asm printer operands, and remove them as an argument to every operand printing function. Requires some slight tweaks to x86, the only user. llvm-svn: 24541	2005-11-30 18:54:35 +00:00
Chris Lattner	fdc786b18f	Fix a bug in a recent patch that broke shifts llvm-svn: 24526	2005-11-30 05:11:18 +00:00
Evan Cheng	f412b7ba0c	Add more X86 ISel patterns. llvm-svn: 24520	2005-11-29 19:38:52 +00:00
Chris Lattner	5d9ecff961	encode rdtsc correctly llvm-svn: 24435	2005-11-20 22:13:18 +00:00
Andrew Lenharth	a369904fc5	The second patch of X86 support for read cycle counter. llvm-svn: 24430	2005-11-20 21:41:10 +00:00
Chris Lattner	af79013023	Teach the x86 backend about the register constraints of its addressing mode. Patch by Evan Cheng llvm-svn: 24423	2005-11-19 07:01:30 +00:00
Chris Lattner	72fa26a85b	add more patterns, patch by Evan Cheng. llvm-svn: 24406	2005-11-18 01:04:42 +00:00
Chris Lattner	f829636c6b	Add patterns for some 16-bit immediate instructions, patch contributed by Evan Cheng. llvm-svn: 24384	2005-11-17 02:01:55 +00:00
Chris Lattner	fec54e57a0	Add patterns for several simple instructions that take i32 immediates. Patch contributed by Evan Cheng! llvm-svn: 24382	2005-11-16 22:59:19 +00:00
Nate Begeman	3b6c2df603	Properly split f32 and f64 into separate register classes for scalar sse fp fixing a bunch of nasty hackery llvm-svn: 23735	2005-10-14 22:06:00 +00:00
Chris Lattner	54139f0b83	give all operands names llvm-svn: 23356	2005-09-14 21:10:24 +00:00
Chris Lattner	d7bd59d77e	add a few missing cases llvm-svn: 22891	2005-08-19 00:41:29 +00:00
Chris Lattner	f62a66a21c	Give ADJCALLSTACKDOWN/UP the correct operands. Give a whole bunch of other stuff variable operands, particularly FP. The FP stackifier is playing fast and loose with operands here, so we have to mark them all as variable. This will have to be fixed before we can dag->dag the X86 backend. The solution is for the pre-stackifier and post-stackifier instructions to all be disjoint. llvm-svn: 22890	2005-08-19 00:38:22 +00:00
Nate Begeman	6cd034da8e	Scalar SSE: load +0.0 -> xorps/xorpd Scalar SSE: a < b ? c : 0.0 -> cmpss, andps Scalar SSE: float -> i16 needs to be promoted llvm-svn: 22637	2005-08-03 23:26:28 +00:00
Nate Begeman	957e0e7c9e	Get closer to fully working scalar FP in SSE regs. This gets singlesource working, and Olden/power. llvm-svn: 22441	2005-07-15 00:38:55 +00:00
Nate Begeman	e5314eb2c2	First round of support for doing scalar FP using the SSE2 ISA extension and XMM registers. There are many known deficiencies and fixmes, which will be addressed ASAP. The major benefit of this work is that it will allow the LLVM register allocator to allocate FP registers across basic blocks. The x86 backend will still default to x87 style FP. To enable this work, you must pass -enable-sse-scalar-fp and either -sse2 or -sse3 to llc. An example before and after would be for: double foo(double *P) { double Sum = 0; int i; for (i = 0; i < 1000; ++i) Sum += P[i]; return Sum; } The inner loop looks like the following: x87: .LBB_foo_1: # no_exit fldl (%esp) faddl (%eax,%ecx,8) fstpl (%esp) incl %ecx cmpl $1000, %ecx #FP_REG_KILL jne .LBB_foo_1 # no_exit SSE2: addsd (%eax,%ecx,8), %xmm0 incl %ecx cmpl $1000, %ecx #FP_REG_KILL jne .LBB_foo_1 # no_exit llvm-svn: 22340	2005-07-06 18:59:04 +00:00
Nate Begeman	032a94775d	Initial set of .td file changes necessary to get scalar fp in xmm registers working. The instruction selector changes will hopefully be coming later this week once they are debugged. This is necessary to support the darwin x86 FP model, and is recommended by intel as the replacement for x87. As a bonus, the register allocator knows how to deal with these registers across basic blocks, unliky the FP stackifier. This leads to significantly better codegen in several cases. llvm-svn: 22300	2005-06-27 21:20:31 +00:00
Chris Lattner	7327c042b4	Add markers in the asm file for tail calls, add a new ADJSTACKPTRri sorta-pseudo-instruction llvm-svn: 22042	2005-05-15 03:10:37 +00:00
Chris Lattner	64232a8480	Yes, calltarget is the operand of the day. llvm-svn: 22040	2005-05-15 01:10:30 +00:00
Chris Lattner	37e226fa9b	Add some new instructions llvm-svn: 22036	2005-05-14 23:35:21 +00:00
Chris Lattner	83d7e55471	add 'ret imm' instruction llvm-svn: 21945	2005-05-13 17:56:48 +00:00
Chris Lattner	7ba0699b05	Fix the syntax of the i/o instructions, these are obviously unused. llvm-svn: 21829	2005-05-09 20:49:20 +00:00
Chris Lattner	236cef3563	Add some new X86 instrs, patch contributed by Morten Ofstad llvm-svn: 21608	2005-04-28 21:50:05 +00:00
Chris Lattner	ba7cdbebb1	add signed versions of the extra precision multiplies llvm-svn: 21106	2005-04-06 04:19:22 +00:00
Chris Lattner	71434aa2dd	add an fabs instr llvm-svn: 21006	2005-04-02 04:31:56 +00:00
Chris Lattner	a024984017	Fix spelling, patch contributed by Gabor Greif! llvm-svn: 20343	2005-02-27 06:18:25 +00:00

... 4 5 6 7 8 ...

668 Commits