llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 04:22:57 +02:00

Author	SHA1	Message	Date
Eric Christopher	4924d5fb93	Custom lower the memory barrier instructions and add support for lowering without sse2. Add a couple of new testcases. Fixes a few libgomp tests and latent bugs. Remove a few todos. llvm-svn: 109078	2010-07-22 02:48:34 +00:00
Eric Christopher	959481ec87	Pulling out previous patch, must've run the tests in the wrong directory. llvm-svn: 109005	2010-07-21 09:23:56 +00:00
Eric Christopher	5c12ad2a4b	Lower MEMBARRIER on x86 and support processors without SSE2. Fixes a pile of libgomp failures in the llvm-gcc testsuite due to the libcall not existing. llvm-svn: 109004	2010-07-21 09:05:23 +00:00
Bruno Cardoso Lopes	4ca44dda21	Add 256-bit vaddsub, vhadd, vhsub, vblend and vdpp instructions! llvm-svn: 108769	2010-07-19 23:32:44 +00:00
Daniel Dunbar	1dd74c37c5	X86: Mark JMP{32,64}[mr] as requires 32-bit/64-bit mode. They are the same instruction, we only want to allow the one for the current subtarget. - This also fixes suffix matching for jmp instructions, because it eliminates the ambiguity between 'jmpl' and 'jmpq'. llvm-svn: 108746	2010-07-19 20:44:16 +00:00
Daniel Dunbar	fa2847103d	X86: Mark some tail call pseduo instruction as code gen only. llvm-svn: 108684	2010-07-19 07:21:04 +00:00
Daniel Dunbar	f228215d4f	X86: Mark In32/64BitMode on LEAVE[64] and SYSEXIT[64]. llvm-svn: 108683	2010-07-19 07:21:01 +00:00
Daniel Dunbar	7a3565367a	X86: Mark MOV.*_{TC,NOREX} instruction as code gen only, they aren't real. llvm-svn: 108680	2010-07-19 06:14:49 +00:00
Daniel Dunbar	9409c3fbb2	X86: MOV8o8a, MOV8ao8, etc. are only valid in 32-bit mode. llvm-svn: 108679	2010-07-19 06:14:44 +00:00
Bruno Cardoso Lopes	3676e24b67	Start the support for AVX instructions with 256-bit %ymm registers. A couple of notes: - The instructions are being added with dummy placeholder patterns using some 256 specifiers, this is not meant to work now, but since there are some multiclasses generic enough to accept them, when we go for codegen, the stuff will be already there. - Add VEX encoding bits to support YMM - Add MOVUPS and MOVAPS in the first round - Use "Y" as suffix for those Instructions: MOVUPSYrr, ... - All AVX instructions in X86InstrSSE.td will move soon to a new X86InstrAVX file. llvm-svn: 107996	2010-07-09 18:27:43 +00:00
Chris Lattner	a5c1c795a2	have the mc lowering process handle a few tail call forms, lowering them to jumps where possible and turning the TAILCALL marker in the instruction asm string into a proper comment. This eliminates a FIXME and is on the path to finishing: rdar://7639610 - eliminate encoding and asm info for TAILJMPd TAILJMPr TAILJMPn, etc. However, I can't eliminate the encodings for these instructions because the JIT still exists and has its own copy of the encoder, sigh. llvm-svn: 107946	2010-07-09 00:49:41 +00:00
Chris Lattner	49ac65543c	Change LEA to have 5 operands for its memory operand, just like all other instructions, even though a segment is not allowed. This resolves a bunch of gross hacks in the encoder and makes LEA more consistent with the rest of the instruction set. No functionality change. llvm-svn: 107934	2010-07-08 23:46:44 +00:00
Chris Lattner	6a5db9c9c9	Implement the major chunk of PR7195: support for 'callw' in the integrated assembler. Still some discussion to be done. llvm-svn: 107825	2010-07-07 22:27:31 +00:00
Eric Christopher	657b8b040e	Add a couple more quick comments. llvm-svn: 106717	2010-06-24 02:07:57 +00:00
Eric Christopher	436ff8863f	Update according to feedback. llvm-svn: 106677	2010-06-23 20:49:35 +00:00
Nico Weber	04606293a5	Add support for the x86 instructions "pusha" and "popa". llvm-svn: 106671	2010-06-23 20:00:58 +00:00
Eric Christopher	c6382036ef	Update uses, defs, and comments for darwin tls patterns. llvm-svn: 106621	2010-06-23 08:01:49 +00:00
Eric Christopher	b6dfc01862	Finish ripping isTwoAddress out of X86. Some mindless formatting and operand renaming to help. The giant turn the constraints on and selectively turn it off should probably be inverted at some point since it's just largely 50/50. llvm-svn: 106367	2010-06-19 00:37:40 +00:00
Eric Christopher	5f10974e92	Ensure that mov and not lea are used to stick the address into the register. While we're at it, make sure it's in the right one. llvm-svn: 105645	2010-06-08 22:04:25 +00:00
Eric Christopher	30010cae3a	Add first pass at darwin tls compiler support. llvm-svn: 105381	2010-06-03 04:07:48 +00:00
Daniel Dunbar	918b0f7bd9	AsmMatcher/X86: Mark _REV instructions as "code gen only", they aren't expected to be matched. llvm-svn: 104757	2010-05-26 22:21:28 +00:00
Kevin Enderby	7eae1aeb51	Fix the x86 move to/from segment register instructions. llvm-svn: 104731	2010-05-26 20:10:45 +00:00
Jakob Stoklund Olesen	f40bb16b94	Rename X86 subregister indices to something shorter. Use the tablegen-produced enums. llvm-svn: 104493	2010-05-24 14:48:17 +00:00
Daniel Dunbar	50265dbaf0	MC/X86: Subdivide immediates a bit more, so that we properly recognize immediates based on the width of the target instruction. For example: addw $0xFFFF, %ax should match the same as addw $-1, %ax but we used to match it to the longer encoding. llvm-svn: 104453	2010-05-22 21:02:33 +00:00
Daniel Dunbar	ee525943d8	tblgen/AsmMatcher: Change AsmOperandClass to allow a list of superclasses instead of just one. llvm-svn: 104452	2010-05-22 21:02:29 +00:00
Daniel Dunbar	030b1001c0	X86: Model i64i32imm properly, as a subclass of all immediates. llvm-svn: 104272	2010-05-20 20:20:39 +00:00
Dan Gohman	c8b4555a94	Fix assembly parsing and encoding of the pushf and popf family of instructions. llvm-svn: 104231	2010-05-20 16:16:00 +00:00
Daniel Dunbar	9646c49298	MC/X86: Lower TAILCALLd[64] to JMP_1, to allow relaxation and to avoid same prefix byte problem as in r104062. - As a total hack to keep the TAILCALL markers in the output, which some tests depend on, this invents a new TAILJMP_1 instruction. llvm-svn: 104120	2010-05-19 15:26:43 +00:00
Kevin Enderby	dc13d89540	Fix so "int3" is correctly accepted, added "into" and fixed "int" with an argument, like "int $4", to not get an Assertion error. llvm-svn: 103791	2010-05-14 19:16:02 +00:00
Dan Gohman	dc05cdd475	Set isTerminator on TRAP instructions. llvm-svn: 103778	2010-05-14 16:46:02 +00:00
Dan Gohman	b0f18b9c6c	Add mayLoad and mayStore flags to instructions which missed them. llvm-svn: 103776	2010-05-14 16:34:55 +00:00
Chris Lattner	887e8f9f53	reapply r103668 with a fix. Never make "minor syntax changes" after testing before committing. llvm-svn: 103681	2010-05-13 00:02:47 +00:00
Chris Lattner	361c115f23	revert r103668 for now, it is apparently breaking things. llvm-svn: 103677	2010-05-12 23:40:59 +00:00
Chris Lattner	91a836a9c7	moffset forms of moves are x86-32 only, make the parser lower them to the correct x86-64 instructions since we don't have a clean way to handle this in td files yet. rdar://7947184 llvm-svn: 103668	2010-05-12 23:13:36 +00:00
Chris Lattner	1960255123	fix the encoding of the obscure "moffset" forms of moves, i386 part first. rdar://7947184 llvm-svn: 103660	2010-05-12 22:48:24 +00:00
Daniel Dunbar	45589cd853	MC/X86: X86AbsMemAsmOperand is subclass of X86NoSegMemAsmOperand. - This fixes "leal 0, %eax", for example. llvm-svn: 103205	2010-05-06 22:39:14 +00:00
Sean Callanan	4331428e24	Eliminated the classification of control registers into %ecr_ and %rcr_, leaving just %cr_ which is what people expect. Updated the disassembler to support this unified register set. Added a testcase to verify that the registers continue to be decoded correctly. llvm-svn: 103196	2010-05-06 20:59:00 +00:00
Kevin Enderby	c1eeb061e7	Fixed the encoding of the x86 push instructions. Using a 32-bit immediate value caused the a pushl instruction to be incorrectly encoding using only two bytes of immediate, causing the following 2 instruction bytes to be part of the 32-bit immediate value. Also fixed the one byte form of push to be used when the immediate would fit in a signed extended byte. Lastly changed the names to not include the 32 of PUSH32 since they actually push the size of the stack pointer. llvm-svn: 102951	2010-05-03 20:45:05 +00:00
Dan Gohman	c283eda8ab	Remove the -disable-16bit command-line option, which is now obsolete. llvm-svn: 102730	2010-04-30 18:30:26 +00:00
Kevin Enderby	58bed5a913	Fixed the word sized Bit Scan Forward/Reverse instructions, they needed the Operand size override prefix to be part of their records. llvm-svn: 102556	2010-04-28 23:20:40 +00:00
Evan Cheng	d4fe387eb8	Enable i16 to i32 promotion by default. llvm-svn: 102493	2010-04-28 08:30:49 +00:00
Evan Cheng	b7bb090d5d	Rather than having a ton of patterns for double shift instructions, e.g. SHLD16rrCL, just perform custom dag combine to form x86 specific dag so they match to the same pattern. This also makes sure later dag combine do not cause isel to miss them (e.g. promoting i16 to i32). llvm-svn: 102485	2010-04-28 01:18:01 +00:00
Evan Cheng	65a95091cf	Fix obvious typos. llvm-svn: 102467	2010-04-27 21:46:03 +00:00
Evan Cheng	0f4671b0dd	isel (i32 anyext i16) as insert_subreg when 16-bit ops are being promoted. llvm-svn: 101979	2010-04-21 01:47:12 +00:00
Evan Cheng	6442d111dd	More work to allow dag combiner to promote 16-bit ops to 32-bit. llvm-svn: 101621	2010-04-17 06:13:15 +00:00
Evan Cheng	d72090a658	Fix ADD32rr_alt instruction encoding bug. Patch by Marius Wachtler. llvm-svn: 100480	2010-04-05 22:21:09 +00:00
Eric Christopher	bbf4e35cf6	Separate out the AES-NI instructions from the SSE4.2 instructions. Add a new subtarget option for AES and check for the support. Add "westmere" line of processors and add AES-NI support to the core i7. Add a couple of TODOs for information I couldn't verify. llvm-svn: 100231	2010-04-02 21:54:27 +00:00
Chris Lattner	22c84d79fa	revert r99743, this is saying that the repmovs instructinos have an input of other type, which is the VT. llvm-svn: 99749	2010-03-28 07:38:39 +00:00
Chris Lattner	941ab0b2d5	claiming to return other is pointless. llvm-svn: 99743	2010-03-28 05:57:36 +00:00
Chris Lattner	c5499723d5	fix some modelling problems exposed by a patch I'm working on. bsr/bsf/ptest nodes all have an EFLAGS result when made by isel lowering. llvm-svn: 99736	2010-03-28 05:07:17 +00:00
Chris Lattner	154641e2ff	eliminate the last of the parallel's! llvm-svn: 99700	2010-03-27 02:47:14 +00:00
Chris Lattner	22dceb8eb0	eliminate almost all the rest of the x86-32 parallels. llvm-svn: 99686	2010-03-27 00:45:04 +00:00
Jakob Stoklund Olesen	5a6e614de9	Teach TableGen to understand X.Y notation in the TSFlagsFields strings. Remove much horribleness from X86InstrFormats as a result. Similar simplifications are probably possible for other targets. llvm-svn: 99539	2010-03-25 18:52:01 +00:00
Chris Lattner	cda90fafdd	eliminate a bunch more parallels now that scheduling handles dead implicit results more aggressively. More to come, I think this is now just a data entry problem. llvm-svn: 99486	2010-03-25 05:44:01 +00:00
Evan Cheng	d663ac8306	Disable folding loads into tail call in 32-bit PIC mode. It can introduce illegal code like this: addl $12, %esp popl %esi popl %edi popl %ebx popl %ebp jmpl __Block_deallocator-L1$pb(%esi) # TAILCALL The problem is the global base register is assigned GR32 register class. TCRETURNmi needs the registers making up the address mode to have the GR32_TC register class. The proper* fix is for X86DAGToDAGISel::getGlobalBaseReg() to return a copy from the global base register of the machine function rather than returning the register itself. But that has the potential of causing it to be coalesced to a more restrictive register class: GR32_TC. It can introduce additional copies and spills. For something as important the PIC base, it's not worth it especially since this is not an issue on 64-bit. llvm-svn: 99455	2010-03-25 00:10:31 +00:00
Chris Lattner	a44c023751	Switch INC8r to defining its pattern in terms of X86inc_flag and defining the add pattern with Pat<>, eliminating a use of parallel. llvm-svn: 99375	2010-03-24 01:02:12 +00:00
Chris Lattner	36f990dc18	switch SDTBinaryArithWithFlags to be a multiple-result node as well. llvm-svn: 99370	2010-03-24 00:49:29 +00:00
Chris Lattner	0d53d0a634	Switch SDTUnaryArithWithFlags to being modeled as a two-result ISD node. The only change in the generated isel code are comments like: < // Src: (X86dec_flag:i16 GR16:i16:$src) --- > // Src: (X86dec_flag:i16:i32 GR16:i16:$src) because now it knows that X86dec_flag returns both an i16 (for the result) and an i32 (for EFLAGS) in this case. Wewt. llvm-svn: 99369	2010-03-24 00:47:47 +00:00
Chris Lattner	6a8da47891	remove useless or_is_add parallel's. llvm-svn: 99359	2010-03-24 00:15:23 +00:00
Chris Lattner	f468453934	reduce nesting. llvm-svn: 99358	2010-03-24 00:12:57 +00:00
Chris Lattner	3634b075ee	remove the patterns that I commented out in r98930, Dan verified that they are dead. llvm-svn: 99000	2010-03-19 21:43:36 +00:00
Chris Lattner	6b395fca87	add a new SDNPVariadic SDNP node flag, and use it in dag isel gen instead of instruction properties. This allows the oh-so-useful behavior of matching a variadic non-root node. llvm-svn: 98934	2010-03-19 05:07:09 +00:00
Chris Lattner	9ae31faad2	comment out a bunch of parallel store patterns that apparently can't match or just have no testcases. Will remove after confirmation from dan that they really are dead. llvm-svn: 98930	2010-03-19 04:14:21 +00:00
Chris Lattner	938747fc7f	Now that tblgen can handle matching implicit defs of instructions to input patterns, we can fix X86ISD::CMP and X86ISD::BT as taking two inputs (which have to be the same type) and returning an i32. This is how the SDNodes get made in the graph, but we weren't able to model it this way due to deficiencies in the pattern language. Now we can change things like this: def UCOM_FpIr80: FpI_<(outs), (ins RFP80:$lhs, RFP80:$rhs), CompareFP, - [(X86cmp RFP80:$lhs, RFP80:$rhs), - (implicit EFLAGS)]>; // CC = ST(0) cmp ST(i) + [(set EFLAGS, (X86cmp RFP80:$lhs, RFP80:$rhs))]>; and fix terrible crimes like this: -def : Pat<(parallel (X86cmp GR8:$src1, 0), (implicit EFLAGS)), +def : Pat<(X86cmp GR8:$src1, 0), (TEST8rr GR8:$src1, GR8:$src1)>; This relies on matching the result of TEST8rr (which is EFLAGS, which is an implicit def) to the result of X86cmp, an i32. llvm-svn: 98903	2010-03-19 00:01:11 +00:00
Chris Lattner	2560ce9976	outs come before ins. llvm-svn: 98864	2010-03-18 20:50:06 +00:00
Chris Lattner	53210a1f20	fix the encoding of TAILJMPd. This fixes Benchmarks/Olden/bisort with the integrated assembler! llvm-svn: 98615	2010-03-16 06:30:18 +00:00
Chris Lattner	c50b8b27f5	fix PR6605, X86ISD::CMP always returns i32 (EFLAGS), not the operand type. llvm-svn: 98507	2010-03-14 18:44:35 +00:00
Chris Lattner	1469c1b01e	add support for pentium class CPUs which do not have cmov, PR4841. Patch by Craig Smith! llvm-svn: 98496	2010-03-14 18:31:44 +00:00
Evan Cheng	7d8c39bb1c	Do not force indirect tailcall through fixed registers: eax, r11. Add support to allow loads to be folded to tail call instructions. llvm-svn: 98465	2010-03-14 03:48:46 +00:00
Daniel Dunbar	0bc3059b94	MC/X86: Rename alternate spellings of ADD{8,16,32} and mark as "code gen only" so they don't get selected by the asm matcher. llvm-svn: 98098	2010-03-09 22:50:46 +00:00
Daniel Dunbar	d92e9bc7c1	MC/X86: Rename alternate spellings of CMP{8,16,32} and mark as "code gen only" so they don't get selected by the asm matcher. llvm-svn: 98097	2010-03-09 22:50:40 +00:00
Kevin Enderby	23e37f3e39	Fix the vmxon entry in the X86InstrInfo.td so it has the correct prefix bytes for the encoding and is not the same as vmptrld. llvm-svn: 97992	2010-03-08 22:17:26 +00:00
Daniel Dunbar	66c79cf44d	X86: Fix encoding for TEST{8,16,32}rr. llvm-svn: 97982	2010-03-08 21:10:36 +00:00
Chris Lattner	6ba2e68770	Correct immediate sizes. llvm-svn: 97957	2010-03-08 18:55:15 +00:00
Anton Korobeynikov	f6f8265c1c	Describe what's going on with mingw alloca and why do we need separate instruction. llvm-svn: 97888	2010-03-06 20:07:32 +00:00
Anton Korobeynikov	74dea2d5cb	Lower dynamic stack allocation on mingw32 to separate instruction. We cannot use a normal call here since it has extra unmodelled side effects (it changes stack pointer). This should fix PR5292. llvm-svn: 97884	2010-03-06 19:32:29 +00:00
Jakob Stoklund Olesen	3408cd6de1	Fix the remaining MUL8 and DIV8 to define AX instead of AL,AH. These instructions technically define AL,AH, but a trick in X86ISelDAGToDAG reads AX in order to avoid reading AH with a REX instruction. Fix PR6489. llvm-svn: 97742	2010-03-04 20:42:07 +00:00
Chris Lattner	2e934b978c	remove nvload and two patterns that use it which are better done by dag combine. llvm-svn: 97633	2010-03-03 02:14:54 +00:00
Chris Lattner	88f948aec4	factor the 'in the default address space' check out to a single 'dsload' pattern. tblgen doesn't check patterns to see if they're textually identical. This allows better factoring. llvm-svn: 97630	2010-03-03 01:52:59 +00:00
Chris Lattner	b178d16c23	factor the 'sign extended from 8 bit' patterns better so that they are not destination type specific. This allows tblgen to factor them and the type check is redundant with what the isel does anyway. llvm-svn: 97629	2010-03-03 01:45:01 +00:00
Dan Gohman	99c6bcbc13	The mayHaveSideEffects flag is no longer used. llvm-svn: 97348	2010-02-27 23:47:46 +00:00
Jakob Stoklund Olesen	a946f9eb7d	DIV8r must define %AX since X86DAGToDAGISel::Select() sometimes uses it instead of %AL/%AH. llvm-svn: 97006	2010-02-24 00:39:35 +00:00
Chris Lattner	03b5f3e853	remove a bunch of dead named arguments in input patterns, though some look dubious afaict, these are all ok. llvm-svn: 96899	2010-02-23 06:54:29 +00:00
Sean Callanan	0bc10793b0	Added the rdtscp instruction to the x86 instruction tables. llvm-svn: 96073	2010-02-13 02:06:11 +00:00
Sean Callanan	9806eade6e	Fixed encodings for invlpg, invept, and invvpid. llvm-svn: 96065	2010-02-13 01:48:34 +00:00
Chris Lattner	5b01ab848c	remove special cases for vmlaunch, vmresume, vmxoff, and swapgs fix swapgs to be spelled right. llvm-svn: 96058	2010-02-13 00:41:14 +00:00
Chris Lattner	f1f926247f	enhance the immediate field encoding to know whether the immediate is pc relative or not, mark call and branches as pcrel. llvm-svn: 96026	2010-02-12 22:27:07 +00:00
Chris Lattner	2265d6280b	Add support for a union type in LLVM IR. Patch by Talin! llvm-svn: 96011	2010-02-12 20:49:41 +00:00
Daniel Dunbar	c82069746b	X86: Fix definition for RCL/RCR.*m? operations -- they were getting represented with "tied memory operands", which is wrong. llvm-svn: 95950	2010-02-12 01:22:03 +00:00
Chris Lattner	a10c41c6df	improve encoding information for branches. We now know they have 8 or 32-bit immediates, which allows the new encoder to handle them. llvm-svn: 95927	2010-02-11 21:45:31 +00:00
Chris Lattner	4718a3f4ea	unbreak the build. llvm-svn: 95915	2010-02-11 19:52:11 +00:00
Chris Lattner	d69e1c1bb2	refactor the conditional jump instructions in the .td file to use a multipattern that generates both the 1-byte and 4-byte versions from the same defm llvm-svn: 95901	2010-02-11 19:25:55 +00:00
Dan Gohman	92b6122204	Fix "the the" and similar typos. llvm-svn: 95781	2010-02-10 16:03:48 +00:00
David Greene	bb211816b6	TableGen fragment refactoring. Move some utility TableGen defs, classes, etc. into a common file so they may be used my multiple pattern files. We will use this for the AVX specification to help with the transition from the current SSE specification. llvm-svn: 95727	2010-02-09 23:52:19 +00:00
Chris Lattner	e1b8c529a4	fix incorrect encoding of SBB8mi that Kevin noticed. llvm-svn: 95448	2010-02-05 22:56:11 +00:00
Chris Lattner	9b3cf069dc	teach X86MCInstLower to lower the MOV32r0 and MOV8r0 pseudo instructions. llvm-svn: 95433	2010-02-05 21:21:06 +00:00
Chris Lattner	32f09202d6	factor code better in X86MCInstLower::Lower, teach it to lower the SETB* instructions. llvm-svn: 95431	2010-02-05 21:13:48 +00:00
Kevin Enderby	57859abd72	Added support for X86 instruction prefixes so llvm-mc can assemble them. The Lock prefix, Repeat string operation prefixes and the Segment override prefixes. Also added versions of the move string and store string instructions without the repeat prefixes to X86InstrInfo.td. And finally marked the rep versions of move/store string records in X86InstrInfo.td as isCodeGenOnly = 1 so tblgen is happy building the disassembler files. llvm-svn: 95252	2010-02-03 21:04:42 +00:00
Evan Cheng	e86a191862	Change TAILJMP's to be varargs and transfer implicit uses over from TCRETURN's. Otherwise the missing uses can make post-regalloc scheduling do bad things. This fixes 403.gcc. llvm-svn: 94950	2010-01-31 07:28:44 +00:00
Daniel Dunbar	23e8bc782c	MC/X86 AsmParser: Handle absolute memory operands correctly. We were doing something totally broken and parsing them as immediates, but the .td file also had the wrong match class so things sortof worked. Except, that is, that we would parse movl $0, %eax as movl 0, %eax Feel free to guess how well that worked. llvm-svn: 94869	2010-01-30 01:02:48 +00:00
Daniel Dunbar	ee85d3388b	X86.td: Refactor to bring operands that use print_pcrel_imm together. llvm-svn: 94861	2010-01-30 00:24:12 +00:00
Daniel Dunbar	e92f9cffdb	AsmMatcher/X86: Separate out sublass for memory operands that have no segment register, and use to cleanup a FIXME in X86AsmParser.cpp. llvm-svn: 94859	2010-01-30 00:24:00 +00:00
Evan Cheng	bc0b06fb16	Eliminate or_not_add and just use AddedComplexity so isel tries or_is_add patterns first. llvm-svn: 93245	2010-01-12 18:31:19 +00:00
Dan Gohman	51b3e804dc	Reapply the MOV64r0 patch, with a fix: MOV64r0 clobbers EFLAGS. llvm-svn: 93229	2010-01-12 04:42:54 +00:00
Evan Cheng	bd938ebc90	Extend r93152 to work on OR r, r. If the source set bits are known not to overlap, then select as an ADD instead. llvm-svn: 93191	2010-01-11 22:03:29 +00:00
Evan Cheng	bc84a42d7b	Revert 93158. It's breaking quite a few x86_64 tests. llvm-svn: 93185	2010-01-11 21:13:41 +00:00
Evan Cheng	4548543b0b	Do not turn 8-bit OR to ADD since ADD8ri is not 3-addressfiable. llvm-svn: 93182	2010-01-11 20:18:04 +00:00
Dan Gohman	5b79391087	Re-instate MOV64r0 and MOV16r0, with adjustments to work with the new AsmPrinter. This is perhaps less elegant than describing them in terms of MOV32r0 and subreg operations, but it allows the current register to rematerialize them. llvm-svn: 93158	2010-01-11 17:37:57 +00:00
Dan Gohman	a83443605d	Pattern top-level operators don't need to be restricted to a single user. The _su forms are intended for non-top-level nodes. llvm-svn: 93155	2010-01-11 17:21:05 +00:00
Evan Cheng	ee806a0db5	Select an OR with immediate as an ADD if the input bits are known zero. This allow the instruction to be 3address-fied if needed. llvm-svn: 93152	2010-01-11 17:03:47 +00:00
Evan Cheng	4f25f87baa	Fix what looks to me obvious instruction definition bugs. 1. CMPXCHG8B and CMPXCHG16B did not specify implicit physical register defs and uses. 2. LCMPXCHG8B is loading 64 bit memory, not 32 bit. llvm-svn: 92985	2010-01-08 01:29:19 +00:00
Dan Gohman	d3383baab0	Remove the SDNPAssociative properties for the flags-producing operators. Eli pointed out that it's not obvious what that would mean. llvm-svn: 92555	2010-01-05 00:44:20 +00:00
Dan Gohman	29583f656d	Add SDNPCommutative and SDNPAssociative to several X86 target nodes. This lets isel fold loads into them in more cases. llvm-svn: 92506	2010-01-04 20:51:05 +00:00
Eli Friedman	3a53d1cb1a	PR5886: Make sure IMUL32m is marked as setting EFLAGS, so scheduling doesn't do illegal stuff around it. No testcase because the issue is very fragile. llvm-svn: 92167	2009-12-26 20:08:30 +00:00
Chris Lattner	f77ca5f9f5	really remove the instruction, don't just comment it out llvm-svn: 91976	2009-12-23 01:46:40 +00:00
Chris Lattner	d7e8bd73fe	completely eliminate the MOV16r0 'instruction'. The only interesting part of this is the divrem changes, which are already tested by CodeGen/X86/divrem.ll. llvm-svn: 91975	2009-12-23 01:45:04 +00:00
Chris Lattner	dbcf2725aa	stop pattern matching 16-bit zero's of a register to MOV16r0, instead use the appropriate subreggy thing. This generates identical code on some large apps (thanks to Evan's cross class coalescing stuff he did back in july). This means that MOV16r0 can go away completely in the future soon. llvm-svn: 91972	2009-12-23 01:30:26 +00:00
Evan Cheng	7cd6bfe549	Remove target attribute break-sse-dep. Instead, do not fold load into sse partial update instructions unless optimizing for size. llvm-svn: 91910	2009-12-22 17:47:23 +00:00
Evan Cheng	d97d025eba	On recent Intel u-arch's, folding loads into some unary SSE instructions can be non-optimal. To be precise, we should avoid folding loads if the instructions only update part of the destination register, and the non-updated part is not needed. e.g. cvtss2sd, sqrtss. Unfolding the load from these instructions breaks the partial register dependency and it can improve performance. e.g. movss (%rdi), %xmm0 cvtss2sd %xmm0, %xmm0 instead of cvtss2sd (%rdi), %xmm0 An alternative method to break dependency is to clear the register first. e.g. xorps %xmm0, %xmm0 cvtss2sd (%rdi), %xmm0 llvm-svn: 91672	2009-12-18 07:40:29 +00:00
Sean Callanan	06b6feb2e1	Instruction fixes, added instructions, and AsmString changes in the X86 instruction tables. Also (while I was at it) cleaned up the X86 tables, removing tabs and 80-line violations. This patch was reviewed by Chris Lattner, but please let me know if there are any problems. * X86.td Removed tabs and fixed 80-line violations X86Instr64bit.td (IRET, POPCNT, BT_, LSL, SWPGS, PUSH_S, POP_S, L_S, SMSW) Added (CALL, CMOV) Added qualifiers (JMP) Added PC-relative jump instruction (POPFQ/PUSHFQ) Added qualifiers; renamed PUSHFQ to indicate that it is 64-bit only (ambiguous since it has no REX prefix) (MOV) Added rr form going the other way, which is encoded differently (MOV) Changed immediates to offsets, which is more correct; also fixed MOV64o64a to have to a 64-bit offset (MOV) Fixed qualifiers (MOV) Added debug-register and condition-register moves (MOVZX) Added more forms (ADC, SUB, SBB, AND, OR, XOR) Added reverse forms, which (as with MOV) are encoded differently (ROL) Made REX.W required (BT) Uncommented mr form for disassembly only (CVT__2__) Added several missing non-intrinsic forms (LXADD, XCHG) Reordered operands to make more sense for MRMSrcMem (XCHG) Added register-to-register forms (XADD, CMPXCHG, XCHG) Added non-locked forms * X86InstrSSE.td (CVTSS2SI, COMISS, CVTTPS2DQ, CVTPS2PD, CVTPD2PS, MOVQ) Added * X86InstrFPStack.td (COM_FST0, COMP_FST0, COM_FI, COM_FIP, FFREE, FNCLEX, FNOP, FXAM, FLDL2T, FLDL2E, FLDPI, FLDLG2, FLDLN2, F2XM1, FYL2X, FPTAN, FPATAN, FXTRACT, FPREM1, FDECSTP, FINCSTP, FPREM, FYL2XP1, FSINCOS, FRNDINT, FSCALE, FCOMPP, FXSAVE, FXRSTOR) Added (FCOM, FCOMP) Added qualifiers (FSTENV, FSAVE, FSTSW) Fixed opcode names (FNSTSW) Added implicit register operand * X86InstrInfo.td (opaque512mem) Added for FXSAVE/FXRSTOR (offset8, offset16, offset32, offset64) Added for MOV (NOOPW, IRET, POPCNT, IN, BTC, BTR, BTS, LSL, INVLPG, STR, LTR, PUSHFS, PUSHGS, POPFS, POPGS, LDS, LSS, LES, LFS, LGS, VERR, VERW, SGDT, SIDT, SLDT, LGDT, LIDT, LLDT, LODSD, OUTSB, OUTSW, OUTSD, HLT, RSM, FNINIT, CLC, STC, CLI, STI, CLD, STD, CMC, CLTS, XLAT, WRMSR, RDMSR, RDPMC, SMSW, LMSW, CPUID, INVD, WBINVD, INVEPT, INVVPID, VMCALL, VMCLEAR, VMLAUNCH, VMRESUME, VMPTRLD, VMPTRST, VMREAD, VMWRITE, VMXOFF, VMXON) Added (NOOPL, POPF, POPFD, PUSHF, PUSHFD) Added qualifier (JO, JNO, JB, JAE, JE, JNE, JBE, JA, JS, JNS, JP, JNP, JL, JGE, JLE, JG, JCXZ) Added 32-bit forms (MOV) Changed some immediate forms to offset forms (MOV) Added reversed reg-reg forms, which are encoded differently (MOV) Added debug-register and condition-register moves (CMOV) Added qualifiers (AND, OR, XOR, ADC, SUB, SBB) Added reverse forms, like MOV (BT) Uncommented memory-register forms for disassembler (MOVSX, MOVZX) Added forms (XCHG, LXADD) Made operand order make sense for MRMSrcMem (XCHG) Added register-register forms (XADD, CMPXCHG) Added unlocked forms * X86InstrMMX.td (MMX_MOVD, MMV_MOVQ) Added forms * X86InstrInfo.cpp: Changed PUSHFQ to PUSHFQ64 to reflect table change * X86RegisterInfo.td: Added debug and condition register sets * x86-64-pic-3.ll: Fixed testcase to reflect call qualifier * peep-test-3.ll: Fixed testcase to reflect test qualifier * cmov.ll: Fixed testcase to reflect cmov qualifier * loop-blocks.ll: Fixed testcase to reflect call qualifier * x86-64-pic-11.ll: Fixed testcase to reflect call qualifier * 2009-11-04-SubregCoalescingBug.ll: Fixed testcase to reflect call qualifier * x86-64-pic-2.ll: Fixed testcase to reflect call qualifier * live-out-reg-info.ll: Fixed testcase to reflect test qualifier * tail-opts.ll: Fixed testcase to reflect call qualifiers * x86-64-pic-10.ll: Fixed testcase to reflect call qualifier * bss-pagealigned.ll: Fixed testcase to reflect call qualifier * x86-64-pic-1.ll: Fixed testcase to reflect call qualifier * widen_load-1.ll: Fixed testcase to reflect call qualifier llvm-svn: 91638	2009-12-18 00:01:26 +00:00
Evan Cheng	aaf2f58a04	Re-enable 91381 with fixes. llvm-svn: 91489	2009-12-16 00:53:11 +00:00
Evan Cheng	cd8f0de016	Use sbb x, x to materialize carry bit in a GPR. The result is all one's or all zero's. llvm-svn: 91381	2009-12-15 00:53:42 +00:00
Evan Cheng	53e863f152	Fix an obvious bug. No test case since LEA16r is not being used. llvm-svn: 91219	2009-12-12 18:51:56 +00:00
Dan Gohman	e573c59c90	Minor whitespace fixes. llvm-svn: 90166	2009-11-30 23:33:53 +00:00
Dan Gohman	b5ec39e2dc	Remove ISD::DEBUG_LOC and ISD::DBG_LABEL, which are no longer used. Note that "hasDotLocAndDotFile"-style debug info was already broken; people wanting this functionality should implement it in the AsmPrinter/DwarfWriter code. llvm-svn: 89711	2009-11-23 23:20:51 +00:00
Dan Gohman	3045fc7514	Use a tab in INT3's asm string, for consistency. llvm-svn: 86850	2009-11-11 18:07:16 +00:00
Anton Korobeynikov	9737bfedeb	Do not infer the target type for COPY_TO_REGCLASS from dest regclass, this won't work if it can contain several types. Require explicit result type for the node for now. This fixes PR5364. PS: It seems that blackfin usage of copy_to_regclass is completely bogus! llvm-svn: 85766	2009-11-02 00:11:39 +00:00
Dan Gohman	2767aa065e	Initial x86 support for BlockAddresses. llvm-svn: 85557	2009-10-30 01:28:02 +00:00
Dan Gohman	3393a4c997	Rename usesCustomDAGSchedInserter to usesCustomInserter, and update a bunch of associated comments, because it doesn't have anything to do with DAGs or scheduling. This is another step in decoupling MachineInstr emitting from scheduling. llvm-svn: 85517	2009-10-29 18:10:34 +00:00
Dan Gohman	6b54c70e78	Following r84485, add Defs = [EFLAGS] to the 32-bit lock instructions too. llvm-svn: 84652	2009-10-20 18:14:49 +00:00
Chris Lattner	cf8c23d554	remove strings from instructions who are never asmprinted. All of these "subreg32" modifier instructions are handled explicitly by the MCInst lowering phase. If they got to the asmprinter, they would explode. They should eventually be replace with correct use of subregs. llvm-svn: 84526	2009-10-19 19:51:42 +00:00
Chris Lattner	f34b09b112	remove the asmstring, it is now dead. Improve comment. llvm-svn: 82390	2009-09-20 07:32:00 +00:00
Chris Lattner	d56be552eb	kill off printPICLabel now, it's specialness is handled by the MachineInstr ->MCInst lowering process, not in the asmprinter. llvm-svn: 82388	2009-09-20 07:28:26 +00:00
Chris Lattner	f411f53f9c	Add an intel syntax MCInstPrinter implementation. You can now transcode from AT&T to intel syntax with "llvm-mc foo.s -output-asm-variant=1" llvm-svn: 82385	2009-09-20 07:17:49 +00:00
Dan Gohman	0dcc5f9922	Add support for using the FLAGS result of or, xor, and and instructions on x86, to avoid explicit test instructions. A few existing tests changed due to arbitrary register allocation differences. llvm-svn: 82263	2009-09-18 19:59:53 +00:00
Sean Callanan	498be752e0	Added RCL and RCR (rotate left and right with a carry bit) instructions to the Intel instruction tables. llvm-svn: 82260	2009-09-18 19:35:23 +00:00
Sean Callanan	a025a7f352	Added the LODS (load byte into register, usually as part string parsing) instructions to the Intel instruction tables. llvm-svn: 82089	2009-09-16 22:59:28 +00:00
Sean Callanan	cb5724f556	Added the LAR (load segment access rights) instructions to the Intel instruction tables. llvm-svn: 82084	2009-09-16 21:55:34 +00:00
Sean Callanan	a0ec1cbaa9	Added the LOOP family of instructions to the Intel instruction tables. llvm-svn: 82083	2009-09-16 21:50:07 +00:00
Sean Callanan	38313b9f78	Added an alternate form of register-register CMP to the Intel instruction tables. llvm-svn: 82081	2009-09-16 21:11:23 +00:00
Sean Callanan	0fb60155bd	Added the ENTER instruction, which sets up a stack frame, to the Intel instruction tables. llvm-svn: 81995	2009-09-16 02:57:13 +00:00
Sean Callanan	a3e93882f3	Added the definitions for one-bit left shifts to the Intel instruction tables. The patterns will stay blank because ADD reg, reg is faster, but having the encoding available is useful for the disassembler. llvm-svn: 81994	2009-09-16 02:28:43 +00:00
Sean Callanan	a68b2a56bb	Added far return instructions (that is, returns to code in other segments) to the Intel instruction tables. llvm-svn: 81953	2009-09-15 23:37:51 +00:00
Sean Callanan	4dc743b7ff	Updated comments per Eli's suggestion. llvm-svn: 81923	2009-09-15 21:43:27 +00:00
Sean Callanan	e62a9a60c7	Added register-to-register ADD instructions to the Intel tables, where the source operand is specified by the R/M field and the destination operand by the Reg field. llvm-svn: 81914	2009-09-15 20:53:57 +00:00
Sean Callanan	3386a02b81	Added a new register class for segment registers to the Intel register table. Added 16- and 64-bit MOVs to and from the segment registers to the Intel instruction tables. llvm-svn: 81895	2009-09-15 18:47:29 +00:00
Sean Callanan	f6e983b998	Modified the Intel instruction tables to include versions of CALL and JMP with segmented addresses provided in-line, as pairs of immediates. llvm-svn: 81818	2009-09-15 00:35:17 +00:00
Sean Callanan	b608bb1128	Added the WAIT instruction to the Intel tables, for the purposes of the disassembler. llvm-svn: 81603	2009-09-12 02:52:41 +00:00
Sean Callanan	1da8919600	Added CMPS (string comparison) instructions for all operand widths to the Intel instruction tables, for the purposes of the disassembler. llvm-svn: 81601	2009-09-12 02:25:20 +00:00
Sean Callanan	e2f2aa65c9	Added SCAS instructions in their 8, 16, 32, and 64-bit variants for the disassembler. llvm-svn: 81591	2009-09-12 00:37:19 +00:00
Sean Callanan	26ea351ab4	Added ADC, SUB, SBB, and OR instructions that operate on rAX and an immediate. llvm-svn: 81551	2009-09-11 19:01:56 +00:00
Sean Callanan	9bf1cfc585	Added XOR instructions for rAX and immediates of various widths. llvm-svn: 81458	2009-09-10 19:52:26 +00:00
Sean Callanan	5e1568e95e	Added MOV instructions between rAX and memory offsets, including segment offsets and (for 8-bit operands) absolute offsets. llvm-svn: 81457	2009-09-10 18:33:42 +00:00
Sean Callanan	ce27a0feb7	Added a variety of PUSH and POP instructions, including ones capable of accessing R/M operands instead of just registers. llvm-svn: 81456	2009-09-10 18:29:13 +00:00
Dan Gohman	c50ad41cc5	Add a -disable-16bit flag and associated support for experimenting with disabling the use of 16-bit operations on x86. This doesn't yet work for inline asms with 16-bit constraints, vectors with 16-bit elements, trampoline code, and perhaps other obscurities, but it's enough to try some experiments. llvm-svn: 80930	2009-09-03 17:18:51 +00:00
Sean Callanan	1c6706b750	Added opaque 32-, 48-, and 80-bit memory operand types to the X86 instruction tables to support segmented addressing (and other objects of obscure type). Modified the X86 assembly printers to handle these new operand types. Added JMP and CALL instructions that use segmented addresses. llvm-svn: 80857	2009-09-03 00:04:47 +00:00
Sean Callanan	8dfa4a30bf	Fixed the asmstrings for 8-bit, 16-bit, and 32-bit ADD %rAX, imm instructions. Added a 64-bit ADD %RAX, imm32 instruction. Added all 4 forms for AND %rAX, imm and CMP %rAX, imm. llvm-svn: 80746	2009-09-02 00:55:49 +00:00
Sean Callanan	18ae1d3c8d	Added TEST %rAX, $imm instructions to the Intel tables. These are required for the X86 disassembler. llvm-svn: 80696	2009-09-01 18:14:18 +00:00
Dan Gohman	f7b76078bb	CMOV_GR8 clobbers EFLAGS when its expansion involves an xor to set a register to 0. This fixes PR4814. llvm-svn: 80445	2009-08-29 22:19:15 +00:00
Dan Gohman	457e656c16	Don't mark CMOV_GR8 as two-address, or commutable, since it's a pseudo. llvm-svn: 80271	2009-08-27 18:16:24 +00:00
Daniel Dunbar	87eb328bcf	X86: Mark EH_RETURN as code-gen-only. llvm-svn: 80232	2009-08-27 07:58:05 +00:00
Dan Gohman	613d152216	Expand i8 selects into control flow instead of 16-bit conditional moves. This avoids the need to promote the operands (or implicitly extend them, a partial register update condition), and can reduce i8 register pressure. This substantially speeds up code such as write_hex in lib/Support/raw_ostream.cpp. subclass-coalesce.ll is too trivial and no longer tests what it was originally intended to test. llvm-svn: 80184	2009-08-27 00:14:12 +00:00
Dan Gohman	6bd4a58365	Don't use INSERT_SUBREG to model anyext operations on x86-64, as it leads to partial-register definitions. To help avoid redundant zero-extensions, also teach the h-register matching patterns that use movzbl to match anyext as well as zext. llvm-svn: 80099	2009-08-26 14:59:13 +00:00
Dan Gohman	d69323d37a	On x86-64, for a varargs function, don't store the xmm registers to the register save area if %al is 0. This avoids touching xmm regsiters when they aren't actually used. llvm-svn: 79061	2009-08-15 01:38:56 +00:00
Daniel Dunbar	514498ccec	X86/AsmParser: Mark MOV64GSrm, MOV64FSrm, GS_MOV32rm, FS_MOV32rm as codegen only. llvm-svn: 78733	2009-08-11 22:24:40 +00:00
Daniel Dunbar	63f93255ae	Add 'isCodeGenOnly' bit to Instruction .td records. - Used to mark fake instructions which don't correspond to an actual machine instruction (or are duplicates of a real instruction). This is to be used for "special cases" in the .td files, which should be ignored by things like the assembler and disassembler. We still need a good solution to handle pervasive duplication, like with the Int_ instructions. - Set the bit on fake "mov 0" style instructions, which allows turning an assembler matcher warning into a hard error. - -2 FIXMEs. llvm-svn: 78731	2009-08-11 22:17:52 +00:00
Sean Callanan	b2288f269b	Added ADD instructions with rAX as one parameter to the Intel instruction tables. llvm-svn: 78721	2009-08-11 21:26:06 +00:00
Chris Lattner	edb3daa5e9	move some 32-bit instrs to x86instrinfo.td llvm-svn: 78680	2009-08-11 16:58:39 +00:00
Sean Callanan	b6295e7143	Added the x86 INT instructions; both the special-case INT 3 and the general-case INT i8. These instructions are only for interpretation by disassemblers, not for emission, so they do not as yet have patterns. llvm-svn: 78630	2009-08-11 01:09:06 +00:00
Daniel Dunbar	20829b121a	llvm-mc/AsmMatcher: Fix thinko, Mem isn't a subclass of Imm. llvm-svn: 78587	2009-08-10 19:08:02 +00:00
Daniel Dunbar	749ff1de5a	llvm-mc/AsmMatcher: Change assembler parser match classes to their own record structure. llvm-svn: 78581	2009-08-10 18:41:10 +00:00
Daniel Dunbar	15e6a41728	llvm-mc/AsmParser: Implement user defined super classes. - We can now discriminate SUB32ri8 from SUB32ri, for example. llvm-svn: 78530	2009-08-09 07:20:21 +00:00
Daniel Dunbar	dff8502076	llvm-mc/AsmParser: Define match classes in the .td file. -2 FIXMEs. llvm-svn: 78523	2009-08-09 05:18:30 +00:00
Anton Korobeynikov	0c6314a3e2	We need to sext global addresses in kernel code model, not zext llvm-svn: 78299	2009-08-06 11:23:24 +00:00
Anton Korobeynikov	9232ddb6f2	Missed part of recent kernel codemodel tweaks llvm-svn: 78293	2009-08-06 09:11:19 +00:00
Dan Gohman	ac47a4b9ed	Enable the new no-SP register classes by default. This is to address PR4572. A few tests have some minor code regressions due to different coalescing. llvm-svn: 78217	2009-08-05 17:40:24 +00:00
Dan Gohman	5d566d918b	Major calling convention code refactoring. Instead of awkwardly encoding calling-convention information with ISD::CALL, ISD::FORMAL_ARGUMENTS, ISD::RET, and ISD::ARG_FLAGS nodes, TargetLowering provides three virtual functions for targets to override: LowerFormalArguments, LowerCall, and LowerRet, which replace the custom lowering done on the special nodes. They provide the same information, but in a more immediately usable format. This also reworks much of the target-independent tail call logic. The decision of whether or not to perform a tail call is now cleanly split between target-independent portions, and the target dependent portion in IsEligibleForTailCallOptimization. This also synchronizes all in-tree targets, to help enable future refactoring and feature work. llvm-svn: 78142	2009-08-05 01:29:28 +00:00
Anton Korobeynikov	0bac80c138	Unbreak Win64 CC. Step one: honour register save area, fix some alignment and provide a different set of call-clobberred registers. llvm-svn: 77962	2009-08-03 08:12:53 +00:00
Dan Gohman	751baf25e9	Add a comment. llvm-svn: 77894	2009-08-02 16:10:01 +00:00
Dan Gohman	d36cbd0574	Resync lea32addr and lea64addr. llvm-svn: 77893	2009-08-02 16:09:17 +00:00
Evan Cheng	148032a1a2	Optimize some common usage patterns of atomic built-ins __sync_add_and_fetch() and __sync_sub_and_fetch. When the return value is not used (i.e. only care about the value in the memory), x86 does not have to use add to implement these. Instead, it can use add, sub, inc, dec instructions with the "lock" prefix. This is currently implemented using a bit of instruction selection trick. The issue is the target independent pattern produces one output and a chain and we want to map it into one that just output a chain. The current trick is to select it into a merge_values with the first definition being an implicit_def. The proper solution is to add new ISD opcodes for the no-output variant. DAG combiner can then transform the node before it gets to target node selection. Problem #2 is we are adding a whole bunch of x86 atomic instructions when in fact these instructions are identical to the non-lock versions. We need a way to add target specific information to target nodes and have this information carried over to machine instructions. Asm printer (or JIT) can use this information to add the "lock" prefix. llvm-svn: 77582	2009-07-30 08:33:02 +00:00
Dan Gohman	3c7e8160f6	Add a new register class to describe operands that can't be SP, due to x86 encoding restrictions. This is currently off by default because it may cause code quality regressions. This is for PR4572. llvm-svn: 77565	2009-07-30 01:56:29 +00:00
Sean Callanan	73b927334f	Added a 2+-byte NOP instruction to the Intel tables, for the assembler/disassembler to use. llvm-svn: 76914	2009-07-23 23:39:34 +00:00
Sean Callanan	1e9eb16a45	Added the unconditional JMP with an 8-bit relocation for the assembler / disassembler. llvm-svn: 76712	2009-07-22 01:05:20 +00:00
Evan Cheng	d76397061a	Add jumps with 8-bit relocation for assembler / disassembler. Patch by Sean Callanan. llvm-svn: 76536	2009-07-21 06:00:18 +00:00
Chris Lattner	a403645785	remove the "debug" modifier, it is only used by one instruction which can never be generated. llvm-svn: 75305	2009-07-10 22:34:11 +00:00
David Greene	c702ce1ab3	Add 256-bit memory operand support. llvm-svn: 74548	2009-06-30 19:24:59 +00:00
David Greene	d33e8e7d83	Add feature flags for AVX and FMA and fix some SSE4A feature flag initialization problems. llvm-svn: 74350	2009-06-26 22:46:54 +00:00
Sean Callanan	a227b42ac0	Test commit: fixed spacing. llvm-svn: 74022	2009-06-23 23:25:37 +00:00
Chris Lattner	580eecebbd	change TLS_ADDR lowering to lower to a real mem operand, instead of matching as a global with that gets printed with the :mem modifier. All operands to lea's should be handled with the lea32mem operand kind, and this allows the TLS stuff to do this. There are several better ways to do this, but I went for the minimal change since I can't really test this (beyond make check). This also makes the use of EBX explicit in the operand list in the 32-bit, instead of implicit in the instruction. llvm-svn: 73834	2009-06-20 20:38:48 +00:00
Chris Lattner	12ba79a2b7	eliminate the "call" operand modifier from the asm descriptions, modeling it as a pcrel immediate instead. This gets pc-rel weirdness out of the main printoperand codepath. llvm-svn: 73829	2009-06-20 19:34:09 +00:00
Eli Friedman	b2688e9b73	Misc tweaks to Intel asm printing to make it more compatible with MASM. Patch by Benedict Gaster. llvm-svn: 73753	2009-06-19 04:48:38 +00:00
Bill Wendling	43f2a61c26	The Ls and Qs were mixed up. Patch by Sean. llvm-svn: 73417	2009-06-15 20:59:31 +00:00
Bill Wendling	8b64cfd877	"The Intel instruction tables should include the 64-bit and 32-bit instructions that push immediate operands of 1, 2, and 4 bytes (extended to the native register size in each case). The assembly mnemonics are "pushl" and "pushq." One such instruction appears at the beginning of the "start" function , so this is essential for accurate disassembly when unwinding." Patch by Sean Callanan! llvm-svn: 73407	2009-06-15 19:39:04 +00:00
Dan Gohman	609f627ed7	Revert r72734. The Darwin assembler doesn't support the static relocation model on x86-64. Higher level logic should override the relocation model to PIC on x86_64-apple-darwin. llvm-svn: 72746	2009-06-03 00:37:20 +00:00
Evan Cheng	7e66d61bec	On Darwin x86_64 small code model doesn't guarantee code address fits in 32-bit. llvm-svn: 72734	2009-06-02 20:09:31 +00:00
Dale Johannesen	8b6ee9e312	Revert 72707 and 72709, for the moment. llvm-svn: 72712	2009-06-02 03:12:52 +00:00
Dale Johannesen	c08669561e	Make the implicit inputs and outputs of target-independent ADDC/ADDE use MVT::i1 (later, whatever it gets legalized to) instead of MVT::Flag. Remove CARRY_FALSE in favor of 0; adjust all target-independent code to use this format. Most targets will still produce a Flag-setting target-dependent version when selection is done. X86 is converted to use i32 instead, which means TableGen needs to produce different code in xxxGenDAGISel.inc. This keys off the new supportsHasI1 bit in xxxInstrInfo, currently set only for X86; in principle this is temporary and should go away when all other targets have been converted. All relevant X86 instruction patterns are modified to represent setting and using EFLAGS explicitly. The same can be done on other targets. The immediate behavior change is that an ADC/ADD pair are no longer tightly coupled in the X86 scheduler; they can be separated by instructions that don't clobber the flags (MOV). I will soon add some peephole optimizations based on using other instructions that set the flags to feed into ADC. llvm-svn: 72707	2009-06-01 23:27:20 +00:00
Evan Cheng	550fc9ba9f	More h-registers tricks: folding zext nodes. llvm-svn: 72558	2009-05-29 01:44:43 +00:00
Evan Cheng	e17c02e328	Try again. Allow call to immediate address for ELF or when in static relocation mode. llvm-svn: 72160	2009-05-20 04:53:57 +00:00
Evan Cheng	8a4887572e	Cannot use immediate as call absolute target in PIC mode. llvm-svn: 72154	2009-05-20 01:11:00 +00:00
Dale Johannesen	a0756109d8	Add OpSize to 16-bit ADC and SBB. llvm-svn: 72045	2009-05-18 21:41:59 +00:00
Dale Johannesen	6efc155312	Fill in the missing patterns for ADC and SBB. Some comment cleanup. llvm-svn: 72022	2009-05-18 17:44:15 +00:00
Dan Gohman	0edabc8a6f	Convert a subtract into a negate and an add when it helps x86 address folding. llvm-svn: 71446	2009-05-11 18:02:53 +00:00
Chris Lattner	5cc9a36d1c	Add basic support for code generation of addrspace(257) -> FS relative on x86. Patch by Zoltan Varga! llvm-svn: 70992	2009-05-05 18:52:19 +00:00
Dan Gohman	8a0e27efb2	Set mayLoad on MOVZX32_NOREXrm8 too. llvm-svn: 70466	2009-04-30 03:11:48 +00:00
Evan Cheng	b7d41a6680	Mark MOV8mr_NOREX and MOV8rm_NOREX as mayStore / mayLoad respectively. llvm-svn: 70461	2009-04-30 00:58:57 +00:00
Nate Begeman	9d121924fd	2nd attempt, fixing SSE4.1 issues and implementing feedback from duncan. PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. llvm-svn: 70225	2009-04-27 18:41:29 +00:00
Dan Gohman	a241dec2fc	Rename GR8_ABCD to GR8_ABCD_L and create GR8_ABCD_H, and use these to precisely describe the h-register subreg register classes. Thanks to Jakob Stoklund Olesen for spotting this and for the initial patch! Also, make getStoreRegOpcode and getLoadRegOpcode aware of the needs of h registers. llvm-svn: 70211	2009-04-27 16:41:36 +00:00
Dan Gohman	180fa04e35	Rename GR8_, GR16_, GR32_, and GR64_ to GR8_ABCD, GR16_ABCD, GR32_ABCD, and GR64_ABCD, respectively, to help describe them. llvm-svn: 70210	2009-04-27 16:33:14 +00:00
Dan Gohman	885b9c3688	Break up long multi-mnemonic strings into separate lines for readability. llvm-svn: 70209	2009-04-27 15:13:28 +00:00
Mon P Wang	904d654436	Revised 68749 to allow matching of load/stores for address spaces < 256. llvm-svn: 70197	2009-04-27 07:22:10 +00:00
Rafael Espindola	4e7a0bf1f1	Fix PR 4004 by including the call to __tls_get_addr in X86tlsaddr. This is not very elegant, but neither is the tls specification :-( llvm-svn: 69968	2009-04-24 12:59:40 +00:00
Rafael Espindola	0b1037ad26	Revert 69952. Causes testsuite failures on linux x86-64. llvm-svn: 69967	2009-04-24 12:40:33 +00:00
Nate Begeman	c1a09c7dfa	PR2957 ISD::VECTOR_SHUFFLE now stores an array of integers representing the shuffle mask internal to the node, rather than taking a BUILD_VECTOR of ConstantSDNodes as the shuffle mask. A value of -1 represents UNDEF. In addition to eliminating the creation of illegal BUILD_VECTORS just to represent shuffle masks, we are better about canonicalizing the shuffle mask, resulting in substantially better code for some classes of shuffles. A clean up of x86 shuffle code, and some canonicalizing in DAGCombiner is next. llvm-svn: 69952	2009-04-24 03:42:54 +00:00
Rafael Espindola	5adc7ad39e	TLS_addr64 and TLS_addr32 define RDI and EAX. They don't use them. This fixes PR4002. llvm-svn: 69672	2009-04-21 08:22:09 +00:00
Rafael Espindola	d74132e2c5	For general dynamic TLS access we must use leaq foo@TLSGD(%rip), %rdi as part of the instruction sequence. Using a register other than %rdi and then copying it to %rdi is not valid. llvm-svn: 69350	2009-04-17 14:35:58 +00:00
Dan Gohman	38bc0faa22	Fix 80-column violations. llvm-svn: 69204	2009-04-15 19:48:57 +00:00
Dan Gohman	a1fe2a3741	Add a new MOV8rr_NOREX, and make X86's copyRegToReg use it when either the source or destination is a physical h register. This fixes sqlite3 with the post-RA scheduler enabled. llvm-svn: 69111	2009-04-15 00:04:23 +00:00
Dan Gohman	8393d29bc8	Rename COPY_TO_SUBCLASS to COPY_TO_REGCLASS, and generalize it accordingly. Thanks to Jakob Stoklund Olesen for pointing out how this might be useful. llvm-svn: 68986	2009-04-13 21:06:25 +00:00
Dan Gohman	be7227005f	Implement x86 h-register extract support. - Add patterns for h-register extract, which avoids a shift and mask, and in some cases a temporary register. - Add address-mode matching for turning (X>>(8-n))&(255<<n), where n is a valid address-mode scale value, into an h-register extract and a scaled-offset address. - Replace X86's MOV32to32_ and related instructions with the new target-independent COPY_TO_SUBREG instruction. On x86-64 there are complicated constraints on h registers, and CodeGen doesn't currently provide a high-level way to express all of them, so they are handled with a bunch of special code. This code currently only supports extracts where the result is used by a zero-extend or a store, though these are fairly common. These transformations are not always beneficial; since there are only 4 h registers, they sometimes require extra move instructions, and this sometimes increases register pressure because it can force out values that would otherwise be in one of those registers. However, this appears to be relatively uncommon. llvm-svn: 68962	2009-04-13 16:09:41 +00:00
Chris Lattner	26aee059ba	a few fixes to "addrspace(256) is reference offset of GS segment register". It turns out that there are still several problems with this, will file a bugzilla. llvm-svn: 68749	2009-04-10 00:16:23 +00:00
Rafael Espindola	7eb72dc5f2	Re-apply 68552. Tested by bootstrapping llvm-gcc and using that to build llvm. llvm-svn: 68645	2009-04-08 21:14:34 +00:00
Bill Wendling	6e702cf68c	Temporarily revert r68552. This was causing a failure in the self-hosting LLVM builds. --- Reverse-merging (from foreign repository) r68552 into '.': U test/CodeGen/X86/tls8.ll U test/CodeGen/X86/tls10.ll U test/CodeGen/X86/tls2.ll U test/CodeGen/X86/tls6.ll U lib/Target/X86/X86Instr64bit.td U lib/Target/X86/X86InstrSSE.td U lib/Target/X86/X86InstrInfo.td U lib/Target/X86/X86RegisterInfo.cpp U lib/Target/X86/X86ISelLowering.cpp U lib/Target/X86/X86CodeEmitter.cpp U lib/Target/X86/X86FastISel.cpp U lib/Target/X86/X86InstrInfo.h U lib/Target/X86/X86ISelDAGToDAG.cpp U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp U lib/Target/X86/AsmPrinter/X86IntelAsmPrinter.cpp U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.h U lib/Target/X86/AsmPrinter/X86IntelAsmPrinter.h U lib/Target/X86/X86ISelLowering.h U lib/Target/X86/X86InstrInfo.cpp U lib/Target/X86/X86InstrBuilder.h U lib/Target/X86/X86RegisterInfo.td llvm-svn: 68560	2009-04-07 22:35:25 +00:00
Rafael Espindola	0324937229	Reduce code duplication on the TLS implementation. This introduces a small regression on the generated code quality in the case we are just computing addresses, not loading values. Will work on it and on X86-64 support. llvm-svn: 68552	2009-04-07 21:37:46 +00:00
Evan Cheng	3e30bcbd69	When optimzing a mul by immediate into two, the resulting mul's should get a x86 specific node to avoid dag combiner from hacking on them further. llvm-svn: 68066	2009-03-30 21:36:47 +00:00
Rafael Espindola	aadb9af093	add 8 and 16 bit TLS moves. add a fixme note on how to remove code duplication. llvm-svn: 66932	2009-03-13 19:39:55 +00:00
Rafael Espindola	ff17d02271	Improve sext and zext of TLS variables. llvm-svn: 66922	2009-03-13 18:37:06 +00:00
Evan Cheng	d112c41d95	Re-apply 66024 with fixes: 1. Fixed indirect call to immediate address assembly. 2. Fixed JIT encoding by making the address pc-relative. llvm-svn: 66803	2009-03-12 18:15:39 +00:00
Dan Gohman	d30e108f0e	Revert r66024. The JIT encoding for CALLpcrel32 is wrong -- see PR3773, and the assembly text output uses an indirect call ("call *") instead of a direct call. llvm-svn: 66735	2009-03-11 23:01:47 +00:00
Rafael Espindola	a8fe373200	optimize i8 and i16 tls values. llvm-svn: 66725	2009-03-11 22:40:04 +00:00
Dan Gohman	f9599e6c5f	Don't use plain INC32 and DEC32 on x86-64; it needs INC64_32r and INC64_16r, because these instructions are encoded differently on x86-64. This fixes JIT regressions on x86-64 in kimwitu++ and others. llvm-svn: 66207	2009-03-05 21:32:23 +00:00
Dan Gohman	31fb085c2e	Re-apply 66008, now that the unfoldMemoryOperand bug is fixed. llvm-svn: 66058	2009-03-04 19:44:21 +00:00
Evan Cheng	7d9019d0f3	Fix PR3666: isel calls to constant addresses. llvm-svn: 66024	2009-03-04 06:48:53 +00:00
Dan Gohman	6831e2c2a6	Revert r66004 for now; it's causing a variety of test failures. llvm-svn: 66008	2009-03-04 03:54:19 +00:00
Dan Gohman	c6c669cc1e	Teach the x86 backend to eliminate "test" instructions by using the EFLAGS result from add, sub, inc, and dec instructions in simple cases. llvm-svn: 66004	2009-03-04 02:33:24 +00:00
Dan Gohman	3c6c7754b2	Add '(implicit EFLAGS)' for AND, OR, XOR, NEG, INC, and DEC instructions. These aren't used yet. llvm-svn: 65965	2009-03-03 19:53:46 +00:00
Evan Cheng	87def37f67	A few more isAsCheapAsAMove. llvm-svn: 63852	2009-02-05 08:42:55 +00:00
Evan Cheng	a05436f739	Implement multiple with overflow by 2 with an add instruction. llvm-svn: 63090	2009-01-27 03:30:42 +00:00
Nate Begeman	92efc4f0ce	Map address space 256 to gs; similar mappings could be supported for the other x86 segments. address space 0 is stack/default, 1-255 are reserved for client use. llvm-svn: 62980	2009-01-26 01:24:32 +00:00
Evan Cheng	0ed6a9d7e0	Favors generating "not" over "xor -1". For example. unsigned test(unsigned a) { return ~a; } llvm used to generate: movl $4294967295, %eax xorl 4(%esp), %eax Now it generates: movl 4(%esp), %eax notl %eax It's 3 bytes shorter. llvm-svn: 62661	2009-01-21 02:09:05 +00:00
Dan Gohman	8c835f6285	Disable the register+memory forms of the bt instructions for now. Thanks to Eli for pointing out that these forms don't ignore the high bits of their index operands, and as such are not immediately suitable for use by isel. llvm-svn: 62194	2009-01-13 23:23:30 +00:00
Dan Gohman	15e69a394a	Add bt instructions that take immediate operands. llvm-svn: 62180	2009-01-13 20:33:23 +00:00
Dan Gohman	e84cfeac5f	Fix a few more JIT encoding issues in the BT instructions. llvm-svn: 62179	2009-01-13 20:32:45 +00:00
Dan Gohman	ca4475dd7b	Add patterns to match conditional moves with loads folded into their left operand, rather than their right. Do this by commuting the operands and inverting the condition. llvm-svn: 61842	2009-01-07 01:00:24 +00:00
Dan Gohman	e78fdaec67	Define instructions for cmovo and cmovno. llvm-svn: 61836	2009-01-07 00:35:10 +00:00
Dan Gohman	2682e8745c	X86_COND_C and X86_COND_NC are alternate mnemonics for X86_COND_B and X86_COND_AE, respectively. llvm-svn: 61835	2009-01-07 00:15:08 +00:00
Evan Cheng	c52f942d67	Do not isel load folding bt instructions for pentium m, core, core2, and AMD processors. These are significantly slower than a load followed by a bt of a register. llvm-svn: 61557	2009-01-02 05:35:45 +00:00
Chris Lattner	062ed6e3dd	Fix some JIT encodings. llvm-svn: 61425	2008-12-25 01:32:49 +00:00
Chris Lattner	f34b843728	BT memory operands load from their address operand. llvm-svn: 61424	2008-12-25 01:27:10 +00:00

... 3 4 5 6 7 ...

905 Commits