llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 14:33:02 +02:00

Author	SHA1	Message	Date
Tim Northover	60e9ada729	X86: expand atomics in IR instead of as MachineInstrs. The logic for expanding atomics that aren't natively supported in terms of cmpxchg loops is much simpler to express at the IR level. It also allows the normal optimisations and CodeGen improvements to help out with atomics, instead of using a limited set of possible instructions.. rdar://problem/13496295 llvm-svn: 212119	2014-07-01 18:53:31 +00:00
Andrea Di Biagio	2bc066b1b4	[X86] Add support for builtin to read performance monitoring counters. This patch adds support for a new builtin instruction called __builtin_ia32_rdpmc. Builtin '__builtin_ia32_rdpmc' is defined as a 'GCC builtin'; on X86, it can be used to read performance monitoring counters. It takes as input the index of the performance counter to read, and returns the value of the specified performance counter as a 64-bit number. Calls to this new builtin will map to instruction RDPMC. The index in input to the builtin call is moved to register %ECX. The result of the builtin call is the value of the specified performance counter (RDPMC would return that quantity in registers RDX:RAX). This patch: - Adds builtin int_x86_rdpmc as a GCCBuiltin; - Adds a new x86 DAG node called 'RDPMC_DAG'; - Teaches how to lower this new builtin; - Adds an ISel pattern to select instruction RDPMC; - Fixes the definition of instruction RDPMC adding %RAX and %RDX as implicit definitions, and adding %ECX as implicit use; - Adds a LLVM test to verify that the new builtin is correctly selected. llvm-svn: 212049	2014-06-30 17:14:21 +00:00
Alexey Volkov	a3a5a1d7f1	[X86] Use ADD/SUB instead of INC/DEC for Silvermont According to Intel Software Optimization Manual on Silvermont INC or DEC instructions require an additional uop to merge the flags. As a result, a branch instruction depending on an INC or a DEC instruction incurs a 1 cycle penalty. Differential Revision: http://reviews.llvm.org/D3990 llvm-svn: 210466	2014-06-09 11:40:41 +00:00
Andrea Di Biagio	41bcee5bc3	[X86] Add ISel patterns to improve the selection of TZCNT and LZCNT. Instructions TZCNT (requires BMI1) and LZCNT (requires LZCNT), always provide the operand size as output if the input operand is zero. We can take advantage of this knowledge during instruction selection stage in order to simplify a few corner case. llvm-svn: 209159	2014-05-19 20:38:59 +00:00
Tim Northover	25b5918f36	X86: disable printing of bare "mov" aliases In AT&T syntax, we should probably print the full "movl" or "movw". TableGen used to ignore these aliases because it was miscounting the number of operands. This fixes the issue. This will be tested when the TableGen "should I print this Alias" heuristic is fixed (very soon). llvm-svn: 208963	2014-05-16 09:41:26 +00:00
Tim Northover	ac5dac4c75	TableGen: use correct MIOperand when printing aliases Previously, TableGen assumed that every aliased operand consumed precisely 1 MachineInstr slot (this was reasonable because until a couple of days ago, nothing more complicated was eligible for printing). This allows a couple more ARM64 aliases to print so we can remove the special code. On the X86 side, I've gone for explicit AT&T size specifiers as the default, so turned off a few of the aliases that would have just started printing. llvm-svn: 208880	2014-05-15 13:36:01 +00:00
Tim Northover	4ba95d4483	TableGen/ARM64: print aliases even if they have syntax variants. To get at least one use of the change (and some actual tests) in with its commit, I've enabled the AArch64 & ARM64 NEON mov aliases. llvm-svn: 208867	2014-05-15 11:16:32 +00:00
Andrea Di Biagio	dae3a5b91a	[X86] Add support for Read Time Stamp Counter x86 builtin intrinsics. This patch: - Adds two new X86 builtin intrinsics ('int_x86_rdtsc' and 'int_x86_rdtscp') as GCCBuiltin intrinsics; - Teaches the backend how to lower the two new builtins; - Introduces a common function to lower READCYCLECOUNTER dag nodes and the two new rdtsc/rdtscp intrinsics; - Improves (and extends) the existing x86 test 'rdtsc.ll'; now test 'rdtsc.ll' correctly verifies that both READCYCLECOUNTER and the two new intrinsics work fine for both 64bit and 32bit Subtargets. llvm-svn: 207127	2014-04-24 17:18:27 +00:00
Lang Hames	883a05a1d6	[X86] Require HasBMI2 for the new BZHI tablegen patterns. Evidently tablegen doesn't infer this from the HasBMI2 predicate on the BZHI instructions. This should fix the recent bot failures. llvm-svn: 206885	2014-04-22 12:04:53 +00:00
Lang Hames	067ce6ce24	[X86] Remove Tablegen def of X86bzhi SDNode: It's not needed as of r206879. llvm-svn: 206880	2014-04-22 10:50:46 +00:00
Lang Hames	c13d48ed4c	[X86] Use tablegen instead of DAG combines to match BZHI instructions, as suggested by Ben Kramer in review of r206738. Thanks again Ben! llvm-svn: 206879	2014-04-22 10:41:56 +00:00
Craig Topper	6d9ad3a694	[X86] Fix Uses/Defs lists for INS, OUTS, SCAS, CMPS, LODS llvm-svn: 202348	2014-02-27 05:08:25 +00:00
Craig Topper	e406fab3df	[X86] Add RAX/EAX/AX Uses/Defs to XCHG RAX/EAX/AX instructions. llvm-svn: 202347	2014-02-27 04:27:00 +00:00
Craig Topper	4fcab63947	[X86] Add RAX/EAX/AX/AL Uses/Defs to the absolute memory location move instructions. Patch by Florian Lukas with some additional instructions fixed by me. Fixes PR18975. llvm-svn: 202345	2014-02-27 04:07:57 +00:00
Craig Topper	b5b81fb98b	Add a bunch of OpSize32 tags to 64-bit mode only instructions to match their 32-bit mode counterparts for cases where there is also a OpSize16 instruction. llvm-svn: 201550	2014-02-18 08:18:29 +00:00
Craig Topper	de78f4304d	Add an x86 prefix encoding for instructions that would decode to a different instruction with 0xf2/f3/66 were in front of them, but don't themselves have a prefix. For now this doesn't change any bbehavior, but plan to use it to fix some bugs in the disassembler. llvm-svn: 201538	2014-02-18 00:21:49 +00:00
Craig Topper	17b586d7c5	Add opcode extension forms of MOV8ri/MOV16ri/MOV32ri. llvm-svn: 201463	2014-02-15 07:29:18 +00:00
Juergen Ributzka	40016f4730	[X86] Don't mark movabsq as cheap-as-move - it isn't that cheap. A simple register copy on X86 is just 3 bytes, whereas movabsq is a 10 byte instruction. Marking movabsq as not beeing cheap will allow LICM to move it out of the loop and it also prevents unnecessary rematerializations if the value is needed in more than one register. llvm-svn: 201377	2014-02-14 00:51:13 +00:00
Craig Topper	5f3cd9c3a9	Recommit r201059 and r201060 with hopefully a fix for its original failure. Original commits messages: Add MRMXr/MRMXm form to X86 for use by instructions which treat the 'reg' field of modrm byte as a don't care value. Will allow for simplification of disassembler code. Simplify a bunch of code by removing the need for the x86 disassembler table builder to know about extended opcodes. The modrm forms are sufficient to convey the information. llvm-svn: 201065	2014-02-10 06:55:41 +00:00
Bob Wilson	3e54e44d03	Revert r201059 and r201060. r201059 appears to cause a crash in a bootstrapped build of clang. Craig isn't available to look at it right now, so I'm reverting it while he investigates. llvm-svn: 201064	2014-02-10 05:28:30 +00:00
Craig Topper	c4ecc4bda5	Add MRMXr/MRMXm form to X86 for use by instructions which treat the 'reg' field of modrm byte as a don't care value. Will allow for simplification of disassembler code. llvm-svn: 201059	2014-02-10 00:50:34 +00:00
Craig Topper	4c6c325efa	Move matching for x86 BMI BLSI/BLSMSK/BLSR instructions to isel patterns instead of DAG combine. This weakens the ability to fold loads with them because we aren't able to match patterns that load the same thing twice. But maybe we should fix that if we care. The peephole optimizer will be able to fold some loads in its absense. llvm-svn: 200824	2014-02-05 07:09:40 +00:00
Craig Topper	e916566881	Merge x86 HasOpSizePrefix/HasOpSize16Prefix into a 2-bit OpSize field with 0 meaning no 0x66 prefix in any mode. Rename Opsize16->OpSize32 and OpSize->OpSize16. The classes now refer to their operand size rather than the mode in which they need a 0x66 prefix. Hopefully can merge REX_W into this as OpSize64. llvm-svn: 200626	2014-02-02 09:25:09 +00:00
Craig Topper	75deac3bc2	Merge HasVEXPrefix/HasEVEXPrefix/HasXOPPrefix into a 2-bit 'encoding' field in TSFlags. llvm-svn: 200624	2014-02-02 07:08:01 +00:00
David Woodhouse	10eb2a8985	[x86] Fix signed relocations for i64i32imm operands These should end up (in ELF) as R_X86_64_32S relocs, not R_X86_64_32. Kill the horrid and incomplete special case and FIXME in EncodeInstruction() and set things up so it can infer the signedness from the ImmType just like it can the size and whether it's PC-relative. llvm-svn: 200495	2014-01-30 22:20:41 +00:00
David Woodhouse	b7c155c55a	[x86] Allow segment and address-size overrides for OUTS[BWLQ] (PR9385) llvm-svn: 199808	2014-01-22 15:08:49 +00:00
David Woodhouse	39833d37a3	[x86] Allow segment and address-size overrides for MOVS[BWLQ] (PR9385) llvm-svn: 199807	2014-01-22 15:08:42 +00:00
David Woodhouse	4515fd303f	]x86] Allow segment and address-size overrides for CMPS[BWLQ] (PR9385) llvm-svn: 199806	2014-01-22 15:08:36 +00:00
David Woodhouse	02c50a95e8	[x86] Allow address-size overrides for SCAS{8,16,32,64} (PR9385) llvm-svn: 199805	2014-01-22 15:08:27 +00:00
David Woodhouse	59ef208820	[x86] Allow address-size overrides for STOS[BWLQ] (PR9385) llvm-svn: 199804	2014-01-22 15:08:21 +00:00
David Woodhouse	e01fc03be8	[x86] Allow segment and address-size overrides for LODS[BWLQ] (PR9385) llvm-svn: 199803	2014-01-22 15:08:08 +00:00
David Woodhouse	316c7ec362	[x86] Rename MOVSD/STOSD/LODSD/OUTSD to MOVSL/STOSL/LODSL/OUTSL The disassembler has a special case for 'L' vs. 'W' in its heuristic for checking for 32-bit and 16-bit equivalents. We could expand the heuristic, but better just to be consistent in using the 'L' suffix. llvm-svn: 199652	2014-01-20 12:02:44 +00:00
Craig Topper	f63b7bc430	Allow x86 mov instructions to/from memory with absolute address to be encoded and disassembled with a segment override prefix. Fixes PR16962. llvm-svn: 199364	2014-01-16 07:36:58 +00:00
David Woodhouse	a7b8d3d331	[x86] Fix retq/retl handling in 64-bit mode This finishes the job started in r198756, and creates separate opcodes for 64-bit vs. 32-bit versions of the rest of the RET instructions too. LRETL/LRETQ are interesting... I can't see any justification for their existence in the SDM. There should be no 'LRETL' in 64-bit mode, and no need for a REX.W prefix for LRETQ. But this is what GAS does, and my Sandybridge CPU and an Opteron 6376 concur when tested as follows: asm __volatile__("pushq $0x1234\nmovq $0x33,%rax\nsalq $32,%rax\norq $1f,%rax\npushq %rax\nlretl $8\n1:"); asm __volatile__("pushq $1234\npushq $0x33\npushq $1f\nlretq $8\n1:"); asm __volatile__("pushq $0x33\npushq $1f\nlretq\n1:"); asm __volatile__("pushq $0x1234\npushq $0x33\npushq $1f\nlretq $8\n1:"); cf. PR8592 and commit r118903, which added LRETQ. I only added LRETIQ to match it. I don't quite understand how the Intel syntax parsing for ret instructions is working, despite r154468 allegedly fixing it. Aren't the explicitly sized 'retw', 'retd' and 'retq' supposed to work? I have at least made the 'lretq' work with (and indeed require) the 'q'. llvm-svn: 199106	2014-01-13 14:05:59 +00:00
David Woodhouse	5c8e725a8c	[x86] Do not relax PUSHi16 to PUSHi32 (PR18414) They do different things to %esp, so they are not equivalent. Rename PUSHi8 to PUSH32i8 and add the missing PUSH16i8. llvm-svn: 198761	2014-01-08 12:58:32 +00:00
David Woodhouse	399c563e5e	[x86] Fix MOV8ao8 et al for 16-bit mode, fix up disassembler to understand It seems there is no separate instruction class for having AdSize and OpSize bits set, which is required in order to disambiguate between all these instructions. So add that to the disassembler. Hm, perhaps we do need an AdSize16 bit after all? llvm-svn: 198759	2014-01-08 12:58:24 +00:00
David Woodhouse	e757b998ec	[x86] Disambiguate RET[QL] and fix aliases for 16-bit mode I couldn't see how to do this sanely without splitting RETQ from RETL. Eric says: "sad about the inability to roundtrip them now, but...". I have no idea what that means, but perhaps it wants preserving in the commit comment. llvm-svn: 198756	2014-01-08 12:58:07 +00:00
David Woodhouse	7d0744dadf	[x86] Disambiguate [LS][IG]DT{32,64}m and add 16-bit versions, fix aliases llvm-svn: 198755	2014-01-08 12:57:55 +00:00
David Woodhouse	38e7bff9f3	[x86] Add JMP16[rm],CALL16[rm] instructions, and fix up aliases llvm-svn: 198754	2014-01-08 12:57:49 +00:00
David Woodhouse	9c25ab3643	[x86] Add PUSHA16,POPA16 instructions, and fix aliases for 16-bit mode llvm-svn: 198753	2014-01-08 12:57:45 +00:00
David Woodhouse	8bc826fd14	[x86] Add OpSize16 to instructions that need it This fixes the bulk of 16-bit output, and the corresponding test case x86-16.s now looks mostly like the x86-32.s test case that it was originally based on. A few irrelevant instructions have been dropped, and there are still some corner cases to be fixed in subsequent patches. llvm-svn: 198752	2014-01-08 12:57:40 +00:00
Craig Topper	201bd5add3	[x86] Add basic support for .code16 This is not really expected to work right yet. Mostly because we will still emit the OpSize (0x66) prefix in all the wrong places, along with a number of other corner cases. Those will all be fixed in the subsequent commits. Patch from David Woodhouse. llvm-svn: 198584	2014-01-06 04:55:54 +00:00
Craig Topper	3ff26e1332	Fix encoding for PUSH64i16. Add In64BitMode Predicate. Remove disassembler hack. llvm-svn: 198547	2014-01-05 05:46:38 +00:00
Craig Topper	4a48c26e38	Add a new x86 specific instruction flag to force some isCodeGenOnly instructions to go through to the disassembler tables without resorting to string matches. Apply flag to all _REV instructions. llvm-svn: 198543	2014-01-05 04:17:28 +00:00
Craig Topper	83505ed565	Mark the 64-bit x86 push/pop instructions as In64BitMode. Mark the corresponding 32-bit versions with the same encodings Not64BitMode. Remove hack from tablegen disassembler table emitter. Fix bad test. llvm-svn: 198530	2014-01-05 01:35:51 +00:00
Craig Topper	839231f528	Mark REX64_PREFIX as In64BitMode, remove hack from X86RecognizableInstr. llvm-svn: 198336	2014-01-02 19:12:10 +00:00
Elena Demikhovsky	7174584583	AVX-512: Added intrinsics for vcvt, vcvtt, vrndscale, vcmp Printing rounding control. Enncoding for EVEX_RC (rounding control). llvm-svn: 198277	2014-01-01 15:12:34 +00:00
Eric Christopher	24d8bb6edd	[x86] Rename In32BitMode predicate to Not64BitMode That's what it actually means, and with 16-bit support it's going to be a little more relevant since in a few corner cases we may actually want to distinguish between 16-bit and 32-bit mode (for example the bare 'push' aliases to pushw/pushl etc.) Patch by David Woodhouse llvm-svn: 197768	2013-12-20 02:04:49 +00:00
Elena Demikhovsky	b43ccbc3f7	AVX-512: Added legal type MVT::i1 and VK1 register for it. Added scalar compare VCMPSS, VCMPSD. Implemented LowerSELECT for scalar FP operations. I replaced FSETCCss, FSETCCsd with one node type FSETCCs. Node extract_vector_elt(v16i1/v8i1, idx) returns an element of type i1. llvm-svn: 197384	2013-12-16 13:52:35 +00:00
Elena Demikhovsky	154413adc2	AVX-512: Removed "z" suffix from AVX-512 instructions, since it is incompatible with GCC. I moved a test from avx512-vbroadcast-crash.ll to avx512-vbroadcast.ll I defined HasAVX512 predicate as AssemblerPredicate. It means that you should invoke llvm-mc with "-mcpu=knl" to get encoding for AVX-512 instructions. I need this to let AsmMatcher to set different encoding for AVX and AVX-512 instructions that have the same mnemonic and operands (all scalar instructions). llvm-svn: 197041	2013-12-11 14:31:04 +00:00
Yunzhong Gao	23e948dd2f	Enabling 3DNow! prefetch instruction for a few AMD processors: bobcat, jaguar, bulldozer and piledriver. Support for the instruction itself seems to have already been added in r178040. Differential Revision: http://llvm-reviews.chandlerc.com/D1933 llvm-svn: 192828	2013-10-16 19:04:11 +00:00
Craig Topper	1548551887	Allow pinsrw/pinsrb/pextrb/pextrw/movmskps/movmskpd/pmovmskb/extractps instructions to parse either GR32 or GR64 without resorting to duplicating instructions. llvm-svn: 192567	2013-10-14 04:55:01 +00:00
Elena Demikhovsky	f24ecf7862	AVX-512: Added VRCP28 and VRSQRT28 instructions and intrinsics. llvm-svn: 192283	2013-10-09 08:16:14 +00:00
Craig Topper	94a706d015	Remove underscores from TBM instruction names for consistency with other instruction naming. llvm-svn: 192040	2013-10-05 19:27:26 +00:00
Craig Topper	0a8f3fc996	Remove unneeded TBM intrinsics. The arithmetic/logical operation patterns are sufficient. llvm-svn: 192039	2013-10-05 19:22:59 +00:00
Craig Topper	d0a63f6722	Add an additional pattern for BLCI since opt can turn (not (add x, 1)) into (sub -2, x). llvm-svn: 192037	2013-10-05 17:17:53 +00:00
Craig Topper	6fb0648c41	Add XOP disassembler support. Fixes PR13933. llvm-svn: 191874	2013-10-03 05:17:48 +00:00
Craig Topper	5ac188d0f2	Add patterns for selecting TBM instructions from logical operations. Patch from Yunzhong Gao. llvm-svn: 191871	2013-10-03 04:16:45 +00:00
Craig Topper	401688a9b1	BEXTR should be defined to take same type for bother operands. llvm-svn: 191728	2013-10-01 03:48:26 +00:00
Yunzhong Gao	e51da27a74	Adding intrinsics to the llvm backend for TBM instruction set. Phabricator code review is located here: http://llvm-reviews.chandlerc.com/D1750 llvm-svn: 191539	2013-09-27 18:38:42 +00:00
Preston Gurd	0411803c14	Adds support for Atom Silvermont (SLM) - -march=slm Implements Instruction scheduler latencies for Silvermont, using latencies from the Intel Silvermont Optimization Guide. Auto detects SLM. Turns on post RA scheduler when generating code for SLM. llvm-svn: 190717	2013-09-13 19:23:28 +00:00
Ben Langmuir	9981cd7cfe	Partial support for Intel SHA Extensions (sha1rnds4) Add basic assembly/disassembly support for the first Intel SHA instruction 'sha1rnds4'. Also includes feature flag, and test cases. Support for the remaining instructions will follow in a separate patch. llvm-svn: 190611	2013-09-12 15:51:31 +00:00
Craig Topper	ac623f26d9	Add neverHasSideEffects=1 on a couple move instructions. llvm-svn: 190259	2013-09-08 00:50:45 +00:00
Craig Topper	6009a9c268	Create BEXTR instructions for (and ((sra or srl) x, imm), (2**size - 1)). Fixes PR17028. llvm-svn: 189742	2013-09-02 07:53:17 +00:00
Craig Topper	cc66aa8db1	Fixup BZHI selection to remove an unneeded zero extension. llvm-svn: 189656	2013-08-30 07:16:16 +00:00
Craig Topper	d9ee224519	Remove unused X86andn_flag node. llvm-svn: 189654	2013-08-30 07:06:26 +00:00
Craig Topper	dad5a27c09	Teach X86 backend to create BMI2 BZHI instructions from (and X, (add (shl 1, Y), -1)). Fixes PR17038. llvm-svn: 189653	2013-08-30 06:52:21 +00:00
Craig Topper	9e78242753	Remove some unnecessary PredicateMethod overrides. Add RenderMethod overrides to remove forwarding in the X86AsmParser code itself. No functional change. llvm-svn: 189205	2013-08-26 00:13:09 +00:00
Craig Topper	3e34d36eda	Put some of the AVX-512 parsing stuff in a more consistent place with the existing functions. llvm-svn: 189204	2013-08-25 23:18:05 +00:00
Craig Topper	3f15756092	First round of fixes for the x86 fixes for the x86 move accumulator from/to memory offset instructions. -Assembly parser now properly check the size of the memory operation specified in intel syntax. So 'mov word ptr [5], al' is no longer accepted. -x86-32 disassembly of these instructions no longer sign extends the 32-bit address immediate based on size. -Intel syntax printing prints the ptr size and places brackets around the address immediate. Known remaining issues with these instructions: -Segment override prefix is not supported. PR16962 and PR16961. -Immediate size should be changed by address size prefix. llvm-svn: 189201	2013-08-25 22:23:38 +00:00
Craig Topper	285493561b	Add hasSideEffects/mayLoad/mayStore flags to the X86 moffs8/moffs16/moffs32/moffs64 versions of move. llvm-svn: 189182	2013-08-24 20:31:14 +00:00
Craig Topper	b786cca090	Remove trailing whitespace. llvm-svn: 189178	2013-08-24 19:50:11 +00:00
Elena Demikhovsky	175a2e60dd	Added INSERT and EXTRACT intructions from AVX-512 ISA. All insertf/extractf functions replaced with insert/extract since we have insertf and inserti forms. Added lowering for INSERT_VECTOR_ELT / EXTRACT_VECTOR_ELT for 512-bit vectors. Added lowering for EXTRACT/INSERT subvector for 512-bit vectors. Added a test. llvm-svn: 187491	2013-07-31 11:35:14 +00:00
Craig Topper	45e8fdfc7f	Changed register names (and pointer keywords) to be lower case when using Intel X86 assembler syntax. Patch by Richard Mitton. llvm-svn: 187476	2013-07-31 02:47:52 +00:00
Craig Topper	191a24ae80	Fixed incorrect disassembly for MOV16o16a when using Intel syntax. Patch by Richard Mitton. llvm-svn: 187471	2013-07-31 01:50:26 +00:00
Elena Demikhovsky	505373db43	Added encoding prefixes for KNL instructions (EVEX). Added 512-bit operands printing. Added instruction formats for KNL instructions. llvm-svn: 187324	2013-07-28 08:28:38 +00:00
Craig Topper	de69f10e0c	Fix more Intel syntax issues with FP instruction aliases. Test cases coming in a subsequent patch. llvm-svn: 187187	2013-07-26 05:37:46 +00:00
Craig Topper	0b10bffb5b	Fix aliases for shrd/shld to handle Intel syntax properly. Also suppress them from being used by the asm printer. llvm-svn: 187020	2013-07-24 04:38:13 +00:00
Craig Topper	c7bcbfdf42	Don't let x86 asm printer use the no operand movsd alias. It should use the normal movsl instead. llvm-svn: 186924	2013-07-23 01:50:47 +00:00
Craig Topper	693606b026	Revert r186907 to fix bots. llvm-svn: 186910	2013-07-23 01:29:37 +00:00
Craig Topper	5337e87594	Don't let x86 asm printer use the no operand movsd alias. It should use the normal movsl instead. llvm-svn: 186907	2013-07-23 01:21:36 +00:00
Craig Topper	5399a0520e	Add aliases to map 'imm, mem' form of x86 bts/btr/btc without a size suffix to their 32-bit forms. This makes them consistent with 'bt' which already had this handling. gas has the same behavior. There have been discussions on the mailing list about determining size based on the immediate, but my goal here was just to remove the inconsistency. llvm-svn: 186904	2013-07-23 00:56:15 +00:00
Craig Topper	f89dce68af	Explicitly don't let the asm printer use the clrb/w/l aliases for xor %reg, %reg. It only didn't use it before because it seems InstAlias handling in the asm printer fails to count tied operands so it tried to find an xor with 2 operands instead of the 3 it wfails to count tied. llvm-svn: 186900	2013-07-23 00:15:19 +00:00
Craig Topper	22b26f24f4	Suppress argumentless aliases for some x86 FP operations from being used by the asm writer. Prefer to use the explicit %st(1) form. llvm-svn: 186897	2013-07-23 00:03:33 +00:00
Kevin Enderby	4f4de8ba04	Fix the move to/from accumulator register instructions that use a full 64-bit absolute address encoded in the instruction. rdar://8612627 and rdar://14299221 llvm-svn: 186878	2013-07-22 21:25:31 +00:00
Craig Topper	86b83be14d	Recommit r186813: More Intel syntax alias fixes. With the addition of suppressing some of the aliases from being emitted by the asm printer. llvm-svn: 186869	2013-07-22 20:46:37 +00:00
Tim Northover	3eb0204626	Revert "More Intel syntax alias fixes." This reverts commit r186813, which broke the bots. llvm-svn: 186818	2013-07-22 11:02:32 +00:00
Craig Topper	9f41c21050	More Intel syntax alias fixes. llvm-svn: 186814	2013-07-22 09:58:07 +00:00
Craig Topper	7f84e2a099	More Intel syntax alias fixes. llvm-svn: 186813	2013-07-22 09:42:31 +00:00
Craig Topper	91ad9abf00	Add Intel variants to aliases for some FP instructions. llvm-svn: 186811	2013-07-22 09:18:43 +00:00
Craig Topper	30d02185df	Reverse operands for Intel syntax form of 'bt' alias. llvm-svn: 186809	2013-07-22 07:47:51 +00:00
Ahmed Bougacha	1560b2eb19	X86: POP*rmm: move address operand to (ins) from (outs). llvm-svn: 185292	2013-06-30 20:44:50 +00:00
Benjamin Kramer	f458149c7d	X86: Make the cmov aliases work with intel syntax too. llvm-svn: 183907	2013-06-13 15:45:24 +00:00
Tim Northover	4ba890d132	X86: Stop LEA64_32r doing unspeakable things to its arguments. Previously LEA64_32r went through virtually the entire backend thinking it was using 32-bit registers until its blissful illusions were cruelly snatched away by MCInstLower and 64-bit equivalents were substituted at the last minute. This patch makes it behave normally, and take 64-bit registers as sources all the way through. Previous uses (for 32-bit arithmetic) are accommodated via SUBREG_TO_REG instructions which make the types and classes agree properly. llvm-svn: 183693	2013-06-10 20:43:49 +00:00
Tim Northover	e84e621d63	Revert r183069: "TMP: LEA64_32r fixing" Very sorry, it was committed from the wrong branch by mistake. llvm-svn: 183070	2013-06-01 10:23:46 +00:00
Tim Northover	93287c3991	TMP: LEA64_32r fixing llvm-svn: 183069	2013-06-01 10:21:54 +00:00
Chad Rosier	7da7292b4e	[ms-inline asm] Fix a crasher when we fail on a direct match. The issue was that the MatchingInlineAsm and VariantID args to the MatchInstructionImpl function weren't being set properly. Specifically, when parsing intel syntax, the parser thought it was parsing inline assembly in the at&t dialect; that will never be the case. The crash was caused when the emitter tried to emit the instruction, but the operands weren't set. When parsing inline assembly we only set the opcode, not the operands, which is used to lookup the instruction descriptor. rdar://13854391 and PR15945 Also, this commit reverts r176036. Now that we're correctly parsing the intel syntax the pushad/popad don't match properly. I've reimplemented that fix using a MnemonicAlias. llvm-svn: 181620	2013-05-10 18:24:17 +00:00
Chad Rosier	9eb6febf54	[ms-inline asm] Apply the condition code mnemonic aliases to both the Intel and AT&T dialect. Test case for r179804 as well. rdar://13674398 and PR13340. llvm-svn: 179813	2013-04-18 23:16:12 +00:00
Chad Rosier	645b701422	[asm parser] Add support for predicating MnemonicAlias based on the assembler variant/dialect. Addresses a FIXME in the emitMnemonicAliases function. Use and test case to come shortly. rdar://13688439 and part of PR13340. llvm-svn: 179804	2013-04-18 22:35:36 +00:00
Michael Liao	427149cbcf	Add support of RDSEED defined in AVX2 extension llvm-svn: 178314	2013-03-28 23:41:26 +00:00
Michael Liao	d961d7a7b3	Skip moving call address loading into callseq when targets prefer register indirect call. To enable a load of a call address to be folded with that call, this load is moved from outside of callseq into callseq. Such a moving adds a non-glued node (that load) into a glued sequence. This non-glue load is only removed when DAG selection folds them into a memory form call instruction. When such instruction selection is disabled, it breaks DAG schedule. To prevent that, such moving is disabled when target favors register indirect call. Previous workaround disabling CALL32m/CALL64m insn selection is removed. llvm-svn: 178308	2013-03-28 23:13:21 +00:00
Michael Liao	bd3f6b0eea	Add XTEST codegen support llvm-svn: 178083	2013-03-26 22:47:01 +00:00
Michael Liao	3515920fbd	Add HLE target feature llvm-svn: 178082	2013-03-26 22:46:02 +00:00
Jakob Stoklund Olesen	9e6a9659f1	Annotate the rest of X86InstrInfo.td with SchedRW lists. llvm-svn: 178048	2013-03-26 18:24:15 +00:00
Michael Liao	969ef73c31	Add PREFETCHW codegen support - Add 'PRFCHW' feature defined in AVX2 ISA extension llvm-svn: 178040	2013-03-26 17:47:11 +00:00
Jakob Stoklund Olesen	6ffb4136aa	Add a WriteMicrocoded for ancient microcoded instructions. llvm-svn: 177611	2013-03-21 00:07:17 +00:00
Jakob Stoklund Olesen	a56394519c	Annotate a lot of X86InstrInfo.td with SchedRW lists. llvm-svn: 177417	2013-03-19 18:03:55 +00:00
Chad Rosier	77e46d6eb6	[ms-inline asm] Add support for the pushad/popad mnemonics. rdar://13254235 llvm-svn: 176036	2013-02-25 19:06:27 +00:00
Kay Tiong Khoo	45b3d90921	added basic support for Intel ADX instructions -feature flag, instructions definitions, test cases llvm-svn: 175196	2013-02-14 19:08:21 +00:00
David Sehr	59597001bc	Two changes relevant to LEA and x32: 1) allows the use of RIP-relative addressing in 32-bit LEA instructions under x86-64 (ILP32 and LP64) 2) separates the size of address registers in 64-bit LEA instructions from control by ILP32/LP64. llvm-svn: 174208	2013-02-01 19:28:09 +00:00
Craig Topper	8884832622	Remove # from the beginning and end of def names. llvm-svn: 171696	2013-01-07 05:26:58 +00:00
Kevin Enderby	255beae3d9	Adds missing aliases for fcom and fcomp instructions without arguments. Patch by Michael M Kuperstein! llvm-svn: 171414	2013-01-02 21:20:15 +00:00
Craig Topper	5650ea25b7	Add mayLoad, mayStore, and hasSideEffects tags to BT/BTS/BTR/BTC instructions. Shouldn't change any functionality since they don't have patterns to select them. llvm-svn: 171128	2012-12-27 02:01:33 +00:00
Craig Topper	9e3cb122f1	Fix operands and encoding form for ARPL instruction. Register form had and reversed. Memory form writes memory, but was marked as MRMSrcMem. llvm-svn: 171123	2012-12-26 23:27:57 +00:00
Craig Topper	152bee45fa	Mark all the _REV instructions as not having side effects. They aren't really emitted by the backend, but it reduces the number of instructions in the output files with unmodelled side effects to make auditing easier. llvm-svn: 171118	2012-12-26 21:30:22 +00:00
Craig Topper	3949d5f59c	Remove EFLAGS from the BLSI/BLSMSK/BLSR patterns. The nodes created by DAG combine don't contain an EFLAGS def. llvm-svn: 170308	2012-12-17 06:13:48 +00:00
Jim Grosbach	2742e92ea2	X86: Better diagnostics for 32-bit vs. 64-bit mode mismatches. When an instruction as written requires 32-bit mode and we're assembling in 64-bit mode, or vice-versa, issue a more specific diagnostic about what's wrong. rdar://12700702 llvm-svn: 167937	2012-11-14 18:04:47 +00:00
Michael Liao	59114df23b	Add support of RTM from TSX extension - Add RTM code generation support throught 3 X86 intrinsics: xbegin()/xend() to start/end a transaction region, and xabort() to abort a tranaction region llvm-svn: 167573	2012-11-08 07:28:54 +00:00
Michael Liao	a7e5913fde	Add __builtin_setjmp/_longjmp supprt in X86 backend - Besides used in SjLj exception handling, __builtin_setjmp/__longjmp is also used as a light-weight replacement of setjmp/longjmp which are used to implementation continuation, user-level threading, and etc. The support added in this patch ONLY addresses this usage and is NOT intended to support SjLj exception handling as zero-cost DWARF exception handling is used by default in X86. llvm-svn: 165989	2012-10-15 22:39:43 +00:00
Craig Topper	14aba4d92c	Separate AVXCC and SSECC printing for cmpps/pd/ss/sd and add masking before the switch statement. This keeps the unreachable default case from being hit if the instruction was created with an intrinsic with too large of an immediate. llvm-svn: 165483	2012-10-09 05:26:13 +00:00
Craig Topper	e6dc3ad641	Remove hasNoAVX method. Can just invert hasAVX instead. llvm-svn: 164664	2012-09-26 06:29:37 +00:00
Michael Liao	439a9cea68	Revise td of X86 atomic instructions - Rewirte most atomic instructions in templates for both better maintenance and future extensions, such as HLE in TSX. llvm-svn: 164357	2012-09-21 03:00:17 +00:00
Jakob Stoklund Olesen	eae8fc91cf	Revert r163761 "Don't fold indexed loads into TCRETURNmi64." The patch caused "Wrong topological sorting" assertions. llvm-svn: 163810	2012-09-13 16:52:17 +00:00
Jakob Stoklund Olesen	b15912aafd	Don't fold indexed loads into TCRETURNmi64. We don't have enough GR64_TC registers when calling a varargs function with 6 arguments. Since %al holds the number of vector registers used, only %r11 is available as a scratch register. This means that addressing modes using both base and index registers can't be folded into TCRETURNmi64. <rdar://problem/12282281> llvm-svn: 163761	2012-09-13 00:25:00 +00:00
Chad Rosier	419fa9e0b0	Update function names to conform to guidelines. No functional change intended. llvm-svn: 163561	2012-09-10 22:50:57 +00:00
Michael Liao	b6735b87b0	Introduce 'UseSSEx' to force SSE legacy encoding - Add 'UseSSEx' to force SSE legacy insn not being selected when AVX is enabled. As the penalty of inter-mixing SSE and AVX instructions, we need prevent SSE legacy insn from being generated except explicitly specified through some intrinsics. For patterns supported by both SSE and AVX, so far, we force AVX insn will be tried first relying on AddedComplexity or position in td file. It's error-prone and introduces bugs accidentally. 'UseSSEx' is disabled when AVX is turned on. For SSE insns inherited by AVX, we need this predicate to force VEX encoding or SSE legacy encoding only. For insns not inherited by AVX, we still use the previous predicates, i.e. 'HasSSEx'. So far, these insns fall into the following categories: * SSE insns with MMX operands * SSE insns with GPR/MEM operands only (xFENCE, PREFETCH, CLFLUSH, CRC, and etc.) * SSE4A insns. * MMX insns. * x87 insns added by SSE. 2 test cases are modified: - test/CodeGen/X86/fast-isel-x86-64.ll AVX code generation is different from SSE one. 'vcvtsi2sdq' cannot be selected by fast-isel due to complicated pattern and fast-isel fallback to materialize it from constant pool. - test/CodeGen/X86/widen_load-1.ll AVX code generation is different from SSE one after fixing SSE/AVX inter-mixing. Exec-domain fixing prefers 'vmovapd' instead of 'vmovaps'. llvm-svn: 162919	2012-08-30 16:54:46 +00:00
Craig Topper	b524d2e36d	Add HasAVX1Only predicate and use it for patterns that have an AVX1 instruction and an AVX2 instruction rather than relying on AddedComplexity. llvm-svn: 162654	2012-08-27 06:08:57 +00:00
Jakob Stoklund Olesen	2f512d8eba	X86MemBarrier has unmodeled side effects. llvm-svn: 162514	2012-08-24 00:31:10 +00:00
Craig Topper	b144f3b6db	Make x86 asm parser to check for xmm vs ymm for index register in gather instructions. Also fix Intel syntax for gather instructions to use 'DWORD PTR' or 'QWORD PTR' to match gas. llvm-svn: 160420	2012-07-18 04:11:12 +00:00
Benjamin Kramer	558c56f216	Give the rdrand instructions a SideEffect flag and a chain so MachineCSE and MachineLICM don't touch it. I already had the necessary things in place for IR-level passes but missed the machine passes. llvm-svn: 160137	2012-07-12 18:14:57 +00:00
Benjamin Kramer	f8e67a04f4	Add intrinsics for Ivy Bridge's rdrand instruction. The rdrand/cmov sequence is the same that is emitted by both GCC and ICC. Fixes PR13284. llvm-svn: 160117	2012-07-12 09:31:43 +00:00
Manman Ren	63bf58865a	X86: add more GATHER intrinsics in LLVM Corrected type for index of llvm.x86.avx2.gather.d.pd.256 from 256-bit to 128-bit. Corrected types for src\|dst\|mask of llvm.x86.avx2.gather.q.ps.256 from 256-bit to 128-bit. Support the following intrinsics: llvm.x86.avx2.gather.d.q, llvm.x86.avx2.gather.q.q llvm.x86.avx2.gather.d.q.256, llvm.x86.avx2.gather.q.q.256 llvm.x86.avx2.gather.d.d, llvm.x86.avx2.gather.q.d llvm.x86.avx2.gather.d.d.256, llvm.x86.avx2.gather.q.d.256 llvm-svn: 159402	2012-06-29 00:54:20 +00:00
Manman Ren	6be46b7b4c	X86: add GATHER intrinsics (AVX2) in LLVM Support the following intrinsics: llvm.x86.avx2.gather.d.pd, llvm.x86.avx2.gather.q.pd llvm.x86.avx2.gather.d.pd.256, llvm.x86.avx2.gather.q.pd.256 llvm.x86.avx2.gather.d.ps, llvm.x86.avx2.gather.q.ps llvm.x86.avx2.gather.d.ps.256, llvm.x86.avx2.gather.q.ps.256 Modified Disassembler to handle VSIB addressing mode. llvm-svn: 159221	2012-06-26 19:47:59 +00:00
Craig Topper	5837bcfc02	Rename FMA3 feature flag to just FMA to match gcc so it can be added to clang. llvm-svn: 157903	2012-06-03 18:58:46 +00:00
Hans Wennborg	4344ad4a86	Implement the local-dynamic TLS model for x86 (PR3985) This implements codegen support for accesses to thread-local variables using the local-dynamic model, and adds a clean-up pass so that the base address for the TLS block can be re-used between local-dynamic access on an execution path. llvm-svn: 157818	2012-06-01 16:27:21 +00:00
Benjamin Kramer	cb686400fb	X86: Rename the CLMUL target feature to PCLMUL. It was renamed in gcc/gas a while ago and causes all kinds of confusion because it was named differently in llvm and clang. llvm-svn: 157745	2012-05-31 14:34:17 +00:00
Preston Gurd	236873fb5d	Added X86 Atom latencies for instructions in X86InstrInfo.td. llvm-svn: 156579	2012-05-10 21:58:35 +00:00
Jakob Stoklund Olesen	88cf278739	Use ptr_rc_tailcall instead of GR32_TC. The getPointerRegClass() hook will return GR32_TC, or whatever is appropriate for the current function. Patch by Yiannis Tsiouris! llvm-svn: 156459	2012-05-09 01:50:09 +00:00
Benjamin Kramer	1380494168	X86: Don't emit conditional floating point moves on when targeting pre-pentiumpro architectures. * Model FPSW (the FPU status word) as a register. * Add ISel patterns for the FUCOM, FNSTSW and SAHF instructions. During Legalize/Lowering, build a node sequence to transfer the comparison result from FPSW into EFLAGS. If you're wondering about the right-shift: That's an implicit sub-register extraction (%ax -> %ah) which is handled later on by the instruction selector. Fixes PR6679. Patch by Christoph Erhardt! llvm-svn: 155704	2012-04-27 12:07:43 +00:00
Craig Topper	ce6c05e0df	Add support for AVX enhanced comparison predicates. Patch from Kay Tiong Khoo. llvm-svn: 153935	2012-04-03 05:20:24 +00:00
Eli Friedman	c397259ea6	Fix the operand ordering on aliases for shld and shrd. PR12173, part 2. llvm-svn: 152136	2012-03-06 19:58:46 +00:00
Eli Friedman	4a049305a9	Make aliases for shld and shrd match gas. PR12173. llvm-svn: 152014	2012-03-05 04:31:54 +00:00
Chad Rosier	ff495392fa	Add q suffix aliases for the fistp and fisttp mnemonics. rdar://10921670 PR11935 llvm-svn: 151543	2012-02-27 19:43:12 +00:00
Michael J. Spencer	d2f0ce2674	Add WIN_FTOL_* psudo-instructions to model the unique calling convention used by the Win32 _ftol2 runtime function. Patch by Joe Groff! llvm-svn: 151382	2012-02-24 19:01:22 +00:00
Jia Liu	b077b6085d	Emacs-tag and some comment fix for all ARM, CellSPU, Hexagon, MBlaze, MSP430, PPC, PTX, Sparc, X86, XCore. llvm-svn: 150878	2012-02-18 12:03:15 +00:00
Craig Topper	ecf21d8132	Add X86 assembler and disassembler support for AMD SVM instructions. Original patch by Kay Tiong Khoo. Few tweaks by me for code density and to reduce replication. llvm-svn: 150873	2012-02-18 08:19:49 +00:00
Jakob Stoklund Olesen	b498ebe5b7	Use the same CALL instructions for Windows as for everything else. The different calling conventions and call-preserved registers are represented with regmask operands that are added dynamically. llvm-svn: 150708	2012-02-16 17:56:02 +00:00
Devang Patel	e03a4f051f	Intel syntax: Fix parser match class to check memory operand size. llvm-svn: 148338	2012-01-17 21:48:03 +00:00
Eli Friedman	a2b480b010	Get rid of unused codegen-only instruction. llvm-svn: 148239	2012-01-16 16:29:35 +00:00
Devang Patel	2096c1a697	Add predicate method check match memory operand size, if available. In att style asm syntax memory operand size is derived from suffix attached with mnemonic. In intel style asm syntax it is part of memory operand hence predicate method check is required to select appropriate instruction. llvm-svn: 148006	2012-01-12 01:51:42 +00:00

1 2 3 4 5 ...

1115 Commits