llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 20:12:56 +02:00

Author	SHA1	Message	Date
Derek Schuff	c55a3d43a9	Allow misaligned stores in x86 fast-isel. In X86FastISel::X86SelectStore(), improperly aligned stores are rejected and handled by the DAG-based ISel. However, X86FastISel::X86SelectLoad() makes no such requirement. There doesn't appear to be an x86 architectural correctness issue with allowing potentially unaligned store instructions. This patch removes this restriction. Patch by Jim Stichnot. llvm-svn: 179774	2013-04-18 17:41:08 +00:00
Chad Rosier	da17fa9b38	[ms-inline asm] Simplify some logic and add a FIXME for unhandled unary minus. llvm-svn: 179765	2013-04-18 16:28:19 +00:00
Chad Rosier	5dffee4c99	Make this private method. llvm-svn: 179764	2013-04-18 16:13:18 +00:00
Chad Rosier	1cb3175415	[ms-inline asm] These should be int64_t, not uint64_t. llvm-svn: 179724	2013-04-17 21:14:38 +00:00
Chad Rosier	1efbeb717f	[ms-inline asm] Add support for the minus unary operator. Previously, we were unable to handle cases such as __asm mov eax, 8*-8. This patch also attempts to simplify the state machine. Further, the error reporting has been improved. Test cases included, but more will be added to the clang side shortly. rdar://13668445 llvm-svn: 179719	2013-04-17 21:01:45 +00:00
Eli Bendersky	802610971f	This patch teaches x86 fast-isel to generate the native div/idiv instructions for the sdiv/srem/udiv/urem bitcode instructions. This is done for the i8, i16, and i32 types, as well as i64 for the x86_64 target. Patch by Jim Stichnoth llvm-svn: 179715	2013-04-17 20:10:13 +00:00
Arnold Schwaighofer	e1dc8ae8c8	X86 cost model: Exit before calling getSimpleVT on non-simple VTs getSimpleVT can only handle simple value types. radar://13676022 llvm-svn: 179714	2013-04-17 20:04:53 +00:00
Chad Rosier	441bf36faa	[ms-inline asm] Add support for parsing complex immediate expressions. Test cases to be submitted on clang side shortly. rdar://13663768 and PR15760 llvm-svn: 179655	2013-04-17 00:11:46 +00:00
Chad Rosier	9a757bb4ea	Remove unused variable from previous refactor. llvm-svn: 179611	2013-04-16 18:20:10 +00:00
Chad Rosier	128e5ae5af	[ms-inline asm] Refactor. No functional change intended. llvm-svn: 179610	2013-04-16 18:15:40 +00:00
Chad Rosier	0aff0eaab6	[ms-inline asm] Remove some dead code. llvm-svn: 179607	2013-04-16 17:27:40 +00:00
Andrew Trick	835ac00f78	X86 machine model: reduce SandyBridge and Haswell ILPWindow. The initial values were arbitrary. I want them to be more conservative. This represents the number of latency cycles hidden by OOO execution. In practice, I think it should be within a small factor of the complex floating point operation latency so the scheduler can make some attempt to hide latency even for smallish blocks. These are by no means the best values, just a starting point for tuning heuristics. Some benchmarks such as TSVC run faster with this lower value for SandyBridge. I haven't run anything on Haswell, but it's shouldn't be 2x SB. llvm-svn: 179450	2013-04-13 06:07:43 +00:00
Andrew Trick	d9efdff16f	Catch another case where SD fails to propagate node order. I need to handle this for the test case in my following scheduler commit. Work is already under way to redesign the mechanism for node order propagation because this case by case approach is unmaintainable. llvm-svn: 179448	2013-04-13 06:07:36 +00:00
Chad Rosier	3d83c7a3e0	[ms-inline asm] Simplify the logic by using parsePrimaryExpr. No functional change intended. Test case previously added in r178568. Part of rdar://13611297 llvm-svn: 179425	2013-04-12 23:03:20 +00:00
Chad Rosier	81c2f41261	[ms-inline asm] Move this logic into a static function as it's only applicable when parsing MS-style inline assembly. No functional change intended. llvm-svn: 179407	2013-04-12 20:20:54 +00:00
Chad Rosier	4e3d67d4c5	[ms-inline asm] Address the FIXME for ImmDisp before brackets. This is a follow on to r179393 and r179399. Test case to be added on the clang side. Part of rdar://13453209 llvm-svn: 179403	2013-04-12 19:51:49 +00:00
Chad Rosier	443d79152d	[ms-inline asm] Have the [ Symbol ] case fall into the more general logic. This is a follow on to r179393. Test case to be added on the clang side. Part of rdar://13453209 llvm-svn: 179399	2013-04-12 18:54:20 +00:00
Chad Rosier	b3960a88c7	[ms-inline asm] Add support for operands that include both a symbol and an immediate displacement. Specifically, add support for generating the proper IR. We've been able to parse this for some time now. Test case to be added on the clang side. Part of rdar://13453209 llvm-svn: 179393	2013-04-12 18:21:18 +00:00
Chad Rosier	765ec71e8d	[ms-inline asm] Add support for using the LENGTH, TYPE, and SIZE operators with variables that use namespace alias qualifiers. Test case coming on clang side shortly. Part of rdar://13499009 llvm-svn: 179343	2013-04-11 23:57:04 +00:00
Chad Rosier	2aa88ce036	[ms-inline asm] Add support for using offsetof operator with variables that use namespace alias qualifiers. Test case coming on clang side shortly. Part of rdar://13499009 llvm-svn: 179339	2013-04-11 23:37:34 +00:00
Chad Rosier	9323db9524	[ms-inline asm] Pass a StringRef reference to ParseIntelVarWithQualifier so we can build up the identifier string. No test case as support for looking up these type of identifiers hasn't been implemented on the clang side. Part of rdar://13499009 llvm-svn: 179336	2013-04-11 23:24:15 +00:00
Chad Rosier	b3012065df	[ms-inline asm] Remove brackets from around a symbol reference in the target specific logic. This makes the code much less fragile. Test case coming on the clang side in a moment. rdar://13634327 llvm-svn: 179323	2013-04-11 21:49:30 +00:00
Michael Liao	877d1576e6	Optimize vector select from all 0s or all 1s As packed comparisons in AVX/SSE produce all 0s or all 1s in each SIMD lane, vector select could be simplified to AND/OR or removed if one or both values being selected is all 0s or all 1s. llvm-svn: 179267	2013-04-11 05:15:54 +00:00
Michael Liao	75c886a312	Add CLAC/STAC instruction encoding/decoding support As these two instructions in AVX extension are privileged instructions for special purpose, it's only expected to be used in inlined assembly. llvm-svn: 179266	2013-04-11 04:52:28 +00:00
Michael Liao	87125582e9	Enhance bool simplifcation in X86 to handle more cases This patch is revised based on patch from Victor Umansky <victor.umansky@intel.com>. More cases are handled in X86's bool simplification, i.e. - SETCC_CARRY - value is truncated to i1 with AND As a by-product, PR5443 is also fixed. llvm-svn: 179265	2013-04-11 04:43:09 +00:00
Nico Rieck	8e22855ea6	MC: Support COFF image-relative MCSymbolRefs Add support for the COFF relocation types IMAGE_REL_I386_DIR32NB and IMAGE_REL_AMD64_ADDR32NB for 32- and 64-bit respectively. These are similar to normal 4-byte relocations except that they do not include the base address of the image. Image-relative relocations are used for debug information (32-bit) and SEH unwind tables (64-bit). A new MCSymbolRef variant called 'VK_COFF_IMGREL32' is introduced to specify such relocations. For AT&T assembly, this variant can be accessed using the symbol suffix '@imgrel'. llvm-svn: 179240	2013-04-10 23:28:17 +00:00
Kay Tiong Khoo	5f12d15d44	fixed xsave, xsaveopt, xrstor mnemonics with intel syntax; added test cases llvm-svn: 179223	2013-04-10 21:52:25 +00:00
Kay Tiong Khoo	ba75929324	fixed to disassemble with tab after mnemonic rather than space llvm-svn: 179215	2013-04-10 21:17:58 +00:00
Preston Gurd	de5cf7a23b	In the X86 back end, getMemoryOperandNo() returns the offset into the operand array of the start of the memory reference descriptor. Additional code in EncodeInstruction provides an additional adjustment. This patch places that additional code in a separate function, called getOperandBias, so that any caller of getMemoryOperandNo can also call getOperandBias. llvm-svn: 179211	2013-04-10 20:11:59 +00:00
Chad Rosier	b0156236cb	Tidy up, fix and simplify a few of the SMLocs. Prior to r179109 the Start SMLoc wasn't always the start of the operand. If there was a symbol reference, then Start pointed to that token. It's very likely there are other places that need to be updated. llvm-svn: 179210	2013-04-10 20:07:47 +00:00
Chad Rosier	cc61ca2355	Remove unused variable. llvm-svn: 179205	2013-04-10 18:46:58 +00:00
Chad Rosier	411fdc3a74	Reapply r179115, but use parsePrimaryExpression a little more judiciously. Test cases that regressed due to r179115, plus a few more, were added in r179182. Original commit message below: [ms-inline asm] Use parsePrimaryExpr in lieu of parseExpression if we need to parse an identifier. Otherwise, parseExpression may parse multiple tokens, which makes it impossible to properly compute an immediate displacement. An example of such a case is the source operand (i.e., [Symbol + ImmDisp]) in the below example: __asm mov eax, [Symbol + ImmDisp] Part of rdar://13611297 llvm-svn: 179187	2013-04-10 17:35:30 +00:00
Evan Cheng	9f82233851	__sincosf_stret returns sinf / cosf in bits 0:31 and 32:63 of xmm0, not in xmm0 / xmm1. rdar://13599493 llvm-svn: 179141	2013-04-10 01:26:07 +00:00
Chad Rosier	aa67701688	Cleanup. No functional change intended. llvm-svn: 179129	2013-04-09 20:58:48 +00:00
Chad Rosier	85e2894bd6	Cleanup. No functional change intended. llvm-svn: 179125	2013-04-09 20:44:09 +00:00
Chad Rosier	e040ffba05	Revert r179115 as it looks to have killed the ASan tests. llvm-svn: 179120	2013-04-09 19:59:12 +00:00
Chad Rosier	4ef6c35911	[ms-inline asm] Use parsePrimaryExpr in lieu of parseExpression if we need to parse an identifier. Otherwise, parseExpression may parse multiple tokens, which makes it impossible to properly compute an immediate displacement. An example of such a case is the source operand (i.e., [Symbol + ImmDisp]) in the below example: __asm mov eax, [Symbol + ImmDisp] The existing test cases exercise this patch. rdar://13611297 llvm-svn: 179115	2013-04-09 19:34:59 +00:00
Chad Rosier	5ec822982c	[ms-inline asm] Maintain a StringRef to reference a symbol in a parsed operand, rather than deriving the StringRef from the Start and End SMLocs. Using the Start and End SMLocs works fine for operands such as [Symbol], but not for operands such as [Symbol + ImmDisp]. All existing test cases that reference a variable exercise this patch. rdar://13602265 llvm-svn: 179109	2013-04-09 17:53:49 +00:00
Arnold Schwaighofer	3218da2403	X86 cost model: Model cost for uitofp and sitofp on SSE2 The costs are overfitted so that I can still use the legalization factor. For example the following kernel has about half the throughput vectorized than unvectorized when compiled with SSE2. Before this patch we would vectorize it. unsigned short A[1024]; double B[1024]; void f() { int i; for (i = 0; i < 1024; ++i) { B[i] = (double) A[i]; } } radar://13599001 llvm-svn: 179033	2013-04-08 18:05:48 +00:00
Chad Rosier	7583b0a3c3	[ms-inline asm] Add support for ImmDisp [ Symbol ] memory operands. rdar://13521249 llvm-svn: 179030	2013-04-08 17:43:47 +00:00
Bill Wendling	f2bb7aa5f8	Use the target options specified on a function to reset the back-end. During LTO, the target options on functions within the same Module may change. This would necessitate resetting some of the back-end. Do this for X86, because it's a Friday afternoon. llvm-svn: 178917	2013-04-05 21:52:40 +00:00
Chad Rosier	59dbb08c9b	[ms-inline asm] Add support for numeric displacement expressions in bracketed memory operands. Essentially, this layers an infix calculator on top of the parsing state machine. The scale on the index register is still expected to be an immediate __asm mov eax, [eax + ebx4] and will not work with more complex expressions. For example, __asm mov eax, [eax + ebx(22)] The plus and minus binary operators assume the numeric value of a register is zero so as to not change the displacement. Register operands should never be an operand for a multiply or divide operation; the scaleindexreg expression is always replaced with a zero on the operand stack to prevent such a case. rdar://13521380 llvm-svn: 178881	2013-04-05 16:28:55 +00:00
Arnold Schwaighofer	52871434dd	X86 cost model: Differentiate cost for vector shifts of constants SSE2 has efficient support for shifts by a scalar. My previous change of making shifts expensive did not take this into account marking all shifts as expensive. This would prevent vectorization from happening where it is actually beneficial. With this change we differentiate between shifts of constants and other shifts. radar://13576547 llvm-svn: 178808	2013-04-04 23:26:24 +00:00
Arnold Schwaighofer	861251004b	CostModel: Add parameter to instruction cost to further classify operand values On certain architectures we can support efficient vectorized version of instructions if the operand value is uniform (splat) or a constant scalar. An example of this is a vector shift on x86. We can efficiently support for (i = 0 ; i < ; i += 4) w[0:3] = v[0:3] << <2, 2, 2, 2> but not for (i = 0; i < ; i += 4) w[0:3] = v[0:3] << x[0:3] This patch adds a parameter to getArithmeticInstrCost to further qualify operand values as uniform or uniform constant. Targets can then choose to return a different cost for instructions with such operand values. A follow-up commit will test this feature on x86. radar://13576547 llvm-svn: 178807	2013-04-04 23:26:21 +00:00
Arnold Schwaighofer	329430aeac	X86 cost model: Vector shifts are expensive in most cases The default logic does not correctly identify costs of casts because they are marked as custom on x86. For some cases, where the shift amount is a scalar we would be able to generate better code. Unfortunately, when this is the case the value (the splat) will get hoisted out of the loop, thereby making it invisible to ISel. radar://13130673 radar://13537826 llvm-svn: 178703	2013-04-03 21:46:05 +00:00
Timur Iskhodzhanov	ecd533f0ec	Fix SRet for thiscall in i686-pc-win32 llvm-svn: 178634	2013-04-03 11:27:54 +00:00
Eric Christopher	81bacb7670	Formatting. llvm-svn: 178589	2013-04-02 23:06:40 +00:00
Chad Rosier	1fe97eda48	[ms-inline asm] Add support for parsing variables with namespace alias qualifiers. This patch only adds support for parsing these identifiers in the X86AsmParser. The front-end interface isn't capable of looking up these identifiers at this point in time. The end result is the compiler now errors during object file emission, rather than at parse time. Test case coming shortly. Part of rdar://13499009 and PR13340 llvm-svn: 178566	2013-04-02 20:02:33 +00:00
Chad Rosier	908153170e	[fast-isel] Use the correct API to disable FastLowerArguments for Win64. llvm-svn: 178549	2013-04-02 16:31:41 +00:00
Andrew Trick	b6ac50177f	The divide unit is not pipeline, but it is still buffered. Buffered means a later divide may be executed out-of-order while a prior divide is sitting (buffered) in a reservation station. You can tell it's not pipelined, because operations that use it reserve it for more than one cycle: def : WriteRes<WriteIDiv, [HWPort0, HWDivider]> { let Latency = 25; let ResourceCycles = [1, 10]; } We don't currently distinguish between an unpipeline operation and one that is split into multiple micro-ops requiring the same unit. Except that the later may have NumMicroOps > 1 if they also consume issue/dispatch resources. llvm-svn: 178519	2013-04-02 01:58:47 +00:00
Benjamin Kramer	7634eefc37	X86TTI: Add accurate costs for itofp operations, based on the actual instruction counts. llvm-svn: 178459	2013-04-01 10:23:49 +00:00
Benjamin Kramer	790bd5fb50	X86: Promote sitofp <8 x i16> to <8 x i32> when AVX is available. A vector sext + sitofp is a lot cheaper than 8 scalar conversions. llvm-svn: 178448	2013-03-31 12:49:15 +00:00
Benjamin Kramer	50725426cb	Change '@SECREL' suffix to GAS-compatible '@SECREL32'. '@SECREL' is what is used by the Microsoft assembler, but GNU as expects '@SECREL32'. With the patch, the MC-generated code works fine in combination with a recent GNU as (2.23.51.20120920 here). Patch by David Nadlinger! Differential Revision: http://llvm-reviews.chandlerc.com/D429 llvm-svn: 178427	2013-03-30 16:21:50 +00:00
Benjamin Kramer	279e5cfa9a	Remove the old CodePlacementOpt pass. It was superseded by MachineBlockPlacement and disabled by default since LLVM 3.1. llvm-svn: 178349	2013-03-29 17:14:24 +00:00
Michael Liao	427149cbcf	Add support of RDSEED defined in AVX2 extension llvm-svn: 178314	2013-03-28 23:41:26 +00:00
Michael Liao	aec693ab31	Enhance boolean simplification to handle 16-/64-bit RDRAND - RDRAND always clears the destination value when a random value is not available (i.e. CF == 0). This value is truncated or zero-extended as the false boolean value to be returned. Boolean simplification needs to skip this 'zext' or 'trunc' node. llvm-svn: 178312	2013-03-28 23:38:52 +00:00
Michael Liao	d961d7a7b3	Skip moving call address loading into callseq when targets prefer register indirect call. To enable a load of a call address to be folded with that call, this load is moved from outside of callseq into callseq. Such a moving adds a non-glued node (that load) into a glued sequence. This non-glue load is only removed when DAG selection folds them into a memory form call instruction. When such instruction selection is disabled, it breaks DAG schedule. To prevent that, such moving is disabled when target favors register indirect call. Previous workaround disabling CALL32m/CALL64m insn selection is removed. llvm-svn: 178308	2013-03-28 23:13:21 +00:00
Nadav Rotem	705ec0e7e3	Add the X86 FMAs to the scheduling model. llvm-svn: 178303	2013-03-28 22:54:45 +00:00
Nadav Rotem	401bba05fe	Add the Haswell machine model. llvm-svn: 178301	2013-03-28 22:34:46 +00:00
Nadav Rotem	e5f49b65f6	Remove the unused port from the SandyBridge machine model llvm-svn: 178300	2013-03-28 22:32:41 +00:00
Michael Liao	30577169a4	Add ADX CPUID detection llvm-svn: 178299	2013-03-28 22:29:53 +00:00
Timur Iskhodzhanov	d7de83d51c	Make Win32 put the SRet address into EAX, fixes PR15556 llvm-svn: 178291	2013-03-28 21:30:04 +00:00
Preston Gurd	787c145b5f	This patch follows is a follow up to r178171, which uses the register form of call in preference to memory indirect on Atom. In this case, the patch applies the optimization to the code for reloading spilled registers. The patch also includes changes to sibcall.ll and movgs.ll, which were failing on the Atom buildbot after the first patch was applied. This patch by Sriram Murali. llvm-svn: 178193	2013-03-27 23:16:18 +00:00
Chad Rosier	09bc7a9c8d	[ms-inline asm] Add support of imm displacement before bracketed memory expression. Specifically, this syntax: ImmDisp [ BaseReg + Scale*IndexReg + Disp ] We don't currently support: ImmDisp [ Symbol ] rdar://13518671 llvm-svn: 178186	2013-03-27 21:49:56 +00:00
Preston Gurd	b6ed645cb6	For the current Atom processor, the fastest way to handle a call indirect through a memory address is to load the memory address into a register and then call indirect through the register. This patch implements this improvement by modifying SelectionDAG to force a function address which is a memory reference to be loaded into a virtual register. Patch by Sriram Murali. llvm-svn: 178171	2013-03-27 19:14:02 +00:00
Hal Finkel	ba08d9519e	Fix typo (common to both X86 and PPC) Thanks to Bill Schmidt for pointing this out during code review! llvm-svn: 178170	2013-03-27 19:10:42 +00:00
Michael Liao	bd3f6b0eea	Add XTEST codegen support llvm-svn: 178083	2013-03-26 22:47:01 +00:00
Michael Liao	3515920fbd	Add HLE target feature llvm-svn: 178082	2013-03-26 22:46:02 +00:00
Jakob Stoklund Olesen	43b68b7eb9	Enable SandyBridgeModel for all modern Intel P6 descendants. All Intel CPUs since Yonah look a lot alike, at least at the granularity of the scheduling models. We can add more accurate models for processors that aren't Sandy Bridge if required. Haswell will probably need its own. The Atom processor and anything based on NetBurst is completely different. So are the non-Intel chips. llvm-svn: 178080	2013-03-26 22:19:12 +00:00
Jakob Stoklund Olesen	b728975493	Annotate the remaining x86 instructions with SchedRW lists. Now all x86 instructions that have itinerary classes also have SchedRW lists. This is required before the new scheduling models can be used. There are still unannotated instructions remaining, but they don't have itinerary classes either. llvm-svn: 178051	2013-03-26 18:24:22 +00:00
Jakob Stoklund Olesen	b7896f4f21	Annotate x87 and mmx instructions with SchedRW lists. This only covers the instructions that were given itinerary classes for the Atom model. llvm-svn: 178050	2013-03-26 18:24:20 +00:00
Jakob Stoklund Olesen	68af421d67	Annotate control instructions with SchedRW lists. This could definitely be more granular. I am not sure if it makes a difference. llvm-svn: 178049	2013-03-26 18:24:17 +00:00
Jakob Stoklund Olesen	9e6a9659f1	Annotate the rest of X86InstrInfo.td with SchedRW lists. llvm-svn: 178048	2013-03-26 18:24:15 +00:00
Michael Liao	969ef73c31	Add PREFETCHW codegen support - Add 'PRFCHW' feature defined in AVX2 ISA extension llvm-svn: 178040	2013-03-26 17:47:11 +00:00
Michael Liao	a0a4d0c6f7	Revise alignment checking/calculation on 256-bit unaligned memory access - It's still considered aligned when the specified alignment is larger than the natural alignment; - The new alignment for the high 128-bit vector should be min(16, alignment) as the pointer is advanced by 16, a power-of-2 offset. llvm-svn: 177947	2013-03-25 23:50:10 +00:00
Jakob Stoklund Olesen	d323c87f9a	Add a scheduling model for Intel Sandy Bridge microarchitecture. The model isn't hooked up by this patch because the instruction set isn't fully annotated yet. llvm-svn: 177942	2013-03-25 23:37:17 +00:00
Jakob Stoklund Olesen	0ac9ff5688	Remove IIC_DEFAULT from X86Schedule.td All the instructions tagged with IIC_DEFAULT had nothing in common, and we already have a NoItineraries class to represent untagged instructions. llvm-svn: 177937	2013-03-25 23:12:41 +00:00
Jakob Stoklund Olesen	19c4788c8a	Annotate X86InstrCompiler.td with SchedRW lists. llvm-svn: 177936	2013-03-25 23:07:35 +00:00
Jakob Stoklund Olesen	b81c63e1a2	Annotate shifts and rotates with SchedRW lists. llvm-svn: 177935	2013-03-25 23:07:32 +00:00
NAKAMURA Takumi	eeb13ad532	X86DisassemblerDecoder.c: Make this C89-compliant. llvm-svn: 177910	2013-03-25 20:55:49 +00:00
NAKAMURA Takumi	ce709c8119	Whitespace. llvm-svn: 177909	2013-03-25 20:55:43 +00:00
Dave Zarzycki	2949721b7d	x86 -- add the XTEST instruction llvm-svn: 177888	2013-03-25 18:59:43 +00:00
Dave Zarzycki	b443618b88	x86 -- disassemble the REP/REPNE prefix when needed This fixes Apple bug: 13493622 llvm-svn: 177887	2013-03-25 18:59:38 +00:00
Jakob Stoklund Olesen	6ffb4136aa	Add a WriteMicrocoded for ancient microcoded instructions. llvm-svn: 177611	2013-03-21 00:07:17 +00:00
Jakob Stoklund Olesen	b3af273625	Model prefetches and barriers as loads. It's not yet clear if these instructions need a more careful model. llvm-svn: 177599	2013-03-20 23:09:53 +00:00
Jakob Stoklund Olesen	042f102514	Add a catch-all WriteSystem SchedWrite type. This is used for all the expensive system instructions. llvm-svn: 177598	2013-03-20 23:09:50 +00:00
Jakob Stoklund Olesen	305e22bdab	Annotate the remaining SSE MOV instructions. llvm-svn: 177592	2013-03-20 22:37:16 +00:00
Jakob Stoklund Olesen	96a403dd67	Annotate SSE horizontal and integer instructions. llvm-svn: 177591	2013-03-20 22:37:13 +00:00
Michael Liao	fe785c9579	Correct cost model for vector shift on AVX2 - After moving logic recognizing vector shift with scalar amount from DAG combining into DAG lowering, we declare to customize all vector shifts even vector shift on AVX is legal. As a result, the cost model needs special tuning to identify these legal cases. llvm-svn: 177586	2013-03-20 22:01:10 +00:00
Jakob Stoklund Olesen	d202f35d06	Add some missing SSE annotations. llvm-svn: 177540	2013-03-20 16:56:39 +00:00
Jakob Stoklund Olesen	327db7c83a	Annotate remaining IIC_BIN_* instructions. llvm-svn: 177539	2013-03-20 16:56:36 +00:00
Michael Liao	d0e167edfb	Fix PR15296 - Move SRA/SRL/SHL lowering support from DAG combination to DAG lowering to support extended 256-bit integer in AVX but not AVX2. llvm-svn: 177478	2013-03-20 02:33:21 +00:00
Michael Liao	8be4fbefe3	Mark all variable shifts needing customizing - Prepare moving logic from DAG combining into DAG lowering. There's no functionality change. llvm-svn: 177477	2013-03-20 02:28:20 +00:00
Michael Liao	3b72fc2823	Move scalar immediate shift lowering into a dedicated func - no functionality change llvm-svn: 177476	2013-03-20 02:20:36 +00:00
Jakob Stoklund Olesen	3b039fa614	Annotate various null idioms with SchedRW lists. llvm-svn: 177461	2013-03-19 23:23:31 +00:00
Jakob Stoklund Olesen	a8c3f3d12c	Annotate SSE float conversions with SchedRW lists. llvm-svn: 177460	2013-03-19 23:23:29 +00:00
Jakob Stoklund Olesen	c4c0f667dc	Annotate X86InstrCMovSetCC.td with SchedRW lists. llvm-svn: 177459	2013-03-19 23:23:26 +00:00
Chad Rosier	3019e30cac	[ms-inline asm] Move the immediate asm rewrite into the target specific logic as a QOI cleanup. No functional change. Tests already in place. rdar://13456414 llvm-svn: 177446	2013-03-19 21:58:18 +00:00
Jakob Stoklund Olesen	71393fdd98	Annotate X86InstrCompiler.td with SchedRW lists. Add a new WriteZero SchedWrite type for the common dependency-breaking instructions that clear a register. llvm-svn: 177442	2013-03-19 21:16:56 +00:00
Chad Rosier	d2b8daa7f4	[ms-inline asm] Create a helper function, CreateMemForInlineAsm, that creates an X86Operand, but also performs a Sema lookup and adds the sizing directive when appropriate. Use this when parsing a bracketed statement. This is necessary to get the instruction matching correct as well. Test case coming on clang side. rdar://13455408 llvm-svn: 177439	2013-03-19 21:11:56 +00:00

1 2 3 4 5 ...

9246 Commits