llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 20:12:56 +02:00

Author	SHA1	Message	Date
Ulrich Weigand	535942804d	[TableGen] Support multi-alternative pattern fragments A TableGen instruction record usually contains a DAG pattern that will describe the SelectionDAG operation that can be implemented by this instruction. However, there will be cases where several different DAG patterns can all be implemented by the same instruction. The way to represent this today is to write additional patterns in the Pattern (or usually Pat) class that map those extra DAG patterns to the instruction. This usually also works fine. However, I've noticed cases where the current setup seems to require quite a bit of extra (and duplicated) text in the target .td files. For example, in the SystemZ back-end, there are quite a number of instructions that can implement an "add-with-overflow" operation. The same instructions also need to be used to implement just plain addition (simply ignoring the extra overflow output). The current solution requires creating extra Pat pattern for every instruction, duplicating the information about which particular add operands map best to which particular instruction. This patch enhances TableGen to support a new PatFrags class, which can be used to encapsulate multiple alternative patterns that may all match to the same instruction. It operates the same way as the existing PatFrag class, except that it accepts a list of DAG patterns to match instead of just a single one. As an example, we can now define a PatFrags to match either an "add-with-overflow" or a regular add operation: def z_sadd : PatFrags<(ops node:$src1, node:$src2), [(z_saddo node:$src1, node:$src2), (add node:$src1, node:$src2)]>; and then use this in the add instruction pattern: defm AR : BinaryRRAndK<"ar", 0x1A, 0xB9F8, z_sadd, GR32, GR32>; These SystemZ target changes are implemented here as well. Note that PatFrag is now defined as a subclass of PatFrags, which means that some users of internals of PatFrag need to be updated. (E.g. instead of using PatFrag.Fragment you now need to use !head(PatFrag.Fragments).) The implementation is based on the following main ideas: - InlinePatternFragments may now replace each original pattern with several result patterns, not just one. - parseInstructionPattern delays calling InlinePatternFragments and InferAllTypes. Instead, it extracts a single DAG match pattern from the main instruction pattern. - Processing of the DAG match pattern part of the main instruction pattern now shares most code with processing match patterns from the Pattern class. - Direct use of main instruction patterns in InferFromPattern and EmitResultInstructionAsOperand is removed; everything now operates solely on DAG match patterns. Reviewed by: hfinkel Differential Revision: https://reviews.llvm.org/D48545 llvm-svn: 336999	2018-07-13 13:18:00 +00:00
Chandler Carruth	02eae91c4a	[SLH] Introduce a new pass to do Speculative Load Hardening to mitigate Spectre variant #1 for x86. There is a lengthy, detailed RFC thread on llvm-dev which discusses the high level issues. High level discussion is probably best there. I've split the design document out of this patch and will land it separately once I update it to reflect the latest edits and updates to the Google doc used in the RFC thread. This patch is really just an initial step. It isn't quite ready for prime time and is only exposed via debugging flags. It has two major limitations currently: 1) It only supports x86-64, and only certain ABIs. Many assumptions are currently hard-coded and need to be factored out of the code here. 2) It doesn't include any options for more fine-grained control, either of which control flow edges are significant or which loads are important to be hardened. 3) The code is still quite rough and the testing lighter than I'd like. However, this is enough for people to begin using. I have had numerous requests from people to be able to experiment with this patch to understand the trade-offs it presents and how to use it. We would also like to encourage work to similar effect in other toolchains. The ARM folks are actively developing a system based on this for AArch64. We hope to merge this with their efforts when both are far enough along. But we also don't want to block making this available on that effort. Many thanks to the numerous people who helped along the way here. For this patch in particular, both Eric and Craig did a ton of review to even have confidence in it as an early, rough cut at this functionality. Differential Revision: https://reviews.llvm.org/D44824 llvm-svn: 336990	2018-07-13 11:13:58 +00:00
Chandler Carruth	42614f22f4	[x86] Teach the EFLAGS copy lowering to handle much more complex control flow patterns including forks, merges, and even cyles. This tries to cover a reasonably comprehensive set of patterns that still don't require PHIs or PHI placement. The coverage was inspired by the amazing variety of patterns produced when copy EFLAGS and restoring it to implement Speculative Load Hardening. Without this patch, we simply cannot make such complex and invasive changes to x86 instruction sequences due to EFLAGS. I've added "just" one test, but this test covers many different complexities and corner cases of this approach. It is actually more comprehensive, as far as I can tell, than anything that I have encountered in the wild on SLH. Because the test is so complex, I've tried to give somewhat thorough comments and an ASCII-art diagram of the control flows to make it a bit easier to read and maintain long-term. Differential Revision: https://reviews.llvm.org/D49220 llvm-svn: 336985	2018-07-13 09:39:10 +00:00
Sander de Smalen	70386770c5	[AArch64][SVE] Asm: Vector Unpack Low/High instructions. This patch adds support for the following unpack instructions: - PUNPKLO, PUNPKHI Unpack elements from low/high half and place into elements of twice their size. e.g. punpklo p0.h, p0.b - UUNPKLO, UUNPKHI Unpack elements from low/high half and SUNPKLO, SUNPKHI place into elements of twice their size after zero- or sign-extending the values. e.g. uunpklo z0.h, z0.b llvm-svn: 336982	2018-07-13 09:25:43 +00:00
Sander de Smalen	52edf6c6bd	[AArch64][SVE] Asm: Support for insert element (INSR) instructions. Insert general purpose register into shifted vector, e.g. insr z0.s, w0 insr z0.d, x0 Insert SIMD&FP scalar register into shifted vector, e.g. insr z0.b, b0 insr z0.h, h0 insr z0.s, s0 insr z0.d, d0 llvm-svn: 336979	2018-07-13 08:51:57 +00:00
Craig Topper	6d90906e48	[X86] Prefer MOVSS/SD over BLEND under optsize in isel. Previously we iseled to blend, commuted to another blend, and then commuted back to movss/movsd or blend depending on optsize. Now we do it directly. llvm-svn: 336976	2018-07-13 06:25:31 +00:00
Craig Topper	64e43783ed	[X86] Remove isel patterns that turns packed add/sub/mul/div+movss/sd into scalar intrinsic instructions. This is not an optimization we should be doing in isel. This is more suitable for a DAG combine. My main concern is a future time when we support more FPENV. Changing a packed op to a scalar op could cause us to miss some exceptions that should have occured if we had done a packed op. A DAG combine would be better able to manage this. llvm-svn: 336971	2018-07-13 04:50:39 +00:00
Matthias Braun	5579dfa88e	CodeGen: Remove pipeline dependencies on StackProtector; NFC This re-applies r336929 with a fix to accomodate for the Mips target scheduling multiple SelectionDAG instances into the pass pipeline. PrologEpilogInserter and StackColoring depend on the StackProtector analysis being alive from the point it is run until PEI, which requires that they are all scheduled in the same FunctionPassManager. Inserting a (machine) ModulePass between StackProtector and PEI results in these passes being in separate FunctionPassManagers and the StackProtector is not available for PEI. PEI and StackColoring don't use much information from the StackProtector pass, so transfering the required information to MachineFrameInfo is cleaner than keeping the StackProtector pass around. This commit moves the SSP layout information to MFI instead of keeping it in the pass. This patch set (D37580, D37581, D37582, D37583, D37584, D37585, D37586, D37587) is a first draft of the pagerando implementation described in http://lists.llvm.org/pipermail/llvm-dev/2017-June/113794.html. Patch by Stephen Crane <sjc@immunant.com> Differential Revision: https://reviews.llvm.org/D49256 llvm-svn: 336964	2018-07-13 00:08:38 +00:00
Craig Topper	96707c7d66	[X86] Add AVX512 equivalents of some isel patterns so we get EVEX instructions. These are the patterns for matching fceil, ffloor, and sqrt to intrinsic instructions if they have a MOVSS/SD. llvm-svn: 336954	2018-07-12 22:14:10 +00:00
Craig Topper	6b6c56db23	Revert r336950 and r336951 "[X86] Add AVX512 equivalents of some isel patterns so we get EVEX instructions." and "foo" One of them had a bad title and they should have been squashed. llvm-svn: 336953	2018-07-12 21:58:03 +00:00
Craig Topper	8de0a2e377	[X86] Add AVX512 equivalents of some isel patterns so we get EVEX instructions. These are the patterns for matching fceil, ffloor, and sqrt to intrinsic instructions if they have a MOVSS/SD. llvm-svn: 336951	2018-07-12 21:53:23 +00:00
Craig Topper	7b54eaa179	foo llvm-svn: 336950	2018-07-12 21:53:07 +00:00
Craig Topper	ca6b8898ee	[X86][FastISel] Support EVEX version of sqrt. llvm-svn: 336939	2018-07-12 19:58:06 +00:00
Matt Arsenault	337112c824	AMDGPU: Fix assert in truncate combine with vectors The piece above probably has the same problem, but I need to try to come up with a test for it. llvm-svn: 336935	2018-07-12 19:40:16 +00:00
Craig Topper	f8f62d55ce	[X86] Connect the flags user from PCMPISTR instructions to the correct node from the instruction. We were accidentally connecting it to result 0 instead of result 1. This was caught by the machine verifier that noticed the flags were dead, but we were using them somehow. I'm still not clear what actually happened downstream. llvm-svn: 336925	2018-07-12 18:04:05 +00:00
Craig Topper	eb5a2435ed	[X86][FastISel] Choose EVEX instructions when possible when lowering x86_sse_cvttss2si and similar intrinsics. This should fix a machine verifier error. llvm-svn: 336924	2018-07-12 18:03:56 +00:00
Sjoerd Meijer	f1cde2c9d6	[AArch64] Armv8.4-A: LDAPR & STLR with immediate offset instructions These instructions are added to AArch64 only. llvm-svn: 336913	2018-07-12 14:57:59 +00:00
Simon Pilgrim	4f5705463c	[X86][SSE] Utilize ZeroableElements for canWidenShuffleElements canWidenShuffleElements can do a better job if given a mask with ZeroableElements info. Apparently, ZeroableElements was being only used to identify AllZero candidates, but possibly we could plug it into more shuffle matchers. Original Patch by Zvi Rackover @zvi Differential Revision: https://reviews.llvm.org/D42044 llvm-svn: 336903	2018-07-12 13:29:41 +00:00
Simon Pilgrim	e7838cec64	[X86][AVX] Use Zeroable mask to improve shuffle mask widening Noticed while updating D42044, lowerV2X128VectorShuffle can improve the shuffle mask with the zeroable data to create a target shuffle mask to recognise more 'zero upper 128' patterns. NOTE: lowerV4X128VectorShuffle could benefit as well but the code needs refactoring first to discriminate between SM_SentinelUndef and SM_SentinelZero for negative shuffle indices. Differential Revision: https://reviews.llvm.org/D49092 llvm-svn: 336900	2018-07-12 13:03:58 +00:00
Simon Atanasyan	2d278225de	[mips] Mark standard encoded instructions as not being in MIPS16e Mark standard encoded instructions and pseudo "standard encoded" as not being in MIPS16e by default. Patch by Simon Dardis. Differential revision: https://reviews.llvm.org/D48379 llvm-svn: 336893	2018-07-12 08:50:11 +00:00
Craig Topper	1b33d48ffb	[X86] Remove i128 type from FR128 regclass. i128 isn't a legal type in our x86 implementation today. So remove this and the few patterns that used it until it becomes necessary. llvm-svn: 336889	2018-07-12 07:30:01 +00:00
Craig Topper	7be1d5dd11	[X86] Remove patterns and ISD nodes for the old scalar FMA intrinsic lowering. We now use llvm.fma.f32/f64 or llvm.x86.fmadd.f32/f64 intrinsics that use scalar types rather than vector types. So we don't these special ISD nodes that operate on the lowest element of a vector. llvm-svn: 336883	2018-07-12 03:42:41 +00:00
Chandler Carruth	8b8b43148d	[x86] Fix another trivial bug in x86 flags copy lowering that has been there for a long time. The boolean tracking whether we saw a kill of the flags was supposed to be per-block we are scanning and instead was outside that loop and never cleared. It requires a quite contrived test case to hit this as you have to have multiple levels of successors and interleave them with kills. I've included such a test case here. This is another bug found testing SLH and extracted to its own focused patch. llvm-svn: 336876	2018-07-12 01:43:21 +00:00
Craig Topper	89788bf942	[X86] Add patterns to use VMOVSS/SD zero masking for scalar f32/f64 select with zero. These showed up in some of the upgraded FMA code. We really need to improve these test cases more, but this helps for now. llvm-svn: 336875	2018-07-12 00:54:40 +00:00
Chandler Carruth	40027cf33a	[x86] Fix EFLAGS copy lowering to correctly handle walking past uses in multiple successors where some of the uses end up killing the EFLAGS register. There was a bug where rather than skipping to the next basic block queued up with uses once we saw a kill, we stopped processing the blocks entirely. =/ Test case produces completely nonsensical code w/o this tiny fix. This was found testing Speculative Load Hardening and split out of that work. Differential Revision: https://reviews.llvm.org/D49211 llvm-svn: 336874	2018-07-12 00:52:50 +00:00
Craig Topper	723bd78241	[X86] Remove and autoupgrade the scalar fma intrinsics with masking. This converts them to what clang is now using for codegen. Unfortunately, there seem to be a few kinks to work out still. I'll try to address with follow up patches. llvm-svn: 336871	2018-07-12 00:29:56 +00:00
Eli Friedman	a5ace10b01	[CodeGen] Emit more precise AssertZext/AssertSext nodes. This is marginally helpful for removing redundant extensions, and the code is easier to read, so it seems like an all-around win. In the new test i8-phi-ext.ll, we used to emit an AssertSext i8; now we emit an AssertZext i2, which allows the extension of the return value to be eliminated. Differential Revision: https://reviews.llvm.org/D49004 llvm-svn: 336868	2018-07-11 23:26:35 +00:00
Tom Stellard	c7b02c4b43	AMDGPU/SI: Initialize InstrInfo before TargetLoweringInfo in GCNSubtarget SITargetLowering queries SIInstrInfo in its constructor, so SIInstrInfo must be initialized first. This fixes msan buildbot failures and was introduced by r336851. llvm-svn: 336861	2018-07-11 22:15:15 +00:00
Tom Stellard	f5c828efa9	AMDGPU: Remove duplicate call to initializeSubtargetDependencies() This was added in r336851. llvm-svn: 336853	2018-07-11 21:12:03 +00:00
Tom Stellard	e236141513	AMDGPU: Refactor Subtarget classes Summary: This is a follow-up to r335942. - Merge SISubtarget into AMDGPUSubtarget and rename to GCNSubtarget - Rename AMDGPUCommonSubtarget to AMDGPUSubtarget - Merge R600Subtarget::Generation and GCNSubtarget::Generation into AMDGPUSubtarget::Generation. Reviewers: arsenm, jvesely Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D49037 llvm-svn: 336851	2018-07-11 20:59:01 +00:00
Craig Topper	72f55d6b1c	[X86] Remove patterns for inserting a load into a zero vector. We can instead block the load folding isProfitableToFold. Then isel will emit a register->register move for the zeroing part and a separate load. The PostProcessISelDAG should be able to remove the register->register move. This saves us patterns and fixes the fact that we only had unaligned load patterns. The test changes show places where we should have been using an aligned load. llvm-svn: 336828	2018-07-11 18:09:04 +00:00
Konstantin Zhuravlyov	3d91e96995	AMDGPU/NFC: Use already available explicit kernarg size instead of calculating it again when filling out the metadata. llvm-svn: 336825	2018-07-11 17:27:17 +00:00
Andrea Di Biagio	6b6463f348	[X86] Fix MayLoad/HasSideEffect flag for (V)MOVLPSrm instructions. Before revision 336728, the "mayLoad" flag for instruction (V)MOVLPSrm was inferred directly from the "default" pattern associated with the instruction definition. r336728 removed special node X86Movlps, and all the patterns associated to it. Now instruction (V)MOVLPSrm doesn't have a pattern associated to it, and the 'mayLoad/hasSideEffects' flags are left unset. When the instruction info is emitted by tablegen, method CodeGenDAGPatterns::InferInstructionFlags() sees that (V)MOVLPSrm doesn't have a pattern, and flags are undefined. So, it conservatively sets the "hasSideEffects" flag for it. As a consequence, we were losing the 'mayLoad' flag, and we were gaining a 'hasSideEffect' flag in its place. This patch fixes the issue (originally reported by Michael Holmen). The mca tests show the differences in the instruction info flags. Instructions that were affected by this problem were: MOVLPSrm/VMOVLPSrm/VMOVLPSZ128rm. Differential Revision: https://reviews.llvm.org/D49182 llvm-svn: 336818	2018-07-11 15:27:50 +00:00
Simon Atanasyan	441bff546e	[mips] Update the P5600 scheduler model not to use instruction itineraries. This mostly brings the P5600 scheduler model to a mostly complete status. There are a number of instructions which trigger the `error:'MipsP5600Model' lacks information for` error. These are certain codegen only instructions relating to MIPS64 which can be addressed by using the correct predicates for them. That will be done in a full-up patch. Patch by Simon Dardis. Differential revision: https://reviews.llvm.org/D45245 llvm-svn: 336802	2018-07-11 13:21:10 +00:00
Sjoerd Meijer	2dddfdaff5	[ARM] ParallelDSP: multiple reduction stmts in loop This fixes an issue that we were not properly supporting multiple reduction stmts in a loop, and not generating SMLADs for these cases. The alias analysis checks were done too early, making it too conservative. Differential revision: https://reviews.llvm.org/D49125 llvm-svn: 336795	2018-07-11 12:36:25 +00:00
Sander de Smalen	9a4aa02652	[AArch64][SVE] Asm: Support for COMPACT instruction. The compact instruction shuffles active elements of vector into lowest numbered elements and sets remaining elements to zero. e.g. compact z0.s, p0, z1.s llvm-svn: 336789	2018-07-11 11:22:26 +00:00
Sander de Smalen	e8509a8efa	[AArch64][SVE] Asm: Support for LAST(A\|B) and CLAST(A\|B) instructions. The LASTB and LASTA instructions extract the last active element, or element after the last active, from the source vector. The added variants are: Scalar: last(a\|b) w0, p0, z0.b last(a\|b) w0, p0, z0.h last(a\|b) w0, p0, z0.s last(a\|b) x0, p0, z0.d SIMD & FP Scalar: last(a\|b) b0, p0, z0.b last(a\|b) h0, p0, z0.h last(a\|b) s0, p0, z0.s last(a\|b) d0, p0, z0.d The CLASTB and CLASTA conditionally extract the last or element after the last active element from the source vector. The added variants are: Scalar: clast(a\|b) w0, p0, w0, z0.b clast(a\|b) w0, p0, w0, z0.h clast(a\|b) w0, p0, w0, z0.s clast(a\|b) x0, p0, x0, z0.d SIMD & FP Scalar: clast(a\|b) b0, p0, b0, z0.b clast(a\|b) h0, p0, h0, z0.h clast(a\|b) s0, p0, s0, z0.s clast(a\|b) d0, p0, d0, z0.d Vector: clast(a\|b) z0.b, p0, z0.b, z1.b clast(a\|b) z0.h, p0, z0.h, z1.h clast(a\|b) z0.s, p0, z0.s, z1.s clast(a\|b) z0.d, p0, z0.d, z1.d Please refer to the architecture specification for more details on the semantics of the added instructions. llvm-svn: 336783	2018-07-11 10:08:00 +00:00
Simon Atanasyan	7590639f3f	[mips] Remove dead code. NFC llvm-svn: 336777	2018-07-11 09:41:28 +00:00
Eric Liu	e94b6561f4	[WebAssembly] Only call llvm::value::dump() in debug build. This fixes compile error in r336759. llvm::value::dump is not available in released build. llvm-svn: 336770	2018-07-11 08:16:47 +00:00
Craig Topper	0b5293684d	[X86] The TEST instruction is eliminated when BSF/TZCNT is used Summary: These changes cover the PR#31399. Now the ffs(x) function is lowered to (x != 0) ? llvm.cttz(x) + 1 : 0 and it corresponds to the following llvm code: %cnt = tail call i32 @llvm.cttz.i32(i32 %v, i1 true) %tobool = icmp eq i32 %v, 0 %.op = add nuw nsw i32 %cnt, 1 %add = select i1 %tobool, i32 0, i32 %.op and x86 asm code: bsfl %edi, %ecx addl $1, %ecx testl %edi, %edi movl $0, %eax cmovnel %ecx, %eax In this case the 'test' instruction can't be eliminated because the 'add' instruction modifies the EFLAGS, namely, ZF flag that is set by the 'bsf' instruction when 'x' is zero. We now produce the following code: bsfl %edi, %ecx movl $-1, %eax cmovnel %ecx, %eax addl $1, %eax Patch by Ivan Kulagin Reviewers: davide, craig.topper, spatel, RKSimon Reviewed By: craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D48765 llvm-svn: 336768	2018-07-11 06:57:42 +00:00
Craig Topper	152eba3166	[X86] Remove some composite MOVSS/MOVSD isel patterns. These patterns looked for a MOVSS/SD followed by a scalar_to_vector. Or a scalar_to_vector followed by a load. In both cases we emitted a MOVSS/SD for the MOVSS/SD part, a REG_CLASS for the scalar_to_vector, and a MOVSS/SD for the load. But we have patterns that do each of those 3 things individually so there's no reason to build large patterns. Most of the test changes are just reorderings. The one test that had a meaningful change is pr30430.ll and it appears to be a regression. But its doing -O0 so I think it missed a lot of opportunities and was just getting lucky before. llvm-svn: 336762	2018-07-11 04:51:40 +00:00
Sam Clegg	eb883aa8d1	[WebAssembly] Add pass to infer prototypes for prototype-less functions See https://bugs.llvm.org/show_bug.cgi?id=35385 Differential Revision: https://reviews.llvm.org/D48471 llvm-svn: 336759	2018-07-11 04:29:36 +00:00
Stefan Pintilie	23881494ba	[Power9] Add remaining __flaot128 builtin support for FMA round to odd Implement this as it is done on GCC: __float128 a, b, c, d; a = __builtin_fmaf128_round_to_odd (b, c, d); // generates xsmaddqpo a = __builtin_fmaf128_round_to_odd (b, c, -d); // generates xsmsubqpo a = - __builtin_fmaf128_round_to_odd (b, c, d); // generates xsnmaddqpo a = - __builtin_fmaf128_round_to_odd (b, c, -d); // generates xsnmsubpqp Differential Revision: https://reviews.llvm.org/D48218 llvm-svn: 336754	2018-07-11 01:42:22 +00:00
Eli Friedman	9ae1cdf437	[ARM] Treat cmn immediates as legal in isLegalICmpImmediate. The original code attempted to do this, but the std::abs() call didn't actually do anything due to implicit type conversions. Fix the type conversions, and perform the correct check for negative immediates. This probably has very little practical impact, but it's worth fixing just to avoid confusion in the future, I think. Differential Revision: https://reviews.llvm.org/D48907 llvm-svn: 336742	2018-07-10 23:44:37 +00:00
Craig Topper	7e83b9f6a2	[X86] Remove AddedComplexity from all patterns that use X86vzmovl as their root. Some added 20 and some added 15. Its unclear when to use which value and whether they are required at all. This patch removes them all. If we start finding real world issues we may need to add them back with proper tests. llvm-svn: 336735	2018-07-10 22:23:54 +00:00
Richard Trieu	db75907caf	Fix -Wmismatched-tags warning class -> struct in forward declaration. llvm-svn: 336733	2018-07-10 22:09:33 +00:00
Craig Topper	b9d34784e3	[X86] Teach X86InstrInfo::commuteInstructionImpl to use MOVSD/MOVSS for BLEND under optsize when the immediate allows it. Isel currently emits movss/movsd a lot of the time and an accidental double commute turns it into a blend. Ideally we'd select blend directly in isel under optspeed and not rely on the double commute to create blend. llvm-svn: 336731	2018-07-10 22:02:23 +00:00
Craig Topper	5c94bad0fe	[X86] Remove X86ISD::MOVLPS and X86ISD::MOVLPD. NFCI These ISD nodes try to select the MOVLPS and MOVLPD instructions which are special load only instructions. They load data and merge it into the lower 64-bits of an XMM register. They are logically equivalent to our MOVSD node plus a load. There was only one place in X86ISelLowering that used MOVLPD and no places that selected MOVLPS. The one place that selected MOVLPD had to choose between it and MOVSD based on whether there was a load. But lowering is too early to tell if the load can really be folded. So in isel we have patterns that use MOVSD for MOVLPD if we can't find a load. We also had patterns that select the MOVLPD instruction for a MOVSD if we can find a load, but didn't choose the MOVLPD ISD opcode for some reason. So it seems better to just standardize on MOVSD ISD opcode and manage MOVSD vs MOVLPD instruction with isel patterns. llvm-svn: 336728	2018-07-10 21:00:22 +00:00
Scott Linder	ed927a89c9	[AMDGPU] Fix layering issue with AMDGPUHSAMetadataStreamer (NFC) llvm-svn: 336722	2018-07-10 20:07:22 +00:00
Craig Topper	76bf466495	[X86] Remove dead SDNode object from X86InstrFragmentsSIMD.td. NFC It points to an opcode that doesn't exist. llvm-svn: 336720	2018-07-10 20:03:51 +00:00

1 2 3 4 5 ...

48319 Commits