llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 20:51:52 +01:00

Author	SHA1	Message	Date
Simon Pilgrim	964dad51f4	[SLP][AMDGPU] Regenerate packed-math tests and remove unused check prefix	2020-11-06 17:27:13 +00:00
Simon Pilgrim	1274be49b8	[VectorCombine][X86] Removed unused check prefixes	2020-11-06 17:27:12 +00:00
Kevin P. Neal	941987a165	[FPEnv] Use strictfp metadata in casting nodes The strictfp metadata was added to the casting AST nodes in D85960, but we aren't using that metadata yet. This patch adds that support. In order to avoid lots of ad-hoc passing around of the strictfp bits I updated the IRBuilder when moving from a function that has the Expr* to a function that lacks it. I believe we should switch to this pattern to keep the strictfp support from being overly invasive. For the purpose of testing that we're picking up the right metadata, I also made my tests use a pragma to make the AST's strictfp metadata not match the global strictfp metadata. This exposes issues that we need to deal with in subsequent patches, and I believe this is the right method for most all of our clang strictfp tests. Differential Revision: https://reviews.llvm.org/D88913	2020-11-06 11:56:12 -05:00
Jay Foad	0cb73d61a1	[TableGen] Indentation and whitespace fixes in generated code. NFC. Some of these were found by running clang-format over the generated code, although that complains about far more issues than I have fixed here. Differential Revision: https://reviews.llvm.org/D90937	2020-11-06 16:10:57 +00:00
Jay Foad	10e740a528	[AMDGPU] Simplify exp target parsing Treat any identifier as a potential exp target and diagnose them all the same way as "invalid exp target"s. Differential Revision: https://reviews.llvm.org/D90947	2020-11-06 16:09:34 +00:00
David Spickett	4187c2aea8	[Arm][MC] Remove unused prefixes in .arch_extension fp tests idiv: There is no difference between Armv7m and Thumbv7M behaviour so the specific CHECKs are not needed. The errors for Armv7-a and Thumbv7-a will always include "ARM" or "THUMB" respectively so they need their own CHECK prefix, making CHECK-V7 redundant. mp: Behaviour is dependent on whether the triple is v6/v7/v7M regardless of being Arm or Thumb. So we don't need the more specific CHECK-ARMv7M etc. simd: Errors are either v7 only, or v7 and v8 so CHECK-V8 is not needed. fp: Same as simd Reviewed By: ostannard Differential Revision: https://reviews.llvm.org/D90918	2020-11-06 15:13:07 +00:00
Jay Foad	d74fd5721f	[AMDGPU] Run exp tests on GFX9 and GFX10 too. NFC.	2020-11-06 15:03:05 +00:00
Roman Lebedev	bba9249b3e	[NFC][InstCombine] Update few comment updates i missed in 0ac56e8eaaeb As pointed out in post-commit review in that commit	2020-11-06 17:38:00 +03:00
Arnold Schwaighofer	3fe61868a9	llvm.coro.id.async lowering: Parameterize how-to restore the current's continutation context and restart the pipeline after splitting The `llvm.coro.suspend.async` intrinsic takes a function pointer as its argument that describes how-to restore the current continuation's context from the context argument of the continuation function. Before we assumed that the current context can be restored by loading from the context arguments first pointer field (`first_arg->caller_context`). This allows for defining suspension points that reuse the current context for example. Also: llvm.coro.id.async lowering: Add llvm.coro.preprare.async intrinsic Blocks inlining until after the async coroutine was split. Also, change the async function pointer's context size position struct async_function_pointer { uint32_t relative_function_pointer_to_async_impl; uint32_t context_size; } And make the position of the `async context` argument configurable. The position is specified by the `llvm.coro.id.async` intrinsic. rdar://70097093 Differential Revision: https://reviews.llvm.org/D90783	2020-11-06 06:22:46 -08:00
Paul C. Anagnostopoulos	72e317cd3c	[NVPTX] [TableGen] Use new features of TableGen to simplify and clarify. Differential Revision: https://reviews.llvm.org/D90861	2020-11-06 09:20:19 -05:00
Simon Moll	6dd86e401d	[VE] Add v(m)regs to preserve_all reg mask V(m)regs where defined before CSR_preserve_all was, add them now. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D90912	2020-11-06 15:16:11 +01:00
Than McIntosh	b387ac6f07	[NFC] Fix typo in comment. Differential Revision: https://reviews.llvm.org/D90846	2020-11-06 09:03:07 -05:00
Paul C. Anagnostopoulos	af8abf220b	[TableGen] Clarify text and fix errors in the Programmer's Reference Differential Revision: https://reviews.llvm.org/D90881	2020-11-06 08:56:29 -05:00
Simon Moll	3379eef9b9	[VE][NFC] Refactor to support more than one calling conv Prepare for supporting different calling conventions by factoring out things into CC-dependent selection functions (getParamCC, getReturnCC). Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D90911	2020-11-06 14:25:25 +01:00
Florian Hahn	c9d167829a	[SLP] Also try to vectorize incoming values of PHIs . Currently we do not consider incoming values of PHIs as roots for SLP vectorization. This means we miss scenarios like the one in the test case and PR47670. It appears quite straight-forward to consider incoming values of PHIs as roots for vectorization, but I might be missing something that makes this problematic. In terms of vectorized instructions, this applies to quite a few benchmarks across MultiSource/SPEC2000/SPEC2006 on X86 with -O3 -flto Same hash: 185 (filtered out) Remaining: 52 Metric: SLP.NumVectorInstructions Program base patch diff test-suite...ProxyApps-C++/HPCCG/HPCCG.test 9.00 27.00 200.0% test-suite...C/CFP2000/179.art/179.art.test 8.00 22.00 175.0% test-suite...T2006/458.sjeng/458.sjeng.test 14.00 30.00 114.3% test-suite...ce/Benchmarks/PAQ8p/paq8p.test 11.00 18.00 63.6% test-suite...s/FreeBench/neural/neural.test 12.00 18.00 50.0% test-suite...rimaran/enc-3des/enc-3des.test 65.00 95.00 46.2% test-suite...006/450.soplex/450.soplex.test 63.00 89.00 41.3% test-suite...ProxyApps-C++/CLAMR/CLAMR.test 177.00 250.00 41.2% test-suite...nchmarks/McCat/18-imp/imp.test 13.00 18.00 38.5% test-suite.../Applications/sgefa/sgefa.test 26.00 35.00 34.6% test-suite...pplications/oggenc/oggenc.test 100.00 133.00 33.0% test-suite...6/482.sphinx3/482.sphinx3.test 103.00 134.00 30.1% test-suite...oxyApps-C++/miniFE/miniFE.test 169.00 213.00 26.0% test-suite.../Benchmarks/Olden/tsp/tsp.test 59.00 73.00 23.7% test-suite...TimberWolfMC/timberwolfmc.test 503.00 622.00 23.7% test-suite...T2006/456.hmmer/456.hmmer.test 65.00 79.00 21.5% test-suite...libquantum/462.libquantum.test 58.00 68.00 17.2% test-suite...ternal/HMMER/hmmcalibrate.test 84.00 98.00 16.7% test-suite...ications/JM/ldecod/ldecod.test 351.00 401.00 14.2% test-suite...arks/VersaBench/dbms/dbms.test 52.00 57.00 9.6% test-suite...ce/Benchmarks/Olden/bh/bh.test 118.00 128.00 8.5% test-suite.../Benchmarks/Bullet/bullet.test 6355.00 6880.00 8.3% test-suite...nsumer-lame/consumer-lame.test 480.00 519.00 8.1% test-suite...000/183.equake/183.equake.test 226.00 244.00 8.0% test-suite...chmarks/Olden/power/power.test 105.00 113.00 7.6% test-suite...6/471.omnetpp/471.omnetpp.test 92.00 99.00 7.6% test-suite...ications/JM/lencod/lencod.test 1173.00 1261.00 7.5% test-suite...0/253.perlbmk/253.perlbmk.test 55.00 59.00 7.3% test-suite...oxyApps-C/miniAMR/miniAMR.test 92.00 98.00 6.5% test-suite...chmarks/MallocBench/gs/gs.test 446.00 473.00 6.1% test-suite.../CINT2006/403.gcc/403.gcc.test 464.00 491.00 5.8% test-suite...6/464.h264ref/464.h264ref.test 998.00 1055.00 5.7% test-suite...006/453.povray/453.povray.test 5711.00 6007.00 5.2% test-suite...FreeBench/distray/distray.test 102.00 107.00 4.9% test-suite...:: External/Povray/povray.test 4184.00 4378.00 4.6% test-suite...DOE-ProxyApps-C/CoMD/CoMD.test 112.00 117.00 4.5% test-suite...T2006/445.gobmk/445.gobmk.test 104.00 108.00 3.8% test-suite...CI_Purple/SMG2000/smg2000.test 789.00 819.00 3.8% test-suite...yApps-C++/PENNANT/PENNANT.test 233.00 241.00 3.4% test-suite...marks/7zip/7zip-benchmark.test 417.00 428.00 2.6% test-suite...arks/mafft/pairlocalalign.test 627.00 643.00 2.6% test-suite.../Benchmarks/nbench/nbench.test 259.00 265.00 2.3% test-suite...006/447.dealII/447.dealII.test 4641.00 4732.00 2.0% test-suite...lications/ClamAV/clamscan.test 106.00 108.00 1.9% test-suite...CFP2000/177.mesa/177.mesa.test 1639.00 1664.00 1.5% test-suite...oxyApps-C/RSBench/rsbench.test 66.00 65.00 -1.5% test-suite.../CINT2000/252.eon/252.eon.test 3416.00 3444.00 0.8% test-suite...CFP2000/188.ammp/188.ammp.test 1846.00 1861.00 0.8% test-suite.../CINT2000/176.gcc/176.gcc.test 152.00 153.00 0.7% test-suite...CFP2006/444.namd/444.namd.test 3528.00 3544.00 0.5% test-suite...T2006/473.astar/473.astar.test 98.00 98.00 0.0% test-suite...frame_layout/frame_layout.test NaN 39.00 nan% On ARM64, there appears to be a slight regression on SPEC2006, which might be interesting to investigate: test-suite...T2006/473.astar/473.astar.test 0.9% Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D88735	2020-11-06 12:50:32 +00:00
Simon Pilgrim	fbc5698619	[InstCombine] Regenerate narrow-math.ll tests	2020-11-06 11:35:54 +00:00
David Spickett	2f2845389a	[AArch64][MC] Remove unused CHECK-ERROR in SVE test file This file is only ever looking for errors so we can just use the default CHECK. Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D90915	2020-11-06 11:21:12 +00:00
David Spickett	f05bd58db1	[AArch64][MC] Remove unused prefix in v8.4-a trace test It was unused when added and the CHECK-ERROR lines cover the possible outputs. Reviewed By: ostannard Differential Revision: https://reviews.llvm.org/D90913	2020-11-06 11:17:18 +00:00
Kazushi (Jam) Marukawa	93a65c706c	[VE] Optimize address calculation Optimize address calculations using LEA/LEASL instructions. Update comments in VEISelLowering.cpp also. Update an existing regression test optimized by this modification. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90878	2020-11-06 19:46:59 +09:00
Simon Moll	9bff67230d	[VE][TTI] don't advertise vregs/vops Claim to not have any vector support to dissuade SLP, LV and friends from generating SIMD IR for the VE target. We will take this back once vector isel is stable. Reviewed By: kaz7, fhahn Differential Revision: https://reviews.llvm.org/D90462	2020-11-06 11:12:10 +01:00
Simon Pilgrim	4abaee6284	[X86] Regenerate zext-load tests and add 32-bit test coverage.	2020-11-06 09:54:08 +00:00
Sander de Smalen	25ea9c3b36	[VPlan] NFC: Change VFRange to take ElementCount This patch changes the type of Start, End in VFRange to be an ElementCount instead of `unsigned`. This is done as preparation to make VPlans for scalable vectors, but is otherwise NFC. Reviewed By: dmgreen, fhahn, vkmr Differential Revision: https://reviews.llvm.org/D90715	2020-11-06 09:50:20 +00:00
Sander de Smalen	2112163f1d	[TypeSize] Extend UnivariateLinearPolyBase with getWithIncrement/Decrement methods This patch adds getWithIncrement/getWithDecrement methods to ElementCount and TypeSize to allow: TypeSize::getFixed(8).getWithIncrement(8) <=> TypeSize::getFixed(16) TypeSize::getFixed(16).getWithDecrement(8) <=> TypeSize::getFixed(8) TypeSize::getScalable(8).getWithIncrement(8) <=> TypeSize::getScalable(16) TypeSize::getScalable(16).getWithDecrement(8) <=> TypeSize::getScalable(8) This patch implements parts of the POC in D90342. Reviewed By: ctetreau, dmgreen Differential Revision: https://reviews.llvm.org/D90713	2020-11-06 09:01:19 +00:00
Roman Lebedev	57330778e3	[IR] CmpInst: Add getFlippedSignednessPredicate() And refactor a few places to use it	2020-11-06 11:31:09 +03:00
Roman Lebedev	c06706283c	[IR] CmpInst: add isRelational() Since there's CmpInst::isEquality(), it only makes sense to have it's inverse for consistency.	2020-11-06 11:31:09 +03:00
Roman Lebedev	05a5fb18a8	[IR] CmpInst: add isEquality(Pred) Currently there is only a member version of isEquality(), which requires an actual [IF]CmpInst to be avaliable, which isn't always possible, and is inconsistent with the general pattern here. I wanted to use it in a new patch, but it wasn't there..	2020-11-06 11:31:09 +03:00
Roman Lebedev	a6b210b265	[IR] CmpInst: add getUnsignedPredicate() There's already getSignedPredicate(), it is not symmetrical to not have it's opposite. I wanted to use it in new code, but it wasn't there..	2020-11-06 11:31:08 +03:00
Max Kazantsev	d6f4f83809	[Test] One more test on IndVars with negative step	2020-11-06 14:55:50 +07:00
Yevgeny Rouban	5ddd491d11	[BranchProbabilityInfo] Introduce method copyEdgeProbabilities(). NFC A new method is introduced to allow bulk copy of outgoing edge probabilities from one block to another. This can be useful when a block is cloned from another one and we do not know if there are edge probabilities set for the original block or not. Copying outside of the BranchProbabilityInfo class makes the user unconditionally set the cloned block's edge probabilities even if they are unset for the original block. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D90839	2020-11-06 14:52:35 +07:00
Max Kazantsev	3151371b49	[Test] Run test with expensive SE inference. NFC The planned changes require expensive inference to kick in	2020-11-06 14:23:44 +07:00
Yevgeny Rouban	9995489bcb	[BranchProbabilityInfo] Remove block handles in eraseBlock() BranchProbabilityInfo::eraseBlock() is a public method and can be called without deleting the block itself. This method is made remove the correspondent tracking handle from BranchProbabilityInfo::Handles along with the probabilities of the block. Handles.erase() call is moved to eraseBlock(). In setEdgeProbability() we need to add the block handle only once. Reviewed By: kazu Differential Revision: https://reviews.llvm.org/D90838	2020-11-06 13:13:58 +07:00
Yevgeny Rouban	a0a835256d	[BranchProbabilityInfo] Get rid of MaxSuccIdx. NFC This refactoring allows to eliminate the MaxSuccIdx map proposed in the commit a7b662d0. The idea is to remove probabilities for a block BB for all its successors one by one from first, second, ... till N-th until they are defined in Probs. This works because probabilities for the block are set at once for all its successors from number 0 to N-1 and the rest are removed if there were stale probs. The protected method setEdgeProbability(), which set probabilities for individual successor, is removed. This makes it clear that the probabilities are set in bulk by the public method with the same name. Reviewed By: kazu, MaskRay Differential Revision: https://reviews.llvm.org/D90837	2020-11-06 12:21:24 +07:00
Valentin Clement	8c87b66158	[flang][openacc] Add parsing tests and semantic check for set directive This patch add some parsing and clause validity tests for the set directive. It makes use of the possibility introduces in patch D90770 to check the restriction were one of the default_async, device_num and device_type clauses is required but also not more than once on the set directive. Reviewed By: sameeranjoshi Differential Revision: https://reviews.llvm.org/D90771	2020-11-05 22:57:58 -05:00
Kazushi (Jam) Marukawa	851ffecdd0	[VE][NFC] Update rem.ll regression test `Replace ISD::SREM handling with KnownBits::srem to reduce code duplication` (bf04e34383b06f1b71819de7f34a1a1de2cdb6a4) changed the result of rem.ll regression test. So, updating it.	2020-11-06 10:44:29 +09:00
Luo, Yuanke	d910fc5e02	[X86] check the k pair register in ipra-reg-usage.ll. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D90810	2020-11-06 09:30:58 +08:00
Giorgis Georgakoudis	c6275bc0ce	[CodeExtractor] Replace uses of extracted bitcasts in out-of-region lifetime markers CodeExtractor handles bitcasts in the extracted region that have lifetime markers users in the outer region as outputs. That creates unnecessary alloca/reload instructions and extra lifetime markers. The patch identifies those cases, and replaces uses in out-of-region lifetime markers with new bitcasts in the outer region. Example ``` define void @foo() { entry: %0 = alloca i32 br label %extract extract: %1 = bitcast i32* %0 to i8* call void @llvm.lifetime.start.p0i8(i64 4, i8* %1) call void @use(i32* %0) br label %exit exit: call void @use(i32* %0) call void @llvm.lifetime.end.p0i8(i64 4, i8* %1) ret void } ``` Current extraction ``` define void @foo() { entry: %.loc = alloca i8, align 8 %0 = alloca i32, align 4 br label %codeRepl codeRepl: ; preds = %entry %lt.cast = bitcast i8* %.loc to i8* call void @llvm.lifetime.start.p0i8(i64 -1, i8* %lt.cast) %lt.cast1 = bitcast i32* %0 to i8* call void @llvm.lifetime.start.p0i8(i64 -1, i8* %lt.cast1) call void @foo.extract(i32* %0, i8** %.loc) %.reload = load i8, i8* %.loc, align 8 call void @llvm.lifetime.end.p0i8(i64 -1, i8* %lt.cast) br label %exit exit: ; preds = %codeRepl call void @use(i32* %0) call void @llvm.lifetime.end.p0i8(i64 4, i8* %.reload) ret void } define internal void @foo.extract(i32* %0, i8** %.out) { newFuncRoot: br label %extract exit.exitStub: ; preds = %extract ret void extract: ; preds = %newFuncRoot %1 = bitcast i32* %0 to i8* store i8* %1, i8** %.out, align 8 call void @use(i32* %0) br label %exit.exitStub } ``` Extraction with patch ``` define void @foo() { entry: %0 = alloca i32, align 4 br label %codeRepl codeRepl: ; preds = %entry %lt.cast1 = bitcast i32* %0 to i8* call void @llvm.lifetime.start.p0i8(i64 -1, i8* %lt.cast1) call void @foo.extract(i32* %0) br label %exit exit: ; preds = %codeRepl call void @use(i32* %0) %lt.cast = bitcast i32* %0 to i8* call void @llvm.lifetime.end.p0i8(i64 4, i8* %lt.cast) ret void } define internal void @foo.extract(i32* %0) { newFuncRoot: br label %extract exit.exitStub: ; preds = %extract ret void extract: ; preds = %newFuncRoot %1 = bitcast i32* %0 to i8* call void @use(i32* %0) br label %exit.exitStub } ``` Reviewed By: vsk Differential Revision: https://reviews.llvm.org/D90689	2020-11-05 17:01:08 -08:00
Sean Silva	7b66e6757c	[STLExtras] Add append_range helper. This is convenient in a lot of cases, such as when the thing you want to append is `someReallyLongFunctionName()` that you'd rather not write twice or assign to a variable for the paired begin/end calls. Differential Revision: https://reviews.llvm.org/D90894	2020-11-05 16:20:02 -08:00
Craig Topper	752bbf7d27	[RISCV] Only enable GPR<->FPR32 bitconvert isel patterns on RV32. NFCI Bitconvert requires the bitwidth to match on both sides. On RV64 the GPR size is i64 so bitconvert between f32 isn't possible. The node should never be generated so the pattern won't ever match, but moving the patterns under IsRV32 makes it more obviously impossible. It also moves it to a similar location to the patterns for the custom nodes we use for RV64.	2020-11-05 16:15:25 -08:00
Konstantin Pyzhov	8ae1180dde	[AMDGPU] Corrected declaration of VOPC instructions with SDWA addressing mode. Removed "implicit def VCC" from declarations of AMDGPU VOPC instructions since they do not implicitly write to VCC in SDWA mode. Differential Revision: https://reviews.llvm.org/D89168	2020-11-05 11:15:50 -05:00
Michael Liao	a374f1fd9a	[amdgpu] Add `llvm.amdgcn.endpgm` support. - `llvm.amdgcn.endpgm` is added to enable "abort" support. Differential Revision: https://reviews.llvm.org/D90809	2020-11-05 19:06:50 -05:00
Yuriy Chernyshov	db5411a244	Do not construct std::string from nullptr While I am trying to forbid such usages systematically in https://reviews.llvm.org/D79427 / P2166R0 to C++ standard, this PR fixes this (definitelly incorrect) usage in llvm. This code is unreachable, so it could not cause any harm Reviewed By: nikic, dblaikie Differential Revision: https://reviews.llvm.org/D87697	2020-11-05 15:23:26 -08:00
Craig Topper	699db15672	[RISCV] Add isel patterns for fnmadd/fnmsub with an fneg on the second operand instead of the first. The multiply part of FMA is commutable, but TargetSelectionDAG.td doesn't have it marked as commutable so tablegen won't automatically create the additional patterns. So manually add commuted patterns.	2020-11-05 14:00:25 -08:00
Craig Topper	69802f9ab7	[RISCV] Add test cases to show missed opportunities to use fnmadd/fnmsub if the second operand to the fma is negated rather than the first. NFC We need to add more isel patterns to handle this.	2020-11-05 14:00:25 -08:00
Valentin Clement	021bbc7b82	[openacc][openmp] Allow duplicate between required and allowed once/exclusive Validity check introduce in D90241 are a bit too restrict and this patch propose to losen them a bit. The duplicate clauses is now check only between the three allowed lists and between the requiredClauses and allowedClauses lists. This allows to enable some check where a clause can be required but also appear only once on the directive. We found these kind of restriction useful on the set directive in OpenACC for example. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D90770	2020-11-05 16:21:26 -05:00
Paul C. Anagnostopoulos	a49fe8891e	[TableGen] Clean up documentation toctrees; clarify two paragraphs. Differential Revision: https://reviews.llvm.org/D90804	2020-11-05 16:19:18 -05:00
Kazushi (Jam) Marukawa	81d1ab5e65	[VE] Add isReMaterializable and isAsCheapAsAMove flags Add isReMaterializable and isCheapAsAMove flags to integer instructions which cost cheap. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90833	2020-11-06 06:09:10 +09:00
Reid Kleckner	3570f7b817	Fix bugs in EOL marking in command line tokenizers Add unit tests for this behavior, since the integration test for clang-cl did not catch these bugs. Fixes PR47604 Differential Revision: https://reviews.llvm.org/D90866	2020-11-05 13:01:32 -08:00
Sanjay Patel	a8868babff	[ARM] remove cost-kind predicate for cmp/sel costs This is the cmp/sel sibling to D90692. Again, the reasoning is: the throughput cost is number of instructions/uops, so size/blended costs are identical except in special cases (for example, fdiv or other known-expensive machine instructions or things like MVE that may require cracking into >1 uops). We need to check for a valid (non-null) condition type parameter because SimplifyCFG may pass nullptr for that (and so we will crash multiple regression tests without that check). I'm not sure if passing nullptr makes sense, but other code in the cost model does appear to check if that param is set or not. Differential Revision: https://reviews.llvm.org/D90781	2020-11-05 14:52:25 -05:00
Momchil Velikov	d83bfcba72	[MachineOutliner] Do not outline debug instructions The debug location is removed from any outlined instruction. This causes the MachineVerifier to crash on outlined DBG_VALUE instructions. Then, debug instructions are "invisible" to the outliner, that is, two ranges of instructions from different functions are considered identical if the only difference is debug instructions. Since a debug instruction from one function is unlikely to provide sensible debug information about all functions, sharing an outlined sequence, this patch just removes debug instructions from the outlined functions. Differential Revision: https://reviews.llvm.org/D89485	2020-11-05 19:26:51 +00:00
Amara Emerson	d203a1a538	[AArch64][GlobalISel] Add AArch64::G_DUPLANE[X] opcodes for lane duplicates. These were previously handled by pattern matching shuffles in the selector, but adding a new opcode and making it equivalent to the AArch64duplane SDAG node allows us to select more patterns, like lane indexed FMLAs (patch adding a test for that will be committed later). The pattern matching code has been simply moved to postlegalize lowering. Differential Revision: https://reviews.llvm.org/D90820	2020-11-05 11:18:11 -08:00

1 2 3 4 5 ...

206494 Commits