llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Kevin P. Neal	c61aa5bf2a	[FPEnv] Teach the IRBuilder about invoke's correct use of the strictfp attribute. Similar to D69312, and documented in D69839, the IRBuilder needs to add the strictfp attribute to invoke instructions when constrained floating point is enabled. Differential Revision: https://reviews.llvm.org/D93134	2020-12-15 12:38:10 -05:00
Simon Pilgrim	e270201f09	[X86] Regenerate avxvnni.ll tests - remove unused prefix. NFCI.	2020-12-15 17:35:32 +00:00
Simon Pilgrim	1fa0a52f0b	[X86] Explicitly use SDValue instead of auto. NFCI. Fix static analyzer warning about not using a SDValue&	2020-12-15 17:27:25 +00:00
Simon Pilgrim	76b4fd1b08	SeparateConstOffsetFromGEP::lowerToSingleIndexGEPs - don't use dyn_cast_or_null. NFCI. ResultPtr is guaranteed to be non-null - and using dyn_cast_or_null causes unnecessary static analyzer warnings. We can't say the same for FirstResult AFAICT, so keep dyn_cast_or_null for that.	2020-12-15 17:27:25 +00:00
Amy Huang	eedfec2ca8	[llvm-symbolizer] Add missing include for config.h The cmake variable LLVM_ENABLE_DIA_SDK was being used here but was undefined because config.h wasn't included. Differential Revision: https://reviews.llvm.org/D93309	2020-12-15 09:20:31 -08:00
Raul Tambre	d6a1ea07f9	[PGO] Allow overriding -vp-counters-per-site In some build configurations more than 1.5 might be required. Paramaterize so it can be changed by the user. Reviewed By: yamauchi Differential Revision: https://reviews.llvm.org/D93281	2020-12-15 19:03:27 +02:00
Simon Pilgrim	878c787836	[X86] Remove unnecessary SUBV_BROADCAST combines. NFCI. Noticed while dealing with D92645 - these are now handled by getFauxShuffleMask + shuffle combining code.	2020-12-15 16:54:34 +00:00
Mircea Trofin	a3554dd52a	[utils] The func_dict for a prefix may just be empty Follow up from D92965 - since we try to find failed prefixes after each RUN line, it's possible the whole list of functions for a prefix be non-existent, which is fine - this happens when none of the RUN lines seen so far used the prefix.	2020-12-15 08:48:37 -08:00
Paul Walker	13de76cd3b	[NFC] Fix a few SVEInstrInfo related stylistic issues.	2020-12-15 16:10:38 +00:00
David Green	5d15bc64e1	[ARM] Match dual lane vmovs from insert_vector_elt MVE has a dual lane vector move instruction, capable of moving two general purpose registers into lanes of a vector register. They look like one of: vmov q0[2], q0[0], r2, r0 vmov q0[3], q0[1], r3, r1 They only accept these lane indices though (and only insert into an i32), either moving lanes 1 and 3, or 0 and 2. This patch adds some tablegen patterns for them, selecting from vector inserts elements. Because the insert_elements are know to be canonicalized to ascending order there are several patterns that we need to select. These lane indices are: 3 2 1 0 -> vmovqrr 31; vmovqrr 20 3 2 1 -> vmovqrr 31; vmov 2 3 1 -> vmovqrr 31 2 1 0 -> vmovqrr 20; vmov 1 2 0 -> vmovqrr 20 With the top one being the most common. All other potential patterns of lane indices will be matched by a combination of these and the individual vmov pattern already present. This does mean that we are selecting several machine instructions at once due to the need to re-arrange the inserts, but in this case there is nothing else that will attempt to match an insert_vector_elt node. Differential Revision: https://reviews.llvm.org/D92553	2020-12-15 15:58:52 +00:00
Ulrich Weigand	4f172e6801	[SystemZ] Remove most hard-coded R1D instances for sibcalls Indirect sibling calls need to use %r1 to hold the target address. This is currently hard-coded in many places. This is not only unnecessary, but makes future changes in this area difficult. This patch now encodes the target address as operand without hard coding a register in most places throughout the MI back-end. Code generation still always uses %r1, but this is now decided solely in one place in SystemZTargetLowering::LowerCall. NFC intended.	2020-12-15 16:31:18 +01:00
Mircea Trofin	03a62f3073	[utils] Fix UpdateTestChecks case where 2 runs differ for last label Two RUN lines produce outputs that, each, have some common parts and some different parts. The common parts are checked under label A. The differing parts are associated to a function and checked under labels B and C, respectivelly. When build_function_body_dictionary is called for the first RUN line, it will attribute the function body to labels A and C. When the second RUN is passed to build_function_body_dictionary, it sees that the function body under A is different from what it has. If in this second RUN line, A were at the end of the prefixes list, A's body is still kept associated with the first run's function. When we output the function body (i.e. add_checks), we stop after emitting for the first prefix matching that function. So we end up with the wrong function body (first RUN's A-association). There is no reason to special-case the last label in the prefixes list, and the fix is to always clear a label association if we find a RUN line where the body is different. Differential Revision: https://reviews.llvm.org/D93078	2020-12-15 07:16:54 -08:00
Florian Hahn	43edfa5357	[AnnotationRemarks] Also generate annotation remarks when using -O0. The AnnotationRemarks pass is already run at the end of the module pipeline. This patch also adds it before bailing out for -O0, so remarks are also generated with -O0.	2020-12-15 14:46:52 +00:00
Kazushi (Jam) Marukawa	c2546bc894	[VE] Support FRAMEADDR Implement FRAMEADDR for VE. Add a regression test also. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93295	2020-12-15 23:31:19 +09:00
Kazushi (Jam) Marukawa	a7f0cf45f8	[VE][NFC] Sort VEISD operations Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93294	2020-12-15 23:29:16 +09:00
Florian Hahn	2d177b0966	[VPlan] Use VPDef for VPWidenSelectRecipe. This patch turns updates VPWidenSelectRecipe to manage the value it defines using VPDef. Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D90560	2020-12-15 14:15:01 +00:00
Sebastian Neubauer	4b0b1e7b26	[AMDGPU] Unify flat offset logic Move getNumFlatOffsetBits from AMDGPUAsmParser and SIInstrInfo into AMDGPUBaseInfo. Differential Revision: https://reviews.llvm.org/D93287	2020-12-15 14:59:59 +01:00
Sebastian Neubauer	fe286555f3	[AMDGPU][NFC] Add more global_atomic_cmpswap tests	2020-12-15 14:47:33 +01:00
Jun Ma	4df13c29e3	[InstCombine] Remove scalable vector restriction in foldVectorBinop Differential Revision: https://reviews.llvm.org/D93289	2020-12-15 21:14:59 +08:00
Florian Hahn	834860df92	[LV] Pass explicit vector width to not require a X86 target.	2020-12-15 12:52:22 +00:00
Jun Ma	de7fb17148	[InstCombine][NFC] Change cast of FixedVectorType to dyn_cast.	2020-12-15 20:36:57 +08:00
Jun Ma	6eda9a4795	[InstCombine] Remove scalable vector restriction in InstCombineCompares Differential Revision: https://reviews.llvm.org/D93269	2020-12-15 20:36:57 +08:00
Jun Ma	8cea8298aa	[InstCombine] Remove scalable vector restriction when fold SelectInst Differential Revision: https://reviews.llvm.org/D93083	2020-12-15 20:36:57 +08:00
Hsiangkai Wang	34e308363d	[RISCV] Define vwadd/vwaddu/vwsub/vwsubu intrinsics. Define vwadd/vwaddu/vwsub/vwsubu intrinsics and lower to V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Hsiangkai Wang <kai.wang@sifive.com> Differential Revision: https://reviews.llvm.org/D93108	2020-12-15 20:15:06 +08:00
Paul Walker	4013487a3f	[SVE] Move INT_TO_FP i1 promotion into custom lowering. AddPromotedToType is being used to legalise INT_TO_FP operations when the source is a predicate. The point where this introduces vector extends might cause problems in the future so this patch falls back to manual promotion within custom lowering. Differential Revision: https://reviews.llvm.org/D90093	2020-12-15 11:57:07 +00:00
Paul Walker	e454fcc49d	[CodeGenPrepare] Update optimizeGatherScatterInst for scalable vectors. optimizeGatherScatterInst does nothing specific to fixed length vectors but uses FixedVectorType to extract the number of elements. This patch simply updates the code to use VectorType and getElementCount instead. For testing I just copied Transforms/CodeGenPrepare/X86/gather-scatter-opt.ll replacing `<4 x ` with `<vscale x 4`. Differential Revision: https://reviews.llvm.org/D92572	2020-12-15 10:57:51 +00:00
Simon Pilgrim	bff9741e5f	[X86] Remove trailing whitespace. NFC.	2020-12-15 10:11:38 +00:00
Simon Pilgrim	975ba80e31	[X86][AVX] LowerBUILD_VECTOR - reduce 256/512-bit build vectors with zero/undef upper elements + pad. As discussed on D92645, we don't do a good job of recognising when we don't require the full width of a ymm/zmm build vector because the upper elements are undef/zero. This commit allows us to make use of implicit zeroing of upper elements with AVX instructions, which we emulate in DAG with a INSERT_SUBVECTOR into the bottom of a undef/zero vector of the original type. This exposed a limitation in getTargetConstantBitsFromNode which didn't extract bits from INSERT_SUBVECTORs of different element widths which I've included as well to prevent a couple of regressions.	2020-12-15 10:11:38 +00:00
Florian Hahn	a10db87eb9	[LV] Add reduction test, which exposed a crash in a pending patch.	2020-12-15 09:42:00 +00:00
Florian Hahn	fec0553490	[VPlan] Use VPDef for VPWidenGEPRecipe. This patch turns updates VPWidenGEPRecipe to manage the value it defines using VPDef. The VPValue is used during VPlan construction and codegeneration instead of the plain IR reference where possible. Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D90561	2020-12-15 09:30:14 +00:00
Florian Hahn	ffe40cb25f	[VPlan] Use VPdef for VPWidenCall. This patch turns updates VPWidenREcipe to manage the value it defines using VPDef. Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D90559	2020-12-15 09:20:07 +00:00
Kazushi (Jam) Marukawa	5bcab24086	[VE] Support atomic exchange instructions Support atomic exchange and atomic compare and exchange instructions. Change CAS and TS1AM instructions for ISel patterns. Add selectADDRzi pattern for them. Add TS1AM pseudo instruction also for better ISel. Add shouldExpandAtomicRMWInIR() function to expand all atomicrmw instructions except atomicrmw xchg. Add custom lower for i8/i16 atomicrmw xchg. Modify replaceFI to support CAS/TS1AM instructions which use "reg+disp" operands instead of "reg+imm+disp" operands. And, add several regression tests to check the correctness. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D93161	2020-12-15 17:43:11 +09:00
Hsiangkai Wang	148cf7e90e	[RISCV][NFC] Define scalable vectors for half types. This is a preperation work for vfadd intrinsics. Differential Revision: https://reviews.llvm.org/D93275	2020-12-15 16:23:22 +08:00
Georgii Rymar	0f9894fe39	[llvm-readelf] - Don't print OS/Processor specific prefix for known ELF file types. This is a change suggested in post commit comments for D93096 (https://reviews.llvm.org/D93096#2451796). Imagine we want to add a custom OS specific ELF file type. For that we can update the `ElfObjectFileType` array: ``` static const EnumEntry<unsigned> ElfObjectFileType[] = { ... {"Core", "CORE (Core file)", ELF::ET_CORE}, {"MyType", "MyType (my description)", 0xfe01}, }; ``` The current code then might print: ``` OS Specific: (MyType (my description)) ``` Though instead we probably would like to see a nicer output, e.g: ``` Type: MyType (my description) ``` To achieve that we can reorder the code slightly. It is impossible to add a test I think, because we have no custom values in the `ElfObjectFileType` array in LLVM. Differential revision: https://reviews.llvm.org/D93217	2020-12-15 10:56:25 +03:00
Max Kazantsev	cd2f340f02	[SCEV] Add missing type check into getRangeForAffineNoSelfWrappingAR We make type widening without checking if it's needed. Bail if the max iteration count is wider than AR's type.	2020-12-15 14:50:32 +07:00
Amara Emerson	f7216ac2fb	[GlobalISel][IRTranslator] Ensure branch probabilities are added when translating invoke edges. This uses a straightforward port of findUnwindDestinations() from SelectionDAG. Differential Revision: https://reviews.llvm.org/D93256	2020-12-14 23:36:54 -08:00
Kazu Hirata	49434d3cd6	[IR] Remove isPowerOf2ByteWidth The predicate used to be used with the C backend, which was removed on Mar 23, 2012 in commit 64a232343aa649fdacf78698da3e4d5737dee56a. It seems to be unused since then.	2020-12-14 23:00:17 -08:00
Max Kazantsev	81955ada9c	[Test] Test on assertion failure with expensive SCEV range inference	2020-12-15 13:47:19 +07:00
Kazu Hirata	c82f0756af	[Analysis] Use llvm::erase_value (NFC)	2020-12-14 22:40:13 -08:00
Hsiangkai Wang	f039b9862f	[RISCV] Define vadd/vsub/vrsub intrinsics and lower to V instructions. This patch is based on the proposal from Roger Ferrer Ibanez. http://lists.llvm.org/pipermail/llvm-dev/2020-October/145850.html Differential Revision: https://reviews.llvm.org/D93013	2020-12-15 12:56:49 +08:00
LLVM GN Syncbot	851d64b940	[gn build] Port d2ed9d6b7ec	2020-12-15 03:35:00 +00:00
Nico Weber	42a753fa7c	Reland "[MachineDebugify] Insert synthetic DBG_VALUE instructions" This reverts commit 841f9c937f6e593c926a26aedf054436eb807fe6. The change landed many months ago; something else broke those tests.	2020-12-14 22:34:23 -05:00
Nico Weber	e3a964236e	Revert "[MachineDebugify] Insert synthetic DBG_VALUE instructions" This reverts commit 2a5675f11d3bc803a245c0e2a3b47491c8f8a065. The tests it adds fail: https://reviews.llvm.org/D78135#2453736	2020-12-14 22:14:48 -05:00
Nico Weber	f3e89af5f2	Revert "[Debugify] Support checking Machine IR debug info" This reverts commit c4d2d4337d50bed3cafd564daece1a197005b22b. Necessary to revert 2a5675f11d3bc803a245c0e.	2020-12-14 22:14:48 -05:00
Luo, Yuanke	fec37b307b	[X86] Add test case for commit e52bc1d2bba794b. Differential Revision: https://reviews.llvm.org/D93173	2020-12-15 11:14:16 +08:00
Nico Weber	d107b74361	Revert "[amdgpu] Default to code object v3" This reverts commit 4b2e7d0215021d0d1df1a6319884b21d33936265. Breaks check-clang, see https://reviews.llvm.org/D93258#2453600	2020-12-14 22:01:26 -05:00
Qiu Chaofan	8f2e4bf3b0	[NFC] [Legalizer] Use common method for expanding fp-to-int operands Reviewed By: RKSimon, steven.zhang Differential Revision: https://reviews.llvm.org/D92481	2020-12-15 10:45:40 +08:00
River Riddle	28caccdfb8	[mlir][Inliner] Refactor the inliner to use nested pass pipelines instead of just canonicalization Now that passes have support for running nested pipelines, the inliner can now allow for users to provide proper nested pipelines to use for optimization during inlining. This revision also changes the behavior of optimization during inlining to optimize before attempting to inline, which should lead to a more accurate cost model and prevents the need for users to schedule additional duplicate cleanup passes before/after the inliner that would already be run during inlining. Differential Revision: https://reviews.llvm.org/D91211	2020-12-14 18:09:47 -08:00
Xiang1 Zhang	6d8bb495f3	[Debugify] Support checking Machine IR debug info Add mir-check-debug pass to check MIR-level debug info. For IR-level, currently, LLVM have debugify + check-debugify to generate and check debug IR. Much like the IR-level pass debugify, mir-debugify inserts sequentially increasing line locations to each MachineInstr in a Module, But there is no equivalent MIR-level check-debugify pass, So now we support it at "mir-check-debug". Reviewed By: djtodoro Differential Revision: https://reviews.llvm.org/D91595	2020-12-14 17:53:46 -08:00
Xiang1 Zhang	4721afcdaa	Revert "[Debugify] Support checking Machine IR debug info" This reverts commit 57a3d9ec4a8c1422f07264bed9f12a4ea416707e.	2020-12-14 17:48:49 -08:00

1 2 3 4 5 ...

208285 Commits