llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Sanjay Patel	6d8d6b38d5	[InstCombine] try to narrow min/max intrinsics with constant operand The constant trunc/ext may not be the optimal pre-condition, but I think that handles the common cases. Example of Alive2 proof: https://alive2.llvm.org/ce/z/sREeLC This is another step towards canonicalizing to the intrinsics. Narrowing was identified as source of potential regression for abs(), so we need to handle this for min/max - see: https://llvm.org/PR48816 If this is not enough, we could process intrinsics in the trunc-driven matching in canEvaluateTruncated().	2021-02-01 13:44:13 -05:00
Sanjay Patel	ce64161bee	[InstCombine] add tests for min/max with extend and constant operand; NFC	2021-02-01 13:44:13 -05:00
Valentin Clement	c1dd747b25	[flang][directive] Enforce basic semantic check for all clauses This patch is a follow up to D94821 to ensure the correct behavior of the general directive structure checker. This patch add the generation of the Enter function declaration for clauses in the TableGen backend. This helps to ensure each clauses declared in the TableGen file has at least a basic check. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D95108	2021-02-01 13:33:30 -05:00
Simon Pilgrim	8985ce20e1	[X86][AVX] Add 'OK' tests cases for PR48877	2021-02-01 18:17:41 +00:00
Simon Pilgrim	446a01964c	[X86][SSE] LowerScalarImmediateShift - use APInt::getLowBitsSet for vXi8 ISD::SRL mask generation. NFCI. Match what we do for ISD::SHL	2021-02-01 18:17:40 +00:00
Jessica Paquette	08fd0a40ed	[AArch64][GlobalISel] Emit G_ASSERT_ZEXT in assignValueToReg When we have a zeroext parameter, emit G_ASSERT_ZEXT. Add a check that we actually emit it. This is a 0.1% code size win on CTMark/7zip and CTMark/consumer-typeset at -Os. Differential Revision: https://reviews.llvm.org/D95567	2021-02-01 10:01:52 -08:00
Teresa Johnson	3315dc9bcf	[LTO] Move part of gold devirt test to v1.16 directory Part of the gold test added in 1487747e990ce9f8851f3d92c3006a74134d7518 relies on more recent fixes to gold that fix the plugin behavior with --export-dynamic-symbol and --dynamic-list. Extract those parts of the new test into a v1.16 test.	2021-02-01 09:53:11 -08:00
Craig Topper	6ab0fefa1b	[RISCV] Add scalable vector support for floating point FMA instructions A follow up patch will add support for commuting operands or changing opcode to vfmacc and friends. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D95662	2021-02-01 09:52:43 -08:00
Craig Topper	c8be45ab39	[RISCV] Update comment text from D95774. NFC	2021-02-01 09:52:43 -08:00
Jessica Paquette	36e5b06d50	[GlobalISel] Make sure G_ASSERT_ZEXT's src ends up with the same rc as dst When replacing the dst reg with the src reg, we need to make sure that we propagate the dst reg's register class through to the src. Otherwise, we aren't meeting the requirements for G_ASSERT_ZEXT, and so the verifier will fail. Differential Revision: https://reviews.llvm.org/D95708	2021-02-01 09:46:35 -08:00
Craig Topper	136420859e	[RISCV] Optimize (srl (and X, 0xffff), C) -> (srli (slli X, 16), 16 + C). Rather than materializing the 0xffff immediate for the AND, use a shift left to remove the upper bits and then shift in zeros from the right. This pattern occurs when type legalizing an i16 right shift. I've implemented this with custom selection code for a number of reasons. I've limited this to the AND having a single use. We need to compensate for SimplifyDemandedBits altering the AND mask. I'm using *W opcodes on RV64. We may want to generlize this in the future. For all these reason it seemed easiest to do it this way. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D95774	2021-02-01 09:37:55 -08:00
Florian Hahn	28df98bafb	[ConstraintElimination] Negate IR condition directly. Instead of using ConstraintSystem::negate when adding new constraints, flip the condition in IR. The main advantage is that EQ predicates can be represented by 2 constraints, which makes negating based on the constraint tricky. The IR condition can easily negated.	2021-02-01 17:21:40 +00:00
Austin Kerbow	b099a1ce22	[AMDGPU] Fix release build after 0397dca0.	2021-02-01 08:55:14 -08:00
Austin Kerbow	411730e6c6	[AMDGPU] Fix crash with sgpr spills to vgpr disabled This would assert with amdgpu-spill-sgpr-to-vgpr disabled when trying to spill the FP. Fixes: SWDEV-262704 Reviewed By: RamNalamothu Differential Revision: https://reviews.llvm.org/D95768	2021-02-01 08:35:25 -08:00
Simon Pilgrim	9ee9a3c639	Revert rGce587529ad8b5 - "[APFloat] multiplySignificand - pass IEEEFloat as const reference. NFCI." Breaks on some buildbots	2021-02-01 16:15:23 +00:00
Sander de Smalen	124d498e95	NFC: Migrate SimplifyCFG to work on InstructionCost This patch migrates cost values and arithmetic to work on InstructionCost. When the interfaces to TargetTransformInfo are changed, any InstructionCost state will propagate naturally. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D95351	2021-02-01 16:14:05 +00:00
Sander de Smalen	6c8092b00f	[SimplifyCFG] NFC: Rename static methods to clang-tidy standards. This patch is a precursor to D95351, which changes the signature of these methods.	2021-02-01 16:14:05 +00:00
David Green	dbdd240132	[ARM] Simplify VMOVRRD from extracts of buildvectors Under the softfp calling convention, we are often left with VMOVRRD(extract(bitcast(build_vector(a, b, c, d)))) for the return value of the function. These can be simplified to a,b or c,d directly, depending on the value of the extract. Big endian is a little different because the bitcast switches the lanes around, meaning we end up with b,a or d,c. Differential Revision: https://reviews.llvm.org/D94989	2021-02-01 16:09:25 +00:00
J-Y You	c233f2a4f7	[TableGen] Fix anonymous record self-reference in foreach and multiclass If we instantiate self-referenced anonymous records in foreach and multiclass, the NAME value will point to incorrect record. It's because anonymous name is resolved too early. This patch adds AnonymousNameInit to represent an anonymous record name. When instantiating an anonymous record, it will update the referred name. Differential Revision: https://reviews.llvm.org/D95309	2021-02-01 10:59:07 -05:00
Simon Pilgrim	c54bcb01a6	[APFloat] multiplySignificand - pass IEEEFloat as const reference. NFCI. Avoids unnecessary IEEEFloat copies.	2021-02-01 15:41:50 +00:00
LLVM GN Syncbot	cbac188e3a	[gn build] Port b63cd4db915c	2021-02-01 14:24:45 +00:00
Kerry McLaughlin	267255edd9	[SVE][CodeGen] Remove performMaskedGatherScatterCombine The AArch64 DAG combine added by D90945 & D91433 extends the index of a scalable masked gather or scatter to i32 if necessary. This patch removes the combine and instead adds shouldExtendGSIndex, which is used by visitMaskedGather/Scatter in SelectionDAGBuilder to query whether the index should be extended before calling getMaskedGather/Scatter. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D94525	2021-02-01 14:10:00 +00:00
Florian Hahn	5ff3260e34	[SCEV] Bail out if URem operand cannot be zero-extended. In some cases, LHS is larger than the target expression type. Bail out in that case for now, to avoid crashing	2021-02-01 13:50:54 +00:00
Jeroen Dobbelaere	2a8989c820	Revert "[Verifier] enable llvm.experimental.noalias.scope.decl dominance check." the 'clang-with-lto-ubuntu' buildbot triggers the assertion. This reverts commit b43c395e60d2636ab5afc9b60a2046978c71e366.	2021-02-01 14:38:33 +01:00
Florian Hahn	adbb86ac12	[ConstraintElimination] Add tests for signed predicates. Add test coverage for conditions with signed predicates.	2021-02-01 13:23:05 +00:00
Tim Northover	e5a10e4821	GlobalISel: check type size before getZExtValue()ing it. Otherwise getZExtValue() asserts.	2021-02-01 12:43:33 +00:00
Cullen Rhodes	a70388496a	[LV] Fix crash when computing max VF too early D90687 introduced a crash: llvm::LoopVectorizationCostModel::computeMaxVF(llvm::ElementCount, unsigned int): Assertion `WideningDecisions.empty() && Uniforms.empty() && Scalars.empty() && "No decisions should have been taken at this point"' failed. when compiling the following C code: typedef struct { char a; } b; b *c; int d, e; int f() { int g = 0; for (; d; d++) { e = 0; for (; e < c[d].a; e++) g++; } return g; } with: clang -Os -target hexagon -mhvx -fvectorize -mv67 testcase.c -S -o - This occurred since prior to D90687 computeFeasibleMaxVF would only be called in computeMaxVF when a scalar epilogue was allowed, but now it's always called. This causes the assert above since computeFeasibleMaxVF collects all viable VFs larger than the default MaxVF, and for each VF calculates the register usage which results in analysis being done the assert above guards against. This can occur in computeFeasibleMaxVF if TTI.shouldMaximizeVectorBandwidth and this target hook is implemented in the hexagon backend to always return true. Reported by @iajbar. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D94869	2021-02-01 12:14:59 +00:00
Sander de Smalen	77ed25e06f	NFC: Migrate SpeculativeExecution to work on InstructionCost This patch migrates cost values and arithmetic to work on InstructionCost. When the interfaces to TargetTransformInfo are changed, any InstructionCost state will propagate naturally. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D95356	2021-02-01 12:13:23 +00:00
Dmitry Preobrazhensky	d2dbf6a661	[AMDGPU][MC] Corrected error position for invalid operands Generic parser may report an incorrect error position when an offending operand is followed by a comma. See bug 48884 for details: https://bugs.llvm.org/show_bug.cgi?id=48884. Differential Revision: https://reviews.llvm.org/D95674	2021-02-01 14:31:08 +03:00
xgupta	ddd3adad6f	[Branch-Rename] Fix some links According to the [[ https://foundation.llvm.org/docs/branch-rename/ \| status of branch rename ]], the master branch of the LLVM repository is removed on 28 Jan 2021. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95766	2021-02-01 16:43:21 +05:30
David Green	9248cc33ab	[ARM] Turn sext_inreg(VGetLaneu) into VGetLaneu This adds a DAG combine for converting sext_inreg of VGetLaneu into VGetLanes, providing the types match correctly. Differential Revision: https://reviews.llvm.org/D95073	2021-02-01 11:10:35 +00:00
Jeroen Dobbelaere	7618d9f77d	[Verifier] enable llvm.experimental.noalias.scope.decl dominance check. Now that Loop Peeling has been fixed (80cdd30eb90c3509bf315f1fa1369483e2448bbd), enable the dominance check by default. This reverts commit 3b5d36ece21f9baf96d82944b0165cb352443bee.	2021-02-01 11:53:01 +01:00
Simon Pilgrim	4720508a81	[X86][AVX] combineExtractWithShuffle - combine extracts from 256/512-bit vector shuffles. We can only legally extract from the lowest 128-bit subvector, so extract the correct subvector to allow us to handle 256/512-bit vector element extracts.	2021-02-01 10:31:43 +00:00
David Green	96fb4c7ba1	[ARM] Simplify extract of VMOVDRR Under SoftFP calling conventions, we can be left with extract(bitcast(BUILD_VECTOR(VMOVDRR(a, b), ..))) patterns that can simplify to a or b, depending on the extract lane. Differential Revision: https://reviews.llvm.org/D94990	2021-02-01 10:24:57 +00:00
Kazushi (Jam) Marukawa	1b359f6379	[VE] Change inetger constants 32-bit friendly Correct integer constants like `1UL << 63` to `UINT64_C(1) << 63` in order to make them work on 32-bit machines. Tested on both an i386 and x86_64 machines. Reviewed By: mgorny Differential Revision: https://reviews.llvm.org/D95724	2021-02-01 19:00:47 +09:00
Florian Hahn	7b52facd8c	[LoopUnswitch] Pacify compiler warnings. Attempt to fix some compiler warnings on some bots after b8c81fa5c7f77a7a1267e42ddbbc9bffb10b0817.	2021-02-01 09:13:06 +00:00
Florian Hahn	a6f31d20e3	[LoopUnswitch] Add shortcut if unswitched path is a no-op. If we determine that the invariant path through the loop has no effects, we can directly branch to the exit block, instead to unswitching first. Besides avoiding some extra work (unswitching first, then deleting the loop again) this allows to be more aggressive than regular unswitching with respect to cost-modeling. This approach should always be be desirable. This is similar in spirit to D93734, just that it uses the previously added checks for loop-unswitching. I tried to add the required no-op checks from scratch, as we only check a subset of the loop. There is potential to unify the checks with LoopDeletion, at the cost of adding a predicate whether a block should be considered. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D95468	2021-02-01 09:03:30 +00:00
Jeroen Dobbelaere	37dc1e669c	[LoopPeel] Use llvm.experimental.noalias.scope.decl for duplicating noalias metadata as needed. The reduction of a sanitizer build failure when enabling the dominance check (D95335) showed that loop peeling also needs to take care of scope duplication, just like loop unrolling (D92887). Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D95544	2021-02-01 10:01:17 +01:00
Craig Topper	86592d2f7a	[TableGen] Don't commute isel patterns if it would put an immAllOnesV or immAllZerosV on the left hand side. This primarily occurs with isel patterns using vnot. This reduces the number of variants in the isel tables. We generally canonicalize build_vectors of constants to the RHS. I think we might fail if there is a bitcast on the build_vector, but that should be easy to fix if we can find a case. Usually the bitcast is introduced by type legalization or lowering. It's likely canonicalization would have already occured.	2021-01-31 21:18:21 -08:00
Serge Pavlov	ef7f39cab9	[FPEnv] Intrinsic for setting rounding mode To set non-default rounding mode user usually calls function 'fesetround' from standard C library. This way has some disadvantages. * It creates unnecessary dependency on libc. On the other hand, setting rounding mode requires few instructions and could be made by compiler. Sometimes standard C library even is not available, like in the case of GPU or AI cores that execute small kernels. * Compiler could generate more effective code if it knows that a particular call just sets rounding mode. This change introduces new IR intrinsic, namely 'llvm.set.rounding', which sets current rounding mode, similar to 'fesetround'. It however differs from the latter, because it is a lower level facility: * 'llvm.set.rounding' does not return any value, whereas 'fesetround' returns non-zero value in the case of failure. In glibc 'fesetround' reports failure if its argument is invalid or unsupported or if floating point operations are unavailable on the hardware. Compiler usually knows what core it generates code for and it can validate arguments in many cases. * Rounding mode is specified in 'fesetround' using constants like 'FE_TONEAREST', which are target dependent. It is inconvenient to work with such constants at IR level. C standard provides a target-independent way to specify rounding mode, it is used in FLT_ROUNDS, however it does not define standard way to set rounding mode using this encoding. This change implements only IR intrinsic. Lowering it to machine code is target-specific and will be implemented latter. Mapping of 'fesetround' to 'llvm.set.rounding' is also not implemented here. Differential Revision: https://reviews.llvm.org/D74729	2021-02-01 11:28:14 +07:00
Craig Topper	a5363d614d	[Mips] Cleanup isel patterns to use 'vnot' instead of (xor X, immAllOnesV). NFCI A couple patterns used bitconvert on the immAllOnesV, but the isel matching uses ISD::isBuildVectorAllOnes which is able to look through bitcasts. So isel patterns don't need to do it explicitly.	2021-01-31 20:01:05 -08:00
Craig Topper	92b45d2660	[PowerPC] Remove vnot_ppc and replace with the standard vnot. immAllOnesV has special support for looking through bitcasts automatically so isel patterns don't need to explicitly look for the bitconvert.	2021-01-31 19:41:33 -08:00
Craig Topper	00ea1c43c6	[X86] Cleanup isel patterns to use 'vnot' instead of (xor X, immAllOnesV) to improve readability. NFC	2021-01-31 18:53:40 -08:00
Lang Hames	43aea6efae	Revert "[JITLink] Add missing symbols for ELF ehframe testcase, re-enable ...." This reverts commit 6e58539659aea0ee621c7e267d825aa82d4e7e96. This failed in http://lab.llvm.org:8011/#/builders/123/builds/2676. I guess were're still missing some symbols, but unfortunately the specific error is masked by a bug in python/lit that hides stderr. This test will have to remain disabled on Windows until I can get help to debug it further.	2021-02-01 13:32:11 +11:00
Craig Topper	ebb87abce3	[RISCV] Custom lower fshl/fshr with Zbt extension. We need to add a mask to the shift amount for these operations to use the FSR/FSL instructions. We were previously doing this in isel patterns, but custom lowering will make the mask visible to optimizations earlier.	2021-01-31 17:49:15 -08:00
Lang Hames	e4e7688ff2	[JITLink] Add missing symbols for ELF ehframe testcase, re-enable on Windows. This testcase was failing on windows due to missing definitions. This commit adds definitions of the missing symbols (as absolute symbols) to eliminate the errors.	2021-02-01 12:24:24 +11:00
Jun Ma	03f6285581	[CodeGenPrepare] Also skip lifetime.end intrinsic when check return block in dupRetToEnableTailCallOpts. Differential Revision: https://reviews.llvm.org/D95424	2021-02-01 08:18:44 +08:00
Craig Topper	6f831b56f3	[RISCV][LegalizeTypes] Try to expand BSWAP before promoting if the promoted BSWAP would expand anyway. If we're going to end up expanding anyway, we should do it early so we don't create extra operations to handle the bytes added by promotion. This is helfpul on RISCV where we might have to promote i16 all the way to i64. Differential Revision: https://reviews.llvm.org/D95756	2021-01-31 14:33:29 -08:00
Florian Hahn	52eafc5cc1	[LTOCodeGenerator] Use lto::Config for options (NFC). This patch removes some options that have been duplicated in LTOCodeGenerator and instead use lto::Config directly to manage the options. This is a cleanup after 6a59f0560648. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D95738	2021-01-31 19:08:07 +00:00
Florian Hahn	fcb3c6a71f	[ConstraintElimination] Add tests for ICMP_EQ predicates. Pre-commit test coverage for conditions with EQ predicates.	2021-01-31 19:02:38 +00:00

1 2 3 4 5 ...

210554 Commits