llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-22 18:54:02 +01:00

Author	SHA1	Message	Date
Petr Hosek	62cf2a8725	[InstrProfiling] Use !associated metadata for counters, data and values C identifier name input sections such as __llvm_prf_* are GC roots so they cannot be discarded. In LLD, the SHF_LINK_ORDER flag overrides the C identifier name semantics. The !associated metadata may be attached to a global object declaration with a single argument that references another global object, and it gets lowered to SHF_LINK_ORDER flag. When a function symbol is discarded by the linker, setting up !associated metadata allows linker to discard counters, data and values associated with that function symbol. Note that !associated metadata is only supported by ELF, it does not have any effect on non-ELF targets. Differential Revision: https://reviews.llvm.org/D76802	2021-02-01 15:01:43 -08:00
Patrick Oppenlander	019e9907bd	[llvm-objcopy] -O binary: consider SHT_NOBITS sections to be empty This is consistent with BFD objcopy. Previously llvm objcopy would allocate space for SHT_NOBITS sections often resulting in enormous binary files. New test case (binary-paddr.test %t6). Reviewed By: jhenderson, MaskRay Differential Revision: https://reviews.llvm.org/D95569	2021-02-01 15:01:25 -08:00
Hongtao Yu	7761552c73	[CSSPGO] Tweaking inlining with pseudo probes. Fixing up a couple places where `getCallSiteIdentifier` is needed to support pseudo-probe-based callsites. Also fixing an issue in the extbinary profile reader where the metadata section is not fully scanned based on the number of profiles loaded only for the current module. Reviewed By: wmi, wenlei Differential Revision: https://reviews.llvm.org/D95791	2021-02-01 13:56:40 -08:00
Philip Reames	95ddf0834f	[tests] highlight cornercase w/deref hoisting from D95815 The main point of committing this early is to have a negative test in tree. Nothing fails in the current tests if we implement this (currently unsound) optimization.	2021-02-01 13:32:39 -08:00
Sanjay Patel	a3cb545d77	[LoopVectorize] improve IR fast-math-flags propagation in reductions This is another step (see D95452) towards correcting fast-math-flags bugs in vector reductions. There are multiple bugs visible in the test diffs, and this is still not working as it should. We still use function attributes (rather than FMF) to drive part of the logic, but we are not checking for the correct FP function attributes. Note that FMF may not be propagated optimally on selects (example in https://llvm.org/PR35607 ). That's why I'm proposing to union the FMF of a fcmp+select pair and avoid regressions on existing vectorizer tests. Differential Revision: https://reviews.llvm.org/D95690	2021-02-01 16:21:36 -05:00
Florian Hahn	59b7da73aa	[ConstraintElimination] Add support for EQ predicates. A == B map to A >= B && A <= B (https://alive2.llvm.org/ce/z/_dwxKn). This extends the constraint construction to return a list of constraints, which can be used to properly de-compose nested AND & OR.	2021-02-01 20:48:31 +00:00
Philip Reames	fc9b7d7c42	[Loads] Plumb through TLI argument [NFC] This is a (rather delayed) follow up to commit 0129cd5. This commit is entirely NFC, the semantic change to leverage the new information will be submitted separate with a test case.	2021-02-01 11:45:30 -08:00
Wouter van Oortmerssen	c4485ed951	[WebAssembly] fixed wasm64 data segment init exp not 64-bit As defined in the spec: https://github.com/WebAssembly/memory64/blob/master/proposals/memory64/Overview.md Differential Revision: https://reviews.llvm.org/D95651	2021-02-01 11:32:50 -08:00
Michael Holman	b4e8df9113	[ConstantHoisting] Fix bug where constant materialization could insert into EH pad If the incoming block to a phi node is an EH pad, then we will materialize into an EH pad, which is not supposed to happen. To fix this, I added a check to see if incoming block of a phi node is an EH pad before using it as the insertion point. Differential Revision: https://reviews.llvm.org/D95019	2021-02-01 11:23:56 -08:00
David Green	b21519cd48	[ARM] Flatten identity shuffles through vqdmulh nodes Given a shuffle(vqdmulh(shuffle, shuffle), we can flatter the shuffles out if they become an identity mask. This can come up during lane interleaving, when we do that better. Differential Revision: https://reviews.llvm.org/D94034	2021-02-01 19:14:20 +00:00
Arthur Eubanks	47246e5e03	[NewPM][Unswitch] Add option to disable -O3 non-trivial unswitching Some benchmarks regress with non-trivial unswitching, so add an option to opt-out of performing non-trivial unswitching while investigating. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D95796	2021-02-01 11:11:59 -08:00
Craig Topper	41a3080ffd	[X86] Accept 64-bit GPRs for vextractps when using a register that requires EVEX. This is consistent with the VEX version. It also fixes a sorting issue in the matching table that caused the EVEX version to be prioritized over VEX in intel syntax. Fixes issue [2] from PR48991.	2021-02-01 11:01:32 -08:00
Haowei Wu	efc42d62e8	[elfabi] Fix tests which failed on different timezones This patch fixes elfabi tests on machines using a GMT+X timezone settings. Differential Revision: https://reviews.llvm.org/D95641	2021-02-01 10:58:55 -08:00
Sanjay Patel	6d8d6b38d5	[InstCombine] try to narrow min/max intrinsics with constant operand The constant trunc/ext may not be the optimal pre-condition, but I think that handles the common cases. Example of Alive2 proof: https://alive2.llvm.org/ce/z/sREeLC This is another step towards canonicalizing to the intrinsics. Narrowing was identified as source of potential regression for abs(), so we need to handle this for min/max - see: https://llvm.org/PR48816 If this is not enough, we could process intrinsics in the trunc-driven matching in canEvaluateTruncated().	2021-02-01 13:44:13 -05:00
Sanjay Patel	ce64161bee	[InstCombine] add tests for min/max with extend and constant operand; NFC	2021-02-01 13:44:13 -05:00
Valentin Clement	c1dd747b25	[flang][directive] Enforce basic semantic check for all clauses This patch is a follow up to D94821 to ensure the correct behavior of the general directive structure checker. This patch add the generation of the Enter function declaration for clauses in the TableGen backend. This helps to ensure each clauses declared in the TableGen file has at least a basic check. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D95108	2021-02-01 13:33:30 -05:00
Simon Pilgrim	8985ce20e1	[X86][AVX] Add 'OK' tests cases for PR48877	2021-02-01 18:17:41 +00:00
Simon Pilgrim	446a01964c	[X86][SSE] LowerScalarImmediateShift - use APInt::getLowBitsSet for vXi8 ISD::SRL mask generation. NFCI. Match what we do for ISD::SHL	2021-02-01 18:17:40 +00:00
Jessica Paquette	08fd0a40ed	[AArch64][GlobalISel] Emit G_ASSERT_ZEXT in assignValueToReg When we have a zeroext parameter, emit G_ASSERT_ZEXT. Add a check that we actually emit it. This is a 0.1% code size win on CTMark/7zip and CTMark/consumer-typeset at -Os. Differential Revision: https://reviews.llvm.org/D95567	2021-02-01 10:01:52 -08:00
Teresa Johnson	3315dc9bcf	[LTO] Move part of gold devirt test to v1.16 directory Part of the gold test added in 1487747e990ce9f8851f3d92c3006a74134d7518 relies on more recent fixes to gold that fix the plugin behavior with --export-dynamic-symbol and --dynamic-list. Extract those parts of the new test into a v1.16 test.	2021-02-01 09:53:11 -08:00
Craig Topper	6ab0fefa1b	[RISCV] Add scalable vector support for floating point FMA instructions A follow up patch will add support for commuting operands or changing opcode to vfmacc and friends. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D95662	2021-02-01 09:52:43 -08:00
Craig Topper	c8be45ab39	[RISCV] Update comment text from D95774. NFC	2021-02-01 09:52:43 -08:00
Jessica Paquette	36e5b06d50	[GlobalISel] Make sure G_ASSERT_ZEXT's src ends up with the same rc as dst When replacing the dst reg with the src reg, we need to make sure that we propagate the dst reg's register class through to the src. Otherwise, we aren't meeting the requirements for G_ASSERT_ZEXT, and so the verifier will fail. Differential Revision: https://reviews.llvm.org/D95708	2021-02-01 09:46:35 -08:00
Craig Topper	136420859e	[RISCV] Optimize (srl (and X, 0xffff), C) -> (srli (slli X, 16), 16 + C). Rather than materializing the 0xffff immediate for the AND, use a shift left to remove the upper bits and then shift in zeros from the right. This pattern occurs when type legalizing an i16 right shift. I've implemented this with custom selection code for a number of reasons. I've limited this to the AND having a single use. We need to compensate for SimplifyDemandedBits altering the AND mask. I'm using *W opcodes on RV64. We may want to generlize this in the future. For all these reason it seemed easiest to do it this way. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D95774	2021-02-01 09:37:55 -08:00
Florian Hahn	28df98bafb	[ConstraintElimination] Negate IR condition directly. Instead of using ConstraintSystem::negate when adding new constraints, flip the condition in IR. The main advantage is that EQ predicates can be represented by 2 constraints, which makes negating based on the constraint tricky. The IR condition can easily negated.	2021-02-01 17:21:40 +00:00
Austin Kerbow	b099a1ce22	[AMDGPU] Fix release build after 0397dca0.	2021-02-01 08:55:14 -08:00
Austin Kerbow	411730e6c6	[AMDGPU] Fix crash with sgpr spills to vgpr disabled This would assert with amdgpu-spill-sgpr-to-vgpr disabled when trying to spill the FP. Fixes: SWDEV-262704 Reviewed By: RamNalamothu Differential Revision: https://reviews.llvm.org/D95768	2021-02-01 08:35:25 -08:00
Simon Pilgrim	9ee9a3c639	Revert rGce587529ad8b5 - "[APFloat] multiplySignificand - pass IEEEFloat as const reference. NFCI." Breaks on some buildbots	2021-02-01 16:15:23 +00:00
Sander de Smalen	124d498e95	NFC: Migrate SimplifyCFG to work on InstructionCost This patch migrates cost values and arithmetic to work on InstructionCost. When the interfaces to TargetTransformInfo are changed, any InstructionCost state will propagate naturally. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D95351	2021-02-01 16:14:05 +00:00
Sander de Smalen	6c8092b00f	[SimplifyCFG] NFC: Rename static methods to clang-tidy standards. This patch is a precursor to D95351, which changes the signature of these methods.	2021-02-01 16:14:05 +00:00
David Green	dbdd240132	[ARM] Simplify VMOVRRD from extracts of buildvectors Under the softfp calling convention, we are often left with VMOVRRD(extract(bitcast(build_vector(a, b, c, d)))) for the return value of the function. These can be simplified to a,b or c,d directly, depending on the value of the extract. Big endian is a little different because the bitcast switches the lanes around, meaning we end up with b,a or d,c. Differential Revision: https://reviews.llvm.org/D94989	2021-02-01 16:09:25 +00:00
J-Y You	c233f2a4f7	[TableGen] Fix anonymous record self-reference in foreach and multiclass If we instantiate self-referenced anonymous records in foreach and multiclass, the NAME value will point to incorrect record. It's because anonymous name is resolved too early. This patch adds AnonymousNameInit to represent an anonymous record name. When instantiating an anonymous record, it will update the referred name. Differential Revision: https://reviews.llvm.org/D95309	2021-02-01 10:59:07 -05:00
Simon Pilgrim	c54bcb01a6	[APFloat] multiplySignificand - pass IEEEFloat as const reference. NFCI. Avoids unnecessary IEEEFloat copies.	2021-02-01 15:41:50 +00:00
LLVM GN Syncbot	cbac188e3a	[gn build] Port b63cd4db915c	2021-02-01 14:24:45 +00:00
Kerry McLaughlin	267255edd9	[SVE][CodeGen] Remove performMaskedGatherScatterCombine The AArch64 DAG combine added by D90945 & D91433 extends the index of a scalable masked gather or scatter to i32 if necessary. This patch removes the combine and instead adds shouldExtendGSIndex, which is used by visitMaskedGather/Scatter in SelectionDAGBuilder to query whether the index should be extended before calling getMaskedGather/Scatter. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D94525	2021-02-01 14:10:00 +00:00
Florian Hahn	5ff3260e34	[SCEV] Bail out if URem operand cannot be zero-extended. In some cases, LHS is larger than the target expression type. Bail out in that case for now, to avoid crashing	2021-02-01 13:50:54 +00:00
Jeroen Dobbelaere	2a8989c820	Revert "[Verifier] enable llvm.experimental.noalias.scope.decl dominance check." the 'clang-with-lto-ubuntu' buildbot triggers the assertion. This reverts commit b43c395e60d2636ab5afc9b60a2046978c71e366.	2021-02-01 14:38:33 +01:00
Florian Hahn	adbb86ac12	[ConstraintElimination] Add tests for signed predicates. Add test coverage for conditions with signed predicates.	2021-02-01 13:23:05 +00:00
Tim Northover	e5a10e4821	GlobalISel: check type size before getZExtValue()ing it. Otherwise getZExtValue() asserts.	2021-02-01 12:43:33 +00:00
Cullen Rhodes	a70388496a	[LV] Fix crash when computing max VF too early D90687 introduced a crash: llvm::LoopVectorizationCostModel::computeMaxVF(llvm::ElementCount, unsigned int): Assertion `WideningDecisions.empty() && Uniforms.empty() && Scalars.empty() && "No decisions should have been taken at this point"' failed. when compiling the following C code: typedef struct { char a; } b; b *c; int d, e; int f() { int g = 0; for (; d; d++) { e = 0; for (; e < c[d].a; e++) g++; } return g; } with: clang -Os -target hexagon -mhvx -fvectorize -mv67 testcase.c -S -o - This occurred since prior to D90687 computeFeasibleMaxVF would only be called in computeMaxVF when a scalar epilogue was allowed, but now it's always called. This causes the assert above since computeFeasibleMaxVF collects all viable VFs larger than the default MaxVF, and for each VF calculates the register usage which results in analysis being done the assert above guards against. This can occur in computeFeasibleMaxVF if TTI.shouldMaximizeVectorBandwidth and this target hook is implemented in the hexagon backend to always return true. Reported by @iajbar. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D94869	2021-02-01 12:14:59 +00:00
Sander de Smalen	77ed25e06f	NFC: Migrate SpeculativeExecution to work on InstructionCost This patch migrates cost values and arithmetic to work on InstructionCost. When the interfaces to TargetTransformInfo are changed, any InstructionCost state will propagate naturally. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D95356	2021-02-01 12:13:23 +00:00
Dmitry Preobrazhensky	d2dbf6a661	[AMDGPU][MC] Corrected error position for invalid operands Generic parser may report an incorrect error position when an offending operand is followed by a comma. See bug 48884 for details: https://bugs.llvm.org/show_bug.cgi?id=48884. Differential Revision: https://reviews.llvm.org/D95674	2021-02-01 14:31:08 +03:00
xgupta	ddd3adad6f	[Branch-Rename] Fix some links According to the [[ https://foundation.llvm.org/docs/branch-rename/ \| status of branch rename ]], the master branch of the LLVM repository is removed on 28 Jan 2021. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D95766	2021-02-01 16:43:21 +05:30
David Green	9248cc33ab	[ARM] Turn sext_inreg(VGetLaneu) into VGetLaneu This adds a DAG combine for converting sext_inreg of VGetLaneu into VGetLanes, providing the types match correctly. Differential Revision: https://reviews.llvm.org/D95073	2021-02-01 11:10:35 +00:00
Jeroen Dobbelaere	7618d9f77d	[Verifier] enable llvm.experimental.noalias.scope.decl dominance check. Now that Loop Peeling has been fixed (80cdd30eb90c3509bf315f1fa1369483e2448bbd), enable the dominance check by default. This reverts commit 3b5d36ece21f9baf96d82944b0165cb352443bee.	2021-02-01 11:53:01 +01:00
Simon Pilgrim	4720508a81	[X86][AVX] combineExtractWithShuffle - combine extracts from 256/512-bit vector shuffles. We can only legally extract from the lowest 128-bit subvector, so extract the correct subvector to allow us to handle 256/512-bit vector element extracts.	2021-02-01 10:31:43 +00:00
David Green	96fb4c7ba1	[ARM] Simplify extract of VMOVDRR Under SoftFP calling conventions, we can be left with extract(bitcast(BUILD_VECTOR(VMOVDRR(a, b), ..))) patterns that can simplify to a or b, depending on the extract lane. Differential Revision: https://reviews.llvm.org/D94990	2021-02-01 10:24:57 +00:00
Kazushi (Jam) Marukawa	1b359f6379	[VE] Change inetger constants 32-bit friendly Correct integer constants like `1UL << 63` to `UINT64_C(1) << 63` in order to make them work on 32-bit machines. Tested on both an i386 and x86_64 machines. Reviewed By: mgorny Differential Revision: https://reviews.llvm.org/D95724	2021-02-01 19:00:47 +09:00
Florian Hahn	7b52facd8c	[LoopUnswitch] Pacify compiler warnings. Attempt to fix some compiler warnings on some bots after b8c81fa5c7f77a7a1267e42ddbbc9bffb10b0817.	2021-02-01 09:13:06 +00:00
Florian Hahn	a6f31d20e3	[LoopUnswitch] Add shortcut if unswitched path is a no-op. If we determine that the invariant path through the loop has no effects, we can directly branch to the exit block, instead to unswitching first. Besides avoiding some extra work (unswitching first, then deleting the loop again) this allows to be more aggressive than regular unswitching with respect to cost-modeling. This approach should always be be desirable. This is similar in spirit to D93734, just that it uses the previously added checks for loop-unswitching. I tried to add the required no-op checks from scratch, as we only check a subset of the loop. There is potential to unify the checks with LoopDeletion, at the cost of adding a predicate whether a block should be considered. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D95468	2021-02-01 09:03:30 +00:00

1 2 3 4 5 ...

210567 Commits