llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
David Green	fd282c40e4	[ARM] Guard against loop variant gather ptr operands This ensures that the operands of any gather/scatter instructions that we attempt to push out of the loop are invariant, preventing invalid IR from being generated.	2021-05-30 18:02:14 +01:00
Ben Shi	f7319a117e	[AVR] Improve inline assembly Reviewed By: dylanmckay Differential Revision: https://reviews.llvm.org/D96394	2021-05-30 23:44:43 +08:00
Florian Hahn	14d190e2ce	[LoopDeletion] Add more tests with infinite sub-loops & mustprogress. A couple of additional tests inspired by PR50511.	2021-05-30 16:41:57 +01:00
Florian Hahn	24266d4661	[VectorCombine] Add tests with noundef index for load scalarization.	2021-05-30 12:15:41 +01:00
Sanjay Patel	c5aaaaa9b9	[InstCombine] fix miscompile from vector select substitution This is similar to the fix in c590a9880d7a ( PR49832 ), but we missed handling the pattern for select of bools (no compare inst). We can't substitute a vector value because the equality condition replacement that we are attempting requires that the condition is true/false for the entire value. Vector select can be partly true/false. I added an assert for vector types, so we shouldn't hit this again. Fixed formatting while auditing the callers. https://llvm.org/PR50500	2021-05-30 07:11:58 -04:00
Florian Hahn	0fab9e3072	[DAGCombine] Poison-prove scalarizeExtractedVectorLoad. extractelement is poison if the index is out-of-bounds, so just scalarizing the load may introduce an out-of-bounds load, which is UB. To avoid introducing new UB, we can mask the index so it only contains valid indices. Fixes PR50382. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D103077	2021-05-30 11:40:55 +01:00
Mindong Chen	fee80286d7	[NFCI] Move DEBUG_TYPE definition below #includes When you try to define a new DEBUG_TYPE in a header file, DEBUG_TYPE definition defined around the #includes in files include it could result in redefinition warnings even compile errors. Reviewed By: tejohnson Differential Revision: https://reviews.llvm.org/D102594	2021-05-30 17:31:01 +08:00
Pengxuan Zheng	e434a8e756	[SafeStack] Use proper API to get stack guard Using the proper API automatically sets `__stack_chk_guard` to `dso_local` if `Reloc::Static`. This wasn't strictly necessary until recently when dso_local was no longer implied by `TargetMachine::shouldAssumeDSOLocal` for `__stack_chk_guard`. By using the proper API, we can avoid generating unnecessary GOT relocations. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D102646	2021-05-30 00:52:48 -07:00
Arthur Eubanks	f9c1930dea	Revert "[TargetLowering] Only inspect attributes in the arguments for ArgListEntry" This reverts commit 1c7f32334d4becc725b9025fd32291a0e5729acd. Some code still needs to properly set parameter ABI attributes, see D101806.	2021-05-29 23:08:15 -07:00
Arthur Eubanks	85767d0682	Revert "[NFC] Use ArgListEntry indirect types more in ISel lowering" This reverts commit bc7d15c61da78864b35e3c114294d6e4db645611. Dependent change is to be reverted.	2021-05-29 22:40:33 -07:00
Fangrui Song	63eb5ce7f4	[InstrProfiling][test] Improve tests	2021-05-29 14:30:44 -07:00
David Green	bb36d4c212	[ARM] Guard against WhileLoopStart kill flags If the operand of the WhileLoopStart is flagged as killed, that currently gets propogated to both the t2CMPri as the instruction is reverted, and the newly created t2DoLoopStart. Only the second should remain as killing the operand, the first dropping the flags.	2021-05-29 21:04:26 +01:00
Jessica Clarke	246602f2b6	Revert "[RISCV] Remove -riscv-no-aliases in favour of new -M no-aliases" The replacement doesn't work for llc, but it is needed by patchable-function-entry.ll. This reverts commit aa9a30b83a06e3e5e68e32ea645ec2d9edc27efc.	2021-05-29 15:11:37 +01:00
Jessica Clarke	d9e3f6fdb8	[Support] Fix getMainExecutable on FreeBSD when called via an absolute path On FreeBSD, absolute paths are passed unmodified in AT_EXECPATH, but relative paths are resolved to absolute paths, and any symlinks will be followed in the process. This means that the resource dir calculation will be wrong if Clang is invoked as an absolute path to a symlink, and this currently causes clang/test/Driver/rocm-detect.hip to fail on FreeBSD. Thus, make sure to call realpath on the result, just like is done on macOS. Whilst here, clean up the old fallback auxargs loop to use the actual type for auxargs rather than using lots of hacky casts that rely on addresses and pointers being the same (which is not the case on CHERI, and thus Arm's prototype Morello, although for little-endian systems it happens to work still as the word-sized integer will be padded to a full pointer, and it's someone academic given dereferencing past the end of environ will give a bounds fault, but CheriBSD is new enough that the elf_aux_info path will be used). This also makes the code easier to follow, and removes the confusing double-increment of p. Reviewed By: dim, arichardson Differential Revision: https://reviews.llvm.org/D103346	2021-05-29 14:59:46 +01:00
Jessica Clarke	f8419889e5	[RISCV] Remove -riscv-no-aliases in favour of new -M no-aliases Whilst here, also remove a couple of unnecessary -o - instances. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D103201	2021-05-29 14:58:28 +01:00
Sanjay Patel	7de8b3d369	[InstCombine] fold zext of masked bit set/clear This does not solve PR17101, but it is one of the underlying diffs noted here: https://bugs.llvm.org/show_bug.cgi?id=17101#c8 We could ease the one-use checks for the 'clear' (no 'not' op) half of the transform, but I do not know if that asymmetry would make things better or worse. Proofs: https://rise4fun.com/Alive/uVB Name: masked bit set %sh1 = shl i32 1, %y %and = and i32 %sh1, %x %cmp = icmp ne i32 %and, 0 %r = zext i1 %cmp to i32 => %s = lshr i32 %x, %y %r = and i32 %s, 1 Name: masked bit clear %sh1 = shl i32 1, %y %and = and i32 %sh1, %x %cmp = icmp eq i32 %and, 0 %r = zext i1 %cmp to i32 => %xn = xor i32 %x, -1 %s = lshr i32 %xn, %y %r = and i32 %s, 1 Note: this is a re-post of a patch that I committed at: rGa041c4ec6f7a The commit was reverted because it exposed another bug: rGb212eb7159b40 But that has since been corrected with: rG8a156d1c2795189 ( D101191 ) Differential Revision: https://reviews.llvm.org/D72396	2021-05-29 08:52:26 -04:00
Sanjay Patel	d6808ae8f5	[InstCombine] reduce code duplication; NFC	2021-05-29 08:33:25 -04:00
Ulrich Weigand	bf93bb1269	[SystemZ] Set getExtendForAtomicOps to ISD::ANY_EXTEND The implementation of subword atomics does not actually guarantee the result is zero-extended, which now caused build bot failures after https://reviews.llvm.org/D101342 was landed.	2021-05-29 12:15:18 +02:00
LLVM GN Syncbot	3dc711e872	[gn build] Port b13edf6e907b	2021-05-29 07:51:43 +00:00
Nikita Popov	f403b1e9b2	[LoopUnroll] Make DomTree explicitly required (NFC) Some of the code was already assuming that DT is non-null, so make that requirement more explicit and remove unnecessary null checks.	2021-05-29 09:37:32 +02:00
LemonBoy	f2c842c94a	[AtomicExpandPass][AArch64] Promote xchg with floating-point types to integer ones Follow the same strategy used for atomic loads/stores by converting the operands to equally-sized integer types. This change prevents the atomic expansion pass from generating illegal LL/SC pairs when targeting AArch64: `expand-atomicrmw-xchg-fp.ll` would previously instantiate intrinsics such as `llvm.aarch64.ldaxr.p0f32` that cannot be lowered. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D103232	2021-05-29 08:57:27 +02:00
Fangrui Song	f8cdbd49ed	[InstrProfiling][test] Fix stale linkage.ll	2021-05-28 21:33:33 -07:00
Fangrui Song	4e9598d950	[InstrProfiling][test] Fix stale tests * Change linkage/visibility of __profn_ variables to match the reality * alwaysinline.ll: Add "EnableValueProfiling", otherwise it doesn't test available_externally alwaysinline. * Delete PR23499.ll - covered by other comdat tests.	2021-05-28 21:14:03 -07:00
Luke	7105889760	[RISCV] Enable interleaved vectorization for RVV Enable interleaved vectorization for RVV. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D101469	2021-05-29 11:03:27 +08:00
Fangrui Song	df99c4fbee	[Internalize] Simplify comdat renaming with noduplicates after D103043 I realized that we can use `comdat noduplicates` which is available on ELF. Add a special case for wasm which doesn't support the feature.	2021-05-28 16:58:38 -07:00
Amara Emerson	32418dbc82	[AArch64][GlobalISel] Fix a crash during selection of a G_ZEXT(s8 = G_LOAD) We have special handling for a zext of a load <32b because the load does a zext for free. In that case, we just select the G_ZEXT as if it were a copy but this triggered the copy checking code to balk at the mismatched size. This was being hidden because normally these get combined into G_ZEXTLOAD but for atomics this doesn't happen. The test case here just uses a normal load because the particular atomic isn't supported yet anyway.	2021-05-28 16:35:24 -07:00
Nikita Popov	3398903f57	[LoopUnroll] Use changeToUnreachable() (NFC) When fulling unrolling with a non-latch exit, the latch block is folded to unreachable. Replace this folding with the existing changeToUnreachable() helper, rather than performing it manually. This also moves the fold to happen after the manual DT update for exit blocks. I believe this is correct in that the conversion of an unconditional backedge into unreachable should not affect the DT at all. Differential Revision: https://reviews.llvm.org/D103340	2021-05-29 00:11:21 +02:00
Craig Topper	3771a3d0c5	[RISCV] Add separate MxList tablegen classes for widening/narrowing and sext.zext.vf2/4/8. NFC This is cleaner than slicing the MxList to remove elements from the beginning or end since that requires hardcoding the size. I don't expect the size of the list to change, but we shouldn't repeat it in multiple places.	2021-05-28 14:06:19 -07:00
Nikita Popov	bd151f910c	[LoopUnroll] Add store to unreachable latch test (NFC) This is to show that we currently only convert the terminator to unreachable, but don't clean up instructions before it (unless trivial DCE removes them). Also clean up excessive whitespace in this test.	2021-05-28 22:49:23 +02:00
Nikita Popov	7e30b2046c	[LoopUnroll] Clean up exit folding (NFC) This does some non-functional cleanup of exit folding during unrolling. The two main changes are: * First rewrite latch->header edges, which is unrelated to exit folding. * Combine folding for latch and non-latch exits. After the previous change, the only difference in their logic is that for non-latch exits we currently only fold "known non-exit" cases, but not "known exit" cases. I think this helps a lot to clarify this code and prepare it for future changes. Differential Revision: https://reviews.llvm.org/D103333	2021-05-28 22:31:13 +02:00
Craig Topper	c8c9da08e9	[RISCV] Pre-commit test cases for D103211. NFC	2021-05-28 13:21:58 -07:00
Bardia Mahjour	78f57f88c4	[NFC] Remove confusing info about MainLoop VF/UF from debug message	2021-05-28 16:10:04 -04:00
Nico Weber	f1c70bb27d	[dsymutil tests] Try to make eh_frames.test run on other platforms We now have llvm-otool :)	2021-05-28 15:49:31 -04:00
Eli Friedman	1638fc9086	[AArch64][RISCV] Make sure isel correctly honors failure orderings. If a cmpxchg specifies acquire or seq_cst on failure, make sure we generate code consistent with that ordering even if the success ordering is not acquire/seq_cst. At one point, it was ambiguous whether this sort of construct was valid, but the C++ standad and LLVM now accept arbitrary combinations of success/failure orderings. This doesn't address the corresponding issue in AtomicExpand. (This was reported as https://bugs.llvm.org/show_bug.cgi?id=33332 .) Fixes https://bugs.llvm.org/show_bug.cgi?id=50512. Differential Revision: https://reviews.llvm.org/D103284	2021-05-28 12:47:40 -07:00
Nico Weber	311aa386aa	[gn build] manually port 982e3c05108b6 (check-lld needs dsymutil)	2021-05-28 15:39:12 -04:00
LLVM GN Syncbot	995b39f2fa	[gn build] Port 9968896cd62a	2021-05-28 18:57:30 +00:00
Craig Topper	a9af2ebabe	[RISCV] Add octuple to LMULInfo tablegen class, remove octuple_from_str. NFCI octuple_from_str was always used with the MX field from an LMULInfo. Might as well just precompute it and put it in the class.	2021-05-28 11:53:05 -07:00
Craig Topper	22fc6f8fbe	[VP] Make getMaskParamPos/getVectorLengthParamPos return unsigned. Lowercase function names. Parameter positions seem like they should be unsigned. While there, make function names lowercase per coding standards. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D103224	2021-05-28 11:28:47 -07:00
Craig Topper	7d56e782b7	[SelectionDAG] Fix typo in assert. NFC	2021-05-28 10:37:11 -07:00
Florian Hahn	9f507ddd21	[VectorCombine] Check indices for all extracts we scalarize. We need to make sure that the indices of all extracts we scalarize are valid.	2021-05-28 18:35:29 +01:00
Florian Hahn	7011f51b3d	[VectorCombine] Add variants of multi-extract tests with assumes.	2021-05-28 18:35:24 +01:00
Stefan Pintilie	70b417a023	Revert "Return "[LoopDeletion] Break backedge if we can prove that the loop is exited on 1st iteration" (try 2)" This reverts commit be1a23203b1de655b8c7dac7549818d975a0cbbf.	2021-05-28 12:21:22 -05:00
Stefan Pintilie	d108ba73ef	Revert "[NFCI][LoopDeletion] Only query SCEV about loop successor if another successor is also in loop" This reverts commit b0b2bf3b5da950679db1431aae431a6dedea2245.	2021-05-28 12:21:22 -05:00
Stefan Pintilie	5950d60ff6	Revert "[NFC] Formatting fix" This reverts commit 59d938e649e62db0cef4903d495e838fbc6a6eb8.	2021-05-28 12:21:22 -05:00
Stefan Pintilie	baca97070f	Revert "[NFC] Reuse existing variables instead of re-requesting successors" This reverts commit c467585682dcdda75e645ef3ab47c8b48440db12.	2021-05-28 12:21:22 -05:00
Stefan Pintilie	32cd617a76	Revert "[NFCI][LoopDeletion] Do not call complex analysis for known non-zero BTC" This reverts commit 7d418dadf6b1e6fd9bcccf7c5b5e1db74992ee70.	2021-05-28 12:21:21 -05:00
Sanjay Patel	c3c1a4a935	[PassManager] unify late simplifycfg options between regular and LTO pipelines This is split off from D102002, and I think it is clear that the difference in behavior was not intended. Options were added to SimplifyCFG over time, but different chunks of the pass pipelines were not kept in sync.	2021-05-28 13:06:49 -04:00
Sanjay Patel	87b311dfb8	[PhaseOrdering] add test for late simplifycfg with LTO; NFC Part of D102002	2021-05-28 13:06:48 -04:00
Florian Hahn	215ae5f8ab	[LoopDeletion] Add test with potentially infinite sub-loop. Tests for PR50511.	2021-05-28 17:45:44 +01:00
Nemanja Ivanovic	a9a8ac3fc8	Revert "Fix "enumerator 'llvm::TargetStackID::WasmLocal' in switch of enum 'llvm::TargetStackID::Value' is not handled" MSVC warnings. NFCI." Since ca5f07f8c4bc96d16ed1992b810aa3897df157f2 already reverted the cause for this warning, this commit now causes warnings about a default label in a switch that covers the enum. This reverts commit cf2eeb114c59cfc3a80133e96c585188fa16cc98.	2021-05-28 10:53:49 -05:00

1 2 3 4 5 ...

216553 Commits