llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-22 02:33:06 +01:00

Author	SHA1	Message	Date
Fraser Cormack	5e5115bbd0	[MemCpyOpt] Fix a variety of scalable-type crashes This patch fixes a variety of crashes resulting from the `MemCpyOptPass` casting `TypeSize` to a constant integer, whether implicitly or explicitly. Since the `MemsetRanges` requires a constant size to work, all but one of the fixes in this patch simply involve skipping the various optimizations for scalable types as cleanly as possible. The optimization of `byval` parameters, however, has been updated to work on scalable types in theory. In practice, this optimization is only valid when the length of the `memcpy` is known to be larger than the scalable type size, which is currently never the case. This could perhaps be done in the future using the `vscale_range` attribute. Some implicit casts have been left as they were, under the knowledge they are only called on aggregate types. These should never be scalably-sized. Reviewed By: nikic, tra Differential Revision: https://reviews.llvm.org/D109329 (cherry-picked from commit 7fb66d4)	2021-09-09 16:21:27 -07:00
Bjorn Pettersson	b37f5f2114	Inform pass manager when child loops are deleted As part of the nontrivial unswitching we could end up removing child loops. This patch add a notification to the pass manager when that happens (using the markLoopAsDeleted callback). Without this there could be stale LoopAccessAnalysis results cached in the analysis manager. Those analysis results are cached based on a Loop* as key. Since the BumpPtrAllocator used to allocate Loop objects could be resetted between different runs of for example the loop-distribute pass (running on different functions), a new Loop object could be created using the same Loop pointer. And then when requiring the LoopAccessAnalysis for the loop we got the stale (corrupt) result from the destroyed loop. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D109257 (fixes PR51754) (cherry-picked from commit 0f0344dd1e3b53387bb396070916e67f4c426da6)	2021-09-09 09:04:59 -07:00
Philip Reames	72d2352801	[AlignFromAssume] Bailout w/non-constant alignments (pr51680) This is a bailout for pr51680. This pass appears to assume that the alignment operand to an align tag on an assume bundle is constant. This doesn't appear to be required anywhere, and clang happily generates non-constant alignments for cases such as this case taken from the bug report: // clang -cc1 -triple powerpc64-- -S -O1 opal_pci-min.c extern int a[]; long b; long c; void d(long, int *, int, long, long, long) __attribute__((__alloc_align__(6))); void e() { b = d(c, a, 0, 0, 5, c); b[0] = 0; } This was exposed by a SCEV change which allowed a non-constant alignment to reach further into the pass' code. We could generalize the pass, but for now, let's fix the crash. (cherry picked from commit 9b45fd909ffa754acbb4e927bc2d55c7ab0d4e3f)	2021-09-01 17:36:37 -07:00
Dawid Jurczak	79ff6e4b2d	[LoopIdiom] Don't transform loop into memmove when load from body has more than one use This change fixes issue found by Markus: https://reviews.llvm.org/rG11338e998df1 Before this patch following code was transformed to memmove: for (int i = 15; i >= 1; i--) { p[i] = p[i-1]; sum += p[i-1]; } However load from p[i-1] is used not only by store to p[i] but also by sum computation. Therefore we cannot emit memmove in loop header. Differential Revision: https://reviews.llvm.org/D107964 (cherry picked from commit bdcf04246c401aec9bdddf32fabc99fa4834a477)	2021-08-25 16:19:10 +02:00
Chris Jackson	910de616a4	[DebugInfo][LSR] Avoid crashes on large integer inputs SCEV-based salvaging in LSR translates SCEVs to DIExpressions. SCEVs may contain very large integers but the translation does not support integers greater than 64 bits. This patch adds checks to ensure conversions of these large integers is not attempted. A regression test is added to ensure no such translation is attempted. Reviewed by: StephenTozer PR: https://bugs.llvm.org/show_bug.cgi?id=51329 Differential Revision: https://reviews.llvm.org/D107438 (cherry picked from commit 21ee38e24f9801a567306b2a88defacf6e589a8b)	2021-08-05 10:38:19 +01:00
Jeremy Morse	46ad88f625	Follow-up to D105207, only salvage affine SCEVs to avoid a crash SCEVToIterCountExpr only expects to be fed affine expressions, but DbgRewriteSalvageableDVIs is feeding it non-affine induction variables. Following this up with an obvious fix, will add test coverage too if this avoids D105207 being reverted. (cherry picked from commit 2537120c870c04893636f171f553024f378c2de8)	2021-08-05 10:35:08 +01:00
Chris Jackson	ff86d9e5f0	[DebugInfo][LoopStrengthReduction] SCEV-based salvaging for LSR Reapply commit d675b594f4f1e1f6a195fb9a4fd02cf3de92292d that was reverted due to buildbot failures. A simple fix has been applied to remove an assertion. Differential Revision: https://reviews.llvm.org/D105207 (cherry picked from commit 0ba8595287ea2203ef2250e2b0b41f284a055518)	2021-08-05 10:34:33 +01:00
Sanjay Patel	2b94ecbbe0	[SROA] prevent crash on large memset length (PR50910) I don't know much about this pass, but we need a stronger check on the memset length arg to avoid an assert. The current code was added with D59000. The test is reduced from: https://llvm.org/PR50910 Differential Revision: https://reviews.llvm.org/D106462 (cherry picked from commit f2a322bfcfbc62b5523f32c4eded6faf2cad2e24)	2021-08-04 16:51:23 -07:00
Sanjay Patel	b92c9f9565	[DivRemPairs] make sure we have a valid CFG for hoisting division This transform was added with e38b7e894808ec2 and as shown in: https://llvm.org/PR51241 ...it could crash without an extra check of the blocks. There might be a more compact way to write this constraint, but we can't just count the successors/predecessors without affecting a test that includes a switch instruction. (cherry picked from commit 5b83261c1518a39636abe094123f1704bbfd972f)	2021-08-02 13:52:37 -07:00
Chris Jackson	9a10dd5b1c	Revert "[DebugInfo][LoopStrengthReduction] SCEV-based salvaging for LSR" This was reverted due to a reported crash. This reverts commit 796b84d26f4d461fb50e7b4e84e15a10eaca88fc.	2021-07-29 00:04:50 +01:00
Benjamin Kramer	c0d40a1ca3	Remove unused include that's also a layering violation. NFC.	2021-07-27 21:21:55 +02:00
Adam Nemet	d9613eb43c	[Matrix] Fix shape for factored transpose The shape of the input is C x R. Differential Revision: https://reviews.llvm.org/D106722	2021-07-27 11:36:13 -07:00
Adam Nemet	add64be20a	[Matrix] RAUW should only replace an instruction in ShapeMap if supportsShapeInfo As an instruction is replaced in optimizeTransposes RAUW will replace it in the ShapeMap (ShapeMap is ValueMap so that uses are updated). In finalizeLowering however we skip updating uses if they are in the ShapeMap since they will be lowered separately at which point we pick up the lowered operands. In the testcase what happened was that since we replaced the doubled-transpose with the shuffle, it ended up in the ShapeMap. As we lowered the columnwise-load the use in the shuffle was not updated. Then as we removed the original columnwise-load we changed that to an undef. I.e. we ended up with: ``` %shuf = shufflevector <8 x double> undef, <8 x double> poison, <6 x i32> ^^^^^ <i32 0, i32 1, i32 2, i32 4, i32 5, i32 6> ``` Besides the fix itself, I have fortified this last bit. As we change uses to undef when removing instruction we track the undefed instruction to make sure we eventually remove those too. This would have caught the issue at compile time. Differential Revision: https://reviews.llvm.org/D106714	2021-07-27 11:36:13 -07:00
Alexey Zhikhartsev	6516543c4b	Add jump-threading optimization for deterministic finite automata The current JumpThreading pass does not jump thread loops since it can result in irreducible control flow that harms other optimizations. This prevents switch statements inside a loop from being optimized to use unconditional branches. This code pattern occurs in the core_state_transition function of Coremark. The state machine can be implemented manually with goto statements resulting in a large runtime improvement, and this transform makes the switch implementation match the goto version in performance. This patch specifically targets switch statements inside a loop that have the opportunity to be threaded. Once it identifies an opportunity, it creates new paths that branch directly to the correct code block. For example, the left CFG could be transformed to the right CFG: ``` sw.bb sw.bb / \| \ / \| \ case1 case2 case3 case1 case2 case3 \ \| / / \| \ latch.bb latch.2 latch.3 latch.1 br sw.bb / \| \ sw.bb.2 sw.bb.3 sw.bb.1 br case2 br case3 br case1 ``` Co-author: Justin Kreiner @jkreiner Co-author: Ehsan Amiri @amehsan Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D99205	2021-07-27 14:34:04 -04:00
Anna Thomas	23f3f90569	Strip undef implying attributes when moving calls When hoisting/moving calls to locations, we strip unknown metadata. Such calls are usually marked `speculatable`, i.e. they are guaranteed to not cause undefined behaviour when run anywhere. So, we should strip attributes that can cause immediate undefined behaviour if those attributes are not valid in the context where the call is moved to. This patch introduces such an API and uses it in relevant passes. See updated tests. Fix for PR50744. Reviewed By: nikic, jdoerfert, lebedev.ri Differential Revision: https://reviews.llvm.org/D104641	2021-07-27 10:57:05 -04:00
Tres Popp	64d9d45951	Handle unused variable when assertions are disabled	2021-07-27 15:43:06 +02:00
Chris Jackson	fe743a5b25	[DebugInfo][LoopStrengthReduction] SCEV-based salvaging for LSR This reapplies commit 76f3ffb2b285998f02639db8fd42fb0de8a540d0 that was reverted due to buildbot failures. - Update lit tests with REQUIRES condition. - Abandon salvage attempt if SCEVUnknown::getValue() returns nullptr. Differential Revision: https://reviews.llvm.org/D105207	2021-07-27 14:22:09 +01:00
Chris Jackson	e13e00f7b3	[DebugInfo][LoopStrengthReduction] SCEV-based salvaging for LSR This reverts commit 76f3ffb2b285998f02639db8fd42fb0de8a540d0 because of a failure on sanitixer-X86-64-linux-autoconf.	2021-07-27 13:36:56 +01:00
Chris Jackson	2e863e863d	[DebugInfo][LoopStrengthReduction] SCEV-based salvaging for LSR This patch extends salvaging of debuginfo in the Loop Strength Reduction (LSR) pass by translating Scalar Evaluations (SCEV) into DIExpressions. The method is as follows: - Cache dbg.value intrinsics that are salvageable. - Obtain a loop Induction Variable (IV) from ScalarExpressionExpander or the loop header. - Translate the IV SCEV into an expression that recovers the current loop iteration count. Combine this with the dbg.value's location op SCEV to create a DIExpression that salvages the value. Review by: jmorse Differential Revision: https://reviews.llvm.org/D105207	2021-07-27 13:00:36 +01:00
Rosie Sumpter	565fcd6a48	[LoopFlatten] Use SCEV and Loop APIs to identify increment and trip count Replace pattern-matching with existing SCEV and Loop APIs as a more robust way of identifying the loop increment and trip count. Also rename 'Limit' as 'TripCount' to be consistent with terminology. Differential Revision: https://reviews.llvm.org/D106580	2021-07-27 08:42:59 +01:00
Johannes Doerfert	4ce16022d3	[Local] Do not introduce a new `llvm.trap` before `unreachable` This is the second attempt to remove the `llvm.trap` insertion after https://reviews.llvm.org/rGe14e7bc4b889dfaffb7180d176a03311df2d4ae6 reverted the first one. It is not clear what the exact issue was back then and it might already be gone by now, it has been >5 years after all. Replaces D106299. Differential Revision: https://reviews.llvm.org/D106308	2021-07-26 23:33:36 -05:00
Nikita Popov	c0e218377f	[MergeICmps] Collect block instructions once (NFC) Collect the relevant instructions for a given BCECmpBlock once on construction, rather than repeating this logic in multiple places.	2021-07-26 18:07:20 +02:00
Nikita Popov	fa9a368e39	[MergeICmps] Try to fix MSVC build failure Apparently this fails to line up the types -- try to sidestep the issue entirely by writing the code in a more reasonable way: Walk over the operands and perform a set lookup, rather than walking over the set and performing an operand scan.	2021-07-26 17:31:27 +02:00
Nikita Popov	6ebcde8cc9	[MergeICmps] Separate out BCECmp and use Optional (NFC) Separate out the BCECmp part from BCECmpBlock, which just stores the comparison atoms without the branch instruction. At the same time switch the code to return Optional<> rather than objects in invalid state and partially constructed objects.	2021-07-26 17:06:43 +02:00
Nikita Popov	bab200ac44	[IR] Consider non-willreturn as side effect (PR50511) This adjusts mayHaveSideEffect() to return true for !willReturn() instructions. Just like other side-effects, non-willreturn calls (aka "divergence") cannot be removed and cannot be reordered relative to other side effects. This fixes a number of bugs where non-willreturn calls are either incorrectly dropped or moved. In particular, it also fixes the last open problem in https://bugs.llvm.org/show_bug.cgi?id=50511. I performed a cursory review of all current mayHaveSideEffect() uses, which convinced me that these are indeed the desired default semantics. Places that do not want to consider non-willreturn as a sideeffect generally do not want mayHaveSideEffect() semantics at all. I identified two such cases, which are addressed by D106591 and D106742. Finally, there is a use in SCEV for which we don't really have an appropriate API right now -- what it wants is basically "would this be considered forward progress". I've just spelled out the previous semantics there. Differential Revision: https://reviews.llvm.org/D106749	2021-07-26 16:35:14 +02:00
Nikita Popov	2a84a8fcc2	[MergeICmps] Relax sinking check The check for sinking instructions past the load + cmp sequence currently checks for side-effects, which includes writing to memory and unwinding. However, I don't believe we care about sinking the instructions past an unwind (as they don't have any side-effects themselves). Differential Revision: https://reviews.llvm.org/D106591	2021-07-23 22:16:11 +02:00
Dawid Jurczak	c98a4cb73f	Revert "[DSE] Transform memset + malloc --> calloc (PR25892)" This reverts commit 43234b1595125ba2b5c23e7b28f5a67041c77673. Reason: We should detect that we are implementing 'calloc' and bail out.	2021-07-23 11:51:59 +02:00
Fangrui Song	fe2cf61945	[Matrix] Fix -Wunused-variable in -DLLVM_ENABLE_ASSERTIONS=off build after D106457. NFC	2021-07-22 11:33:02 -07:00
Adam Nemet	cf091349e0	[Matrix] Fix miscompile for NT matmul if the transpose has other use We should only add the fake lowering entry for the matrix remark if the transpose is not lowered on its own. `MapVector::insert` is used to insert the entry during proper lowering which does not overwrite the fake entry in the map. We actually had test coverage for this but the reference output code was wrong; it was storing undef rather than the transposed column. Also add an assert that would have caught this. Differential Revision: https://reviews.llvm.org/D106457	2021-07-22 10:45:56 -07:00
Dawid Jurczak	60d27bc367	[LoopIdiom] Transform memmove-like loop into memmove (PR46179) The purpose of patch is to learn Loop idiom recognition pass how to recognize simple memmove patterns in similar way like GCC: https://godbolt.org/z/fh95e83od LoopIdiomRecognize already has machinery for memset and memcpy recognition, patch tries to extend exisiting capabilities with minimal effort. Differential Revision: https://reviews.llvm.org/D104464	2021-07-22 13:05:43 +02:00
Sanjay Patel	ffb5e7ee28	[SROA] avoid crash on memset with constant expression length https://llvm.org/PR50888	2021-07-21 15:20:28 -04:00
Rosie Sumpter	b11b07e0b8	[LoopFlatten][LoopInfo] Use Loop to identify latch compare instruction Make getLatchCmpInst non-static and use it in LoopFlatten as a more robust way of identifying the compare. Differential Revision: https://reviews.llvm.org/D106256	2021-07-21 10:14:18 +01:00
Dawid Jurczak	a09883bcb0	[DSE] Transform memset + malloc --> calloc (PR25892) After this change DSE can eliminate malloc + memset and emit calloc. It's https://reviews.llvm.org/D101440 follow-up. Differential Revision: https://reviews.llvm.org/D103009	2021-07-20 11:39:05 +02:00
Artem Belevich	33f436abed	Revert "[MemCpyOpt] Enable memcpy optimizations unconditionally." This reverts commit 2c98298a7559dfe4a264ef1adaad0921526768cc which breaks sanitizers.	2021-07-19 14:27:41 -07:00
Artem Belevich	1e6cc7e040	[infer-address-spaces] Handle complex non-pointer constexpr arguments. Fixes https://bugs.llvm.org/show_bug.cgi?id=51099 Differential Revision: https://reviews.llvm.org/D106098	2021-07-19 12:15:52 -07:00
Artem Belevich	d635785a03	[MemCpyOpt] Enable memcpy optimizations unconditionally. The patch does not depend on the availability of the library functions for memcpy/memset as it operates on LLVM intrinsics. The optimizations are useful on the targets that have these functions disabled (e.g. NVPTX & AMDGPU). Differential Revision: https://reviews.llvm.org/D104801	2021-07-19 11:58:02 -07:00
maekawatoshiki	10a3d40457	[LICM] Create LoopNest Invariant Code Motion (LNICM) pass This patch adds a new pass called LNICM which is a LoopNest version of LICM and a test case to show how LNICM works. Basically, LNICM only hoists invariants out of loop nest (not a loop) to keep/make perfect loop nest. This enables later optimizations that require perfect loop nest. Reviewed By: Whitney Differential Revision: https://reviews.llvm.org/D104180	2021-07-20 00:31:18 +09:00
Rosie Sumpter	839ef97cbf	[LoopFlatten] Use Loop to identify loop induction phi. NFC Replace code which identifies induction phi with helper function getInductionVariable to improve robustness. Differential Revision: https://reviews.llvm.org/D106045	2021-07-19 09:06:57 +01:00
Congzhe Cao	8f6ef387e2	[LoopInterchange] Check lcssa phis in the inner latch in scenarios of multi-level nested loops We already know that we need to check whether lcssa phis are supported in inner loop exit block or in outer loop exit block, and we have logic to check them already. Presumably the inner loop latch does not have lcssa phis and there is no code that deals with lcssa phis in the inner loop latch. However, that assumption is not true, when we have loops with more than two-level nesting. This patch adds checks for lcssa phis in the inner latch. Reviewed By: Whitney Differential Revision: https://reviews.llvm.org/D102300	2021-07-16 11:59:20 -04:00
Max Kazantsev	e4a04110a4	[LSR] Handle case 1reg => reg. PR50918 This patch addresses assertion failure in case when the only found formula for LSR is `1reg => reg` which was supposed to be an impossible situation, however there is a test that shows it is possible. In this case, we can use scale register with scale of 1 as the missing base register. Reviewed By: huihuiz, reames Differential Revision: https://reviews.llvm.org/D105009	2021-07-16 11:33:59 +07:00
Arthur Eubanks	9d99a13a85	[SimpleLoopUnswitch] Don't non-trivially unswitch loops with catchswitch exits SplitBlock() can't handle catchswitch. Fixes PR50973. Reviewed By: aheejin Differential Revision: https://reviews.llvm.org/D105672	2021-07-14 14:07:28 -07:00
Arthur Eubanks	78dcef41e1	[NewPM][SimpleLoopUnswitch] Add option to not trivially unswitch To help with debugging non-trivial unswitching issues. Don't care about the legacy pass, nobody is using it. If a pass's string params are empty (e.g. "simple-loop-unswitch"), don't default to the empty constructor for the pass params. We should still let the parser take care of it in case the parser has its own defaults. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D105933	2021-07-13 16:09:42 -07:00
Eli Friedman	309b52e532	[LoopReroll] Add an extra defensive check to avoid SCEV assertion. Make sure getMinusSCEV() didn't return a pointer. The following check would never succeed if it was a pointer, anyway, but calling getMulExpr() on a pointer SCEV now asserts.	2021-07-13 12:17:09 -07:00
Arthur Eubanks	ea2a4162d1	[NFC] Inline variable to prevent unused variable warning	2021-07-13 09:57:59 -07:00
Arthur Eubanks	c0c934f6d5	[OpaquePtr] Use byval type more	2021-07-13 09:34:34 -07:00
Arthur Eubanks	a6bb782422	[OpaquePtr] Get load/store type without PointerType::getElementType()	2021-07-13 09:34:34 -07:00
Yevgeny Rouban	fe6c66d36f	[RS4GC] Use one DVCache for both inlineGetBaseAndOffset() and insertParsePoints() This new test demonstrates a case where a base ptr is generated twice for the same value: the first one is generated while the gc.get.pointer.base() is inlined, the second is generated for the statepoint. This happens because the methods inlineGetBaseAndOffset() and insertParsePoints() do not share their defining value cache used by the findBasePointer() method. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D103240	2021-07-12 18:13:00 +07:00
Craig Topper	34eef4175e	[DivRemPairs] Add an initial case for hoisting to a common predecessor. This patch adds support for hoisting the division and maybe the remainder for control flow graphs like this. ``` PredBB \| \ \| Rem \| / Div ``` If we have DivRem we'll hoist both to PredBB. If not we'll just hoist Div and expand Rem using the Div. This improves our codegen for something like this ``` __uint128_t udivmodti4(__uint128_t dividend, __uint128_t divisor, __uint128_t remainder) { if (remainder != 0) remainder = dividend % divisor; return dividend / divisor; } ``` Reviewed By: spatel, lebedev.ri Differential Revision: https://reviews.llvm.org/D87555	2021-07-11 10:03:07 -07:00
Eli Friedman	5e023f461c	[ScalarEvolution] Strictly enforce pointer/int type rules. Rules: 1. SCEVUnknown is a pointer if and only if the LLVM IR value is a pointer. 2. SCEVPtrToInt is never a pointer. 3. If any other SCEV expression has no pointer operands, the result is an integer. 4. If a SCEVAddExpr has exactly one pointer operand, the result is a pointer. 5. If a SCEVAddRecExpr's first operand is a pointer, and it has no other pointer operands, the result is a pointer. 6. If every operand of a SCEVMinMaxExpr is a pointer, the result is a pointer. 7. Otherwise, the SCEV expression is invalid. I'm not sure how useful rule 6 is in practice. If we exclude it, we can guarantee that ScalarEvolution::getPointerBase always returns a SCEVUnknown, which might be a helpful property. Anyway, I'll leave that for a followup. This is basically mop-up at this point; all the changes with significant functional effects have landed. Some of the remaining changes could be split off, but I don't see much point. Differential Revision: https://reviews.llvm.org/D105510	2021-07-09 17:29:26 -07:00
Arthur Eubanks	e6114424c3	[OpaquePtr][InferAddrSpace] Use PointerType::getWithSamePointeeType()	2021-07-09 10:29:08 -07:00

1 2 3 4 5 ...

11104 Commits