llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-22 02:33:06 +01:00

Author	SHA1	Message	Date
Sanjay Patel	5a14ea148e	[InstSimplify] fold min/max with limit constant This is already done within InstCombine: https://alive2.llvm.org/ce/z/MiGE22 ...but leaving it out of analysis makes it harder to avoid infinite loops there. (cherry picked from commit e260e10c4a21784c146c94a2a14b7e78b09a9cf7)	2021-08-16 11:35:29 -07:00
Johannes Doerfert	86cba7bf5e	[InstSimplify] Expose generic interface for replaced operand simplification Users, especially the Attributor, might replace multiple operands at once. The actual implementation of simplifyWithOpReplaced is able to handle that just fine, the interface was simply not allowing to replace more than one operand at a time. This is exposing a more generic interface without intended changes for existing code. Differential Revision: https://reviews.llvm.org/D106189	2021-07-27 00:56:12 -05:00
Kevin P. Neal	1696270de0	[FPEnv][InstSimplify] Constrained FP support for NaN Currently InstructionSimplify.cpp knows how to simplify floating point instructions that have a NaN operand. It does not know how to handle the matching constrained FP intrinsic. This patch teaches it how to simplify so long as the exception handling is not "fpexcept.strict". Differential Revision: https://reviews.llvm.org/D103169	2021-07-09 11:26:28 -04:00
Sanjay Patel	b4665b3453	[InstSimplify] fix bug in poison propagation for FP ops If any operand of a math op is poison, that takes precedence over general undef/NaN. This should not be visible with binary ops because it requires 2 constant operands to trigger (and if both operands of a binop are constant, that should get handled first in ConstantFolding).	2021-07-06 14:06:50 -04:00
Sanjay Patel	ab96742bc5	[InstSimplify] fold extractelement of splat with variable extract index We already have a fold for variable index with constant vector, but if we can determine a scalar splat value, then it does not matter whether that value is constant or not. We overlooked this fold in D102404 and earlier patches, but the fixed vector variant is shown in: https://llvm.org/PR50817 Alive2 agrees on that: https://alive2.llvm.org/ce/z/HpijPC The same logic applies to scalable vectors. Differential Revision: https://reviews.llvm.org/D104867	2021-07-05 08:19:40 -04:00
Sanjay Patel	1cbf22d059	[InstSimplify] do not propagate poison from select arm to icmp user This is the cause of the miscompile in: https://llvm.org/PR50944 The problem has likely existed for some time, but it was made visible with: 5af8bacc94024 ( D104661 ) handleOtherCmpSelSimplifications() assumed it can convert select of constants to bool logic ops, but that does not work with poison. We had a very similar construct in InstCombine, so the fix here mimics the fix there. The bug is in instsimplify, but I'm not sure how to reproduce it outside of instcombine. The reason this is visible in instcombine is because we have a hack (FIXME) to bypass simplification of a select when it has an icmp user: `955f125899/llvm/lib/Transforms/InstCombine/InstCombineSelect.cpp (L2632)` So we get to an unusual case where we are trying to simplify an instruction that has an operand that would have already simplified if we had processed it in normal order. Differential Revision: https://reviews.llvm.org/D105298	2021-07-01 17:40:07 -04:00
Sanjay Patel	a6d45cd7cf	[InstSimplify] move extract with undef index fold; NFC This puts it closer to the other undef query check and will avoid a potential ordering problem if we allow folding non-constant-int indexes.	2021-06-24 13:22:10 -04:00
Juneyoung Lee	bc3eadbb4a	[InstSimplify] Add more poison folding optimizations This adds more poison folding optimizations to InstSimplify. Since all binary operators propagate poison, these are fine. Also, the precondition of `select cond, undef, x` -> `x` is relaxed to allow the case when `x` is undef. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D104661	2021-06-23 20:25:24 +09:00
Juneyoung Lee	b719c3f4a5	[InstSimplify] icmp poison, X -> poison This adds a simple transformation from icmp with poison constant to poison. Comparing poison with something else is poison, so this is okay. https://alive2.llvm.org/ce/z/e8iReb https://alive2.llvm.org/ce/z/q4MurY	2021-06-20 15:39:07 +09:00
Sanjay Patel	31ec72a21f	[InstSimplify] propagate poison through FP ops We already have this fold: fadd float poison, 1.0 --> poison ...via ConstantFolding, so this makes the behavior consistent if the other operand(s) are non-constant. The fold for undef was added before poison existed as a value/type in IR. This came up in D102673 / D103169 because we're trying to sort out the more complicated handling for constrained math ops. We should have the handling for the regular instructions done first, so we can build on that (or diverge as needed). Differential Revision: https://reviews.llvm.org/D104383	2021-06-16 11:31:58 -04:00
Arthur Eubanks	3103303429	[InstSimplify] Treat invariant group insts as bitcasts for load operands We can look through invariant group intrinsics for the purposes of simplifying the result of a load. Since intrinsics can't be constants, but we also don't want to completely rewrite load constant folding, we convert the load operand to a constant. For GEPs and bitcasts we just treat them as constants. For invariant group intrinsics, we treat them as a bitcast. Relanding with a check for self-referential values. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D101103	2021-06-15 12:59:43 -07:00
Arthur Eubanks	1ea77f3a1a	Revert "[InstSimplify] Treat invariant group insts as bitcasts for load operands" This reverts commit 26044c6a54de3e03c73c5515702b95acdb0b7f22. Breaks on invalid IR (see D101103).	2021-06-09 11:46:10 -07:00
Caroline Concatto	dcc4b87d6f	[InstCombine] Add instcombine fold for extractelement + splat for scalable vectors This patch allows that scalable vector can also use the fold that already exists for fixed vector, only when the lane index is lower than the minimum number of elements of the vector. Differential Revision: https://reviews.llvm.org/D102404	2021-06-08 10:43:38 +01:00
Arthur Eubanks	3b6a5ff6b4	[InstSimplify] Treat invariant group insts as bitcasts for load operands We can look through invariant group intrinsics for the purposes of simplifying the result of a load. Since intrinsics can't be constants, but we also don't want to completely rewrite load constant folding, we convert the load operand to a constant. For GEPs and bitcasts we just treat them as constants. For invariant group intrinsics, we treat them as a bitcast. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D101103	2021-06-01 16:33:06 -07:00
Sanjay Patel	c5aaaaa9b9	[InstCombine] fix miscompile from vector select substitution This is similar to the fix in c590a9880d7a ( PR49832 ), but we missed handling the pattern for select of bools (no compare inst). We can't substitute a vector value because the equality condition replacement that we are attempting requires that the condition is true/false for the entire value. Vector select can be partly true/false. I added an assert for vector types, so we shouldn't hit this again. Fixed formatting while auditing the callers. https://llvm.org/PR50500	2021-05-30 07:11:58 -04:00
Sanjay Patel	831c97ae39	[InstSimplify] allow undef element match in vector select condition value The semantics of select with undefined/poison condition are not explicitly stated in the LangRef, but this matches comments in the code and Alive2 appears to concur: https://alive2.llvm.org/ce/z/KXytmd We can find this pattern after demanded elements transforms. As noted in D101191, fuzzers are finding infinite loops because we may not account for this pattern in other passes.	2021-05-25 14:25:34 -04:00
David Goldblatt	ce067f4056	[InstSimplify] Transform X * Y % Y --> 0 simplifyDiv already handles the case X * Y / Y --> X (barring overflow). This adds the equivalent handling to simplifyRem. Correctness: https://alive2.llvm.org/ce/z/J2cUbS https://alive2.llvm.org/ce/z/us9NUM https://alive2.llvm.org/ce/z/AvaDGJ https://alive2.llvm.org/ce/z/kq9ige Extending the situations in which we apply this transform would not be correct: https://alive2.llvm.org/ce/z/Lf9V63 https://alive2.llvm.org/ce/z/6RPQK3 https://alive2.llvm.org/ce/z/p9UdxC https://alive2.llvm.org/ce/z/A2zlhE https://alive2.llvm.org/ce/z/vHTtLw https://alive2.llvm.org/ce/z/lvpH42 Differential Revision: https://reviews.llvm.org/D102864	2021-05-25 10:16:04 -04:00
Joe Ellis	bbfb092906	[InstSimplify] Properly constrain {insert,extract}_subvector intrinsic fold The previous rule: (insert_vector _, (extract_vector X, 0), 0) -> X is not quite correct. The correct fold should be: (insert_vector Y, (extract_vector X, 0), 0) -> X where: Y is X, or Y is undef This commit updates the pattern. Reviewed By: peterwaller-arm, paulwalker-arm Differential Revision: https://reviews.llvm.org/D102699	2021-05-21 10:05:03 +00:00
Nikita Popov	d8b6f240a9	[IR] Add BasicBlock::isEntryBlock() (NFC) This is a recurring and somewhat awkward pattern. Add a helper method for it.	2021-05-15 12:41:58 +02:00
Joe Ellis	3dc0c1e7a7	[InstSimplify] Remove redundant {insert,extract}_vector intrinsic chains This commit removes some redundant {insert,extract}_vector intrinsic chains by implementing the following patterns as instsimplifies: (insert_vector _, (extract_vector X, 0), 0) -> X (extract_vector (insert_vector _, X, 0), 0) -> X Reviewed By: peterwaller-arm Differential Revision: https://reviews.llvm.org/D101986	2021-05-13 16:09:50 +00:00
Juneyoung Lee	c112624b0b	[InstCombine] Fold overflow bit of [u\|s]mul.with.overflow in a poison-safe way As discussed in D101191, this patch adds a poison-safe folding of overflow bit check: ``` %Op0 = icmp ne i4 %X, 0 %Agg = call { i4, i1 } @llvm.[us]mul.with.overflow.i4(i4 %X, i4 %Y) %Op1 = extractvalue { i4, i1 } %Agg, 1 %ret = select i1 %Op0, i1 %Op1, i1 false => %Y.fr = freeze %Y %Agg = call { i4, i1 } @llvm.[us]mul.with.overflow.i4(i4 %X, i4 %Y.fr) %Op1 = extractvalue { i4, i1 } %Agg, 1 %ret = %Op1 ``` https://alive2.llvm.org/ce/z/zgPUGT https://alive2.llvm.org/ce/z/h2gZ_6 Note that there are cases where inserting freeze is not necessary: e.g. %Y is `noundef`. In this case, LLVM is already good because `%ret` is already successfully folded into `and`, triggering the pre-existing optimization in InstSimplify: https://godbolt.org/z/v6qena15K Differential Revision: https://reviews.llvm.org/D101423	2021-05-02 11:54:12 +09:00
Sanjay Patel	b3bc645e79	[InstSimplify] generalize ctlz-of-shifted-constant https://alive2.llvm.org/ce/z/zWL_VQ	2021-04-21 14:23:55 -04:00
Nikita Popov	f8a60b9733	Revert "[InstSimplify] Bypass no-op `and`-mask, using known bits (PR49543)" This reverts commit ea1a0d7c9ae3e5232a4163fc67efad4aabd51f2b. While this is strictly more powerful, it is also strictly slower. InstSimplify intentionally does not perform many folds that it is allowed to perform, if doing so requires a KnownBits calculation that will be repeated in InstCombine. Maybe it's worthwhile to do this here, but that needs a more explicitly stated motivation, evaluated in a review.	2021-04-21 09:55:25 +02:00
Roman Lebedev	b7b50a3bdd	[InstSimplify] Bypass no-op `and`-mask, using known bits (PR49543) We already special-cased a few interesting patterns, but that is strictly less powerful than using KnownBits. So instead get the known bits for the operand of `and`, and iff all the unset bits of the `and`-mask are known to be zeros in the operand, we can omit said `and`.	2021-04-21 00:31:46 +03:00
Sanjay Patel	f26eaa7622	[InstSimplify] improve efficiency for detecting non-zero value Stepping through callstacks in the example from D99759 reveals this potential compile-time improvement. The savings come from avoiding ValueTracking's computing known bits if we have already dealt with special-case patterns. Further improvements in this direction seem possible. This makes a degenerate test based on PR49785 about 40x faster (25 sec -> 0.6 sec), but it does not address the larger question of how to limit computeKnownBitsFromAssume(). Ie, the original test there is still infinite-time for all practical purposes. Differential Revision: https://reviews.llvm.org/D100408	2021-04-14 09:04:15 -04:00
Roman Lebedev	d9b2a4b5e7	[NFC][ConstantRange] Add 'icmp' helper method "Does the predicate hold between two ranges?" Not very surprisingly, some places were already doing this check, without explicitly naming the algorithm, cleanup them all.	2021-04-10 19:38:55 +03:00
Roman Lebedev	4fdceaffc0	Revert "[NFC][ConstantRange] Add 'icmp' helper method" This reverts commit 17cf2c94230bc107e7294ef84fad3b47f4cd1b73.	2021-04-10 19:37:53 +03:00
Roman Lebedev	c6f9ab66c8	[NFC][ConstantRange] Add 'icmp' helper method "Does the predicate hold between two ranges?" Not very surprisingly, some places were already doing this check, without explicitly naming the algorithm, cleanup them all.	2021-04-10 19:09:52 +03:00
Florian Hahn	5c0788f272	[SimplifyInst] Use correct type for GEPs with vector indices. The current code does not properly handle vector indices unless they are the first index. At the moment LangRef gives the impression that the vector index must be the one and only index (https://llvm.org/docs/LangRef.html#getelementptr-instruction). But vector indices can appear at any position and according to the verifier there may be multiple vector indices. If that's the case, the number of elements must match. This patch updates SimplifyGEPInst to properly handle those additional cases. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D99961	2021-04-06 17:56:10 +01:00
Sanjay Patel	032ed4beee	[InstSimplify] fix potential miscompile in select value equivalence This is the sibling fix to c590a9880d7a - as there, we can't subsitute a vector value the equality compare replacement that we are trying requires that the comparison is true for the entire value. Vector select can be partly true/false.	2021-04-05 16:52:34 -04:00
Sander de Smalen	1ba98aa252	Always emit error for wrong interfaces to scalable vectors, unless cmdline flag is passed. In order to bring up scalable vector support in LLVM incrementally, we introduced behaviour to emit a warning, instead of an error, when asking the wrong question of a scalable vector, like asking for the fixed number of elements. This patch puts that behaviour under a flag. The default behaviour is that the compiler will always error, which means that all LLVM unit tests and regression tests will now fail when a code-path is taken that still uses the wrong interface. The behaviour to demote an error to a warning can be individually enabled for tools that want to support experimental use of scalable vectors. This patch enables that behaviour when driving compilation from Clang. This means that for users who want to try out scalable-vector support, fixed-width codegen support, or build user-code with scalable vector intrinsics, Clang will not crash and burn when the compiler encounters such a case. This allows us to do away with the following pattern in many of the SVE tests: RUN: .... 2>%t RUN: cat %t \| FileCheck --check-prefix=WARN WARN-NOT: warning: ... The behaviour to emit warnings is only temporary and we expect this flag to be removed in the future when scalable vector support is more stable. This patch also has fixes the following tests: unittests: ScalableVectorMVTsTest.SizeQueries SelectionDAGAddressAnalysisTest.unknownSizeFrameObjects AArch64SelectionDAGTest.computeKnownBitsSVE_ZERO_EXTEND_VECTOR_INREG regression tests: Transforms/InstCombine/vscale_gep.ll Reviewed By: paulwalker-arm, ctetreau Differential Revision: https://reviews.llvm.org/D98856	2021-04-02 10:55:22 +01:00
Yang Fan	404db5968d	[InstSimplify] Fix unused variable warning (NFC) GCC warning: ``` /llvm-project/llvm/lib/Analysis/InstructionSimplify.cpp: In function ‘llvm::Value* SimplifyWithOpReplaced(llvm::Value, llvm::Value, llvm::Value, const llvm::SimplifyQuery&, bool, unsigned int)’: /llvm-project/llvm/lib/Analysis/InstructionSimplify.cpp:3993:15: warning: unused variable ‘SI’ [-Wunused-variable] 3993 \| if (auto SI = dyn_cast<SelectInst>(I)) \| ^~ ```	2021-03-24 09:56:36 +08:00
Juneyoung Lee	ba5c93c7e6	Reland "[InstCombine] Add simplification of two logical and/ors" This relands 07c3b97e184d5bd828b8a680cdce46e73f3db9fc (D96945) which was reverted by commit f49354838e526671e616d16199ebdee653b9f6fa. The two-stage compilation successfully tests passes on my machine.	2021-03-23 16:24:50 +09:00
Nikita Popov	c053a6554f	[InstCombine] Whitelist non-refining folds in SimplifyWithOpReplaced This is an alternative to D98391/D98585, playing things more conservatively. If AllowRefinement == false, then we don't use InstSimplify methods at all, and instead explicitly implement a small number of non-refining folds. Most cases are handled by constant folding, and I only had to add three folds to cover our unit tests / test-suite. While this may lose some optimization power, I think it is safer to approach from this direction, given how many issues this code has already caused. Differential Revision: https://reviews.llvm.org/D99027	2021-03-22 22:12:56 +01:00
Nikita Popov	befc0b53a8	[InstSimplify] Clean up SimplifyReplacedWithOp implementation (NFCI) Replace Op with RepOp up-front, and then always work with the new operands, rather than checking for replacement in various places.	2021-03-21 15:30:30 +01:00
Simonas Kazlauskas	59b63b74d5	[InstSimplify] Restrict a GEP transform to avoid provenance changes This is a follow-up to D98588, and fixes the inline `FIXME` about a GEP-related simplification not preserving the provenance. https://alive2.llvm.org/ce/z/qbQoAY Additional tests were added in {rGf125f28afdb59eba29d2491dac0dfc0a7bf1b60b} Depends on D98672 Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D98611	2021-03-16 18:53:05 +02:00
Simonas Kazlauskas	533dfa60d2	[InstSimplify] Match PtrToInt more directly in a GEP transform (NFC) In preparation for D98611, the upcoming change will need to apply additional checks to `P` and `V`, and so this refactor paves the way for adding additional checks in a less awkward way. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D98672	2021-03-16 15:45:19 +02:00
Sanjay Patel	69f00a58ef	[InstSimplify] ctlz({signbit} >>u x) --> x The motivating pattern was handled in 0a2d69480d , but we should have this for symmetry. But this really highlights that we could generalize for any shifted constant if we match this in instcombine. https://alive2.llvm.org/ce/z/MrmVNt	2021-03-15 12:03:35 -04:00
Bjorn Pettersson	bcace0dff8	[InstSimplify] Simplify smul.fix and smul.fix.sat Add simplification of smul.fix and smul.fix.sat according to X * 0 -> 0 X * undef -> 0 X * (1 << scale) -> X This includes the commuted patterns and splatted vectors. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D98299	2021-03-12 09:09:58 +01:00
Juneyoung Lee	f828df253e	Resolve unused variable warning (NFC)	2021-03-11 12:03:03 +09:00
Juneyoung Lee	2a52d0ee68	[InstSimplify] Pass SimplifyQuery to computePointerICmp (NFC)	2021-03-11 11:13:46 +09:00
Juneyoung Lee	0ac1e53c7e	Revert "[InstCombine] Add simplification of two logical and/ors" This reverts commit 07c3b97e184d5bd828b8a680cdce46e73f3db9fc due to a reported failure in two-stage build.	2021-03-10 05:48:31 +09:00
Sanjay Patel	cc5fb7e4a2	[ValueTracking] move/add helper to get inverse min/max; NFC We will need to this functionality to improve min/max folds in instcombine when we canonicalize to intrinsics.	2021-03-08 17:38:22 -05:00
Sanjay Patel	7358b98bc7	[InstSimplify] cttz(1<<x) --> x https://alive2.llvm.org/ce/z/TDacYu https://alive2.llvm.org/ce/z/KF84S3	2021-03-08 16:30:14 -05:00
Juneyoung Lee	ad32346391	[InstCombine] Add simplification of two logical and/ors This is a patch that adds folding of two logical and/ors that share one variable: a && (a && b) -> a && b a && (a & b) -> a && b ... This is towards removing the poison-unsafe select optimization (D93065 has more context). Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D96945	2021-03-08 02:38:43 +09:00
Simon Pilgrim	a7e8ccf5c2	[InstSimplify] Handle nsw shl -> poison patterns Pulled out from D90479 - this recognises invalid nsw shl patterns with signbit changes that result in poison. Differential Revision: https://reviews.llvm.org/D97305	2021-02-23 18:26:56 +00:00
Simon Pilgrim	3934877b60	[InstructionSimplify] SimplifyShift - rename shift amount KnownBits. NFCI. As suggested on D97305.	2021-02-23 18:12:59 +00:00
Simon Pilgrim	9b05a382fc	[InstSimplify] Cleanup out-of-range shift amount handling. Use APInt::uge() direct instead of getLimitedValue(). Use KnownBits::getMinValue() to make the bounds check more obvious.	2021-02-22 17:00:49 +00:00
Caroline Concatto	9ea32f75fa	[CodeGen][SelectionDAG]Add new intrinsic experimental.vector.reverse This patch adds a new intrinsic experimental.vector.reduce that takes a single vector and returns a vector of matching type but with the original lane order reversed. For example: ``` vector.reverse(<A,B,C,D>) ==> <D,C,B,A> ``` The new intrinsic supports fixed and scalable vectors types. The fixed-width vector relies on shufflevector to maintain existing behaviour. Scalable vector uses the new ISD node - VECTOR_REVERSE. This new intrinsic is one of the named shufflevector intrinsics proposed on the mailing-list in the RFC at [1]. Patch by Paul Walker (@paulwalker-arm). [1] https://lists.llvm.org/pipermail/llvm-dev/2020-November/146864.html Differential Revision: https://reviews.llvm.org/D94883	2021-02-15 13:39:43 +00:00
Juneyoung Lee	2da6d2b52c	[InstCombine,InstSimplify] Optimize select followed by and/or/xor This patch adds `A & (A && B)` -> `A && B` (similarly for or + logical or) Also, this patch adds `~(select C, (icmp pred X, Y), const)` -> `select C, (icmp pred' X, Y), ~const`. Alive2 proof: merge_and: https://alive2.llvm.org/ce/z/teMR97 merge_or: https://alive2.llvm.org/ce/z/b4yZUp xor_and: https://alive2.llvm.org/ce/z/_-TXHi xor_or: https://alive2.llvm.org/ce/z/2uYx_a Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D94861	2021-01-19 09:14:17 +09:00

1 2 3 4 5 ...

786 Commits