llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Nikita Popov	7f56d08fc8	[OpaquePtr] Return opaque pointer from opaque pointer GEP For a GEP on an opaque pointer, also return an opaque pointer (or vector of opaque pointer) result. This requires explicitly enumerating the GEP source element type, because it is now no longer implicitly enumerated as part of either the source or result pointer types. Differential Revision: https://reviews.llvm.org/D104652	2021-06-21 18:36:32 +02:00
Hendrik Greving	6abba09064	RegisterCoalescer: Fix iterating through use operands. Fixes a minor bug when trying to iterate through use operands when updating debug use operands. Extends a test to include above. Differential Revision: https://reviews.llvm.org/D104576	2021-06-21 09:17:54 -07:00
Sanjay Patel	d8dba3ce64	[InstCombine] move bitmanipulation-of-select folds This is no outwardly-visible-difference-intended, but it is obviously better to have all transforms for an intrinsic housed together since we already have helper functions in place. It is also potentially more efficient to zap a simple pattern match before trying to do expensive computeKnownBits() calls.	2021-06-21 11:32:16 -04:00
Rosie Sumpter	844287b90f	[SLP][AArch64] Add SLP vectorizer regression test. NFC This test is for a missed SLP vectorizer opportunity, reported here https://bugs.llvm.org/show_bug.cgi?id=44593. This is due to a cost modelling issue with vector reduction intrinsics which will be fixed in a future commit (see https://reviews.llvm.org/D104538).	2021-06-21 16:31:00 +01:00
Sanjay Patel	ed82d06775	[InstCombine] fold ctlz/cttz-of-select with 1 or more constant arms Building on: 4c44b02d87 ...and adding handling for the extra operand in these intrinsics. This pattern is discussed in: https://llvm.org/PR50140	2021-06-21 11:04:12 -04:00
Matt Arsenault	ca489b9942	AMDGPU: Add missing tests for v_fma_mixlo	2021-06-21 10:58:53 -04:00
Sjoerd Meijer	dcef6b166c	[FuncSpec] Add minsize test. NFC.	2021-06-21 15:21:09 +01:00
Sam Tebbs	a7c1a9b580	[ARM] Transform a fixed-point to floating-point conversion into a VCVT_fix Conversion from a fixed-point number to a floating-point number is done by multiplying the fixed-point number by 2^(-n) where n is the number of fractional bits. Currently this is lowered to a vcvt (integer to floating-point) then a vmul, but it can instead be lowered directly to a vcvt (fixed-point to floating-point). This patch enables such transformations as long as the multiplication factor is a power of 2. Differential Revision: https://reviews.llvm.org/D103903	2021-06-21 14:14:09 +01:00
Sebastian Neubauer	a7a80ebf9c	[NFC] Fix typo	2021-06-21 14:59:30 +02:00
Bradley Smith	d2336f2398	[AArch64][SVE] Wire up vscale_range attribute to SVE min/max vector queries Differential Revision: https://reviews.llvm.org/D103702	2021-06-21 13:00:36 +01:00
Florian Hahn	1cebcabcfe	[LoopIdiom] Add test case that involves adds with flags and zero exts. Test coverage to ensure D104319 does not introduce a regression here.	2021-06-21 12:10:58 +01:00
Jordan Rupprecht	79bdbc37ef	[NFC] Wrap entire assert-only block in LLVM_DEBUG	2021-06-21 04:01:27 -07:00
Fraser Cormack	18c509d4ea	[VP][NFCI] Address various clang-tidy warnings Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D104288	2021-06-21 10:57:42 +01:00
Sebastian Neubauer	95f90b846b	[AMDGPU] Fix linking with shared libraries AMDGPULDSUtils depends on llvm::CallGraph.	2021-06-21 11:11:13 +02:00
Nikita Popov	64dbdb829e	[Mem2Reg] Regenerate test checks (NFC)	2021-06-21 11:06:28 +02:00
Nikita Popov	7f3872cdf6	[Mem2Reg] Use poison for unreachable cases Use poison instead of undef for cases dealing with unreachable code. This still leaves the more interesting case of "load from uninitialized memory" as undef.	2021-06-21 10:54:13 +02:00
Nikita Popov	192f74a255	[Mem2Reg] Regenerate test checks (NFC)	2021-06-21 10:47:59 +02:00
Juneyoung Lee	f69a2df9b9	[InstCombine] Fold icmp (select c,const,arg), null if icmp arg, null can be simplified This patch folds icmp (select c,const,arg), null if icmp arg, null can be simplified. Resolves llvm.org/pr48975. Reviewed By: nikic, xbolva00 Differential Revision: https://reviews.llvm.org/D96663	2021-06-21 17:39:05 +09:00
Sjoerd Meijer	a6fd3be6d5	[FuncSpec] Don't specialise functions with NoDuplicate instructions. getSpecializationCost was returning INT_MAX for a case when specialisation shouldn't happen, but this wasn't properly checked if specialisation was forced. Differential Revision: https://reviews.llvm.org/D104461	2021-06-21 09:02:11 +01:00
LLVM GN Syncbot	150a090f84	[gn build] Port 208332de8abf	2021-06-21 07:27:34 +00:00
Ruiling Song	02847853b4	[AMDGPU] Add Optimize VGPR LiveRange Pass. This pass aims to optimize VGPR live-range in a typical divergent if-else control flow. For example: def(a) if(cond) use(a) ... // A else use(a) As AMDGPU access vgpr with respect to active-mask, we can mark `a` as dead in region A. For details, please refer to the comments in implementation file. The pass is enabled by default, the frontend can disable it through "-amdgpu-opt-vgpr-liverange=false". Differential Revision: https://reviews.llvm.org/D102212	2021-06-21 15:25:55 +08:00
LLVM GN Syncbot	75f1abb9b8	[gn build] Port 80fd5fa5269c	2021-06-21 06:23:08 +00:00
hsmahesha	37c462f96a	[AMDGPU] Replace non-kernel function uses of LDS globals by pointers. The main motivation behind pointer replacement of LDS use within non-kernel functions is - to avoid subsequent LDS lowering pass from directly packing LDS (assume large LDS) into a struct type which would otherwise cause allocating huge memory for struct instance within every kernel. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D103225	2021-06-21 11:51:49 +05:30
Max Kazantsev	0317b20934	[Test] Add some tests showing room for optimization exploiting undef and UB	2021-06-21 13:11:46 +07:00
Esme-Yi	d5a215bffc	[yaml2obj] Add support for writing the long symbol name. Summary: This patch, as a follow-up of D95505, adds support for writing the long symbol name by implementing the StringTable. Only XCOFF32 is suppoted now. Reviewed By: jhenderson, shchenz Differential Revision: https://reviews.llvm.org/D103455	2021-06-21 05:09:56 +00:00
Max Kazantsev	633abb5259	[LoopDeletion] Handle Phis with similar inputs from different blocks This patch lifts the requirement to have the only incoming live block for Phis. There can be multiple live blocks if the same value comes to phi from all of them. Differential Revision: https://reviews.llvm.org/D103959 Reviewed By: nikic, lebedev.ri	2021-06-21 11:37:06 +07:00
Juneyoung Lee	1652bab657	[InstCombine] Use poison constant to represent the result of unreachable instrs This patch updates InstCombine to use poison constant to represent the resulting value of (either semantically or syntactically) unreachable instrs, or a don't-care value of an unreachable store instruction. This allows more aggressive folding of unused results, as shown in llvm/test/Transforms/InstCombine/getelementptr.ll . Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D104602	2021-06-21 09:58:44 +09:00
Eli Friedman	d624c63575	[NFC][ScalarEvolution] Clean up ExitLimit constructors. Make all the constructors forward to one constructor. Remove redundant assertions.	2021-06-20 17:40:30 -07:00
Jim Lin	191d405aea	[IVDescriptors] Fix comment that getUnsafeAlgebraInst has been renamed to getExactFPMathInst https://reviews.llvm.org/rG36a489d194750dc888f214240e9dec9122ca1f0e renamed the function call in the test from getUnsafeAlgebraInst to getExactFPMathInst. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D104441	2021-06-21 07:56:22 +08:00
Dmitri Gribenko	71570a2380	[GCOVProfiling][test] Ensure that 'opt' drops any files in a temp directory	2021-06-20 22:48:35 +02:00
Craig Topper	0dab191ced	[TypePromotion] Prune Intrinsic includes. NFC TypePromotion is meant to be a generic pass and doesn't reference any ARM intrinsics so it shouldn't include IntrinsicsARM.h. The other Intrinsic related headers appear to be unneeded as well.	2021-06-20 13:04:02 -07:00
Nikita Popov	45f41f7ab1	[LoopUnroll] Use smallest exact trip count from any exit This is a more general alternative/extension to D102635. Rather than handling the special case of "header exit with non-exiting latch", this unrolls against the smallest exact trip count from any exit. The latch exit is no longer treated as priviledged when it comes to full unrolling. The motivating case is in full-unroll-one-unpredictable-exit.ll. Here the header exit is an IV-based exit, while the latch exit is a data comparison. This kind of loop does not get rotated, because the latch is already exiting, and loop rotation doesn't try to distinguish IV-based/analyzable latches. Differential Revision: https://reviews.llvm.org/D102982	2021-06-20 20:58:26 +02:00
Fangrui Song	74f848b886	Fix -Wunused-variable and -Wunused-but-set-variable in -DLLVM_ENABLE_ASSERTIONS=off build. NFC	2021-06-20 11:09:07 -07:00
David Green	51d7c2c19b	[DSE] Remove stores in the same loop iteration DSE will currently only remove stores in the same block unless they can be guaranteed to be loop invariant. This expands that to any stores that are in the same Loop, at the same loop level. This should still account for where AA/MSSA will not handle aliasing between loops, but allow the dead stores to be removed where they overlap in the same loop iteration. It requires adding loop info to DSE, but that looks fairly harmless. The test case this helps is from code like this, which can come up in certain matrix operations: for(i=..) dst[i] = 0; for(j=..) dst[i] += src[in+j]; After LICM, this becomes: for(i=..) dst[i] = 0; sum = 0; for(j=..) sum += src[in+j]; dst[i] = sum; The first store is dead, and with this patch is now removed. Differntial Revision: https://reviews.llvm.org/D100464	2021-06-20 17:03:30 +01:00
Sanjay Patel	245d1ca508	[InstCombine] fold ctpop-of-select with 1 or more constant arms The general pattern is mentioned in: https://llvm.org/PR50140 ...but we need to do a bit more to handle intrinsics with extra operands like ctlz/cttz.	2021-06-20 11:28:45 -04:00
Sanjay Patel	f90ab103b5	[InstCombine] avoid infinite loops with select folds of constant expressions This pair of transforms was added recently with: 8591640379ac9175a And could lead to conflicting folds: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=35399	2021-06-20 09:46:25 -04:00
Roman Lebedev	48ca532a9e	[NFC][AArch64][ARM][Thumb][Hexagon] Autogenerate some tests These all (and some others) are being affected by D104597, but they are manually-written, which rather complicates checking the effect that change has on them.	2021-06-20 14:12:45 +03:00
Roman Lebedev	9acde4873f	[UpdateTestUtils] Print test filename when complaining about conflicting prefix Now that FileCheck eagerly complains when prefixes are unused, the update script does the same, and is becoming very common to need to drop some prefixes, yet figuring out the file it complains about isn't obvious unless it actually tells us.	2021-06-20 14:12:39 +03:00
Roman Lebedev	21a466ca4d	[SimplifyCFG] FoldTwoEntryPHINode(): don't fold if either block has it's address taken Same as with HoistThenElseCodeToIf() (ad87761925c2790aab272138b5bbbde4a93e0383).	2021-06-20 12:37:14 +03:00
Roman Lebedev	9120098319	[SimplifyCFG] HoistThenElseCodeToIf(): don't hoist if either block has it's address taken This problem is exposed by D104598, after it tail-merges `ret` in `@test_inline_constraint_S_label`, the verifier would start complaining `invalid operand for inline asm constraint 'S'`. Essentially, taking address of a block is mismodelled in IR. It should probably be an explicit instruction, a first one in block, that isn't identical to any other instruction of the same type, so that it can't be hoisted.	2021-06-20 12:18:15 +03:00
Juneyoung Lee	b719c3f4a5	[InstSimplify] icmp poison, X -> poison This adds a simple transformation from icmp with poison constant to poison. Comparing poison with something else is poison, so this is okay. https://alive2.llvm.org/ce/z/e8iReb https://alive2.llvm.org/ce/z/q4MurY	2021-06-20 15:39:07 +09:00
Fangrui Song	9e8233e08c	[llvm-cov gcov] Support GCC 12 format GCC 12 will change the length field to represent the number of bytes instead of 32-bit words. This avoids padding for strings.	2021-06-19 22:51:20 -07:00
Fangrui Song	f02bea7812	[llvm-cov gcov] Change case to match the prevailing style && replace getString with readString	2021-06-19 22:50:52 -07:00
Fangrui Song	8a7045847f	[test] Fix nocompress.test	2021-06-19 16:27:53 -07:00
Fangrui Song	2fe891f533	[llvm-profdata] Make diagnostics consistent with the (no capitalization, no period) style The format is currently inconsistent. Use the https://llvm.org/docs/CodingStandards.html#error-and-warning-messages style. And add `error:` or `warning:` to CHECK lines wherever appropriate.	2021-06-19 14:54:25 -07:00
Fangrui Song	e2a2115bff	[llvm-profdata] Delete unneeded empty output filename check	2021-06-19 12:20:45 -07:00
Craig Topper	61a08a19b8	[RISCV] Prevent formation of shXadd(.uw) and add.uw if it prevents the use of addi. If the outer add has an simm12 immediate operand we should prefer it instead of materializing it in a register. This would guarantee and extra instruction and temporary register. Since we don't check one use on the shl or zext we might generate more instructions if there is an additional user.	2021-06-19 12:10:42 -07:00
Roman Lebedev	a8e6eca719	[NFC] AMD Zen 3: fix typo in a comment	2021-06-19 22:05:17 +03:00
Fangrui Song	2beef4520d	Simplify some typedef struct	2021-06-19 11:36:44 -07:00
Nico Weber	a20aff32da	[gn build] (manually) port b9c05aff205b (MIRTests)	2021-06-19 13:04:09 -04:00

... 4 5 6 7 8 ...

217678 Commits