llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Lang Hames	792e37b227	[ORC] Fix missing std::move.	2021-06-15 21:42:58 +10:00
Lang Hames	0c8cd2ec6f	[ORC] Fix narrowing-in-initializer-list warnings.	2021-06-15 21:39:16 +10:00
Lang Hames	8b99b0a62e	[ORC] Fix missing function in unit test.	2021-06-15 21:39:00 +10:00
Lang Hames	8d05185ee9	[ORC] Make WrapperFunctionResult's ValuePtr member non-const. The const qualifier was a hangover from an earlier iteration that allowed wrapper functions to return pointers to const memory. This feature has been removed, so there's no reason for this to be const any more, and removing it eliminates const-cast warnings.	2021-06-15 21:24:12 +10:00
Lang Hames	e11b1aca83	[ORC] Port WrapperFunctionUtils and SimplePackedSerialization from ORC runtime. Replace the existing WrapperFunctionResult type in llvm/include/ExecutionEngine/Orc/Shared/TargetProcessControlTypes.h with a version adapted from the ORC runtime's implementation. Also introduce the SimplePackedSerialization scheme (also adapted from the ORC runtime's implementation) for wrapper functions to avoid manual serialization and deserialization for calls to runtime functions involving common types.	2021-06-15 21:13:57 +10:00
Neil Henning	8521aa2a65	ABI breaking changes fixes. This commit mostly just replaces bad uses of `NDEBUG` with uses of `LLVM_ENABLE_ABI_BREAKING_CHANGES` - the safe way to include ABI breaking changes (normally extra struct elements in headers). Differential Revision: https://reviews.llvm.org/D104216	2021-06-15 11:08:13 +01:00
Roman Lebedev	586aaeabf1	[X86] Schedule-model second (mask) output of GATHER instruction Much like `mulx`'s `WriteIMulH`, there are two outputs of AVX2 GATHER instructions. This was changed back in rL160110, but the sched model change wasn't present. So right now, for sched models that are marked as complete (`znver3` only now), codegen'ning `GATHER` results in a crash: ``` DefIdx 1 exceeds machine model writes for early-clobber renamable $ymm3, dead early-clobber renamable $ymm2 = VPGATHERDDYrm killed renamable $ymm3(tied-def 0), undef renamable $rax, 4, renamable $ymm0, 0, $noreg, killed renamable $ymm2(tied-def 1) :: (load 32, align 1) ``` https://godbolt.org/z/Ks7zW7WGh I'm guessing we need to deal with this like we deal with `WriteIMulH`. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D104205	2021-06-15 12:04:33 +03:00
Andrea Di Biagio	ad80113e58	[MCA][InstrBuilder] Check for the presence of flag VariadicOpsAreDefs. This patch fixes the logic that checks for variadic register definitions, Before llvm-svn 348114 (commit 4cf35b4ab0b), it was not possible to explicitly mark variadic operands as definitions. By default, variadic operands of an MCInst were always assumed to be uses. A number of had-hoc checks were introduced in the InstrBuilder to fix the processing of variadic register operands of ARM ldm/stm variants. This patch simply replaces those old (and buggy) checks with a much simpler (and correct) check for MCID::Flag::VariadicOpsAreDefs.	2021-06-15 09:52:38 +01:00
Jay Foad	c3a38401e0	[IR] Remove forward declaration of GraphTraits from Type.h This has been unnecessary since r352353 removed GraphTraits specializations for Type, except that a couple of other headers were accidentally relying on this declaration. Differential Revision: https://reviews.llvm.org/D104119	2021-06-15 09:23:45 +01:00
LLVM GN Syncbot	d90014c622	[gn build] Port d0a5d8611935	2021-06-15 05:56:32 +00:00
CarlosAlbertoEnciso	8355a030c0	[Debug-Info][CodeView] Fix GUID string generation for MSVC generated objects. This patch is to address https://bugs.llvm.org/show_bug.cgi?id=50459. YAML:455:28: error: GUID strings are 38 characters long The valid format for a GUID is {XXXXXXXX-XXXX-XXXX-XXXX-XXXXXXXXXXXX} where X is a hex digit (0,1,2,3,4,5,6,7,8,9,A,B,C,D,E,F). The length of the individual components must be: 8, 4, 4, 4, 12. For some cases, the converted string generated by obj2yaml, does not comply with those lengths. yaml2obj checks that the GUID string must be 38 characters including the dashes and braces. Reviewed By: amccarth Differential Revision: https://reviews.llvm.org/D103089	2021-06-15 06:53:21 +01:00
CarlosAlbertoEnciso	75ed4fb122	Revert "[NFC] This is a test commit to check commit access." This reverts commit b4d40e19def8c2e1a77ae30b5ac16751d1c461f7.	2021-06-15 06:25:22 +01:00
Carlos Alberto Enciso	fc72a61cfd	[NFC] This is a test commit to check commit access. Add full stop at the end of comment.	2021-06-15 06:20:31 +01:00
Craig Topper	e3c98d1dad	[X86] Use EVT::getVectorVT instead of changeVectorElementType in reduceVMULWidth. Changing vector element type doesn't work for v6i32->v6i16 now that v6i32 is an MVT and v6i16 is not. I would like to fix this in changeVectorElementType, but you need a LLVMContext to call getVectorVT which we can't get from an MVT. Fixes PR50709.	2021-06-14 22:07:04 -07:00
Kai Luo	adf785206e	[PowerPC] Export 16 byte load-store instructions Export `lq`, `stq`, `lqarx` and `stqcx.` in preparation for implementing 16-byte lock free atomic operations on AIX. Add a new register class `g8prc` for these instructions, since these instructions require even-odd register pair. Reviewed By: nemanjai, jsji, #powerpc Differential Revision: https://reviews.llvm.org/D103010	2021-06-15 01:56:10 +00:00
Vitaly Buka	a3de872dd6	[NFC][sanitizer] clang-format some code	2021-06-14 18:05:22 -07:00
Jacob Hegna	aeb9af0756	Remove redundant environment variable XLA_FLAGS. If the flag is not set, the script saved_model_aot_compile.py in tensorflow will default it to the correct value. However, in TF 2.5, the way the value is set in TensorFlowCompile.cmake file triggers a build error. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D103972	2021-06-14 23:58:22 +00:00
Adrian Prantl	dfb1691713	Allow signposts to take advantage of deferred string substitution One nice feature of the os_signpost API is that format string substitutions happen in the consumer, not the logging application. LLVM's current Signpost class doesn't take advantage of this though and instead always uses a static "Begin/End %s" format string. This patch uses variadic macros to allow the API to be used as intended. Unfortunately, the primary use-case I had in mind (the LLDB_SCOPED_TIMER() macro) does not get much better from this, because __PRETTY_FUNCTION__ is not a macro, but a static string, so signposts created by LLDB_SCOPED_TIMER() still use a static "%s" format string. At least LLDB_SCOPED_TIMERF() works as intended. This reapplies the previously reverted patch with additional include order fixes for non-modular builds of LLDB. Differential Revision: https://reviews.llvm.org/D103575	2021-06-14 16:53:41 -07:00
Huihui Zhang	6dc3e5ee9a	[SVE][LSR] Teach LSR to enable simple scaled-index addressing mode generation for SVE. Currently, Loop strengh reduce is not handling loops with scalable stride very well. Take loop vectorized with scalable vector type <vscale x 8 x i16> for instance, (refer to test/CodeGen/AArch64/sve-lsr-scaled-index-addressing-mode.ll added). Memory accesses are incremented by "16vscale", while induction variable is incremented by "8vscale". The scaling factor "2" needs to be extracted to build candidate formula i.e., "reg(%in) + 2reg({0,+,(8 %vscale)}". So that addrec register reg({0,+,(8vscale)}) can be reused among Address and ICmpZero LSRUses to enable optimal solution selection. This patch allow LSR getExactSDiv to recognize special cases like "C1XY /s C2X*Y", and pull out "C1 /s C2" as scaling factor whenever possible. Without this change, LSR is missing candidate formula with proper scaled factor to leverage target scaled-index addressing mode. Note: This patch doesn't fully fix AArch64 isLegalAddressingMode for scalable vector. But allow simple valid scale to pass through. Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D103939	2021-06-14 16:42:34 -07:00
Adrian Prantl	299552b38b	Revert "Allow signposts to take advantage of deferred string substitution" This reverts commit 03841edde7eee21d1d450041ab9a113a7e1be869. Unfortunately this still breaks the LLDB standalone bot.	2021-06-14 16:09:04 -07:00
Matt Morehouse	3601822b05	[HWASan] Enable globals support for LAM. Reviewed By: vitalybuka Differential Revision: https://reviews.llvm.org/D104265	2021-06-14 14:20:44 -07:00
Adrian Prantl	770268a3e9	Allow signposts to take advantage of deferred string substitution One nice feature of the os_signpost API is that format string substitutions happen in the consumer, not the logging application. LLVM's current Signpost class doesn't take advantage of this though and instead always uses a static "Begin/End %s" format string. This patch uses variadic macros to allow the API to be used as intended. Unfortunately, the primary use-case I had in mind (the LLDB_SCOPED_TIMER() macro) does not get much better from this, because __PRETTY_FUNCTION__ is not a macro, but a static string, so signposts created by LLDB_SCOPED_TIMER() still use a static "%s" format string. At least LLDB_SCOPED_TIMERF() works as intended. This reapplies the previsously reverted patch with additional MachO.h macro #undefs. Differential Revision: https://reviews.llvm.org/D103575	2021-06-14 14:19:41 -07:00
Roman Lebedev	8401a57c05	[TLI] SimplifyDemandedVectorElts(): handle SCALAR_TO_VECTOR(EXTRACT_VECTOR_ELT(?, 0)) Iff we have `SCALAR_TO_VECTOR` (and we demand it's only defined 0'th element), and said scalar was produced by `EXTRACT_VECTOR_ELT` from the 0'th element of some vector, then we can just continue traversal into said source vector. This comes up in X86 vector uniform shift lowering. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D104250	2021-06-14 23:52:53 +03:00
Piotr Sobczak	c63f0c139e	[AMDGPU] Limit runs of fixLdsBranchVmemWARHazard The code in fixLdsBranchVmemWARHazard looks for patterns of a vmem/lds access followed by a branch, followed by an lds/vmem access. The handling of the hazard requires an arbitrary number of instructions to process. In the worst case where a function has a vmem access, but no lds accesses, all instructions are examined only to conclude that the hazard cannot occur. Add the pre-processing stage which detects if there is both lds and vmem present in the function and only then does the more costly search. This patch significantly improves compilation time in the cases the hazard cannot happen. In one pathological case I looked at IsHazardInst is needlesly called 88.6 milions times. The numbers could also be improved by introducing a map around the inner calls to ::getWaitStatesSince in fixLdsBranchVmemWARHazard, but nothing will beat not running fixLdsBranchVmemWARHazard at all in the cases detected by shouldRunLdsBranchVmemWARHazardFixup(). Differential Revision: https://reviews.llvm.org/D104219	2021-06-14 22:30:23 +02:00
Arthur Eubanks	675b987cd3	Move some code under NDEBUG from D103135	2021-06-14 11:39:12 -07:00
Arthur Eubanks	db00e9aaf8	Remove accidentally added debugging code from D103135	2021-06-14 11:11:40 -07:00
Saleem Abdulrasool	a4fccf0a10	X86: pass swift_async context in R14 on Win64 Pass swift_async context in a callee-saved register rather than as a regular parameter. This is similar to the Swift `self` and `error` parameters.	2021-06-14 11:02:21 -07:00
Arthur Eubanks	25efb3e5da	[docs][OpaquePtr] Shuffle around the transition plan section Emphasize that this is basically an attempt to remove ``PointerType::getElementType`` and ``Type::getPointerElementType()``. Add a couple more subtasks. Differential Revision: https://reviews.llvm.org/D104151	2021-06-14 10:59:41 -07:00
Arthur Eubanks	a32daab226	[OpaquePtr] Remove existing support for forward compatibility It assumes that PointerType will keep having an optional pointee type, but we'd like to remove the pointee type in PointerType at some point. I feel like the current implementation could be simplified anyway, although perhaps I'm underestimating the amount of work needed throughout BitcodeReader. We will still need a side table to keep track of pointee types. This will be reimplemented at some point. This is essentially a revert of a4771e9d (which doesn't look like it was reviewed anyway). Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D103135	2021-06-14 10:52:56 -07:00
wlei	c4ed78c10b	[CSSPGO] Aggregation by the last K context frames for cold profiles This change provides the option to merge and aggregate cold context by the last k frames instead of context-less name. By default K = 1 means the context-less one. This is for better perf tuning. The more selective merging and trimming will rely on llvm-profgen's preinliner. Reviewed By: wenlei, hoy Differential Revision: https://reviews.llvm.org/D104131	2021-06-14 10:33:43 -07:00
Fraser Cormack	94d12e2de0	[RISCV] Transform unaligned RVV vector loads/stores to aligned ones This patch adds support for loading and storing unaligned vectors via an equivalently-sized i8 vector type, which has support in the RVV specification for byte-aligned access. This offers a more optimal path for handling of unaligned fixed-length vector accesses, which are currently scalarized. It also prevents crashing when `LegalizeDAG` sees an unaligned scalable-vector load/store operation. Future work could be to investigate loading/storing via the largest vector element type for the given alignment, in case that would be more optimal on hardware. For instance, a 4-byte-aligned nxv2i64 vector load could loaded as nxv4i32 instead of as nxv16i8. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D104032	2021-06-14 18:12:18 +01:00
Sanjay Patel	224787dbb2	[InstCombine] add DeMorgan folds for logical ops in select form We canonicalized to these select patterns (poison-safe logic) with D101191, so we need to reduce 'not' ops when possible as we would with 'and'/'or' instructions. This is shown in a secondary example in: https://llvm.org/PR50389 https://alive2.llvm.org/ce/z/BvsESh	2021-06-14 12:54:35 -04:00
Sanjay Patel	b4782a5249	[InstCombine] add tests for logical and/or with not ops; NFC	2021-06-14 12:54:35 -04:00
Florian Hahn	b893da34b6	[LoopDeletion] Add test with irreducible control flow in loop. Currently the irreducible cycles in the loops are ignored. The irreducible cycle may loop infinitely in irreducible_subloop_no_mustprogress, which is allowed and the loop should not be removed. Discussed in D103382.	2021-06-14 17:42:32 +01:00
Florian Hahn	5af9313d31	[VectorCombine] Limit scalarization to non-poison indices for now. As Eli mentioned post-commit in D103378, the result of the freeze may still be out-of-range according to Alive2. So for now, just limit the transform to indices that are non-poison.	2021-06-14 16:40:14 +01:00
Saleem Abdulrasool	c92efb7d33	SelectionDAG: repair the Windows build 6e5628354e22f3ca40b04295bac540843b8e6482 regressed the Windows build as the return type no longer matched in both branches for the return value type deduction. This uses a bit more compiler magic to deal with that.	2021-06-14 08:25:36 -07:00
zhijian	ad7e1ecf68	[AIX][XCOFF] emit vector info of traceback table. Summary: emit vector info of traceback table. Reviewers: Jason Liu,Hubert Tong Differential Revision: https://reviews.llvm.org/D93659	2021-06-14 11:15:22 -04:00
Florian Hahn	bc6a656349	[ADT] Use unnamed argument for unused arg in StringMapEntryStorage. This silences an 'unsused argument' warning. Similar to c2006f857d80f54b90ed7d911d3e7acf4f46001b.	2021-06-14 15:54:57 +01:00
Jingu Kang	d8d1189bdb	[AArch64] Improve SAD pattern Given a vecreduce_add node, detect the below pattern and convert it to the node sequence with UABDL, [S\|U]ADB and UADDLP. i32 vecreduce_add( v16i32 abs( v16i32 sub( v16i32 [sign\|zero]_extend(v16i8 a), v16i32 [sign\|zero]_extend(v16i8 b)))) =================> i32 vecreduce_add( v4i32 UADDLP( v8i16 add( v8i16 zext( v8i8 [S\|U]ABD low8:v16i8 a, low8:v16i8 b v8i16 zext( v8i8 [S\|U]ABD high8:v16i8 a, high8:v16i8 b Differential Revision: https://reviews.llvm.org/D104042	2021-06-14 15:48:51 +01:00
LLVM GN Syncbot	4f6c145c6a	[gn build] Port c820b494d6e1	2021-06-14 14:41:33 +00:00
Roman Lebedev	7a71822528	[NFC][DAGCombine] Extract getFirstIndexOf() lambda back into a function Not all supported compilers like such lambdas, at least one buildbot is unhappy.	2021-06-14 16:25:59 +03:00
Roman Lebedev	9f4eaf3945	[DAGCombine] reduceBuildVecToShuffle(): sort input vectors by decreasing size The sorting, obviously, must be stable, else we will have random assembly fluctuations. Apparently there was no test coverage that would benefit from that, so i've added one test. The sorting consists of two parts - just sort the input vectors, and recompute the shuffle mask -> input vector mapping. I don't believe we need to do anything else. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D104187	2021-06-14 16:18:37 +03:00
Jeroen Dobbelaere	c08eaddde6	Intrinsic::getName: require a Module argument Ensure that we provide a `Module` when checking if a rename of an intrinsic is necessary. This fixes the issue that was detected by https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=32288 (as mentioned by @fhahn), after committing D91250. Note that the `LLVMIntrinsicCopyOverloadedName` is being deprecated in favor of `LLVMIntrinsicCopyOverloadedName2`. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D99173	2021-06-14 14:52:29 +02:00
Florian Hahn	68369fae88	[VPlan] Add additional tests for region merging. Add additional tests suggested in D100260. Also drop the unneeded `indvars.` prefix from induction phi name.	2021-06-14 11:25:06 +01:00
Guillaume Chatelet	9aa4a5f77d	[llvm] remove Sequence::asSmallVector() There's no need for `toSmallVector()` as `SmallVector.h` already provides a `to_vector` free function that takes a range. Reviewed By: Quuxplusone Differential Revision: https://reviews.llvm.org/D104024	2021-06-14 08:28:05 +00:00
Simon Moll	91d4645488	[VP] Binary floating-point intrinsics. This patch implements vector-predicated intrinsics on IR level for fadd, fsub, fmul, fdiv and frem. There operate in the default floating-point environment. We will use constrained fp operand bundles for constrained vector-predicated fp math (D93455). Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93470	2021-06-14 08:51:41 +02:00
Mindong Chen	65590a4d7a	[LoopVectorize] precommit pr50686.ll for D104148	2021-06-14 13:58:25 +08:00
Xuanda Yang	a66f237758	[LLParser] Remove outdated deplibs The comment mentions deplibs should be removed in 4.0. Removing it in this patch. Reviewed By: compnerd, dexonsmith, lattner Differential Revision: https://reviews.llvm.org/D102763	2021-06-14 12:46:12 +08:00
RamNalamothu	a2306da6e0	Implement DW_CFA_LLVM_* for Heterogeneous Debugging Add support in MC/MIR for writing/parsing, and DebugInfo. This is part of the Extensions for Heterogeneous Debugging defined at https://llvm.org/docs/AMDGPUDwarfExtensionsForHeterogeneousDebugging.html Specifically the CFI instructions implemented here are defined at https://llvm.org/docs/AMDGPUDwarfExtensionsForHeterogeneousDebugging.html#cfa-definition-instructions Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D76877	2021-06-14 08:51:50 +05:30
Aditya Kumar	d436515539	Calculate getTerminator only when necessary Differential Revision: https://reviews.llvm.org/D104202	2021-06-13 20:16:07 -07:00

1 2 3 4 5 ...

217195 Commits