llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Fangrui Song	3698f6f3f3	[llvm-objcopy] Fix section group flag read/write when operating on a cross-endian object file	2021-07-26 15:09:15 -07:00
Amara Emerson	5e3045ed0c	[GlobalISel] Add a constant folding combine. Use it AArch64 post-legal combiner. These don't always get folded because when the instructions are created the constants are obscured by artifacts. Differential Revision: https://reviews.llvm.org/D106776	2021-07-26 14:53:33 -07:00
Heejin Ahn	5b24282a50	[WebAssembly] Remove dominator dependency in WasmEHPrepare (NFC) Dominator trees were previously used for an optimization related to `wasm.lsda` but the optimization was removed in D97309. Currently dominators are not doing anything in this pass. Also removes some `include` lines without which it compiles. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D106811	2021-07-26 14:45:13 -07:00
Heejin Ahn	9b48b7e7be	[WebAssembly] Make Emscripten EH work with Emscripten SjLj When Emscripten EH mixes with Emscripten SjLj, we are not currently handling some of them correctly. There are three cases: 1. The current function calls `setjmp` and there is an `invoke` to a function that can either throw or longjmp. In this case, we have to check both for exception and longjmp. We are currently handling this case correctly: `0c0eb76782/llvm/lib/Target/WebAssembly/WebAssemblyLowerEmscriptenEHSjLj.cpp (L1058-L1090)` When inserting routines for functions that can longjmp, which we do only for setjmp-calling functions, we check if the function was previously an `invoke` and handle it correctly. 2. The current function does NOT call `setjmp` and there is an `invoke` to a function that can either throw or longjmp. Because there is no `setjmp` call, we haven't been doing any check for functions that can longjmp. But in that case, for `invoke`, we only check for an exception and if it is not an exception we reset `__THREW__` to 0, which can silently swallow the longjmp: `0c0eb76782/llvm/lib/Target/WebAssembly/WebAssemblyLowerEmscriptenEHSjLj.cpp (L70-L80)` This CL fixes this. 3. The current function calls `setjmp` and there is no `invoke`. Because it is not an `invoke`, we haven't been doing any check for functions that can throw, and only insert longjmp-checking routines for functions that can longjmp. But in that case, if a longjmpable function throws, we only check for a longjmp so if it is not a longjmp we reset `__THREW__` to 0, which can silently swallow the exception: `0c0eb76782/llvm/lib/Target/WebAssembly/WebAssemblyLowerEmscriptenEHSjLj.cpp (L156-L169)` This CL fixes this. To do that, this moves around some code, so we register necessary functions for both EH and SjLj and precompute some data (the set of functions that contains `setjmp`) before doing actual EH or SjLj transformation. This CL makes 2nd and 3rd tests in https://github.com/emscripten-core/emscripten/pull/14732 work. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D106525	2021-07-26 13:48:31 -07:00
Roman Lebedev	cc27d61007	[SimplifyCFG] SwitchToLookupTable(): don't increase ret count The very next SimplifyCFG pass invocation will tail-merge these two ret's anyways, there is not much point in creating more work for ourselves.	2021-07-26 23:29:55 +03:00
Roman Lebedev	5b8d8bfd1b	[SimplifyCFG] Drop support for simplifying cond branch to two (different) ret's Nowadays, simplifycfg pass already tail-merges all the ret blocks together before doing anything, and it should not increase the count of ret's, so this is dead code.	2021-07-26 23:29:52 +03:00
Roman Lebedev	c0c15a814e	[SimplifyCFG] Drop support for duplicating ret's into uncond predecessors This functionality existed only under a default-off flag, and simplifycfg nowadays prefers to not increase the count of ret's.	2021-07-26 23:29:21 +03:00
Matheus Izvekov	33d35b0a79	[CodeView] Saturate values bigger than supported by APInt. This fixes an assert firing when compiling code which involves 128 bit integrals. This would trigger runtime checks similar to this: ``` Assertion failed: getMinSignedBits() <= 64 && "Too many bits for int64_t", file llvm/include/llvm/ADT/APInt.h, line 1646 ``` To get around this, we just saturate those big values. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D105320	2021-07-26 22:15:26 +02:00
David Green	e4830a62b3	[ARM] Fixup vst4 test. NFC	2021-07-26 20:56:22 +01:00
Lei Huang	bbc51b9f17	[PowerPC]Add addex instruction definition and MC tests Add td definitions and asm/disasm tests for the addex instruction introduced in ISA 3.0. Reviewed By: nemanjai, amyk, NeHuang Differential Revision: https://reviews.llvm.org/D106666	2021-07-26 14:55:38 -05:00
Sander de Smalen	4f1b8e266e	[LV] Don't let ForceTargetInstructionCost override Invalid cost. Invalid costs can be used to avoid vectorization with a given VF, which is used for scalable vectors to avoid things that the code-generator cannot handle. If we override the cost using the -force-target-instruction-cost option of the LV, we would override this mechanism, rendering the flag useless. This change ensures the cost is only overriden when the original cost that was calculated is valid. That allows the flag to be used in combination with the -scalable-vectorization option. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D106677	2021-07-26 20:27:49 +01:00
Sander de Smalen	bb39cd345a	[AArch64] NFC: Make some AArch64-SVE LoopVectorize tests generic. This change moves most of `sve-inductions.ll` to non-AArch64 specific LV tests using the `-target-supports-scalable-vectors` flag, because they're not explicitly AArch64-specific. One test builds on AArch64-specific knowledge regarding masked loads/stores, and remains in sve-inductions.ll.	2021-07-26 20:27:48 +01:00
Reid Kleckner	2e5bfee63f	[SimplifyCFG] Remove stale comment after d7378259aa, NFC	2021-07-26 12:25:29 -07:00
Reid Kleckner	a85a7951e1	Fix clang debug info irgen of i128 enums DIEnumerator stores an APInt as of April 2020, so now we don't need to truncate the enumerator value to 64 bits. Fixes assertions during IRGen. Split from D105320, thanks to Matheus Izvekov for the test case and report. Differential Revision: https://reviews.llvm.org/D106585	2021-07-26 12:25:29 -07:00
Lei Huang	c4acdbbb3c	[PowerPC] Add implicit-def RM to instructions mtfsb[01] This is a followup patch for D105930 to add implicit-def of RM for mtfsb[01] instructions as per review comments. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D106603	2021-07-26 14:07:08 -05:00
Joseph Huber	71556de73f	[OpenMP][NFC] Remove unncessary capture in RAII struct Summary: There was an unnecessary variable assigned to the information cache when we only need it in the constructor to extract the function declaration.	2021-07-26 15:05:55 -04:00
Michael Liao	b2d24acf01	[amdgpu] Add 64-bit PC support when expanding unconditional branches. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D106445	2021-07-26 14:50:30 -04:00
Craig Topper	837c44ce91	[TypePromotion] Remove redundant if. NFC The same condition was checked in the previous if. Maybe this was a bad merge resolution?	2021-07-26 11:47:25 -07:00
Kevin P. Neal	800de3f8d5	[FPEnv][InstSimplify] Enable more folds for constrained fadd Precommit tests, try 2. My tree is up-to-date as of this morning so this should go better than my first try.	2021-07-26 14:06:21 -04:00
Amara Emerson	6a4c66376a	[AArch4][GlobalISel] Post-legalize combine s64 = G_MERGE s32, 0 -> G_ZEXT. These are generated as a byproduce of legalization. Differential Revision: https://reviews.llvm.org/D106768	2021-07-26 10:58:04 -07:00
Eli Friedman	2211738092	[LLVM IR] Allow volatile stores to trap. Proposed alternative to D105338. This is ugly, but short-term I think it's the best way forward: first, let's formalize the hacks into a coherent model. Then we can consider extensions of that model (we could have different flavors of volatile with different rules). Differential Revision: https://reviews.llvm.org/D106309	2021-07-26 10:51:00 -07:00
Amara Emerson	d0d4c1578a	[AArch64][GlobalISel] Enable some select combines after legalization. The legalizer generates selects for some operations, which can have constant condition values, resulting in lots of dead code if it's not folded away. Differential Revision: https://reviews.llvm.org/D106762	2021-07-26 10:40:32 -07:00
Amara Emerson	b09f2e63d9	[GlobalISel] Add combine for merge(unmerge) and use AArch64 postlegal-combiner. Differential Revision: https://reviews.llvm.org/D106761	2021-07-26 10:37:31 -07:00
Heejin Ahn	cd521f45d9	[WebAssembly] Improve pseudocode in LowerEmscriptenEHSjLj Both `__THREW__` and `__threwValue` are global variables, and we have been distinguishing the global variable `__THREW__` and the loaded value `%__THREW__.val` in comments but not doing it for `__threwValue`. Made the pseudocode comments consistent for both variables. Reviewed By: dschuff Differential Revision: https://reviews.llvm.org/D106524	2021-07-26 10:13:28 -07:00
Simon Pilgrim	e79fa78700	[X86][AVX] Add PR50053 test case	2021-07-26 17:57:38 +01:00
Florian Hahn	3ef6cc6cad	[LAA] Remove RuntimeCheckingPtrGroup::RtCheck member (NFC). This patch removes RtCheck from RuntimeCheckingPtrGroup to make it possible to construct RuntimeCheckingPtrGroup objects without a RuntimePointerChecking object. This should make it easier to re-use the code to generate runtime checks, e.g. in D102834. RtCheck was only used to access the pointer info for a given index. Instead, the start and end expressions can be passed directly. For code-gen, we also need to know the address space to use. This can also be explicitly passed at construction. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D105481	2021-07-26 17:38:10 +01:00
Stephen Tozer	27b2ac2965	[DebugInfo] Correctly update debug users of SSA values in tail duplication During tail duplication, SSA values may be updated and have their uses replaced with a virtual register, and any debug instructions that use that value are deleted. This patch fixes the implementation of the debug instruction deletion to work correctly for debug instructions that use the SSA value multiple times, by batching deletions so that we don't attempt to delete the same instruction twice. Differential Revision: https://reviews.llvm.org/D106557	2021-07-26 17:27:57 +01:00
Simon Pilgrim	e67bfc0f97	[Analysis] Fix getOrderedReductionCost to call target's getArithmeticInstrCost implementation The getOrderedReductionCost implementation introduced in D105432 calls the CRTP base version getArithmeticInstrCost instead of the redirecting to the target version. Differential Revision: https://reviews.llvm.org/D106795	2021-07-26 17:15:43 +01:00
Sander de Smalen	2b39df3750	[LV] Remove assert that VF cannot be scalable in setCostBasedWideningDecision. Scalarization for scalable vectors is not (yet) supported, so the LV discards a VF when scalarization is chosen as the widening decision. It should therefore not assert that the VF is not scalable when it computes the decision to scalarize. The code can get here when both the interleave-cost, gather/scatter cost and scalarization-cost are all illegal. This may e.g. happen for SVE when the VF=1, to avoid generating `<vscale x 1 x eltty>` types that the code-generator cannot yet handle. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D106656	2021-07-26 17:11:45 +01:00
Nikita Popov	c0e218377f	[MergeICmps] Collect block instructions once (NFC) Collect the relevant instructions for a given BCECmpBlock once on construction, rather than repeating this logic in multiple places.	2021-07-26 18:07:20 +02:00
Fangrui Song	834bd6611c	[llvm-objcopy] Drop GRP_COMDAT if the group signature is localized See [GRP_COMDAT group with STB_LOCAL signature](https://groups.google.com/g/generic-abi/c/2X6mR-s2zoc) objcopy PR: https://sourceware.org/bugzilla/show_bug.cgi?id=27931 GRP_COMDAT deduplication is purely based on the signature symbol name in ld.lld/GNU ld/gold. The local/global status is not part of the equation. If the signature symbol is localized by --localize-hidden or --keep-global-symbol, the intention is likely to make the group fully localized. Drop GRP_COMDAT to suppress deduplication. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D106782	2021-07-26 09:05:18 -07:00
Fangrui Song	f0aeef47e5	[yaml2obj][MachO] Rename PayloadString to Content The new name is conciser and matches yaml2obj ELF & DWARF. Reviewed By: #lld-macho, thakis Differential Revision: https://reviews.llvm.org/D106759	2021-07-26 09:04:51 -07:00
Nikita Popov	fa9a368e39	[MergeICmps] Try to fix MSVC build failure Apparently this fails to line up the types -- try to sidestep the issue entirely by writing the code in a more reasonable way: Walk over the operands and perform a set lookup, rather than walking over the set and performing an operand scan.	2021-07-26 17:31:27 +02:00
Kazu Hirata	24ba96f75f	[AsmParser] Remove MDRef (NFC) The last use was removed on Jan 12, 2015 in commit ab617d597708fcf3c4b829bf595e9d990ca66c07.	2021-07-26 08:29:33 -07:00
Paul Walker	6d505ef4dd	[SVE] Use reg+reg addressing mode for immediate offsets. For reg+imm SVE addressing mode imm is implictly scaled by VL, making them impractical for truely immediate offsets. However, if the offset can be unscaled based on the storage element type we can use the reg+reg SVE addressing mode and thus either reduce the number of generate add instructions or replace them with a mov instruction that can be hoisted from the hot code path. Differential Revision: https://reviews.llvm.org/D106744	2021-07-26 16:24:16 +01:00
Sanjay Patel	2b04c07ca1	[SimplifyLibCalls] avoid crash on pointer math We could try harder to screen out libcalls by function signature (and that would be a much larger change than for sprintf alone), but that might make the transition to type-less pointers more difficult. https://llvm.org/PR51200	2021-07-26 11:08:45 -04:00
Sanjay Patel	a1170d1e3b	[SimplifyLibCalls] reduce code duplication; NFC	2021-07-26 11:08:45 -04:00
Nikita Popov	6ebcde8cc9	[MergeICmps] Separate out BCECmp and use Optional (NFC) Separate out the BCECmp part from BCECmpBlock, which just stores the comparison atoms without the branch instruction. At the same time switch the code to return Optional<> rather than objects in invalid state and partially constructed objects.	2021-07-26 17:06:43 +02:00
Sander de Smalen	6651df41b7	[LV] Don't assume isScalarAfterVectorization if one of the uses needs widening. This fixes an issue that was found in D105199, where a GEP instruction is used both as the address of a store, as well as the value of a store. For the former, the value is scalar after vectorization, but the latter (as value) requires widening. Other code in that function seems to prevent similar cases from happening, but it seems this case was missed. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D106164	2021-07-26 16:01:55 +01:00
Bradley Smith	5e51e7ed64	[AArch64][SVE] Break false dependencies for inactive lanes of unary operations Differential Revision: https://reviews.llvm.org/D105889	2021-07-26 15:01:21 +00:00
Ulrich Weigand	81afdbc83c	[SystemZ] Add support for new cpu architecture - arch14 This patch adds support for the next-generation arch14 CPU architecture to the SystemZ backend. This includes: - Basic support for the new processor and its features. - Detection of arch14 as host processor. - Assembler/disassembler support for new instructions. - New LLVM intrinsics for certain new instructions. - Support for low-level builtins mapped to new LLVM intrinsics. - New high-level intrinsics in vecintrin.h. - Indicate support by defining __VEC__ == 10304. Note: No currently available Z system supports the arch14 architecture. Once new systems become available, the official system name will be added as supported -march name.	2021-07-26 16:57:28 +02:00
Florian Hahn	ca0aa2b075	Recommit "[VPlan] Add recipe for first-order rec phis, make splicing explicit." This reverts the revert commit b1777b04dc4b1a9fee0e7effa7e177892ab32ef0. The patch originally got reverted due to a crash: https://bugs.chromium.org/p/chromium/issues/detail?id=1232798#c2 The underlying issue was that we were not using the stored values from the modified memory recipes, but the out-of-date values directly from the IR (accessed via the VPlan). This should be fixed in d995d6376. A reduced version of the reproducer has been added in 93664503be6b.	2021-07-26 15:50:30 +01:00
Nikita Popov	bab200ac44	[IR] Consider non-willreturn as side effect (PR50511) This adjusts mayHaveSideEffect() to return true for !willReturn() instructions. Just like other side-effects, non-willreturn calls (aka "divergence") cannot be removed and cannot be reordered relative to other side effects. This fixes a number of bugs where non-willreturn calls are either incorrectly dropped or moved. In particular, it also fixes the last open problem in https://bugs.llvm.org/show_bug.cgi?id=50511. I performed a cursory review of all current mayHaveSideEffect() uses, which convinced me that these are indeed the desired default semantics. Places that do not want to consider non-willreturn as a sideeffect generally do not want mayHaveSideEffect() semantics at all. I identified two such cases, which are addressed by D106591 and D106742. Finally, there is a use in SCEV for which we don't really have an appropriate API right now -- what it wants is basically "would this be considered forward progress". I've just spelled out the previous semantics there. Differential Revision: https://reviews.llvm.org/D106749	2021-07-26 16:35:14 +02:00
Benjamin Kramer	9ae7d5aa56	Simplify away some SmallVector copies. NFCI. The lifetime of the initializer list is the full expression, so we can skip storing it in a temporary vector.	2021-07-26 16:33:38 +02:00
Jeremy Morse	9b82c651ff	[InstrRef][AArch64][1/4] Accept constant physreg variable locations Late in SelectionDAG we join up instruction numbers with their defining instructions, if it couldn't be done during the main part of SelectionDAG. One exception is function arguments, where we have to point a DBG_PHI instruction at the incoming live register, as they don't have a defining instruction. This patch adds another exception, for constant physregs, like aarch64 has. It may seem wasteful to use two instructions where we could use a single DBG_VALUE, however the whole point of instruction referencing is to decouple the identification of values from the specification of where variable location ranges start. (Part of my aarch64 work to ease adoption of instruction referencing, as in the meta comment on D104520) Differential Revision: https://reviews.llvm.org/D104520	2021-07-26 15:26:15 +01:00
Florian Hahn	d00a7edec9	[LV] Add test to store a first-order rec via interleave group. This is a reduced version of the reproducer from https://bugs.chromium.org/p/chromium/issues/detail?id=1232798#c2	2021-07-26 15:20:04 +01:00
Alexey Bataev	37023003ea	[SLP]Fix costs calculations. Need to fix several cost-related problems. The final type may be defined incorrectly because of to early definition (we may end up with the wider type), the CommonCost should not be redefined in ExtractElements cost related calculations and the shuffle of the final insertelements vectors should be calculated as a cost of single vector permutations + costs of two vector permutations for other n-1 incoming vectors. Differential Revision: https://reviews.llvm.org/D106578	2021-07-26 07:14:03 -07:00
gbreynoo	40ac5ec4fa	[llvm-readobj] Display multiple function names for stack size entries The current implementation of displaying .stack_size information presumes that each entry represents a single function but this is not always the case. For example with the use of ICF multiple functions can be represented with the same code, meaning that the address found in a .stack_size entry corresponds to multiple function symbols. This change allows multiple function names to be displayed when appropriate. Differential Revision: https://reviews.llvm.org/D105884	2021-07-26 14:49:53 +01:00
Jay Foad	453349344b	[AMDGPU][GISel] Fix MMO for raw/struct buffer access with non-constant offset Codegen for the raw/struct buffer access intrinsics would update the offset in the MMO to reflect the combined offset, if it was known to be constant. If the combined offset was not known to be constant, or if there was an index, it would set the offset in the MMO to 0. This is unsafe because it makes it look like the access does not alias with another access with a fixed non-zero offset. Fix these cases by setting the pointer in the MMO to null, to reflect the fact that we do not have any known IR value pointer + constant offset for the access. D106284 did this for SelectionDAG. This is the corresponding fix for GlobalISel. Differential Revision: https://reviews.llvm.org/D106451	2021-07-26 14:27:30 +01:00
Jay Foad	eaeb8d4f70	[AMDGPU] Pre-commit global-isel test case for D106451 This test case shows the scheduler wrongly reordering two buffer accesses that might alias.	2021-07-26 14:27:30 +01:00

1 2 3 4 5 ...

219215 Commits