llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-22 10:42:39 +01:00

Author	SHA1	Message	Date
Thomas Lively	a913c9bb30	[WebAssembly] Codegen for pmin and pmax Replace the clang builtins and LLVM intrinsics for {f32x4,f64x2}.{pmin,pmax} with standard codegen patterns. Since wasm_simd128.h uses an integer vector as the standard single vector type, the IR for the pmin and pmax intrinsic functions contains bitcasts that would not be there otherwise. Add extra codegen patterns that can still select the pmin and pmax instructions in the presence of these bitcasts. Differential Revision: https://reviews.llvm.org/D106612	2021-07-23 14:49:21 -07:00
Roman Lebedev	fdb7d69784	[NFC][BasicBlockUtils] Refactor GetIfCondition() to return the branch, not it's condition Otherwise e.g. the FoldTwoEntryPHINode() has to do a lot of legwork to re-deduce what is the dominant block (i.e. for which block is this branch the terminator).	2021-07-24 00:18:26 +03:00
Cyndy Ishida	cd241d2fc0	[llvm][NFC] Fix typos in Errc.h description	2021-07-23 11:54:49 -07:00
Mircea Trofin	4559a48614	[NFC][MLGO] Just use the underlying protobuf object for logging Avoid buffering just to copy the buffered data, in 'development mode', when logging. Instead, just populate the underlying protobuf. Differential Revision: https://reviews.llvm.org/D106592	2021-07-23 10:56:48 -07:00
Fangrui Song	c3bb156e90	Revert "[clang] -falign-loops=" This reverts commit 42896eeed9e3d12e7e38217a0d7e35b9736451ac. Unfinished. Accidentally pushed when reverting a clangd commit.	2021-07-23 09:58:35 -07:00
Fangrui Song	05f5a9a949	[clang] -falign-loops=	2021-07-23 09:50:43 -07:00
luxufan	85def5bf4e	[JITLink][RISCV] Initial Support RISCV64 in JITLink This patch is the initial support, it implements translation from object file to JIT link graph, and very few relocations were supported. Currently, the test file ELF_pc_indirect.s is passed, the HelloWorld program(compiled with mno-relax flag) can be linked correctly and run on instruction emulator correctly. In the downstream implementation, I have implemented the GOT, PLT function, and EHFrame and some optimization will be implement soon. I will organize the code in to patches, then gradually send it to upstream. Differential Revision: https://reviews.llvm.org/D105429	2021-07-23 23:47:30 +08:00
Kazu Hirata	73fafa5526	[ARM] Remove getHWDivName (NFC) This function seems to be unused for at least 5 years.	2021-07-23 07:44:23 -07:00
David Truby	130948388d	[llvm][sve] Lowering for VLS truncating stores This adds custom lowering for truncating stores when operating on fixed length vectors in SVE. It also includes a DAG combine to fold extends followed by truncating stores into non-truncating stores in order to prevent this pattern appearing once truncating stores are supported. Currently truncating stores are not used in certain cases where the size of the vector is larger than the target vector width. Differential Revision: https://reviews.llvm.org/D104471	2021-07-23 14:04:55 +01:00
Dylan Fleming	8a94b4239a	[SVE][IR] Fix Binary op matching in PatternMatch::m_VScale Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D105978	2021-07-23 11:39:13 +01:00
Fraser Cormack	2db8b2b6fc	[NFC] Fix early line-break in doxygen comment	2021-07-23 07:16:05 +01:00
Giorgis Georgakoudis	d1dd1d3743	[OpenMP] Use AAHeapToStack/AAHeapToShared analysis in SPMDization SPMDization D102307 detects incompatible OpenMP runtime calls to abort converting a target region to SPMD mode. Calls to memory allocation/de-allocation routines kmpc_alloc_shared, kmpc_free_shared are incompatible unless they are removed by AAHeapToStack/AAHeapToShared analysis. This patch extends SPMDization detection to include AAHeapToStack/AAHeapToShared analysis results for enlarging the scope of possible SPMDized regions detected. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D105634	2021-07-22 18:08:37 -07:00
Vitaly Buka	381f03cdf9	[NFC][asan] Always pass Dominator Trees into forAllReachableExits	2021-07-22 18:01:38 -07:00
Gulfem Savrun Yeniceri	4e540995b1	[profile] Add binary id into profiles This patch adds binary id into profiles to easily associate binaries with the corresponding profiles. There is an RFC that discusses the motivation, design and implementation in more detail: https://lists.llvm.org/pipermail/llvm-dev/2021-June/151154.html Differential Revision: https://reviews.llvm.org/D102039	2021-07-23 00:19:12 +00:00
Florian Mayer	b276efa2ab	[hwasan] Use stack safety analysis. This avoids unnecessary instrumentation. Reviewed By: eugenis, vitalybuka Differential Revision: https://reviews.llvm.org/D105703	2021-07-22 16:20:27 -07:00
Alexander Yermolovich	a827bbdfea	[DWP] Refactoring llvm-dwp in to a library part 2 This is follow up to https://reviews.llvm.org/D106198 where llvm-dwp was refactored in to multiple files. In this patch moving them in to lib/include directories. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D106493	2021-07-22 14:23:29 -07:00
Nick Fitzgerald	f7deab5277	Reland: "[WebAssembly] Deduplicate imports of the same module name, field name, and type" When two symbols import the same thing, only one import should be emitted in the Wasm file. Fixes https://bugs.llvm.org/show_bug.cgi?id=50938 Reverted in: 16aac493e59519377071e900d119ba2e7e5b525d. Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D105519	2021-07-22 14:16:05 -07:00
Paulo Matos	e8be0ee828	[WebAssembly] Implementation of global.get/set for reftypes in LLVM IR Reland of 31859f896. This change implements new DAG notes GLOBAL_GET/GLOBAL_SET, and lowering methods for load and stores of reference types from IR globals. Once the lowering creates the new nodes, tablegen pattern matches those and converts them to Wasm global.get/set. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D104797	2021-07-22 22:07:24 +02:00
Jon Chesterfield	0493801fb2	[nfc] Fix typo in comment, s/node/note	2021-07-22 20:16:53 +01:00
Victor Huang	017f21fed1	[PowerPC] Add PowerPC "__stbcx" builtin and intrinsic for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds the builtin and intrinsic for "__stbcx". Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D106484	2021-07-22 10:48:46 -05:00
Alexey Bataev	fd1d10a20f	[OPENMP]Fix PR49787: Codegen for calling __tgt_target_teams_nowait_mapper has too few arguments. Added missed arguments in __tgt_target_teams_nowait_mapper/__tgt_target_nowait_mapper runtime functions calls. Differential Revision: https://reviews.llvm.org/D106542	2021-07-22 08:44:37 -07:00
Alexey Bataev	6351ecd4dc	Revert "[OPENMP]Fix PR49787: Codegen for calling __tgt_target_teams_nowait_mapper has too few arguments." This reverts commit b455f7f22564a096c043b02fa159ab16669c121c to fix buildbots.	2021-07-22 08:06:29 -07:00
Alexey Bataev	0261373c6d	[OPENMP]Fix PR49787: Codegen for calling __tgt_target_teams_nowait_mapper has too few arguments. Added missed arguments in __tgt_target_teams_nowait_mapper/__tgt_target_nowait_mapper runtime functions calls. Differential Revision: https://reviews.llvm.org/D106542	2021-07-22 07:53:37 -07:00
Kazu Hirata	19374d4da0	[Transforms] Remove getOrCreateInitFunction (NFC) The last use was removed on Jan 16, 2019 in commit 81101de5853b4ed64640220a086a67b16f36f153.	2021-07-22 06:30:39 -07:00
Paulo Matos	ceddd7eb41	Add support for zero-sized Scalars as a LowLevelType Opaque values (of zero size) can be stored in memory with the implemention of reference types in the WebAssembly backend. Since MachineMemOperand uses LLTs we need to be able to support zero-sized scalars types in LLTs. Differential Revision: https://reviews.llvm.org/D105423	2021-07-22 13:47:19 +02:00
Florian Mayer	152a339cb1	Revert "[hwasan] Use stack safety analysis." This reverts commit bde9415fef25e9ff6e10595a2f4f5004dd62f10a.	2021-07-22 12:16:16 +01:00
Florian Mayer	fa5973a54d	[hwasan] Use stack safety analysis. This avoids unnecessary instrumentation. Reviewed By: eugenis, vitalybuka Differential Revision: https://reviews.llvm.org/D105703	2021-07-22 12:04:54 +01:00
Simon Tatham	bc23ee33a0	[clang] Use i64 for the !srcloc metadata on asm IR nodes. This is part of a patch series working towards the ability to make SourceLocation into a 64-bit type to handle larger translation units. !srcloc is generated in clang codegen, and pulled back out by llvm functions like AsmPrinter::emitInlineAsm that need to report errors in the inline asm. From there it goes to LLVMContext::emitError, is stored in DiagnosticInfoInlineAsm, and ends up back in clang, at BackendConsumer::InlineAsmDiagHandler(), which reconstitutes a true clang::SourceLocation from the integer cookie. Throughout this code path, it's now 64-bit rather than 32, which means that if SourceLocation is expanded to a 64-bit type, this error report won't lose half of the data. The compiler will tolerate both of i32 and i64 !srcloc metadata in input IR without faulting. Test added in llvm/MC. (The semantic accuracy of the metadata is another matter, but I don't know of any situation where that matters: if you're reading an IR file written by a previous run of clang, you don't have the SourceManager that can relate those source locations back to the original source files.) Original version of the patch by Mikhail Maltsev. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D105491	2021-07-22 10:24:52 +01:00
Johannes Doerfert	54c73c71f7	[Attributor][FIX] Do not introduce multiple instances of SSA values If we have a recursive function we could create multiple instantiations of an SSA value, one per recursive invocation of the function. This is a problem as we use SSA value equality in various places. The basic idea follows from this test: ``` static int r(int c, int *a) { int X; return c ? r(false, &X) : a == &X; } int test(int c) { return r(c, undef); } ``` If we look through the argument `a` we will end up with `X`. Using SSA value equality we will fold `a == &X` to true and return true even though it should have been false because `a` and `&X` are from different instantiations of the function. Various tests for this have been placed in value-simplify-instances.ll and this commit fixes them all by avoiding to produce simplified values that could be non-unique at runtime. Thus, the result of a simplify value call will always be unique at runtime or the original value, both do not allow to accidentally compare two instances of a value with each other and conclude they are equal statically (pointer equivalence) while they are unequal at runtime.	2021-07-22 00:07:55 -05:00
Johannes Doerfert	b93c50b86a	[Attributor] Improve the Attributor::getAssumedConstant interface Similar to Attributor::getAssumedSimplified we need to allow IRPs directly to get the right simplification callback (and context).	2021-07-22 00:07:55 -05:00
Johannes Doerfert	59bc220605	[Attributor][NFC] Clang format	2021-07-21 22:51:05 -05:00
Joseph Huber	9d542314e4	[Libomptarget] Introduce new main thread ID runtime function This patch introduces `__kmpc_is_generic_main_thread_id` which splits the old comparison into its own runtime function. The purpose of this is so we can fold this part independently, so when both this and `is_spmd_mode` are folded the final function will be folded as well. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106437	2021-07-21 21:18:14 -04:00
Joseph Huber	472a223072	[OpenMP] Change `__kmpc_free_shared` to include the paired allocation size This patch changes `__kmpc_free_shared` to take an additional argument corresponding to the associated allocation's size. This makes it easier to implement the allocator in the runtime. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106496	2021-07-21 20:56:21 -04:00
Lang Hames	549c960a94	Re-re-revert "[ORC][ORC-RT] Add initial native-TLV support to MachOPlatform." This reverts commit 6b2a96285b9bbe92d2c5e21830f21458f8be976d. The ccache builders are still failing. Looks like they need to be updated to get the llvm-zorg config change in 490633945677656ba75d42ff1ca9d4a400b7b243. I'll re-apply this as soon as the builders are updated.	2021-07-22 10:45:24 +10:00
Lang Hames	6f51759135	Re-re-apply "[ORC][ORC-RT] Add initial native-TLV support to MachOPlatform." This reapplies commit a7733e9556b5a6334c910f88bcd037e84e17e3fc ("Re-apply [ORC][ORC-RT] Add initial native-TLV support to MachOPlatform."), and d4abdefc998a1ee19d5edc79ec233774cbf64f6a ("[ORC-RT] Rename macho_tlv.x86-64.s to macho_tlv.x86-64.S (uppercase suffix)"). These patches were reverted in 48aa82cacbff10e1c5395a03f86488bf449ba4da while I investigated bot failures (e.g. https://lab.llvm.org/buildbot/#/builders/109/builds/18981). The fix was to disable building of the ORC runtime on buliders using ccache (which is the same fix used for other compiler-rt projects containing assembly code). This fix was commited to llvm-zorg in 490633945677656ba75d42ff1ca9d4a400b7b243.	2021-07-22 09:46:52 +10:00
Thomas Lively	8403ff42b3	[WebAssembly] Replace @llvm.wasm.popcnt with @llvm.ctpop.v16i8 Use the standard target-independent intrinsic to take advantage of standard optimizations. Differential Revision: https://reviews.llvm.org/D106506	2021-07-21 16:45:54 -07:00
Stanislav Mekhanoshin	2ef5dd8386	Prevent dead uses in register coalescer after rematerialization The coalescer does not check if register uses are available at the point of rematerialization. If it attempts to rematerialize an instruction with such uses it can end up with use without a def. LiveRangeEdit does such check during rematerialization, so just call LiveRangeEdit::allUsesAvailableAt() to avoid the problem. Differential Revision: https://reviews.llvm.org/D106396	2021-07-21 15:19:55 -07:00
Gulfem Savrun Yeniceri	8179b3101d	Revert "[profile] Add binary id into profiles" Revert "[profile] Change linkage type of a compiler-rt func" This reverts commits f984ac2715f71c38a7872fa2c2ad535b3d4fa285 and 467c7191249b76abff33853b1692a77f327c2422 because it broke some builds.	2021-07-21 19:15:18 +00:00
Giorgis Georgakoudis	7e612fb3a1	[Attributor] Preserve BBs and instructions added in AA manifests Manifesting AbstractAttributes may add new BBs in the IR. This patch provides an interface to register those BBs in the Attributor so that those BBs and containing instructions are not deleted as dead. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106383	2021-07-21 11:27:00 -07:00
Eli Friedman	4384bac220	[SelectionDAG] Fix the representation of ISD::STEP_VECTOR. The existing rule about the operand type is strange. Instead, just say the operand is a TargetConstant with the right width. (Legalization ignores TargetConstants, so it doesn't matter if that width is legal.) Highlights: 1. I had to substantially rewrite the AArch64 isel patterns to expect a TargetConstant. Nothing too exotic, but maybe a little hairy. Maybe worth considering a target-specific node with some dagcombines instead of this complicated nest of isel patterns. 2. Our behavior on RV32 for vectors of i64 has changed slightly. In particular, we correctly preserve the width of the arithmetic through legalization. This changes the DAG a bit. Maybe room for improvement here. 3. I explicitly defined the behavior around overflow. This is necessary to make the DAGCombine transforms legal, and I don't think it causes any practical issues. Differential Revision: https://reviews.llvm.org/D105673	2021-07-21 10:58:40 -07:00
Gulfem Savrun Yeniceri	5edc17d32b	[profile] Add binary id into profiles This patch adds binary id into profiles to easily associate binaries with the corresponding profiles. There is an RFC that discusses the motivation, design and implementation in more detail: https://lists.llvm.org/pipermail/llvm-dev/2021-June/151154.html Differential Revision: https://reviews.llvm.org/D102039	2021-07-21 17:55:43 +00:00
Giorgis Georgakoudis	2b07724890	[Attributor][NFC] Modify isAssumedHeapToStack for const argument There is no need for a non-const argument interface and the const argument modification covers existing and upcoming use cases. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106418	2021-07-21 10:28:21 -07:00
Giorgis Georgakoudis	ca49ab772d	[OpenMP] Expose libomptarget function to get HW thread id The patch exposes the libomptarget runtime function that gets the hardware thread id through the kmpc API. This is to be used in SPMDization for checking the thread id to execute regions by a single thread in a block. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106323	2021-07-21 10:26:04 -07:00
Thomas Lively	d9c07b1710	[WebAssembly] Codegen for v128.load{32,64}_zero Replace the experimental clang builtins and LLVM intrinsics for these instructions with normal instruction selection patterns. The wasm_simd128.h intrinsics header was already using portable code for the corresponding intrinsics, so now it produces the correct instructions. Differential Revision: https://reviews.llvm.org/D106400	2021-07-21 09:02:12 -07:00
Eric Astor	2b759c7f74	[ms] [llvm-ml] Support built-in text macros Add support for all built-in text macros supported by ML64: @Date, @Time, @FileName, @FileCur, and @CurSeg. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D104965	2021-07-21 11:44:09 -04:00
Quinn Pham	2b9fe667ec	[PowerPC] Floating Point Builtins for XL Compat. This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds builtins related to floating point operations Reviewed By: #powerpc, nemanjai, amyk, NeHuang Differential Revision: https://reviews.llvm.org/D103986	2021-07-21 08:33:39 -05:00
Jakub Kuderski	fe1c7a103a	[ADT] Add initializer_list constructor to SmallDenseMap Make it easier to initialize small maps inline. Note that DenseMap already has an initializer_list constructor. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D106363	2021-07-21 09:32:16 -04:00
Guillaume Chatelet	a29bc1a45f	[llvm] Add enum iteration to Sequence This patch allows iterating typed enum via the ADT/Sequence utility. It also changes the original design to better separate concerns: - `StrongInt` only deals with safe `intmax_t` operations, - `SafeIntIterator` presents the iterator and reverse iterator interface but only deals with safe `StrongInt` internally. - `iota_range` only deals with `SafeIntIterator` internally. This design ensures that operations are always valid. In particular, "Out of bounds" assertions fire when: - the `value_type` is not representable as an `intmax_t` - iterator operations make internal computation underflow/overflow - the internal representation cannot be converted back to `value_type` Differential Revision: https://reviews.llvm.org/D106279	2021-07-21 12:48:53 +00:00
Rosie Sumpter	b11b07e0b8	[LoopFlatten][LoopInfo] Use Loop to identify latch compare instruction Make getLatchCmpInst non-static and use it in LoopFlatten as a more robust way of identifying the compare. Differential Revision: https://reviews.llvm.org/D106256	2021-07-21 10:14:18 +01:00
Lang Hames	864fe23f53	[ORC][ORC-RT] Revert MachO TLV patches while I investigate more bot failures. This reverts commit d4abdefc998a1ee19d5edc79ec233774cbf64f6a ("[ORC-RT] Rename macho_tlv.x86-64.s to macho_tlv.x86-64.S (uppercase suffix)", and a7733e9556b5a6334c910f88bcd037e84e17e3fc ("Re-apply "[ORC][ORC-RT] Add initial native-TLV support to MachOPlatform."), while I investigate failures on ccache builders (e.g. https://lab.llvm.org/buildbot/#/builders/109/builds/18981)	2021-07-21 15:52:33 +10:00
Lang Hames	248727a066	Re-apply "[ORC][ORC-RT] Add initial native-TLV support to MachOPlatform." Reapplies fe1fa43f16beac1506a2e73a9f7b3c81179744eb, which was reverted in 6d8c63946cc259c0af02584b7cc690dde11dea35, with fixes: 1. Remove .subsections_via_symbols directive from macho_tlv.x86-64.s (it's not needed here anyway). 2. Return error from pthread_key_create to the MachOPlatform to silence unused variable warning.	2021-07-21 15:11:22 +10:00
Lang Hames	53129328a6	Revert "[ORC][ORC-RT] Add initial native-TLV support to MachOPlatform." Reverts commit fe1fa43f16beac1506a2e73a9f7b3c81179744eb while I investigate failures on Linux.	2021-07-21 09:22:55 +10:00
Lang Hames	ad6b1171f3	[ORC][ORC-RT] Add initial native-TLV support to MachOPlatform. Adds code to LLVM (MachOPlatform) and the ORC runtime to support native MachO thread local variables. Adding new TLVs to a JITDylib at runtime is supported. On the LLVM side MachOPlatform is updated to: 1. Identify thread local variables in the LinkGraph and lower them to GOT accesses to data in the __thread_data or __thread_bss sections. 2. Merge and report the address range of __thread_data and thread_bss sections to the runtime. On the ORC runtime a MachOTLVManager class introduced which records the address range of thread data/bss sections, and creates thread-local instances from the initial data on demand. An orc-runtime specific tlv_get_addr implementation is included which saves all register state then calls the MachOTLVManager to get the address of the requested variable for the current thread.	2021-07-21 09:10:10 +10:00
Lang Hames	255fc69d3a	[JITLink] Add support for moving blocks and symbols between sections. LinkGraph::transferBlock can be used to move a block and all associated symbols from one section to another. LinkGraph::mergeSections moves all blocks and sections from a source section to a destination section.	2021-07-21 09:10:09 +10:00
Albion Fung	d88c540901	[PowerPC] Implemented mtmsr, mfspr, mtspr Builtins Implemented builtins for mtmsr, mfspr, mtspr on PowerPC; the patch is intended for XL Compatibility. Differential revision: https://reviews.llvm.org/D106130	2021-07-20 17:51:00 -05:00
Alex Lorenz	e6cc4def5d	[clang][darwin] Add support for macOS -> Mac Catalyst version remapping to the Darwin SDK Info Differential Revision: https://reviews.llvm.org/D105958	2021-07-20 14:25:33 -07:00
Albion Fung	63ce4846c7	[PowerPC] Store, load, move from and to registers related builtins This patch implements store, load, move from and to registers related builtins, as well as the builtin for stfiw. The patch aims to provide feature parady with xlC on AIX. Differential revision: https://reviews.llvm.org/D105946	2021-07-20 15:46:14 -05:00
Sterling Augustine	c4088202b5	Consolidate string types into ptr and length representations. After rGbbbc4f110e35ac709b943efaa1c4c99ec073da30, we can move any string type that has convenient pointer and length fields into the PtrAndLengthKind, reducing the amount of code. Differential Revision: https://reviews.llvm.org/D106381	2021-07-20 13:29:57 -07:00
Fangrui Song	2174d3b961	[LTO] Add SelectionKind to IRSymtab and use it in ld.lld/LLVMgold In PGO, a C++ external linkage function `foo` has a private counter `__profc_foo` and a private `__profd_foo` in a `comdat nodeduplicate`. A `__attribute__((weak))` function `foo` has a weak hidden counter `__profc_foo` and a private `__profd_foo` in a `comdat nodeduplicate`. In `ld.lld a.o b.o`, say a.o defines an external linkage `foo` and b.o defines a weak `foo`. Currently we treat `comdat nodeduplicate` as `comdat any`, ld.lld will incorrectly consider `b.o:__profc_foo` non-prevailing. In the worst case when `b.o:__profd_foo` is retained and `b.o:__profc_foo` isn't, there will be dangling reference causing an `undefined hidden symbol` error. Add SelectionKind to `Comdat` in IRSymtab and let linkers ignore nodeduplicate comdat. Differential Revision: https://reviews.llvm.org/D106228	2021-07-20 13:22:00 -07:00
Fangrui Song	dd6e19a41c	[IR] Rename `comdat noduplicates` to `comdat nodeduplicate` In the textual format, `noduplicates` means no COMDAT/section group deduplication is performed. Therefore, if both sets of sections are retained, and they happen to define strong external symbols with the same names, there will be a duplicate definition linker error. In PE/COFF, the selection kind lowers to `IMAGE_COMDAT_SELECT_NODUPLICATES`. The name describes the corollary instead of the immediate semantics. The name can cause confusion to other binary formats (ELF, wasm) which have implemented/ want to implement the "no deduplication" selection kind. Rename it to be clearer. Reviewed By: rnk Differential Revision: https://reviews.llvm.org/D106319	2021-07-20 12:47:10 -07:00
Shilei Tian	0d5ca9b780	[OpenMP][deviceRTLs] Update return type of function __kmpc_parallel_level In `deviceRTLs`, the parallel level is stored in a shared variable of type `uint8_t`. `__kmpc_parallel_level` currently returns a 16-bit interger. This patch first changes the return type of the function to `uint8_t`, same as the shared variable, and then corrects function type which was updated in D105955. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106384	2021-07-20 15:45:43 -04:00
Nikita Popov	8c4fc78279	[AttrBuilder] Assert correct attribute kind Make sure that addAttribute() is only used with simple enum attributes. Integer and type attributes need to provide an additional value/type.	2021-07-20 21:16:23 +02:00
Victor Huang	f281101580	[PowerPC] Add PowerPC cmpb builtin and emit target indepedent code for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch add the builtin and emit target independent code for __cmpb. Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D105194	2021-07-20 13:06:22 -05:00
Nikita Popov	8eac42b1cf	[Inline] Fix noalias addition on simplified instructions (PR50589) When adding noalias/alias.scope metadata, we analyze the instructions of the original callee, and then place metadata on the corresponding inlined instructions in the caller as provided by VMap. However, this assumes that this actually a clone of the instruction, rather than the result of simplification. If simplification occurred, the instruction that VMap points to may not have any relationship as far as ModRef behavior is concerned. Fix this by tracking simplified instructions during cloning and then only processing instructions that have not been simplified. This is done with an additional map form original to cloned instruction, into which we only insert if no simplification is performed. The mapping in VMap can then be compared to this map. If they're the same, the instruction hasn't been simplified. (I originally wanted to only track a set of simplified instructions, but that wouldn't work if the instruction only gets simplified afterwards, e.g. based on rewritten phis.) Fixes https://bugs.llvm.org/show_bug.cgi?id=50589. Differential Revision: https://reviews.llvm.org/D106242	2021-07-20 19:52:41 +02:00
Arthur Eubanks	d577e2748f	[NewPM] Print pre-transformation IR name in --print-after-all Sometimes a transformation can change the name of some IR (e.g. an SCC with functions added/removed). This can be confusing when debug logging doesn't match the post-transformation name. The specific example I came across was that --print-after-all said the inliner was working on an SCC that only contained one function, but calls in multiple functions were getting inlined. After all inlining, the current SCC only contained one function. Piggyback off of the existing logic to handle invalidated IR + --print-module-scope. Simply always store the IR description and use that. Reviewed By: jamieschmeiser Differential Revision: https://reviews.llvm.org/D106290	2021-07-20 10:20:10 -07:00
Quinn Pham	b0ba22b69f	[PowerPC] Semachecking for XL compat builtin icbt This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds semachecking for an already implemented builtin, `__icbt`. `__icbt` is only valid for Power8 and up. Reviewed By: #powerpc, nemanjai Differential Revision: https://reviews.llvm.org/D105834	2021-07-20 11:05:22 -05:00
Stefan Pintilie	dbd743acfc	[PowerPC] Inefficient register allocation of ACC registers results in many copies. ACC registers are a combination of four consecutive vector registers. If the vector registers are assigned first this often forces a number of copies to appear just before the ACC register is created. If the ACC register is assigned first then fewer copies are generated when the vector registers are assigned. This patch tries to force the register allocator to assign the ACC registers first and then the UACC registers and then the vector pair registers. It does this by changing the priority of the register classes. This patch also adds hints to help the register allocator assign UACC registers from known ACC registers and vector pair registers from known UACC registers. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D105854	2021-07-20 10:53:40 -05:00
Sterling Augustine	4a2706ecc3	Avoid keeping internal string_views in Twine. This is a follow-up to https://reviews.llvm.org/D103935 A Twine's internal layout should not depend on which version of the C++ standard is in use. Dynamically linking binaries compiled with two different layouts (eg, --std=c++14 vs --std=c++17) ends up problematic. This change avoids that issue by immediately converting a string_view to a pointer-and-length at the cost of an extra eight-bytes in Twine. Differential Revision: https://reviews.llvm.org/D106186	2021-07-20 08:46:53 -07:00
Anirudh Prasad	0dda2de3a7	[SystemZ][z/OS] Add GOFF support to file magic identification - This patch adds in the GOFF format to the file magic identification logic in LLVM - Currently, for the object file support, GOFF is marked as having as an error - However, this is only temporary until https://reviews.llvm.org/D98437 is merged in Reviewed By: abhina.sreeskantharajan Differential Revision: https://reviews.llvm.org/D105993	2021-07-20 10:50:47 -04:00
Simon Pilgrim	41db4b6343	[CostModel] Templatize EntryCost::Cost to allow custom cost metrics We currently use an unsigned value for our CostTblEntry and TypeConversionCostTblEntry cost tables which is limiting depending on how the target wishes to handle various CostKinds etc. For instance, targets might wish to store separate instruction count, latency or throughput values etc. On D46276 we have been investigating storing a code snippet to improve latency/throughput cost calculations. There is a slight problem in that template argument deduction was struggling to match the now templatized Costs[] tables in a ArrayRef constructor - I've added helper wrappers for CostTableLookup/ConvertCostTableLookup which avoids us having to update all existing calls with a template hint. Differential Revision: https://reviews.llvm.org/D106351	2021-07-20 15:31:39 +01:00
Florian Mayer	be3736ea6f	Revert "[hwasan] Use stack safety analysis." This reverts commit e9c63ed10b3bdf6eb3fa76d1a3eb403d6fc6a118.	2021-07-20 10:36:46 +01:00
Florian Mayer	1c1f625528	[hwasan] Use stack safety analysis. This avoids unnecessary instrumentation. Reviewed By: eugenis, vitalybuka Differential Revision: https://reviews.llvm.org/D105703	2021-07-20 10:06:35 +01:00
Johannes Doerfert	8c94163113	[Attributor] Gracefully handle interprocedural reachability queries This does ensure `InformationCache::getPotentiallyReachable` will not crash/assert on instructions from different functions but simply return that one is reachable, which is conservatively correct.	2021-07-20 00:35:14 -05:00
Johannes Doerfert	28e9e9d011	[Attributor][NFCI] Expose `getAssumedUnderlyingObjects` API	2021-07-20 00:35:13 -05:00
Johannes Doerfert	6f8d0fac4a	[Attributor][NFC] Fix function name spelling	2021-07-20 00:35:13 -05:00
Johannes Doerfert	895c3ad5ac	[Attributor] Introduce AAPointerInfo This patch introduces AAPointerInfo which tracks the uses of a pointer and places them in "bins" based on their offset from the base and access size. As with other AAs, any pointer can be tracked but it is up to the user to make sense of the results. The user in this patch is AAValueSimplify and AAPotentialValues which both utilize AAPointerInfo to determine the value of a load. For now, this is restricted to loads of allocas and internal globals. Through the use of AAPointerInfo and the "bins" we can track struct members separately. The users also know that storing only zeros (at unknown indices) will result in loading only 0 (from unknown indices). Other than that, the users are flow and context insensitive (for now). To deal with the "bins" more easily, AAPointerInfo provides a forallInterfearingAccesses that applies a callback on all accesses that might interfere with a given load or store. Differential Revision: https://reviews.llvm.org/D104432	2021-07-19 22:48:35 -05:00
Johannes Doerfert	273b9adf8c	[Attributor] Simplify loads As a first step to simplify loads we only handle `null` and `undef` underlying objects, as well as objects that have the load as a single user. Loads of those values can be replaced by the initializer, if any. Proper reasoning is introduced in a follow up patch Differential Revision: https://reviews.llvm.org/D103862	2021-07-19 22:47:29 -05:00
Fangrui Song	9f9a9f472c	Revert D105519 "[WebAssembly] Deduplicate imports of the same module name, field name, and type" and its followup This reverts commit 4ae575b9997e0903d1c2ec01a43e3f3f2db5df16 and 9b965b37c75d626c01951184088314590e38d299. There is an use-of-uninitialized-value bug in the `else` branch in ImportSection::addImport.	2021-07-19 17:09:01 -07:00
Artem Belevich	33f436abed	Revert "[MemCpyOpt] Enable memcpy optimizations unconditionally." This reverts commit 2c98298a7559dfe4a264ef1adaad0921526768cc which breaks sanitizers.	2021-07-19 14:27:41 -07:00
Derek Schuff	0c7127a873	[WebAssembly] Generate R_WASM_FUNCTION_OFFSET relocs in debuginfo sections Debug info sections need R_WASM_FUNCTION_OFFSET_I32 relocs (with FK_Data_4 fixup kinds) to refer to functions (instead of R_WASM_TABLE_INDEX as is used in data sections). Usually this is done in a convoluted way, with unnamed temp data symbols which target the start of the function, in which case WasmObjectWriter::recordRelocation converts it to use the section symbol instead. However in some cases the function can actually be undefined; in this case the dwarf generator uses the function symbol (a named undefined function symbol) instead. In that case the section-symbol transform doesn't work and we need to generate the correct reloc type a different way. In this change WebAssemblyWasmObjectWriter::getRelocType takes the fixup section type into account to choose the correct reloc type. Fixes PR50408 Differential Revision: https://reviews.llvm.org/D103557	2021-07-19 14:02:33 -07:00
Nick Fitzgerald	b3c6e87b2d	[WebAssembly] Deduplicate imports of the same module name, field name, and type When two symbols import the same thing, only one import should be emitted in the Wasm file. Fixes https://bugs.llvm.org/show_bug.cgi?id=50938 Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D105519	2021-07-19 13:59:02 -07:00
Stuart Brady	1df98c77f3	[demangler] Fix demangling of 'half' Demangle 'Dh' as 'half' (as per GCC), and not 'decimal16' (which doesn't make sense, as there is no IEEE 754 decimal16 format). The Itanium C++ ABI specification describes 'Dh' as: > IEEE 754r half-precision floating point (16 bits) (https://itanium-cxx-abi.github.io/cxx-abi/abi.html#mangling-builtin) Reviewed By: ldionne, jyknight Differential Revision: https://reviews.llvm.org/D103833	2021-07-19 21:21:34 +01:00
Tony Tye	e4ba84ff98	[AMDGPU] Reserve AMDGPU ELF e_flags machine 0x45 Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D106249	2021-07-19 20:17:35 +00:00
Petr Hosek	1aff1e2660	[InstrProfiling] Use weak alias for bias variable We need the compiler generated variable to override the weak symbol of the same name inside the profile runtime, but using LinkOnceODRLinkage results in weak symbol being emitted in which case the symbol selected by the linker is going to depend on the order of inputs which can be fragile. This change replaces the use of weak definition inside the runtime with a weak alias. We place the compiler generated symbol inside a COMDAT group so dead definition can be garbage collected by the linker. We also disable the use of runtime counter relocation on Darwin since Mach-O doesn't support weak external references, but Darwin already uses a different continous mode that relies on overmapping so runtime counter relocation isn't needed there. Differential Revision: https://reviews.llvm.org/D105176	2021-07-19 12:23:51 -07:00
Artem Belevich	d635785a03	[MemCpyOpt] Enable memcpy optimizations unconditionally. The patch does not depend on the availability of the library functions for memcpy/memset as it operates on LLVM intrinsics. The optimizations are useful on the targets that have these functions disabled (e.g. NVPTX & AMDGPU). Differential Revision: https://reviews.llvm.org/D104801	2021-07-19 11:58:02 -07:00
Haowei Wu	5403a31116	[ifs][elfabi] Merge llvm-ifs/elfabi tools This change merges llvm-elfabi and llvm-ifs tools. Differential Revision: https://reviews.llvm.org/D100139	2021-07-19 11:23:19 -07:00
Haowei Wu	bcce63dd25	[ifs] Prepare llvm-ifs for elfabi/ifs merging. This diff changes llvm-ifs to use unified IFS file format and perform other renaming changes in preparation for the merging between elfabi/ifs. Differential Revision: https://reviews.llvm.org/D99810	2021-07-19 11:23:00 -07:00
Haowei Wu	db5e2f303b	[elfabi] Prepare elfabi/ifs merging. This change implements unified text stub format and command line interface proposed in the elfabi/ifs merge plan. Differential Revision: https://reviews.llvm.org/D99399	2021-07-19 11:22:43 -07:00
Amy Huang	f2ec69fb59	Revert "[llvm][sve] Lowering for VLS truncating stores" because it causes a seg fault (see https://reviews.llvm.org/D104471). This reverts commit c305557acdaad453e32309d575fe9c6c7090c099.	2021-07-19 11:03:33 -07:00
Amara Emerson	49ad61c372	[GlobalISel] Fix load-or combine moving loads across potential aliasing stores. Although this combine checks that there's no load folding barriers between the loads that it's trying to merge, it was inserting the load at the MIRBuilder's default insertion point, which is the G_OR use inst. This was causing a miscompile in the test suite's SingleSource/Regression/C/gcc-c-torture/execute/GCC-C-execute-bswap-2 Differential Revision: https://reviews.llvm.org/D106251	2021-07-19 10:23:23 -07:00
Wouter van Oortmerssen	8a42745952	[WebAssembly] Support R_WASM_MEMORY_ADDR_TLS_SLEB64 for wasm64 Also fixed TLS tests swapping addr & value in store op Differential Revision: https://reviews.llvm.org/D106096	2021-07-19 10:22:43 -07:00
Victor Campos	33327c4fc5	[NewPM] Fix wrong perfect forwardings Some template functions were missing '&&' in function arguments, therefore these were always taken by value after template instantiation. This patch adds the double ampersand to introduce proper perfect forwarding. Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D106148	2021-07-19 17:21:32 +01:00
Simon Pilgrim	bcad7af28b	[ISD] Add disclaimer comments to AssertSext/Zext/Align opcodes about poison values As encountered on D106053, we need to be very explicit that the Assertion nodes don't hold true for a poison value (or for specific poisoned vector elements). Differential Revision: https://reviews.llvm.org/D106257	2021-07-19 17:15:28 +01:00
maekawatoshiki	10a3d40457	[LICM] Create LoopNest Invariant Code Motion (LNICM) pass This patch adds a new pass called LNICM which is a LoopNest version of LICM and a test case to show how LNICM works. Basically, LNICM only hoists invariants out of loop nest (not a loop) to keep/make perfect loop nest. This enables later optimizations that require perfect loop nest. Reviewed By: Whitney Differential Revision: https://reviews.llvm.org/D104180	2021-07-20 00:31:18 +09:00
Matt Arsenault	9fc3416fcc	GlobalISel: Preserve LLT when bitcasting loads and stores This also avoids improperly legalizing some truncating vector stores.	2021-07-19 11:30:14 -04:00
Jeremy Morse	6df259fb43	[InstrRef][X86] Drop debug instruction numbers from x87 instructions Avoid a crash when using instruction referencing if x87 floating point instructions are used. These instructions are significantly mutated when they're rewritten from referring to registers, to referring to floating-point-stack positions. As a result, their operands are re-ordered, and (InstrRef) LiveDebugValues asserts when it sees a DBG_INSTR_REF referring to a non-reg non-def register operand. To fix this, drop the instruction numbers, and thus variable locations. This patch adds a helper utility do do that. Dropping the variable locations is sub-optimal, but applying DBG_VALUEs to the $fp0 and similar registers is dropped on emission too. It seems we've never done well at describing variables that live in x87 registers, at all. Differential Revision: https://reviews.llvm.org/D105657	2021-07-19 15:08:27 +01:00
Kazu Hirata	297f95d1cb	[CodeGen] Remove isNON_TRUNCStore and isTRUNCStore (NFC) The last use of isNON_TRUNCStore was removed on Oct 10, 2018 in commit 07acc992dc39edfccc5a4b773c3dcf8a5bf6d893. isTRUNCStore seems to be unused for at least 10 years.	2021-07-19 06:56:04 -07:00
Florian Mayer	c24e3effdd	Revert "[hwasan] Use stack safety analysis." This reverts commit 12268fe14a1a65d4b62f0b6e5beab46ba8501ae7.	2021-07-19 12:08:32 +01:00
Florian Mayer	b61613dac7	[hwasan] Use stack safety analysis. This avoids unnecessary instrumentation. Reviewed By: eugenis, vitalybuka Differential Revision: https://reviews.llvm.org/D105703	2021-07-19 11:54:44 +01:00
Florian Mayer	070bb38c2e	[NFC] [MTE] helper for stack tagging lifetimes. Reviewed By: eugenis, vitalybuka Differential Revision: https://reviews.llvm.org/D106135	2021-07-19 11:09:16 +01:00

1 2 3 4 5 ...

45641 Commits