llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 20:51:52 +01:00

Author	SHA1	Message	Date
David Sherwood	2d2e4a1b17	[Analysis] Add simple cost model for strict (in-order) reductions I have added a new FastMathFlags parameter to getArithmeticReductionCost to indicate what type of reduction we are performing: 1. Tree-wise. This is the typical fast-math reduction that involves continually splitting a vector up into halves and adding each half together until we get a scalar result. This is the default behaviour for integers, whereas for floating point we only do this if reassociation is allowed. 2. Ordered. This now allows us to estimate the cost of performing a strict vector reduction by treating it as a series of scalar operations in lane order. This is the case when FP reassociation is not permitted. For scalable vectors this is more difficult because at compile time we do not know how many lanes there are, and so we use the worst case maximum vscale value. I have also fixed getTypeBasedIntrinsicInstrCost to pass in the FastMathFlags, which meant fixing up some X86 tests where we always assumed the vector.reduce.fadd/mul intrinsics were 'fast'. New tests have been added here: Analysis/CostModel/AArch64/reduce-fadd.ll Analysis/CostModel/AArch64/sve-intrinsics.ll Transforms/LoopVectorize/AArch64/strict-fadd-cost.ll Transforms/LoopVectorize/AArch64/sve-strict-fadd-cost.ll Differential Revision: https://reviews.llvm.org/D105432	2021-07-26 10:26:06 +01:00
Lang Hames	480ddba43e	[ORC][ORC-RT] Add initial Objective-C and Swift support to MachOPlatform. This allows ORC to execute code containing Objective-C and Swift classes and methods (provided that the language runtime is loaded into the executor).	2021-07-26 18:02:01 +10:00
Nikita Popov	ed5a269ff1	[Attributes] Clean up handling of UB implying attributes (NFC) Rather than adding methods for dropping these attributes in various places, add a function that returns an AttrBuilder with these attributes, which can then be used with existing methods for dropping attributes. This is with an eye on D104641, which also needs to drop them from returns, not just parameters. Also be more explicit about the semantics of the method in the documentation. Refer to UB rather than Undef, which is what this is actually about.	2021-07-25 18:21:13 +02:00
Kazu Hirata	697f5408f6	[GlobalISel] Remove FlagsOp (NFC) The class was introduced without a use on Dec 11, 2018 in commit cef44a234219e38e1c28c902ff24586150eef682.	2021-07-25 07:05:07 -07:00
Kazu Hirata	3e06079a92	[Inline] Fix a warning by removing an explicit copy constructor This patches fixes the warning: llvm/include/llvm/Analysis/InlineCost.h:62:3: error: definition of implicit copy assignment operator for 'CostBenefitPair' is deprecated because it has a user-declared copy constructor [-Werror,-Wdeprecated-copy] by removing the explicit copy constructor.	2021-07-25 06:56:47 -07:00
Liqiang Tao	4952863892	[llvm][Inline] Add interface to return cost-benefit stuff Return cost-benefit stuff which is computed by cost-benefit analysis. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D105349	2021-07-25 20:18:19 +08:00
Kazu Hirata	45ba93f745	[ADT] Remove WrappedPairNodeDataIterator (NFC) The last use was removed on Jul 16, 2020 in commit f1d4db4f0cdcbfeaee0840bf8a4fb5dc1b9b56fd.	2021-07-24 08:02:57 -07:00
Sander de Smalen	1f523effc3	[BasicTTI] Set scalarization cost of scalable vector casts to Invalid. When BasicTTIImpl::getCastInstrCost can't determine the cost of a vector cast operation when the types need legalization, it falls back to calculating scalarization costs. Instead of crashing on `cast<FixedVectorType>(DstVTy)` when the type is a scalable vector, return an Invalid cost. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D106655	2021-07-24 14:13:21 +01:00
Simon Pilgrim	cb0d04c29e	[DAG] Add initial SelectionDAG::isGuaranteedNotToBeUndefOrPoison framework (PR51129) I've setup the basic framework for the isGuaranteedNotToBeUndefOrPoison call and updated DAGCombiner::visitFREEZE to use it, further Opcodes can be handled when we have test coverage. I'm not aware of any vector test freeze coverage so the DemandedElts (and the Depth) args are not being used yet - but they are in place. SelectionDAG::isGuaranteedNotToBePoison wrappers have also been added. Differential Revision: https://reviews.llvm.org/D106668	2021-07-24 11:36:35 +01:00
Amara Emerson	c79055fae4	[GlobalISel] Add GUnmerge, GMerge, GConcatVectors, GBuildVector abstractions. NFC. Use these to slightly simplify some code in the artifact combiner.	2021-07-23 22:32:26 -07:00
Lang Hames	500a10cb5e	Re-re-re-apply "[ORC][ORC-RT] Add initial native-TLV support to MachOPlatform." The ccache builders have recevied a config update that should eliminate the build issues seen previously.	2021-07-24 13:16:12 +10:00
Kuter Dinel	a502a514d2	[AMDGPU] Deduce attributes with the Attributor This patch introduces a pass that uses the Attributor to deduce AMDGPU specific attributes. Reviewed By: jdoerfert, arsenm Differential Revision: https://reviews.llvm.org/D104997	2021-07-24 06:07:15 +03:00
Thomas Lively	a913c9bb30	[WebAssembly] Codegen for pmin and pmax Replace the clang builtins and LLVM intrinsics for {f32x4,f64x2}.{pmin,pmax} with standard codegen patterns. Since wasm_simd128.h uses an integer vector as the standard single vector type, the IR for the pmin and pmax intrinsic functions contains bitcasts that would not be there otherwise. Add extra codegen patterns that can still select the pmin and pmax instructions in the presence of these bitcasts. Differential Revision: https://reviews.llvm.org/D106612	2021-07-23 14:49:21 -07:00
Roman Lebedev	fdb7d69784	[NFC][BasicBlockUtils] Refactor GetIfCondition() to return the branch, not it's condition Otherwise e.g. the FoldTwoEntryPHINode() has to do a lot of legwork to re-deduce what is the dominant block (i.e. for which block is this branch the terminator).	2021-07-24 00:18:26 +03:00
Cyndy Ishida	cd241d2fc0	[llvm][NFC] Fix typos in Errc.h description	2021-07-23 11:54:49 -07:00
Mircea Trofin	4559a48614	[NFC][MLGO] Just use the underlying protobuf object for logging Avoid buffering just to copy the buffered data, in 'development mode', when logging. Instead, just populate the underlying protobuf. Differential Revision: https://reviews.llvm.org/D106592	2021-07-23 10:56:48 -07:00
Fangrui Song	c3bb156e90	Revert "[clang] -falign-loops=" This reverts commit 42896eeed9e3d12e7e38217a0d7e35b9736451ac. Unfinished. Accidentally pushed when reverting a clangd commit.	2021-07-23 09:58:35 -07:00
Fangrui Song	05f5a9a949	[clang] -falign-loops=	2021-07-23 09:50:43 -07:00
luxufan	85def5bf4e	[JITLink][RISCV] Initial Support RISCV64 in JITLink This patch is the initial support, it implements translation from object file to JIT link graph, and very few relocations were supported. Currently, the test file ELF_pc_indirect.s is passed, the HelloWorld program(compiled with mno-relax flag) can be linked correctly and run on instruction emulator correctly. In the downstream implementation, I have implemented the GOT, PLT function, and EHFrame and some optimization will be implement soon. I will organize the code in to patches, then gradually send it to upstream. Differential Revision: https://reviews.llvm.org/D105429	2021-07-23 23:47:30 +08:00
Kazu Hirata	73fafa5526	[ARM] Remove getHWDivName (NFC) This function seems to be unused for at least 5 years.	2021-07-23 07:44:23 -07:00
David Truby	130948388d	[llvm][sve] Lowering for VLS truncating stores This adds custom lowering for truncating stores when operating on fixed length vectors in SVE. It also includes a DAG combine to fold extends followed by truncating stores into non-truncating stores in order to prevent this pattern appearing once truncating stores are supported. Currently truncating stores are not used in certain cases where the size of the vector is larger than the target vector width. Differential Revision: https://reviews.llvm.org/D104471	2021-07-23 14:04:55 +01:00
Dylan Fleming	8a94b4239a	[SVE][IR] Fix Binary op matching in PatternMatch::m_VScale Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D105978	2021-07-23 11:39:13 +01:00
Fraser Cormack	2db8b2b6fc	[NFC] Fix early line-break in doxygen comment	2021-07-23 07:16:05 +01:00
Giorgis Georgakoudis	d1dd1d3743	[OpenMP] Use AAHeapToStack/AAHeapToShared analysis in SPMDization SPMDization D102307 detects incompatible OpenMP runtime calls to abort converting a target region to SPMD mode. Calls to memory allocation/de-allocation routines kmpc_alloc_shared, kmpc_free_shared are incompatible unless they are removed by AAHeapToStack/AAHeapToShared analysis. This patch extends SPMDization detection to include AAHeapToStack/AAHeapToShared analysis results for enlarging the scope of possible SPMDized regions detected. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D105634	2021-07-22 18:08:37 -07:00
Vitaly Buka	381f03cdf9	[NFC][asan] Always pass Dominator Trees into forAllReachableExits	2021-07-22 18:01:38 -07:00
Gulfem Savrun Yeniceri	4e540995b1	[profile] Add binary id into profiles This patch adds binary id into profiles to easily associate binaries with the corresponding profiles. There is an RFC that discusses the motivation, design and implementation in more detail: https://lists.llvm.org/pipermail/llvm-dev/2021-June/151154.html Differential Revision: https://reviews.llvm.org/D102039	2021-07-23 00:19:12 +00:00
Florian Mayer	b276efa2ab	[hwasan] Use stack safety analysis. This avoids unnecessary instrumentation. Reviewed By: eugenis, vitalybuka Differential Revision: https://reviews.llvm.org/D105703	2021-07-22 16:20:27 -07:00
Alexander Yermolovich	a827bbdfea	[DWP] Refactoring llvm-dwp in to a library part 2 This is follow up to https://reviews.llvm.org/D106198 where llvm-dwp was refactored in to multiple files. In this patch moving them in to lib/include directories. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D106493	2021-07-22 14:23:29 -07:00
Nick Fitzgerald	f7deab5277	Reland: "[WebAssembly] Deduplicate imports of the same module name, field name, and type" When two symbols import the same thing, only one import should be emitted in the Wasm file. Fixes https://bugs.llvm.org/show_bug.cgi?id=50938 Reverted in: 16aac493e59519377071e900d119ba2e7e5b525d. Reviewed By: sbc100 Differential Revision: https://reviews.llvm.org/D105519	2021-07-22 14:16:05 -07:00
Paulo Matos	e8be0ee828	[WebAssembly] Implementation of global.get/set for reftypes in LLVM IR Reland of 31859f896. This change implements new DAG notes GLOBAL_GET/GLOBAL_SET, and lowering methods for load and stores of reference types from IR globals. Once the lowering creates the new nodes, tablegen pattern matches those and converts them to Wasm global.get/set. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D104797	2021-07-22 22:07:24 +02:00
Jon Chesterfield	0493801fb2	[nfc] Fix typo in comment, s/node/note	2021-07-22 20:16:53 +01:00
Victor Huang	017f21fed1	[PowerPC] Add PowerPC "__stbcx" builtin and intrinsic for XL compatibility This patch is in a series of patches to provide builtins for compatibility with the XL compiler. This patch adds the builtin and intrinsic for "__stbcx". Reviewed By: nemanjai, #powerpc Differential revision: https://reviews.llvm.org/D106484	2021-07-22 10:48:46 -05:00
Alexey Bataev	fd1d10a20f	[OPENMP]Fix PR49787: Codegen for calling __tgt_target_teams_nowait_mapper has too few arguments. Added missed arguments in __tgt_target_teams_nowait_mapper/__tgt_target_nowait_mapper runtime functions calls. Differential Revision: https://reviews.llvm.org/D106542	2021-07-22 08:44:37 -07:00
Alexey Bataev	6351ecd4dc	Revert "[OPENMP]Fix PR49787: Codegen for calling __tgt_target_teams_nowait_mapper has too few arguments." This reverts commit b455f7f22564a096c043b02fa159ab16669c121c to fix buildbots.	2021-07-22 08:06:29 -07:00
Alexey Bataev	0261373c6d	[OPENMP]Fix PR49787: Codegen for calling __tgt_target_teams_nowait_mapper has too few arguments. Added missed arguments in __tgt_target_teams_nowait_mapper/__tgt_target_nowait_mapper runtime functions calls. Differential Revision: https://reviews.llvm.org/D106542	2021-07-22 07:53:37 -07:00
Kazu Hirata	19374d4da0	[Transforms] Remove getOrCreateInitFunction (NFC) The last use was removed on Jan 16, 2019 in commit 81101de5853b4ed64640220a086a67b16f36f153.	2021-07-22 06:30:39 -07:00
Paulo Matos	ceddd7eb41	Add support for zero-sized Scalars as a LowLevelType Opaque values (of zero size) can be stored in memory with the implemention of reference types in the WebAssembly backend. Since MachineMemOperand uses LLTs we need to be able to support zero-sized scalars types in LLTs. Differential Revision: https://reviews.llvm.org/D105423	2021-07-22 13:47:19 +02:00
Florian Mayer	152a339cb1	Revert "[hwasan] Use stack safety analysis." This reverts commit bde9415fef25e9ff6e10595a2f4f5004dd62f10a.	2021-07-22 12:16:16 +01:00
Florian Mayer	fa5973a54d	[hwasan] Use stack safety analysis. This avoids unnecessary instrumentation. Reviewed By: eugenis, vitalybuka Differential Revision: https://reviews.llvm.org/D105703	2021-07-22 12:04:54 +01:00
Simon Tatham	bc23ee33a0	[clang] Use i64 for the !srcloc metadata on asm IR nodes. This is part of a patch series working towards the ability to make SourceLocation into a 64-bit type to handle larger translation units. !srcloc is generated in clang codegen, and pulled back out by llvm functions like AsmPrinter::emitInlineAsm that need to report errors in the inline asm. From there it goes to LLVMContext::emitError, is stored in DiagnosticInfoInlineAsm, and ends up back in clang, at BackendConsumer::InlineAsmDiagHandler(), which reconstitutes a true clang::SourceLocation from the integer cookie. Throughout this code path, it's now 64-bit rather than 32, which means that if SourceLocation is expanded to a 64-bit type, this error report won't lose half of the data. The compiler will tolerate both of i32 and i64 !srcloc metadata in input IR without faulting. Test added in llvm/MC. (The semantic accuracy of the metadata is another matter, but I don't know of any situation where that matters: if you're reading an IR file written by a previous run of clang, you don't have the SourceManager that can relate those source locations back to the original source files.) Original version of the patch by Mikhail Maltsev. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D105491	2021-07-22 10:24:52 +01:00
Johannes Doerfert	54c73c71f7	[Attributor][FIX] Do not introduce multiple instances of SSA values If we have a recursive function we could create multiple instantiations of an SSA value, one per recursive invocation of the function. This is a problem as we use SSA value equality in various places. The basic idea follows from this test: ``` static int r(int c, int *a) { int X; return c ? r(false, &X) : a == &X; } int test(int c) { return r(c, undef); } ``` If we look through the argument `a` we will end up with `X`. Using SSA value equality we will fold `a == &X` to true and return true even though it should have been false because `a` and `&X` are from different instantiations of the function. Various tests for this have been placed in value-simplify-instances.ll and this commit fixes them all by avoiding to produce simplified values that could be non-unique at runtime. Thus, the result of a simplify value call will always be unique at runtime or the original value, both do not allow to accidentally compare two instances of a value with each other and conclude they are equal statically (pointer equivalence) while they are unequal at runtime.	2021-07-22 00:07:55 -05:00
Johannes Doerfert	b93c50b86a	[Attributor] Improve the Attributor::getAssumedConstant interface Similar to Attributor::getAssumedSimplified we need to allow IRPs directly to get the right simplification callback (and context).	2021-07-22 00:07:55 -05:00
Johannes Doerfert	59bc220605	[Attributor][NFC] Clang format	2021-07-21 22:51:05 -05:00
Joseph Huber	9d542314e4	[Libomptarget] Introduce new main thread ID runtime function This patch introduces `__kmpc_is_generic_main_thread_id` which splits the old comparison into its own runtime function. The purpose of this is so we can fold this part independently, so when both this and `is_spmd_mode` are folded the final function will be folded as well. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106437	2021-07-21 21:18:14 -04:00
Joseph Huber	472a223072	[OpenMP] Change `__kmpc_free_shared` to include the paired allocation size This patch changes `__kmpc_free_shared` to take an additional argument corresponding to the associated allocation's size. This makes it easier to implement the allocator in the runtime. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106496	2021-07-21 20:56:21 -04:00
Lang Hames	549c960a94	Re-re-revert "[ORC][ORC-RT] Add initial native-TLV support to MachOPlatform." This reverts commit 6b2a96285b9bbe92d2c5e21830f21458f8be976d. The ccache builders are still failing. Looks like they need to be updated to get the llvm-zorg config change in 490633945677656ba75d42ff1ca9d4a400b7b243. I'll re-apply this as soon as the builders are updated.	2021-07-22 10:45:24 +10:00
Lang Hames	6f51759135	Re-re-apply "[ORC][ORC-RT] Add initial native-TLV support to MachOPlatform." This reapplies commit a7733e9556b5a6334c910f88bcd037e84e17e3fc ("Re-apply [ORC][ORC-RT] Add initial native-TLV support to MachOPlatform."), and d4abdefc998a1ee19d5edc79ec233774cbf64f6a ("[ORC-RT] Rename macho_tlv.x86-64.s to macho_tlv.x86-64.S (uppercase suffix)"). These patches were reverted in 48aa82cacbff10e1c5395a03f86488bf449ba4da while I investigated bot failures (e.g. https://lab.llvm.org/buildbot/#/builders/109/builds/18981). The fix was to disable building of the ORC runtime on buliders using ccache (which is the same fix used for other compiler-rt projects containing assembly code). This fix was commited to llvm-zorg in 490633945677656ba75d42ff1ca9d4a400b7b243.	2021-07-22 09:46:52 +10:00
Thomas Lively	8403ff42b3	[WebAssembly] Replace @llvm.wasm.popcnt with @llvm.ctpop.v16i8 Use the standard target-independent intrinsic to take advantage of standard optimizations. Differential Revision: https://reviews.llvm.org/D106506	2021-07-21 16:45:54 -07:00
Stanislav Mekhanoshin	2ef5dd8386	Prevent dead uses in register coalescer after rematerialization The coalescer does not check if register uses are available at the point of rematerialization. If it attempts to rematerialize an instruction with such uses it can end up with use without a def. LiveRangeEdit does such check during rematerialization, so just call LiveRangeEdit::allUsesAvailableAt() to avoid the problem. Differential Revision: https://reviews.llvm.org/D106396	2021-07-21 15:19:55 -07:00
Gulfem Savrun Yeniceri	8179b3101d	Revert "[profile] Add binary id into profiles" Revert "[profile] Change linkage type of a compiler-rt func" This reverts commits f984ac2715f71c38a7872fa2c2ad535b3d4fa285 and 467c7191249b76abff33853b1692a77f327c2422 because it broke some builds.	2021-07-21 19:15:18 +00:00

1 2 3 4 5 ...

45603 Commits