llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Craig Topper	8ad0bb8601	[RISCV] Remove RISCVMergeBaseOffsetOpt from the -O0 pass pipeline. Internally the pass skips any function with the optnone attribute. But that still requires checking each function. If the opt level is set to None we might as well just skip putting in the pipeline at all. This what is already done for many of the passes added by TargetPassConfig. Differential Revision: https://reviews.llvm.org/D92511	2020-12-03 09:58:25 -08:00
modimo	6b81c5b99d	[MemCpyOpt] Correctly merge alias scopes during call slot optimization When MemCpyOpt performs call slot optimization it will concatenate the `alias.scope` metadata between the function call and the memcpy. However, scoped AA relies on the domains in metadata to be maintained in a caller-callee relationship. Naive concatenation breaks this assumption leading to bad AA results. The fix is to take the intersection of domains then union the scopes within those domains. The original bug came from a case of rust bad codegen which uses this bad aliasing to perform additional memcpy optimizations. As show in the added test case `%src` got forwarded past its lifetime leading to a dereference of garbage data. Testing ninja check-llvm Reviewed By: jeroen.dobbelaere Differential Revision: https://reviews.llvm.org/D91576	2020-12-03 09:23:37 -08:00
Fangrui Song	f6492fea1c	Switch to std::is_trivially_move_constructible and std::is_trivially_copy_constructible Differential Revision: https://reviews.llvm.org/D92543	2020-12-03 09:15:40 -08:00
Kazu Hirata	b51993db92	[X86] Remove DecodeVPERMVMask and DecodeVPERMV3Mask This patch removes the variants of DecodeVPERMVMask and DecodeVPERMV3Mask that take "const Constant C" as they are not used anymore. They were introduced on Sep 8, 2015 in commit e88038f23517ffc741acfd307ff92e2b1af136d8. The last use of DecodeVPERMVMask(const Constant C, ...) was removed on Feb 7, 2016 in commit 73fc26b44a8591b15f13eaffef17e67161c69388. The last use of DecodeVPERMV3Mask(const Constant *C, ...) was removed on May 28, 2018 in commit dcfcfdb0d166fff8388bdd2edc5a2948054c9da1. Differential Revision: https://reviews.llvm.org/D91926	2020-12-03 09:12:02 -08:00
Jameson Nash	1ad7228623	repair cygwin build This is needed for cross-compiling LLVM from Cygwin, but it had gotten deleted in rG2724d9e12960cc1d93eeabbfc9aa1bffffa041cc Reviewed By: compnerd Differential Revision: https://reviews.llvm.org/D92336	2020-12-03 11:49:16 -05:00
Anna Thomas	b963f5b680	[ScalarizeMaskedMemIntrin] NFC: Convert member functions to static This will make it easier to add new PM support once the pass is moved into transforms (D92407).	2020-12-03 11:46:38 -05:00
Valentin Clement	907f409972	[LLVMFrontend][openacc] Add basic unit tests for functions in LLVMFrontendOpenACC Add unit tests for functions in LLVMFrontendOpenACC. As notice in D91470 these functions were not tested as well as the ones for OpenMP (D91643). This patch add tests for the OpenACC part. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D91653	2020-12-03 11:27:18 -05:00
Ahmed Bougacha	fe6a3c2668	[Triple][MachO] Define "arm64e", an AArch64 subarch for Pointer Auth. This also teaches MachO writers/readers about the MachO cpu subtype, beyond the minimal subtype reader support present at the moment. This also defines a preprocessor macro to allow users to distinguish __arm64__ from __arm64e__. arm64e defaults to an "apple-a12" CPU, which supports v8.3a, allowing pointer-authentication codegen. It also currently defaults to ios14 and macos11. Differential Revision: https://reviews.llvm.org/D87095	2020-12-03 07:53:59 -08:00
Baptiste Saleil	e737a423ce	[PowerPC] Fix for excessive ACC copies due to PHI nodes When using accumulators in loops, they are passed around in PHI nodes of unprimed accumulators, causing the generation of additional prime/unprime instructions. This patch detects these cases and changes these PHI nodes to primed accumulator PHI nodes. We also add IR and MIR test cases for several PHI node cases. Differential Revision: https://reviews.llvm.org/D91391	2020-12-03 09:51:23 -06:00
Yonghong Song	2b2723c653	[BPF] support atomic instructions Implement fetch_<op>/fetch_and_<op>/exchange/compare-and-exchange instructions for BPF. Specially, the following gcc intrinsics are implemented. __sync_fetch_and_add (32, 64) __sync_fetch_and_sub (32, 64) __sync_fetch_and_and (32, 64) __sync_fetch_and_or (32, 64) __sync_fetch_and_xor (32, 64) __sync_lock_test_and_set (32, 64) __sync_val_compare_and_swap (32, 64) For __sync_fetch_and_sub, internally, it is implemented as a negation followed by __sync_fetch_and_add. For __sync_lock_test_and_set, despite its name, it actually does an atomic exchange and return the old content. https://gcc.gnu.org/onlinedocs/gcc-4.1.1/gcc/Atomic-Builtins.html For intrinsics like __sync_{add,sub}_and_fetch and __sync_bool_compare_and_swap, the compiler is able to generate codes using __sync_fetch_and_{add,sub} and __sync_val_compare_and_swap. Similar to xadd, atomic xadd, xor and xxor (atomic_<op>) instructions are added for atomic operations which do not have return values. LLVM will check the return value for __sync_fetch_and_{add,and,or,xor}. If the return value is used, instructions atomic_fetch_<op> will be used. Otherwise, atomic_<op> instructions will be used. All new instructions only support 64bit and 32bit with alu32 mode. old xadd instruction still supports 32bit without alu32 mode. For encoding, please take a look at test atomics_2.ll. Differential Revision: https://reviews.llvm.org/D72184	2020-12-03 07:38:00 -08:00
dfukalov	b944ac9e0a	[NFC] Reduce include files dependency. 1. Removed #include "...AliasAnalysis.h" in other headers and modules. 2. Cleaned up includes in AliasAnalysis.h. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D92489	2020-12-03 18:25:05 +03:00
Paul C. Anagnostopoulos	15379f2764	[TableGen] Eliminate the 'code' type Update the documentation. Rework various backends that relied on the code type. Differential Revision: https://reviews.llvm.org/D92269	2020-12-03 10:19:11 -05:00
Kazushi (Jam) Marukawa	6ffbb84f85	[VE] Add vsll, vsrl, vsla, vsra, and vsfa intrinsic instructions Add vsll, vsrl, vsla, vsra, and vsfa intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92550	2020-12-03 23:19:58 +09:00
Jay Foad	a902c6802a	[TableGen] Remove unused class RecordValResolver. NFC. Differential Revision: https://reviews.llvm.org/D92477	2020-12-03 13:36:58 +00:00
Joe Ellis	3e31000cc2	[DAGCombine] Fix TypeSize warning in DAGCombine::visitLIFETIME_END Bail out early if we encounter a scalable store. Reviewed By: peterwaller-arm Differential Revision: https://reviews.llvm.org/D92392	2020-12-03 12:12:41 +00:00
Evgeniy Brevnov	6a03c6a3e7	[NFC][Tests] Added one additional test case for NaryRessociation pass. New tes cases added. Change var names to avoid the following warning from update_test_checks.py: WARNING: Change IR value name 'tmp5' to prevent possible conflict with scripted FileCheck name. Reviewed By: ebrevnov Differential Revision: https://reviews.llvm.org/D92566	2020-12-03 19:11:08 +07:00
Evgeniy Brevnov	65d2b5aa5a	[NFC][Tests] Auto generate checks for llvm/test/Transforms/NaryReassociate/pr24301.ll using update_test_checks.py Generate checks with update_test_checks.py in order to simplify upcoming updates. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D92561	2020-12-03 18:22:14 +07:00
Georgii Rymar	4ad44d85e5	[llvm-readelf/obj] - Report unique warnings in getSymbolForReloc() helper. Use `reportUniqueWarning` instead of `reportWarning` and refine the interface of the helper. Differential revision: https://reviews.llvm.org/D92556	2020-12-03 14:13:26 +03:00
Tim Northover	a8373fee73	arm64: count Triple::aarch64_32 as an aarch64 target and enable leaf frame pointers	2020-12-03 11:09:44 +00:00
Georgii Rymar	924cd26a7e	[llvm-readelf] - Report unique warnings when dumping hash symbols/histogram. This converts 2 more places to use `reportUniqueWarning` and adds tests. Differential revision: https://reviews.llvm.org/D92551	2020-12-03 14:05:04 +03:00
Max Kazantsev	78b51b6b79	Revert "[IndVars] ICmpInst should not prevent IV widening" This reverts commit 0c9c6ddf17bb01ae350a899b3395bb078aa0c62e. We are seeing some failures with this patch locally. Not clear if it's causing them or just triggering a problem in another place. Reverting while investigating.	2020-12-03 18:01:41 +07:00
Kazushi (Jam) Marukawa	4516e4161b	[VE] Add veqv and vseq intrinsic instructions Add veqv and vseq intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D92527	2020-12-03 17:39:24 +09:00
Yonghong Song	e7918ccc90	BPF: add a test for selectiondag alias analysis w.r.t. lifetime This adds a test for the bug https://bugs.llvm.org/show_bug.cgi?id=47591 Previously, selection dag has a bug which may incorrectly assume no alias when crossing a lifetime boundary and this may generate incorrect code as demonstrated in the above bug. It looks the bug is fixed by https://reviews.llvm.org/D91833. Basically, when comparing two potential memory access dag nodes, a store and a lifetime.start, with the same frame index. Previously, it may be decided no alias. With the above fix, these two will be considered aliasing which will prevent incorrect code scheduling. Differential Revision: https://reviews.llvm.org/D92451	2020-12-02 22:27:17 -08:00
modimo	5b1e62daa4	[NFC] Fix typo	2020-12-02 22:23:57 -08:00
Fangrui Song	649f05aa24	Switch from llvm::is_trivially_copyable to std::is_trivially_copyable GCC<5 did not support std::is_trivially_copyable. Now LLVM builds require 5.1 we can migrate to std::is_trivially_copyable. The Optional.h change made MSVC choke (https://buildkite.com/llvm-project/premerge-checks/builds/18587#cd1bb616-ffdc-4581-9795-b42c284196de) so I leave it out for now. Differential Revision: https://reviews.llvm.org/D92514	2020-12-02 22:02:48 -08:00
Jianzhou Zhao	cb92e3d61f	[dfsan] Rename ShadowTy/ZeroShadow with prefix Primitive This is a child diff of D92261. After supporting field/index-level shadow, the existing shadow with type i16 works for only primitive types. Reviewed-by: morehouse Differential Revision: https://reviews.llvm.org/D92459	2020-12-03 05:31:01 +00:00
Craig Topper	71bed70099	[RISCV] Add additional half precision fnmadd/fnmsub tests with an fneg on the second operand instead of the first. This matches the float/double tests added in defe11866a326491ee9767f84bb3f70cfc4f4bcb	2020-12-02 21:13:42 -08:00
Craig Topper	a584f7033c	[RISCV] Add f16 to isFMAFasterThanFMulAndFAdd now that the Zfh extension is supported	2020-12-02 20:31:43 -08:00
QingShan Zhang	563c9f0908	[PowerPC] Add the hw sqrt test for vector type v4f32/v2f64 PowerPC ISA support the input test for vector type v4f32 and v2f64. Replace the software compare with hw test will improve the perf. Reviewed By: ChenZheng Differential Revision: https://reviews.llvm.org/D90914	2020-12-03 03:19:18 +00:00
Kazu Hirata	4dd4adf3d5	[SelectionDAG] Use is_contained (NFC)	2020-12-02 19:09:45 -08:00
Craig Topper	5b5b0951c9	[RISCV] Initialize MergeBaseOffsetOptPass so it will work with print-before/after-all. If its not in the PassRegistry it's not recognized as a pass when we print before/after. Happened to notice while I was working on a new pass.	2020-12-02 18:04:22 -08:00
Kazu Hirata	d349da566f	[MemorySSA] Remove unused declaration findDominatingDef (NFC) The function definition was removed on Feb 22, 2017 in commit 17e8d0eae24ffa41cf7641d984c05e00d59b93a4. The declaration has remained since.	2020-12-02 17:40:20 -08:00
Sergey Dmitriev	7d3d7f8d53	[llvm-link] use file magic when deciding if input should be loaded as archive llvm-link should not rely on the '.a' file extension when deciding if input file should be loaded as archive. Archives may have other extensions (f.e. .lib) or no extensions at all. This patch changes llvm-link to use llvm::file_magic to check if input file is an archive. Reviewed By: RaviNarayanaswamy Differential Revision: https://reviews.llvm.org/D92376	2020-12-02 17:21:34 -08:00
Hsiangkai Wang	39e5617bc0	[RISCV] Support Zfh half-precision floating-point extension. Support "Zfh" extension according to https://github.com/riscv/riscv-isa-manual/blob/zfh/src/zfh.tex Differential Revision: https://reviews.llvm.org/D90738	2020-12-03 09:16:33 +08:00
Sergey Dmitriev	0e229fe66b	Revert "[llvm-link] use file magic when deciding if input should be loaded as archive" This reverts commit 55f8c2fdfbc5eda1be946e97ecffa2dea44a883e.	2020-12-02 16:53:57 -08:00
Xun Li	5243362b90	Small improvements to Intrinsic::getName While I was adding a new intrinsic instruction (not overloaded), I accidentally used CreateUnaryIntrinsic to create the intrinsics, which turns out to be passing the type list to getName, and ended up naming the intrinsics function with type suffix, which leads to wierd bugs latter on. It took me a long time to debug. It seems a good idea to add an assertion in getName so that it fails if types are passed but it's not a overloaded function. Also, the overloade version of getName is less efficient because it creates an std::string. We should avoid calling it if we know that there are no types provided. Differential Revision: https://reviews.llvm.org/D92523	2020-12-02 16:49:12 -08:00
Sergey Dmitriev	f1419e8887	[llvm-link] use file magic when deciding if input should be loaded as archive llvm-link should not rely on the '.a' file extension when deciding if input file should be loaded as archive. Archives may have other extensions (f.e. .lib) or no extensions at all. This patch changes llvm-link to use llvm::file_magic to check if input file is an archive. Reviewed By: RaviNarayanaswamy Differential Revision: https://reviews.llvm.org/D92376	2020-12-02 16:29:41 -08:00
Duncan P. N. Exon Smith	9f6a612ccc	ADT: Rely on std::aligned_union_t for math in AlignedCharArrayUnion, NFC Instead of computing the alignment and size of the `char` buffer in `AlignedCharArrayUnion`, rely on the math in `std::aligned_union_t`. Because some users of this rely on the `buffer` field existing with a type convertible to `char *`, we can't change the field type, but we can still avoid duplicating the logic. A potential follow up would be to delete `AlignedCharArrayUnion` after updating its users to use `std::aligned_union_t` directly; or if we like our template parameters better, could update users to stop peeking inside and then replace the definition with: ``` template <class T, class... Ts> using AlignedCharArrayUnion = std::aligned_union_t<1, T, Ts...>; ``` Differential Revision: https://reviews.llvm.org/D92500	2020-12-02 15:56:12 -08:00
Mircea Trofin	611de20466	[NFC][MC] TargetRegisterInfo::getSubReg is a MCRegister. Typing the API appropriately. Differential Revision: https://reviews.llvm.org/D92341	2020-12-02 15:46:38 -08:00
Duncan P. N. Exon Smith	402de92d98	ADT: Remove redundant `alignas` from IntervalMap, NFC `AlignedArrayCharUnion` is now using `alignas`, which is properly supported now by all the host toolchains we support. As a result, the extra `alignas` on `IntervalMap` isn't needed anymore. This is effectively a revert of 379daa29744cd96b0a87ed0d4a010fa4bc47ce73. Differential Revision: https://reviews.llvm.org/D92509	2020-12-02 14:33:20 -08:00
Reid Kleckner	7c87aeebfe	Revert "Use std::is_trivially_copyable", breaks MSVC build Revert "Delete llvm::is_trivially_copyable and CMake variable HAVE_STD_IS_TRIVIALLY_COPYABLE" This reverts commit 4d4bd40b578d77b8c5bc349ded405fb58c333c78. This reverts commit 557b00e0afb2dc1776f50948094ca8cc62d97be4.	2020-12-02 14:30:46 -08:00
Florian Hahn	fad4c5768d	[ConstraintElimination] Make sure arguments of std:pow match. This should fix a build failure on some systems, e.g. solaris11-sparcv9 http://lab.llvm.org:8014/#/builders/22	2020-12-02 22:23:26 +00:00
Harald van Dijk	cca089bd44	[X86] Add TLS_(base_)addrX32 for X32 mode LLVM has TLS_(base_)addr32 for 32-bit TLS addresses in 32-bit mode, and TLS_(base_)addr64 for 64-bit TLS addresses in 64-bit mode. x32 mode wants 32-bit TLS addresses in 64-bit mode, which were not yet handled. This adds TLS_(base_)addrX32 as copies of TLS_(base_)addr64, except that they use tls32(base)addr rather than tls64(base)addr, and then restricts TLS_(base_)addr64 to 64-bit LP64 mode, TLS_(base_)addrX32 to 64-bit ILP32 mode. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D92346	2020-12-02 22:20:36 +00:00
H.J. Lu	ced4c140a0	Use PC-relative address for x32 TLS address Since x32 supports PC-relative address, it shouldn't use EBX for TLS address. Instead of checking N.getValueType(), we should check Subtarget->is32Bit(). This fixes PR 22676. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D16474	2020-12-02 22:20:36 +00:00
LLVM GN Syncbot	eec32dd187	[gn build] Port 24d4291ca70	2020-12-02 21:52:41 +00:00
Hongtao Yu	4cefe8e200	[CSSPGO] Pseudo probes for function calls. An indirect call site needs to be probed for its potential call targets. With CSSPGO a direct call also needs a probe so that a calling context can be represented by a stack of callsite probes. Unlike pseudo probes for basic blocks that are in form of standalone intrinsic call instructions, pseudo probes for callsites have to be attached to the call instruction, thus a separate instruction would not work. One possible way of attaching a probe to a call instruction is to use a special metadata that carries information about the probe. The special metadata will have to make its way through the optimization pipeline down to object emission. This requires additional efforts to maintain the metadata in various places. Given that the `!dbg` metadata is a first-class metadata and has all essential support in place , leveraging the `!dbg` metadata as a channel to encode pseudo probe information is probably the easiest solution. With the requirement of not inflating `!dbg` metadata that is allocated for almost every instruction, we found that the 32-bit DWARF discriminator field which mainly serves AutoFDO can be reused for pseudo probes. DWARF discriminators distinguish identical source locations between instructions and with pseudo probes such support is not required. In this change we are using the discriminator field to encode the ID and type of a callsite probe and the encoded value will be unpacked and consumed right before object emission. When a callsite is inlined, the callsite discriminator field will go with the inlined instructions. The `!dbg` metadata of an inlined instruction is in form of a scope stack. The top of the stack is the instruction's original `!dbg` metadata and the bottom of the stack is for the original callsite of the top-level inliner. Except for the top of the stack, all other elements of the stack actually refer to the nested inlined callsites whose discriminator field (which actually represents a calliste probe) can be used together to represent the inline context of an inlined PseudoProbeInst or CallInst. To avoid collision with the baseline AutoFDO in various places that handles dwarf discriminators where a check against the `-pseudo-probe-for-profiling` switch is not available, a special encoding scheme is used to tell apart a pseudo probe discriminator from a regular discriminator. For the regular discriminator, if all lowest 3 bits are non-zero, it means the discriminator is basically empty and all higher 29 bits can be reversed for pseudo probe use. Callsite pseudo probes are inserted in `SampleProfileProbePass` and a target-independent MIR pass `PseudoProbeInserter` is added to unpack the probe ID/type from `!dbg`. Note that with this work the switch -debug-info-for-profiling will not work with -pseudo-probe-for-profiling anymore. They cannot be used at the same time. Reviewed By: wmi Differential Revision: https://reviews.llvm.org/D91756	2020-12-02 13:45:20 -08:00
Jianzhou Zhao	1191fc6062	[dfsan] Rename CachedCombinedShadow to be CachedShadow At D92261, this type will be used to cache both combined shadow and converted shadow values. Reviewed-by: morehouse Differential Revision: https://reviews.llvm.org/D92458	2020-12-02 21:39:16 +00:00
Jianzhou Zhao	38f4d125f9	[dfsan] Test loading global ptrs This covers a branch in the loadShadow method. Reviewed-by: morehouse Differential Revision: https://reviews.llvm.org/D92460	2020-12-02 21:35:41 +00:00
Jianzhou Zhao	7ce5dd535d	[dfsan] Add a test case for phi	2020-12-02 21:29:44 +00:00
Fangrui Song	355a5fca7d	[ThinLTO][test] Fix X86/nossp.ll after D91816	2020-12-02 13:13:58 -08:00

... 2 3 4 5 6 ...

207833 Commits