llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Nick Desaulniers	af68671197	Revert "[DebugInfo] Drop DBG_VALUE_LISTs with an excessive number of debug operands" This reverts commit b623df3c93983c4512aa54f2c706716bdf865a90, as per https://llvm.org/docs/DeveloperPolicy.html#patch-reversion-policy. Breakages observed downstream reported in: https://reviews.llvm.org/D91722#2724321 Fixes exist in: https://reviews.llvm.org/D101523 https://reviews.llvm.org/D101540 but haven't landed yet going into the weekend.	2021-04-30 16:45:37 -07:00
George Balatsouras	db10f9f59c	[dfsan] Fix origin tracking for fast8 The problem is the following. With fast8, we broke an important invariant when loading shadows. A wide shadow of 64 bits used to correspond to 4 application bytes with fast16; so, generating a single load was okay since those 4 application bytes would share a single origin. Now, using fast8, a wide shadow of 64 bits corresponds to 8 application bytes that should be backed by 2 origins (but we kept generating just one). Let’s say our wide shadow is 64-bit and consists of the following: 0xABCDEFGH. To check if we need the second origin value, we could do the following (on the 64-bit wide shadow) case: - bitwise shift the wide shadow left by 32 bits (yielding 0xEFGH0000) - push the result along with the first origin load to the shadow/origin vectors - load the second 32-bit origin of the 64-bit wide shadow - push the wide shadow along with the second origin to the shadow/origin vectors. The combineOrigins would then select the second origin if the wide shadow is of the form 0xABCDE0000. The tests illustrate how this change affects the generated bitcode. Reviewed By: stephan.yichao.zhao Differential Revision: https://reviews.llvm.org/D101584	2021-04-30 15:57:33 -07:00
Jon Roelofs	738d110269	[EarlyIfConversion] Avoid producing selects with identical operands This extends the early-ifcvt pass to avoid a few more cases where the resulting select instructions would have matching operands. Additionally, we now use TII to determine "sameness" of the operands so that as TII gets smarter, so too will ifcvt. The attached test case was bugpoint-reduced down from CINT2000/252.eon in the test-suite. See: https://clang.godbolt.org/z/WvnrcrGEn Differential Revision: https://reviews.llvm.org/D101508	2021-04-30 15:51:14 -07:00
Jon Roelofs	0386626f22	[PowerPC] modernize test via update_llc_test_checks.py. NFC	2021-04-30 15:51:13 -07:00
Dávid Bolvanský	79c3f9a4d0	[X86] Promote 16-bit CTTZ_ZERO_UNDEF to 32-bit variant Related to PR50172. Protects us against regressions after we will start doing cttz(zext(x)) -> zext(cttz(x)) transformation in the middle-end. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D101662	2021-05-01 00:42:15 +02:00
Jon Roelofs	8f8ffa214f	Revert "[EarlyIfConversion] Avoid producing selects with identical operands" This reverts commit 3d27b5d28aabf8516aa1fefc78a6878b89a992f0. Broke one of the PPC tests, which I didn't see because I usually build with only the x86/AARch64 targets enabled... oops. https://lab.llvm.org/buildbot#builders/109/builds/13834 llvm/test/CodeGen/PowerPC/expand-foldable-isel.ll	2021-04-30 14:55:34 -07:00
Amara Emerson	ccca686898	[AArch64][GlobalISel] Use a single MachineIRBuilder for most of isel. NFC. This is a long overdue cleanup. Not every use is eliminated, I stuck to uses that were directly being called from select(), and not the render functions. Differential Revision: https://reviews.llvm.org/D101590	2021-04-30 14:49:41 -07:00
Jon Roelofs	af8be2e6a5	[EarlyIfConversion] Avoid producing selects with identical operands This extends the early-ifcvt pass to avoid a few more cases where the resulting select instructions would have matching operands. Additionally, we now use TII to determine "sameness" of the operands so that as TII gets smarter, so too will ifcvt. The attached test case was bugpoint-reduced down from CINT2000/252.eon in the test-suite. See: https://clang.godbolt.org/z/WvnrcrGEn Differential Revision: https://reviews.llvm.org/D101508	2021-04-30 14:42:39 -07:00
Jez Ng	3cbe568195	[llvm-readobj] Recognize N_THUMB_DEF as a symbol flag The right symbol flag mask is ~0x7, not ~0xf. Also emit string names for the other flags (we were missing some). Reviewed By: #lld-macho, gkm Differential Revision: https://reviews.llvm.org/D101548	2021-04-30 17:39:56 -04:00
Gulfem Savrun Yeniceri	127755a578	[NewPM] Disable RelLookupTableConverter pass in LTO Relative look table converter pass caused an issue when full lto is enabled (reported in https://reviews.llvm.org/D94355). This patch disables that pass from full lto pre-link phase optimization pipeline until the issue is fixed. Differential Revision: https://reviews.llvm.org/D101664	2021-04-30 21:23:40 +00:00
Jay Foad	b4ca76eab1	[AMDGPU] Remove set_gpr_idx instructions in conditional blocks SIPreEmitPeephole did not try to remove redundant s_set_gpr_idx_* instructions in blocks that end with a conditional branch instruction. This seems like a simple oversight. Differential Revision: https://reviews.llvm.org/D101629	2021-04-30 22:15:45 +01:00
Nikita Popov	8d71021388	[ValueTracking] Slightly clean up programUndefinedIfUndefOrPoison() (NFC) Use contains() to check set membership, and adjust an oddly structured loop.	2021-04-30 23:05:41 +02:00
Nikita Popov	1aee99e8ca	[ValueTracking] Limit scan when checking poison UB (PR50155) The current code can scan an unlimited number of instructions, if the containing basic block is very large. The test case from PR50155 contains a basic block with approximately 100k instructions. To avoid this, limit the number of instructions we inspect. At the same time, drop the limit on the number of basic blocks, as this will be implicitly limited by the number of instructions as well.	2021-04-30 23:04:49 +02:00
Justin Bogner	61b0781814	Add support for llvm.assume intrinsic to the LoadStoreVectorizer pass Patch by Viacheslav Nikolaev. Thanks!	2021-04-30 13:39:46 -07:00
LLVM GN Syncbot	586fbcca5e	[gn build] Port 2d28100bf2e4	2021-04-30 20:17:55 +00:00
Daniil Fukalov	3f292e8925	[TTI] NFC: Change getTypeLegalizationCost to return InstructionCost. This patch migrates the TTI cost interfaces to return an InstructionCost. See this patch for the introduction of the type: https://reviews.llvm.org/D91174 See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2020-November/146408.html Reviewed By: sdesmalen, kparzysz Differential Revision: https://reviews.llvm.org/D101533	2021-04-30 22:51:51 +03:00
Dávid Bolvanský	8ceccff3f0	[InstCombine] Added tests for PR50172, NFC	2021-04-30 21:26:26 +02:00
Guozhi Wei	c7561ad6e9	[MachineFunction] Make comment for TracksLiveness more clearer As discussed in https://lists.llvm.org/pipermail/llvm-dev/2021-April/150225.html, the current comments for TracksLiveness property and isKill flag are confusing. This patch makes the comments more clearer. Differential Revision: https://reviews.llvm.org/D101500	2021-04-30 12:10:36 -07:00
Arthur Eubanks	956b0ff1ba	[llvm-reduce] Don't unset dso_local on implicitly dso_local GVs This introduces a flag that aborts if we ever reduce to IR that fails the verifier. Reviewed By: swamulism, arichardson Differential Revision: https://reviews.llvm.org/D101279	2021-04-30 11:57:22 -07:00
Arthur Eubanks	62d0262970	[llvm-reduce] Add flag to only run specific passes Reviewed By: fhahn, hans Differential Revision: https://reviews.llvm.org/D101278	2021-04-30 11:51:01 -07:00
Konstantin Zhuravlyov	27caabe390	Revert "AMDGPU/llvm-readobj: Add missing tests for note parsing/displaying" This reverts commit c9c4676a458b1ef99ffb67b43cfd87d6c27a448c. Reason for revert: note-amd-valid-v3.s test fails if AMDGPU is not built.	2021-04-30 14:45:52 -04:00
Nick Desaulniers	b45152915e	[llvm-objdump] add -v alias for --version Used by the Linux kernel's CONFIG_X86_DECODER_SELFTEST. Link: https://github.com/ClangBuiltLinux/linux/issues/1130 Reviewed By: MaskRay, jhenderson, rupprecht Differential Revision: https://reviews.llvm.org/D101483	2021-04-30 11:26:36 -07:00
Nick Desaulniers	5eea788998	[M68k] fix -Wdefaulted-function-deleted and -Woverloaded-virtual Fixes the following warnings observerd when building the experimental m68k backend (-DLLVM_EXPERIMENTAL_TARGETS_TO_BUILD="M68k"): ../lib/Target/M68k/M68kMachineFunction.h:71:3: warning: explicitly defaulted default constructor is implicitly deleted [-Wdefaulted-function-deleted] M68kMachineFunctionInfo() = default; ^ ../lib/Target/M68k/M68kMachineFunction.h:24:20: note: default constructor of 'M68kMachineFunctionInfo' is implicitly deleted because field 'MF' of reference type 'llvm::MachineFunction &' would not be initialized MachineFunction &MF; ^ In file included from ../lib/Target/M68k/M68kISelLowering.cpp:18: In file included from ../lib/Target/M68k/M68kSubtarget.h:17: ../lib/Target/M68k/M68kFrameLowering.h:60:8: warning: 'llvm::M68kFrameLowering::emitCalleeSavedFrameMoves' hides overloaded virtual functions [-Woverloaded-virtual] void emitCalleeSavedFrameMoves(MachineBasicBlock &MBB, ^ ../include/llvm/CodeGen/TargetFrameLowering.h:215:3: note: hidden overloaded virtual function 'llvm::TargetFrameLowering::emitCalleeSavedFrameMoves' declared here: different number of parameters (2 vs 3) emitCalleeSavedFrameMoves(MachineBasicBlock &MBB, ^ ../include/llvm/CodeGen/TargetFrameLowering.h:218:16: note: hidden overloaded virtual function 'llvm::TargetFrameLowering::emitCalleeSavedFrameMoves' declared here: different number of parameters (4 vs 3) virtual void emitCalleeSavedFrameMoves(MachineBasicBlock &MBB, ^ pr/50071 Reviewed By: myhsu Differential Revision: https://reviews.llvm.org/D101588	2021-04-30 11:23:31 -07:00
Scott Linder	4f1d345bf5	[ADT] Add llvm::remove_cvref and llvm::remove_cvref_t Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D100669	2021-04-30 18:22:38 +00:00
Duncan P. N. Exon Smith	ee59f57519	Support: Stop using F_{None,Text,Append} compatibility synonyms, NFC Stop using the compatibility spellings of `OF_{None,Text,Append}` left behind by 1f67a3cba9b09636c56e2109d8a35ae96dc15782. A follow-up will remove them. Differential Revision: https://reviews.llvm.org/D101650	2021-04-30 11:00:03 -07:00
Eli Friedman	166d8510f8	[AArch64] Fix lowering for fshl/fshr with SVE types. These operations don't exist natively, so just let the target-independent code expand to plain shifts. The generated sequences could probably be optimized a bit more, but they seem good enough for now. Differential Revision: https://reviews.llvm.org/D101574	2021-04-30 10:51:25 -07:00
LLVM GN Syncbot	e687ab5c5d	[gn build] Port c6f20d70a8c9	2021-04-30 17:30:08 +00:00
Scott Linder	babaad1cad	[ADT] Add STLForwardCompat.h and llvm::disjunction Move some types in STLExtras.h which are named and behave identically to STL types from future standards into a dedicated header. This keeps them organized (they are not "extras" in the same sense as most types in STLExtras.h are) and fixes circular dependencies in future patches. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D100668	2021-04-30 17:28:47 +00:00
LLVM GN Syncbot	ac096d1d93	[gn build] Port 7c17731596e9	2021-04-30 17:00:36 +00:00
LLVM GN Syncbot	6ae64271ea	[gn build] Port 5a3309f82576	2021-04-30 17:00:35 +00:00
Alexey Bataev	f231508823	[SLP]Fix the crash on cost calculation if non-compatible vectors shuffled. If the extracts from the non-power-2 vectors are recognized as shuffles, need some extra checks to not crash cost calculations if trying to gext the ecost for subvector extracts. In this case need to check carefully that we do not exit out of bounds of the original vector, otherwise the TTI's cost model will crash on assert. Differential Revision: https://reviews.llvm.org/D101477	2021-04-30 09:34:20 -07:00
Stelios Ioannou	6336102b92	[AArch64] Adds a pre-indexed paired Load/Store optimization for LDR-STR. This patch merges STR<S,D,Q,W,X>pre-STR<S,D,Q,W,X>ui and LDR<S,D,Q,W,X>pre-LDR<S,D,Q,W,X>ui instruction pairs into a single STP<S,D,Q,W,X>pre and LDP<S,D,Q,W,X>pre instruction, respectively. For each pair, there is a MIR test that verifies this optimization. Differential Revision: https://reviews.llvm.org/D99272 Change-Id: Ie97a20c8c716c08492fe229c22e14e3c98ef08b7	2021-04-30 17:29:58 +01:00
Bradley Smith	e47dc36677	[AArch64][SVE] Remove unused function missed from D101302 The functionality in SVEIntrinsicOpts::isReinterpretToSVBool was moved in D101302, however the original now unused function was not removed (NFC). Differential Revision: https://reviews.llvm.org/D101642	2021-04-30 16:57:09 +01:00
Tomas Matheson	f90afcb5c6	Revert "[CodeGen][ARM] Implement atomicrmw as pseudo operations at -O0" This reverts commit 3338290c187b254ad071f4b9cbf2ddb2623cefc0. Broke expensive checks on debian.	2021-04-30 16:53:14 +01:00
Tomas Matheson	9017004ebd	[CodeGen][ARM] Implement atomicrmw as pseudo operations at -O0 atomicrmw instructions are expanded by AtomicExpandPass before register allocation into cmpxchg loops. Register allocation can insert spills between the exclusive loads and stores, which invalidates the exclusive monitor and can lead to infinite loops. To avoid this, reimplement atomicrmw operations as pseudo-instructions and expand them after register allocation. Floating point legalisation: f16 ATOMIC_LOAD_FADD(f16, f16) is legalised to f32 ATOMIC_LOAD_FADD(i16, f32) and then eventually f32 ATOMIC_LOAD_FADD_16(*i16, f32) Differential Revision: https://reviews.llvm.org/D101164	2021-04-30 16:40:33 +01:00
Paul C. Anagnostopoulos	7a240cb6b2	[TableGen] Fix two bugs in 'defm' when complex 'assert' is involved. This patch fixes two bugs that arise when a 'defm' inherits from a multiclass and also from a class with assertions. Differential Revision: https://reviews.llvm.org/D101626	2021-04-30 11:31:06 -04:00
Konstantin Zhuravlyov	f3733a51e7	AMDGPU/llvm-readobj: Add missing tests for note parsing/displaying This is a follow up review/change for https://reviews.llvm.org/D95638 Add valid note tests for code object v2 notes: - NT_AMD_HSA_CODE_OBJECT_VERSION (required yaml2obj update) - NT_AMD_HSA_HSAIL (required yaml2obj update) - NT_AMD_HSA_ISA_VERSION (required yaml2obj update) - NT_AMD_HSA_METADATA - NT_AMD_HSA_ISA_NAME - NT_AMD_PAL_METADATA Add valid note tests for code object v3 notes: - NT_AMDGPU_METADATA Add invalid note tests for code object v2 notes: - NT_AMD_HSA_CODE_OBJECT_VERSION (required yaml2obj update) - NT_AMD_HSA_HSAIL (required yaml2obj update) - NT_AMD_HSA_ISA_VERSION (required yaml2obj update) Add invalid note tests for code object v3 notes: - NT_AMDGPU_METADATA Differential Revision: https://reviews.llvm.org/D101304	2021-04-30 11:19:16 -04:00
Jingu Kang	b2b2a42182	[SimpleLoopUnswitch] Port partially invariant unswitch from LoopUnswitch to SimpleLoopUnswitch Differential Revision: https://reviews.llvm.org/D99354	2021-04-30 15:55:56 +01:00
Amy Kwan	f189e0c45f	[PowerPC] Add new infrastructure to select load/store instructions, update P8/P9 load/store patterns. This patch introduces a new infrastructure that is used to select the load and store instructions in the PPC backend. The primary motivation is that the current implementation of selecting load/stores is dependent on the ordering of patterns in TableGen. Given this limitation, we are not able to easily and reliably generate the P10 prefixed load and stores instructions (such as when the immediates that fit within 34-bits). This refactoring is meant to provide us with more control over the patterns/different forms to exploit, as well as eliminating dependency of pattern declaration in TableGen. The idea of this refactoring is that it introduces a set of addressing modes that correspond to different instruction formats of a particular load and store instruction, along with a set of common flags that describes a load/store. Whenever a load/store instruction is being selected, we analyze the instruction and compute a set of flags for it. The computed flags are then used to select the most optimal load/store addressing mode. This patch is the first of a series of patches to be committed - it contains the initial implementation of the refactored load/store selection infrastructure and also updates P8/P9 patterns to adopt this infrastructure. The idea is that incremental patches will add more implementation and support, and eventually the old implementation will be removed. Differential Revision: https://reviews.llvm.org/D93370	2021-04-30 09:53:19 -05:00
Sidharth Baveja	f8a477d8bf	[XCOFF][AIX] Add Global Variables Directly to TOC for 32 bit AIX Summary: This patch implements the backend implementation of adding global variables directly to the table of contents (TOC), rather than adding the address of the variable to the TOC. Currently, this patch will look for the "toc-data" attribute on symbols in the IR, and then add those symbols to the TOC. ATM, this is implemented for 32 bit AIX. Reviewers: sfertile Differential Revision: https://reviews.llvm.org/D101178	2021-04-30 14:48:02 +00:00
Jay Foad	15c04ad2d8	[AMDGPU] Add test for set_gpr_idx removal with conditional branches	2021-04-30 15:01:32 +01:00
LLVM GN Syncbot	4628abad24	[gn build] Port 43bc584dc05e	2021-04-30 13:48:40 +00:00
Simon Moll	c85f7fbfe3	[VE] VP intrinsics are legal	2021-04-30 15:47:55 +02:00
Simon Moll	63ed8031a5	[VP,Integer,#2] ExpandVectorPredication pass This patch implements expansion of llvm.vp.* intrinsics (https://llvm.org/docs/LangRef.html#vector-predication-intrinsics). VP expansion is required for targets that do not implement VP code generation. Since expansion is controllable with TTI, targets can switch on the VP intrinsics they do support in their backend offering a smooth transition strategy for VP code generation (VE, RISC-V V, ARM SVE, AVX512, ..). Reviewed By: rogfer01 Differential Revision: https://reviews.llvm.org/D78203	2021-04-30 15:47:28 +02:00
Jay Foad	a4902109b1	[AMDGPU] Add implicit negative check for the set_gpr_idx tests The only effect of the optimization is to remove s_set_gpr_idx_* instructions, and update_mir_test_checks.py always inserts CHECK: rather than CHECK-NEXT: checks, so without this implicit negative check, the tests would always pass even if the optimization did nothing. Differential Revision: https://reviews.llvm.org/D101622	2021-04-30 14:45:12 +01:00
Pooja Yadav	b5839a5bcf	[docs]Added llvm/bindings section Added information about language bindings provided by LLVM. Reviewed By: xgupta, gandhi21299 Differential Revision: https://reviews.llvm.org/D101295	2021-04-30 19:05:22 +05:30
Andrea Di Biagio	de24fa879c	[MCA] Fix CarryOver check in the DispatchStage (PR50174). Early exit from method DispatchStage::isAvailable() if the dispatch group is already full. Not all instructions declare at least one uOP. Fixes PR50174.	2021-04-30 14:26:46 +01:00
Jay Foad	cc3b40cb3d	[AMDGPU] Fix inconsistent ---/... in MIR tests and regenerate checks In some cases the lack of --- or ... confused update_mir_test_checks.py into not adding any checks for a function.	2021-04-30 14:10:50 +01:00
Evgeniy Brevnov	411ae35850	[NARY] Don't optimize min/max if there are side uses (part2) Previous attempt to fix infinite recursion in min/max reassociation was not fully successful (D100170). Newly discovered failing case is due to not properly handled when there is a single use. It should be processed separately from 2 uses case. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D101359	2021-04-30 19:02:02 +07:00
Florian Hahn	bbc6b99422	[Passes] Run sinking/hoisting in SimplifyCFG earlier. Hoisting and sinking instructions out of conditional blocks enables additional vectorization by: 1. Executing memory accesses unconditionally. 2. Reducing the number of instructions that need predication. After disabling early hoisting / sinking, we miss out on a few vectorization opportunities. One of those is causing a ~10% performance regression in one of the Geekbench benchmarks on AArch64. This patch tires to recover the regression by running hoisting/sinking as part of a SimplifyCFG run after LoopRotate and before LoopVectorize. Note that in the legacy pass-manager, we run LoopRotate just before vectorization again and there's no SimplifyCFG run in between, so the sinking/hoisting may impact the later run on LoopRotate. But the impact should be limited and the benefit of hosting/sinking at this stage should outweigh the risk of not rotating. Compile-time impact looks slightly positive for most cases. http://llvm-compile-time-tracker.com/compare.php?from=2ea7fb7b1c045a7d60fcccf3df3ebb26aa3699e5&to=e58b4a763c691da651f25996aad619cb3d946faf&stat=instructions NewPM-O3: geomean -0.19% NewPM-ReleaseThinLTO: geoman -0.54% NewPM-ReleaseLTO-g: geomean -0.03% With a few benchmarks seeing a notable increase, but also some improvements. Alternative to D101290. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D101468	2021-04-30 12:23:57 +01:00

1 2 3 4 5 ...

215049 Commits