llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-22 18:54:02 +01:00

Author	SHA1	Message	Date
Fangrui Song	a174f79c66	[docs] Update llvm-readelf supported options after D105532	2021-07-16 10:40:30 -07:00
Philip Reames	1c5ed99606	[test] Extend negative stride backedge tests to cover signed comparisons	2021-07-16 10:29:22 -07:00
Guozhi Wei	7d6ba24baf	[X86FixupLEAs] Try again to transform the sequence LEA/SUB to SUB/SUB This patch transforms the sequence lea (reg1, reg2), reg3 sub reg3, reg4 to two sub instructions sub reg1, reg4 sub reg2, reg4 Similar optimization can also be applied to LEA/ADD sequence. The modifications to TwoAddressInstructionPass is to ensure the operands of ADD instruction has expected order (the dest register of LEA should be src register of ADD). Differential Revision: https://reviews.llvm.org/D104684	2021-07-16 10:16:03 -07:00
Philip Reames	055a12795f	[SCEV] Add tests for known negative strides in trip count logic	2021-07-16 10:08:31 -07:00
Jon Roelofs	04c73eae43	Revert "[MachineVerifier] Diagnose invalid INSERT_SUBREGs" This reverts commit dd57ba1a17b93dbe211d04cb2d4de5f6dc898d60. It broke some tests: http://45.33.8.238/linux/51314/step_12.txt	2021-07-16 09:53:55 -07:00
Simon Pilgrim	4e9ef1e02b	[X86] Regenerate twoaddr-lea.ll test checks.	2021-07-16 17:43:36 +01:00
Simon Pilgrim	e00330583b	[DAG] SelectionDAG::MaskedElementsAreZero - assert we're calling with a vector. NFCI. Add an assertion that we've calling MaskedElementsAreZero with a vector op and that the DemandedElts arg is a matching width. Makes the error a lot easier to grok when something else accidentally gets used.	2021-07-16 17:43:35 +01:00
Jon Roelofs	ac7796917a	[MachineVerifier] Diagnose invalid INSERT_SUBREGs Differential revision: https://reviews.llvm.org/D105953	2021-07-16 09:43:12 -07:00
Craig Topper	84de2a93a3	[RISCV] Teach constant materialization that it can use zext.w at the end with Zba to reduce number of instructions. If the upper 32 bits are zero and bit 31 is set, we might be able to use zext.w to fill in the zeros after using an lui and/or addi. Most of this patch is plumbing the subtarget features into the constant materialization. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D105509	2021-07-16 09:35:56 -07:00
Craig Topper	ad9e84d420	[RISCV] Add curly braces around a case body that declares variables. NFC This is at the end of the switch so doesn't cause any issues now, but if a new case is added it will break.	2021-07-16 09:35:56 -07:00
Nikita Popov	7f88bf6ac9	[Verifier] Require same signature for intrinsic calls As suggested on D105733, this adds a verifier rule that calls to intrinsics must match the signature of the intrinsic. Without opaque pointers this is automatically enforced for all calls, because the pointer types need to match. If the signatures don't match, a pointer bitcast has to be inserted. For intrinsics in particular, such bitcasts are not legal, because the address of intrinsics cannot be taken. With opaque pointers, there are no more pointer bitcasts, so it's generally possible for the call and the callee signature to differ. However, for intrinsics we still want to enforce that the signatures must match, the same as was done before through the address taken check. We can't enforce this more generally for non-intrinsics, because calls with mismatched signatures at the very least can legally occur in unreachable code, and might also be valid in some other cases, depending on how exactly the signatures differ. Differential Revision: https://reviews.llvm.org/D106013	2021-07-16 18:33:16 +02:00
madhur13490	b3e3f87671	[NFC] Fix typo intrinisic Differential Revision: https://reviews.llvm.org/D106161	2021-07-16 21:45:11 +05:30
Congzhe Cao	8f6ef387e2	[LoopInterchange] Check lcssa phis in the inner latch in scenarios of multi-level nested loops We already know that we need to check whether lcssa phis are supported in inner loop exit block or in outer loop exit block, and we have logic to check them already. Presumably the inner loop latch does not have lcssa phis and there is no code that deals with lcssa phis in the inner loop latch. However, that assumption is not true, when we have loops with more than two-level nesting. This patch adds checks for lcssa phis in the inner latch. Reviewed By: Whitney Differential Revision: https://reviews.llvm.org/D102300	2021-07-16 11:59:20 -04:00
Matt Arsenault	7e3f114325	Mips/GlobalISel: Use LLT form of getMachineMemOperand NFC here since it's just using a scalar anyway.	2021-07-16 11:41:32 -04:00
Matt Arsenault	614185bb3c	GlobalISel: Preserve memory type for memset expansion	2021-07-16 11:41:32 -04:00
Matt Arsenault	184c6f78cf	AArch64/GlobalISel: Update tests to use correct memory types	2021-07-16 11:41:32 -04:00
Masoud Ataei	33e29ff74a	[PowerPC] Updated the error message of MASSV pass to mention vectorization is needed be enable on P8 and later targets. Differential Revision: https://reviews.llvm.org/D106091	2021-07-16 14:45:09 +00:00
Amy Kwan	9f4ca39c11	[PowerPC] Update Refactored Load/Store Implementation, XForm VSX Patterns, and Tests This patch includes the following updates to the load/store refactoring effort introduced in D93370: - Update various VSX patterns that use to "force" an XForm, to instead just XForm. This allows the ability for the patterns to compute the most optimal addressing mode (and to produce a DForm instruction when possible) - Update pattern and test case for the LXVD2X/STXVD2X intrinsics - Update LIT test cases that use to use the XForm instruction to use the DForm instruction Differential Revision: https://reviews.llvm.org/D95115	2021-07-16 09:28:48 -05:00
Fraser Cormack	b8f7435c4c	Revert "[RISCV] Lower more BUILD_VECTOR sequences to RVV's VID" This reverts commit a6ca88e908b5befcd9b0f8c8cb40f53095cc17bc. More caution is required to avoid overflow/underflow. Thanks to the santizers for catching this.	2021-07-16 15:00:20 +01:00
Matt Arsenault	5a8526607f	GlobalISel: Remove dead function	2021-07-16 08:59:25 -04:00
Matt Arsenault	f27b93051e	AMDGPU/GlobalISel: Preserve more memory types	2021-07-16 08:57:26 -04:00
Matt Arsenault	dc97583234	AMDGPU/GlobalISel: Redo kernel argument load handling This avoids relying on G_EXTRACT on unusual types, and also properly decomposes structs into multiple registers. This also preserves the LLTs in the memory operands.	2021-07-16 08:56:54 -04:00
Jeremy Morse	20a3fe6622	[InstrRef][FastISel] Support emitting DBG_INSTR_REF from fast-isel If you attach __attribute__((optnone)) to a function when using optimisations, that function will use fast-isel instead of the usual SelectionDAG method. This is a problem for instruction referencing, because it means DBG_VALUEs of virtual registers will be created, triggering some safety assertions in LiveDebugVariables. Those assertions exist to detect exactly this scenario, where an unexpected piece of code is generating virtual register references in instruction referencing mode. Fix this by transforming the DBG_VALUEs created by fast-isel into half-formed DBG_INSTR_REFs, after which they get patched up in finalizeDebugInstrRefs. The test modified adds a fast-isel mode to the instruction referencing isel test. Differential Revision: https://reviews.llvm.org/D105694	2021-07-16 13:56:15 +01:00
Sanjay Patel	ab8a0b5f26	[SLP] add tests for poison-safe bool logic reductions; NFC More coverage for D105730	2021-07-16 08:50:58 -04:00
Dmitry Preobrazhensky	aa2ec7ce25	[AMDGPU][MC] Added missing isCall/isBranch flags Added isCall for S_CALL_B64; added isBranch for S_SUBVECTOR_LOOP_*. Differential Revision: https://reviews.llvm.org/D106072	2021-07-16 14:59:10 +03:00
Serge Pavlov	15d6dcdf87	Use update_test_checks.py to auto-generate check lines	2021-07-16 18:20:08 +07:00
Nicholas Guy	340bf152dd	[AArch64] Update Cortex-A55 SchedModel to improve LDP scheduling Specifying the latencies of specific LDP variants appears to improve performance almost universally. Differential Revision: https://reviews.llvm.org/D105882	2021-07-16 12:00:57 +01:00
Kerry McLaughlin	a9d31ba311	[LV] Avoid scalable vectorization for loops containing alloca This patch returns an Invalid cost from getInstructionCost() for alloca instructions if the VF is scalable, as otherwise loops which contain these instructions will crash when attempting to scalarize the alloca. Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D105824	2021-07-16 11:47:13 +01:00
Cullen Rhodes	3f31269929	[AArch64][SME] Add load and store instructions This patch adds support for following contiguous load and store instructions: * LD1B, LD1H, LD1W, LD1D, LD1Q * ST1B, ST1H, ST1W, ST1D, ST1Q A new register class and operand is added for the 32-bit vector select register W12-W15. The differences in the following tests which have been re-generated are caused by the introduction of this register class: * llvm/test/CodeGen/AArch64/GlobalISel/irtranslator-inline-asm.ll * llvm/test/CodeGen/AArch64/GlobalISel/regbank-inlineasm.mir * llvm/test/CodeGen/AArch64/stp-opt-with-renaming-reserved-regs.mir * llvm/test/CodeGen/AArch64/stp-opt-with-renaming.mir D88663 attempts to resolve the issue with the store pair test differences in the AArch64 load/store optimizer. The GlobalISel differences are caused by changes in the enum values of register classes, tests have been updated with the new values. The reference can be found here: https://developer.arm.com/documentation/ddi0602/2021-06 Reviewed By: CarolineConcatto Differential Revision: https://reviews.llvm.org/D105572	2021-07-16 10:11:10 +00:00
Sander de Smalen	d49dc1ccda	Reland "[LV] Print remark when loop cannot be vectorized due to invalid costs." The original patch was: https://reviews.llvm.org/D105806 There were some issues with undeterministic behaviour of the sorting function, which led to scalable-call.ll passing and/or failing. This patch fixes the issue by numbering all instructions in the array first, and using that number as the order, which should provide a consistent ordering. This reverts commit a607f64118240f70bf1b14ec121b65f49d63800d.	2021-07-16 10:52:01 +01:00
Fraser Cormack	28d982df90	[RISCV] Lower more BUILD_VECTOR sequences to RVV's VID This patch teaches the compiler to identify a wider variety of `BUILD_VECTOR`s which form integer arithmetic sequences, and to lower them to `vid.v` with modifications for non-unit steps and non-zero addends. The sequences handled by this optimization must either be monotonically increasing or decreasing. Consecutive elements holding the same value indicate a fractional step which, while simple mathematically, becomes more complex to handle both in the realm of lossy integer division and in the presence of `undef`s. For example, a common "interleaving" shuffle index will be lowered by LLVM to both `<0,u,1,u,2,...>` and `<u,0,u,1,u,...>` `BUILD_VECTOR` nodes. Either of these would ideally be lowered to `vid.v` shifted right by 1. Detection of this sequence in presence of general `undef` values is more complicated, however: `<0,u,u,1,>` could match either `<0,0,0,1,>` or `<0,0,1,1,>` depending on later values in the sequence. Both are possible, so backtracking or multiple passes is inevitable. Sticking to monotonic sequences keeps the logic simpler as it can be done in one pass. Fractional steps will likely be a separate optimization in a future patch. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D104921	2021-07-16 10:35:13 +01:00
Timm Bäder	b97b48f662	[llvm][tools] Hide unrelated llvm-cfi-verify options Differential Revision: https://reviews.llvm.org/D106055	2021-07-16 10:43:52 +02:00
Simon Giesecke	d4960d7c98	Reformat files. Differential Revision: https://reviews.llvm.org/D105982	2021-07-16 07:39:21 +00:00
Mehdi Amini	7d809bb14e	Use ManagedStatic and lazy initialization of cl::opt in libSupport to make it free of global initializer We can build it with -Werror=global-constructors now. This helps in situation where libSupport is embedded as a shared library, potential with dlopen/dlclose scenario, and when command-line parsing or other facilities may not be involved. Avoiding the implicit construction of these cl::opt can avoid double-registration issues and other kind of behavior. Reviewed By: lattner, jpienaar Differential Revision: https://reviews.llvm.org/D105959	2021-07-16 07:38:16 +00:00
Mehdi Amini	b708f244c7	Revert "Use ManagedStatic and lazy initialization of cl::opt in libSupport to make it free of global initializer" This reverts commit af9321739b20becf170e6bb5060b8d780e1dc8dd. Still some specific config broken in some way that requires more investigation.	2021-07-16 07:35:13 +00:00
Timm Bäder	5adb9eb982	Revert "[llvm][tools] Hide unrelated llvm-cfi-verify options" This reverts commit 7c63726072005cc331bb21694c9022e6d18a3b93.	2021-07-16 09:30:57 +02:00
Timm Bäder	da6027db65	[llvm][tools] Hide unrelated llvm-cfi-verify options Differential Revision: https://reviews.llvm.org/D106055	2021-07-16 09:25:54 +02:00
Marcos Horro	44d94bc849	[llvm-mca][JSON] Store extra information about driver flags used for the simulation Added information stored in PipelineOptions and the MCSubtargetInfo. Bug: https://bugs.llvm.org/show_bug.cgi?id=51041 Reviewed By: andreadb Differential Revision: https://reviews.llvm.org/D106077	2021-07-16 09:18:40 +02:00
Mehdi Amini	64ec18abb6	Use ManagedStatic and lazy initialization of cl::opt in libSupport to make it free of global initializer We can build it with -Werror=global-constructors now. This helps in situation where libSupport is embedded as a shared library, potential with dlopen/dlclose scenario, and when command-line parsing or other facilities may not be involved. Avoiding the implicit construction of these cl::opt can avoid double-registration issues and other kind of behavior. Reviewed By: lattner, jpienaar Differential Revision: https://reviews.llvm.org/D105959	2021-07-16 06:54:26 +00:00
Serge Pavlov	8c4ac92d65	Fix typo in test	2021-07-16 12:43:57 +07:00
Max Kazantsev	e4a04110a4	[LSR] Handle case 1reg => reg. PR50918 This patch addresses assertion failure in case when the only found formula for LSR is `1reg => reg` which was supposed to be an impossible situation, however there is a test that shows it is possible. In this case, we can use scale register with scale of 1 as the missing base register. Reviewed By: huihuiz, reames Differential Revision: https://reviews.llvm.org/D105009	2021-07-16 11:33:59 +07:00
Carl Ritson	539761ef24	[TableGen] Allow isAllocatable inheritence from any superclass When setting Allocatable on a generated register class check all superclasses and set Allocatable true if any superclass is allocatable. Without this change generated register classes based on an allocatable class may end up unallocatable due to the topological inheritance order. This change primarily effects AMDGPU backend; however, there are a few changes in MIPs GlobalISel register constraints as a result. Reviewed By: kparzysz Differential Revision: https://reviews.llvm.org/D105967	2021-07-16 13:02:24 +09:00
Shilei Tian	08c004d674	[Attributor] Add support for compound assignment for ChangeStatus A common use of `ChangeStatus` is as follows: ``` ChangeStatus Changed = ChangeStatus::UNCHANGED; Changed \|= foo(); ``` where `foo` returns `ChangeStatus` as well. Currently `ChangeStatus` doesn't support compound assignment, we have to write as ``` Changed = Changed \| foo(); ``` which is not that convenient. This patch add the support for compound assignment for `ChangeStatus`. Compound assignment is usually implemented as a member function, and binary arithmetic operator is therefore implemented using compound assignment. However, unlike regular C++ class, enum class doesn't support member functions. As a result, they can only be implemented in the way shown in the patch. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D106109	2021-07-15 23:51:46 -04:00
Mehdi Amini	0fd38b8415	Revert "Use ManagedStatic and lazy initialization of cl::opt in libSupport to make it free of global initializer" This reverts commit 42f588f39c5ce6f521e3709b8871d1fdd076292f. Broke some buildbots	2021-07-16 03:46:53 +00:00
Mehdi Amini	a9a8a9a361	Use ManagedStatic and lazy initialization of cl::opt in libSupport to make it free of global initializer We can build it with -Werror=global-constructors now. This helps in situation where libSupport is embedded as a shared library, potential with dlopen/dlclose scenario, and when command-line parsing or other facilities may not be involved. Avoiding the implicit construction of these cl::opt can avoid double-registration issues and other kind of behavior. Reviewed By: lattner, jpienaar Differential Revision: https://reviews.llvm.org/D105959	2021-07-16 03:33:20 +00:00
LLVM GN Syncbot	79801693ba	[gn build] Port 766a08df12c1	2021-07-16 02:23:45 +00:00
Nico Weber	1cc643bb34	[gn build] port 766a08df12c1	2021-07-15 22:23:14 -04:00
Daniel Rodríguez Troitiño	13cc0cf34a	[test] Use double pound to denote comments. Use double pound at the start of the line to differentiate comments from statements for Lit or FileCheck. I will also use this small commit to check my commit access. Differential Revision: https://reviews.llvm.org/D106103	2021-07-15 17:39:34 -07:00
Matt Arsenault	ef17052770	GlobalISel: Surface offsets parameter from ComputeValueVTs	2021-07-15 19:11:40 -04:00
Matt Arsenault	79609410ad	AMDGPU/GlobalISel: Fix incorrect memory types in test	2021-07-15 19:11:40 -04:00

... 2 3 4 5 6 ...

218777 Commits