llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Benjamin Kramer	f3d58580a1	[X86] Unbreak the build after 22fa6b20d92e	2020-09-07 12:24:30 +02:00
Simon Pilgrim	1db7a4fa9e	[X86] getFauxShuffleMask - handle insert_subvector(zero, sub, C) Directly use SM_SentinelZero elements if we're (widening)inserting into a zero vector.	2020-09-07 11:10:40 +01:00
Simon Pilgrim	437ebe246f	[X86][AVX] Add extra vperm2f128+vpermilvar combine coverage The existing test /should/ reduce to a vmovaps (concat xmm with zero upper).	2020-09-07 10:58:53 +01:00
Simon Pilgrim	4fee922db4	[X86] Use Register instead of unsigned. NFCI. Fixes llvm-prefer-register-over-unsigned clang-tidy warnings.	2020-09-07 10:49:29 +01:00
Esme-Yi	67a9609ddc	[NFC][PowerPC] Add tests for `mul` with big constants.	2020-09-07 09:45:47 +00:00
Simon Pilgrim	bd029710f2	[X86] Use Register instead of unsigned. NFCI. Fixes llvm-prefer-register-over-unsigned clang-tidy warnings.	2020-09-07 10:38:09 +01:00
Simon Pilgrim	c1e4cc6249	[X86] Use Register instead of unsigned. NFCI. Fixes llvm-prefer-register-over-unsigned clang-tidy warning.	2020-09-07 10:38:08 +01:00
Sam Parker	7c4a7cb063	[SimplifyCFG] Consider cost of combining predicates. Modify FoldBranchToCommonDest to consider the cost of inserting instructions when attempting to combine predicates to fold blocks. The threshold can be controlled via a new option: -simplifycfg-branch-fold-threshold which defaults to '2' to allow the insertion of a not and another logical operator. Differential Revision: https://reviews.llvm.org/D86526	2020-09-07 10:04:50 +01:00
Jay Foad	a04922a28f	[GlobalISel] Extend not_cmp_fold to work on conditional expressions Differential Revision: https://reviews.llvm.org/D86709	2020-09-07 09:31:08 +01:00
Sam Parker	c37b434c46	[ARM][CostModel] CodeSize costs for i1 arith ops When optimising for size, make the cost of i1 logical operations relatively expensive so that optimisations don't try to combine predicates. Differential Revision: https://reviews.llvm.org/D86525	2020-09-07 09:27:18 +01:00
Xing GUO	5f500e98a4	[DWARFYAML] Make the debug_addr section optional. This patch makes the debug_addr section optional. When an empty debug_addr section is specified, yaml2obj only emits a section header for it. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D87205	2020-09-07 16:17:18 +08:00
Raphael Isemann	daa93e6992	Add BinaryFormat/ELFRelocs/CSKY.def to LLVM modulemap	2020-09-07 10:14:22 +02:00
Jay Foad	3381408aad	[KnownBits] Implement accurate unsigned and signed max and min Use the new implementation in ValueTracking, SelectionDAG and GlobalISel. Differential Revision: https://reviews.llvm.org/D87034	2020-09-07 09:09:01 +01:00
Raul Tambre	834639de3e	[CMake][TableGen] Remove dead CMake version checks LLVM requires CMake 3.13.4, so remove version checks that are dead code. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D87190	2020-09-07 10:59:07 +03:00
Raul Tambre	4a39e08b2a	[CMake][TableGen] Simplify code by using list(TRANSFORM) LLVM requires CMake 3.13.4 so now we can simplify the code. Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D87193	2020-09-07 10:53:20 +03:00
dongAxis	f58ab2a818	When dumping results of StackLifetime, it will print the following log: BB [7, 8): begin {}, end {}, livein {}, liveout {} BB [1, 2): begin {}, end {}, livein {}, liveout {} ... But it is not convenient to know what the basic block is. So I add the basic block name to it. Reviewed By: vitalybuka TestPlan: check-llvm Differential Revision: https://reviews.llvm.org/D87152	2020-09-07 11:43:16 +08:00
Zi Xuan Wu	a51dedaf25	[ELF] Add a new e_machine value EM_CSKY and add some CSKY relocation types This is the split part of D86269, which add a new ELF machine flag called EM_CSKY and related relocations. Some target-specific flags and tests for csky can be added in follow-up patches later. Differential Revision: https://reviews.llvm.org/D86610	2020-09-07 10:42:28 +08:00
Chen Zheng	a3ac48f849	[machinesink] add testcase for more sinking - NFC	2020-09-06 21:14:14 -04:00
Thomas Lively	08861e0346	[WebAssembly] Fix incorrect assumption of simple value types Fixes PR47375, in which an assertion was triggering because WebAssemblyTargetLowering::isVectorLoadExtDesirable was improperly assuming the use of simple value types. Differential Revision: https://reviews.llvm.org/D87110	2020-09-06 15:42:21 -07:00
Amy Kwan	2477050bd8	[PowerPC] Implement Vector Expand Mask builtins in LLVM/Clang This patch implements the vec_expandm function prototypes in altivec.h in order to utilize the vector expand with mask instructions introduced in Power10. Differential Revision: https://reviews.llvm.org/D82727	2020-09-06 17:13:21 -05:00
Nikita Popov	99a542ef5d	[ValueTracking] Avoid known bits fallback for non-zero get check (NFCI) The known bits fall back will never be able to infer a non-null value here, so don't bother.	2020-09-06 23:16:38 +02:00
Florian Hahn	18bc223e14	[DSE,MemorySSA] Add a few additional debug messages.	2020-09-06 20:31:00 +01:00
Benjamin Kramer	c3337b0542	[SmallVector] Move error handling out of line This reduces duplication and avoids emitting ice cold code into every instance of grow().	2020-09-06 18:06:44 +02:00
Simon Pilgrim	ea26a20118	[X86][AVX] lowerShuffleWithPERMV - adjust binary shuffle masks to account for widening on non-VLX targets rGabd33bf5eff2 enabled us to pad 128/256-bit shuffles to 512-bit on non-VLX targets, but wasn't updating binary shuffles to account for the new vector width.	2020-09-06 14:52:25 +01:00
David Green	61bcb0cad0	[ARM] Regenerate tests. NFC	2020-09-06 12:51:43 +01:00
Nikita Popov	587a99fa86	[InstSimplify] Fold degenerate abs of abs form This addresses the remaining issue from D87188. Due to a series of folds, we may end up with abs-of-abs represented as x == 0 ? -abs(x) : abs(x). Rather than recognizing this as a special abs pattern and doing an abs-of-abs fold on it afterwards, I'm directly folding this to one of the select operands in InstSimplify. The general pattern falls into the "select with operand replaced" category, but that fold is not powerful enough to recognize that both hands of the select are the same for value zero. Differential Revision: https://reviews.llvm.org/D87197	2020-09-06 09:43:08 +02:00
Amara Emerson	eb6cc475b7	[GlobalISel] Disable the indexed loads combine completely unless forced. NFC. The post-index matcher, before it queries the target legality, walks uses of some instructions which in pathological cases can be massive. Since no targets actually support indexed loads yet, disable this to stop wasting compile time on something which is going to fail anyway.	2020-09-05 21:04:03 -07:00
vnalamot	62e3b4669e	[AMDGPU] Remove the dead spill slots while spilling FP/BP to memory During the PEI pass, the dead TargetStackID::SGPRSpill spill slots are not being removed while spilling the FP/BP to memory. Fixes: SWDEV-250393 Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D87032	2020-09-06 07:04:25 +05:30
Krzysztof Parzyszek	cb324edefe	[Hexagon] Add assertions about V6_pred_scalar2	2020-09-05 18:20:23 -05:00
Krzysztof Parzyszek	34a99d5a26	[Hexagon] When widening truncate result, also widen operand if necessary	2020-09-05 18:19:32 -05:00
Krzysztof Parzyszek	c8b94c00f4	[Hexagon] Resize the mem operand when widening loads and stores	2020-09-05 18:17:48 -05:00
Krzysztof Parzyszek	00ccf90843	[Hexagon] Handle widening of vector truncate	2020-09-05 15:07:38 -05:00
Nikita Popov	4af70f1516	[InstSimplify] Add tests for a peculiar abs of abs form (NFC) This pattern shows up when canonicalizing to spf abs form to intrinsic abs form.	2020-09-05 21:42:22 +02:00
Florian Hahn	6276196b86	[LangRef] Adjust guarantee for llvm.memcpy to also allow equal arguments. This adjusts the description of `llvm.memcpy` to also allow operands to be equal. This is in line with what Clang currently expects. This change is intended to be temporary and followed by re-introduce a variant with the non-overlapping guarantee for cases where we can actually ensure that property in the front-end. See the links below for more details: http://lists.llvm.org/pipermail/cfe-dev/2020-August/066614.html and PR11763. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D86815	2020-09-05 19:18:23 +01:00
Nikita Popov	39d0c7029b	[InstCombine] Add tests for known negative abs intrinsic (NFC) And duplicate tests for known non-negative from InstSimplify.	2020-09-05 17:31:04 +02:00
Nikita Popov	50631fa23a	[SCEV] Recognize min/max intrinsics Recognize umin/umax/smin/smax intrinsics and convert them to the already existing SCEV nodes of the same name. In the future we'll want SCEVExpander to also produce the intrinsics, but we're not ready for that yet. Differential Revision: https://reviews.llvm.org/D87160	2020-09-05 16:30:11 +02:00
Nikita Popov	8436150b08	[InstCombine] Fold abs with dominating condition Similar to D87168, but for abs. If we have a dominating x >= 0 condition, then we know that abs(x) is x. This fold is in InstCombine, because we need to create a sub instruction for the x < 0 case. Differential Revision: https://reviews.llvm.org/D87184	2020-09-05 16:18:35 +02:00
Nikita Popov	1eb2ecc5c6	[InstSimplify] Fold min/max based on dominating condition If we have a dominating condition that x >= y, then umax(x, y) is x, etc. I'm doing this in InstSimplify as the corresponding transform for the select form is also done there. Differential Revision: https://reviews.llvm.org/D87168	2020-09-05 16:16:40 +02:00
Nikita Popov	ecd979f4bb	[InstCombine] Fold abs intrinsic eq zero Following the same transform for the select version of abs.	2020-09-05 15:11:38 +02:00
Nikita Popov	0df23d0dc6	[InstCombine] Add tests for abs intrinsic eq zero (NFC)	2020-09-05 15:11:38 +02:00
Nikita Popov	fa01287458	[InstCombine] Fold mul of abs intrinsic Same as the existing SPF_ABS fold. We don't need to explicitly handle NABS, as the negs will get folded away first.	2020-09-05 12:37:45 +02:00
Nikita Popov	11983c3cbd	[InstCombine] Add tests for mul of abs intrinsic (NFC)	2020-09-05 12:36:27 +02:00
Nikita Popov	5750a482e4	[InstCombine] Fold cttz of abs intrinsic Same as the existing fold for SPF_ABS. We don't need to explicitly handle the NABS variant, as we'll first fold away the neg in that case.	2020-09-05 12:25:41 +02:00
Nikita Popov	78708a857e	[InstCombine] Add tests for cttz of abs intrinsic (NFC)	2020-09-05 12:22:42 +02:00
Nikita Popov	12cbb20dca	[InstCombine] Test abs with dominating condition (NFC)	2020-09-05 11:10:01 +02:00
Jonas Paulsson	7897525197	[SelectionDAG] Always intersect SDNode flags during getNode() node memoization. Previously SDNodeFlags::instersectWith(Flags) would do nothing if Flags was in an undefined state, which is very bad given that this is the default when getNode() is called without passing an explicit SDNodeFlags argument. This meant that if an already existing and reused node had a flag which the second caller to getNode() did not set, that flag would remain uncleared. This was exposed by https://bugs.llvm.org/show_bug.cgi?id=47092, where an NSW flag was incorrectly set on an add instruction (which did in fact overflow in one of the two original contexts), so when SystemZElimCompare removed the compare with 0 trusting that flag, wrong-code resulted. There is more that needs to be done in this area as discussed here: Differential Revision: https://reviews.llvm.org/D86871 Review: Ulrich Weigand, Sanjay Patel	2020-09-05 10:30:38 +02:00
Nikita Popov	ed666b599d	[SCCP] Add tests for intrinsic ranges (NFC)	2020-09-05 10:28:13 +02:00
serge-sans-paille	32b636840a	Fix return status of SimplifyCFG When a switch case is folded into default's case, that's an IR change that should be reported, update ConstantFoldTerminator accordingly. Differential Revision: https://reviews.llvm.org/D87142	2020-09-05 07:54:15 +02:00
Qiu Chaofan	6da3508c40	[PowerPC] Expand constrained ppc_fp128 to i32 conversion Libcall __gcc_qtou is not available, which breaks some tests needing it. On PowerPC, we have code to manually expand the operation, this patch applies it to constrained conversion. To keep it strict-safe, it's using the algorithm similar to expandFP_TO_UINT. For constrained operations marking FP exception behavior as 'ignore', we should set the NoFPExcept flag. However, in some custom lowering the flag is missed. This should be fixed by future patches. Reviewed By: uweigand Differential Revision: https://reviews.llvm.org/D86605	2020-09-05 13:16:20 +08:00
Krzysztof Parzyszek	af4c0a6b56	[Hexagon] Unindent everything in HexagonISelLowering.h, NFC Just a shift, no other formatting changes.	2020-09-04 17:25:29 -05:00

1 2 3 4 5 ...

203036 Commits