llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 12:43:36 +01:00

Author	SHA1	Message	Date
Craig Topper	38a1f47551	[RISCV] Add support for integer fixed vector setcc I believe I've covered all orderings of splat operands here. Better canonicalization in lowering might help reduce this. I did not handle the immediate adjustments needed for set(u)gt/set(u)lt. Testing here is limited to byte types because the scalable vector type used for masks for the store is calculated assuming 8 byte elements. But for the setcc its based on the element count of the container type for the setcc input. So they don't agree. We'll need to enhanced D96352 to handle this I think. Differential Revision: https://reviews.llvm.org/D96443	2021-02-12 09:29:41 -08:00
Craig Topper	72741c7878	[RISCV] Add support for matching .vx and .vi forms of binary instructions for fixed vectors. Unlike scalable vectors, I'm only using a ComplexPattern for the immediate itself. The vmv_v_x is matched explicitly. We igore the VL argument when matching a binary operator, but we do check it when matching splat directly. I left out tests for vXi64 as they fail on rv32 right now. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D96365	2021-02-12 09:18:10 -08:00
LLVM GN Syncbot	adf90c61ff	[gn build] Port ba3ea9c60f0f	2021-02-12 16:56:34 +00:00
Florian Hahn	f7e18b634e	[AArch64] Increase outlined sequence in test added in a3f6233fa4b4.	2021-02-12 16:20:51 +00:00
Florian Hahn	cdf4bcbe89	[AArch64] Add test case where machine outliner breaks up a bundle. This is a backend test for PR49082.	2021-02-12 16:16:03 +00:00
Petar Avramovic	08a3498f71	AMDGPU/GlobalISel: Calculate isKnownNeverNaN for fminnum and fmaxnum Implements same logis as in SelectionDAG. G_FMINNUM_IEEE and G_FMAXNUM_IEEE are never SNaN by definition and never NaN when one operand is known non-NaN and other known non-SNaN. G_FMINNUM and G_FMAXNUM are never NaN/SNaN when one of the operands is known non-NaN/SNaN. Differential Revision: https://reviews.llvm.org/D91716	2021-02-12 17:14:34 +01:00
Petar Avramovic	2314218c2f	AMDGPU/GlobalISel: Check values of constants in isKnownNeverNaN Differential Revision: https://reviews.llvm.org/D91714	2021-02-12 17:14:34 +01:00
Petar Avramovic	03b0215c29	AMDGPU/GlobalISel: Precommit globalisel tests for isKnownNeverNaN	2021-02-12 17:14:34 +01:00
Jay Foad	3fbdec87d1	[TableGen][GlobalISel] Allow duplicate RendererFns Allow different GICustomOperandRenderers to use the same RendererFn. This avoids the need for targets to define a bunch of identical C++ renderer functions with different names. Without this fix TableGen would have emitted code that tried to define the GICR enumeration with duplicate enumerators. Differential Revision: https://reviews.llvm.org/D96587	2021-02-12 15:05:32 +00:00
David Green	22b5fa9203	[ARM] Single source VMOVNT Our current lowering of VMOVNT goes via a shuffle vector of the form <0, N, 2, N+2, 4, N+4, ..>. That can of course also be a single input shuffle of the form <0, 0, 2, 2, 4, 4, ..>, where we use a VMOVNT to insert a vector into the top lanes of itself. This adds lowering of that case, re-using the existing isVMOVNMask. Differential Revision: https://reviews.llvm.org/D96065	2021-02-12 14:28:57 +00:00
Kerry McLaughlin	807c8f5596	[SVE][LoopVectorize] Support for vectorization of loops with function calls Changes `getScalarizationOverhead` to return an invalid cost for scalable VFs and adds some simple tests for loops containing a function for which there is a vectorized variant available. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D96356	2021-02-12 13:47:43 +00:00
Sanjay Patel	d1a8bb697a	[Vectorizers][TTI] remove option to bypass creation of vector reduction intrinsics The vector reduction intrinsics started life as experimental ops, so backend support was lacking. As part of promoting them to 1st-class intrinsics, however, codegen support was added/improved: D58015 D90247 So I think it is safe to now remove this complication from IR. Note that we still have an IR-level codegen expansion pass for these as discussed in D95690. Removing that is another step in simplifying the logic. Also note that x86 was already unconditionally forming reductions in IR, so there should be no difference for x86. I spot checked a couple of the tests here by running them through opt+llc and did not see any asm diffs. If we do find functional differences for other targets, it should be possible to (at least temporarily) restore the shuffle IR with the ExpandReductions IR pass. Differential Revision: https://reviews.llvm.org/D96552	2021-02-12 08:13:50 -05:00
Florian Hahn	b4a7561ae7	[VPlan] Make VPRecipeBase inherit from VPUser directly (NFC). The individual recipes have been updated to manage their operands using VPUser a while back. Now that the transition is done, we can instead make VPRecipeBase a VPUser and get rid of the toVPUser helper.	2021-02-12 13:06:58 +00:00
Abhina Sreeskantharajan	027bda6786	Mark output as text if it is really text This is a continuation of https://reviews.llvm.org/D67696. The following places need to set the OF_Text flag correctly. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D96363	2021-02-12 07:14:21 -05:00
luxufan	7124b02d55	[RISCV] Change parseVTypeI function Change parseVTypeI function to Make the added vset instruction test cases report more concrete error message. Differential Revision: https://reviews.llvm.org/D96218	2021-02-12 19:38:34 +08:00
David Sherwood	553c02be6d	[NFC][Analysis] Change struct VecDesc to use ElementCount This patch changes the VecDesc struct to use ElementCount instead of an unsigned VF value, in preparation for future work that adds support for vectorized versions of math functions using scalable vectors. Since all I'm doing in this patch is switching the type I believe it's a non-functional change. I changed getWidestVF to now return both the widest fixed-width and scalable VF values, but currently the widest scalable value will be zero. Differential Revision: https://reviews.llvm.org/D96011	2021-02-12 11:07:58 +00:00
Vitaly Buka	27081e2bf5	[NFC][ProfileData] Improve language	2021-02-12 02:55:58 -08:00
Max Kazantsev	277bd73c42	[Test] Add test with uadd intrinsic with missing opt opportunity	2021-02-12 17:46:32 +07:00
Simon Pilgrim	f2ccf7edab	[DAG] DAGTypeLegalizer::PromoteIntRes_ADDSUBSHLSAT - break if-else chain. NFCI. Style fixup - the if() block always returns so we can pull out the contents of the else() block.	2021-02-12 10:33:12 +00:00
Sjoerd Meijer	dbbe3c7ed5	[MIRLangRef] Document MachineOperand comments Late follow-up of D74306 to document MachineOperand comments in MIRLangRef. Differential Revision: https://reviews.llvm.org/D96518	2021-02-12 10:15:47 +00:00
Florian Hahn	b33d843a96	[AArch64] Add tests with sign cmps patterns that can be improved. Some of the sign patterns can be optimized to or & asr, which requires fewer instructions.	2021-02-12 10:09:10 +00:00
Maxim Kuvyrkov	64fcbd05ef	Fix exegesis build on aarch64-windows-msvc host Include x86 intrinsics only when compiling for x86_64 or i386. _MSC_VER no longer implies x86. Reviewed By: gchatelet Differential Revision: https://reviews.llvm.org/D96498	2021-02-12 09:50:22 +00:00
David Sherwood	fa38810646	[Analysis] Change VFABI::mangleTLIVectorName to use ElementCount Adds support for mangling TLI vector names for scalable vectors. Differential Revision: https://reviews.llvm.org/D96338	2021-02-12 09:38:12 +00:00
Fraser Cormack	765390acdd	[RISCV] Add support for integer fixed min/max This patch extends the initial fixed-length vector support to include smin, smax, umin, and umax. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D96491	2021-02-12 09:19:45 +00:00
Kristof Beyls	ceb25a320b	[DeveloperPolicy] Specify LLVM's license more clearly. Before, the first mention of LLVM's license on the developer policy page stated that LLVM's license is Apache 2. This patch makes that more accurate by mentioning the LLVM exception this first time the LLVM license is discussed on that page, i.e. Apache-2.0 with LLVM-exception. Technically, the correct SPDX identifier for LLVM's license is 'Apache-2.0 WITH LLVM-exception', but I thought that writing the 'WITH' in lower case made the paragraph easier to read without reducing clarity. Differential Revision: https://reviews.llvm.org/D96482	2021-02-12 09:16:43 +00:00
wlei	fafe9e7911	[CSSPGO][llvm-profgen] Add brackets for context id to support extended binary format To align with https://reviews.llvm.org/D95547, we need to add brackets for context id before initializing the `SampleContext`. Also added test cases for extended binary format from llvm-profgen side. Differential Revision: https://reviews.llvm.org/D95929	2021-02-12 01:14:53 -08:00
Sander de Smalen	4b43882cd0	[BasicTTIImpl] Fix getCastInstrCost for scalable vectors by querying for ElementCount. This fixes an overly restrictive assumption that the vector is a FixedVectorType, in code that tries to calculate the cost of a cast operation when splitting a too-wide vector. The algorithm works the same for scalable vectors, so this patch removes the cast<FixedVectorType>. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D96253	2021-02-12 08:28:52 +00:00
Sander de Smalen	0d847dd905	[CostModel] An extending load to illegal type is not free. COST(zext (<4 x i32> load(...) to <4 x i64>)) != 0 when <4 x i64> is an illegal result type that requires splitting of the operation. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D96250	2021-02-12 07:59:21 +00:00
Kazu Hirata	7e9ec71ad7	[CodeGen] Use range-based for loops (NFC)	2021-02-11 23:31:31 -08:00
Kazu Hirata	c14adbd811	[PGO] Drop unnecessary const from return types (NFC)	2021-02-11 23:31:29 -08:00
Kazu Hirata	01ea295bd7	[TableGen] Use ListSeparator (NFC)	2021-02-11 23:31:27 -08:00
Fangrui Song	7fde0f34c6	DebugInfo/Symbolize: Use stable_sort This fixes coff-dwarf.test on some build bots. The test relies on the sort order and prefers main (StorageClass: External) to .text (StorageClass: Static).	2021-02-11 22:53:56 -08:00
Max Kazantsev	7dc3f286c2	[Test] Add a potentially hanging test to prevent merging patches that hang it	2021-02-12 13:48:40 +07:00
Heejin Ahn	3c77486de7	[WebAssembly] Fix delegate's argument computation I previously assumed `delegate`'s immediate argument computation followed a different rule than that of branches, but we agreed to make it the same (https://github.com/WebAssembly/exception-handling/issues/146). This removes the need for a separate `DelegateStack` in both CFGStackify and InstPrinter. When computing the immediate argument, we use a different function for `delegate` computation because in MIR `DELEGATE`'s instruction's destination is the destination catch BB or delegate BB, and when it is a catch BB, we need an additional step of getting its corresponding `end` marker. Reviewed By: tlively, dschuff Differential Revision: https://reviews.llvm.org/D96525	2021-02-11 21:57:28 -08:00
Peter Collingbourne	f547c482fb	gn build: Support cross-compiling libunwind for Android. - Usual cross-compilation fix: s/target_/current_/g - Define _LIBUNWIND_IS_NATIVE_ONLY to enable unwinding past functions with return pointer authentication. - Android needs two libunwind static libraries: one with symbols exported and one without. These both need to be in the same build tree so the libunwind_hermetic_static_library configuration option doesn't help here. Replace it with build rules that build both libraries. - Install the libraries in the location that Android expects them to be. Differential Revision: https://reviews.llvm.org/D96563	2021-02-11 21:47:33 -08:00
Craig Topper	47244a42c7	[TableGen] Make the map in InfoByHwMode protected. NFCI Switch some for loops to just use the begin()/end() implementations in the InfoByHwMode struct. Add a method to insert into the map for the one case that was modifying the map directly.	2021-02-11 21:16:10 -08:00
James Y Knight	3a1ce2f6df	Fix bitcode decoder error in "Encode alignment attribute for `atomicrmw`" The wrong record field number was being used in bitcode decoding, which broke a self-hosted LTO build. (Yet, somehow, this _doesn't_ seem to have broken simple bitcode encode/decode roundtrip tests, and I'm not sure why...) Fixes commit d06ab79816785fa362e7d96d7a398bea8064cba7	2021-02-11 22:29:03 -05:00
Amara Emerson	b519f0f5db	[GlobalISel] Fix sext_inreg(load) combine to not move the originating load. The builder was using the extend user as the insertion point, which meant that we were incorrectly "moving" the load from its original position, and therefore could violate memory operation ordering.	2021-02-11 19:27:09 -08:00
Fangrui Song	abb3df8ce4	DebugInfo/Symbolize: Don't differentiate function/data symbolization Before d08bd13ac8a560c4645e17e192ca07e1bdcd2895, only `SymbolRef::ST_Function` symbols were used for .symtab symbolization. That commit added a `"DATA"` mode to llvm-symbolizer which used `SymbolRef::ST_Data` symbols for symbolization. Since function and data symbols have different addresses, we don't need to differentiate the two modes. This patches unifies the two modes to simplify code. `"DATA"` is used by `compiler-rt/lib/sanitizer_common/sanitizer_symbolizer_libcdep.cpp`. `check-hwasan` and `check-tsan` have runtime tests. Differential Revision: https://reviews.llvm.org/D96322	2021-02-11 19:22:44 -08:00
Philip Reames	2bc932f596	[knownbits] Preserve known bits for small shift recurrences The motivation for this is that I'm looking at an example that uses shifts as induction variables. There's lots of other omissions, but one of the first I noticed is that we can't compute tight known bits. (This indirectly causes SCEV's range analysis to produce very poor results as well.) Differential Revision: https://reviews.llvm.org/D96440	2021-02-11 17:56:36 -08:00
Philip Reames	5c03a3baf7	[tests] Autogen update test to remove whitespace diffs	2021-02-11 17:06:49 -08:00
Philip Reames	65435fd32d	[tests] precommit a tests for D96534 (and other range quality items)	2021-02-11 17:02:59 -08:00
Philip Reames	c8d230ef7d	[tests] Autogen a few tests for ease of update	2021-02-11 16:54:06 -08:00
Craig Topper	ef5423cf23	[RISCV] Add a pattern for a scalable vector mask vnot. We can use a vnand.mm with the same register for both inputs. This avoids materializing an alls ones constant with vmset.mm.	2021-02-11 15:34:58 -08:00
Hongtao Yu	01baf99f4a	Remove test code that cause MSAN failure. Summary: The negative test (with the feature being added disabled) caused MSAN failure and that's the added feature is supposed to fix. Therefore the negative test code is being removed.	2021-02-11 14:51:55 -08:00
ShihPo Hung	934bc977ad	[RISCV] Initial support for insert/extract subvector This patch handles cast-like insert_subvector & extract_subvector in which case: 1. index starts from 0. 2. inserting a fixed-width vector into a scalable vector, or extracting a fixed-width vector from a scalable vector. Reviewed By: craig.topper, frasercrmck Differential Revision: https://reviews.llvm.org/D96352	2021-02-11 14:35:49 -08:00
Pengxuan Zheng	cd15616305	[AArch64] Adding Neon Sm3 & Sm4 Intrinsics This adds SM3 and SM4 Intrinsics support for AArch64, specifically: vsm3ss1q_u32 vsm3tt1aq_u32 vsm3tt1bq_u32 vsm3tt2aq_u32 vsm3tt2bq_u32 vsm3partw1q_u32 vsm3partw2q_u32 vsm4eq_u32 vsm4ekeyq_u32 Reviewed By: labrinea Differential Revision: https://reviews.llvm.org/D95655	2021-02-11 14:20:20 -08:00
Guillaume Chatelet	92238fe870	Fix incorrect indentation in LangRef.rst	2021-02-11 20:47:43 +00:00
Hongtao Yu	a570cb0e5b	[CSSPGO] Process functions in a top-down order on a dynamic call graph. Functions are currently processed by the sample profiler loader in a top-down order defined by the static call graph. The order is being adjusted to be a top-down order based on the input context-sensitive profile. One benefit is that the processing order of caller and callee in one SCC would follow the context order in the profile to favor more inlining. Another benefit is that the processing order of caller and callee through an indirect call (which is not on the static call graph) can be honored which in turn allows for more inlining. The profile top-down order for SCC is also extended to support non-CS profiles. Two switches `-mllvm -use-profile-indirect-call-edges` and `-mllvm -use-profile-top-down-order` are being introduced. Reviewed By: wmi Differential Revision: https://reviews.llvm.org/D95988	2021-02-11 12:36:59 -08:00
Guillaume Chatelet	d7077079f8	Fix incorrect indentation in LangRef.rst	2021-02-11 20:34:19 +00:00

... 3 4 5 6 7 ...

211313 Commits