llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
Georgii Rymar	0d030a773a	[obj2yaml][yaml2obj] - Add note-section.yaml tests. They were a part of D68983, but were lost in the last diff and were not committed for unknown reason. I've renamed (from elf-sht-note.yaml) them and fixed broken formating a few places. Everything else remained untouched.	2020-07-24 14:27:54 +03:00
Roman Lebedev	8acafe5379	[NFC][GVN] Clean loadpre-missed-opportunity.ll test some more	2020-07-24 12:44:22 +03:00
Florian Hahn	55c3c49e06	[IPSCCP] Add another test case with argmemonly callsite attributes.	2020-07-24 10:13:51 +01:00
Xing GUO	9f9286af5c	[DWARFYAML] Replace 'Format', 'Version', etc with 'FormParams'. NFC. This patch replaces 'Format', 'Version' fields, etc with 'FormParams' to simplify codes. Reviewed By: labath Differential Revision: https://reviews.llvm.org/D84496	2020-07-24 16:54:51 +08:00
Petar Avramovic	8a0a38f0c4	AMDGPU/GlobalISel: Select set.inactive intrinsic Differential Revision: https://reviews.llvm.org/D84407	2020-07-24 10:14:14 +02:00
Craig Topper	c93c5e72a0	[X86] Make the X86ProcFamilyEnum private to X86Subtarget. Removed unneeded 'protected' from X86Subtarget. NFC	2020-07-23 23:42:11 -07:00
Petr Hosek	300de51f92	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of abb0075 with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-07-23 23:05:36 -07:00
Mircea Trofin	c07fcb7a55	[llvm][NFC] Don't use llvm/Config/config.h in .h files config.h is excluded from installs, llvm-config.h isn't Differential Revision: https://reviews.llvm.org/D84459	2020-07-23 22:27:38 -07:00
Xing GUO	8195cce4cd	[DWARFYAML] Use writeDWARFOffset() to simplify emitting offsets. NFC. This patch uses writeDWARFOffset() to simplify some codes. NFC.	2020-07-24 12:15:18 +08:00
Nico Weber	2add9f8975	[gn build] (manually) merge d054c7ee2e9	2020-07-23 22:28:23 -04:00
Fangrui Song	4f0382f382	Add test utility 'extract' See https://lists.llvm.org/pipermail/llvm-dev/2020-July/143373.html "[llvm-dev] Multiple documents in one test file" for some discussions. `extract part filename` splits the input file into multiple parts separated by regex `^(.\|//)--- ` and extract the specified part to stdout or the output file (if specified). Use case A (organizing input of different formats (e.g. linker script+assembly) in one file). ``` // RUN: extract lds %s -o %t.lds // RUN: extract asm %s -o %t.s // RUN: llvm-mc %t.s -o %t.o // RUN: ld.lld -T %t.lds %t.o -o %t This is sometimes better than the %S/Inputs/ approach because the user can see the auxiliary files immediately and don't have to open another file. ``` Use case B (for utilities which don't have built-in input splitting feature): ``` // RUN: extract case1 %s \| llc \| FileCheck %s --check-prefix=CASE1 // RUN: extract case2 %s \| llc \| FileCheck %s --check-prefix=CASE2 Combing tests prudently can improve readability. This is sometimes better than having multiple test files. ``` Since this is a new utility, there is no git history concerns for UpperCase variable names. I use lowerCase variable names like mlir/lld. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D83834	2020-07-23 19:15:35 -07:00
Craig Topper	a70e8e80f8	[LegalizeTypes] Teach DAGTypeLegalizer::GenWidenVectorLoads to pad with undef if needed when concatenating small or loads to match a larger load In the included test case the align 16 allowed the v23f32 load to handled as load v16f32, load v4f32, and load v4f32(one element not used). These loads all need to be concatenated together into a final vector. In this case we tried to concatenate the two v4f32 loads to match the type of the v16f32 load so we could do a second concat_vectors, but those loads alone only add up to v8f32. So we need to two v4f32 undefs to pad it. It appears we've tried to hack around a similar issue in this code before by adding undef padding to loads in one of the earlier loops in this function. Originally in r147964 by padding all loads narrower than previous loads to the same size. Later modifed to only the last load in r293088. This patch removes that earlier code and just handles it on demand where we know we need it. Fixes PR46820 Differential Revision: https://reviews.llvm.org/D84463	2020-07-23 19:02:03 -07:00
Matt Arsenault	929709ee06	GlobalISel: Add scalarSameSizeAs LegalizeRule Widen or narrow a type to a type with the same scalar size as another. This can be used to force G_PTR_ADD/G_PTRMASK's scalar operand to match the bitwidth of the pointer type. Use this to disallow narrower types for G_PTRMASK.	2020-07-23 21:17:31 -04:00
Matt Arsenault	4fa5e95608	GlobalISel: Drop original type pointeriness in minScalarSameAs It is not useful to report WidenScalar for a pointer value, so always report a scalar value with the target size. This allows using this to clamp the scalar operand to the pointer size in operations like G_PTR_ADD or G_PTRMASK.	2020-07-23 21:17:18 -04:00
Matt Arsenault	b410d6597f	GlobalISel: Fix maxScalarIf predicate check The std::function itself was tested, not the result of the actual query. It seems like there should be a warning for this. Inline the check to avoid this.	2020-07-23 21:17:10 -04:00
Fangrui Song	ff71233589	Revert D84264 "[SCCP] Directly remove non-feasible edges" & 5db5b4bc4394ca247c9eb665e03b851848aa2fbf It breaks stage-2 build. Clang crashed when compiling llvm/lib/Target/Hexagon/HexagonFrameLowering.cpp llvm/Support/GenericDomTree.h eraseNode: Node is not a leaf node	2020-07-23 17:51:48 -07:00
Eli Friedman	f7eaa43249	[AArch64][SVE] Teach copyPhysReg to copy ZPR2/3/4. It's sort of tricky to hit this in practice, but not impossible. I have a synthetic C testcase if anyone is interested. The implementation is identical to the equivalent NEON register copies. Differential Revision: https://reviews.llvm.org/D84373	2020-07-23 16:41:37 -07:00
Lang Hames	85d946b864	[ORC] Enable use of TargetProcessControl::getMemMgr with ObjectLinkingLayer. This patch makes ownership of the JITLinkMemoryManager by ObjectLinkingLayer optional: the layer can still own the memory manager but no longer has to. Evevntually we want to move to a state where ObjectLinkingLayer never owns its memory manager. For now allowing optional ownership makes it easier to develop classes that can dynamically use either RTDyldObjectLinkingLayer, which owns its memory managers, or ObjectLinkingLayer (e.g. LLJIT).	2020-07-23 16:18:57 -07:00
Craig Topper	cd59b87053	[bugpoint] Fix typo in assertion message. NFC	2020-07-23 16:06:43 -07:00
Thomas Lively	7d8901e025	[WebAssembly] Fix store_unfolded_offset tests in simd-offset.ll These tests were previously duplicates of the unfolded_gep_negative_offset tests, and this change updates them to test what they were meant to test. Differential Revision: https://reviews.llvm.org/D84365	2020-07-23 16:05:20 -07:00
Roman Lebedev	fe2f572177	[NFC][GVN] Add a (horrible) test for D84181 demonstrating non-NFC'ness	2020-07-24 01:28:23 +03:00
Amy Kwan	57e04066c4	[PowerPC] Implement low-order Vector Multiply, Modulus and Divide Instructions This patch aims to implement the low order vector multiply, divide and modulo instructions available on Power10. The patch involves legalizing the ISD nodes MUL, UDIV, SDIV, UREM and SREM for v2i64 and v4i32 vector types in order to utilize the following instructions: - Vector Multiply Low Doubleword: vmulld - Vector Modulus Word/Doubleword: vmodsw, vmoduw, vmodsd, vmodud - Vector Divide Word/Doubleword: vdivsw, vdivsd, vdivuw, vdivud Differential Revision: https://reviews.llvm.org/D82510	2020-07-23 17:18:36 -05:00
Petr Hosek	cb35cea843	Revert "[CMake] Simplify CMake handling for zlib" This reverts commit 1d09ecf36175f7910ffedd6d497c07b5c74c22fb since it breaks sanitizer bots.	2020-07-23 15:12:42 -07:00
Eric Christopher	d8256761e7	Use llvm::size rather than an empty loop to get the number of top level loops.	2020-07-23 14:55:50 -07:00
Petr Hosek	683670a251	[CMake] Simplify CMake handling for zlib Rather than handling zlib handling manually, use find_package from CMake to find zlib properly. Use this to normalize the LLVM_ENABLE_ZLIB, HAVE_ZLIB, HAVE_ZLIB_H. Furthermore, require zlib if LLVM_ENABLE_ZLIB is set to YES, which requires the distributor to explicitly select whether zlib is enabled or not. This simplifies the CMake handling and usage in the rest of the tooling. This is a reland of abb0075 with all followup changes and fixes that should address issues that were reported in PR44780. Differential Revision: https://reviews.llvm.org/D79219	2020-07-23 14:47:25 -07:00
Amara Emerson	ab3de496a6	[GlobalISel] Use existing MIR builder instead of creating one in combiner.	2020-07-23 14:16:45 -07:00
Sidharth Baveja	c3dd7f4b12	[Loop Fusion] Integrate Loop Peeling into Loop Fusion (re-land after fixing ASAN build failures) This patch adds the ability to peel off iterations of the first loop in loop fusion. This can allow for both loops to have the same trip count, making it legal for them to be fused together. Here is a simple scenario peeling can be used in loop fusion: for (i = 0; i < 10; ++i) a[i] = a[i] + 3; for (j = 1; j < 10; ++j) b[j] = b[j] + 5; Here is we can make use of peeling, and then fuse the two loops together. We can peel off the 0th iteration of the loop i, and then combine loop i and j for i = 1 to 10. a[0] = a[0] +3; for (i = 1; i < 10; ++i) { a[i] = a[i] + 3; b[i] = b[i] + 5; } Currently peeling with loop fusion is only supported for loops with constant trip counts and a single exit point. Both unguarded and guarded loops are supported. Reviewed By: bmahjour (Bardia Mahjour), MaskRay (Fangrui Song) Differential Revision: https://reviews.llvm.org/D82927	2020-07-23 21:02:04 +00:00
David Green	ab0cc8c927	[ARM] Add predicated mla reduction patterns Similar to 8fa824d7a3 but this time for MLA patterns, this selects predicated vmlav/vmlava/vmlalv/vmlava instructions from vecreduce.add(select(p, mul(x, y), 0)) nodes. Differential Revision: https://reviews.llvm.org/D84102	2020-07-23 21:47:59 +01:00
Steven Wu	b1abd6ff52	[Bitcode] Avoid duplicating linker option when upgrading Summary: The upgrading path from old ModuleFlag based linker options to the new NamedMetadata based linker option in in materializeMetadata() which gets called once for the module and once for every GV. The linker options are getting dup'ed every time and it can create massive amount of the linker options in the object file that gets created from old bitcode. Fix the problem by checking if the new option exists or not before upgrade again. rdar://64543389 Reviewers: pcc, t.p.northover, dexonsmith, arphaman Reviewed By: arphaman Subscribers: hiraditya, jkorous, ributzka, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83688	2020-07-23 13:07:28 -07:00
Nikita Popov	113464a305	[SCCP] Add another switch+phi test (NFC)	2020-07-23 21:51:09 +02:00
Tarindu Jayatilaka	ab5e6db94c	Add new function properties to FunctionPropertiesAnalysis Added LoadInstCount, StoreInstCount, MaxLoopDepth, LoopCount Reviewed By: jdoerfert, mtrofin Differential Revision: https://reviews.llvm.org/D82283	2020-07-23 12:46:47 -07:00
Matt Arsenault	9c29bf3fb7	AMDGPU: Fix failures from overflowing uint8_t number of operands If the operand index exceeded the limit of unsigned char, it wrapped and would point to the wrong operand. Increase the size of the operand index field to avoid this, and also don't bother trying to fold into implicit operands.	2020-07-23 15:39:33 -04:00
Amara Emerson	b33e55da04	[AArch64][GlobalISel] Add post-legalize combine for sext(trunc(sextload)) -> trunc/copy On AArch64 we generate redundant G_SEXTs or G_SEXT_INREGs because of this. Differential Revision: https://reviews.llvm.org/D81993	2020-07-23 12:06:35 -07:00
Nikita Popov	a35546a2b5	[SCCP] Add missing change reporting Forgot to actually use the return value of the function.	2020-07-23 20:58:29 +02:00
Tarindu Jayatilaka	d725c86409	Add a Printer to the FunctionPropertiesAnalysis A printer pass and a lit test case was added. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D82523	2020-07-23 11:57:11 -07:00
Nikita Popov	62e22a5a7e	[IR] Add min/max/abs intrinsics This adds the llvm.abs(), llvm.umin(), llvm.umax(), llvm.smin(), and llvm.smax() intrinsics specified in D81829. For SelectionDAG, the ISD opcodes and all the legalization and lowering already exist, so this just wires them up to the intrinsic in the SDAG builder and adds rudimentary tests. For GlobalISel only the min/max intrinsics are wired up, as llvm.abs() will require the addition of a G_ABS op, and corresponding legalization support. Differential Revision: https://reviews.llvm.org/D84125	2020-07-23 20:56:19 +02:00
Roman Lebedev	39bcc4be1d	[LangRef] Add integer min/max/abs intrinsics Add LangRef specification for the llvm.abs, llvm.umin, llvm.umax, llvm.smin, and llvm.smax integer intrinsics. Link to RFC: https://lists.llvm.org/pipermail/llvm-dev/2020-June/142257.html Proposed alive2 implementation: https://github.com/AliveToolkit/alive2/pull/353 Differential Revision: https://reviews.llvm.org/D81829	2020-07-23 20:56:18 +02:00
Tarindu Jayatilaka	904b2334bd	Refactor FunctionPropertiesAnalysis this separates `analyze` logic from `FunctionPropertiesAnalysis` Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D82521	2020-07-23 11:49:10 -07:00
Matt Arsenault	ec425b15a5	AMDGPU/GlobalISel: Add some tests for stack passed pointers	2020-07-23 14:38:31 -04:00
Nikita Popov	a7f6e63f5c	[SCCP] Directly remove non-feasible edges Non-feasible control-flow edges are currently removed by replacing the branch condition with a constant and then calling ConstantFoldTerminator. This happens in a rather roundabout manner, by inspecting the users (effectively: predecessors) of unreachable blocks, and further complicated by the need to explicitly materialize the condition for "forced" edges. I would like to extend SCCP to discard switch conditions that are non-feasible based on range information, but this is incompatible with the current approach (as there is no single constant we could use.) Instead, this patch explicitly removes non-feasible edges. It currently only needs to handle the case where there is a single feasible edge. The llvm_unreachable() branch will need to be implemented for the aforementioned switch improvement. Differential Revision: https://reviews.llvm.org/D84264	2020-07-23 20:32:57 +02:00
Matt Arsenault	9f712b1dc5	AMDGPU/GlobalISel: Handle call return values The only case that I know doesn't work is the implicit sret case when the return type doesn't fit in the return registers.	2020-07-23 14:29:35 -04:00
Nikita Popov	067f635646	[PredicateInfo][SCCP] Remove assertion (PR46814) As long as RenamedOp is not guaranteed to be accurate, we cannot assert here and should just return false. This was already done for the other conditions in this function. Fixes https://bugs.llvm.org/show_bug.cgi?id=46814.	2020-07-23 19:36:51 +02:00
Simon Pilgrim	b67f20fe61	[X86][SSE] Add additional (f)add(shuffle(x,y),shuffle(x,y)) tests for D83789	2020-07-23 18:30:34 +01:00
Gui Andrade	cf4816b501	[MSAN] Allow emitting checks for struct types Differential Revision: https://reviews.llvm.org/D82680	2020-07-23 16:50:59 +00:00
Simon Pilgrim	15ef46e64d	RegionInfo.cpp - remove duplicate includes that already exist in RegionInfo.h. NFC. Also remove some unnecessary forward declarations in RegionInfo.h.	2020-07-23 17:50:22 +01:00
Gui Andrade	8c1506c928	[MSAN] Never allow checking calls to __sanitizer_unaligned_{load,store} These functions expect the caller to always pass shadows over TLS. Differential Revision: https://reviews.llvm.org/D84351	2020-07-23 16:42:59 +00:00
Florian Hahn	b85e3d578d	[IPSCCP] Add test case for PR46717 for argmemonly handling.	2020-07-23 17:25:26 +01:00
Simon Pilgrim	78cccc12b3	CSEInfo.h - remove unused includes. NFC.	2020-07-23 17:17:21 +01:00
Craig Topper	90ac13b6c2	[X86] Add Feature64Bit to the 'generic' CPU and remove feature string hacking in X86Subtarget constructor Feature64Bit is only used by a check in the X86Subtarget constructor to ensure that the CPU selected supports 64-bit mode when the triple is for 64-bit mode. 'generic' is the default CPU in llc and so needs to be able to pass this check. Previously we did this by detecting the name and adding the feature to the feature string. But there doesn't seem to be any reason we can't just add the feature to the CPU directly.	2020-07-23 09:16:18 -07:00
Steven Wu	7e7fa937e3	[Bitcode] Drop invalid branch_weight in BitcodeReader Summary: If bitcode reader gets an invalid branch weight, drop that from the inputs. This allows us to read the broken modules we generated before the verifier was able to catch this. rdar://64870641 Reviewers: yrouban, t.p.northover, dexonsmith, arphaman, aprantl Reviewed By: aprantl Subscribers: aprantl, hiraditya, jkorous, ributzka, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83699	2020-07-23 09:07:15 -07:00

1 2 3 4 5 ...

200686 Commits