llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 13:11:39 +01:00

Author	SHA1	Message	Date
David Sherwood	009f73b707	Revert "[SVE] Add checks for no warnings in SVE tests" This reverts commit fb495c31701b2cd697cf4282f63764c9cefd49c0. Was causing test failures and broke buildbot.	2020-06-18 08:03:31 +01:00
David Sherwood	59b1a7011c	[CodeGen] Fix warnings in getVectorElementCount() In EVT::getVectorElementCount() when the type is not simple we should return getExtendedVectorElementCount() from the function instead of constructing the ElementCount object manually. I discovered this warning in an existing test: test/CodeGen/AArch64/sve-intrinsics-loads.ll Differential Revision: https://reviews.llvm.org/D81927	2020-06-18 07:51:11 +01:00
David Sherwood	4399d793fa	[SVE] Add checks for no warnings in SVE tests There are now quite a few SVE tests in LLVM and Clang that do not emit warnings related to invalid use of EVT::getVectorNumElements() and VectorType::getNumElements(). For these tests I have added additional checks that there are no warnings in order to prevent any future regressions. Differential Revision: https://reviews.llvm.org/D80712	2020-06-18 07:49:48 +01:00
Kristof Beyls	d4b7fc9e9e	[IndirectThunks] Tiny comment fix	2020-06-18 06:50:30 +01:00
Kristof Beyls	5fc1ee97f5	[IndirectThunks] Make generated MF structure as expected by all instruction selectors. This also enables running the AArch64 SLSHardening pass with GlobalISel, so add a test for that. Differential Revision: https://reviews.llvm.org/D81403	2020-06-18 06:44:53 +01:00
Kristof Beyls	43c0a7f78b	[AArch64] SLSHardening: compute correct thunk name for X29. The enum values for AArch64 registers are not all consecutive. Therefore, the computation "__llvm_slsblr_thunk_x" + utostr(Reg - AArch64::X0) is not always correct. utostr(Reg - AArch64::X0) will not generate the expected string for the registers that do not have consecutive values in the enum. This happened to work for most registers, but does not for AArch64::FP (i.e. register X29). This can get triggered when the X29 is not used as a frame pointer. Differential Revision: https://reviews.llvm.org/D81997	2020-06-18 06:36:49 +01:00
Max Kazantsev	3557563d55	[Test] Add more tests showing missing opportunities in Select instcombine	2020-06-18 12:32:55 +07:00
Xing GUO	bf7dda020d	[DWARFYAML][debug_abbrev] Make the abbreviation code optional. This patch helps make the `Code` optional in abbreviations table. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D81826	2020-06-18 13:02:54 +08:00
Mehdi Amini	d3e71cd05e	Remove "unused" member ModuleSlice from `struct OpenMPOpt` This is fixing warning from clang: warning: private field 'ModuleSlice' is not used [-Wunused-private-field] SmallPtrSetImpl<Function *> &ModuleSlice; ^ Differential Revision: https://reviews.llvm.org/D82027	2020-06-18 03:02:26 +00:00
Kang Zhang	aa0efac1ba	[PowerPC] Don't convert Loop to CTR Loop for fp128 BinaryOperator Summary: For PPC BinaryOperator of fp128 will become libcall, we shouldn't convert loop to CTR loop if the loop contain libCall. But currently, in the PPCTTIImpl::mightUseCTR() function, we only deal with BinaryOperator for ppc_fp128, don't deal with the fp128. Reviewed By: shchenz Differential Revision: https://reviews.llvm.org/D81353	2020-06-18 02:54:19 +00:00
Xing GUO	82c4755d8e	[ObjectYAML][ELF] Add support for emitting the .debug_abbrev section. This patch enables yaml2elf emit the .debug_abbrev section. The generated .debug_abbrev is verified using `llvm-dwarfdump`. Known issues that will be addressed later: - Current implementation doesn't support generating multiple abbreviation tables in one .debug_abbrev section. Reviewed By: jhenderson, grimar Differential Revision: https://reviews.llvm.org/D81820	2020-06-18 10:50:38 +08:00
Esme-Yi	3a5edae7bf	[PowerPC] Custom lower rotl v1i128 to vector_shuffle. Summary: A bug is reported in bugzilla-45628, where the swap_with_shift case can’t be matched to a single HW instruction xxswapd as expected. In fact the case matches the idiom of rotate. We have MatchRotate to handle an ‘or’ of two operands and generate a rot[lr] if the case matches the idiom of rotate. While PPC doesn’t support ROTL v1i128. We can custom lower ROTL v1i128 to the vector_shuffle. The vector_shuffle will be matched to a single HW instruction during the phase of instruction selection. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D81076	2020-06-18 01:32:23 +00:00
Sam Clegg	7eacd92326	[WebAssembly] MC: Fix for data aliases with offsets (getelementptr) For some reason we hadn't seen such cases in the wild which makes me think that clang and rustc don't generate these. In the bug which reproduces it only occurs with LTO so my guess is that some LTO pass is creating this alias + gep. See: https://github.com/emscripten-core/emscripten/issues/8731 Differential Revision: https://reviews.llvm.org/D79462	2020-06-17 16:25:50 -07:00
Matt Arsenault	cf1fb3e07a	AMDGPU: Don't use 16-bit FP inline constants in integer operands It seems to be a hardware defect that the half inline constants do not work as expected for the 16-bit integer operations (the inverse does work correctly). Experimentation seems to show these are really reading the 32-bit inline constants, which can be observed by writing inline asm using op_sel to see what's in the high half of the constant. Theoretically we could fold the high halves of the 32-bit constants using op_sel. The *_asm_all.s MC tests are broken, and I don't know where the script to autogenerate these are. I started manually fixing it, but there's just too many cases to fix. This also does break the assembler/disassembler support for these values, and I'm not sure what to do about it. These are still valid encodings, so it seems like you should be able to use them in some way. If you wrote assembly using them, you could have really meant it (perhaps to read the high bits with op_sel?). The disassembler will print the invalid literal constant which will fail to re-assemble. The behavior is also different depending on the use context. Consider this example, which was previously accepted and encoded using the inline constant: v_mad_i16 v5, v1, -4.0, v3 ; encoding: [0x05,0x00,0xec,0xd1,0x01,0xef,0x0d,0x04] In contexts where an inline immediate is required (such as on gfx8/9), this will now be rejected. For gfx10, this will produce the literal encoding and change the printed format: v_mad_i16 v5, v1, 0xc400, v3 ; encoding: [0x05,0x00,0x5e,0xd7,0x01,0xff,0x0d,0x04,0x00,0xc4,0x00,0x00] This is just another variation of the issue that we don't perfectly handle round trip assembly/disassembly due to not tracking how immediates were encoded. This doesn't matter much in practice, since compilers don't emit the suboptimal encoding. I doubt any users are relying on this behavior (although I did make use of the old behavior to figure out what was wrong). Fixes bug 46302.	2020-06-17 19:14:10 -04:00
Yonghong Song	fdc9c93fe2	[BPF] fix a bug for BTF pointee type pruning In BTF, pointee type pruning is used to reduce cluttering too many unused types into prog BTF. For example, struct task_struct { ... struct mm_struct mm; ... } If bpf program does not access members of "struct mm_struct", there is no need to bring types for "struct mm_struct" to BTF. This patch fixed a bug where an incorrect pruning happened. The test case like below: struct t; typedef struct t _t; struct s1 { _t c; }; int test1(struct s1 arg) { ... } struct t { int a; int b; }; struct s2 { _t c; } int test2(struct s2 arg) { ... } After processing test1(), among others, BPF backend generates BTF types for "struct s1", "_t" and a placeholder for "struct t". Note that "struct t" is not really generated. If later a direct access to "struct t" member happened, "struct t" BTF type will be generated properly. During processing test2(), when processing member type "_t c", BPF backend sees type "_t" already generated, so returned. This caused the problem that "struct t" BTF type is never generated and eventually causing incorrect type definition for "struct s2". To fix the issue, during DebugInfo type traversal, even if a typedef/const/volatile/restrict derived type has been recorded in BTF, if it is not a type pruning candidate, type traversal of its base type continues. Differential Revision: https://reviews.llvm.org/D82041	2020-06-17 15:13:46 -07:00
Eric Christopher	d2da144d3a	Revert "Remove unused class variable ModuleSlice." as it was used in debug only code. This reverts commit 07a1749081bac07515d357fd96165662031de527.	2020-06-17 14:45:17 -07:00
Eric Christopher	155c9de0ea	Remove unused class variable ModuleSlice.	2020-06-17 14:33:29 -07:00
Christopher Tetreault	faa78a0019	[SVE] Eliminate bad VectorType::getNumElements() calls from ConstantFold Summary: Assume all usages of this function are explicitly fixed-width operations and cast to FixedVectorType Reviewers: efriedma, sdesmalen, c-rhodes, majnemer, dblaikie Reviewed By: sdesmalen Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80262	2020-06-17 14:19:56 -07:00
Christopher Tetreault	a01428816d	[SVE] Fix invalid usages of getNumElements in ShuffleVectorInstruction Summary: Fix invalid usages of getNumElements identified by test case LLVM.Transforms/InstCombine::vscale_extractelement.ll. changesLength: Since the length of the llvm::SmallVector shufflemask is related to the minimum number of elements in a scalable vector, it is fine to just get the Min field of the ElementCount isIdentityWithExtract: Since it is not possible to express the mask needed for this pattern for scalable vectors, we can just bail before calling getNumElements() Reviewers: efriedma, sdesmalen, fpetrogalli, gchatelet, yrouban, craig.topper Reviewed By: sdesmalen Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81969	2020-06-17 13:45:34 -07:00
Roman Lebedev	f11df3ab21	[InstCombine] Negator: while there, add detection for cycles during negation I don't have any testcases showing it happening, and i haven't succeeded in creating one, but i'm also not positive it can't ever happen, and i recall having something that looked like that in the very beginning of Negator creation. But since we now already have a negation cache, we can now detect such cases practically for free. Let's do so instead of "relying" on stack overflow :D	2020-06-17 22:47:20 +03:00
Roman Lebedev	9a68cef13a	[InstCombine] Negator: cache negation results (PR46362) It is possible that we can try to negate the same value multiple times. For example, PHI nodes may happen to have multiple incoming values (all of which must be the same value) for the same incoming basic block. It may happen that we try to negate such a PHI node, and succeed, and that might result in having now-different incoming values.. To avoid that, and in general to reduce the amount of duplicated work we might be doing, let's introduce a cache where we'll track results of negating each value. The added test was previously failing -verify after -instcombine. Fixes https://bugs.llvm.org/show_bug.cgi?id=46362	2020-06-17 22:47:20 +03:00
Roman Lebedev	7ee109a70f	[NFC][InstCombine] Negator: add thin negate() wrapped before visit()	2020-06-17 22:47:20 +03:00
Roman Lebedev	3f4b036382	[NFC][InstCombine] Negator: do not include unneeded "llvm/IR/DerivedTypes.h" header	2020-06-17 22:47:19 +03:00
Thomas Lively	447b230f85	[WebAssembly] Fix bug in FixBrTables and use branch analysis utils Summary: This commit fixes a bug in the FixBrTables pass in which an unconditional branch from the switch header block to the jump table block was not removed before the blocks were combined. The result was an invalid CFG in the MachineFunction. This commit also switches from using bespoke branch analysis and deletion code to using the standard utilities for the same. Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81909	2020-06-17 12:34:45 -07:00
Nick Desaulniers	9bf0fe84c8	[InlineSpiller] add assert about spills post terminators Summary: This invariant is being violated in the test case https://reviews.llvm.org/D77849, related to the use of the relatively new ability for callbr to have return values, and MachineBasicBlocks with INLINEASM_BR terminators to emit live out register defs. As noted in the comment, this triggers invariant violations in MachineVerifier via `llc -verify-machineinstrs` or `llc -verify-regalloc`, since only MachineInstrs that are terminators are allowed to follow the first terminator. https://reviews.llvm.org/D75098 may rework this very assertion if we're spilling via a (proposed) TCOPY MachineInstr. Reviewers: void, efriedma, arsenm Reviewed By: efriedma Subscribers: qcolombet, wdng, hiraditya, llvm-commits, srhines Tags: #llvm Differential Revision: https://reviews.llvm.org/D78166	2020-06-17 11:51:58 -07:00
Nick Desaulniers	60914a2419	BreakCriticalEdges for callbr indirect dests Summary: llvm::SplitEdge was failing an assertion that the BasicBlock only had one successor (for BasicBlocks terminated by CallBrInst, we typically have multiple successors). It was surprising that the earlier call to SplitCriticalEdge did not handle the critical edge (there was an early return). Removing that triggered another assertion relating to creating a BlockAddress for a BasicBlock that did not (yet) have a parent, which is a simple order of operations issue in llvm::SplitCriticalEdge (a freshly constructed BasicBlock must be inserted into a Function's basic block list to have a parent). Thanks to @nathanchance for the report. Fixes: https://github.com/ClangBuiltLinux/linux/issues/1018 Reviewers: craig.topper, jyknight, void, fhahn, efriedma Reviewed By: efriedma Subscribers: eli.friedman, rnk, efriedma, fhahn, hiraditya, llvm-commits, nathanchance, srhines Tags: #llvm Differential Revision: https://reviews.llvm.org/D81607	2020-06-17 11:45:06 -07:00
Julian Lettner	01a5d2a598	[lit] Avoid re-iterating result codes in different places Let's have one canonical place to define ResultCode instances and their labels. Also make ResultCode's `__init__` function self-registering to better support custom ResultCodes.	2020-06-17 11:40:19 -07:00
Zequan Wu	7549421f49	[llvm-readobj] set --elf-cg-profile as alias of --cg-profile Summary: Rename --elf-cg-profile to --cg-profile and keep --elf-cg-profile as an alias of --cg-profile. Reviewers: jhenderson, MaskRay, espindola, hans Reviewed By: jhenderson, MaskRay Subscribers: emaste, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81855	2020-06-17 11:24:45 -07:00
Davide Italiano	b26165d0dc	[CGP] Reset the debug location when promoting zext(s). When the zext gets promoted, it used to retain the original location, which pessimizes the debugging experience causing an unexpected jump in stepping at -Og. Fixes https://bugs.llvm.org/show_bug.cgi?id=46120 (which also contains a full C repro). Differential Revision: https://reviews.llvm.org/D81437	2020-06-17 11:13:13 -07:00
Ian Levesque	b5f0be1e22	[xray] Option to omit the function index Summary: Add a flag to omit the xray_fn_idx to cut size overhead and relocations roughly in half at the cost of reduced performance for single function patching. Minor additions to compiler-rt support per-function patching without the index. Reviewers: dberris, MaskRay, johnislarry Subscribers: hiraditya, arphaman, cfe-commits, #sanitizers, llvm-commits Tags: #clang, #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D81995	2020-06-17 13:49:01 -04:00
Alexandre Ganea	39f366394d	[X86] For 32-bit targets, emit two-byte NOP when possible In order to support hot-patching, we need to make sure the first emitted instruction in a function is a two-byte+ op. This is already the case on x86_64, which seems to always emit two-byte+ ops. However on 32-bit targets this wasn't the case. PATCHABLE_OP now lowers to a XCHG AX, AX, (66 90) like MSVC does. However when targetting pentium3 (/arch:SSE) or i386 (/arch:IA32) targets, we generate MOV EDI,EDI (8B FF) like MSVC does. This is for compatiblity reasons with older tools that rely on this two byte pattern. Differential Revision: https://reviews.llvm.org/D81301	2020-06-17 13:44:38 -04:00
Alexandre Ganea	f74b1eb181	[X86] Change signature of EmitNops. NFC. This is to support https://reviews.llvm.org/D81301.	2020-06-17 13:44:37 -04:00
Fangrui Song	78ddb2f901	[llvm-cov gcov] Support clang<11 fake 4.2 format Test cases are restored from a3bed4bd3743b5fee1e66116a63089df742bcae1	2020-06-17 10:17:15 -07:00
Michał Górny	e37aaf4758	[llvm] Disable linking llvm-exegesis to dylib Force linking llvm-exegesis to static LLVM libraries instead of dylib to prevent duplicate symbols due to linking both. Ideally, we'd want to link to the dylib only here but the target sub-libraries use hidden symbols from LLVM target libraries and therefore linking the dylib fails. Differential Revision: https://reviews.llvm.org/D81922	2020-06-17 19:00:26 +02:00
Michał Górny	1655e30934	[llvm] Avoid linking llvm-cfi-verify to duplicate libs Fix the CMake rules for LLVMCFIVerify library not to pull duplicate LLVM .a libraries when linking to the dylib. This prevents problems due to duplicate symbols and apparently fixes mingw32. This is an alternative approach to D44650 that just forces .a libraries instead. However, there doesn't seem to be any reason to do that. Differential Revision: https://reviews.llvm.org/D81921	2020-06-17 19:00:26 +02:00
Michał Górny	6b982af8c2	[llvm] [CommandLine] Do not suggest really hidden opts in nearest lookup Skip 'really hidden' options when performing lookup of the nearest option when invalid option was passed. Since these options aren't even documented in --help-hidden, it seems inconsistent to suggest them to users. This fixes clang-tools-extra test failures due to unexpected suggestions when linking the tools to LLVM dylib (that provides more options than the subset of LLVM libraries linked directly). Differential Revision: https://reviews.llvm.org/D82001	2020-06-17 19:00:26 +02:00
Scott Linder	b04677bc50	[AMDGPU] Skip CFIInstructions in SIInsertWaitcnts Summary: CFI emitted during PEI at the beginning of the prologue needs to apply to any inserted waitcnts on function entry. Reviewers: arsenm, t-tye, RamNalamothu Reviewed By: arsenm Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm, #debug-info Differential Revision: https://reviews.llvm.org/D76881	2020-06-17 12:41:03 -04:00
vnalamot	a597b94680	[NFC] Move getAll{S,V}GPR{32,128} methods to SIFrameLowering Summary: Future patch needs some of these in multiple places. The definitions of these can't be in the header and be eligible for inlining without making the full declaration of GCNSubtarget visible. I'm not sure what the right trade-off is, but I opted to not bloat SIRegisterInfo.h Reviewers: arsenm, cdevadas Reviewed By: arsenm Subscribers: RamNalamothu, qcolombet, jvesely, wdng, nhaehnle, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79878	2020-06-17 12:08:09 -04:00
sstefan1	ca69beb4d9	[OpenMPOPT][NFC] Introducing OMPInformationCache. Summary: Introduction of OpenMP-specific information cache based on Attributor's `InformationCache`. This should make it easier to share information between them. Reviewers: jdoerfert, JonChesterfield, hamax97, jhuber6, uenoku Subscribers: yaxunl, hiraditya, guansong, uenoku, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81798	2020-06-17 16:56:45 +02:00
Jay Foad	2b67d7fb66	[AMDGPU] Simplify GCNPassConfig::addOptimizedRegAlloc. NFC.	2020-06-17 15:56:15 +01:00
Simon Pilgrim	96c16f069b	[X86] Regenerate mixed-ptr tests checks Help reduce the diffs in D81517	2020-06-17 15:48:24 +01:00
Simon Pilgrim	30d12a5b04	ScalarEvolution.h - reduce LoopInfo.h include to forward declarations. NFC. Move ScalarEvolution::forgetLoopDispositions implementation to ScalarEvolution.cpp to remove the dependency. Add implicit header dependency to source files where necessary.	2020-06-17 15:48:23 +01:00
Sjoerd Meijer	0d40769e87	[ARM] Reimplement MVE Tail-Predication pass using @llvm.get.active.lane.mask To set up a tail-predicated loop, we need to to calculate the number of elements processed by the loop. We can now use intrinsic @llvm.get.active.lane.mask() to do this, which is emitted by the vectoriser in D79100. This intrinsic generates a predicate for the masked loads/stores, and consumes the Backedge Taken Count (BTC) as its second argument. We can now use that to reconstruct the loop tripcount, instead of the IR pattern match approach we were using before. Many thanks to Eli Friedman and Sam Parker for all their help with this work. This also adds overflow checks for the different, new expressions that we create: the loop tripcount, and the sub expression that calculates the remaining elements to be processed. For the latter, SCEV is not able to calculate precise enough bounds, so we work around that at the moment, but is not entirely correct yet, it's conservative. The overflow checks can be overruled with a force flag, which is thus potentially unsafe (but not really because the vectoriser is the only place where this intrinsic is emitted at the moment). It's also good to mention that the tail-predication pass is not yet enabled by default. We will follow up to see if we can implement these overflow checks better, either by a change in SCEV or we may want revise the definition of llvm.get.active.lane.mask. Differential Revision: https://reviews.llvm.org/D79175	2020-06-17 15:17:42 +01:00
Kirill Naumov	17cf0dfd15	Revert "[InlineCost] InlineCostAnnotationWriterPass introduced" This reverts commit 37e06e8f5c6ee39a1d7cbaf7d5f5a3ebfa1b4e15.	2020-06-17 14:02:34 +00:00
Kirill Naumov	466ce2232e	Revert "[InlineCost] PrinterPass prints constants to which instructions are simplified" This reverts commit 52b0db22f8cfb594c32389224570681d2d2c2f21.	2020-06-17 14:02:29 +00:00
Kirill Naumov	81e7237c3b	Revert "[InlineCost] GetElementPtr with constant operands" This reverts commit 34fba68d80051e3c53e7843157c036f6d511ae03.	2020-06-17 14:02:18 +00:00
Kirill Naumov	3f766b30ab	[InlineCost] GetElementPtr with constant operands If the GEP instruction contanins only constants as its arguments, then it should be recognized as a constant. For now, there was also added a flag to turn off this simplification if it causes any regressions ("disable-gep-const-evaluation") which is off by default. Once I gather needed data of the effectiveness of this simplification, the flag will be deleted. Reviewers: apilipenko, davidxl, mtrofin Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D81026	2020-06-17 13:40:19 +00:00
Kirill Naumov	d123d281c8	[InlineCost] PrinterPass prints constants to which instructions are simplified This patch enables printing of constants to see which instructions were constant-folded. Needed for tests and better visiual analysis of inliner's work. Reviewers: apilipenko, mtrofin, davidxl, fedor.sergeev Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D81024	2020-06-17 13:40:18 +00:00
Kirill Naumov	dc9522e0eb	[InlineCost] InlineCostAnnotationWriterPass introduced This class allows to see the inliner's decisions for better optimization verifications and tests. To use, use flag "-passes="print<inline-cost>"". Reviewers: apilipenko, mtrofin, davidxl, fedor.sergeev Reviewed By: mtrofin Differential revision: https://reviews.llvm.org/D81743	2020-06-17 13:40:17 +00:00
Hans Wennborg	aca27c6ef0	Revert "GlobalISel: Make LLT constructors constexpr" This reverts commit 5a95be22d248be654b992dfb25e3850dbb182a14. It causes GCC 5.3 to segfault: In file included from /work/llvm.monorepo/llvm/lib/Target/AArch64/GISel/AArch64InstructionSelector.cpp:357:0: lib/Target/AArch64/AArch64GenGlobalISel.inc:189:17: in constexpr expansion of ‘llvm::LLT::scalar(16u)’ lib/Target/AArch64/AArch64GenGlobalISel.inc:205:1: internal compiler error: Segmentation fault	2020-06-17 15:31:12 +02:00

1 2 3 4 5 ...

198599 Commits