llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Amara Emerson	d942a7b79b	[AArch64][GlobalISel] Fallback if the +strict-align target feature is given. Works around PR44246.	2020-01-23 14:45:29 -08:00
Florian Hahn	ebce5bf21c	[IPSCCP] Use ParamState for arguments at call sites. We currently use integer ranges to merge concrete function arguments. We use the ParamState range for those, but we only look up concrete values in the regular state. For concrete function arguments that are themselves arguments of the containing function, we can use the param state directly and improve the precision in some cases. Besides improving the results in some cases, this is also a small step towards switching to ValueLatticeElement, by allowing D60582 to be a NFC. Reviewers: efriedma, davide Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D71836	2020-01-23 13:55:42 -08:00
Matt Arsenault	c5e9e39558	GlobalISel: Add MIPatternMatch for G_ICMP/G_FCMP	2020-01-23 13:30:47 -08:00
Matt Arsenault	95d5499a92	AMDGPU/GlobalISel: Fix RegBanKSelect for llvm.amdgcn.exp.compr This wasn't updated for the immarg handling change. We really need a verifier for this.	2020-01-23 13:30:46 -08:00
Teresa Johnson	dae138eca8	[ThinLTO] Summarize vcall_visibility metadata Summary: Second patch in series to support Safe Whole Program Devirtualization Enablement, see RFC here: http://lists.llvm.org/pipermail/llvm-dev/2019-December/137543.html Summarize vcall_visibility metadata in ThinLTO global variable summary. Depends on D71907. Reviewers: pcc, evgeny777, steven_wu Subscribers: mehdi_amini, Prazek, inglorion, hiraditya, dexonsmith, arphaman, ostannard, llvm-commits, cfe-commits, davidxl Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D71911	2020-01-23 13:19:56 -08:00
Reid Kleckner	af435f4dd3	[PDB] Simplify API for making section map, NFC Prevents API misuse described in PR44495	2020-01-23 12:15:21 -08:00
Matt Arsenault	37207e405d	AMDGPU: Fix ubsan error Since register classes go up to 1024, 32 elements, all masks bits are needed and a 32-bit shift by 32 is illegal. We didn't have any instructions theoretically using a 32 element VGPR before d1dbb5e4718a8f845abf0783513a33a55429470b	2020-01-23 15:05:47 -05:00
Roman Lebedev	f72d7af9b1	[IR] Attribute/AttrBuilder: use Value::MaximumAlignment magic constant Summary: I initially encountered those assertions when trying to create this IR `alignment` attribute from clang's `__attribute__((assume_aligned(imm)))`, because until D72994 there is no sanity checking for the value of `imm`. But even then, we have `llvm::Value::MaximumAlignment` constant (which is `536870912`), which is enforced for clang attributes, and then there are some other magical constant (`0x40000000` i.e. `1073741824` i.e. `2 * 536870912`) in `Attribute::getWithAlignment()`/`AttrBuilder::addAlignmentAttr()`. I strongly suspect that `0x40000000` is incorrect, and that also should be `llvm::Value::MaximumAlignment`. Reviewers: erichkeane, hfinkel, jdoerfert, gchatelet, courbet Reviewed By: erichkeane Subscribers: hiraditya, cfe-commits, llvm-commits Tags: #llvm, #clang Differential Revision: https://reviews.llvm.org/D72998	2020-01-23 22:50:49 +03:00
Teresa Johnson	8920d6a40a	[WPD/VFE] Always emit vcall_visibility metadata for -fwhole-program-vtables Summary: First patch to support Safe Whole Program Devirtualization Enablement, see RFC here: http://lists.llvm.org/pipermail/llvm-dev/2019-December/137543.html Always emit !vcall_visibility metadata under -fwhole-program-vtables, and not just for -fvirtual-function-elimination. The vcall visibility metadata will (in a subsequent patch) be used to communicate to WPD which vtables are safe to devirtualize, and we will optionally convert the metadata to hidden visibility at link time. Subsequent follow on patches will help enable this by adding vcall_visibility metadata to the ThinLTO summaries, and always emit type test intrinsics under -fwhole-program-vtables (and not just for vtables with hidden visibility). In order to do this safely with VFE, since for VFE all vtable loads must be type checked loads which will no longer be the case, this patch adds a new "Virtual Function Elim" module flag to communicate to GlobalDCE whether to perform VFE using the vcall_visibility metadata. One additional advantage of using the vcall_visibility metadata to drive more WPD at LTO link time is that we can use the same mechanism to enable more aggressive VFE at LTO link time as well. The link time option proposed in the RFC will convert vcall_visibility metadata to hidden (aka linkage unit visibility), which combined with -fvirtual-function-elimination will allow it to be done more aggressively at LTO link time under the same conditions. Reviewers: pcc, ostannard, evgeny777, steven_wu Subscribers: mehdi_amini, Prazek, hiraditya, dexonsmith, davidxl, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D71907	2020-01-23 11:36:01 -08:00
Alina Sbirlea	8cfea297a5	[LoopIdiomRecognize] Teach LoopIdiomRecognize to preserve MemorySSA.	2020-01-23 11:31:12 -08:00
Alina Sbirlea	22d4f4cbf9	[IndVarSimplify] Fix for MemorySSA preserve.	2020-01-23 11:06:16 -08:00
Fangrui Song	0b827d4aa9	[AArch64][test] Fix MC/AArch64 tests after D72799	2020-01-23 10:47:50 -08:00
Fangrui Song	6e3b58e0b9	[AArch64][test] Fix tests after D72799	2020-01-23 10:45:15 -08:00
Justin Bogner	626b423640	[LoopUnroll] Avoid UB when converting from WeakVH to `Value ` Calling `operator` on a WeakVH with a null value yields a null reference, which is UB. Avoid this by implicitly converting the WeakVH to a `Value *` rather than dereferencing and then taking the address for the type conversion. Differential Revision: https://reviews.llvm.org/D73280	2020-01-23 10:36:39 -08:00
Danilo Carvalho Grael	698ac54e59	[SVE] Add SVE2 patterns for unpredicated multiply instructions Summary: Add patterns for SVE2 unpredicated multiply instructions: - mul, smulh, umulh, pmul, sqdmulh, sqrdmulh Reviewers: sdesmalen, huntergr, efriedma, c-rhodes, kmclaughlin, rengolin Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits, amehsan Tags: #llvm Differential Revision: https://reviews.llvm.org/D72799	2020-01-23 13:20:53 -05:00
Simon Pilgrim	7a10fa3e59	[X86] LowerRotate - early out for vector rotates by zero	2020-01-23 17:48:09 +00:00
Simon Pilgrim	4c18f5d376	[X86] Add test showing failure to remove vector rotate by zero	2020-01-23 17:48:08 +00:00
Simon Pilgrim	3a410151bf	[X86] Add AVX512 tests for vector rotations	2020-01-23 17:48:08 +00:00
Simon Pilgrim	9d10b6a6c7	[SelectionDAG] ComputeNumSignBits - add ISD::ADD demanded elts support	2020-01-23 17:48:07 +00:00
Sam Parker	acdd8b62e2	[RDA] Skip debug values Skip debug instructions when iterating through a block to find uses. Differential Revision: https://reviews.llvm.org/D73273	2020-01-23 17:04:54 +00:00
Matt Arsenault	3170227425	AMDGPU/GlobalISel: Select V_ADD3_U32/V_XOR3_B32 The other 3-op patterns should also be theoretically handled, but currently there's a bug in the inferred pattern complexity. I'm not sure what the error handling strategy should be for potential constant bus violations. I think the correct strategy is to never produce mixed SGPR and VGPR operands in a typical VOP instruction, which will trivially avoid them. However, it's possible to still have hand written MIR (or erroneously transformed code) with these operands. When these fold, the restriction will be violated. We currently don't have any verifiers for reg bank legality. For now, just ignore the restriction. It might be worth triggering a DAG fallback on verifier error.	2020-01-23 12:04:20 -05:00
Matt Arsenault	9a7dbde109	GlobalISel: Use Register	2020-01-23 12:04:20 -05:00
Simon Pilgrim	f1b4b4c1bb	[SelectionDAG] ComputeNumSignBits - add ISD::ADD vector support Add missing handling for (ADD (AND X, 1), -1) uniform vectors	2020-01-23 16:42:12 +00:00
Simon Pilgrim	94ace0b9c4	[X86][SSE] Add ComputeNumSignBits tests for (ADD (AND X, 1), -1) vectors	2020-01-23 16:42:11 +00:00
Guillaume Chatelet	2efa9bb646	[Alignment][NFC] Use Align with CreateAlignedStore Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet, bollu Subscribers: arsenm, jvesely, nhaehnle, hiraditya, kerbowa, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D73274	2020-01-23 17:34:32 +01:00
Matt Arsenault	7f79f10d3e	AMDGPU: Check for other uses when looking through casted select Fixes mesa regression on ext_transform_feedback-max-varyings	2020-01-23 11:31:24 -05:00
Sam Parker	ee505c7a11	[NFC][ARM] Add test	2020-01-23 16:21:52 +00:00
Simon Pilgrim	10568c1873	[SelectionDAG] ComputeNumSignBits - add ISD::SUB demanded elts support	2020-01-23 16:20:48 +00:00
Simon Pilgrim	aa8536e4c8	[X86][AVX] Add AVX1/AVX2 ashr vector tests	2020-01-23 16:20:48 +00:00
Michael Liao	2c5a63a728	Fix GCC warning/error '-fpermission'. NFC.	2020-01-23 10:45:02 -05:00
Krzysztof Parzyszek	0d83f9aecc	[Hexagon] Remove unused operand definitions: s10_0Imm and s10_6Imm	2020-01-23 09:38:54 -06:00
Sergej Jaskiewicz	22b60a6c62	Revert "[tablegen] Emit string literals instead of char arrays" This reverts commit ce23515f5ab01161c98449d833b3ae013b553aa8. That commit broke some builds on Windows: http://lab.llvm.org:8011/builders/clang-x64-windows-msvc/builds/13870	2020-01-23 18:22:22 +03:00
Alexey Lapshin	0d5d418268	[Dsymutil][Debuginfo][NFC] #4 Refactor dsymutil to separate DWARF optimizing part. Summary: The primary goal of this refactoring is to separate DWARF optimizing part. So that it could be reused by linker or by any other client. There was a thread on llvm-dev discussing the necessity of such a refactoring: http://lists.llvm.org/pipermail/llvm-dev/2019-September/135068.html. This is a final part from series of patches for dsymutil. Previous patches : D71068, D71839, D72476. This patch: 1. Creates lib/DWARFLinker interface : void addObjectFile(DwarfLinkerObjFile &ObjFile); bool link(); void setOptions; 1. Moves all linking logic from tools/dsymutil/DwarfLinkerForBinary into lib/DWARFLinker. 2. Renames RelocationManager into AddressesManager. 3. Remarks creation logic moved from separate parallel execution into object file loading routine. Testing: it passes "check-all" lit testing. MD5 checksum for clang .dSYM bundle matches for the dsymutil with/without that patch. Reviewers: JDevlieghere, friss, dblaikie, aprantl, jdoerfert Reviewed By: JDevlieghere Subscribers: merge_guards_bot, hiraditya, jfb, llvm-commits, probinson, thegameg Tags: #llvm, #debug-info Differential Revision: https://reviews.llvm.org/D72915	2020-01-23 18:16:32 +03:00
Kazushi (Jam) Marukawa	2c679e1d4c	[VE] add, sub, left/right shift isel patterns Summary: Add, sub, left/right shift isel patterns and tests for i32/i64 and fp32/fp64. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D73207	2020-01-23 16:00:37 +01:00
Kazu Hirata	72ebb168c0	Revert "Resubmit: [JumpThreading] Thread jumps through two basic blocks" This reverts commit 53b68e676faf208b4a8f817e9bd4ddd522cc6006. Our internal tests are showing breakage with this patch.	2020-01-23 06:34:03 -08:00
Simon Moll	bcbd2887a3	[VE][NFC] re-write RR* isel class using null_frag Summary: Re-write RR* using null_frag to avoid duplication in upcoming patches. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D73259	2020-01-23 15:17:45 +01:00
Luke Drummond	93befba229	[tablegen] Emit string literals instead of char arrays This changes the generated (Instr\|Asm\|Reg\|Regclass)Name tables from this form: extern const char HexagonInstrNameData[] = { /* 0 / 'G', '_', 'F', 'L', 'O', 'G', '1', '0', 0, / 9 / 'E', 'N', 'D', 'L', 'O', 'O', 'P', '0', 0, / 18 / 'V', '6', '_', 'v', 'd', 'd', '0', 0, / 26 / 'P', 'S', '_', 'v', 'd', 'd', '0', 0, [...] }; ...to this: extern const char HexagonInstrNameData[] = { / 0 / "G_FLOG10\0" / 9 / "ENDLOOP0\0" / 18 / "V6_vdd0\0" / 26 */ "PS_vdd0\0" [...] }; This should make debugging and exploration a lot easier for mortals, while providing a significant compile-time reduction for common compilers. To avoid issues with low implementation limits, this is disabled by default for visual studio or when cross-compiling. To force output one way or the other, pass `--long-string-literals=<bool>` to `tablegen` Reviewers: mstorsjo, rnk Subscribers: llvm-commit Differential Revision: https://reviews.llvm.org/D73044	2020-01-23 13:57:20 +00:00
Jay Foad	d6a7ab5e54	[CodeGen] Make use of MachineInstrBuilder::getReg Reviewers: arsenm Subscribers: wdng, hiraditya, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73262	2020-01-23 13:38:13 +00:00
LLVM GN Syncbot	41ea30c256	[gn build] Port 2f6987ba61c	2020-01-23 13:34:12 +00:00
Sam Parker	003b950720	[NFC][RDA] Make the interface const Make all the public query methods const.	2020-01-23 13:32:11 +00:00
Simon Pilgrim	f164be40ec	[SelectionDAG] Compute Known + Sign Bits - merge INSERT_VECTOR_ELT known/unknown index paths Match the approach in SimplifyDemandedBits where we calculate the demanded elts and then have a common path for the ComputeKnownBits/ComputeNumSignBits call.	2020-01-23 13:31:37 +00:00
Fedor Sergeev	af7baa2a66	[LoopRotate] add ability to repeat loop rotation until non-deoptimizing exit is found In case of loops with multiple exit where all-but-one exit are deoptimizing it might happen that the first rotation will end up with latch having a deoptimizing exit. This makes the loop unsuitable for trip-count analysis (say, getLoopEstimatedTripCount) as well as for loop transformations that know how to handle multple deoptimizing exits. It pretty much means that canonical form in multple-deoptimizing-exits case should be with non-deoptimizing exit at latch. Teach loop-rotation to reach this canonical form by repeating rotation. -loop-rotate-multi option introduced to control this behavior, currently disabled by default. Reviewers: skatkov, asbirlea, reames, fhahn Reviewed By: skatkov Tags: #llvm Differential Revision: https://reviews.llvm.org/D73058	2020-01-23 15:56:24 +03:00
Guillaume Chatelet	084ea94702	[Alignement][NFC] Deprecate untyped CreateAlignedLoad Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: arsenm, jvesely, nhaehnle, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73260	2020-01-23 13:34:32 +01:00
Kerry McLaughlin	21172e27d1	[AArch64][SVE] Add first-faulting load intrinsic Summary: Implements the llvm.aarch64.sve.ldff1 intrinsic and DAG combine rules for first-faulting loads with sign & zero extends Reviewers: sdesmalen, efriedma, andwar, dancgr, rengolin Reviewed By: sdesmalen Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, cameron.mcinally, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73025	2020-01-23 11:57:16 +00:00
Simon Tatham	7180efac21	[ARM,MVE] Support immediate vbicq,vorrq,vmvnq intrinsics. Summary: Immediate vmvnq is code-generated as a simple vector constant in IR, and left to the backend to recognize that it can be created with an MVE VMVN instruction. The predicated version is represented as a select between the input and the same constant, and I've added a Tablegen isel rule to turn that into a predicated VMVN. (That should be better than the previous VMVN + VPSEL: it's the same number of instructions but now it can fold into an adjacent VPT block.) The unpredicated forms of VBIC and VORR are done by enabling the same isel lowering as for NEON, recognizing appropriate immediates and rewriting them as ARMISD::VBICIMM / ARMISD::VORRIMM SDNodes, which I then instruction-select into the right MVE instructions (now that I've also reworked those instructions to use the same MC operand encoding). In order to do that, I had to promote the Tablegen SDNode instance `NEONvorrImm` to a general `ARMvorrImm` available in MVE as well, and similarly for `NEONvbicImm`. The predicated forms of VBIC and VORR are represented as a vector select between the original input vector and the output of the unpredicated operation. The main convenience of this is that it still lets me use the existing isel lowering for VBICIMM/VORRIMM, and not have to write another copy of the operand encoding translation code. This intrinsic family is the first to use the `imm_simd` system I put into the MveEmitter tablegen backend. So, naturally, it showed up a bug or two (emitting bogus range checks and the like). Fixed those, and added a full set of tests for the permissible immediates in the existing Sema test. Also adjusted the isel pattern for `vmovlb.u8`, which stopped matching because lowering started turning its input into a VBICIMM. Now it recognizes the VBICIMM instead. Reviewers: dmgreen, MarkMurrayARM, miyuki, ostannard Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D72934	2020-01-23 11:53:52 +00:00
Simon Tatham	c48cf33847	[ARM,MVE] Revise immediate VBIC/VORR to look more like NEON. Summary: In NEON, the immediate forms of VBIC and VORR are each represented as a single MC instruction, which takes its immediate operand already encoded in a NEON-friendly format: 8 data bits, plus some control bits indicating how to expand them into a full vector. In MVE, we represented immediate VBIC and VORR as four separate MC instructions each, for an 8-bit immediate shifted left by 0, 8, 16 or 24 bits. For each one, the value of the immediate operand is in the 'natural' form, i.e. the numerical value that would actually be BICed or ORRed into each vector lane (and also the same value shown in assembly). For example, MVE_VBICIZ16v4i32 takes an operand such as 0xab0000, which NEON would represent as 0xab \| (control bits << 8). The MVE approach is superficially nice (it makes assembly input and output easy, and it's also nice if you're manually constructing immediate VBICs). But it turns out that it's better for isel if we make the NEON and MVE instructions work the same, because the ARMISD::VBICIMM and VORRIMM node types already encode their immediate into the NEON format, so it's easier if we can just use it. Also, this commit reduces the total amount of code rather than increasing it, which is surely an indication that it really is simpler to do it this way! Reviewers: dmgreen, ostannard, miyuki, MarkMurrayARM Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73205	2020-01-23 11:53:52 +00:00
Miloš Stojanović	02c4274a22	[llvm-exegesis] Don't use unsupported aliasing instructions Since some instruction types aren't allowed as the main instruction also don't allow them for aliasing instructions. Differential Revision: https://reviews.llvm.org/D73220	2020-01-23 12:42:42 +01:00
Thomas Preud'homme	0c1c818a0d	[FileCheck] Strengthen error checks in unit tests Summary: This commit adds error checking beyond UndefVarError and fix a number of Error/Expected related idioms: - use (EXPECT\|ASSERT)_THAT_(ERROR\|EXPECTED) instead of errorToBool or boolean operator - ASSERT when a further check require the check to be successful to give a correct result Reviewers: jhenderson, jdenny, probinson, grimar, arichardson, rnk Reviewed By: jhenderson Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72914	2020-01-23 11:31:53 +00:00
Simon Pilgrim	2c3b4faf6d	[SelectionDAG] ComputeKnownBits - merge EXTRACT_VECTOR_ELT known/unknown index paths Match the approach in SimplifyDemandedBits/ComputeNumSignBits where we calculate the demanded elts and then have a common path for the ComputeKnownBits call.	2020-01-23 11:29:16 +00:00
Simon Pilgrim	1b80c25ad9	[SelectionDAG] Compute Known + Sign Bits - merge INSERT_SUBVECTOR known/unknown index paths Match the approach in SimplifyDemandedBits where we calculate the demanded elts and then have a common path for the ComputeKnownBits/ComputeNumSignBits call, additionally we only ever need original demanded elts of the base vector even if the index is unknown.	2020-01-23 11:29:15 +00:00

1 2 3 4 5 ...

190580 Commits