llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 20:51:52 +01:00

Author	SHA1	Message	Date
Sjoerd Meijer	8b8ce6b228	[AArch64] More @llvm.fma.f16 tests Follow up of rL371321 that added FMA FP16 patterns. This adds more tests for @llvm.fma.f16. This probably shows we miss one fmsub optimisation opportunity, which I will look into. llvm-svn: 371833	2019-09-13 09:44:13 +00:00
Sam Tebbs	bc425164d2	[ARM] Add support for MVE vmaxv and vminv This patch adds vecreduce_smax, vecredude_umax, vecreduce_smin, vecreduce_umin and selection for vmaxv and minv. Differential Revision: https://reviews.llvm.org/D66413 llvm-svn: 371827	2019-09-13 09:11:46 +00:00
George Rimar	d0e41dee2b	[llvm-objdump] Fix llvm-objdump --all-headers output order Patch by Justice Adams! Made llvm-objdump --all-headers output match the order of GNU objdump for compatibility reasons. Old order of the headers output: * file header * section header table * symbol table * program header table * dynamic section New order of the headers output (GNU compatible): * file header information * program header table * dynamic section * section header table * symbol table (Relevant BugZilla Bug: https://bugs.llvm.org/show_bug.cgi?id=41830) Differential revision: https://reviews.llvm.org/D67357 llvm-svn: 371826	2019-09-13 08:56:28 +00:00
Dmitri Gribenko	c6af2e2ae7	Revert "Fix test failures after r371640" This reverts commit r371645, because r371640 was reverted. llvm-svn: 371824	2019-09-13 08:26:59 +00:00
Matt Arsenault	5ed6e85ff3	AMDGPU/GlobalISel: Legalize s32->s16 G_SITOFP/G_UITOFP llvm-svn: 371811	2019-09-13 04:04:55 +00:00
Shiva Chen	41a7c547de	[RISCV] Support stack offset exceed 32-bit for RV64 Differential Revision: https://reviews.llvm.org/D61884 llvm-svn: 371810	2019-09-13 04:03:32 +00:00
Shiva Chen	f88372996a	Revert "[RISCV] Support stack offset exceed 32-bit for RV64" This reverts commit 1c340c62058d4115d21e5fa1ce3a0d094d28c792. llvm-svn: 371809	2019-09-13 04:03:24 +00:00
Matt Arsenault	d221bbcf9c	AMDGPU/GlobalISel: Fix RegBankSelect for amdgcn.else llvm-svn: 371808	2019-09-13 03:55:49 +00:00
Matt Arsenault	a963b653dc	AMDGPU/GlobalISel: Select 16-bit VALU bit ops llvm-svn: 371807	2019-09-13 03:55:43 +00:00
Shiva Chen	388575ab79	[RISCV] Support stack offset exceed 32-bit for RV64 Differential Revision: https://reviews.llvm.org/D61884 llvm-svn: 371806	2019-09-13 02:50:13 +00:00
Matt Arsenault	c7d2e6ca93	AMDGPU/GlobalISel: Legalize G_FFLOOR llvm-svn: 371803	2019-09-13 01:48:15 +00:00
Tim Shen	846797bc3a	Temporarily revert r371640 "LiveIntervals: Split live intervals on multiple dead defs". It reveals a miscompile on Hexagon. See PR43302 for details. llvm-svn: 371802	2019-09-13 01:34:25 +00:00
Matt Arsenault	48ccbbfecd	AMDGPU/GlobalISel: Legalize G_FMAD Unlike SelectionDAG, treat this as a normally legalizable operation. In SelectionDAG this is supposed to only ever formed if it's legal, but I've found that to be restricting. For AMDGPU this is contextually legal depending on whether denormal flushing is allowed in the use function. Technically we currently treat the denormal mode as a subtarget feature, so custom lowering could be avoided. However I consider this to be a defect, and this should be contextually dependent on the controllable rounding mode of the parent function. llvm-svn: 371800	2019-09-13 00:44:35 +00:00
Matt Arsenault	bd2bbeaa29	AMDGPU/GlobalISel: Select G_CTPOP llvm-svn: 371798	2019-09-13 00:11:20 +00:00
Matt Arsenault	13e6fc349a	LiveIntervals: Remove assertion This testcase is invalid, and caught by the verifier. For the verifier to catch it, the live interval computation needs to complete. Remove the assert so the verifier catches this, which is less confusing. In this testcase there is an undefined use of a subregister, and lanes which aren't used or defined. An equivalent testcase with the super-register shrunk to have no untouched lanes already hit this verifier error. llvm-svn: 371792	2019-09-12 23:46:51 +00:00
Matt Arsenault	13e7d19a0d	AMDGPU: Inline constant when materalizing FI with add on gfx9 This was relying on the SGPR usable for the carry out clobber to also be used for the input. There was no carry out on gfx9. With no carry out clobber to worry about, so the literal can just be directly used with a VOP2 add. llvm-svn: 371791	2019-09-12 23:46:46 +00:00
Philip Reames	448b18aca0	[Test] Restructure check lines to show differences between modes more clearly With the landing of the previous patch (in particular D66318) there are a lot fewer diffs now. I added an experimental O0 line, and updated all the tests to group experimental and non-experimental O0/O3 together. Skimming the remaining diffs, there's only a few which are obviously incorrect. There's a large number which are questionable, so more todo. llvm-svn: 371790	2019-09-12 23:22:37 +00:00
Jessica Paquette	8a8cc5c189	[AArch64][GlobalISel] Support tail calling with swiftself parameters Swiftself uses a callee-saved register. We can tail call when the register used in the caller and callee is the same. This behaviour is equivalent to that in `TargetLowering::parametersInCSRMatch`. Update call-translator-tail-call.ll to verify that we can do this. When we support inline assembly, we can write a check similar to the one in the general swiftself.ll. For now, we need to verify that we get the correct COPY instruction after call lowering. Differential Revision: https://reviews.llvm.org/D67511 llvm-svn: 371788	2019-09-12 23:00:59 +00:00
Philip Reames	ba1f39ccae	[SDAG] Update generic code to conservatively check for isAtomic in addition to isVolatile This is the first sweep of generic code to add isAtomic bailouts where appropriate. The intention here is to have the switch from AtomicSDNode to LoadSDNode/StoreSDNode be close to NFC; that is, I'm not looking to allow additional optimizations at this time. That will come later. See D66309 for context. Differential Revision: https://reviews.llvm.org/D66318 llvm-svn: 371786	2019-09-12 22:49:17 +00:00
Jessica Paquette	d84c7b0582	[AArch64][GlobalISel] Support sibling calls with outgoing arguments This adds support for lowering sibling calls with outgoing arguments. e.g ``` define void @foo(i32 %a) ``` Support is ported from AArch64ISelLowering's `isEligibleForTailCallOptimization`. The only thing that is missing is a full port of `TargetLowering::parametersInCSRMatch`. So, if we're using swiftself, we'll never tail call. - Rename `analyzeCallResult` to `analyzeArgInfo`, since the function is now used for both outgoing and incoming arguments - Teach `OutgoingArgHandler` about tail calls. Tail calls use frame indices for stack arguments. - Teach `lowerFormalArguments` to set the bytes in the caller's stack argument area. This is used later to check if the tail call's parameters will fit on the caller's stack. - Add `areCalleeOutgoingArgsTailCallable` to perform the eligibility check on the callee's outgoing arguments. For testing: - Update call-translator-tail-call to verify that we can now tail call with outgoing arguments, use G_FRAME_INDEX for stack arguments, and respect the size of the caller's stack - Remove GISel-specific check lines from speculation-hardening.ll, since GISel now tail calls like the other selectors - Add a GISel test line to tailcall-string-rvo.ll since we can tail call in that test now - Add a GISel test line to tailcall_misched_graph.ll since we tail call there now. Add specific check lines for GISel, since the debug output from the machine-scheduler differs with GlobalISel. The dependency still holds, but the output comes out in a different order. Differential Revision: https://reviews.llvm.org/D67471 llvm-svn: 371780	2019-09-12 22:10:36 +00:00
Craig Topper	862ec62f6f	[PowerPC] Remove the SPE4RC register class and instead add f32 to the GPRC register class. Summary: Since the SPE4RC register class contains an identical set of registers and an identical spill size to the GPRC class its slightly confusing the tablegen emitter. It's preventing the GPRC_and_GPRC_NOR0 synthesized register class from inheriting VTs and AltOrders from GPRC or GPRC_NOR0. This is because SPE4C is found first in the super register class list when inheriting these properties and it doesn't set the VTs or AltOrders the same way as GPRC or GPRC_NOR0. This patch replaces all uses of GPE4RC with GPRC and allows GPRC and GPRC_NOR0 to contain f32. The test changes here are because the AltOrders are being inherited to GPRC_NOR0 now. Found while trying to determine if getCommonSubClass needs to take a VT argument. It was originally added to support fp128 on x86-64, I've changed some things about that so that it might be needed anymore. But a PowerPC test crashed without it and I think its due to this subclass issue. Reviewers: jhibbits, nemanjai, kbarton, hfinkel Subscribers: wuzish, nemanjai, mehdi_amini, hiraditya, kbarton, MaskRay, dexonsmith, jsji, shchenz, steven.zhang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67513 llvm-svn: 371779	2019-09-12 22:07:35 +00:00
Philip Reames	f7b66a883e	Remove a duplicate test Turns out I'd already added exactly the same test under the name non_unit_stride. llvm-svn: 371777	2019-09-12 21:40:15 +00:00
Philip Reames	3b71e989fe	[SCEV] Add smin support to getRangeRef We were failing to compute trip counts (both exact and maximum) for any loop which involved a comparison against either an umin or smin. It looks like this simply got missed when we added smin/umin to SCEV. (Note: umin was submitted separately earlier today. Turned out two folks hit this at the same time.) Differential Revision: https://reviews.llvm.org/D67514 llvm-svn: 371776	2019-09-12 21:32:27 +00:00
Craig Topper	be6798af61	[DAGCombiner][X86] Pass the CmpOpVT to reduceSelectOfFPConstantLoads so X86 can exclude fp128 compares. The X86 decision assumes the compare will produce a result in an XMM register, but that can't happen for an fp128 compare since those go to a libcall the returns an i32. Pass the VT so X86 can check the type. llvm-svn: 371775	2019-09-12 21:30:18 +00:00
Evandro Menezes	3b3c47e0cc	[ConstantFolding] Expand folding of some library functions Expanding the folding of `nearbyint()`, `rint()` and `trunc()` to library functions, in addition to the current support for intrinsics. Differential revision: https://reviews.llvm.org/D67468 llvm-svn: 371774	2019-09-12 21:23:22 +00:00
Tim Shen	3de03a4c3d	Fix llvm-reduce tests so that they don't assume the source code is writable. Instead of copying over the original file permissions, just create a new file and add the executable bit. llvm-svn: 371772	2019-09-12 21:03:49 +00:00
Florian Hahn	9748747e28	[LV] Update test case after r371768. llvm-svn: 371769	2019-09-12 20:07:17 +00:00
Florian Hahn	b59499608a	[SCEV] Support SCEVUMinExpr in getRangeRef. This patch adds support for SCEVUMinExpr to getRangeRef, similar to the support for SCEVUMaxExpr. Reviewers: sanjoy.google, efriedma, reames, nikic Reviewed By: sanjoy.google Differential Revision: https://reviews.llvm.org/D67177 llvm-svn: 371768	2019-09-12 20:03:32 +00:00
David Blaikie	a2200faeaf	llvm-reduce: For now, mark these tests as requiring a shell (since they execute shell scripts/that's the only entry point at the moment) llvm-svn: 371764	2019-09-12 19:50:54 +00:00
Philip Reames	d7acb3e35e	Precommit tests for D67514 llvm-svn: 371762	2019-09-12 19:34:27 +00:00
David Blaikie	70d10faac3	llvm-reduce: Remove unused plugin support/requirements llvm-svn: 371755	2019-09-12 18:52:31 +00:00
Alina Sbirlea	ba51045595	[LICM/AST] Check if the AliasAny set is removed from the tracker. Summary: Resolves PR38513. Credit to @bjope for debugging this. Reviewers: hfinkel, uabelho, bjope Subscribers: sanjoy.google, bjope, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67417 llvm-svn: 371752	2019-09-12 18:09:47 +00:00
Sanjay Patel	dd417415f3	[InstCombine] add tests for fptrunc; NFC llvm-svn: 371750	2019-09-12 18:00:11 +00:00
Alina Sbirlea	6d1edda49a	[MemorySSA] Pass (for update) MSSAU when hoisting instructions. Summary: Pass MSSAU to makeLoopInvariant in order to properly update MSSA. Reviewers: george.burgess.iv Subscribers: Prazek, sanjoy.google, uabelho, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67470 llvm-svn: 371748	2019-09-12 17:12:51 +00:00
Philip Reames	1bc2c14283	Precommit tests for generalization of load dereferenceability in loop llvm-svn: 371747	2019-09-12 17:09:01 +00:00
Sanjay Patel	901e3163db	[InstCombine] reduce test noise and regenerate CHECK lines; NFC llvm-svn: 371746	2019-09-12 17:07:01 +00:00
Philip Reames	caf2d0f40c	[LV] Support invariant addresses in speculation logic Implement a TODO from rL371452, and handle loop invariant addresses in predicated blocks. If we can prove that the load is safe to speculate into the header, then we can avoid using a masked.load in favour of a normal load. This is mostly about vectorization robustness. In the common case, it's generally expected that LICM/LoadStorePromotion would have eliminated such loads entirely. Differential Revision: https://reviews.llvm.org/D67372 llvm-svn: 371745	2019-09-12 16:49:10 +00:00
David Green	afc4123d6c	[CGP] Ensure sinking multiple instructions does not invalidate dominance checks In MVE, as of rL371218, we are attempting to sink chains of instructions such as: %l1 = insertelement <8 x i8> undef, i8 %l0, i32 0 %broadcast.splat26 = shufflevector <8 x i8> %l1, <8 x i8> undef, <8 x i32> zeroinitializer In certain situations though, we can end up breaking the dominance relations of instructions. This happens when we sink the instruction into a loop, but cannot remove the originals. The Use is updated, which might in fact be a Use from the second instruction to the first. This attempts to fix that by reversing the order of instruction that are sunk, and ensuring that we update the uses on new instructions if they have already been sunk, not the old ones. Differential Revision: https://reviews.llvm.org/D67366 llvm-svn: 371743	2019-09-12 16:00:07 +00:00
Roman Lebedev	ae40d1323c	[InstCombine][InstSimplify] Move constant-folding tests in result-of-usub-is-non-zero-and-no-overflow.ll llvm-svn: 371737	2019-09-12 14:12:31 +00:00
Roman Lebedev	256ad0deef	[NFC][InstCombine][InstSimplify] Add test for "add-of-negative is non-zero and no overflow" (PR43259) https://rise4fun.com/Alive/ska https://rise4fun.com/Alive/9iX https://bugs.llvm.org/show_bug.cgi?id=43259 llvm-svn: 371736	2019-09-12 14:12:20 +00:00
Sanjay Patel	e5665905a2	[ConstProp] allow folding for fma that produces NaN Folding for fma/fmuladd was added here: rL202914 ...and as seen in existing/unchanged tests, that works to propagate NaN if it's already an input, but we should fold an fma() that creates NaN too. From IEEE-754-2008 7.2 "Invalid Operation", there are 2 clauses that apply to fma, so I added tests for those patterns: c) fusedMultiplyAdd: fusedMultiplyAdd(0, ∞, c) or fusedMultiplyAdd(∞, 0, c) unless c is a quiet NaN; if c is a quiet NaN then it is implementation defined whether the invalid operation exception is signaled d) addition or subtraction or fusedMultiplyAdd: magnitude subtraction of infinities, such as: addition(+∞, −∞) Differential Revision: https://reviews.llvm.org/D67446 llvm-svn: 371735	2019-09-12 14:10:50 +00:00
Petar Avramovic	db34bdc442	[MIPS GlobalISel] Select indirect branch Select G_BRINDIRECT for MIPS32. Differential Revision: https://reviews.llvm.org/D67441 llvm-svn: 371730	2019-09-12 11:44:36 +00:00
Petar Avramovic	9b40572197	[MIPS GlobalISel] Lower G_DYN_STACKALLOC IRTranslator creates G_DYN_STACKALLOC instruction during expansion of alloca when argument that tells number of elements to allocate on stack is a virtual register. Use default lowering for MIPS32. Differential Revision: https://reviews.llvm.org/D67440 llvm-svn: 371728	2019-09-12 11:39:50 +00:00
Petar Avramovic	8ff3e07274	[MIPS GlobalISel] Select G_IMPLICIT_DEF G_IMPLICIT_DEF is used for both integer and floating point implicit-def. Handle G_IMPLICIT_DEF as ambiguous opcode in MipsRegisterBankInfo. Select G_IMPLICIT_DEF for MIPS32. Differential Revision: https://reviews.llvm.org/D67439 llvm-svn: 371727	2019-09-12 11:32:38 +00:00
Tim Northover	1bb14916f2	AArch64: support arm64_32, an ILP32 slice for watchOS. This is the main CodeGen patch to support the arm64_32 watchOS ABI in LLVM. FastISel is mostly disabled for now since it would generate incorrect code for ILP32. llvm-svn: 371722	2019-09-12 10:22:23 +00:00
Roman Lebedev	5fe74069b5	[InstSimplify] simplifyUnsignedRangeCheck(): handle more cases (PR43251) Summary: I don't have a direct motivational case for this, but it would be good to have this for completeness/symmetry. This pattern is basically the motivational pattern from https://bugs.llvm.org/show_bug.cgi?id=43251 but with different predicate that requires that the offset is non-zero. The completeness bit comes from the fact that a similar pattern (offset != zero) will be needed for https://bugs.llvm.org/show_bug.cgi?id=43259, so it'd seem to be good to not overlook very similar patterns.. Proofs: https://rise4fun.com/Alive/21b Also, there is something odd with `isKnownNonZero()`, if the non-zero knowledge was specified as an assumption, it didn't pick it up (PR43267) Reviewers: spatel, nikic, xbolva00 Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67411 llvm-svn: 371718	2019-09-12 09:26:17 +00:00
Kai Luo	d9647b46e6	[PowerPC][MCP][NFC] Pre-commit test cases for https://reviews.llvm.org/D65267 llvm-svn: 371717	2019-09-12 09:00:44 +00:00
Qiu Chaofan	3d8ed5a845	[DAGCombiner] Improve division estimation of floating points. Current implementation of estimating divisions loses precision since it estimates reciprocal first and does multiplication. This patch is to re-order arithmetic operations in the last iteration in DAGCombiner to improve the accuracy. Reviewed By: Sanjay Patel, Jinsong Ji Differential Revision: https://reviews.llvm.org/D66050 llvm-svn: 371713	2019-09-12 07:51:24 +00:00
David Blaikie	11d923a343	Reapply llvm-reduce: Add pass to reduce parameters"" Fixing a couple of asan-identified bugs * use of an invalid "Use" iterator after the element was removed * use of StringRef to Function name after the Function was erased This reapplies r371567, which was reverted in r371580. llvm-svn: 371700	2019-09-12 01:20:48 +00:00
David Blaikie	bcb0f91086	PR43278: llvm-reduce: Use temporary file names (and ToolOutputFile) rather than unique ones - to ensure they're cleaned up This modifies the tool somewhat to only create files when about to run the "interestingness" test, and delete them immediately after - this means some more files will be created sometimes (when "double checking" work - which should probably be fixed/avoided anyway). This now creates temporary files, rather than only unique ones, and also uses ToolOutputFile (without ever calling "keep") to ensure the files are deleted as soon as the interestingness test is run. llvm-svn: 371696	2019-09-12 00:31:57 +00:00

1 2 3 4 5 ...

64980 Commits