llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 20:51:52 +01:00

Author	SHA1	Message	Date
Matt Arsenault	f76556c503	AMDGPU/GlobalISel: Fix assert on multi-return side effect intrinsics llvm.amdgcn.else hits this. llvm-svn: 371812	2019-09-13 04:12:12 +00:00
Matt Arsenault	5ed6e85ff3	AMDGPU/GlobalISel: Legalize s32->s16 G_SITOFP/G_UITOFP llvm-svn: 371811	2019-09-13 04:04:55 +00:00
Shiva Chen	41a7c547de	[RISCV] Support stack offset exceed 32-bit for RV64 Differential Revision: https://reviews.llvm.org/D61884 llvm-svn: 371810	2019-09-13 04:03:32 +00:00
Shiva Chen	f88372996a	Revert "[RISCV] Support stack offset exceed 32-bit for RV64" This reverts commit 1c340c62058d4115d21e5fa1ce3a0d094d28c792. llvm-svn: 371809	2019-09-13 04:03:24 +00:00
Matt Arsenault	d221bbcf9c	AMDGPU/GlobalISel: Fix RegBankSelect for amdgcn.else llvm-svn: 371808	2019-09-13 03:55:49 +00:00
Matt Arsenault	a963b653dc	AMDGPU/GlobalISel: Select 16-bit VALU bit ops llvm-svn: 371807	2019-09-13 03:55:43 +00:00
Shiva Chen	388575ab79	[RISCV] Support stack offset exceed 32-bit for RV64 Differential Revision: https://reviews.llvm.org/D61884 llvm-svn: 371806	2019-09-13 02:50:13 +00:00
Matt Arsenault	c7d2e6ca93	AMDGPU/GlobalISel: Legalize G_FFLOOR llvm-svn: 371803	2019-09-13 01:48:15 +00:00
Tim Shen	846797bc3a	Temporarily revert r371640 "LiveIntervals: Split live intervals on multiple dead defs". It reveals a miscompile on Hexagon. See PR43302 for details. llvm-svn: 371802	2019-09-13 01:34:25 +00:00
Matt Arsenault	48ccbbfecd	AMDGPU/GlobalISel: Legalize G_FMAD Unlike SelectionDAG, treat this as a normally legalizable operation. In SelectionDAG this is supposed to only ever formed if it's legal, but I've found that to be restricting. For AMDGPU this is contextually legal depending on whether denormal flushing is allowed in the use function. Technically we currently treat the denormal mode as a subtarget feature, so custom lowering could be avoided. However I consider this to be a defect, and this should be contextually dependent on the controllable rounding mode of the parent function. llvm-svn: 371800	2019-09-13 00:44:35 +00:00
Matt Arsenault	bd2bbeaa29	AMDGPU/GlobalISel: Select G_CTPOP llvm-svn: 371798	2019-09-13 00:11:20 +00:00
Matt Arsenault	62a482c739	DAG/GlobalISel: Correct type profile of bitcount ops The result integer does not need to be the same width as the input. AMDGPU, NVPTX, and Hexagon all have patterns working around the types matching. GlobalISel defines these as being different type indexes. llvm-svn: 371797	2019-09-13 00:11:14 +00:00
Matt Arsenault	7413fb9a27	AMDGPU: Add immarg to llvm.amdgcn.init.exec.from.input As far as I can tell this has to be a constant. llvm-svn: 371793	2019-09-12 23:46:54 +00:00
Matt Arsenault	13e6fc349a	LiveIntervals: Remove assertion This testcase is invalid, and caught by the verifier. For the verifier to catch it, the live interval computation needs to complete. Remove the assert so the verifier catches this, which is less confusing. In this testcase there is an undefined use of a subregister, and lanes which aren't used or defined. An equivalent testcase with the super-register shrunk to have no untouched lanes already hit this verifier error. llvm-svn: 371792	2019-09-12 23:46:51 +00:00
Matt Arsenault	13e7d19a0d	AMDGPU: Inline constant when materalizing FI with add on gfx9 This was relying on the SGPR usable for the carry out clobber to also be used for the input. There was no carry out on gfx9. With no carry out clobber to worry about, so the literal can just be directly used with a VOP2 add. llvm-svn: 371791	2019-09-12 23:46:46 +00:00
Philip Reames	448b18aca0	[Test] Restructure check lines to show differences between modes more clearly With the landing of the previous patch (in particular D66318) there are a lot fewer diffs now. I added an experimental O0 line, and updated all the tests to group experimental and non-experimental O0/O3 together. Skimming the remaining diffs, there's only a few which are obviously incorrect. There's a large number which are questionable, so more todo. llvm-svn: 371790	2019-09-12 23:22:37 +00:00
Philip Reames	f9198adb61	Rename nonvolatile_load/store to simple_load/store [NFC] Implement the TODO from D66318. llvm-svn: 371789	2019-09-12 23:03:39 +00:00
Jessica Paquette	8a8cc5c189	[AArch64][GlobalISel] Support tail calling with swiftself parameters Swiftself uses a callee-saved register. We can tail call when the register used in the caller and callee is the same. This behaviour is equivalent to that in `TargetLowering::parametersInCSRMatch`. Update call-translator-tail-call.ll to verify that we can do this. When we support inline assembly, we can write a check similar to the one in the general swiftself.ll. For now, we need to verify that we get the correct COPY instruction after call lowering. Differential Revision: https://reviews.llvm.org/D67511 llvm-svn: 371788	2019-09-12 23:00:59 +00:00
Philip Reames	ba1f39ccae	[SDAG] Update generic code to conservatively check for isAtomic in addition to isVolatile This is the first sweep of generic code to add isAtomic bailouts where appropriate. The intention here is to have the switch from AtomicSDNode to LoadSDNode/StoreSDNode be close to NFC; that is, I'm not looking to allow additional optimizations at this time. That will come later. See D66309 for context. Differential Revision: https://reviews.llvm.org/D66318 llvm-svn: 371786	2019-09-12 22:49:17 +00:00
Greg Clayton	00330f5bfd	[NFC] Fix file header filename to be Range.h llvm-svn: 371783	2019-09-12 22:23:03 +00:00
DeForest Richards	ebd06b083a	[Docs] Adds page for reference docs Adds a Reference Documentation page for LLVM and API reference documentation. llvm-svn: 371782	2019-09-12 22:17:04 +00:00
Jessica Paquette	d84c7b0582	[AArch64][GlobalISel] Support sibling calls with outgoing arguments This adds support for lowering sibling calls with outgoing arguments. e.g ``` define void @foo(i32 %a) ``` Support is ported from AArch64ISelLowering's `isEligibleForTailCallOptimization`. The only thing that is missing is a full port of `TargetLowering::parametersInCSRMatch`. So, if we're using swiftself, we'll never tail call. - Rename `analyzeCallResult` to `analyzeArgInfo`, since the function is now used for both outgoing and incoming arguments - Teach `OutgoingArgHandler` about tail calls. Tail calls use frame indices for stack arguments. - Teach `lowerFormalArguments` to set the bytes in the caller's stack argument area. This is used later to check if the tail call's parameters will fit on the caller's stack. - Add `areCalleeOutgoingArgsTailCallable` to perform the eligibility check on the callee's outgoing arguments. For testing: - Update call-translator-tail-call to verify that we can now tail call with outgoing arguments, use G_FRAME_INDEX for stack arguments, and respect the size of the caller's stack - Remove GISel-specific check lines from speculation-hardening.ll, since GISel now tail calls like the other selectors - Add a GISel test line to tailcall-string-rvo.ll since we can tail call in that test now - Add a GISel test line to tailcall_misched_graph.ll since we tail call there now. Add specific check lines for GISel, since the debug output from the machine-scheduler differs with GlobalISel. The dependency still holds, but the output comes out in a different order. Differential Revision: https://reviews.llvm.org/D67471 llvm-svn: 371780	2019-09-12 22:10:36 +00:00
Craig Topper	862ec62f6f	[PowerPC] Remove the SPE4RC register class and instead add f32 to the GPRC register class. Summary: Since the SPE4RC register class contains an identical set of registers and an identical spill size to the GPRC class its slightly confusing the tablegen emitter. It's preventing the GPRC_and_GPRC_NOR0 synthesized register class from inheriting VTs and AltOrders from GPRC or GPRC_NOR0. This is because SPE4C is found first in the super register class list when inheriting these properties and it doesn't set the VTs or AltOrders the same way as GPRC or GPRC_NOR0. This patch replaces all uses of GPE4RC with GPRC and allows GPRC and GPRC_NOR0 to contain f32. The test changes here are because the AltOrders are being inherited to GPRC_NOR0 now. Found while trying to determine if getCommonSubClass needs to take a VT argument. It was originally added to support fp128 on x86-64, I've changed some things about that so that it might be needed anymore. But a PowerPC test crashed without it and I think its due to this subclass issue. Reviewers: jhibbits, nemanjai, kbarton, hfinkel Subscribers: wuzish, nemanjai, mehdi_amini, hiraditya, kbarton, MaskRay, dexonsmith, jsji, shchenz, steven.zhang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67513 llvm-svn: 371779	2019-09-12 22:07:35 +00:00
Philip Reames	f7b66a883e	Remove a duplicate test Turns out I'd already added exactly the same test under the name non_unit_stride. llvm-svn: 371777	2019-09-12 21:40:15 +00:00
Philip Reames	3b71e989fe	[SCEV] Add smin support to getRangeRef We were failing to compute trip counts (both exact and maximum) for any loop which involved a comparison against either an umin or smin. It looks like this simply got missed when we added smin/umin to SCEV. (Note: umin was submitted separately earlier today. Turned out two folks hit this at the same time.) Differential Revision: https://reviews.llvm.org/D67514 llvm-svn: 371776	2019-09-12 21:32:27 +00:00
Craig Topper	be6798af61	[DAGCombiner][X86] Pass the CmpOpVT to reduceSelectOfFPConstantLoads so X86 can exclude fp128 compares. The X86 decision assumes the compare will produce a result in an XMM register, but that can't happen for an fp128 compare since those go to a libcall the returns an i32. Pass the VT so X86 can check the type. llvm-svn: 371775	2019-09-12 21:30:18 +00:00
Evandro Menezes	3b3c47e0cc	[ConstantFolding] Expand folding of some library functions Expanding the folding of `nearbyint()`, `rint()` and `trunc()` to library functions, in addition to the current support for intrinsics. Differential revision: https://reviews.llvm.org/D67468 llvm-svn: 371774	2019-09-12 21:23:22 +00:00
Tim Shen	3de03a4c3d	Fix llvm-reduce tests so that they don't assume the source code is writable. Instead of copying over the original file permissions, just create a new file and add the executable bit. llvm-svn: 371772	2019-09-12 21:03:49 +00:00
Craig Topper	8b02dd7a44	[SelectionDAGBuilder] Simplify loop in visitSelect back to how it was before r255558. This code was changed to accomodate fp128 being softened to itself during type legalization on x86-64. This was done in order to create libcalls while having fp128 as a legal type. We're now doing the libcall creation during LegalizeDAG and the type legalization changes to enable the old behavior have been removed. So this change to SelectionDAGBuilder is no longer needed. llvm-svn: 371771	2019-09-12 21:00:32 +00:00
Simon Pilgrim	6273367347	[X86] Move negateFMAOpcode helper earlier to help future patch. NFCI. llvm-svn: 371770	2019-09-12 20:39:56 +00:00
Florian Hahn	9748747e28	[LV] Update test case after r371768. llvm-svn: 371769	2019-09-12 20:07:17 +00:00
Florian Hahn	b59499608a	[SCEV] Support SCEVUMinExpr in getRangeRef. This patch adds support for SCEVUMinExpr to getRangeRef, similar to the support for SCEVUMaxExpr. Reviewers: sanjoy.google, efriedma, reames, nikic Reviewed By: sanjoy.google Differential Revision: https://reviews.llvm.org/D67177 llvm-svn: 371768	2019-09-12 20:03:32 +00:00
David Blaikie	a2200faeaf	llvm-reduce: For now, mark these tests as requiring a shell (since they execute shell scripts/that's the only entry point at the moment) llvm-svn: 371764	2019-09-12 19:50:54 +00:00
Philip Reames	d7acb3e35e	Precommit tests for D67514 llvm-svn: 371762	2019-09-12 19:34:27 +00:00
Austin Kerbow	f6c3175fcc	AMDGPU: Fix bug in r371671 on some builds. llvm-svn: 371761	2019-09-12 19:12:21 +00:00
David Blaikie	70d10faac3	llvm-reduce: Remove unused plugin support/requirements llvm-svn: 371755	2019-09-12 18:52:31 +00:00
Alina Sbirlea	ba51045595	[LICM/AST] Check if the AliasAny set is removed from the tracker. Summary: Resolves PR38513. Credit to @bjope for debugging this. Reviewers: hfinkel, uabelho, bjope Subscribers: sanjoy.google, bjope, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67417 llvm-svn: 371752	2019-09-12 18:09:47 +00:00
Sanjay Patel	dd417415f3	[InstCombine] add tests for fptrunc; NFC llvm-svn: 371750	2019-09-12 18:00:11 +00:00
Alina Sbirlea	6d1edda49a	[MemorySSA] Pass (for update) MSSAU when hoisting instructions. Summary: Pass MSSAU to makeLoopInvariant in order to properly update MSSA. Reviewers: george.burgess.iv Subscribers: Prazek, sanjoy.google, uabelho, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67470 llvm-svn: 371748	2019-09-12 17:12:51 +00:00
Philip Reames	1bc2c14283	Precommit tests for generalization of load dereferenceability in loop llvm-svn: 371747	2019-09-12 17:09:01 +00:00
Sanjay Patel	901e3163db	[InstCombine] reduce test noise and regenerate CHECK lines; NFC llvm-svn: 371746	2019-09-12 17:07:01 +00:00
Philip Reames	caf2d0f40c	[LV] Support invariant addresses in speculation logic Implement a TODO from rL371452, and handle loop invariant addresses in predicated blocks. If we can prove that the load is safe to speculate into the header, then we can avoid using a masked.load in favour of a normal load. This is mostly about vectorization robustness. In the common case, it's generally expected that LICM/LoadStorePromotion would have eliminated such loads entirely. Differential Revision: https://reviews.llvm.org/D67372 llvm-svn: 371745	2019-09-12 16:49:10 +00:00
David Green	afc4123d6c	[CGP] Ensure sinking multiple instructions does not invalidate dominance checks In MVE, as of rL371218, we are attempting to sink chains of instructions such as: %l1 = insertelement <8 x i8> undef, i8 %l0, i32 0 %broadcast.splat26 = shufflevector <8 x i8> %l1, <8 x i8> undef, <8 x i32> zeroinitializer In certain situations though, we can end up breaking the dominance relations of instructions. This happens when we sink the instruction into a loop, but cannot remove the originals. The Use is updated, which might in fact be a Use from the second instruction to the first. This attempts to fix that by reversing the order of instruction that are sunk, and ensuring that we update the uses on new instructions if they have already been sunk, not the old ones. Differential Revision: https://reviews.llvm.org/D67366 llvm-svn: 371743	2019-09-12 16:00:07 +00:00
Guillaume Chatelet	961213111f	[Alignment] Move OffsetToAlignment to Alignment.h Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet, JDevlieghere, alexshap, rupprecht, jhenderson Subscribers: sdardis, nemanjai, hiraditya, kbarton, jakehehrlich, jrtc27, MaskRay, atanasyan, jsji, seiya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D67499 llvm-svn: 371742	2019-09-12 15:20:36 +00:00
Rainer Orth	7d740f416b	test-release.sh: Don't use chrpath on Solaris When trying to run test-release.sh on Solaris 11.4 for 9.0.0 rc4, I failed initially because Solaris lacks chrpath. This patch accounts for that and allowed the run to continue. Tested on amd64-pc-solaris2.11 and sparcv9-sun-solaris2.11. Differential Revision: https://reviews.llvm.org/D67484 llvm-svn: 371741	2019-09-12 14:50:32 +00:00
James Henderson	65ef8f90c0	[docs][llvm-strip] Remove unnecessary whitespace for consistency llvm-svn: 371739	2019-09-12 14:24:04 +00:00
Roman Lebedev	ae40d1323c	[InstCombine][InstSimplify] Move constant-folding tests in result-of-usub-is-non-zero-and-no-overflow.ll llvm-svn: 371737	2019-09-12 14:12:31 +00:00
Roman Lebedev	256ad0deef	[NFC][InstCombine][InstSimplify] Add test for "add-of-negative is non-zero and no overflow" (PR43259) https://rise4fun.com/Alive/ska https://rise4fun.com/Alive/9iX https://bugs.llvm.org/show_bug.cgi?id=43259 llvm-svn: 371736	2019-09-12 14:12:20 +00:00
Sanjay Patel	e5665905a2	[ConstProp] allow folding for fma that produces NaN Folding for fma/fmuladd was added here: rL202914 ...and as seen in existing/unchanged tests, that works to propagate NaN if it's already an input, but we should fold an fma() that creates NaN too. From IEEE-754-2008 7.2 "Invalid Operation", there are 2 clauses that apply to fma, so I added tests for those patterns: c) fusedMultiplyAdd: fusedMultiplyAdd(0, ∞, c) or fusedMultiplyAdd(∞, 0, c) unless c is a quiet NaN; if c is a quiet NaN then it is implementation defined whether the invalid operation exception is signaled d) addition or subtraction or fusedMultiplyAdd: magnitude subtraction of infinities, such as: addition(+∞, −∞) Differential Revision: https://reviews.llvm.org/D67446 llvm-svn: 371735	2019-09-12 14:10:50 +00:00
Petar Avramovic	db34bdc442	[MIPS GlobalISel] Select indirect branch Select G_BRINDIRECT for MIPS32. Differential Revision: https://reviews.llvm.org/D67441 llvm-svn: 371730	2019-09-12 11:44:36 +00:00

1 2 3 4 5 ...

184875 Commits