llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

Author	SHA1	Message	Date
Matt Arsenault	077968a109	AMDGPU: Update more tests to use modern buffer intrinsics	2020-01-16 14:29:38 -05:00
Matt Arsenault	408e513c5f	AMDGPU/GlobalISel: Improve lowering of G_SEXT_INREG Clamping the scalar is much better than lowering with superwide shifts for types > s64.	2020-01-16 14:29:37 -05:00
Matt Arsenault	552fe4c9ba	GlobalISel: Don't ignore requested ext narrowing type This was assuming the narrow target was the source type. Respect the requested type when these don't match by using intermediate merges. This avoids producing very wide, illegal shift expansions.	2020-01-16 14:29:37 -05:00
Matt Arsenault	205916c405	GlobalISel: Move extension scalar narrowing to separate function Also rename a few things. Handling a different requested type will require this to become much more complex.	2020-01-16 14:29:37 -05:00
Krzysztof Parzyszek	447f535a6a	[Hexagon] Update autogeneated intrinsic information in LLVM	2020-01-16 13:11:18 -06:00
Craig Topper	ee6566c72b	[LegalizeDAG][Mips] Add an assert to protect a uint_to_fp implementation from double rounding. Add a i32->f32 uint_to_fp implementation that avoids this code. The algorithm here only works if the sint_to_fp doesn't do any rounding. Otherwise it can round before the offset fixup is applied. Add an assert to protect this. To avoid breaking the one test in tree that tested this code with a set of types that fail the assert, I've enabled i32->f32 to use the i64->f32 algorithm. This only occurs when f64 isn't a legal type. If f64 is legal then we do i32->f64->f32 instead. Differential Revision: https://reviews.llvm.org/D72794	2020-01-16 11:08:16 -08:00
Matt Arsenault	353036eab7	AMDGPU: Remove IR section from MIR test Also generate check lines so this isn't just testing the meaningless block name.	2020-01-16 13:49:44 -05:00
Matt Arsenault	69451d9bc3	GlobalISel: Apply target MMO flags to atomics Unify MMO flag handling with SelectionDAG like with loads and stores.	2020-01-16 13:49:43 -05:00
Matt Arsenault	ae7ab4d57e	GlobalISel: Preserve load/store metadata in IRTranslator This was dropping the invariant metadata on dead argument loads, so they weren't deleted. Atomics still need to be fixed the same way. Also, apparently store was never preserving dereferencable which should also be fixed.	2020-01-16 13:49:43 -05:00
Matt Arsenault	0e42d09951	TableGen/GlobalISel: Fix srcvalue inputs Allow using srcvalue for discarding pattern inputs.	2020-01-16 13:49:43 -05:00
Matt Arsenault	534a1fba52	TableGen: Remove dead code	2020-01-16 13:49:43 -05:00
Matt Arsenault	104f9f96d4	AMDGPU: Update tests to use modern buffer intrinsics	2020-01-16 13:49:43 -05:00
Krzysztof Parzyszek	f9eef6e26a	[Hexagon] Add a target feature to disable compound instructions This affects the following instructions: Tag: M4_mpyrr_addr Syntax: Ry32 = add(Ru32,mpyi(Ry32,Rs32)) Tag: M4_mpyri_addr_u2 Syntax: Rd32 = add(Ru32,mpyi(#u6:2,Rs32)) Tag: M4_mpyri_addr Syntax: Rd32 = add(Ru32,mpyi(Rs32,#u6)) Tag: M4_mpyri_addi Syntax: Rd32 = add(#u6,mpyi(Rs32,#U6)) Tag: M4_mpyrr_addi Syntax: Rd32 = add(#u6,mpyi(Rs32,Rt32)) Tag: S4_addaddi Syntax: Rd32 = add(Rs32,add(Ru32,#s6)) Tag: S4_subaddi Syntax: Rd32 = add(Rs32,sub(#s6,Ru32)) Tag: S4_or_andix Syntax: Rx32 = or(Ru32,and(Rx32,#s10)) Tag: S4_andi_asl_ri Syntax: Rx32 = and(#u8,asl(Rx32,#U5)) Tag: S4_ori_asl_ri Syntax: Rx32 = or(#u8,asl(Rx32,#U5)) Tag: S4_addi_asl_ri Syntax: Rx32 = add(#u8,asl(Rx32,#U5)) Tag: S4_subi_asl_ri Syntax: Rx32 = sub(#u8,asl(Rx32,#U5)) Tag: S4_andi_lsr_ri Syntax: Rx32 = and(#u8,lsr(Rx32,#U5)) Tag: S4_ori_lsr_ri Syntax: Rx32 = or(#u8,lsr(Rx32,#U5)) Tag: S4_addi_lsr_ri Syntax: Rx32 = add(#u8,lsr(Rx32,#U5)) Tag: S4_subi_lsr_ri Syntax: Rx32 = sub(#u8,lsr(Rx32,#U5))	2020-01-16 12:37:30 -06:00
Arkady Shlykov	ae9dada9fd	Revert "[Loop Peeling] Add possibility to enable peeling on loop nests." This reverts commit 3f3017e because there's a failure on peel-loop-nests.ll with LLVM_ENABLE_EXPENSIVE_CHECKS on. Differential Revision: https://reviews.llvm.org/D70304	2020-01-16 10:33:38 -08:00
Nico Weber	851ce920c9	[gn build] (manually) port bed7626f04f7	2020-01-16 13:19:39 -05:00
Nico Weber	9a71e6cdaf	[gn build] include revision information in lld --version output	2020-01-16 13:10:41 -05:00
stevewan	84c463c970	[PowerPC][AIX] Make PIC the default relocation model for AIX Summary: The `llc` tool currently defaults to Static relocation model and generates non-relocatable code for 32-bit Power. This is not desirable on AIX where we always generate Position Independent Code (PIC). This patch makes PIC the default relocation model for AIX. Reviewers: daltenty, hubert.reinterpretcast, DiggerLin, Xiangling_L, sfertile Reviewed By: hubert.reinterpretcast Subscribers: mgorny, wuzish, nemanjai, hiraditya, kbarton, jsji, shchenz, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72479	2020-01-16 13:07:36 -05:00
Nico Weber	2f7698b8c3	remove an include that's unused after r347592	2020-01-16 12:49:54 -05:00
Fedor Sergeev	d6b3c3a8da	[GVN] introduce GVNOptions to control GVN pass behavior There are a few global (cl::opt) controls that enable optional behavior in GVN. Introduce GVNOptions that provide corresponding per-pass instance controls. That will allow to use GVN multiple times in pipeline each time with different settings. Reviewers: asbirlea, rnk, reames, skatkov, fhahn Reviewed By: fhahn Tags: #llvm Differential Revision: https://reviews.llvm.org/D72732	2020-01-16 20:21:08 +03:00
Mircea Trofin	e90406ee2a	[llvm] Make new pass manager's OptimizationLevel a class Summary: The old pass manager separated speed optimization and size optimization levels into two unsigned values. Coallescing both in an enum in the new pass manager may lead to unintentional casts and comparisons. In particular, taking a look at how the loop unroll passes were constructed previously, the Os/Oz are now (==new pass manager) treated just like O3, likely unintentionally. This change disallows raw comparisons between optimization levels, to avoid such unintended effects. As an effect, the O{s\|z} behavior changes for loop unrolling and loop unroll and jam, matching O2 rather than O3. The change also parameterizes the threshold values used for loop unrolling, primarily to aid testing. Reviewers: tejohnson, davidxl Reviewed By: tejohnson Subscribers: zzheng, ychen, mehdi_amini, hiraditya, steven_wu, dexonsmith, dang, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D72547	2020-01-16 09:00:56 -08:00
LLVM GN Syncbot	bbf89c711e	[gn build] Port 6b357866496	2020-01-16 16:56:26 +00:00
Miloš Stojanović	b22e99bc49	[llvm-exegesis][mips] Add RegisterAliasingTest unit test Differential Revision: https://reviews.llvm.org/D72004	2020-01-16 17:50:45 +01:00
Miloš Stojanović	48ecc855b2	[llvm-exegesis][NFC] Refactor Mips tests fixtures into a base class. Differential Revision: https://reviews.llvm.org/D72003	2020-01-16 17:50:44 +01:00
Matt Arsenault	c41df580c2	AMDGPU/GlobalISel: Don't handle legacy buffer intrinsic	2020-01-16 11:31:12 -05:00
Hubert Tong	dc883a9c61	[MC][test] Fix non-portable GNU diff option Summary: This patch replaces the non-portable GNU diff option `--strip-trailing-cr` with the POSIX `-b` option in two test files. Reviewers: daltenty, jasonliu Reviewed By: daltenty Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72745	2020-01-16 11:29:24 -05:00
Matt Arsenault	28876d6a15	AMDGPU/GlobalISel: Select DS GWS intrinsics	2020-01-16 11:25:10 -05:00
Jay Foad	cfc365cf6e	[GlobalISel] Don't arbitrarily limit a mask to 64 bits Reviewers: arsenm Subscribers: wdng, rovka, hiraditya, volkan, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72853	2020-01-16 16:13:20 +00:00
Jay Foad	43278bacb4	[GlobalISel] Pass MachineOperands into MachineIRBuilder helper methods Reviewers: arsenm, aditya_nandakumar, aemerson Subscribers: wdng, rovka, hiraditya, volkan, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72849	2020-01-16 16:04:21 +00:00
Sam Parker	be9f6d7d6e	[ARM][LowOverheadLoops] Update liveness info Recommitting e93e0d413f3a after reverting due to test failures, which will hopefully now be fixed. Original commit message: After expanding the pseudo instructions, update the liveness info. We do this in a post-order traversal of the loop, including its exit blocks and preheader(s). Differential Revision: https://reviews.llvm.org/D72131	2020-01-16 15:44:25 +00:00
Jay Foad	9918b43039	[GlobalISel] Use more MachineIRBuilder helper methods Reviewers: arsenm, nhaehnle Subscribers: wdng, rovka, hiraditya, volkan, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72833	2020-01-16 15:34:51 +00:00
Anna Welker	8d31f5ac65	[ARM][MVE] Enable extending gathers Enables the masked gather pass to create extending masked gathers. Differential Revision: https://reviews.llvm.org/D72451	2020-01-16 15:24:54 +00:00
Francesco Petrogalli	f6e39fe1d1	[VectorUtils] Rework the Vector Function Database (VFDatabase). Summary: This commits is a rework of the patch in https://reviews.llvm.org/D67572. The rework was requested to prevent out-of-tree performance regression when vectorizing out-of-tree IR intrinsics. The vectorization of such intrinsics is enquired via the static function `isTLIScalarize`. For detail see the discussion in https://reviews.llvm.org/D67572. Reviewers: uabelho, fhahn, sdesmalen Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72734	2020-01-16 15:08:26 +00:00
Nico Weber	6bab3f4983	Make lld cmake not compute commit revision twice r354605 moved LLD to the unified revision handling introduced in rL353268 / r352729 and removed uses of LLD_REPOSITORY_STRING and LLD_REVISION_STRING. After this change, we no longer compute the (now-unused) values of these two variables. Since this removes the only use of llvm/utils/GetRepositoryPath, remove that too (it's redundant with the system added in r354605). While here, also remove LLD_VERSION_MAJOR and LLD_VERSION_MINOR. Their uses were removed in r285163. Also remove LLD_VERSION from Version.inc which as far as I can tell has been unused since the file was added in r219277. No behavior change. Differential Revision: https://reviews.llvm.org/D72803	2020-01-16 09:55:36 -05:00
Jeremy Morse	6d4a603c87	Revert "[PHIEliminate] Move dbg values after phi and label" Testing compiler-rt, a new assertion failure occurs when building the GwpAsanTestObjects object. I'm uploading a reproducer to D70597. This reverts commit 75188b01e9af3a89639d84be912f84610d6885ba.	2020-01-16 14:01:27 +00:00
Simon Pilgrim	07620d7416	Fix unused variable warning. NFCI.	2020-01-16 13:02:40 +00:00
Chris Ye	e6e666bec8	[PHIEliminate] Move dbg values after phi and label If there are DBG_VALUEs between phi and label (after phi and before label), DBG_VALUE will block PHI lowering after the LABEL. Moving all DBG_VALUEs after Labels in the function ScheduleDAGSDNodes::EmitSchedule to avoid impacting PHI lowering. before: PHI DBG_VALUE LABEL after: (move DBG_VALUE after label) PHI LABEL DBG_VALUE then: (phi lowering after label) LABEL COPY DBG_VALUE Fixes the issue: https://bugs.llvm.org/show_bug.cgi?id=43859 Differential Revision: https://reviews.llvm.org/D70597	2020-01-16 11:58:09 +00:00
Florian Hahn	7b29a90656	[IR] Mark memset.* intrinsics as IntrWriteMem. llvm.memset intrinsics do only write memory, but are missing IntrWriteMem, so they doesNotReadMemory() returns false for them. The test change is due to the test checking the fn attribute ids at the call sites, which got bumped up due to a new combination with writeonly appearing in the test file. Reviewers: jdoerfert, reames, efriedma, nlopes, lebedev.ri Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D72789	2020-01-16 10:35:46 +00:00
Florian Hahn	1eeb9c02e5	[LV] Allow assume calls in predicated blocks. The assume intrinsic is intentionally marked as may reading/writing memory, to avoid passes moving them around. When flattening the CFG for predicated blocks, we have to drop the assume calls, as they are control-flow dependent. There are some cases where we can do better (when control flow is preserved), but that is follow-up work. Fixes PR43620. Reviewers: hsaito, rengolin, dcaballe, Ayal Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D68814	2020-01-16 10:11:35 +00:00
Florian Hahn	6c45b4f8b1	[LV] Make X86/assume.ll X86 independent (NFC). The test does not check anything X86 specific. This is a preparation for the D68814.	2020-01-16 10:01:35 +00:00
LLVM GN Syncbot	24c5598549	[gn build] Port ed181efa175	2020-01-16 09:55:55 +00:00
Sameer Sahasrabuddhe	6a0b5d46f8	[HIP][AMDGPU] expand printf when compiling HIP to AMDGPU Summary: This change implements the expansion in two parts: - Add a utility function emitAMDGPUPrintfCall() in LLVM. - Invoke the above function from Clang CodeGen, when processing a HIP program for the AMDGPU target. The printf expansion has undefined behaviour if the format string is not a compile-time constant. As a sufficient condition, the HIP ToolChain now emits -Werror=format-nonliteral. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D71365	2020-01-16 15:15:38 +05:30
Kazushi (Jam) Marukawa	a4acca0462	[VE] i64 arguments, return values and constants Summary: Support for i64 arguments (in register), return values and constants along with tests. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D72776	2020-01-16 10:09:50 +01:00
Igor Kudrin	f87be52260	[DebugInfo] Simplify the constructor of DWARFDebugAranges::Range. NFC. This removes the default values of the arguments. The only caller, DWARFDebugAranges::construct(), provides all three parameters. Differential Revision: https://reviews.llvm.org/D72757	2020-01-16 13:08:30 +07:00
Craig Topper	da61486b44	[LegalizeDAG][TargetLowering] Move vXi64/i64->vXf32/f32 uint_to_fp legalizing code from TargetLowering::expandUINT_TO_FP back to LegalizeDAG. This was moved in October 2018, but we don't appear to be using this for vectors on any in tree target. Moving it back simplifies D72794 so we can share the code for i32->f32.	2020-01-15 22:04:50 -08:00
LLVM GN Syncbot	e4adbf252b	[gn build] Port 8fdafb7dced	2020-01-16 04:13:31 +00:00
Liu, Chen3	9217c910b5	Insert wait instruction after X87 instructions which could raise float-point exception. This patch also modify some mayRaiseFPException flag which set in D68854. Differential Revision: https://reviews.llvm.org/D72750	2020-01-16 12:12:51 +08:00
Matt Arsenault	61c8d1a930	Set some fast math attributes in setFunctionAttributes This will provide a more consistent view to codegen for these attributes. The current system is somewhat awkward, and the fields in TargetOptions are reset based on the command line flag if the attribute isn't set. By forcing these attributes with the flag, there can never be an inconsistency in the behavior if code directly inspects the attribute on the function without considering the command line flags.	2020-01-15 22:23:18 -05:00
Wei Mi	866992fbb3	[SampleFDO] Fix invalid branch profile generated by indirect call promotion. Suppose an inline instance has hot total sample count but 0 entry count, and it is an indirect call target. If the indirect call has no other call target and inline instance associated with it and it is promoted, currently the conditional branch generated by indirect call promotion will have invalid branch profile which is !{!"branch_weights", i32 0, i32 0} -- because the entry count of the promoted target is 0 and the total entry count of all targets is also 0. This caused a SEGV in Control Height Reduction and may cause problem in other passes. Function entry count of an inline instance is computed by a heuristic -- using either the sample of the starting line or starting inner inline instance. The patch changes the heuristic a little bit so that when total sample count is larger than 0, the computed entry count will be at least 1. Then the new branch profile will be !{!"branch_weights", i32 1, i32 0}. Differential Revision: https://reviews.llvm.org/D72790	2020-01-15 18:36:06 -08:00
Craig Topper	1990a5b605	[X86] When handling i64->f32 sint_to_fp on 32-bit targets only bitcast to f64 if sse2 is enabled. The code is trying to copy the i64 value to an xmm register to use a 64-bit store so that the 64-bit fild can benefit from store forwarding. But this trick only works if f64 is going to be stored in an XMM register. If we only have SSE1 then only float is in xmm register. So this trick just causes 2 stores i32 stores, an f64 load into the x87, an f64 from x87, and a 64-bit fild. So we end up with an extra stack temporary and still didn't get store forwarding. We might be able to use v2f32 here instead, but I didn't check. I just wanted the code to make sense. Found by inspection as I continue to stare too hard at our int_to_fp conversions.	2020-01-15 18:26:28 -08:00
Craig Topper	d2622ad06a	[X86] Add 32-bit mode sse1 command line to scalar-int-to-fp.ll. NFC	2020-01-15 18:26:27 -08:00

1 2 3 4 5 ...

190205 Commits