llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 12:12:47 +01:00

Author	SHA1	Message	Date
David Blaikie	6bae49b565	PointerLikeTypeTraits: Standardize NumLowBitsAvailable on static constexpr rather than anonymous enum This is (more?) usable by GDB pretty printers and seems nicer to write. There's one tricky caveat that in C++14 (LLVM's codebase today) the static constexpr member declaration is not a definition - so odr use of this constant requires an out of line definition, which won't be provided (that'd make all these trait classes more annoyidng/expensive to maintain). But the use of this constant in the library implementation is/should always be in a non-odr context - only two unit tests needed to be touched to cope with this/avoid odr using these constants. Based on/expanded from D72590 by Christian Sigg.	2020-01-16 15:30:50 -08:00
Eric Christopher	40942db2ce	[NFC] Fold isHugeExpression into hasHugeExpression and update callers accordingly.	2020-01-16 15:28:54 -08:00
Jessica Paquette	5ae7bf2aca	[AArch64][GlobalISel] Change G_FCONSTANTs feeding into stores into G_CONSTANTS Given the following situation: x = G_FCONSTANT (something that can't be materialized) G_STORE x, some_addr We know that x must be materialized as at least a single mov. However, at the time of selection, the G_STORE will have been regbankselected to a FPR store. So, as a result, you'll get an unnecessary fmov into the G_STORE. Storing a constant value in a GPR and a constant value in a FPR are the same. So, whenever you see a G_FCONSTANT that feeds into only G_STORES, so might as well make it a G_CONSTANT. This adds a target-specific combine which changes G_FCONSTANTs feeding into G_STOREs into G_CONSTANTs. Differential Revision: https://reviews.llvm.org/D72814	2020-01-16 15:18:44 -08:00
Derek Schuff	50c21552f7	Revert "[WebAssembly] Track frame registers through VReg and local allocation" This reverts commit 3a05c3969c18b5520e360b78fc63cda39a6be98f. It breaks under expensive-checks and on Windows	2020-01-16 14:38:00 -08:00
Matt Arsenault	02f8021821	AMDGPU: Do permlane16 vdst_in discard optimization in InstCombine There's more potential value to discarding the source value earlier, since we always know the value of the fi/bc bits.	2020-01-16 17:27:53 -05:00
Matt Arsenault	ec9629b24b	AMDGPU: Move permlane discard vdst_in optimization This case can be handled as a regular selection pattern, so move it out of the weird post-isel folding code which doesn't have an exactly equivalent place in GlobalISel. I think it doesn't make much sense to do this optimization here though, and it would be more useful in instcombine. There's not really any new information that will be gained during lowering since these inputs were known from the beginning.	2020-01-16 17:27:53 -05:00
Sam Clegg	641286a21a	[llvm-nm] Use `StringRef` over `const std::string &` params Differential Revision: https://reviews.llvm.org/D72718	2020-01-16 14:02:58 -08:00
Derek Schuff	ce9124de54	[WebAssembly] Track frame registers through VReg and local allocation This change has 2 components: Target-independent: add a method getDwarfFrameBase to TargetFrameLowering. It describes how the Dwarf frame base will be encoded. That can be a register (the default), the CFA (which replaces NVPTX-specific logic in DwarfCompileUnit), or a DW_OP_WASM_location descriptr. WebAssembly: Allow WebAssemblyFunctionInfo::getFrameRegister to return the correct virtual register instead of FP32/SP32 after WebAssemblyReplacePhysRegs has run. Make WebAssemblyExplicitLocals store the local it allocates for the frame register. Use this local information to implement getDwarfFrameBase The result is that the DW_AT_frame_base attribute is correctly encoded for each subprogram, and each param and local variable has a correct DW_AT_location that uses DW_OP_fbreg to refer to the frame base. Differential Revision: https://reviews.llvm.org/D71681	2020-01-16 13:51:17 -08:00
Sanjay Patel	bc125ae78b	[IR] fix crash in Constant::isElementWiseEqual() with FP types We lifted this code from InstCombine for general usage in: rL369842 ...but it's not safe as-is. There are no existing users that can trigger this bug, but I discovered it via crashing several regression tests when trying to use it for select folding in InstSimplify. ICmp requires (vector) integer types, so give up on anything that's not integer or FP (pointers and ?) then bitcast the constants before trying the match. That matches the definition of "equal or undef" that I was looking for. If someone wants an FP-aware version of equality (deal with NaN, -0.0), that could be a different mode or different function. Differential Revision: https://reviews.llvm.org/D72784	2020-01-16 16:49:16 -05:00
LLVM GN Syncbot	3c615417b5	[gn build] Port d5c6b8407c1	2020-01-16 21:35:08 +00:00
StevenWanYu	17feeab184	[NFC] Remove unnecessary link components. Remove unused link components in unittest according to post commit comments.	2020-01-16 16:08:07 -05:00
Krzysztof Parzyszek	3a2fdd005e	[Hexagon] Add ELF flags for Hexagon v66 to ELFYAML.cpp	2020-01-16 15:01:00 -06:00
Fedor Sergeev	bf9f1ee39d	[GVN] add GVN parameters parsing to new pass manager Introduce parsing, add a few instances of parameter use into GVN-PRE tests. Reviewers: skatkov, asbirlea Reviewed By: skatkov Tags: #llvm Differential Revision: https://reviews.llvm.org/D72752	2020-01-16 23:53:46 +03:00
Kazu Hirata	bed83094ca	Resubmit: [JumpThreading] Thread jumps through two basic blocks This reverts commit 2d258ed931cdf47a7d1dcf08ad963b5452a8670f. This revision fixes the Windows build and adds a testcase for it, namely thread-two-bbs3.ll. My original patch improperly copied EH pads on Windows. This patch disregards jump threading opportunities having to do with EH pads. [JumpThreading] Thread jumps through two basic blocks Summary: This patch teaches JumpThreading.cpp to thread through two basic blocks like: bb3: %var = phi i32* [ null, %bb1 ], [ @a, %bb2 ] %tobool = icmp eq i32 %cond, 0 br i1 %tobool, label %bb4, label ... bb4: %cmp = icmp eq i32* %var, null br i1 %cmp, label bb5, label bb6 by duplicating basic blocks like bb3 above. Once we duplicate bb3 as bb3.dup and redirect edge bb2->bb3 to bb2->bb3.dup, we have: bb3: %var = phi i32* [ @a, %bb2 ] %tobool = icmp eq i32 %cond, 0 br i1 %tobool, label %bb4, label ... bb3.dup: %var = phi i32* [ null, %bb1 ] %tobool = icmp eq i32 %cond, 0 br i1 %tobool, label %bb4, label ... bb4: %cmp = icmp eq i32* %var, null br i1 %cmp, label bb5, label bb6 Then the existing code in JumpThreading.cpp can thread edge bb3.dup->bb4 through bb4 and eventually create bb3.dup->bb5. Reviewers: wmi Subscribers: hiraditya, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70247	2020-01-16 12:33:37 -08:00
StevenWanYu	863a3802fa	Address redirect issue on Windows.	2020-01-16 15:07:50 -05:00
Matt Arsenault	2373a4fd8d	AMDGPU: Remove outdated comment	2020-01-16 14:54:27 -05:00
StevenWanYu	8e3115ca8a	Don't run powerpc lit test case on other platforms. Only run this test on powerpc tragets, because other platforms might not have powerpc registered.	2020-01-16 14:37:25 -05:00
Matt Arsenault	077968a109	AMDGPU: Update more tests to use modern buffer intrinsics	2020-01-16 14:29:38 -05:00
Matt Arsenault	408e513c5f	AMDGPU/GlobalISel: Improve lowering of G_SEXT_INREG Clamping the scalar is much better than lowering with superwide shifts for types > s64.	2020-01-16 14:29:37 -05:00
Matt Arsenault	552fe4c9ba	GlobalISel: Don't ignore requested ext narrowing type This was assuming the narrow target was the source type. Respect the requested type when these don't match by using intermediate merges. This avoids producing very wide, illegal shift expansions.	2020-01-16 14:29:37 -05:00
Matt Arsenault	205916c405	GlobalISel: Move extension scalar narrowing to separate function Also rename a few things. Handling a different requested type will require this to become much more complex.	2020-01-16 14:29:37 -05:00
Krzysztof Parzyszek	447f535a6a	[Hexagon] Update autogeneated intrinsic information in LLVM	2020-01-16 13:11:18 -06:00
Craig Topper	ee6566c72b	[LegalizeDAG][Mips] Add an assert to protect a uint_to_fp implementation from double rounding. Add a i32->f32 uint_to_fp implementation that avoids this code. The algorithm here only works if the sint_to_fp doesn't do any rounding. Otherwise it can round before the offset fixup is applied. Add an assert to protect this. To avoid breaking the one test in tree that tested this code with a set of types that fail the assert, I've enabled i32->f32 to use the i64->f32 algorithm. This only occurs when f64 isn't a legal type. If f64 is legal then we do i32->f64->f32 instead. Differential Revision: https://reviews.llvm.org/D72794	2020-01-16 11:08:16 -08:00
Matt Arsenault	353036eab7	AMDGPU: Remove IR section from MIR test Also generate check lines so this isn't just testing the meaningless block name.	2020-01-16 13:49:44 -05:00
Matt Arsenault	69451d9bc3	GlobalISel: Apply target MMO flags to atomics Unify MMO flag handling with SelectionDAG like with loads and stores.	2020-01-16 13:49:43 -05:00
Matt Arsenault	ae7ab4d57e	GlobalISel: Preserve load/store metadata in IRTranslator This was dropping the invariant metadata on dead argument loads, so they weren't deleted. Atomics still need to be fixed the same way. Also, apparently store was never preserving dereferencable which should also be fixed.	2020-01-16 13:49:43 -05:00
Matt Arsenault	0e42d09951	TableGen/GlobalISel: Fix srcvalue inputs Allow using srcvalue for discarding pattern inputs.	2020-01-16 13:49:43 -05:00
Matt Arsenault	534a1fba52	TableGen: Remove dead code	2020-01-16 13:49:43 -05:00
Matt Arsenault	104f9f96d4	AMDGPU: Update tests to use modern buffer intrinsics	2020-01-16 13:49:43 -05:00
Krzysztof Parzyszek	f9eef6e26a	[Hexagon] Add a target feature to disable compound instructions This affects the following instructions: Tag: M4_mpyrr_addr Syntax: Ry32 = add(Ru32,mpyi(Ry32,Rs32)) Tag: M4_mpyri_addr_u2 Syntax: Rd32 = add(Ru32,mpyi(#u6:2,Rs32)) Tag: M4_mpyri_addr Syntax: Rd32 = add(Ru32,mpyi(Rs32,#u6)) Tag: M4_mpyri_addi Syntax: Rd32 = add(#u6,mpyi(Rs32,#U6)) Tag: M4_mpyrr_addi Syntax: Rd32 = add(#u6,mpyi(Rs32,Rt32)) Tag: S4_addaddi Syntax: Rd32 = add(Rs32,add(Ru32,#s6)) Tag: S4_subaddi Syntax: Rd32 = add(Rs32,sub(#s6,Ru32)) Tag: S4_or_andix Syntax: Rx32 = or(Ru32,and(Rx32,#s10)) Tag: S4_andi_asl_ri Syntax: Rx32 = and(#u8,asl(Rx32,#U5)) Tag: S4_ori_asl_ri Syntax: Rx32 = or(#u8,asl(Rx32,#U5)) Tag: S4_addi_asl_ri Syntax: Rx32 = add(#u8,asl(Rx32,#U5)) Tag: S4_subi_asl_ri Syntax: Rx32 = sub(#u8,asl(Rx32,#U5)) Tag: S4_andi_lsr_ri Syntax: Rx32 = and(#u8,lsr(Rx32,#U5)) Tag: S4_ori_lsr_ri Syntax: Rx32 = or(#u8,lsr(Rx32,#U5)) Tag: S4_addi_lsr_ri Syntax: Rx32 = add(#u8,lsr(Rx32,#U5)) Tag: S4_subi_lsr_ri Syntax: Rx32 = sub(#u8,lsr(Rx32,#U5))	2020-01-16 12:37:30 -06:00
Arkady Shlykov	ae9dada9fd	Revert "[Loop Peeling] Add possibility to enable peeling on loop nests." This reverts commit 3f3017e because there's a failure on peel-loop-nests.ll with LLVM_ENABLE_EXPENSIVE_CHECKS on. Differential Revision: https://reviews.llvm.org/D70304	2020-01-16 10:33:38 -08:00
Nico Weber	851ce920c9	[gn build] (manually) port bed7626f04f7	2020-01-16 13:19:39 -05:00
Nico Weber	9a71e6cdaf	[gn build] include revision information in lld --version output	2020-01-16 13:10:41 -05:00
stevewan	84c463c970	[PowerPC][AIX] Make PIC the default relocation model for AIX Summary: The `llc` tool currently defaults to Static relocation model and generates non-relocatable code for 32-bit Power. This is not desirable on AIX where we always generate Position Independent Code (PIC). This patch makes PIC the default relocation model for AIX. Reviewers: daltenty, hubert.reinterpretcast, DiggerLin, Xiangling_L, sfertile Reviewed By: hubert.reinterpretcast Subscribers: mgorny, wuzish, nemanjai, hiraditya, kbarton, jsji, shchenz, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72479	2020-01-16 13:07:36 -05:00
Nico Weber	2f7698b8c3	remove an include that's unused after r347592	2020-01-16 12:49:54 -05:00
Fedor Sergeev	d6b3c3a8da	[GVN] introduce GVNOptions to control GVN pass behavior There are a few global (cl::opt) controls that enable optional behavior in GVN. Introduce GVNOptions that provide corresponding per-pass instance controls. That will allow to use GVN multiple times in pipeline each time with different settings. Reviewers: asbirlea, rnk, reames, skatkov, fhahn Reviewed By: fhahn Tags: #llvm Differential Revision: https://reviews.llvm.org/D72732	2020-01-16 20:21:08 +03:00
Mircea Trofin	e90406ee2a	[llvm] Make new pass manager's OptimizationLevel a class Summary: The old pass manager separated speed optimization and size optimization levels into two unsigned values. Coallescing both in an enum in the new pass manager may lead to unintentional casts and comparisons. In particular, taking a look at how the loop unroll passes were constructed previously, the Os/Oz are now (==new pass manager) treated just like O3, likely unintentionally. This change disallows raw comparisons between optimization levels, to avoid such unintended effects. As an effect, the O{s\|z} behavior changes for loop unrolling and loop unroll and jam, matching O2 rather than O3. The change also parameterizes the threshold values used for loop unrolling, primarily to aid testing. Reviewers: tejohnson, davidxl Reviewed By: tejohnson Subscribers: zzheng, ychen, mehdi_amini, hiraditya, steven_wu, dexonsmith, dang, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D72547	2020-01-16 09:00:56 -08:00
LLVM GN Syncbot	bbf89c711e	[gn build] Port 6b357866496	2020-01-16 16:56:26 +00:00
Miloš Stojanović	b22e99bc49	[llvm-exegesis][mips] Add RegisterAliasingTest unit test Differential Revision: https://reviews.llvm.org/D72004	2020-01-16 17:50:45 +01:00
Miloš Stojanović	48ecc855b2	[llvm-exegesis][NFC] Refactor Mips tests fixtures into a base class. Differential Revision: https://reviews.llvm.org/D72003	2020-01-16 17:50:44 +01:00
Matt Arsenault	c41df580c2	AMDGPU/GlobalISel: Don't handle legacy buffer intrinsic	2020-01-16 11:31:12 -05:00
Hubert Tong	dc883a9c61	[MC][test] Fix non-portable GNU diff option Summary: This patch replaces the non-portable GNU diff option `--strip-trailing-cr` with the POSIX `-b` option in two test files. Reviewers: daltenty, jasonliu Reviewed By: daltenty Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72745	2020-01-16 11:29:24 -05:00
Matt Arsenault	28876d6a15	AMDGPU/GlobalISel: Select DS GWS intrinsics	2020-01-16 11:25:10 -05:00
Jay Foad	cfc365cf6e	[GlobalISel] Don't arbitrarily limit a mask to 64 bits Reviewers: arsenm Subscribers: wdng, rovka, hiraditya, volkan, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72853	2020-01-16 16:13:20 +00:00
Jay Foad	43278bacb4	[GlobalISel] Pass MachineOperands into MachineIRBuilder helper methods Reviewers: arsenm, aditya_nandakumar, aemerson Subscribers: wdng, rovka, hiraditya, volkan, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72849	2020-01-16 16:04:21 +00:00
Sam Parker	be9f6d7d6e	[ARM][LowOverheadLoops] Update liveness info Recommitting e93e0d413f3a after reverting due to test failures, which will hopefully now be fixed. Original commit message: After expanding the pseudo instructions, update the liveness info. We do this in a post-order traversal of the loop, including its exit blocks and preheader(s). Differential Revision: https://reviews.llvm.org/D72131	2020-01-16 15:44:25 +00:00
Jay Foad	9918b43039	[GlobalISel] Use more MachineIRBuilder helper methods Reviewers: arsenm, nhaehnle Subscribers: wdng, rovka, hiraditya, volkan, Petar.Avramovic, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72833	2020-01-16 15:34:51 +00:00
Anna Welker	8d31f5ac65	[ARM][MVE] Enable extending gathers Enables the masked gather pass to create extending masked gathers. Differential Revision: https://reviews.llvm.org/D72451	2020-01-16 15:24:54 +00:00
Francesco Petrogalli	f6e39fe1d1	[VectorUtils] Rework the Vector Function Database (VFDatabase). Summary: This commits is a rework of the patch in https://reviews.llvm.org/D67572. The rework was requested to prevent out-of-tree performance regression when vectorizing out-of-tree IR intrinsics. The vectorization of such intrinsics is enquired via the static function `isTLIScalarize`. For detail see the discussion in https://reviews.llvm.org/D67572. Reviewers: uabelho, fhahn, sdesmalen Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72734	2020-01-16 15:08:26 +00:00
Nico Weber	6bab3f4983	Make lld cmake not compute commit revision twice r354605 moved LLD to the unified revision handling introduced in rL353268 / r352729 and removed uses of LLD_REPOSITORY_STRING and LLD_REVISION_STRING. After this change, we no longer compute the (now-unused) values of these two variables. Since this removes the only use of llvm/utils/GetRepositoryPath, remove that too (it's redundant with the system added in r354605). While here, also remove LLD_VERSION_MAJOR and LLD_VERSION_MINOR. Their uses were removed in r285163. Also remove LLD_VERSION from Version.inc which as far as I can tell has been unused since the file was added in r219277. No behavior change. Differential Revision: https://reviews.llvm.org/D72803	2020-01-16 09:55:36 -05:00

1 2 3 4 5 ...

190222 Commits