llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Roman Lebedev	967ec6cd21	[InstCombine] Negator: don't negate multi-use `sub` While we can do that, it doesn't increase instruction count, if the old `sub` sticks around then the transform is not only not a unlikely win, but a likely regression, since we likely now extended live range and use count of both of the `sub` operands, as opposed to just the result of `sub`. As Kostya Serebryany notes in post-commit review in https://reviews.llvm.org/D68408#1998112 this indeed can degrade final assembly, increase register pressure, and spilling. This isn't what we want here, so at least for now let's guard it with an use check.	2020-04-23 23:59:15 +03:00
Christopher Tetreault	27fa4379da	[SVE] Remove calls to isScalable from Transforms Reviewers: efriedma, chandlerc, reames, aprantl, sdesmalen Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77756	2020-04-23 13:50:07 -07:00
Christopher Tetreault	8845acff75	[SVE] Remove isScalable from Bitcode Reviewers: efriedma, dexonsmith, tejohnson, sdesmalen Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77695	2020-04-23 13:28:15 -07:00
Christopher Tetreault	51927a5cf9	[SVE] Remove calls to isScalable from AARCH64 Reviewers: efriedma, sdesmalen, t.p.northover, mcrosier Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77758	2020-04-23 13:09:17 -07:00
Jay Foad	3087bffeb8	[AMDGPU] Avoid hard-coded line numbers in error message checks This makes it easier for us to maintain downstream changes to some of these tests. NFC. Differential Revision: https://reviews.llvm.org/D78716	2020-04-23 21:06:09 +01:00
Christopher Tetreault	a081c1bbd4	[SVE] Remove calls to isScalable from CodeGen Reviewers: efriedma, sdesmalen, stoklund, sunfish Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77755	2020-04-23 12:58:52 -07:00
Christopher Tetreault	03903ff332	[SVE] Remove calls to VectorType::isScalable from analysis Reviewers: efriedma, sdesmalen, chandlerc, sunfish Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77692	2020-04-23 12:44:22 -07:00
Matt Arsenault	79a9ccb289	AMDGPU: Fix inlining logic for denormals This was backwards from intended and missing a test. We perhaps should just ignored the FP mode here, since it shouldn't be legal to mix code with different default modes in the absence of strictfp.	2020-04-23 15:30:48 -04:00
Matt Arsenault	33f5f7933c	AMDGPU: Change pre-gfx9 implementation of fcanonicalize to mul If f32 denormals were enabled pre-gfx9, we would still try to implement this with v_max_f32. Pre-gfx9, these instructions ignored the denormal mode and did not flush. Switch to the multiply form for f32 as a workaround which should always work in any case. This fixes conformance failures when the library implementation of fmin/fmax were accidentally not inlined, forcing the assumption of no flushing on targets where denormals are not enabled by default. This is a workaround, since really we should not be mixing code with different FP mode expectations, but prefer the lowering that will work in any mode. Now this will always use max to implement canonicalize on gfx9+. This is only really beneficial for f64. For f32/f16 it's a neutral choice (and worse in terms of code size in 1 case), but possibly worse for the compiler since it does add an extra register use operand. Leave this change for later.	2020-04-23 15:24:13 -04:00
Alexander Shaposhnikov	22888ebe65	[llvm-objcopy][MachO] Copy LC_LOAD_WEAK_DYLIB load commands LC_LOAD_WEAK_DYLIB is analogous to LC_LOAD_DYLIB and doesn't require any special handling. Test plan: make check-all Differential revision: https://reviews.llvm.org/D78602	2020-04-23 12:21:44 -07:00
Matt Arsenault	fe6f7697c6	AMDGPU/GlobalISel: Add new baseline checks for canonicalize	2020-04-23 15:04:32 -04:00
Christopher Tetreault	651d21f100	[SVE] Remove calls to isScalable from IR Reviewers: efriedma, sdesmalen, dexonsmith, dblaikie Reviewed By: sdesmalen Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77691	2020-04-23 11:51:22 -07:00
Simon Pilgrim	7dd30c4307	[X86][SSE] Add SSE2 extract-concat tests Check pre-SSE41 codegen where we have less PEXTR/PINSR instructions	2020-04-23 19:40:34 +01:00
Mircea Trofin	a65579c739	[llvm][NFC] Factor out cost-model independent inling decision Summary: llvm::getInlineCost starts off by determining whether inlining should happen or not because of user directives or easily determinable unviability. This CL refactors this functionality as a reusable API. Reviewers: davidxl, eraman Reviewed By: davidxl, eraman Subscribers: hiraditya, haicheng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73825	2020-04-23 10:58:43 -07:00
Christopher Tetreault	f55cc8b846	[SVE] Make VectorType::getNumElements() complain for scalable vectors Summary: Piggy-back off of TypeSize's STRICT_FIXED_SIZE_VECTORS flag and: - if it is defined, assert that the vector is not scalable - if it is not defined, complain if the vector is scalable Reviewers: efriedma, sdesmalen, c-rhodes Reviewed By: sdesmalen Subscribers: hiraditya, mgorny, tschuett, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78576	2020-04-23 10:47:38 -07:00
Mircea Trofin	196afafb83	[llvm][NFC][CallSite] Removed CallSite from few implementation details Reviewers: dblaikie, craig.topper Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78724	2020-04-23 10:36:36 -07:00
Vedant Kumar	123a24ae29	MachineBasicBlock: Avoid copy in skipDebugInstructions{Forward,Backward}, NFC	2020-04-23 10:22:28 -07:00
Simon Pilgrim	8f43205784	X86MCTargetDesc.h - replace FormattedStream.h include with forward declaration. NFC.	2020-04-23 17:42:51 +01:00
Simon Pilgrim	dbc505b82b	X86TargetObjectFile.h - remove unnecessary TargetLoweringObjectFile.h include. NFC. We already include TargetLoweringObjectFileImpl.h which includes it and we only use its types as part of TargetLoweringObjectFile* overridden methods.	2020-04-23 17:42:50 +01:00
Vedant Kumar	15533d797a	Rename a shadowed variable causing build failure on gcc<5.5 See discussion here: https://reviews.llvm.org/D78265	2020-04-23 09:23:44 -07:00
Simon Pilgrim	7811e75f06	[RISCV] Remove unused forward declarations. NFC.	2020-04-23 16:30:45 +01:00
Simon Pilgrim	df1c4b6a32	[WebAssembly] Remove unused forward declarations. NFC.	2020-04-23 16:30:45 +01:00
Simon Pilgrim	f58f961ea2	[XCore] Remove unused forward declarations. NFC.	2020-04-23 16:30:45 +01:00
Simon Pilgrim	2945e2ec06	[NVPTX] Remove unused forward declarations. NFC.	2020-04-23 16:30:44 +01:00
Simon Pilgrim	ae09af164c	[Sparc] Remove unused forward declarations. NFC.	2020-04-23 16:30:44 +01:00
Krzysztof Parzyszek	0796a275d0	[Hexagon] Add missing live-in registers in some codegen tests	2020-04-23 10:28:04 -05:00
Victor Huang	aaf13c35d8	[PowerPC][Future] Add missing changes for PC Realtive addressing 1. Use Subtarget.isUsingPCRelativeCalls() in LowerConstantPool to check if using PCRelative addressing. 2. Change MO_GOT_FLAG = 32 to MO_GOT_FLAG = 8 in PPC.h to use consecutive bits. Differential Revision: https://reviews.llvm.org/D78406	2020-04-23 10:26:43 -05:00
Mircea Trofin	325c1a23f9	[llvm][NFC][CallSite] Remove CallSite from TypeMetadataUtils & related Reviewers: craig.topper, dblaikie Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D78666	2020-04-23 08:23:16 -07:00
Sanjay Patel	234627a72c	[InstCombine] substitute equivalent constant to reduce logic-of-icmps (X == C) && (Y Pred1 X) --> (X == C) && (Y Pred1 C) (X != C) \|\| (Y Pred1 X) --> (X != C) \|\| (Y Pred1 C) This cooperates/overlaps with D78430, but it is a more general transform that gets us most of the expected simplifications and several other improvements. http://volta.cs.utah.edu:8080/z/5gxjjc PR45618: https://bugs.llvm.org/show_bug.cgi?id=45618 Differential Revision: https://reviews.llvm.org/D78582	2020-04-23 10:19:16 -04:00
Simon Pilgrim	2c99a3b86e	[PowerPC] Remove unused forward declarations. NFC.	2020-04-23 15:02:18 +01:00
Simon Pilgrim	a0e04a71b1	[Mips] Remove unused forward declarations. NFC.	2020-04-23 15:02:18 +01:00
Simon Pilgrim	f6b9cad708	LanaiMCTargetDesc.h - remove unused forward declarations. NFC.	2020-04-23 15:02:18 +01:00
Simon Pilgrim	93b13ec1ff	[MSP430] Remove unused forward declarations. NFC.	2020-04-23 15:02:17 +01:00
Sanjay Patel	b6de2f1d6f	[InstSimplify] fold and/or of compares with equality to min/max constant I found 12 (6 if we compress the DeMorganized forms) patterns for logic-of-compares with a min/max constant while looking at PR45510: https://bugs.llvm.org/show_bug.cgi?id=45510 The variations on those forms multiply the test cases by 8 (unsigned/signed, swapped compare operands, commuted logic operands). We have partial logic to deal with these for the unsigned min (zero) case, but missed everything else. We are deferring the majority of these patterns to InstCombine to allow more general handling (see D78582). We could use ConstantRange instead of predicate+constant matching here. I don't expect there's any noticeable compile-time impact for either form. Here's an abuse of Alive2 to show the 12 basic signed variants of the patterns in one function: http://volta.cs.utah.edu:8080/z/5Vpiyg declare void @use(i1, i1, i1, i1, i1, i1, i1, i1, i1, i1, i1, i1) define void @src(i8 %x, i8 %y) { %m1 = icmp eq i8 %x, 127 %c1 = icmp slt i8 %x, %y %r1 = and i1 %m1, %c1 ; (X == MAX) && (X < Y) --> false %m2 = icmp ne i8 %x, 127 %c2 = icmp sge i8 %x, %y %r2 = or i1 %m2, %c2 ; (X != MAX) \|\| (X >= Y) --> true %m3 = icmp eq i8 %x, -128 %c3 = icmp sgt i8 %x, %y %r3 = and i1 %m3, %c3 ; (X == MIN) && (X > Y) --> false %m4 = icmp ne i8 %x, -128 %c4 = icmp sle i8 %x, %y %r4 = or i1 %m4, %c4 ; (X != MIN) \|\| (X <= Y) --> true %m5 = icmp eq i8 %x, 127 %c5 = icmp sge i8 %x, %y %r5 = and i1 %m5, %c5 ; (X == MAX) && (X >= Y) --> X == MAX %m6 = icmp ne i8 %x, 127 %c6 = icmp slt i8 %x, %y %r6 = or i1 %m6, %c6 ; (X != MAX) \|\| (X < Y) --> X != MAX %m7 = icmp eq i8 %x, -128 %c7 = icmp sle i8 %x, %y %r7 = and i1 %m7, %c7 ; (X == MIN) && (X <= Y) --> X == MIN %m8 = icmp ne i8 %x, -128 %c8 = icmp sgt i8 %x, %y %r8 = or i1 %m8, %c8 ; (X != MIN) \|\| (X > Y) --> X != MIN %m9 = icmp ne i8 %x, 127 %c9 = icmp slt i8 %x, %y %r9 = and i1 %m9, %c9 ; (X != MAX) && (X < Y) --> X < Y %m10 = icmp eq i8 %x, 127 %c10 = icmp sge i8 %x, %y %r10 = or i1 %m10, %c10 ; (X == MAX) \|\| (X >= Y) --> X >= Y %m11 = icmp ne i8 %x, -128 %c11 = icmp sgt i8 %x, %y %r11 = and i1 %m11, %c11 ; (X != MIN) && (X > Y) --> X > Y %m12 = icmp eq i8 %x, -128 %c12 = icmp sle i8 %x, %y %r12 = or i1 %m12, %c12 ; (X == MIN) \|\| (X <= Y) --> X <= Y call void @use(i1 %r1, i1 %r2, i1 %r3, i1 %r4, i1 %r5, i1 %r6, i1 %r7, i1 %r8, i1 %r9, i1 %r10, i1 %r11, i1 %r12) ret void } define void @tgt(i8 %x, i8 %y) { %m5 = icmp eq i8 %x, 127 %m6 = icmp ne i8 %x, 127 %m7 = icmp eq i8 %x, -128 %m8 = icmp ne i8 %x, -128 %c9 = icmp slt i8 %x, %y %c10 = icmp sge i8 %x, %y %c11 = icmp sgt i8 %x, %y %c12 = icmp sle i8 %x, %y call void @use(i1 0, i1 1, i1 0, i1 1, i1 %m5, i1 %m6, i1 %m7, i1 %m8, i1 %c9, i1 %c10, i1 %c11, i1 %c12) ret void } Differential Revision: https://reviews.llvm.org/D78430	2020-04-23 09:16:10 -04:00
Sanjay Patel	9c3edad8f1	[InstCombine] add test for logic-of-icmps that should simplify (D78582); NFC	2020-04-23 09:16:10 -04:00
Simon Pilgrim	1ad8ea0571	RuntimeDyldELF.h - make the object namespace explicit for ELFObjectFileBase. NFC.	2020-04-23 13:52:49 +01:00
Simon Pilgrim	8f5ff4f791	[ObjCARC] Remove unused forward declarations. NFC.	2020-04-23 13:52:49 +01:00
Simon Pilgrim	9c81ef9a5a	XCOFF.h - replace StringRef.h include with forward declaration. NFC. Move StringRef.h include to XCOFF.cpp	2020-04-23 13:52:48 +01:00
Alex Richardson	425ae3721b	[UpdateTestChecks] Make generation of UTC_ARGS: comment more robust We now use the argparse Action objects to determine the name of the flags. This fixes cases where the key for the stored result ('dest') is not the same as the command line flag (e.g. --enable/--disable). Also add a test that --disabled can be part of the initial UTC_ARGS. This is split out from D78478 Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D78617	2020-04-23 13:12:27 +01:00
Alex Richardson	ee9e515d91	Use FrameIndexTy for stack protector Using getValueType() is not correct for architectures extended with CHERI since we need a pointer type and not the value that is loaded. While stack protector is useless when you have CHERI (since CHERI provides much stronger security guarantees), we still have a test to check that we can generate correct code for checks. Merging b281138a1b67ca4405b77d774adc3de72742e7a2 into our tree broke this test. Fix by using TLI.getFrameIndexTy(). Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D77785	2020-04-23 13:12:27 +01:00
Xing GUO	3be88d17d4	[dsymutil][doc] Improve documentation. This change helps improve `dsymutil` documentation. - Add missing options - Re-arrange options in alphabetical order - Wrap inline options in double-back-quote - `-v` is for `--version` not `--verbose` Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D78479	2020-04-23 20:06:52 +08:00
River Riddle	b00b0b0ae0	[mlir][Standard] Allow select to use an i1 for vector and tensor values It currently requires that the condition match the shape of the selected value, but this is only really useful for things like masks. This revision allows for the use of i1 to mean that all of the vector/tensor is selected. This also matches the behavior of LLVM select. A benefit of this change is that transformations that want to generate selects, like those on the CFG, don't have to special case vector/tensor. Previously the only way to generate a select from an i1 was to use a splat, but that doesn't support dynamically shaped/unranked tensors. Differential Revision: https://reviews.llvm.org/D78690	2020-04-23 04:50:09 -07:00
River Riddle	ba1b761ce1	[mlir][Standard] Add a canonicalization to simplify cond_br when the successors are identical This revision adds support for canonicalizing the following: ``` cond_br %cond, ^bb1(A, ..., N), ^bb1(A, ..., N) br ^bb1(A, ..., N) ``` If the operands to the successor are different and the cond_br is the only predecessor, we emit selects for the branch operands. ``` cond_br %cond, ^bb1(A), ^bb1(B) %select = select %cond, A, B br ^bb1(%select) ``` Differential Revision: https://reviews.llvm.org/D78682	2020-04-23 04:42:02 -07:00
Simon Pilgrim	355bc17b08	[VPlan] Remove unused forward declarations. NFC. Move VPlan.h include from VPlanVerifier.h down to VPlanVerifier.cpp	2020-04-23 12:34:20 +01:00
Jay Foad	ce882d5781	[AMDGPU] Use RegClass helper functions in getRegForInlineAsmConstraint. This avoids more long lists of register classes that have to be updated every time we add a new one. NFC. Differential Revision: https://reviews.llvm.org/D78570	2020-04-23 12:26:52 +01:00
Serguei Katkov	5b93390019	[CaptureTracking] Replace hardcoded constant to option. NFC. The motivation is to be able to play with the option and change if it is required. Reviewers: fedor.sergeev, apilipenko, rnk, jdoerfert Reviewed By: fedor.sergeev Subscribers: hiraditya, dantrushin, llvm-commits Differential Revision: https://reviews.llvm.org/D78624	2020-04-23 18:23:35 +07:00
Florian Hahn	1d82c3d44a	[VPlan] Add & use VPValue operands for VPWidenRecipe (NFC). This patch adds VPValue version of the instruction operands to VPWidenRecipe and uses them during code-generation. Similar to D76373 this reduces ingredient def-use usage by ILV as a step towards full VPlan-based def-use relations. Reviewers: rengolin, Ayal, gilr Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D76992	2020-04-23 12:16:46 +01:00
Jay Foad	3876a09759	[AMDGPU] Use SGPR instead of SReg classes 12994a70cf7 did this for 128-bit classes: SGPR_128 only includes the real allocatable SGPRs, and SReg_128 adds the additional non-allocatable TTMP registers. There's no point in allocating SReg_128 vregs. This shrinks the size of the classes regalloc needs to consider, which is usually good. This patch extends it to all classes > 64 bits, for consistency. Differential Revision: https://reviews.llvm.org/D78622	2020-04-23 11:45:22 +01:00
Sander de Smalen	6a9013112e	[AArch64] Define ACLE FP conversion intrinsics with more specific predicate. This patch changes the FP conversion intrinsics to take a predicate that matches the number of lanes for the vector with the widest element type as opposed to using <vscale x 16 x i1>. For example: ```<vscale x 4 x float> @llvm.aarch64.sve.fcvt.f32f16(<vscale x 4 x float>, <vscale x 4 x i1>, <vscale x 8 x half>)``` now uses <vscale x 4 x i1> instead of <vscale x 16 x i1> And similar for: ```<vscale x 4 x float> @llvm.aarch64.sve.fcvt.f32f64(<vscale x 4 x float>, <vscale x 2 x i1>, <vscale x 2 x double>)``` where the predicate now matches the wider type, so <vscale x 2 x i1>. Reviewers: efriedma, SjoerdMeijer, paulwalker-arm, rengolin Reviewed By: efriedma Tags: #clang Differential Revision: https://reviews.llvm.org/D78402	2020-04-23 10:53:23 +01:00
Georgii Rymar	7b2ae86356	[obj2yaml] - Zero initialize program headers. NFCI. It allows to simplify the current code and also might help for the code around. It is also consistent with what we do for another headers, e.g. section headers, elf file header etc. Differential revision: https://reviews.llvm.org/D78627	2020-04-23 12:39:14 +03:00

1 2 3 4 5 ...

195689 Commits