llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 13:11:39 +01:00

Author	SHA1	Message	Date
Mircea Trofin	724ebfc377	[NFC] Use Register/MCRegister Differential Revision: https://reviews.llvm.org/D90724	2020-11-04 12:20:17 -08:00
Craig Topper	3631995968	[RISCV] Remove assertsexti32 from fslw/fsrw isel patterns. The operations in these patterns shouldn't be effected by sign bits. And the pattern is starting from a sign_extend_inreg so we aren't expecting sign bits to be passed through either. Differential Revision: https://reviews.llvm.org/D90739	2020-11-04 11:37:58 -08:00
Nikita Popov	41412f444d	[MemorySSA] Use provided memory location even if instruction is call If getClobberingMemoryAccess() is called with an explicit MemoryLocation, but the starting access happens to be a call, the provided location is currently ignored, and alias analysis queries will be performed against the call instruction instead. Something similar happens if the starting access is a load with a MemoryDef. Change the implementation to not set Q.Inst in the first place if we want to perform a MemoryLocation-based query, to make sure it can't be turned into an Instruction-based query along the way... Additionally, remove the special handling that lifetime.start intrinsics currently get. They simply report NoAlias for clobbers between lifetime.start and other calls, but that's obviously not right if the other call is something like a memset or memcpy. The default behavior we get from getModRefInfo() will already do the right thing here. Differential Revision: https://reviews.llvm.org/D88782	2020-11-04 20:30:22 +01:00
Steven Wan	8a4871ee91	Add info about the cherry-picked commit and contributor	2020-11-04 14:23:27 -05:00
Steven Wan	54d3d00751	[PowerPC] Rename mftbl to mftb `mftb` and `mftbl` are equivalent, there is no need to have two names for doing the same thing, rename `mftbl` to only have `mftb`. Differential Revision: https://reviews.llvm.org/D89506	2020-11-04 14:23:27 -05:00
Craig Topper	1d80c19eed	[RISCV] Correct the operand order for fshl/fshr to fsl/fsr instructions. fsl/fsr take their shift amount in $rs2 or an immediate. The sources are $rs1 and $rs3. fshl/fshr ISD opcodes both concatenate operand 0 in the high bits and operand 1 in the lower bits. fshl returns the high bits after shifting and fshr returns the low bits. So a shift amount of 0 returns operand 0 for fshl and operand 1 for fshr. fsl/fsr concatenate their operands in different orders such that $rs1 will be returned for a shift amount of 0. So $rs1 needs to come from operand 0 of fshl and operand 1 of fshr. Differential Revision: https://reviews.llvm.org/D90735	2020-11-04 11:13:25 -08:00
Fraser Cormack	5f7fe12cc6	[DAGCombine] Fix bug in load scalarization Summary: For vector element types which are not byte-sized, we would generate incorrect scalar offsets and produce incorrect codegen. This optimization could potentially be supported in the future, e.g. by loading in bytes, then shifting and masking out the remaining bits of the vector element. However, without an upstream target to test against it's best to avoid the bad codegen in the simplest possible way. Related to this bug: https://bugs.llvm.org/show_bug.cgi?id=27600 Reviewed by: foad Differential Revision: https://reviews.llvm.org/D78568	2020-11-04 19:02:40 +00:00
Craig Topper	4bde11ee29	[RISCV] Remove assertsexti32 from inputs to riscv_sllw/srlw nodes in B extension isel patterns. riscv_sllw/srlw only reads the lower 32 bits of the first operand. And the lower 5 bits of the second operands. Whether the upper 32 bits of the input are sign bits or not doesn't matter. Also use ineg and not to shorten the patterns. Differential Revision: https://reviews.llvm.org/D90668	2020-11-04 10:35:05 -08:00
Arnold Schwaighofer	d90984c1dd	Start of an llvm.coro.async implementation This patch adds the `async` lowering of coroutines. This will be used by the Swift frontend to lower async functions. In contrast to the `retcon` lowering the frontend needs to be in control over control-flow at suspend points as execution might be suspended at these points. This is very much work in progress and the implementation will change as it evolves with the frontend. As such the documentation is lacking detail as some of it might change. rdar://70097093 Reapply with fix for memory sanitizer failure and sphinx failure. Differential Revision: https://reviews.llvm.org/D90612	2020-11-04 10:29:21 -08:00
Craig Topper	f850868dc3	[RISCV] Check all 64-bits of the mask in SelectRORIW. We need to ensure the upper 32 bits of the mask are zero. So that the srl shifts zeroes into the lower 32 bits. Differential Revision: https://reviews.llvm.org/D90585	2020-11-04 10:15:30 -08:00
Christopher Tetreault	78607d1352	[UBSan] Cannot negate smallest negative signed integer Silence warning Undefined Behavior Sanitzer warning: runtime error: negation of -9223372036854775808 cannot be represented in type 'int64_t' (aka 'long'); cast to an unsigned type to negate this value to itself Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D90710	2020-11-04 10:07:52 -08:00
Craig Topper	7eeba8eadb	[RISCV] Remove custom isel for (srl (shl val, 32), imm). Use pattern instead. NFCI We don't need custom matching, we just a need a predicate to check the immediate is greater than 32. We can use the existing ImmSub32 to adjust the immediate. I've also used the new predicate in the other location that used ImmSub32. I tried to create a test case where we would break without the greater than 32 check on that pattern, but DAG combine defeated me. Still seemed safer to have it. Differential Revision: https://reviews.llvm.org/D90546	2020-11-04 09:59:14 -08:00
Joe Nash	3dea5f368c	[AMDGPU] Resolve pseudo registers at encoding uses Pseudo-registers allow different register encodings between gpu generations. Make sure we resolve the pseudo regs to real regs whenever we get their hardware encoding. Using the correct encodings revealed a register bank conflict and an unnecessary write dependency. Tests have been updated to match. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D90721 Change-Id: I73c154cd24aecc820993b50bebaf4df97a5710ca	2020-11-04 12:52:32 -05:00
Fangrui Song	032dc68e5b	Revert "[GlobalISel] GISelKnownBits::computeKnownBitsImpl - Replace TargetOpcode::G_MUL handling with the common KnownBits::computeForMul implementation" This reverts commit 0b8711e1af97d6c82dc9d25c12c5a06af060cc56 which broke GlobalISelTests AArch64GISelMITest.TestKnownBits	2020-11-04 09:54:04 -08:00
Arthur Eubanks	2e4e41af20	[NewPM] Don't run before pass instrumentation on required passes This allows those instrumentation to log when they decide to skip a pass. This provides extra helpful info for optnone functions and also will help with opt-bisect. Have OptNoneInstrumentation print when it skips due to seeing optnone. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D90545	2020-11-04 09:45:10 -08:00
Sebastian Neubauer	0003aeadad	[AMDGPU] Fix iterating in SIFixSGPRCopies The insertion of waterfall loops splits the current basic block into three blocks. So the basic block that we iterate over must be updated. This failed assert(!NodePtr->isKnownSentinel()) in ilist_iterator for divergent calls in branches before. Differential Revision: https://reviews.llvm.org/D90596	2020-11-04 18:43:19 +01:00
Fangrui Song	c248aad5ed	[llvm-objcopy] Make --set-section-flags work with --add-section This matches behavior GNU objcopy and can simplify clang-offload-bundler (which currently works around the issue by invoking llvm-objcopy twice). Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D90438	2020-11-04 09:39:14 -08:00
Alexander Shaposhnikov	57690ddfcb	[llvm-objcopy][MachO] Make isValidMachOCannonicalName static This diff makes the function isValidMachOCannonicalName static. NFC. Test plan: make check-all	2020-11-04 09:37:29 -08:00
Simon Pilgrim	aff03bd34f	[KnownBits] KnownBits::computeForMul - avoid unnecessary APInt copies. NFCI. Use const references instead.	2020-11-04 17:25:25 +00:00
Simon Pilgrim	0aa6d48258	[GlobalISel] GISelKnownBits::computeKnownBitsImpl - Replace TargetOpcode::G_MUL handling with the common KnownBits::computeForMul implementation Avoid code duplication	2020-11-04 17:25:24 +00:00
Arnold Schwaighofer	c8e9566a32	Revert "Start of an llvm.coro.async implementation" This reverts commit ea606cced0583d1dbd4c44680601d1d4e9a56e58. This patch causes memory sanitizer failures sanitizer-x86_64-linux-fast.	2020-11-04 08:26:20 -08:00
LLVM GN Syncbot	d609931b08	[gn build] Port d1b2a523191	2020-11-04 15:36:49 +00:00
Arnold Schwaighofer	3e8facdd39	Start of an llvm.coro.async implementation This patch adds the `async` lowering of coroutines. This will be used by the Swift frontend to lower async functions. In contrast to the `retcon` lowering the frontend needs to be in control over control-flow at suspend points as execution might be suspended at these points. This is very much work in progress and the implementation will change as it evolves with the frontend. As such the documentation is lacking detail as some of it might change. rdar://70097093 Differential Revision: https://reviews.llvm.org/D90612	2020-11-04 07:32:29 -08:00
Eric Astor	5e9623a87c	[ms] [llvm-ml] Enable support for MASM-style macro procedures Allows the MACRO directive to define macro procedures with parameters and macro-local symbols. Supports required and optional parameters (including default values), and matches ml64.exe for its macro-local symbol handling (up to 65536 macro-local symbols in any translation unit). Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D89729	2020-11-04 10:29:57 -05:00
Simon Pilgrim	fab109bb67	Use isa<> instead of dyn_cast<> to avoid unused variable warning. NFCI.	2020-11-04 15:26:32 +00:00
Simon Pilgrim	afe668e17d	Fix gcc braces warning. NFCI. gcc warns that the EXPECT_TRUE macro isn't surrounded by if() {} - we already do this in other cases in the file.	2020-11-04 15:26:32 +00:00
Paul C. Anagnostopoulos	9295b21984	[TableGen] Add !interleave operator to concatenate a list of values with delimiters Add a test. Use it in some TableGen files. Differential Revision: https://reviews.llvm.org/D90469	2020-11-04 09:23:54 -05:00
Paul C. Anagnostopoulos	115a197e56	[TableGen] [IR] Eliminate unnecessary recursive help class. Differential Revision: https://reviews.llvm.org/D90532	2020-11-04 09:18:09 -05:00
Sanjay Patel	75c976d15a	[InstSimplify] allow vector folds for icmp Pred (1 << X), 0x80	2020-11-04 08:12:48 -05:00
Sanjay Patel	9993666887	[InstSimplify] add vector cmp tests; NFC	2020-11-04 08:12:47 -05:00
Roman Lebedev	a30754006b	[Reassociate] Guard `add`-like `or` conversion into an `add` with profitability check This is slightly better compile-time wise, since we avoid potentially-costly knownbits analysis that will ultimately not allow us to actually do anything with said `add`.	2020-11-04 16:10:34 +03:00
Clement Courbet	672cb989aa	[llvm-exegesis] Fix rGaf658d920e2b Add missing header. ``` ../../llvm/tools/llvm-exegesis/lib/X86/Target.cpp(606,14): error: use of undeclared identifier '__readeflags' Eflags = __readeflags(); ```	2020-11-04 13:23:34 +01:00
LLVM GN Syncbot	05c2fcc211	[gn build] Port 73b6cb67dcd	2020-11-04 12:00:24 +00:00
LLVM GN Syncbot	d19e12083d	[gn build] Port 1124bf4ab77	2020-11-04 12:00:24 +00:00
Nico Weber	7fc5c0fdb0	[gn build] try to port 707d69ff32309b	2020-11-04 07:00:05 -05:00
Simon Moll	1caff81b49	[VE] Add +vpu attribute `+vpu` controls whether VEISelLowering adds any vregs. This defaults to `-vpu` to have scalar code generation out of the box. We bring up vector isel under the `+vpu` flag. Once vector isel is stable we switch to `+vpu` and advertise vregs and vops in TTI. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D90465	2020-11-04 12:42:00 +01:00
Kerry McLaughlin	1de78af5c4	[SVE][CodeGen] Lower scalable integer vector reductions This patch uses the existing LowerFixedLengthReductionToSVE function to also lower scalable vector reductions. A separate function has been added to lower VECREDUCE_AND & VECREDUCE_OR operations with predicate types using ptest. Lowering scalable floating-point reductions will be addressed in a follow up patch, for now these will hit the assertion added to expandVecReduce() in TargetLowering. Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D89382	2020-11-04 11:38:49 +00:00
Simon Pilgrim	ad61fbdaba	[DAG] computeKnownBits - Replace ISD::MUL handling with the common KnownBits::computeForMul implementation	2020-11-04 11:32:08 +00:00
Sebastian Neubauer	7ea56efeb5	[AMDGPU] Set rsrc1 flags for graphics shaders Before they were only set for compute kernels and compute shaders but not for other shaders. Differential Revision: https://reviews.llvm.org/D89399	2020-11-04 12:25:41 +01:00
Sebastian Neubauer	537305eda7	[AMDGPU] Fix ieee mode default value Previously, the default value for ieee mode was - on for compute kernels and compute shaders, - off for all shaders except compute shaders. This commit changes the default to be - on for compute kernels, - off for shaders. This aligns the default value with the settings that are actually in use. To my knowledge, all users of shader calling conventions (mesa and llpc) disable the ieee mode by default. Differential Revision: https://reviews.llvm.org/D89388	2020-11-04 12:25:38 +01:00
Stefan Gränitz	0fe835476e	[JITLink][ELF] Omit temporary labels in tests Oneshot temporary labels for declaring function size can be omitted. Follow-up from D90331. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D90676	2020-11-04 10:03:15 +00:00
Clement Courbet	fab7d28218	[llvm-exegesis][X86] Save and restore eflags. This is needed to benchmark instruction that touch EFLAGS (e.g. STD: set direction flag). Differential Revision: https://reviews.llvm.org/D90742	2020-11-04 10:44:15 +01:00
Vitaly Buka	7d04bf155f	[sanitizer] Remove ANDROID_NDK_VERSION	2020-11-04 01:15:25 -08:00
Clement Courbet	d7751be085	[llvm-exegesis] Fix unused variable warning.	2020-11-04 10:09:50 +01:00
David Green	2f8823c9c5	[ARM] Remove unused variable. NFC	2020-11-04 09:00:03 +00:00
Sander de Smalen	ca12e64408	[NFCI] Replace AArch64StackOffset by StackOffset. This patch replaces the AArch64StackOffset class by the generic one defined in TypeSize.h. Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D88983	2020-11-04 08:49:00 +00:00
Clement Courbet	3104bbde01	Re-land "[llvm-exegesis] Save target state before running the benchmark." The X86 exegesis target is never executed run on non-X86 hosts, disable X86 instrinsic code on non-X86 targets. This reverts commit 8cfc872129a99782ab07a19171bf8eace85589ae.	2020-11-04 09:46:55 +01:00
Fangrui Song	878101132b	[DebugInfo] Delete unused DwarfUnit::addConstantFPValue & addConstantValue overloads. NFC This functions appear to be unused for many years.	2020-11-04 00:05:57 -08:00
Clement Courbet	932b6b828d	Revert "Re-land "[llvm-exegesis] Save target state before running the benchmark." Still issues on some architectures. This reverts commit fd13d7ce09af2bcad6976b8f5207874992bdd908.	2020-11-04 08:48:44 +01:00
Clement Courbet	b8e79488b4	Re-land "[llvm-exegesis] Save target state before running the benchmark. Use `__builtin_ia32_fxsave64` under __GNUC__, (_fxsave64) does not exist in old versions of gcc (pre-9.1). This reverts commit e128f9cafca4e72b089fcd1381af5a1ec656d987.	2020-11-04 08:34:33 +01:00

1 2 3 4 5 ...

206303 Commits