llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Douglas Yung	5f54866a8c	Add requirement for aarch64-registered-target to test change added in 42e3f97a9dd3a439f63a733c4ee909cba6b77e49.	2021-03-09 12:26:15 -08:00
George Balatsouras	96bb976c88	[dfsan] Update store.ll test This removes hard-coded shadow width references and adds more RUN lines to increase test coverage under different options (fast16 labels mode). Also, shortens the test by unifying common lines under both combine- and no-combine-ptr-label options. Reviewed By: stephan.yichao.zhao Differential Revision: https://reviews.llvm.org/D98227	2021-03-09 12:21:29 -08:00
Philip Reames	fedce4ad20	[cgp] improve robustness of uadd/usub transforms LSR prefers to schedule iv increments just before the latch. The recent 80511565 broadened this to moving increments in the original IR. This pointed out a robustness problem with the CGP transform. When we have a use of an induction increment outside of the loop (we canonicalize away from this form, but it happens e.g. unanalyzeable loops) we'd avoid performing the uadd/usub transform. Interestingly, all of these involve moving the increment closer to it's operands, so there's no concern about dominating all uses. We can handle that case cheaply, resulting in a more robust transform.	2021-03-09 11:52:08 -08:00
Philip Reames	72e71e056e	[tests] precommit test for an upcoming change	2021-03-09 11:52:07 -08:00
Amara Emerson	bbe082f248	[AArch64][GlobalISel] Form G_DUPLANE32 for <2 x s32> shufflevectors in lowering. For <2 x s32>, we can use G_DUPLANE32, but with a <4 x s32> source. To make it work, we can just widen the original source with a concat_vectors. Doing this allows <2 x float> indexed fmul instruction selection patterns to fire, which gives a nice 0.3% code size saving on Bullet with -Os. Differential Revision: https://reviews.llvm.org/D98059	2021-03-09 11:36:26 -08:00
Amara Emerson	ef0ba5e19d	[GlobalISel] Fold away G_BUILD_VECTOR with all elements extracted. If every element is extracted from a G_BUILD_VECTOR, pass through the source registers. This is different to the extract(build_vector) combine because this one tolerates multiple users as long as they're exhaustive. Differential Revision: https://reviews.llvm.org/D97890	2021-03-09 11:34:26 -08:00
Philip Reames	794f535aaa	[cgp] group related code together [nfc]	2021-03-09 11:23:15 -08:00
Amara Emerson	a3470733b1	[AArch64][GlobalISel] Add combine for extract_vector_elt(build_vector, cst) Differential Revision: https://reviews.llvm.org/D97835	2021-03-09 11:08:02 -08:00
Jay Foad	75fd560edb	[AMDGPU] Refactor AMDGPUTargetStreamer::EmitCodeEnd Refactor and add comments to explain where the magic numbers come from in terms of the instruction cache line size. NFC. Differential Revision: https://reviews.llvm.org/D98266	2021-03-09 19:02:18 +00:00
gbtozers	0b8acbe1f0	[DebugInfo] Process DBG_VALUE_LIST in LiveDebugValues This patch implements DBG_VALUE_LIST handling to the LiveDebugValues pass. This is a substantial change, and makes a few fundamental changes to the existing logic. We still use the basic model of a VarLocMap that is indexed by a LocIndex, with a VarLocSet (a CoalescingBitVector underneath) giving us efficient lookups of existing variable locations for a given location type. The main change is that the VarLocMap may contain a given VarLoc multiple times (once for each unique location operand), so that a VarLoc can be looked up from any of the registers that it uses. This means that each VarLoc has multiple corresponding LocIndexes; to allow us to iterate through the set of VarLocs (previously we would iterate through the VarLocSet), we now also maintain a single entry in the VarLocMap that contains every VarLoc exactly once. The VarLoc class itself is also changed; this change is much simpler, refactoring out location-specific members into a MachineLocation class and adding a vector of these locations. Differential Revision: https://reviews.llvm.org/D83890	2021-03-09 18:58:26 +00:00
Markus Böck	5ca102f600	[Support][test] Unconditionally use setenv macro when compiling on Windows This test currently fails to compile when using a MinGW toolchain as setenv is not defined. This function is a POSIX function Windows does not implement. This patch enables the setenv macro used in the unit test for all of Windows, making the test compile and run successfully. Differential Revision: https://reviews.llvm.org/D98271	2021-03-09 19:53:22 +01:00
Nikita Popov	d8092ee676	[cmake] Link socket/nsl on SunOS in llvm-jitlink llvm-jitlink and llvm-jitlink-executor make use of APIs that are part of the socket and nsl libraries on SunOS systems (Solaris and Illumos). Make sure they get linked. Ran into this in Rust CI when cross-compiling LLVM 12 to these targets. Differential Revision: https://reviews.llvm.org/D97633	2021-03-09 19:04:59 +01:00
Craig Topper	48f016a681	[RISCV] Add support for VECTOR_REVERSE for scalable vector types. I've left mask registers to a future patch as we'll need to convert them to full vectors, shuffle, and then truncate. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D97609	2021-03-09 10:03:45 -08:00
Amara Emerson	f9107e0902	[AArch64][GlobalISel] Lower scalar G_{SMIN, SMAX, UMIN, UMAX}.	2021-03-09 10:03:16 -08:00
Fangrui Song	9bcdc90dd9	[MC] Change ELFOSABI_NONE to ELFOSABI_GNU for SHF_GNU_RETAIN GNU ld does not give SHF_GNU_RETAIN GC root semantics for ELFOSABI_NONE. (https://sourceware.org/pipermail/binutils/2021-March/115581.html) This allows GNU ld to interpret SHF_GNU_RETAIN and avoids a gold quirk https://sourceware.org/bugzilla/show_bug.cgi?id=27490 Because ELFObjectWriter is in an anonymous namespace, I have to place `markGnuAbi` in the parent MCObjectWriter. Differential Revision: https://reviews.llvm.org/D97976	2021-03-09 09:59:47 -08:00
Christudasan Devadasan	adea7a264e	[AMDGPU] Fix the dead frame indices during custom spill lowering. AMDGPU target tries to handle the SGPR and VGPR spills in a custom pass before the actual frame lowering pass. Once they are handled and the respective frames are eliminated in the custom pass, certain uses of them still remain. For instance, the DBG_VALUE instructions inserted by the allocator alongside the spill instruction will use the corresponding frame index. They become dead later during PEI and causes a crash while trying to replace the frame indices. We should possibly avoid this custom pass. For now, replacing such dead references with null register value. Reviewed By: arsenm, scott.linder Differential Revision: https://reviews.llvm.org/D98038	2021-03-09 23:22:49 +05:30
Nikita Popov	5db4fbc8f1	[FastISel] Don't trivially kill extractvalues (PR49467) All extractvalues of the same value at the same index will map to the same register, so even if one specific extractvalue only has one use, we should not mark it as a trivial kill, as there may be more extractvalues later. Fixes https://bugs.llvm.org/show_bug.cgi?id=49467. Differential Revision: https://reviews.llvm.org/D98145	2021-03-09 18:46:38 +01:00
gbtozers	3089bda8b6	[DebugInfo] Add replaceArg function to simplify DBG_VALUE_LIST expressions The LiveDebugValues and LiveDebugVariables implementations for handling DBG_VALUE_LIST instructions can be simplified significantly if they do not have to deal with any duplicated operands, such as a DBG_VALUE_LIST that uses the same register multiple times in its expression. This patch adds a function, replaceArg, that can be used to simplify a DIExpression in the case of duplicated operands. Differential Revision: https://reviews.llvm.org/D83896	2021-03-09 17:41:04 +00:00
Craig Topper	17152b7023	[RISCV] Add support for fixed vector reductions. I've included tests that require type legalization to split the vector. The i64 version of these scalarizes on RV32 due to type legalization visiting the result before the vector type. So we have to abort our custom expansion to avoid creating target specific nodes with an illegal type. Then type legalization ends up scalarizing. We might be able to fix this by doing custom splitting for large vectors in our handler to get down to a legal type. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D98102	2021-03-09 09:39:59 -08:00
Dave Lee	57a42af2d7	Revert "[build][modules] Fix ObjCARCUtil.h modularization" This reverts commit f1b690598eeca0568e39134f28e0a59c1c1f3598.	2021-03-09 09:36:47 -08:00
Craig Topper	0a037e9e43	[RISCV] Don't modify the SEW immediate on the V extension pseudo instructions after inserting VSETVLI. Previously we set the value to -1, but the SEW information could be useful for scheduling. Reviewed By: frasercrmck, rogfer01 Differential Revision: https://reviews.llvm.org/D98062	2021-03-09 09:02:19 -08:00
Craig Topper	771f76cbf1	[RISCV] Optimize fixed vector ABS. Fix crash on scalable vector ABS for SEW=64 with RV32. The default fixed vector expansion uses sra+xor+add since it can't see that smax is legal due to our custom handling. So we select smax(X, sub(0, X)) manually. Scalable vectors are able to use the smax expansion automatically for most cases. It crashes in one case because getConstant can't build a SPLAT_VECTOR for nxvXi64 when i64 scalars aren't legal. So we manually emit a SPLAT_VECTOR_I64 for that case. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D97991	2021-03-09 08:51:03 -08:00
Craig Topper	233eb621e7	[RISCV] Make the hasStdExtM() check in RISCVInstrInfo::getVLENFactoredAmount emit a diagnostic rather than an assert. As far as I know we're not enforcing the StdExtM must be enabled to use the V extension. If we use an assert here and hit this code in a release build we'll silently emit an invalid instruction. By using a diagnostic we report the error to the user in release builds. I think there may still be a later fatal error from the code emitter though. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D97970	2021-03-09 08:50:02 -08:00
gbtozers	9b334d3086	[DebugInfo] Handle multiple variable location operands in IR This patch updates the various IR passes to correctly handle dbg.values with a DIArgList location. This patch does not actually allow DIArgLists to be produced by salvageDebugInfo, and it does not affect any pass after codegen-prepare. Other than that, it should cover every IR pass. Most of the changes simply extend code that operated on a single debug value to operate on the list of debug values in the style of any_of, all_of, for_each, etc. Instances of setOperand(0, ...) have been replaced with with replaceVariableLocationOp, which takes the value that is being replaced as an additional argument. In places where this value isn't readily available, we have to track the old value through to the point where it gets replaced. Differential Revision: https://reviews.llvm.org/D88232	2021-03-09 16:44:38 +00:00
Sanjay Patel	3977619fb1	[InstCombine] canonicalize 'not' op after min/max intrinsic This is another step towards parity between existing select transforms and min/max intrinsics (D98152).. The existing 'not' folds around select are complicated, so it's likely that we will need to enhance this, but this should be a safe step.	2021-03-09 11:33:28 -05:00
Sanjay Patel	4250b8d238	[InstCombine] add tests for min/max intrinsics with not+constant; NFC	2021-03-09 11:33:28 -05:00
Oliver Stannard	0ebf19c414	[ARM] Add comment explaining stack frame layout Add a comment explaining how we lay out stack frames for ARM targets, based on the existing one for AArch64. Also expand the comment to explain reserved call frames for both architectures. Differential revision: https://reviews.llvm.org/D98258	2021-03-09 15:20:32 +00:00
Simon Pilgrim	f790b7438f	[X86] canonicalizeShuffleWithBinOps - add binary shuffle handling	2021-03-09 13:57:03 +00:00
Sanjay Patel	ba30a3aa71	[InstCombine] fold min/max intrinsics with not ops This is a partial translation of the existing select-based folds. We need to recreate several different transforms to avoid regressions as noted in D98152. https://alive2.llvm.org/ce/z/teuZ_J	2021-03-09 08:55:48 -05:00
Liu, Chen3	d394018ed5	[X86][NFC] Move instruction selection of the x86_tdpb[s,u]d_internal and x86_tilezero_internal to X86InstrAMX.td Differential Revision: https://reviews.llvm.org/D97997	2021-03-09 21:27:39 +08:00
Stefan Gränitz	d33516d79a	[Orc] Always check mapped sections for ELFDebugObject are in bounds of working memory buffer As stated in the JITLink user guide: Do not assume that the input object is well formed. https://llvm.org/docs/JITLink.html#tips-for-jitlink-backend-developers	2021-03-09 14:01:50 +01:00
Stefan Gränitz	b4f49cdcc9	[Orc] Test DebugObjectManagerPlugin fills in load-address for .text section of in-memory ELF debug object	2021-03-09 14:01:50 +01:00
Stefan Gränitz	9a8ecaf741	[Orc] Fix working memory buffer for section mapping in ELFDebugObject	2021-03-09 14:01:49 +01:00
gbtozers	c2c05d8f17	[DebugInfo] Add tests for ISel handling of variadic debug values This patch adds a set of lit tests for ISel support of DIArgList/DBG_VALUE_LIST. Differential Revision: https://reviews.llvm.org/D88593	2021-03-09 12:42:15 +00:00
gbtozers	832251b284	[DebugInfo] Emit DBG_VALUE_LIST from ISel This patch completes ISel support for DIArgList dbg.values by allowing SDDbgValues with multiple location operands to be emitted as DBG_VALUE_LIST instructions. The primary change of this patch is refactoring EmitDbgValue by pulling location operand emission out to the new function AddDbgValueLocationOps, which is used for both DIArgList and single value dbg.values. Outside of that, the only behaviour change is that the scheduler has a lambda added, HasUnknownVReg, to prevent us from attempting to emit a DBG_VALUE_LIST before all of its used VRegs have become available. Differential Revision: https://reviews.llvm.org/D88592	2021-03-09 12:17:39 +00:00
Liu, Chen3	ce1573aa8c	[X86][NFC] Adding one flag to imply whether the instruction should check the predicate when compress EVEX instructions to VEX encoding. Some EVEX instructions should check the predicates when compress to VEX encoding. For example, avx512vnni instructions. This is because avx512vnni doesn't mean that avxvnni is supported on the target. This patch moving the manually added check to .inc that generated by tablegen. Differential Revision: https://reviews.llvm.org/D98011	2021-03-09 19:58:01 +08:00
Luo, Yuanke	d4f790f4bb	[X86][AMX] Add test case for combining AMX bitcast.	2021-03-09 19:48:01 +08:00
Florian Hahn	c05fa040af	[InstCombine] Simplify phis with incoming pointer-casts. If the incoming values of a phi are pointer casts of the same original value, replace the phi with a single cast. Such redundant phis are somewhat common after loop-rotate and removing them can avoid some unnecessary code bloat, e.g. because an iteration of a loop is peeled off to make the phi invariant. It should also simplify further analysis on its own. InstCombine already uses stripPointerCasts in a couple of places and also simplifies phis based on the incoming values, so the patch should fit in the existing scope. The patch causes binary changes in 47 out of 237 benchmarks in MultiSource/SPEC2000/SPEC2006 with -O3 -flto on X86. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D98058	2021-03-09 11:40:18 +00:00
Simon Pilgrim	7c45a0c6e6	M68kInstrInfo::AnalyzeBranchImpl - fix MSVC build. NFCI. MSVC couldn't resolve the decltype of a capture of a capture - but is happy with an auto.	2021-03-09 11:27:26 +00:00
John Brawn	87e909a8b2	[CodeGen] Report a normal instead of fatal error for label redefinition A symbol being redefined as a label is something that can happen as a result of ordinary input, so it shouldn't cause a fatal error. Also adjust the error message to match the one you get when a symbol is redefined as a variable. Differential Revision: https://reviews.llvm.org/D98181	2021-03-09 10:54:41 +00:00
Cullen Rhodes	6682076a17	[IR] Introduce llvm.experimental.vector.splice intrinsic This patch introduces a new intrinsic @llvm.experimental.vector.splice that constructs a vector of the same type as the two input vectors, based on a immediate where the sign of the immediate distinguishes two variants. A positive immediate specifies an index into the first vector and a negative immediate specifies the number of trailing elements to extract from the first vector. For example: @llvm.experimental.vector.splice(<A,B,C,D>, <E,F,G,H>, 1) ==> <B, C, D, E> ; index @llvm.experimental.vector.splice(<A,B,C,D>, <E,F,G,H>, -3) ==> <B, C, D, E> ; trailing element count These intrinsics support both fixed and scalable vectors, where the former is lowered to a shufflevector to maintain existing behaviour, although while marked as experimental the recommended way to express this operation for fixed-width vectors is to use shufflevector. For scalable vectors where it is not possible to express a shufflevector mask for this operation, a new ISD node has been implemented. This is one of the named shufflevector intrinsics proposed on the mailing-list in the RFC at [1]. Patch by Paul Walker and Cullen Rhodes. [1] https://lists.llvm.org/pipermail/llvm-dev/2020-November/146864.html Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D94708	2021-03-09 10:44:22 +00:00
Alex Orlov	ee05374523	Fix a crash in DWARFUnit::getInlinedChainForAddress in case of unexpected DWARF information. In some cases a broken or invalid debug info could cause a crash in DWARFUnit::getInlinedChainForAddress during parsing a chain of in-lined functions. This patch fixes this issue. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D98119	2021-03-09 14:20:27 +04:00
gbtozers	3c23fc350b	[DebugInfo] Handle dbg.values with multiple variable location operands in ISel This patch adds partial support in Instruction Selection for dbg.values that use a DIArgList. This patch does not add support for producing DBG_VALUE_LIST, but adds the logic for processing DIArgLists within the ISel pass. This change is largely focused on handleDebugValue and some of the functions that it calls. Outside of this, salvageDebugInfo and transferDbgValues have been modified to replace individual operands instead of the entire value; dangling debug info for variadic debug values is not currently supported (but may be added later). Differential Revision: https://reviews.llvm.org/D88589	2021-03-09 09:48:03 +00:00
Jan Kratochvil	00cd257373	llvm-dwarfdump: Fix DWARF-5 DW_FORM_implicit_const (used by GCC) Differential Revision: https://reviews.llvm.org/D98195	2021-03-09 09:26:58 +01:00
Jan Kratochvil	c751f84a2f	[nfc] llvm-dwarfdump: DWARFAbbreviationDeclaration::AttributeSpec -> DWARFAttribute `AttributeSpec` does not contain values while `DWARFAttribute` already does. Therefore one no longer needs to pass `uint64_t *OffsetPtr`. Differential Revision: https://reviews.llvm.org/D98194	2021-03-09 08:31:23 +01:00
Hongtao Yu	e6b2d9f770	[CSSPGO] Always use callsite samples as callsite probe counts. For CS profile, the callsite count of previously inlined callees is populated with the entry count of the callees. Therefore when trying to get a weight for calliste probe after inlinining, the callsite count should always be used. The same fix has already been made for non-probe case. Reviewed By: wenlei Differential Revision: https://reviews.llvm.org/D98094	2021-03-08 22:52:36 -08:00
Ta-Wei Tu	199be9ec1a	[CodeGenPrepare] Fix isIVIncrement (PR49466) In the NFC commit 8d835f42a57f15c0b9053bd7c41ea95821a40e5f, the check for `!L` is moved to a separate function `getIVIncrement` which, instead of using `BO->getParent()`, uses `PN->getParent()`. However, these two basic blocks are not necessarily the same. https://bugs.llvm.org/show_bug.cgi?id=49466 demonstrates a case where `PN` is contained in a loop while `BO` is not, causing the null-pointer dereference in `L->getLoopLatch()`. This patch checks whether both `BO` and `PN` belong to the same loop before entering `getIVIncrement`. Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D98144	2021-03-09 13:32:34 +08:00
Rafael Auler	2b6009cd7d	[DebugInfo] Do not error on unsupported CIE version D81469 introduced a check to error on CIE version different than 1 for eh_frame, but older compilers mistakenly create binaries with this version set to 3 for DWARF4 or 4 to DWARF5. Move the check to dump time instead of eh_frame parse time, so we can be tolerant with older binaries. Reviewed By: aprantl Differential Revision: https://reviews.llvm.org/D97830	2021-03-08 19:39:08 -08:00
ShihPo Hung	50d76ad4d1	[RISCV][MC] Fix nf encoding for vector ld/st whole register The three bit nf is one less than the number of NFIELDS, so we manually decrement 1 for VS1/2/4/8R & VL1/2/4/8R. Reviewed By: craig.topper Differential revision: https://reviews.llvm.org/D98185	2021-03-08 19:30:24 -08:00
Alexander Shaposhnikov	4904bec77b	[docs] Fix llvm-objcopy.rst Adjust the title underline, NFC.	2021-03-08 19:06:32 -08:00

1 2 3 4 5 ...

212386 Commits