llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-18 10:32:48 +02:00

Author	SHA1	Message	Date
Simon Giesecke	f35a5b956e	Add --quiet option to llvm-gsymutil to suppress output of warnings. Differential Revision: https://reviews.llvm.org/D102829	2021-05-27 12:36:34 +00:00
Mats Petersson	ffafbe5131	Revert "[OpenMP]Add support for workshare loop modifier in lowering" This reverts commit ea4c5fb04c6d9618d451fb2d2c360dc95c6d9131.	2021-05-27 13:09:47 +01:00
Mats Petersson	ae07366301	[OpenMP]Add support for workshare loop modifier in lowering When lowering the dynamic, guided, auto and runtime types of scheduling, there is an optional monotonic or non-monotonic modifier. This patch adds support in the OMP IR Builder to pass this down to the runtime functions. Also implements tests for the variants. Differential Revision: https://reviews.llvm.org/D102008	2021-05-27 12:28:27 +01:00
David Green	fd59827ea3	[ARM] Extra test for reverted WLS memset. NFC	2021-05-27 12:20:19 +01:00
Benjamin Kramer	bcc28eb207	Add triples to a bunch of x86-specific tests that currently fail on PPC	2021-05-27 12:32:04 +02:00
James Henderson	5c474e1dad	[lit][test] Improve testing of use_llvm_tool Reviewed by: MaskRay Differential Revision: https://reviews.llvm.org/D103154	2021-05-27 11:25:43 +01:00
Florian Hahn	20c74f6c60	[Matrix] Include matrix pipeline for new PM in new-pm-defaults.ll. -enable-matrix just adds a single pass, so it's easier to just check in new-pm-default.ll rather than duplicating the full checks for -O3 with the new pass manager. Suggested post-commit by @aeubanks.	2021-05-27 10:57:39 +01:00
Fraser Cormack	ab12d83795	[SelectionDAG][RISCV] Don't unroll 0/1-type bool VSELECTs This patch extends the cases in which the legalizer is able to express VSELECT in terms of XOR/AND/OR. When dealing with a VSELECT between boolean vector types, the mask itself is an all-ones or all-ones value of the operand type, so a 0/1 boolean type behaves identically to a 0/-1 type. This greatly helps RISC-V which relies on expansion for these nodes. It also allows scalable-vector bool VSELECTs to use the default expansion, where before it would crash in SelectionDAG::UnrollVectorOp. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D103147	2021-05-27 10:08:57 +01:00
Sebastian Neubauer	79ec3080a2	[AMDGPU][GlobalISel] Allow amdgpu_gfx calling conv Calling functions from shaders already works with the SelectionDAG. Differential Revision: https://reviews.llvm.org/D103183	2021-05-27 10:41:40 +02:00
Max Kazantsev	fb6b06d635	[NFCI][LoopDeletion] Do not call complex analysis for known non-zero BTC	2021-05-27 15:29:37 +07:00
Max Kazantsev	8529f1644f	[NFC] Reuse existing variables instead of re-requesting successors	2021-05-27 15:29:37 +07:00
Amara Emerson	d5383816bc	[GlobalISel] Implement splitting of G_SHUFFLE_VECTOR. Thhis is a port from the DAG legalization. We're still missing some of the canonicalizations of shuffles but it's a start. Differential Revision: https://reviews.llvm.org/D102828	2021-05-27 00:28:38 -07:00
Fangrui Song	06613cf382	[docs] llvm-objdump: Mention -M no-aliases is supported on AArch64	2021-05-26 23:57:32 -07:00
Max Kazantsev	b201cc808f	[NFCI] Lazily evaluate SCEVs of PHIs Eager evaluation has cost of compile time. Only query them if they are required for proving predicates.	2021-05-27 13:35:31 +07:00
Max Kazantsev	fe711d6dd1	[NFC] Formatting fix	2021-05-27 12:50:54 +07:00
Max Kazantsev	bb1a1653ea	[NFCI][LoopDeletion] Only query SCEV about loop successor if another successor is also in loop	2021-05-27 12:44:22 +07:00
Esme-Yi	54468e15fa	[llvm-objdump] Print the DEBUG type under `--section-headers`. Summary: Under the option --section-headers, we can only print the section types of TEXT, DATA, and BSS for now. This patch adds the DEBUG type. Reviewed By: jhenderson, Higuoxing Differential Revision: https://reviews.llvm.org/D102603	2021-05-27 04:53:14 +00:00
LLVM GN Syncbot	75be5c89d2	[gn build] Port 857fa7b7b187	2021-05-27 04:42:56 +00:00
LLVM GN Syncbot	878593c3e2	[gn build] Port 0dc7fd1bc167	2021-05-27 04:42:55 +00:00
Hasyimi Bahrudin	aa98e6ea8a	Fix non-global-value-max-name-size not considered by LLParser `non-global-value-max-name-size` is used by `Value` to cap the length of local value name. However, this flag is not considered by `LLParser`, which leads to unexpected `use of undefined value error`. The fix is to move the responsibility of capping the length to `ValueSymbolTable`. The test is the one provided by [[ https://bugs.llvm.org/show_bug.cgi?id=45899 \| Mikael in the bug report ]]. Reviewed By: mehdi_amini Differential Revision: https://reviews.llvm.org/D102707	2021-05-27 04:20:03 +00:00
Yevgeny Rouban	0a4cf978a7	[RS4GC] Introduce intrinsics to get base ptr and offset There can be a need for some optimizations to get (base, offset) for any GC pointer. The base can be calculated by generating needed instructions as it is done by the RewriteStatepointsForGC::findBasePointer() function. The offset can be calculated in the same way. Though to not expose the base calculation and to make the offset calculation as simple as ptrtoint(derived_ptr) - ptrtoint(base_ptr), which is illegal outside RS4GC, this patch introduces 2 intrinsics: @llvm.experimental.gc.get.pointer.base(%derived_ptr) @llvm.experimental.gc.get.pointer.offset(%derived_ptr) These intrinsics are inlined by RS4GC along with generation of statepoint sequences. With these new intrinsics the GC parseable lowering for atomic memcpy intrinsics (6ec2c5e402a724ba99bce82a9cac7a3006d660f4) could be implemented as a separate pass. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D100445	2021-05-27 09:14:14 +07:00
Jessica Paquette	044ed9b7c9	Fix unit test after 324af79dbc6066 Needed to add in an extra parameter to calls to `libcall`.	2021-05-26 17:50:53 -07:00
Jessica Paquette	d821abe3ce	[GlobalISel] Don't emit lost debug location remarks when legalizing tail calls There were a bunch of lost debug location remarks that show up when legalizing tail calls on AArch64. This would happen because we drop the return in the block where we emit the tail call. So, we end up dropping the debug location, which makes the LostDebugLocObserver report a missing debug location. Although it's true that we lose these debug locations, this isn't a particularly useful remark. We expect to drop these debug locations when emitting tail calls. Suppressing remarks in this case is preferable, since the amount of noise could hide actual debug location related bugs. To do this, I just plumbed the LostDebugLocObserver through the relevant LegalizerHelper functions. This is the only case I can think of where we need the LostDebugLocObserver in the LegalizerHelper. So, rather than storing it in the LegalizerHelper proper and mucking around with the constructors, I figured it'd be cleanest to take the simplest path for now. This clears up ~20 noisy lost debug location remarks on CTMark in AArch64 at -Os. Differential Revision: https://reviews.llvm.org/D103128	2021-05-26 17:16:11 -07:00
Sriraman Tallam	40b2a440c5	Emit correct location lists with basic block sections. This patch addresses multiple things: 1) It ensures that const_value is emitted when possible with basic block sections. 2) It emits location lists such that the labels are always within the section boundary. 3) It fixes a bug when the parameter is first used in a non-entry block which is in a different section from the entry block. Differential Revision: https://reviews.llvm.org/D85085	2021-05-26 17:12:31 -07:00
Amara Emerson	5e0b929619	[AArch64][GlobalISel] Legalize non-power-of-2 vector elements for G_STORE. The rules were already there, it just needed re-ordering so the odd case didn't bail out too early.	2021-05-26 17:01:02 -07:00
Krzysztof Parzyszek	6b5a937ccd	[Hexagon] Restore handling of expanding shuffles Fixed bugs, added testcases. The byte-unpack is actually recognized by the DAG combiner, but the halfword-unpack it not.	2021-05-26 18:04:15 -05:00
Philip Reames	5149991497	[tests] Add some basic coverage of multiple exit unrolling	2021-05-26 15:51:26 -07:00
naromero77	863822a0e1	[flang][docs] Initial documentation for the Fortran LLVM Test Suite. Describes how to run the Fortran LLVM Test Suite, specifically the external SPEC CPU 2017 Fortran tests. Reviewed By: rovka Differential Revision: https://reviews.llvm.org/D102877	2021-05-26 15:59:55 -05:00
Fangrui Song	df086a1bfa	[AArch64] Support llvm-mc/llvm-objdump -M no-aliases This enables the no-aliases forms of many instructions. Depends on D103004 Reviewed By: tmatheson Differential Revision: https://reviews.llvm.org/D103005	2021-05-26 13:35:31 -07:00
Rahman Lavaee	b75ea85ad5	[llvm-readobj] Optimize printing stack sizes to linear time. Currently, each function name lookup is a linear iteration over all symbols defined in the object file which makes the total running time quadratic. This patch optimizes the function name lookup by populating an address to index map upon the first function name lookup which is used to lookup each function name in O(1). impact: For the clang binary built with `-fstack-size-section`, this improves the running time of `llvm-readobj --stack-size` from 7 minutes to 0.25 seconds. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D103072	2021-05-26 13:14:33 -07:00
Craig Topper	fae63c4e1a	[RISCV] Use X0 as destination of inserted vsetvli when possible. We aren't going to connect the result to anything so we might as well avoid allocating a register. Reviewed By: frasercrmck, HsiangKai Differential Revision: https://reviews.llvm.org/D102031	2021-05-26 13:08:51 -07:00
Jessica Clarke	15661d2c61	[RISCV][NFC] Fix some whitespace nits in MC test RUN lines	2021-05-26 21:03:18 +01:00
Jacob Hegna	324ccfabc2	Update documentation for InlineModel features. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D103193	2021-05-26 12:52:28 -07:00
Sjoerd Meijer	dd5fcc39b3	[CostModel][AArch64] Add floating point arithmetic tests. NFC.	2021-05-26 20:26:20 +01:00
Jeremy Morse	2a2a79e361	[DebugInstrRef][1/3] Track PHI values through register allocation This patch introduces "DBG_PHI" instructions, a marker of where a PHI instruction used to be, before PHI elimination. Under the instruction referencing model, we want to know where every value in the function is defined -- and a PHI, even if implicit, is such a place. Just like instruction numbers, we can use this to identify a value to be used as a variable value, but we don't need to know what instruction defines that value, for example: bb1: DBG_PHI $rax, 1 [... more insts ... ] bb2: DBG_INSTR_REF 1, 0, !1234, !DIExpression() This specifies that on entry to bb1, whatever value is in $rax is known as value number one -- and the later DBG_INSTR_REF marks the position where variable !1234 should take on value number one. PHI locations are stored in MachineFunction for the duration of the regalloc phase in the DebugPHIPositions map. The map is populated by PHIElimination, and then flushed back into the instruction stream by virtregrewriter. A small amount of maintenence is needed in LiveDebugVariables to account for registers being split, but only for individual positions, not for entire ranges of blocks. Differential Revision: https://reviews.llvm.org/D86812	2021-05-26 20:24:00 +01:00
Roman Lebedev	07003c78e4	[NFC][Codegen][X86] Add a few more interleaved load/store patterns w/ i16 element type Matching the costmodel coverage. We want them both because they simplify coming up with the patterns to check their cost, and to track their codegen. Tests for loads can be fully autogenerated: https://godbolt.org/z/o1fncqo9n For stores, however, i have done that semi-manually: https://godbolt.org/z/KPzTnvsh1	2021-05-26 21:55:38 +03:00
Roman Lebedev	aded4a78a5	[NFC][X86][Costmodel] Add some more interleaved load/store test with i16 element type Not sure if even larger interleaving factors are needed, but these are what i have seen being queried in the wild.	2021-05-26 21:55:37 +03:00
Philip Reames	9d48c2760c	[SCEV] Compute trip multiple for multiple exit loops This patch implements getSmallConstantTripMultiple(L) correctly for multiple exit loops. The previous implementation was both imprecise, and violated the specified behavior of the method. This was fine in practice, because it turns out the function was both dead in real code, and not tested for the multiple exit case. Differential Revision: https://reviews.llvm.org/D103189	2021-05-26 11:52:25 -07:00
Heejin Ahn	3f66a78716	[WebAssembly] Add TargetInstrInfo::getCalleeOperand DwarfDebug unconditionally assumes for all call instructions the 0th operand is the callee operand, which seems to be true for other targets, but not for WebAssembly. This adds `TargetInstrInfo::getCallOperand` method whose default implementation returns `getOperand(0)` and makes WebAssembly overrides it to use its own utility method to get the callee operand. This also fixes an existing bug in `WebAssembly::getCalleeOp`, which was uncovered by this CL. Reviewed By: dschuff, djtodoro Differential Revision: https://reviews.llvm.org/D102978	2021-05-26 11:43:59 -07:00
Heejin Ahn	9027ee31ca	[SimplifyCFG] Use make_early_inc_range() while deleting instructions We are deleting `phi` nodes within the for loop, so this makes sure we increment the iterator before we delete the instruction pointed by the iterator. This started to break in `a0be081646`. Reviewed By: dschuff, lebedev.ri Differential Revision: https://reviews.llvm.org/D103181	2021-05-26 11:43:11 -07:00
Stanislav Mekhanoshin	e9a316e231	[AMDGPU] Fix kernel LDS lowering for constants There is a trivial but severe bug in the recent code collecting LDS globals used by kernel. It aborts scan on the first constant without scanning further uses. That leads to LDS overallocation with multiple kernels in certain cases. Differential Revision: https://reviews.llvm.org/D103190	2021-05-26 11:34:50 -07:00
Dmitry Preobrazhensky	f584ef9bf9	[AMDGPU][MC][GFX90A] Corrected DS_GWS opcodes Corrected DS_GWS opcodes to use even aligned registers. Differential Revision: https://reviews.llvm.org/D103185	2021-05-26 21:31:50 +03:00
Philip Reames	948729e467	[SCEV] Generalize getSmallConstantTripCount(L) for multiple exit loops This came up in review for another patch, see https://reviews.llvm.org/D102982#2782407 for full context. I've reviewed the callers to make sure they can handle multiple exit loops w/non-zero returns. There's two cases in target cost models where results might change (Hexagon and PowerPC), but the results looked legal and reasonable. If a target maintainer wishes to back out the effect of the costing change, they should explicitly check for multiple exit loops and handle them as desired. Differential Revision: https://reviews.llvm.org/D103182	2021-05-26 11:18:25 -07:00
Fangrui Song	c2d4999f01	[llvm-mc] Add -M to replace -riscv-no-aliases and -riscv-arch-reg-names In objdump, many targets support `-M no-aliases`. Instead of having a `-*-no-aliases` for each target when LLVM adds the support, it makes more sense to introduce objdump style `-M`. -riscv-arch-reg-names is removed. -riscv-no-aliases has too many uses and thus is retained for now. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D103004	2021-05-26 10:43:32 -07:00
Philip Reames	98697f161f	[SCEV] Add a utility for converting from "exit count" to "trip count" (Mostly as a logical place to put a comment since this is a reoccuring confusion.)	2021-05-26 10:41:49 -07:00
Craig Topper	6982e1d1c2	[RISCV] Optimize SEW=64 shifts by splat on RV32. SEW=64 shifts only uses the log2(64) bits of shift amount. If we're splatting a 64 bit value in 2 parts, we can avoid splatting the upper bits and just let the low bits be sign extended. They won't be read anyway. For the purposes of SelectionDAG semantics of the generic ISD opcodes, if hi was non-zero or bit 31 of the low is 1, the shift was already undefined so it should be ok to replace high with sign extend of low. In order do be able to find the split i64 value before it becomes a stack operation, I added a new ISD opcode that will be expanded to the stack spill in PreprocessISelDAG. This new node is conceptually similar to BuildPairF64, but I expanded earlier so that we could go through regular isel to get the right VLSE opcode for the LMUL. BuildPairF64 is expanded in a CustomInserter. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D102521	2021-05-26 10:23:32 -07:00
Philip Reames	14a02691cf	[SCEV] Extract out a helper for computing trip multiples	2021-05-26 10:15:03 -07:00
Jessica Clarke	647345a9f3	[RISCV] Remove --riscv-no-aliases from RVV tests This serves no useful purpose other than to clutter things up. Diff summary as the real diff is extremely unwieldy: 24844 -; CHECK-NEXT: jalr zero, 0(ra) 24844 +; CHECK-NEXT: ret 8 -; CHECK-NEXT: vl4re8.v v28, (a0) 8 +; CHECK-NEXT: vl4r.v v28, (a0) 64 -; CHECK-NEXT: vl8re8.v v24, (a0) 64 +; CHECK-NEXT: vl8r.v v24, (a0) 392 -; RUN: --riscv-no-aliases < %s \| FileCheck %s 392 +; RUN: < %s \| FileCheck %s 1 -; RUN: -verify-machineinstrs --riscv-no-aliases < %s \ 1 +; RUN: -verify-machineinstrs < %s \ As discussed in D103004.	2021-05-26 17:59:38 +01:00
Craig Topper	763ff65b77	[RISCV] Don't propagate VL/VTYPE across inline assembly in the Insert VSETVLI pass. It's conceivable someone could put a vsetvli in inline assembly so its safer to consider them as barriers. The alternative would be to trust that the user marks VL and VTYPE registers as clobbers of the inline assembly if they do that, but hat seems error prone. I'm assuming inline assembly in vector code is going to be rare. Reviewed By: frasercrmck, HsiangKai Differential Revision: https://reviews.llvm.org/D103126	2021-05-26 09:56:20 -07:00
Alexey Bataev	63d0ad489a	[SLP]Fix vectorization of insertelements with multiple uses. SLP vectorizer should not consider in sertelements with multiple uses as a part of high level build vector, it must be considered as a terminating insertelement in the vector build, otherwise it may produce incorrect code. Differential Revision: https://reviews.llvm.org/D103164	2021-05-26 09:42:18 -07:00

1 2 3 4 5 ...

216439 Commits