llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Sam Elliott	83b5e8407b	[RISCV] Implement Hooks to avoid chaining SELECT Summary: This implements two hooks that attempt to avoid control flow for RISC-V. RISC-V will lower SELECTs into control flow, which is not a great idea. The hook `hasMultipleConditionRegisters()` turns off the following DAGCombiner folds: select(C0\|C1, x, y) <=> select(C0, x, select(C1, x, y)) select(C0&C1, x, y) <=> select(C0, select(C1, x, y), y) The second hook `setJumpIsExpensive` controls a flag that has a similar purpose and is used in CodeGenPrepare and the SelectionDAGBuilder. Both of these have the effect of ensuring more logic is done before fewer jumps. Note: with the `B` extension, we may be able to lower select into a conditional move instruction, so at some point these hooks will need to be guarded based on enabled extensions. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D79268	2020-07-01 11:56:31 +01:00
Sam Elliott	6259181599	[RISCV][NFC] Add Test for (select (or B1, B2), X, Y) Summary: As shown, LLVM is keen to avoid logic and introduce selects (in DAGCombiner, and other places). This leads to control flow on RISC-V which we should attempt to avoid. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D79267	2020-07-01 11:56:11 +01:00
Vitaly Buka	8595706ad4	[StackSafety,NFC] Remove expensive assert Differential Revision: https://reviews.llvm.org/D80908	2020-07-01 02:54:27 -07:00
Vitaly Buka	fbc02282a0	[StackSafety,NFC] Remove unneded constexpr Differential Revision: https://reviews.llvm.org/D80908	2020-07-01 02:54:27 -07:00
Petar Avramovic	06dda0986c	AMDGPU/GlobalISel: Select init_exec intrinsic Change imm with timm in pattern for SI_INIT_EXEC_LO and remove regbank mappings for non register operands. Differential Revision: https://reviews.llvm.org/D82885	2020-07-01 11:50:59 +02:00
Kerry McLaughlin	11891a6394	[AArch64][SVE] Add reg+imm addressing mode for unpredicated loads Reviewers: efriedma, sdesmalen, david-arm Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82893	2020-07-01 10:33:56 +01:00
Georgii Rymar	b18d8b7481	[llvm-readobj] - Don't crash when checking the number of dynamic symbols. When we deriving the number of symbols from the DT_HASH table, we can crash when calculate the number of symbols in the symbol table when SHT_DYNSYM has sh_entsize == 0. The patch fixes the issue. Differential revision: https://reviews.llvm.org/D82877	2020-07-01 12:14:10 +03:00
Georgii Rymar	aa2b4ebf4e	[llvm-readobj] - Simplify and refine hash table tests Now we are able to have default values for macros in YAML descriptions. I've applied it for hash table tests and also fixed few copy-paste issues in their comments. Differential revision: https://reviews.llvm.org/D82870	2020-07-01 12:09:08 +03:00
David Stenberg	a98c027f78	[DebugInfo] Do not emit entry values for composite locations Summary: This is a fix for PR45009. When working on D67492 I made DwarfExpression emit a single DW_OP_entry_value operation covering the whole composite location description that is produced if a register does not have a valid DWARF number, and is instead composed of multiple register pieces. Looking closer at the standard, this appears to not be valid DWARF. A DW_OP_entry_value operation's block can only be a DWARF expression or a register location description, so it appears to not be valid for it to hold a composite location description like that. See DWARFv5 sec. 2.5.1.7: "The DW_OP_entry_value operation pushes the value that the described location held upon entering the current subprogram. It has two operands: an unsigned LEB128 length, followed by a block containing a DWARF expression or a register location description (see Section 2.6.1.1.3 on page 39)." Here is a dwarf-discuss mail thread regarding this: http://lists.dwarfstd.org/pipermail/dwarf-discuss-dwarfstd.org/2020-March/004610.html There was not a strong consensus reached there, but people seem to lean towards that operations specified under 2.6 (e.g. DW_OP_piece) may not be part of a DWARF expression, and thus the DW_OP_entry_value operation can't contain those. Perhaps we instead want to emit a entry value operation per each DW_OP_reg* operation, e.g.: - DW_OP_entry_value(DW_OP_regx sub_reg0), DW_OP_stack_value, DW_OP_piece 8, - DW_OP_entry_value(DW_OP_regx sub_reg1), DW_OP_stack_value, DW_OP_piece 8, [...] The question then becomes how the call site should look; should a composite location description be emitted there, and we then leave it up to the debugger to match those two composite location descriptions? Another alternative could be to emit a call site parameter entry for each sub-register, but firstly I'm unsure if that is even valid DWARF, and secondly it seems like that would complicate the collection of call site values quite a bit. As far as I can tell GCC does not emit any entry values / call sites in these cases, so we do not have something to compare with, but the former seems like the more reasonable approach. Currently when trying to emit a call site entry for a parameter composed of multiple DWARF registers a (DwarfRegs.size() == 1) assert is triggered in addMachineRegExpression(). Until the call site representation is figured out, and until there is use for these entry values in practice, this commit simply stops the invalid DWARF from being emitted. Reviewers: djtodoro, vsk, aprantl Reviewed By: djtodoro, vsk Subscribers: jyknight, hiraditya, fedor.sergeev, jrtc27, llvm-commits Tags: #debug-info, #llvm Differential Revision: https://reviews.llvm.org/D75270	2020-07-01 10:50:55 +02:00
Guillaume Chatelet	930fdcd491	[Alignment][NFC] Migrate MachineFrameInfo::CreateSpillStackObject to Align iThis patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Differential Revision: https://reviews.llvm.org/D82934	2020-07-01 08:49:28 +00:00
Sam Parker	dd087b6361	[NFC][ARM] Add test.	2020-07-01 09:28:56 +01:00
Tony	5d6f19018d	[AMDGPU] Correct AMDGPUUsage.rst DW_AT_LLVM_lane_pc example - Correct typo of DW_OP_xaddr to DW_OP_addrx in AMDGPUUsage.rst for DW_AT_LLVM_lane_pc example. Change-Id: I1b0ee2b24362a0240388e4c2f044c1d4883509b9	2020-07-01 08:23:15 +00:00
Paul Walker	411d35313e	[SVE] Relax merge requirement for IR based divides. We currently lower SDIV to SDIV_MERGE_OP1. This forces the value for inactive lanes in a way that can hamper register allocation, however, the lowering has no requirement for inactive lanes. Instead this patch replaces SDIV_MERGE_OP1 with SDIV_PRED thus freeing the register allocator. Once done the only user of SDIV_MERGE_OP1 is intrinsic lowering so I've removed the node and perform ISel on the intrinsic directly. This also allows us to implement MOVPRFX based zeroing in the same manner as SUB. This patch also renames UDIV_MERGE_OP1 and [F]ADD_MERGE_OP1 for the same reason but in the ADD cases the ISel code is already as required. Differential Revision: https://reviews.llvm.org/D82783	2020-07-01 08:18:42 +00:00
LLVM GN Syncbot	d958e10e27	[gn build] Port f12cd99c440	2020-07-01 08:09:43 +00:00
Hans Wennborg	d489f3e1d7	[ThinLTO] Always parse module level inline asm with At&t dialect (PR46503) clang-cl passes -x86-asm-syntax=intel to the cc1 invocation so that assembly listings produced by the /FA flag are printed in Intel dialect. That flag however should not affect the parsing of inline assembly in the program. (See r322652) When compiling normally, AsmPrinter::emitInlineAsm is used for assembling and defaults to At&t dialect. However, when compiling for ThinLTO, the code which parses module level inline asm to find symbols for the symbol table was failing to set the dialect. This patch fixes that. (See the bug for more details.) Differential revision: https://reviews.llvm.org/D82862	2020-07-01 09:43:45 +02:00
Saiyedul Islam	00ecd50fa3	[AMDGPU] Spill more than wavesize CSR SGPRs In case of more than wavesize CSR SGPR spills, lanes of reserved VGPR were getting overwritten due to wrap around. Reserve a VGPR (when NumVGPRSpillLanes = 0, WaveSize, 2*WaveSize, ..) and when one of the two conditions is true: 1. One reserved VGPR being tracked by VGPRReservedForSGPRSpill is not yet reserved. 2. All spill lanes of reserved VGPR(s) are full and another spill lane is required. Reviewed By: arsenm, kerbowa Differential Revision: https://reviews.llvm.org/D82463	2020-07-01 07:40:47 +00:00
Sam Parker	e85afea7aa	[ARM][LowOverheadLoops] Handle reductions While validating live-out values, record instructions that look like a reduction. This will comprise of a vector op (for now only vadd), a vorr (vmov) which store the previous value of vadd and then a vpsel in the exit block which is predicated upon a vctp. This vctp will combine the last two iterations using the vmov and vadd into a vector which can then be consumed by a vaddv. Once we have determined that it's safe to perform tail-predication, we need to change this sequence of instructions so that the predication doesn't produce incorrect code. This involves changing the register allocation of the vadd so it updates itself and the predication on the final iteration will not update the falsely predicated lanes. This mimics what the vmov, vctp and vpsel do and so we then don't need any of those instructions. Differential Revision: https://reviews.llvm.org/D75533	2020-07-01 08:31:49 +01:00
serge-sans-paille	da98c44f41	Correctly track GCOVProfiling IR update Differential Revision: https://reviews.llvm.org/D82742	2020-07-01 09:30:59 +02:00
Guillaume Chatelet	c69f9aa1aa	[Alignment][NFC] Migrate MachineFrameInfo::CreateStackObject to Align This patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Differential Revision: https://reviews.llvm.org/D82894	2020-07-01 07:28:11 +00:00
LLVM GN Syncbot	ad0d6e5982	[gn build] Port 4da65c2920b	2020-07-01 07:02:27 +00:00
Adam Balogh	360c750d4b	[Hexagon][NFC] Remove redundant condition Condition `secondReg` is checked both in an outer and in an inner `if` statement in static function `canCompareBeNewValueJump()` in file `HexagonNewValueJump.cpp`. This patch removes the redundant inner check. The issue was found using `clang-tidy` check under review `misc-redundant-condition`. See https://reviews.llvm.org/D81272. Differential Revision: https://reviews.llvm.org/D82556	2020-07-01 09:04:26 +02:00
Adam Balogh	5155bd8dbb	[AMDGPU][NFC] Remove redundant condition Condition `LiteralCount` is checked both in an outer and in an inner `if` statement in `SIInstrInfo::verifyInstruction()`. This patch removes the redundant inner check. The issue was found using `clang-tidy` check under review `misc-redundant-condition`. See https://reviews.llvm.org/D81272. Differential Revision: https://reviews.llvm.org/D82555	2020-07-01 09:04:25 +02:00
Yonghong Song	446d919c93	[BPF] Fix a bug for __builtin_preserve_field_info() processing Andrii discovered a problem where a simple case similar to below will generate wrong relocation kind: enum { FIELD_EXISTENCE = 2, }; struct s1 { int a1; }; int test() { struct s1 *v = 0; return __builtin_preserve_field_info(v[0], FIELD_EXISTENCE); } The expected relocation kind should be FIELD_EXISTENCE, but recorded reloc kind in the final object file is FIELD_BYTE_OFFSET, which is incorrect. This exposed a bug in generating access strings from intrinsics. The current access string generation has two steps: step 1: find the base struct/union type, step 2: traverse members in the base type. The current implementation relies on at lease one member access in step 2 to get the correct relocation kind, which is true in typical cases. But if there is no member accesses, the current implementation falls to the default info kind FIELD_BYTE_OFFSET. This is incorrect, we should still record the reloc kind based on the user input. This patch fixed this issue by properly recording the reloc kind in such cases. Differential Revision: https://reviews.llvm.org/D82932	2020-06-30 23:45:37 -07:00
JF Bastien	e8c89cc968	[NFC] fix diagnostic It's pretty silly to diagnose on a scalar copy but the build does that: loop variable 'SibReg' of type 'const llvm::Register' creates a copy from type 'const llvm::Register' [-Wrange-loop-analysis]	2020-06-30 21:49:01 -07:00
JF Bastien	775ce4c84c	Fix diagnostic for missing virtual dtor Introduced in D82673.	2020-06-30 21:32:05 -07:00
Arthur Eubanks	5addf7faf9	[NewPM] Add explicit init value to -enable-new-pm So it's easier to test with it on by default. Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D82922	2020-06-30 18:40:01 -07:00
Xing GUO	25e66a23af	[DWARFYAML][test] Make the checker stricter. NFC. Currently, DWARFYAML doesn't emit the 0 byte for terminating the abbrev table for the given compilation unit. Before addressing this issue, we have to make the test stricter.	2020-07-01 09:41:29 +08:00
Douglas Yung	be86a0eaf9	Fixup BDVER1 and ZNVER1 definitions that were accidentally changed in recent refactor. - BDVER1 - Duplicate FeatureLZCNT removed - ZNVER1 - Duplicate FeatureLZCNT removed - Removed unsupported FeatureLWP - Swapped FeatureMMX and FeatureMOVBE to be in alphabetical order	2020-06-30 18:15:33 -07:00
Shoaib Meenai	7634d66acc	[llvm-install-name-tool] Tighten some path checks Just having --implicit-check-not=/usr breaks when the LLVM checkout path contains '/usr', since llvm-objdump prints out the path to the input file in the first line. Tighten the checks by adding the 'name' prefix that's used when printing load command payloads. An alternative would be to redirect the input file into llvm-objdump, in which case it prints out 'a.out' as the file name, but I'm not sure how reliable that behavior is.	2020-06-30 17:20:00 -07:00
Arthur Eubanks	8856ff1234	[Docs][BasicAA] Rename some more basicaa -> basic-aa Follow up to https://reviews.llvm.org/D82607.	2020-06-30 17:03:45 -07:00
Nico Weber	f40f77469b	[gn build] (semi-manually) port ce6153a5282	2020-06-30 19:30:13 -04:00
Matt Arsenault	58a9482211	GlobalISel: Disallow undef generic virtual register uses With an undef operand, it's possible for getVRegDef to fail and return null. This is an edge case very little code bothered to consider. Proper gMIR should use G_IMPLICIT_DEF instead. I initially tried to apply this restriction to all SSA MIR, so then getVRegDef would never fail anywhere. However, ProcessImplicitDefs does technically run while the function is in SSA. ProcessImplicitDefs and DetectDeadLanes would need to either move, or a new pseudo-SSA type of function property would need to be introduced.	2020-06-30 19:18:01 -04:00
Matt Arsenault	f7dc4642fe	AMDGPU/GlobalISel: Remove some selection tests which should be invalid These use undef generic virtual register operands, which should be rejected by the verifier.	2020-06-30 19:18:01 -04:00
Arthur Eubanks	67bdd9bf52	Fix wrong title underline length	2020-06-30 16:02:45 -07:00
Reid Kleckner	4c5eb8f368	[gn build] Update build for new OpenMP tablegen logic Ports 1a70077b5a64189d9c04d1a2d7ea6ff0e49744d6 to gn from cmake.	2020-06-30 16:02:05 -07:00
Hendrik Greving	2be3dfa59d	[ModuloSchedule] Make PeelingModuloScheduleExpander inheritable. Basically a NFC, but allows subclasses access to the entire PeelingModuloScheduleExpander class. We are doing this to allow backends, particularly one that are not necessarily upstreamed, to inherit from PeelingModuloScheduleExpander and access its basic structures. Renames Info into LoopInfo for consistency in PeelingModuloScheduleExpander. Differential Revision: https://reviews.llvm.org/D82673	2020-06-30 15:56:13 -07:00
Arthur Eubanks	097a2b8949	[Docs][BasicAA] Rename -basicaa to -basic-aa in docs Follow up to https://reviews.llvm.org/D82607.	2020-06-30 15:54:28 -07:00
Kit Barton	ff26590410	[PPC][NFC] Replace TM with Subtarget->getTargetMachine() in preparation for GlobalISel. There are two uses of TM (instance of TargetMachine) when checking options. These will not work once we enable GlobalISel. This patch replaces those uses of TM with Subtarget->getTargetMachine().	2020-06-30 17:19:24 -05:00
Alex Lorenz	c22c46f4ce	split darwin-version-min-load-command.s into Arm64 subtest to avoid failures Some buildbot configurations don't build the arm64 backend, so the test-cases that need arm64 should go into the aarch64 subdirectory.	2020-06-30 14:49:04 -07:00
Craig Topper	3205bd7427	[X86] Use some preprocessor macros to reduce the very similar repeated code in getVPTESTMOpc. NFCI This function picks X86 opcode name based on type, masking, and whether not a load or broadcast has been folded using multiple switch statements. The contents of the switches mostly just vary in a few characters in the instruction name. So use some macros to build the instruction names to reduce the repetiveness.	2020-06-30 14:38:22 -07:00
Amy Kwan	d10d816f3a	[PowerPC][Power10] Add Vector Splat Imm/Permute/Blend/Shift Double Bit Imm Definitions and MC Tests This patch adds the td definitions and asm/disasm tests for the following instructions: XXSPLTIW XXSPLTIDP XXSPLTI32DX XXPERMX XXBLENDVB XXBLENDVH XXBLENDVW XXBLENDVD VSLDBI VSRDBI Differential Revision: https://reviews.llvm.org/D82896	2020-06-30 16:07:21 -05:00
Eli Friedman	24a885401b	[SVE] Reject vector struct indexes for scalable vectors. It's messy to pattern-match, and completely unnecessary: scalar indexes work equally well. See also discussion on D81620 and D82061. Differential Revision: https://reviews.llvm.org/D82430	2020-06-30 13:52:38 -07:00
Eli Friedman	ccb803369c	[BitcodeReader] Fix DelayedShuffle handling for ConstantExpr shuffles. The indexing was messed up, so the result was completely broken. Shuffle constant exprs are rare in practice; without vscale types, constant folding generally elminates them. So sort of hard to trip over. Fixes regression from D72467. (Recommitting after fix for memory leak.) Differential Revision: https://reviews.llvm.org/D80330	2020-06-30 13:23:07 -07:00
Matt Arsenault	d907e0ced3	Sparc: Use Register	2020-06-30 16:14:23 -04:00
Matt Arsenault	6494b099e2	RISCV: Don't store function in RISCVMachineFunctionInfo Targets should not depend on the MachineFunction state during the MachineFunctionInfo construction.	2020-06-30 16:08:51 -04:00
Matt Arsenault	8215de3621	PPC: Don't store function in PPCFunctionInfo Continue migrating targets from depending on the MachineFunction during the initial construction.	2020-06-30 16:08:51 -04:00
Matt Arsenault	1d8040150a	Mips: Don't store MachineFunction in MipsFunctionInfo It will soon be disallowed to depend on MachineFunction state on construction.	2020-06-30 16:08:51 -04:00
Eli Friedman	7b42c225db	[IR] Delete llvm::Constants using the correct type. In most cases, this doesn't have much impact: the destructors just call the base class destructor anyway. A few subclasses of ConstantExpr actually store non-trivial data, though. Make sure we clean up appropriately. This is sort of ugly, but I don't see a good alternative given the constraints. Issue found by asan buildbots running the testcase for D80330. Differential Revision: https://reviews.llvm.org/D82509	2020-06-30 12:37:53 -07:00
Florian Hahn	ef10a35a6b	[AArch64] Add getCFInstrCost, treat branches as free for throughput. D79164/2596da31740f changed getCFInstrCost to return 1 per default. AArch64 did not have its own implementation, hence the throughput cost of CFI instructions is overestimated. On most cores, most branches should be predicated and essentially free throughput wise. This restores a 9% performance regression on a SPEC2006 benchmark on AArch64 with -O3 LTO & PGO. This patch effectively restores pre 2596da31740f behavior for AArch64 and undoes the AArch64 test changes of the patch. Reviewers: samparker, dmgreen, anemet Reviewed By: samparker Differential Revision: https://reviews.llvm.org/D82755	2020-06-30 20:34:04 +01:00
Craig Topper	9600883c24	[X86] Move frontend CPU feature initialization to a look up table based implementation. NFCI This replaces the switch statement implementation in the clang's X86.cpp with a lookup table in X86TargetParser.cpp. I've used constexpr and copy of the FeatureBitset from SubtargetFeature.h to store the features in a lookup table. After the lookup the bitset is translated into strings for use by the rest of the frontend code. I had to modify the implementation of the FeatureBitset to avoid bugs in gcc 5.5 constexpr handling. It seems to not like the same array entry to be used on the left side and right hand side of an assignment or &= or \|=. I've also used uint32_t instead of uint64_t and sized based on the X86::CPU_FEATURE_MAX. I've initialized the features for different CPUs outside of the table so that we can express inheritance in an adhoc way. This was one of the big limitations of the switch and we had resorted to labels and gotos. Differential Revision: https://reviews.llvm.org/D82731	2020-06-30 12:04:58 -07:00

1 2 3 4 5 ...

199454 Commits