llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 02:52:53 +02:00

Author	SHA1	Message	Date
Duncan P. N. Exon Smith	a126d2972b	ADT: Add SFINAE to the generic IntrusiveRefCntPtr constructors Add an `enable_if` to the generic `IntrusiveRefCntPtr` constructors so that std::is_convertible gives an honest answer when the underlying pointers cannot be converted. Added `static_assert`s to the test suite to verify. Also combine generic constructors from `IntrusiveRefCntPtr<X>&&` and `const IntrusiveRefCntPtr<X>&`. At first glance this appears to be an infinite loop, but the real copy/move constructors are spelled out separately above. Added a unit test to verify. Differential Revision: https://reviews.llvm.org/D95498	2021-01-28 15:07:27 -08:00
Jessica Paquette	c1671731c6	Recommit "[GlobalISel] Walk through hints in getDefIgnoringCopies et al" Recommit of 4580acf6752ea3cc884657b5aa3e174bed86fc8c `Opc = DefMI->getOpcode()` was in the wrong place.	2021-01-28 14:43:00 -08:00
Jessica Paquette	2033979607	Revert "[GlobalISel] Walk through hints in getDefIgnoringCopies et al" This reverts commit 4580acf6752ea3cc884657b5aa3e174bed86fc8c. Reverting while looking into some test failures.	2021-01-28 14:37:57 -08:00
Jessica Paquette	fe6b2e148e	[GlobalISel] Walk through hints in getDefIgnoringCopies et al Treat hint instructions like G_ASSERT_ZEXT like COPY instructions in helpers which walk through copies. This ensures that instructions like G_ASSERT_ZEXT won't impact any optimizations that rely on these helpers. Differential Revision: https://reviews.llvm.org/D95577	2021-01-28 14:27:00 -08:00
Tony Tye	43c2e84b50	[NFC][AMDGPU] Correct name of DWARF CFA extensions Add LLVM to the DW_CFA_LLVM_def_aspace_cfa and DW_CFA_LLVM_def_aspace_cfa_sf DWARF extensions. Reviewed By: scott.linder Differential Revision: https://reviews.llvm.org/D95640	2021-01-28 22:25:33 +00:00
Roman Lebedev	9609a21100	[ScalarizeMaskedMemIntrin] Preserve Dominator Tree, if avaliable This de-pessimizes the arguably more usual case of no masked mem intrinsics, and gets rid of one more Dominator Tree recalculation. As per llvm/test/CodeGen/X86/opt-pipeline.ll, there's one more Dominator Tree recalculation left, we could get rid of.	2021-01-29 01:11:36 +03:00
Roman Lebedev	dbe994fc45	[PartiallyInlineLibCalls] Preserve Dominator Tree, if avaliable This doesn't get rid of any Dominator Tree recalculations just yet, there is one more pass to update..	2021-01-29 01:11:36 +03:00
Roman Lebedev	7af8fcd8c9	[NFC][ScalarizeMaskedMemIntrin] scalarizeMaskedCompressStore(): port to SplitBlockAndInsertIfThen() Makes Dominator Tree preservation in a followup patch somewhat easier.	2021-01-29 01:11:35 +03:00
Roman Lebedev	3898756b60	[NFC][ScalarizeMaskedMemIntrin] scalarizeMaskedExpandLoad(): port to SplitBlockAndInsertIfThen() Makes Dominator Tree preservation in a followup patch somewhat easier.	2021-01-29 01:11:35 +03:00
Roman Lebedev	97fcb6947f	[NFC][ScalarizeMaskedMemIntrin] scalarizeMaskedScatter(): port to SplitBlockAndInsertIfThen() Makes Dominator Tree preservation in a followup patch somewhat easier.	2021-01-29 01:11:35 +03:00
Roman Lebedev	32fe5cdb7c	[NFC][ScalarizeMaskedMemIntrin] scalarizeMaskedGather(): port to SplitBlockAndInsertIfThen() Makes Dominator Tree preservation in a followup patch somewhat easier.	2021-01-29 01:11:34 +03:00
Roman Lebedev	67f2dad29d	[NFC][ScalarizeMaskedMemIntrin] scalarizeMaskedStore(): port to SplitBlockAndInsertIfThen() Makes Dominator Tree preservation in a followup patch somewhat easier.	2021-01-29 01:11:34 +03:00
Roman Lebedev	f5faa0e4fe	[NFC][ScalarizeMaskedMemIntrin] scalarizeMaskedLoad(): port to SplitBlockAndInsertIfThen() Makes Dominator Tree preservation in a followup patch somewhat easier.	2021-01-29 01:11:34 +03:00
Roman Lebedev	27c02cf875	[NFC][PartiallyInlineLibCalls] Port to SplitBlockAndInsertIfThen() This makes follow-up patch for Dominator Tree preservation somewhat more straight-forward.	2021-01-29 01:11:33 +03:00
Roman Lebedev	2121393e6e	[NFC][EntryExitInstrumenter] Mark Dominator Tree as preserved in legacy-PM too This is correctly handled in new-PM wrappers, but not in old-PM.	2021-01-29 01:11:33 +03:00
Cassie Jones	e11c57fcf5	[GlobalISel] Implement widenScalar for carry-in add/sub These are widened to a wider UADDE/USUBE, with the overflow value unused, and with the same synthesis of a new overflow value as for the O operations. Reviewed By: paquette Differential Revision: https://reviews.llvm.org/D95326	2021-01-28 17:06:24 -05:00
Jessica Paquette	3fcc23c823	[GlobalISel] Add G_ASSERT_ZEXT This adds a generic opcode which communicates that a type has already been zero-extended from a narrower type. This is intended to be similar to AssertZext in SelectionDAG. For example, ``` %x_was_extended:_(s64) = G_ASSERT_ZEXT %x, 16 ``` Signifies that the top 48 bits of %x are known to be 0. This is useful in cases like this: ``` define i1 @zeroext_param(i8 zeroext %x) { %cmp = icmp ult i8 %x, -20 ret i1 %cmp } ``` In AArch64, `%x` must use a 32-bit register, which is then truncated to a 8-bit value. If we know that `%x` is already zero-ed out in the relevant high bits, we can avoid the truncate. Currently, in GISel, this looks like this: ``` _zeroext_param: and w8, w0, #0xff ; We don't actually need this! cmp w8, #236 cset w0, lo ret ``` While SDAG does not produce the truncation, since it knows that it's unnecessary: ``` _zeroext_param: cmp w0, #236 cset w0, lo ret ``` This patch - Adds G_ASSERT_ZEXT - Adds MIRBuilder support for it - Adds MachineVerifier support for it - Documents it It also puts G_ASSERT_ZEXT into its own class of "hint instruction." (There should be a G_ASSERT_SEXT in the future, maybe a G_ASSERT_ALIGN as well.) This allows us to skip over hints in the legalizer etc. These can then later be selected like COPY instructions or removed. Differential Revision: https://reviews.llvm.org/D95564	2021-01-28 13:58:37 -08:00
Greg Clayton	4dc036b075	Add the ability to extract the unwind rows from DWARF Call Frame Information. This patch adds the ability to evaluate the state machine for CIE and FDE unwind objects and produce a UnwindTable with all UnwindRow objects needed to unwind registers. It will also dump the UnwindTable for each CIE and FDE when dumping DWARF .debug_frame or .eh_frame sections in llvm-dwarfdump or llvm-objdump. This allows users to see what the unwind rows actually look like for a given CIE or FDE instead of just seeing a list of opcodes. This patch adds new classes: UnwindLocation, RegisterLocations, UnwindRow, and UnwindTable. UnwindLocation is a class that describes how to unwind a register or Call Frame Address (CFA). RegisterLocations is a class that tracks registers and their UnwindLocations. It gets populated when parsing the DWARF call frame instruction opcodes for a unwind row. The registers are mapped from their register numbers to the UnwindLocation in a map. UnwindRow contains the result of evaluating a row of DWARF call frame instructions for the CIE, or a row from a FDE. The CIE can produce a set of initial instructions that each FDE that points to that CIE will use as the seed for the state machine when parsing FDE opcodes. A UnwindRow for a CIE will not have a valid address, whille a UnwindRow for a FDE will have a valid address. The UnwindTable is a class that contains a sorted (by address) vector of UnwindRow objects and is the result of parsing all opcodes in a CIE, or FDE. Parsing a CIE should produce a UnwindTable with a single row. Parsing a FDE will produce a UnwindTable with one or more UnwindRow objects where all UnwindRow objects have valid addresses. The rows in the UnwindTable will be sorted from lowest Address to highest after parsing the state machine, or an error will be returned if the table isn't sorted. To parse a UnwindTable clients can use the following methods: static Expected<UnwindTable> UnwindTable::create(const CIE Cie); static Expected<UnwindTable> UnwindTable::create(const FDE Fde); A valid table will be returned if the DWARF call frame instruction opcodes have no encoding errors. There are a few things that can go wrong during the evaluation of the state machine and these create functions will catch and return them. Differential Revision: https://reviews.llvm.org/D89845	2021-01-28 13:39:17 -08:00
Reid Kleckner	365f97bc0d	Revert "[PDB] Defer relocating .debug$S until commit time and parallelize it" This reverts commit 1a9bd5b81328adf0dd5a8b4f3ad5949463e66da3. I suspect that this patch may have caused https://crbug.com/1171438.	2021-01-28 13:17:27 -08:00
Duncan P. N. Exon Smith	25a2ac921b	Support: Simplify __HAIKU__ #ifdef in llvm::sys::Wait, NFC This just reduces the amount of code in the `#ifndef` block as a follow-up to 5c1cea6f406366b85f3c200a1c48f713da4450ba.	2021-01-28 12:28:12 -08:00
Albion Fung	d6b6754937	[PowerPC][Power10] Fix XXSPLI32DX not correctly exploiting specific cases Some cases may be transformed into 32 bit splats before hitting the boolean statement, which may cause incorrect behaviour and provide XXSPLTI32DX with the incorrect values of splat. The condition was reversed so that the shortcut prevents this problem. Differential Revision: https://reviews.llvm.org/D95634	2021-01-28 15:17:32 -05:00
David Blaikie	85f9252ff2	Fix memory leak in 4318028cd2d7633a0cdeb0b5d4d2ed81fab87864	2021-01-28 12:08:23 -08:00
Thomas Lively	30175dd5d9	[WebAssembly] Prototype i8x16 to i32x4 widening instructions As proposed in https://github.com/WebAssembly/simd/pull/395 and matching the opcodes used in V8: https://chromium-review.googlesource.com/c/v8/v8/+/2617385/4/src/wasm/wasm-opcodes.h Differential Revision: https://reviews.llvm.org/D95557	2021-01-28 10:59:32 -08:00
Nico Weber	73c25457d6	[gn build] (manually) port 081c1db02dd2 more	2021-01-28 13:32:49 -05:00
Nico Weber	05a869f16d	[gn build] (manually) port 3b625060fc915	2021-01-28 13:26:37 -05:00
David Blaikie	266d73d7f0	DebugInfo: Add a DWARF FORM extension for addrx+offset references to reduce relocations This is an alternative to the use of complex DWARF expressions for addresses - shaving off a few extra bytes of expression overhead.	2021-01-28 10:20:02 -08:00
Wouter van Oortmerssen	026d2d3257	[WebAssembly] Fix Fast ISEL not lowering 64-bit function pointers Differential Revision: https://reviews.llvm.org/D95410	2021-01-28 10:05:29 -08:00
Nico Weber	35efce7942	[gn build] (semi-manually) port 081c1db02dd2	2021-01-28 13:05:10 -05:00
Jay Foad	02d55f0877	[AMDGPU] Simplify some RUN lines. NFC.	2021-01-28 17:57:55 +00:00
Adrian Prantl	432a842de0	Better document the limitations of coro::salvageDebugInfo() and fix a few edge cases that show up in the Swift compiler but weren't caught by the existing tests. Most notably the old code wasn't salvaging load operations correctly. The patch also gets rid of the LoadFromFramePtr argument and replaces it with a more generalized mechanism.	2021-01-28 09:53:19 -08:00
Fangrui Song	6e141a22c9	[llvm-nm] Display defined weak STT_GNU_IFUNC symbols as 'i' This patch makes the behavior match GNU nm. Note: undefined STT_GNU_IFUNC symbols use 'U'. Differential Revision: https://reviews.llvm.org/D95461	2021-01-28 09:46:05 -08:00
Craig Topper	1d3e28399b	[RISCV] Remove isel patterns for Zbs *W instructions. These instructions have been removed from the 0.94 bitmanip spec. We should focus on optimizing the codegen without using them. Reviewed By: asb Differential Revision: https://reviews.llvm.org/D95302	2021-01-28 09:33:56 -08:00
Craig Topper	dd352986b6	[RISCV] Add support for scalable vector fneg using vfsgnjn.vv Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D95568	2021-01-28 09:11:49 -08:00
Shaurya Gupta	e387e1d835	Revert "[DWARF] Create subprogram's DIE in DISubprogram's unit" This reverts commit ef0dcb506300dc9644e8000c6028d14214be9d97. This change is causing a lot of compiler crashes inside, sorry I don't have a small repro/stacktrace with symbols to share right now. Differential Revision: https://reviews.llvm.org/D95622	2021-01-28 16:39:01 +00:00
Simon Pilgrim	2fbc33f30f	[X86][AVX] combineHorizOpWithShuffle - fix valuetype comparison typo. Ensure we check the valuetypes of all the HOP(SHUFFLE(X,Y),SHUFFLE(X,Y)) shuffle input ops - there was a copy+paste typo (noticed by MSVC analyzer) that meant we were checking the same input from one of the shuffles twice. I haven't been able to create a test case for this yet - I don't think its currently possible to create a target/faux binary shuffle that scales to a 2x128 shuffle mask from two different value types.	2021-01-28 16:36:23 +00:00
Simon Pilgrim	0cbd016ac6	[APFloat] Remove orphan ilogb(DoubleAPFloat) declaration. NFCI.	2021-01-28 15:18:25 +00:00
Simon Pilgrim	984ce3745a	[APFloat] scalbn - pass DoubleAPFloat arg as const-ref. NFCI. Avoid unnecessary copy and fix clang-tidy warning.	2021-01-28 15:18:24 +00:00
Tobias Burnus	bbff278e57	[MC][ELF] Fix accepting abbreviated form with sh_flags and sh_entsize Followup to D92052 as I missed an issue as shown via GCC bug https://gcc.gnu.org/PR97827, namely: (e.g.) ".rodata." implies ELF::SHF_ALLOC. Crossref: - D73999 / commit 75af9da755721123e62b45cd0bc0c5e688a9722a added for LLVM 11 a check that sh_flags and sh_entsize (and sh_type) changes are an error, in line with GNU assembler. - D92052 / commit 1deff4009e0ae661b03682901bf6932297ce7ea1 permitted the abbreviated form which many assemblers accept and GCC generates: while the first .section contains the flags and entsize, subsequent sections simply contain the name without repeating entsize or flags. However, the latter patch missed in the check that some flags are automatically set, e.g. '.rodata." implies ELF::SHF_ALLOC. Related https://bugs.llvm.org/show_bug.cgi?id=48201 Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D94072	2021-01-28 14:54:43 +00:00
Simon Pilgrim	c21eeac3de	[X86] Add extload test cases from D95086 I've also added vselect variants of the vector cases	2021-01-28 13:29:01 +00:00
Stefan Gränitz	2f98bedc10	[Orc] Remove unused header from TPC server The header would include OrcJIT headers in OrcTargetProcess, which is not desired. All common declarations should be in OrcShared. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D95606	2021-01-28 14:16:49 +01:00
David Green	78cb69d190	[ARM] Add alignment checks for MVE VLDn The MVE VLD2/4 and VST2/4 instructions require the pointer to be aligned to at least the size of the element type. This adds a check for that into the ARM lowerInterleavedStore and lowerInterleavedLoad functions, not creating the intrinsics if they are invalid for the alignment of the load/store. Unfortunately this is one of those bug fixes that does effect some useful codegen, as we were able to sometimes do some nice lowering of q15 types. But they can cause problem with low aligned pointers. Differential Revision: https://reviews.llvm.org/D95319	2021-01-28 13:10:08 +00:00
Bradley Smith	2fb0bfaa8e	[AArch64][SVE] Allow accesses to SVE stack objects to use frame pointer The layout of the stack frame for SVE means that using the frame pointer rather than the stack pointer for an access to an SVE stack object removes the need for an additional add to jump over the non-SVE objects. Likewise the opposite is true for non-SVE stack objects. This patch allows for the former to be done by having HasFP return true in the presence of both SVE and non-SVE stack objects, and also fixes a minor issue whereby the later would not be done for certain offsets.	2021-01-28 12:39:57 +00:00
Simon Pilgrim	5fc2b65dfd	AMDGPUPrintfRuntimeBinding - don't dereference a dyn_cast<> pointer. NFCI. We dereference the dyn_cast<> in all paths - use cast<> to silence the clang static analyzer warning.	2021-01-28 12:38:44 +00:00
Simon Pilgrim	3d244dcde9	[X86][AVX] canonicalizeLaneShuffleWithRepeatedOps - don't merge VPERMILPD ops with different low/high masks. Unlike VPERMILPS, VPERMILPD can have non-repeating masks in each 128-bit subvector, we weren't accounting for this when folding vperm2f128(vpermilpd(x,c),vpermilpd(y,c)) -> vpermilpd(vperm2f128(x,y),c). I'm intending to add support for this but wanted to get a minimal fix in first for merging into 12.xx. Fixes PR48908	2021-01-28 12:11:31 +00:00
Simon Pilgrim	80a9852618	[X86][AVX] Add PR48908 shuffle test case	2021-01-28 11:21:36 +00:00
Simon Pilgrim	a8f6482d0c	Fix "32-bit shift result used in 64-bit comparison" MSVC warning. NFCI.	2021-01-28 11:21:36 +00:00
Simon Pilgrim	32b4d25b48	[Support] Add some missing namespace closure comments. NFCI. Fixes some clang-tidy warnings.	2021-01-28 11:21:35 +00:00
Simon Pilgrim	5bf02200da	[DebugInfo] Remove some unused includes. NFCI. Mainly removing a lot of <vector> includes from files that don't explicitly use std::vector	2021-01-28 11:21:35 +00:00
Roman Lebedev	5cad690af5	[CodeGen][DwarfEHPrepare] Preserve Dominator Tree Now that D94827 has flipped the switch, and SimplifyCFG is officially marked as production-ready regarding Dominator Tree preservation, we can update this user pass to also preserve Dominator Tree. This is a geomean compile-time win of `-0.05%`..`-0.08%`. https://llvm-compile-time-tracker.com/compare.php?from=51a25846c198cff00abad0936f975167357afa6f&to=082499aac236a5c141e50a9e77870d5be2de5f0b&stat=instructions Differential Revision: https://reviews.llvm.org/D95548	2021-01-28 14:11:34 +03:00
Roman Lebedev	9051eaeeb1	[SimplifyCFG] If provided, preserve Dominator Tree SimplifyCFG is an utility pass, and the fact that it does not preserve DomTree's, forces it's users to somehow workaround that, likely by not preserving DomTrees's themselves. Indeed, simplifycfg pass didn't know how to preserve dominator tree, it took me just under a month (starting with e1133179587dd895962a2fe4d6eb0cb1e63b5ee2) do rectify that, now it fully knows how to, there's likely some problems with that still, but i've dealt with everything i can spot so far. I think we now can flip the switch. Note that this is functionally an NFC change, since this doesn't change the users to pass in the DomTree, that is a separate question. Reviewed By: kuhar, nikic Differential Revision: https://reviews.llvm.org/D94827	2021-01-28 14:11:34 +03:00

1 2 3 4 5 ...

210434 Commits