llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Simon Pilgrim	e8c62e4bcc	[X86][Atom] Fix vector fadd/fcmp/fmul resource/throughputs Match whats documented in the Intel AOM - these are all fadd/fcmp use Port1 and fmul uses Port1, but in many cases BOTH ports are required - this was being incorrectly modelled as EITHER port. Discovered while investigating the correct fptoui costs to fix the regressions in D101555. Now that we can use in-order models in llvm-mca, the atom model is a good "worst case scenario" analysis for x86.	2021-05-20 18:56:58 +01:00
Alex Orlov	5487f701b4	Add support for DWARF embedded source to llvm-symbolizer. This patch adds DWARF embedded source printout to llvm-symbolizer. Reviewed By: jhenderson, dblaikie Differential Revision: https://reviews.llvm.org/D102355	2021-05-20 21:40:28 +04:00
Stefan Pintilie	7cef6d55d4	[PowerPC] Add fix to partword atomic operations Partword atomic binaries are not zero extended as they should be. This patch fixes them to ensure that they are zero extended. Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D102819	2021-05-20 12:36:37 -05:00
Fraser Cormack	a3089bc5eb	[RISCV] Ensure shuffle splat operands are type-legal The use of `SelectionDAG::getSplatValue` isn't guaranteed to return a type-legal splat value as it may implicitly extract a vector element from another shuffle. It is not permitted to introduce an illegal type when lowering shuffles. This patch addresses the crash by adding a boolean flag to `getSplatValue`, defaulting to false, which when set will ensure a type-legal return value. If it is unable to do that it will fail to return a splat value. I've been through the existing uses of `getSplatValue` in other targets and was unable to find a need or test cases showing a need to update their uses. In some cases, the call is made during `LegalizeVectorOps` which may still produce illegal scalar types. In other situations, the illegally-typed splat value may be quickly patched up to a legal type (such as any-extending the returned `extract_vector_elt` up to a legal type) before `LegalizeDAG` notices. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D102687	2021-05-20 18:00:03 +01:00
Wouter van Oortmerssen	1184fb03b6	[WebAssembly] Fix PIC/GOT codegen for wasm64 __table_base is know 64-bit, since in LLVM it represents a function pointer offset __table_base32 is a copy in wasm32 for use in elem init expr, since no truncation may be used there. New reloc R_WASM_TABLE_INDEX_REL_SLEB64 added Differential Revision: https://reviews.llvm.org/D101784	2021-05-20 09:59:31 -07:00
Steven Wu	0da7fd24df	[IR][AutoUpgrade] Drop alignment from non-pointer parameters and returns This is a follow-up of D102201. After some discussion, it is a better idea to upgrade all invalid uses of alignment attributes on function return values and parameters, not just limited to void function return types. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D102726	2021-05-20 09:54:38 -07:00
Stephen Tozer	90c1db3c3c	[DebugInfo] Handle DIArgList in FastISel or GlobalIsel Currently, variadic dbg.values (i.e. those using a DIArgList as part of their location) are not handled properly by FastISel or GlobalISel, and will produce invalid DBG_VALUE instructions if they encounter them. This patch fixes this issue by emitting undef DBG_VALUE instructions for variadic dbg.values, so that no incorrect instruction is produced and any prior variable location is terminated. This is simply a quick-fix to prevent errors; a correct implementation should come later for these ISel pipelines to ensure that we do not drop debug information unnecessarily. Differential Revision: https://reviews.llvm.org/D102500	2021-05-20 17:37:28 +01:00
Peter Waller	be488ff93c	[CodeGen][AArch64][SVE] Canonicalize intrinsic rdffr{ => _z} Follow up to D101357 / 3fa6510f6. Supersedes D102330. Goal: Use flags setting rdffrs instead of rdffr + ptest. Problem: RDFFR_P doesn't have have a flags setting equivalent. Solution: in instcombine, canonicalize to RDFFR_PP at the IR level, and rely on RDFFR_PP+PTEST => RDFFRS_PP optimization in AArch64InstrInfo::optimizePTestInstr. While here: * Test that rdffr.z+ptest generates a rdffrs. * Use update_{test,llc}_checks.py on the tests. * Use sve attribute on functions. Differential Revision: https://reviews.llvm.org/D102623	2021-05-20 16:22:50 +00:00
Sanjay Patel	94e851fedf	[GlobalOpt] recompute alignments for loads and stores of updated globals GlobalOpt can slice structs/arrays and change GEPs in the process, but it was not updating alignments for load/store users. This eventually causes the crashing seen in: https://llvm.org/PR49661 https://llvm.org/PR50253 On x86, this required SLP+codegen to create an aligned vector store on an invalid address. The bugs would be easier to demonstrate on a target with stricter alignment requirements. I'm not sure if this is a complete solution. The alignment updating code is adapted from InstCombine, so I assume that part is tested and good. Differential Revision: https://reviews.llvm.org/D102552	2021-05-20 12:12:21 -04:00
Sanjay Patel	eeebdacf1b	[GlobalOpt] adjust test to show load problems; NFC Goes with D102552	2021-05-20 12:12:21 -04:00
Alexey Bataev	2a555e9e81	[SLP]Try to vectorize tiny trees with shuffled gathers of extractelements. If we gather extract elements and they actually are just shuffles, it might be profitable to vectorize them even if the tree is tiny. Differential Revision: https://reviews.llvm.org/D101460	2021-05-20 08:36:16 -07:00
Daniel Kiss	00169a8661	[ARM][AArch64] SLSHardening: make non-comdat thunks possible Linker scripts might not handle COMDAT sections. SLSHardeing adds new section for each __llvm_slsblr_thunk_xN. This new option allows the generation of the thunks into the normal text section to handle these exceptional cases. ,comdat or ,noncomdat can be added to harden-sls to control the codegen. -mharden-sls=[all\|retbr\|blr],nocomdat. Reviewed By: kristof.beyls Differential Revision: https://reviews.llvm.org/D100546	2021-05-20 17:07:05 +02:00
Djordje Todorovic	b69d892627	Revert "[Debugify][Original DI] Test dbg var loc preservation" This reverts commit 76f375f3d9d6902820ffc21200e454926748c678. This will be pushed again, after investigating a test failure: https://lab.llvm.org/buildbot/#/builders/16/builds/11254	2021-05-20 07:11:35 -07:00
Djordje Todorovic	8ece18da90	[Debugify][Original DI] Test dbg var loc preservation This is an improvement of [0]. This adds checking of original llvm.dbg.values()/declares() instructions in optimizations. We have picked a real issue that has been found with this (actually, picked one variable location missing from [1] and resolved the issue), and the result is the fix for that -- D100844. Before applying the D100844, using the options from [0] (but with this patch applied) on the compilation of GDB 7.11, the final HTML report for the debug-info issues can be found at [1] (please scroll down, and look for "Summary of Variable Location Bugs"). After applying the D100844, the numbers has improved a bit -- please take a look into [2]. [0] https://llvm.org/docs/HowToUpdateDebugInfo.html\ [1] https://djolertrk.github.io/di-check-before-adce-fix/ [2] https://djolertrk.github.io/di-check-after-adce-fix/ Differential Revision: https://reviews.llvm.org/D100845	2021-05-20 06:42:02 -07:00
Joerg Sonnenberger	10acf9ba43	[SPARCv9] allow stw as alias for st Strictly speaking, the architecture manual no longer uses the st mnemonic, but that's a much more intrusive change for little gain. Differential Revision: https://reviews.llvm.org/D96313	2021-05-20 15:27:36 +02:00
David Green	0d22518864	[ARM] Extra tests for MVE vhadd and vmulh. NFC	2021-05-20 14:13:39 +01:00
David Sherwood	f86b85d85e	[CodeGen] Add support for widening the result of EXTRACT_SUBVECTOR When trying to return a type such as <vscale x 1 x i32> from a function we crash in DAGTypeLegalizer::WidenVecRes_EXTRACT_SUBVECTOR when attempting to get the fixed number of elements in the vector. For the simple case we are dealing with, i.e. extracting <vscale x 1 x i32> from index 0 of input vector <vscale x 4 x i32> we can simply rely upon existing code that just returns the input. Differential Revision: https://reviews.llvm.org/D102605	2021-05-20 12:27:08 +01:00
Simon Pilgrim	145caddc0a	[CostModel][X86][AVX2] Improve 256-bit vector non-uniform shifts costs Haswell, Excavator and early Ryzen all have slower 256-bit non-uniform vector shifts (confirmed on AMDSoG/Agner/instlatx64 and llvm models) - so bump the worst case costs accordingly. Noticed while investigating PR50364	2021-05-20 12:16:16 +01:00
David Truby	80b1235577	[llvm][sve] Lowering for VLS MLOAD/MSTORE This adds custom lowering for the MLOAD and MSTORE ISD nodes when passed fixed length vectors in SVE. This is done by converting the vectors to VLA vectors and using the VLA code generation. Fixed length extending loads and truncating stores currently produce correct code, but do not use the built in extend/truncate in the load and store instructions. This will be fixed in a future patch. Differential Revision: https://reviews.llvm.org/D101834	2021-05-20 10:50:59 +00:00
Roman Lebedev	07d20f9eb2	[NFC][Coroutines] Autogenerate a few tests for ease of further updates	2021-05-20 13:37:44 +03:00
Sergey Dmitriev	7095312ff1	[llvm-strip] Add support for '--' for delimiting options from input files This will allow to use llvm-strip with file names that begin with dashes. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D102825	2021-05-20 03:33:51 -07:00
David Green	eb198d6426	[AArch64] Add extra codegen tests. NFC This adds some extra codegen tests for abs and hadd, regenerating some of the existing tests with updated check lines.	2021-05-20 11:32:51 +01:00
LLVM GN Syncbot	75ce07ec7f	[gn build] Port 081c62501e4f	2021-05-20 10:17:56 +00:00
Alexey Lapshin	2bc33a6598	[llvm-objcopy] Refactor CopyConfig structure. This patch prepares llvm-objcopy to move its implementation into a separate library. To make it possible it is necessary to minimize internal dependencies. Differential Revision: https://reviews.llvm.org/D99055	2021-05-20 13:14:51 +03:00
Roman Lebedev	029935da14	[NFC][CHR] Autogenerate checklines in a few tests for ease of updates	2021-05-20 13:12:45 +03:00
Roman Lebedev	3605ab1b0c	[NFC][PruneEH] Autogenerate checklines in a few tests for ease of updates	2021-05-20 13:12:45 +03:00
Roman Lebedev	e8395509b1	[NFC][SimplifyCFG] Autogenerate checklines in a few tests for ease of updates	2021-05-20 13:12:44 +03:00
Simon Pilgrim	0e2845d424	[X86][AVX] Don't scrub pointer math in avx-vperm2x128.ll This will make it easier to track address offsets in folded loads/broadcasts of subvectors	2021-05-20 10:53:20 +01:00
Luke	7de9fb29f4	[RISCV] Add legality check for vectorizing reduction Check if it is legal to vectorize reduction. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D99509	2021-05-20 17:45:46 +08:00
David Sherwood	3bda651940	[CodeGen] Add support for widening INSERT_SUBVECTOR operands When attempting to return something like a <vscale x 1 x i32> type from a function we end up trying to widen the vector by inserting a <vscale x 1 x i32> subvector into an undefined <vscale x 4 x i32> vector. However, during legalisation we then attempt to widen the INSERT_SUBVECTOR operands and hit an error in WidenVectorOperand. This patch adds a new WidenVecOp_INSERT_SUBVECTOR function that currently only supports inserting subvectors into undefined vectors. Differential Revision: https://reviews.llvm.org/D102501	2021-05-20 10:37:03 +01:00
Heejin Ahn	05d69ddc38	[WebAssembly] Ignore filters in Emscripten EH landingpads We have been handling filters and landingpads incorrectly all along. We pass clauses' (catches') types to `__cxa_find_matching_catch` in JS glue code, which returns the thrown pointer and sets the selector using `setTempRet0()`. We apparently have been doing the same for filters' (exception specs') types; we pass them to `__cxa_find_matching_catch` just the same way as clauses. And `__cxa_find_matching_catch` treats all given types as clauses. So it is a little surprising; maybe we intended to do something from the JS side and didn't end up doing? So anyway, I don't think supporting exception specs in Emscripten EH is a priority, but this can actually cause incorrect results for normal catches when functions are inlined and the inlined spec type has a parent-child relationship with the catch's type. --- The below is an example of a bug that can happen when inlining and class hierarchy is mixed. If you are busy you can skip this part: ``` struct A {}; struct B : A {}; void bar() throw (B) { throw B(); } void foo() { try { bar(); } catch (A &) { fputs ("Expected result\n", stdout); } } ``` In the unoptimized code, `bar`'s landingpad will have a filter for `B` and `foo`'s landingpad will have a clause for `A`. But when `bar` is inlined into `foo`, `foo`'s landingpad has both a filter for `B` and a clause for `A`, and it passes the both types to `__cxa_find_matching_catch`: ``` __cxa_find_matching_catch(typeinfo for B, typeinfo for A) ``` `__cxa_find_matching_catch` thinks both are clauses, and looks at the first type `B`, which belongs to a filter. And the thrown type is `B`, so it thinks the first type `B` is caught. But this makes it return an incorrect selector, because it is supposed to catch the exception using the second type `A`, which is a parent of `B`. As a result, the `foo` in the example program above does not print "Expected result" but just throws the exception to the caller. (This wouldn't have happened if `A` and `B` are completely disjoint types, such as `float` and `int`) Fixes https://bugs.llvm.org/show_bug.cgi?id=50357. Reviewed By: dschuff, kripken Differential Revision: https://reviews.llvm.org/D102795	2021-05-20 01:28:16 -07:00
Caroline Concatto	85e935efe6	[CostModel][AArch64] Add missing costs for getShuffleCost with scalable vectors Differential Revision: https://reviews.llvm.org/D102490	2021-05-20 09:08:31 +01:00
serge-sans-paille	26806aa0f7	Force visibility of llvm::Any to external llvm::Any::TypeId::Id relies on the uniqueness of the address of a static variable defined in a template function. hidden visibility implies vague linkage for that variable, which does not guarantee the uniqueness of the address across a binary and a shared library. This totally breaks the implementation of llvm::Any. Ideally, setting visibility to llvm::Any::TypeId::Id should be enough, unfortunately this doesn't work as expected and we lack time (before 12.0.1 release) to understand why setting the visibility to llvm::Any does work. See https://gcc.gnu.org/wiki/Visibility and https://gcc.gnu.org/onlinedocs/gcc/Vague-Linkage.html for more information on that topic. Differential Revision: https://reviews.llvm.org/D101972	2021-05-20 10:06:00 +02:00
Andrew Savonichev	607f7c19c2	[AArch64] Combine vector shift instructions in SelectionDAG bswap.v2i16 + sitofp in LLVM IR generate a sequence of: - REV32 + USHR for bswap.v2i16 - SHL + SSHR + SCVTF for sext to v2i32 and scvt The shift instructions are excessive as noted in PR24820, and they can be optimized to just SSHR. Differential Revision: https://reviews.llvm.org/D102333	2021-05-20 10:50:13 +03:00
Amara Emerson	e9e3784d95	[GlobalISel] Fix div+rem -> divrem combine causing use-def violation.	2021-05-19 23:13:41 -07:00
Simon Giesecke	aa649d36a0	Add option to llvm-gsymutil to read addresses from stdin. Differential Revision: https://reviews.llvm.org/D102224	2021-05-20 06:10:35 +00:00
Xiang1 Zhang	652257ec88	Revert "[HWASAN] Update the tag info for X86_64." This reverts commit 81c18ce03cd8199cc4f2c817e31b42a191a0fe7d.	2021-05-20 13:12:59 +08:00
Xiang1 Zhang	de91b3f2fe	[HWASAN] Update the tag info for X86_64. In LAM model X86_64 will use bits 57-62 (of 0-63) as HWASAN tag. So here we make sure the tag shift position and tag mask is correct for x86-64. Differential Revision: https://reviews.llvm.org/D102472	2021-05-20 11:22:12 +08:00
Sergey Dmitriev	5281788ba1	[llvm-objcopy] Update LIT test to resolve bot failure [NFC] Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D102823	2021-05-19 19:56:35 -07:00
Zhiwei Chen	8ecb4a2780	[sanitizer] Reduce redzone size for small size global objects Currently 1 byte global object has a ridiculous 63 bytes redzone. This patch reduces the redzone size to be less than 32 if the size of global object is less than or equal to half of 32 (the minimal size of redzone). A 12 bytes object has a 20 bytes redzone, a 20 bytes object has a 44 bytes redzone. Reviewed By: MaskRay, #sanitizers, vitalybuka Differential Revision: https://reviews.llvm.org/D102469	2021-05-19 19:18:50 -07:00
Jon Roelofs	94fbaca3d2	Fix warnings in windows bots. NFC	2021-05-19 17:42:34 -07:00
LLVM GN Syncbot	077e051b22	[gn build] Port 4bf69fb52b3c	2021-05-19 22:27:27 +00:00
Ahmed Bougacha	50227f56b6	[docs] Describe reporting security issues on the chromium tracker. To track security issues, we're starting with the chromium bug tracker (using the llvm project there). We considered using Github Security Advisories. However, they are currently intended as a way for project owners to publicize their security advisories, and aren't well-suited to reporting issues. This also moves the issue-reporting paragraph to the beginning of the document, in part to make it more discoverable, in part to allow the anchor-linking to actually display the paragraph at the top of the page. Note that this doesn't update the concrete list of security-sensitive areas, which is still an open item. When we do, we may want to move the list of security-sensitive areas next to the issue-reporting paragraph as well, as it seems like relevant information needed in the reporting process. Finally, when describing the discission medium, this splits the topics discussed into two: the concrete security issues, discussed in the issue tracker, and the logistics of the group, in our mailing list, as patches on public lists, and in the monthly sync-up call. While there, add a SECURITY.md page linking to the relevant paragraph. Differential Revision: https://reviews.llvm.org/D100873	2021-05-19 15:21:50 -07:00
Jon Roelofs	ad30e385ed	[Remarks] Add analysis remarks for memset/memcpy/memmove lengths Differential revision: https://reviews.llvm.org/D102452	2021-05-19 15:09:18 -07:00
Ryan Prichard	dc2b1a16fe	[MC][ARM] Reject Thumb "ror rX, #0 " The ROR instruction can only handle immediates between 1 and 31. The would-be encoding for ROR #0 is actually the RRX instruction. Reviewed By: nickdesaulniers Differential Revision: https://reviews.llvm.org/D102455	2021-05-19 15:05:39 -07:00
Petr Hosek	2054565e98	[CMake] Don't LTO optimize targets that aren't part of any distribution When using distributions, targets that aren't included in any distribution don't need to be as optimized as targets that are included since those targets are typically only used for tests. We might consider avoiding LTO for these targets altogether, see https://lists.llvm.org/pipermail/llvm-dev/2021-April/149843.html Differential Revision: https://reviews.llvm.org/D102732	2021-05-19 15:02:11 -07:00
wlei	aa88690f46	[CSSPGO] Avoid deleting probe instruction in FoldValueComparisonIntoPredecessors This change tries to fix a place missing `moveAndDanglePseudoProbes `. In FoldValueComparisonIntoPredecessors, it folds the BB into predecessors and then marked the BB unreachable. However, the original logic from the BB is still alive, deleting the probe will mislead the SampleLoader mark it as zero count sample. Reviewed By: hoy, wenlei Differential Revision: https://reviews.llvm.org/D102721	2021-05-19 13:39:05 -07:00
Lang Hames	0e09eb0cf2	[ORC] Add a CPU getter to JITTargetMachineBuilder.	2021-05-19 13:31:25 -07:00
Arthur Eubanks	63f5e603f7	[OpaquePtr] Make atomicrmw work with opaque pointers FullTy is only necessary when we need to figure out what type an instruction works with given a pointer's pointee type. However, we just end up using the value operand's type, so FullTy isn't necessary. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D102788	2021-05-19 12:49:28 -07:00
Arthur Eubanks	208107dd2c	[OpaquePtr] Make cmpxchg work with opaque pointers Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D102745	2021-05-19 12:44:10 -07:00

1 2 3 4 5 ...

216192 Commits