llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Simon Tatham	acb5034d24	[ARM,MVE] Add ACLE intrinsics for VCVT[ANPM] family. Summary: These instructions convert a vector of floats to a vector of integers of the same size, with assorted non-default rounding modes. Implemented in IR as target-specific intrinsics, because as far as I can see there are no matches for that functionality in the standard IR intrinsics list. Reviewers: MarkMurrayARM, dmgreen, miyuki, ostannard Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D75255	2020-03-02 10:33:30 +00:00
Simon Tatham	86da1af72a	[ARM,MVE] Add ACLE intrinsics for VCVT.F32.F16 family. Summary: These instructions make a vector of `<4 x float>` by widening every other lane of a vector of `<8 x half>`. I wondered about representing these using standard IR, along the lines of a shufflevector to extract elements of the input into a `<4 x half>` followed by an `fpext` to turn that into `<4 x float>`. But it looks as if that would take a lot of work in isel lowering to make it match any pattern I could sensibly write in Tablegen, and also I haven't been able to think of any other case where that pattern might be generated in IR, so there wouldn't be any extra code generation win from doing it that way. Therefore, I've just used another target-specific intrinsic. We can always change it to the other way later if anyone thinks of a good reason. (In order to put the intrinsic definition near similar things in `IntrinsicsARM.td`, I've also lifted the definition of the `MVEMXPredicated` multiclass higher up the file, without changing it.) Reviewers: MarkMurrayARM, dmgreen, miyuki, ostannard Reviewed By: miyuki Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D75254	2020-03-02 10:33:30 +00:00
Simon Tatham	37f1154506	[ARM,MVE] Correct MC operands in VCVT.F32.F16. (NFC) Summary: The two MVE instructions that convert between v4f32 and v8f16 were implemented as instances of the same class, with the same MC operand list. But that's not really appropriate, because the narrowing conversion only partially overwrites its output register (it only has 4 f16 values to write into a vector of 8), so even when unpredicated, it needs a $Qd_src input, a constraint tying that to the $Qd output, and a vpred_n. The widening conversion is better represented like any other instruction that completely replaces its output when unpredicated: it should have no $Qd_src operand, and instead, a vpred_r containing a $inactive parameter. That's a better match to other similar instructions, such as its integer analogue, the VMOVL instruction that makes a v4i32 by sign- or zero-extending every other lane of a v8i16. This commit brings the widening VCVT.F32.F16 into line with the other instructions that behave like it. That means you can write isel patterns that use it unpredicated, without having to add a pointless undefined $QdSrc operand. No existing code generation uses that instruction yet, so there should be no functional change from this fix. Reviewers: MarkMurrayARM, dmgreen, miyuki, ostannard Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75253	2020-03-02 10:33:30 +00:00
Simon Tatham	605803fc34	[ARM,MVE] Add ACLE intrinsics for VQMOV[U]N family. Summary: These instructions work like VMOVN (narrowing a vector of wide values to half size, and overwriting every other lane of an output register with the result), except that the narrowing conversion is saturating. They come in three signedness flavours: signed to signed, unsigned to unsigned, and signed to unsigned. All are represented in IR by a target-specific intrinsic that takes two separate 'unsigned' flags. Reviewers: MarkMurrayARM, dmgreen, miyuki, ostannard Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D75252	2020-03-02 10:33:30 +00:00
Pavel Labath	b424382a42	[DWARF] Use DWARFDataExtractor::getInitialLength to parse debug_names Summary: In this patch I've done a slightly bigger rewrite to also remove the hardcoded header lengths. Reviewers: jhenderson, dblaikie, ikudrin Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75119	2020-03-02 11:21:23 +01:00
Pavel Labath	16b078b185	[DWARF] Use getInitialLength in range list parsing Summary: This could be considered obvious, but I am putting it up to illustrate the usefulness/impact of the getInitialLength change. Reviewers: dblaikie, jhenderson, ikudrin Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75117	2020-03-02 11:18:12 +01:00
Pavel Labath	b7a812f072	[DWARFDebugLine] Use new DWARFDataExtractor::getInitialLength Summary: The error messages change somewhat, but I believe the overall informational value remains unchanged. Reviewers: jhenderson, dblaikie, ikudrin Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75116	2020-03-02 11:14:29 +01:00
serge-sans-paille	cbaf8eb24e	Fix Base64Test - for StringRef size Original failures: http://lab.llvm.org:8011/builders/clang-with-lto-ubuntu/builds/15975/steps/test-stage1-compiler/logs/stdio	2020-03-02 11:13:30 +01:00
Anna Welker	4327ec59c8	[ARM][MVE] Restrict allowed types of gather/scatter offsets The MVE gather instructions smaller than 32bits zext extend the values in the offset register, as opposed to sign extending them. We need to make sure that the code that we select from is suitably extended, which this patch attempts to fix by tightening up the offset checks. Differential Revision: https://reviews.llvm.org/D75361	2020-03-02 10:04:12 +00:00
Kang Zhang	8f356f681e	[NFC][PowerPC] Move some alias definition from PPCInstrInfo.td to PPCInstr64Bit.td Summary: Some 64-bit instructions alias definition is in PPCInstrInfo.td, it should be moved to PPCInstr64Bit.td.	2020-03-02 09:54:15 +00:00
LLVM GN Syncbot	135a140d05	[gn build] Port 5a1958f2673	2020-03-02 09:02:51 +00:00
serge-sans-paille	9d5f964841	Syndicate, test and fix base64 implementation Move Base64 implementation from clangd/SemanticHighlighting to llvm/Support/Base64, fix its implementation and provide a decent test suite. Previous implementation code was using + operator instead of \| to combine some results, which is a problem when shifting signed values. (0xFF << 16) is implicitly converted to a (signed) int, and thus results in 0xffff0000, which is negative. Combining negative numbers with a + in that context is not what we want to do. This fixes https://github.com/llvm/llvm-project/issues/149. Differential Revision: https://reviews.llvm.org/D75057	2020-03-02 10:02:25 +01:00
Hans Wennborg	e0fb9eaf01	Revert "[DebugInfo][clang][DWARF5]: Added support for debuginfo generation for defaulted parameters" The Bitcode/DITemplateParameter-5.0.ll test is failing: FAIL: LLVM :: Bitcode/DITemplateParameter-5.0.ll (5894 of 36324) ****************** TEST 'LLVM :: Bitcode/DITemplateParameter-5.0.ll' FAILED ****************** Script: -- : 'RUN: at line 1'; /usr/local/google/home/thakis/src/llvm-project/out/gn/bin/llvm-dis -o - /usr/local/google/home/thakis/src/llvm-project/llvm/test/Bitcode/DITemplateParameter-5.0.ll.bc \| /usr/local/google/home/thakis/src/llvm-project/out/gn/bin/FileCheck /usr/local/google/home/thakis/src/llvm-project/llvm/test/Bitcode/DITemplateParameter-5.0.ll -- Exit Code: 2 Command Output (stderr): -- It looks like the Bitcode/DITemplateParameter-5.0.ll.bc file was never checked in. This reverts commit c2b437d53d40b6dc5603c97f527398f477d9c5f1.	2020-03-02 09:30:52 +01:00
Awanish Pandey	0c03c0d36d	[DebugInfo][clang][DWARF5]: Added support for debuginfo generation for defaulted parameters in C++ templates. Summary: This patch adds support for debuginfo generation for defaulted parameters in clang and also extends corresponding DebugMetadata/IR to support this feature. Reviewers: probinson, aprantl, dblaikie Reviewed By: aprantl, dblaikie Differential Revision: https://reviews.llvm.org/D73462	2020-03-02 12:33:05 +05:30
Fangrui Song	346697cf46	[PowerPC][test] Improve .got2 and .toc tests There is no .got2 test for powerpc32. There is no comdat variable test for powerpc{32,64}.	2020-03-01 22:41:25 -08:00
Serguei Katkov	27ac9a2711	[InlineSpiller] Relax re-materialization restriction for statepoint We should be careful to allow count of re-materialization of operands to be less then number of physical registers. STATEPOINT instruction has a variable number of operands and potentially very big. So re-materialization for all operands is disabled at the moment if restrict-statepoint-remat is true. The patch relaxes the re-materialization restriction for STATEPOINT instruction allowing it for fixed operands. Specifically it is about call target. Reviewers: reames Reviewed By: reames Subscribers: llvm-commits, qcolombet, hiraditya Differential Revision: https://reviews.llvm.org/D75335	2020-03-02 11:25:44 +07:00
Jim Lin	3d6c422d4a	[Sparc] Fix incorrect operand for matching CMPri pattern Summary: It should be normal constant instead of target constant. Pattern CMPri can be matched if the constant can be fitted into immediate field. Otherwise, pattern CMPrr will be matched. This fixed bug https://bugs.llvm.org/show_bug.cgi?id=44091. Reviewers: dcederman, jyknight Reviewed By: jyknight Subscribers: jonpa, hiraditya, fedor.sergeev, jrtc27, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75227	2020-03-02 11:36:32 +08:00
Craig Topper	43dd0cfe6e	[DAGCombiner][X86] Disable narrowExtractedVectorLoad if the element type size isn't byte sized The address calculation for the offset assumes that you can calculate the offset by multiplying the index by the store size of the element. But that only works if the element's store size is exactly its real size since we store vectors tightly packed in memory. There are improvements we could make to this like special casing extracting element 0. I think we could also handle cases where the extracted VT is byte sized and the index is aligned with the extract element count. Differential Revision: https://reviews.llvm.org/D75377	2020-03-01 18:13:25 -08:00
Shengchen Kan	0a44228bff	[X86] Not track size of the boudaryalign fragment during the layout Summary: Currently the boundaryalign fragment caches its size during the process of layout and then it is relaxed and update the size in each iteration. This behaviour is unnecessary and ugly. Reviewers: annita.zhang, reames, MaskRay, craig.topper, LuoYuanke, jyknight Reviewed By: MaskRay Subscribers: hiraditya, dexonsmith, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75404	2020-03-02 09:32:30 +08:00
Craig Topper	e5f11d246e	[X86][TwoAddressInstructionPass] Teach tryInstructionCommute to continue checking for commutable FMA operands in more cases. Previously we would only check for another commutable operand if the first commute was an aggressive commute. But if we have two kill operands and neither is tied to the def at the start, we should consider both operands as the one to use as the new def. This improves the loop in the fma-commute-loop.ll test. This test is derived from a post from discourse here https://llvm.discourse.group/t/unnecessary-vmovapd-instructions-generated-can-you-hint-in-favor-of-vfmadd231pd/582 Differential Revision: https://reviews.llvm.org/D75016	2020-03-01 16:38:08 -08:00
Craig Topper	af775d56a6	[DAGCombiner] Don't emit select_cc from visitSINT_TO_FP/visitUINT_TO_FP. Use plain select instead. Select_cc isn't used by all targets. X86 doesn't have optimizations for it. Since we already know the input to the sint_to_fp/uint_to_fp is a setcc we can just emit a plain select using that setcc as the condition. Other DAG combines can turn that into a select_cc on targets that support it. Differential Revision: https://reviews.llvm.org/D75415	2020-03-01 10:52:17 -08:00
Lang Hames	e46ccc96a0	[JITLink] Update DEBUG_TYPE string for llvm-jitlink. Apparently LLVM_DEBUG doesn't like dashes in strings.	2020-03-01 09:38:06 -08:00
Stefanos Baziotis	d11bc38c04	Fix [ADT][NFC] SCCIterator: Change hasLoop() to hasCycle()	2020-03-01 19:35:58 +02:00
Stefanos Baziotis	9b680a0a2c	[ADT][NFC] SCCIterator: Change hasLoop() to hasCycle()	2020-03-01 19:17:21 +02:00
Reid Kleckner	bbb26b638d	Attempt to fix ZLIB CMake logic on Windows CMake doesn't seem to like it when you regex search for "^".	2020-03-01 08:45:24 -08:00
Reid Kleckner	20236045b1	[WinEH] Fix inttoptr+phi optimization in presence of catchswitch getFirstInsertionPt's return value must be checked for validity before casting it to Instruction*. Don't attempt to insert casts after a phi in a catchswitch block. Fixes PR45033, introduced in D37832. Reviewed By: davidxl, hfinkel Differential Revision: https://reviews.llvm.org/D75381	2020-03-01 07:49:28 -08:00
Sanjay Patel	ecdd92802f	[DAGCombiner] recognize shuffle (shuffle X, Mask0), Mask --> splat X We get the simple cases of this via demanded elements and other folds, but that doesn't work if the values have >1 use, so add a dedicated match for the pattern. We already have this transform in IR, but it doesn't help the motivating x86 tests (based on PR42024) because the shuffles don't exist until after legalization and other combines have happened. The AArch64 test shows a minimal IR example of the problem. Differential Revision: https://reviews.llvm.org/D75348	2020-03-01 09:10:25 -05:00
Jun Ma	f81f85226a	[Coroutines][New pass manager] Move CoroElide pass to right position Differential Revision: https://reviews.llvm.org/D75345	2020-03-01 21:48:24 +08:00
Jun Ma	6998c94fae	Revert "[Coroutines][new pass manager] Move CoroElide pass to right position" This reverts commit 4c0a133a412cd85381469e20f88ee7bf5d2ded8e.	2020-03-01 21:37:41 +08:00
Jun Ma	123d381c17	[Coroutines][new pass manager] Move CoroElide pass to right position Differential Revision: https://reviews.llvm.org/D75345	2020-03-01 20:55:38 +08:00
Craig Topper	b8eb0ad623	[X86] Don't add DELETED_NODES to DAG combine worklist after calling SimplifyDemandedBits/SimplifyDemandedVectorElts. These AddToWorklist calls were added in 84cd968f75bbd6e0fbabecc29d2c1090263adec7. It's possible the SimplifyDemandedBits/SimplifyDemandedVectorElts triggered CSE that deleted N. Detect that and avoid adding N to the worklist. Fixes PR45067.	2020-03-01 00:06:32 -08:00
Fangrui Song	ed889a7401	[PowerPC] Move .got2/.toc logic from PPCLinuxAsmPrinter::doFinalization() to emitEndOfAsmFile() Delete redundant .p2align 2 and improve tests.	2020-02-29 17:12:36 -08:00
Juneyoung Lee	d770247e9f	[ValueTracking] Let getGuaranteedNonFullPoisonOp consider assume, remove mentioning about br Summary: This patch helps getGuaranteedNonFullPoisonOp handle llvm.assume call. Also, a comment about the semantics of branch is removed to prevent confusion. As llvm.assume does, branching on poison directly raises UB (as LangRef says), and this allows transformations such as introduction of llvm.assume on branch condition at each successor, or freely replacing values after conditional branch (such as at loop exit). Handling br is not addressed in this patch. It makes SCEV more accurate, causing existing LoopVectorize/IndVar/etc tests to fail. Reviewers: spatel, lebedev.ri, nlopes Reviewed By: nlopes Subscribers: hiraditya, javed.absar, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75397	2020-03-01 07:45:44 +09:00
Juneyoung Lee	41d800ff06	[ValueTracking] A value is never undef or poison if it must raise UB Summary: This patch helps isGuaranteedNotToBeUndefOrPoison return true if the value makes the program always undefined. According to value tracking functions' comments, it is not still in consensus whether a poison value can be bitwise or not, so conservatively only the case with i1 is considered. Reviewers: spatel, lebedev.ri, reames, nlopes, regehr Reviewed By: nlopes Subscribers: uenoku, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75396	2020-03-01 07:35:58 +09:00
Juneyoung Lee	d709c278b6	[GVN] Fold equivalent freeze instructions Summary: This patch defines two freeze instructions to have the same value number if they are equivalent. This is allowed because GVN replaces all uses of a duplicated instruction with another. If it partially rewrites use, it is not allowed. e.g) ``` a = freeze(x) b = freeze(x) use(a) use(a) use(b) => use(a) use(b) // This is not allowed! use(b) ``` Reviewers: fhahn, reames, spatel, efriedma Reviewed By: fhahn Subscribers: lebedev.ri, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75398	2020-03-01 07:32:05 +09:00
Craig Topper	fcfdcbcfd4	[X86] Tighten up the SDTypeProfile for X86ISD::CVTNE2PS2BF16. NFCI	2020-02-29 13:22:13 -08:00
Reid Kleckner	80428fb35f	Avoid including FileSystem.h from MemoryBuffer.h Lots of headers pass around MemoryBuffer objects, but very few open them. Let those that do include FileSystem.h. Saves ~250 includes of Chrono.h & FileSystem.h: $ diff -u thedeps-before.txt thedeps-after.txt \| grep '^[-+] ' \| sort \| uniq -c \| sort -nr 254 - ../llvm/include/llvm/Support/FileSystem.h 253 - ../llvm/include/llvm/Support/Chrono.h 237 - ../llvm/include/llvm/Support/NativeFormatting.h 237 - ../llvm/include/llvm/Support/FormatProviders.h 192 - ../llvm/include/llvm/ADT/StringSwitch.h 190 - ../llvm/include/llvm/Support/FormatVariadicDetails.h ... This requires duplicating the file_t typedef, which is unfortunate. I sunk the choice of mapping mode down into the cpp file using variable template specializations instead of class members in headers.	2020-02-29 12:30:23 -08:00
Simon Pilgrim	3cec35fc47	Fix Wdocumentation warning - use tparam for template parameters. NFC.	2020-02-29 19:25:02 +00:00
Simon Pilgrim	e5707f0783	[MachineInst] Remove dead code. NFCI. The MachineFunction MF value is not used any more and is always null.	2020-02-29 19:25:02 +00:00
Simon Pilgrim	fc4498117c	Make argument const to silence cppcheck warning. NFCI.	2020-02-29 19:25:01 +00:00
Petr Hosek	1d64104097	[CMake] Link against ZLIB::ZLIB This is the imported target that find_package(ZLIB) defines. Differential Revision: https://reviews.llvm.org/D74176	2020-02-29 11:07:25 -08:00
Petr Hosek	352df9bc18	[CMake] Use PUBLIC link mode for static libraries Using INTERFACE prevents the use of imported libraries as we've done in 00b3d49 because these aren't linked against the target, they're only made part of the interface. This doesn't affect the output since static libraries aren't being linked into, but it enables the use of imported libraries. Differential Revision: https://reviews.llvm.org/D74106	2020-02-29 11:05:59 -08:00
Simon Pilgrim	6019f1db6d	[X86][F16C] Remove cvtph2ps intrinsics and use generic half2float conversion (PR37554) This removes everything but int_x86_avx512_mask_vcvtph2ps_512 which provides the SAE variant, but even this can use the fpext generic if the rounding control is the default. Differential Revision: https://reviews.llvm.org/D75162	2020-02-29 18:57:35 +00:00
Fangrui Song	e70edfb8de	[MC] Add MCStreamer::emitInt{8,16,32,64} Similar to AsmPrinter::emitInt{8,16,32,64}.	2020-02-29 09:40:21 -08:00
Sanjay Patel	06b5c8075d	[PassManager] add tests for vector pass enabling; NFC	2020-02-29 07:56:41 -05:00
Stefan Gränitz	0becb69ddb	[ExecutionEngine] Add JITSymbolFlags::fromSummary(GlobalValueSummary*) Summary: A function that creates JITSymbolFlags from a GlobalValueSummary. Similar functions exist: fromGlobalValue(), fromObjectSymbol() Reviewers: lhames Reviewed By: lhames Subscribers: hiraditya, steven_wu, dexonsmith, arphaman, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75082	2020-02-29 11:54:03 +00:00
Georgii Rymar	2a39090bf4	[llvm-readobj] - Report warnings instead of errors for broken relocations. This is a follow-up for https://reviews.llvm.org/D74545. It adds test cases for each incorrect case returned in `getRelocationTarget`. Differential revision: https://reviews.llvm.org/D74595	2020-02-29 12:50:32 +03:00
Benjamin Kramer	f478403d71	ArrayRef'ize restoreCalleeSavedRegisters. NFCI. restoreCalleeSavedRegisters can mutate the contents of the CalleeSavedInfos, so use a MutableArrayRef.	2020-02-29 09:50:23 +01:00
Shengchen Kan	56f370d717	[X86] Move the function getOrCreateBoundaryAlignFragment MCObjectStreamer is more suitable to create fragments than X86AsmBackend, for example, the function getOrCreateDataFragment is defined in MCObjectStreamer. Differential Revision: https://reviews.llvm.org/D75351	2020-02-29 15:11:16 +08:00
Shengchen Kan	616dd1c201	[X86] Disable the NOP padding for branches when bundle is enabled When bundle is enabled, data fragment itself has a space to emit NOP to bundle-align instructions. The behaviour makes it impossible for us to determine whether the macro fusion really happen when emitting instructions. In addition, boundary-align fragment is also used to emit NOPs to align instructions, currently using them together sometimes makes code crazy. Differential Revision: https://reviews.llvm.org/D75346	2020-02-29 15:07:06 +08:00

1 2 3 4 5 ...

192754 Commits