llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Aaron Smith	f22e6d55bc	[SelectionDAG] Add MVT::bf16 to getConstantFP() Summary: This was probably overlooked in recent bfloat patches. Needed to handle bf16 constants in SelectionDAG. ConstantFP:bf16<APFloat(0)> Reviewers: stuij Reviewed By: stuij Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81779	2020-06-16 15:10:05 -07:00
Fangrui Song	3ac41a5d21	[llvm-cov gcov] Don't suppress .gcov output if .gcda is corrupted If .gcda is corrupted, gcov continues to produce a .gcov and just assumes execution counts are zeros. This is reasonable, because the program can corrupt its .gcda output. The code path should be similar to the code path without .gcda.	2020-06-16 14:55:38 -07:00
Daniel Sanders	c3ee50d3ed	[gicombiner] Allow generated combiners to store additional members Summary: Adds the ability to add members to a generated combiner via a State base class. In the current AArch64PreLegalizerCombiner this is used to make Helper available without having to provide it to every call. As part of this, split the command line processing into a separate object so that it still only runs once even though the generated combiner is constructed more frequently. Depends on D81862 Reviewers: aditya_nandakumar, bogner, volkan, aemerson, paquette, arsenm Reviewed By: arsenm Subscribers: jvesely, wdng, nhaehnle, kristof.beyls, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81863	2020-06-16 14:47:04 -07:00
Kirill Naumov	f43849e9c7	[CallPrinter] Adding heat coloring to CallPrinter This patch introduces the heat coloring of the Call Printer which is based on the relative "hotness" of each function. The patch is a part of sequence of three patches, related to graphs Heat Coloring. Another feature added is the flag similar to "-cfg-dot-filename-prefix", which allows to write the graph into a named .pdf Reviewers: rcorcs, apilipenko, davidxl, sfertile, fedor.sergeev, eraman, bollu Differential Revision: https://reviews.llvm.org/D77172	2020-06-16 21:15:29 +00:00
Fangrui Song	d1d0909fd1	[gcov] Add -i --intermediate-format Between gcov 4.9~8, `gcov -i $file` prints coverage information to $file.gcov in an intermediate text format (single file, instead of $source.gcov for each source file). lcov newer than 2019-05-24 detects -i support and uses it to increase processing speed. gcov 9 (GCC r265587) removed --intermediate-format and -i was changed to mean --json-format. However, we consider this format still useful and support it. geninfo (part of lcov) supports this format even if we announce that we are compatible with gcov 9.0.0	2020-06-16 14:14:28 -07:00
Fangrui Song	69837fa7b5	[gcov] Refactor llvm-cov gcov and add SourceInfo	2020-06-16 14:14:26 -07:00
Christopher Tetreault	ebc189db72	[SVE] Eliminate calls to default-false VectorType::get() from AArch64 Reviewers: efriedma, c-rhodes, david-arm, samparker, greened Reviewed By: efriedma Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81518	2020-06-16 13:53:25 -07:00
Christopher Tetreault	f0aae37e1d	[NFC] Bail out for scalable vectors before calling getNumElements Summary: Move the bail out logic to before constructing the Result and Lane vectors. This is both potentially faster, and avoids calling getNumElements on a potentially scalable vector Reviewers: efriedma, sunfish, chandlerc, c-rhodes, fpetrogalli Reviewed By: fpetrogalli Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81619	2020-06-16 13:41:29 -07:00
Christopher Tetreault	650da30b37	[SVE] Fix bad FixedVectorType cast in simplifyDivRem Summary: simplifyDivRem attempts to walk a VectorType elementwise. Ensure that it only does so for FixedVectorType Reviewers: efriedma, spatel, lebedev.ri, david-arm, kmclaughlin Reviewed By: spatel, david-arm Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81856	2020-06-16 13:17:05 -07:00
Christopher Tetreault	59f1665fe9	[SVE] Eliminate calls to default-false VectorType::get() from Vectorize Reviewers: efriedma, fhahn, spatel, sdesmalen, kmclaughlin Reviewed By: efriedma Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81521	2020-06-16 12:50:13 -07:00
Matt Arsenault	4fd2c5292f	GlobalISel: Fix not failing on widening G_INSERT_VECTOR_ELT This doesn't actually handled type idx 0, but was reporting Legalized on it. No test changes because nothing was trying to use this.	2020-06-16 15:48:57 -04:00
Ahsan Saghir	1d87baf1c6	[PowerPC] Add -m[no-]power10-vector clang and llvm option Summary: This patch adds command line option for enabling power10-vector support. Reviewers: hfinkel, nemanjai, lei, amyk, #powerpc Reviewed By: lei, amyk, #powerpc Subscribers: wuzish, kbarton, hiraditya, shchenz, cfe-commits, llvm-commits Tags: #llvm, #clang, #powerpc Differential Revision: https://reviews.llvm.org/D80758	2020-06-16 14:47:35 -05:00
Matt Arsenault	f91d7e4c2e	GlobalISel: Use early return and reduce indentation	2020-06-16 14:47:08 -04:00
Stanislav Mekhanoshin	1a6d6ebda9	Fix ubsan error in tblgen with signed left shift UBSAN complains when tblgen performs SHL of a negative value. Differential Revision: https://reviews.llvm.org/D81952	2020-06-16 11:15:09 -07:00
Hiroshi Yamauchi	b2733d94ae	[TLI] Add four C++17 delete variants. Summary: delete(void, unsigned int, align_val_t) delete(void, unsigned long, align_val_t) delete[](void, unsigned int, align_val_t) delete[](void, unsigned long, align_val_t) Differential Revision: https://reviews.llvm.org/D81853	2020-06-16 11:12:02 -07:00
Sanjay Patel	fe400c6e8c	[VectorCombine] scalarize compares with insertelement operand(s) Generalize scalarization (recently enhanced with D80885) to allow compares as well as binops. Similar to binops, we are avoiding scalarization of a loaded value because that could avoid a register transfer in codegen. This requires 1 extra predicate that I am aware of: we do not want to scalarize the condition value of a vector select. That might also invert a transform that we do in instcombine that prefers a vector condition operand for a vector select. I think this is the final step in solving PR37463: https://bugs.llvm.org/show_bug.cgi?id=37463 Differential Revision: https://reviews.llvm.org/D81661	2020-06-16 13:48:10 -04:00
Jessica Paquette	b8d56677b3	[AArch64][GlobalISel] Avoid creating redundant ubfx when selecting G_ZEXT When selecting 32 b -> 64 b G_ZEXTs, we don't have to always emit the extend. If the instruction feeding into the G_ZEXT implicitly zero extends the high half of the register, we can just emit a SUBREG_TO_REG instead. Differential Revision: https://reviews.llvm.org/D81897	2020-06-16 09:50:47 -07:00
Fangrui Song	10f226c28a	[GlobalISel] Delete unused variable after r353432	2020-06-16 08:32:09 -07:00
Leandro Vaz	257581e847	Fix debug line info when line markers are present inside macros. Compiling assembly files when newlines are reduced to line markers within a `.macro` context will generate wrong information in `.debug_line` section. This patch fixes this issue by evaluating line markers within the macro scope but not when they are used and evaluated. Reviewed By: probinson Differential Revision: https://reviews.llvm.org/D80381	2020-06-16 16:13:11 +01:00
Luke Geeson	a099df1a66	[AArch64]: BFloat MatMul Intrinsics&CodeGen This patch upstreams support for BFloat Matrix Multiplication Intrinsics and Code Generation from __bf16 to AArch64. This includes IR intrinsics. Unittests are provided as needed. AArch32 Intrinsics + CodeGen will come after this patch. This patch is part of a series implementing the Bfloat16 extension of the Armv8.6-a architecture, as detailed here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a The bfloat type, and its properties are specified in the Arm Architecture Reference Manual: https://developer.arm.com/docs/ddi0487/latest/arm-architecture-reference-manual-armv8-for-armv8-a-architecture-profile The following people contributed to this patch: Luke Geeson - Momchil Velikov - Mikhail Maltsev - Luke Cheeseman Reviewers: SjoerdMeijer, t.p.northover, sdesmalen, labrinea, miyuki, stuij Reviewed By: miyuki, stuij Subscribers: kristof.beyls, hiraditya, danielkiss, cfe-commits, llvm-commits, miyuki, chill, pbarrio, stuij Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D80752 Change-Id: I174f0fd0f600d04e3799b06a7da88973c6c0703f	2020-06-16 15:23:30 +01:00
Luke Geeson	15ca470585	[AArch64]: BFloat Load/Store Intrinsics&CodeGen This patch upstreams support for ld / st variants of BFloat intrinsics in from __bf16 to AArch64. This includes IR intrinsics. Unittests are provided as needed. This patch is part of a series implementing the Bfloat16 extension of the Armv8.6-a architecture, as detailed here: https://community.arm.com/developer/ip-products/processors/b/processors-ip-blog/posts/arm-architecture-developments-armv8-6-a The bfloat type, and its properties are specified in the Arm Architecture Reference Manual: https://developer.arm.com/docs/ddi0487/latest/arm-architecture-reference-manual-armv8-for-armv8-a-architecture-profile The following people contributed to this patch: - Luke Geeson - Momchil Velikov - Luke Cheeseman Reviewers: fpetrogalli, SjoerdMeijer, sdesmalen, t.p.northover, stuij Reviewed By: stuij Subscribers: arsenm, pratlucas, simon_tatham, labrinea, kristof.beyls, hiraditya, danielkiss, cfe-commits, llvm-commits, pbarrio, stuij Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D80716 Change-Id: I22e1dca2a8a9ec25d1e4f4b200cb50ea493d2575	2020-06-16 15:23:30 +01:00
Georgii Rymar	764aa568b0	[DebugInfo/DWARF] - Report .eh_frame sections of version != 1. Specification (https://refspecs.linuxbase.org/LSB_5.0.0/LSB-Core-generic/LSB-Core-generic/ehframechpt.html#AEN1349) says that the value of Version field for .eh_frame should be 1. Though we accept other values and might perform an attempt to read it as a .debug_frame because of that, what is wrong. This patch adds a version check. Differential revision: https://reviews.llvm.org/D81469	2020-06-16 15:46:26 +03:00
Tyker	47f1c58669	Revert "[AssumeBundles] add cannonicalisation to the assume builder" This reverts commit 90c50cad1983c5e29107a78382dead0fe2a9562c.	2020-06-16 14:34:55 +02:00
Ayke van Laethem	2d43b2fbd5	[AVR] Remove faulty stack pushing behavior An instruction like this will need to allocate some stack space for the last parameter: %x = call addrspace(1) i16 @bar(i64 undef, i64 undef, i16 undef, i16 0) This worked fine when passing an actual value (in this case 0). However, when passing undef, no value was pushed to the stack and therefore no push instructions were created. This caused an unbalanced stack leading to interesting results. This commit fixes that by replacing the push logic with a regular stack adjustment and stack-relative load/stores. This is less efficient but at least it correctly compiles the code. I can think of a few improvements in the future: * The stack should have been adjusted in the function prologue when there are no allocas in the function. * Many (if not most) stack adjustments can be replaced by pushing/popping the values directly. Exactly like the previous code attempted but didn't do correctly. * Small stack adjustments can be done more efficiently with a few push/pop instructions (pushing/popping bogus values), both for code size and for speed. All in all, as long as there are no allocas in the function I think that it is almost always more efficient to emit regular push/pop instructions. This is however left for future optimizations. Differential Revision: https://reviews.llvm.org/D78581	2020-06-16 13:53:32 +02:00
Ayke van Laethem	684b1f2b77	[AVR] Fix stack size in functions with a frame pointer This patch fixes a bug in stack save/restore code. Because the frame pointer was saved/restored manually (not by marking it as clobbered) the StackSize variable was not updated accordingly. Most code still worked, but code that tried to load a parameter passed on the stack did not. This commit fixes this by marking the frame pointer as a callee-clobbered register. This will let it be saved without any effort in prolog/epilog code and will make sure the correct address is calculated for loading parameters that are passed on the stack. This approach is used by most other targets (such as X86, AArch64 and RISC-V). Differential Revision: https://reviews.llvm.org/D78579	2020-06-16 13:53:32 +02:00
David Green	3175ed5612	[ARM] Fix crash trying to generate i1 immediates These code patterns attempt to call isVMOVModifiedImm on a splat of i1 values, leading to an unreachable being hit. I've guarded the call on a more specific set of sizes, as i1 vectors are legal under MVE. Differential Revision: https://reviews.llvm.org/D81860	2020-06-16 12:27:24 +01:00
Simon Pilgrim	5719b7a57b	Fix comment typo - Uexpected -> Unexpected. NFC.	2020-06-16 12:14:51 +01:00
Tyker	4068658963	[AssumeBundles] add cannonicalisation to the assume builder Summary: this reduces significantly the number of assumes generated without aftecting too much the information that is preserved. this improves the compile-time cost of enable-knowledge-retention significantly. Reviewers: jdoerfert, sstefan1 Reviewed By: jdoerfert Subscribers: hiraditya, asbirlea, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79650	2020-06-16 13:12:35 +02:00
Kristof Beyls	d310eea0df	Silence GCC 7 warning GCC 7 was reporting "enumeral and non-enumeral type in conditional expression" as a warning. The code casts an instruction opcode enum to unsigned implicitly, in line with intentions; so this commit silences the warning by making the cast to unsigned explicit.	2020-06-16 11:42:52 +01:00
sstefan1	73e7ebf693	[NFC][OpenMPOpt] Provide function-specific foreachUse.	2020-06-16 12:33:15 +02:00
Alexandros Lamprineas	facc422c9e	[ARM][NFC] Explicitly specify the fp16 value type in codegen patterns. We are planning to add the bf16 value type in the HPR register class and this will make the codegen patterns ambiguous. Differential Revision: https://reviews.llvm.org/D81505	2020-06-16 11:32:17 +01:00
Jay Foad	32716ad2d8	Revert "[IR] Clean up dead instructions after simplifying a conditional branch" This reverts commit 69bdfb075b293c4b3363f2dc0ac732ca03c3c9ca. Reverting to investigate https://bugs.llvm.org/show_bug.cgi?id=46343	2020-06-16 10:32:15 +01:00
Simon Pilgrim	88a4eeda25	[X86][SSE] combineVectorSizedSetCCEquality - remove unused AVX2 MOVMSK path. NFCI. If PTEST is not available, then we're guaranteed to be performing a 128-bit vector comparison using MOVMSK(PCMPEQB(v16i8)).	2020-06-16 10:07:41 +01:00
Igor Kudrin	cb2c2a149f	[MC] Generate .debug_frame in the 64-bit DWARF format [7/7] Note that .eh_frame sections are generated in the 32-bit format even when debug sections are 64-bit, for compatibility reasons. They use relative references between entries, so they hardly benefit from the 64-bit format. Differential Revision: https://reviews.llvm.org/D81149	2020-06-16 15:50:14 +07:00
Igor Kudrin	9b0111814b	[MC] Fix DWARF forms for 64-bit DWARFv3 files [6/7] DW_FORM_sec_offset was introduced in DWARFv4, so, for 64-bit DWARFv3, DW_FORM_data8 should be used instead. Differential Revision: https://reviews.llvm.org/D81148	2020-06-16 15:50:14 +07:00
Igor Kudrin	8986fbdc27	[MC] Generate .debug_rnglists in the 64-bit DWARF format [5/7] In addition, the patch fixes referencing the section within a compilation unit. Differential Revision: https://reviews.llvm.org/D81147	2020-06-16 15:50:13 +07:00
Igor Kudrin	2619754621	[MC] Generate .debug_aranges in the 64-bit DWARF format [4/7] Differential Revision: https://reviews.llvm.org/D81146	2020-06-16 15:50:13 +07:00
Igor Kudrin	7881958479	[MC] Generate a compilation unit in the 64-bit DWARF format [3/7] The patch enables producing DWARF64 compilation units and fixes generating references to .debug_abbrev and .debug_line sections. A similar change for .debug_ranges/.debug_rnglists will be added in a forthcoming patch. Differential Revision: https://reviews.llvm.org/D81145	2020-06-16 15:50:13 +07:00
Igor Kudrin	853fb0b653	[MC] Generate .debug_line in the 64-bit DWARF format [2/7] Differential Revision: https://reviews.llvm.org/D81144	2020-06-16 15:50:13 +07:00
Igor Kudrin	02872c9d70	[MC] Add --dwarf64 to generate DWARF64 debug info [1/7] The patch adds an option `--dwarf64` to instruct a tool to generate debug information in the 64-bit DWARF format. There is no real implementation yet, only a few compatibility checks. Differential Revision: https://reviews.llvm.org/D81143	2020-06-16 15:50:13 +07:00
Simon Pilgrim	125618570c	[X86][SSE] MatchVectorAllZeroTest - handle OR vector reductions This patch extends MatchVectorAllZeroTest to handle OR vector reduction patterns where the result is compared against zero. Fixes PR45378 Differential Revision: https://reviews.llvm.org/D81547	2020-06-16 09:42:34 +01:00
Simon Pilgrim	0a7c114d32	[X86][SSE] combineVectorSizedSetCCEquality - move single Subtarget.hasAVX() use into condition. NFC. We already have Subtarget.hasSSE2() and Subtarget.useAVX512Regs() in the condition - seems to be a legacy from when we had multiple uses.	2020-06-16 09:42:33 +01:00
Sam Parker	75d1608a84	[CostModel] Unify getCFInstrCost Have TTI::getInstructionThroughput call getUserCost for Br, Ret and PHI. This now means that eveything in getInstructionThroughput is handled by getUserCost. Differential Revision: https://reviews.llvm.org/D79849	2020-06-16 08:40:54 +01:00
Fangrui Song	c4908bd9d4	[AArch64] Print the immediate operand for SPACE pseudo instruction Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D81814	2020-06-15 20:55:53 -07:00
Amara Emerson	188eb39c56	[AArch64][GlobalISel] Emit constant pool loads for 64 bit fp immediates. Note: don't do this for integer 64 bit materialization to match SDAG. Differential Revision: https://reviews.llvm.org/D81893	2020-06-15 20:53:09 -07:00
Qiu Chaofan	71cc6e9751	[LLParser] Delete temp CallInst when error occurs Only functions with floating-point return type accepts fast-math flags. When adding such flags to function returning integer, we'll see a crash, because there's still an undeleted value referencing the argument. This patch manually removes the temporary instruction when error occurs. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D78355	2020-06-16 11:41:25 +08:00
Xing GUO	972e60a052	[ObjectYAML][DWARF] Implement the .debug_addr section. This patch implements the .debug_addr section. Reviewed By: jhenderson, grimar Differential Revision: https://reviews.llvm.org/D81541	2020-06-16 10:53:10 +08:00
Mircea Trofin	14315371b9	[llvm][NFC] Fix license on InlineFeaturesAnalysis.{h\|cpp} Summary: Also fixed the InlineAdvisor.cpp license. Reviewers: rriddle Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81896	2020-06-15 19:34:33 -07:00
Craig Topper	39cca89f2e	[X86] Add support for inline assembly 'x' constraint for i128. Limiting to x86-64 since that's when __int128 is legal in clang. Differential Revision: https://reviews.llvm.org/D81817	2020-06-15 19:34:02 -07:00
Gui Andrade	bb38b0a59d	[MSAN] Pass Origin by parameter to __msan_warning functions Summary: Normally, the Origin is passed over TLS, which seems like it introduces unnecessary overhead. It's in the (extremely) cold path though, so the only overhead is in code size. But with eager-checks, calls to __msan_warning functions are extremely common, so this becomes a useful optimization. This can save ~5% code size. Reviewers: eugenis, vitalybuka Reviewed By: eugenis, vitalybuka Subscribers: hiraditya, #sanitizers, llvm-commits Tags: #sanitizers, #llvm Differential Revision: https://reviews.llvm.org/D81700	2020-06-15 17:49:18 -07:00

1 2 3 4 5 ...

135657 Commits