llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 12:12:47 +01:00

Author	SHA1	Message	Date
Craig Topper	f4bf2109b9	[X86] Update release notes.	2020-08-27 21:19:58 -07:00
Francesco Petrogalli	5f5352f7e7	[MC][SVE] Fix data operand for instruction alias of `st1d`. The version of `st1d` that operates with vector plus immediate addressing mode uses the alias `st1d { <Zn>.d }, <Pg>, [<Za>.d]` for rendering `st1d { <Zn>.d }, <Pg>, [<Za>.d, #0]`. The disassembler was generating `<Zn>.s` instead of `<Zn>.d>`. Differential Revision: https://reviews.llvm.org/D86633	2020-08-26 20:12:13 +00:00
Hans Wennborg	a48e0377f5	ReleaseNotes: mention the build preferring python 3 Text by Saleem!	2020-08-26 19:29:58 +02:00
Francesco Petrogalli	88141c7f49	[release][SVE] Move notes for SVE ACLE to the release notes of clang.	2020-08-26 15:41:59 +01:00
Hans Wennborg	c77d5eb058	ReleaseNotes: removal of llgo	2020-08-26 15:16:02 +02:00
QingShan Zhang	e7d24f45b4	[DAGCombine] Remove dead node when it is created by getNegatedExpression We hit the compiling time reported by https://bugs.llvm.org/show_bug.cgi?id=46877 and the reason is the same as D77319. So we need to remove the dead node we created to avoid increase the problem size of DAGCombiner. Reviewed By: Spatel Differential Revision: https://reviews.llvm.org/D86183 (cherry picked from commit 960cbc53ca170c8c605bf83fa63b49ab27a56f65)	2020-08-25 17:51:05 +02:00
Johannes Doerfert	fdbb91ad22	Reuse OMPIRBuilder `struct ident_t` handling in Clang Replace the `ident_t` handling in Clang with the methods offered by the OMPIRBuilder. This cuts down on the clang code as well as the differences between the two, making further transitions easier. Tests have changed but there should not be a real functional change. The most interesting difference is probably that we stop generating local ident_t allocations for now and just use globals. Given that this happens only with debug info, the location part of the `ident_t` is probably bigger than the test anyway. As the location part is already a global, we can avoid the allocation, memcpy, and store in favor of a constant global that is slightly bigger. This can be revisited if there are complications. Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D80735	2020-08-25 17:02:44 +02:00
Kang Zhang	ed779a88b8	[PowerPC] Fix a typo for InstAlias of mfsprg D77531 has a type for mfsprg, it should be mtsprg. This patch is to fix this typo. (cherry picked from commit 95e18b2d9d5f93c209ea81df79c2e18ef77de506)	2020-08-24 20:42:30 +02:00
Francesco Petrogalli	1fefa517f6	[release][docs] Move SVE release notes to AArch64 section.	2020-08-20 16:24:59 +00:00
Francesco Petrogalli	edf75ab839	[release][docs] Note on lazy binding and SVE.	2020-08-20 15:33:30 +00:00
Sam Elliott	71c87ee018	[RISCV] Indirect branch generation in position independent code This fixes the "Unable to insert indirect branch" fatal error sometimes seen when generating position-independent code. Patch by msizanoen1 Reviewed By: jrtc27 Differential Revision: https://reviews.llvm.org/D84833 (cherry picked from commit 5f9ecc5d857fa5d95f6ea36153be19db40576f8a)	2020-08-20 14:32:36 +02:00
Amy Huang	fc50dce591	[globalopt] Change so that emitting fragments doesn't use the type size of DIVariables When turning on -debug-info-kind=constructor we ran into a "fragment covers entire variable" error during thinlto. The fragment is currently always emitted if there is no type size, but sometimes the variable has a forward declared struct type which doesn't have a size. This changes the code to get the type size from the GlobalVariable instead. Differential Revision: https://reviews.llvm.org/D85572 (cherry picked from commit 54b6cca0f28484395ae43bcda4c9f929bc51cfe3)	2020-08-19 18:36:13 +02:00
Francesco Petrogalli	5e39908de4	[release][docs] Update contributions to LLVM 11 for SVE. Differential Revision: https://reviews.llvm.org/D85977	2020-08-18 21:13:21 +00:00
Craig Topper	f19cefc097	[X86] Disable copy elision in LowerMemArgument for scalarized vectors when the loc VT is a different size than the original element. For example a v4f16 argument is scalarized to 4 i32 values. So the values are spread out instead of being packed tightly like in the original vector. Fixes PR47000. (cherry picked from commit 08b2d0a963dbbf54317a137d69f430b347d1bfae)	2020-08-18 16:02:37 +02:00
Craig Topper	c2ceb9b749	[X86] Add test case for PR47000. NFC (cherry picked from commit 13796d14238baabff972e15ceddb4ae61b1584b8)	2020-08-18 16:02:36 +02:00
Fangrui Song	5c795970c1	[X86] Optimize getImpliedDisabledFeatures & getImpliedEnabledFeatures after D83273 Previously the time complexity is O(\|number of paths from the root to an implied feature\| * CPU_FWATURE_MAX) where CPU_FEATURE_MAX is 92. The number of paths can be large (theoretically exponential). For an inline asm statement, there is a code path `clang::Parser::ParseAsmStatement -> clang::Sema::ActOnGCCAsmStmt -> ASTContext::getFunctionFeatureMap` leading to potentially many calls of getImpliedEnabledFeatures (41 for my -march=native case). We should improve the performance a bit in case the number of inline asm statements is large (Linux kernel builds). Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D85257 (cherry picked from commit 0c7af8c83bd1acb0ca78f35ddde29b6fde4363a0)	2020-08-18 12:37:17 +02:00
Roman Lebedev	7b99daf6d1	[InstCombine] Sanitize undef vector constant to 1 in X(2^C) with X << C (PR47133) While xundef is undef, shift-by-undef is poison, which we must avoid introducing. Also log2(iN undef) is NOT iN undef, because log2(iN undef) u< N. See https://bugs.llvm.org/show_bug.cgi?id=47133 (cherry picked from commit 12d93a27e7b78d58dd00817cb737f273d2dba8ae)	2020-08-18 12:26:01 +02:00
Chen Zheng	f0d66a02ba	[PowerPC] Make StartMI ignore COPY like instructions. Reviewed By: lkail Differential Revision: https://reviews.llvm.org/D85659 (cherry picked from commit 4d52ebb9b9c72b656c1ccb6a1424841f246cd791)	2020-08-18 11:49:37 +02:00
David Sherwood	de2e9a1d9d	[SVE] Fix bug in SVEIntrinsicOpts::optimizePTest The code wasn't taking into account that the two operands passed to ptest could be identical and was trying to erase them twice. Differential Revision: https://reviews.llvm.org/D85892 (cherry picked from commit 6c7957c9901714b7ad0a8d2743a8c431b57fd0c9)	2020-08-18 10:02:50 +02:00
David Blaikie	4fc1aa99f5	Fix -Wconstant-conversion warning with explicit cast Introduced by fd6584a22043b254a323635c142b28ce80ae5b5b Following similar use of casts in AsmParser.cpp, for instance - ideally this type would use unsigned chars as they're more representative of raw data and don't get confused around implementation defined choices of char's signedness, but this is what it is & the signed/unsigned conversions are (so far as I understand) safe/bit preserving in this usage and what's intended, given the API design here. (cherry picked from commit e31cfc4cd3e393300002e9c519787c96e3b67bab)	2020-08-17 14:01:44 +02:00
David Sherwood	e312733943	[SVE][CodeGen] Fix bug with store of unpacked FP scalable vectors Fixed an incorrect pattern in lib/Target/AArch64/AArch64SVEInstrInfo.td for storing out <vscale x 2 x f32> unpacked scalable vectors. Added a couple of tests to test/CodeGen/AArch64/sve-st1-addressing-mode-reg-imm.ll Differential Revision: https://reviews.llvm.org/D85441 (cherry picked from commit 0905d9f31ead399d054c5d2a2c353e690f5c8daa)	2020-08-17 13:58:11 +02:00
Sander de Smalen	08c4f4dedd	[AArch64][SVE] Disable tail calls if callee does not preserve SVE regs. This fixes an issue triggered by the following code, where emitEpilogue got confused when trying to restore the SVE registers after the call, whereas the call to bar() is implemented as a TCReturn: int non_sve(); int sve(svint32_t x) { return non_sve(); } Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D84869 (cherry picked from commit f2916636f83dfeb4808a16045db0025783743471)	2020-08-17 13:58:10 +02:00
Sander de Smalen	b70d8f0926	[AArch64][SVE] Add missing unwind info for SVE registers. This patch adds a CFI entry for each SVE callee saved register that needs unwind info at an offset from the CFA. The offset is a DWARF expression because the offset is partly scalable. The CFI entries only cover a subset of the SVE callee-saves and only encodes the lower 64-bits, thus implementing the lowest common denominator ABI. Existing unwinders may support VG but only restore the lower 64-bits. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D84044 (cherry picked from commit bb3344c7d8c2703c910dd481ada43ecaf11536a6)	2020-08-17 13:58:10 +02:00
Sander de Smalen	58d78be3d9	[AArch64][SVE] Fix CFA calculation in presence of SVE objects. The CFA is calculated as (SP/FP + offset), but when there are SVE objects on the stack the SP offset is partly scalable and should instead be expressed as the DWARF expression: SP + offset + scalable_offset * VG where VG is the Vector Granule register, containing the number of 64bits 'granules' in a scalable vector. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D84043 (cherry picked from commit fd6584a22043b254a323635c142b28ce80ae5b5b)	2020-08-17 13:58:09 +02:00
Martin Storsjö	4e760d6c21	[docs] Add release notes for the 11.x release	2020-08-17 11:17:15 +03:00
Petar Avramovic	9a16e544d3	[GlobalISel][InlineAsm] Fix matching input constraint to physreg Add given input and mark it as tied. Doesn't create additional copy compared to matching input constraint to virtual register. Differential Revision: https://reviews.llvm.org/D85122 (cherry picked from commit d893278bba01b0e1209e8b8accbdd5cfa75a0932)	2020-08-07 19:48:51 +02:00
Martin Storsjö	ece79ac6e6	[AArch64] [Windows] Error out on unsupported symbol locations These might occur in seemingly generic assembly. Previously when targeting COFF, they were silently ignored, which certainly won't give the right result. Instead clearly error out, to make it clear that the assembly needs to be adjusted for this target. Also change a preexisting report_fatal_error into a proper error message, pointing out the offending source instruction. This isn't strictly an internal error, as it can be triggered by user input. Differential Revision: https://reviews.llvm.org/D85242 (cherry picked from commit f5e6fbac24f198d075a7c4bc0879426e79040bcf)	2020-08-06 13:46:50 +02:00
Chen Zheng	888e055a40	[PowerPC] fixupIsDeadOrKill start and end in different block fixing In fixupIsDeadOrKill, we assume StartMI and EndMI not exist in same basic block, so we add an assertion in that function. This is wrong before RA, as before RA the true definition may exist in another block through copy like instructions. Reviewed By: nemanjai Differential Revision: https://reviews.llvm.org/D83365 (cherry picked from commit 36f9fe2d3493717dbc6866d96b2e989839ce1a4c)	2020-08-05 20:12:38 +02:00
Martin Storsjö	872454e51b	[llvm-rc] Allow string table values split into multiple string literals This can practically easily be a product of combining strings with macros in resource files. This fixes https://github.com/mstorsjo/llvm-mingw/issues/140. As string literals within llvm-rc are handled as StringRefs, each referencing an uninterpreted slice of the input file, with actual interpretation of the input string (codepage handling, unescaping etc) done only right before writing them out to disk, it's hard to concatenate them other than just bundling them up in a vector, without rearchitecting a large part of llvm-rc. This matches how the same already is supported in VersionInfoValue, with a std::vector<IntOrString> Values. MS rc.exe only supports concatenated string literals in version info values (already supported), string tables (implemented in this patch) and user data resources (easily implemented in a separate patch, but hasn't been requested by any end user yet), while GNU windres supports string immediates split into multiple strings anywhere (e.g. like (100 ICON "myicon" ".ico"). Not sure if concatenation in other statements actually is used in the wild though, in resource files normally built by GNU windres. Differential Revision: https://reviews.llvm.org/D85183 (cherry picked from commit b989fcbae6f179ad887d19ceef83ace1c00b87cc)	2020-08-05 19:59:38 +02:00
Hans Wennborg	d8c783670a	RuntimeDyldELF: report_fatal_error instead of asserting for unimplemented relocations (PR46816) This fixes the ExecutionEngine/MCJIT/stubs-sm-pic.ll test in no-asserts builds which is set to XFAIL on some platforms like 32-bit x86. More importantly, we probably don't want to silently error in these cases. Differential revision: https://reviews.llvm.org/D84390 (cherry picked from commit 6a3b07a4bf14be32569550f2e9814d8797d27d31)	2020-08-05 19:39:11 +02:00
Jonas Devlieghere	1dec893ffe	[llvm] Add RISCVTargetParser.def to the module map This fixes the modules build. (cherry picked from commit 1b3c25e7b61f44b80788f8758f0d7f0b013135b5)	2020-08-05 17:32:25 +02:00
Hans Wennborg	78f30188b4	Bump forgotten version nbr in llvm/docs/conf.py	2020-08-05 17:12:51 +02:00
Changpeng Fang	5aeae1780f	AMDGPU: Put inexpensive ops first in AMDGPUAnnotateUniformValues::visitLoadInst Summary: This is in response to the review of https://reviews.llvm.org/D84873: The expensive check should be reordered last Reviewers: arsenm Differential Revision: https://reviews.llvm.org/D84890 (cherry picked from commit 243376cdc7b719d443f42c8c4667e5d96af53dcc)	2020-08-03 16:01:25 +02:00
Michał Górny	2fc661ffb0	[CMake] Pass bugreport URL to standalone clang build BUG_REPORT_URL is currently used both in LLVM and in Clang but declared only in the latter. This means that it's missing in standalone clang builds and the driver ends up outputting: PLEASE submit a bug report to and include [...] (note the missing URL) To fix this, include LLVM_PACKAGE_BUGREPORT in LLVMConfig.cmake (similarly to how we pass PACKAGE_VERSION) and use it to fill BUG_REPORT_URL when building clang standalone. Differential Revision: https://reviews.llvm.org/D84987 (cherry picked from commit 21c165de2a1bcca9dceb452f637d9e8959fba113)	2020-08-03 15:59:06 +02:00
Florian Hahn	c96add5bee	[LAA] Avoid adding pointers to the checks if they are not needed. Currently we skip alias sets with only reads or a single write and no reads, but still add the pointers to the list of pointers in RtCheck. This can lead to cases where we try to access a pointer that does not exist when grouping checks. In most cases, the way we access PositionMap masked that, as the value would default to index 0. But in the example in PR46854 it causes a crash. This patch updates the logic to avoid adding pointers for alias sets that do not need any checks. It makes things slightly more verbose, by first checking the numbers of reads/writes and bailing out early if we don't need checks for the alias set. I think this makes the logic a bit simpler to follow. Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D84608 (cherry picked from commit 2062b3707c1ef698deaa9abc571b937fdd077168)	2020-08-03 15:55:25 +02:00
Brendon Cahoon	585524e839	Align store conditional address In cases where the alignment of the datatype is smaller than expected by the instruction, the address is aligned. The aligned address is used for the load, but wasn't used for the store conditional, which resulted in a run-time alignment exception. (cherry picked from commit 7b114446c320de542c50c4c02f566e5d18adee33)	2020-08-03 15:52:15 +02:00
Balazs Benics	2cd4771119	[analyzer] Fix out-of-tree only clang build by not relaying on private header It turned out that the D78704 included a private LLVM header, which is excluded from the LLVM install target. I'm substituting that `#include` with the public one by moving the necessary `#define` into that. There was a discussion about this at D78704 and on the cfe-dev mailing list. I'm also placing a note to remind others of this pitfall. Reviewed By: mgorny Differential Revision: https://reviews.llvm.org/D84929 (cherry picked from commit 63d3aeb529a7b0fb95c2092ca38ad21c1f5cfd74)	2020-07-31 20:31:44 +02:00
Francesco Petrogalli	d7924b4be1	[llvm][CodeGen] Addressing modes for SVE ldN. Reviewers: c-rhodes, efriedma, sdesmalen Subscribers: huihuiz, tschuett, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77251 (cherry picked from commit adb28e0fb2b0e97ea9dce422c09b36979cf7cd2f)	2020-07-31 17:27:53 +02:00
Francesco Petrogalli	c3a85d666e	[NFC][AArch64] Replace some template methods/invocations... ...with the non-template version, as the template version might increase the size of the compiler build. Methods affected: 1.`findAddrModeSVELoadStore` 2. `SelectPredicatedStore` Also, remove the `const` qualifier from the `unsigned` parameters of the methods to conform with other similar methods in the class. (cherry picked from commit dbeb184b7f54db2d3ef20ac153b1c77f81cf0b99)	2020-07-31 17:27:52 +02:00
Francesco Petrogalli	a232ab7037	[llvm][sve] Reg + Imm addressing mode for ld1ro. Reviewers: kmclaughlin, efriedma, sdesmalen Subscribers: tschuett, hiraditya, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83357 (cherry picked from commit 809600d6642773f71245f76995dab355effc73af)	2020-07-31 17:27:52 +02:00
David Sherwood	05256451cd	[SVE][CodeGen] At -O0 fallback to DAG ISel when translating alloca with scalable types When building code at -O0 We weren't falling back to DAG ISel correctly when encountering alloca instructions with scalable vector types. This is because the alloca has no operands that are scalable. I've fixed this by adding a check in AArch64ISelLowering::fallBackToDAGISel for alloca instructions with scalable types. Differential Revision: https://reviews.llvm.org/D84746 (cherry picked from commit 23ad660b5d34930b2b5362f1bba63daee78f6aa4)	2020-07-31 17:27:51 +02:00
David Sherwood	4f276364b5	[SVE] Don't consider scalable vector types in SLPVectorizerPass::vectorizeChainsInBlock In vectorizeChainsInBlock we try to collect chains of PHI nodes that have the same element type, but the code is relying upon the implicit conversion from TypeSize -> uint64_t. For now, I have modified the code to ignore PHI nodes with scalable types. Differential Revision: https://reviews.llvm.org/D83542 (cherry picked from commit 9ad7c980bb47edd7db8f8db828b487cc7dfc9921)	2020-07-31 17:27:50 +02:00
David Sherwood	efb915bb8c	[SVE][CodeGen] Add simple integer add tests for SVE tuple types I have added tests to: CodeGen/AArch64/sve-intrinsics-int-arith.ll for doing simple integer add operations on tuple types. Since these tests introduced new warnings due to incorrect use of getVectorNumElements() I have also fixed up these warnings in the same patch. These fixes are: 1. In narrowExtractedVectorBinOp I have changed the code to bail out early for scalable vector types, since we've not yet hit a case that proves the optimisations are profitable for scalable vectors. 2. In DAGTypeLegalizer::WidenVecRes_CONCAT_VECTORS I have replaced calls to getVectorNumElements with getVectorMinNumElements in cases that work with scalable vectors. For the other cases I have added asserts that the vector is not scalable because we should not be using shuffle vectors and build vectors in such cases. Differential revision: https://reviews.llvm.org/D84016 (cherry picked from commit 207877175944656bd9b52d36f391a092854572be)	2020-07-31 17:27:50 +02:00
David Sherwood	72e8f4492d	[SVE] Add checks for no warnings in CodeGen/AArch64/sve-sext-zext.ll Previous patches fixed up all the warnings in this test: llvm/test/CodeGen/AArch64/sve-sext-zext.ll and this change simply checks that no new warnings are added in future. Differential revision: https://reviews.llvm.org/D83205 (cherry picked from commit f43b5c7a76ab83dcc80e6769d41d5c4b761312b1)	2020-07-31 17:27:49 +02:00
David Sherwood	ebd7adf1d6	[CodeGen] Remove calls to getVectorNumElements in DAGTypeLegalizer::SplitVecOp_EXTRACT_SUBVECTOR In DAGTypeLegalizer::SplitVecOp_EXTRACT_SUBVECTOR I have replaced calls to getVectorNumElements with getVectorMinNumElements, since this code path works for both fixed and scalable vector types. For scalable vectors the index will be multiplied by VSCALE. Fixes warnings in this test: sve-sext-zext.ll Differential revision: https://reviews.llvm.org/D83198 (cherry picked from commit 5d84eafc6b86a42e261af8d753c3a823e0e7c67e)	2020-07-31 17:27:49 +02:00
David Sherwood	df55a6ed72	[SVE] Don't use LocalStackAllocation for SVE objects I have introduced a new TargetFrameLowering query function: isStackIdSafeForLocalArea that queries whether or not it is safe for objects of a given stack id to be bundled into the local area. The default behaviour is to always bundle regardless of the stack id, however for AArch64 this is overriden so that it's only safe for fixed-size stack objects. There is future work here to extend this algorithm for multiple local areas so that SVE stack objects can be bundled together and accessed from their own virtual base-pointer. Differential Revision: https://reviews.llvm.org/D83859 (cherry picked from commit 14bc85e0ebb6c00c1672158ab6a692bfbb11e1cc)	2020-07-31 17:27:49 +02:00
Sander de Smalen	b14a3de842	[AArch64][SVE] Fix epilogue for SVE when the stack is realigned. While deallocating the stackframe, the offset used to reload the callee-saved registers was not pointing to the SVE callee-saves, but rather to the whole SVE area. +--------------+ \| GRP callee \| \| saves \| +--------------+ <- FP \| SVE callee \| \| saves \| +--------------+ <- Should restore SVE callee saves from here \| SVE Spills \| \| and Locals \| +--------------+ <- instead of from here. \| \| : : \| \| +--------------+ <- SP Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D84539 (cherry picked from commit cda2eb3ad2bbe923e74d6eb083af196a0622d800)	2020-07-31 17:27:48 +02:00
Sander de Smalen	4b9a803323	[AArch64][SVE] Don't align the last SVE callee save. Instead of aligning the last callee-saved-register slot to the stack alignment (16 bytes), just align the SVE callee-saved block. This also simplifies the code that allocates space for the callee-saves. This change is needed to make sure the offset to which the callee-saved register is spilled, corresponds to the offset used for e.g. unwind call frame instructions. Reviewers: efriedma, paulwalker-arm, david-arm, rengolin Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D84042 (cherry picked from commit 26b4ef3694973ea2fa656d3d3a7f67f16f135654)	2020-07-31 17:27:48 +02:00
Sander de Smalen	03811e1752	[AArch64][SVE] Don't support fixedStack for SVE objects. Fixed stack objects are preallocated and defined to be allocated before any of the regular stack objects. These are normally used to model stack arguments. The AAPCS does not support passing SVE registers on the stack by value (only by reference). The current layout also doesn't place them before all stack objects, but rather before all SVE objects. Removing this simplifies the code that emits the allocation/deallocation around callee-saved registers (D84042). This patch also removes all uses of fixedStack from from framelayout-sve.mir, where this was used purely for testing purposes. Reviewers: paulwalker-arm, efriedma, rengolin Reviewed By: paulwalker-arm Differential Revision: https://reviews.llvm.org/D84538 (cherry picked from commit 54492a5843a34684ce21ae201dd8ca3e509288fd)	2020-07-31 17:27:47 +02:00
Eli Friedman	8c995d4317	[AArch64][SVE] Teach copyPhysReg to copy ZPR2/3/4. It's sort of tricky to hit this in practice, but not impossible. I have a synthetic C testcase if anyone is interested. The implementation is identical to the equivalent NEON register copies. Differential Revision: https://reviews.llvm.org/D84373 (cherry picked from commit 993c1a3219a8ae69f1d700183bf174d75f3815d4)	2020-07-31 17:27:47 +02:00

1 2 3 4 5 ...

200291 Commits