llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Cullen Rhodes	5dfa2f5625	[AArch64] Implement .variant_pcs directive A dynamic linker with lazy binding support may need to handle variant PCS function symbols specially, so an ELF symbol table marking STO_AARCH64_VARIANT_PCS [1] was added to address this. Function symbols that follow the vector PCS are marked via the .variant_pcs assembler directive, which takes a single parameter specifying the symbol name and sets the STO_AARCH64_VARIANT_PCS st_other flag in the object file. [1] https://github.com/ARM-software/abi-aa/blob/master/aaelf64/aaelf64.rst#st-other-values Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D89138	2020-10-13 10:06:27 +00:00
Evgeny Leviant	a5e5f602ef	[TableGen][SchedModels] Fix aliasing of SchedWriteVariant Differential revision: https://reviews.llvm.org/D89114	2020-10-13 13:05:24 +03:00
Paul Walker	4bf1aa70a4	[SVE] Add ISel patterns for "insert undef_nxv#f##, f##, 0" Differential Revision: https://reviews.llvm.org/D89235	2020-10-13 10:49:18 +01:00
Roman Lebedev	43aa3d0fc4	[SCEV] BuildConstantFromSCEV(): properly handle SCEVSignExtend from ptr Much similar to the ZExt/Trunc handling. Thanks goes to Alexander Richardson for nudging towards noticing this one proactively. The appropriate (currently crashing) test coverage added.	2020-10-13 12:19:59 +03:00
Jay Foad	1bd21cb488	[AMDGPU] v_mac_legacy_f32 does not support DPP Differential Revision: https://reviews.llvm.org/D89245	2020-10-13 10:03:00 +01:00
Pietro Albini	70207b468c	Add expected response time and escalation path to the security docs Following up on the discussion within the group during the roundtable at the 2020 LLVM Developers Meeting, this commit adds to the security docs: * How long we expect acknowledging security reports will take * The escalation path the reporter can follow if they get no response A temporary line inviting reporters to directly follow the escalation path while the mailing list is being setup is also added. Differential Revision: https://reviews.llvm.org/D89068	2020-10-13 10:57:06 +02:00
Roman Lebedev	df3ab258e6	[SCEV] BuildConstantFromSCEV(): properly handle SCEVZeroExtend from ptr As being reported in https://reviews.llvm.org/D88806#2326944, this is pretty much the sibling problem of https://reviews.llvm.org/D88806#2325340, with root cause being that SCEV now models `ptrtoint` as trunc/zext/self of unknown. The appropriate (currently crashing) test coverage added.	2020-10-13 11:47:44 +03:00
Simon Pilgrim	6df62d9e30	[InstCombine] Support uniform vector splats in ((((X >> C) & CC) + Y) << C) folds. Add support for uniform vector splats (no undefs).	2020-10-13 09:28:39 +01:00
Max Kazantsev	5d36bfd596	[Test] Use generated auto-checks to make further changes more visible	2020-10-13 15:16:32 +07:00
Jay Foad	1a6becd14b	[AMDGPU] Use lowercase for subtarget feature names in RUN lines	2020-10-13 09:02:09 +01:00
Tobias Hieta	34a683def6	[llvm-install-name-tool] Add -delete_all_rpaths option This diff adds an option to remove all rpaths from a Mach-O binary. Test plan: make check-all Differential revision: https://reviews.llvm.org/D88674	2020-10-13 00:45:57 -07:00
Craig Topper	0120cd1285	[X86][SelectionDAG] Add SADDO_CARRY and SSUBO_CARRY to support multipart signed add/sub overflow legalization. This passes existing X86 test but I'm not sure if it handles all type legalization cases it needs to. Alternative to D89200 Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D89222	2020-10-12 23:18:29 -07:00
Thomas Lively	2b696bcac5	Reland "[WebAssembly] Emulate v128.const efficiently"" This reverts commit 432e4e56d3d2, which reverted 542523a61a21. Two issues from the original commit have been fixed. First, MSVC does not like when std::array is initialized with only single braces, so this commit switches to using the more portable double braces. Second, there was a subtle endianness bug that prevented the original commit from working correctly on big-endian machines, which has been fixed by switching to using endianness-agnostic bit twiddling instead of type punning. Differential Revision: https://reviews.llvm.org/D88773	2020-10-13 04:36:59 +00:00
Jez Ng	985bef6e98	[llvm-readobj] Don't print out section names for STABS symbols This diff is similar to what D71394 did for `llvm-objdump` -- it avoids trying to look up a section name for STABS symbols, since some STABS symbol types (like `N_OSO`) use the `n_sect` field to store other data instead of a section index. Differential Revision: https://reviews.llvm.org/D88468	2020-10-12 18:55:40 -07:00
LLVM GN Syncbot	9263495686	[gn build] Port 412cdcf2edf	2020-10-13 01:07:55 +00:00
Wang, Pengfei	4ae5349aa4	[X86] Add HRESET instruction. For more details about these instructions, please refer to the latest ISE document: https://software.intel.com/en-us/download/intel-architecture-instruction-set-extensions-programming-reference. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D89102	2020-10-13 08:47:26 +08:00
Fangrui Song	bfb96021e5	[llc] -filetype=null: don't create .null if -o is not specified The new behavior is consistent with llvm-mc -filetype=null.	2020-10-12 17:28:59 -07:00
Ruiling Song	236d926c95	[AMDGPU] Update LiveVariables in convertToThreeAddress() This can fix an asan failure like below. ==15856==ERROR: AddressSanitizer: use-after-poison on address ... READ of size 8 at 0x6210001a3cb0 thread T0 #0 llvm::MachineInstr::getParent() #1 llvm::LiveVariables::VarInfo::findKill() #2 TwoAddressInstructionPass::rescheduleMIBelowKill() #3 TwoAddressInstructionPass::tryInstructionTransform() #4 TwoAddressInstructionPass::runOnMachineFunction() We need to update the Kills if we replace instructions. The Kills may be later accessed within TwoAddressInstruction pass. Differential Revision: https://reviews.llvm.org/D89092	2020-10-13 08:12:20 +08:00
Jonas Devlieghere	1e1e7e732e	[llvm] Export LLVM_USE_SPLIT_DWARF in LLVMConfig.cmake Export LLVM_USE_SPLIT_DWARF in LLVMConfig.cmake so that it can be used from standalone builds of clang and lldb. Currently, there is no way for standalone builds to know whether this option was set which means that it only applies to LLVM. Differential revision: https://reviews.llvm.org/D89282	2020-10-12 16:48:59 -07:00
Craig Topper	2976744ea9	[BitCodeAnalyzer] Add a few missing TYPE_CODES and MODULE_CODE_COMDAT to GetCodeName Happened to notice some of these printing as UnknownCode while running llvm-bcanalyzer on a bc file I had. Differential Revision: https://reviews.llvm.org/D86900	2020-10-12 15:43:12 -07:00
Eli Friedman	f8d23ec53c	[AArch64] Add tests for 128-bit shift variations. It looks like there's still some room for optimization; the funnel shift lowering is more efficient than the lowering of actual shifts.	2020-10-12 14:48:58 -07:00
Sanjay Patel	f787339d35	[CostModel] get type-based cost for vector intrinsics directly This is NFC-intended (the reductions and other intrinsics have tests that should provide coverage) - trying to untangle the mess that has formed here. We were stripping the arguments off of the given "IntrinsicCostAttributes" and then semi-recursively calling back into getIntrinsicInstrCost() only to then call getTypeBasedIntrinsicInstrCost(), so make that call directly instead.	2020-10-12 17:27:34 -04:00
Paul C. Anagnostopoulos	69c806cfe8	[TableGen] Add overload of RecordKeeper::getAllDerivedDefinitions() and use in PseudoLowering backend. Now the two getAllDerivedDefinitions() use StringRef and Arrayref. Use all_of() in getAllDerivedDefinitions().	2020-10-12 16:40:09 -04:00
Craig Topper	0b87fb88ce	[X86] Add more tests for D89178. NFC	2020-10-12 13:36:37 -07:00
Nico Weber	68c9b7691c	[gn build] Add libclang_rt.ios.a, libclang_rt.iossim.a to the build It's built with just-built clang, like all other compiler-rt parts in the GN build. This requires adding some cross build support to the mac toolchain. Also add explicit mmacosx-version-min and miphoneos-version-min flags to the build. ios.a is only built with the arm64 slice, iossim.a only with the x86_64 slice for now. (The latter should maybe become host_cpu when Arm Macs become a common iOS development platform.) With this, it's possible to build chromium/iOS with a GN-built LLVM. Differential Revision: https://reviews.llvm.org/D89260	2020-10-12 16:21:04 -04:00
Tony	c338b755d2	[AMDGPU] Correct processor names for gfx1010 and gfx1011 Change-Id: Ie409f86876b0437d0b0405aff42872963708d926 Differential Revision: https://reviews.llvm.org/D89259	2020-10-12 20:16:12 +00:00
Roman Lebedev	47d13f688a	Reland "[SCEV] Model ptrtoint(SCEVUnknown) cast not as unknown, but as zext/trunc/self of SCEVUnknown" This relands commit 1c021c64caef83cccb719c9bf0a2554faa6563af which was reverted in commit 17cec6a11a12f815052d56a17ef738cf246a2d9a because an assertion was being triggered, since `BuildConstantFromSCEV()` wasn't updated to handle the case where the constant we want to truncate is actually a pointer. I was unsuccessful in coming up with a test case where we'd end there with constant zext/sext of a pointer, so i didn't handle those cases there until there is a test case. Original commit message: While we indeed can't treat them as no-ops, i believe we can/should do better than just modelling them as `unknown`. `inttoptr` story is complicated, but for `ptrtoint`, it seems straight-forward to model it just as a zext-or-trunc of unknown. This may be important now that we track towards making inttoptr/ptrtoint casts not no-op, and towards preventing folding them into loads/etc (see D88979/D88789/D88788) Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D88806	2020-10-12 23:02:55 +03:00
Roman Lebedev	7aaed16e98	[NFC][ScalarEvolution] Add tests with ptrtoint in constant context in loop Reduced from the https://reviews.llvm.org/D88806#2325340	2020-10-12 23:02:55 +03:00
Cameron McInally	9c7dd5dbe3	[SVE] Fix VBITS_GE_256 typo in fixed-width tests. This seems to be a typo that propagated to a number of tests. Replace VBITS_GE_256 with CHECK. There is no VBITS_GE_256.	2020-10-12 14:31:34 -05:00
Arthur Eubanks	8e4adad70b	[Inliner][NPM] Fix various tests under NPM alloca-dbgdeclare-merge.ll: alloca-merge-align.ll: array_merge.ll: NPM inliner does not merge allocas delete-call.ll: NPM inliner does not delete readonly calls externally_available.ll: NPM inliner does not delete available_externally functions inline-cold-callee.ll: inline-hot-callee.ll: inline-hot-callee.ll has a comment saying it only applies to legacy PM, I assume same for inline-cold-callee.ll devirtualize-2.ll: inline-hot-callsite: monster_scc.ll: pr22285.ll: already has legacy and new PM RUN lines inline-cold.ll: profile-summary required to see callee as cold prof-update-sample.ll: profile-summary required to update branch_weights Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D89093	2020-10-12 10:41:40 -07:00
Simon Pilgrim	49641626ba	[InstCombine] FoldShiftByConstant - create Scalar/Vector constant with ConstantInt::get(). NFCI. There's no need to create constant vector splats manually - missed this one in rG24dd0cd1edd5	2020-10-12 18:39:45 +01:00
Fangrui Song	7466a25f11	[X86] Support -march=x86-64-v[234] PR47686. These micro-architecture levels are defined in the x86-64 psABI: https://gitlab.com/x86-psABIs/x86-64-ABI/-/commit/77566eb03bc6a326811cb7e9 GCC 11 will support these levels. Note, -mtune=x86-64-v[234] are invalid and __builtin_cpu_is cannot be used on them. Reviewed By: craig.topper, RKSimon Differential Revision: https://reviews.llvm.org/D89197	2020-10-12 10:29:46 -07:00
Simon Pilgrim	eaf55acd73	[InstCombine] FoldShiftByConstant - create Scalar/Vector constant with ConstantInt::get(). NFCI. There's no need to create constant vector splats manually.	2020-10-12 18:17:20 +01:00
Simon Pilgrim	6abf3a54a4	[InstCombine] FoldShiftByConstant - merge equivalent types. NFCI. Consistently use the original shift instruction's Type/BitWidth instead of the operands, casted values etc.	2020-10-12 18:17:19 +01:00
Teresa Johnson	dbca0f9a08	Restore "[ThinLTO] Avoid temporaries when loading global decl attachment metadata" This restores commit ab1b4810b55279bcf6fdd87be74a403440be3991 which was reverted in 01b9deba76a950f04574b656c7c31ae389104f2d, with a fix for the issue it caused. We should use a temporary BitstreamCursor when loading the global decl attachment records so that the abbrev ids held in the lazy loading IndexCursor are not clobbered. Enhanced the test so that the issue is exposed there. Original description: When performing ThinLTO importing, the metadata loader attempts to lazy load, by building an index. However, module level global decl attachment metadata was being parsed early while building the index, since the associated (module level) global values aren't materialized on demand. This results in the creation of forward reference temporary metadatas, which are expensive. Normally, these module level global values don't have much attached metadata. However, in the case of -fwhole-program-vtables (e.g. for whole program devirtualization), the vtables may have many attached type metadatas. This was resulting in very slow performance when performing ThinLTO importing with the default lazy loading. This patch restructures the handling of these global decl attachment records, delaying their parsing until after the lazy loading index has been built. Then the parser can use the interface that loads from the index, which resolves forward references immediately instead of creating expensive temporaries. For one ThinLTO backend that imports from modules containing huge numbers of vtables and associated types, I measured the following compile times for the metadata materialization during function importing, rounded to nearest second: No -fwhole-program-vtables: Lazy loading on (head): 1s Lazy loading off (head): 3s Lazy loading on (patch): 1s With -fwhole-program-vtables: Lazy loading on (head): 440s Lazy loading off (head): 4s Lazy loading on (patch): 2s Differential Revision: https://reviews.llvm.org/D87970	2020-10-12 10:11:56 -07:00
Florian Hahn	daadcb3953	[VPlan] Use VPValue def for VPMemoryInstructionRecipe. This patch turns VPMemoryInstructionRecipe into a VPValue and uses it during VPlan construction and codegeneration instead of the plain IR reference where possible. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D84680	2020-10-12 18:02:33 +01:00
Hans Wennborg	bcad3ef754	Revert 1c021c64c "[SCEV] Model ptrtoint(SCEVUnknown) cast not as unknown, but as zext/trunc/self of SCEVUnknown" > While we indeed can't treat them as no-ops, i believe we can/should > do better than just modelling them as `unknown`. `inttoptr` story > is complicated, but for `ptrtoint`, it seems straight-forward > to model it just as a zext-or-trunc of unknown. > > This may be important now that we track towards > making inttoptr/ptrtoint casts not no-op, > and towards preventing folding them into loads/etc > (see D88979/D88789/D88788) > > Reviewed By: mkazantsev > > Differential Revision: https://reviews.llvm.org/D88806 It caused the following assert during Chromium builds: llvm/lib/IR/Constants.cpp:1868: static llvm::Constant llvm::ConstantExpr::getTrunc(llvm::Constant , llvm::Type *, bool): Assertion `C->getType()->isIntOrIntVectorTy() && "Trunc operand must be integer"' failed. See code review for a link to a reproducer. This reverts commit 1c021c64caef83cccb719c9bf0a2554faa6563af.	2020-10-12 18:39:35 +02:00
Konstantin Schwarz	e455104bb4	[GlobalISel][KnownBits] Early return on out of bound shift amounts If the known shift amount is bigger than or equal to the bitwidth of the type of the value to be shifted, the result is target dependent, so don't try to infer any bits. This fixes a crash we've seen in one of our internal test suites. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D89232	2020-10-12 18:39:19 +02:00
Dávid Bolvanský	6ca1daa81f	[Tests] Regenerate test checks; NFC	2020-10-12 17:55:00 +02:00
Mircea Trofin	e0730e7fa4	[NFC][MC] Use MCRegister in LiveRangeMatrix The change starts from LiveRangeMatrix and also checks the users of the APIs are typed accordingly. Differential Revision: https://reviews.llvm.org/D89145	2020-10-12 08:54:36 -07:00
Florian Hahn	9589bcce9e	[VPlan] Use operands for printing of VPWidenMemoryInstructionRecipe. Now that operands of the recipe are managed through VPUser, we can simplify the printing by just using the operands.	2020-10-12 16:51:54 +01:00
Mircea Trofin	ed7451bf79	[NFC][Regalloc] Pass VirtRegMap by reference. It's never null - the reason it's modeled as a pointer is because the pass can't init it in its ctor. Passing by ref simplifies the code, too, as the null checks were unnecessary complexity. Differential Revision: https://reviews.llvm.org/D89171	2020-10-12 08:32:30 -07:00
Sebastian Neubauer	5b90d915b3	[AMDGPU] Print metadata on error If the metadata is valid yaml, we can print it, even if it failed validation. That makes it easier to debug any wrong metadata. Differential Revision: https://reviews.llvm.org/D89243	2020-10-12 17:22:06 +02:00
Florian Hahn	d48b7dbff0	[LoopDeletion] Remove over-eager SCEV verification. 60b852092c98dbdc6248d60109d90ae6f8ad841c introduced SCEV verification to deleteDeadLoop, but it appears this check is currently a bit over-eager and some users of deleteDeadLoop appear to only patch up SE after calling it (e.g. PR47753). Remove the extra check for now. We can consider adding it back after we tracked down the source of the inconsistency for PR47753.	2020-10-12 16:18:30 +01:00
Sebastian Neubauer	d4dc1f31fa	[AMDGPU] Insert waterfall loops for divergent calls Extend loadSRsrcFromVGPR to allow moving a range of instructions into the loop. The call instruction is surrounded by copies into physical registers which should be part of the waterfall loop. Differential Revision: https://reviews.llvm.org/D88291	2020-10-12 17:16:11 +02:00
Cameron McInally	9f327e8d3a	[SVE] Lower fixed length VECREDUCE_XOR operation Differential Revision: https://reviews.llvm.org/D88974	2020-10-12 10:12:15 -05:00
Jay Foad	f4248c66fc	[AMDGPU] Use @LINE for error checking in gfx10 assembler tests	2020-10-12 16:10:20 +01:00
Simon Pilgrim	371ddf42dc	[InstCombine] matchFunnelShift - fold or(shl(a,x),lshr(b,sub(bw,x))) -> fshl(a,b,x) iff x < bw (REAPPLIED) If value tracking can confirm that a shift value is less than the type bitwidth then we can more confidently fold general or(shl(a,x),lshr(b,sub(bw,x))) patterns to a funnel/rotate intrinsic pattern without causing bad codegen regressions in the backend (see D89139). Reapplied after the shift canonicalization in rG02295e6d1a15 which removed the need to flip the shift values. Differential Revision: https://reviews.llvm.org/D88783	2020-10-12 16:06:41 +01:00
Simon Pilgrim	fbb71b81b0	[InstCombine] matchFunnelShift - remove shift value commutation. NFCI. After rG02295e6d1a15 we no longer need to invert the shift values for fshr - this is just hidden at the moment as funnel shifts only ever match for constant values so never use the fshr "Sub on SHL" path.	2020-10-12 15:55:18 +01:00
Simon Pilgrim	ab2599111f	[InstCombine] matchFunnelShift - canonicalize to OR(SHL,LSHR). NFCI. Simplify the shift amount matching code by canonicalizing the shift ops first.	2020-10-12 15:10:59 +01:00

... 3 4 5 6 7 ...

205207 Commits