llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
Simon Pilgrim	628ab5868b	[X86][AVX] combineBitcastvxi1 - improve handling of vectors truncated to vXi1 If we're truncating to vXi1 from a wider type, then prefer the original wider vector as is simplifies folding the separate truncations + extensions. AVX1 this is only worth it for v8i1 cases, not v4i1 where we're always better off truncating down to v4i32 for movmsk. Helps with some regressions encountered in D96609	2021-03-24 14:05:59 +00:00
Stefan Pintilie	34252802f4	[PowerPC] Add mprivileged option Add an option to tell the compiler that it can use privileged instructions. This patch only adds the option. Backend implementation will be added in a future patch. Reviewed By: lei, amyk Differential Revision: https://reviews.llvm.org/D99193	2021-03-24 08:33:22 -05:00
Vinicius Tinti	046d087d1d	[llvm-objdump] Implement --prefix-strip option The option `--prefix-strip` is only used when `--prefix` is not empty. It removes N initial directories from absolute paths before adding the prefix. This matches GNU's objdump behavior. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D96679	2021-03-24 13:22:35 +00:00
Joseph Huber	ccdd23f91f	[OpenMP] Change OMPIRBuilder to append function attributes Summary: Currently the OMPIRBuilder overwrites the function's existing attributes when it assigns the ones defined in OMPKinds.def. This changes the behaviour to append the current function's attributes with them instead. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D98740	2021-03-24 09:08:29 -04:00
Alexey Bataev	1b88ea3100	[LoopAnalysis][NFC]Remove redundant code. Removed redundant code for IsConsecutive variable.	2021-03-24 05:37:19 -07:00
Nico Weber	687ca89b8f	[gn build] port 1d8fc086ae26	2021-03-24 08:29:34 -04:00
Simon Pilgrim	add3426cc4	[X86][AVX] lowerShuffleAsBroadcast - MOVDDUP(SCALAR_TO_VECTOR(X)) -> BROADCAST(X) Prefer broadcast from scalar on AVX targets as this makes it easier for later folds to strip away bitcasts etc. This helps a lot with the AVX1 poor codegen from PR49658. There's a trivial regression in bitcast-int-to-vector-bool-*ext.ll tests due to SimplifyDemandedBits not being able to see a multi-use case, but there's bigger existing codegen issues to be addressed first in those tests (unnecessary NOTs).	2021-03-24 11:31:56 +00:00
Andrea Di Biagio	2d533d6c56	[MCA] Fix for uninitialised member in constructor. NFC	2021-03-24 11:21:59 +00:00
Simon Pilgrim	8136e5e7e1	[X86] Remove unused 'OneUse' option from IsNOT helper. NFCI.	2021-03-24 11:14:38 +00:00
Simon Pilgrim	12a8acb5a9	[X86][AVX] Cleanup gather_v8i32_v8i32 special test case Cleanup the gather_v8i32_v8i32 IR to more closely match how the middle-end will optimise the vector geps (exposing more splats). This helps the gather scalarization case a lot, but shows a missed opportunity for AVX512 gathers to recognise uniform-constant indices. And none of the cases realise that some of the gathers are really blended broadcasts....	2021-03-24 11:14:38 +00:00
alex-t	24e5eb0aab	[AMDGPU] SIOptimizeExecMaskingPreRA should check constant bus constraint when folds EXEC copy Folding EXEC copy into it's single use may lead to constant bus constraint violation as it adds one more SGPR operand. This change makes it validate the user instruction with the new SGPR operand and only fold it if it is legal. Reviewed By: rampitec, arsenm Differential Revision: https://reviews.llvm.org/D98888	2021-03-24 14:14:13 +03:00
Florian Hahn	d57a381a65	[LV] Move exact FP math check out of Requirements. We know if the loop contains FP instructions preventing vectorization after we are done with legality checks. This patch updates the code the check for un-vectorizable FP operations earlier, to avoid unnecessarily running the cost model and picking a vectorization factor. It also makes the code more direct and moves the check to a position where similar checks are done. I might be missing something, but I don't see any reason to handle this check differently to other, similar checks. Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D98633	2021-03-24 11:01:44 +00:00
Andrew Savonichev	182b0cd903	[MCA] Disable RCU for InOrderIssueStage This is a follow-up for: D98604 [MCA] Ensure that writes occur in-order When instructions are aligned by the order of writes, they retire in-order naturally. There is no need for an RCU, so it is disabled. Differential Revision: https://reviews.llvm.org/D98628	2021-03-24 13:54:04 +03:00
Stefan Pintilie	4e38761daa	[PowerPC] Change option to mrop-protect In order to have the same option on power PC LLVM and power PC gcc the option will be changed from -mrop-protection to -mrop-protect. The feature will be off by default and turned on when the option is used. Reviewed By: lei, amyk Differential Revision: https://reviews.llvm.org/D99185	2021-03-24 05:51:35 -05:00
Roman Lebedev	e78f99f1c4	[NFC][PhaseOrdering] Add a testcase for additional LICM before LoopRotate (D99249/D99204)	2021-03-24 13:24:09 +03:00
Ta-Wei Tu	86a2dac39f	[NFC] Improve debug message and test description in 4c1f74a	2021-03-24 18:21:13 +08:00
Ta-Wei Tu	ccd74e3fe8	[LoopFlatten] Fix invalid assertion (PR49571) The `InductionPHI` is not necessarily the increment instruction, as demonstrated in pr49571.ll. This patch removes the assertion and instead bails out from the `LoopFlatten` pass if that happens. This fixes https://bugs.llvm.org/show_bug.cgi?id=49571 Reviewed By: SjoerdMeijer Differential Revision: https://reviews.llvm.org/D99252	2021-03-24 18:08:27 +08:00
Ta-Wei Tu	ed6e3dd8c5	[NFC] Remove redundant `struct` prefix Reviewed By: SjoerdMeijer, fhahn Differential Revision: https://reviews.llvm.org/D99251	2021-03-24 17:58:33 +08:00
Nigel Perks	6a29e5317f	[XCore][Test] XFAIL tests requiring 8-byte stack alignment. XCore default subtarget does not support 8-byte stack alignment. These failures can be seen on builder clang-xcore-ubuntu-20-x64 on staging buildbot. Differential Revision: https://reviews.llvm.org/D99092	2021-03-24 09:12:53 +00:00
Andy Wingo	87799a4429	[WebAssembly][MC] Record limit constraints for table sizes This commit adds a full WasmTableType to MCSymbolWasm, differing from the current situation (just an ElemType) in that it additionally records a WasmLimits. We add support for specifying the limits in .S files also, via the following syntax variations: .tabletype SYM, ELEMTYPE .tabletype SYM, ELEMTYPE, MINSIZE .tabletype SYM, ELEMTYPE, MINSIZE, MAXSIZE Depends on D99186. Differential Revision: https://reviews.llvm.org/D99191	2021-03-24 09:44:22 +01:00
Andy Wingo	0ac187d2cf	[WebAssembly] Rename WasmLimits::Initial to ::Minimum. NFC. This patch renames the "Initial" member of WasmLimits to the name used in the spec, "Minimum". In the core WebAssembly specification, the Limits data type has one required "min" member and one optional "max" member, indicating the minimum required size of the corresponding table or memory, and the maximum size, if any. Although the WebAssembly spec does instantiate locally-defined tables and memories with the initial size being equal to the minimum size, it can't impose such a requirement for imports. It doesn't make sense to require an initial size for a memory import, for example. The compiler can only sensibly express the minimum and maximum sizes. See https://github.com/WebAssembly/js-types/blob/master/proposals/js-types/Overview.md#naming-of-size-limits for a related discussion that agrees that the right name of "initial" is "minimum" when querying the type of a table or memory from JavaScript. (Of course it still makes sense for JS to speak in terms of an initial size when it explicitly instantiates memories and tables.) Differential Revision: https://reviews.llvm.org/D99186	2021-03-24 09:10:11 +01:00
Jim Lin	a18b19b6db	[RISCV] Add HasStdExtD predicate to copysign from double and to double patterns Copysign from double and to double patterns have lack of HasStdExtD predicate. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D99234	2021-03-24 14:29:23 +08:00
Alex Orlov	10514b67ec	* Fix demangling of optional template-args for vendor extended type qualifier. This fixes https://bugs.llvm.org/show_bug.cgi?id=48009 bug. Reviewed By: erik.pilkington, krisb Differential Revision: https://reviews.llvm.org/D98687	2021-03-24 10:21:32 +04:00
Chuanqi Xu	c80fb82d14	[NFC] [Support] Fix unconsistent comment with codes for ExtendSigned	2021-03-24 13:58:54 +08:00
Max Kazantsev	01fde05248	[NFC] Fix comment describing what EdgeBundles is The original comment says the same thing twice, and does not mention that edges entering the block are also in the same bundle (which seems true from what the underlying code is doing). Differential Revision: https://reviews.llvm.org/D99144 Reviewed By: RKSimon	2021-03-24 11:04:05 +07:00
Serguei Katkov	c9a1c9ca92	[RegAlloc] Fix "ran out of regs" with uses in statepoint Statepoint instruction is known to have a variable and big number of operands. It is possible that Register Allocator will split live intervals in the way that all physical registers are occupied by "zero-length" live intervals which are marked as not-spillable. While intervals are marked as not-spillable in the moment of creation when they are really zero-length it is possible that in future as part of re-materialization there will need for physical register between def and use of such tiny interval (the use is not related to this interval at all). As all physical registers are assigned to not-spillable intervals there is not avaialbe registers and RA reports an error. The idea of the fix is avoid marking tiny live intervals where there is a use in statepoint instruction in var args section. Such interval may be perfectly spilled and folded to operand of statepoint. Reviewers: reames, dantrushin, qcolombet, dsanders, dmgreen Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D98766	2021-03-24 10:25:34 +07:00
Craig Topper	7d88bdafb9	[X86] Bale out of X86FastISel::X86SelectCmp for vectors. None of the code in this function was written to handle vectors. Most of the cases already fail for vectors for one reason or another. The exception is an optimization that detects identical operands. This can be triggered by vectors, but the code always creates a 0 or 1 constants in a scalar register which is incorrect for vectors. Fixes PR49706.	2021-03-23 20:16:04 -07:00
Yang Fan	404db5968d	[InstSimplify] Fix unused variable warning (NFC) GCC warning: ``` /llvm-project/llvm/lib/Analysis/InstructionSimplify.cpp: In function ‘llvm::Value* SimplifyWithOpReplaced(llvm::Value, llvm::Value, llvm::Value, const llvm::SimplifyQuery&, bool, unsigned int)’: /llvm-project/llvm/lib/Analysis/InstructionSimplify.cpp:3993:15: warning: unused variable ‘SI’ [-Wunused-variable] 3993 \| if (auto SI = dyn_cast<SelectInst>(I)) \| ^~ ```	2021-03-24 09:56:36 +08:00
Bruno Cardoso Lopes	054affa82b	[CGAtomic] Lift stronger requirements on cmpxch and support acquire failure mode - Fix `emitAtomicCmpXchgFailureSet` to support release/acquire (succ/fail) memory order. - Remove stronger checks for cmpxch. Effectively, this addresses http://wg21.link/p0418 Differential Revision: https://reviews.llvm.org/D98995	2021-03-23 16:45:37 -07:00
Choongwoo Han	955273b426	[Coverage] Load records immediately The current implementation keeps buffers generated for each object file until it completes loading of all files. This approach requires a lot of memory if there are a lot of huge object files. Thus, make it to load coverage records immediately rather than waiting for other binaries to be loaded. This reduces memory usage of llvm-cov from >128GB to 5GB when loading Chromium binaries in Windows. Additional testing: check-profile, check-llvm Differential Revision: https://reviews.llvm.org/D99110	2021-03-23 16:25:20 -07:00
Amara Emerson	947d021f28	[AArch64][GlobalISel] Add test for G_FSHR legalization.	2021-03-23 16:11:45 -07:00
Amara Emerson	23d194db09	[AArch64][GlobalISel] Lower G_FSHL and G_FSHR. Codegen isn't as good as we need it, but that'll be done later.	2021-03-23 16:09:19 -07:00
Jingu Kang	af9f082c95	[ValueTracking] Handle increasing mul recurrence in isKnownNonZero() Differential Revision: https://reviews.llvm.org/D99069	2021-03-23 23:04:41 +00:00
Rafael Auler	3b0888e5fb	Add register size info back to MCRegisterClass This patch addresses the removal of register size information done in commit c8b782c. Without this change, there is no viable option to get register size information outside libTarget. We need this information to run analysis that know the register size from the MC layer, used by BOLT. Discussion D50285 and D47199. Reviewed By: kparzysz Differential Revision: https://reviews.llvm.org/D97891	2021-03-23 15:04:44 -07:00
Matteo Favaro	a4c86712c5	[MSSA] Extending IsGuaranteedLoopInvariant to support an instruction defined in the entry block As mentioned in [[ https://reviews.llvm.org/D96979 \| D96979 ]], I'm extending the IsGuaranteedLoopInvariant check also to the `MemorySSA.cpp` file. @fhahn For now I didn't unify the function into `MemorySSA.h` because, as you mentioned, it's not directly MSSA related. I'm open to suggestions to find a better place so we can improve the unification process. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D97155	2021-03-23 21:50:56 +00:00
Alexey Bataev	147e88fa15	[Analysis]Add getPointersDiff function to improve compile time. Added getPointersDiff function to LoopAccessAnalysis and used it instead direct calculatoin of the distance between pointers and/or isConsecutiveAccess function in SLP vectorizer to improve compile time and detection of stores consecutive chains. Part of D57059 Differential Revision: https://reviews.llvm.org/D98967	2021-03-23 14:25:36 -07:00
Nikita Popov	57bd8da517	[BasicAA] Handle assumes with operand bundles This fixes a regression reported on D99022: If a call has operand bundles, then the inaccessiblememonly attribute on the function will be ignored, as operand bundles can affect modref behavior in the general case. However, for assume operand bundles in particular this is not the case. Adjust getModRefBehavior() to always report inaccessiblememonly for assumes, regardless of presence of operand bundles.	2021-03-23 21:21:19 +01:00
Nikita Popov	7f52840c4e	[BasicAA] Add test for assume with operand bundles (NFC)	2021-03-23 21:21:19 +01:00
Alexey Bataev	2cb8308176	Revert "[Analysis]Add getPointersDiff function to improve compile time." This reverts commit 065a14a12d2694f26f4e894641f5ab8cfc5da8bd to investigate and fix crash in SLP vectorizer.	2021-03-23 13:17:54 -07:00
Alexey Bataev	9601d11568	[Analysis]Add getPointersDiff function to improve compile time. Added getPointersDiff function to LoopAccessAnalysis and used it instead direct calculatoin of the distance between pointers and/or isConsecutiveAccess function in SLP vectorizer to improve compile time and detection of stores consecutive chains. Part of D57059 Differential Revision: https://reviews.llvm.org/D98967	2021-03-23 12:58:42 -07:00
Amara Emerson	e0b640481c	[AArch64][GlobalISel] Lower G_CTLZ_ZERO_UNDEF. This adds some missing legalizer tests, which uncovered a v2s64 selection test that wasn't working since there's no legalization or instruction for that.	2021-03-23 12:49:10 -07:00
Craig Topper	b271af80f3	[ValueTracking] Teach canCreateUndefOrPoison that ctpop does not create undef or poison. This select of ctpop with 0 pattern can get left behind after loop idiom recognize converts a loop to ctpop. LLVM 10 was able to optimize this, but LLVM 11 and later is not. The difference seems to be that some select transforms are now limited based on canCreateUndefOrPoison. Teaching canCreateUndefOrPoison about ctpop restores the LLVM 10 codegen. Differential Revision: https://reviews.llvm.org/D99207	2021-03-23 12:42:18 -07:00
Jonas Devlieghere	78fab9fc9c	[dsymutil] Only look for ThinLTO suffixes if we failed to find symbol. Only look for symbols with the ThinLTO suffix if we fail to find the symbol.	2021-03-23 10:55:48 -07:00
Tony	da027401bd	[NFC][AMDGPU] Reserve AMD GPU ELF machine number 0x41 Reviewed By: foad Differential Revision: https://reviews.llvm.org/D99196	2021-03-23 17:53:02 +00:00
Siva Chandra	1f9792a0bf	[runtimes] Add the libc project to the list of runtimes. This is possible as the default libc build now works under runtimes build. Differential Revision: https://reviews.llvm.org/D99101	2021-03-23 17:33:03 +00:00
Jay Foad	2de727df6f	[AMDGPU] Simplify AMDGPUAnnotateUniformValues::visitBranchInst. NFC. A BranchInst is always the terminator of its containing BasicBlock.	2021-03-23 16:54:43 +00:00
Nathan James	5699686601	[NFC] Enable RVALUE_REFERENCE_THIS on MSVC 2019 In https://reviews.llvm.org/D72948 This was enabled for all MSVC but reverted as it was determined not to work on some 2017 versions. The issue is assumed to be fixed on 2019 so enable for 2019 and newer. Some testing could be done to determine which version of MSVC 2017 support this feature but its safer right now to leave it at 2019. Reviewed By: aaron.ballman Differential Revision: https://reviews.llvm.org/D98809	2021-03-23 16:40:13 +00:00
Joe Nash	af64dca8a0	[AMDGPU] Refactor DPPCombine NFC. Extract IsShrinkable into a helper function, and make Subtarget a member variable. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D99099 Change-Id: If4bc97a88a9ae4eb1df47e717345d46a6ed515bf	2021-03-23 11:53:53 -04:00
Craig Topper	8a90bdff9c	[RISCV] Use selectImm for RV32. NFC Previously we used selectImm for RV64 and isel patterns for RV32. This should be NFC, but will allow RV32 and RV64 to share improvements in the future. For example, it might be useful to use BSETI from Zbs to make single bit constants. Reviewed By: luismarques Differential Revision: https://reviews.llvm.org/D98877	2021-03-23 08:57:15 -07:00
Jay Foad	abd0bfe722	[AMDGPU] Set SchedRW on real instructions Coyp SchedRW from pseudos to real instructions so that llvm-mca has access to it. This is NFC for normal compiler codegen, which schedules pseudos not real instructions. Add an llvm-mca test for some high latency double-precision instructions as a smoke test. Differential Revision: https://reviews.llvm.org/D99187	2021-03-23 15:38:11 +00:00

1 2 3 4 5 ...

213152 Commits