llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

Author	SHA1	Message	Date
Simon Pilgrim	2fae2eeab3	GlobalISel/Utils.h - remove unused includes. NFCI. Twine is unused, and TargetLowering can be reduced to a forward declaration and moved to Utils.cpp	2020-09-03 15:59:12 +01:00
Simon Pilgrim	c930434d89	LowerEmuTLS.cpp - remove unused TargetLowering.h include. NFC. We only needed llvm/IR/Constants.h.	2020-09-03 14:40:09 +01:00
Amara Emerson	0c2c686079	[StackProtector] Fix crash with vararg due to not checking LocationSize validity. Differential Revision: https://reviews.llvm.org/D87074	2020-09-03 00:08:48 -07:00
Craig Topper	639e60808d	[CodeGenPrepare][X86] Teach optimizeGatherScatterInst to turn a splat pointer into GEP with scalar base and 0 index This helps SelectionDAGBuilder recognize the splat can be used as a uniform base. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D86371	2020-09-02 20:44:12 -07:00
Jay Foad	53df1fb508	[APInt] New member function setBitVal Differential Revision: https://reviews.llvm.org/D87033	2020-09-02 21:40:31 +01:00
Anna Thomas	042179924b	[ImplicitNullChecks] NFC: Refactor dependence safety check After computing dependence, we check if it is safe to hoist by identifying if it clobbers any liveIns in the sibling block (NullSucc). This check is moved to its own function which will be used in the soon-to-be modified dependence checking algorithm for implicit null checks pass. Tests-Run: lit tests on X86/implicit-*	2020-09-02 10:29:44 -04:00
Anna Thomas	d6da4d8aad	[ImplicitNullChecks] NFC: Separated out checks and added comments Separated out some checks in isSuitableMemoryOp and added comments explaining why some of those checks are done. Tests-Run:X86 implicit null checks tests.	2020-09-02 10:29:44 -04:00
Paul Walker	aeb7dd335b	[SVE] Don't reorder subvector/binop sequences when the resulting binop is not legal. When lowering fixed length vector operations for SVE the subvector operations are used extensively to marshall data between scalable and fixed-length vectors. This means that sequences like: extract_subvec(binop(insert_subvec(a), insert_subvec(b))) are very common. DAGCombine only checks if the resulting binop is legal or can be custom lowered when undoing such sequences. When it's custom lowering that is introducing them the result is an infinite legalise->combine->legalise loop. This patch extends the isOperationLegalOr... functions to include a "LegalOnly" parameter to restrict the check to legal operations only. Although isOperationLegal could be used it's common for the affected code paths to be visited pre and post legalisation, so the extra parameter keeps the code tidy. Differential Revision: https://reviews.llvm.org/D86450	2020-09-02 11:01:33 +01:00
Sander de Smalen	40471e4fe8	[AArch64][SVE] Preserve full vector regs over EH edge. Unwinders may only preserve the lower 64bits of Neon and SVE registers, as only the registers in the base ABI are guaranteed to be preserved over the exception edge. The caller will need to preserve additional registers for when the call throws an exception and the unwinder has tried to recover state. For e.g. svint32_t bar(svint32_t); svint32_t foo(svint32_t x, bool err) { try { bar(x); } catch (...) { err = true; } return x; } `z0` needs to be spilled before the call to `bar(x)` and reloaded before returning from foo, as the exception handler may have clobbered z0. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D84737	2020-09-02 10:54:18 +01:00
Igor Kudrin	7e0c7ec095	[DebugInfo] Emit a 1-byte value as a terminator of entries list in the name index. As stated in section 6.1.1.2, DWARFv5, p. 142, \| The last entry for each name is followed by a zero byte that \| terminates the list. There may be gaps between the lists. The patch changes emitting a 4-byte zero value to a 1-byte one, which effectively removes the gap between entry lists, and thus saves approximately 3 bytes per name; the calculation is not exact because the total size of the table is aligned to 4. Differential Revision: https://reviews.llvm.org/D86927	2020-09-02 16:12:39 +07:00
Igor Kudrin	ab5e60fa50	[DebugInfo] Remove Dwarf5AccelTableWriter::Header::UnitLength. NFC. The member is not in use; the unit length for the table is emitted as a difference between two labels. Moreover, the type of the member might be misleading, because for DWARF64 the field should be 64 bit long. Differential Revision: https://reviews.llvm.org/D86912	2020-09-02 16:11:45 +07:00
Cameron McInally	7b182e9bab	[SVE] Update INSERT_SUBVECTOR DAGCombine to use getVectorElementCount(). A small piece of the project to replace getVectorNumElements() with getVectorElementCount(). Differential Revision: https://reviews.llvm.org/D86894	2020-09-01 16:51:44 -05:00
Amara Emerson	e10dc0379d	Revert "Revert "[GlobalISel] Fold xor(cmp(pred, _, _), 1) -> cmp(inverse(pred), _, _)" (and dependent patch "Optimize away a Not feeding a brcond by using tbz instead of tbnz.")" This reverts commit 8693ddc74371dedc742c9f3d3e4eda1da72c13ea. Re-committing with the test requiring asserts.	2020-09-01 14:29:04 -07:00
Jordan Rupprecht	deb09864b8	Revert "[GlobalISel] Fold xor(cmp(pred, _, _), 1) -> cmp(inverse(pred), _, _)" (and dependent patch "Optimize away a Not feeding a brcond by using tbz instead of tbnz.") This reverts commit 8ad8f484b63ca507417b58c9016d2761f2b1a1a8. It causes crashes when running `ninja check-llvm-codegen-aarch64-globalisel`, e.g. http://lab.llvm.org:8011/builders/clang-with-thin-lto-ubuntu/builds/24132/steps/test-stage1-compiler/logs/stdio. Note that the crash does not seem to reproduce in debug builds. 5ded4442520d3dbb1aa72e6fe03cddef8828c618 depends on this, so revert that too.	2020-09-01 13:31:57 -07:00
Craig Topper	d3e914ec24	[MachineCopyPropagation] In isNopCopy, check the destination registers match in addition to the source registers. Previously if the source match we asserted that the destination matched. But GPR <-> mask register copies on X86 can violate this since we use the same K-registers for multiple sizes. Fixes this ISPC issue https://github.com/ispc/ispc/issues/1851 Differential Revision: https://reviews.llvm.org/D86507	2020-09-01 12:44:32 -07:00
Amara Emerson	399486642d	[GlobalISel] Fold xor(cmp(pred, _, _), 1) -> cmp(inverse(pred), _, _) This is needed for an upcoming change to how we translate conditional branches which might generate these. Differential Revision: https://reviews.llvm.org/D86383	2020-09-01 10:57:17 -07:00
Matt Arsenault	5e713f621a	GlobalISel: Implement computeNumSignBits for G_SELECT	2020-09-01 12:50:19 -04:00
Matt Arsenault	731e73be24	GlobalISel: Port smarter known bits for umin/umax from DAG	2020-09-01 12:50:15 -04:00
Matt Arsenault	942cf2892e	GlobalISel: Implement computeKnownBits for G_BSWAP and G_BITREVERSE	2020-09-01 12:49:57 -04:00
Sam Tebbs	745bb7deaa	[DAGCombiner] Fold an AND of a masked load into a zext_masked_load This patch folds an AND of a masked load and build vector into a zero extended masked load. Differential Revision: https://reviews.llvm.org/D86789	2020-09-01 17:02:07 +01:00
Volkan Keles	72df60b8c0	GlobalISel: Add combines for extend operations https://reviews.llvm.org/D86516	2020-09-01 08:50:06 -07:00
Matt Arsenault	b0476e1c15	GlobalISel: Implement computeNumSignBits for G_SEXTLOAD/G_ZEXTLOAD	2020-09-01 11:20:02 -04:00
Matt Arsenault	fbbea16c6b	GlobalISel: Implement computeKnownBits for G_UNMERGE_VALUES	2020-09-01 11:19:27 -04:00
Sourabh Singh Tomar	0940e21689	[NFCI] Removed an un-used declaration got accidentally introduced in f91d18eaa946b2d2ea5a9	2020-09-01 15:59:04 +05:30
David Sherwood	c4d572ac0e	[SVE][CodeGen] Fix TypeSize/ElementCount related warnings in sve-split-load.ll I have fixed up a number of warnings resulting from TypeSize -> uint64_t casts and calling getVectorNumElements() on scalable vector types. I think most of the changes are fairly trivial except for those in DAGTypeLegalizer::SplitVecRes_MLOAD I've tried to ensure we create the MachineMemoryOperands in a sensible way for scalable vectors. I have added a CHECK line to the following test: CodeGen/AArch64/sve-split-load.ll that ensures no new warnings are added. Differential Revision: https://reviews.llvm.org/D86697	2020-09-01 07:47:59 +01:00
Qiu Chaofan	55050110d2	[NFC] [DAGCombiner] Refactor bitcast folding within fabs/fneg fabs and fneg share a common transformation: (fneg (bitconvert x)) -> (bitconvert (xor x sign)) (fabs (bitconvert x)) -> (bitconvert (and x ~sign)) This patch separate the code into a single method. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D86862	2020-09-01 00:48:12 +08:00
Qiu Chaofan	9b5b508e71	[NFC] [DAGCombiner] Remove unnecessary negation in visitFNEG In visitFNEG of DAGCombiner, the folding of (fneg (fsub c, x)) is redundant since getNegatedExpression already handles it.	2020-09-01 00:35:01 +08:00
Sanjay Patel	4302187593	[DAGCombiner] skip reciprocal divisor optimization for x/sqrt(x), better I tried to fix this in: rG716e35a0cf53 ...but that patch depends on the order that we encounter the magic "x/sqrt(x)" expression in the combiner's worklist. This patch should improve that by waiting until we walk the user list to decide if there's a use to skip. The AArch64 test reveals another (existing) ordering problem though - we may try to create an estimate for plain sqrt(x) before we see that it is part of a 1/sqrt(x) expression.	2020-08-31 09:35:59 -04:00
Sanjay Patel	5b3abe923f	[DAGCombiner] skip reciprocal divisor optimization for x/sqrt(x) In general, we probably want to try the multi-use reciprocal transform before sqrt transforms, but x/sqrt(x) is a special-case because that will always reduce to plain sqrt(x) or an estimate. The AArch64 tests show that the transform is limited by TLI hook to patterns where there are 3 or more uses of the divisor. So this change can result in an extra division compared to what we had, but that's the intended behvior based on the current setting of that hook.	2020-08-30 10:55:45 -04:00
Nikita Popov	417119145c	[TargetLowering] Strip tailing whitespace (NFC)	2020-08-29 18:09:08 +02:00
Kai Luo	fabcc59830	[DAGCombiner] Enhance (zext(setcc)) Current `v:t = zext(setcc x,y,cc)` will be transformed to `select x, y, 1:t, 0:t, cc`. It misses some opportunities if x's type size is less than `t`'s size. This patch enhances the above transformation. Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D86687	2020-08-29 03:37:41 +00:00
Matt Arsenault	9e320d7adb	GlobalISel: Combine out redundant sext_inreg The scalar tests don't work yet, since computeNumSignBits apparently doesn't handle sextload yet, and sext folds into the load first.	2020-08-28 17:57:31 -04:00
Jon Roelofs	2cc95b4ba6	[early-ifcvt] Add OptRemarks	2020-08-28 15:51:18 -06:00
Craig Topper	09050e4cf7	[Attributes] Add a method to check if an Attribute has AttrKind None. Use instead of hasAttribute(Attribute::None) There's a special case in hasAttribute for None when pImpl is null. If pImpl is not null we dispatch to pImpl->hasAttribute which will always return false for Attribute::None. So if we just want to check for None its sufficient to just check that pImpl is null. Which can even be done inline. This patch adds a helper for that case which I hope will speed up our getSubtargetImpl implementations. Differential Revision: https://reviews.llvm.org/D86744	2020-08-28 13:23:45 -07:00
Benjamin Kramer	1b1f975a90	[CodeGenPrepare] Zap the argument of llvm.assume when deleting it We know that the argument is mostly likely dead, so we can purge it early. Otherwise it would make it to codegen, and can block further optimizations.	2020-08-28 20:52:22 +02:00
Snehasish Kumar	69b963173e	[llvm][CodeGen] Machine Function Splitter We introduce a codegen optimization pass which splits functions into hot and cold parts. This pass leverages the basic block sections feature recently introduced in LLVM from the Propeller project. The pass targets functions with profile coverage, identifies cold blocks and moves them to a separate section. The linker groups all cold blocks across functions together, decreasing fragmentation and improving icache and itlb utilization. We evaluated the Machine Function Splitter pass on clang bootstrap and SPECInt 2017. For clang bootstrap we observe a mean 2.33% runtime improvement with a ~32% reduction in itlb and stlb misses. Additionally, L1 icache misses reduced by 9.5% while L2 instruction misses reduced by 20%. For SPECInt we report the change in IntRate the C/C++ benchmarks. All benchmarks apart from mcf and x264 improve, on average by 0.6% with the max for deepsjeng at 1.6%. Benchmark % Change 500.perlbench_r 0.78 502.gcc_r 0.82 505.mcf_r -0.30 520.omnetpp_r 0.18 523.xalancbmk_r 0.37 525.x264_r -0.46 531.deepsjeng_r 1.61 541.leela_r 0.83 557.xz_r 0.15 Differential Revision: https://reviews.llvm.org/D85368	2020-08-28 11:10:14 -07:00
Denis Antrushin	34d811b345	[Statepoint] Always spill base pointer. There is a subtle problem with new statepoint lowering scheme when base and pointers are the same (see PR46917 for more context): %1 = STATEPOINT ... %0, %0(tied-def 0)... if, for some reason, register allocator desides to put two instances of %0 into two different objects (registers or spill slots), we may end up with $reg3 = STATEPOINT ... $reg2, $reg1(tied-def 0)... and nothing will prevent later passes to sink uses of $reg2 below statepoint, which is incorrect. As a short term solution, always put base pointers on stack during lowering. A longer term solution may be to rework MIR statepoint format to avoid GC pointer duplication in statepoint argument list. Reviewed By: reames Differential Revision: https://reviews.llvm.org/D86712	2020-08-28 23:22:07 +07:00
Yonghong Song	ac35f59b91	[GlobalISel] fix a compilation error with gcc 6.3.0 With gcc 6.3.0, I hit the following compilation error: ../lib/CodeGen/GlobalISel/Combiner.cpp: In member function ‘bool llvm::Combiner::combineMachineInstrs(llvm::MachineFunction&, llvm::GISelCSEInfo*)’: ../lib/CodeGen/GlobalISel/Combiner.cpp:156:54: error: suggest parentheses around ‘&&’ within ‘\|\|’ [-Werror=parentheses] assert(!CSEInfo \|\| !errorToBool(CSEInfo->verify()) && ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~^~ "CSEInfo is not consistent. Likely missing calls to " ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~ "observer on mutations"); Fix the code as suggested by the compiler.	2020-08-28 09:16:52 -07:00
QingShan Zhang	8eac4059bb	[DAGCombine] Don't delete the node if it has uses immediately This is the follow up patch for https://reviews.llvm.org/D86183 as we miss to delete the node if NegX == NegY, which has use after we create the node. ``` if (NegX && (CostX <= CostY)) { Cost = std::min(CostX, CostZ); RemoveDeadNode(NegY); return DAG.getNode(Opcode, DL, VT, NegX, Y, NegZ, Flags); #<-- NegY is used here if NegY == NegX. } ``` Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D86689	2020-08-28 16:13:43 +00:00
David Sherwood	56b8c35591	[SVE] Make ElementCount members private This patch changes ElementCount so that the Min and Scalable members are now private and can only be accessed via the get functions getKnownMinValue() and isScalable(). In addition I've added some other member functions for more commonly used operations. Hopefully this makes the class more useful and will reduce the need for calling getKnownMinValue(). Differential Revision: https://reviews.llvm.org/D86065	2020-08-28 14:43:53 +01:00
Denis Antrushin	2a1fa7b84c	[Statepoint] Turn assert into check in foldPatchpoint. Original D81646 had check for tied regs in foldPatchpoint(). Due to unfortunate miscommunication with review comments and adressing some comments post commit, it turned into assertion. We had an offline talk and agreed that with current implementation this path is possible, so I'm changing it back to check. Note that this is workaround until ussues described in PR46917 are resolved.	2020-08-28 20:00:23 +07:00
Sam Parker	710437b36d	[ARM][LowOverheadLoops] Liveouts and reductions Remove the code that tried to look for reduction patterns, since the vectorizer and isel can now produce predicated arithmetic instructios within the loop body. This has required some reorganisation and fixes around live-out and predication checks, as well as looking for cases where an input/output is initialised to zero. Differential Revision: https://reviews.llvm.org/D86613	2020-08-28 13:56:16 +01:00
Matt Arsenault	f54c1fe9ec	GlobalISel: Implement computeNumSignBits for G_SEXT_INREG	2020-08-27 19:44:37 -04:00
Matt Arsenault	62f7266a72	Correctly revert "GlobalISel: Use & operator on KnownBits" I mis-resolved the revert through moving the code to another function.	2020-08-27 19:08:31 -04:00
Matt Arsenault	bb532337e4	Revert "GlobalISel: Use & operator on KnownBits" This reverts commit e53b799779b079a70f600e5cad2ab7267d66b1b7. Confusingly, this does not simply and the two sets of known bits, but implements known bits for the and operator.	2020-08-27 18:52:34 -04:00
Brad Smith	a1c364cc4b	[SSP] Restore setting the visibility of __guard_local to hidden for better code generation. Patch by: Philip Guenther	2020-08-27 17:17:38 -04:00
Matt Arsenault	2cd41c208e	GlobalISel: Implement known bits for min/max	2020-08-27 16:56:17 -04:00
Matt Arsenault	fcf8b40603	MIR: Infer not-SSA for subregister defs It's possible to have a single virtual register def with a subreg index that would pass the previous check, but it's not possible to have a subregister def in SSA. This is in preparation for adding stricter checks for SSA MIR.	2020-08-27 16:56:16 -04:00
Eli Friedman	4c050f47e9	[RegisterScavenging] Delete dead function unprocess().	2020-08-27 13:19:32 -07:00
Matt Arsenault	9b9f0675a4	GlobalISel: Use & operator on KnownBits Avoid repeating for zero and one	2020-08-27 14:07:18 -04:00

1 2 3 4 5 ...

29301 Commits