llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
Xing GUO	33fdabb077	[DWARFYAML] Rename getUsedSectionNames() to getNonEmptySectionNames(). This patch renames getUsedSectionNames() to getNonEmptySectionNames. NFC.	2020-07-26 21:10:38 +08:00
Sanjay Patel	57ae573dba	[InstSimplify] add tests for min/max intrinsics; NFC	2020-07-26 09:04:37 -04:00
Sanjay Patel	c36efeb0c6	[InstSimplify] fold fcmp using isKnownNeverInfinity + isKnownNeverNaN Follow-up to D84035 / rG7393d7574c09. This sidesteps a question of FMF/poison on fcmp raised in PR46077: http://bugs.llvm.org/PR46077 https://alive2.llvm.org/ce/z/TCsyzD define i1 @src(float %x) { %0: %x42 = fadd nnan ninf float %x, 42.000000 %r = fcmp ueq float %x42, inf ret i1 %r } => define i1 @tgt(float %x) { %0: ret i1 0 } Transformation seems to be correct! https://alive2.llvm.org/ce/z/FQaH7a define i1 @src(i8 %x) { %0: %cast = uitofp i8 %x to float %r = fcmp one float inf, %cast ret i1 %r } => define i1 @tgt(i8 %x) { %0: ret i1 1 } Transformation seems to be correct!	2020-07-26 09:04:37 -04:00
Sanjay Patel	ff162bb86a	[InstSimplify] add tests for fcmp with infinity constant; NFC	2020-07-26 09:04:36 -04:00
Juneyoung Lee	5b6495c7fc	[JumpThreading] Add a test for D84598; NFC	2020-07-26 22:00:01 +09:00
Juneyoung Lee	2b46786139	[ConstantFolding] Fold freeze if it is never undef or poison This is a simple patch that adds constant folding for freeze instruction. IIUC, it isn't needed to update ConstantFold.cpp because there is no freeze constexpr. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D84597	2020-07-26 21:54:44 +09:00
Juneyoung Lee	e7db975cc1	[ValueTracking] Instruction::isBinaryOp should be used for constexprs This is a simple patch that makes canCreateUndefOrPoison use Instruction::isBinaryOp because BinaryOperator inherits Instruction. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D84596	2020-07-26 21:48:51 +09:00
Juneyoung Lee	e0da85891c	NFC; add a test for freeze's constprop	2020-07-26 21:03:23 +09:00
Juneyoung Lee	33eb5168ac	NFC; add an example that subtracts pointers to two global vars	2020-07-26 20:47:33 +09:00
Roman Lebedev	fa31e249cb	[NFC][XRay] Account: migrate to DenseMap + SmallVector, -16% faster on large (3.8G) input DenseMap is a single allocation underneath, so this is has pretty expected performance impact on large-ish (3.8G) xray log processing time.	2020-07-26 14:08:07 +03:00
Roman Lebedev	2f9de5aa72	[NFC][XRay] Account: decouple getStats() interface from underlying data structure It doesn't really need to know where Timings are stored, it just needs to be able to sort them, so MutableArrayRef is enough. That uncovers an interesting quirk that it relied on implicit double->int conversion for calculating percentiles.	2020-07-26 14:08:06 +03:00
Alex Richardson	ef265d2b4d	[lit] Don't include tests skipped due to sharding in reports When running multiple shards, don't include skipped tests in the xunit output since merging the files will result in duplicates. In our CHERI Jenkins CI, I configured the libc++ tests to run using sharding (since we are testing using a single-CPU QEMU). We then merge the generated XUnit xml files to produce a final result, but if the individual XMLs report tests excluded due to sharding each test is included N times in the final result. This also makes it difficult to find the tests that were skipped due to missing REQUIRES: etc. Reviewed By: yln Differential Revision: https://reviews.llvm.org/D84235	2020-07-26 11:39:22 +01:00
Amara Emerson	8003a99b34	[AArch64][GlobalISel] Make <8 x s16> and <16 x s8> legal types for G_SHUFFLE_VECTOR and G_IMPLICIT_DEF. Trivial change, we're still missing support for rev matching for these types in the combiner.	2020-07-26 00:48:09 -07:00
Craig Topper	98c2717f20	[X86] Merge X86MCInstLowering's maxLongNopLength into emitNop and remove check for FeatureNOPL. The switch in emitNop uses 64-bit registers for nops exceeding 2 bytes. This isn't valid outside 64-bit mode. We could fix this easily enough, but there are no users that ask for more than 2 bytes outside 64-bit mode. Inlining the method to make the coupling between the two methods more explicit.	2020-07-25 22:11:47 -07:00
Craig Topper	da44994d9d	[X86] Remove getProcFamily() method from X86Subtarget. NFC This isn't used and we've decided in the past that a CPU enum for tuning is not a good idea.	2020-07-25 22:11:45 -07:00
Changpeng Fang	c4460abd0f	DADCombiner: Don't simplify the token factor if the node's number of operands already exceeds TokenFactorInlineLimit Summary: In parallelizeChainedStores, a TokenFactor was created with the size greater than 3000. We found that DAGCombiner::visitTokenFactor will consume a huge amount of time on such nodes. Since the number of operands already exceeds TokenFactorInlineLimit, we propose to give up simplification with the consideration of compile time. Reviewers: @spatel, @arsenm Differential Revision: https://reviews.llvm.org/D84204	2020-07-25 21:20:59 -07:00
Craig Topper	bee69185ca	[X86] Replace a use of ProcIntelSLM with FeatureFast7ByteNOP.	2020-07-25 20:46:48 -07:00
Eric Christopher	4b0c03aa04	Fold StatepointBB into checks as it's only used from an NDEBUG or ASSERT context fixing an unused variable warning.	2020-07-25 18:36:53 -07:00
Nemanja Ivanovic	f5f0b31128	[PowerPC][NFC] Fix an assert that cannot trip from 7d076e19e31a I mixed up the precedence of operators in the assert and thought I had it right since there was no compiler warning. This just adds the parentheses in the expression as needed.	2020-07-25 20:28:52 -04:00
Philip Reames	7d5b6ac5c3	[Statepoints] Style cleanup after 3da1a963 [NFC] Just fixing a few minor stylistic issues.	2020-07-25 16:40:39 -07:00
Craig Topper	5541759712	[X86] Add masked versions of the VPTERNLOG test cases added for D83630. NFC We don't handle these yet and D83630 won't improve that, but at least we'll have the tests.	2020-07-25 16:37:17 -07:00
Roman Lebedev	eae3e5a494	[Reduce] Argument reduction: do deal with function declarations We can happily turn function definitions into declarations, thus obscuring their argument from being elided by this pass. I don't believe there is a good reason to just ignore declarations. likely even proper llvm intrinsics ones, at worst the input becomes uninteresting. The other question here is that all these transforms are all-or-nothing. In some cases, should we be treating each use separately? The main blocker here seemed to be that llvm::CloneFunctionInto() does `&OldFunc->front()`, which inserts a nullptr into a densemap, which is not happy about it and asserts.	2020-07-26 01:31:56 +03:00
Roman Lebedev	942c801f82	[Reduce] Argument reduction: do properly handle invoke insts (PR46819) replaceFunctionCalls() is very non-exhaustive, it only handles CallInst's. Which means, by the time we drop old function, there may still be uses of it lurking around. Let's instead whack-a-mole them by all by replacing with undef. I'm not sure this is the best handling, especially for calls, but IMO poorly reduced input is much better than crashing reduction tool. A (previously-crashing!) test added. Fixes https://bugs.llvm.org/show_bug.cgi?id=46819	2020-07-26 01:29:00 +03:00
Roman Lebedev	c62f97f67d	[Reduce] Basic block reduction: do properly handle invoke insts (PR46818) Terminator may have returned value, so we need to replace uses, and in general handle invoke as a branch inst. I'm not sure this is the best handling, but IMO poorly reduced input is much better than crashing reduction tool. A (previously-crashing!) test added. Fixes https://bugs.llvm.org/show_bug.cgi?id=46818	2020-07-26 01:28:59 +03:00
Lang Hames	ef607c6171	[ORC] Rename TargetProcessControl DynamicLibraryHandle and loadLibrary. The new names, DylibHandle and loadDylib, are more concise and make clear that these utilities are for loading dynamic libraries, not static ones.	2020-07-25 15:21:43 -07:00
Lang Hames	8dd38c568c	[ORC] Don't require PageSize or Triple during TargetProcessControl construction Subclasses will commonly gather that information from a remote during construction, in which case they won't have meaningful values to pass to TargetProcessControl's constructor.	2020-07-25 15:21:43 -07:00
Philip Reames	dfcbe0aef2	[Statepoints] Support lowering gc relocations to virtual registers (Disabled under flag for the moment) This is part of a larger project wherein we are finally integrating lowering of gc live operands with the register allocator. Today, we force spill all operands in SelectionDAG. The code to do so is distinctly non-optimal. The approach this patch is working towards is to instead lower the relocations directly into the MI form, and let the register allocator pick which ones get spilled and which stack slots they get spilled to. In terms of performance, the later part is actually more important as it avoids redundant shuffling of values between stack slots. This particular change adds ISEL support to produce the variadic def STATEPOINT form required by the above. In particular, the first N are lowered to variadic tied def/use pairs. So new statepoint looks like this: reloc1,reloc2,... = STATEPOINT ..., base1, derived1<tied-def0>, base2, derived2<tied-def1>, ... N is limited by the maximal number of tied registers machine instruction can have (15 at the moment). The current patch is restricted to handling relocations within a single basic block. Cross block relocations (e.g. invokes) are handled via the legacy mechanism. This restriction will be relaxed in future patches. Patch By: dantrushin Differential Revision: https://reviews.llvm.org/D81648	2020-07-25 14:26:05 -07:00
Craig Topper	a376bad472	[X86] Add llvm.roundeven test cases. Add f80 tests cases for constrained intrinsics that lower to libcalls. NFC	2020-07-25 13:29:47 -07:00
Craig Topper	12c5aa4515	[X86] Fix intrinsic names in strict fp80 tests to use f80 in their names instead of x86_fp80. The type is called x86_fp80, but when it is printed in the intrinsic name it should be f80. The parser doesn't seem to care that the name was wrong.	2020-07-25 13:12:49 -07:00
LLVM GN Syncbot	91051ca099	[gn build] Port 136c8f50e96	2020-07-25 18:51:58 +00:00
Roman Lebedev	f9cbb7144f	[Reduce] Try turning function definitions into declarations first, NFCI-ish ReduceFunctions could do it, but it also replaces all calls with undef, so if any of undef replacements makes reduction uninteresting, it won't work. ReduceBasicBlocks also could do it, but well, it may take many guesses for all the blocks of a function to happen to be out-of-chunk, which is not a very efficient way to go about it. So let's just do this first.	2020-07-25 21:43:36 +03:00
Florian Hahn	2f84723455	[X86] Remove stress-scheduledagrrlist.ll. This test seems to take quite a long time with EXPENSIVE_CHECKS. Remove it.	2020-07-25 15:45:24 +01:00
Nikita Popov	23a372d0f2	[LVI] Don't require operand number for range (NFC) Pass the Value* instead of the operand number, rename I to CxtI. This makes the function a bit more generally useful.	2020-07-25 16:33:45 +02:00
Matt Arsenault	4251a6ded6	AMDGPU/GlobalISel: Don't assert on G_INSERT > 128-bits Just fallback for now. Really tablegen needs to generate all of the subregister index handling we need.	2020-07-25 10:05:44 -04:00
Nikita Popov	004e042624	[SCCP] Add assume non null test (NFC)	2020-07-25 16:02:15 +02:00
Nikita Popov	86610c55e0	[SCCP] Restore the change reporting as well Reapply 5db5b4bc4394ca247c9eb665e03b851848aa2fbf.	2020-07-25 15:11:30 +02:00
Nikita Popov	74624d8649	Reapply [SCCP] Directly remove non-feasible edges Reapply with DTU update moved after CFG update, which is a requirement of the API. ----- Non-feasible control-flow edges are currently removed by replacing the branch condition with a constant and then calling ConstantFoldTerminator. This happens in a rather roundabout manner, by inspecting the users (effectively: predecessors) of unreachable blocks, and further complicated by the need to explicitly materialize the condition for "forced" edges. I would like to extend SCCP to discard switch conditions that are non-feasible based on range information, but this is incompatible with the current approach (as there is no single constant we could use.) Instead, this patch explicitly removes non-feasible edges. It currently only needs to handle the case where there is a single feasible edge. The llvm_unreachable() branch will need to be implemented for the aforementioned switch improvement. Differential Revision: https://reviews.llvm.org/D84264	2020-07-25 14:52:35 +02:00
Simon Pilgrim	b71064b07e	SimplifyLibCalls - remove unnecessary header and forward declaration. NFC. We include TargetLibraryInfo.h so don't need to forward declare it, and we don't need to include TargetLibraryInfo.h in SimplifyLibCalls.cpp as well.	2020-07-25 12:58:39 +01:00
Simon Pilgrim	85fa3b1299	[X86][SSE] combineX86ShufflesRecursively - move all Root node asserts to the same location. NFCI. Minor tidyup for some upcoming shuffle combine improvements.	2020-07-25 12:48:14 +01:00
Simon Pilgrim	a3ff65cfbb	SymbolRemappingReader.h - pass Twine by reference not value. NFCI.	2020-07-25 12:48:14 +01:00
Florian Hahn	c1dc5ad58a	[IPSCCP] Drop argmemonly after replacing pointer argument. This patch updates IPSCCP to drop argmemonly and inaccessiblemem_or_argmemonly if it replaces a pointer argument. Fixes PR46717. Reviewers: efriedma, davide, nikic, jdoerfert Reviewed By: efriedma, jdoerfert Differential Revision: https://reviews.llvm.org/D84432	2020-07-25 11:52:14 +01:00
Nathan James	edd124eebe	Fix C2975 error under MSVC Apparantly a constexpr value isn't a compile time constant under certain versions of MSVC.	2020-07-25 11:03:59 +01:00
Simon Pilgrim	5a6feccd46	[X86][SSE] getFauxShuffle - ignore undemanded sources for PACKSS/PACKUS faux shuffles If we don't care about an entire LHS/RHS of the PACK op, then can just treat it the same as undef (we don't care if it saturates) and is safe to treat as a shuffle. This can happen if we attempt to decode as a faux shuffle before SimplifyDemandedVectorElts has been called on the PACK which should replace the source with UNDEF entirely.	2020-07-25 10:51:14 +01:00
Nathan James	8b5de07c6a	[ADT] Add a range-based version of std::move Adds a range-based version of `std::move`, the version that moves a range, not the one that creates r-value references. Reviewed By: dblaikie, gamesh411 Differential Revision: https://reviews.llvm.org/D83902	2020-07-25 10:37:34 +01:00
Jessica Paquette	0266a8b395	[AArch64][GlobalISel] Look through constants when selection stores of 0 Very minor code size improvements (hits 8 times in Bullet at -O3), but still something. Also very minor NFC change to make sure we only search for a 0 constant when selecting a store. Before, we'd do this for loads as well. Differential Revision: https://reviews.llvm.org/D84573	2020-07-24 22:46:14 -07:00
Amy Kwan	2f09b60413	[PowerPC] Exploit the High Order Vector Multiply Instructions on Power10 This patch aims to exploit the following vector multiply high instructions on Power10. vmulhsw VRT, VRA, VRB vmulhsd VRT, VRA, VRB vmulhuw VRT, VRA, VRB vmulhud VRT, VRA, VRB Differential Revision: https://reviews.llvm.org/D82584	2020-07-24 20:57:57 -05:00
Rong Xu	f48156accc	[PGO] Fix incorrect function entry count Function entry count might be zero after the profile counts reset and before reentry to the function. Zero profile entry count is very bad as the profile count from BFI will be wrong. A simple fix is to set the profile entry count to 1 if there are non-zero profile counts in this function. Differential Revision: https://reviews.llvm.org/D84378	2020-07-24 17:39:55 -07:00
Rong Xu	873a89587d	[PGO][InstrProf] Do not promote count if the exit blocks contains ret instruction Skip profile count promotion if any of the ExitBlocks contains a ret instruction. This is to prevent dumping of incomplete profile -- if the the loop is a long running loop and dump is called in the middle of the loop, the result profile is incomplete. ExitBlocks containing a ret instruction is an indication of a long running loop -- early exit to error handling code. Differential Revision: https://reviews.llvm.org/D84379	2020-07-24 17:38:31 -07:00
Rong Xu	ab6b85942c	Revert "[PGO][InstrProf] Do not promote count if the exit blocks contains ret instruction" This reverts commit 6fdc6f6c7d34af60c45c405f448370a684ef6f2a.	2020-07-24 17:35:44 -07:00
Rong Xu	84b04d6437	Revert "[PGO][InstrProf] Do not promote count if the exit blocks contains ret instruction" This reverts commit 867ef4472d8e57384c929e4f06b74d1ac8883a99.	2020-07-24 17:33:49 -07:00

1 2 3 4 5 ...

200773 Commits