llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

History

Nikita Popov 31fa5fa3a2 [MemCpyOpt] memset->memcpy forwarding with undef tail Currently memcpyopt optimizes cases like memset(a, byte, N); memcpy(b, a, M); to memset(a, byte, N); memset(b, byte, M); if M <= N. Often this allows further simplifications down the line, which drop the first memset entirely. This patch extends this optimization for the case where M > N, but we know that the bytes a[N..M] are undef due to alloca/lifetime.start. This situation arises relatively often for Rust code, because Rust does not initialize trailing structure padding and loves to insert redundant memcpys. This also fixes https://bugs.llvm.org/show_bug.cgi?id=39844. For the implementation, I'm reusing a bit of code for a similar existing optimization (direct memcpy of undef). I've also added memset support to MemDepAnalysis GetLocation -- Instead, getPointerDependencyFrom could be used, but it seems to make more sense to add this to GetLocation and thus make the computation cachable. Differential Revision: https://reviews.llvm.org/D55120 llvm-svn: 348645		2018-12-07 21:16:58 +00:00
..
AliasAnalysis.cpp	Allow subclassing ExternalAA	2018-11-07 20:26:42 +00:00
AliasAnalysisEvaluator.cpp
AliasAnalysisSummary.cpp
AliasAnalysisSummary.h
AliasSetTracker.cpp	[AliasSetTracker] Misc cleanup (NFCI)	2018-11-01 23:37:51 +00:00
Analysis.cpp	[stack-safety] Empty local passes for Stack Safety Global Analysis	2018-11-26 23:05:48 +00:00
AssumptionCache.cpp
BasicAliasAnalysis.cpp	Replace most users of UnknownSize with LocationSize::unknown(); NFC	2018-10-10 21:28:44 +00:00
BlockFrequencyInfo.cpp
BlockFrequencyInfoImpl.cpp	llvm::sort(C.begin(), C.end(), ...) -> llvm::sort(C, ...)	2018-09-27 02:13:45 +00:00
BranchProbabilityInfo.cpp	[TI removal] Make variables declared as `TerminatorInst` and initialized	2018-10-15 10:04:59 +00:00
CallGraph.cpp	llvm::sort(C.begin(), C.end(), ...) -> llvm::sort(C, ...)	2018-09-27 02:13:45 +00:00
CallGraphSCCPass.cpp	Fixing -print-module-scope for legacy SCC passes	2018-12-03 14:48:15 +00:00
CallPrinter.cpp	Revert "Extend CFGPrinter and CallPrinter with Heat Colors"	2018-06-29 17:48:58 +00:00
CaptureTracking.cpp	Introduce MaxUsesToExplore argument to capture tracking	2018-11-29 20:08:12 +00:00
CFG.cpp	[TI removal] Make variables declared as `TerminatorInst` and initialized	2018-10-15 10:04:59 +00:00
CFGPrinter.cpp	[CFG Printer] Add support for writing the dot files with a custom	2018-10-09 04:30:23 +00:00
CFLAndersAliasAnalysis.cpp	Replace most users of UnknownSize with LocationSize::unknown(); NFC	2018-10-10 21:28:44 +00:00
CFLGraph.h	[IR] Replace `isa<TerminatorInst>` with `isTerminator()`.	2018-08-26 09:51:22 +00:00
CFLSteensAliasAnalysis.cpp
CGSCCPassManager.cpp	[New PM] Introducing PassInstrumentation framework	2018-09-20 17:08:45 +00:00
CMakeLists.txt	[stack-safety] Empty local passes for Stack Safety Local Analysis	2018-11-26 21:57:47 +00:00
CmpInstAnalysis.cpp	[CmpInstAnalysis] fix function signature for ICmp code to predicate; NFC	2018-12-04 18:53:27 +00:00
CodeMetrics.cpp
ConstantFolding.cpp	[ConstantFolding] Add support for saturating add/sub	2018-11-20 17:05:55 +00:00
CostModel.cpp
Delinearization.cpp
DemandedBits.cpp	Reapply "[DemandedBits][BDCE] Support vectors of integers"	2018-12-07 15:38:13 +00:00
DependenceAnalysis.cpp	Replace most users of UnknownSize with LocationSize::unknown(); NFC	2018-10-10 21:28:44 +00:00
DivergenceAnalysis.cpp	[DA] GPUDivergenceAnalysis for unstructured GPU kernels	2018-11-30 22:55:20 +00:00
DominanceFrontier.cpp
DomPrinter.cpp	Revert "Extend CFGPrinter and CallPrinter with Heat Colors"	2018-06-29 17:48:58 +00:00
EHPersonalities.cpp	[TI removal] Make variables declared as `TerminatorInst` and initialized	2018-10-15 10:04:59 +00:00
GlobalsModRef.cpp	Remove trailing space	2018-07-30 19:41:25 +00:00
GuardUtils.cpp	Re-enable "[NFC] Unify guards detection"	2018-08-30 03:39:16 +00:00
IndirectCallPromotionAnalysis.cpp
InlineCost.cpp	[Inliner] Penalise inlining of calls with loops at Oz	2018-11-05 14:54:34 +00:00
InstCount.cpp
InstructionPrecedenceTracking.cpp	[LICM] Hoist guards from non-header blocks	2018-11-12 09:29:58 +00:00
InstructionSimplify.cpp	[ValueTracking] add helper function for testing implied condition; NFCI	2018-12-02 13:26:03 +00:00
Interval.cpp
IntervalPartition.cpp
IteratedDominanceFrontier.cpp	[IDF] Teach Iterated Dominance Frontier to use a snapshot CFG based on a GraphDiff.	2018-08-17 17:39:15 +00:00
IVDescriptors.cpp	Fix parenthesis warning in IVDescriptors	2018-11-30 13:54:36 +00:00
IVUsers.cpp
LazyBlockFrequencyInfo.cpp
LazyBranchProbabilityInfo.cpp
LazyCallGraph.cpp	ADT/STLExtras: Introduce llvm::empty; NFC	2018-10-31 00:23:23 +00:00
LazyValueInfo.cpp	[LVI] run transfer function for binary operator even when the RHS isn't a constant	2018-11-21 05:24:12 +00:00
LegacyDivergenceAnalysis.cpp	LegacyDivergenceAnalysis: fix uninitialized value	2018-11-30 23:07:49 +00:00
Lint.cpp
LLVMBuild.txt
Loads.cpp
LoopAccessAnalysis.cpp	[LV] Avoid vectorizing unsafe dependencies in uniform address	2018-11-19 15:39:59 +00:00
LoopAnalysisManager.cpp	[LoopPassManager] MemorySSA should be preserved when enabled.	2018-09-06 20:54:24 +00:00
LoopInfo.cpp	[TI removal] Make variables declared as `TerminatorInst` and initialized	2018-10-15 10:04:59 +00:00
LoopPass.cpp	[LoopPass] fixing 'Modification' messages in -debug-pass=Executions for loop passes	2018-11-19 15:10:59 +00:00
LoopUnrollAnalyzer.cpp
MemDepPrinter.cpp	Remove trailing space	2018-07-30 19:41:25 +00:00
MemDerefPrinter.cpp
MemoryBuiltins.cpp	Reverting r340807.	2018-08-30 18:37:18 +00:00
MemoryDependenceAnalysis.cpp	[MemCpyOpt] memset->memcpy forwarding with undef tail	2018-12-07 21:16:58 +00:00
MemoryLocation.cpp	Replace most users of UnknownSize with LocationSize::unknown(); NFC	2018-10-10 21:28:44 +00:00
MemorySSA.cpp	[MemorySSA] Create query after checking if instruction is a fence.	2018-11-13 21:12:49 +00:00
MemorySSAUpdater.cpp	[IR] Add hasNPredecessors, hasNPredecessorsOrMore to BasicBlock	2018-11-19 19:54:27 +00:00
ModuleDebugInfoPrinter.cpp
ModuleSummaryAnalysis.cpp	[ThinLTO] Allow importing of functions with var args	2018-12-01 05:11:46 +00:00
MustExecute.cpp	[LICM] Hoist guards from non-header blocks	2018-11-12 09:29:58 +00:00
ObjCARCAliasAnalysis.cpp
ObjCARCAnalysisUtils.cpp
ObjCARCInstKind.cpp
OptimizationRemarkEmitter.cpp
OrderedBasicBlock.cpp	[NFC] Sanitizing asserts for OrderedBasicBlock	2018-09-11 08:46:19 +00:00
OrderedInstructions.cpp	[NFC] Move OrderedInstructions and InstructionPrecedenceTracking to Analysis	2018-08-30 04:49:03 +00:00
PHITransAddr.cpp
PhiValues.cpp	[PhiValues] Use callback value handles to invalidate deleted values	2018-08-24 15:48:30 +00:00
PostDominators.cpp
ProfileSummaryInfo.cpp	[ProfileSummary] Standardize methods and fix comment	2018-11-19 05:23:16 +00:00
PtrUseVisitor.cpp
README.txt
RegionInfo.cpp	Test commit, fix a minor typo.	2018-07-22 20:04:42 +00:00
RegionPass.cpp	[NFC][PassTiming] factor out generic PassTimingInfo	2018-08-28 21:06:51 +00:00
RegionPrinter.cpp	Revert "Extend CFGPrinter and CallPrinter with Heat Colors"	2018-06-29 17:48:58 +00:00
ScalarEvolution.cpp	[SCEV][NFC] Verify IR in isLoop[Entry,Backedge]GuardedByCond	2018-11-08 05:07:58 +00:00
ScalarEvolutionAliasAnalysis.cpp	Make LocationSize a proper Optional type; NFC	2018-10-09 03:18:56 +00:00
ScalarEvolutionExpander.cpp	Revert r347934 "[SCEV] Guard movement of insertion point for loop-invariants"	2018-12-05 23:13:50 +00:00
ScalarEvolutionNormalization.cpp
ScopedNoAliasAA.cpp
StackSafetyAnalysis.cpp	[stack-safety] Update comment	2018-11-27 01:56:44 +00:00
StratifiedSets.h
SyncDependenceAnalysis.cpp	[TI removal] Switch some newly added code over to use `Instruction`	2018-10-19 00:22:10 +00:00
SyntheticCountsUtils.cpp
TargetLibraryInfo.cpp	Revert unapproved commit	2018-11-24 07:26:55 +00:00
TargetTransformInfo.cpp	[TTI] getOperandInfo - a broadcast shuffle means the result is OK_UniformValue	2018-11-14 15:04:08 +00:00
Trace.cpp
TypeBasedAliasAnalysis.cpp
TypeMetadataUtils.cpp	[WPD] Fix incorrect devirtualization after indirect call promotion	2018-09-27 14:55:32 +00:00
ValueLattice.cpp
ValueLatticeUtils.cpp
ValueTracking.cpp	[ValueTracking] Support funnel shifts in computeKnownBits()	2018-12-02 14:14:11 +00:00
VectorUtils.cpp	[VectorUtils] Use namespace for InterleaveGroup template specialization.	2018-11-13 16:26:34 +00:00

README.txt

Analysis Opportunities:

//===---------------------------------------------------------------------===//

In test/Transforms/LoopStrengthReduce/quadradic-exit-value.ll, the
ScalarEvolution expression for %r is this:

  {1,+,3,+,2}<loop>

Outside the loop, this could be evaluated simply as (%n * %n), however
ScalarEvolution currently evaluates it as

  (-2 + (2 * (trunc i65 (((zext i64 (-2 + %n) to i65) * (zext i64 (-1 + %n) to i65)) /u 2) to i64)) + (3 * %n))

In addition to being much more complicated, it involves i65 arithmetic,
which is very inefficient when expanded into code.

//===---------------------------------------------------------------------===//

In formatValue in test/CodeGen/X86/lsr-delayed-fold.ll,

ScalarEvolution is forming this expression:

((trunc i64 (-1 * %arg5) to i32) + (trunc i64 %arg5 to i32) + (-1 * (trunc i64 undef to i32)))

This could be folded to

(-1 * (trunc i64 undef to i32))

//===---------------------------------------------------------------------===//