llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 21:13:02 +02:00

History

Michael Zolotukhin 20e33533a3 [SCEV] Compute affine range in another way to avoid bitwidth extending. Summary: This approach has two major advantages over the existing one: 1. We don't need to extend bitwidth in our computations. Extending bitwidth is a big issue for compile time as we often end up working with APInts wider than 64bit, which is a slow case for APInt. 2. When we zero extend a wrapped range, we lose some information (we replace the range with [0, 1 << src bit width)). Thus, avoiding such extensions better preserves information. Correctness testing: I ran 'ninja check' with assertions that the new implementation of getRangeForAffineAR gives the same results as the old one (this functionality is not present in this patch). There were several failures - I inspected them manually and found out that they all are caused by the fact that we're returning more accurate results now (see bullet (2) above). Without such assertions 'ninja check' works just fine, as well as SPEC2006. Compile time testing: CTMark/Os: - mafft/pairlocalalign -16.98% - tramp3d-v4/tramp3d-v4 -12.72% - lencod/lencod -11.51% - Bullet/bullet -4.36% - ClamAV/clamscan -3.66% - 7zip/7zip-benchmark -3.19% - sqlite3/sqlite3 -2.95% - SPASS/SPASS -2.74% - Average -5.81% Performance testing: The changes are expected to be neutral for runtime performance. Reviewers: sanjoy, atrick, pete Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30477 llvm-svn: 297992		2017-03-16 21:07:38 +00:00
..
AliasAnalysis.cpp	[AliasAnalysis] Fences do not modify constant memory location	2017-01-20 00:21:33 +00:00
AliasAnalysisEvaluator.cpp
AliasAnalysisSummary.cpp
AliasAnalysisSummary.h
AliasSetTracker.cpp	Use a WeakVH for UnknownInstructions in AliasSetTracker	2017-03-11 01:15:48 +00:00
Analysis.cpp
AssumptionCache.cpp	AssumptionCache: Disable the verifier by default, move it behind a hidden cl::opt and verify from releaseMemory().	2017-02-15 21:10:09 +00:00
BasicAliasAnalysis.cpp	[BasicAA] Take attributes into account when requesting modref info for a call site	2017-03-01 13:19:51 +00:00
BlockFrequencyInfo.cpp	[PGO] internal option cleanups	2017-02-02 21:29:17 +00:00
BlockFrequencyInfoImpl.cpp	Cleanup dump() functions.	2017-01-28 02:02:38 +00:00
BranchProbabilityInfo.cpp	Retry: [BPI] Use a safer constructor to calculate branch probabilities	2016-12-17 01:02:08 +00:00
CallGraph.cpp	Cleanup dump() functions.	2017-01-28 02:02:38 +00:00
CallGraphSCCPass.cpp	Implement getPassName() for IR printing passes.	2017-03-10 07:09:20 +00:00
CallPrinter.cpp
CaptureTracking.cpp
CFG.cpp
CFGPrinter.cpp
CFLAndersAliasAnalysis.cpp	Apply clang-tidy's performance-unnecessary-value-param to LLVM.	2017-01-13 14:39:03 +00:00
CFLGraph.h
CFLSteensAliasAnalysis.cpp
CGSCCPassManager.cpp	[PM/LCG] Teach the LazyCallGraph how to replace a function without	2017-02-09 23:24:13 +00:00
CMakeLists.txt	[PM] Separate the LoopAnalysisManager from the LoopPassManager and move	2017-01-11 09:43:56 +00:00
CodeMetrics.cpp	Revert @llvm.assume with operator bundles (r289755-r289757)	2016-12-19 08:22:17 +00:00
ConstantFolding.cpp	[Analysis] Add LibFunc_ prefix to enums in TargetLibraryInfo. (NFC)	2017-01-23 23:16:46 +00:00
CostModel.cpp	[TargetTransformInfo] getIntrinsicInstrCost() scalarization estimation improved	2017-03-14 06:35:36 +00:00
Delinearization.cpp
DemandedBits.cpp	Revert @llvm.assume with operator bundles (r289755-r289757)	2016-12-19 08:22:17 +00:00
DependenceAnalysis.cpp	Cleanup dump() functions.	2017-01-28 02:02:38 +00:00
DivergenceAnalysis.cpp
DominanceFrontier.cpp	[PM] Introduce an analysis set used to preserve all analyses over	2017-01-15 06:32:49 +00:00
DomPrinter.cpp
EHPersonalities.cpp
GlobalsModRef.cpp
IndirectCallPromotionAnalysis.cpp
InlineCost.cpp	[InlineCost] Move the code in isGEPOffsetConstant to a lambda.	2017-02-25 00:10:22 +00:00
InstCount.cpp
InstructionSimplify.cpp	[InstSimplify] allow folds for bool vector div/rem	2017-03-09 21:56:03 +00:00
Interval.cpp
IntervalPartition.cpp
IteratedDominanceFrontier.cpp	Drop graph_ prefix	2017-02-09 20:37:46 +00:00
IVUsers.cpp	[PM] Separate the LoopAnalysisManager from the LoopPassManager and move	2017-01-11 09:43:56 +00:00
LazyBlockFrequencyInfo.cpp	[LazyBFI] Fix typos	2017-02-14 17:21:12 +00:00
LazyBranchProbabilityInfo.cpp
LazyCallGraph.cpp	[LCG] Fix EXPENSIVE_CHECKS typo. NFC	2017-02-28 18:34:55 +00:00
LazyValueInfo.cpp	[LVI] Add Datalayout to the class LazyValueInfo since all its Impls require it. NFC	2017-03-12 14:06:41 +00:00
Lint.cpp	Revert @llvm.assume with operator bundles (r289755-r289757)	2016-12-19 08:22:17 +00:00
LLVMBuild.txt
Loads.cpp	[JumpThread] Enhance finding partial redundant loads by continuing scanning single predecessor	2017-02-02 15:12:34 +00:00
LoopAccessAnalysis.cpp	[SLP] Revert everything that has to do with memory access sorting.	2017-03-10 18:59:07 +00:00
LoopAnalysisManager.cpp	Revert r293017 and fix the actual underlying issue.	2017-02-07 01:50:48 +00:00
LoopInfo.cpp	Make VerifyDomInfo and VerifyLoopInfo global variables	2017-01-24 05:52:07 +00:00
LoopPass.cpp	Implement getPassName() for IR printing passes.	2017-03-10 07:09:20 +00:00
LoopUnrollAnalyzer.cpp
MemDepPrinter.cpp
MemDerefPrinter.cpp
MemoryBuiltins.cpp	[Analysis] Add LibFunc_ prefix to enums in TargetLibraryInfo. (NFC)	2017-01-23 23:16:46 +00:00
MemoryDependenceAnalysis.cpp	[Devirtualization] MemDep returns non-local !invariant.group dependencies	2017-01-12 11:33:58 +00:00
MemoryLocation.cpp	[Analysis] Add LibFunc_ prefix to enums in TargetLibraryInfo. (NFC)	2017-01-23 23:16:46 +00:00
ModuleDebugInfoPrinter.cpp	[IR] Remove the DIExpression field from DIGlobalVariable.	2016-12-20 02:09:43 +00:00
ModuleSummaryAnalysis.cpp	Perform symbol binding for .symver versioned symbols	2017-03-09 00:19:49 +00:00
ObjCARCAliasAnalysis.cpp
ObjCARCAnalysisUtils.cpp
ObjCARCInstKind.cpp
OptimizationDiagnosticInfo.cpp	[ORE] Remove ORE.emit{{.+}} functions	2017-02-23 21:32:53 +00:00
OrderedBasicBlock.cpp
PHITransAddr.cpp	Revert @llvm.assume with operator bundles (r289755-r289757)	2016-12-19 08:22:17 +00:00
PostDominators.cpp	[PM] Introduce an analysis set used to preserve all analyses over	2017-01-15 06:32:49 +00:00
ProfileSummaryInfo.cpp	[ProfileSummaryInfo] Remove unneeded braces. NFCI.	2017-03-10 20:50:51 +00:00
PtrUseVisitor.cpp
README.txt
RegionInfo.cpp	[PM] Introduce an analysis set used to preserve all analyses over	2017-01-15 06:32:49 +00:00
RegionPass.cpp	Implement getPassName() for IR printing passes.	2017-03-10 07:09:20 +00:00
RegionPrinter.cpp
ScalarEvolution.cpp	[SCEV] Compute affine range in another way to avoid bitwidth extending.	2017-03-16 21:07:38 +00:00
ScalarEvolutionAliasAnalysis.cpp
ScalarEvolutionExpander.cpp	Revert @llvm.assume with operator bundles (r289755-r289757)	2016-12-19 08:22:17 +00:00
ScalarEvolutionNormalization.cpp
ScopedNoAliasAA.cpp
SparsePropagation.cpp
StratifiedSets.h
TargetLibraryInfo.cpp	[Analysis] Add LibFunc_ prefix to enums in TargetLibraryInfo. (NFC)	2017-01-23 23:16:46 +00:00
TargetTransformInfo.cpp	[TargetTransformInfo] getIntrinsicInstrCost() scalarization estimation improved	2017-03-14 06:35:36 +00:00
Trace.cpp
TypeBasedAliasAnalysis.cpp	[TBAA] Don't generate invalid TBAA when merging nodes	2016-12-11 20:07:25 +00:00
TypeMetadataUtils.cpp	Analysis: Add appropriate const qualification to functions in TypeMetadataUtils.cpp. NFC.	2017-01-27 22:55:30 +00:00
ValueTracking.cpp	[ValueTracking] Out of range shifts might be undef	2017-03-14 10:13:17 +00:00
VectorUtils.cpp	[LV] Move interleaved access helper functions to VectorUtils (NFC)	2017-02-01 17:45:46 +00:00

README.txt

Analysis Opportunities:

//===---------------------------------------------------------------------===//

In test/Transforms/LoopStrengthReduce/quadradic-exit-value.ll, the
ScalarEvolution expression for %r is this:

  {1,+,3,+,2}<loop>

Outside the loop, this could be evaluated simply as (%n * %n), however
ScalarEvolution currently evaluates it as

  (-2 + (2 * (trunc i65 (((zext i64 (-2 + %n) to i65) * (zext i64 (-1 + %n) to i65)) /u 2) to i64)) + (3 * %n))

In addition to being much more complicated, it involves i65 arithmetic,
which is very inefficient when expanded into code.

//===---------------------------------------------------------------------===//

In formatValue in test/CodeGen/X86/lsr-delayed-fold.ll,

ScalarEvolution is forming this expression:

((trunc i64 (-1 * %arg5) to i32) + (trunc i64 %arg5 to i32) + (-1 * (trunc i64 undef to i32)))

This could be folded to

(-1 * (trunc i64 undef to i32))

//===---------------------------------------------------------------------===//