llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

History

Teresa Johnson e53f3c1bec [Inliner] Inlining should honor nobuiltin attributes Summary: Final patch in series to fix inlining between functions with different nobuiltin attributes/options, which was specifically an issue in LTO. See discussion on D61634 for background. The prior patch in this series (D67923) enabled per-Function TLI construction that identified the nobuiltin attributes. Here I have allowed inlining to proceed if the callee's nobuiltins are a subset of the caller's nobuiltins, but not in the reverse case, which should be conservatively correct. This is controlled by a new option, -inline-caller-superset-nobuiltin, which is enabled by default. Reviewers: hfinkel, gchatelet, chandlerc, davidxl Subscribers: arsenm, jvesely, nhaehnle, mehdi_amini, eraman, hiraditya, haicheng, dexonsmith, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74162		2020-02-28 07:34:14 -08:00
..
AliasAnalysis.cpp	[IR] Lazily number instructions for local dominance queries	2020-02-18 14:44:24 -08:00
AliasAnalysisEvaluator.cpp
AliasAnalysisSummary.cpp
AliasAnalysisSummary.h
AliasSetTracker.cpp	[NFC] Remove trailing space	2020-02-18 10:49:13 +08:00
Analysis.cpp
AssumptionCache.cpp
BasicAliasAnalysis.cpp	[BasicAA] Make BasicAA a cfg pass.	2020-02-11 11:30:08 -08:00
BlockFrequencyInfo.cpp
BlockFrequencyInfoImpl.cpp	[BFI] Add a debug check for unknown block queries.	2020-02-04 10:05:28 -08:00
BranchProbabilityInfo.cpp	[BrachProbablityInfo] Add invalidate method.	2020-01-17 10:47:51 -08:00
CallGraph.cpp	Introduce a CallGraph updater helper class	2020-02-08 14:16:48 -06:00
CallGraphSCCPass.cpp	Introduce a CallGraph updater helper class	2020-02-08 14:16:48 -06:00
CallPrinter.cpp	Make llvm::StringRef to std::string conversions explicit.	2020-01-28 23:25:25 +01:00
CaptureTracking.cpp	[IR] Lazily number instructions for local dominance queries	2020-02-18 14:44:24 -08:00
CFG.cpp
CFGPrinter.cpp	Flags for displaying only hot nodes in CFGPrinter graph	2020-02-21 17:20:00 -08:00
CFLAndersAliasAnalysis.cpp
CFLGraph.h
CFLSteensAliasAnalysis.cpp
CGSCCPassManager.cpp	Add PassManagerImpl.h to hide implementation details	2020-02-03 11:15:55 -08:00
CMakeLists.txt	[IR] Lazily number instructions for local dominance queries	2020-02-18 14:44:24 -08:00
CmpInstAnalysis.cpp
CodeMetrics.cpp
ConstantFolding.cpp	[AMDGPU][ConstantFolding] Fold llvm.amdgcn.fract intrinsic	2020-02-27 14:37:53 +00:00
CostModel.cpp
DDG.cpp	[DDG] Data Dependence Graph - Graph Simplification	2020-02-19 13:41:51 -05:00
Delinearization.cpp
DemandedBits.cpp
DependenceAnalysis.cpp	[DA] Delinearization of fixed-size multi-dimensional arrays	2020-02-27 10:29:01 -05:00
DependenceGraphBuilder.cpp	[DDG] Data Dependence Graph - Graph Simplification	2020-02-19 13:41:51 -05:00
DivergenceAnalysis.cpp	[DA] Don't propagate from unreachable blocks	2020-01-24 10:28:11 -08:00
DominanceFrontier.cpp
DomPrinter.cpp
DomTreeUpdater.cpp	[NFC] Fixes -Wrange-loop-analysis warnings	2020-01-01 20:01:37 +01:00
EHPersonalities.cpp
GlobalsModRef.cpp	[GlobalsModRef] Add invalidate method	2020-01-17 10:33:54 -08:00
GuardUtils.cpp	[NFC] Remove trailing space	2020-02-18 10:49:13 +08:00
IndirectCallPromotionAnalysis.cpp
InlineCost.cpp	[Inliner] Inlining should honor nobuiltin attributes	2020-02-28 07:34:14 -08:00
InstCount.cpp
InstructionPrecedenceTracking.cpp	[IR] Lazily number instructions for local dominance queries	2020-02-18 14:44:24 -08:00
InstructionSimplify.cpp	Reapply: [SVE] Fix bug in simplification of scalable vector instructions	2020-02-05 10:00:09 -08:00
Interval.cpp
IntervalPartition.cpp
IVDescriptors.cpp	[SCEV] Remove unused ScalarEvolutionExpander.h includes (NFC).	2020-01-04 18:29:35 +00:00
IVUsers.cpp
LazyBlockFrequencyInfo.cpp
LazyBranchProbabilityInfo.cpp
LazyCallGraph.cpp	[LazyCallGraph] Fix ambiguous index value	2020-02-18 23:32:55 -05:00
LazyValueInfo.cpp	Temporarily revert "Reapply [LVI] Normalize pointer behavior" and "[LVI] Restructure caching"	2019-12-20 10:25:57 -08:00
LegacyDivergenceAnalysis.cpp	Resubmit: [DA][TTI][AMDGPU] Add option to select GPUDA with TTI	2020-01-24 10:39:40 -08:00
Lint.cpp	[instrinsics] Add @llvm.memcpy.inline instrinsics	2020-01-28 09:42:01 +01:00
LLVMBuild.txt
Loads.cpp	[NFC] Remove trailing space	2020-02-18 10:49:13 +08:00
LoopAccessAnalysis.cpp	[VectorUtils] Rework the Vector Function Database (VFDatabase).	2020-01-16 15:08:26 +00:00
LoopAnalysisManager.cpp	Add PassManagerImpl.h to hide implementation details	2020-02-03 11:15:55 -08:00
LoopCacheAnalysis.cpp	[LoopCacheAnalysis]: Add support for negative stride	2020-02-10 13:22:35 -05:00
LoopInfo.cpp	Rename LoopInfo::isRotated() to LoopInfo::isRotatedForm().	2019-12-12 14:22:36 -05:00
LoopPass.cpp	NFC. Remove obsolete SimpleAnalysis infrastructure	2020-01-23 13:58:30 +07:00
LoopUnrollAnalyzer.cpp
MemDepPrinter.cpp
MemDerefPrinter.cpp
MemoryBuiltins.cpp	Reland [DataLayout] Fix occurrences that size and range of pointers are assumed to be the same.	2019-12-13 14:30:21 +00:00
MemoryDependenceAnalysis.cpp	[DependenceAnalysis] Memory dependence analysis internal caching mechanism is broken in presence of TBAA (PR42733).	2020-02-21 20:20:36 +07:00
MemoryLocation.cpp	[IR] Split out target specific intrinsic enums into separate headers	2019-12-11 18:02:14 -08:00
MemorySSA.cpp	[MemorySSA] Don't verify MemorySSA unless VerifyMemorySSA enabled	2020-02-13 18:46:58 +01:00
MemorySSAUpdater.cpp	[MemorySSA] Moving at the end often means before terminator.	2019-11-20 17:11:00 -08:00
ModuleDebugInfoPrinter.cpp
ModuleSummaryAnalysis.cpp	[NFC] Remove trailing space	2020-02-18 10:49:13 +08:00
MustExecute.cpp	[MustExecute] Add backward exploration for must-be-executed-context	2020-02-20 14:49:30 +09:00
ObjCARCAliasAnalysis.cpp
ObjCARCAnalysisUtils.cpp
ObjCARCInstKind.cpp
OptimizationRemarkEmitter.cpp	Compute ORE, BPI, BFI in Loop passes.	2020-02-12 09:15:18 -08:00
OrderedInstructions.cpp	[IR] Lazily number instructions for local dominance queries	2020-02-18 14:44:24 -08:00
PHITransAddr.cpp
PhiValues.cpp	[PhiValues] Remove redundant map searches	2019-11-23 10:32:56 +02:00
PostDominators.cpp	[CodeMoverUtils] Added an API to check if an instruction can be safely	2019-11-22 21:29:08 +00:00
ProfileSummaryInfo.cpp
PtrUseVisitor.cpp
README.txt
RegionInfo.cpp
RegionPass.cpp
RegionPrinter.cpp
ScalarEvolution.cpp	[NFC] [DA] Refactoring getIndexExpressionsFromGEP	2020-02-24 17:32:30 -05:00
ScalarEvolutionAliasAnalysis.cpp
ScalarEvolutionExpander.cpp	[SCEV][IndVars] Always provide insertion point to the SCEVExpander::isHighCostExpansion()	2020-02-25 23:05:59 +03:00
ScalarEvolutionNormalization.cpp
ScopedNoAliasAA.cpp
StackSafetyAnalysis.cpp	Support zero size types in StackSafetyAnalysis.	2020-01-27 15:22:59 -08:00
StratifiedSets.h
SyncDependenceAnalysis.cpp	Fix assert that doesn't check anything.	2020-01-23 19:02:00 -08:00
SyntheticCountsUtils.cpp
TargetLibraryInfo.cpp	No longer generate calls to *_finite	2020-02-28 10:07:37 +01:00
TargetTransformInfo.cpp	[NFC] Remove trailing space	2020-02-18 10:49:13 +08:00
Trace.cpp
TypeBasedAliasAnalysis.cpp	[Metadata] Add TBAA struct metadata to `AAMDNode`	2020-01-06 11:05:15 +03:00
TypeMetadataUtils.cpp
ValueLattice.cpp
ValueLatticeUtils.cpp
ValueTracking.cpp	[ValueTracking] Improve isKnownNonNaN() to recognize zero splats.	2020-02-19 09:35:36 -08:00
VectorUtils.cpp	[VectorUtils] Accept IRBuilderBase; NFC	2020-02-18 18:02:04 +01:00
VFABIDemangling.cpp	[llvm][VectorUtils] Tweak VFShape for scalable vector functions.	2020-01-30 05:53:56 +00:00

README.txt

Analysis Opportunities:

//===---------------------------------------------------------------------===//

In test/Transforms/LoopStrengthReduce/quadradic-exit-value.ll, the
ScalarEvolution expression for %r is this:

  {1,+,3,+,2}<loop>

Outside the loop, this could be evaluated simply as (%n * %n), however
ScalarEvolution currently evaluates it as

  (-2 + (2 * (trunc i65 (((zext i64 (-2 + %n) to i65) * (zext i64 (-1 + %n) to i65)) /u 2) to i64)) + (3 * %n))

In addition to being much more complicated, it involves i65 arithmetic,
which is very inefficient when expanded into code.

//===---------------------------------------------------------------------===//

In formatValue in test/CodeGen/X86/lsr-delayed-fold.ll,

ScalarEvolution is forming this expression:

((trunc i64 (-1 * %arg5) to i32) + (trunc i64 %arg5 to i32) + (-1 * (trunc i64 undef to i32)))

This could be folded to

(-1 * (trunc i64 undef to i32))

//===---------------------------------------------------------------------===//