llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

History

Chandler Carruth e29471d99d [inliner] Skip debug intrinsics even earlier in computing the inline cost so that they don't impact the vector bonus. Fundamentally, counting unsimplified instructions is just wrong; it will continue to introduce instability as things which do not generate code bizarrely impact inlining. For example, sufficiently nested inlined functions could turn off the vector bonus with lifetime markers just like the debug intrinsics do. =/ This is a short-term tactical fix. Long term, I think we need to remove the vector bonus entirely. That's a separate patch and discussion though. The patch to fix this provided by Dario Domizioli. I've added some comments about the planned direction and used a heavily pruned form of debug info intrinsics for the test case. While this debug info doesn't work or "do" anything useful, it lets us easily test all manner of interference easily, and I suspect this will not be the last time we want to craft a pattern where debug info interferes with the inliner in a problematic way. llvm-svn: 200609		2014-02-01 10:38:17 +00:00
..
IPA	[inliner] Skip debug intrinsics even earlier in computing the inline	2014-02-01 10:38:17 +00:00
AliasAnalysis.cpp	[cleanup] Move the Dominators.h and Verifier.h headers into the IR	2014-01-13 09:26:24 +00:00
AliasAnalysisCounter.cpp	Put the functionality for printing a value to a raw_ostream as an	2014-01-09 02:29:41 +00:00
AliasAnalysisEvaluator.cpp	Put the functionality for printing a value to a raw_ostream as an	2014-01-09 02:29:41 +00:00
AliasDebugger.cpp
AliasSetTracker.cpp	Put the functionality for printing a value to a raw_ostream as an	2014-01-09 02:29:41 +00:00
Analysis.cpp	[PM] Make the verifier work independently of any pass manager.	2014-01-19 02:22:18 +00:00
BasicAliasAnalysis.cpp	Fix known typos	2014-01-24 17:20:08 +00:00
BlockFrequencyInfo.cpp	BlockFrequencyInfo: Readded getEntryFreq.	2013-12-20 22:11:11 +00:00
BranchProbabilityInfo.cpp	[block-freq] Teach branch probability how to return the edge weight in between a BasicBlock and one of its successors.	2013-12-14 02:24:25 +00:00
CaptureTracking.cpp	Make nocapture analysis work with addrspacecast	2014-01-14 19:11:52 +00:00
CFG.cpp	[cleanup] Move the Dominators.h and Verifier.h headers into the IR	2014-01-13 09:26:24 +00:00
CFGPrinter.cpp
CMakeLists.txt	delinearization of arrays	2013-11-12 22:47:20 +00:00
CodeMetrics.cpp
ConstantFolding.cpp	Add addrspacecast instruction.	2013-11-15 01:34:59 +00:00
CostModel.cpp	Get right cost for addrspacecast in cost model	2014-01-22 20:30:16 +00:00
Delinearization.cpp	Re-sort all of the includes with ./utils/sort_includes.py so that	2014-01-07 11:48:04 +00:00
DependenceAnalysis.cpp	Fix known typos	2014-01-24 17:20:08 +00:00
DominanceFrontier.cpp	[PM] Split DominatorTree into a concrete analysis result object which	2014-01-13 13:07:17 +00:00
DomPrinter.cpp	[PM] Split DominatorTree into a concrete analysis result object which	2014-01-13 13:07:17 +00:00
InstCount.cpp
InstructionSimplify.cpp	InstSimplify: Make shift, select and GEP simplifications vector-aware.	2014-01-24 17:09:53 +00:00
Interval.cpp
IntervalPartition.cpp
IVUsers.cpp	[PM] Split DominatorTree into a concrete analysis result object which	2014-01-13 13:07:17 +00:00
LazyValueInfo.cpp	Use SmallVectorImpl::iterator/const_iterator instead of SmallVector to avoid specifying the vector size.	2013-07-04 01:31:24 +00:00
LibCallAliasAnalysis.cpp
LibCallSemantics.cpp
Lint.cpp	[PM] Split DominatorTree into a concrete analysis result object which	2014-01-13 13:07:17 +00:00
LLVMBuild.txt
Loads.cpp
LoopInfo.cpp	[PM] Split DominatorTree into a concrete analysis result object which	2014-01-13 13:07:17 +00:00
LoopPass.cpp	[PM] Rename the IR printing pass header to a more generic and correct	2014-01-12 11:10:32 +00:00
Makefile
MemDepPrinter.cpp	Put the functionality for printing a value to a raw_ostream as an	2014-01-09 02:29:41 +00:00
MemoryBuiltins.cpp	Update optimization passes to handle inalloca arguments	2014-01-28 02:38:36 +00:00
MemoryDependenceAnalysis.cpp	[PM] Split DominatorTree into a concrete analysis result object which	2014-01-13 13:07:17 +00:00
ModuleDebugInfoPrinter.cpp	Put the functionality for printing a value to a raw_ostream as an	2014-01-09 02:29:41 +00:00
NoAliasAnalysis.cpp
PHITransAddr.cpp	[cleanup] Move the Dominators.h and Verifier.h headers into the IR	2014-01-13 09:26:24 +00:00
PostDominators.cpp	[PM] Pull the generic graph algorithms and data structures for dominator	2014-01-13 10:52:56 +00:00
PtrUseVisitor.cpp
README.txt
RegionInfo.cpp	[PM] Split DominatorTree into a concrete analysis result object which	2014-01-13 13:07:17 +00:00
RegionPass.cpp
RegionPrinter.cpp
ScalarEvolution.cpp	Fix crasher introduced in r200203 and caught by a libc++ buildbot. Don't assume that getMulExpr returns a SCEVMulExpr, it may have simplified it to something else!	2014-01-27 10:47:44 +00:00
ScalarEvolutionAliasAnalysis.cpp
ScalarEvolutionExpander.cpp	[cleanup] Move the Dominators.h and Verifier.h headers into the IR	2014-01-13 09:26:24 +00:00
ScalarEvolutionNormalization.cpp	[cleanup] Move the Dominators.h and Verifier.h headers into the IR	2014-01-13 09:26:24 +00:00
SparsePropagation.cpp
TargetTransformInfo.cpp	Revert "Revert "Add Constant Hoisting Pass" (r200034)"	2014-01-25 02:02:55 +00:00
Trace.cpp	Put the functionality for printing a value to a raw_ostream as an	2014-01-09 02:29:41 +00:00
TypeBasedAliasAnalysis.cpp	TBAA: fix PR17620.	2013-10-22 01:40:25 +00:00
ValueTracking.cpp	Allow speculating llvm.sqrt, fma and fmuladd	2014-01-31 00:09:00 +00:00

README.txt

Analysis Opportunities:

//===---------------------------------------------------------------------===//

In test/Transforms/LoopStrengthReduce/quadradic-exit-value.ll, the
ScalarEvolution expression for %r is this:

  {1,+,3,+,2}<loop>

Outside the loop, this could be evaluated simply as (%n * %n), however
ScalarEvolution currently evaluates it as

  (-2 + (2 * (trunc i65 (((zext i64 (-2 + %n) to i65) * (zext i64 (-1 + %n) to i65)) /u 2) to i64)) + (3 * %n))

In addition to being much more complicated, it involves i65 arithmetic,
which is very inefficient when expanded into code.

//===---------------------------------------------------------------------===//

In formatValue in test/CodeGen/X86/lsr-delayed-fold.ll,

ScalarEvolution is forming this expression:

((trunc i64 (-1 * %arg5) to i32) + (trunc i64 %arg5 to i32) + (-1 * (trunc i64 undef to i32)))

This could be folded to

(-1 * (trunc i64 undef to i32))

//===---------------------------------------------------------------------===//