llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-02 00:42:52 +01:00

Author	SHA1	Message	Date
Jakob Stoklund Olesen	7b1480ff12	Add the SpillPlacement analysis pass. This pass precomputes CFG block frequency information that can be used by the register allocator to find optimal spill code placement. Given an interference pattern, placeSpills() will compute which basic blocks should have the current variable enter or exit in a register, and which blocks prefer the stack. The algorithm is ready to consume block frequencies from profiling data, but for now it gets by with the static estimates used for spill weights. This is a work in progress and still not hooked up to RegAllocGreedy. llvm-svn: 122938	2011-01-06 01:21:53 +00:00
Jakob Stoklund Olesen	abf8941a60	Turn the EdgeBundles class into a stand-alone machine CFG analysis pass. The analysis will be needed by both the greedy register allocator and the X86FloatingPoint pass. It only needs to be computed once when the CFG doesn't change. This pass is very fast, usually showing up as 0.0% wall time. llvm-svn: 122832	2011-01-04 21:10:05 +00:00
Jakob Stoklund Olesen	2879da5e13	Pass a Banner argument to the machine code verifier both from createMachineVerifierPass and MachineFunction::verify. The banner is printed before the machine code dump, just like the printer pass. llvm-svn: 122113	2010-12-18 00:06:56 +00:00
Jakob Stoklund Olesen	d40af5ffbd	Add MachineLoopRanges analysis. A MachineLoopRange contains the intervals of slot indexes covered by the blocks in a loop. This representation of the loop blocks is more efficient to compare against interfering registers during register coalescing. llvm-svn: 121917	2010-12-15 23:41:23 +00:00
Jakob Stoklund Olesen	d638b989f2	Stub out RegAllocGreedy. This new register allocator is initially identical to RegAllocBasic, but it will receive all of the tricks that RegAllocBasic won't get. RegAllocGreedy will eventually replace linear scan. llvm-svn: 121234	2010-12-08 03:26:16 +00:00
Dan Gohman	3998a0430f	Rename ExpandPseudos to ExpandISelPseudos to help clarify its role. llvm-svn: 119716	2010-11-18 18:45:06 +00:00
Dan Gohman	52a761760d	Split pseudo-instruction expansion into a separate pass, to make it easier to debug, and to avoid complications when the CFG changes in the middle of the instruction selection process. llvm-svn: 119382	2010-11-16 21:02:37 +00:00
Jakob Stoklund Olesen	3988c3fb55	Make the spiller responsible for updating the LiveStacks analysis. llvm-svn: 117337	2010-10-26 00:11:33 +00:00
Andrew Trick	7a1dadd47d	This is a prototype of an experimental register allocation framework. It's purpose is not to improve register allocation per se, but to make it easier to develop powerful live range splitting. I call it the basic allocator because it is as simple as a global allocator can be but provides the building blocks for sophisticated register allocation with live range splitting. A minimal implementation is provided that trivially spills whenever it runs out of registers. I'm checking in now to get high-level design and style feedback. I've only done minimal testing. The next step is implementing a "greedy" allocation algorithm that does some register reassignment and makes better splitting decisions. llvm-svn: 117174	2010-10-22 23:09:15 +00:00
Lang Hames	f670bff621	Moved the PBQP allocator class out of the header and back in to the cpp file to hide the gory details. Allocator instances can now be created by calling createPBQPRegisterAllocator. Tidied up use of CoalescerPair as per Jakob's suggestions. Made the new PBQPBuilder based construction process the default. The internal construction process remains in-place and available via -pbqp-builder=false for now. It will be removed shortly if the new process doesn't cause any regressions. llvm-svn: 114626	2010-09-23 04:28:54 +00:00
Duncan Sands	2a1c11e104	Stop using the dom frontier in DwarfEHPrepare by not promoting alloca's any more. I plan to reimplement alloca promotion using SSAUpdater later. It looks like Bill's URoR logic really always needs domtree, so the pass now always asks for domtree info. llvm-svn: 112597	2010-08-31 09:05:06 +00:00
Jim Grosbach	a4d3174cba	Add a local stack object block allocation pass. This is still an experimental pass that allocates locals relative to one another before register allocation and then assigns them to actual stack slots as a block later in PEI. This will eventually allow targets with limited index offset range to allocate additional base registers (not just FP and SP) to more efficiently reference locals, as well as handle situations where locals cannot be referenced via SP or FP at all (dynamic stack realignment together with variable sized objects, for example). It's currently incomplete and almost certainly buggy. Work in progress. Disabled by default and gated via the -enable-local-stack-alloc command line option. rdar://8277890 llvm-svn: 111059	2010-08-14 00:15:52 +00:00
Bill Wendling	8a7a43a1cb	Merge the OptimizeExts and OptimizeCmps passes into one PeepholeOptimizer pass. This pass should expand with all of the small, fine-grained optimization passes to reduce compile time and increase happiment. llvm-svn: 110627	2010-08-09 23:59:04 +00:00
Jim Grosbach	e4f646b03f	tidy up llvm-svn: 110476	2010-08-06 21:31:35 +00:00
Owen Anderson	f2fea95f2f	Reapply r110396, with fixes to appease the Linux buildbot gods. llvm-svn: 110460	2010-08-06 18:33:48 +00:00
Bill Wendling	0cd2ae5158	Add the Optimize Compares pass (disabled by default). This pass tries to remove comparison instructions when possible. For instance, if you have this code: sub r1, 1 cmp r1, 0 bz L1 and "sub" either sets the same flag as the "cmp" instruction or could be converted to set the same flag, then we can eliminate the "cmp" instruction all together. This is a important for ARM where the ALU instructions could set the CPSR flag, but need a special suffix ('s') to do so. llvm-svn: 110423	2010-08-06 01:32:48 +00:00
Owen Anderson	aadd8a89ca	Revert r110396 to fix buildbots. llvm-svn: 110410	2010-08-06 00:23:35 +00:00
Owen Anderson	b9762c07cb	Don't use PassInfo* as a type identifier for passes. Instead, use the address of the static ID member as the sole unique type identifier. Clean up APIs related to this change. llvm-svn: 110396	2010-08-05 23:42:04 +00:00
Jakob Stoklund Olesen	21e64c3fae	Remove double-def checking from MachineVerifier, so a register does not have to be killed before being redefined. These checks are usually disabled, and usually fail when enabled. We de facto allow live registers to be redefined without a kill, the corresponding assertions in RegScavenger were removed long ago. llvm-svn: 110362	2010-08-05 18:59:59 +00:00
Jakob Stoklund Olesen	7fe0620525	Remove the local register allocator. Please use the fast allocator instead. llvm-svn: 106051	2010-06-15 21:58:33 +00:00
Jakob Stoklund Olesen	d76041cf58	Add a -regalloc=default option that chooses a register allocator based on the -O optimization level. This only really affects llc for now because both the llvm-gcc and clang front ends override the default register allocator. I intend to remove that code later. llvm-svn: 104904	2010-05-27 23:57:25 +00:00
Jakob Stoklund Olesen	9f9fed5a7c	Remove ancient prototype. llvm-svn: 104903	2010-05-27 23:57:19 +00:00
Jakob Stoklund Olesen	8cf10fe9e4	Add fast register allocator, enabled with -regalloc=fast. So far this is just a clone of -regalloc=local that has been lobotomized to run 25% faster. It drops the least-recently-used calculations, and is just plain stupid when it runs out of registers. The plan is to make this go even faster for -O0 by taking advantage of the short live intervals in unoptimized code. It should not be necessary to calculate liveness when most virtual registers are killed 2-3 instructions after they are born. llvm-svn: 102006	2010-04-21 18:02:42 +00:00
Dan Gohman	f1f21fb3c6	Code that needs a TargetMachine should have access to one directly, rather than just getting one through a TargetLowering. llvm-svn: 101802	2010-04-19 19:05:59 +00:00
Evan Cheng	ac90d47ea3	Post regalloc LICM. Work in progress. llvm-svn: 100592	2010-04-07 00:41:17 +00:00
David Greene	7c81589636	Ok, third time's the charm. No changes from last time except the CMake source addition. Apparently the buildbots were wrong about failures. --- Add some switches helpful for debugging: -print-before=<Pass Name> Dump IR before running pass <Pass Name>. -print-before-all Dump IR before running each pass. -print-after-all Dump IR after running each pass. These are helpful when tracking down a miscompilation. It is easy to get IR dumps and do diffs on them, etc. To make this work well, add a new getPrinterPass API to Pass so that each kind of pass (ModulePass, FunctionPass, etc.) can create a Pass suitable for dumping out the kind of object the Pass works on. llvm-svn: 100249	2010-04-02 23:17:14 +00:00
Evan Cheng	499918dabf	Revert 100204. It broke a bunch of tests and apparently changed what passes are run during codegen. llvm-svn: 100207	2010-04-02 19:29:15 +00:00
David Greene	554373897c	Let's try this again. Re-apply 100143 including an apparent missing <string> include. For some reason the buildbot choked on this while my builds did not. It's probably due to a difference in system headers. --- Add some switches helpful for debugging: -print-before=<Pass Name> Dump IR before running pass <Pass Name>. -print-before-all Dump IR before running each pass. -print-after-all Dump IR after running each pass. These are helpful when tracking down a miscompilation. It is easy to get IR dumps and do diffs on them, etc. To make this work well, add a new getPrinterPass API to Pass so that each kind of pass (ModulePass, FunctionPass, etc.) can create a Pass suitable for dumping out the kind of object the Pass works on. llvm-svn: 100204	2010-04-02 18:46:26 +00:00
Eric Christopher	77e4cc4bbd	Revert r100143. llvm-svn: 100146	2010-04-01 22:54:42 +00:00
David Greene	a358be0ef7	Add some switches helpful for debugging: -print-before=<Pass Name> Dump IR before running pass <Pass Name>. -print-before-all Dump IR before running each pass. -print-after-all Dump IR after running each pass. These are helpful when tracking down a miscompilation. It is easy to get IR dumps and do diffs on them, etc. To make this work well, add a new getPrinterPass API to Pass so that each kind of pass (ModulePass, FunctionPass, etc.) can create a Pass suitable for dumping out the kind of object the Pass works on. llvm-svn: 100143	2010-04-01 22:43:57 +00:00
Evan Cheng	291c815b10	Add skeleton of a machine level cse pass. llvm-svn: 97543	2010-03-02 02:38:24 +00:00
Dan Gohman	835086ef52	Fix various doxygen warnings. llvm-svn: 96779	2010-02-22 04:10:52 +00:00
Bob Wilson	2fd80c3d94	Add a new pass on machine instructions to optimize away PHI cycles that reduce down to a single value. InstCombine already does this transformation but DAG legalization may introduce new opportunities. This has turned out to be important for ARM where 64-bit values are split up during type legalization: InstCombine is not able to remove the PHI cycles on the 64-bit values but the separate 32-bit values can be optimized. I measured the compile time impact of this (running llc on 176.gcc) and it was not significant. llvm-svn: 95951	2010-02-12 01:30:21 +00:00
Jim Grosbach	034f69e0aa	For aligned load/store instructions, it's only required to know whether a function can support dynamic stack realignment. That's a much easier question to answer at instruction selection stage than whether the function actually will have dynamic alignment prologue. This allows the removal of the stack alignment heuristic pass, and improves code quality for cases where the heuristic would result in dynamic alignment code being generated when it was not strictly necessary. llvm-svn: 93885	2010-01-19 18:31:11 +00:00
Evan Cheng	76db3bb18e	Add a quick pass to optimize sign / zero extension instructions. For targets where the pre-extension values are available in the subreg of the result of the extension, replace the uses of the pre-extension value with the result + extract_subreg. For now, this pass is fairly conservative. It only perform the replacement when both the pre- and post- extension values are used in the block. It will miss cases where the post-extension values are live, but not used. llvm-svn: 93278	2010-01-13 00:30:23 +00:00
Evan Cheng	0b005cade5	Add a pre-regalloc tail duplication pass. llvm-svn: 90567	2009-12-04 09:42:45 +00:00
Jim Grosbach	0e1230b23b	Factor the stack alignment calculations out into a target independent pass. No functionality change. llvm-svn: 90336	2009-12-02 19:30:24 +00:00
Bob Wilson	c029183683	Rename new TailDuplicationPass to avoid name conflict with the old one. llvm-svn: 89968	2009-11-26 21:38:41 +00:00
Bob Wilson	de012efdba	Split tail duplication into a separate pass. This is needed to avoid running tail duplication when doing branch folding for if-conversion, and we also want to be able to run tail duplication earlier to fix some reg alloc problems. Move the CanFallThrough function from BranchFolding to MachineBasicBlock so that it can be shared by TailDuplication. llvm-svn: 89904	2009-11-26 00:32:21 +00:00
Devang Patel	9cd7c1ab8e	Remove DebugLabelFolder pass. It is not used by dwarf writer anymore. llvm-svn: 89790	2009-11-24 19:37:07 +00:00
Bill Wendling	58923e365d	Don't put in these EH changes. llvm-svn: 85460	2009-10-29 00:37:35 +00:00
Bill Wendling	784d38511f	Reverting r85338 for now. It's causing a bootstrap failure on PPC darwin9. --- Reverse-merging r85338 into '.': U lib/CodeGen/SimpleRegisterCoalescing.cpp U lib/CodeGen/SimpleRegisterCoalescing.h llvm-svn: 85454	2009-10-29 00:22:16 +00:00
Bob Wilson	fc1194919b	Revert r85346 change to control tail merging by CodeGenOpt::Level. I'm going to redo this using the OptimizeForSize function attribute. llvm-svn: 85426	2009-10-28 20:46:46 +00:00
Bob Wilson	98c9fb94ab	Record CodeGen optimization level in the BranchFolding pass so that we can use it to control tail merging when there is a tradeoff between performance and code size. When there is only 1 instruction in the common tail, we have been merging. That can be good for code size but is a definite loss for performance. Now we will avoid tail merging in that case when the optimization level is "Aggressive", i.e., "-O3". Radar 7338114. Since the IfConversion pass invokes BranchFolding, it too needs to know the optimization level. Note that I removed the RegisterPass instantiation for IfConversion because it required a default constructor. If someone wants to keep that for some reason, we can add a default constructor with a hard-wired optimization level. llvm-svn: 85346	2009-10-27 23:49:38 +00:00
Evan Cheng	e1fbdc5244	Change createPostRAScheduler so it can be turned off at llc -O1. llvm-svn: 84273	2009-10-16 21:06:15 +00:00
Evan Cheng	491d179d97	Remove simple regalloc. It has bit rotted. llvm-svn: 82127	2009-09-17 05:48:07 +00:00
Chris Lattner	9a542aae19	remove std::ostream versions of printing stuff for MBB and MF, upgrading a few things to use raw_ostream llvm-svn: 79811	2009-08-23 03:13:20 +00:00
Jim Grosbach	4643e96d36	Move the sjlj exception handling conversions to a back-end pass where they more properly belong. This allows removing the front-end conditionalized SJLJ code, and cleans up the generated IR considerably. All of the infrastructure code (calling _Unwind_SjLj_Register/Unregister, etc) is added by the SjLjEHPrepare pass. llvm-svn: 79250	2009-08-17 16:41:22 +00:00
Daniel Dunbar	9bbadad09d	Fix some comments referring to std::cerr. llvm-svn: 77931	2009-08-03 01:02:24 +00:00
Dan Gohman	f28b3bb262	Reapply r77654 with a fix: MachineFunctionPass's getAnalysisUsage shouldn't do AU.setPreservesCFG(), because even though CodeGen passes don't modify the LLVM IR CFG, they may modify the MachineFunction CFG, and passes like MachineLoop are registered with isCFGOnly set to true. llvm-svn: 77691	2009-07-31 18:16:33 +00:00

1 2 3

111 Commits