llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Jakob Stoklund Olesen	eb77ff7f7d	Stop tracking spill slot uses in VirtRegMap. Nobody cared, StackSlotColoring scans the instructions to find used stack slots. llvm-svn: 144485	2011-11-13 01:23:30 +00:00
Jakob Stoklund Olesen	7b1107fe0e	Update split candidate correctly when interference cache is full. No test case, spotted by inspection. llvm-svn: 143407	2011-11-01 00:02:31 +00:00
Jakob Stoklund Olesen	44b1ea8bae	Ignore the cloning of unknown registers. THe LRE_DidCloneVirtReg callback may be called with vitual registers that RAGreedy doesn't even know about yet. In that case, there are no data structures to update. llvm-svn: 139702	2011-09-14 17:34:37 +00:00
Jakob Stoklund Olesen	a147642106	Remove the -compact-regions flag. It has been enabled by default for a while, it was only there to allow performance comparisons. llvm-svn: 139501	2011-09-12 16:54:42 +00:00
Jakob Stoklund Olesen	e43edf1b4a	Add an interface for SplitKit complement spill modes. SplitKit always computes a complement live range to cover the places where the original live range was live, but no explicit region has been allocated. Currently, the complement live range is created to be as small as possible - it never overlaps any of the regions. This minimizes register pressure, but if the complement is going to be spilled anyway, that is not very important. The spiller will eliminate redundant spills, and hoist others by making the spill slot live range overlap some of the regions created by splitting. Stack slots are cheap. This patch adds the interface to enable spill modes in SplitKit. In spill mode, SplitKit will assume that the complement is going to spill, so it will allow it to overlap regions in order to avoid back-copies. By doing some of the spiller's work early, the complement live range becomes simpler. In some cases, it can become much simpler because no extra PHI-defs are required. This will speed up both splitting and spilling. This is only the interface to enable spill modes, no implementation yet. llvm-svn: 139500	2011-09-12 16:49:21 +00:00
Benjamin Kramer	0d60a5573c	Make a bunch of symbols private. llvm-svn: 138025	2011-08-19 01:42:18 +00:00
Jakob Stoklund Olesen	2f58336f2f	Refer to the RegisterCoalescer pass by ID. A public interface is no longer needed since RegisterCoalescer is not an analysis any more. llvm-svn: 137082	2011-08-09 00:29:53 +00:00
Jakob Stoklund Olesen	32649cbc65	Fix typo. Thanks, Andy! llvm-svn: 137023	2011-08-06 18:20:24 +00:00
Jakob Stoklund Olesen	a8eabeb2b3	Reject RS_Spill ranges from local splitting as well. All new local ranges are marked as RS_New now, so there is no need to attempt splitting of RS_Spill ranges any more. llvm-svn: 137002	2011-08-05 23:50:33 +00:00
Jakob Stoklund Olesen	af4cd7e67f	Only mark remainder intervals as RS_Spill after per-block splitting. The local ranges created get to stay in the RS_New stage, just like for local and region splitting. This gives tryLocalSplit a bit more freedom the first time it sees one of these new local ranges. llvm-svn: 137001	2011-08-05 23:50:31 +00:00
Jakob Stoklund Olesen	3f2f3056e5	Remember to update LiveDebugVariables after per-block splitting. llvm-svn: 136996	2011-08-05 23:10:40 +00:00
Jakob Stoklund Olesen	6225dc24a6	Extract per-block splitting into its own method. No functional change. llvm-svn: 136994	2011-08-05 23:04:18 +00:00
Jakob Stoklund Olesen	d352ade356	Also use shouldSplitSingleBlock() in the fallback splitting mode. Drop the use of SplitAnalysis::getMultiUseBlocks, there is no need to go through a SmallPtrSet any more. llvm-svn: 136992	2011-08-05 22:43:23 +00:00
Jakob Stoklund Olesen	83a365a416	Split around single instructions to enable register class inflation. Normally, we don't create a live range for a single instruction in a basic block, the spiller does that anyway. However, when splitting a live range that belongs to a proper register sub-class, inserting these extra COPY instructions completely remove the constraints from the remainder interval, and it may be allocated from the larger super-class. The spiller will mop up these small live ranges if we end up spilling anyway. It calls them snippets. llvm-svn: 136989	2011-08-05 22:20:45 +00:00
Jakob Stoklund Olesen	5c1e079053	Enable compact region splitting by default. This helps generate better code in functions with high register pressure. The previous version of compact region splitting caused regressions because the regions were a bit too large. A stronger negative bias applied in r136832 fixed this problem. llvm-svn: 136836	2011-08-03 23:16:09 +00:00
Jakob Stoklund Olesen	faae878e65	Be more conservative when forming compact regions. Apply twice the negative bias on transparent blocks when computing the compact regions. This excludes loop backedges from the region when only one of the loop blocks uses the register. Previously, we would include the backedge in the region if the loop preheader and the loop latch both used the register, but the loop header didn't. When both the header and latch blocks use the register, we still keep it live on the backedge. llvm-svn: 136832	2011-08-03 23:09:38 +00:00
Chandler Carruth	b2c1fe2ccc	Fix some warnings from Clang in release builds: lib/CodeGen/RegAllocGreedy.cpp:1176:18: warning: unused variable 'B' [-Wunused-variable] if (unsigned B = Cand.getBundles(BundleCand, BestCand)) { ^ lib/CodeGen/RegAllocGreedy.cpp:1188:18: warning: unused variable 'B' [-Wunused-variable] if (unsigned B = Cand.getBundles(BundleCand, 0)) { ^ llvm-svn: 136831	2011-08-03 23:07:27 +00:00
Jakob Stoklund Olesen	ec971376f9	Use the precomputed def presence in RAGreedy::calcSpillCost. llvm-svn: 136742	2011-08-02 23:04:08 +00:00
Jakob Stoklund Olesen	ffdbb7cc16	Inform SpillPlacement about blocks with defs. This information is not used for anything yet. llvm-svn: 136741	2011-08-02 23:04:06 +00:00
Jakob Stoklund Olesen	4b62c6ea69	Rename {First,Last}Use to {First,Last}Instr. With a 'FirstDef' field right there, it is very confusing that FirstUse refers to an instruction that may be a def. llvm-svn: 136739	2011-08-02 22:54:14 +00:00
Jakob Stoklund Olesen	81247d41e4	Time the emission of debug values. llvm-svn: 136584	2011-07-31 03:53:42 +00:00
Jakob Stoklund Olesen	0caedc8db8	Revert r136528 "Enable compact region splitting by default." While this generally helped x86-64, there was some large regressions for i386. llvm-svn: 136571	2011-07-30 17:19:14 +00:00
Jakob Stoklund Olesen	a77df82448	Enable compact region splitting by default. This helps generate better code in functions with high register pressure. llvm-svn: 136528	2011-07-29 22:10:27 +00:00
Jakob Stoklund Olesen	55ca982766	Reverse order of RS_Split live ranges under -compact-regions. There are two conflicting strategies in play: - Under high register pressure, we want to assign large live ranges first. Smaller live ranges are easier to place afterwards. - Live range splitting is guided by interference, so splitting should be deferred until interference is as realistic as possible. With the recent changes to the live range stages, and with compact regions enabled, it is less traumatic to split a live range too early. If some of the split products were too big, they can often be split again. By reversing the RS_Split order, we get this queue order: 1. Normal live ranges, large to small. 2. RS_Split live ranges, large to small. The large-to-small order improves RAGreedy's puzzle solving skills under high register pressure. It may cause a bit more iterated splitting, but we handle that better now. With this change, -compact-regions is mostly an improvement on SPEC. llvm-svn: 136388	2011-07-28 20:48:23 +00:00
Jakob Stoklund Olesen	61ae16da49	Add support for multi-way live range splitting. When splitting global live ranges, it is now possible to split for multiple destination intervals at once. Previously, we only had the main and stack intervals. Each edge bundle is assigned to a split candidate, and splitAroundRegion will insert copies between the candidate intervals and the stack interval as needed. The multi-way splitting is used to split around compact regions when enabled with -compact-regions. The best candidate register still gets all the bundles it wants, but everything outside the main interval is first split around compact regions before we create single-block intervals. Compact region splitting still causes some regressions, so it is not enabled by default. llvm-svn: 136186	2011-07-26 23:41:46 +00:00
Jakob Stoklund Olesen	e6475ac496	Revert to RA_Assign when a virtreg separates into components. When dead code elimination deletes a PHI value, the virtual register may split into multiple connected components. In that case, revert each component to the RS_Assign stage. The new components are guaranteed to be smaller (the original value numbers are distributed among the components), so this will always be making progress. The components are now allowed to evict other live ranges or be split again. llvm-svn: 136034	2011-07-26 00:54:56 +00:00
Jakob Stoklund Olesen	1b71204b66	Add an RS_Split2 stage used for loop prevention. This mechanism already exists, but the RS_Split2 stage makes it clearer. When live range splitting creates ranges that may not be making progress, they are marked RS_Split2 instead of RS_New. These ranges may be split again, but only in a way that can be proven to make progress. For local ranges, that means they must be split into ranges used by strictly fewer instructions. For global ranges, region splitting is bypassed and the RS_Split2 ranges go straight to per-block splitting. llvm-svn: 135912	2011-07-25 15:25:43 +00:00
Jakob Stoklund Olesen	e295d32875	Rename live range stages to better reflect how they are used. The stage is used to control where a live range is going, not where it is coming from. Live ranges created by splitting will usually be marked RS_New, but some are marked RS_Spill to avoid wasting time trying to split them again. The old RS_Global and RS_Local stages are merged - they are really the same thing for local and global live ranges. llvm-svn: 135911	2011-07-25 15:25:41 +00:00
Jakob Stoklund Olesen	e5b06d7a17	Add RAGreedy::calcCompactRegion. This method computes the edge bundles that should be live when splitting around a compact region. This is independent of interference. The function returns false if the live range was already a compact region, or the compact region doesn't have any live bundles - it would be the same as splitting around basic blocks. Compact regions are computed using the normal spill placement code. We pretend there is interference in all live-through blocks that don't use the live range. This removes all edges from the Hopfield network used for spill placement, so it converges instantly. llvm-svn: 135847	2011-07-23 03:41:57 +00:00
Jakob Stoklund Olesen	ec1337a6e8	Prepare RAGreedy::growRegion for compact regions. A split candidate can have a null PhysReg which means that it doesn't map to a real interference pattern. Instead, pretend that all through blocks have interference. This makes it possible to generate compact regions where the live range doesn't go through blocks that don't use it. The live range will still be live between directly connected blocks with uses. Splitting around a compact region tends to produce a live range with a high spill weight, so it may evict a less dense live range. llvm-svn: 135845	2011-07-23 03:22:33 +00:00
Frits van Bommel	6c24f9c277	Migrate LLVM and Clang to use the new makeArrayRef(...) functions where previously explicit non-default constructors were used. Mostly mechanical with some manual reformatting. llvm-svn: 135390	2011-07-18 12:00:32 +00:00
Jakub Staszak	be41018cf4	Remove unused LoopRanges from RegAllocGreedy. llvm-svn: 135354	2011-07-16 20:43:00 +00:00
Jakob Stoklund Olesen	987cd08002	Extract parts of RAGreedy::splitAroundRegion as SplitKit methods. This gets rid of some of the gory splitting details in RAGreedy and makes them available to future SplitKit clients. Slightly generalize the functionality to support multi-way splitting. Specifically, SplitEditor::splitLiveThroughBlock() supports switching between different register intervals in a block. llvm-svn: 135307	2011-07-15 21:47:57 +00:00
Jakob Stoklund Olesen	b0af7bda8d	Reapply r135121 with a fixed copy constructor. Original commit message: Count references to interference cache entries. Each InterferenceCache::Cursor instance references a cache entry. A non-zero reference count guarantees that the entry won't be reused for a new register. This makes it possible to have multiple live cursors examining interference for different physregs. The total number of live cursors into a cache must be kept below InterferenceCache::getMaxCursors(). Code generation should be unaffected by this change, and it doesn't seem to affect the cache replacement strategy either. llvm-svn: 135130	2011-07-14 05:35:11 +00:00
Jakob Stoklund Olesen	718e76d4dd	Revert r135121 which broke a gcc-4.2 builder. llvm-svn: 135122	2011-07-14 00:58:38 +00:00
Jakob Stoklund Olesen	7d17ec883e	Count references to interference cache entries. Each InterferenceCache::Cursor instance references a cache entry. A non-zero reference count guarantees that the entry won't be reused for a new register. This makes it possible to have multiple live cursors examining interference for different physregs. The total number of live cursors into a cache must be kept below InterferenceCache::getMaxCursors(). Code generation should be unaffected by this change, and it doesn't seem to affect the cache replacement strategy either. llvm-svn: 135121	2011-07-14 00:31:14 +00:00
Jakob Stoklund Olesen	b1011cf255	Reapply r135074 and r135080 with a fix. The cache entry referenced by the best split candidate could become clobbered by an unsuccessful candidate. The correct fix here is to use reference counts on the cache entries. Coming up. llvm-svn: 135113	2011-07-14 00:17:10 +00:00
Jakob Stoklund Olesen	5102a957a1	Revert r135074 and r135080. They broke clamscan. llvm-svn: 135096	2011-07-13 22:20:09 +00:00
Jakob Stoklund Olesen	db508ece2d	Only keep the global split candidates that work out. Some pysical registers create split solutions that would spill anywhere. They should not even be considered in future multi-way global splits. This does not affect code generation (yet). llvm-svn: 135080	2011-07-13 20:49:46 +00:00
Jakob Stoklund Olesen	a1cd88cd42	Move the InterferenceCache cursor into the GlobalSplitCand struct. This is in preparation of supporting multiple global split candidates in a single live range split operation. llvm-svn: 135074	2011-07-13 20:14:52 +00:00
Jakob Stoklund Olesen	acaf9e9ce1	Be more aggressive about following hints. RAGreedy::tryAssign will now evict interference from the preferred register even when another register is free. To support this, add the EvictionCost struct that counts how many hints are broken by an eviction. We don't want to break one hint just to satisfy another. Rename canEvict to shouldEvict, and add the first bit of eviction policy that doesn't depend on spill weights: Always make room in the preferred register as long as the evictees can be split and aren't already assigned to their preferred register. Also make the CSR avoidance more accurate. When looking for a cheaper register it is OK to use a new volatile register. Only CSR aliases that have never been used before should be avoided. llvm-svn: 134735	2011-07-08 20:46:18 +00:00
Jakob Stoklund Olesen	57f59c98ed	Break infinite loop when the Hopfield network oscillates. This is impossible in theory, I can prove it. In practice, our near-zero threshold can cause the network to oscillate between equally good solutions. <rdar://problem/9720596> llvm-svn: 134428	2011-07-05 18:46:42 +00:00
Jakob Stoklund Olesen	c380e517fd	Tweak comment and debug output. llvm-svn: 134412	2011-07-05 15:38:37 +00:00
Jakob Stoklund Olesen	9950d41b39	Fix PR10244. A split point inserted in a block with a landing pad successor may be hoisted above the call to ensure that it dominates all successors. The code that handles the rest of the basic block must take this into account. I am not including a test case, it would be very fragile. PR10244 comes from building clang with exceptions enabled. llvm-svn: 134369	2011-07-04 00:05:28 +00:00
Jakob Stoklund Olesen	60871c3ee0	Use a new strategy for preventing eviction loops in RAGreedy. Every live range is assigned a cascade number the first time it is involved in an eviction. As the evictor, it gets a new cascade number. Every evictee is assigned the same cascade number as the evictor. Eviction is prohibited if the evictor has a lower assigned cascade number than the evictee. This means that assigned cascade numbers are monotonically increasing with every eviction, yet they are bounded by NextCascade which can only be incremented by new live ranges. Thus, infinite loops cannot happen, but eviction cascades can still be triggered by new live ranges as we want. Thanks to Andy for explaining this to me. llvm-svn: 134303	2011-07-02 01:37:09 +00:00
Jakob Stoklund Olesen	58a24d9ecc	Reapply r134047 now that the world is ready for it. This patch will sometimes choose live range split points next to interference instead of always splitting next to a register point. That means spill code can now appear almost anywhere, and it was necessary to fix code that didn't expect that. The difficult places were: - Between a CALL returning a value on the x87 stack and the corresponding FpPOP_RETVAL (was FpGET_ST0). Probably also near x87 inline assembly, but that didn't actually show up in testing. - Between a CALL popping arguments off the stack and the corresponding ADJCALLSTACKUP. Both are fixed now. The only place spill code can't appear is after terminators, see SplitAnalysis::getLastSplitPoint. Original commit message: Rewrite RAGreedy::splitAroundRegion, now with cool ASCII art. This function has to deal with a lot of special cases, and the old version got it wrong sometimes. In particular, it would sometimes leave multiple uses in the stack interval in a single block. That causes bad code with multiple reloads in the same basic block. The new version handles block entry and exit in a single pass. It first eliminates all the easy cases, and then goes on to create a local interval for the blocks with difficult interference. Previously, we would only create the local interval for completely isolated blocks. It can happen that the stack interval becomes completely empty because we could allocate a register in all edge bundles, and the new local intervals deal with the interference. The empty stack interval is harmless, but we need to remove a SplitKit assertion that checks for empty intervals. llvm-svn: 134125	2011-06-30 01:30:39 +00:00
Jakob Stoklund Olesen	fbf3ec7692	Revert r134047 while investigating a llvm-gcc-i386-linux-selfhost miscompile. llvm-svn: 134053	2011-06-29 02:03:36 +00:00
Jakob Stoklund Olesen	db62d3a5d0	Rewrite RAGreedy::splitAroundRegion, now with cool ASCII art. This function has to deal with a lot of special cases, and the old version got it wrong sometimes. In particular, it would sometimes leave multiple uses in the stack interval in a single block. That causes bad code with multiple reloads in the same basic block. The new version handles block entry and exit in a single pass. It first eliminates all the easy cases, and then goes on to create a local interval for the blocks with difficult interference. Previously, we would only create the local interval for completely isolated blocks. It can happen that the stack interval becomes completely empty because we could allocate a register in all edge bundles, and the new local intervals deal with the interference. The empty stack interval is harmless, but we need to remove a SplitKit assertion that checks for empty intervals. llvm-svn: 134047	2011-06-29 00:24:24 +00:00
Rafael Espindola	45a2fa5664	There is only one register coalescer. Merge it into the base class and remove the analysis group. llvm-svn: 133899	2011-06-26 22:34:10 +00:00
Rafael Espindola	7ad658a832	Move RegisterCoalescer.h to lib/CodeGen. llvm-svn: 133895	2011-06-26 21:41:06 +00:00

1 2 3 4

170 Commits