llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-29 23:12:55 +01:00

Author	SHA1	Message	Date
Chris Lattner	59dd979162	Teach the local spiller to turn stack slot loads into register-register copies when possible, avoiding the load (and avoiding the copy if the value is already in the right register). This patch came about when I noticed code like the following being generated: store R17 -> [SS1] ...blah... R4 = load [SS1] This was causing an LSU reject on the G5. This problem was due to the register allocator folding spill code into a reg-reg copy (producing the load), which prevented the spiller from being able to rewrite the load into a copy, despite the fact that the value was already available in a register. In the case above, we now rip out the R4 load and replace it with a R4 = R17 copy. This speeds up several programs on X86 (which spills a lot :) ), e.g. smg2k from 22.39->20.60s, povray from 12.93->12.66s, 168.wupwise from 68.54->53.83s (!), 197.parser from 7.33->6.62s (!), etc. This may have a larger impact in some cases on the G5 (by avoiding LSU rejects), though it probably won't trigger as often (less spilling in general). Targets that implement folding of loads/stores into copies should implement the isLoadFromStackSlot hook to get this. llvm-svn: 23388	2005-09-19 06:56:21 +00:00
Chris Lattner	e7610bc599	Use continue in the use-processing loop to make it clear what the early exits are, simplify logic, and cause things to not be nested as deeply. This also uses MRI->areAliases instead of an explicit loop. No functionality change, just code cleanup. llvm-svn: 23296	2005-09-09 20:29:51 +00:00
Misha Brukman	774e55c446	Remove trailing whitespace llvm-svn: 21420	2005-04-21 22:36:52 +00:00
Chris Lattner	f81edb57b6	Make sure to notice that explicit physregs are used in the function llvm-svn: 21084	2005-04-04 21:35:34 +00:00
Chris Lattner	964297fc32	Update these register allocators to set the PhysRegUsed info in MachineFunction. llvm-svn: 19791	2005-01-23 22:45:13 +00:00
Chris Lattner	2087f3c8e9	Improve compatibility with acc llvm-svn: 19549	2005-01-14 15:54:24 +00:00
Chris Lattner	a361504a90	Clean up the MachineBasicBlock.h file, percolating #includes into this file. Patch contributed by Morten Ofstad llvm-svn: 17251	2004-10-26 15:35:58 +00:00
Chris Lattner	34acee9dbd	This patch fixes the nasty bug that caused 175.vpr to fail for X86 last night. The problem occurred when trying to reload this instruction: MOV32mr %reg2326, 8, %reg2297, 4, %reg2295 The value of reg2326 was available in EBX, so it was reused from there, instead of reloading it into EDX. The value of reg2297 was available in EDX, so it was reused from there, instead of reloading it into EDI. The value of reg2295 was not available, so we tried reloading it into EBX, its assigned register. However, we checked and saw that we already reloaded something into EBX, so we chose what reg2326 was assigned to (EDX) and reloaded into that register instead. Unfortunately EDX had already been used by reg2297, so reloading into EDX clobbered the value used by the reg2326 operand, breaking the program. The fix for this is to check that the newly picked register is ok. In this case we now find that EDX is already used and try using EDI, which succeeds. llvm-svn: 17006	2004-10-15 03:19:31 +00:00
Chris Lattner	2c87b68231	This patch adds and improves debugging output. No functionality changes. llvm-svn: 17005	2004-10-15 03:16:29 +00:00
Chris Lattner	815b635639	Do not repeat the map lookup llvm-svn: 16633	2004-10-01 23:16:43 +00:00
Chris Lattner	c521544a32	When a virtual register is folded into an instruction, keep track of whether it was a use, def, or both. This allows us to be less pessimistic in our analysis of them. In practice, this doesn't make a big difference, but it doesn't hurt either. llvm-svn: 16632	2004-10-01 23:15:36 +00:00
Chris Lattner	38467b8a66	Add a simple little improvement to the local spiller to keep track of stores and delete them if they turn out to be dead. This is a useful little hack that even speeds up some programs. For example, it speeds up Ptrdist/ks from 17.53s to 15.59s, and 188.ammp from 149s to 146s. This also speeds up llc :) llvm-svn: 16630	2004-10-01 19:47:12 +00:00
Chris Lattner	8a5b40154f	Substantially revamp the local spiller, causing it to actually improve the generated code over the simple spiller. The new local spiller generates substantially better code than the simple one in some cases, by reusing values that are loaded out of stack slots and kept available in registers. This primarily helps programs that are spilling a lot, and there is still stuff that can be done to improve it. This patch makes the local spiller the default, as it's only a tiny bit slower than the simple spiller (it increases the runtime of llc by < 1%). Here are some numbers with speedups. Program #reuse old(s) new(s) Speedup Povray: 3452, 16.87 -> 15.93 (5.5%) 177.mesa: 2176, 2.77 -> 2.76 (0%) 179.art: 35, 28.43 -> 28.01 (1.5%) 183.equake: 55, 61.44 -> 61.41 (0%) 188.ammp: 869, 174 -> 149 (15%) 164.gzip: 43, 40.73 -> 40.71 (0%) 175.vpr: 351, 18.54 -> 17.34 (6.5%) 176.gcc: 2471, 5.01 -> 4.92 (1.8%) 181.mcf 42, 79.30 -> 75.20 (5.2%) 186.crafty: 484, 29.73 -> 30.04 (-1%) 197.parser: 251, 10.47 -> 10.67 (-1%) 252.eon: 1501, 1.98 -> 1.75 (12%) 253.perlbm: 1183, 14.83 -> 14.42 (2.8%) 254.gap: 825, 7.46 -> 7.29 (2.3%) 255.vortex: 285, 10.51 -> 10.27 (2.3%) 256.bzip2: 63, 55.70 -> 55.20 (0.9%) 300.twolf: 830, 21.63 -> 22.00 (-1%) PtrDist/ks 14, 32.75 -> 17.53 (46.5%) Olden/tsp 46, 8.71 -> 8.24 (5.4%) Free/distray 70, 1.09 -> 0.99 (9.2%) llvm-svn: 16629	2004-10-01 19:04:51 +00:00
Chris Lattner	db2a0987cc	Use more efficient map operations. Fix a bug that would affect hypothetical targets that supported multiple memory operands. llvm-svn: 16614	2004-09-30 16:35:08 +00:00
Chris Lattner	8d8b8b05bd	There is no need to call MachineInstr::print directly, just send the MI& to an ostream. llvm-svn: 16613	2004-09-30 16:10:45 +00:00
Chris Lattner	168a4380d9	Simplify the logic in the simple spiller and capitalize some variables llvm-svn: 16609	2004-09-30 02:59:33 +00:00
Chris Lattner	1f61bfb971	Switch from defaulting to the 'local' spiller to the 'simple' spiller. The two spillers produce perfectly identical code (at least on povray and eon), but the simple spiller is substantially faster than the local spiller. Once the local spiller is improved, we can switch back. Switching cuts 5.2% off of the llc time for povray (about 1.3s). llvm-svn: 16608	2004-09-30 02:40:06 +00:00
Chris Lattner	d716b5739c	Don't use a densemap for keeping track of which vregs are already loaded, just use a simple vector. This speeds up -spiller=simple from taking 22s to taking .1s on povray (debug build). This change does not modify the generated code. llvm-svn: 16607	2004-09-30 02:33:48 +00:00
Chris Lattner	d27ff8035e	Use longer and more explicit names for instance vars (particularly important data structures). Fix the print method to send to the right ostream, not always cerr. Delete typedefs that are only used once. llvm-svn: 16606	2004-09-30 02:15:18 +00:00
Chris Lattner	418207045d	Reindent code, improve comments, move huge nested methods out of classes, prune #includes, add print/dump methods, etc. No functionality changes. llvm-svn: 16604	2004-09-30 01:54:45 +00:00
Reid Spencer	c4abcbefb1	Changes For Bug 352 Move include/Config and include/Support into include/llvm/Config, include/llvm/ADT and include/llvm/Support. From here on out, all LLVM public header files must be under include/llvm/. llvm-svn: 16137	2004-09-01 22:55:40 +00:00
Chris Lattner	463d46f17a	Stop using CreateStackObject(RegClass*) llvm-svn: 15775	2004-08-15 22:02:22 +00:00
Chris Lattner	e58190f5f6	These methods no longer take a TargetRegisterClass* operand. llvm-svn: 15774	2004-08-15 21:56:44 +00:00
Brian Gaeke	f18cdca667	These files don't need to include <iostream> since they include "Support/Debug.h". llvm-svn: 15089	2004-07-21 20:50:33 +00:00
Chris Lattner	c014005d53	Fix IA64 compatibility llvm-svn: 14866	2004-07-16 00:06:01 +00:00
Tanya Lattner	da38dc5180	Made a fix so that you can print out MachineInstrs that belong to a MachineBasicBlock that is not yet attached to a MachineFunction. This change includes changing the third operand (TargetMachine) to a pointer for the MachineInstr::print function. llvm-svn: 14389	2004-06-25 00:13:11 +00:00
Chris Lattner	0046ea7105	Adjust to new TargetMachine interface llvm-svn: 13956	2004-06-02 05:57:12 +00:00
Alkis Evlogimenos	516e16441b	Add method to assign stack slot to virtual register without creating a new one. llvm-svn: 13895	2004-05-29 20:38:05 +00:00
Alkis Evlogimenos	0e0433a791	Spill explicit physical register defs as well. llvm-svn: 12260	2004-03-09 08:35:13 +00:00
Alkis Evlogimenos	02e8f14c6a	As I wrote in the docs, simple is the default spiller :-) llvm-svn: 12189	2004-03-06 23:08:44 +00:00
Alkis Evlogimenos	467364b706	Add simple spiller. llvm-svn: 12188	2004-03-06 22:38:29 +00:00
Alkis Evlogimenos	c4ad8080f7	Add a spiller option to llc. A simple spiller will come soon. When we get CFG in the machine code represenation a global spiller will also be possible. Also document the linear scan register allocator but mark it as experimental for now. llvm-svn: 12062	2004-03-01 23:18:15 +00:00
Alkis Evlogimenos	64e76a0d43	Add the long awaited memory operand folding support for linear scan llvm-svn: 12058	2004-03-01 20:05:10 +00:00
Alkis Evlogimenos	63fda9c474	Make spiller push stores right after the definition of a register so that they are as far away from the loads as possible. llvm-svn: 11895	2004-02-27 04:51:35 +00:00
Alkis Evlogimenos	cff0fc180c	Clear maps right after basic block is processed. llvm-svn: 11892	2004-02-26 23:22:23 +00:00
Alkis Evlogimenos	af42cbf42f	Fix bugs found with recent addition of assertions in MRegisterInfo::is{Physical,Virtual}Register. llvm-svn: 11849	2004-02-25 23:21:52 +00:00
Alkis Evlogimenos	f1516015af	Add DenseMap template and actually use it for for mapping virtual regs to objects. llvm-svn: 11840	2004-02-25 21:55:45 +00:00
Alkis Evlogimenos	6d7150e9bb	Move machine code rewriter and spiller outside the register allocator. The implementation is completely rewritten and now employs several optimizations not exercised before. For example for 164.gzip we have 997 loads and 699 stores vs the 1221 loads and 880 stores we have before. llvm-svn: 11798	2004-02-24 08:58:30 +00:00
Alkis Evlogimenos	d192266264	Refactor VirtRegMap out of RegAllocLinearScan as the first part of bug 251 (providing a generic machine code rewriter/spiller). llvm-svn: 11780	2004-02-23 23:08:11 +00:00

39 Commits