llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-02 00:42:52 +01:00

Author	SHA1	Message	Date
Evan Cheng	863ed2677b	Fix PR5024. LiveVariables physical register defs should commit only after all of the defs are processed. Also fix a implicit_def propagation bug: a implicit_def of a physical register should be applied to uses of the sub-registers. llvm-svn: 82616	2009-09-23 06:28:31 +00:00
Evan Cheng	3edb8b18a5	Fix PR4986. "r1024 = insert_subreg r1024, undef, 2" cannot be turned in an implicit_def. Instead, it's an identity copy so it should be eliminated. Also make sure to update livevariable kill information. llvm-svn: 82436	2009-09-21 04:32:32 +00:00
Dale Johannesen	f8d05d7ce6	When computing live intervals for earlyclobber operands, we pushed the beginning of the interval back 1, so the interval would overlap with inputs that die. We were also pushing the end of the interval back 1, though, which means the earlyclobber didn't overlap with other output operands. Don't do this. PR 4964. llvm-svn: 82342	2009-09-20 00:36:41 +00:00
Daniel Dunbar	e14446813e	Fix -Asserts warning. llvm-svn: 81909	2009-09-15 20:31:12 +00:00
Evan Cheng	c6aba09119	Another try at early partial coalescing. Identity phi source copies (their sources are defined by phi join def) are coalesced. And the phi join copy is backward copy propagated into the other copies. Still miscompiling some tests. :-( llvm-svn: 81849	2009-09-15 06:45:16 +00:00
Evan Cheng	83bb285c97	Add early coalescing to liveintervals. This is work in progress and is known to miscompute some tests. Read it at your own rish, I have aged 10 year while writing this. The gist of this is if source of some of the copies that feed into a phi join is defined by the phi join, we'd like to eliminate them. However, if any of the non-identity source overlaps the live interval of the phi join then the coalescer won't be able to coalesce them. The early coalescer's job is to eliminate the identity copies by partially-coalescing the two live intervals. llvm-svn: 81796	2009-09-14 21:33:42 +00:00
Lang Hames	fc3582b378	Moved some more index operations over to LiveIntervals. llvm-svn: 81605	2009-09-12 03:34:03 +00:00
Evan Cheng	adfc67b74b	80 col violations. llvm-svn: 81598	2009-09-12 02:01:07 +00:00
Lang Hames	e504e61ab5	Replaces uses of unsigned for indexes in LiveInterval and VNInfo with a new class, MachineInstrIndex, which hides arithmetic details from most clients. This is a step towards allowing the register allocator to update/insert code during allocation. llvm-svn: 81040	2009-09-04 20:41:11 +00:00
Chris Lattner	1c0452caeb	Change Pass::print to take a raw ostream instead of std::ostream, update all code that this affects. llvm-svn: 79830	2009-08-23 06:03:38 +00:00
Chris Lattner	db2965c71f	remove various std::ostream version of printing methods from MachineInstr and MachineOperand. This required eliminating a bunch of stuff that was using DOUT, I hope that bill doesn't mind me stealing his fun. ;-) llvm-svn: 79813	2009-08-23 03:41:05 +00:00
Chris Lattner	38a3eb5739	remove a dead class. llvm-svn: 79795	2009-08-23 00:42:42 +00:00
Bill Wendling	cf51caf09d	Convert DOUT to DEBUG(errs()...). llvm-svn: 79752	2009-08-22 20:18:03 +00:00
Lang Hames	1503988bb5	Modified VNInfo. The "copy" member is now a union which holds the copy for a register interval, or the defining register for a stack interval. Access is via getCopy/setCopy and getReg/setReg. llvm-svn: 78620	2009-08-10 23:43:28 +00:00
Evan Cheng	6fc78f15fe	Turn some insert_subreg, extract_subreg, subreg_to_reg into implicit_defs. llvm-svn: 78151	2009-08-05 03:53:14 +00:00
David Greene	070216fa44	Re-apply LiveInterval index dumping patch, with fixes suggested by Bill and others. llvm-svn: 78003	2009-08-03 21:55:09 +00:00
Dan Gohman	4529d71681	Use setPreservesAll and setPreservesCFG in CodeGen passes. llvm-svn: 77754	2009-07-31 23:37:33 +00:00
Daniel Dunbar	8496064116	More migration to raw_ostream, the water has dried up around the iostream hole. - Some clients which used DOUT have moved to DEBUG. We are deprecating the "magic" DOUT behavior which avoided calling printing functions when the statement was disabled. In addition to being unnecessary magic, it had the downside of leaving code in -Asserts builds, and of hiding potentially unnecessary computations. llvm-svn: 77019	2009-07-25 00:23:56 +00:00
Daniel Dunbar	1df20e31e5	Move to raw_ostream. llvm-svn: 76963	2009-07-24 09:53:24 +00:00
David Greene	fede97eac6	Constify the key in Mi2IndexMap. llvm-svn: 76801	2009-07-22 21:56:14 +00:00
Chris Lattner	135caf1a2b	revert r76602, 76603, and r76615, pending design discussions. llvm-svn: 76646	2009-07-21 21:12:58 +00:00
David Greene	94bfedaf40	Prefix IR dumps with LiveInterval indices when possible. This turns this: %ESI<def> = MOV32rr %EDI<kill> ADJCALLSTACKDOWN64 0, %RSP<imp-def>, %EFLAGS<imp-def,dead>, %RSP<imp-use> %reg1027<def> = MOVZX64rr32 %ESI %reg1027<def> = ADD64ri8 %reg1027, 15, %EFLAGS<imp-def,dead> %reg1027<def> = AND64ri8 %reg1027, -16, %EFLAGS<imp-def,dead> %RDI<def> = MOV64rr %RSP %RDI<def> = SUB64rr %RDI, %reg1027<kill>, %EFLAGS<imp-def,dead> %RSP<def> = MOV64rr %RDI into this: 4 %reg1024<def> = MOV32rr %EDI<kill> 12 ADJCALLSTACKDOWN64 0, %RSP<imp-def>, %EFLAGS<imp-def,dead>, %RSP<imp-use> 20 %reg1025<def> = MOVZX64rr32 %reg1024 28 %reg1026<def> = MOV64rr %reg1025<kill> 36 %reg1026<def> = ADD64ri8 %reg1026, 15, %EFLAGS<imp-def,dead> 44 %reg1027<def> = MOV64rr %reg1026<kill> 52 %reg1027<def> = AND64ri8 %reg1027, -16, %EFLAGS<imp-def,dead> 60 %reg1028<def> = MOV64rr %RSP 68 %reg1029<def> = MOV64rr %reg1028<kill> 76 %reg1029<def> = SUB64rr %reg1029, %reg1027<kill>, %EFLAGS<imp-def,dead> 84 %RSP<def> = MOV64rr %reg1029 This helps greatly when debugging register allocation and coalescing problems. llvm-svn: 76615	2009-07-21 18:56:32 +00:00
Evan Cheng	ba5b67f66d	Simplify the coalescer (finally!) by making LiveIntervals::processImplicitDefs a little more aggressive and teaching liveintervals to make use of isUndef marker on MachineOperands. llvm-svn: 76223	2009-07-17 19:43:40 +00:00
Evan Cheng	981276bb16	Changed my mind. We now allow remat of instructions whose defs have subreg indices. llvm-svn: 76100	2009-07-16 20:15:00 +00:00
Evan Cheng	7a6b20df7f	Let callers decide the sub-register index on the def operand of rematerialized instructions. Avoid remat'ing instructions whose def have sub-register indices for now. It's just really really hard to get all the cases right. llvm-svn: 75900	2009-07-16 09:20:10 +00:00
Torok Edwin	f955a6ef49	llvm_unreachable->llvm_unreachable(0), LLVM_UNREACHABLE->llvm_unreachable. This adds location info for all llvm_unreachable calls (which is a macro now) in !NDEBUG builds. In NDEBUG builds location info and the message is off (it only prints "UREACHABLE executed"). llvm-svn: 75640	2009-07-14 16:55:14 +00:00
Torok Edwin	ae8a3ff177	assert(0) -> LLVM_UNREACHABLE. Make llvm_unreachable take an optional string, thus moving the cerr<< out of line. LLVM_UNREACHABLE is now a simple wrapper that makes the message go away for NDEBUG builds. llvm-svn: 75379	2009-07-11 20:10:48 +00:00
Torok Edwin	9b41a5faf2	Convert more assert(0)+abort() -> LLVM_UNREACHABLE, and abort()/exit() -> llvm_report_error(). llvm-svn: 75363	2009-07-11 13:10:19 +00:00
Duncan Sands	fff8443450	Avoid compiler warnings if assertions turned off. llvm-svn: 75267	2009-07-10 20:07:07 +00:00
Lang Hames	ceb80b14d3	Improved tracking of value number kills. VN kills are now represented as an (index,bool) pair. The bool flag records whether the kill is a PHI kill or not. This code will be used to enable splitting of live intervals containing PHI-kills. A slight change to live interval weights introduced an extra spill into lsr-code-insertion (outside the critical sections). The test condition has been updated to reflect this. llvm-svn: 75097	2009-07-09 03:57:02 +00:00
Evan Cheng	7d78cb531e	Remove special handling of implicit_def. Fix a couple more bugs in liveintervalanalysis and coalescer handling of implicit_def. Note, isUndef marker must be placed even on implicit_def def operand or else the scavenger will not ignore it. This is necessary because -O0 path does not use liveintervalanalysis, it treats implicit_def just like any other def. llvm-svn: 74601	2009-07-01 08:19:36 +00:00
Evan Cheng	37503e9671	Handle IMPLICIT_DEF with isUndef operand marker, part 2. This patch moves the code to annotate machineoperands to LiveIntervalAnalysis. It also add markers for implicit_def that define physical registers. The rest, is just a lot of details. llvm-svn: 74580	2009-07-01 01:59:31 +00:00
Evan Cheng	c6c942b70f	Add a bit IsUndef to MachineOperand. This indicates the def / use register operand is defined by an implicit_def. That means it can def / use any register and passes (e.g. register scavenger) can feel free to ignore them. The register allocator, when it allocates a register to a virtual register defined by an implicit_def, can allocate any physical register without worrying about overlapping live ranges. It should mark all of operands of the said virtual register so later passes will do the right thing. This is not the best solution. But it should be a lot less fragile to having the scavenger try to track what is defined by implicit_def. llvm-svn: 74518	2009-06-30 08:49:04 +00:00
Chris Lattner	0ce83b0e95	When doing remat, don't consider uses of non-allocatable physregs. Patch by Evan. llvm-svn: 74370	2009-06-27 04:06:41 +00:00
Lang Hames	1bade5ff09	More VNInfo tweaking, plus a little progress on intra-block splitting. llvm-svn: 73750	2009-06-19 02:17:53 +00:00
Lang Hames	7f288c29af	Improved PHI def marking, replaced some gotos with breaks. llvm-svn: 73727	2009-06-18 22:01:47 +00:00
Lang Hames	5c64015a56	VNInfo cleanup. llvm-svn: 73634	2009-06-17 21:01:20 +00:00
Evan Cheng	1607bd1fa9	Move register allocation preference (or hint) from LiveInterval to MachineRegisterInfo. This allows more passes to set them. llvm-svn: 73346	2009-06-14 20:22:55 +00:00
Lang Hames	1a81422fab	Update to in-place spilling framework. Includes live interval scaling and trivial rewriter. llvm-svn: 72729	2009-06-02 16:53:25 +00:00
Jeffrey Yasskin	14f27c22aa	LiveVariables::VarInfo contains an AliveBlocks BitVector, which has as many entries as there are basic blocks in the function. LiveVariables::getVarInfo creates a VarInfo struct for every register in the function, leading to quadratic space use. This patch changes the BitVector to a SparseBitVector, which doesn't help the worst-case memory use but does reduce the actual use in very long functions with short-lived variables. llvm-svn: 72426	2009-05-26 18:27:15 +00:00
Evan Cheng	28aa6c41d1	In some rare cases, the register allocator can spill registers but end up not utilizing registers at all. The fundamental problem is linearscan's backtracking can end up freeing more than one allocated registers. However, reloads and restores might be folded into uses / defs and freed registers might not be used at all. VirtRegMap keeps track of allocations so it knows what's not used. As a horrible hack, the stack coloring can color spill slots with free registers. That is, it replace reload and spills with copies from and to the free register. It unfold instructions that load and store the spill slot and replace them with register using variants. Not yet enabled. This is part 1. More coming. llvm-svn: 70787	2009-05-03 18:32:42 +00:00
Evan Cheng	c315cf24e3	Fix PR4076. Correctly create live interval of physical register with two-address update. llvm-svn: 70245	2009-04-27 20:42:46 +00:00
Evan Cheng	43fc90ae59	Fix PR4056. It's possible a physical register def is dead if its implicit use is deleted by two-address pass. llvm-svn: 70213	2009-04-27 17:36:47 +00:00
Evan Cheng	a36c6c6819	It has finally happened. Spiller is now using live interval info. This fixes a very subtle bug. vr defined by an implicit_def is allowed overlap with any register since it doesn't actually modify anything. However, if it's used as a two-address use, its live range can be extended and it can be spilled. The spiller must take care not to emit a reload for the vn number that's defined by the implicit_def. This is both a correctness and performance issue. llvm-svn: 69743	2009-04-21 22:46:52 +00:00
Evan Cheng	c248188b46	Added a linearscan register allocation optimization. When the register allocator spill an interval with multiple uses in the same basic block, it creates a different virtual register for each of the reloads. e.g. %reg1498<def> = MOV32rm %reg1024, 1, %reg0, 12, %reg0, Mem:LD(4,4) [sunkaddr39 + 0] %reg1506<def> = MOV32rm %reg1024, 1, %reg0, 8, %reg0, Mem:LD(4,4) [sunkaddr42 + 0] %reg1486<def> = MOV32rr %reg1506 %reg1486<def> = XOR32rr %reg1486, %reg1498, %EFLAGS<imp-def,dead> %reg1510<def> = MOV32rm %reg1024, 1, %reg0, 4, %reg0, Mem:LD(4,4) [sunkaddr45 + 0] => %reg1498<def> = MOV32rm %reg2036, 1, %reg0, 12, %reg0, Mem:LD(4,4) [sunkaddr39 + 0] %reg1506<def> = MOV32rm %reg2037, 1, %reg0, 8, %reg0, Mem:LD(4,4) [sunkaddr42 + 0] %reg1486<def> = MOV32rr %reg1506 %reg1486<def> = XOR32rr %reg1486, %reg1498, %EFLAGS<imp-def,dead> %reg1510<def> = MOV32rm %reg2038, 1, %reg0, 4, %reg0, Mem:LD(4,4) [sunkaddr45 + 0] From linearscan's point of view, each of reg2036, 2037, and 2038 are separate registers, each is "killed" after a single use. The reloaded register is available and it's often clobbered right away. e.g. In thise case reg1498 is allocated EAX while reg2036 is allocated RAX. This means we end up with multiple reloads from the same stack slot in the same basic block. Now linearscan recognize there are other reloads from same SS in the same BB. So it'll "downgrade" RAX (and its aliases) after reg2036 is allocated until the next reload (reg2037) is done. This greatly increase the likihood reloads from SS are reused. This speeds up sha1 from OpenSSL by 5.8%. It is also an across the board win for SPEC2000 and 2006. llvm-svn: 69585	2009-04-20 08:01:12 +00:00
Dan Gohman	1070168ae0	Don't abort on an aliasing physical register that does not have a live interval. This is needed for some upcoming subreg changes. llvm-svn: 68956	2009-04-13 15:22:29 +00:00
Bob Wilson	c53238dff1	Fix pr3954. The register scavenger asserts for inline assembly with register destinations that are tied to source operands. The TargetInstrDescr::findTiedToSrcOperand method silently fails for inline assembly. The existing MachineInstr::isRegReDefinedByTwoAddr was very close to doing what is needed, so this revision makes a few changes to that method and also renames it to isRegTiedToUseOperand (for consistency with the very similar isRegTiedToDefOperand and because it handles both two-address instructions and inline assembly with tied registers). llvm-svn: 68714	2009-04-09 17:16:43 +00:00
Dan Gohman	c9ce27d6b7	Implement support for using modeling implicit-zero-extension on x86-64 with SUBREG_TO_REG, teach SimpleRegisterCoalescing to coalesce SUBREG_TO_REG instructions (which are similar to INSERT_SUBREG instructions), and teach the DAGCombiner to take advantage of this on targets which support it. This eliminates many redundant zero-extension operations on x86-64. This adds a new TargetLowering hook, isZExtFree. It's similar to isTruncateFree, except it only applies to actual definitions, and not no-op truncates which may not zero the high bits. Also, this adds a new optimization to SimplifyDemandedBits: transform operations like x+y into (zext (add (trunc x), (trunc y))) on targets where all the casts are no-ops. In contexts where the high part of the add is explicitly masked off, this allows the mask operation to be eliminated. Fix the DAGCombiner to avoid undoing these transformations to eliminate casts on targets where the casts are no-ops. Also, this adds a new two-address lowering heuristic. Since two-address lowering runs before coalescing, it helps to be able to look through copies when deciding whether commuting and/or three-address conversion are profitable. Also, fix a bug in LiveInterval::MergeInClobberRanges. It didn't handle the case that a clobber range extended both before and beyond an existing live range. In that case, multiple live ranges need to be added. This was exposed by the new subreg coalescing code. Remove 2008-05-06-SpillerBug.ll. It was bugpoint-reduced, and the spiller behavior it was looking for no longer occurrs with the new instruction selection. llvm-svn: 68576	2009-04-08 00:15:30 +00:00
Owen Anderson	2312267684	Don't assign a new stack slot if the pre-alloc splitter already assigned one. llvm-svn: 67764	2009-03-26 18:53:38 +00:00
Evan Cheng	7e4a6972d6	Fix PR3391 and PR3864. Reg allocator infinite looping. llvm-svn: 67544	2009-03-23 18:24:37 +00:00

1 2 3 4 5 ...

378 Commits