llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-29 23:12:55 +01:00

Author	SHA1	Message	Date
Chris Lattner	f0ef4b4391	Fix a nasty problem on two-address machines in the following situation: store EAX -> [ss#0] [ss#0] += 1 ... use(EAX) In this case, it is not valid to rewrite this as: store EAX -> [ss#0] EAX += 1 store EAX -> [ss#0] ;;; this would also delete the store above ... use(EAX) ... because EAX is not a dead at that point. Keep track of which registers we are allowed to clobber, and which ones we aren't, and don't clobber the ones we're not supposed to. :) This should resolve the issues on X86 last night. llvm-svn: 25948	2006-02-03 23:28:46 +00:00
Chris Lattner	03b42d7724	significantly simplify the VirtRegMap code by pulling the SpillSlotsAvailable and PhysRegsAvailable maps out into a new AvailableSpills struct. No functionality change. This paves the way for a bugfix, coming up next. llvm-svn: 25947	2006-02-03 23:13:58 +00:00
Nate Begeman	5a58572b9b	Implement some feedback from sabre llvm-svn: 25946	2006-02-03 22:38:07 +00:00
Nate Begeman	2d9838ec9b	Add a framework for eliminating instructions that produces undemanded bits. llvm-svn: 25945	2006-02-03 22:24:05 +00:00
Chris Lattner	393e9d10dd	add a note llvm-svn: 25944	2006-02-03 22:06:45 +00:00
Chris Lattner	2b82e66f7c	another case Nate came up with llvm-svn: 25943	2006-02-03 22:05:41 +00:00
Chris Lattner	6c39bcf741	add a note llvm-svn: 25942	2006-02-03 21:25:23 +00:00
Chris Lattner	47b11a250c	remove some #ifdef'd out code, which should properly be in the dag combiner anyway. llvm-svn: 25941	2006-02-03 20:13:59 +00:00
Chris Lattner	366fa6bb83	remove an old comment llvm-svn: 25940	2006-02-03 18:59:39 +00:00
Chris Lattner	782422567b	Remove the X86PeepholeOptimizerPass, a truly horrible old hack that is now obsolete. yaay :) llvm-svn: 25939	2006-02-03 18:54:24 +00:00
Chris Lattner	70ef21db04	When rewriting frame instructions, emit the appropriate small-immediate instruction when possible. llvm-svn: 25938	2006-02-03 18:20:04 +00:00
Chris Lattner	21f305febf	node predicates add to the complexity of a pattern. This ensures that the X86 backend attempts to match small-immediate versions of instructions before the full size immediate versions. llvm-svn: 25937	2006-02-03 18:06:02 +00:00
Chris Lattner	255ef92e53	Teach sparc to fold loads/stores into copies. Remove the dead getRegClassForType method minor formating changes. llvm-svn: 25936	2006-02-03 07:06:25 +00:00
Chris Lattner	541258d077	remove dead fn llvm-svn: 25935	2006-02-03 06:51:34 +00:00
Nate Begeman	78c9e14249	Add common code for reassociating ops in the dag combiner llvm-svn: 25934	2006-02-03 06:46:56 +00:00
Evan Cheng	79edabf042	Added a (store (op (load ...) ...) ...) folding test case. llvm-svn: 25933	2006-02-03 06:46:41 +00:00
Chris Lattner	590c0d8621	Implement isLoadFromStackSlot and isStoreToStackSlot llvm-svn: 25932	2006-02-03 06:44:54 +00:00
Evan Cheng	7c8bab62e5	(store (op (load ...))) folding problem. In the generated matching code, Chain is initially set to the chain operand of store node, when it reaches load, if it matches the load then Chain is set to the chain operand of the load. However, if the matching code that follows this fails, isel moves on to the next pattern but it does not restore Chain to the chain operand of the store. So when it tries to match the next store / op / load pattern it would fail on the Chain == load.getOperand(0) test. The solution is for each chain operand to get a unique name. e.g. Chain10. llvm-svn: 25931	2006-02-03 06:22:41 +00:00
Chris Lattner	172cf85d48	remove some target-indep and implemented notes llvm-svn: 25930	2006-02-03 06:22:11 +00:00
Chris Lattner	9f8d39543f	target independent notes llvm-svn: 25929	2006-02-03 06:21:43 +00:00
Nate Begeman	85f3c9f566	Flesh out a couple of the items in the README llvm-svn: 25928	2006-02-03 05:17:06 +00:00
Jeff Cohen	e2f56a56f6	Fix VC++ compilation error caused by using a std::map iterator variable to receive a std::multimap iterator value. For some reason, GCC doesn't have a problem with this. llvm-svn: 25927	2006-02-03 03:48:54 +00:00
Chris Lattner	9ca1b98733	Remove move copies and dead stuff by not clobbering the result reg of a noop copy. llvm-svn: 25926	2006-02-03 03:16:14 +00:00
Andrew Lenharth	c879542ab0	isStoreToStackSlot llvm-svn: 25925	2006-02-03 03:07:37 +00:00
Chris Lattner	8e4e3207fe	Simplify some code llvm-svn: 25924	2006-02-03 03:06:49 +00:00
Chris Lattner	6588525e2a	the X86 backend no longer needs to delete its own noop copies llvm-svn: 25923	2006-02-03 02:59:58 +00:00
Chris Lattner	42c3d5124f	Add code that checks for noop copies, which triggers when either: 1. a target doesn't know how to fold load/stores into copies, or 2. the spiller rewrites the input to a copy to the same register as the dest instead of to the reloaded reg. This will be moved/improved in the near future, but allows elimination of some ancient x86 hacks. This eliminates 92 copies from SMG2000 on X86 and 163 copies from 252.eon. llvm-svn: 25922	2006-02-03 02:02:59 +00:00
Chris Lattner	06f54e7cb3	Add a note llvm-svn: 25921	2006-02-03 01:49:49 +00:00
Evan Cheng	c21098e77c	Added case HANDLENODE to getOperationName(). llvm-svn: 25920	2006-02-03 01:33:01 +00:00
Chris Lattner	e5344b1169	Physregs may hold multiple stack slot values at the same time. Keep track of this, and use it to our advantage (bwahahah). This allows us to eliminate another 60 instructions from smg2000 on PPC (probably significantly more on X86). A common old-new diff looks like this: stw r2, 3304(r1) - lwz r2, 3192(r1) stw r2, 3300(r1) - lwz r2, 3192(r1) stw r2, 3296(r1) - lwz r2, 3192(r1) stw r2, 3200(r1) - lwz r2, 3192(r1) stw r2, 3196(r1) - lwz r2, 3192(r1) + or r2, r2, r2 stw r2, 3188(r1) and - lwz r31, 604(r1) - lwz r13, 604(r1) - lwz r14, 604(r1) - lwz r15, 604(r1) - lwz r16, 604(r1) - lwz r30, 604(r1) + or r31, r30, r30 + or r13, r30, r30 + or r14, r30, r30 + or r15, r30, r30 + or r16, r30, r30 + or r30, r30, r30 Removal of the R = R copies is coming next... llvm-svn: 25919	2006-02-03 00:36:31 +00:00
Chris Lattner	66d0341e73	update a note llvm-svn: 25918	2006-02-02 23:50:22 +00:00
Chris Lattner	935255c984	Fix a deficiency in the spiller that Evan noticed. In particular, consider this code: store [stack slot #0], R10 = add R14, [stack slot #0] The spiller didn't know that the store made the value of [stackslot#0] available in R10 IF the store came from a copy instruction with the store folded into it. This patch teaches VirtRegMap to look at these stores and recognize the values they make available. In one case Evan provided, this code: divsd %XMM0, %XMM1 movsd %XMM1, QWORD PTR [%ESP + 40] 1) movsd QWORD PTR [%ESP + 48], %XMM1 2) movsd %XMM1, QWORD PTR [%ESP + 48] addsd %XMM1, %XMM0 3) movsd QWORD PTR [%ESP + 48], %XMM1 movsd QWORD PTR [%ESP + 4], %XMM0 turns into: divsd %XMM0, %XMM1 movsd %XMM1, QWORD PTR [%ESP + 40] addsd %XMM1, %XMM0 3) movsd QWORD PTR [%ESP + 48], %XMM1 movsd QWORD PTR [%ESP + 4], %XMM0 In this case, instruction #2 was removed because of the value made available by #1, and inst #1 was later deleted because it is now never used before the stack slot is redefined by #3. This occurs here and there in a lot of code with high spilling, on PPC most of the removed loads/stores are LSU-reject-causing loads, which is nice. On X86, things are much better (because it spills more), where we nuke about 1% of the instructions from SMG2000 and several hundred from eon. More improvements to come... llvm-svn: 25917	2006-02-02 23:29:36 +00:00
Nate Begeman	09bdfffaa6	add 64b gpr store to the possible list of isStoreToStackSlot opcodes. llvm-svn: 25916	2006-02-02 21:07:50 +00:00
Chris Lattner	452b75a57b	fix operand numbers llvm-svn: 25915	2006-02-02 20:38:12 +00:00
Chris Lattner	8337a1050d	implement isStoreToStackSlot for PPC llvm-svn: 25914	2006-02-02 20:16:12 +00:00
Chris Lattner	15cb732cd7	Move isLoadFrom/StoreToStackSlot from MRegisterInfo to TargetInstrInfo,a far more logical place. Other methods should also be moved if anyoneis interested. :) llvm-svn: 25913	2006-02-02 20:12:32 +00:00
Chris Lattner	6684ba101f	Move isLoadFrom/StoreToStackSlot from MRegisterInfo to TargetInstrInfo, a far more logical place. Other methods should also be moved if anyone is interested. :) llvm-svn: 25912	2006-02-02 20:11:55 +00:00
Chris Lattner	10f9a9daa5	implement isStoreToStackSlot llvm-svn: 25911	2006-02-02 20:00:41 +00:00
Chris Lattner	876bbd4faa	add a method llvm-svn: 25910	2006-02-02 19:57:16 +00:00
Chris Lattner	3cbf670f57	add a new isStoreToStackSlot method llvm-svn: 25909	2006-02-02 19:55:29 +00:00
Chris Lattner	ebf20d47ac	more notes llvm-svn: 25908	2006-02-02 19:43:28 +00:00
Chris Lattner	8a8c101989	add a note, I have no idea how important this is. llvm-svn: 25907	2006-02-02 19:16:34 +00:00
Chris Lattner	95fe9f5df2	%fcc is not an alias for %fcc0 llvm-svn: 25906	2006-02-02 08:02:20 +00:00
Chris Lattner	f5c935f882	correct an opcode llvm-svn: 25905	2006-02-02 07:56:15 +00:00
Chris Lattner	c434ee61a3	new example llvm-svn: 25903	2006-02-02 07:37:11 +00:00
Nate Begeman	dd4acf9710	Update the README llvm-svn: 25902	2006-02-02 07:27:56 +00:00
Chris Lattner	456711ae45	Turn any_extend nodes into zero_extend nodes when it allows us to remove an and instruction. This allows us to compile stuff like this: bool %X(int %X) { %Y = add int %X, 14 %Z = setne int %Y, 12345 ret bool %Z } to this: _X: cmpl $12331, 4(%esp) setne %al movzbl %al, %eax ret instead of this: _X: cmpl $12331, 4(%esp) setne %al movzbl %al, %eax andl $1, %eax ret This occurs quite a bit with the X86 backend. For example, 25 times in lambda, 30 times in 177.mesa, 14 times in galgel, 70 times in fma3d, 25 times in vpr, several hundred times in gcc, ~45 times in crafty, ~60 times in parser, ~140 times in eon, 110 times in perlbmk, 55 on gap, 16 times on bzip2, 14 times on twolf, and 1-2 times in many other SPEC2K programs. llvm-svn: 25901	2006-02-02 07:17:31 +00:00
Chris Lattner	c2ec404142	Implement MaskedValueIsZero for ANY_EXTEND nodes llvm-svn: 25900	2006-02-02 06:43:15 +00:00
Chris Lattner	32762f6c01	implemented, testcase here: test/Regression/CodeGen/X86/compare-add.ll llvm-svn: 25899	2006-02-02 06:36:48 +00:00
Chris Lattner	8f4d73f3da	add two dag combines: (C1-X) == C2 --> X == C1-C2 (X+C1) == C2 --> X == C2-C1 This allows us to compile this: bool %X(int %X) { %Y = add int %X, 14 %Z = setne int %Y, 12345 ret bool %Z } into this: _X: cmpl $12331, 4(%esp) setne %al movzbl %al, %eax andl $1, %eax ret not this: _X: movl $14, %eax addl 4(%esp), %eax cmpl $12345, %eax setne %al movzbl %al, %eax andl $1, %eax ret Testcase here: Regression/CodeGen/X86/compare-add.ll nukage of the and coming up next. llvm-svn: 25898	2006-02-02 06:36:13 +00:00

... 2 3 4 5 6 ...

22599 Commits