llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Chris Lattner	aa6f7cb4e4	Make sure that at least one virtual method is defined in a .cpp file to avoid having the compiler emit RTTI and vtables to EVERY translation unit. llvm-svn: 11871	2004-02-26 07:24:18 +00:00
Chris Lattner	e07d786aa6	turn things like: if (X == 0 \|\| X == 2) ...where the comparisons and branches are in different blocks... into a switch instruction. This comes up a lot in various programs, and works well with the switch/switch merging code I checked earlier. For example, this testcase: int switchtest(int C) { return C == 0 ? f(123) : C == 1 ? f(3123) : C == 4 ? f(312) : C == 5 ? f(1234): f(444); } is converted into this: switch int %C, label %cond_false.3 [ int 0, label %cond_true.0 int 1, label %cond_true.1 int 4, label %cond_true.2 int 5, label %cond_true.3 ] instead of a whole bunch of conditional branches. Admittedly the code is ugly, and incomplete. To be complete, we need to add br -> switch merging and switch -> br merging. For example, this testcase: struct foo { int Q, R, Z; }; #define A (X->Q+X->R * 123) int test(struct foo X) { return A == 123 ? X1() : A == 12321 ? X2(): (A == 111 \|\| A == 222) ? X3() : A == 875 ? X4() : X5(); } Gets compiled to this: switch int %tmp.7, label %cond_false.2 [ int 123, label %cond_true.0 int 12321, label %cond_true.1 int 111, label %cond_true.2 int 222, label %cond_true.2 ] ... cond_false.2: ; preds = %entry %tmp.52 = seteq int %tmp.7, 875 ; <bool> [#uses=1] br bool %tmp.52, label %cond_true.3, label %cond_false.3 where the branch could be folded into the switch. This kind of thing occurs ALL OF THE TIME, especially in programs like 176.gcc, which is a horrible mess of code. It contains stuff like shudder*: #define SWITCH_TAKES_ARG(CHAR) \ ( (CHAR) == 'D' \ \|\| (CHAR) == 'U' \ \|\| (CHAR) == 'o' \ \|\| (CHAR) == 'e' \ \|\| (CHAR) == 'u' \ \|\| (CHAR) == 'I' \ \|\| (CHAR) == 'm' \ \|\| (CHAR) == 'L' \ \|\| (CHAR) == 'A' \ \|\| (CHAR) == 'h' \ \|\| (CHAR) == 'z') and #define CONST_OK_FOR_LETTER_P(VALUE, C) \ ((C) == 'I' ? SMALL_INTVAL (VALUE) \ : (C) == 'J' ? SMALL_INTVAL (-(VALUE)) \ : (C) == 'K' ? (unsigned)(VALUE) < 32 \ : (C) == 'L' ? ((VALUE) & 0xffff) == 0 \ : (C) == 'M' ? integer_ok_for_set (VALUE) \ : (C) == 'N' ? (VALUE) < 0 \ : (C) == 'O' ? (VALUE) == 0 \ : (C) == 'P' ? (VALUE) >= 0 \ : 0) and #define LEGITIMIZE_ADDRESS(X,OLDX,MODE,WIN) \ { \ if (GET_CODE (X) == PLUS && CONSTANT_ADDRESS_P (XEXP (X, 1))) \ (X) = gen_rtx (PLUS, SImode, XEXP (X, 0), \ copy_to_mode_reg (SImode, XEXP (X, 1))); \ if (GET_CODE (X) == PLUS && CONSTANT_ADDRESS_P (XEXP (X, 0))) \ (X) = gen_rtx (PLUS, SImode, XEXP (X, 1), \ copy_to_mode_reg (SImode, XEXP (X, 0))); \ if (GET_CODE (X) == PLUS && GET_CODE (XEXP (X, 0)) == MULT) \ (X) = gen_rtx (PLUS, SImode, XEXP (X, 1), \ force_operand (XEXP (X, 0), 0)); \ if (GET_CODE (X) == PLUS && GET_CODE (XEXP (X, 1)) == MULT) \ (X) = gen_rtx (PLUS, SImode, XEXP (X, 0), \ force_operand (XEXP (X, 1), 0)); \ if (GET_CODE (X) == PLUS && GET_CODE (XEXP (X, 0)) == PLUS) \ (X) = gen_rtx (PLUS, Pmode, force_operand (XEXP (X, 0), NULL_RTX),\ XEXP (X, 1)); \ if (GET_CODE (X) == PLUS && GET_CODE (XEXP (X, 1)) == PLUS) \ (X) = gen_rtx (PLUS, Pmode, XEXP (X, 0), \ force_operand (XEXP (X, 1), NULL_RTX)); \ if (GET_CODE (X) == SYMBOL_REF \|\| GET_CODE (X) == CONST \ \|\| GET_CODE (X) == LABEL_REF) \ (X) = legitimize_address (flag_pic, X, 0, 0); \ if (memory_address_p (MODE, X)) \ goto WIN; } and others. These macros get used multiple times of course. These are such lovely candidates for macros, aren't they? :) This code also nicely handles LLVM constructs that look like this: if (isa<CastInst>(I)) ... else if (isa<BranchInst>(I)) ... else if (isa<SetCondInst>(I)) ... else if (isa<UnwindInst>(I)) ... else if (isa<VAArgInst>(I)) ... where the isa can obviously be a dyn_cast as well. Switch instructions are a good thing. llvm-svn: 11870	2004-02-26 07:13:46 +00:00
Chris Lattner	ac94c441b6	No need to clear the map here, it will always be empty llvm-svn: 11868	2004-02-26 05:21:21 +00:00
Chris Lattner	9e55e31b2d	Fix typo llvm-svn: 11864	2004-02-26 03:45:03 +00:00
Chris Lattner	7990e4dcd0	The node doesn't have to be _no_ node flags, it just has to be complete and not have any globals. llvm-svn: 11863	2004-02-26 03:43:43 +00:00
Chris Lattner	948fffa8a2	Add _more_ functions llvm-svn: 11862	2004-02-26 03:43:08 +00:00
Chris Lattner	6a3796eaf9	Fix some warnings, some of which were spurious, and some of which were real bugs. Thanks Brian! llvm-svn: 11859	2004-02-26 01:20:02 +00:00
Misha Brukman	3d1720cdb9	Instructions to call and return from functions. llvm-svn: 11858	2004-02-26 00:37:12 +00:00
Chris Lattner	9fd0c48f80	Two changes: 1. Functions do not make things incomplete, only variables 2. Constant global variables no longer need to be marked incomplete, because we are guaranteed that the initializer for the global will be in the graph we are hacking on now. This makes resolution of indirect calls happen a lot more in the bu pass, supports things like vtables and the C counterparts (giant constant arrays of function pointers), etc... Testcase here: test/Regression/Analysis/DSGraph/constant_globals.ll llvm-svn: 11852	2004-02-25 23:36:08 +00:00
Chris Lattner	7d273bc532	When building local graphs, clone the initializer for constant globals into each local graph that uses the global. llvm-svn: 11850	2004-02-25 23:31:02 +00:00
Alkis Evlogimenos	af42cbf42f	Fix bugs found with recent addition of assertions in MRegisterInfo::is{Physical,Virtual}Register. llvm-svn: 11849	2004-02-25 23:21:52 +00:00
Chris Lattner	9fe3bf296d	Simplify the dead node elimination stuff Make the incompleteness marker faster by looping directly over the globals instead of over the scalars to find the globals Fix a bug where we didn't mark a global incomplete if it didn't have any outgoing edges. This wouldn't break any current clients but is still wrong. llvm-svn: 11848	2004-02-25 23:08:00 +00:00
Chris Lattner	a9f67b5ab8	Add a bunch more functions llvm-svn: 11847	2004-02-25 23:06:40 +00:00
Chris Lattner	d99d965f8c	Try harder to get symbol info llvm-svn: 11846	2004-02-25 23:06:30 +00:00
Brian Gaeke	aba4159be8	Represent va_list in interpreter as a (ec-stack-depth . var-arg-index) pair, and look up varargs in the execution stack every time, instead of just pushing iterators (which can be invalidated during callFunction()) around. (union GenericValue now has a "pair of uints" member, to support this mechanism.) Fixes Bug 234. llvm-svn: 11845	2004-02-25 23:01:48 +00:00
Brian Gaeke	4f0a829a68	Great sparc renaming fallout IV: Sparc --> SparcV9. llvm-svn: 11844	2004-02-25 22:09:36 +00:00
Alkis Evlogimenos	2caa729f02	Remove asssert since it is breaking cases that it shouldn't. llvm-svn: 11841	2004-02-25 22:01:06 +00:00
Alkis Evlogimenos	f1516015af	Add DenseMap template and actually use it for for mapping virtual regs to objects. llvm-svn: 11840	2004-02-25 21:55:45 +00:00
Chris Lattner	2a13dd5706	My faith in programmers has been found to be totally misplaced. One would assume that if they don't intend to write to a global variable, that they would mark it as constant. However, there are people that don't understand that the compiler can do nice things for them if they give it the information it needs. This pass looks for blatently obvious globals that are only ever read from. Though it uses a trivially simple "alias analysis" of sorts, it is still able to do amazing things to important benchmarks. 253.perlbmk, for example, contains several *GIANT* function pointer tables that are not marked constant and should be. Marking them constant allows the optimizer to turn a whole bunch of indirect calls into direct calls. Note that only a link-time optimizer can do this transformation, but perlbmk does have several strings and other minor globals that can be marked constant by this pass when run from GCCAS. 176.gcc has a ton of strings and large tables that are marked constant, both at compile time (38 of them) and at link time (48 more). Other benchmarks give similar results, though it seems like big ones have disproportionally more than small ones. This pass is extremely quick and does good things. I'm going to enable it in gccas & gccld. Not bad for 50 SLOC. llvm-svn: 11836	2004-02-25 21:34:36 +00:00
Misha Brukman	6a13621948	SparcV8 regs are really 32-bit, not 64! Thanks, Chris. llvm-svn: 11835	2004-02-25 21:03:02 +00:00
Misha Brukman	f12c1e5a55	Clean up the tablegen descriptions for SparcV8. llvm-svn: 11834	2004-02-25 21:02:21 +00:00
Misha Brukman	c8801eb5be	Fix the SparcV8 register definitions that were imported from PPC template. llvm-svn: 11833	2004-02-25 21:00:05 +00:00
Misha Brukman	a4b3e0f01b	SparcV8 has different types of instructions, but F1 is only used for CALL. llvm-svn: 11832	2004-02-25 20:52:20 +00:00
Chris Lattner	ccae3f6d60	Add an assertion llvm-svn: 11830	2004-02-25 19:37:44 +00:00
Chris Lattner	7c05e5d4d8	Fix failures in 099.go due to the cfgsimplify pass creating switch instructions where there did not used to be any before llvm-svn: 11829	2004-02-25 19:30:19 +00:00
Brian Gaeke	5166390fd2	SparcV8 skeleton llvm-svn: 11828	2004-02-25 19:28:19 +00:00
Brian Gaeke	c6de948cd1	Great renaming part II: Sparc --> SparcV9 (also includes command-line options and Makefiles) llvm-svn: 11827	2004-02-25 19:08:12 +00:00
Brian Gaeke	965df0b91b	Great renaming: Sparc --> SparcV9 llvm-svn: 11826	2004-02-25 18:44:15 +00:00
Chris Lattner	4f09004dff	Add a bunch more functions used by perlbmk llvm-svn: 11824	2004-02-25 17:43:20 +00:00
Chris Lattner	04f116953d	Fix incorrect debug code llvm-svn: 11821	2004-02-25 15:15:04 +00:00
Chris Lattner	ab9628ad18	Teach the instruction selector how to transform 'array' GEP computations into X86 scaled indexes. This allows us to compile GEP's like this: int* %test([10 x { int, { int } }]* %X, int %Idx) { %Idx = cast int %Idx to long %X = getelementptr [10 x { int, { int } }]* %X, long 0, long %Idx, ubyte 1, ubyte 0 ret int* %X } Into a single address computation: test: mov %EAX, DWORD PTR [%ESP + 4] mov %ECX, DWORD PTR [%ESP + 8] lea %EAX, DWORD PTR [%EAX + 8*%ECX + 4] ret Before it generated: test: mov %EAX, DWORD PTR [%ESP + 4] mov %ECX, DWORD PTR [%ESP + 8] shl %ECX, 3 add %EAX, %ECX lea %EAX, DWORD PTR [%EAX + 4] ret This is useful for things like int/float/double arrays, as the indexing can be folded into the loads&stores, reducing register pressure and decreasing the pressure on the decode unit. With these changes, I expect our performance on 256.bzip2 and gzip to improve a lot. On bzip2 for example, we go from this: 10665 asm-printer - Number of machine instrs printed 40 ra-local - Number of loads/stores folded into instructions 1708 ra-local - Number of loads added 1532 ra-local - Number of stores added 1354 twoaddressinstruction - Number of instructions added 1354 twoaddressinstruction - Number of two-address instructions 2794 x86-peephole - Number of peephole optimization performed to this: 9873 asm-printer - Number of machine instrs printed 41 ra-local - Number of loads/stores folded into instructions 1710 ra-local - Number of loads added 1521 ra-local - Number of stores added 789 twoaddressinstruction - Number of instructions added 789 twoaddressinstruction - Number of two-address instructions 2142 x86-peephole - Number of peephole optimization performed ... and these types of instructions are often in tight loops. Linear scan is also helped, but not as much. It goes from: 8787 asm-printer - Number of machine instrs printed 2389 liveintervals - Number of identity moves eliminated after coalescing 2288 liveintervals - Number of interval joins performed 3522 liveintervals - Number of intervals after coalescing 5810 liveintervals - Number of original intervals 700 spiller - Number of loads added 487 spiller - Number of stores added 303 spiller - Number of register spills 1354 twoaddressinstruction - Number of instructions added 1354 twoaddressinstruction - Number of two-address instructions 363 x86-peephole - Number of peephole optimization performed to: 7982 asm-printer - Number of machine instrs printed 1759 liveintervals - Number of identity moves eliminated after coalescing 1658 liveintervals - Number of interval joins performed 3282 liveintervals - Number of intervals after coalescing 4940 liveintervals - Number of original intervals 635 spiller - Number of loads added 452 spiller - Number of stores added 288 spiller - Number of register spills 789 twoaddressinstruction - Number of instructions added 789 twoaddressinstruction - Number of two-address instructions 258 x86-peephole - Number of peephole optimization performed Though I'm not complaining about the drop in the number of intervals. :) llvm-svn: 11820	2004-02-25 07:00:55 +00:00
Chris Lattner	dccf14825c	* Make the previous patch more efficient by not allocating a temporary MachineInstr to do analysis. * FOLD getelementptr instructions into loads and stores when possible, making use of some of the crazy X86 addressing modes. For example, the following C++ program fragment: struct complex { double re, im; complex(double r, double i) : re(r), im(i) {} }; inline complex operator+(const complex& a, const complex& b) { return complex(a.re+b.re, a.im+b.im); } complex addone(const complex& arg) { return arg + complex(1,0); } Used to be compiled to: _Z6addoneRK7complex: mov %EAX, DWORD PTR [%ESP + 4] mov %ECX, DWORD PTR [%ESP + 8] * mov %EDX, %ECX fld QWORD PTR [%EDX] fld1 faddp %ST(1) * add %ECX, 8 fld QWORD PTR [%ECX] fldz faddp %ST(1) * mov %ECX, %EAX fxch %ST(1) fstp QWORD PTR [%ECX] *** add %EAX, 8 fstp QWORD PTR [%EAX] ret Now it is compiled to: _Z6addoneRK7complex: mov %EAX, DWORD PTR [%ESP + 4] mov %ECX, DWORD PTR [%ESP + 8] fld QWORD PTR [%ECX] fld1 faddp %ST(1) fld QWORD PTR [%ECX + 8] fldz faddp %ST(1) fxch %ST(1) fstp QWORD PTR [%EAX] fstp QWORD PTR [%EAX + 8] ret Other programs should see similar improvements, across the board. Note that in addition to reducing instruction count, this also reduces register pressure a lot, always a good thing on X86. :) llvm-svn: 11819	2004-02-25 06:13:04 +00:00
Chris Lattner	10d08a2955	Add a helper to create an addressing mode given all of the pieces. llvm-svn: 11818	2004-02-25 06:01:07 +00:00
Chris Lattner	c0e2bc0250	add an inefficient way of folding structure and constant array indexes together into a single LEA instruction. This should improve the code generated for things like X->A.B.C[12].D. The bigger benefit is still coming though. Note that this uses an LEA instruction instead of an add, giving the register allocator more freedom. We should probably never generate ADDri32's. llvm-svn: 11817	2004-02-25 03:45:50 +00:00
Chris Lattner	969f90db77	Implement special case for storing an immediate into memory so that we don't need an intermediate register. llvm-svn: 11816	2004-02-25 02:56:58 +00:00
Chris Lattner	9036c86b14	Add support for 'rename' llvm-svn: 11813	2004-02-24 22:17:00 +00:00
Chris Lattner	57ee51ae0b	Make the verifier a little more explicit about this problem. llvm-svn: 11811	2004-02-24 22:06:07 +00:00
Chris Lattner	d9652be664	Add support for remove, fwrite, and fread Also fix problem where we didn't check to see if a node pointer was null. Though fclose(null) doesn't make a lot of sense, 300.twolf does it. llvm-svn: 11810	2004-02-24 22:02:48 +00:00
Brian Gaeke	eae0364189	FunctionLiveVarInfo.h moved: include/llvm/CodeGen -> lib/Target/Sparc/LiveVar llvm-svn: 11804	2004-02-24 19:46:00 +00:00
Chris Lattner	9da41150e8	Fix some unexpected fallout from the config.h changes. Because the CBE no longer was getting this #include, it always fell back on the less precise floating point initializer values, causing some testsuite failures. llvm-svn: 11803	2004-02-24 18:34:10 +00:00
Chris Lattner	fc15346b60	Fix a faulty optimization on FP values llvm-svn: 11801	2004-02-24 18:10:14 +00:00
Chris Lattner	7845e4f7f0	If a block is made dead, make sure to promptly remove it. llvm-svn: 11799	2004-02-24 16:09:21 +00:00
Alkis Evlogimenos	6d7150e9bb	Move machine code rewriter and spiller outside the register allocator. The implementation is completely rewritten and now employs several optimizations not exercised before. For example for 164.gzip we have 997 loads and 699 stores vs the 1221 loads and 880 stores we have before. llvm-svn: 11798	2004-02-24 08:58:30 +00:00
Chris Lattner	d678669018	Implement SimplifyCFG/switch_switch_fold.ll This case occurs many times in various benchmarks, especially when combined with the previous patch. This allows it to get stuff like: if (X == 4 \|\| X == 3) if (X == 5 \|\| X == 8) and switch (X) { case 4: case 5: case 6: if (X == 4 \|\| X == 5) llvm-svn: 11797	2004-02-24 07:23:58 +00:00
Alkis Evlogimenos	042f01039b	Add predicates for checking if a virtual register has a physical register mapping or a stack slot mapping. llvm-svn: 11795	2004-02-24 06:30:36 +00:00
Chris Lattner	1293e1d00c	Rearrange code a bit llvm-svn: 11793	2004-02-24 05:54:22 +00:00
Chris Lattner	e5db7dc4c6	Implement: test/Regression/Transforms/SimplifyCFG/switch_create.ll This turns code like this: if (X == 4 \| X == 7) and if (X != 4 & X != 7) into switch instructions. llvm-svn: 11792	2004-02-24 05:38:11 +00:00
Alkis Evlogimenos	0d0db88889	Make enum private as it is an implementation detail. llvm-svn: 11782	2004-02-23 23:49:40 +00:00
Alkis Evlogimenos	9344a740be	Remove '4Virt' from member function names as it is obvious. llvm-svn: 11781	2004-02-23 23:47:10 +00:00
Alkis Evlogimenos	d192266264	Refactor VirtRegMap out of RegAllocLinearScan as the first part of bug 251 (providing a generic machine code rewriter/spiller). llvm-svn: 11780	2004-02-23 23:08:11 +00:00
Chris Lattner	78800ae270	Generate much more efficient code in programs like pifft llvm-svn: 11775	2004-02-23 21:46:58 +00:00
Chris Lattner	7fa6519e07	Fix a small typeo in my checkin last night that broke vortex and other programs :( llvm-svn: 11774	2004-02-23 21:46:42 +00:00
Chris Lattner	253f77f2a7	Fix InstCombine/2004-02-23-ShiftShiftOverflow.ll Also, turn 'shr int %X, 1234' into 'shr int %X, 31' llvm-svn: 11768	2004-02-23 20:30:06 +00:00
Alkis Evlogimenos	34f28e5d3f	Add number of spilled registers statistic. llvm-svn: 11759	2004-02-23 18:45:32 +00:00
Chris Lattner	82e1a3657d	Fix bugs in finegrainification llvm-svn: 11758	2004-02-23 18:40:08 +00:00
Chris Lattner	1bf9dde4a1	Finegrainify namespacification llvm-svn: 11757	2004-02-23 18:38:20 +00:00
Alkis Evlogimenos	82a1d7d30e	Use MachineBasicBlock::getParent(). llvm-svn: 11756	2004-02-23 18:36:38 +00:00
Alkis Evlogimenos	2863fbd178	Remove implementation of default constructor as it is useless now. llvm-svn: 11755	2004-02-23 18:28:35 +00:00
Alkis Evlogimenos	9b103024ef	Refactor rewinding code for finding the first terminator of a basic block into MachineBasicBlock::getFirstTerminator(). This also fixes a bug in the implementation of the above in both RegAllocLocal and InstrSched, where instructions where added after the terminator if the basic block's only instruction was a terminator (it shouldn't matter for RegAllocLocal since this case never occurs in practice). llvm-svn: 11748	2004-02-23 18:14:48 +00:00
Chris Lattner	40e15a6000	Simplify code a bit, don't go off the end of the block, now that the current block we are in might be empty llvm-svn: 11744	2004-02-23 07:42:19 +00:00
Chris Lattner	28e4e925eb	We were forgetting to add FP_REG_KILL instructions to basic blocks which will eventually get an assignment due to elimination of PHIs. llvm-svn: 11743	2004-02-23 07:29:45 +00:00
Chris Lattner	74418a30aa	Implement cast.ll::test14/15 llvm-svn: 11742	2004-02-23 07:16:20 +00:00
Chris Lattner	a65e5e3df1	Refactor some code. In the mul - setcc folding case, we really care about whether this is the sign bit or not, so check unsigned comparisons as well. llvm-svn: 11740	2004-02-23 06:38:22 +00:00
Alkis Evlogimenos	50598d1135	Improved PhysRegTracker interface. RegAlloc lazily allocates the register tracker using a std::auto_ptr llvm-svn: 11738	2004-02-23 06:10:13 +00:00
Chris Lattner	9ecc3fc3c1	Implement mul.ll:test11 llvm-svn: 11737	2004-02-23 06:00:11 +00:00
Chris Lattner	51b37305d9	Implement "strength reduction" of X <= C and X >= C llvm-svn: 11735	2004-02-23 05:47:48 +00:00
Chris Lattner	c31a2e26ab	Implement InstCombine/mul.ll:test10, which is a case that occurs when dealing with "predication" llvm-svn: 11734	2004-02-23 05:39:21 +00:00
Alkis Evlogimenos	99af6ca36b	Simplify iterator usage now that we have next(). Also don't pass iterators by reference now that MachineInstr* are in an ilist llvm-svn: 11732	2004-02-23 04:12:30 +00:00
Chris Lattner	b200638dc4	Work around a gas bug. Print '-9223372036854775808' as unsigned. llvm-svn: 11729	2004-02-23 03:27:05 +00:00
Chris Lattner	85f13fae06	Implement cast fp -> bool llvm-svn: 11728	2004-02-23 03:21:41 +00:00
Chris Lattner	795ca35cde	Stop passing iterators around by reference now that we have ilists! Implement cast Type::ULongTy -> double llvm-svn: 11726	2004-02-23 03:10:10 +00:00
Alkis Evlogimenos	976f485826	Some code cleanups from Chris llvm-svn: 11724	2004-02-23 01:57:39 +00:00
Alkis Evlogimenos	1525e120a6	Fix comments in PhysRegTracker and rename isPhysRegAvail to isRegAvail to be consistent with the other two llvm-svn: 11723	2004-02-23 01:25:05 +00:00
Chris Lattner	f9acb33dfd	Add a new cmove instruction llvm-svn: 11722	2004-02-23 01:16:05 +00:00
Alkis Evlogimenos	ee3ef42726	Move LiveIntervals.h up to be the first included header llvm-svn: 11721	2004-02-23 01:01:21 +00:00
Alkis Evlogimenos	ba2b9aec71	Pull PhysRegTracker out of RegAllocLinearScan as it can be used by other allocators as well llvm-svn: 11720	2004-02-23 00:53:31 +00:00
Alkis Evlogimenos	850bd0819f	Move LiveIntervals.h to lib/CodeGen since it shouldn't be exposed to other parts of the compiler llvm-svn: 11719	2004-02-23 00:50:15 +00:00
Chris Lattner	cf8db3e8aa	Only insert FP_REG_KILL instructions in MachineBasicBlocks that actually use FP instructions. This reduces the number of instructions inserted in 176.gcc (for example) from 58074 to 101 (it doesn't use much FP, which is typical). This reduction speeds up the entire code generator. In the case of 176.gcc, llc went from taking 31.38s to 24.78s. The passes that sped up the most are the register allocator and the 2 live variable analysis passes, which sped up 2.3, 1.3, and 1.5s respectively. The asmprinter pass also sped up because it doesn't print the instructions in comments :) Note that this patch is likely to expose latent bugs in machine code passes, because now basicblock can be empty, where they were never empty before. I cleaned out regalloclocal, but who knows about linscan :) llvm-svn: 11717	2004-02-22 19:47:26 +00:00
Chris Lattner	5485375e5d	Another bug fix for empty MBB's llvm-svn: 11716	2004-02-22 19:37:31 +00:00
Alkis Evlogimenos	7f7d70a53c	Move MOTy::UseType enum into MachineOperand. This eliminates the switch statements in the constructors and simplifies the implementation of the getUseType() member function. You will have to specify defs using MachineOperand::Def instead of MOTy::Def though (similarly for Use and UseAndDef). llvm-svn: 11715	2004-02-22 19:23:26 +00:00
Chris Lattner	56a7886c8a	Fix a bug where we were implicitly assuming that there would be at least one terminator instruction in each basic block. llvm-svn: 11714	2004-02-22 19:08:15 +00:00
Chris Lattner	cc9a188e0a	Reduce the number of pointless copies inserted due to constant pointer refs. Also, make an assertion actually fireable! llvm-svn: 11713	2004-02-22 17:35:42 +00:00
Chris Lattner	ed03319931	Fix bug in previous checkout: leave the iterator at the first instruction AFTER the GEP that was emitted. :( llvm-svn: 11712	2004-02-22 17:05:38 +00:00
Chris Lattner	ade64c9839	Completely rewrite how getelementptr instructions are expanded. This has two (minor) benefits right now: 1. An extra dummy MOVrr32 is gone. This move would often be coallesced by both allocators anyway. 2. The code now uses the gep_type_iterator to walk the gep, which should future proof it a bit. It still assumes that array indexes are Longs though. These don't really justify rewriting the code. The big benefit will come later though. llvm-svn: 11710	2004-02-22 07:04:00 +00:00
Alkis Evlogimenos	6998610eda	When folding memory operands in machine instructions be careful to leave register operands with the same use/def flags as the original instruction. llvm-svn: 11709	2004-02-22 06:54:26 +00:00
Chris Lattner	63b79422f3	Fix a soon-to-be-missing #include llvm-svn: 11707	2004-02-22 06:26:17 +00:00
Chris Lattner	727748b382	Get all instruction definitions llvm-svn: 11706	2004-02-22 06:25:38 +00:00
Chris Lattner	3392d316e9	Wow this is out of date. When we have _real_ code generator documentation, this should be folded into it. llvm-svn: 11705	2004-02-22 05:53:54 +00:00
Alkis Evlogimenos	ba33a0ab9b	Print basic block boundaries in machine instruction debug output. llvm-svn: 11704	2004-02-22 05:46:04 +00:00
Chris Lattner	69bb1545d1	Implement Transforms/InstCombine/cast.ll:test13, a case which occurs in a hot 164.gzip loop. llvm-svn: 11702	2004-02-22 05:25:17 +00:00
Chris Lattner	cf8afa52b8	The two address pass cannot handle two addr instructions where one incoming value is a physreg and one is a virtreg. For this reason, disable copy folding entirely for physregs. Also, use the new isMoveInstr target hook which gives us folding of FP moves as well. llvm-svn: 11700	2004-02-22 04:44:58 +00:00
Alkis Evlogimenos	32d12d31ae	Abstract merging of ranges away from number of slots per instruction. Also make it less aggressive as the current implementation breaks in some cases. llvm-svn: 11696	2004-02-22 04:05:13 +00:00
Chris Lattner	573441bfbd	Use isNull instead of getNode() to test for existence of a node, this is cheaper. FIX MAJOR BUG, whereby we didn't merge null edges correctly. Correcting this fixes poolallocation on 175.vpr, and possibly others. llvm-svn: 11695	2004-02-22 00:53:54 +00:00
Chris Lattner	7bff00313e	Fix an iterator invalidation problem which was causing some nodes to not be correctly merged over! llvm-svn: 11693	2004-02-21 22:28:26 +00:00
Chris Lattner	458704f675	Use handy method llvm-svn: 11692	2004-02-21 22:27:31 +00:00
Misha Brukman	642275dc4e	`cat' is usually in /bin, not /usr/bin, at least on our systems. llvm-svn: 11690	2004-02-21 21:51:41 +00:00
Chris Lattner	7448a9867b	When printing a stack trace, demangle it if possible. Since we are potentially in a signal handler, allocating memory or doing other unsafe things is bad, which means we should do it in a different process. llvm-svn: 11689	2004-02-21 21:06:19 +00:00
Alkis Evlogimenos	e39c21cc93	Make 'fold' statistic's description the same in both allocators. llvm-svn: 11687	2004-02-21 18:07:33 +00:00
Chris Lattner	dacc6a7448	Instead of cloning the globals for main into the globals graph at the end of BU propagation, clone the globals into the GG of EACH FUNCTION that finishes processing! The GlobalsGraph must include all globals and effects from all functions in the program. Fixing this makes pool allocation work better on 175.vpr, but it still ultimately crashes. llvm-svn: 11686	2004-02-21 00:30:28 +00:00
Chris Lattner	0f9200e5a5	There is no need to merge the globals graph into the function graphs at the end of the BU and CBU passes. The globals will be marked incomplete, so it doesn't matter if they are missing some info, and merging isn't guaranteed to bring everything in anyway! llvm-svn: 11684	2004-02-20 23:52:15 +00:00
Chris Lattner	31f8fa66cb	Add two missing returns, which caused us to be very pessimistic about the printf and scanf families! llvm-svn: 11683	2004-02-20 23:27:09 +00:00
Alkis Evlogimenos	43431e117e	Some more statistics improvements. llvm-svn: 11676	2004-02-20 20:53:26 +00:00
Alkis Evlogimenos	6d57ed784f	Disambiguate statistic descriptions. llvm-svn: 11675	2004-02-20 20:46:49 +00:00
Alkis Evlogimenos	70f547d06e	Rename statistic and add another one. llvm-svn: 11674	2004-02-20 20:43:08 +00:00
Chris Lattner	4653c4cb50	Add support for some string functions, the scanf family, and sprintf llvm-svn: 11673	2004-02-20 20:27:11 +00:00
Alkis Evlogimenos	168e4bf455	Fix crash in debug output. llvm-svn: 11659	2004-02-20 06:41:12 +00:00
Brian Gaeke	688940d763	Use backtrace() and include execinfo.h, if they were detected by autoconf. llvm-svn: 11658	2004-02-20 06:40:59 +00:00
Alkis Evlogimenos	16744f2860	Fix instruction numbering in debug output. llvm-svn: 11655	2004-02-20 06:29:51 +00:00
Alkis Evlogimenos	f32239a5c6	Too many changes in one commit: 1. LiveIntervals now implement a 4 slot per instruction model. Load, Use, Def and a Store slot. This is required in order to correctly represent caller saved register clobbering on function calls, register reuse in the same instruction (def resues last use) and also spill code added later by the allocator. The previous representation (2 slots per instruction) was insufficient and as a result was causing subtle bugs. 2. Fixes in spill code generation. This was the major cause of failures in the test suite. 3. Linear scan now has core support for folding memory operands. This is untested and not enabled (the live interval update function does not attempt to fold loads/stores in instructions). 4. Lots of improvements in the debugging output of both live intervals and linear scan. Give it a try... it is beautiful :-) In summary the above fixes all the issues with the recent reserved register elimination changes and get the allocator very close to the next big step: folding memory operands. llvm-svn: 11654	2004-02-20 06:15:40 +00:00
Chris Lattner	b24f30de8d	It is totally unacceptable to print out (literally) millions of zeros when compiling 129.compress... so don't! llvm-svn: 11649	2004-02-20 05:49:22 +00:00
Chris Lattner	0d3df27b4c	Disable the stack trace thing until we can get an autoconf test for it. This call breaks on sparcs llvm-svn: 11635	2004-02-19 21:21:23 +00:00
Chris Lattner	6deffd7154	Implement new function llvm-svn: 11631	2004-02-19 20:03:14 +00:00
Alkis Evlogimenos	eed4727191	Fix RA::verifyAssignment() llvm-svn: 11629	2004-02-19 19:24:17 +00:00
Chris Lattner	c37073f249	Fix problem fusing spill code into instructions: we didn't update the live variable information to take into account the change of instruction address. llvm-svn: 11628	2004-02-19 18:34:02 +00:00
Chris Lattner	266206caed	Fix an iterator invalidation problem. :( llvm-svn: 11627	2004-02-19 18:32:29 +00:00
Chris Lattner	6b0030105e	Add method to update livevar when an instruction moves llvm-svn: 11625	2004-02-19 18:28:02 +00:00
Chris Lattner	8a9be6b652	Fix a __LONG__ term annoyance of mine: symbolic registers weren't being printed by operator<< on MachineInstr's, and looking up what register "24" is all of the time was greatly annoying. llvm-svn: 11623	2004-02-19 16:17:08 +00:00
Chris Lattner	436ab13009	Add a MachineBasicBlock::getParent() method llvm-svn: 11622	2004-02-19 16:13:54 +00:00
Alkis Evlogimenos	59c646da40	Make ToolExecutionError inherit std::exception and implement its interface: getMessage() is gone, use what() instead. llvm-svn: 11621	2004-02-19 07:39:26 +00:00
Alkis Evlogimenos	fbaf7b3944	Print stacktrace in STDERR before dying on a fatal signal. Currently the symbols are not demangled. llvm-svn: 11620	2004-02-19 07:36:35 +00:00
Alkis Evlogimenos	68f40cbfd3	Rename reloads/spills to loads/stores. llvm-svn: 11619	2004-02-19 06:19:09 +00:00
Chris Lattner	2a293313ca	Add support for just running the code generator llvm-svn: 11611	2004-02-18 23:24:41 +00:00
Alkis Evlogimenos	310f641c9c	Implement assignment correctness verification. llvm-svn: 11609	2004-02-18 23:15:23 +00:00
Chris Lattner	5a22f3a7d1	indent correctly llvm-svn: 11601	2004-02-18 20:58:00 +00:00
Chris Lattner	eddc6ab2e7	Don't yell. BUGPOINT should yell, not the tool runner :) llvm-svn: 11600	2004-02-18 20:57:38 +00:00
Chris Lattner	a52c617fd7	If there is an error running a tool, include the error message (e.g. assertion failure) in the exception llvm-svn: 11597	2004-02-18 20:38:00 +00:00
Chris Lattner	72ce97cc95	When an error occurs executing a tool, we now throw an exception instead of calling exit(1). llvm-svn: 11593	2004-02-18 20:21:57 +00:00
Chris Lattner	e672426e5d	Eliminate operator[] is deprecated warnings llvm-svn: 11578	2004-02-18 16:43:51 +00:00
Chris Lattner	8acab89631	Fix deprecated operator[] warnings llvm-svn: 11577	2004-02-18 16:38:18 +00:00
Alkis Evlogimenos	7ec1bad952	Fix argument size for MOVSX and MOVZX instructions. llvm-svn: 11576	2004-02-18 16:20:40 +00:00
Alkis Evlogimenos	c394f7803b	Be more agressive when joining ranges. llvm-svn: 11575	2004-02-18 04:38:37 +00:00
Alkis Evlogimenos	5c90efed55	Fix overly conservative spill interval computation. llvm-svn: 11574	2004-02-18 03:35:38 +00:00
Alkis Evlogimenos	7290ef1f5c	Beautify debug output. llvm-svn: 11573	2004-02-18 00:35:06 +00:00
Chris Lattner	4fa2e7a67f	Fix PR245: Linking weak and strong global variables is dependent on link order llvm-svn: 11565	2004-02-17 21:56:04 +00:00
Chris Lattner	d9e1a49650	When we complete the bottom-up pass, make sure to merge the globals in 'main' into the globals graph. llvm-svn: 11562	2004-02-17 19:06:47 +00:00
Chris Lattner	f58d2dd6cf	Add support for GlobalAddress's for alkis llvm-svn: 11560	2004-02-17 18:23:55 +00:00
Alkis Evlogimenos	c6f0651e5c	These store to memory too. llvm-svn: 11558	2004-02-17 17:53:48 +00:00
Chris Lattner	ea22a3de13	Remove the -disable-kill option. The register allocator is buggy with it, and it was only for debugging in the first place. llvm-svn: 11557	2004-02-17 17:49:10 +00:00
Chris Lattner	88271db3bc	These store to memory, not read from it. llvm-svn: 11556	2004-02-17 17:46:50 +00:00
Alkis Evlogimenos	0528c59353	Instructiosn with 1 memory operand have 4 operands in our representation.. duh! llvm-svn: 11554	2004-02-17 15:58:13 +00:00
Alkis Evlogimenos	b1a61b72f2	Align case statements. llvm-svn: 11552	2004-02-17 15:50:41 +00:00
Alkis Evlogimenos	b815fd46ec	Add TEST and XCHG memory operand support. llvm-svn: 11550	2004-02-17 15:48:42 +00:00
Alkis Evlogimenos	32a5b0fd6c	Add OR and XOR memory operand support. llvm-svn: 11549	2004-02-17 15:33:14 +00:00
Alkis Evlogimenos	1e4b3b3c9b	Peephole optimize SUBmi{16,32} into SUBmi{16,32}b when immediate is 8 bits wide. llvm-svn: 11548	2004-02-17 15:14:29 +00:00
Alkis Evlogimenos	4f22bb4d4b	ADDmi{16,32} should be in the next case statement. llvm-svn: 11547	2004-02-17 15:10:11 +00:00
Alkis Evlogimenos	135c4faa55	Add memory operand folding support for MUL, DIV, IDIV, NEG, NOT, MOVSX, and MOVZX. llvm-svn: 11546	2004-02-17 09:14:23 +00:00
Alkis Evlogimenos	e7bbd1c2fb	Add memory operand folding for CMP{rm,mr,mi}{8,16,32}, INCm{8,16,32} and DECm{8,16,32} instructions. llvm-svn: 11545	2004-02-17 08:49:20 +00:00
Alkis Evlogimenos	d7e3cc8d65	Add CMP{rm,mr,mi}{8,16,32}, INCm{8,16,32} and DECm{8,16,32} instructions. llvm-svn: 11544	2004-02-17 08:49:00 +00:00
Alkis Evlogimenos	638db7b5aa	Add SUB{rm,mr,mi}{8,16,32} instructions. llvm-svn: 11543	2004-02-17 08:17:40 +00:00
Chris Lattner	a4e0020e54	Add support to the local allocator for fusing spill code into the instructions that need them. This is very useful on CISCy targets like the X86 because it reduces the total spill pressure, and makes better use of it's (large) instruction set. Though the X86 backend doesn't know how to rewrite many instructions yet, this already makes a substantial difference on 176.gcc for example: Before: Time: 8.0099 ( 31.2%) 0.0100 ( 12.5%) 8.0199 ( 31.2%) 7.7186 ( 30.0%) Local Register Allocator Code quality: 734559 asm-printer - Number of machine instrs printed 111395 ra-local - Number of registers reloaded 79902 ra-local - Number of registers spilled 231554 x86-peephole - Number of peephole optimization performed After: Time: 7.8700 ( 30.6%) 0.0099 ( 19.9%) 7.8800 ( 30.6%) 7.7892 ( 30.2%) Local Register Allocator Code quality: 733083 asm-printer - Number of machine instrs printed 2379 ra-local - Number of reloads fused into instructions 109046 ra-local - Number of registers reloaded 79881 ra-local - Number of registers spilled 230658 x86-peephole - Number of peephole optimization performed So by fusing 2300 instructions, we reduced the static number of instructions by 1500, and reduces the number of peepholes (and thus the work) by about 900. This also clearly reduces the number of reload/spill instructions that are emitted. llvm-svn: 11542	2004-02-17 08:09:40 +00:00
Alkis Evlogimenos	5aa39e1583	Add support for folding memory operands for ADC, SBB and SUB instructions. llvm-svn: 11541	2004-02-17 08:08:51 +00:00
Alkis Evlogimenos	28691e063b	Add support for ADC{rm.mr}32 and SBB{rm,mr}32. llvm-svn: 11540	2004-02-17 08:06:31 +00:00
Chris Lattner	eb1428d581	Add a (hidden) option to print instructions that fail to fuse. It's looking like compares and test's would be the next huge win... llvm-svn: 11539	2004-02-17 08:03:47 +00:00
Alkis Evlogimenos	19248dd757	Add support for folding memory operands in MOVri{8,16,32} instructions. llvm-svn: 11538	2004-02-17 07:47:20 +00:00
Chris Lattner	c4ea4d12bf	Expand the repertoire of the forms we can print and encode. llvm-svn: 11537	2004-02-17 07:40:44 +00:00
Chris Lattner	029dec8f3e	Disable this peephole for now. We can't keep track of the fact that the immediate is 8 bits, but the memory reference is full sized. llvm-svn: 11536	2004-02-17 07:36:32 +00:00
Chris Lattner	f93308ec17	Fix a bug in my previous refactoring change... arg! llvm-svn: 11535	2004-02-17 07:02:17 +00:00
Chris Lattner	62f67310f1	The C backend is no longer in llvm-dis, it's in llc llvm-svn: 11533	2004-02-17 06:40:06 +00:00
Chris Lattner	a9493ad718	Add an option to disable spill fusing in the X86 backend llvm-svn: 11531	2004-02-17 06:30:34 +00:00
Chris Lattner	d4b2f4ef32	Fix the mneumonics for the mov instructions to have the source and destination order in the correct sense!! Arg! llvm-svn: 11530	2004-02-17 06:28:19 +00:00
Chris Lattner	5757579731	Fix the last crimes against nature that used the 'ir' ordering to use the 'ri' ordering instead... no it's not possible to store a register into an immediate! llvm-svn: 11529	2004-02-17 06:24:02 +00:00
Chris Lattner	4682990fa5	GRRR. Move instructions have swapped the order of the r/m operands. llvm-svn: 11528	2004-02-17 06:20:20 +00:00
Chris Lattner	16666f8bd2	Rename MOVi[mr] instructions to MOV[rm]i llvm-svn: 11527	2004-02-17 06:16:44 +00:00
Chris Lattner	1db99b1949	Whoops, got my cases swapped. llvm-svn: 11526	2004-02-17 06:02:15 +00:00
Chris Lattner	e227ae6b88	Change to match the newer, simpler, interface llvm-svn: 11525	2004-02-17 05:54:57 +00:00
Chris Lattner	b82bb37952	Add support for folding memory operands into AND and IMUL's llvm-svn: 11523	2004-02-17 05:46:06 +00:00
Chris Lattner	48e19d8b8e	Scrunchify code, by adding helpers. No functionality changes. llvm-svn: 11522	2004-02-17 05:35:13 +00:00
Chris Lattner	9751eb8ab9	Add mem forms of AND instructions llvm-svn: 11521	2004-02-17 05:25:50 +00:00
Alkis Evlogimenos	c4ec9111bb	Add API to check and fold memory operands into instructions. llvm-svn: 11519	2004-02-17 04:33:18 +00:00
Chris Lattner	3c514e8a54	Rename the IMULri* instructions to IMULrri, as they are actually three address instructions. Add forms of these instructions that read from memory llvm-svn: 11518	2004-02-17 04:26:43 +00:00
Chris Lattner	bb0cfb0429	Once we have a way to fold spill code reloads into instructions, we have a way to use it. :) llvm-svn: 11517	2004-02-17 04:08:37 +00:00
Alkis Evlogimenos	c6ea9a6b65	Fix spilled interval update. It was too conservative. llvm-svn: 11516	2004-02-17 04:04:20 +00:00
Chris Lattner	7b3342d814	Refactor code a bit. No functionality changes, though the comment hints at things to come. llvm-svn: 11515	2004-02-17 03:57:19 +00:00
Chris Lattner	28ab65caf2	Adjust to recent changes llvm-svn: 11514	2004-02-17 03:03:47 +00:00
Alkis Evlogimenos	501e24b28a	Add peephole optimizations for ADD [MEM], IMM8 instructions. llvm-svn: 11511	2004-02-16 23:50:18 +00:00
Alkis Evlogimenos	657876c656	Add two more variants of add. Update comments. llvm-svn: 11510	2004-02-16 23:48:42 +00:00
Chris Lattner	2c0736be99	Only spit out warning for functions that take pointers, not for sin and the like Add more special case handling for stdio functions. I feel dirty, how about you? llvm-svn: 11506	2004-02-16 22:57:19 +00:00
Chris Lattner	11c0e5b684	Move the folding of gep null, 0, 0, 0 to a place where it can be shared and enjoyed by all, fixing a fixme. Add an assert llvm-svn: 11505	2004-02-16 20:46:13 +00:00
Chris Lattner	513e4c7bd9	memset and bcopy and now unified by the llvm.memset intrinsic llvm-svn: 11503	2004-02-16 18:37:40 +00:00
Chris Lattner	d8cc48da34	Add some ADD instructions that take memory operands for Alkis llvm-svn: 11502	2004-02-16 18:19:31 +00:00
Alkis Evlogimenos	790b000aa7	Add LeakDetection to MachineInstr. Move out of line member functions of MachineBasicBlock to MachineBasicBlock.cpp. llvm-svn: 11497	2004-02-16 07:17:43 +00:00
Chris Lattner	f51bbb7eec	Implement test/Regression/Transforms/SimplifyCFG/UncondBranchToReturn.ll, see the testcase for the reasoning. llvm-svn: 11496	2004-02-16 06:35:48 +00:00
Chris Lattner	9affa63dc6	Fold PHI nodes of constants which are only used by a single cast. This implements phi.ll:test4 llvm-svn: 11494	2004-02-16 05:07:08 +00:00
Chris Lattner	011b98cec4	Teach LLVM to unravel the "swap idiom". This implements: Regression/Transforms/InstCombine/xor.ll:test20 llvm-svn: 11492	2004-02-16 03:54:20 +00:00
Chris Lattner	71154f0931	Implement Transforms/InstCombine/xor.ll:test19 llvm-svn: 11490	2004-02-16 01:20:27 +00:00
Chris Lattner	b4b7a985fc	Fix a bug in the recent rewrite of the leakdetector that caused all of the nightly tests to be really messed up. The problem was that the new leakdetector was depending on undefined behavior: the order of destruction of static objects. llvm-svn: 11488	2004-02-15 23:33:48 +00:00
Chris Lattner	336c99d138	Now that the lowerinvoke pass inserts calls to llvm.setjmp/llvm.longjmp, some hacks can be banished. Also, this gives us the opportunity to emit special code for the setjmp/longjmps which alows the elimination of one GCC warning for every setjmp/longjmp site (which is often THOUSANDS in C++ programs). Yaay! llvm-svn: 11484	2004-02-15 22:51:47 +00:00
Chris Lattner	b5914223b8	By default, llvm.setjmp/llvm.longjmp intrinsics get lowered to their libc counterparts llvm-svn: 11483	2004-02-15 22:24:51 +00:00
Chris Lattner	c224c03911	Instead of producing calls to setjmp/longjmp, produce uses of the llvm.setjmp/llvm.longjmp intrinsics. llvm-svn: 11482	2004-02-15 22:24:27 +00:00
Chris Lattner	ded1ad2846	Refactor code. Now the intrinsic lowering pass tries to recycle preexisting prototypes, even if they don't precisely match what it would prefer to use. This fixes: CBackend/2004-02-15-PreexistingExternals.llx compiling it into: ltmp_0_30 = memcpy(l14_C, 4u, 17); ltmp_1_30 = memcpy(((int *)l27_A), ((unsigned )(long)l27_B), ((int )123u)); instead of: ltmp_0_30 = memcpy(l14_C, 4u, 17); ltmp_1_27 = l43_memcpy(l27_A, l27_B, 123u); Which does the wrong thing as you could imagine. llvm-svn: 11481	2004-02-15 22:16:39 +00:00
Alkis Evlogimenos	4a2e2b54f6	This pass should not require phi elimination or live variable analysis. It should only preserve them and update LiveVariables if it already ran. llvm-svn: 11479	2004-02-15 21:50:32 +00:00
Chris Lattner	787d6b86e3	Finegrainify namespacification Remove one of the operands of a two operand instruction llvm-svn: 11478	2004-02-15 21:38:28 +00:00
Alkis Evlogimenos	84318d7bb6	Make dense maps keyed on physical registers smallerusing MRegisterInfo::getNumRegs() instead of MRegisterInfo::FirstVirtualRegister. Also use MRegisterInfo::is{Physical,Virtual}Register where appropriate. llvm-svn: 11477	2004-02-15 21:37:17 +00:00
Alkis Evlogimenos	1026a01d5a	Eliminate the use of spill (reserved) registers. llvm-svn: 11476	2004-02-15 10:24:21 +00:00
Chris Lattner	28131460da	Adjustments to support the new ConstantAggregateZero class llvm-svn: 11474	2004-02-15 05:55:15 +00:00
Chris Lattner	99ed412516	Add support for the new ConstantAggregateZero class llvm-svn: 11473	2004-02-15 05:54:27 +00:00
Chris Lattner	2679a58a61	Make the JIT zero out globals with memset instead of an element at a time. This should speed it up a bit on a lot of programs llvm-svn: 11472	2004-02-15 05:54:06 +00:00
Chris Lattner	cd6a595db5	No need to scan zero initializers. This should make DSA a bit faster. llvm-svn: 11471	2004-02-15 05:53:42 +00:00
Chris Lattner	363cd9e4b8	Add a new ConstantAggregateZero class, to fix PR239. This makes zero initializers for constant structs and arrays take constant space, instead of space proportinal to the number of elements. This reduces the memory usage of the LLVM compiler by hundreds of megabytes when compiling some nasty SPEC95 benchmarks. llvm-svn: 11470	2004-02-15 05:53:04 +00:00
Chris Lattner	0f819b9523	ConstantArray::get and ConstantStruct::get now just return pointers to 'Constant', instead of specific subclass pointers. In the future, these will return an instance of ConstantAggregateZero if all of the inputs are zeros. llvm-svn: 11467	2004-02-15 04:14:47 +00:00

... 2 3 4 5 6 ...

5725 Commits