llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-30 23:42:52 +01:00

Author	SHA1	Message	Date
Alkis Evlogimenos	94cab18bdc	Change interface so that we can add to the end of a basic block without getting an assertion from ilist that we are dereferencing ilist<T>::end(). llvm-svn: 11345	2004-02-12 08:11:04 +00:00
Brian Gaeke	7d1652ca5c	Remove this MachineOpCodeFlags assertion - its test can never be false. llvm-svn: 11342	2004-02-12 04:01:07 +00:00
Alkis Evlogimenos	b755d35fd2	Change MachineBasicBlock's vector of MachineInstr pointers into an ilist of MachineInstr objects. This allows constant time removal and insertion of MachineInstr instances from anywhere in each MachineBasicBlock. It also allows for constant time splicing of MachineInstrs into or out of MachineBasicBlocks. llvm-svn: 11340	2004-02-12 02:27:10 +00:00
Brian Gaeke	4d67c9770d	getOpCode() --> getOpcode(). llvm-svn: 11339	2004-02-12 01:34:05 +00:00
Brian Gaeke	116a667130	MachineInstr::getOpCode() --> getOpcode() in SPARC back-end. llvm-svn: 11335	2004-02-11 20:47:34 +00:00
Chris Lattner	2a50f8218a	s/getOpCode/getOpcode llvm-svn: 11332	2004-02-11 19:26:28 +00:00
Chris Lattner	ffd9c8a00b	Cosmetic improvements to this option. llvm-svn: 11331	2004-02-11 19:14:04 +00:00
Chris Lattner	66e7209785	Actually load profiling information now! Block layout can use real, live, actual profile info, and works! :) llvm-svn: 11324	2004-02-11 18:21:05 +00:00
Chris Lattner	16ad30203c	Fix a typeo llvm-svn: 11323	2004-02-11 18:20:41 +00:00
Alkis Evlogimenos	fb5edb0b49	Increase constness. llvm-svn: 11322	2004-02-11 17:55:09 +00:00
Chris Lattner	36f91c80f4	Fix copy-and-pastos llvm-svn: 11319	2004-02-11 06:10:18 +00:00
Chris Lattner	c456d0cbc7	Add skeleton profileinfoloader pass. This will be enhanced to actually LOAD a profile tommorow. :) llvm-svn: 11318	2004-02-11 06:10:05 +00:00
Alkis Evlogimenos	4b4342e865	Remove assert as the only integer registers on the sparc are physical. llvm-svn: 11317	2004-02-11 06:04:51 +00:00
Alkis Evlogimenos	30c37082ae	Fix previous broken commit. A MachineOperand may have opType == MO_VirtualRegister but if the register number is one of a physical register is it considered as a physical register. llvm-svn: 11315	2004-02-11 05:55:00 +00:00
Chris Lattner	427c8ee657	Factor this code out of llvm-prof llvm-svn: 11314	2004-02-11 05:54:25 +00:00
Chris Lattner	5730220037	Remove obsolete comment. Unreachable blocks will automatically be left at the end of the function. llvm-svn: 11313	2004-02-11 05:20:50 +00:00
Chris Lattner	ca52c22356	Add an _embarassingly simple_ implementation of basic block layout. This is more of a testcase for profiling information than anything that should reasonably be used, but it's a starting point. When I have more time I will whip this into better shape. llvm-svn: 11311	2004-02-11 04:53:20 +00:00
Alkis Evlogimenos	d2edafbc32	Remove assert as it is meaningless. MachineOperands can be tagged as MO_VirtualRegister but actually be representing a physical register. llvm-svn: 11310	2004-02-11 04:52:30 +00:00
Chris Lattner	c9900e2eb3	Make sure to register the 'no profile' implementation as the default for ProfileInfo llvm-svn: 11309	2004-02-11 04:47:54 +00:00
Chris Lattner	f8077b7801	Simplify implementation, and probably speed things up too. llvm-svn: 11308	2004-02-11 03:57:16 +00:00
Chris Lattner	4418633216	Implement SimplifyCFG/PhiEliminate.ll Having a proper 'select' instruction would allow the elimination of a lot of the special case cruft in this patch, but we don't have one yet. llvm-svn: 11307	2004-02-11 03:36:04 +00:00
Chris Lattner	ceb4bfebcc	Initialize the count instance variable. llvm-svn: 11305	2004-02-11 03:29:16 +00:00
Chris Lattner	ea85b486f1	Expose the "Other" value type to tablegen targets llvm-svn: 11304	2004-02-11 03:08:45 +00:00
Chris Lattner	43553da3a0	Remove obsolete method llvm-svn: 11302	2004-02-11 01:17:33 +00:00
Chris Lattner	754914b142	The hasConstantReferences predicate always returns false. llvm-svn: 11301	2004-02-11 01:17:07 +00:00
Chris Lattner	a81258a786	An initial implementation of an LLVM ProfileInfo class which is designed to eventually allow Passes to use profiling information to direct them. llvm-svn: 11294	2004-02-10 22:11:42 +00:00
Chris Lattner	db2c8941a5	Add #include llvm-svn: 11285	2004-02-10 21:18:55 +00:00
Chris Lattner	37e040ee03	Do not use MachineOperand::isVirtualRegister either! llvm-svn: 11283	2004-02-10 21:12:22 +00:00
Chris Lattner	d64519766e	Stop using this method llvm-svn: 11282	2004-02-10 21:12:06 +00:00
Chris Lattner	c9f1da7374	Remove uses of MachineOperand::isVirtualRegister llvm-svn: 11281	2004-02-10 20:55:47 +00:00
Chris Lattner	0d6d4075e3	Remvoe use of MO.isVirtualRegister(), turn an assertion into an assert() llvm-svn: 11280	2004-02-10 20:47:24 +00:00
Chris Lattner	9d0a4e63b1	Eliminate users of MachineOperand::isPhysicalRegister llvm-svn: 11278	2004-02-10 20:41:10 +00:00
Chris Lattner	6fb75586ba	Remove use of isPhysicalRegister llvm-svn: 11277	2004-02-10 20:35:42 +00:00
Chris Lattner	d58b38eeca	Don't use MachineOperator::is(Phys\|Virt)Register llvm-svn: 11276	2004-02-10 20:31:28 +00:00
Chris Lattner	421302bc54	Tighten up checks llvm-svn: 11274	2004-02-10 20:25:13 +00:00
Chris Lattner	d2c679e5f6	initialization calls now return argc. If the program uses the argc value passed into main, make sure they use the return value of the init call instead of the one passed in. llvm-svn: 11262	2004-02-10 17:41:01 +00:00
Chris Lattner	a62e6a4952	Fix PR228: [sparc] Boolean constants are emitted as true and false I will observe that the concept of using WriteAsOperand is completely broken, but then we all knew that, didn't we? llvm-svn: 11255	2004-02-10 05:16:44 +00:00
Misha Brukman	ac167832e7	Doxygenify comments. llvm-svn: 11252	2004-02-09 23:18:42 +00:00
Chris Lattner	e961c1189f	Only add the global variable with the abort message if an unwind actually occurs in the program. llvm-svn: 11249	2004-02-09 22:48:47 +00:00
John Criswell	c1fa74809a	Fix PR#226: When emitting padding, always emit it as bytes. Bytes can be placed into any alignment situation. llvm-svn: 11247	2004-02-09 22:15:33 +00:00
Chris Lattner	034a264a99	It turns out that the two dimensional vectors were causing big slowdowns in this for programs with lots of types (like the testcase in PR224). The problem was that the type ID that the outer vector was using was not very dense (as many types are getting resolved), so the vector is large and gets reallocated a lot. Since there are a lot of values in the program (the .ll file is 10M), each reallocation has to copy the subvectors, which is also quite slow (this wouldn't be a problem if C++ supported move semantics, but it doesn't, at least not yet :( Changing the outer data structure to a map speeds a release build of llvm-as up from 11.21s to 5.13s on the testcase in PR224. llvm-svn: 11244	2004-02-09 21:03:38 +00:00
Chris Lattner	ef4a05f780	Remove the statistics llvm-svn: 11243	2004-02-09 21:01:23 +00:00
Chris Lattner	6deb7ad03b	Speed up type resolution some more. On the testcase in PR224, for example, this speeds up a release llvm-as from 21.95s to 11.21s, because before it would do an expensive traversal of the type-graph of every type resolved. llvm-svn: 11242	2004-02-09 20:23:44 +00:00
Chris Lattner	7eb118e4f7	When resolving upreferences, if multiple uprefs will be resolved to the same type at the same time, resolve the upreferences to each other before resolving it to the outer type. This shaves off some time from the testcase in PR224, from 25.41s -> 21.72s. llvm-svn: 11241	2004-02-09 18:53:54 +00:00
Brian Gaeke	c86ce8b134	Move InstrSchedule's iterator begin/end methods inline. llvm-svn: 11239	2004-02-09 18:42:46 +00:00
Brian Gaeke	60268b265f	Make SchedGraph::dump() use SchedGraphNodeCommon's const_iterator instead of randomly groping about inside its outEdges array. Make SchedGraph::addDummyEdges() use getNumOutEdges() instead of outEdges.size(). Get rid of ifdefed-out code in SchedGraph::buildGraph(). llvm-svn: 11238	2004-02-09 18:42:05 +00:00
Chris Lattner	f7f7d11eac	Implement the hashing scheme in an attempt to speed up the "slow" case in type resolution. Unfortunately it doesn't help. Also delete some dead debugging code. llvm-svn: 11237	2004-02-09 18:32:40 +00:00
Chris Lattner	ada23acba4	This debugging hook is no longer needed. llvm-svn: 11233	2004-02-09 17:20:52 +00:00
Chris Lattner	57a6b877ba	Code cleanup in preparation for later changes. Now that ContainedTy's are consistent across the various type classes, we can factor out a LOT more almost-identical code. Also, add a couple of temporary statistics. llvm-svn: 11232	2004-02-09 16:35:14 +00:00
Chris Lattner	f231cb60f4	Now that all of the derived types have disciplined interfaces, we can eliminate all of the ad-hoc storage of contained types. This allows getContainedType to not be virtual, and allows us to entirely delete the TypeIterator class. llvm-svn: 11230	2004-02-09 05:40:24 +00:00
Chris Lattner	255269e677	Don't depend on auto data conversion llvm-svn: 11229	2004-02-09 05:16:30 +00:00
Chris Lattner	16690fad3d	Adjust to the changed StructType interface. In particular, getElementTypes() is gone. llvm-svn: 11228	2004-02-09 04:37:31 +00:00
Chris Lattner	a1757d1d91	Start using the new and improve interface to FunctionType arguments llvm-svn: 11224	2004-02-09 04:14:01 +00:00
Chris Lattner	2b3eaf045b	This #include is not needed, it should have been removed with the last patch llvm-svn: 11222	2004-02-09 03:22:32 +00:00
Chris Lattner	76292e7fcc	Instead of searching the entire type graph for a type to determine if it contains the type we are looking for, just search the immediately used types. We can only do this because we keep the "current" type in the nesting level as we decrement upreferences. This change speeds up the testcase in PR224 from 50.4s to 22.08s, not too shabby. llvm-svn: 11221	2004-02-09 03:19:29 +00:00
Chris Lattner	645f8fb4e5	Upreferences are always OpaqueTypes, meaning that it is impossible for a non-abstract type from containing one. This speeds up the asmparser on the testcase in PR224 from 61->50s. llvm-svn: 11220	2004-02-09 03:03:10 +00:00
Chris Lattner	95c21b04da	Another nice speedup for the register allocator. This time, we replace the Virt2PhysRegMap std::map with an std::vector. This speeds up the register allocator another (almost) 40%, from .72->.45s in a release build of LLC on 253.perlbmk. llvm-svn: 11219	2004-02-09 02:12:04 +00:00
Chris Lattner	8e4aa43710	Add a new (hidden) option that is useful for profiling. llvm-svn: 11218	2004-02-09 01:47:10 +00:00
Chris Lattner	9fb6056f1f	Ugh, perform an optimization that GCC should be able to do itself. This speeds up livevar from .48/.32s -> .45/.31s in LLC on perlbmk llvm-svn: 11217	2004-02-09 01:43:23 +00:00
Chris Lattner	b665065e6d	Only do stuff for the REAL number of physical registers we have, not 1024. This speeds up live variables a lot, from .60/.39s -> .47/.26s in LLC, for the first/second pass respectively. llvm-svn: 11216	2004-02-09 01:35:21 +00:00
Chris Lattner	c49e77cecd	Change the PhysRegsUsed map into a dense array. Seeing that this is a mapping from physical registers, and they are always dense, it makes sense to not have a ton of RBtree overhead. This change speeds up regalloclocal about ~30% on 253.perlbmk, from .35s -> .27s in the JIT (in LLC, it goes from .74 -> .55). Now live variable analysis is the slowest codegen pass. Of course it doesn't help that we have to run it twice, because regalloclocal doesn't update it, but even if it did it would be the slowest pass (now it's just the 2x slowest pass :( llvm-svn: 11215	2004-02-09 01:26:13 +00:00
Chris Lattner	b771939ef3	Two problems with these lines of code: 1. The "work" was not in the assert, so it was punishing the optimized release 2. getNamedFunction is _very_ expensive in large programs. It is not designed to be used like this, and was taking 7% of the execution time of the code generator on perlbmk. Since the assert "can never fail", I'm just killing it. llvm-svn: 11214	2004-02-09 00:59:07 +00:00
Chris Lattner	6ef23e7e64	The ConstantExpr::getCast call can cause a CPR to be generated. If so, strip it off. llvm-svn: 11213	2004-02-09 00:20:55 +00:00
Chris Lattner	9523c4de75	Fix PR215: [bcwriter] Problem compactifying ConstantPointerRefs Have I ever mentioned how much I _hate_ constantpointerrefs? llvm-svn: 11212	2004-02-09 00:15:41 +00:00
Misha Brukman	58ca173834	Fix grammar-o. llvm-svn: 11210	2004-02-08 22:27:33 +00:00
Chris Lattner	a9887d33e8	Improve compatibility with programs that already have a prototype for 'write', even if it is wierd in some way. llvm-svn: 11207	2004-02-08 22:14:44 +00:00
Chris Lattner	a9a34f9d82	vi failed me again. :) llvm-svn: 11206	2004-02-08 21:52:30 +00:00
Chris Lattner	2878b11cfc	Rename the invoke 'except' destination to the 'unwind' destination llvm-svn: 11205	2004-02-08 21:52:04 +00:00
Chris Lattner	2e51b50de1	Change the 'exception' destination to the 'unwind' destination. We will always allow 'except' instead of 'unwind' here though. llvm-svn: 11203	2004-02-08 21:48:25 +00:00
Chris Lattner	68fdb35576	rename the "exceptional" destination of an invoke instruction to the 'unwind' dest llvm-svn: 11202	2004-02-08 21:44:31 +00:00
Chris Lattner	70d893a160	Fix PR225: [pruneeh] -pruneeh pass removes invoke instructions it shouldn't llvm-svn: 11200	2004-02-08 21:15:59 +00:00
Chris Lattner	46c84561b2	splitBasicBlock "does the right thing" now, no reason to reposition it. llvm-svn: 11199	2004-02-08 20:49:07 +00:00
Chris Lattner	1c646349df	Implement proper invoke/unwind lowering. This fixed PR16 "[lowerinvoke] The -lowerinvoke pass does not insert calls to setjmp/longjmp" llvm-svn: 11195	2004-02-08 19:53:56 +00:00
Chris Lattner	ecaa4c58e2	Print out all globals as they are emitted, not just those emitted from emitGlobals llvm-svn: 11191	2004-02-08 19:33:23 +00:00
Chris Lattner	ffd16b0190	There is no reason to #define fd llvm-svn: 11190	2004-02-08 19:33:07 +00:00
Chris Lattner	f549f9473e	Add a call to 'write' right before the call to abort() in the unwind path. This causes the JIT, or LLC'd program to print out a nice message, explaining WHY the program aborted. llvm-svn: 11184	2004-02-08 07:30:29 +00:00
Chris Lattner	838c2dd5af	Add one that I missed llvm-svn: 11179	2004-02-08 01:53:10 +00:00
Chris Lattner	cc1376078c	Instead of callign removeTriviallyDeadNodes on the global graph every time removeDeadNodes is called, only call it at the end of the pass being run. This saves 1.3 seconds running DSA on 177.mesa (5.3->4.0s), which is pretty big. This is only possible because of the automatic garbage collection done on forwarding nodes. llvm-svn: 11178	2004-02-08 01:51:48 +00:00
Chris Lattner	e2ee216e1f	Remove another unneeded call. llvm-svn: 11177	2004-02-08 01:40:40 +00:00
Chris Lattner	85ba7bbf78	This call is no longer needed now that merging does not produce garbage llvm-svn: 11176	2004-02-08 01:38:34 +00:00
Chris Lattner	32a3eb0b88	Substantially improve the DSA code by removing 'forwarding' nodes from DSGraphs while they are forwarding. When the last reference to the forwarding node is dropped, the forwarding node is autodeleted. This should simplify removeTriviallyDead nodes, and is only (efficiently) possible because we are using an ilist of dsnodes now. llvm-svn: 11175	2004-02-08 01:27:18 +00:00
Chris Lattner	84ff796305	Bugfix for ilist conversion. The ilist wants to make an 'end' node which has G == 0 llvm-svn: 11174	2004-02-08 01:05:37 +00:00
Chris Lattner	29067016a4	Switch the Nodes list from being an std::vector<DSNode*> to an ilist<DSNode> llvm-svn: 11173	2004-02-08 00:53:26 +00:00
Chris Lattner	2c836fc933	Change to use node_iterators instead of direct access to Nodes llvm-svn: 11171	2004-02-08 00:23:16 +00:00
Chris Lattner	6af67c7eb4	getNodes() is gone, use node_begin/end instead Rename stats from dsnode -> dsa Add a new stat llvm-svn: 11167	2004-02-07 23:58:05 +00:00
Chris Lattner	ed36ca5f13	getNodes() is gone llvm-svn: 11166	2004-02-07 23:57:26 +00:00
Chris Lattner	ce838bfae6	There is no need to clone over nodes that are going to be dead anyway llvm-svn: 11157	2004-02-07 22:00:03 +00:00
Alkis Evlogimenos	59bb9d69c7	Increase code clarity. llvm-svn: 11151	2004-02-06 18:08:18 +00:00
Alkis Evlogimenos	2aa7703205	Eliminate uneeded lookups by passing a Virt2PhysMap::iterator instead of the virtual register to certain functions. llvm-svn: 11143	2004-02-06 03:15:40 +00:00
Chris Lattner	3f57a7faab	Fix another dominator update bug. These bugs keep getting exposed because GCSE keeps finding more code motion opportunities now that the dominators are correct! llvm-svn: 11142	2004-02-05 23:20:59 +00:00
Alkis Evlogimenos	f01a26ceaa	Change live interval representation. Machine instructions now have two slots each. As a concequence they get numbered as 0, 2, 4 and so on. The first slot is used for operand uses and the second for defs. Here's an example: 0: A = ... 2: B = ... 4: C = A + B ;; last use of A The live intervals should look like: A = [1, 5) B = [3, x) C = [5, y) llvm-svn: 11141	2004-02-05 22:55:25 +00:00
Chris Lattner	f2a8b9e75b	Fix bug updating dominators llvm-svn: 11140	2004-02-05 22:33:26 +00:00
Chris Lattner	3846a304eb	Add debug output llvm-svn: 11139	2004-02-05 22:33:19 +00:00
Chris Lattner	6875c14234	Fix PR223: Loopsimplify incorrectly updates dominator information The problem is that the dominator update code didn't "realize" that it's possible for the newly inserted basic block to dominate anything. Because it IS possible, stuff was getting updated wrong. llvm-svn: 11137	2004-02-05 21:12:24 +00:00
Alkis Evlogimenos	3dd0f57349	We don't need to scan the blocks that we are live-in on every access. Rather we only have to do it on the creation of the interval. llvm-svn: 11135	2004-02-05 20:45:40 +00:00
Chris Lattner	ffc5eee17a	In a "seeing the forest through the trees" kinda situation, I realized that a complete rewrite of load-vn will make it a bit faster. This changes speeds up the gcse pass (which uses load-vn) from 25.45s to 0.42s on the testcase in PR209. I've also verified that this gives the exact same results as the old one. llvm-svn: 11132	2004-02-05 17:20:00 +00:00
Chris Lattner	1a04f2a635	This is a big diff with no functionality change. We just reorder some code, which causes big reindentation. While I'm at it, I fix the fixme by removing some dead code. llvm-svn: 11131	2004-02-05 05:56:23 +00:00
Chris Lattner	7d5e3febb7	finegrainify namespacification llvm-svn: 11130	2004-02-05 05:51:40 +00:00
Tanya Lattner	0221566368	Added missing include. llvm-svn: 11129	2004-02-05 05:04:39 +00:00
Tanya Lattner	d7b137d9fb	Fixed Chris' typo. llvm-svn: 11128	2004-02-05 04:45:21 +00:00
Chris Lattner	b721589bc0	Implement optimizations for handling large basic blocks. llvm-svn: 11126	2004-02-05 00:36:43 +00:00
Alkis Evlogimenos	676e5b8997	Modify the two address instruction pass to remove the duplicate operand of the instruction and thus simplify the register allocation. llvm-svn: 11124	2004-02-04 22:17:40 +00:00
Chris Lattner	342b7276d6	Minor speedup, don't query ValueMap each time through the loop llvm-svn: 11123	2004-02-04 21:44:26 +00:00
Brian Gaeke	dcab84ecf1	Take away the default iostream argument of createMachineFunctionPrinterPass(), at Chris's request. llvm-svn: 11120	2004-02-04 21:41:01 +00:00
Chris Lattner	cbe1dd55f4	Two changes: 1. Don't scan to the end of alloca instructions in the caller function to insert inlined allocas, just insert at the top. This saves a lot of time inlining into functions with a lot of allocas. 2. Use splice to move the alloca instructions over, instead of remove/insert. This allows us to transfer a block at a time, and eliminates a bunch of silly symbol table manipulations. This speeds up the inliner on the testcase in PR209 from 1.73s -> 1.04s (67%) llvm-svn: 11118	2004-02-04 21:33:42 +00:00
Alkis Evlogimenos	a5458ae146	IMULri* instructions do not require their first two registers operands to be the same (IOW they are not two address instructions). llvm-svn: 11117	2004-02-04 17:21:04 +00:00
Chris Lattner	790d7321b4	Optimize the case where we are inlining a function that contains only one basic block, and that basic block ends with a return instruction. In this case, we can just splice the cloned "body" of the function directly into the source basic block, avoiding a lot of rearrangement and splitBasicBlock's linear scan over the split block. This speeds up the inliner on the testcase in PR209 from 2.3s to 1.7s, a 35% reduction. llvm-svn: 11116	2004-02-04 04:17:06 +00:00
Chris Lattner	68aef33986	Adjust to the new BasicBlock ctor, which requires a function parameter llvm-svn: 11114	2004-02-04 03:58:28 +00:00
Chris Lattner	223ffefd1f	Adjust to the new BB ctor llvm-svn: 11113	2004-02-04 03:57:50 +00:00
Chris Lattner	d655075e9d	Remove unneeded code now that splitBasicBlock does the "right thing" llvm-svn: 11111	2004-02-04 03:21:51 +00:00
Chris Lattner	840109cd9c	When splitting a basic block, insert the new half immediately after the first half. llvm-svn: 11110	2004-02-04 03:21:31 +00:00
Chris Lattner	8f0a362cd4	More refactoring. Move alloca instructions and handle invoke instructions before we delete the original call site, allowing slight simplifications of code, but nothing exciting. llvm-svn: 11109	2004-02-04 02:51:48 +00:00
Chris Lattner	ab01a0e982	Move the cloning of the function body much earlier in the inlinefunction process. The only optimization we did so far is to avoid creating a PHI node, then immediately destroying it in the common case where the callee has one return statement. Instead, we just don't create the return value. This has no noticable performance impact, but paves the way for future improvements. llvm-svn: 11108	2004-02-04 01:41:09 +00:00
Chris Lattner	aa44b4de3e	Give CloneBasicBlock an optional function argument to specify which function to add the cloned block to. This allows the block to be added to the function immediately, and all of the instructions to be immediately added to the function symbol table, which speeds up the inliner from 3.7 -> 3.38s on the PR209. llvm-svn: 11107	2004-02-04 01:19:43 +00:00
Chris Lattner	7f1e6f16c5	If changing a parent, don't add then remove the object from the leak detector llvm-svn: 11106	2004-02-04 01:06:38 +00:00
Chris Lattner	1404b61b62	In BasicBlock::splitBasicBlock, just use islist::splice to move the instructions, instead of a loop that is really inefficient with large basic blocks. This speeds up the inliner pass on the testcase in PR209 from 13.8s to 2.24s which still isn't exactly speedy, but is a lot better. :) llvm-svn: 11105	2004-02-03 23:11:21 +00:00
Chris Lattner	2998d508b0	Bunch up all locally used allocas by the block they are allocated in, and process them all as a group. This speeds up SRoA/mem2reg from 28.46s to 0.62s on the testcase from PR209. llvm-svn: 11100	2004-02-03 22:34:12 +00:00
Chris Lattner	eee884cc98	Handle extremely trivial cases extremely efficiently. This speeds up SRoA/mem2reg from 41.2s to 27.5s on the testcase in PR209. llvm-svn: 11099	2004-02-03 22:00:33 +00:00
Chris Lattner	92211eb77d	Generate ftst instructions for comparison against zero llvm-svn: 11098	2004-02-03 18:54:04 +00:00
Chris Lattner	7b6e8f1f70	Add the ftst instruction llvm-svn: 11095	2004-02-03 07:27:50 +00:00
Chris Lattner	8b623358ff	Add support for one argument OneArgFP instructions llvm-svn: 11094	2004-02-03 07:27:34 +00:00
Alkis Evlogimenos	5e78d4bd75	When an instruction like: A += B had both A and B virtual registers spilled, A was loaded from its stack location twice. This fixes the bug. llvm-svn: 11093	2004-02-03 01:13:07 +00:00
Alkis Evlogimenos	e128cb3295	Revert changes. Will implement this using a different set of primitives llvm-svn: 11091	2004-02-02 23:08:58 +00:00
Alkis Evlogimenos	a4fab12df8	Fix debugging output. llvm-svn: 11088	2004-02-02 22:00:32 +00:00
Alkis Evlogimenos	3f268a9996	Correctly update def/use information for modified machine operands. llvm-svn: 11087	2004-02-02 21:56:40 +00:00
Alkis Evlogimenos	dfef24298e	Should be more careful. The previously applied change made all counts outside of loops = 0. llvm-svn: 11085	2004-02-02 20:29:57 +00:00
Chris Lattner	0e438dc219	Floating point negates are -0.0 - X, not 0.0 - X llvm-svn: 11084	2004-02-02 20:21:29 +00:00
Chris Lattner	f2a481f7f8	Disable (x - (y - z)) => (x + (z - y)) optimization for floating point. llvm-svn: 11083	2004-02-02 20:09:56 +00:00
Chris Lattner	343ab20a2d	Update comment llvm-svn: 11082	2004-02-02 20:09:22 +00:00
Chris Lattner	06f08a26ac	No need to declare implicit uses/defs of ST0 llvm-svn: 11081	2004-02-02 19:57:45 +00:00
Brian Gaeke	1aa59f5107	Make deadarghaX0r warning louder. (I just love typing haX0r. haX0r haX0r haX0r.) llvm-svn: 11079	2004-02-02 19:32:27 +00:00
Chris Lattner	71c12c8e0d	Generate the fchs instruction to negate a floating point number llvm-svn: 11078	2004-02-02 19:31:38 +00:00
Chris Lattner	36367e8c90	Add support for OneArgFPRW instructions, fix a couple of typeos llvm-svn: 11077	2004-02-02 19:23:15 +00:00
Brian Gaeke	355bc299b7	Include PowerPC.h. Flesh out the stub versions of addPassesToEmitAssembly() and addPassesToJITCompile() slightly. llvm-svn: 11076	2004-02-02 19:06:36 +00:00
Brian Gaeke	24f70fa16d	Add comments describing how you would add prototypes for factory methods for PowerPC-specific passes here. llvm-svn: 11073	2004-02-02 19:05:28 +00:00
Chris Lattner	963082346c	Codegen -0.0 correctly. Do not use fldz! This is another -0.0 == +0.0 problem, arg. llvm-svn: 11070	2004-02-02 18:56:30 +00:00
Alkis Evlogimenos	1825f30a3a	Create an object for tracking physical register usage. This will look much better when I get rid of the reserved registers. llvm-svn: 11066	2004-02-02 07:30:36 +00:00
Chris Lattner	a61cda01c2	Second half of a fix for PR218 & test/Regression/Assembler/2004-02-01-NegativeZero.llx. Basically we store floating point values as their integral components, instead of relying on the semantics of floating point < to differentiate between values. This is likely to make the map search be faster anyway. llvm-svn: 11064	2004-02-01 22:49:04 +00:00
Alkis Evlogimenos	586dc51262	Change weight array into a vector and make it as big as the number of registers (not as the max number of registers). Change toSpill from a std::set into a std::vector<bool>. Use the reverse iterator adapter to do a reverse scan of allocatable registers. llvm-svn: 11061	2004-02-01 20:13:26 +00:00
Alkis Evlogimenos	b2ea4389ca	Use std::map::count() instead of std::map::find() != std::map::end() where appropriate. llvm-svn: 11060	2004-02-01 18:39:53 +00:00
Alkis Evlogimenos	eabb1b1bfa	Change string for joined intervals. llvm-svn: 11057	2004-02-01 16:13:05 +00:00
Alkis Evlogimenos	3c94357980	FpMOV is also a move instruction. llvm-svn: 11055	2004-02-01 08:22:16 +00:00
Alkis Evlogimenos	e9d1aa31fc	Change xor to ^. llvm-svn: 11054	2004-02-01 08:19:25 +00:00
Chris Lattner	824015593c	Disable tail duplication in any "hard" cases, where it might break SSA form. llvm-svn: 11052	2004-02-01 06:32:28 +00:00
Chris Lattner	2e497b5e42	Fix a bug in a recent checkin llvm-svn: 11050	2004-02-01 05:25:07 +00:00
Chris Lattner	3575c84933	Fix the count of the number of instructions removed llvm-svn: 11049	2004-02-01 05:15:07 +00:00
Alkis Evlogimenos	107cf3892c	Missed one silly assert :-) llvm-svn: 11048	2004-02-01 02:21:31 +00:00
Alkis Evlogimenos	a7459c76aa	Simplify joinIntervals() code. llvm-svn: 11047	2004-02-01 02:18:31 +00:00
Chris Lattner	3818f475de	Remove all of the annoying statistics now that I'm finished (for the near term) working on bytecode size stuff. llvm-svn: 11046	2004-02-01 01:50:31 +00:00
Alkis Evlogimenos	612a35f2a4	Use MRegisterInfo::isPhysicalRegister and MRegisterInfo::isVirtualRegister. llvm-svn: 11045	2004-02-01 01:27:01 +00:00
Chris Lattner	831561989c	Fix a crasher bug in my constant folding rewrite llvm-svn: 11044	2004-02-01 01:23:19 +00:00
Chris Lattner	becd58c6cb	Print an error message if we can't materialize the bytecode file llvm-svn: 11043	2004-02-01 01:07:25 +00:00
Chris Lattner	c76f82c957	Print an error message if there is an error materialize the bc file. llvm-svn: 11041	2004-02-01 00:32:35 +00:00
Alkis Evlogimenos	9559cebebf	Apply final part of Chris' patch. llvm-svn: 11040	2004-01-31 23:48:47 +00:00
Alkis Evlogimenos	656870703f	Be a little smarter on the way we handle physical register defs. llvm-svn: 11038	2004-01-31 23:13:30 +00:00
Chris Lattner	0ea56952d4	Finegrainify namespacification, use new MRegisterInfo::isVirtualRegister method llvm-svn: 11037	2004-01-31 21:27:19 +00:00
Chris Lattner	bff56dc518	Fix, correctly this time, the computation of the return value Fix a spello Tighten up the assertion checking No functionality changes. llvm-svn: 11036	2004-01-31 21:21:43 +00:00
Chris Lattner	05b7fef02f	* Fix incorrect computation of the runOnMachineFunction return value * Turn a bunch of instance variables into automatic variables llvm-svn: 11035	2004-01-31 21:14:04 +00:00
Chris Lattner	b0addcc1b0	Remove unneeded #includes Move Passes.h (which defines the interface to this file) to the top. Move statistics to the top of the file. Add a comment llvm-svn: 11034	2004-01-31 21:07:15 +00:00
Chris Lattner	89f03df701	Avoid referencing deleted DSgraphs when merging an SCC into a larger SCC. This fixes the crash in 176.gcc. llvm-svn: 11033	2004-01-31 21:02:18 +00:00
Alkis Evlogimenos	d1ec5f36ee	Merge safe parts from last night's buggy commit. These do not break any test cases :-) llvm-svn: 11032	2004-01-31 19:59:32 +00:00
Alkis Evlogimenos	ef63827238	Optimize liveAt() and overlaps(). We now use a binary search instead of a linear search to find the first range for comparisons. This cuts down the linear scan register allocator running time by a factor of 3 in 254.perlbmk and by a factor of 2.2 in 176.gcc. llvm-svn: 11030	2004-01-31 16:54:54 +00:00
Alkis Evlogimenos	b9df133d67	Revert last night's changes as they broke some tests. Will remerge parts of the patch. llvm-svn: 11029	2004-01-31 14:37:41 +00:00
Alkis Evlogimenos	c0db519832	Several performance enhancements and cleanups from Chris. Simplification of LiveIntervals::Interval::overlaps() and addition of examples to overlaps() and liveAt() to make them clearer. llvm-svn: 11028	2004-01-31 04:56:07 +00:00
Chris Lattner	a497e3d6e1	Fix thinko llvm-svn: 11027	2004-01-30 22:48:02 +00:00
Chris Lattner	f5b5d1f2f8	Add some comments sketching out how this is to work eventually. llvm-svn: 11026	2004-01-30 22:25:18 +00:00
Chris Lattner	543fbaa973	Add a new flag, which is only used for symmetry. llvm-svn: 11025	2004-01-30 22:24:18 +00:00
Chris Lattner	0127841d4e	Forward method request to chained aa implementation llvm-svn: 11024	2004-01-30 22:20:55 +00:00
Chris Lattner	8bcdaf9b5d	Implement the pointsToConstantMemory() method. llvm-svn: 11022	2004-01-30 22:17:24 +00:00
Chris Lattner	71d05cef5e	Improve mod/ref information based on the pointsToConstantMemory method. llvm-svn: 11021	2004-01-30 22:16:42 +00:00
Chris Lattner	6f4945b965	Add (currently disabled) support to the instruction selector to only insert FP_REG_KILL instructions at the end of blocks involved with critical edges. Fix a bug where FP_REG_KILL instructions weren't inserted in fall through unconditional branches. Perhaps this will fix some linscan problems? llvm-svn: 11019	2004-01-30 22:13:44 +00:00
Chris Lattner	32ec75c4ef	Finegrainify namespacification Implement LiveVariables::getIndexMachineBasicBlock llvm-svn: 11018	2004-01-30 22:08:53 +00:00
Brian Gaeke	9cf840250b	Give clients of MachineFunctionPrinter the ability to specify a banner and choose an ostream. llvm-svn: 11016	2004-01-30 21:53:46 +00:00
Misha Brukman	440d62207c	Order #includes alphabetically, per style guide. llvm-svn: 11015	2004-01-30 17:26:24 +00:00
Chris Lattner	dfa4f14e34	Fix a bug aflicting 265.gap llvm-svn: 11006	2004-01-29 08:36:22 +00:00
Chris Lattner	caecd769c4	Minor bugfixes llvm-svn: 11005	2004-01-29 03:32:15 +00:00
Misha Brukman	6d62b1ee69	Hyphenate `target-dependent' llvm-svn: 11003	2004-01-28 20:43:01 +00:00
Brian Gaeke	923ff046c0	Add a new (static inline) std::ostream& << AllocInfo& method. Use it. llvm-svn: 11002	2004-01-28 19:05:43 +00:00
Chris Lattner	7ab2349541	Rename DSGraph::ScalarMapTy -> DSScalarMap llvm-svn: 11001	2004-01-28 09:15:42 +00:00
Chris Lattner	9a79b1690c	Fix a bug llvm-svn: 11000	2004-01-28 03:31:34 +00:00
Chris Lattner	8324e48b84	Eliminate the call to removeTriviallyDeadNodes from updateFromGlobals graph, moving it to the start of removeDeadNodes. This speeds up DSA by 2s on perlbmk from 41s llvm-svn: 10999	2004-01-28 03:24:41 +00:00
Chris Lattner	92eb91ed70	In the TD pass, iterate over globals directly instead of through the whole scalar map. This saves 5s in the TD pass, from 22->17s on perlbmk llvm-svn: 10998	2004-01-28 03:12:48 +00:00
Chris Lattner	c25a7dad97	In the TD pass, don't iterate over the scalar map to find the globals, iterate over the globals directly. This doesn't save any substantial time, however, because the globals graph only contains globals! llvm-svn: 10997	2004-01-28 03:07:30 +00:00
Chris Lattner	394c9343b4	In updateFromGlobalsGraph, instead of iterating over all of the scalars in the function to find the globals, iterate over all of the globals directly. This speeds the function up from 14s to 6.3s on perlbmk, reducing DSA time from 53->46s. llvm-svn: 10996	2004-01-28 03:03:06 +00:00
Chris Lattner	528f5842dd	Minor tweaks, eliminate useless integer pruning optimziation, turn on timers by default llvm-svn: 10993	2004-01-28 02:41:32 +00:00
Chris Lattner	18e0615a69	Further reduce the number of nodes cloned with getClonedNH, using merge instead. This reduces the number of nodes allocated, then immediately merged and DNE'd from 2193852 to 1298049. unfortunately this only speeds DSA up by ~1.5s (of 53s), because it's spending most of its time waddling through the scalar map :( llvm-svn: 10992	2004-01-28 02:11:49 +00:00
Chris Lattner	6d38674f7b	Add a timer, fix a minor bug. Also, use RC::merge when possible, reducing the number of nodes allocated, then immediately merged away from 2985444 to 2193852 on perlbmk. llvm-svn: 10991	2004-01-28 02:05:05 +00:00
Chris Lattner	10881df6e2	Another bugfix, disable "spurious" output. You gotta love spurious llvm-svn: 10990	2004-01-28 01:19:52 +00:00
Chris Lattner	b0d7b564dd	fix bug in previous checkin llvm-svn: 10989	2004-01-27 22:54:56 +00:00
Chris Lattner	3bd20c5114	* Add a new commandline argument to control the "global roots hack". Default it to be off. If it looks like it's completely unnecessary after testing, I will remove it completely (which is the hope). * Callers of the DSNode "copy ctor" can not choose to not copy links. * Make node collapsing not create a garbage node in some cases, avoiding a memory allocation, and a subsequent DNE. * When merging types, allow two functions of different types to be merged without collapsing. * Use DSNodeHandle::isNull more often instead of DSNodeHandle::getNode() == 0, as it is much more efficient. *** Implement the new, more efficient reachability cloner class In addition to only cloning nodes that are reachable from interesting roots, this also fixes the huge inefficiency we had where we cloned lots of nodes, only to merge them away immediately after they were cloned. Now we only actually allocate a node if there isn't one to merge it into. * Eliminate the now-obsolete cloneReachable* and clonePartiallyInto methods * Rewrite updateFromGlobalsGraph to use the reachability cloner * Rewrite mergeInGraph to use the reachability cloner * Disable the scalar map scanning code in removeTriviallyDeadNodes. In large SCC's, this is extremely expensive. We need a better data structure for the scalar map, because we really want to scan the unique node handles, not ALL of the scalars. * Remove the incorrect SANER_CODE_FOR_CHECKING_IF_ALL_REFERRERS_ARE_FROM_SCALARMAP code. * Move the code for eliminating integer nodes from the trivially dead eliminator to the dead node eliminator. * removeDeadNodes no longer uses removeTriviallyDeadNodes, as it contains a superset of the node removal power. * Only futz around with the globals graph in removeDeadNodes if it is modified llvm-svn: 10987	2004-01-27 22:03:40 +00:00
Chris Lattner	727e11ac33	Rewrite to use the reachability cloner interface. Also, make this much more efficient in the case where a function calls into the same graph multiple times (ie, it either contains multiple calls to the same function, or multiple calls to functions in the same SCC graph) llvm-svn: 10986	2004-01-27 21:53:14 +00:00
Chris Lattner	25f95d2b7c	minor cleanups llvm-svn: 10985	2004-01-27 21:51:19 +00:00
Chris Lattner	9117f221b1	Get clone flags right, so we don't build InlinedGlobals only to clear them llvm-svn: 10984	2004-01-27 21:50:41 +00:00
John Criswell	3a994220ad	Fixes for PR214. Use the SHLIBEXT variable instead of hardcoding .so into every file. llvm-svn: 10976	2004-01-26 20:59:41 +00:00
Alkis Evlogimenos	2a8df1739b	Fix failing test cases with joined live intervals. It turns out that when joining we need to check if we overlap with the second interval or any of its aliases. Also make joining intervals the default. llvm-svn: 10973	2004-01-23 13:37:51 +00:00
Brian Gaeke	bc72431950	Add the JITInfo object, accessor & initializer. llvm-svn: 10972	2004-01-23 06:39:30 +00:00
Brian Gaeke	2757ec5b94	Add CodeEmitter and JITInfo stubs. Dump the old PowerPCTargetMachine::addPassesToJITCompile() method, in favor of the TargetJITInfo interface. llvm-svn: 10971	2004-01-23 06:35:43 +00:00
Chris Lattner	6d2f67adb3	Initial support for implementing clonePartiallyInto in terms of cloneReachableSubgraph, though this support is currently disabled. llvm-svn: 10970	2004-01-23 01:44:53 +00:00
Chris Lattner	317a6fec82	Fix a problem brian ran into with the bytecode reader asserting. It turns out that the problem was actually the writer writing out a 'null' value because it didn't normalize it. This fixes: test/Regression/Assembler/2004-01-22-FloatNormalization.ll llvm-svn: 10967	2004-01-23 00:55:21 +00:00
Alkis Evlogimenos	b4745bda05	Add option to join live intervals. Two intervals are joined if there is a move between two registers, at least one of the registers is virtual and the two live intervals do not overlap. This results in about 40% reduction in intervals, 30% decrease in the register allocators running time and a 20% increase in peephole optimizations (mainly move eliminations). The option can be enabled by passing -join-liveintervals where appropriate. llvm-svn: 10965	2004-01-22 23:08:45 +00:00

... 2 3 4 5 6 ...

5474 Commits