llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Alkis Evlogimenos	4b4342e865	Remove assert as the only integer registers on the sparc are physical. llvm-svn: 11317	2004-02-11 06:04:51 +00:00
Alkis Evlogimenos	30c37082ae	Fix previous broken commit. A MachineOperand may have opType == MO_VirtualRegister but if the register number is one of a physical register is it considered as a physical register. llvm-svn: 11315	2004-02-11 05:55:00 +00:00
Chris Lattner	427c8ee657	Factor this code out of llvm-prof llvm-svn: 11314	2004-02-11 05:54:25 +00:00
Chris Lattner	5730220037	Remove obsolete comment. Unreachable blocks will automatically be left at the end of the function. llvm-svn: 11313	2004-02-11 05:20:50 +00:00
Chris Lattner	ca52c22356	Add an _embarassingly simple_ implementation of basic block layout. This is more of a testcase for profiling information than anything that should reasonably be used, but it's a starting point. When I have more time I will whip this into better shape. llvm-svn: 11311	2004-02-11 04:53:20 +00:00
Alkis Evlogimenos	d2edafbc32	Remove assert as it is meaningless. MachineOperands can be tagged as MO_VirtualRegister but actually be representing a physical register. llvm-svn: 11310	2004-02-11 04:52:30 +00:00
Chris Lattner	c9900e2eb3	Make sure to register the 'no profile' implementation as the default for ProfileInfo llvm-svn: 11309	2004-02-11 04:47:54 +00:00
Chris Lattner	f8077b7801	Simplify implementation, and probably speed things up too. llvm-svn: 11308	2004-02-11 03:57:16 +00:00
Chris Lattner	4418633216	Implement SimplifyCFG/PhiEliminate.ll Having a proper 'select' instruction would allow the elimination of a lot of the special case cruft in this patch, but we don't have one yet. llvm-svn: 11307	2004-02-11 03:36:04 +00:00
Chris Lattner	ceb4bfebcc	Initialize the count instance variable. llvm-svn: 11305	2004-02-11 03:29:16 +00:00
Chris Lattner	ea85b486f1	Expose the "Other" value type to tablegen targets llvm-svn: 11304	2004-02-11 03:08:45 +00:00
Chris Lattner	43553da3a0	Remove obsolete method llvm-svn: 11302	2004-02-11 01:17:33 +00:00
Chris Lattner	754914b142	The hasConstantReferences predicate always returns false. llvm-svn: 11301	2004-02-11 01:17:07 +00:00
Chris Lattner	a81258a786	An initial implementation of an LLVM ProfileInfo class which is designed to eventually allow Passes to use profiling information to direct them. llvm-svn: 11294	2004-02-10 22:11:42 +00:00
Chris Lattner	db2c8941a5	Add #include llvm-svn: 11285	2004-02-10 21:18:55 +00:00
Chris Lattner	37e040ee03	Do not use MachineOperand::isVirtualRegister either! llvm-svn: 11283	2004-02-10 21:12:22 +00:00
Chris Lattner	d64519766e	Stop using this method llvm-svn: 11282	2004-02-10 21:12:06 +00:00
Chris Lattner	c9f1da7374	Remove uses of MachineOperand::isVirtualRegister llvm-svn: 11281	2004-02-10 20:55:47 +00:00
Chris Lattner	0d6d4075e3	Remvoe use of MO.isVirtualRegister(), turn an assertion into an assert() llvm-svn: 11280	2004-02-10 20:47:24 +00:00
Chris Lattner	9d0a4e63b1	Eliminate users of MachineOperand::isPhysicalRegister llvm-svn: 11278	2004-02-10 20:41:10 +00:00
Chris Lattner	6fb75586ba	Remove use of isPhysicalRegister llvm-svn: 11277	2004-02-10 20:35:42 +00:00
Chris Lattner	d58b38eeca	Don't use MachineOperator::is(Phys\|Virt)Register llvm-svn: 11276	2004-02-10 20:31:28 +00:00
Chris Lattner	421302bc54	Tighten up checks llvm-svn: 11274	2004-02-10 20:25:13 +00:00
Chris Lattner	d2c679e5f6	initialization calls now return argc. If the program uses the argc value passed into main, make sure they use the return value of the init call instead of the one passed in. llvm-svn: 11262	2004-02-10 17:41:01 +00:00
Chris Lattner	a62e6a4952	Fix PR228: [sparc] Boolean constants are emitted as true and false I will observe that the concept of using WriteAsOperand is completely broken, but then we all knew that, didn't we? llvm-svn: 11255	2004-02-10 05:16:44 +00:00
Misha Brukman	ac167832e7	Doxygenify comments. llvm-svn: 11252	2004-02-09 23:18:42 +00:00
Chris Lattner	e961c1189f	Only add the global variable with the abort message if an unwind actually occurs in the program. llvm-svn: 11249	2004-02-09 22:48:47 +00:00
John Criswell	c1fa74809a	Fix PR#226: When emitting padding, always emit it as bytes. Bytes can be placed into any alignment situation. llvm-svn: 11247	2004-02-09 22:15:33 +00:00
Chris Lattner	034a264a99	It turns out that the two dimensional vectors were causing big slowdowns in this for programs with lots of types (like the testcase in PR224). The problem was that the type ID that the outer vector was using was not very dense (as many types are getting resolved), so the vector is large and gets reallocated a lot. Since there are a lot of values in the program (the .ll file is 10M), each reallocation has to copy the subvectors, which is also quite slow (this wouldn't be a problem if C++ supported move semantics, but it doesn't, at least not yet :( Changing the outer data structure to a map speeds a release build of llvm-as up from 11.21s to 5.13s on the testcase in PR224. llvm-svn: 11244	2004-02-09 21:03:38 +00:00
Chris Lattner	ef4a05f780	Remove the statistics llvm-svn: 11243	2004-02-09 21:01:23 +00:00
Chris Lattner	6deb7ad03b	Speed up type resolution some more. On the testcase in PR224, for example, this speeds up a release llvm-as from 21.95s to 11.21s, because before it would do an expensive traversal of the type-graph of every type resolved. llvm-svn: 11242	2004-02-09 20:23:44 +00:00
Chris Lattner	7eb118e4f7	When resolving upreferences, if multiple uprefs will be resolved to the same type at the same time, resolve the upreferences to each other before resolving it to the outer type. This shaves off some time from the testcase in PR224, from 25.41s -> 21.72s. llvm-svn: 11241	2004-02-09 18:53:54 +00:00
Brian Gaeke	c86ce8b134	Move InstrSchedule's iterator begin/end methods inline. llvm-svn: 11239	2004-02-09 18:42:46 +00:00
Brian Gaeke	60268b265f	Make SchedGraph::dump() use SchedGraphNodeCommon's const_iterator instead of randomly groping about inside its outEdges array. Make SchedGraph::addDummyEdges() use getNumOutEdges() instead of outEdges.size(). Get rid of ifdefed-out code in SchedGraph::buildGraph(). llvm-svn: 11238	2004-02-09 18:42:05 +00:00
Chris Lattner	f7f7d11eac	Implement the hashing scheme in an attempt to speed up the "slow" case in type resolution. Unfortunately it doesn't help. Also delete some dead debugging code. llvm-svn: 11237	2004-02-09 18:32:40 +00:00
Chris Lattner	ada23acba4	This debugging hook is no longer needed. llvm-svn: 11233	2004-02-09 17:20:52 +00:00
Chris Lattner	57a6b877ba	Code cleanup in preparation for later changes. Now that ContainedTy's are consistent across the various type classes, we can factor out a LOT more almost-identical code. Also, add a couple of temporary statistics. llvm-svn: 11232	2004-02-09 16:35:14 +00:00
Chris Lattner	f231cb60f4	Now that all of the derived types have disciplined interfaces, we can eliminate all of the ad-hoc storage of contained types. This allows getContainedType to not be virtual, and allows us to entirely delete the TypeIterator class. llvm-svn: 11230	2004-02-09 05:40:24 +00:00
Chris Lattner	255269e677	Don't depend on auto data conversion llvm-svn: 11229	2004-02-09 05:16:30 +00:00
Chris Lattner	16690fad3d	Adjust to the changed StructType interface. In particular, getElementTypes() is gone. llvm-svn: 11228	2004-02-09 04:37:31 +00:00
Chris Lattner	a1757d1d91	Start using the new and improve interface to FunctionType arguments llvm-svn: 11224	2004-02-09 04:14:01 +00:00
Chris Lattner	2b3eaf045b	This #include is not needed, it should have been removed with the last patch llvm-svn: 11222	2004-02-09 03:22:32 +00:00
Chris Lattner	76292e7fcc	Instead of searching the entire type graph for a type to determine if it contains the type we are looking for, just search the immediately used types. We can only do this because we keep the "current" type in the nesting level as we decrement upreferences. This change speeds up the testcase in PR224 from 50.4s to 22.08s, not too shabby. llvm-svn: 11221	2004-02-09 03:19:29 +00:00
Chris Lattner	645f8fb4e5	Upreferences are always OpaqueTypes, meaning that it is impossible for a non-abstract type from containing one. This speeds up the asmparser on the testcase in PR224 from 61->50s. llvm-svn: 11220	2004-02-09 03:03:10 +00:00
Chris Lattner	95c21b04da	Another nice speedup for the register allocator. This time, we replace the Virt2PhysRegMap std::map with an std::vector. This speeds up the register allocator another (almost) 40%, from .72->.45s in a release build of LLC on 253.perlbmk. llvm-svn: 11219	2004-02-09 02:12:04 +00:00
Chris Lattner	8e4aa43710	Add a new (hidden) option that is useful for profiling. llvm-svn: 11218	2004-02-09 01:47:10 +00:00
Chris Lattner	9fb6056f1f	Ugh, perform an optimization that GCC should be able to do itself. This speeds up livevar from .48/.32s -> .45/.31s in LLC on perlbmk llvm-svn: 11217	2004-02-09 01:43:23 +00:00
Chris Lattner	b665065e6d	Only do stuff for the REAL number of physical registers we have, not 1024. This speeds up live variables a lot, from .60/.39s -> .47/.26s in LLC, for the first/second pass respectively. llvm-svn: 11216	2004-02-09 01:35:21 +00:00
Chris Lattner	c49e77cecd	Change the PhysRegsUsed map into a dense array. Seeing that this is a mapping from physical registers, and they are always dense, it makes sense to not have a ton of RBtree overhead. This change speeds up regalloclocal about ~30% on 253.perlbmk, from .35s -> .27s in the JIT (in LLC, it goes from .74 -> .55). Now live variable analysis is the slowest codegen pass. Of course it doesn't help that we have to run it twice, because regalloclocal doesn't update it, but even if it did it would be the slowest pass (now it's just the 2x slowest pass :( llvm-svn: 11215	2004-02-09 01:26:13 +00:00
Chris Lattner	b771939ef3	Two problems with these lines of code: 1. The "work" was not in the assert, so it was punishing the optimized release 2. getNamedFunction is _very_ expensive in large programs. It is not designed to be used like this, and was taking 7% of the execution time of the code generator on perlbmk. Since the assert "can never fail", I'm just killing it. llvm-svn: 11214	2004-02-09 00:59:07 +00:00
Chris Lattner	6ef23e7e64	The ConstantExpr::getCast call can cause a CPR to be generated. If so, strip it off. llvm-svn: 11213	2004-02-09 00:20:55 +00:00
Chris Lattner	9523c4de75	Fix PR215: [bcwriter] Problem compactifying ConstantPointerRefs Have I ever mentioned how much I _hate_ constantpointerrefs? llvm-svn: 11212	2004-02-09 00:15:41 +00:00
Misha Brukman	58ca173834	Fix grammar-o. llvm-svn: 11210	2004-02-08 22:27:33 +00:00
Chris Lattner	a9887d33e8	Improve compatibility with programs that already have a prototype for 'write', even if it is wierd in some way. llvm-svn: 11207	2004-02-08 22:14:44 +00:00
Chris Lattner	a9a34f9d82	vi failed me again. :) llvm-svn: 11206	2004-02-08 21:52:30 +00:00
Chris Lattner	2878b11cfc	Rename the invoke 'except' destination to the 'unwind' destination llvm-svn: 11205	2004-02-08 21:52:04 +00:00
Chris Lattner	2e51b50de1	Change the 'exception' destination to the 'unwind' destination. We will always allow 'except' instead of 'unwind' here though. llvm-svn: 11203	2004-02-08 21:48:25 +00:00
Chris Lattner	68fdb35576	rename the "exceptional" destination of an invoke instruction to the 'unwind' dest llvm-svn: 11202	2004-02-08 21:44:31 +00:00
Chris Lattner	70d893a160	Fix PR225: [pruneeh] -pruneeh pass removes invoke instructions it shouldn't llvm-svn: 11200	2004-02-08 21:15:59 +00:00
Chris Lattner	46c84561b2	splitBasicBlock "does the right thing" now, no reason to reposition it. llvm-svn: 11199	2004-02-08 20:49:07 +00:00
Chris Lattner	1c646349df	Implement proper invoke/unwind lowering. This fixed PR16 "[lowerinvoke] The -lowerinvoke pass does not insert calls to setjmp/longjmp" llvm-svn: 11195	2004-02-08 19:53:56 +00:00
Chris Lattner	ecaa4c58e2	Print out all globals as they are emitted, not just those emitted from emitGlobals llvm-svn: 11191	2004-02-08 19:33:23 +00:00
Chris Lattner	ffd16b0190	There is no reason to #define fd llvm-svn: 11190	2004-02-08 19:33:07 +00:00
Chris Lattner	f549f9473e	Add a call to 'write' right before the call to abort() in the unwind path. This causes the JIT, or LLC'd program to print out a nice message, explaining WHY the program aborted. llvm-svn: 11184	2004-02-08 07:30:29 +00:00
Chris Lattner	838c2dd5af	Add one that I missed llvm-svn: 11179	2004-02-08 01:53:10 +00:00
Chris Lattner	cc1376078c	Instead of callign removeTriviallyDeadNodes on the global graph every time removeDeadNodes is called, only call it at the end of the pass being run. This saves 1.3 seconds running DSA on 177.mesa (5.3->4.0s), which is pretty big. This is only possible because of the automatic garbage collection done on forwarding nodes. llvm-svn: 11178	2004-02-08 01:51:48 +00:00
Chris Lattner	e2ee216e1f	Remove another unneeded call. llvm-svn: 11177	2004-02-08 01:40:40 +00:00
Chris Lattner	85ba7bbf78	This call is no longer needed now that merging does not produce garbage llvm-svn: 11176	2004-02-08 01:38:34 +00:00
Chris Lattner	32a3eb0b88	Substantially improve the DSA code by removing 'forwarding' nodes from DSGraphs while they are forwarding. When the last reference to the forwarding node is dropped, the forwarding node is autodeleted. This should simplify removeTriviallyDead nodes, and is only (efficiently) possible because we are using an ilist of dsnodes now. llvm-svn: 11175	2004-02-08 01:27:18 +00:00
Chris Lattner	84ff796305	Bugfix for ilist conversion. The ilist wants to make an 'end' node which has G == 0 llvm-svn: 11174	2004-02-08 01:05:37 +00:00
Chris Lattner	29067016a4	Switch the Nodes list from being an std::vector<DSNode*> to an ilist<DSNode> llvm-svn: 11173	2004-02-08 00:53:26 +00:00
Chris Lattner	2c836fc933	Change to use node_iterators instead of direct access to Nodes llvm-svn: 11171	2004-02-08 00:23:16 +00:00
Chris Lattner	6af67c7eb4	getNodes() is gone, use node_begin/end instead Rename stats from dsnode -> dsa Add a new stat llvm-svn: 11167	2004-02-07 23:58:05 +00:00
Chris Lattner	ed36ca5f13	getNodes() is gone llvm-svn: 11166	2004-02-07 23:57:26 +00:00
Chris Lattner	ce838bfae6	There is no need to clone over nodes that are going to be dead anyway llvm-svn: 11157	2004-02-07 22:00:03 +00:00
Alkis Evlogimenos	59bb9d69c7	Increase code clarity. llvm-svn: 11151	2004-02-06 18:08:18 +00:00
Alkis Evlogimenos	2aa7703205	Eliminate uneeded lookups by passing a Virt2PhysMap::iterator instead of the virtual register to certain functions. llvm-svn: 11143	2004-02-06 03:15:40 +00:00
Chris Lattner	3f57a7faab	Fix another dominator update bug. These bugs keep getting exposed because GCSE keeps finding more code motion opportunities now that the dominators are correct! llvm-svn: 11142	2004-02-05 23:20:59 +00:00
Alkis Evlogimenos	f01a26ceaa	Change live interval representation. Machine instructions now have two slots each. As a concequence they get numbered as 0, 2, 4 and so on. The first slot is used for operand uses and the second for defs. Here's an example: 0: A = ... 2: B = ... 4: C = A + B ;; last use of A The live intervals should look like: A = [1, 5) B = [3, x) C = [5, y) llvm-svn: 11141	2004-02-05 22:55:25 +00:00
Chris Lattner	f2a8b9e75b	Fix bug updating dominators llvm-svn: 11140	2004-02-05 22:33:26 +00:00
Chris Lattner	3846a304eb	Add debug output llvm-svn: 11139	2004-02-05 22:33:19 +00:00
Chris Lattner	6875c14234	Fix PR223: Loopsimplify incorrectly updates dominator information The problem is that the dominator update code didn't "realize" that it's possible for the newly inserted basic block to dominate anything. Because it IS possible, stuff was getting updated wrong. llvm-svn: 11137	2004-02-05 21:12:24 +00:00
Alkis Evlogimenos	3dd0f57349	We don't need to scan the blocks that we are live-in on every access. Rather we only have to do it on the creation of the interval. llvm-svn: 11135	2004-02-05 20:45:40 +00:00
Chris Lattner	ffc5eee17a	In a "seeing the forest through the trees" kinda situation, I realized that a complete rewrite of load-vn will make it a bit faster. This changes speeds up the gcse pass (which uses load-vn) from 25.45s to 0.42s on the testcase in PR209. I've also verified that this gives the exact same results as the old one. llvm-svn: 11132	2004-02-05 17:20:00 +00:00
Chris Lattner	1a04f2a635	This is a big diff with no functionality change. We just reorder some code, which causes big reindentation. While I'm at it, I fix the fixme by removing some dead code. llvm-svn: 11131	2004-02-05 05:56:23 +00:00
Chris Lattner	7d5e3febb7	finegrainify namespacification llvm-svn: 11130	2004-02-05 05:51:40 +00:00
Tanya Lattner	0221566368	Added missing include. llvm-svn: 11129	2004-02-05 05:04:39 +00:00
Tanya Lattner	d7b137d9fb	Fixed Chris' typo. llvm-svn: 11128	2004-02-05 04:45:21 +00:00
Chris Lattner	b721589bc0	Implement optimizations for handling large basic blocks. llvm-svn: 11126	2004-02-05 00:36:43 +00:00
Alkis Evlogimenos	676e5b8997	Modify the two address instruction pass to remove the duplicate operand of the instruction and thus simplify the register allocation. llvm-svn: 11124	2004-02-04 22:17:40 +00:00
Chris Lattner	342b7276d6	Minor speedup, don't query ValueMap each time through the loop llvm-svn: 11123	2004-02-04 21:44:26 +00:00
Brian Gaeke	dcab84ecf1	Take away the default iostream argument of createMachineFunctionPrinterPass(), at Chris's request. llvm-svn: 11120	2004-02-04 21:41:01 +00:00
Chris Lattner	cbe1dd55f4	Two changes: 1. Don't scan to the end of alloca instructions in the caller function to insert inlined allocas, just insert at the top. This saves a lot of time inlining into functions with a lot of allocas. 2. Use splice to move the alloca instructions over, instead of remove/insert. This allows us to transfer a block at a time, and eliminates a bunch of silly symbol table manipulations. This speeds up the inliner on the testcase in PR209 from 1.73s -> 1.04s (67%) llvm-svn: 11118	2004-02-04 21:33:42 +00:00
Alkis Evlogimenos	a5458ae146	IMULri* instructions do not require their first two registers operands to be the same (IOW they are not two address instructions). llvm-svn: 11117	2004-02-04 17:21:04 +00:00
Chris Lattner	790d7321b4	Optimize the case where we are inlining a function that contains only one basic block, and that basic block ends with a return instruction. In this case, we can just splice the cloned "body" of the function directly into the source basic block, avoiding a lot of rearrangement and splitBasicBlock's linear scan over the split block. This speeds up the inliner on the testcase in PR209 from 2.3s to 1.7s, a 35% reduction. llvm-svn: 11116	2004-02-04 04:17:06 +00:00
Chris Lattner	68aef33986	Adjust to the new BasicBlock ctor, which requires a function parameter llvm-svn: 11114	2004-02-04 03:58:28 +00:00
Chris Lattner	223ffefd1f	Adjust to the new BB ctor llvm-svn: 11113	2004-02-04 03:57:50 +00:00
Chris Lattner	d655075e9d	Remove unneeded code now that splitBasicBlock does the "right thing" llvm-svn: 11111	2004-02-04 03:21:51 +00:00
Chris Lattner	840109cd9c	When splitting a basic block, insert the new half immediately after the first half. llvm-svn: 11110	2004-02-04 03:21:31 +00:00
Chris Lattner	8f0a362cd4	More refactoring. Move alloca instructions and handle invoke instructions before we delete the original call site, allowing slight simplifications of code, but nothing exciting. llvm-svn: 11109	2004-02-04 02:51:48 +00:00

1 2 3 4 5 ...

5362 Commits