llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Chris Lattner	b665065e6d	Only do stuff for the REAL number of physical registers we have, not 1024. This speeds up live variables a lot, from .60/.39s -> .47/.26s in LLC, for the first/second pass respectively. llvm-svn: 11216	2004-02-09 01:35:21 +00:00
Chris Lattner	c49e77cecd	Change the PhysRegsUsed map into a dense array. Seeing that this is a mapping from physical registers, and they are always dense, it makes sense to not have a ton of RBtree overhead. This change speeds up regalloclocal about ~30% on 253.perlbmk, from .35s -> .27s in the JIT (in LLC, it goes from .74 -> .55). Now live variable analysis is the slowest codegen pass. Of course it doesn't help that we have to run it twice, because regalloclocal doesn't update it, but even if it did it would be the slowest pass (now it's just the 2x slowest pass :( llvm-svn: 11215	2004-02-09 01:26:13 +00:00
Chris Lattner	b771939ef3	Two problems with these lines of code: 1. The "work" was not in the assert, so it was punishing the optimized release 2. getNamedFunction is _very_ expensive in large programs. It is not designed to be used like this, and was taking 7% of the execution time of the code generator on perlbmk. Since the assert "can never fail", I'm just killing it. llvm-svn: 11214	2004-02-09 00:59:07 +00:00
Chris Lattner	6ef23e7e64	The ConstantExpr::getCast call can cause a CPR to be generated. If so, strip it off. llvm-svn: 11213	2004-02-09 00:20:55 +00:00
Chris Lattner	9523c4de75	Fix PR215: [bcwriter] Problem compactifying ConstantPointerRefs Have I ever mentioned how much I _hate_ constantpointerrefs? llvm-svn: 11212	2004-02-09 00:15:41 +00:00
Misha Brukman	58ca173834	Fix grammar-o. llvm-svn: 11210	2004-02-08 22:27:33 +00:00
Chris Lattner	a9887d33e8	Improve compatibility with programs that already have a prototype for 'write', even if it is wierd in some way. llvm-svn: 11207	2004-02-08 22:14:44 +00:00
Chris Lattner	a9a34f9d82	vi failed me again. :) llvm-svn: 11206	2004-02-08 21:52:30 +00:00
Chris Lattner	2878b11cfc	Rename the invoke 'except' destination to the 'unwind' destination llvm-svn: 11205	2004-02-08 21:52:04 +00:00
Chris Lattner	2e51b50de1	Change the 'exception' destination to the 'unwind' destination. We will always allow 'except' instead of 'unwind' here though. llvm-svn: 11203	2004-02-08 21:48:25 +00:00
Chris Lattner	68fdb35576	rename the "exceptional" destination of an invoke instruction to the 'unwind' dest llvm-svn: 11202	2004-02-08 21:44:31 +00:00
Chris Lattner	70d893a160	Fix PR225: [pruneeh] -pruneeh pass removes invoke instructions it shouldn't llvm-svn: 11200	2004-02-08 21:15:59 +00:00
Chris Lattner	46c84561b2	splitBasicBlock "does the right thing" now, no reason to reposition it. llvm-svn: 11199	2004-02-08 20:49:07 +00:00
Chris Lattner	1c646349df	Implement proper invoke/unwind lowering. This fixed PR16 "[lowerinvoke] The -lowerinvoke pass does not insert calls to setjmp/longjmp" llvm-svn: 11195	2004-02-08 19:53:56 +00:00
Chris Lattner	ecaa4c58e2	Print out all globals as they are emitted, not just those emitted from emitGlobals llvm-svn: 11191	2004-02-08 19:33:23 +00:00
Chris Lattner	ffd16b0190	There is no reason to #define fd llvm-svn: 11190	2004-02-08 19:33:07 +00:00
Chris Lattner	f549f9473e	Add a call to 'write' right before the call to abort() in the unwind path. This causes the JIT, or LLC'd program to print out a nice message, explaining WHY the program aborted. llvm-svn: 11184	2004-02-08 07:30:29 +00:00
Chris Lattner	838c2dd5af	Add one that I missed llvm-svn: 11179	2004-02-08 01:53:10 +00:00
Chris Lattner	cc1376078c	Instead of callign removeTriviallyDeadNodes on the global graph every time removeDeadNodes is called, only call it at the end of the pass being run. This saves 1.3 seconds running DSA on 177.mesa (5.3->4.0s), which is pretty big. This is only possible because of the automatic garbage collection done on forwarding nodes. llvm-svn: 11178	2004-02-08 01:51:48 +00:00
Chris Lattner	e2ee216e1f	Remove another unneeded call. llvm-svn: 11177	2004-02-08 01:40:40 +00:00
Chris Lattner	85ba7bbf78	This call is no longer needed now that merging does not produce garbage llvm-svn: 11176	2004-02-08 01:38:34 +00:00
Chris Lattner	32a3eb0b88	Substantially improve the DSA code by removing 'forwarding' nodes from DSGraphs while they are forwarding. When the last reference to the forwarding node is dropped, the forwarding node is autodeleted. This should simplify removeTriviallyDead nodes, and is only (efficiently) possible because we are using an ilist of dsnodes now. llvm-svn: 11175	2004-02-08 01:27:18 +00:00
Chris Lattner	84ff796305	Bugfix for ilist conversion. The ilist wants to make an 'end' node which has G == 0 llvm-svn: 11174	2004-02-08 01:05:37 +00:00
Chris Lattner	29067016a4	Switch the Nodes list from being an std::vector<DSNode*> to an ilist<DSNode> llvm-svn: 11173	2004-02-08 00:53:26 +00:00
Chris Lattner	2c836fc933	Change to use node_iterators instead of direct access to Nodes llvm-svn: 11171	2004-02-08 00:23:16 +00:00
Chris Lattner	6af67c7eb4	getNodes() is gone, use node_begin/end instead Rename stats from dsnode -> dsa Add a new stat llvm-svn: 11167	2004-02-07 23:58:05 +00:00
Chris Lattner	ed36ca5f13	getNodes() is gone llvm-svn: 11166	2004-02-07 23:57:26 +00:00
Chris Lattner	ce838bfae6	There is no need to clone over nodes that are going to be dead anyway llvm-svn: 11157	2004-02-07 22:00:03 +00:00
Alkis Evlogimenos	59bb9d69c7	Increase code clarity. llvm-svn: 11151	2004-02-06 18:08:18 +00:00
Alkis Evlogimenos	2aa7703205	Eliminate uneeded lookups by passing a Virt2PhysMap::iterator instead of the virtual register to certain functions. llvm-svn: 11143	2004-02-06 03:15:40 +00:00
Chris Lattner	3f57a7faab	Fix another dominator update bug. These bugs keep getting exposed because GCSE keeps finding more code motion opportunities now that the dominators are correct! llvm-svn: 11142	2004-02-05 23:20:59 +00:00
Alkis Evlogimenos	f01a26ceaa	Change live interval representation. Machine instructions now have two slots each. As a concequence they get numbered as 0, 2, 4 and so on. The first slot is used for operand uses and the second for defs. Here's an example: 0: A = ... 2: B = ... 4: C = A + B ;; last use of A The live intervals should look like: A = [1, 5) B = [3, x) C = [5, y) llvm-svn: 11141	2004-02-05 22:55:25 +00:00
Chris Lattner	f2a8b9e75b	Fix bug updating dominators llvm-svn: 11140	2004-02-05 22:33:26 +00:00
Chris Lattner	3846a304eb	Add debug output llvm-svn: 11139	2004-02-05 22:33:19 +00:00
Chris Lattner	6875c14234	Fix PR223: Loopsimplify incorrectly updates dominator information The problem is that the dominator update code didn't "realize" that it's possible for the newly inserted basic block to dominate anything. Because it IS possible, stuff was getting updated wrong. llvm-svn: 11137	2004-02-05 21:12:24 +00:00
Alkis Evlogimenos	3dd0f57349	We don't need to scan the blocks that we are live-in on every access. Rather we only have to do it on the creation of the interval. llvm-svn: 11135	2004-02-05 20:45:40 +00:00
Chris Lattner	ffc5eee17a	In a "seeing the forest through the trees" kinda situation, I realized that a complete rewrite of load-vn will make it a bit faster. This changes speeds up the gcse pass (which uses load-vn) from 25.45s to 0.42s on the testcase in PR209. I've also verified that this gives the exact same results as the old one. llvm-svn: 11132	2004-02-05 17:20:00 +00:00
Chris Lattner	1a04f2a635	This is a big diff with no functionality change. We just reorder some code, which causes big reindentation. While I'm at it, I fix the fixme by removing some dead code. llvm-svn: 11131	2004-02-05 05:56:23 +00:00
Chris Lattner	7d5e3febb7	finegrainify namespacification llvm-svn: 11130	2004-02-05 05:51:40 +00:00
Tanya Lattner	0221566368	Added missing include. llvm-svn: 11129	2004-02-05 05:04:39 +00:00
Tanya Lattner	d7b137d9fb	Fixed Chris' typo. llvm-svn: 11128	2004-02-05 04:45:21 +00:00
Chris Lattner	b721589bc0	Implement optimizations for handling large basic blocks. llvm-svn: 11126	2004-02-05 00:36:43 +00:00
Alkis Evlogimenos	676e5b8997	Modify the two address instruction pass to remove the duplicate operand of the instruction and thus simplify the register allocation. llvm-svn: 11124	2004-02-04 22:17:40 +00:00
Chris Lattner	342b7276d6	Minor speedup, don't query ValueMap each time through the loop llvm-svn: 11123	2004-02-04 21:44:26 +00:00
Brian Gaeke	dcab84ecf1	Take away the default iostream argument of createMachineFunctionPrinterPass(), at Chris's request. llvm-svn: 11120	2004-02-04 21:41:01 +00:00
Chris Lattner	cbe1dd55f4	Two changes: 1. Don't scan to the end of alloca instructions in the caller function to insert inlined allocas, just insert at the top. This saves a lot of time inlining into functions with a lot of allocas. 2. Use splice to move the alloca instructions over, instead of remove/insert. This allows us to transfer a block at a time, and eliminates a bunch of silly symbol table manipulations. This speeds up the inliner on the testcase in PR209 from 1.73s -> 1.04s (67%) llvm-svn: 11118	2004-02-04 21:33:42 +00:00
Alkis Evlogimenos	a5458ae146	IMULri* instructions do not require their first two registers operands to be the same (IOW they are not two address instructions). llvm-svn: 11117	2004-02-04 17:21:04 +00:00
Chris Lattner	790d7321b4	Optimize the case where we are inlining a function that contains only one basic block, and that basic block ends with a return instruction. In this case, we can just splice the cloned "body" of the function directly into the source basic block, avoiding a lot of rearrangement and splitBasicBlock's linear scan over the split block. This speeds up the inliner on the testcase in PR209 from 2.3s to 1.7s, a 35% reduction. llvm-svn: 11116	2004-02-04 04:17:06 +00:00
Chris Lattner	68aef33986	Adjust to the new BasicBlock ctor, which requires a function parameter llvm-svn: 11114	2004-02-04 03:58:28 +00:00
Chris Lattner	223ffefd1f	Adjust to the new BB ctor llvm-svn: 11113	2004-02-04 03:57:50 +00:00

1 2 3 4 5 ...

5265 Commits