llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-30 07:22:55 +01:00

Author	SHA1	Message	Date
Nate Begeman	9c6b86dcb4	fold sext_in_reg, sext_in_reg where both have the same VT. This was popping up in Fourinarow. llvm-svn: 23722	2005-10-14 01:29:07 +00:00
Nate Begeman	c206a2a32a	Relax the checking on zextload generation a bit, since as sabre pointed out you could be AND'ing with the result of a shift that shifts out all the bits you care about, in addition to a constant. Also, move over an add/sub_parts fold from legalize to the dag combiner, where it works for things other than constants. Woot! llvm-svn: 23720	2005-10-14 01:12:21 +00:00
Chris Lattner	b2db3f50f2	Fix the trunc(load) case, finally allowing crafty and povray to pass llvm-svn: 23718	2005-10-13 22:10:05 +00:00
Chris Lattner	2563394c27	Fix some bugs in (sext (load x)) llvm-svn: 23717	2005-10-13 21:52:31 +00:00
Chris Lattner	3fd6d9358e	When ExpandOp'ing a [SZ]EXTLOAD, make sure to remember that the chain is also legal. Add support for ExpandOp'ing raw EXTLOADs too. llvm-svn: 23716	2005-10-13 21:44:47 +00:00
Chris Lattner	dc6e47231b	Implement PromoteOp for *EXTLOAD, allowing MallocBench/gs to Legalize llvm-svn: 23715	2005-10-13 20:07:41 +00:00
Nate Begeman	d155f7f1c2	Fix the remaining DAGCombiner issues pointed out by sabre. This should fix the remainder of the failures introduced by my patch last night. llvm-svn: 23714	2005-10-13 18:34:58 +00:00
Chris Lattner	75447d2f8e	Fix a minor bug in the dag combiner that broke pcompress2 and some other tests. llvm-svn: 23713	2005-10-13 18:16:34 +00:00
Nate Begeman	9a08d6fb43	Add support to Legalize for expanding i64 sextload/zextload into hi and lo parts. This should fix the crafty and signed long long unit test failure on x86 last night. llvm-svn: 23711	2005-10-13 17:15:37 +00:00
Jim Laskey	2d23e75ac5	Inhibit instructions from being pushed before function calls. This will minimize unnecessary spilling. llvm-svn: 23710	2005-10-13 16:44:00 +00:00
Nate Begeman	c7e7c94db5	Move some Legalize functionality over to the DAGCombiner where it belongs. Kill some dead code. llvm-svn: 23706	2005-10-13 03:11:28 +00:00
Nate Begeman	b1d64c386b	Fix a potential bug with two combine-to's back to back that chris pointed out, where after the first CombineTo() call, the node the second CombineTo wishes to replace may no longer exist. Fix a very real bug with the truncated load optimization on little endian targets, which do not need a byte offset added to the load. llvm-svn: 23704	2005-10-12 23:18:53 +00:00
Nate Begeman	d97bb9d084	More cool stuff for the dag combiner. We can now finally handle things like turning: _foo: fctiwz f0, f1 stfd f0, -8(r1) lwz r2, -4(r1) rlwinm r3, r2, 0, 16, 31 blr into _foo: fctiwz f0,f1 stfd f0,-8(r1) lhz r3,-2(r1) blr Also removed an unncessary constraint from sra -> srl conversion, which should take care of hte only reason we would ever need to handle sra in MaskedValueIsZero, AFAIK. llvm-svn: 23703	2005-10-12 20:40:40 +00:00
Jim Laskey	2fe279f783	Finally committing to the new scheduler. Still -sched=none by default. llvm-svn: 23702	2005-10-12 18:29:35 +00:00
Jim Laskey	84b96c93e4	Added graphviz/gv support for MF. llvm-svn: 23700	2005-10-12 12:09:05 +00:00
Chris Lattner	21c3cf756f	Fix a powerpc crash on CodeGen/Generic/llvm-ct-intrinsics.ll llvm-svn: 23694	2005-10-11 17:56:34 +00:00
Chris Lattner	e6fcb88ad1	Add a canonicalization that got lost, fixing PowerPC/fold-li.ll:SUB llvm-svn: 23693	2005-10-11 06:07:15 +00:00
Chris Lattner	1441e42e56	clean up some corner cases llvm-svn: 23692	2005-10-10 23:00:08 +00:00
Chris Lattner	5308a566ba	Implement trivial DSE. If two stores are neighbors and store to the same location, replace them with a new store of the last value. This occurs in the same neighborhood in 197.parser, speeding it up about 1.5% llvm-svn: 23691	2005-10-10 22:31:19 +00:00
Chris Lattner	1c60abe065	Add support for CombineTo, allowing the dag combiner to replace nodes with multiple results. Use this support to implement trivial store->load forwarding, implementing CodeGen/PowerPC/store-load-fwd.ll. Though this is the most simple case and can be extended in the future, it is still useful. For example, it speeds up 197.parser by 6.2% by avoiding an LSU reject in xalloc: stw r6, lo16(l5_end_of_array)(r2) addi r2, r5, -4 stwx r5, r4, r2 - lwzx r5, r4, r2 - rlwinm r5, r5, 0, 0, 30 stwx r5, r4, r2 lwz r2, -4(r4) ori r2, r2, 1 llvm-svn: 23690	2005-10-10 22:04:48 +00:00
Nate Begeman	703c8f57d6	Teach the DAGCombiner several new tricks, teaching it how to turn sext_inreg into zext_inreg based on the signbit (fires a lot), srem into urem, etc. llvm-svn: 23688	2005-10-10 21:26:48 +00:00
Chris Lattner	caed5bc79b	Fix comment llvm-svn: 23686	2005-10-10 16:52:03 +00:00
Chris Lattner	58f0f21ed1	Add ISD::ADD to MaskedValueIsZero llvm-svn: 23685	2005-10-10 16:51:40 +00:00
Chris Lattner	5b6e18d6fd	This function is now dead llvm-svn: 23684	2005-10-10 16:49:22 +00:00
Chris Lattner	29613bce04	Enable Nate's excellent DAG combiner work by default. This allows the removal of a bunch of ad-hoc and crufty code from SelectionDAG.cpp. llvm-svn: 23682	2005-10-10 16:47:10 +00:00
Chris Lattner	097b306215	add a todo for something I noticed llvm-svn: 23679	2005-10-09 22:59:08 +00:00
Chris Lattner	d0eecf4e64	(X & Y) & C == 0 if either X&C or Y&C are zero llvm-svn: 23678	2005-10-09 22:12:36 +00:00
Chris Lattner	d5ac294abd	When emiting a CopyFromReg and the source is already a vreg, do not bother creating a new vreg and inserting a copy: just use the input vreg directly. This speeds up the compile (e.g. about 5% on mesa with a debug build of llc) by not adding a bunch of copies and vregs to be coallesced away. On mesa, for example, this reduces the number of intervals from 168601 to 129040 going into the coallescer. llvm-svn: 23671	2005-10-09 05:58:56 +00:00
Nate Begeman	8feae2fcc9	Lo and behold, the last bits of SelectionDAG.cpp have been moved over. llvm-svn: 23665	2005-10-08 00:29:44 +00:00
Chris Lattner	f8b0332dfc	remove debugging code llvm-svn: 23663	2005-10-07 15:31:26 +00:00
Chris Lattner	dff6183cd7	implement CodeGen/PowerPC/div-2.ll:test2-4 by propagating zero bits through C-X's llvm-svn: 23662	2005-10-07 15:30:32 +00:00
Chris Lattner	5e0581c32b	fix indentation llvm-svn: 23660	2005-10-07 06:37:02 +00:00
Chris Lattner	36b58a015b	Turn sdivs into udivs when we can prove the sign bits are clear. This implements CodeGen/PowerPC/div-2.ll llvm-svn: 23659	2005-10-07 06:10:46 +00:00
Chris Lattner	7709ee0085	silence a bogus GCC warning llvm-svn: 23646	2005-10-06 17:39:10 +00:00
Chris Lattner	3b848038ab	Fix the LLC regressions on X86 last night. In particular, when undoing previous copy elisions and we discover we need to reload a register, make sure to use the regclass of the original register for the reload, not the class of the current register. This avoid using 16-bit loads to reload 32-bit values. llvm-svn: 23645	2005-10-06 17:19:06 +00:00
Chris Lattner	0f04d333d5	Make the legalizer completely non-recursive llvm-svn: 23642	2005-10-06 01:20:27 +00:00
Nate Begeman	85d4334da0	Let the combiner handle more cases llvm-svn: 23641	2005-10-05 21:44:43 +00:00
Nate Begeman	cf23a9a328	Remove some bad code from Legalize llvm-svn: 23640	2005-10-05 21:44:10 +00:00
Nate Begeman	a34475adfc	Check in some more DAGCombiner pieces llvm-svn: 23639	2005-10-05 21:43:42 +00:00
Chris Lattner	5afc88fe07	Fix a bug in the local spiller, where we could take code like this: store r12 -> [ss#2] R3 = load [ss#1] use R3 R3 = load [ss#2] R4 = load [ss#1] and turn it into this code: store R12 -> [ss#2] R3 = load [ss#1] use R3 R3 = R12 R4 = R3 <- oops! The problem was that promoting R3 = load[ss#2] to a copy missed the fact that the instruction invalidated R3 at that point. llvm-svn: 23638	2005-10-05 18:30:19 +00:00
Chris Lattner	7f1bde4996	implement visitBR_CC so that PowerPC/inverted-bool-compares.ll passes with the dag combiner. This speeds up espresso by 8%, reaching performance parity with the dag-combiner-disabled llc. llvm-svn: 23636	2005-10-05 06:47:48 +00:00
Chris Lattner	27adcf1b0f	fix some pastos llvm-svn: 23635	2005-10-05 06:37:22 +00:00
Chris Lattner	697fdaba58	Add a new HandleNode class, which is used to handle (haha) cases in the dead node elim and dag combiner passes where the root is potentially updated. This fixes a fixme in the dag combiner. llvm-svn: 23634	2005-10-05 06:35:28 +00:00
Chris Lattner	75ce53eefd	Implement the code for PowerPC/inverted-bool-compares.ll, even though it that testcase still does not pass with the dag combiner. This is because not all forms of br* are folded yet. Also, when we combine a node into another one, delete the node immediately instead of waiting for the node to potentially come up in the future. llvm-svn: 23632	2005-10-05 06:11:08 +00:00
Chris Lattner	cf12d7b556	make sure that -view-isel-dags is the input to the isel, not the input to the second phase of dag combining llvm-svn: 23631	2005-10-05 06:09:10 +00:00
Chris Lattner	86ccb0efb4	Fix a crash compiling Olden/tsp llvm-svn: 23630	2005-10-05 04:45:43 +00:00
Jim Laskey	9a2a3d4aab	Reverting to version - until problem isolated. llvm-svn: 23622	2005-10-04 16:41:51 +00:00
Nate Begeman	9740be11f7	Fix some faulty logic in the libcall inserter. Since calls return more than one value, don't bail if one of their uses happens to be a node that's not an MVT::Other when following the chain from CALLSEQ_START to CALLSEQ_END. Once we've found a CALLSEQ_START, we can just return; there's no need to tail-recurse further up the graph. Most importantly, just because something only has one use doesn't mean we should use it's one use to follow from start to end. This faulty logic caused us to follow a chain of one-use FP operations back to a much earlier call, putting a cycle in the graph from a later start to an earlier end. This is a better fix that reverting to the workaround committed earlier today. llvm-svn: 23620	2005-10-04 02:10:55 +00:00
Nate Begeman	2dcd06e46c	Add back a workaround that fixes some breakages from chris's last change. Neither of us have yet figured out why this code is necessary, but stuff breaks if its not there. Still tracking this down... llvm-svn: 23617	2005-10-04 00:37:37 +00:00
Jim Laskey	22633f7a41	Refactor gathering node info and emission. llvm-svn: 23610	2005-10-03 12:30:32 +00:00

1 2 3 4 5 ...

1840 Commits