llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 20:43:44 +02:00

Author	SHA1	Message	Date
Evan Cheng	1771f6da9c	There are times when the coalescer would not coalesce away a copy but the copy can be eliminated by the allocator is the destination and source targets the same register. The most common case is when the source and destination registers are in different class. For example, on x86 mov32to32_ targets GR32_ which contains a subset of the registers in GR32. The allocator can do 2 things: 1. Set the preferred allocation for the destination of a copy to that of its source. 2. After allocation is done, change the allocation of a copy destination (if legal) so the copy can be eliminated. This eliminates 443 extra moves from 403.gcc. llvm-svn: 43662	2007-11-03 07:20:12 +00:00
Evan Cheng	8d473f667d	Add run line. llvm-svn: 43645	2007-11-02 17:36:58 +00:00
Evan Cheng	65a07e73e2	One more extract_subreg coalescing bug. llvm-svn: 43644	2007-11-02 17:35:08 +00:00
Evan Cheng	b50cc64eb0	Missing a getNumOperands check. llvm-svn: 43630	2007-11-02 01:26:22 +00:00
Dale Johannesen	c125d9b4e8	Test that expand_vector_elt(v2i64) works in 32-bit mode. llvm-svn: 43598	2007-11-01 02:38:24 +00:00
Evan Cheng	5e058e94b5	It's not safe to tell SplitCriticalEdge to merge identical edges. It may delete the phi instruction that's being processed. llvm-svn: 43524	2007-10-30 22:27:26 +00:00
Evan Cheng	633cd3e84d	- Bug fixes. - Allow icmp rewrite using an iv / stride of a smaller integer type. llvm-svn: 43480	2007-10-29 22:07:18 +00:00
Dan Gohman	02b8beff5f	Fix a DAGCombiner abort on a bitcast from a scalar to a vector. llvm-svn: 43470	2007-10-29 20:44:42 +00:00
Evan Cheng	5fe81cf64e	Enable more fold (sext (load x)) -> (sext (truncate (sextload x))) transformation. Previously, it's restricted by ensuring the number of load uses is one. Now the restriction is loosened up by allowing setcc uses to be "extended" (e.g. setcc x, c, eq -> setcc sext(x), sext(c), eq). llvm-svn: 43465	2007-10-29 19:58:20 +00:00
Chris Lattner	1503362624	Add support for the x86-64 'q' regigster modifier, and add support for the b/h/w/k/q inline asm memory modifiers, which are just ignored. This fixes PR1748 and CodeGen/X86/2007-10-28-inlineasm-q-modifier.ll llvm-svn: 43430	2007-10-29 03:09:07 +00:00
Evan Cheng	53696b7e9f	Loosen up iv reuse to allow reuse of the same stride but a larger type when truncating from the larger type to smaller type is free. e.g. Turns this loop: LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx movw %dx, %si LBB1_2: # bb movl L_X$non_lazy_ptr, %edi movw %si, (%edi) movl L_Y$non_lazy_ptr, %edi movw %dx, (%edi) addw $4, %dx incw %si incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb into LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx LBB1_2: # bb movl L_X$non_lazy_ptr, %esi movw %cx, (%esi) movl L_Y$non_lazy_ptr, %esi movw %dx, (%esi) addw $4, %dx incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb llvm-svn: 43375	2007-10-26 01:56:11 +00:00
Evan Cheng	66cbf54030	If a loop termination compare instruction is the only use of its stride, and the compaison is against a constant value, try eliminate the stride by moving the compare instruction to another stride and change its constant operand accordingly. e.g. loop: ... v1 = v1 + 3 v2 = v2 + 1 if (v2 < 10) goto loop => loop: ... v1 = v1 + 3 if (v1 < 30) goto loop llvm-svn: 43336	2007-10-25 09:11:16 +00:00
Dale Johannesen	402c11966a	This was failing on Darwin, which defaults to PIC; no lea was generated. I think this follows the intent. llvm-svn: 43312	2007-10-24 20:58:14 +00:00
Dan Gohman	df1f166e4a	Strength reduction improvements. - Avoid attempting stride-reuse in the case that there are users that aren't addresses. In that case, there will be places where the multiplications won't be folded away, so it's better to try to strength-reduce them. - Several SSE intrinsics have operands that strength-reduction can treat as addresses. The previous item makes this more visible, as any non-address use of an IV can inhibit stride-reuse. - Make ValidStride aware of whether there's likely to be a base register in the address computation. This prevents it from thinking that things like stride 9 are valid on x86 when the base register is already occupied. Also, XFAIL the 2007-08-10-LEA16Use32.ll test; the new logic to avoid stride-reuse elimintes the LEA in the loop, so the test is no longer testing what it was intended to test. llvm-svn: 43231	2007-10-22 20:40:42 +00:00
Dan Gohman	76e104c8ad	Fix the folding of multiplication into addresses on x86, which was broken by the recent {U,S}MUL_LOHI changes. llvm-svn: 43230	2007-10-22 20:22:24 +00:00
Evan Cheng	2d53c3f15e	New test case. llvm-svn: 43193	2007-10-19 22:05:00 +00:00
Rafael Espindola	2b5b200b9f	Test byval with a 8 bit aligned struct llvm-svn: 43173	2007-10-19 11:29:21 +00:00
Rafael Espindola	d8d4372845	Add support for byval function whose argument is not 32 bit aligned. To do this it is necessary to add a "always inline" argument to the memcpy node. For completeness I have also added this node to memmove and memset. I have also added getMem* functions, because the extra argument makes it cumbersome to use getNode and because I get confused by it :-) llvm-svn: 43172	2007-10-19 10:41:11 +00:00
Evan Cheng	f6d1c7be14	Really fix PR1734. Carefully track which register uses are sub-register uses by traversing inverse register coalescing map. llvm-svn: 43118	2007-10-18 07:49:59 +00:00
Dan Gohman	2903f7fc26	Add support for ISD::SELECT in SplitVectorOp. llvm-svn: 43072	2007-10-17 14:48:28 +00:00
Evan Cheng	524c0e6c3d	Yet another test case for extract_subreg coalescing crash. llvm-svn: 43063	2007-10-17 02:15:06 +00:00
Evan Cheng	09fa6ed483	Fix PR1734. llvm-svn: 43035	2007-10-16 19:29:47 +00:00
Dale Johannesen	dd254c4efa	New test for svn rev 43033, radar 5538745. llvm-svn: 43034	2007-10-16 18:10:14 +00:00
Evan Cheng	f5bcd3d737	LowerFP_TO_SINT must not create a stack object if it's not needed. llvm-svn: 43004	2007-10-15 20:11:21 +00:00
Dan Gohman	2dc6099def	Reapply the fix in 42908 for this file. This changes the function names from "test" to "foo" so that they don't match the grep -i ST. llvm-svn: 43001	2007-10-15 19:22:17 +00:00
Evan Cheng	43887d3714	Fix PR1729: watch out for val# with no def. llvm-svn: 42996	2007-10-15 18:33:50 +00:00
Tanya Lattner	3a64752342	Fix run line. llvm-svn: 42990	2007-10-15 16:35:13 +00:00
Evan Cheng	850c8739dc	New test case. llvm-svn: 42963	2007-10-14 10:15:03 +00:00
Evan Cheng	33df6a6bed	Revert 42908 for now. llvm-svn: 42960	2007-10-14 05:57:21 +00:00
Evan Cheng	f5ed18f7d3	Fix test case. llvm-svn: 42949	2007-10-13 03:14:06 +00:00
Evan Cheng	6101e4ffdf	New tests. llvm-svn: 42948	2007-10-13 03:10:54 +00:00
Dan Gohman	c96f2809ca	Fix this test to not depend on the assembly output containing something that includes the string "st". This probably fixes the regression on Darwin. llvm-svn: 42932	2007-10-12 20:42:14 +00:00
Dan Gohman	a75e4a62e6	Change the names used for internal labels to use the current function symbol name instead of a codegen-assigned function number. Thanks Evan! :-) llvm-svn: 42908	2007-10-12 14:53:36 +00:00
Evan Cheng	51791564b0	Doh. llvm-svn: 42901	2007-10-12 09:10:27 +00:00
Evan Cheng	947b4a6c3d	EXTRACT_SUBREG test case. llvm-svn: 42900	2007-10-12 09:03:31 +00:00
Arnold Schwaighofer	f1e49dd41d	Added missing -march=x86 flag. llvm-svn: 42893	2007-10-12 07:49:48 +00:00
Dan Gohman	ab5c3ed0d1	Add intrinsics for sin, cos, and pow. These use llvm_anyfloat_ty, and so may be overloaded with vector types. And add a testcase for codegen for these. llvm-svn: 42885	2007-10-12 00:01:22 +00:00
Dan Gohman	9a70be46f1	Add an explicit target triple to make this test behave as expected on non-Apple hosts. And use the count script instead of wc + grep. llvm-svn: 42878	2007-10-11 23:04:36 +00:00
Arnold Schwaighofer	d47210011e	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Dan Gohman	708e76e663	These two tests now require only two multiply instructions, instead of four. llvm-svn: 42784	2007-10-09 15:39:37 +00:00
Evan Cheng	25b65542d9	Update test. llvm-svn: 42775	2007-10-08 22:20:32 +00:00
Dan Gohman	9da3bddf43	These two tests now require only three multiply instructions, instead of four. llvm-svn: 42765	2007-10-08 20:48:12 +00:00
Dale Johannesen	b600202c68	Make test work on non-x86 hosts. llvm-svn: 42671	2007-10-06 01:22:39 +00:00
Evan Cheng	0a642eaa62	Test case for 3-address conversion. llvm-svn: 42664	2007-10-05 23:33:09 +00:00
Evan Cheng	dc467c6323	Enable convertToThreeAddress for X86 by default. llvm-svn: 42655	2007-10-05 22:31:10 +00:00
Evan Cheng	6fd2606ff5	New test case. llvm-svn: 42628	2007-10-05 01:44:22 +00:00
Evan Cheng	1d3c836933	-pre-RA-sched=none, simple, simple-noitin are gone. llvm-svn: 42505	2007-10-01 22:17:20 +00:00
Dan Gohman	02f80006f8	Teach SplitVectorOp how to split INSERT_VECTOR_ELT. llvm-svn: 42457	2007-09-28 23:53:40 +00:00
Rafael Espindola	01b306e575	Refactor the memcpy lowering for the x86 target. The only generated code difference is that now we call memcpy when the size of the array is unknown. This matches GCC behavior and is better since the run time value can be arbitrarily large. llvm-svn: 42433	2007-09-28 12:53:01 +00:00
Dale Johannesen	e61886cee4	Add sqrt and powi intrinsics for long double. llvm-svn: 42423	2007-09-28 01:08:20 +00:00

1 2 3 4 5 ...

254 Commits