llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 19:52:54 +01:00

Author	SHA1	Message	Date
Chris Lattner	b628208161	Finish implementing a readme entry: when inserting an i64 variable into a vector of zeros or undef, and when the top part is obviously zero, we can just use movd + shuffle. This allows us to compile vec_set-B.ll into: _test3: movl $1234567, %eax andl 4(%esp), %eax movd %eax, %xmm0 ret instead of: _test3: subl $28, %esp movl $1234567, %eax andl 32(%esp), %eax movl %eax, (%esp) movl $0, 4(%esp) movq (%esp), %xmm0 addl $28, %esp ret llvm-svn: 48090	2008-03-09 05:42:06 +00:00
Chris Lattner	17f68a3075	Implement a readme entry, compiling #include <xmmintrin.h> __m128i doload64(short x) {return _mm_set_epi16(0,0,0,0,0,0,0,1);} into: movl $1, %eax movd %eax, %xmm0 ret instead of a constant pool load. llvm-svn: 48063	2008-03-09 01:05:04 +00:00
Chris Lattner	1b16dc4a6e	upgrade this test llvm-svn: 48062	2008-03-09 00:32:10 +00:00
Chris Lattner	24031c9426	make this test harder llvm-svn: 48061	2008-03-09 00:30:06 +00:00
Chris Lattner	7173d3bd70	Teach SD some vector identities, allowing us to compile vec_set-9 into: _test3: movd %rdi, %xmm1 #IMPLICIT_DEF %xmm0 punpcklqdq %xmm1, %xmm0 ret instead of: _test3: #IMPLICIT_DEF %rax movd %rax, %xmm0 movd %rdi, %xmm1 punpcklqdq %xmm1, %xmm0 ret This is still not ideal. There is no reason to two xmm regs. llvm-svn: 48058	2008-03-08 23:43:36 +00:00
Evan Cheng	dba1dfe962	Implement x86 support for @llvm.prefetch. It corresponds to prefetcht{0\|1\|2} and prefetchnta instructions. llvm-svn: 48042	2008-03-08 00:58:38 +00:00
Dan Gohman	afeac8050d	Add support for calls with i128 return values on ppc64. llvm-svn: 48041	2008-03-08 00:19:12 +00:00
Dan Gohman	5bf464e1e6	Add support for lowering 128-bit shifts on ppc64. llvm-svn: 48029	2008-03-07 20:36:53 +00:00
Evan Cheng	5f74d1a8dc	Fixed a register scavenger bug. If a def is re-defining part of a super register, there must be an implicit def of the super-register on the MI. llvm-svn: 48024	2008-03-07 20:12:54 +00:00
Chris Lattner	aa81dc7d21	mark frem as expand for all legal fp types on x86, regardless of whether we're using SSE or not. This fixes PR2122. llvm-svn: 48006	2008-03-07 06:36:32 +00:00
Bill Wendling	f0615bf090	Add testcase. llvm-svn: 48005	2008-03-06 23:34:22 +00:00
Gabor Greif	92e00c1e50	some more spelling changes llvm-svn: 47996	2008-03-06 10:51:21 +00:00
Evan Cheng	9c77211639	Constant fold SIGN_EXTEND_INREG with ashr not lshr. llvm-svn: 47992	2008-03-06 08:20:51 +00:00
Scott Michel	8dd2324903	Refine Cell's i64 constant generation code to cover more constants where the upper and lower 32-bits are the same (in addition to 0 and -1 previously.) llvm-svn: 47985	2008-03-06 04:02:54 +00:00
Scott Michel	3c19a3950c	- Expand tabs to spaces. - select_bits.ll now fully functional now that PR1993 is closed. It was previously broken by refactoring in SPUInstrInfo.td and using multiclasses. - Same for eqv.ll llvm-svn: 47972	2008-03-05 23:00:19 +00:00
Evan Cheng	8f5092bea8	Fix a coalescer bug wrt how dead copy interval is shortened. llvm-svn: 47966	2008-03-05 22:09:42 +00:00
Chris Lattner	a9fcb187af	Generalize FP constant shrinking optimization to apply to any vt except ppc long double. This allows us to shrink constant pool entries for x86 long double constants, which in turn allows us to use flds/fldl instead of fldt. llvm-svn: 47938	2008-03-05 06:48:13 +00:00
Evan Cheng	e0b3c221ab	Add a target lowering hook to control whether it's worthwhile to compress fp constant. For x86, if sse2 is available, it's not a good idea since cvtss2sd is slower than a movsd load and it prevents load folding. On x87, it's important to shrink fp constant since fldt is very expensive. llvm-svn: 47931	2008-03-05 01:30:59 +00:00
Bill Wendling	cf70f2c2a0	Use -enable-ppc-regscavenger flag for these checks llvm-svn: 47916	2008-03-04 23:16:26 +00:00
Evan Cheng	14f556a6d7	Really fix the test. llvm-svn: 47882	2008-03-04 08:01:56 +00:00
Evan Cheng	7a67175fcc	Fix broken test. llvm-svn: 47881	2008-03-04 07:59:13 +00:00
Evan Cheng	3123d6ced3	Add PR1501 test case. llvm-svn: 47874	2008-03-04 00:47:45 +00:00
Bill Wendling	8d64999daf	This is the initial check-in for adding register scavenging to PPC. (Currently, PPC-64 doesn't work.) This also lowers the spilling of the CR registers so that it uses a register other than the default R0 register (the scavenger scrounges for one). A significant part of this patch fixes how kill information is handled. llvm-svn: 47863	2008-03-03 22:19:16 +00:00
Chris Lattner	299977b5ca	Evan implemented these. llvm-svn: 47828	2008-03-02 18:05:14 +00:00
Chris Lattner	4c67781884	Evan implemented this. llvm-svn: 47827	2008-03-02 17:56:29 +00:00
Chris Lattner	7f3c8ea9bf	add a testcase for misc vector stuff llvm-svn: 47826	2008-03-02 08:57:59 +00:00
Bill Wendling	246237f14f	DCE'ed this testcase. llvm-svn: 47760	2008-02-29 19:28:11 +00:00
Bill Wendling	d0f85a5ca3	If we reload a virtual register that's already been assigned, we want to mark that instruction as its "last use". This fixes PR1925. llvm-svn: 47758	2008-02-29 18:52:01 +00:00
Lauro Ramos Venancio	15e03a8950	Update testcase. llvm-svn: 47735	2008-02-28 23:13:15 +00:00
Evan Cheng	e1d3e0958b	Set to default: x86 no longer fold and into test if it has more than one use. llvm-svn: 47711	2008-02-28 07:46:38 +00:00
Dale Johannesen	7b8dae547d	New test for misaligned Altivec laod/store. llvm-svn: 47698	2008-02-27 23:14:50 +00:00
Dan Gohman	5d96a06b0b	Don't hard-code the mask size to be 32, which is incorrect on ppc64 and was causing aborts with the new APInt changes. This may also be fixing an obscure ppc64 bug. llvm-svn: 47692	2008-02-27 21:12:32 +00:00
Evan Cheng	da92e34fe3	Fix a bug in dead spill slot elimination. llvm-svn: 47687	2008-02-27 19:57:11 +00:00
Chris Lattner	e51c23341d	actually run llc, thanks Dan :) llvm-svn: 47677	2008-02-27 17:46:54 +00:00
Evan Cheng	295ae42ede	Don't track max alignment during stack object allocations since they can be deleted later. Let PEI compute it. llvm-svn: 47668	2008-02-27 10:04:56 +00:00
Chris Lattner	1f46cc2345	Make X86TargetLowering::LowerSINT_TO_FP return without creating a dead stack slot and store if the SINT_TO_FP is actually legal. This allows us to compile: double a(double b) {return (unsigned)b;} to: _a: cvttsd2siq %xmm0, %rax movl %eax, %eax cvtsi2sdq %rax, %xmm0 ret instead of: _a: subq $8, %rsp cvttsd2siq %xmm0, %rax movl %eax, %eax cvtsi2sdq %rax, %xmm0 addq $8, %rsp ret crazy. llvm-svn: 47660	2008-02-27 05:57:41 +00:00
Chris Lattner	bc686e546a	Compile x86-64-and-mask.ll into: _test: movl %edi, %eax ret instead of: _test: movl $4294967295, %ecx movq %rdi, %rax andq %rcx, %rax ret It would be great to write this as a Pat pattern that used subregs instead of a 'pseudo' instruction, but I don't know how to do that in td files. llvm-svn: 47658	2008-02-27 05:47:54 +00:00
Evan Cheng	7553230e3a	Spiller now remove unused spill slots. llvm-svn: 47657	2008-02-27 03:04:06 +00:00
Bill Wendling	24932cf415	Upgrade testcase. llvm-svn: 47644	2008-02-26 23:22:40 +00:00
Evan Cheng	701b6a1dc3	Enable -coalescer-commute-instrs by default. llvm-svn: 47623	2008-02-26 20:40:22 +00:00
Dan Gohman	8a8f3fe7e0	Avoid aborting on invalid shift counts. llvm-svn: 47612	2008-02-26 18:50:50 +00:00
Eli Friedman	1f2cabfbcf	Fix for pr2093: direct operands aren't necessarily addresses, so don't try to simplify them. llvm-svn: 47610	2008-02-26 18:37:49 +00:00
Chris Lattner	1a461075ef	Fix PR2096, a regression introduced with my patch last night. This also fixes cfrac, flops, and 175.vpr llvm-svn: 47605	2008-02-26 17:09:59 +00:00
Gabor Greif	a8990a9cac	Really feed llvm-as with the testcase, do not let it read from stdin. This fixes the hangs seen on solaris10. llvm-svn: 47604	2008-02-26 13:37:13 +00:00
Evan Cheng	8e99554e84	This is possible: vr1 = extract_subreg vr2, 3 ... vr3 = extract_subreg vr1, 2 The end result is vr3 is equal to vr2 with subidx 2. llvm-svn: 47592	2008-02-26 08:03:41 +00:00
Chris Lattner	5b4101cf68	Fix isNegatibleForFree to not return true for ConstantFP nodes after legalize. Just because a constant is legal (e.g. 0.0 in SSE) doesn't mean that its negated value is legal (-0.0). We could make this stronger by checking to see if the negated constant is actually legal post negation, but it doesn't seem like a big deal. llvm-svn: 47591	2008-02-26 07:04:54 +00:00
Evan Cheng	6366bbf577	Fix PR2076. CodeGenPrepare now sinks address computation for inline asm memory operands into inline asm block. llvm-svn: 47589	2008-02-26 02:42:37 +00:00
Scott Michel	36bbc39272	Merge current work back to tree to minimize diffs and drift. Major highlights for CellSPU modifications: - SPUInstrInfo.td refactoring: "multiclass" really is _your_ friend. - Other improvements based on refactoring effort in SPUISelLowering.cpp, esp. in SPUISelLowering::PerformDAGCombine(), where zero amount shifts and rotates are now eliminiated, other scalar-to-vector-to-scalar silliness is also eliminated. - 64-bit operations are being implemented, _muldi3.c gcc runtime now compiles and generates the right code. More work still needs to be done. llvm-svn: 47532	2008-02-23 18:41:37 +00:00
Evan Cheng	d299f09bc5	Rematerialization logic was overly conservative when it comes to loads from fixed stack slots. llvm-svn: 47529	2008-02-23 03:38:34 +00:00
Evan Cheng	6782480bd1	Update test. llvm-svn: 47527	2008-02-23 02:57:25 +00:00

1 2 3 4 5 ...

736 Commits