llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 06:22:56 +02:00

Author	SHA1	Message	Date
Chris Lattner	ac9ed02670	fix wonky indentation llvm-svn: 31298	2006-10-30 22:27:23 +00:00
Chris Lattner	dcfee77788	add another target hook for branch folding. llvm-svn: 31262	2006-10-28 17:29:57 +00:00
Chris Lattner	016325f336	Implement support for branch condition reversal. llvm-svn: 31099	2006-10-21 05:52:40 +00:00
Chris Lattner	20bb8bfd45	Simplify code, no functionality change llvm-svn: 31097	2006-10-21 05:42:09 +00:00
Chris Lattner	b638d287f4	allow insertion of a conditional branch with fall-through llvm-svn: 31095	2006-10-21 05:34:23 +00:00
Chris Lattner	596c126372	update assert message llvm-svn: 31093	2006-10-21 04:42:29 +00:00
Chris Lattner	3fb2b87f17	bugfix llvm-svn: 31074	2006-10-20 20:44:34 +00:00
Chris Lattner	62a0f00312	Implement branch analysis/xform hooks required by the branch folding pass. llvm-svn: 31065	2006-10-20 17:42:20 +00:00
Chris Lattner	7353b07275	expose DWARF_LABEL opcode# so the branch folder can update debug info properly. llvm-svn: 31024	2006-10-17 22:41:45 +00:00
Chris Lattner	bea7a9de50	remove some dead code llvm-svn: 30938	2006-10-13 20:40:42 +00:00
Chris Lattner	04ad43b4de	update comments llvm-svn: 30663	2006-09-28 23:33:12 +00:00
Evan Cheng	15dd42884e	Committing X86-64 support. llvm-svn: 30177	2006-09-08 06:48:29 +00:00
Chris Lattner	59a4d8dfcd	Fix a long-standing wart in the code generator: two-address instruction lowering actually removes one of the operands, instead of just assigning both operands the same register. This make reasoning about instructions unnecessarily complex, because you need to know if you are before or after register allocation to match up operand #'s with the target description file. Changing this also gets rid of a bunch of hacky code in various places. This patch also includes changes to fold loads into cmp/test instructions in the X86 backend, along with a significant simplification to the X86 spill folding code. llvm-svn: 30108	2006-09-05 02:12:02 +00:00
Evan Cheng	692215be9c	Can't commute shufps. The high / low parts elements come from different vectors. llvm-svn: 29275	2006-07-25 20:25:40 +00:00
Evan Cheng	e2397256c1	Commute shufps / shufpd. llvm-svn: 28577	2006-05-30 23:34:30 +00:00
Evan Cheng	88bd79b75b	Somehow I lost a condition when I was shuffling some code around. Anyway, only transform a shufps to pshufd when the first two operands are the same. llvm-svn: 28575	2006-05-30 22:13:36 +00:00
Evan Cheng	bdb6af8e7d	Fix a build breaker. llvm-svn: 28574	2006-05-30 21:45:53 +00:00
Evan Cheng	66bfb1dc9a	Oops. PSHUFD is only available with SSE2. llvm-svn: 28573	2006-05-30 21:30:59 +00:00
Evan Cheng	03ca651244	Allow shufps x, x, mask to be converted to pshufd x, mask to save a move. llvm-svn: 28565	2006-05-30 20:26:50 +00:00
Evan Cheng	fb2038985a	These can be transformed into lea as well. Not that we use this feature currently... llvm-svn: 28393	2006-05-19 18:43:41 +00:00
Evan Cheng	6a08dd641a	Add MOV16_rm / MOV32_rm and MOV16_mr / MOV32_mr to isLoadFromStackSlot and isStoreToStackSlot llvm-svn: 28223	2006-05-11 07:33:49 +00:00
Evan Cheng	0fb3fc3626	Fixing truncate. Previously we were emitting truncate from r16 to r8 as movw. That is we promote the destination operand to r16. So %CH = TRUNC_R16_R8 %BP is emitted as movw %bp, %cx. This is incorrect. If %cl is live, it would be clobbered. Ideally we want to do the opposite, that is emitted it as movb ??, %ch But this is not possible since %bp does not have a r8 sub-register. We are now defining a new register class R16_ which is a subclass of R16 containing only those 16-bit registers that have r8 sub-registers (i.e. AX - DX). We isel the truncate to two instructions, a MOV16to16_ to copy the value to the R16_ class, followed by a TRUNC_R16_R8. Due to bug 770, the register colaescer is not going to coalesce between R16 and R16_. That will be fixed later so we can eliminate the MOV16to16_. Right now, it can only be eliminated if we are lucky that source and destination registers are the same. llvm-svn: 28164	2006-05-08 08:01:26 +00:00
Chris Lattner	3e2a664ada	Teach the codegen about instructions used for SSE spill code, allowing it to optimize cases where it has to spill a lot llvm-svn: 27801	2006-04-18 16:44:51 +00:00
Evan Cheng	169240beb7	- More efficient extract_vector_elt with shuffle and movss, movsd, movd, etc. - Some bug fixes and naming inconsistency fixes. llvm-svn: 27377	2006-04-03 20:53:28 +00:00
Evan Cheng	bdb85b387f	Support for scalar to vector with zero extension. llvm-svn: 27091	2006-03-24 23:15:12 +00:00
Evan Cheng	6ec225863c	- Remove scalar to vector pseudo ops. They are just wrong. - Handle FR32 to VR128:v4f32 and FR64 to VR128:v2f64 with aliases of MOVAPS and MOVAPD. Mark them as move instructions and hope they will be deleted. llvm-svn: 26919	2006-03-21 07:09:35 +00:00
Evan Cheng	bf4008c701	1. Use pxor instead of xoraps / xorapd to clear FR32 / FR64 registers. This proves to be worth 20% on Ptrdist/ks. Might be related to dependency breaking support. 2. Added FsMOVAPSrr and FsMOVAPDrr as aliases to MOVAPSrr and MOVAPDrr. These are used for FR32 / FR64 reg-to-reg copies. 3. Tell reg-allocator to generate MOVSSrm / MOVSDrm and MOVSSmr / MOVSDmr to spill / restore FsMOVAPSrr and FsMOVAPDrr. llvm-svn: 26241	2006-02-16 22:45:17 +00:00
Chris Lattner	452b75a57b	fix operand numbers llvm-svn: 25915	2006-02-02 20:38:12 +00:00
Chris Lattner	15cb732cd7	Move isLoadFrom/StoreToStackSlot from MRegisterInfo to TargetInstrInfo,a far more logical place. Other methods should also be moved if anyoneis interested. :) llvm-svn: 25913	2006-02-02 20:12:32 +00:00
Evan Cheng	8c988b64ae	Tell codegen MOVAPSrr and MOVAPDrr are copies. llvm-svn: 25889	2006-02-01 23:03:16 +00:00
Nate Begeman	3b6c2df603	Properly split f32 and f64 into separate register classes for scalar sse fp fixing a bunch of nasty hackery llvm-svn: 23735	2005-10-14 22:06:00 +00:00
Nate Begeman	7a1bc7318d	Teach the register allocator that movaps is also a move instruction llvm-svn: 22451	2005-07-16 02:00:20 +00:00
Nate Begeman	e5314eb2c2	First round of support for doing scalar FP using the SSE2 ISA extension and XMM registers. There are many known deficiencies and fixmes, which will be addressed ASAP. The major benefit of this work is that it will allow the LLVM register allocator to allocate FP registers across basic blocks. The x86 backend will still default to x87 style FP. To enable this work, you must pass -enable-sse-scalar-fp and either -sse2 or -sse3 to llc. An example before and after would be for: double foo(double *P) { double Sum = 0; int i; for (i = 0; i < 1000; ++i) Sum += P[i]; return Sum; } The inner loop looks like the following: x87: .LBB_foo_1: # no_exit fldl (%esp) faddl (%eax,%ecx,8) fstpl (%esp) incl %ecx cmpl $1000, %ecx #FP_REG_KILL jne .LBB_foo_1 # no_exit SSE2: addsd (%eax,%ecx,8), %xmm0 incl %ecx cmpl $1000, %ecx #FP_REG_KILL jne .LBB_foo_1 # no_exit llvm-svn: 22340	2005-07-06 18:59:04 +00:00
Misha Brukman	bf3f6181fd	* Remove trailing whitespace * Convert tabs to spaces llvm-svn: 21426	2005-04-21 23:38:14 +00:00
Chris Lattner	b75589131d	When commuting these instructions, make sure to actually swap the operands too. llvm-svn: 19694	2005-01-19 16:55:52 +00:00
Chris Lattner	9d5ee289d7	Improve coverage of the X86 instruction set by adding 16-bit shift doubles. llvm-svn: 19687	2005-01-19 07:31:24 +00:00
Chris Lattner	c03f360215	Teach the code generator that shrd/shld is commutable if it has an immediate. This allows us to generate this: foo: mov %EAX, DWORD PTR [%ESP + 4] mov %EDX, DWORD PTR [%ESP + 8] shld %EDX, %EDX, 2 shl %EAX, 2 ret instead of this: foo: mov %EAX, DWORD PTR [%ESP + 4] mov %ECX, DWORD PTR [%ESP + 8] mov %EDX, %EAX shrd %EDX, %ECX, 30 shl %EAX, 2 ret Note the magically transmogrifying immediate. llvm-svn: 19686	2005-01-19 07:11:01 +00:00
Chris Lattner	a78fd4726e	Disable 2->3 address promotion of add and inc instructions to LEA's. In addition to being three address, LEA's don't set the flags. This fixes 186.crafty. llvm-svn: 19251	2005-01-02 04:18:17 +00:00
Chris Lattner	d6bc921fa8	Implement the convertToThreeAddress method, add support for inverting JP/JNP branches. llvm-svn: 19247	2005-01-02 02:37:07 +00:00
Chris Lattner	2677b71f64	Fix a warning llvm-svn: 15409	2004-08-01 19:31:30 +00:00
Alkis Evlogimenos	cdcb1c62e5	Align breaks. llvm-svn: 15371	2004-07-31 10:05:44 +00:00
Chris Lattner	0d66480e9e	Add breaks llvm-svn: 15365	2004-07-31 09:53:31 +00:00
Alkis Evlogimenos	1eb8a5dc09	Simplify code a bit. llvm-svn: 15364	2004-07-31 09:44:32 +00:00
Alkis Evlogimenos	de150fb74b	Correctly spell 'unconditional'. llvm-svn: 15363	2004-07-31 09:41:44 +00:00
Alkis Evlogimenos	bc3d550391	Implement insertGoto and reverseBranchCondition for the X86. llvm-svn: 15362	2004-07-31 09:38:47 +00:00
Alkis Evlogimenos	7ecfe0a839	A big X86 instruction rename. The instructions are renamed to make their names more decriptive. A name consists of the base name, a default operand size followed by a character per operand with an optional special size. For example: ADD8rr -> add, 8-bit register, 8-bit register IMUL16rmi -> imul, 16-bit register, 16-bit memory, 16-bit immediate IMUL16rmi8 -> imul, 16-bit register, 16-bit memory, 8-bit immediate MOVSX32rm16 -> movsx, 32-bit register, 16-bit memory llvm-svn: 11995	2004-02-29 08:50:03 +00:00
Chris Lattner	c2977ac665	Adjust to change in TII ctor arguments llvm-svn: 11987	2004-02-29 06:31:44 +00:00
Chris Lattner	cfc8f02250	These two virtual methods are never called. llvm-svn: 11984	2004-02-29 05:59:33 +00:00
Alkis Evlogimenos	7f7d70a53c	Move MOTy::UseType enum into MachineOperand. This eliminates the switch statements in the constructors and simplifies the implementation of the getUseType() member function. You will have to specify defs using MachineOperand::Def instead of MOTy::Def though (similarly for Use and UseAndDef). llvm-svn: 11715	2004-02-22 19:23:26 +00:00
Alkis Evlogimenos	6d6ab846af	Remove getAllocatedRegNum(). Use getReg() instead. llvm-svn: 11393	2004-02-13 21:01:20 +00:00

1 2

71 Commits