llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 06:22:56 +02:00

Author	SHA1	Message	Date
Chris Lattner	9d7971791b	Start inferring side effect information more aggressively, and fix many bugs in the x86 backend where instructions were not marked maystore/mayload, and perf issues where instructions were not marked neverHasSideEffects. It would be really nice if we could write patterns for copy instructions. I have audited all the x86 instructions down to MOVDQAmr. The flags on others and on other targets are probably not right in all cases, but no clients currently use this info that are enabled by default. llvm-svn: 45829	2008-01-10 07:59:24 +00:00
Chris Lattner	6ad01a9965	remove explicit sets of 'neverHasSideEffects' that can now be inferred from the instr patterns. llvm-svn: 45824	2008-01-10 05:45:39 +00:00
Chris Lattner	14310afe42	rename isLoad -> isSimpleLoad due to evan's desire to have such a predicate. llvm-svn: 45667	2008-01-06 23:38:27 +00:00
Chris Lattner	ad9a6ccb83	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Bill Wendling	e5af8b6e5c	Add "mayHaveSideEffects" and "neverHasSideEffects" flags to some instructions. I based what flag to set on whether it was already marked as "isRematerializable". If there was a further check to determine if it's "really" rematerializable, then I marked it as "mayHaveSideEffects" and created a check in the X86 back-end similar to the remat one. llvm-svn: 45132	2007-12-17 23:07:56 +00:00
Evan Cheng	64a1febf9a	Implicit def instructions, e.g. X86::IMPLICIT_DEF_GR32, are always re-materializable and they should not be spilled. llvm-svn: 44960	2007-12-12 23:12:09 +00:00
Chris Lattner	be0c5a0500	Fix a long standing deficiency in the X86 backend: we would sometimes emit "zero" and "all one" vectors multiple times, for example: _test2: pcmpeqd %mm0, %mm0 movq %mm0, _M1 pcmpeqd %mm0, %mm0 movq %mm0, _M2 ret instead of: _test2: pcmpeqd %mm0, %mm0 movq %mm0, _M1 movq %mm0, _M2 ret This patch fixes this by always arranging for zero/one vectors to be defined as v4i32 or v2i32 (SSE/MMX) instead of letting them be any random type. This ensures they get trivially CSE'd on the dag. This fix is also important for LegalizeDAGTypes, as it gets unhappy when the x86 backend wants BUILD_VECTOR(i64 0) to be legal even when 'i64' isn't legal. This patch makes the following changes: 1) X86TargetLowering::LowerBUILD_VECTOR now lowers 0/1 vectors into their canonical types. 2) The now-dead patterns are removed from the SSE/MMX .td files. 3) All the patterns in the .td file that referred to immAllOnesV or immAllZerosV in the wrong form now use *_bc to match them with a bitcast wrapped around them. 4) X86DAGToDAGISel::SelectScalarSSELoad is generalized to handle bitcast'd zero vectors, which simplifies the code actually. 5) getShuffleVectorZeroOrUndef is updated to generate a shuffle that is legal, instead of generating one that is illegal and expecting a later legalize pass to clean it up. 6) isZeroShuffle is generalized to handle bitcast of zeros. 7) several other minor tweaks. This patch is definite goodness, but has the potential to cause random code quality regressions. Please be on the lookout for these and let me know if they happen. llvm-svn: 44310	2007-11-25 00:24:49 +00:00
Evan Cheng	b43255bc68	Remove (somewhat confusing) Imp<> helper, use let Defs = [], Uses = [] instead. llvm-svn: 41863	2007-09-11 19:55:27 +00:00
Evan Cheng	527fe7ab57	Mark load instructions with isLoad = 1. llvm-svn: 41595	2007-08-30 05:49:43 +00:00
Dan Gohman	a599a813d5	Mark the SSE and MMX load instructions that X86InstrInfo::isReallyTriviallyReMaterializable knows how to handle with the isReMaterializable flag so that it is given a chance to handle them. Without hoisting constant-pool loads from loops this isn't very visible, though it does keep CodeGen/X86/constant-pool-remat-0.ll from making a copy of the constant pool on the stack. llvm-svn: 40736	2007-08-02 14:27:55 +00:00
Dan Gohman	e3464e6bec	Change the x86 assembly output to use tab characters to separate the mnemonics from their operands instead of single spaces. This makes the assembly output a little more consistent with various other compilers (f.e. GCC), and slightly easier to read. Also, update the regression tests accordingly. llvm-svn: 40648	2007-07-31 20:11:57 +00:00
Evan Cheng	3493ec0ce1	Redo and generalize previously removed opt for pinsrw: (vextract (v4i32 bc (v4f32 s2v (f32 load ))), 0) -> (i32 load ) llvm-svn: 40628	2007-07-31 08:04:03 +00:00
Evan Cheng	8312ed6f77	Change instruction description to split OperandList into OutOperandList and InOperandList. This gives one piece of important information: # of results produced by an instruction. An example of the change: def ADD32rr : I<0x01, MRMDestReg, (ops GR32:$dst, GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; => def ADD32rr : I<0x01, MRMDestReg, (outs GR32:$dst), (ins GR32:$src1, GR32:$src2), "add{l} {$src2, $dst\|$dst, $src2}", [(set GR32:$dst, (add GR32:$src1, GR32:$src2))]>; llvm-svn: 40033	2007-07-19 01:14:50 +00:00
Bill Wendling	2e66551f22	Support generation of GR64 to MMX code in the JIT. llvm-svn: 37866	2007-07-04 01:29:22 +00:00
Bill Wendling	3600c7a835	Allow a GR64 to be moved into an MMX register via the "movd" instruction. Still need to have JIT generate this code. llvm-svn: 37863	2007-07-04 00:19:54 +00:00
Dan Gohman	9cbc3fb1ab	Revert the earlier change that removed the M_REMATERIALIZABLE machine instruction flag, and use the flag along with a virtual member function hook for targets to override if there are instructions that are only trivially rematerializable with specific operands (i.e. constant pool loads). llvm-svn: 37728	2007-06-26 00:48:07 +00:00
Dan Gohman	b60d8a92c9	Replace M_REMATERIALIZIBLE and the newly-added isOtherReMaterializableLoad with a general target hook to identify rematerializable instructions. Some instructions are only rematerializable with specific operands, such as loads from constant pools, while others are always rematerializable. This hook allows both to be identified as being rematerializable with the same mechanism. llvm-svn: 37644	2007-06-19 01:48:05 +00:00
Chris Lattner	e67947b38f	implement the missing maskmovq mmx intrinsic that akor hit. llvm-svn: 37100	2007-05-16 06:08:17 +00:00
Bill Wendling	498c102df6	Add the final MMX instructions. Correct a few wrong patterns. llvm-svn: 36405	2007-04-24 21:18:37 +00:00
Bill Wendling	a4aa65bc38	Adding more MMX instructions. llvm-svn: 35638	2007-04-03 23:48:32 +00:00
Bill Wendling	ca2124e5a9	Add FEMMS and ADDQ. Renamed MMX recipes to prepend the MMX_ to them. llvm-svn: 35616	2007-04-03 06:00:37 +00:00
Bill Wendling	1087888176	Unbreak mmx arithmetic. It was barfing trying to do v8i8 arithmetic. llvm-svn: 35392	2007-03-28 00:57:11 +00:00
Bill Wendling	6b555c80c0	Add the "unpack low packed data" instructions. This should be the last of the MMX instructions that are needed... llvm-svn: 35389	2007-03-27 21:20:36 +00:00
Bill Wendling	d43819da2f	Fix so that pandn is emitted instead of an xor/and combo. Add integer comparison operators. llvm-svn: 35385	2007-03-27 20:22:40 +00:00
Bill Wendling	a42484728c	Add support for the v1i64 type. This makes better code for this: #include <mmintrin.h> extern __m64 C; void baz(__v2si A, __v2si B) { *A = C; _mm_empty(); } We get this: _baz: call "L1$pb" "L1$pb": popl %eax movl L_C$non_lazy_ptr-"L1$pb"(%eax), %eax movq (%eax), %mm0 movl 4(%esp), %eax movq %mm0, (%eax) emms ret GCC gives us this: _baz: pushl %ebx call L3 "L00000000001$pb": L3: popl %ebx subl $8, %esp movl L_C$non_lazy_ptr-"L00000000001$pb"(%ebx), %eax movl (%eax), %edx movl 4(%eax), %ecx movl 16(%esp), %eax movl %edx, (%eax) movl %ecx, 4(%eax) emms addl $8, %esp popl %ebx ret llvm-svn: 35351	2007-03-26 07:53:08 +00:00
Bill Wendling	124f2c8706	PR1260: Add final support to get the QT example to compile. llvm-svn: 35290	2007-03-23 22:35:46 +00:00
Bill Wendling	e6a9c6dfe6	We generate a shufflevector instruction, so we don't need the builtin intrinsic. llvm-svn: 35269	2007-03-22 20:29:26 +00:00
Bill Wendling	1bcad4c1cd	Support added for shifts and unpacking MMX instructions. llvm-svn: 35266	2007-03-22 18:42:45 +00:00
Bill Wendling	8ced23ee5a	And now support for MMX logical operations. llvm-svn: 35125	2007-03-16 09:44:46 +00:00
Bill Wendling	feaff80149	Multiplication support for MMX. llvm-svn: 35118	2007-03-15 21:24:36 +00:00
Bill Wendling	236cfc4344	Adding more arithmetic operators to MMX. This is an almost exact copy of the addition. Please let me know if you have suggestions. llvm-svn: 35055	2007-03-10 09:57:05 +00:00
Bill Wendling	5fef3fd7e7	Added "padd*" support for MMX. Added MMX move stuff to X86InstrInfo so that moves, loads, etc. are recognized. llvm-svn: 35031	2007-03-08 22:09:11 +00:00
Bill Wendling	8f49ba1000	Remove useless pattern fragments. llvm-svn: 35009	2007-03-07 18:23:09 +00:00
Bill Wendling	3c201ddd02	Properly support v8i8 and v4i16 types. It now converts them to v2i32 for load and stores. llvm-svn: 35002	2007-03-07 05:43:18 +00:00
Bill Wendling	a02d43fbbd	Add LOAD/STORE support for MMX. llvm-svn: 34978	2007-03-06 18:53:42 +00:00
Bill Wendling	c52174dee3	Add the emms intrinsic for MMX support. llvm-svn: 34938	2007-03-05 23:09:45 +00:00
Evan Cheng	a2eaed93a0	INC / DEC instructions have shorter code size than ADD32ri8, etc. llvm-svn: 29194	2006-07-19 00:27:29 +00:00
Evan Cheng	dc9b5f5fc0	X86 integer register classes naming changes. Make them consistent with FP, vector classes. llvm-svn: 28324	2006-05-16 07:21:53 +00:00
Evan Cheng	8768f25c80	SSE / SSE2 conversion intrinsics. llvm-svn: 27637	2006-04-12 23:42:44 +00:00
Evan Cheng	798acd4094	movnt* and maskmovdqu intrinsics llvm-svn: 27587	2006-04-11 06:57:30 +00:00
Evan Cheng	cad90504fe	Instruction encoding bug llvm-svn: 27102	2006-03-25 06:00:03 +00:00
Evan Cheng	b280b34497	Added CVTTPS2PI. llvm-svn: 27095	2006-03-25 01:31:59 +00:00
Evan Cheng	7c8d7bc4b3	Didn't mean to check this in. No MMX support yet. llvm-svn: 26933	2006-03-21 23:04:23 +00:00
Evan Cheng	47dd756c72	- Use movaps to store 128-bit vector integers. - Each scalar to vector v8i16 and v16i8 is a any_extend followed by a movd. llvm-svn: 26932	2006-03-21 23:01:21 +00:00
Evan Cheng	6ec225863c	- Remove scalar to vector pseudo ops. They are just wrong. - Handle FR32 to VR128:v4f32 and FR64 to VR128:v2f64 with aliases of MOVAPS and MOVAPD. Mark them as move instructions and hope they will be deleted. llvm-svn: 26919	2006-03-21 07:09:35 +00:00
Evan Cheng	a4db61ddc1	x86 ISD::SCALAR_TO_VECTOR support. llvm-svn: 26911	2006-03-21 00:33:35 +00:00
Evan Cheng	c63d434203	Move a few things around. llvm-svn: 26893	2006-03-20 06:04:52 +00:00
Evan Cheng	8ab6294f94	One more round of reorg so sabre doesn't freak out. :-) llvm-svn: 26303	2006-02-21 20:00:20 +00:00
Evan Cheng	fee17dfff8	Split instruction info into multiple files, one for each of x87, MMX, and SSE. llvm-svn: 26300	2006-02-21 19:13:53 +00:00

49 Commits