llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 14:33:02 +02:00

Author	SHA1	Message	Date
Evan Cheng	1c33e83af5	Don't do all the lowering stuff for 2-wide build_vector's. Also, minor optimization for shuffle of undef. llvm-svn: 27946	2006-04-22 08:34:05 +00:00
Evan Cheng	ec33bd04fb	Fix a performance regression. Use {p}shuf* when there are only two distinct elements in a build_vector. llvm-svn: 27945	2006-04-22 06:21:46 +00:00
Evan Cheng	5cb5fdd8eb	Revamp build_vector lowering to take advantage of movss and movd instructions. movd always clear the top 96 bits and movss does so when it's loading the value from memory. The net result is codegen for 4-wide shuffles is much improved. It is near optimal if one or more elements is a zero. e.g. __m128i test(int a, int b) { return _mm_set_epi32(0, 0, b, a); } compiles to _test: movd 8(%esp), %xmm1 movd 4(%esp), %xmm0 punpckldq %xmm1, %xmm0 ret compare to gcc: _test: subl $12, %esp movd 20(%esp), %xmm0 movd 16(%esp), %xmm1 punpckldq %xmm0, %xmm1 movq %xmm1, %xmm0 movhps LC0, %xmm0 addl $12, %esp ret or icc: _test: movd 4(%esp), %xmm0 #5.10 movd 8(%esp), %xmm3 #5.10 xorl %eax, %eax #5.10 movd %eax, %xmm1 #5.10 punpckldq %xmm1, %xmm0 #5.10 movd %eax, %xmm2 #5.10 punpckldq %xmm2, %xmm3 #5.10 punpckldq %xmm3, %xmm0 #5.10 ret #5.10 There are still room for improvement, for example the FP variant of the above example: __m128 test(float a, float b) { return _mm_set_ps(0.0, 0.0, b, a); } _test: movss 8(%esp), %xmm1 movss 4(%esp), %xmm0 unpcklps %xmm1, %xmm0 xorps %xmm1, %xmm1 movlhps %xmm1, %xmm0 ret The xorps and movlhps are unnecessary. This will require post legalizer optimization to handle. llvm-svn: 27939	2006-04-21 23:03:30 +00:00
Evan Cheng	e0289de5ab	Now generating perfect (I think) code for "vector set" with a single non-zero scalar value. e.g. _mm_set_epi32(0, a, 0, 0); ==> movd 4(%esp), %xmm0 pshufd $69, %xmm0, %xmm0 _mm_set_epi8(0, 0, 0, 0, 0, a, 0, 0, 0, 0, 0, 0, 0, 0, 0, 0); ==> movzbw 4(%esp), %ax movzwl %ax, %eax pxor %xmm0, %xmm0 pinsrw $5, %eax, %xmm0 llvm-svn: 27923	2006-04-21 01:05:10 +00:00
Evan Cheng	41f2933444	- Added support to turn "vector clear elements", e.g. pand V, <-1, -1, 0, -1> to a vector shuffle. - VECTOR_SHUFFLE lowering change in preparation for more efficient codegen of vector shuffle with zero (or any splat) vector. llvm-svn: 27875	2006-04-20 08:58:49 +00:00
Evan Cheng	9dcd046bbd	Handle v2i64 BUILD_VECTOR custom lowering correctly. v2i64 is a legal type, but i64 is not. If possible, change a i64 op to a f64 (e.g. load, constant) and then cast it back. llvm-svn: 27849	2006-04-20 00:11:39 +00:00
Evan Cheng	d79f6a9f5a	isSplatMask() bug: first element can be an undef. llvm-svn: 27847	2006-04-19 23:28:59 +00:00
Evan Cheng	019dea6886	- Added support to do aribitrary 4 wide shuffle with no more than three instructions. - Fixed a commute vector_shuff bug. llvm-svn: 27845	2006-04-19 22:48:17 +00:00
Evan Cheng	265831aa45	Commute vector_shuffle to match more movlhps, movlp{s\|d} cases. llvm-svn: 27840	2006-04-19 20:35:22 +00:00
Evan Cheng	98b1ca65dd	Use movss to insert_vector_elt(v, s, 0). llvm-svn: 27782	2006-04-17 22:45:49 +00:00
Evan Cheng	ecf13c5d79	Use two pinsrw to insert an element into v4i32 / v4f32 vector. llvm-svn: 27779	2006-04-17 22:04:06 +00:00
Evan Cheng	4de1805c84	Implement v8i16, v16i8 splat using unpckl + pshufd. llvm-svn: 27768	2006-04-17 20:43:08 +00:00
Chris Lattner	e1d38ad84b	implement returns of a vector, testcase here: CodeGen/X86/vec_return.ll llvm-svn: 27767	2006-04-17 20:32:50 +00:00
Evan Cheng	eb739d0355	FP SETOLT, SETOLT, SETUGE, SETUGT conditions were implemented incorrectly llvm-svn: 27755	2006-04-17 07:24:10 +00:00
Evan Cheng	32e5d4f6bc	Silly bug llvm-svn: 27719	2006-04-15 05:37:34 +00:00
Evan Cheng	f9a93a1d3f	Do not use movs{h\|l}dup for a shuffle with a single non-undef node. llvm-svn: 27718	2006-04-15 03:13:24 +00:00
Evan Cheng	32c4470374	Last few SSE3 intrinsics. llvm-svn: 27711	2006-04-14 21:59:03 +00:00
Evan Cheng	25fcfb9f2d	X86 SSE2 supports v8i16 multiplication llvm-svn: 27644	2006-04-13 05:10:25 +00:00
Evan Cheng	2c2d734efd	All "integer" logical ops (pand, por, pxor) are now promoted to v2i64. Clean up and fix various logical ops issues. llvm-svn: 27633	2006-04-12 21:21:57 +00:00
Evan Cheng	66fb7beed7	Promote v4i32, v8i16, v16i8 load to v2i64 load. llvm-svn: 27612	2006-04-12 17:12:36 +00:00
Evan Cheng	da283be867	Added support for _mm_move_ss and _mm_move_sd. llvm-svn: 27575	2006-04-11 00:19:04 +00:00
Evan Cheng	2b6c899eb2	Conditional move of vector types. llvm-svn: 27556	2006-04-10 07:23:14 +00:00
Evan Cheng	281a7abddf	Code clean up. llvm-svn: 27501	2006-04-07 21:53:05 +00:00
Evan Cheng	9f27046dc9	- movlp{s\|d} and movhp{s\|d} support. - Normalize shuffle nodes so result vector lower half elements come from the first vector, the rest come from the second vector. (Except for the exceptions :-). - Other minor fixes. llvm-svn: 27474	2006-04-06 23:23:56 +00:00
Evan Cheng	6d470008c8	Support for comi / ucomi intrinsics. llvm-svn: 27444	2006-04-05 23:38:46 +00:00
Evan Cheng	056e0af55a	Handle canonical form of e.g. vector_shuffle v1, v1, <0, 4, 1, 5, 2, 6, 3, 7> This is turned into vector_shuffle v1, <undef>, <0, 0, 1, 1, 2, 2, 3, 3> by dag combiner. It would match a {p}unpckl on x86. llvm-svn: 27437	2006-04-05 07:20:06 +00:00
Evan Cheng	d562dfa0db	Bogus assert llvm-svn: 27434	2006-04-05 06:11:20 +00:00
Evan Cheng	9e56e97205	Fallthrough to expand if a VECTOR_SHUFFLE cannot be custom lowered. llvm-svn: 27433	2006-04-05 06:09:26 +00:00
Evan Cheng	849a726354	Handle v8i16 shuffle that must be broken into a pair of pshufhw / pshuflw. llvm-svn: 27427	2006-04-05 01:47:37 +00:00
Evan Cheng	7ff32cd571	Use movlpd to: store lower f64 extracted from v2f64. Use movhpd to: store upper f64 extracted from v2f64. llvm-svn: 27382	2006-04-03 22:30:54 +00:00
Evan Cheng	169240beb7	- More efficient extract_vector_elt with shuffle and movss, movsd, movd, etc. - Some bug fixes and naming inconsistency fixes. llvm-svn: 27377	2006-04-03 20:53:28 +00:00
Evan Cheng	4623ebd3d0	Use a X86 target specific node X86ISD::PINSRW instead of a mal-formed INSERT_VECTOR_ELT to insert a 16-bit value in a 128-bit vector. llvm-svn: 27314	2006-03-31 21:55:24 +00:00
Evan Cheng	7b9a0c6d7a	Add support to use pextrw and pinsrw to extract and insert a word element from a 128-bit vector. llvm-svn: 27304	2006-03-31 19:22:53 +00:00
Evan Cheng	4ca9bbc1bb	Expand all INSERT_VECTOR_ELT (obviously bad) for now. llvm-svn: 27275	2006-03-31 01:30:39 +00:00
Evan Cheng	5d9fc9fdd0	Typo llvm-svn: 27272	2006-03-31 00:33:57 +00:00
Evan Cheng	c55052da81	Ok for vector_shuffle mask to contain undef elements. llvm-svn: 27271	2006-03-31 00:30:29 +00:00
Evan Cheng	d3c692650f	Make sure all possible shuffles are matched. Use pshufd, pshuhw, and pshulw to shuffle v4f32 if shufps doesn't match. Use shufps to shuffle v4f32 if pshufd, pshuhw, and pshulw don't match. llvm-svn: 27259	2006-03-30 19:54:57 +00:00
Evan Cheng	7bc3bc8246	- Added some SSE2 128-bit packed integer ops. - Added SSE2 128-bit integer pack with signed saturation ops. - Added pshufhw and pshuflw ops. llvm-svn: 27252	2006-03-29 23:07:14 +00:00
Evan Cheng	d0d3eade59	Need to special case splat after all. Make the second operand of splat vector_shuffle undef. llvm-svn: 27250	2006-03-29 19:02:40 +00:00
Evan Cheng	02b5de9b3e	- More shuffle related bug fixes. - Whenever possible use ops of the right packed types for vector shuffles / splats. llvm-svn: 27246	2006-03-29 03:04:49 +00:00
Evan Cheng	5194a37602	- Only use pshufd for v4i32 vector shuffles. - Other shuffle related fixes. llvm-svn: 27244	2006-03-29 01:30:51 +00:00
Evan Cheng	178e36174a	Fixing buggy code. llvm-svn: 27239	2006-03-28 23:41:33 +00:00
Jim Laskey	fa6dfa9212	Added missing paren on behalf of Ramana Radhakrishnan. llvm-svn: 27223	2006-03-28 10:17:11 +00:00
Evan Cheng	0305cec743	Missed X86::isUNPCKHMask llvm-svn: 27222	2006-03-28 08:27:15 +00:00
Evan Cheng	fb4b2bfc7d	* Prefer using operation of matching types. e.g unpcklpd rather than movlhps. * Bug fixes. llvm-svn: 27218	2006-03-28 06:50:32 +00:00
Evan Cheng	4d554dae17	- Clean up / consoladate various shuffle masks. - Some misc. bug fixes. - Use MOVHPDrm to load from m64 to upper half of a XMM register. llvm-svn: 27210	2006-03-28 02:43:26 +00:00
Evan Cheng	d8d7ec47bd	Model unpack lower and interleave as vector_shuffle so we can lower the intrinsics as such. llvm-svn: 27200	2006-03-28 00:39:58 +00:00
Evan Cheng	0865274fa5	Use pcmpeq to generate vector of all ones. llvm-svn: 27167	2006-03-27 07:00:16 +00:00
Nate Begeman	3d518334b9	SelectionDAGISel can now natively handle Switch instructions, in the same manner that the LowerSwitch LLVM to LLVM pass does: emitting a binary search tree of basic blocks. The new approach has several advantages: it is faster, it generates significantly smaller code in many cases, and it paves the way for implementing dense switch tables as a jump table by handling switches directly in the instruction selector. This functionality is currently only enabled on x86, but should be safe for every target. In anticipation of making it the default, the cfg is now properly updated in the x86, ppc, and sparc select lowering code. llvm-svn: 27156	2006-03-27 01:32:24 +00:00
Evan Cheng	1dfbede1d1	Remove X86:isZeroVector, use ISD::isBuildVectorAllZeros instead; some fixes / cleanups llvm-svn: 27150	2006-03-26 09:53:12 +00:00
Evan Cheng	e5807f6b47	Build arbitrary vector with more than 2 distinct scalar elements with a series of unpack and interleave ops. llvm-svn: 27119	2006-03-25 09:37:23 +00:00
Evan Cheng	234090b386	Added 128-bit packed integer subtraction. llvm-svn: 27096	2006-03-25 01:33:37 +00:00
Evan Cheng	bdb85b387f	Support for scalar to vector with zero extension. llvm-svn: 27091	2006-03-24 23:15:12 +00:00
Evan Cheng	d58d54cf3e	Handle BUILD_VECTOR with all zero elements. llvm-svn: 27056	2006-03-24 07:29:27 +00:00
Chris Lattner	ace2d0d227	Gabor points out that we can't spell. :) llvm-svn: 27049	2006-03-24 07:12:19 +00:00
Evan Cheng	8507228441	All v2f64 shuffle cases can be handled. llvm-svn: 27044	2006-03-24 06:40:32 +00:00
Evan Cheng	3028b04057	More efficient v2f64 shuffle using movlhps, movhlps, unpckhpd, and unpcklpd. llvm-svn: 27040	2006-03-24 02:58:06 +00:00
Evan Cheng	68410804f0	Handle more shuffle cases with SHUFP* instructions. llvm-svn: 27024	2006-03-24 01:18:28 +00:00
Evan Cheng	daa75ed684	Typo llvm-svn: 26997	2006-03-23 20:26:04 +00:00
Evan Cheng	5f7cf963db	Add 128-bit integer vector load and add (for testing). llvm-svn: 26967	2006-03-23 01:57:24 +00:00
Evan Cheng	54215cd1ea	Added a ValueType operand to isShuffleMaskLegal(). For now, x86 will not do 64-bit vector shuffle. llvm-svn: 26964	2006-03-22 22:07:06 +00:00
Evan Cheng	7cb4e14749	Some clean up. llvm-svn: 26957	2006-03-22 19:22:18 +00:00
Evan Cheng	ae6a39ea92	- Supposely movlhps is faster / better than unpcklpd. - Don't forget pshufd is only available with sse2. llvm-svn: 26956	2006-03-22 19:16:21 +00:00
Evan Cheng	cff38e19c3	- Implement X86ISelLowering::isShuffleMaskLegal(). We currently only support splat and PSHUFD cases. - Clean up shuffle / splat matching code. llvm-svn: 26954	2006-03-22 18:59:22 +00:00
Evan Cheng	f6dc0a7f5e	- VECTOR_SHUFFLE of v4i32 / v4f32 with undef second vector always matches PSHUFD. We can make permutes entries which point to the undef pointing anything we want. - Change some names to appease Chris. llvm-svn: 26951	2006-03-22 08:01:21 +00:00
Chris Lattner	1554bf155e	fix a warning llvm-svn: 26941	2006-03-22 04:18:34 +00:00
Evan Cheng	7aac4350c7	Some splat and shuffle support. llvm-svn: 26940	2006-03-22 02:53:00 +00:00
Evan Cheng	47dd756c72	- Use movaps to store 128-bit vector integers. - Each scalar to vector v8i16 and v16i8 is a any_extend followed by a movd. llvm-svn: 26932	2006-03-21 23:01:21 +00:00
Chris Lattner	31a93c7740	These targets don't support EXTRACT_VECTOR_ELT, though, in time, X86 will. llvm-svn: 26930	2006-03-21 20:51:05 +00:00
Chris Lattner	09ede9ec9f	Add a build_vector node llvm-svn: 26895	2006-03-20 06:18:01 +00:00
Chris Lattner	1bd0aaf2b8	rename these nodes llvm-svn: 26848	2006-03-19 01:13:28 +00:00
Evan Cheng	f4774c9091	Darwin should use _setjmp/_longjmp instead of setjmp/longjmp. llvm-svn: 26833	2006-03-17 20:31:41 +00:00
Chris Lattner	647503bccc	Disable x86 fastcc from passing args in registers llvm-svn: 26824	2006-03-17 17:27:47 +00:00
Chris Lattner	a71bc63ced	Parameterize the number of integer arguments to pass in registers llvm-svn: 26818	2006-03-17 05:10:20 +00:00
Nate Begeman	42736d46b2	Remove BRTWOWAY* Make the PPC backend not dependent on BRTWOWAY_CC and make the branch selector smarter about the code it generates, fixing a case in the readme. llvm-svn: 26814	2006-03-17 01:40:33 +00:00
Evan Cheng	0e1abe6e19	Bug fix: condition inverted. llvm-svn: 26804	2006-03-16 22:02:48 +00:00
Evan Cheng	cad75d9f0c	Added a way for TargetLowering to specify what values can be used as the scale component of the target addressing mode. llvm-svn: 26802	2006-03-16 21:47:42 +00:00
Evan Cheng	ed013bd937	Add LSR hooks. llvm-svn: 26740	2006-03-13 23:18:16 +00:00
Evan Cheng	a3e0a7f652	Use rep/stosl; and Count 0x3; rep/stosb for memset with 4 byte aligned dest. and variable value. Similarly for memcpy. llvm-svn: 26603	2006-03-07 23:29:39 +00:00
Evan Cheng	2327759419	Enable Dwarf debugging info. llvm-svn: 26581	2006-03-07 02:02:57 +00:00
Chris Lattner	6b0947c277	Copysign needs to be expanded everywhere. Note that Alpha and IA64 should implement copysign as a native op if they have it. llvm-svn: 26541	2006-03-05 05:08:37 +00:00
Evan Cheng	2b45c57663	MEMSET / MEMCPY lowering bugs: we can't issue a single WORD / DWORD version of rep/stos and rep/mov if the count is not a constant. We could do rep/stosl; and $count, 3; rep/stosb For now, I will lower them to memset / memcpy calls. We will revisit this after a little bit experiment. Also need to take care of the trailing bytes even if the count is a constant. Since the max. number of trailing bytes are 3, we will simply issue loads / stores. llvm-svn: 26517	2006-03-04 02:48:56 +00:00
Evan Cheng	f2a0107221	Typo llvm-svn: 26512	2006-03-04 01:12:00 +00:00
Chris Lattner	999aa36a04	remove the read/write port/io intrinsics. llvm-svn: 26479	2006-03-03 00:19:58 +00:00
Evan Cheng	ebe1f272b7	Vector op lowering. llvm-svn: 26438	2006-03-01 01:11:20 +00:00
Evan Cheng	bd1279fd2a	Added a common about the need for X86ISD::Wrapper. llvm-svn: 26372	2006-02-25 09:55:19 +00:00
Evan Cheng	cb9fb051a5	- Clean up the lowering and selection code of ConstantPool, GlobalAddress, and ExternalSymbol. - Use C++ code (rather than tblgen'd selection code) to match the above mentioned leaf nodes. Do not mutate and nodes and do not record the selection in CodeGenMap. These nodes should be safe to duplicate. This is a performance win. llvm-svn: 26335	2006-02-23 20:41:18 +00:00
Evan Cheng	2977507828	PIC related bug fixes. 1. Various asm printer bug. 2. Lowering bug. Now TargetGlobalAddress is wrapped in X86ISD::TGAWrapper. llvm-svn: 26324	2006-02-23 02:43:52 +00:00
Evan Cheng	305141c1ba	- Added option -relocation-model to set relocation model. Valid values include static, pic, dynamic-no-pic, and default. PPC and x86 default is dynamic-no-pic for Darwin, pic for others. - Removed options -enable-pic and -ppc-static. llvm-svn: 26315	2006-02-22 20:19:42 +00:00
Evan Cheng	005de9e2bb	Added MMX, SSE1, and SSE2 vector instructions and some simple patterns. Fixed some existing bugs (wrong predicates, prefixes) at the same time. llvm-svn: 26310	2006-02-22 02:26:30 +00:00
Chris Lattner	6bb2c3e9cd	split register class handling from explicit physreg handling. llvm-svn: 26308	2006-02-22 00:56:39 +00:00
Chris Lattner	a124432746	Updates to match change of getRegForInlineAsmConstraint prototype llvm-svn: 26305	2006-02-21 23:11:00 +00:00
Evan Cheng	063df91002	If SSE3 is available, promote FP_TO_UINT i32 to FP_TO_SINT i64 to take advantage of fisttpll. llvm-svn: 26288	2006-02-18 07:26:17 +00:00
Evan Cheng	bf3558a375	x86 / Darwin PIC support. llvm-svn: 26273	2006-02-18 00:15:05 +00:00
Chris Lattner	3a899dbbc8	unbreak the build llvm-svn: 26260	2006-02-17 07:09:27 +00:00
Evan Cheng	330fd348ff	Unbreak x86 be llvm-svn: 26259	2006-02-17 07:01:52 +00:00
Nate Begeman	9c0ab71f4a	kill ADD_PARTS & SUB_PARTS and replace them with fancy new ADDC, ADDE, SUBC and SUBE nodes that actually expose what's going on and allow for significant simplifications in the targets. llvm-svn: 26255	2006-02-17 05:43:56 +00:00
Nate Begeman	b0ec087c0f	Kill the x86 pattern isel. boom. llvm-svn: 26246	2006-02-17 00:03:04 +00:00
Nate Begeman	0bc71999b9	Rework the SelectionDAG-based implementations of SimplifyDemandedBits and ComputeMaskedBits to match the new improved versions in instcombine. Tested against all of multisource/benchmarks on ppc. llvm-svn: 26238	2006-02-16 21:11:51 +00:00
Evan Cheng	2f3056286a	A bit more memset / memcpy optimization. Turns them into calls to memset / memcpy if 1) buffer(s) are not DWORD aligned, 2) size is not known to be greater or equal to some minimum value (currently 128). llvm-svn: 26224	2006-02-16 00:21:07 +00:00
Evan Cheng	f6c74c0096	Rename maxStoresPerMemSet to maxStoresPerMemset, etc. llvm-svn: 26174	2006-02-14 08:38:30 +00:00
Evan Cheng	80812d1070	Set maxStoresPerMemSet to 16. Ditto for maxStoresPerMemCpy and maxStoresPerMemMove. Although the last one is not used. llvm-svn: 26172	2006-02-14 08:25:08 +00:00
Chris Lattner	599b432345	Switch targets over to using SelectionDAG::getCALLSEQ_START to create CALLSEQ_START nodes. llvm-svn: 26143	2006-02-13 09:00:43 +00:00
Evan Cheng	15369d0b89	Darwin ABI issues: weak, linkonce, etc. dynamic-no-pic support is complete. Also fixed a function stub bug. Added weak and linkonce support for x86 Linux. llvm-svn: 26038	2006-02-07 08:38:37 +00:00
Evan Cheng	078962656b	Separate FILD and FILD_FLAG, the later is only used for SSE2. It produces a flag so it can be flagged to a FST. llvm-svn: 25953	2006-02-04 02:20:30 +00:00
Evan Cheng	d432c17927	Fix a erroneous comment. llvm-svn: 25894	2006-02-02 00:28:23 +00:00
Nate Begeman	0be60963bd	Fix some of the stuff in the PPC README file, and clean up legalization of the SELECT_CC, BR_CC, and BRTWOWAY_CC nodes. llvm-svn: 25875	2006-02-01 07:19:44 +00:00
Evan Cheng	27738f635e	Return's chain should be matching either the chain produced by the value or the chain going into the load. llvm-svn: 25863	2006-02-01 01:19:32 +00:00
Evan Cheng	329e86ddfa	When folding a load into a return of SSE value, check the chain to ensure the memory location has not been clobbered. llvm-svn: 25861	2006-02-01 00:20:21 +00:00
Evan Cheng	7eb36f4721	Be smarter about whether to store the SSE return value in memory. If it is already available in memory, do a fld directly from there. llvm-svn: 25859	2006-01-31 23:19:54 +00:00
Evan Cheng	45ebd632f2	- Allow XMM load (for scalar use) to be folded into ANDP* and XORP. - Use XORP to implement fneg. llvm-svn: 25857	2006-01-31 22:28:30 +00:00
Chris Lattner	5587b270e4	* Fix 80-column violations * Rename hasSSE -> hasSSE1 to avoid my continual confusion with 'has any SSE'. * Add inline asm constraint specification. llvm-svn: 25854	2006-01-31 19:43:35 +00:00
Evan Cheng	49467b6b5b	Added custom lowering of fabs llvm-svn: 25831	2006-01-31 03:14:29 +00:00
Evan Cheng	38aacb5f09	Don't generate complex sequence for SETOLE, SETOLT, SETULT, and SETUGT. Flip the order of the compare operands and generate SETOGT, SETOGE, SETUGE, and SETULE instead. llvm-svn: 25824	2006-01-30 23:41:35 +00:00
Evan Cheng	8ea651a9a4	i64 -> f32, f32 -> i64 and some clean up. llvm-svn: 25818	2006-01-30 22:13:22 +00:00
Evan Cheng	d2d96373dc	Always use FP stack instructions to perform i64 to f64 as well as f64 to i64 conversions. SSE does not have instructions to handle these tasks. llvm-svn: 25817	2006-01-30 08:02:57 +00:00
Chris Lattner	a44182300b	Move MaskedValueIsZero from the DAGCombiner to the TargetLowering interface,making isMaskedValueZeroForTargetNode simpler, and useable from other partsof the compiler. llvm-svn: 25803	2006-01-30 04:09:27 +00:00
Chris Lattner	754bc1f46c	adjust prototype llvm-svn: 25798	2006-01-30 03:49:07 +00:00
Chris Lattner	b66484069a	The FP stack doesn't support UNDEF, ask the legalizer to legalize it instead of lying and saying we have it. llvm-svn: 25775	2006-01-29 06:44:22 +00:00
Chris Lattner	5f0a3df176	Targets all now request ConstantFP to be legalized into TargetConstantFP. 'fpimm' in .td files is now TargetConstantFP. llvm-svn: 25771	2006-01-29 06:26:08 +00:00
Chris Lattner	744d9a40f0	silence a warning llvm-svn: 25745	2006-01-28 10:34:47 +00:00
Evan Cheng	746086dc97	Bye bye Pattern ISel, hello DAG ISel. llvm-svn: 25700	2006-01-27 21:26:54 +00:00
Nate Begeman	d2c6fbef4a	Remove TLI.LowerReturnTo, and just let targets custom lower ISD::RET for the same functionality. This addresses another piece of bug 680. Next, on to fixing Alpha VAARG, which I broke last time. llvm-svn: 25696	2006-01-27 21:09:22 +00:00
Evan Cheng	5891f49c47	x86 CPU detection and proper subtarget support llvm-svn: 25679	2006-01-27 08:10:46 +00:00
Evan Cheng	bf29b90240	When trying to fold X86::SETCC into a Select, make a copy if it has more than one use. This allows more CMOV instructions. llvm-svn: 25634	2006-01-26 02:13:10 +00:00
Nate Begeman	c29fac7fce	First part of bug 680: Remove TLI.LowerVA* and replace it with SDNodes that are lowered the same way as everything else. llvm-svn: 25606	2006-01-25 18:21:52 +00:00
Evan Cheng	46f85ddd84	X86 prefer scheduling for reduced register pressure. llvm-svn: 25602	2006-01-25 09:15:17 +00:00
Evan Cheng	b463e81b83	Fix a selectcc lowering bug. Make a copy of X86ISD::CMP when folding it. llvm-svn: 25596	2006-01-25 09:05:09 +00:00
Chris Lattner	f2a2d62f48	use ESP directly, not a copy of ESP into some other register for fastcc calls llvm-svn: 25584	2006-01-24 06:14:44 +00:00
Chris Lattner	c078165ea6	Emit the copies out of call return registers after the ISD::CALLSEQ_END node, fixing fastcc and the case where a function has a frame pointer due to dynamic allocas. llvm-svn: 25580	2006-01-24 05:17:12 +00:00
Chris Lattner	eacfe56de8	LowerReturn now doesn't have to handle f32 returns. llvm-svn: 25484	2006-01-20 18:41:25 +00:00
Evan Cheng	680a8e070b	Avoid generating a redundant setcc. llvm-svn: 25457	2006-01-19 08:52:46 +00:00
Evan Cheng	aebece2f7b	A obvious typo llvm-svn: 25435	2006-01-19 01:46:14 +00:00
Evan Cheng	92c122b26f	SRA shift amount must be in i8 llvm-svn: 25416	2006-01-18 09:26:46 +00:00
Evan Cheng	4d841aaa98	If a call return type is i1, insert a truncate from X86::AL to i1. llvm-svn: 25415	2006-01-18 08:08:38 +00:00
Evan Cheng	208f5076a0	Fix lowering of calls which return f32 values. llvm-svn: 25413	2006-01-17 21:58:21 +00:00
Evan Cheng	759a96e1a3	SSE does not support i64 SINT_TO_FP (FP stack doesn't either, but we custom expand it), so ask legalizer to expand i32 UINT_TO_FP. llvm-svn: 25386	2006-01-17 02:32:49 +00:00
Evan Cheng	53520a8cee	Added a FIXME comment about why FST is currently flagged to fpGETRESULT. llvm-svn: 25381	2006-01-17 00:37:42 +00:00
Evan Cheng	3ce28c990a	Bug fixes: fpGETRESULT should produces a flag result and X86ISD::FST should read a flag. llvm-svn: 25378	2006-01-17 00:19:47 +00:00
Evan Cheng	de33ca2831	Fix FP_TO_INT**_IN_MEM lowering. llvm-svn: 25368	2006-01-16 21:21:29 +00:00
Chris Lattner	20f25dc8c2	Use the default lowering of ISD::DYNAMIC_STACKALLOC, delete now dead code. llvm-svn: 25333	2006-01-15 09:00:21 +00:00
Nate Begeman	85b2dc0c4e	bswap implementation llvm-svn: 25312	2006-01-14 03:14:10 +00:00
Evan Cheng	a26ed7b9b2	LHS = X86ISD::CMOVcc LHS, RHS means LHS = RHS if cc. So the operands must be flipped around. llvm-svn: 25290	2006-01-13 19:51:46 +00:00
Chris Lattner	344642961e	Enable X86 support for savestack/restorestack llvm-svn: 25278	2006-01-13 18:00:54 +00:00
Chris Lattner	80fed2d66e	expand unsupported stacksave/stackrestore nodes llvm-svn: 25272	2006-01-13 02:42:53 +00:00
Evan Cheng	bed984a1c8	More typo's. I need new eye glasses... llvm-svn: 25261	2006-01-13 01:17:24 +00:00
Evan Cheng	ae2915ac91	Oops. Typo. llvm-svn: 25260	2006-01-13 01:06:49 +00:00
Evan Cheng	3f2ae15472	Fix a SETCC / BRCOND folding bug. llvm-svn: 25259	2006-01-13 01:03:02 +00:00
Evan Cheng	794a7cf6fe	Fix sint_to_fp (fild*) support. llvm-svn: 25257	2006-01-12 22:54:21 +00:00
Evan Cheng	978f5581c4	X86ISD::SETCC (e.g. SETEr) produces a flag (so multiple SETCC can be linked together). llvm-svn: 25247	2006-01-12 08:27:59 +00:00
Evan Cheng	dd45d29b56	* Materialize GlobalAddress and ExternalSym with MOV32ri rather than LEA32r. * Do not lower GlobalAddress to TargetGlobalAddress. Let isel does it. llvm-svn: 25246	2006-01-12 07:56:47 +00:00
Evan Cheng	5841005bdf	Added ROTL and ROTR. llvm-svn: 25232	2006-01-11 23:20:05 +00:00
Evan Cheng	66540aa32c	Support for MEMCPY and MEMSET. llvm-svn: 25226	2006-01-11 22:15:48 +00:00
Nate Begeman	cff96008ac	Add bswap, rotl, and rotr nodes Add dag combiner code to recognize rotl, rotr Add ppc code to match rotl Targets should add rotl/rotr patterns if they have them llvm-svn: 25222	2006-01-11 21:21:00 +00:00
Evan Cheng	e42281bcba	* Add special entry code main() (to set x87 to 64-bit precision). * Allow a register node as SelectAddr() base. * ExternalSymbol -> TargetExternalSymbol as direct function callee. * Use X86::ESP register rather than CopyFromReg(X86::ESP) as stack ptr for call parmater passing. llvm-svn: 25207	2006-01-11 06:09:51 +00:00
Evan Cheng	9adc8e5a3d	SSE cmov support. llvm-svn: 25190	2006-01-11 00:33:36 +00:00
Evan Cheng	8504673bb2	FP_TO_INT*_IN_MEM and x87 FP Select support. llvm-svn: 25188	2006-01-10 20:26:56 +00:00
Evan Cheng	c0b3a2166b	More typos llvm-svn: 25162	2006-01-09 22:29:54 +00:00
Evan Cheng	5baec4d0e2	typo llvm-svn: 25160	2006-01-09 20:49:21 +00:00
Evan Cheng	d3babfe458	Support for ADD_PARTS, SUB_PARTS, SHL_PARTS, SHR_PARTS, and SRA_PARTS. llvm-svn: 25158	2006-01-09 18:33:28 +00:00
Evan Cheng	1e0d7b98f3	* Fast call support. * FP cmp, setcc, etc. llvm-svn: 25117	2006-01-06 00:43:03 +00:00
Jim Laskey	41b3ee3c4f	Had expand logic backward. llvm-svn: 25105	2006-01-05 01:47:43 +00:00
Jim Laskey	5eddaee9f3	Added initial support for DEBUG_LABEL allowing debug specific labels to be inserted in the code. llvm-svn: 25104	2006-01-05 01:25:28 +00:00
Evan Cheng	2329411038	DAG based isel call support. llvm-svn: 25103	2006-01-05 00:27:02 +00:00
Chris Lattner	cee6093ca8	Fix a problem duraid pointed out to me compiling kc++ with -enable-x86-fastcc llvm-svn: 25024	2005-12-27 03:02:18 +00:00
Evan Cheng	995503fc91	More X86 floating point patterns. llvm-svn: 24990	2005-12-23 07:31:11 +00:00
Chris Lattner	8e80a247ff	make sure bit_convert's are expanded llvm-svn: 24979	2005-12-23 05:15:23 +00:00
Evan Cheng	fb6413e05a	* Fix a GlobalAddress lowering bug. * Teach DAG combiner about X86ISD::SETCC by adding a TargetLowering hook. llvm-svn: 24921	2005-12-21 23:05:39 +00:00
Jim Laskey	d82881490c	Disengage DEBUG_LOC from non-PPC targets. llvm-svn: 24919	2005-12-21 20:51:37 +00:00
Evan Cheng	6f15189a77	* Added support for X86 RET with an additional operand to specify number of bytes to pop off stack. * Added support for X86 SETCC. llvm-svn: 24917	2005-12-21 20:21:51 +00:00
Evan Cheng	0226113ed5	* Added lowering hook for external weak global address. It inserts a load for Darwin. * Added lowering hook for ISD::RET. It inserts CopyToRegs for the return value (or store / fld / copy to ST(0) for floating point value). This eliminate the need to write C++ code to handle RET with variable number of operands. llvm-svn: 24888	2005-12-21 02:39:21 +00:00
Evan Cheng	44e4e6a57f	Added a hook to print out names of target specific DAG nodes. llvm-svn: 24877	2005-12-20 06:22:03 +00:00
Evan Cheng	bb34a50cb0	X86 conditional branch support. llvm-svn: 24870	2005-12-19 23:12:38 +00:00
Evan Cheng	a3ff796fda	Remove a few lines of dead code. llvm-svn: 24768	2005-12-17 07:18:44 +00:00
Evan Cheng	d51da93a03	X86 lowers SELECT to a cmp / test followed by a conditional move. llvm-svn: 24754	2005-12-17 01:21:05 +00:00
Evan Cheng	43152cb8b6	* Promote all 1 bit entities to 8 bit. * Handling extload (1 bit -> 8 bit) and remove C++ code that handle 1 bit zextload. llvm-svn: 24726	2005-12-15 19:49:23 +00:00
Chris Lattner	5df0bce13a	X86 doesn't support sextinreg for 8-bit things either. llvm-svn: 24631	2005-12-07 17:59:14 +00:00
Chris Lattner	47feb1ecbb	No targets support line number info yet. llvm-svn: 24513	2005-11-29 06:16:21 +00:00
Chris Lattner	d122fc01dd	Lower READCYCLECOUNTER correctly, preserving the chain result llvm-svn: 24438	2005-11-20 22:57:19 +00:00
Chris Lattner	f4f66fafd9	use chain operands to ensure the copies don't wander from the rdtsc instruction. llvm-svn: 24434	2005-11-20 22:01:40 +00:00
Andrew Lenharth	a369904fc5	The second patch of X86 support for read cycle counter. llvm-svn: 24430	2005-11-20 21:41:10 +00:00
Chris Lattner	792ac11aee	Separate X86ISelLowering stuff out from the X86ISelPattern.cpp file. Patch contributed by Evan Cheng. llvm-svn: 24358	2005-11-15 00:40:23 +00:00

... 2 3 4 5 6 ...

332 Commits