llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Chris Lattner	70b56ea647	regenerate llvm-svn: 27512	2006-04-08 01:18:56 +00:00
Chris Lattner	97a47d4c9c	Parse shufflevector llvm-svn: 27511	2006-04-08 01:18:35 +00:00
Chris Lattner	14b19acd56	Add shufflevector support, todo, implement better constant folding. llvm-svn: 27510	2006-04-08 01:18:18 +00:00
Chris Lattner	4bd21d8de2	Add shufflevector reading support llvm-svn: 27509	2006-04-08 01:17:59 +00:00
Evan Cheng	0916c33201	ldmxcsr and stmxcsr. llvm-svn: 27506	2006-04-08 00:47:44 +00:00
Jim Laskey	bec0d42d8d	Remove section change in function end, preventing override of function's real section. llvm-svn: 27503	2006-04-08 00:35:59 +00:00
Evan Cheng	281a7abddf	Code clean up. llvm-svn: 27501	2006-04-07 21:53:05 +00:00
Evan Cheng	12da231c27	Added patterns for MOVHPSmr and MOVLPSmr. llvm-svn: 27497	2006-04-07 21:20:58 +00:00
Evan Cheng	0dd7987d36	Keep track of an Mac OS X / x86 ABI bug. llvm-svn: 27496	2006-04-07 21:19:53 +00:00
Jim Laskey	fabb0ba736	Make sure that debug labels are defined within the same section and after the entry point of a function. llvm-svn: 27494	2006-04-07 20:44:42 +00:00
Jim Laskey	b93bc75add	Foundation for call frame information. llvm-svn: 27491	2006-04-07 16:34:46 +00:00
Chris Lattner	f358863a69	We have an assertion that checks that we do not encode null values into the .bc file if they are supposed to be implicit. This is cool, except that it checked after constant expr folding: improving constant expr folding could cause the .bc reader to assert out on old .bc files. Move the check so that it checks all simple constants, but no constantexprs. llvm-svn: 27480	2006-04-07 05:00:02 +00:00
Chris Lattner	8c5ac50fb4	Constant fold extractelement(zero, x) -> zero llvm-svn: 27479	2006-04-07 04:44:06 +00:00
Chris Lattner	32b65613d9	Fix inlining of insert/extract element constantexprs llvm-svn: 27478	2006-04-07 04:41:03 +00:00
Evan Cheng	37b4263eb8	Add code to RemapOperand() to handle Instruction::ExtractElement and Instruction::InsertElement. llvm-svn: 27477	2006-04-07 01:27:42 +00:00
Evan Cheng	aaa0d70b65	A MOVPS2SSmr, i.e. _mm_store_ss, encoding bug. Also MOVPDI2DIrr. llvm-svn: 27476	2006-04-06 23:53:29 +00:00
Evan Cheng	9f27046dc9	- movlp{s\|d} and movhp{s\|d} support. - Normalize shuffle nodes so result vector lower half elements come from the first vector, the rest come from the second vector. (Except for the exceptions :-). - Other minor fixes. llvm-svn: 27474	2006-04-06 23:23:56 +00:00
Evan Cheng	e248d318a8	New entries. llvm-svn: 27473	2006-04-06 23:21:24 +00:00
Evan Cheng	e5eefd369a	1. If both vector operands of a vector_shuffle are undef, turn it into an undef. 2. A shuffle mask element can also be an undef. llvm-svn: 27472	2006-04-06 23:20:43 +00:00
Andrew Lenharth	892b890d6a	This may be overconservative, but it lets the new cfe compile llvm-svn: 27471	2006-04-06 23:18:45 +00:00
Chris Lattner	db7dfe8c61	Add an item llvm-svn: 27470	2006-04-06 23:16:19 +00:00
Chris Lattner	a390188fd4	Make sure to return the result in the right type. llvm-svn: 27469	2006-04-06 23:12:19 +00:00
Andrew Lenharth	038b30839d	Move this to lib/Analysis. This reduces Core size, yay. llvm-svn: 27468	2006-04-06 22:58:58 +00:00
Chris Lattner	c0680ae07e	Match vpku[hw]um(x,x). Convert vsldoi(x,x) to work the same way other (x,x) cases work. llvm-svn: 27467	2006-04-06 22:28:36 +00:00
Chris Lattner	a52d88ee89	Add support for matching vmrg(x,x) patterns llvm-svn: 27463	2006-04-06 22:02:42 +00:00
Andrew Lenharth	95d16ade31	fix some linking problems with the new gcc llvm-svn: 27460	2006-04-06 21:26:32 +00:00
Chris Lattner	300076cbd8	Pattern match vmrg* instructions, which are now lowered by the CFE into shuffles. llvm-svn: 27457	2006-04-06 21:11:54 +00:00
Chris Lattner	6cf87c1b01	remove two done items llvm-svn: 27453	2006-04-06 19:19:38 +00:00
Chris Lattner	bc0489232b	Lower vperm(x,y, mask) -> shuffle(x,y,mask) if mask is constant. This allows us to compile oh-so-realistic stuff like this: vec_vperm(A, B, (vector unsigned char){14}); to: vspltb v0, v0, 14 instead of: vspltisb v0, 14 vperm v0, v2, v1, v0 llvm-svn: 27452	2006-04-06 19:19:17 +00:00
Chris Lattner	2875bb116e	Support pattern matching vsldoi(x,y) and vsldoi(x,x), which allows the f.e. to lower it and LLVM to have one fewer intrinsic. This implements CodeGen/PowerPC/vec_shuffle.ll llvm-svn: 27450	2006-04-06 18:26:28 +00:00
Chris Lattner	10fa7be550	Compile the vpkuhum/vpkuwum intrinsics into vpkuhum/vpkuwum instead of into vperm with a perm mask lvx'd from the constant pool. llvm-svn: 27448	2006-04-06 17:23:16 +00:00
Evan Cheng	d2d7aff6ba	POR encoded as PAND, yikes. llvm-svn: 27446	2006-04-06 01:49:20 +00:00
Evan Cheng	dcf423ad74	An entry about comi / ucomi intrinsics. llvm-svn: 27445	2006-04-05 23:46:04 +00:00
Evan Cheng	6d470008c8	Support for comi / ucomi intrinsics. llvm-svn: 27444	2006-04-05 23:38:46 +00:00
Chris Lattner	7f13e50435	Add all of the data stream intrinsics and instructions. woo llvm-svn: 27442	2006-04-05 22:27:14 +00:00
Chris Lattner	338945e669	Fix a typo llvm-svn: 27440	2006-04-05 20:15:25 +00:00
Chris Lattner	d1b47b18ed	Fix CodeGen/PowerPC/2006-04-05-splat-ish.ll llvm-svn: 27439	2006-04-05 17:39:25 +00:00
Evan Cheng	056e0af55a	Handle canonical form of e.g. vector_shuffle v1, v1, <0, 4, 1, 5, 2, 6, 3, 7> This is turned into vector_shuffle v1, <undef>, <0, 0, 1, 1, 2, 2, 3, 3> by dag combiner. It would match a {p}unpckl on x86. llvm-svn: 27437	2006-04-05 07:20:06 +00:00
Chris Lattner	fe2926cf46	Make a vector live across blocks have the correct Vec type. This fixes CodeGen/X86/2006-04-04-CrossBlockCrash.ll llvm-svn: 27436	2006-04-05 06:54:42 +00:00
Evan Cheng	d562dfa0db	Bogus assert llvm-svn: 27434	2006-04-05 06:11:20 +00:00
Evan Cheng	9e56e97205	Fallthrough to expand if a VECTOR_SHUFFLE cannot be custom lowered. llvm-svn: 27433	2006-04-05 06:09:26 +00:00
Evan Cheng	abd8dc54c2	Exapnd a VECTOR_SHUFFLE to a BUILD_VECTOR if target asks for it to be expanded or custom lowering fails. llvm-svn: 27432	2006-04-05 06:07:11 +00:00
Andrew Lenharth	58b9d4af7a	revert this, this is safe, if conservative. leave a note to that effect llvm-svn: 27428	2006-04-05 02:42:36 +00:00
Evan Cheng	849a726354	Handle v8i16 shuffle that must be broken into a pair of pshufhw / pshuflw. llvm-svn: 27427	2006-04-05 01:47:37 +00:00
Chris Lattner	ee971bedf2	add vsl llvm-svn: 27425	2006-04-05 01:16:22 +00:00
Chris Lattner	993209029f	add vmladduhm llvm-svn: 27423	2006-04-05 00:49:48 +00:00
Chris Lattner	66c3b75644	Add m[tf]vscr instructions. llvm-svn: 27421	2006-04-05 00:03:57 +00:00
Chris Lattner	10394b1c42	add a note llvm-svn: 27419	2006-04-04 23:45:11 +00:00
Chris Lattner	e7a52b473f	Add missing byte merges. llvm-svn: 27418	2006-04-04 23:43:56 +00:00
Chris Lattner	ab137b431f	Add FP -> Int Conversions llvm-svn: 27417	2006-04-04 23:25:02 +00:00
Chris Lattner	6cf881590f	add average intrinsics llvm-svn: 27416	2006-04-04 23:14:00 +00:00
Chris Lattner	59c4add58a	add a note llvm-svn: 27414	2006-04-04 22:43:55 +00:00
Chris Lattner	d1483ca1ad	Fix some broken logic that would cause us to codegen {2147483647,2147483647,2147483647,2147483647} as 'vspltisb v0, -1'. llvm-svn: 27413	2006-04-04 22:28:35 +00:00
Evan Cheng	f745d450c5	Added pslldq and psrldq. llvm-svn: 27412	2006-04-04 21:49:39 +00:00
Evan Cheng	22dd2900e6	Minor fixes + naming changes. llvm-svn: 27410	2006-04-04 19:12:30 +00:00
Evan Cheng	3f7a10bee8	PSHUF* encoding bugs. llvm-svn: 27405	2006-04-04 18:40:36 +00:00
Chris Lattner	cad2bfa3d7	Do not create ZEXTLOAD's unless we are before legalize or the operation is legal. llvm-svn: 27402	2006-04-04 17:39:18 +00:00
Chris Lattner	4e99e6dfdd	Ask legalize to promote all vector shuffles to be v16i8 instead of having to handle all 4 PPC vector types. This simplifies the matching code and allows us to eliminate a bunch of patterns. This also adds cases we were missing, such as CodeGen/PowerPC/vec_splat.ll:splat_h. llvm-svn: 27400	2006-04-04 17:25:31 +00:00
Chris Lattner	136a27d0d0	* Add supprot for SCALAR_TO_VECTOR operations where the input needs to be promoted/expanded (e.g. SCALAR_TO_VECTOR from i8/i16 on PPC). * Add support for targets to request that VECTOR_SHUFFLE nodes be promoted to a canonical type, for example, we only want v16i8 shuffles on PPC. * Move isShuffleLegal out of TLI into Legalize. * Teach isShuffleLegal to allow shuffles that need to be promoted. llvm-svn: 27399	2006-04-04 17:23:26 +00:00
Chris Lattner	020ff34600	Signed shr by a constant is not the same as sdiv by 2^k llvm-svn: 27395	2006-04-04 06:11:42 +00:00
Evan Cheng	f07104b717	cmpps / cmppd encoding bug llvm-svn: 27393	2006-04-04 03:04:07 +00:00
Chris Lattner	85dc06c29e	Constant fold bitconvert(undef) llvm-svn: 27391	2006-04-04 01:02:22 +00:00
Evan Cheng	2be8582ddb	Compact some intrinsic definitions. llvm-svn: 27388	2006-04-04 00:10:53 +00:00
Chris Lattner	2bf9c8cc18	Plug in the byte and short splats llvm-svn: 27387	2006-04-04 00:05:13 +00:00
Chris Lattner	0128e4d335	Revert accidentally committed hunks. llvm-svn: 27386	2006-04-03 23:58:04 +00:00
Chris Lattner	57b9e01b3e	Make sure to mark unsupported SCALAR_TO_VECTOR operations as expand. llvm-svn: 27385	2006-04-03 23:55:43 +00:00
Evan Cheng	bbe72d932c	Some SSE1 intrinsics: min, max, sqrt, etc. llvm-svn: 27384	2006-04-03 23:49:17 +00:00
Chris Lattner	ab96554280	revert previous patch llvm-svn: 27383	2006-04-03 23:14:49 +00:00
Evan Cheng	7ff32cd571	Use movlpd to: store lower f64 extracted from v2f64. Use movhpd to: store upper f64 extracted from v2f64. llvm-svn: 27382	2006-04-03 22:30:54 +00:00
Chris Lattner	eb9684f6a4	Force use of a frame-pointer if there is anything on the stack that is aligned more than the OS keeps the stack aligned. llvm-svn: 27381	2006-04-03 22:03:29 +00:00
Chris Lattner	4601b86ad0	The stack alignment is now computed dynamically, just verify it is correct. llvm-svn: 27380	2006-04-03 21:39:57 +00:00
Chris Lattner	264e4a438a	Remove unused method llvm-svn: 27379	2006-04-03 21:39:03 +00:00
Evan Cheng	169240beb7	- More efficient extract_vector_elt with shuffle and movss, movsd, movd, etc. - Some bug fixes and naming inconsistency fixes. llvm-svn: 27377	2006-04-03 20:53:28 +00:00
Chris Lattner	758878175b	Align vectors to the size in bytes, not bits. llvm-svn: 27376	2006-04-03 19:28:50 +00:00
Chris Lattner	d13dd8ef5c	Add a missing check, this fixes UnitTests/Vector/sumarray.c llvm-svn: 27375	2006-04-03 17:29:28 +00:00
Chris Lattner	d9902c3de0	Add a missing check, which broke a bunch of vector tests. llvm-svn: 27374	2006-04-03 17:21:50 +00:00
Chris Lattner	c65511b05c	Add the full set of min/max instructions llvm-svn: 27372	2006-04-03 15:58:28 +00:00
Andrew Lenharth	4760eaae91	support x * (c1 + c2) where c1 and c2 are pow2s. special case for c2 == 4 llvm-svn: 27370	2006-04-03 04:19:17 +00:00
Andrew Lenharth	af4a638eab	mul by const conversion sequences. more coming soon llvm-svn: 27368	2006-04-03 03:18:59 +00:00
Andrew Lenharth	b133e47444	back this out llvm-svn: 27367	2006-04-03 03:16:50 +00:00
Andrew Lenharth	91c6f28ad6	This should be a win of every arch llvm-svn: 27364	2006-04-02 21:42:45 +00:00
Andrew Lenharth	4c950de4c1	This makes McCat/12-IOtest go 8x faster or so llvm-svn: 27363	2006-04-02 21:08:39 +00:00
Andrew Lenharth	067eaad4d9	This will be needed soon llvm-svn: 27362	2006-04-02 20:13:57 +00:00
Chris Lattner	fa82c33ae7	add a note llvm-svn: 27360	2006-04-02 07:20:00 +00:00
Chris Lattner	8ba4723c74	Inform the dag combiner that the predicate compares only return a low bit. llvm-svn: 27359	2006-04-02 06:26:07 +00:00
Chris Lattner	0b4f7786d2	relax assertion llvm-svn: 27358	2006-04-02 06:19:46 +00:00
Chris Lattner	6433904644	Allow targets to compute masked bits for intrinsics. llvm-svn: 27357	2006-04-02 06:15:09 +00:00
Chris Lattner	dbdc830c83	Add a little dag combine to compile this: int %AreSecondAndThirdElementsBothNegative(<4 x float>* %in) { entry: %tmp1 = load <4 x float>* %in ; <<4 x float>> [#uses=1] %tmp = tail call int %llvm.ppc.altivec.vcmpgefp.p( int 1, <4 x float> < float 0x7FF8000000000000, float 0.000000e+00, float 0.000000e+00, float 0x7FF8000000000000 >, <4 x float> %tmp1 ) ; <int> [#uses=1] %tmp = seteq int %tmp, 0 ; <bool> [#uses=1] %tmp3 = cast bool %tmp to int ; <int> [#uses=1] ret int %tmp3 } into this: _AreSecondAndThirdElementsBothNegative: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI1_0) lis r5, ha16(LCPI1_0) lvx v0, 0, r3 lvx v1, r5, r4 vcmpgefp. v0, v1, v0 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 mtspr 256, r2 blr instead of this: _AreSecondAndThirdElementsBothNegative: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI1_0) lis r5, ha16(LCPI1_0) lvx v0, 0, r3 lvx v1, r5, r4 vcmpgefp. v0, v1, v0 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 xori r3, r3, 1 cntlzw r3, r3 srwi r3, r3, 5 mtspr 256, r2 blr llvm-svn: 27356	2006-04-02 06:11:11 +00:00
Chris Lattner	42a1e621f1	vector casts of casts are eliminable. Transform this: %tmp = cast <4 x uint> %tmp to <4 x int> ; <<4 x int>> [#uses=1] %tmp = cast <4 x int> %tmp to <4 x float> ; <<4 x float>> [#uses=1] into: %tmp = cast <4 x uint> %tmp to <4 x float> ; <<4 x float>> [#uses=1] llvm-svn: 27355	2006-04-02 05:43:13 +00:00
Chris Lattner	d9ca6b5bfc	vector casts never reinterpret bits llvm-svn: 27354	2006-04-02 05:40:28 +00:00
Chris Lattner	3c994295fe	Allow transforming this: %tmp = cast <4 x uint>* %testData to <4 x int>* ; <<4 x int>> [#uses=1] %tmp = load <4 x int> %tmp ; <<4 x int>> [#uses=1] to this: %tmp = load <4 x uint>* %testData ; <<4 x uint>> [#uses=1] %tmp = cast <4 x uint> %tmp to <4 x int> ; <<4 x int>> [#uses=1] llvm-svn: 27353	2006-04-02 05:37:12 +00:00
Chris Lattner	cb26b2dfe8	Turn altivec lvx/stvx intrinsics into loads and stores. This allows the elimination of one load from this: int AreSecondAndThirdElementsBothNegative( vector float in ) { #define QNaN 0x7FC00000 const vector unsigned int testData = (vector unsigned int)( QNaN, 0, 0, QNaN ); vector float test = vec_ld( 0, (float) &testData ); return ! vec_any_ge( test, *in ); } Now generating: _AreSecondAndThirdElementsBothNegative: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI1_0) lis r5, ha16(LCPI1_0) addi r6, r1, -16 lvx v0, r5, r4 stvx v0, 0, r6 lvx v1, 0, r3 vcmpgefp. v0, v0, v1 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 xori r3, r3, 1 cntlzw r3, r3 srwi r3, r3, 5 mtspr 256, r2 blr llvm-svn: 27352	2006-04-02 05:30:25 +00:00
Chris Lattner	8967316b8c	Remove done item llvm-svn: 27351	2006-04-02 05:28:54 +00:00
Chris Lattner	c76f9e8691	Implement promotion for EXTRACT_VECTOR_ELT, allowing v16i8 multiplies to work with PowerPC. llvm-svn: 27349	2006-04-02 05:06:04 +00:00
Chris Lattner	9c24ec6de5	add a note llvm-svn: 27348	2006-04-02 03:59:11 +00:00
Chris Lattner	f15063eadf	Implement the Expand action for binary vector operations to break the binop into elements and operate on each piece. This allows generic vector integer multiplies to work on PPC, though the generated code is horrible. llvm-svn: 27347	2006-04-02 03:57:31 +00:00
Chris Lattner	389e309bfb	Intrinsics that just load from memory can be treated like loads: they don't have to serialize against each other. This allows us to schedule lvx's across each other, for example. llvm-svn: 27346	2006-04-02 03:41:14 +00:00
Chris Lattner	e314cf19ba	Adjust to change in Intrinsics.gen interface. llvm-svn: 27344	2006-04-02 03:35:01 +00:00
Chris Lattner	104db817c8	Constant fold all of the vector binops. This allows us to compile this: "vector unsigned char mergeLowHigh = (vector unsigned char) ( 8, 9, 10, 11, 16, 17, 18, 19, 12, 13, 14, 15, 20, 21, 22, 23 ); vector unsigned char mergeHighLow = vec_xor( mergeLowHigh, vec_splat_u8(8));" aka: void %test2(<16 x sbyte>* %P) { store <16 x sbyte> cast (<4 x int> xor (<4 x int> cast (<16 x ubyte> < ubyte 8, ubyte 9, ubyte 10, ubyte 11, ubyte 16, ubyte 17, ubyte 18, ubyte 19, ubyte 12, ubyte 13, ubyte 14, ubyte 15, ubyte 20, ubyte 21, ubyte 22, ubyte 23 > to <4 x int>), <4 x int> cast (<16 x sbyte> < sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8, sbyte 8 > to <4 x int>)) to <16 x sbyte>), <16 x sbyte> * %P ret void } into this: _test2: mfspr r2, 256 oris r4, r2, 32768 mtspr 256, r4 li r4, lo16(LCPI2_0) lis r5, ha16(LCPI2_0) lvx v0, r5, r4 stvx v0, 0, r3 mtspr 256, r2 blr instead of this: _test2: mfspr r2, 256 oris r4, r2, 49152 mtspr 256, r4 li r4, lo16(LCPI2_0) lis r5, ha16(LCPI2_0) vspltisb v0, 8 lvx v1, r5, r4 vxor v0, v1, v0 stvx v0, 0, r3 mtspr 256, r2 blr ... which occurs here: http://developer.apple.com/hardware/ve/calcspeed.html llvm-svn: 27343	2006-04-02 03:25:57 +00:00
Chris Lattner	badebf1c9b	Add a new -view-legalize-dags command line option llvm-svn: 27342	2006-04-02 03:07:27 +00:00

1 2 3 4 5 ...

13669 Commits