llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
Andrew Lenharth	ef07d5c5a5	deal with memchr llvm-svn: 27830	2006-04-19 15:34:02 +00:00
Andrew Lenharth	09948a5331	friendlier error message llvm-svn: 27829	2006-04-19 15:33:35 +00:00
Chris Lattner	62537a04fb	add a note llvm-svn: 27828	2006-04-19 05:55:06 +00:00
Chris Lattner	99c7c3ad2f	Add a note. llvm-svn: 27827	2006-04-19 05:53:27 +00:00
Chris Lattner	411111435e	grammaro llvm-svn: 27826	2006-04-19 04:21:57 +00:00
Chris Lattner	8b3c2c9a9d	Fix a bug owen noticed llvm-svn: 27825	2006-04-19 04:21:16 +00:00
Chris Lattner	c76104e09b	Change wording llvm-svn: 27824	2006-04-19 04:12:01 +00:00
Chris Lattner	bd5802ad9b	add a note llvm-svn: 27823	2006-04-19 04:05:21 +00:00
Chris Lattner	966f0d4424	add some more notes llvm-svn: 27822	2006-04-19 04:02:47 +00:00
Andrew Lenharth	d89cdc541a	stupid stuff llvm-svn: 27821	2006-04-19 03:45:25 +00:00
Andrew Lenharth	90e3af9ab1	fix printing call graphs llvm-svn: 27820	2006-04-18 23:45:19 +00:00
Andrew Lenharth	87c0d713cc	I understand now. Shoot. llvm-svn: 27819	2006-04-18 22:36:11 +00:00
Evan Cheng	7364ee1c92	- PEXTRW cannot take a memory location as its first source operand. - PINSRWrmi encoding bug. llvm-svn: 27818	2006-04-18 21:59:43 +00:00
Evan Cheng	d6fa185be2	SHUFP{S\|D}, PSHUF* encoding bugs. Left out the mask immediate operand. llvm-svn: 27817	2006-04-18 21:56:36 +00:00
Evan Cheng	f16e4bf29d	Name change for clarity sake llvm-svn: 27816	2006-04-18 21:55:35 +00:00
Evan Cheng	82d7cacbbc	Encoding bug: CMPPSrmi, CMPPDrmi dropped operand 2 (condtion immediate). llvm-svn: 27815	2006-04-18 21:31:08 +00:00
Evan Cheng	8e87e9b0db	Name change for clarity sake llvm-svn: 27814	2006-04-18 21:29:50 +00:00
Evan Cheng	838f053b09	Left a pattern out llvm-svn: 27813	2006-04-18 21:29:08 +00:00
Andrew Lenharth	4d432bcb19	llvm.memc* improvements. helps PA a lot in some specmarks llvm-svn: 27812	2006-04-18 20:59:52 +00:00
Andrew Lenharth	d5fd116ede	llvm.memc* improvements. helps PA a lot in some specmarks llvm-svn: 27811	2006-04-18 19:54:11 +00:00
Chris Lattner	f58f727be6	These are correctly encoded by the JIT. I checked :) llvm-svn: 27810	2006-04-18 19:03:38 +00:00
Chris Lattner	5f153584d9	add a note llvm-svn: 27809	2006-04-18 18:30:19 +00:00
Chris Lattner	47a41ae889	Fix a crash on: void foo2(vector float A, vector float B) { vector float C = (vector float)vec_cmpeq(A, B); if (!vec_any_eq(A, B)) B = (vector float){0,0,0,0}; A = C; } llvm-svn: 27808	2006-04-18 18:28:22 +00:00
Evan Cheng	2cd4e2d240	Fixed an encoding bug: movd from XMM to R32. llvm-svn: 27807	2006-04-18 18:19:00 +00:00
Chris Lattner	2bd91746e1	pretty print node name llvm-svn: 27806	2006-04-18 18:05:58 +00:00
Chris Lattner	44ea12c5f8	Implement an important entry from README_ALTIVEC: If an altivec predicate compare is used immediately by a branch, don't use a (serializing) MFCR instruction to read the CR6 register, which requires a compare to get it back to CR's. Instead, just branch on CR6 directly. :) For example, for: void foo2(vector float A, vector float B) { if (!vec_any_eq(A, B)) *B = (vector float){0,0,0,0}; } We now generate: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 bne cr6, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr instead of: _foo2: mfspr r2, 256 oris r5, r2, 12288 mtspr 256, r5 lvx v2, 0, r4 lvx v3, 0, r3 vcmpeqfp. v2, v3, v2 mfcr r3, 2 rlwinm r3, r3, 27, 31, 31 cmpwi cr0, r3, 0 beq cr0, LBB1_2 ; UnifiedReturnBlock LBB1_1: ; cond_true vxor v2, v2, v2 stvx v2, 0, r4 mtspr 256, r2 blr LBB1_2: ; UnifiedReturnBlock mtspr 256, r2 blr This implements CodeGen/PowerPC/vec_br_cmp.ll. llvm-svn: 27804	2006-04-18 17:59:36 +00:00
Chris Lattner	43ef757c0e	new testcase llvm-svn: 27803	2006-04-18 17:56:30 +00:00
Chris Lattner	519001b0ee	move some stuff around, clean things up llvm-svn: 27802	2006-04-18 17:52:36 +00:00
Chris Lattner	3e2a664ada	Teach the codegen about instructions used for SSE spill code, allowing it to optimize cases where it has to spill a lot llvm-svn: 27801	2006-04-18 16:44:51 +00:00
Nate Begeman	7821901005	Fix a copy & paste error from long ago. llvm-svn: 27800	2006-04-18 16:03:18 +00:00
Chris Lattner	a72743cb57	Add some more notes, many still missing llvm-svn: 27799	2006-04-18 06:32:08 +00:00
Reid Spencer	87e24202fb	Have the AutoRegen.sh script prompt the user for the LLVM src and obj directories if it can't find them. Then, replace those values into the configure.ac script and pass them to the LLVM_CONFIG_PROJECT so that the values become the default for llvm_src and llvm_obj variables. In this way the user is required to input this exactly once, and the scripts take it from there. llvm-svn: 27798	2006-04-18 06:27:47 +00:00
Reid Spencer	04ccc317bc	Make it possible to default the llvm_src and llvm_obj variables based on the arguments to the macro. This better supports the AutoRegen.sh script in projects/sample/autoconf. llvm-svn: 27797	2006-04-18 06:25:37 +00:00
Chris Lattner	276916c3bf	add a bunch of stuff, pieces still missing llvm-svn: 27796	2006-04-18 06:18:36 +00:00
Chris Lattner	8228f79eae	Add a warning. llvm-svn: 27795	2006-04-18 05:31:20 +00:00
Chris Lattner	90357afa3d	Add a warning llvm-svn: 27794	2006-04-18 05:26:10 +00:00
Chris Lattner	e90fdf3b98	Use vmladduhm to do v8i16 multiplies which is faster and simpler than doing even/odd halves. Thanks to Nate telling me what's what. llvm-svn: 27793	2006-04-18 04:28:57 +00:00
Chris Lattner	5951b60cb4	Implement v16i8 multiply with this code: vmuloub v5, v3, v2 vmuleub v2, v3, v2 vperm v2, v2, v5, v4 This implements CodeGen/PowerPC/vec_mul.ll. With this, v16i8 multiplies are 6.79x faster than before. Overall, UnitTests/Vector/multiplies.c is now 2.45x faster with LLVM than with GCC. Remove the 'integer multiplies' todo from the README file. llvm-svn: 27792	2006-04-18 03:57:35 +00:00
Chris Lattner	fd1fb64f66	Add tests for v8i16 and v16i8 llvm-svn: 27791	2006-04-18 03:54:50 +00:00
Evan Cheng	6be2e4b419	Correct comments llvm-svn: 27790	2006-04-18 03:45:01 +00:00
Chris Lattner	4d84b56e64	Lower v8i16 multiply into this code: li r5, lo16(LCPI1_0) lis r6, ha16(LCPI1_0) lvx v4, r6, r5 vmulouh v5, v3, v2 vmuleuh v2, v3, v2 vperm v2, v2, v5, v4 where v4 is: LCPI1_0: ; <16 x ubyte> .byte 2 .byte 3 .byte 18 .byte 19 .byte 6 .byte 7 .byte 22 .byte 23 .byte 10 .byte 11 .byte 26 .byte 27 .byte 14 .byte 15 .byte 30 .byte 31 This is 5.07x faster on the G5 (measured) than lowering to scalar code + loads/stores. llvm-svn: 27789	2006-04-18 03:43:48 +00:00
Chris Lattner	613d7fda64	Custom lower v4i32 multiplies into a cute sequence, instead of having legalize scalarize the sequence into 4 mullw's and a bunch of load/store traffic. This speeds up v4i32 multiplies 4.1x (measured) on a G5. This implements PowerPC/vec_mul.ll llvm-svn: 27788	2006-04-18 03:24:30 +00:00
Chris Lattner	ab9ad1cdc1	new testcase llvm-svn: 27787	2006-04-18 03:22:16 +00:00
Evan Cheng	13a5022494	Another entry llvm-svn: 27786	2006-04-18 01:22:57 +00:00
Chris Lattner	d664951c95	Fix a build failure on Vladimir's tester. llvm-svn: 27785	2006-04-18 00:21:25 +00:00
Evan Cheng	2f9011cd87	Another entry. llvm-svn: 27784	2006-04-18 00:21:01 +00:00
Evan Cheng	98b1ca65dd	Use movss to insert_vector_elt(v, s, 0). llvm-svn: 27782	2006-04-17 22:45:49 +00:00
Chris Lattner	76beeb373d	Turn x86 unaligned load/store intrinsics into aligned load/store instructions if the pointer is known aligned. llvm-svn: 27781	2006-04-17 22:26:56 +00:00
Chris Lattner	8c8f1e35c9	Fix handling of calls in functions that use vectors. This fixes a crash on the code in GCC PR26546. llvm-svn: 27780	2006-04-17 22:10:08 +00:00
Evan Cheng	ecf13c5d79	Use two pinsrw to insert an element into v4i32 / v4f32 vector. llvm-svn: 27779	2006-04-17 22:04:06 +00:00

1 2 3 4 5 ...

24220 Commits