llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-29 23:12:55 +01:00

Author	SHA1	Message	Date
Reid Spencer	3ac324ab41	Fix numerous inferred casts. llvm-svn: 32479	2006-12-12 09:18:51 +00:00
Reid Spencer	3d1b0dfba6	Fix the casting for the computation of the Malloc size. llvm-svn: 32477	2006-12-12 09:17:08 +00:00
Reid Spencer	562b83c7df	Change inferred getCast into specific getCast. Passes all tests. llvm-svn: 32469	2006-12-12 05:05:00 +00:00
Chris Lattner	a9b75a7e35	Patch for PR1045 and Transforms/ScalarRepl/2006-12-11-SROA-Crash.ll llvm-svn: 32468	2006-12-12 04:24:41 +00:00
Chris Lattner	28e7eaf6b8	trunc to integer, not to FP. llvm-svn: 32426	2006-12-11 01:17:00 +00:00
Chris Lattner	a8eec08185	implement promotion of unions containing two packed types of the same width. This implements Transforms/ScalarRepl/union-packed.ll llvm-svn: 32422	2006-12-11 00:35:08 +00:00
Chris Lattner	098fb42690	* Eliminate calls to CastInst::createInferredCast. * Add support for promoting unions with fp values in them. This produces our new int<->fp bitcast instructions, implementing Transforms/ScalarRepl/union-fp-int.ll As an example, this allows us to compile this: union intfloat { int i; float f; }; float invsqrt(const float arg_x) { union intfloat x = { .f = arg_x }; const float xhalf = arg_x * 0.5f; x.i = 0x5f3759df - (x.i >> 1); return x.f * (1.5f - xhalf * x.f * x.f); } into: _invsqrt: movss 4(%esp), %xmm0 movd %xmm0, %eax sarl %eax movl $1597463007, %ecx subl %eax, %ecx movd %ecx, %xmm1 mulss LCPI1_0, %xmm0 mulss %xmm1, %xmm0 movss LCPI1_1, %xmm2 mulss %xmm1, %xmm0 subss %xmm0, %xmm2 movl 8(%esp), %eax mulss %xmm2, %xmm1 movss %xmm1, (%eax) ret instead of: _invsqrt: subl $4, %esp movss 8(%esp), %xmm0 movss %xmm0, (%esp) movl (%esp), %eax movl $1597463007, %ecx sarl %eax subl %eax, %ecx movl %ecx, (%esp) mulss LCPI1_0, %xmm0 movss (%esp), %xmm1 mulss %xmm1, %xmm0 mulss %xmm1, %xmm0 movss LCPI1_1, %xmm2 subss %xmm0, %xmm2 mulss %xmm2, %xmm1 movl 12(%esp), %eax movss %xmm1, (%eax) addl $4, %esp ret llvm-svn: 32418	2006-12-10 23:56:50 +00:00
Reid Spencer	069149765d	Incorporate any changes in the successor blocks into the result of MarkAliveBlocks. llvm-svn: 32375	2006-12-08 21:52:01 +00:00
Bill Wendling	f13d78d3b8	What should be the last unnecessary <iostream>s in the library. llvm-svn: 32333	2006-12-07 22:21:48 +00:00
Bill Wendling	23b8b13c9d	Removing even more <iostream> includes. llvm-svn: 32320	2006-12-07 20:04:42 +00:00
Bill Wendling	a3246c4272	Changed llvm_ostream et all to OStream. llvm_cerr, llvm_cout, llvm_null, are now cerr, cout, and NullStream resp. llvm-svn: 32298	2006-12-07 01:30:32 +00:00
Reid Spencer	ff6cd88f93	Update ConstantIntegral Max/Min tests for new interface. llvm-svn: 32288	2006-12-06 20:39:57 +00:00
Chris Lattner	06ba0b8202	add missing #include llvm-svn: 32280	2006-12-06 18:14:47 +00:00
Chris Lattner	a531ce882e	Detemplatize the Statistic class. The only type it is instantiated with is 'unsigned'. llvm-svn: 32279	2006-12-06 17:46:33 +00:00
Chris Lattner	3d1758e08c	Remove the 'printname' argument to WriteAsOperand. It is always true, and passing false would make the asmprinter fail anyway. llvm-svn: 32264	2006-12-06 06:16:21 +00:00
Chris Lattner	8fac1e29c7	counter should be unsigned. llvm-svn: 32252	2006-12-06 01:50:04 +00:00
Chris Lattner	e0738f8f8b	add an instcombine xform. This speeds up 462.libquantum from 9.78s to 7.48s. This regression is due to unforseen consequences of the cast patch. llvm-svn: 32209	2006-12-05 01:26:29 +00:00
Devang Patel	ae17721f63	SCCP does not handle Packed Type properly. Disable Packed Type handling for now. llvm-svn: 32208	2006-12-04 23:54:59 +00:00
Reid Spencer	d727d239f8	Update call to CastInst::getCastOpcode for its new signature. llvm-svn: 32166	2006-12-04 02:48:01 +00:00
Jeff Cohen	f99052befb	Unbreak VC++ build. llvm-svn: 32113	2006-12-02 02:22:01 +00:00
Chris Lattner	1629b0d995	disable transformations that are invalid for fp vectors. This fixes Transforms/InstCombine/2006-12-01-BadFPVectorXform.ll llvm-svn: 32112	2006-12-02 00:13:08 +00:00
Reid Spencer	529fb41272	Remove 4 FIXMEs to hack around cast-to-bool problems which no longer exist. llvm-svn: 32051	2006-11-30 23:13:36 +00:00
Chris Lattner	f3806f1c16	make it clear that this is always a zext llvm-svn: 32044	2006-11-30 17:35:08 +00:00
Chris Lattner	affa9441cd	One more bugfix, 3 cases of making casts explicit. llvm-svn: 32043	2006-11-30 17:32:29 +00:00
Chris Lattner	0b4753a96c	Fix a bug in globalopt due to the recent cast patch. llvm-svn: 32042	2006-11-30 17:26:08 +00:00
Chris Lattner	2fd5719f50	implement cast.ll:test35. With this, we recognize: unsigned short swp(unsigned short a) { return ((a & 0xff00) >> 8 \| (a & 0x00ff) << 8); } as an idiom for bswap. llvm-svn: 32011	2006-11-29 07:18:39 +00:00
Chris Lattner	03fdea2e74	Teach instcombine to turn trunc(srl x, c) -> srl (trunc(x), c) when safe. This implements InstCombine/cast.ll:test34. It fires hundreds of times on 176.gcc. llvm-svn: 32009	2006-11-29 07:04:07 +00:00
Chris Lattner	0409f2c48d	Implement Regression/Transforms/InstCombine/bswap-fold.ll, folding seteq (bswap(x)), c -> seteq(x,bswap(c)) llvm-svn: 32006	2006-11-29 05:02:16 +00:00
Reid Spencer	a866877d2f	Join a split line. llvm-svn: 31996	2006-11-29 01:11:01 +00:00
Reid Spencer	c48fe0fd4d	Undo the last patch until 253.perlbmk passes with these changes. llvm-svn: 31977	2006-11-28 20:23:51 +00:00
Reid Spencer	8587322e79	Remove 4 FIXME's from the CAST patch now that the back end is correctly producing code for "trunc to bool". This passes all tests on Linux. llvm-svn: 31963	2006-11-28 07:23:01 +00:00
Chris Lattner	b391cbb939	Fix PR1014 and InstCombine/2006-11-27-XorBug.ll. llvm-svn: 31941	2006-11-27 19:55:07 +00:00
Reid Spencer	992d9788b3	For PR950: The long awaited CAST patch. This introduces 12 new instructions into LLVM to replace the cast instruction. Corresponding changes throughout LLVM are provided. This passes llvm-test, llvm/test, and SPEC CPUINT2000 with the exception of 175.vpr which fails only on a slight floating point output difference. llvm-svn: 31931	2006-11-27 01:05:10 +00:00
Bill Wendling	999f49061f	Remove #include <iostream> and use llvm_* streams instead. llvm-svn: 31925	2006-11-26 10:17:54 +00:00
Bill Wendling	5c13d56f78	Replace #include <iostream> with llvm_* streams. llvm-svn: 31924	2006-11-26 10:02:32 +00:00
Bill Wendling	1b3a86000a	Removed #include <iostream> and replaced with llvm_* streams. llvm-svn: 31923	2006-11-26 09:46:52 +00:00
Bill Wendling	f2fadcee85	Removed #include <iostream> and used the llvm_cerr/DOUT streams instead. llvm-svn: 31922	2006-11-26 09:17:06 +00:00
Nick Lewycky	cd25e651c2	Update to new predicate simplifier VRP design. Fixes PR966 and PR967. Remove predicate simplifier from default gcc3 pipeline. New design is too slow to enable by default. Add new testcases for problems encountered in development. llvm-svn: 31895	2006-11-22 23:49:16 +00:00
Chris Lattner	632c66b8ef	This xform is handled by FoldOpIntoPhi in visitCastInst in a more elegant way. llvm-svn: 31889	2006-11-21 17:05:13 +00:00
Chris Lattner	cdb67482da	Do not convert massive blocks on phi nodes into select statements. Instead only do these transformations if there are a small number of phi's. This speeds up Ptrdist/ks from 2.35s to 2.19s on my mac pro. llvm-svn: 31853	2006-11-18 19:19:36 +00:00
Chris Lattner	cc4df7e0ab	If an indvar with a variable stride is used by the exit condition, go ahead and handle it like constant stride vars. This fixes some bad codegen in variable stride cases. For example, it compiles this: void foo(int k, int i) { for (k=i+i; k <= 8192; k+=i) flags2[k] = 0; } to: LBB1_1: #bb.preheader movl %eax, %ecx addl %ecx, %ecx movl L_flags2$non_lazy_ptr, %edx LBB1_2: #bb movb $0, (%edx,%ecx) addl %eax, %ecx cmpl $8192, %ecx jle LBB1_2 #bb LBB1_5: #return ret or (if the array is local and we are in dynamic-nonpic or static mode): LBB3_2: #bb movb $0, _flags2(%ecx) addl %eax, %ecx cmpl $8192, %ecx jle LBB3_2 #bb and: lis r2, ha16(L_flags2$non_lazy_ptr) lwz r2, lo16(L_flags2$non_lazy_ptr)(r2) slwi r3, r4, 1 LBB1_2: ;bb li r5, 0 add r6, r4, r3 stbx r5, r2, r3 cmpwi cr0, r6, 8192 bgt cr0, LBB1_5 ;return instead of: leal (%eax,%eax,2), %ecx movl %eax, %edx addl %edx, %edx addl L_flags2$non_lazy_ptr, %edx xorl %esi, %esi LBB1_2: #bb movb $0, (%edx,%esi) movl %eax, %edi addl %esi, %edi addl %ecx, %esi cmpl $8192, %esi jg LBB1_5 #return and: lis r2, ha16(L_flags2$non_lazy_ptr) lwz r2, lo16(L_flags2$non_lazy_ptr)(r2) mulli r3, r4, 3 slwi r5, r4, 1 li r6, 0 add r2, r2, r5 LBB1_2: ;bb li r5, 0 add r7, r3, r6 stbx r5, r2, r6 add r6, r4, r6 cmpwi cr0, r7, 8192 ble cr0, LBB1_2 ;bb This speeds up Benchmarks/Shootout/sieve from 8.533s to 6.464s and implements LoopStrengthReduce/var_stride_used_by_compare.ll llvm-svn: 31809	2006-11-17 06:17:33 +00:00
Chris Lattner	0a2d29b345	Fix a gcc 4.2 warning. llvm-svn: 31751	2006-11-15 04:53:24 +00:00
Chris Lattner	0114b0c20e	implement InstCombine/shift-simplify.ll by transforming: (X >> Z) op (Y >> Z) -> (X op Y) >> Z for all shifts and all ops={and/or/xor}. llvm-svn: 31729	2006-11-14 07:46:50 +00:00
Chris Lattner	616335f272	implement InstCombine/and-compare.ll:test1. This compiles: typedef struct { unsigned prefix : 4; unsigned code : 4; unsigned unsigned_p : 4; } tree_common; int foo(tree_common a, tree_common b) { return a->code == b->code; } into: _foo: movl 4(%esp), %eax movl 8(%esp), %ecx movl (%eax), %eax xorl (%ecx), %eax # TRUNCATE movb %al, %al shrb $4, %al testb %al, %al sete %al movzbl %al, %eax ret instead of: _foo: movl 8(%esp), %eax movb (%eax), %al shrb $4, %al movl 4(%esp), %ecx movb (%ecx), %cl shrb $4, %cl cmpb %al, %cl sete %al movzbl %al, %eax ret saving one cycle by eliminating a shift. llvm-svn: 31727	2006-11-14 06:06:06 +00:00
Chris Lattner	65a873caa2	Fix InstCombine/2006-11-10-ashr-miscompile.ll a miscompilation introduced by the shr -> [al]shr patch. This was reduced from 176.gcc. llvm-svn: 31653	2006-11-10 23:38:52 +00:00
Chris Lattner	4e6c828296	second patch to fix PR992/993. llvm-svn: 31610	2006-11-09 23:36:08 +00:00
Chris Lattner	23d3dac40a	Minimal patch to fix PR992/PR993 llvm-svn: 31608	2006-11-09 23:17:45 +00:00
Chris Lattner	cd3c5f59e2	Teach ShrinkDemandedConstant how to handle X+C. This implements: add.ll:test33, add.ll:test34, shift-sra.ll:test2 llvm-svn: 31586	2006-11-09 05:12:27 +00:00
Chris Lattner	d1bcd014a8	reenable factoring of GEP expressions, being more precise about the case that it bad to do. llvm-svn: 31563	2006-11-08 19:42:28 +00:00
Chris Lattner	77e0e67f23	make this code more efficient by not creating a phi node we are just going to delete in the first place. This also makes it simpler. llvm-svn: 31562	2006-11-08 19:29:23 +00:00

1 2 3 4 5 ...

2670 Commits