llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Chris Lattner	3547371b0b	unbreak the build, apparently without this transformutils starts depending on libipa? llvm-svn: 94102	2010-01-21 21:20:51 +00:00
Chris Lattner	3f91babac8	tidy up llvm-svn: 94101	2010-01-21 21:05:54 +00:00
Victor Hernandez	507d86fcbb	Don't need to include IntrinsicInst.h any more llvm-svn: 94092	2010-01-21 19:33:59 +00:00
Victor Hernandez	3a31114cf9	No need to map NULL operands of metadata llvm-svn: 94091	2010-01-21 19:26:20 +00:00
Dan Gohman	be34c35f32	Re-implement the main strength-reduction portion of LoopStrengthReduction. This new version is much more aggressive about doing "full" reduction in cases where it reduces register pressure, and also more aggressive about rewriting induction variables to count down (or up) to zero when doing so reduces register pressure. It currently uses fairly simplistic algorithms for finding reuse opportunities, but it introduces a new framework allows it to combine multiple strategies at once to form hybrid solutions, instead of doing all full-reduction or all base+index. llvm-svn: 94061	2010-01-21 02:09:26 +00:00
Eric Christopher	939ad8be86	Add strcpy_chk -> strcpy support for "don't know" object size answers. This will update as object size checking gets better information. llvm-svn: 94059	2010-01-21 01:04:38 +00:00
Chris Lattner	5c16b9815b	simplify this code. llvm-svn: 94048	2010-01-20 23:30:28 +00:00
Jakob Stoklund Olesen	ac14f3bf31	Move per-function inline threshold calculation to a method. No functional change except the forgotten test for InlineLimit.getNumOccurrences() == 0 in the CurrentThreshold2 calculation. llvm-svn: 94007	2010-01-20 17:51:28 +00:00
Victor Hernandez	50e7dcdcc2	Switch Elts from vector to SmallVector llvm-svn: 93989	2010-01-20 06:56:16 +00:00
Victor Hernandez	d79e7e0e12	Map operands of all function-local metadata, not just metadata passed to llvm.dbg.declare intrinsics llvm-svn: 93979	2010-01-20 05:49:59 +00:00
Dan Gohman	4ca37472ec	When doing address-mode sinking, expand the base register first, rather than the scaled register. This makes it more likely that subsequent AddrModeMatcher queries will match the new address the same way as the old, instead of accidentally matching what had been the base register as the new scaled register, and then failing to match the scaled register. This fixes some problems with address-mode sinking multiple muls into a block, which will be a lot more common with some upcoming LoopStrengthReduction changes. llvm-svn: 93935	2010-01-19 22:45:06 +00:00
Chris Lattner	e0124f19f9	optimize ~(~X >>s Y) --> (X >>s Y), patch by Edmund Grimley Evans! llvm-svn: 93884	2010-01-19 18:16:19 +00:00
Bob Wilson	e94d77854f	Fix a crash in scalarrepl for memcpy/memmove where the source and destination are the same. I had already fixed a similar problem where the source and destination were different bitcasts derived from the same alloca, but the previous fix still did not handle the case where both operands are exactly the same value. Radar 7552893. llvm-svn: 93848	2010-01-19 04:32:48 +00:00
Eric Christopher	5b602f9bf7	Fix comment. llvm-svn: 93831	2010-01-19 01:20:15 +00:00
Chris Lattner	6cd7a81f86	my instcombine transformations to make extension elimination more aggressive changed the canonical form from sext(trunc(x)) to ashr(lshr(x)), make sure to transform a couple more things into that canonical form, and catch a case where we missed turning zext/shl/ashr into a single sext. llvm-svn: 93787	2010-01-18 22:19:16 +00:00
Devang Patel	2cae5754c6	While mapping llvm.dbg.declare intrinsic manually map its operand, if possible, because it points to an alloca instruction through metadata. llvm-svn: 93757	2010-01-18 19:52:14 +00:00
Owen Anderson	d73ce407a5	Convert some of the dynamic opcode lookups into static ones. llvm-svn: 93693	2010-01-17 19:33:27 +00:00
Owen Anderson	50cacaff8f	Fix comment. llvm-svn: 93679	2010-01-17 06:49:03 +00:00
Bob Wilson	bdd9890c7d	Fix a comment typo. llvm-svn: 93560	2010-01-15 21:55:02 +00:00
Bill Wendling	488a7187b4	When the visitSub method was split into visitSub and visitFSub, this xform was added to the FSub version. However, the original version of this xform guarded against doing this for floating point (!Op0->getType()->isFPOrFPVector()). This is causing LLVM to perform incorrect xforms for code like: void func(double rhi, double rlo, double xh, double xl, double yh, double yl){ double mh, ml; double c = 134217729.0; double up, u1, u2, vp, v1, v2; up = xhc; u1 = (xh - up) + up; u2 = xh - u1; vp = yhc; v1 = (yh - vp) + vp; v2 = yh - v1; mh = xhyh; ml = (((u1v1 - mh) + (u1v2)) + (u2v1)) + (u2v2); ml += xhyl + xlyh; rhi = mh + ml; rlo = (mh - (rhi)) + ml; } The last line was optimized away, but rl is intended to be the difference between the infinitely precise result of mh + ml and after it has been rounded to double precision. llvm-svn: 93369	2010-01-13 23:23:17 +00:00
Chris Lattner	21bbaf49d5	1) Use the new SimplifyInstructionsInBlock routine instead of the copy in JT. 2) When cloning blocks for PHI or xor conditions, use instsimplify to simplify the code as we go. This allows us to squish common cases early in JT which opens up opportunities for subsequent iterations, and allows it to completely simplify the testcase. llvm-svn: 93253	2010-01-12 20:41:47 +00:00
Chris Lattner	87f86498c3	add a helper function. llvm-svn: 93251	2010-01-12 19:40:54 +00:00
Chris Lattner	bc0016437d	tidy up llvm-svn: 93222	2010-01-12 02:07:50 +00:00
Chris Lattner	774e3967ad	Teach jump threading to duplicate small blocks when the branch condition is a xor with a phi node. This eliminates nonsense like this from 176.gcc in several places: LBB166_84: testl %eax, %eax - setne %al - xorb %cl, %al - notb %al - testb $1, %al - je LBB166_85 + je LBB166_69 + jmp LBB166_85 This is rdar://7391699 llvm-svn: 93221	2010-01-12 02:07:17 +00:00
Chris Lattner	d6d8cc7b37	some cleanup, and make it obvious that ProcessJumpOnPHI only works on branches by renaming it and checking for a branch at the call site. llvm-svn: 93208	2010-01-11 23:41:09 +00:00
Chris Lattner	a8cabeeecb	reenable the piece that turns trunc(zext(x)) -> x even if zext has multiple uses, codegen has no apparent problem with the trunc version of this, because it turns into a simple subreg idiom llvm-svn: 93202	2010-01-11 22:49:40 +00:00
Chris Lattner	2749cc2036	Disable folding sext(trunc(x)) -> x (and other similar cast/cast cases) when the trunc has multiple uses. Codegen is not able to coalesce the subreg case correctly and so this leads to higher register pressure and spilling (see PR5997). This speeds up 256.bzip2 from 8.60 -> 8.04s on my machine, ~7%. llvm-svn: 93200	2010-01-11 22:45:25 +00:00
Chris Lattner	85a6f02b94	add one more bitfield optimization, allowing clang to generate good code on PR4216: _test_bitfield: ## @test_bitfield orl $32962, %edi movl $4294941946, %eax andq %rdi, %rax ret instead of: _test_bitfield: movl $4294941696, %ecx movl %edi, %eax orl $194, %edi orl $32768, %eax andq $250, %rdi andq %rax, %rcx movq %rdi, %rax orq %rcx, %rax ret Evan is looking into the remaining andq+imm -> andl optimization. llvm-svn: 93147	2010-01-11 06:55:24 +00:00
Chris Lattner	16e36659f5	Extend CanEvaluateZExtd to handle and/or/xor more aggressively in the BitsToClear case. This allows it to promote expressions which have an and/or/xor after the lshr, promoting cases like test2 (from PR4216) and test3 (random extample extracted from a spec benchmark). clang now compiles the code in PR4216 into: _test_bitfield: ## @test_bitfield movl %edi, %eax orl $194, %eax movl $4294902010, %ecx andq %rax, %rcx orl $32768, %edi andq $39936, %rdi movq %rdi, %rax orq %rcx, %rax ret instead of: _test_bitfield: ## @test_bitfield movl %edi, %eax orl $194, %eax movl $4294902010, %ecx andq %rax, %rcx shrl $8, %edi orl $128, %edi shlq $8, %rdi andq $39936, %rdi movq %rdi, %rax orq %rcx, %rax ret which is still not great, but is progress. llvm-svn: 93145	2010-01-11 04:05:13 +00:00
Chris Lattner	f2ba85eedc	Remove the dead TD argument to CanEvaluateZExtd, and add a new BitsToClear result which allows us to start promoting expressions that end with a lshr-by-constant. This is conservatively correct and better than what we had before (see testcases) but still needs to be extended further. llvm-svn: 93144	2010-01-11 03:32:00 +00:00
Chris Lattner	d7f1b97147	improve comments, remove dead TD argument to CanEvaluateSExtd. llvm-svn: 93143	2010-01-11 02:43:35 +00:00
Chris Lattner	18d753e05f	teach sext optimization to handle truncs from types that are not the dest of the sext. llvm-svn: 93128	2010-01-10 20:30:41 +00:00
Chris Lattner	ca53de1ab7	teach zext optimization how to deal with truncs that don't come from the zext dest type. This allows us to handle test52/53 in cast.ll, and allows llvm-gcc to generate much better code for PR4216 in -m64 mode: _test_bitfield: ## @test_bitfield orl $32962, %edi movl %edi, %eax andl $-25350, %eax ret This also fixes a bug handling vector extends, ensuring that the mask produced is a vector constant, not an integer constant. llvm-svn: 93127	2010-01-10 20:25:54 +00:00
Chris Lattner	67e0ef4d5d	simplify CanEvaluateSExtd to return a bool now that we have a simpler profitability predicate. llvm-svn: 93111	2010-01-10 07:57:20 +00:00
Chris Lattner	fa864affbc	the NumCastsRemoved argument to CanEvaluateSExtd is dead, remove it. llvm-svn: 93110	2010-01-10 07:42:21 +00:00
Chris Lattner	e68d6e61b1	now that the cost model has changed, we can always consider elimination of a sign extend to be a win, which simplifies the client of CanEvaluateSExtd, and allows us to eliminate more casts (examples taken from real code). llvm-svn: 93109	2010-01-10 07:40:50 +00:00
Chris Lattner	e86619ca01	change the preferred canonical form for a sign extension to be lshr+ashr instead of trunc+sext. We want to avoid type conversions whenever possible, it is easier to codegen expressions without truncates and extensions. llvm-svn: 93107	2010-01-10 07:08:30 +00:00
Chris Lattner	00b1e8508e	fix indentation of switch statements, no functionality change. llvm-svn: 93106	2010-01-10 06:59:55 +00:00
Chris Lattner	1332cfaea1	fix pasto that broke bootstrap. llvm-svn: 93105	2010-01-10 06:50:04 +00:00
Chris Lattner	7314e76078	simplify CanEvaluateZExtd now that we don't care about the number of bits known clear in the result and don't care about the # casts eliminated. TD is also dead but keeping it for now. llvm-svn: 93098	2010-01-10 02:50:04 +00:00
Chris Lattner	1106f03886	two changes: 1) don't try to optimize a sext or zext that is only used by a trunc, let the trunc get optimized first. This avoids some pointless effort in some common cases since instcombine scans down a block in the first pass. 2) Change the cost model for zext elimination to consider an 'and' cheaper than a zext. This allows us to do it more aggressively, and for the next patch to simplify the code quite a bit. llvm-svn: 93097	2010-01-10 02:39:31 +00:00
Chris Lattner	a04ed0659f	enhance CanEvaluateZExtd to handle shift left and sext, allowing more expressions to be promoted and casts eliminated. llvm-svn: 93096	2010-01-10 02:22:12 +00:00
Chris Lattner	eb3da2ab86	remove an xform subsumed by EvaluateInDifferentType. llvm-svn: 93095	2010-01-10 01:35:55 +00:00
Julien Lerouge	950d1334d1	Fix nondeterministic behavior. llvm-svn: 93093	2010-01-10 01:07:22 +00:00
Chris Lattner	edb60f0861	clean up this xform by using m_Trunc. llvm-svn: 93092	2010-01-10 01:04:31 +00:00
Chris Lattner	3efde81a44	inline and remove the rest of commonIntCastTransforms. llvm-svn: 93091	2010-01-10 01:00:46 +00:00
Chris Lattner	492e9ca5a4	Inline the expression type promotion/demotion stuff out of commonIntCastTransforms into the callers, eliminating a switch, and allowing the static predicate methods to be moved down to live next to the corresponding function. No functionality change. llvm-svn: 93089	2010-01-10 00:58:42 +00:00
Chris Lattner	b142e13f84	only factor from expressions whose uses are empty and whose base is the right expression type. This fixes PR5981. llvm-svn: 93045	2010-01-09 06:01:36 +00:00
Julien Lerouge	76c75e82d0	Fix nondeterministic behavior. llvm-svn: 93038	2010-01-09 01:06:49 +00:00
Eric Christopher	937778e8d0	Remove unnecessary dyn_cast and add a comment. Part of a WIP. llvm-svn: 93026	2010-01-08 21:37:11 +00:00

1 2 3 4 5 ...

6310 Commits