llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 19:12:56 +02:00

Author	SHA1	Message	Date
Kay Tiong Khoo	dd9c210766	Use local variable for repeated use rather than 'get' method. No functional change intended. llvm-svn: 196164	2013-12-02 22:23:32 +00:00
Kay Tiong Khoo	40a987d7ca	Move variables to where they are used and give them better names. No functional change intended. llvm-svn: 196163	2013-12-02 22:20:40 +00:00
Kay Tiong Khoo	14489a85ab	Rename variables to be consistent (CST -> Cst). No functional change intended. llvm-svn: 196161	2013-12-02 22:11:56 +00:00
Kay Tiong Khoo	5257afa264	Conservative fix for PR17827 - don't optimize a shift + and + compare sequence where the shift is logical unless the comparison is unsigned llvm-svn: 196129	2013-12-02 18:43:59 +00:00
Stephen Canon	d8aaca93a6	Rein in overzealous InstCombine of fptrunc(OP(fpextend, fpextend)). llvm-svn: 195934	2013-11-28 21:38:05 +00:00
Hal Finkel	79b1387151	Apply the InstCombine fptrunc sqrt optimization to llvm.sqrt InstCombine, in visitFPTrunc, applies the following optimization to sqrt calls: (fptrunc (sqrt (fpext x))) -> (sqrtf x) but does not apply the same optimization to llvm.sqrt. This is a problem because, to enable vectorization, Clang generates llvm.sqrt instead of sqrt in fast-math mode, and because this optimization is being applied to sqrt and not applied to llvm.sqrt, sometimes the fast-math code is slower. This change makes InstCombine apply this optimization to llvm.sqrt as well. This fixes the specific problem in PR17758, although the same underlying issue (optimizations applied to libcalls are not applied to intrinsics) exists for other optimizations in SimplifyLibCalls. llvm-svn: 194935	2013-11-16 21:29:08 +00:00
Benjamin Kramer	0519e29d1b	InstCombine: fold (A >> C) == (B >> C) --> (A^B) < (1 << C) for constant Cs. This is common in bitfield code. llvm-svn: 194925	2013-11-16 16:00:48 +00:00
Matt Arsenault	2c9ec5c652	Add instcombine visitor for addrspacecast llvm-svn: 194786	2013-11-15 05:45:08 +00:00
Nadav Rotem	a2f08b8300	Update the docs to match the function name. llvm-svn: 194537	2013-11-13 01:12:01 +00:00
Nadav Rotem	e15585e8ff	Fold (iszero(A&K1) \| iszero(A&K2)) -> (A&(K1\|K2)) != (K1\|K2) if we know that K1 and K2 are 'one-hot' (only one bit is on). llvm-svn: 194525	2013-11-12 22:38:59 +00:00
Matt Arsenault	1f521e921d	Scalarize select vector arguments when extracted. When the elements are extracted from a select on vectors or a vector select, do the select on the extracted scalars from the input if there is only one use. llvm-svn: 194013	2013-11-04 20:36:06 +00:00
Craig Topper	037594e792	Remove x86_sse42_crc32_64_8 intrinsic. It has no functional difference from x86_sse42_crc32_32_8 and was not mapped to a clang builtin. I'm not even sure why this form of the instruction is even called out explicitly in the docs. Also add AutoUpgrade support to convert it into the other intrinsic with appropriate trunc and zext. llvm-svn: 192672	2013-10-15 05:20:47 +00:00
Owen Anderson	5e2eba8c18	Pull fptrunc's upwards through selects when one of the select's selectands was a constant. This has a number of benefits, including producing small immediates (easier to materialize, smaller constant pools) as well as being more likely to allow the fptrunc to fuse with a preceding instruction (truncating selects are unusual). llvm-svn: 191929	2013-10-03 21:08:05 +00:00
Matt Arsenault	c8e9a77ae3	Make gep i8* X, -(ptrtoint Y) transform work with address spaces llvm-svn: 191920	2013-10-03 18:15:57 +00:00
Matt Arsenault	bc0b70cd8d	Use right address space size in InstCombineCompares The test's output doesn't change, but this ensures this is actually hit with a different address space. llvm-svn: 191701	2013-09-30 21:11:01 +00:00
Matt Arsenault	27f5203bba	Constant fold ptrtoint + compare with address spaces llvm-svn: 191699	2013-09-30 21:06:18 +00:00
Benjamin Kramer	ae9249f56f	InstCombine: Replace manual fast math flag copying with the new IRBuilder RAII helper. Defines away the issue where cast<Instruction> would fail because constant folding happened. Also slightly cleaner. llvm-svn: 191674	2013-09-30 15:39:59 +00:00
Joey Gouly	b2a84bfcc1	Fix a bug in InstCombine where it attempted to cast a Value* to an Instruction* when it was actually a Constant*. There are quite a few other casts to Instruction that might have the same problem, but this is the only one I have a test case for. llvm-svn: 191668	2013-09-30 14:18:35 +00:00
Matt Arsenault	c1629ee7a8	Use type helper functions llvm-svn: 191574	2013-09-27 22:18:51 +00:00
Justin Bogner	d2e08f6deb	InstCombine: Only foldSelectICmpAndOr for integer types Currently foldSelectICmpAndOr asserts if the "or" involves a vector containing several of the same power of two. We can easily avoid this by only performing the fold on integer types, like foldSelectICmpAnd does. Fixes <rdar://problem/15012516> llvm-svn: 191552	2013-09-27 20:35:39 +00:00
Benjamin Kramer	109a525643	Push analysis passes to InstSimplify when they're around anyways. llvm-svn: 191309	2013-09-24 16:37:40 +00:00
Benjamin Kramer	89ff6bf9f0	InstCombine: Remove unused argument. No functionality change. llvm-svn: 191112	2013-09-20 22:12:42 +00:00
Benjamin Kramer	7ea950a209	InstCombine: Canonicalize (gep i8* X, -(ptrtoint Y)) to (sub (ptrtoint X), (ptrtoint Y)) The GEP pattern is what SCEV expander emits for "ugly geps". The latter is what you get for pointer subtraction in C code. The rest of instcombine already knows how to deal with that so just canonicalize on that. llvm-svn: 191090	2013-09-20 14:38:44 +00:00
Shuxin Yang	44b8e62121	[Fast-math] Disable "(C1/X)C2 => (C1C2)/X" if C1/X has multiple uses. If "C1/X" were having multiple uses, the only benefit of this transformation is to potentially shorten critical path. But it is at the cost of instroducing additional div. The additional div may or may not incur cost depending on how div is implemented. If it is implemented using Newton–Raphson iteration, it dosen't seem to incur any cost (FIXME). However, if the div blocks the entire pipeline, that sounds to be pretty expensive. Let CodeGen to take care this transformation. This patch sees 6% on a benchmark. rdar://15032743 llvm-svn: 191037	2013-09-19 21:13:46 +00:00
Benjamin Kramer	b681a45715	InstCombine: Don't allow turning vector-of-pointer loads into vector-of-integer. The code below can't handle any pointers. PR17293. llvm-svn: 191036	2013-09-19 20:59:04 +00:00
Quentin Colombet	1950396a9e	Revert the load slicing done in r190870. To avoid regressions with bitfield optimizations, this slicing should take place later, like ISel time. llvm-svn: 190891	2013-09-17 22:01:26 +00:00
Matt Arsenault	6fd5ad85d0	Cleanup handling of constant function casts. Some of this code is no longer necessary since int<->ptr casts are no longer occur as of r187444. This also fixes handling vectors of pointers, and adds a bunch of new testcases for vectors and address spaces. llvm-svn: 190885	2013-09-17 21:10:14 +00:00
Quentin Colombet	3d996b9289	[InstCombiner] Slice a big load in two loads when the elements are next to each other in memory. The motivation was to get rid of truncate and shift right instructions that get in the way of paired load or floating point load. E.g., Consider the following example: struct Complex { float real; float imm; }; When accessing a complex, llvm was generating a 64-bits load and the imm field was obtained by a trunc(lshr) sequence, resulting in poor code generation, at least for x86. The idea is to declare that two load instructions is the canonical form for loading two arithmetic type, which are next to each other in memory. Two scalar loads at a constant offset from each other are pretty easy to detect for the sorts of passes that like to mess with loads. <rdar://problem/14477220> llvm-svn: 190870	2013-09-17 16:57:34 +00:00
Eli Friedman	3718173153	Get rid of unused isPodLike definitions. llvm-svn: 190461	2013-09-11 00:36:54 +00:00
Quentin Colombet	48ec8d3046	[InstCombiner] Expose opportunities to merge subtract and comparison. Several architectures use the same instruction to perform both a comparison and a subtract. The instruction selection framework does not allow to consider different basic blocks to expose such fusion opportunities. Therefore, these instructions are “merged” by CSE at MI IR level. To increase the likelihood of CSE to apply in such situation, we reorder the operands of the comparison, when they have the same complexity, so that they matches the order of the most frequent subtract. E.g., icmp A, B ... sub B, A <rdar://problem/14514580> llvm-svn: 190352	2013-09-09 20:56:48 +00:00
Matt Arsenault	4c2083b14a	Use type helper functions. llvm-svn: 190113	2013-09-06 00:37:24 +00:00
Matt Arsenault	f022cb3c1a	Consistently use dbgs() in debug printing llvm-svn: 190093	2013-09-05 19:48:28 +00:00
Tim Northover	baf9697e72	InstCombine: allow unmasked icmps to be combined with logical ops "(icmp op i8 A, B)" is equivalent to "(icmp op i8 (A & 0xff), B)" as a degenerate case. Allowing this as a "masked" comparison when analysing "(icmp) &/\| (icmp)" allows us to combine them in more cases. rdar://problem/7625728 llvm-svn: 189931	2013-09-04 11:57:17 +00:00
Tim Northover	9045305fce	InstCombine: look for masked compares with subset relation Even in cases which aren't universally optimisable like "(A & B) != 0 && (A & C) != 0", the masks can make one of the comparisons completely redundant. In this case, since we've gone to the effort of spotting masked comparisons we should combine them. rdar://problem/7625728 llvm-svn: 189930	2013-09-04 11:57:13 +00:00
Matt Arsenault	469d672381	Teach InstCombineLoadCast about address spaces. This is another one that doesn't matter much, but uses the right GEP index types in the first place. llvm-svn: 189854	2013-09-03 21:05:48 +00:00
Matt Arsenault	306cb38abf	Use type form of getIntPtrType in alloca visitor. This doesn't actually matter, since alloca is always 0 address space, but this is more consistent. llvm-svn: 189853	2013-09-03 21:05:15 +00:00
Benjamin Kramer	3a81691558	InstCombine: Check for zero shift amounts before subtracting one causing integer overflow. PR17026. Also avoid undefined shifts and shift amounts larger than 64 bits (those are always undef because we can't represent integer types that large). llvm-svn: 189672	2013-08-30 14:35:35 +00:00
Matt Arsenault	60f6ad8d53	Fix typo. llvm-svn: 189524	2013-08-28 22:17:26 +00:00
Matt Arsenault	95d00423a7	Teach InstCombine about address spaces llvm-svn: 188926	2013-08-21 19:53:10 +00:00
Jakub Staszak	b4a62fef02	Use pop_back_val() instead of both back() and pop_back(). llvm-svn: 188723	2013-08-19 22:47:55 +00:00
Matt Arsenault	77bbadbfcb	Teach InstCombine visitGetElementPtr about address spaces llvm-svn: 188721	2013-08-19 22:17:40 +00:00
Matt Arsenault	2c55fe773b	Cleanup visitGetElementPtr to make address space change easier llvm-svn: 188720	2013-08-19 22:17:34 +00:00
Matt Arsenault	14a3f7be8d	commonPointerCast cleanups to make address space change easier llvm-svn: 188719	2013-08-19 22:17:18 +00:00
Matt Arsenault	22098dc46b	Revert non-test parts of r188507 Re-add the inboundsless tests I didn't add originally llvm-svn: 188710	2013-08-19 21:40:31 +00:00
Jim Grosbach	933ecf8022	InstCombine: Use isAllOnesValue() instead of explicit -1. llvm-svn: 188563	2013-08-16 17:03:36 +00:00
Jim Grosbach	72387340f5	InstCombine: Simplify if(x!=0 && x!=-1). When both constants are positive or both constants are negative, InstCombine already simplifies comparisons like this, but when it's exactly zero and -1, the operand sorting ends up reversed and the pattern fails to match. Handle that special case. Follow up for rdar://14689217 llvm-svn: 188512	2013-08-16 00:15:20 +00:00
Matt Arsenault	9594ef019c	Don't do FoldCmpLoadFromIndexedGlobal for non inbounds GEPs This path wasn't tested before without a datalayout, so add some more tests and re-run with and without one. llvm-svn: 188507	2013-08-15 23:11:07 +00:00
Matt Arsenault	cb3b478d91	Fix always creating GEP with i32 indices Use the pointer size if datalayout is available. Use i64 if it's not, which is consistent with what other places do when the pointer size is unknown. The test doesn't really test this in a useful way since it will be transformed to that later anyway, but this now tests it for non-zero arrays and when datalayout isn't available. The cases in visitGetElementPtrInst should save an extra re-visit to the newly created GEP since it won't need to cleanup after itself. llvm-svn: 188339	2013-08-14 00:24:38 +00:00
Matt Arsenault	6d1daebfff	Use type helper functions instead of cast llvm-svn: 188338	2013-08-14 00:24:34 +00:00
Matt Arsenault	734103e561	Use array initializer, space around operator llvm-svn: 188337	2013-08-14 00:24:05 +00:00

1 2 3 4 5 ...

956 Commits