llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 02:52:53 +02:00

Author	SHA1	Message	Date
Simon Pilgrim	3b4ac7dc2a	[InstCombine][X86][AVX] Add DemandedElts support for VPERMILPD/VPERMILPS instructions Simplify a vpermilvar shuffle mask based on the elements of the mask that are actually demanded. llvm-svn: 292209	2017-01-17 11:35:03 +00:00
Sanjoy Das	1728d455cf	[InstCombine] Don't DSE across readnone functions that may throw Summary: Depends on D28740 Reviewers: dberlin, chandlerc, hfinkel, majnemer Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D28742 llvm-svn: 292197	2017-01-17 05:45:09 +00:00
David Majnemer	b33ccfb48d	[InstCombine] Fold ((C1-zext(X)) & C2) -> zext((C1-X) & C2) This is valid if C2 fits within the bitwidth of X thanks to two's complement modulo arithmetic. llvm-svn: 292179	2017-01-17 00:45:57 +00:00
Matt Arsenault	a568a0d4a6	Add comment to test file I forgot to save llvm-svn: 292178	2017-01-17 00:35:28 +00:00
Matt Arsenault	018a8adda0	SimplifyLibCalls: Remove checks for fabs Use the intrinsic instead of emitting the libcall which will be replaced by the intrinsic. llvm-svn: 292176	2017-01-17 00:30:31 +00:00
Matt Arsenault	ccfb3dd68e	SimplifyLibCalls: Replace fabs libcalls with intrinsics Add missing fabs(fpext) optimzation that worked with the call, and also fixes it creating a second fpext when there were multiple uses. llvm-svn: 292172	2017-01-17 00:10:40 +00:00
Simon Pilgrim	b3d302c9e0	[InstCombine][AVX] Tests showing missed opportunities to pass demanded elts through a permilpd/permilps shuffle mask llvm-svn: 292165	2017-01-16 21:34:22 +00:00
Sanjay Patel	ab8f315ba8	[InstCombine] use m_APInt to allow shift-shift folds for vectors with splat constants Some existing 'FIXME' tests are still not folded because of splat holes in value tracking. llvm-svn: 292151	2017-01-16 19:35:45 +00:00
Sanjay Patel	1ec279d94f	[InstCombine] add tests to show missed vector folds; NFC The shift-shift possibilities became easier to see after: https://reviews.llvm.org/rL292145 llvm-svn: 292150	2017-01-16 19:23:34 +00:00
Simon Pilgrim	cdda17446c	[InstCombine][SSE] Tests showing missed opportunities to pass demanded elts through a packss/packus truncation llvm-svn: 292144	2017-01-16 17:26:23 +00:00
Simon Pilgrim	9d900b5f6e	[InstCombine][SSE] Add DemandedElts support for PSHUFB instructions Simplify a pshufb shuffle mask based on the elements of the mask that are actually demanded. Differential Revision: https://reviews.llvm.org/D28745 llvm-svn: 292101	2017-01-16 11:30:41 +00:00
Sanjay Patel	3b866f23f8	[InstCombine] add tests to show missed vector folds; NFC Also, add comments and remove bogus comment. llvm-svn: 292082	2017-01-15 23:45:03 +00:00
Simon Pilgrim	c1b9ed731d	[InstCombine][SSE] Tests showing missed opportunities to pass demanded elts through a pshufb shuffle mask llvm-svn: 292072	2017-01-15 17:49:04 +00:00
Sanjay Patel	5495465040	[InstCombine] use m_APInt to allow ashr folds for vectors with splat constants llvm-svn: 292064	2017-01-15 16:38:19 +00:00
Sanjay Patel	f8ca2aa064	[InstCombine] add explanatory comments to tests; NFC llvm-svn: 292063	2017-01-15 16:22:26 +00:00
Chandler Carruth	fa2e854de5	[PM] Fix instcombine's analysis preservation in the new pass manager to cover domtree and alias analysis. These are the pretty clear analyses that we would always want to survive this pass. To make these survive, we also need to preserve the assumption cache. Added a test that verifies the important bits of this preservation. llvm-svn: 292037	2017-01-14 23:25:22 +00:00
Sanjay Patel	286505799a	[InstCombine] add test to show missed vector fold; NFC llvm-svn: 292035	2017-01-14 23:12:29 +00:00
Sanjay Patel	c20cf10406	[InstCombine] optimize unsigned icmp of increment Allows LLVM to optimize sequences like the following: %add = add nuw i32 %x, 1 %cmp = icmp ugt i32 %add, %y Into: %cmp = icmp uge i32 %x, %y Previously, only signed comparisons were being handled. Decrements could also be handled, but 'sub nuw %x, 1' is currently canonicalized to 'add %x, -1' in InstCombineAddSub, losing the nuw flag. Removing that canonicalization seems like it might have far-reaching ramifications so I kept this simple for now. Patch by Matti Niemenmaa! Differential Revision: https://reviews.llvm.org/D24700 llvm-svn: 291975	2017-01-13 23:25:46 +00:00
Sanjay Patel	fcb9044bb8	[InstCombine] use m_APInt to allow lshr folds for vectors with splat constants llvm-svn: 291972	2017-01-13 23:04:10 +00:00
Sanjay Patel	4485ac55e2	[InstCombine / InstSimplify] add and move tests for lshr transforms; NFC llvm-svn: 291970	2017-01-13 22:54:12 +00:00
Sanjay Patel	3edc9b2bd9	[InstCombine] use m_APInt to allow shl folds for vectors with splat constants llvm-svn: 291934	2017-01-13 18:39:09 +00:00
Sanjay Patel	68dd981647	[InstCombine] add tests to show missing transforms for vector shl; NFC llvm-svn: 291926	2017-01-13 18:27:23 +00:00
Sanjay Patel	ee734847d0	[InstCombine] if the condition of a select may be known via assumes, eliminate the select This is a limited solution for PR31512: https://llvm.org/bugs/show_bug.cgi?id=31512 The motivation is that we will need to increase usage of llvm.assume and/or metadata to solve PR28430: https://llvm.org/bugs/show_bug.cgi?id=28430 ...and this kind of simplification is needed to take advantage of that extra information. The 'not' test case would be handled by: https://reviews.llvm.org/D28485 Differential Revision: https://reviews.llvm.org/D28337 llvm-svn: 291915	2017-01-13 17:02:42 +00:00
Matt Arsenault	588e04537c	InstSimplify: Eliminate fabs on known positive llvm-svn: 291624	2017-01-11 00:33:24 +00:00
Matt Arsenault	0be1430dec	InstCombine: fdiv -x, -y -> fdiv x, y llvm-svn: 291611	2017-01-10 23:08:54 +00:00
Davide Italiano	ba6e5d0a9e	[SimplifyLibCalls] Propagate fast math flags while optimizing pow(). llvm-svn: 291577	2017-01-10 18:02:05 +00:00
Davide Italiano	bc2b08fb6d	[SimplifyLibCalls] pow(x, -0.5) -> 1.0 / sqrt(x). Differential Revision: https://reviews.llvm.org/D28479 llvm-svn: 291486	2017-01-09 21:55:23 +00:00
Sanjay Patel	3ac455e916	[InstCombine] add test to show missed fold using llvm.assume; NFC llvm-svn: 291472	2017-01-09 20:18:30 +00:00
Sanjay Patel	a6e75ebdd0	[InstCombine] regenerate checks; NFC llvm-svn: 291469	2017-01-09 19:43:26 +00:00
Sanjay Patel	02b3fa484b	[InstCombine] regenerate checks; NFC llvm-svn: 291464	2017-01-09 19:18:46 +00:00
Sanjay Patel	ac0df93034	[InstCombine] remove unnecessary attribute comments from test files; NFC llvm-svn: 291463	2017-01-09 19:13:38 +00:00
Matt Arsenault	b5e40e0ffe	SimplifyLibCalls: Remove incorrect optimization of fabs fabs(x * x) is not generally safe to assume x is positive if x is a NaN. This is also less general than it could be, so this will be replaced with a transformation on the intrinsic. llvm-svn: 291359	2017-01-07 19:55:12 +00:00
David Majnemer	56f8a7c1a4	[InstSimplify] Optimize away urems in the presence of range metadata We know that urem %V, C can be optimized away to %V if %V is ult C. llvm-svn: 291282	2017-01-06 21:23:51 +00:00
Sanjay Patel	bbb8bbf58b	[InstCombine] add a vector version of a test added in r291262; NFC llvm-svn: 291265	2017-01-06 19:14:05 +00:00
Sanjay Patel	e3702efb5f	[InstCombine] move and add tests for icmp + shl nsw; NFC As discussed here: http://lists.llvm.org/pipermail/llvm-dev/2017-January/108749.html ...we should be able to better optimize this pattern. llvm-svn: 291262	2017-01-06 18:57:54 +00:00
Matt Arsenault	750d75bf50	InstCombine: Fold cos(-x) -> cos(x) Also cos(fabs(x)) -> cos(x) llvm-svn: 291022	2017-01-04 22:49:03 +00:00
David Majnemer	eaee470355	[InstCombine] Add a test for r290733 llvm-svn: 290929	2017-01-04 02:21:37 +00:00
David Majnemer	1199f6c23b	[InstCombine] Move casts around shift operations It is possible to perform a left shift before zero extending if the shift would only shift out zeros. llvm-svn: 290928	2017-01-04 02:21:34 +00:00
David Majnemer	01104cb956	[InstCombine] Combine adds across a zext We can perform the following: (add (zext (add nuw X, C1)), C2) -> (zext (add nuw X, C1+C2)) This is only possible if C2 is negative and C2 is greater than or equal to negative C1. llvm-svn: 290927	2017-01-04 02:21:31 +00:00
Matt Arsenault	9fca19b426	InstCombine: Fold fabs on select of constants llvm-svn: 290913	2017-01-03 22:40:34 +00:00
Sanjay Patel	0428428089	[InstCombine] tighten checks for tests of assume -> metadata transform; NFC llvm-svn: 290903	2017-01-03 19:32:11 +00:00
Matt Arsenault	0915ad05ab	InstCombine: Add fma with constant transforms DAGCombine already does these. llvm-svn: 290860	2017-01-03 04:32:35 +00:00
Matt Arsenault	efa569bc4d	InstCombine: Add fma + fabs/fneg transforms fma (fneg x), (fneg y), z -> fma x, y, z fma (fabs x), (fabs x), z -> fma x, x, z llvm-svn: 290859	2017-01-03 04:32:31 +00:00
Sanjay Patel	d7fde6c48f	[InstCombine] add explanatory comment to test; NFC The test was added at r290797, and a patch to enable the transform is proposed in D28204. llvm-svn: 290798	2017-01-01 18:20:49 +00:00
Sanjay Patel	d9ee257edd	[InstCombine] add test to show potential nonnull attribute propagation; NFC This will change with the current draft of: https://reviews.llvm.org/D28204 llvm-svn: 290797	2017-01-01 17:18:00 +00:00
Craig Topper	2266c1322b	[InstCombine][AVX-512] Teach InstCombine that llvm.x86.avx512.vcomi.sd and llvm.x86.avx512.vcomi.ss don't use the upper elements of their input. This was already done for the SSE/SSE2 version of the intrinsics. llvm-svn: 290776	2016-12-31 00:45:06 +00:00
Craig Topper	412163fcd4	[InstCombine] Fix some of the AVX-512 scalar arithmetic test cases to do a better job of testing what they intended to test. The accidentally had trivially dead code. Also needed to adjust the rounding mode to not CUR_DIRECTION so the intrinsics don't get converted to native operations before going through SimplifyDemandedVectorElts. llvm-svn: 290702	2016-12-29 02:29:04 +00:00
Michael Kuperstein	04dff9bc7a	[InstCombine] Canonicalize insert splat sequences into an insert + shuffle This adds a combine that canonicalizes a chain of inserts which broadcasts a value into a single insert + a splat shufflevector. This fixes PR31286. Differential Revision: https://reviews.llvm.org/D27992 llvm-svn: 290641	2016-12-28 00:18:08 +00:00
George Burgess IV	1ca7f8d821	[Analysis] Ignore `nobuiltin` on `allocsize` function calls. We currently ignore the `allocsize` attribute on functions calls with the `nobuiltin` attribute when trying to lower `@llvm.objectsize`. We shouldn't care about `nobuiltin` here: `allocsize` is explicitly added by the user, not inferred based on a function's symbol. llvm-svn: 290588	2016-12-27 06:32:14 +00:00
Craig Topper	404ccdcfc2	[InstCombine][X86] Add DemandedElts support for 512-bit PMULDQ/PMULUDQ instructions PMULDQ/PMULUDQ vXi64 instructions only use the even numbered v2Xi32 input elements which SimplifyDemandedVectorElts should try and use. This builds on r290554 which added supported for 128 and 256-bit. llvm-svn: 290582	2016-12-27 05:30:09 +00:00

1 2 3 4 5 ...

2523 Commits