llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 02:52:53 +02:00

Author	SHA1	Message	Date
Simon Pilgrim	3b4ac7dc2a	[InstCombine][X86][AVX] Add DemandedElts support for VPERMILPD/VPERMILPS instructions Simplify a vpermilvar shuffle mask based on the elements of the mask that are actually demanded. llvm-svn: 292209	2017-01-17 11:35:03 +00:00
Sanjoy Das	1728d455cf	[InstCombine] Don't DSE across readnone functions that may throw Summary: Depends on D28740 Reviewers: dberlin, chandlerc, hfinkel, majnemer Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D28742 llvm-svn: 292197	2017-01-17 05:45:09 +00:00
David Majnemer	b33ccfb48d	[InstCombine] Fold ((C1-zext(X)) & C2) -> zext((C1-X) & C2) This is valid if C2 fits within the bitwidth of X thanks to two's complement modulo arithmetic. llvm-svn: 292179	2017-01-17 00:45:57 +00:00
Matt Arsenault	a568a0d4a6	Add comment to test file I forgot to save llvm-svn: 292178	2017-01-17 00:35:28 +00:00
Matt Arsenault	018a8adda0	SimplifyLibCalls: Remove checks for fabs Use the intrinsic instead of emitting the libcall which will be replaced by the intrinsic. llvm-svn: 292176	2017-01-17 00:30:31 +00:00
Matt Arsenault	ccfb3dd68e	SimplifyLibCalls: Replace fabs libcalls with intrinsics Add missing fabs(fpext) optimzation that worked with the call, and also fixes it creating a second fpext when there were multiple uses. llvm-svn: 292172	2017-01-17 00:10:40 +00:00
Simon Pilgrim	b3d302c9e0	[InstCombine][AVX] Tests showing missed opportunities to pass demanded elts through a permilpd/permilps shuffle mask llvm-svn: 292165	2017-01-16 21:34:22 +00:00
Sanjay Patel	ab8f315ba8	[InstCombine] use m_APInt to allow shift-shift folds for vectors with splat constants Some existing 'FIXME' tests are still not folded because of splat holes in value tracking. llvm-svn: 292151	2017-01-16 19:35:45 +00:00
Sanjay Patel	1ec279d94f	[InstCombine] add tests to show missed vector folds; NFC The shift-shift possibilities became easier to see after: https://reviews.llvm.org/rL292145 llvm-svn: 292150	2017-01-16 19:23:34 +00:00
Simon Pilgrim	cdda17446c	[InstCombine][SSE] Tests showing missed opportunities to pass demanded elts through a packss/packus truncation llvm-svn: 292144	2017-01-16 17:26:23 +00:00
Simon Pilgrim	9d900b5f6e	[InstCombine][SSE] Add DemandedElts support for PSHUFB instructions Simplify a pshufb shuffle mask based on the elements of the mask that are actually demanded. Differential Revision: https://reviews.llvm.org/D28745 llvm-svn: 292101	2017-01-16 11:30:41 +00:00
Sanjay Patel	3b866f23f8	[InstCombine] add tests to show missed vector folds; NFC Also, add comments and remove bogus comment. llvm-svn: 292082	2017-01-15 23:45:03 +00:00
Simon Pilgrim	c1b9ed731d	[InstCombine][SSE] Tests showing missed opportunities to pass demanded elts through a pshufb shuffle mask llvm-svn: 292072	2017-01-15 17:49:04 +00:00
Sanjay Patel	5495465040	[InstCombine] use m_APInt to allow ashr folds for vectors with splat constants llvm-svn: 292064	2017-01-15 16:38:19 +00:00
Sanjay Patel	f8ca2aa064	[InstCombine] add explanatory comments to tests; NFC llvm-svn: 292063	2017-01-15 16:22:26 +00:00
Chandler Carruth	ff4cd4e1d6	[PM] Clean up the testing for IVUsers, especially with the new PM. First, I've moved a test of IVUsers from the LSR tree to a dedicated IVUsers test directory. I've also simplified its RUN line now that the new pass manager's loop PM is providing analyses on their own. No functionality changed, but it makes subsequent changes cleaner. llvm-svn: 292060	2017-01-15 09:29:27 +00:00
Chandler Carruth	fa2e854de5	[PM] Fix instcombine's analysis preservation in the new pass manager to cover domtree and alias analysis. These are the pretty clear analyses that we would always want to survive this pass. To make these survive, we also need to preserve the assumption cache. Added a test that verifies the important bits of this preservation. llvm-svn: 292037	2017-01-14 23:25:22 +00:00
Sanjay Patel	286505799a	[InstCombine] add test to show missed vector fold; NFC llvm-svn: 292035	2017-01-14 23:12:29 +00:00
Daniel Berlin	c6a5e7ac11	NewGVN: Fix PR31613 test regex naming llvm-svn: 291979	2017-01-13 23:54:10 +00:00
Sanjay Patel	c20cf10406	[InstCombine] optimize unsigned icmp of increment Allows LLVM to optimize sequences like the following: %add = add nuw i32 %x, 1 %cmp = icmp ugt i32 %add, %y Into: %cmp = icmp uge i32 %x, %y Previously, only signed comparisons were being handled. Decrements could also be handled, but 'sub nuw %x, 1' is currently canonicalized to 'add %x, -1' in InstCombineAddSub, losing the nuw flag. Removing that canonicalization seems like it might have far-reaching ramifications so I kept this simple for now. Patch by Matti Niemenmaa! Differential Revision: https://reviews.llvm.org/D24700 llvm-svn: 291975	2017-01-13 23:25:46 +00:00
Sanjay Patel	fcb9044bb8	[InstCombine] use m_APInt to allow lshr folds for vectors with splat constants llvm-svn: 291972	2017-01-13 23:04:10 +00:00
Sanjay Patel	4485ac55e2	[InstCombine / InstSimplify] add and move tests for lshr transforms; NFC llvm-svn: 291970	2017-01-13 22:54:12 +00:00
Daniel Berlin	2710b40fb3	NewGVN: Move leaders around properly to ensure we have a canonical dominating leader. Fixes PR 31613. Summary: This is a testcase where phi node cycling happens, and because we do not order the leaders by domination or anything similar, the leader keeps changing. Using std::set for the members is too expensive, and we actually don't need them sorted all the time, only at leader changes. We could keep both a set and a vector, and keep them mostly sorted and resort as necessary, or use a set and a fibheap, but all of this seems premature. After running some statistics, we are able to avoid the vast majority of sorting by keeping a "next leader" field. Most congruence classes only have leader changes once or twice during GVN. Reviewers: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D28594 llvm-svn: 291968	2017-01-13 22:40:01 +00:00
David Majnemer	2c4f02adc6	[LoopStrengthReduce] Don't bother rewriting PHIs in catchswitch blocks The catchswitch instruction cannot be split, don't bother trying to rewrite it. This fixes PR31627. llvm-svn: 291966	2017-01-13 22:24:27 +00:00
Sanjay Patel	3edc9b2bd9	[InstCombine] use m_APInt to allow shl folds for vectors with splat constants llvm-svn: 291934	2017-01-13 18:39:09 +00:00
Sanjay Patel	68dd981647	[InstCombine] add tests to show missing transforms for vector shl; NFC llvm-svn: 291926	2017-01-13 18:27:23 +00:00
Sanjay Patel	ee734847d0	[InstCombine] if the condition of a select may be known via assumes, eliminate the select This is a limited solution for PR31512: https://llvm.org/bugs/show_bug.cgi?id=31512 The motivation is that we will need to increase usage of llvm.assume and/or metadata to solve PR28430: https://llvm.org/bugs/show_bug.cgi?id=28430 ...and this kind of simplification is needed to take advantage of that extra information. The 'not' test case would be handled by: https://reviews.llvm.org/D28485 Differential Revision: https://reviews.llvm.org/D28337 llvm-svn: 291915	2017-01-13 17:02:42 +00:00
Adam Nemet	7258bda633	Move test of lazy BFI with ORE to a generic directory llvm-svn: 291862	2017-01-13 00:16:23 +00:00
Piotr Padlewski	9032a3fc4f	[Devirtualization] MemDep returns non-local !invariant.group dependencies Summary: Memory Dependence Analysis was limited to return only local dependencies for invariant.group handling. Now it returns NonLocal when it finds it and then by asking getNonLocalPointerDependency we get found dep. Thanks to this we are able to devirtualize loops! void indirect(A &a, int n) { for (int i = 0 ; i < n; i++) a.foo(); } void test(int n) { A a; indirect(a); } After inlining a.foo() will be changed to direct call, even if foo and A::A() is external (but only if vtable definition is be available). Reviewers: nlewycky, dberlin, chandlerc, rsmith Subscribers: mehdi_amini, davide, llvm-commits Differential Revision: https://reviews.llvm.org/D28137 llvm-svn: 291762	2017-01-12 11:33:58 +00:00
Chandler Carruth	ccc380f775	Make a test actually test what it set out to test. This test seems to have largely been relying on asserts being tripped. It had a very specific and somewhat uninteresting grep of the output, but it never really did anything to cause SCEV to be preserved across loop simplify, certainly not explicitly. And a later addition to it actually added CHECK lines despite the test never running FileCheck. Now we actually print SCEV before and after loop simplify to make sure it is changing and being updated. Which seems to be much more likely the point of the test. llvm-svn: 291740	2017-01-12 03:49:07 +00:00
Michael Kuperstein	7da400e4db	Add test that verifies we don't peel loops in optsize functions. NFC. llvm-svn: 291708	2017-01-11 21:42:51 +00:00
Peter Collingbourne	396740fa88	LowerTypeTests: Represent the memory region size with the constant size-1. This means that we can use a shorter instruction sequence in the case where the size is a power of two and on the boundary between two representations. Differential Revision: https://reviews.llvm.org/D28421 llvm-svn: 291706	2017-01-11 21:32:10 +00:00
Peter Collingbourne	8069f39125	Re-apply r291205, "LowerTypeTests: Split the pass in two: a resolution phase and a lowering phase.", with a fix for an off-by-one error. llvm-svn: 291699	2017-01-11 20:28:46 +00:00
Daniel Berlin	b15f304291	NewGVN: Fix PR31594, by tracking the store count of congruence classes, and updating checking to allow for equivalence through reachability. (Sadly, the checking here is not perfect, and can't be made perfect, so we'll have to disable it after we are satisfied with correctness. Right now it is just "very unlikely" to happen.) llvm-svn: 291698	2017-01-11 20:22:36 +00:00
Rong Xu	48a9ec1cd5	Resubmit "[PGO] Turn off comdat renaming in IR PGO by default" This patch resubmits the changes in r291588. llvm-svn: 291696	2017-01-11 20:19:41 +00:00
Michael Kuperstein	172556c384	[SLP] Remove bogus assert. The removed assert seems bogus - it's perfectly legal for the roots of the vectorized subtrees to be equal even if the original scalar values aren't, if the original scalars happen to be equivalent. This fixes PR31599. Differential Revision: https://reviews.llvm.org/D28539 llvm-svn: 291692	2017-01-11 19:23:57 +00:00
Ivan Krasin	78668e7030	Revert rL291205 because it breaks Chrome tests under CFI. Summary: Revert LowerTypeTests: Split the pass in two: a resolution phase and a lowering phase. This change separates how type identifiers are resolved from how intrinsic calls are lowered. All information required to lower an intrinsic call is stored in a new TypeIdLowering data structure. The idea is that this data structure can either be initialized using the module itself during regular LTO, or using the module summary in ThinLTO backends. Original URL: https://reviews.llvm.org/D28341 Reviewers: pcc Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D28532 llvm-svn: 291684	2017-01-11 16:54:04 +00:00
Mohammed Agabaria	007bbd87af	[X86] fixing failed test in commit: r291657 Missing Requires asserts. llvm-svn: 291659	2017-01-11 09:03:11 +00:00
Mohammed Agabaria	df301aa885	[X86] updating TTI costs for arithmetic instructions on X86\SLM arch. updated instructions: pmulld, pmullw, pmulhw, mulsd, mulps, mulpd, divss, divps, divsd, divpd, addpd and subpd. special optimization case which replaces pmulld with pmullw\pmulhw\pshuf seq. In case if the real operands bitwidth <= 16. Differential Revision: https://reviews.llvm.org/D28104 llvm-svn: 291657	2017-01-11 08:23:37 +00:00
Adam Nemet	f483225ed9	[LICM] Report failing to hoist conditionally-executed loads These are interesting again because the user may not be aware that this is a common reason preventing LICM. A const is removed from an instruction pointer declaration in order to pass it to ORE. Differential Revision: https://reviews.llvm.org/D27940 llvm-svn: 291649	2017-01-11 04:39:49 +00:00
Adam Nemet	8eb8ca7ca4	[LICM] Report failing to hoist a load with an invariant address These are interesting because lack of precision in alias information could be standing in the way of this optimization. An example is the case in the test suite that I showed in the DevMeeting talk: http://lab.llvm.org:8080/artifacts/opt-view_test-suite/build/MultiSource/Benchmarks/FreeBench/distray/CMakeFiles/distray.dir/html/_org_test-suite_MultiSource_Benchmarks_FreeBench_distray_distray.c.html#L236 canSinkOrHoistInst is also used from LoopSink, which does not use opt-remarks so we need to take ORE as an optional argument. Differential Revision: https://reviews.llvm.org/D27939 llvm-svn: 291648	2017-01-11 04:39:45 +00:00
Adam Nemet	7d55194036	[LICM] Report successful hoist/sink/promotion Differential Revision: https://reviews.llvm.org/D27938 llvm-svn: 291646	2017-01-11 04:39:35 +00:00
Matt Arsenault	588e04537c	InstSimplify: Eliminate fabs on known positive llvm-svn: 291624	2017-01-11 00:33:24 +00:00
Rong Xu	7d746ec198	Revert "[PGO] Turn off comdat renaming in IR PGO by default" This patch reverts r291588: [PGO] Turn off comdat renaming in IR PGO by default, as we are seeing some hash mismatches in our internal tests. llvm-svn: 291621	2017-01-10 23:54:31 +00:00
Matt Arsenault	0be1430dec	InstCombine: fdiv -x, -y -> fdiv x, y llvm-svn: 291611	2017-01-10 23:08:54 +00:00
Michael Kuperstein	e2a21ff3e2	[LV] Don't panic when encountering the IV of an outer loop. Bail out instead of asserting when we encounter this situation, which can actually happen. The reason the test uses the new PM is that the "bad" phi, incidentally, gets cleaned up by LoopSimplify. But LICM can create this kind of phi and preserve loop simplify form, so the cleanup has no chance to run. This fixes PR31190. We may want to solve this in a less conservative manner, since this phi is actually uniform within the inner loop (or we may want LICM to output a cleaner promotion to begin with). Differential Revision: https://reviews.llvm.org/D28490 llvm-svn: 291589	2017-01-10 19:32:30 +00:00
Rong Xu	92b1fe0567	[PGO] Turn off comdat renaming in IR PGO by default Summary: In IR PGO we append the function hash to comdat functions to avoid the potential hash mismatch. This turns out not legal in some cases: if the comdat function is address-taken and used in comparison. Renaming changes the semantic. This patch turns off comdat renaming by default. To alleviate the hash mismatch issue, we now rename the profile variable for comdat functions. Profile allows co-existing multiple versions of profiles with different hash value. The inlined copy will always has the correct profile counter. The out-of-line copy might not have the correct count. But we will not have the bogus mismatch warning. Reviewers: davidxl Subscribers: llvm-commits, xur Differential Revision: https://reviews.llvm.org/D28416 llvm-svn: 291588	2017-01-10 19:30:20 +00:00
Davide Italiano	ba6e5d0a9e	[SimplifyLibCalls] Propagate fast math flags while optimizing pow(). llvm-svn: 291577	2017-01-10 18:02:05 +00:00
Serge Pavlov	dda5fe0374	[StructurizeCfg] Update dominator info. In some cases StructurizeCfg updates root node, but dominator info remains unchanges, it causes crash when expensive checks are enabled. To cope with this problem a new method was added to DominatorTreeBase that allows adding new root nodes, it is called in StructurizeCfg to put dominator tree in sync. This change fixes PR27488. Differential Revision: https://reviews.llvm.org/D28114 llvm-svn: 291530	2017-01-10 02:50:47 +00:00
Davide Italiano	bc2b08fb6d	[SimplifyLibCalls] pow(x, -0.5) -> 1.0 / sqrt(x). Differential Revision: https://reviews.llvm.org/D28479 llvm-svn: 291486	2017-01-09 21:55:23 +00:00

1 2 3 4 5 ...

8095 Commits