llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Craig Topper	a9fb877147	[InstCombine] Add test cases for D51398 These tests contain the pattern (neg (max ~X, C)) which we should transform to ((min X, ~C) + 1) llvm-svn: 341023	2018-08-30 06:14:54 +00:00
Craig Topper	d094523cbe	[X86] Make Feature64Bit useful We now only add +64bit to the CPU string for "generic" CPU. All other CPU names are assumed to have the feature flag already set if they support 64-bit. I've remove the implies from CMPXCHG8 so that Feature64Bit only comes in via CPUs or user passing -mattr=+64bit. I've changed the assert to a report_fatal_error so it's not lost in Release builds. The test updates are to fix things that tripped the new error. Differential Revision: https://reviews.llvm.org/D51231 llvm-svn: 341022	2018-08-30 06:01:05 +00:00
Philip Reames	457e69f814	[SimplifyCFG] Fix a cost modeling oversight in branch commoning The cost modeling was not accounting for the fact we were duplicating the instruction once per predecessor. With a default threshold of 1, this meant we were actually creating #pred copies. Adding to the fun, there is absolutely no test coverage for this. Simply bailing for more than one predecessor passes all checked in tests. llvm-svn: 341001	2018-08-30 00:03:02 +00:00
Reid Kleckner	ddf31c570a	Revert r340947 "[InstCombine] Expand the simplification of pow() into exp2()" It broke the clang-cl self-host. llvm-svn: 340991	2018-08-29 22:58:33 +00:00
Philip Reames	b57d5d0305	Add a todo and tests to Address a review commnt from D50925 [NFC] llvm-svn: 340978	2018-08-29 22:09:21 +00:00
Philip Reames	e4d2e304c4	[LICM] Hoist stores of invariant values to invariant addresses out of loops Teach LICM to hoist stores out of loops when the store writes to a location otherwise unused in the loop, writes a value which is invariant, and is guaranteed to execute if the loop is entered. Worth noting is that this transformation is partially overlapping with the existing promotion transformation. Reasons this is worthwhile anyway include: * For multi-exit loops, this doesn't require duplication of the store. * It kicks in for case where we can't prove we exit through a normal exit (i.e. we may throw), but can prove the store executes before that possible side exit. Differential Revision: https://reviews.llvm.org/D50925 llvm-svn: 340974	2018-08-29 21:49:30 +00:00
Fedor Sergeev	c3b4726b4b	[SimpleLoopUnswitch] After unswitch delete dead blocks in parent loops Summary: Assert from PR38737 happens on the dead block inside the parent loop after unswitching nontrivial switch in the inner loop. deleteDeadBlocksFromLoop now takes extra care to detect/remove dead blocks in all the parent loops in addition to the blocks from original loop being unswitched. Reviewers: asbirlea, chandlerc Reviewed By: asbirlea Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51415 llvm-svn: 340955	2018-08-29 19:10:44 +00:00
Sanjay Patel	f86719876c	[InstCombine] canonicalize fneg with llvm.sin This is a follow-up to rL339604 which did the same transform for a sin libcall. The handling of intrinsics vs. libcalls is unfortunately scattered, so I'm just adding this next to the existing transform for llvm.cos for now. This should resolve PR38458: https://bugs.llvm.org/show_bug.cgi?id=38458 If the call was already negated, the negates will cancel each other out. llvm-svn: 340952	2018-08-29 18:27:49 +00:00
Sanjay Patel	e4979e89e5	[InstCombine] add tests for llvm.sin(-x); NFC Also add a corresponding test for llvm.cos with FMF to make sure that was handled correctly. llvm-svn: 340950	2018-08-29 18:11:42 +00:00
Evandro Menezes	ebc4d37280	[InstCombine] Expand the simplification of pow() with nested exp{,2}() Expand the simplification of `pow(exp{,2}(x), y)` to all FP types. This improvement helps some benchmarks in SPEC CPU2000 and CPU2006, such as 252.eon, 447.dealII, 453.povray. Otherwise, no significant regressions on x86-64 or A64. Differential revision: https://reviews.llvm.org/D51195 llvm-svn: 340948	2018-08-29 17:59:48 +00:00
Evandro Menezes	3cff71d444	[InstCombine] Expand the simplification of pow() into exp2() Generalize the simplification of `pow(2.0, y)` to `pow(2.0 ** n, y)` for all scalar and vector types. This improvement helps some benchmarks in SPEC CPU2000 and CPU2006, such as 252.eon, 447.dealII, 453.povray. Otherwise, no significant regressions on x86-64 or A64. Differential revision: https://reviews.llvm.org/D49273 llvm-svn: 340947	2018-08-29 17:59:34 +00:00
Sanjay Patel	4f40c047fe	[InstCombine] add test for vector demanded elements + shrinking; NFC llvm-svn: 340933	2018-08-29 15:34:19 +00:00
Hans Wennborg	88ab94c362	LoopSink: Don't sink into blocks without an insertion point (PR38462) In the PR, LoopSink was trying to sink into a catchswitch block, which doesn't have a valid insertion point. Differential Revision: https://reviews.llvm.org/D51307 llvm-svn: 340900	2018-08-29 06:55:27 +00:00
Zhaoshi Zheng	7f59505011	[QTOOL-37352] Consider isLegalAddressingImm in Constant Hoisting In Thumb1, legal imm range is [0, 255] for ADD/SUB instructions. However, the legal imm range for LD/ST in (R+Imm) addressing mode is [0, 127]. Imms in [128, 255] are materialized by mov R, #imm, and LD/STs use them in (R+R) addressing mode. This patch checks if a constant is used as offset in (R+Imm), if so, it checks isLegalAddressingMode passing the constant value as BaseOffset. Differential Revision: https://reviews.llvm.org/D50931 llvm-svn: 340882	2018-08-28 23:00:59 +00:00
Alina Sbirlea	da787801e8	[SimpleLoopUnswitch] Form dedicated exits after trivial unswitches. Summary: Form dedicated exits after trivial unswitches. Fixes PR38737, PR38283. Reviewers: chandlerc, fedor.sergeev Subscribers: sanjoy, jlebar, uabelho, llvm-commits Differential Revision: https://reviews.llvm.org/D51375 llvm-svn: 340871	2018-08-28 20:41:05 +00:00
Matt Arsenault	f5f26c463d	AMDGPU: Remove nan tests in class if src is nnan llvm-svn: 340850	2018-08-28 18:10:02 +00:00
Sanjay Patel	9dffb6589f	[InstCombine] fix baseline assertions rL340842 contained the wrong version of the check lines. llvm-svn: 340846	2018-08-28 17:23:20 +00:00
Sanjay Patel	0965eef0c0	[InstCombine] add tests for select narrowing (PR38691); NFC llvm-svn: 340842	2018-08-28 16:45:00 +00:00
David Bolvansky	64733b19f6	[Inliner] Attribute callsites with inline remarks Summary: Sometimes reading an output *.ll file it is not easy to understand why some callsites are not inlined. We can read output of inline remarks (option --pass-remarks-missed=inline) and try correlating its messages with the callsites. An easier way proposed by this patch is to add to every callsite processed by Inliner an attribute with the latest message that describes the cause of not inlining this callsite. The attribute is called //inline-remark//. By default this feature is off. It can be switched on by the option //-inline-remark-attribute//. For example in the provided test the result method //@test1// has two callsites //@bar// and inline remarks report different inlining missed reasons: remark: <unknown>:0:0: bar not inlined into test1 because too costly to inline (cost=-5, threshold=-6) remark: <unknown>:0:0: bar not inlined into test1 because it should never be inlined (cost=never): recursive It is not clear which remark correspond to which callsite. With the inline remark attribute enabled we get the reasons attached to their callsites: define void @test1() { call void @bar(i1 true) #0 call void @bar(i1 false) #2 ret void } attributes #0 = { "inline-remark"="(cost=-5, threshold=-6)" } .. attributes #2 = { "inline-remark"="(cost=never): recursive" } Patch by: yrouban (Yevgeny Rouban) Reviewers: xbolva00, tejohnson, apilipenko Reviewed By: xbolva00, tejohnson Subscribers: eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D50435 llvm-svn: 340834	2018-08-28 15:27:25 +00:00
Mikael Holmen	4e23f235e0	[CloneFunction] Constant fold terminators before checking single predecessor Summary: This fixes PR31105. There is code trying to delete dead code that does so by e.g. checking if the single predecessor of a block is the block itself. That check fails on a block like this bb: br i1 undef, label %bb, label %bb since that has two (identical) predecessors. However, after the check for dead blocks there is a call to ConstantFoldTerminator on the basic block, and that call simplifies the block to bb: br label %bb Therefore we now do the call to ConstantFoldTerminator before the check if the block is dead, so it can realize that it really is. The original behavior lead to the block not being removed, but it was simplified as above, and then we did a call to Dest->replaceAllUsesWith(&*I); with old and new being equal, and an assertion triggered. Reviewers: chandlerc, fhahn Reviewed By: fhahn Subscribers: eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D51280 llvm-svn: 340820	2018-08-28 12:40:11 +00:00
Craig Topper	a72b09e6fc	[InstCombine] Extend (add (sext x), cst) --> (sext (add x, cst')) and (add (zext x), cst) --> (zext (add x, cst')) to work for vectors Differential Revision: https://reviews.llvm.org/D51236 llvm-svn: 340796	2018-08-28 02:02:29 +00:00
Kit Barton	5fe828ca5e	[PPC] Remove Darwin support from POWER backend. This patch issues an error message if Darwin ABI is attempted with the PPC backend. It also cleans up existing test cases, either converting the test to use an alternative triple or removing the test if the coverage is no longer needed. Updated Tests ------------- The majority of test cases were updated to use a different triple that does not include the Darwin ABI. Many tests were also updated to use FileCheck, in place of grep. Deleted Tests ------------- llvm/test/tools/dsymutil/PowerPC/sibling.test was originally added to test specific functionality of dsymutil using an object file created with an old version of llvm-gcc for a Powerbook G4. After a discussion with @JDevlieghere he suggested removing the test. llvm/test/CodeGen/PowerPC/combine_loads_from_build_pair.ll was converted from a PPC test to a SystemZ test, as the behavior is also reproducible there. All other tests that were deleted were specific to the darwin/ppc ABI and no longer necessary. Phabricator Review: https://reviews.llvm.org/D50988 llvm-svn: 340795	2018-08-28 01:18:29 +00:00
Craig Topper	03eb14a1a6	[InstCombine] Add test cases for D51236. NFC llvm-svn: 340789	2018-08-27 22:55:49 +00:00
Sanjay Patel	3fe1ea1aae	[InstCombine] allow shuffle+binop canonicalization with widening shuffles This lines up with the behavior of an existing transform where if both operands of the binop are shuffled, we allow moving the binop before the shuffle regardless of whether the shuffle changes the size of the vector. llvm-svn: 340787	2018-08-27 22:41:44 +00:00
Evandro Menezes	9e7c2d3864	[PATCH] [InstCombine] Fix issue in the simplification of pow() with nested exp{,2}() Fix the issue of duplicating the call to `exp{,2}()` when it's nested in `pow()`, as exposed by rL340462. Differential revision: https://reviews.llvm.org/D51194 llvm-svn: 340784	2018-08-27 22:11:15 +00:00
Roman Tereshin	91525f438c	Revert "[SCEV][NFC] Check NoWrap flags before lexicographical comparison of SCEVs" This reverts r319889. Unfortunately, wrapping flags are not a part of SCEV's identity (they do not participate in computing a hash value or in equality comparisons) and in fact they could be assigned after the fact w/o rebuilding a SCEV. Grep for const_cast's to see quite a few of examples, apparently all for AddRec's at the moment. So, if 2 expressions get built in 2 slightly different ways: one with flags set in the beginning, the other with the flags attached later on, we may end up with 2 expressions which are exactly the same but have their operands swapped in one of the commutative N-ary expressions, and at least one of them will have "sorted by complexity" invariant broken. 2 identical SCEV's won't compare equal by pointer comparison as they are supposed to. A real-world reproducer is added as a regression test: the issue described causes 2 identical SCEV expressions to have different order of operands and therefore compare not equal, which in its turn prevents LoadStoreVectorizer from vectorizing a pair of consecutive loads. On a larger example (the source of the test attached, which is a bugpoint) I have seen even weirder behavior: adding a constant to an existing SCEV changes the order of the existing terms, for instance, getAddExpr(1, ((A * B) + (C * D))) returns (1 + (C * D) + (A * B)). Differential Revision: https://reviews.llvm.org/D40645 llvm-svn: 340777	2018-08-27 21:41:37 +00:00
Tim Renouf	a41e092d34	[AMDGPU] Add support for multi-dword s.buffer.load intrinsic Summary: Patch by Marek Olsak and David Stuttard, both of AMD. This adds a new amdgcn intrinsic supporting s.buffer.load, in particular multiple dword variants. These are convenient to use from some front-end implementations. Also modified the existing llvm.SI.load.const intrinsic to common up the underlying implementation. This modification also requires that we can lower to non-uniform loads correctly by splitting larger dword variants into sizes supported by the non-uniform versions of the load. V2: Addressed minor review comments. V3: i1 glc is now i32 cachepolicy for consistency with buffer and tbuffer intrinsics, plus fixed formatting issue. V4: Added glc test. Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D51098 Change-Id: I83a6e00681158bb243591a94a51c7baa445f169b llvm-svn: 340684	2018-08-25 14:53:17 +00:00
Sanjay Patel	461e9771b2	[InstCombine] add tests for shuffle+binop transform; NFC llvm-svn: 340683	2018-08-25 14:37:08 +00:00
Philip Reames	c6a0f43fb3	[CVP] Extend tests to illustrate an old patch isn't needed Back in https://reviews.llvm.org/D19559, I tried to teach CVP about range facts implied by value/value icmps (i.e. no constants.) In the meantime, we've implemented the optimization, but I couldn't find tests checked in, so adding them. llvm-svn: 340660	2018-08-24 21:56:43 +00:00
Xinliang David Li	c62cbab33d	[PGO] add target md5sum in warning message for icall Differential revision: http://reviews.llvm.org/D51193 llvm-svn: 340657	2018-08-24 21:38:24 +00:00
Eli Friedman	3be32ec89b	[SafeStack] Set debug location for calls to __safestack_pointer_address. Otherwise, the debug info is incorrect. On its own, this is mostly harmless, but the safe-stack also later inlines the call to __safestack_pointer_address, which leads to debug info with the wrong scope, which eventually causes an assertion failure (and incorrect debug info in release mode). Differential Revision: https://reviews.llvm.org/D51075 llvm-svn: 340651	2018-08-24 20:42:32 +00:00
David Bolvansky	d9c443e857	Revert [Inliner] Attribute callsites with inline remarks llvm-svn: 340619	2018-08-24 16:39:41 +00:00
David Bolvansky	43c40e97d4	[Inliner] Attribute callsites with inline remarks Summary: Sometimes reading an output *.ll file it is not easy to understand why some callsites are not inlined. We can read output of inline remarks (option --pass-remarks-missed=inline) and try correlating its messages with the callsites. An easier way proposed by this patch is to add to every callsite processed by Inliner an attribute with the latest message that describes the cause of not inlining this callsite. The attribute is called //inline-remark//. By default this feature is off. It can be switched on by the option //-inline-remark-attribute//. For example in the provided test the result method //@test1// has two callsites //@bar// and inline remarks report different inlining missed reasons: remark: <unknown>:0:0: bar not inlined into test1 because too costly to inline (cost=-5, threshold=-6) remark: <unknown>:0:0: bar not inlined into test1 because it should never be inlined (cost=never): recursive It is not clear which remark correspond to which callsite. With the inline remark attribute enabled we get the reasons attached to their callsites: define void @test1() { call void @bar(i1 true) #0 call void @bar(i1 false) #2 ret void } attributes #0 = { "inline-remark"="(cost=-5, threshold=-6)" } .. attributes #2 = { "inline-remark"="(cost=never): recursive" } Patch by: yrouban (Yevgeny Rouban) Reviewers: xbolva00, tejohnson, apilipenko Reviewed By: xbolva00, tejohnson Subscribers: eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D50435 llvm-svn: 340618	2018-08-24 16:28:36 +00:00
Philip Reames	7c4a4cfc1e	[LICM] Hoist an invariant_start out of loops if there are no stores executed before it Once the invariant_start is reached, we know that no instruction after it can modify the memory. So, if we can prove the location isn't read between entry into the loop and the execution of the invariant_start, we can execute the invariant_start before entering the loop. Differential Revision: https://reviews.llvm.org/D51181 llvm-svn: 340617	2018-08-24 16:24:48 +00:00
Florian Hahn	0f9f9d79b6	[Local] Make DoesKMove required for combineMetadata. This patch makes the DoesKMove argument non-optional, to force people to think about it. Most cases where it is false are either code hoisting or code sinking, where we pick one instruction from a set of equal instructions among different code paths. Reviewers: dberlin, nlopes, efriedma, davide Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D47475 llvm-svn: 340606	2018-08-24 11:40:04 +00:00
Craig Topper	da9171ac7d	[ValueTracking] Fix assert message and add test case for r340546 and PR38677. The bug was already fixed. This just adds a test case for it. llvm-svn: 340556	2018-08-23 17:45:53 +00:00
David Bolvansky	05aaf88531	[InstCombine] Fold Select with binary op - FP opcodes Summary: Follow up for https://reviews.llvm.org/rL339520 and https://reviews.llvm.org/rL338300 Alive: ``` %A = fcmp oeq float %x, 0.0 %B = fadd nsz float %x, %z %C = select i1 %A, float %B, float %y => %C = select i1 %A, float %z, float %y ---------- %A = fcmp oeq float %x, 0.0 %B = fadd nsz float %x, %z %C = select %A, float %B, float %y => %C = select %A, float %z, float %y Done: 1 Optimization is correct %A = fcmp une float %x, -0.0 %B = fadd nsz float %x, %z %C = select i1 %A, float %y, float %B => %C = select i1 %A, float %y, float %z ---------- %A = fcmp une float %x, -0.0 %B = fadd nsz float %x, %z %C = select %A, float %y, float %B => %C = select %A, float %y, float %z Done: 1 Optimization is correct ``` Reviewers: spatel, lebedev.ri Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D50714 llvm-svn: 340538	2018-08-23 15:22:15 +00:00
John Brawn	e70236adf8	[GVN] Invalidate cached info for phis when setting dead predecessors to undef When GVN sets the incoming value for a phi to undef because the incoming block is unreachable it needs to also invalidate the cached info for that phi in MemoryDependenceAnalysis, otherwise later queries will return stale information. Differential Revision: https://reviews.llvm.org/D51099 llvm-svn: 340529	2018-08-23 12:48:17 +00:00
Florian Hahn	1e1573359c	Recommit r333268: [IPSCCP] Use PredicateInfo to propagate facts from cmp instructions. This version of the patch fixes cleaning up ssa_copy intrinsics, so it does not crash for instructions in blocks that have been marked unreachable. This patch updates IPSCCP to use PredicateInfo to propagate facts to true branches predicated by EQ and to false branches predicated by NE. As a follow up, we should be able to extend it to also propagate additional facts about nonnull. Reviewers: davide, mssimpso, dberlin, efriedma Reviewed By: davide, dberlin Differential Revision: https://reviews.llvm.org/D45330 llvm-svn: 340525	2018-08-23 11:04:00 +00:00
David Bolvansky	4695717fdb	[LibCalls] Added returned attribute to libcalls Reviewers: efriedma Reviewed By: efriedma Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51092 llvm-svn: 340512	2018-08-23 05:18:23 +00:00
Craig Topper	999526d794	[ValueTracking] Teach computeNumSignBits to understand min/max clamp patterns with constant/splat values If we have a min/max pair we can do a better job of counting sign bits if we look at them together. This is similar to what is done in the SelectionDAG version of computeNumSignBits for ISD::SMAX/SMIN. Differential Revision: https://reviews.llvm.org/D51112 llvm-svn: 340480	2018-08-22 23:27:50 +00:00
Evandro Menezes	a86aeb26c4	[NFC] Expand test cases for simplifying pow() llvm-svn: 340462	2018-08-22 22:44:06 +00:00
Eli Friedman	d02128f555	[SafeStack] Handle unreachable code with safe stack coloring. Instead of asserting that the function doesn't have any unreachable code, just ignore it for the purpose of computing liveness. Differential Revision: https://reviews.llvm.org/D51070 llvm-svn: 340456	2018-08-22 21:38:57 +00:00
Alina Sbirlea	facf6eb783	Update MemorySSA in LoopSimplifyCFG. Summary: Add MemorySSA as a dependency to LoopSimplifyCFG and preserve it. Disabled by default until all passes preserve MemorySSA. Reviewers: bogner, chandlerc Subscribers: sanjoy, jlebar, Prazek, george.burgess.iv, llvm-commits Differential Revision: https://reviews.llvm.org/D50911 llvm-svn: 340445	2018-08-22 20:10:21 +00:00
Alina Sbirlea	0cb48ba57d	Update MemorySSA in LoopInstSimplify. Summary: Add MemorySSA as a depency to LoopInstInstSimplify and preserve it. Disabled by default until all passes preserve MemorySSA. Reviewers: chandlerc Subscribers: sanjoy, jlebar, Prazek, george.burgess.iv, llvm-commits Differential Revision: https://reviews.llvm.org/D50906 llvm-svn: 340444	2018-08-22 20:05:21 +00:00
Vedant Kumar	d10dbb804e	[CodeGenPrepare] Set debug locs when folding a comparison into a uadd.with.overflow CGP can replace a branch + select with a uadd.with.overflow. Teach it to set debug locations as it does this. llvm-svn: 340432	2018-08-22 18:15:03 +00:00
Max Kazantsev	c8eea01630	[GuardWidening] Ignore guards with trivial conditions Guard widening should not spend efforts on dealing with guards with trivial true/false conditions. Such guards can easily be eliminated by any further cleanup pass like instcombine. However we should not unconditionally delete them because it may be profitable to widen other conditions into such guards. Differential Revision: https://reviews.llvm.org/D50247 Reviewed By: fedor.sergeev llvm-svn: 340381	2018-08-22 02:40:49 +00:00
Vedant Kumar	5602b833c9	[CodeGenPrepare] Set debug loc when widening a switch condition Set a debug location on the cast instruction used to widen a switch condition. llvm-svn: 340379	2018-08-22 01:23:31 +00:00
Vedant Kumar	c0dd251e86	[CodeGenPrepare] Set debug locations when splitting selects When splitting a select into a diamond, set debug locations on newly-created branch instructions and phi nodes. llvm-svn: 340371	2018-08-22 00:10:37 +00:00
Vedant Kumar	8314ccb02a	[CodeGenPrepare] Clean up dbg.value use-before-def as late as possible CodeGenPrepare has a strategy for moving dbg.values so that a value's definition always dominates its debug users. This cleanup was happening too early (before certain CGP transforms were run), resulting in some dbg.value use-before-def errors. Perform this cleanup as late as possible to avoid use-before-def. llvm-svn: 340370	2018-08-21 23:43:08 +00:00

1 2 3 4 5 ...

11354 Commits