llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Serguei Katkov	0a1b40ee08	[LICM] Avoid duplicate work during building AliasSetTracker Currently we re-use cached info from sub loops or traverse them to populate AliasSetTracker. But after that we traverse all basic blocks from the current loop. This is redundant work. All what we need is traversing the all basic blocks from the loop except those which are used to get the data from the cache. This should improve compile time only. Reviewers: mkazantsev, reames, kariddi, anna Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51715 llvm-svn: 341896	2018-09-11 04:07:36 +00:00
Max Kazantsev	773d488805	[IndVars][NFC] Refactor to make modifications of Changed transparent IndVarSimplify's design is somewhat odd in the way how it reports that some transform has made a change. It has a `Changed` field which can be set from within any function, which makes it hard to track whether or not it was set properly after a transform was made. It leads to oversights in setting this flag where needed, see example in PR38855. This patch removes the `Changed` field, turns it into a local and unifies the signatures of all relevant transform functions to return boolean value which designates whether or not this transform has made a change. Differential Revision: https://reviews.llvm.org/D51850 Reviewed By: skatkov llvm-svn: 341893	2018-09-11 03:57:22 +00:00
Philip Reames	dd0d8caa29	[LICM] (re-)simplify code using MemoryLocation API [NFC] I'd made exactly this same change before, but it appears to have been accidentally reverted in another change. (I'm assuming accidental since it was without comment or test case, and in an unrelated change.) llvm-svn: 341892	2018-09-11 03:28:28 +00:00
Alina Sbirlea	6cbdd18a8b	[InstCombine] Partially revert rL341674 due to PR38897. Summary: Revert min/max changes in rL341674 dues to high compile times causing timeouts (PR38897). Checking in to unblock failing builds. Patch available for post-commit review and re-revert once resolved. Working on a smaller reproducer for PR38897. Reviewers: craig.topper, spatel Subscribers: sanjoy, jlebar, llvm-commits Differential Revision: https://reviews.llvm.org/D51897 llvm-svn: 341883	2018-09-10 23:47:21 +00:00
Sanjay Patel	6ec4a8b575	[InstCombine] use SelectInst operand names to make code clearer; NFC Cleanup step for D51433. llvm-svn: 341850	2018-09-10 18:37:59 +00:00
Sebastian Pop	ccc9f17f1f	HotColdSplitting: check that target supports cold calling convention Before tagging a function with coldcc make sure the target supports cold calling convention. Without this patch HotColdSplitting pass fails on aarch64 with: fatal error: error in backend: Unsupported calling convention. llvm-svn: 341838	2018-09-10 15:08:02 +00:00
Sebastian Pop	4658a91b60	add flag instead of using a constant [NFC] llvm-svn: 341837	2018-09-10 15:07:59 +00:00
Sebastian Pop	b43ec5a3a5	make flag name more specific to gvn [NFC] llvm-svn: 341836	2018-09-10 15:07:56 +00:00
Tim Northover	0f17fdc4c5	InstCombine: move hasOneUse check to the top of foldICmpAddConstant There were two combines not covered by the check before now, neither of which actually differed from normal in the benefit analysis. The most recent seems to be because it was just added at the top of the function (naturally). The older is from way back in 2008 (r46687) when we just didn't put those checks in so routinely, and has been diligently maintained since. llvm-svn: 341831	2018-09-10 14:26:44 +00:00
Benjamin Kramer	f196b4cee0	Don't create a temporary vector of loop blocks just to iterate over them. Loop's getBlocks returns an ArrayRef. llvm-svn: 341821	2018-09-10 12:32:06 +00:00
John Brawn	10ecfe9801	[GVN] Invalidate cached info for values replaced by equality propagation When GVN propagates an equality by replacing one value with another it also needs to invalidate the cached information for the value being replaced. Differential Revision: https://reviews.llvm.org/D51218 llvm-svn: 341820	2018-09-10 12:23:05 +00:00
Max Kazantsev	bddb12a8cf	[IndVars] Set Changed if rewriteFirstIterationLoopExitValues changes IR. PR38863 Currently, `rewriteFirstIterationLoopExitValues` does not set Changed flag even if it makes changes in the IR. There is no clear evidence that it can cause a crash, but it looks highly suspicious and likely invalid. Differential Revision: https://reviews.llvm.org/D51779 Reviewed By: skatkov llvm-svn: 341779	2018-09-10 06:50:16 +00:00
Max Kazantsev	c9fa24d06f	[IndVars] Set Changed if sinkUnusedInvariants changes IR. PR38863 Currently, `sinkUnusedInvariants` does not set Changed flag even if it makes changes in the IR. There is no clear evidence that it can cause a crash, but it looks highly suspicious and likely invalid. Differential Revision: https://reviews.llvm.org/D51777 Reviewed By: skatkov llvm-svn: 341777	2018-09-10 06:32:00 +00:00
Vikram TV	bbc52ae47d	Move a transformation routine from LoopUtils to LoopVectorize. Summary: Move InductionDescriptor::transform() routine from LoopUtils to its only uses in LoopVectorize.cpp. Specifically, the function is renamed as InnerLoopVectorizer::emitTransformedIndex(). This is a child to D51153. Reviewers: dmgreen, llvm-commits Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D51837 llvm-svn: 341776	2018-09-10 06:16:44 +00:00
Vikram TV	d95d133618	Move createMinMaxOp() out of RecurrenceDescriptor. Reviewers: dmgreen, llvm-commits Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D51838 llvm-svn: 341773	2018-09-10 05:05:08 +00:00
Abderrazek Zaafrani	58be3eed37	[SimplifyIndVar] Avoid generating truncate instructions with non-hoisted Laod operand. Differential Revision: https://reviews.llvm.org/D49151 llvm-svn: 341726	2018-09-07 22:41:57 +00:00
Alina Sbirlea	04e7e7b515	[MemorySSA] Update MemoryPhi wiring for block splitting to consider if identical edges were merged. Summary: Block splitting is done with either identical edges being merged, or not. Only critical edges can be split without merging identical edges based on an option. Teach the memoryssa updater to take this into account: for the same edge between two blocks only move one entry from the Phi in Old to the new Phi in New. Reviewers: george.burgess.iv Subscribers: sanjoy, jlebar, Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D51563 llvm-svn: 341709	2018-09-07 21:14:48 +00:00
Sanjay Patel	25432f7c6b	[InstCombine] narrow vector select with padded condition and extracted result (PR38691) shuf (sel (shuf NarrowCond, undef, WideMask), X, Y), undef, NarrowMask) --> sel NarrowCond, (shuf X, undef, NarrowMask), (shuf Y, undef, NarrowMask) The motivating case from: https://bugs.llvm.org/show_bug.cgi?id=38691 ...is the last regression test. In that case, we're just left with the narrow select. Note that if we do create new shuffles, they use the existing extraction identity mask, so there's no danger that this transform creates arbitrary shuffles. Differential Revision: https://reviews.llvm.org/D51496 llvm-svn: 341708	2018-09-07 21:03:34 +00:00
Fangrui Song	489406cdb3	[PGO] Fix some style issue of ControlHeightReduction Reviewers: yamauchi Reviewed By: yamauchi Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51811 llvm-svn: 341702	2018-09-07 20:23:15 +00:00
Hiroshi Yamauchi	96d59fd89c	[PGO][CHR] Build/warning fix llvm-svn: 341692	2018-09-07 18:44:53 +00:00
JF Bastien	ed41f749a7	NFC: remove magic bool in LoopIdiomRecognize Use an enum class instead. llvm-svn: 341684	2018-09-07 18:17:59 +00:00
Hiroshi Yamauchi	0646802061	[PGO][CHR] Small cleanup. Summary: Do away with demangling. It wasn't really necessary. Declared some local functions to be static. Reviewers: davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51740 llvm-svn: 341681	2018-09-07 18:00:58 +00:00
Craig Topper	15698b21e0	[InstCombine] Fold (min/max ~X, Y) -> ~(max/min X, ~Y) when Y is freely invertible If the ~X wasn't able to simplify above the max/min, we might be able to simplify it by moving it below the max/min. I had to modify the ~(min/max ~X, Y) transform to prevent getting stuck in a loop when we saw the new ~(max/min X, ~Y) before the ~Y had been folded away to remove the new not. Differential Revision: https://reviews.llvm.org/D51398 llvm-svn: 341674	2018-09-07 16:19:50 +00:00
Anna Thomas	a0399f5bed	[LV] Fix code gen for conditionally executed loads and stores Fix a latent bug in loop vectorizer which generates incorrect code for memory accesses that are executed conditionally. As pointed in review, this bug definitely affects uniform loads and may affect conditional stores that should have turned into scatters as well). The code gen for conditionally executed uniform loads on architectures that support masked gather instructions is broken. Without this patch, we were unconditionally executing the conditional load in the vectorized version. This patch does the following: 1. Uniform conditional loads on architectures with gather support will have correct code generated. In particular, the cost model (setCostBasedWideningDecision) is fixed. 2. For the recipes which are handled after the widening decision is set, we use the isScalarWithPredication(I, VF) form which is added in the patch. 3. Fix the vectorization cost model for scalarization (getMemInstScalarizationCost): implement and use isPredicatedInst to identify all predicated instructions, not just scalar+predicated. So, now the cost for scalarization will be increased for maskedloads/stores and gather/scatter operations. In short, we should be choosing the gather/scatter in place of scalarization on archs where it is profitable. 4. We needed to weaken the assert in useEmulatedMaskMemRefHack. Reviewers: Ayal, hsaito, mkuper Differential Revision: https://reviews.llvm.org/D51313 llvm-svn: 341673	2018-09-07 15:53:48 +00:00
Aditya Kumar	e8a2c26b1b	Hot cold splitting pass Find cold blocks based on profile information (or optionally with static analysis). Forward propagate profile information to all cold-blocks. Outline a cold region. Set calling conv and prof hint for the callsite of the outlined function. Worked in collaboration with: Sebastian Pop <s.pop@samsung.com> Differential Revision: https://reviews.llvm.org/D50658 llvm-svn: 341669	2018-09-07 15:03:49 +00:00
Florian Hahn	4480a19a6e	[InstCombine] Do not fold scalar ops over select with vector condition. If OtherOpT or OtherOpF have scalar types and the condition is a vector, we would create an invalid select. Reviewers: spatel, john.brawn, mssimpso, craig.topper Reviewed By: spatel Differential Revision: https://reviews.llvm.org/D51781 llvm-svn: 341666	2018-09-07 14:40:06 +00:00
Florian Hahn	61fb4d5bf2	[NewGVN] Mark function as changed if we erase instructions. Currently eliminateInstructions only returns true if any instruction got replaced. In the test case for this patch, we eliminate the trivially dead calls, for which eliminateInstructions not do a replacement and the function is not marked as changed, which is why the inliner crashes while traversing the call graph. Alternatively we could also change eliminateInstructions to return true in case we mark instructions for deletion, but that's slightly more code and doing it at the place where the replacement happens seems safer. Fixes PR37517. Reviewers: davide, mcrosier, efriedma, bjope Reviewed By: bjope Differential Revision: https://reviews.llvm.org/D51169 llvm-svn: 341651	2018-09-07 11:41:34 +00:00
Alexander Potapenko	2d11f5dd93	[MSan] don't access MsanCtorFunction when using KMSAN MSan has found a use of uninitialized memory in MSan, fix it. llvm-svn: 341646	2018-09-07 09:56:36 +00:00
Alexander Potapenko	d0446d3d9b	[MSan] Add KMSAN instrumentation to MSan pass Introduce the -msan-kernel flag, which enables the kernel instrumentation. The main differences between KMSAN and MSan instrumentations are: - KMSAN implies msan-track-origins=2, msan-keep-going=true; - there're no explicit accesses to shadow and origin memory. Shadow and origin values for a particular X-byte memory location are read and written via pointers returned by __msan_metadata_ptr_for_load_X(u8 addr) and __msan_store_shadow_origin_X(u8 addr, uptr shadow, uptr origin); - TLS variables are stored in a single struct in per-task storage. A call to a function returning that struct is inserted into every instrumented function before the entry block; - __msan_warning() takes a 32-bit origin parameter; - local variables are poisoned with __msan_poison_alloca() upon function entry and unpoisoned with __msan_unpoison_alloca() before leaving the function; - the pass doesn't declare any global variables or add global constructors to the translation unit. llvm-svn: 341637	2018-09-07 09:10:30 +00:00
Max Kazantsev	187869ae5d	[IndVars] Set Changed when we delete dead instructions. PR38855 IndVars does not set `Changed` flag when it eliminates dead instructions. As result, it may make IR modifications and report that it has done nothing. It leads to inconsistent preserved analyzes results. Differential Revision: https://reviews.llvm.org/D51770 Reviewed By: skatkov llvm-svn: 341633	2018-09-07 07:23:39 +00:00
Wei Mi	47a041af4c	[SampleFDO] Make sample profile loader unaware of compact format change. The patch tries to make sample profile loader independent of profile format change. It moves compact format related code into FunctionSamples and SampleProfileReader classes, and sample profile loader only has to interact with those two classes and will be unaware of profile format changes. The cleanup also contain some fixes to further remove the difference between compactbinary format and binary format. After the cleanup using different formats originated from the same profile will generate the same binaries, which we verified by compiling two large server benchmarks w/wo thinlto. Differential Revision: https://reviews.llvm.org/D51643 llvm-svn: 341591	2018-09-06 22:03:37 +00:00
Sanjay Patel	7acef232ba	[InstCombine] add xor+not folds This fold is needed to avoid a regression when we try to recommit rL300977. We can't see the most basic win currently because demanded bits changes the patterns: https://rise4fun.com/Alive/plpp llvm-svn: 341559	2018-09-06 16:23:40 +00:00
Alexander Potapenko	c01b469493	[MSan] store origins for variadic function parameters in __msan_va_arg_origin_tls Add the __msan_va_arg_origin_tls TLS array to keep the origins for variadic function parameters. Change the instrumentation pass to store parameter origins in this array. This is a reland of r341528. test/msan/vararg.cc doesn't work on Mips, PPC and AArch64 (because this patch doesn't touch them), XFAIL these arches. Also turned out Clang crashed on i80 vararg arguments because of incorrect origin type returned by getOriginPtrForVAArgument() - fixed it and added a test. llvm-svn: 341554	2018-09-06 15:14:36 +00:00
Sanjay Patel	4b83f1a634	[InstCombine] fix formatting in SimplifyDemandedVectorElts->Select; NFCI I'm preparing to add the same functionality both here and to the DAG version of this code in D51696 / D51433, so try to make those cases as similar as possible to avoid bugs. llvm-svn: 341545	2018-09-06 13:19:22 +00:00
Alexander Potapenko	77c2634f8a	[MSan] revert r341528 to unbreak the bots llvm-svn: 341541	2018-09-06 12:19:27 +00:00
Florian Hahn	601db38419	[LoopInterchange] Cleanup unused variables. llvm-svn: 341537	2018-09-06 10:41:01 +00:00
Florian Hahn	e8c0bf5c66	[LoopInterchange] Move preheader creation to transform stage and simplify. There is no need to create preheaders in the analysis stage, we only need them when adjusting the branches. Also, the only cases we need to create our own preheaders is when they have more than 1 predecessors or PHI nodes (even with only 1 predecessor, we could have an LCSSA phi node). I have simplified the conditions and added some assertions to be sure. Because we know the inner and outer loop need to be tightly nested, it is sufficient to check if the inner loop preheader is the outer loop header to check if we need to create a new preheader. Reviewers: efriedma, mcrosier, karthikthecool Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D51703 llvm-svn: 341533	2018-09-06 09:57:27 +00:00
Alexander Potapenko	df6d00d165	[MSan] store origins for variadic function parameters in __msan_va_arg_origin_tls Add the __msan_va_arg_origin_tls TLS array to keep the origins for variadic function parameters. Change the instrumentation pass to store parameter origins in this array. llvm-svn: 341528	2018-09-06 08:50:11 +00:00
Alexander Potapenko	55a97d2686	[MSan] Make sure variadic function arguments do not overflow __msan_va_arg_tls Turns out that calling a variadic function with too many (e.g. >100 i64's) arguments overflows __msan_va_arg_tls, which leads to smashing other TLS data with function argument shadow values. getShadow() already checks for kParamTLSSize and returns clean shadow if the argument does not fit, so just skip storing argument shadow for such arguments. llvm-svn: 341525	2018-09-06 08:21:54 +00:00
Max Kazantsev	26a15d24bf	Revert "[IndVars] Turn isValidRewrite into an assertion" because it seems wrong llvm-svn: 341517	2018-09-06 05:52:47 +00:00
Max Kazantsev	1356fdcbb5	[IndVars] Turn isValidRewrite into an assertion Function rewriteLoopExitValues contains a check on isValidRewrite which is needed to make sure that SCEV does not convert the pattern `gep Base, (&p[n] - &p[0])` into `gep &p[n], Base - &p[0]`. This problem has been fixed in SCEV long ago, so this check is just obsolete. This patch converts it into an assertion to make sure that the SCEV will not mess up this case in the future. Differential Revision: https://reviews.llvm.org/D51582 Reviewed By: atrick llvm-svn: 341516	2018-09-06 05:21:25 +00:00
Benjamin Kramer	40fd323276	[ControlHeightReduction] Remove unused includes Also clang-format them. llvm-svn: 341468	2018-09-05 13:51:05 +00:00
Benjamin Kramer	1aa290ab4d	[Aggressive InstCombine] Move C bindings to their own header file. llvm-svn: 341461	2018-09-05 11:41:12 +00:00
Richard Trieu	5cc0f882a6	Prevent unsigned overflow. The sum of the weights is caculated in an APInt, which has a width smaller than 64. In certain cases, the sum of the widths would overflow when calculations are done inside an APInt, but would not if done with uint64_t. Since the values will be passed as uint64_t in the function call anyways, do all the math in 64 bits. Also added an assert in case the probabilities overflow 64 bits. llvm-svn: 341444	2018-09-05 04:19:15 +00:00
Fangrui Song	36e6b77321	Fix -Wunused-function in release build after rL341386 llvm-svn: 341443	2018-09-05 03:10:20 +00:00
Sanjay Patel	3d57fb8f30	[InstCombine] fix xor-or-xor fold to check uses and handle commutes I'm probably missing some way to use m_Deferred to remove the code duplication, but that can be a follow-up. The improvement in demand_shrink_nsw.ll is an example of missing the fold because the pattern matching was deficient. I didn't try to follow the bits in that test, but Alive says it's correct: https://rise4fun.com/Alive/ugc llvm-svn: 341426	2018-09-04 23:22:13 +00:00
Zhaoshi Zheng	7cb7429e0d	Revert "Revert r341269: [Constant Hoisting] Hoisting Constant GEP Expressions" Reland r341269. Use std::stable_sort when sorting constant condidates. Reverting commit, r341365: Revert r341269: [Constant Hoisting] Hoisting Constant GEP Expressions One of the tests is failing 50% of the time when expensive checks are enabled. Not sure how deep the problem is so just reverting while the author can investigate so that the bots stop repeatedly failing and blaming things incorrectly. Will respond with details on the original commit. Original commit, r341269: [Constant Hoisting] Hoisting Constant GEP Expressions Leverage existing logic in constant hoisting pass to transform constant GEP expressions sharing the same base global variable. Multi-dimensional GEPs are rewritten into single-dimensional GEPs. https://reviews.llvm.org/D51396 Differential Revision: https://reviews.llvm.org/D51654 llvm-svn: 341417	2018-09-04 22:17:03 +00:00
Anna Thomas	d0c6bf40cb	[LV] First order recurrence phis should not be treated as uniform This is fix for PR38786. First order recurrence phis were incorrectly treated as uniform, which caused them to be vectorized as uniform instructions. Patch by Ayal Zaks and Orivej Desh! Reviewed by: Anna Differential Revision: https://reviews.llvm.org/D51639 llvm-svn: 341416	2018-09-04 22:12:23 +00:00
Hiroshi Yamauchi	3b619a8745	Fix a memory leak after rL341386. Reviewers: davidxl Reviewed By: davidxl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D51658 llvm-svn: 341412	2018-09-04 21:28:22 +00:00
Sanjay Patel	e15dec4563	[InstCombine] make ((X & C) ^ C) form consistent for vectors It would be better to create a 'not' here, but that's not possible yet. llvm-svn: 341410	2018-09-04 21:17:14 +00:00

1 2 3 4 5 ...

20586 Commits