llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Tim Northover	f6c9f899a8	GlobalISel: refactor legalization of G_INSERT. Now that G_INSERT instructions can only insert one register, this code was overly general. In another direction it didn't handle registers that crossed split boundaries properly, which needed to be fixed. llvm-svn: 297042	2017-03-06 18:23:04 +00:00
Sanjay Patel	2bda33fccd	[InstSimplify] regenerate checks; NFC llvm-svn: 297040	2017-03-06 18:13:01 +00:00
Krzysztof Parzyszek	34717b9d7f	[TableGen] Ensure proper ordering of subtarget feature names llvm-svn: 297039	2017-03-06 18:08:37 +00:00
Dehao Chen	dc2ec5d444	Remove the sample pgo annotation heuristic that uses call count to annotate basic block count. Summary: We do not need that special handling because the debug info is more accurate now. Performance testing shows no regression on google internal benchmarks. Reviewers: davidxl, aprantl Reviewed By: aprantl Subscribers: llvm-commits, aprantl Differential Revision: https://reviews.llvm.org/D30658 llvm-svn: 297038	2017-03-06 17:49:59 +00:00
Krzysztof Parzyszek	033501f748	[Hexagon] Early-if-convert branches that may exit the loop Merge the tail block into the loop in cases where the main loop body exits early, subject to profitability constraints. This will coalesce the loop body into fewer blocks. For example: loop: loop: // loop body // loop body if (...) jump exit --> // more body more: if (...) jump exit // more body jump loop jump loop llvm-svn: 297033	2017-03-06 17:24:04 +00:00
Krzysztof Parzyszek	90d780b0a6	[Hexagon] Mark dead defs as <dead> in expand-condsets The code in updateDeadFlags removed unnecessary <dead> flags, but there can be cases where such a flag is not set, and yet a register has become dead. For example, if a mux with identical inputs is replaced with a COPY, the predicate register may no longer be used after that. llvm-svn: 297032	2017-03-06 17:09:06 +00:00
Krzysztof Parzyszek	fbbcfd906f	[Hexagon] Pick a dot-old instruction that matches the architecture llvm-svn: 297031	2017-03-06 17:03:16 +00:00
Sanjay Patel	39bc7f12f4	[InstSimplify] remove misleading comments; NFC Div/rem-of-0 does not cause faults/undef (not the same as div/rem-by-0). llvm-svn: 297029	2017-03-06 16:49:35 +00:00
Sanjay Patel	e875a18abf	[DAGCombiner] simplify div/rem-by-0 Refactoring of duplicated code and more fixes to follow. This is motivated by the post-commit comments for r296699: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20170306/435182.html Ie, we can crash if we're missing obvious simplifications like this that exist in the IR simplifier or if these occur later than expected. The x86 change for non-splat division shows a potential opportunity to improve vector codegen: we assumed that since only one lane had meaningful results, we should do the math in scalar. But that means moving back and forth from vector registers. llvm-svn: 297026	2017-03-06 16:36:42 +00:00
Alexey Bataev	e888c6c3d2	[SLP] A test for vectorization of users of extractelement instructions, NFC. llvm-svn: 297024	2017-03-06 16:26:00 +00:00
Tobias Grosser	921c30d131	Fix minor typo introduce in r297014 llvm-svn: 297020	2017-03-06 16:03:26 +00:00
Vassil Vassilev	93bcfdca58	Silence a warning "hiding virtual function". llvm-svn: 297018	2017-03-06 15:50:59 +00:00
Sanjay Patel	819a6f8841	[x86] add tests to show missing div/rem simplifications; NFC These are not x86-specific, but the problem is not visible for all targets because it is masked by other transforms. These can lead to compiler crashes. llvm-svn: 297017	2017-03-06 15:50:07 +00:00
Michael Kruse	753fb870c9	[BasicBlockUtils] Check for nullptr before updating LoopInfo. LoopInfo::getLoopFor returns nullptr if a BB is not in a loop and only then can the loop be updated to contain the newly created BBs. Add the missing nullptr check to SplitBlockAndInsertIfThen. Within LLVM, the only user of this function that also passes a LoopInfo to be updated is InnerLoopVectorizer::predicateInstructions(). As the method's name implies, the BB operataten on will always be within a loop, but out-of-tree users may also use it differently (here: Polly). All other uses of LoopInfo::getLoopFor in the file properly check its return value for nullptr. llvm-svn: 297016	2017-03-06 15:33:05 +00:00
Sanjay Patel	e58df799fe	[DAG] fix formatting; NFC llvm-svn: 297015	2017-03-06 15:27:57 +00:00
Tobias Grosser	b7c673a9c1	New Test-Case for Region Analysis While working on improvements to region info analysis, this test case caused an incorrect region bb2 => bb3 to be detected. Reviewers: grosser Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30652 llvm-svn: 297014	2017-03-06 15:20:38 +00:00
Sanjay Patel	b8be70a569	[DAG] fix typo in comment; NFC llvm-svn: 297011	2017-03-06 15:07:43 +00:00
Tom Stellard	ec0b8c6945	CMake: Add a build target for generating a source RPM Summary: 'make srpm' or 'ninja srpm' will tar up the current source code and then build a source RPM package. By default it will use the llvm.spec file to generate the source RPM, but you can specify your own custom spec file with the LLVM_SRPM_USER_BINARY_SPECFILE option. CMake will perform variable substitution on your custom specfile, so you can reference CMake variables in it. For example: Version: @LLVM_RPM_SPEC_VERSION@ Note that everything in the source directory will be included in the tarball so if you have a clang check out in tools/clang, then all the clang source will end up in the tarball to. It is recommended to only use this build target with a clean source tree. Reviewers: beanz Reviewed By: beanz Subscribers: mgorny, llvm-commits Differential Revision: https://reviews.llvm.org/D30093 llvm-svn: 297007	2017-03-06 14:26:50 +00:00
Nemanja Ivanovic	1a27ab2702	[PowerPC] Fix failure with STBRX when store is narrower than the bswap Fixes a crash caused by r296811 by truncating the input of the STBRX node when the bswap is wider than i32. Fixes https://bugs.llvm.org/show_bug.cgi?id=32140 Differential Revision: https://reviews.llvm.org/D30615 llvm-svn: 297001	2017-03-06 07:32:13 +00:00
Dean Michael Berris	1ece6d4a7e	[XRay] Allow logging the first argument of a function call. Summary: Functions with the "xray-log-args" attribute will have a special XRay sled kind emitted, for compiler-rt to copy any call arguments to your logging handler. For practical and performance reasons, only the first argument is supported, and only up to 64 bits. Reviewers: dberris Reviewed By: dberris Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D29702 llvm-svn: 296998	2017-03-06 06:48:56 +00:00
Craig Topper	e2e31b6a01	[APInt] Move operator~ out of line to make it better able to reused memory allocation from temporary objects Summary: This makes operator~ take the APInt by value so if it came from a temporary APInt the move constructor will get invoked and it will be able to reuse the memory allocation from the temporary. This is similar to what was already done for 2s complement negation. Reviewers: hans, davide, RKSimon Reviewed By: davide Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30614 llvm-svn: 296997	2017-03-06 06:30:47 +00:00
Craig Topper	4e17c0a9f9	[APInt] Remove add and sub functions from APIntOps namespace. They aren't used in tree and using the overloaded operators has more optimization opportunities. llvm-svn: 296995	2017-03-06 04:35:01 +00:00
Craig Topper	ac0e488f8e	[APInt] Remove unused And/Or/Xor methods. They just forward to the corresponding operator overload. I plan to enhance the operator overloads to handle rvalues and these methods would not longer be optimal to use. llvm-svn: 296993	2017-03-06 00:24:53 +00:00
Sanjoy Das	58bb799513	[SCEV] Decrease the recursion threshold for CompareValueComplexity Fixes PR32142. r287232 accidentally increased the recursion threshold for CompareValueComplexity from 2 to 32. This change reverses that change by introducing a separate flag for CompareValueComplexity's threshold. llvm-svn: 296992	2017-03-05 23:49:17 +00:00
Craig Topper	1c33eb3a5e	[APInt] Remove the And/Or/Xor/Not functions from the APIntOps namespace. Summary: They aren't used anywhere in tree and its preferable to use the &, \|, ^, or ~ operators. With my patch to add rvalue reference support to &, \|, ^ operators it also becomes less performant to use these functions. Reviewers: RKSimon, davide, hans Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D30613 llvm-svn: 296990	2017-03-05 16:41:11 +00:00
Simon Pilgrim	a5f0499064	[SelectionDAG] Fix vector splitting for *_EXTEND_VECTOR_INREG instructions Found by fuzz testing after rL296985 landed llvm-svn: 296989	2017-03-05 15:52:18 +00:00
Tobias Grosser	3ed0ada2df	New Test-Case for Region Analysis While working on improvements to the region info analysis, this test case caused an incorrect region 1 => 2 to be detected. It is incorrect because entry has an outgoing edge to 3. This is interesting because 1 dom 2 and 2 pdom 1, which should have been enough to prevent incoming forward edges into the region and outgoing forward edges from the region. Reviewers: grosser Subscribers: llvm-commits Contributed-by: Nandini Singhal <cs15mtech01004@iith.ac.in> Differential Revision: https://reviews.llvm.org/D30603 llvm-svn: 296988	2017-03-05 14:08:28 +00:00
Benjamin Kramer	046137ca72	[X86] Silence GCC enum compare warning. X86ISelLowering.cpp:26506:36: error: enumeral mismatch in conditional expression: 'llvm::X86ISD::NodeType' vs 'llvm::ISD::NodeType' [-Werror=enum-compare] llvm-svn: 296986	2017-03-05 12:53:20 +00:00
Simon Pilgrim	84ae30b32d	[X86][SSE] Lower 128-bit vectors to SIGN/ZERO_EXTEND_VECTOR_IN_REG ops As described on PR31712, we miss a variety of legalization combines because we lower these to X86ISD::VSEXT/VZEXT despite them having the same functionality. This patch makes 128-bit (SSE41) SIGN/ZERO_EXTEND_VECTOR_IN_REG ops legal, adds the necessary tablegen plumbing and uses a helper 'getExtendInVec' to decide when to use SIGN/ZERO_EXTEND_VECTOR_IN_REG or VSEXT/VZEXT. We're missing a couple of shuffle combines that will be added in a future patch for review. Later patches can then support the AVX2 cases as a mixture of SIGN/ZERO_EXTEND and SIGN/ZERO_EXTEND_VECTOR_IN_REG, and then finally deal with the AVX512 cases. Differential Revision: https://reviews.llvm.org/D30549 llvm-svn: 296985	2017-03-05 09:57:20 +00:00
Sylvestre Ledru	8d9fcfbbee	Revert commit r296967, no typo llvm-svn: 296984	2017-03-05 07:46:24 +00:00
Craig Topper	e633888ab8	[SimplifyCFG] Use APInt::operator\| instead of APInt::Or. NFC I'm looking to improve operator\| to support rvalue references and may remove APInt::Or. llvm-svn: 296982	2017-03-05 01:08:19 +00:00
Craig Topper	784f263d08	[DAGCombine] Use APInt::operator\|(uint64_t) instead of creating a temporary APInt and calling APInt::Or. NFC This is more efficient by itself. But this is prep for a future patch that may remove APInt::Or while making operator\| support rvalue references similar to add/sub. llvm-svn: 296981	2017-03-05 01:08:16 +00:00
Sanjay Patel	6ecc3e6175	[x86] don't require a zext when forming ADC/SBB The larger goal is to move the ADC/SBB transforms currently in combineX86SetCC() to combineAddOrSubToADCOrSBB() because we're creating ADC/SBB in lots of places where we shouldn't. This was intended to be an NFC change, but avx-512 has something strange going on. It doesn't seem like any of the affected tests should really be using SET+TEST or ADC; a simple ADD could replace several instructions. But that's another bug... llvm-svn: 296978	2017-03-04 20:35:19 +00:00
Sanjay Patel	c4d2690b90	[DAGCombiner] allow transforming (select Cond, C +/- 1, C) to (add(ext Cond), C) select Cond, C +/- 1, C --> add(ext Cond), C -- with a target hook. This is part of the ongoing process to obsolete D24480. The motivation is to canonicalize to select IR in InstCombine whenever possible, so we need to have a way to undo that easily in codegen. PowerPC is an obvious winner for this kind of transform because it has fast and complete bit-twiddling abilities but generally lousy conditional execution perf (although this might have changed in recent implementations). x86 also sees some wins, but the effect is limited because these transforms already mostly exist in its target-specific combineSelectOfTwoConstants(). The fact that we see any x86 changes just shows that that code is a mess of special-case holes. We may be able to remove some of that logic now. My guess is that other targets will want to enable this hook for most cases. The likely follow-ups would be to add value type and/or the constants themselves as parameters for the hook. As the tests in select_const.ll show, we can transform any select-of-constants to math/logic, but the general transform for any 2 constants needs one more instruction (multiply or 'and'). ARM is one target that I think may not want this for most cases. I see infinite loops there because it wants to use selects to enable conditionally executed instructions. Differential Revision: https://reviews.llvm.org/D30537 llvm-svn: 296977	2017-03-04 19:18:09 +00:00
Zachary Turner	cbbc49a0c6	Try to fix thread name truncation on non-Windows. llvm-svn: 296976	2017-03-04 18:53:09 +00:00
Kamil Rytarowski	03f6f8e5ee	Improve the Threading code on NetBSD Do not include <sys/user.h> on NetBSD. It's dead file and will be removed. No need to include <sys/sysctl.h> in this code context on NetBSD. llvm-svn: 296973	2017-03-04 17:42:46 +00:00
Zachary Turner	cdbc938574	Truncate thread names if they're too long. llvm-svn: 296972	2017-03-04 16:42:25 +00:00
Daniel Berlin	54e6edd65c	DebugCounter: Initialize skip to 0, not -1 llvm-svn: 296971	2017-03-04 14:08:47 +00:00
Sylvestre Ledru	2d8d5d165d	Fix a typo. Patch by fcrick on github https://github.com/llvm-mirror/llvm/pull/23 llvm-svn: 296969	2017-03-04 14:01:38 +00:00
Sylvestre Ledru	7ec49d130a	Remove redundant code block and update comment. By patch zoren here: https://github.com/llvm-mirror/llvm/pull/20 llvm-svn: 296968	2017-03-04 14:00:44 +00:00
Sylvestre Ledru	f1fad05f29	Fix a typo. Thanks to huangml. Reported here: https://github.com/llvm-mirror/llvm/pull/6 llvm-svn: 296967	2017-03-04 13:56:11 +00:00
Simon Pilgrim	d64b8a32e1	[X86][SSE] Enable post-legalize vXi64 shuffle combining on 32-bit targets Long ago (2010 according to svn blame), combineShuffle probably needed to prevent the accidental creation of illegal i64 types but there doesn't appear to be any combines that can cause this any more as they all have their own legality checks. Differential Revision: https://reviews.llvm.org/D30213 llvm-svn: 296966	2017-03-04 12:50:47 +00:00
Florian Hahn	21e448ed02	[legalize-types] Remove stale entries from SoftenedFloats. Summary: When replacing a SDValue, we should remove the replaced value from SoftenedFloats (and possibly the other maps as well?). When we revisit a Node because it needs analyzing again, we have to remove all result values from SoftenedFloats (and possibly other maps?). This fixes the fp128 test failures with expensive checks for X86. I think we probably should also remove the values from the other maps (PromotedIntegers and so on), let me know what you think. Reviewers: baldrick, bogner, davidxl, ab, arsenm, pirama, chh, RKSimon Reviewed By: chh Subscribers: danalbert, wdng, srhines, hfinkel, sepavloff, llvm-commits Differential Revision: https://reviews.llvm.org/D29265 llvm-svn: 296964	2017-03-04 12:00:35 +00:00
Evgeny Stupachenko	ba51722072	Add test missed in r296770. Differential Revision: http://reviews.llvm.org/D27004 From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 296962	2017-03-04 05:20:02 +00:00
Daniel Berlin	33909a77e0	Fix bug in bisect-skip-count not using passed-in arguments llvm-svn: 296961	2017-03-04 03:23:41 +00:00
Evgeny Stupachenko	e5edb8e9bf	Set option enabling LSR alternative way to resolve complex solution to false. Differential Revision: http://reviews.llvm.org/D29862 From: Evgeny Stupachenko <evstupac@gmail.com> llvm-svn: 296959	2017-03-04 03:14:05 +00:00
Matthias Braun	fc6f82320d	X86ISelLowering: Only perform copy elision on legal types. This fixes cases where i1 types were not properly legalized yet and lead to the creating of 0-sized stack slots. This fixes http://llvm.org/PR32136 llvm-svn: 296950	2017-03-04 01:40:40 +00:00
Peter Collingbourne	badac7599f	Fix build. llvm-svn: 296949	2017-03-04 01:38:05 +00:00
Peter Collingbourne	9c81e2e149	WholeProgramDevirt: Implement exporting for uniform ret val opt. Differential Revision: https://reviews.llvm.org/D29846 llvm-svn: 296948	2017-03-04 01:34:53 +00:00
Peter Collingbourne	0551aa2e46	WholeProgramDevirt: Implement exporting for single-impl devirtualization. Differential Revision: https://reviews.llvm.org/D29811 llvm-svn: 296945	2017-03-04 01:31:01 +00:00

1 2 3 4 5 ...

145819 Commits