llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-27 22:12:47 +01:00

Author	SHA1	Message	Date
Tim Northover	ca0f4dc4f0	AArch64/ARM64: move ARM64 into AArch64's place This commit starts with a "git mv ARM64 AArch64" and continues out from there, renaming the C++ classes, intrinsics, and other target-local objects for consistency. "ARM64" test directories are also moved, and tests that began their life in ARM64 use an arm64 triple, those from AArch64 use an aarch64 triple. Both should be equivalent though. This finishes the AArch64 merge, and everyone should feel free to continue committing as normal now. llvm-svn: 209577	2014-05-24 12:50:23 +00:00
Dinesh Dwivedi	ecbf9efc4d	Added inst-combine for 'MIN(MIN(A, 97), 23)' and 'MAX(MAX(A, 23), 97)' This removes TODO added in r208849 [http://reviews.llvm.org/D3629] MIN(MIN(A, 97), 23) -> MIN(A, 23) MAX(MAX(A, 23), 97) -> MAX(A, 97) Differential Revision: http://reviews.llvm.org/D3785 llvm-svn: 209110	2014-05-19 07:08:32 +00:00
NAKAMURA Takumi	7e44bbf27f	Revert r209049 and r209065, "Add support for combining GEPs across PHI nodes" It broke clang selfhosting even after r209065. llvm-svn: 209067	2014-05-17 14:39:21 +00:00
Louis Gerbarg	5be1142768	Fix for sanitizer crash introduced in r209049 This patch fixes 3 issues introduced by r209049 that only showed up in on the sanitizer buildbots. One was a typo in a compare. The other is a check to confirm that the single differing value in the two incoming GEPs is the same type. The final issue was the the IRBuilder under some circumstances would build PHIs in the middle of the block. llvm-svn: 209065	2014-05-17 06:51:36 +00:00
Louis Gerbarg	18e29acd3d	Add support for combining GEPs across PHI nodes Currently LLVM will generally merge GEPs. This allows backends to use more complex addressing modes. In some cases this is not happening because there is PHI inbetween the two GEPs: GEP1--\ \|-->PHI1-->GEP3 GEP2--/ This patch checks to see if GEP1 and GEP2 are similiar enough that they can be cloned (GEP12) in GEP3's BB, allowing GEP->GEP merging (GEP123): GEP1--\ --\ --\ \|-->PHI1-->GEP3 ==> \|-->PHI2->GEP12->GEP3 == > \|-->PHI2->GEP123 GEP2--/ --/ --/ This also breaks certain use chains that are preventing GEP->GEP merges that the the existing instcombine would merge otherwise. Tests included. rdar://15547484 llvm-svn: 209049	2014-05-16 23:47:24 +00:00
Dinesh Dwivedi	2381ae3d5b	Reverting r208848, reason: build failure: sanitizer-x86_64-linux-bootstrap/builds/3399 llvm-svn: 208852	2014-05-15 08:22:55 +00:00
Dinesh Dwivedi	74c9400378	Added instcombine for 'MIN(MIN(A, 27), 93)' and 'MAX(MAX(A, 93), 27)' MIN(MIN(A, 23), 97) -> MIN(A, 23) MAX(MAX(A, 97), 23) -> MAX(A, 97) Differential Revision: http://reviews.llvm.org/D3629 llvm-svn: 208849	2014-05-15 06:13:40 +00:00
Dinesh Dwivedi	de41090110	Added inst combine transforms for single bit tests from Chris's note if ((x & C) == 0) x \|= C becomes x \|= C if ((x & C) != 0) x ^= C becomes x &= ~C if ((x & C) == 0) x ^= C becomes x \|= C if ((x & C) != 0) x &= ~C becomes x &= ~C if ((x & C) == 0) x &= ~C becomes nothing Z3 Verifications code for above transform http://rise4fun.com/Z3/Pmsh Differential Revision: http://reviews.llvm.org/D3717 llvm-svn: 208848	2014-05-15 06:01:33 +00:00
David Majnemer	809b8a331d	InstCombine: Optimize -x s< cst Summary: This gets rid of a sub instruction by moving the negation to the constant when valid. Reviewers: nicholas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3773 llvm-svn: 208827	2014-05-15 00:02:20 +00:00
Jay Foad	e0eac700cb	Rename ComputeMaskedBits to computeKnownBits. "Masked" has been inappropriate since it lost its Mask parameter in r154011. llvm-svn: 208811	2014-05-14 21:14:37 +00:00
Serge Pavlov	a4228593ad	Fix the case when reordering shuffle and binop produces a constant. This resolves PR19737. llvm-svn: 208762	2014-05-14 09:05:09 +00:00
Nick Lewycky	2f1d385135	Optimize integral reciprocal (udiv 1, x and sdiv 1, x) to not use division. This fires exactly once in a clang bootstrap, but covers a few different results from http://www.cs.utah.edu/~regehr/souper/ llvm-svn: 208750	2014-05-14 03:03:05 +00:00
Serge Pavlov	978a422aed	Fix type of shuffle resulted from shuffle merge. This fix resolves PR19730. llvm-svn: 208666	2014-05-13 06:07:21 +00:00
Serge Pavlov	1ff1d49a09	Fix type of shuffle obtained from reordering with binary operation In transformation: BinOp(shuffle(v1,undef), shuffle(v2,undef)) -> shuffle(BinOp(v1, v2),undef) type of the undef argument must be same as type of BinOp. llvm-svn: 208531	2014-05-12 10:11:27 +00:00
Serge Pavlov	8e3b52d51f	Fix reordering of shuffles and binary operations Do not apply transformation: BinOp(shuffle(v1), shuffle(v2)) -> shuffle(BinOp(v1, v2)) if operands v1 and v2 are of different size. This change fixes PR19717, which was caused by r208488. llvm-svn: 208518	2014-05-12 05:44:53 +00:00
Serge Pavlov	d043cc92e5	Reorder shuffle and binary operation. This patch enables transformations: BinOp(shuffle(v1), shuffle(v2)) -> shuffle(BinOp(v1, v2)) BinOp(shuffle(v1), const1) -> shuffle(BinOp, const2) They allow to eliminate extra shuffles in some cases. Differential Revision: http://reviews.llvm.org/D3525 llvm-svn: 208488	2014-05-11 08:46:12 +00:00
Michael Zolotukhin	de5b0a206d	[InstCombine] Some cleanup in optimization of redundant insertvalue instructions. And one more test added. llvm-svn: 208355	2014-05-08 19:50:24 +00:00
Chandler Carruth	36ad1ec2b0	Tidy up whitespace with clang-format prior to making significant changes. llvm-svn: 208229	2014-05-07 17:36:59 +00:00
Michael Zolotukhin	5fe056ff21	[InstCombine] Add optimization of redundant insertvalue instructions. rdar://problem/11861387 llvm-svn: 208214	2014-05-07 14:30:18 +00:00
Rafael Espindola	f399f03ccc	Also handle ConstantAggregateZero when optimizing vpermilvar*. llvm-svn: 207582	2014-04-29 22:20:40 +00:00
Rafael Espindola	b4468db535	Remove tabs. Sorry, new machine and I forgot to change the editor setting. llvm-svn: 207578	2014-04-29 21:02:37 +00:00
Rafael Espindola	7372093fcc	Two fixes to the vpermilvar optimization. The instcomine logic to handle vpermilvar's pd and 256 variants was incorrect. The _256 variants have indexes into the individual 128 bit lanes and in all cases it also has to mask out unused bits. llvm-svn: 207577	2014-04-29 20:41:54 +00:00
Hans Wennborg	6405294846	InstCombine: don't drop 'inalloca' in PromoteCastOfAllocation (PR19569) llvm-svn: 207426	2014-04-28 17:40:03 +00:00
Craig Topper	b663bffa27	[C++] Use 'nullptr'. llvm-svn: 207394	2014-04-28 04:05:08 +00:00
Andrea Di Biagio	815dfb7574	[InstCombine][X86] Teach how to fold calls to SSE2/AVX2 packed logical shift right intrinsics. A packed logical shift right with a shift count bigger than or equal to the element size always produces a zero vector. In all other cases, it can be safely replaced by a 'lshr' instruction. llvm-svn: 207299	2014-04-26 01:03:22 +00:00
Craig Topper	c0a2a29f4e	[C++] Use 'nullptr'. Transforms edition. llvm-svn: 207196	2014-04-25 05:29:35 +00:00
Michael J. Spencer	65065bf94f	[InstCombine][x86] Constant fold psll intrinsics. This excludes avx512 as I don't have hardware to verify. It excludes _dq variants because they are represented in the IR as <{2,4} x i64> when it's actually a byte shift of the entire i{128,265}. This also excludes _dq_bs as they aren't at all supported by the backend. There are also no corresponding instructions in the ISA. I have no idea why they exist... llvm-svn: 207058	2014-04-24 00:58:18 +00:00
Filipe Cabecinhas	696e2aae90	Optimize some special cases for SSE4a insertqi Summary: Since the upper 64 bits of the destination register are undefined when performing this operation, we can substitute it and let the optimizer figure out that only a copy is needed. Also added range merging, if an instruction copies a range that can be merged with a previous copied range. Added test cases for both optimizations. Reviewers: grosbach, nadav CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3357 llvm-svn: 207055	2014-04-24 00:38:14 +00:00
Matt Arsenault	bc017b7e11	Handle addrspacecast when looking at memcpys from globals llvm-svn: 207054	2014-04-24 00:01:09 +00:00
Matt Arsenault	457de98b62	Remove dead code in instcombine. Don't replace shifts greater than the type with the maximum shift. This isn't hit anywhere in the tests, and somewhere else is replacing these with undef. llvm-svn: 207000	2014-04-23 16:48:40 +00:00
Chandler Carruth	6f9ba6a633	[Modules] Fix potential ODR violations by sinking the DEBUG_TYPE definition below all of the header #include lines, lib/Transforms/... edition. This one is tricky for two reasons. We again have a couple of passes that define something else before the includes as well. I've sunk their name macros with the DEBUG_TYPE. Also, InstCombine contains headers that need DEBUG_TYPE, so now those headers #define and #undef DEBUG_TYPE around their code, leaving them well formed modular headers. Fixing these headers was a large motivation for all of these changes, as "leaky" macros of this form are hard on the modules implementation. llvm-svn: 206844	2014-04-22 02:55:47 +00:00
Rafael Espindola	5bfe46aee5	Simplify a vpermil* with constant mask. With a constant mask a vpermil* is just a shufflevector. This patch implements that simplification. This allows us to produce denser code. It should also allow more folding down the line. llvm-svn: 206801	2014-04-21 22:06:04 +00:00
Chandler Carruth	e07407deea	[Modules] Sink all the DEBUG_TYPE defines for InstCombine out of the header files and into the cpp files. These files will require more touches as the header files actually use DEBUG(). Eventually, I'll have to introduce a matched #define and #undef of DEBUG_TYPE for the header files, but that comes as step N of many to clean all of this up. llvm-svn: 206777	2014-04-21 19:51:41 +00:00
Matt Arsenault	2a6aada789	Revert "Revert r206045, "Fix shift by constants for vector."" Fix cases where the Value itself is used, and not the constant value. llvm-svn: 206214	2014-04-14 21:50:37 +00:00
NAKAMURA Takumi	1a21e608ca	Whitespace. llvm-svn: 206154	2014-04-14 07:03:13 +00:00
NAKAMURA Takumi	c6fb0494ea	Revert r206045, "Fix shift by constants for vector." It broke some builders, at least, i686. llvm-svn: 206153	2014-04-14 07:02:57 +00:00
Serge Pavlov	c145d314a1	Use APInt arithmetic, fixed typo. Thanks to Benjamin Kramer for noticing that. llvm-svn: 206144	2014-04-14 02:20:19 +00:00
Serge Pavlov	816d014c52	Recognize test for overflow in integer multiplication. If multiplication involves zero-extended arguments and the result is compared as in the patterns: %mul32 = trunc i64 %mul64 to i32 %zext = zext i32 %mul32 to i64 %overflow = icmp ne i64 %mul64, %zext or %overflow = icmp ugt i64 %mul64 , 0xffffffff then the multiplication may be replaced by call to umul.with.overflow. This change fixes PR4917 and PR4918. Differential Revision: http://llvm-reviews.chandlerc.com/D2814 llvm-svn: 206137	2014-04-13 18:23:41 +00:00
Matt Arsenault	c399a3f659	Fix shift by constants for vector. ashr <N x iM>, <N x iM> M -> undef llvm-svn: 206045	2014-04-11 17:57:53 +00:00
Eli Bendersky	be453afe99	Fix PR19270 - type mismatch caused by invalid optimization. Patch by Jingyue Wu. llvm-svn: 205547	2014-04-03 17:51:58 +00:00
Tim Northover	2f13163a84	ARM64: initial backend import This adds a second implementation of the AArch64 architecture to LLVM, accessible in parallel via the "arm64" triple. The plan over the coming weeks & months is to merge the two into a single backend, during which time thorough code review should naturally occur. Everything will be easier with the target in-tree though, hence this commit. llvm-svn: 205090	2014-03-29 10:18:08 +00:00
Erik Verbruggen	11e61b79e5	Revert "InstCombine: merge constants in both operands of icmp." This reverts commit r204912, and follow-up commit r204948. This introduced a performance regression, and the fix is not completely clear yet. llvm-svn: 205010	2014-03-28 14:50:57 +00:00
Reid Kleckner	c826dce075	InstCombine: Don't combine constants on unsigned icmps Fixes a miscompile introduced in r204912. It would miscompile code like (unsigned)(a + -49) <= 5U. The transform would turn this into (unsigned)a < 55U, which would return true for values in [0, 49], when it should not. llvm-svn: 204948	2014-03-27 17:49:27 +00:00
Erik Verbruggen	5e4efd4306	InstCombine: merge constants in both operands of icmp. Transform: icmp X+Cst2, Cst into: icmp X, Cst-Cst2 when Cst-Cst2 does not overflow, and the add has nsw. llvm-svn: 204912	2014-03-27 11:16:05 +00:00
Richard Osborne	fd123c2caf	[InstCombine] Don't fold bitcast into store if it would need addrspacecast Summary: Previously the code didn't check if the before and after types for the store were pointers to different address spaces. This resulted in instcombine using a bitcast to convert between pointers to different address spaces, causing an assertion due to the invalid cast. It is not be appropriate to use addrspacecast this case because it is not guaranteed to be a no-op cast. Instead bail out and do not do the transformation. CC: llvm-commits Differential Revision: http://llvm-reviews.chandlerc.com/D3117 llvm-svn: 204733	2014-03-25 17:21:41 +00:00
Richard Osborne	db5d56840b	Reuse earlier variables to make it clear the types involved in the cast. No functionality change. llvm-svn: 204732	2014-03-25 17:21:35 +00:00
Owen Anderson	3a006737fe	Fix a bug in InstCombine where we would incorrectly attempt to construct a bitcast between pointers of two different address spaces if they happened to have the same pointer size. llvm-svn: 203862	2014-03-13 22:51:43 +00:00
Chandler Carruth	fad39ebe19	[C++11] Add range based accessors for the Use-Def chain of a Value. This requires a number of steps. 1) Move value_use_iterator into the Value class as an implementation detail 2) Change it to actually be a Use iterator rather than a User iterator. 3) Add an adaptor which is a User iterator that always looks through the Use to the User. 4) Wrap these in Value::use_iterator and Value::user_iterator typedefs. 5) Add the range adaptors as Value::uses() and Value::users(). 6) Update all of the callers to correctly distinguish between whether they wanted a use_iterator (and to explicitly dig out the User when needed), or a user_iterator which makes the Use itself totally opaque. Because #6 requires churning essentially everything that walked the Use-Def chains, I went ahead and added all of the range adaptors and switched them to range-based loops where appropriate. Also because the renaming requires at least churning every line of code, it didn't make any sense to split these up into multiple commits -- all of which would touch all of the same lies of code. The result is still not quite optimal. The Value::use_iterator is a nice regular iterator, but Value::user_iterator is an iterator over Users rather than over the User objects themselves. As a consequence, it fits a bit awkwardly into the range-based world and it has the weird extra-dereferencing 'operator->' that so many of our iterators have. I think this could be fixed by providing something which transforms a range of T&s into a range of Ts, but that can be separated into another patch, and it isn't yet 100% clear whether this is the right move. However, this change gets us most of the benefit and cleans up a substantial amount of code around Use and User. =] llvm-svn: 203364	2014-03-09 03:16:01 +00:00
Tim Northover	b74aa030d9	InstCombine: form shuffles from wider range of insert/extractelements Sequences of insertelement/extractelements are sometimes used to build vectorsr; this code tries to put them back together into shuffles, but could only produce a completely uniform shuffle types (<N x T> from two <N x T> sources). This should allow shuffles with different numbers of elements on the input and output sides as well. llvm-svn: 203229	2014-03-07 10:24:44 +00:00
Chandler Carruth	a48d15a676	[Layering] Move InstVisitor.h into the IR library as it is pretty obviously coupled to the IR. llvm-svn: 203064	2014-03-06 03:23:41 +00:00

1 2 3 4 5 ...

1046 Commits