llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Sanjoy Das	4e36ac5e45	Fix crashes in IRCE caused by mismatched types There are places where the inductive range check elimination pass depends on two llvm::Values or llvm::SCEVs to be of the same llvm::Type when they do not need to be. This patch relaxes those restrictions (by bailing out of the optimization if the types mismatch), and adds test cases to trigger those paths. These issues were found by bootstrapping clang with IRCE running in the -O3 pass ordering. Differential Revision: http://reviews.llvm.org/D7082 llvm-svn: 226793	2015-01-22 08:29:18 +00:00
Erik Eckstein	46b9effbe2	SLPVectorizer: add a second limit for the number of alias checks. Even with the current limit on the number of alias checks, the containing loop has quadratic complexity. This begins to hurt for blocks containing > 1K load/store instructions. This commit introduces a limit for the loop count. It reduces the runtime for such very large blocks. llvm-svn: 226792	2015-01-22 08:20:51 +00:00
Elena Demikhovsky	837f1e39e9	Fixed a bug in masked load/store in reversed loop. Added a test. The bug was submitted to bugzilla: http://llvm.org/bugs/show_bug.cgi?id=22225 llvm-svn: 226791	2015-01-22 08:20:06 +00:00
Chandler Carruth	5b2abcc667	[PM] Rename InstCombine.h to InstCombineInternal.h in preparation for creating a non-internal header file for the InstCombine pass. I thought about calling this InstCombiner.h or in some way more clearly associating it with the InstCombiner clas that it is primarily defining, but there are several other utility interfaces defined within this for InstCombine. If, in the course of refactoring, those end up moving elsewhere or going away, it might make more sense to make this the combiner's header alone. Naturally, this is a bikeshed to a certain degree, so feel free to lobby for a different shade of paint if this name just doesn't suit you. llvm-svn: 226783	2015-01-22 05:25:13 +00:00
Chandler Carruth	001ed18423	[canonicalize] Teach InstCombine to canonicalize loads which are only ever stored to always use a legal integer type if one is available. Regardless of whether this particular type is good or bad, it ensures we don't get weird differences in generated code (and resulting performance) from "equivalent" patterns that happen to end up using a slightly different type. After some discussion on llvmdev it seems everyone generally likes this canonicalization. However, there may be some parts of LLVM that handle it poorly and need to be fixed. I have at least verified that this doesn't impede GVN and instcombine's store-to-load forwarding powers in any obvious cases. Subtle cases are exactly what we need te flush out if they remain. Also note that this IR pattern should already be hitting LLVM from Clang at least because it is exactly the IR which would be produced if you used memcpy to copy a pointer or floating point between memory instead of a variable. llvm-svn: 226781	2015-01-22 05:08:12 +00:00
Saleem Abdulrasool	47bffde92c	ARM: fail less catastrophically on invalid Windows input Windows supports a restricted set of relocations (compared to ARM ELF). In some cases, we may end up generating an unsupported relocation. This can occur with bad input to the assembler in particular (the frontend should never generate code that cannot be compiled). Generate an error rather than just aborting. The change in the API is driven by the desire to provide a slightly more helpful message for debugging purposes. llvm-svn: 226779	2015-01-22 04:03:32 +00:00
Chandler Carruth	83d25106f7	[canonicalize] Move a helper function further up the file so it can be used earlier. NFC. llvm-svn: 226777	2015-01-22 03:34:54 +00:00
Duncan P. N. Exon Smith	69c26f2395	DIBuilder: Make header iterator constructor explicit, NFC llvm-svn: 226775	2015-01-22 03:20:09 +00:00
Duncan P. N. Exon Smith	cb2d333dd9	DIBuilder: Extract header_begin() and header_end(), NFC Use begin/end functions so that users don't need to know how these weird things work. llvm-svn: 226774	2015-01-22 03:17:43 +00:00
Duncan P. N. Exon Smith	bde325101d	DIBuilder: Stop abusing DIExpressionIterator::operator*(), NFC This code was confusing, since it created a `DIExpressionIterator` from an invalid start point (although it wasn't wrong: it never actually iterated). Now that the underlying iterator has `getNumber()`, just use it directly. llvm-svn: 226773	2015-01-22 03:13:35 +00:00
Duncan P. N. Exon Smith	c3ca8cfd07	DIBuilder: Extract DIHeaderFieldIterator::getNumber(), NFC Reduce code duplication between `DIBuilder` and `DIExpressionIterator` by implementing a `getNumber()` directly in the iterator. llvm-svn: 226772	2015-01-22 03:11:13 +00:00
Duncan P. N. Exon Smith	c1facb579f	DIBuilder: Create a getHeaderIterator() helper, NFC Extract this so it can be reused. llvm-svn: 226770	2015-01-22 03:00:01 +00:00
Chris Bieneman	67edb37229	Making deleted copy constructors and operators to be private for better diagnostics when deleted is not available. llvm-svn: 226769	2015-01-22 02:51:33 +00:00
Reid Kleckner	82c69426ca	SEH: Finish writing the catch-all test case llvm-svn: 226768	2015-01-22 02:31:09 +00:00
Reid Kleckner	6b2145b9c9	Win64 SEH: Emit the constant 1 for catch-all into xdata llvm-svn: 226767	2015-01-22 02:27:44 +00:00
Chris Bieneman	9a06c4c87f	Assigning and copying command line option objects shouldn't be allowed. Summary: The default copy and assignment operators for these objects probably don't actually do what the clients intend, so they should be deleted. Places using the assignment operator to set the value of an option should cast to the option's data type first to call into the override for operator=. Places using the copy constructor just need to be changed to not copy (i.e. passing by const reference instead of value). Reviewers: dexonsmith, chandlerc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7114 llvm-svn: 226762	2015-01-22 01:49:59 +00:00
Sanjoy Das	0520741ff9	Make ScalarEvolution less aggressive with respect to no-wrap flags. ScalarEvolution currently lowers a subtraction recurrence to an add recurrence with the same no-wrap flags as the subtraction. This is incorrect because `sub nsw X, Y` is not the same as `add nsw X, -Y` and `sub nuw X, Y` is not the same as `add nuw X, -Y`. This patch fixes the issue, and adds two test cases demonstrating the bug. Differential Revision: http://reviews.llvm.org/D7081 llvm-svn: 226755	2015-01-22 00:48:47 +00:00
Paul Robinson	4e123e1b82	Explicitly describe '///' versus '//' comment delimiters. llvm-svn: 226750	2015-01-22 00:19:56 +00:00
Adrian Prantl	59be0cc8b7	Make DwarfExpression use the new DIExpressionIterator. NFC. llvm-svn: 226748	2015-01-22 00:00:59 +00:00
Adrian Prantl	919936d65f	Rewrite DIExpression::Verify() using an iterator. NFC. Addresses review comments for r226627. llvm-svn: 226747	2015-01-22 00:00:52 +00:00
Chandler Carruth	320c9d4d21	[canonicalization] Refactor how we create new stores into a helper function. This is a bit tidier anyways and will make a subsquent patch simpler as I want to add another case to this combine. llvm-svn: 226746	2015-01-21 23:45:01 +00:00
Simon Pilgrim	c666c3a4b2	[X86][SSE] Missing SSE/AVX1 memory folding integer instructions Added most of the missing integer vector folding patterns for SSE (to SSE42) and AVX1. The most useful of these are probably the i32/i64 extraction, i8/i16/i32/i64 insertions, zero/sign extension, unsigned saturation subtractions, i64 subtractions and the variable mask blends (pblendvb) - others include CLMUL, SSE42 string comparisons and bit tests. Differential Revision: http://reviews.llvm.org/D7094 llvm-svn: 226745	2015-01-21 23:43:30 +00:00
Tim Northover	5b0e908c64	DAGCombine: fold (or (and X, M), (and X, N)) -> (and X, (or M, N)) It can help with argument juggling on some targets, and is generally a good idea. llvm-svn: 226740	2015-01-21 23:17:19 +00:00
David Blaikie	5ab2c04898	DebugInfo: Use distinct inlinedAt MDLocations to avoid separate inlined calls being coalesced When two calls from the same MDLocation are inlined they currently get treated as one inlined function call (creating difficulty debugging, duplicate variables, etc). Clang worked around this by including column information on inline calls which doesn't address LTO inlining or calls to the same function from the same line and column (such as through a macro). It also didn't address ctor and member function calls. By making the inlinedAt locations distinct, every call site has an explicitly distinct location that cannot be coalesced with any other call. This can produce linearly (2x in the worst case where every call is inlined and the call instruction has a non-call instruction at the same location) more debug locations. Any increase beyond that are in cases where the Clang workaround was insufficient and the new scheme is creating necessary distinct nodes that were being erroneously coalesced previously. After this change to LLVM the incomplete workarounds in Clang. That should reduce the number of debug locations (in a build without column info, the default on Darwin, not the default on Linux) by not creating pseudo-distinct locations for every call to an inline function. (oh, and I made the inlined-at chain rebuilding iterative instead of recursive because I was having trouble wrapping my head around it the way it was - open to discussion on the right design for that function (including going back to a recursive solution)) llvm-svn: 226736	2015-01-21 22:57:29 +00:00
Matt Arsenault	ab16558365	R600: Add checks for urem/srem by a constant Make sure this uses the faster expansion using magic constants to avoid the full division path. llvm-svn: 226734	2015-01-21 22:56:15 +00:00
Matthias Braun	0bd858c6b5	LiveIntervalAnalysis: Mark subregister defs as undef when we determined they are only reading a dead superregister value This was not necessary before as this case can only be detected when the liveness analysis is at subregister level. llvm-svn: 226733	2015-01-21 22:55:13 +00:00
Chris Bieneman	4455c9a3b0	Adding a new cl::HideUnrelatedOptions API to allow clang to migrate off cl::getRegisteredOptions. Summary: cl::getRegisteredOptions really exposes some of the innards of how command line parsing is implemented. Exposing new APIs that allow us to disentangle client code from implementation details will allow us to make more extensive changes to command line parsing. Reviewers: chandlerc, dexonsmith, beanz Reviewed By: dexonsmith Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D7100 llvm-svn: 226729	2015-01-21 22:45:52 +00:00
Simon Pilgrim	b377a5e42b	[X86][SSE] Added support for SSE3 lane duplication shuffle instructions This patch adds shuffle matching for the SSE3 MOVDDUP, MOVSLDUP and MOVSHDUP instructions. The big use of these being that they avoid many single source shuffles from needing to use (pre-AVX) dual source instructions such as SHUFPD/SHUFPS: causing extra moves and preventing load folds. Adding these instructions uncovered an issue in XFormVExtractWithShuffleIntoLoad which crashed on single operand shuffle instructions (now fixed). It also involved fixing getTargetShuffleMask to correctly identify theses instructions as unary shuffles. Also adds a missing tablegen pattern for MOVDDUP. Differential Revision: http://reviews.llvm.org/D7042 llvm-svn: 226716	2015-01-21 22:44:35 +00:00
Matt Arsenault	58bdd73a01	R600: Add missing tests for i64 srem llvm-svn: 226713	2015-01-21 22:43:19 +00:00
Jonathan Roelofs	74e85c180f	Fix load-store optimizer on thumbv4t Thumbv4t does not have lo->lo copies other than MOVS, and that can't be predicated. So emit MOVS when needed and bail if there's a predicate. http://reviews.llvm.org/D6592 llvm-svn: 226711	2015-01-21 22:39:43 +00:00
George Burgess IV	54d138f1fd	Added test to cover the CFLAA bitset indexing bug. llvm-svn: 226710	2015-01-21 22:39:35 +00:00
David Majnemer	f4834fe3ed	InstCombine: Don't strip bitcasts off of callsites marked 'thunk' The return type of a thunk is meaningless, we just want the arguments and return value to be forwarded. llvm-svn: 226708	2015-01-21 22:32:04 +00:00
Simon Pilgrim	5e3d36f19e	[X86][SSE] movddup shuffle mask decodes Patch to provide shuffle decodes and asm comments for the SSE3/AVX1 movddup double duplication instructions. llvm-svn: 226705	2015-01-21 22:02:30 +00:00
Adrian Prantl	64d3d9a495	simplify expression llvm-svn: 226701	2015-01-21 21:23:35 +00:00
Adrian Prantl	c8ae66ee60	Fix a compile issue on MSVC and call finalize(). llvm-svn: 226694	2015-01-21 19:25:35 +00:00
Matthias Braun	3e16064f18	LiveIntervalAnalysis: Factor out code to update liveness on vreg def removal This cleans up code and is more in line with the general philosophy of modifying LiveIntervals through LiveIntervalAnalysis instead of changing them directly. This also fixes a case where SplitEditor::removeBackCopies() would miss the subregister ranges. llvm-svn: 226690	2015-01-21 19:02:30 +00:00
Matthias Braun	938927861a	LiveIntervalAnalysis: document removePhysRegDefAt() function. llvm-svn: 226689	2015-01-21 19:02:26 +00:00
Matthias Braun	1b694229cb	LiveIntervalAnalysis: Factor out code to update liveness on physreg def removal This cleans up code and is more in line with the general philosophy of modifying LiveIntervals through LiveIntervalAnalysis instead of changing them directly. llvm-svn: 226687	2015-01-21 18:50:21 +00:00
Matthias Braun	d3551d5e79	LiveIntervalAnalysis: Remove unused pruneValue() variant. llvm-svn: 226686	2015-01-21 18:45:57 +00:00
Adrian Prantl	a94057833d	Let subprograms with instructions without parent scopes fail the verification. Tested via a unit test. Follow-up to r226616. llvm-svn: 226684	2015-01-21 18:32:56 +00:00
Matt Arsenault	3c5cd0f801	R600/SI: Custom lower fround This fixes it for SI. It also removes the pattern used previously for Evergreen for f32. I'm not sure if the the new R600 output is better or not, but it uses 1 fewer instructions if BFI is available. llvm-svn: 226682	2015-01-21 18:18:25 +00:00
Colin LeMahieu	eacab3ed87	[Hexagon] Converting multiply and accumulate with immediate intrinsics to patterns. llvm-svn: 226681	2015-01-21 18:13:15 +00:00
Ahmed Bougacha	aea688cba4	[X86] Declare SSE4.1/AVX2 vector extloads covered by PMOV[SZ]X legal. Now that we can fully specify extload legality, we can declare them legal for the PMOVSX/PMOVZX instructions. This for instance enables a DAGCombine to fire on code such as (and (<zextload-equivalent> ...), <redundant mask>) to turn it into: (zextload ...) as seen in the testcase changes. There is one regression, in widen_load-2.ll: we're no longer able to do store-to-load forwarding with illegal extload memory types. This will be addressed separately. Differential Revision: http://reviews.llvm.org/D6533 llvm-svn: 226676	2015-01-21 17:07:06 +00:00
Eric Fiselier	b07310b743	[lit] Format JSONMetricValue strings better. llvm-svn: 226672	2015-01-21 16:38:31 +00:00
George Burgess IV	194cda6021	Fixed a bug with how we determine bitset indices. llvm-svn: 226671	2015-01-21 16:37:21 +00:00
Yaron Keren	ba4196c723	Add missing include guards to WindowsSupport.h. llvm-svn: 226669	2015-01-21 16:20:38 +00:00
Tim Northover	c2963c8019	Revert "DAGCombine: fold (or (and X, M), (and X, N)) -> (and X, (or M, N))" It hadn't gone through review yet, but was still on my local copy. This reverts commit r226663 llvm-svn: 226665	2015-01-21 15:48:52 +00:00
Tim Northover	4b020bd23b	AArch64: add backend option to reserve x18 (platform register) AAPCS64 says that it's up to the platform to specify whether x18 is reserved, and a first step on that way is to add a flag controlling it. From: Andrew Turner <andrew@fubar.geek.nz> llvm-svn: 226664	2015-01-21 15:43:31 +00:00
Tim Northover	c9cc73b336	DAGCombine: fold (or (and X, M), (and X, N)) -> (and X, (or M, N)) llvm-svn: 226663	2015-01-21 15:43:28 +00:00
Michael Kuperstein	de439866fe	[x32] Fast ISel should use LEA64_32r instead of LEA32r to adjust addresses in x32 mode. llvm-svn: 226661	2015-01-21 14:44:05 +00:00

1 2 3 4 5 ...

112050 Commits