llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 12:33:33 +02:00

Author	SHA1	Message	Date
Renato Golin	ab7412a40d	[ARM] Merging 64-bit divmod lib calls into one When div+rem calls on the same arguments are found, the ARM back-end merges the two calls into one __aeabi_divmod call for up to 32-bits values. However, for 64-bit values, which also have a lib call (__aeabi_ldivmod), it wasn't merging the calls, and thus calling ldivmod twice and spilling the temporary results, which generated pretty bad code. This patch legalises 64-bit lib calls for divmod, so that now all the spilling and the second call are gone. It also relaxes the DivRem combiner a bit on the legal type check, since it was already checking for isLegalOrCustom on every value, so the extra check for isTypeLegal was redundant. This patch fixes PR17193 (and a long time FIXME in the tests). llvm-svn: 262507	2016-03-02 19:35:45 +00:00
Reid Kleckner	0558b04f57	Revert "[X86] Elide references to _chkstk for dynamic allocas" This reverts commit r262370. It turns out there is code out there that does sequences of allocas greater than 4K: http://crbug.com/591404 The goal of this change was to improve the code size of inalloca call sequences, but we got tangled up in the mess of dynamic allocas. Instead, we should come back later with a separate MI pass that uses dominance to optimize the full sequence. This should also be able to remove the often unneeded stacksave/stackrestore pairs around the call. llvm-svn: 262505	2016-03-02 19:20:59 +00:00
Matthias Braun	5ca7af07c9	ARM: Introduce conservative load/store optimization mode Most of the time ARM has the CCR.UNALIGN_TRP bit set to false which means that unaligned loads/stores do not trap and even extensive testing will not catch these bugs. However the multi/double variants are not affected by this bit and will still trap. In effect a more aggressive load/store optimization will break existing (bad) code. These bugs do not necessarily manifest in the broken code where the misaligned pointer is formed but often later in perfectly legal code where it is accessed. This means recompiling system libraries (which have no alignment bugs) with a newer compiler will break existing applications (with alignment bugs) that worked before. So (under protest) I implemented this safe mode which limits the formation of multi/double operations to cases that are not affected by user code (stack operations like spills/reloads) or cases where the normal operations trap anyway (floating point load/stores). It is disabled by default. Differential Revision: http://reviews.llvm.org/D17015 llvm-svn: 262504	2016-03-02 19:20:00 +00:00
Justin Bogner	fc205457a6	SelectionDAG: Use correctly sized allocation functions for SDNodes The placement new calls here were all calling the allocation function in RecyclingAllocator/Recycler for SDNode, instead of the function for the specific subclass we were constructing. Since this particular allocator always overallocates it more or less worked, but would hide what we're actually doing from any memory tools. Also, if you tried to change this allocator so something like a BumpPtrAllocator or MallocAllocator, the compiler would crash horribly all the time. Part of llvm.org/PR26808. llvm-svn: 262500	2016-03-02 19:01:11 +00:00
Geoff Berry	8e0ed2340c	[AArch64] Enable non-leaf frame pointer elimination. Summary: This change enables frame pointer elimination in non-leaf functions. The -fomit-frame-pointer option still needs to be used when compiling via clang (or an equivalent method of not setting the 'no-frame-pointer-elim*' function attributes if generating llvm IR via some other method) to take advantage of this optimization. This change should be NFC when compiling via clang without -fomit-frame-pointer. Reviewers: t.p.northover Subscribers: aemerson, rengolin, tberghammer, qcolombet, llvm-commits, danalbert, mcrosier, srhines Differential Revision: http://reviews.llvm.org/D17730 llvm-svn: 262495	2016-03-02 17:58:31 +00:00
Chris Bieneman	d5c3a96844	[CMake] Add test-depends target to build dependencies of check-all This is just another convenience target for bots to use. It enables isolation of building and testing. llvm-svn: 262494	2016-03-02 17:56:30 +00:00
Reid Kleckner	9feb4e3a93	[cmake] Check the compiler version first Otherwise users get messages from CheckAtomic about missing libatomic instead of a sensible message that says "use GCC 4.7 or newer". I structured the change along the lines of HandleLLVMStdlib.cmake, so that the standalone build of Clang still gets the compiler version check. Reviewers: beanz Differential Revision: http://reviews.llvm.org/D17789 llvm-svn: 262491	2016-03-02 16:42:56 +00:00
Chandler Carruth	e597ed0112	[AA] Hoist the logic to reformulate various AA queries in terms of other parts of the AA interface out of the base class of every single AA result object. Because this logic reformulates the query in terms of some other aspect of the API, it would easily cause O(n^2) query patterns in alias analysis. These could in turn be magnified further based on the number of call arguments, and then further based on the number of AA queries made for a particular call. This ended up causing problems for Rust that were actually noticable enough to get a bug (PR26564) and probably other places as well. When originally re-working the AA infrastructure, the desire was to regularize the pattern of refinement without losing any generality. While I think it was successful, that is clearly proving to be too costly. And the cost is needless: we gain no actual improvement for this generality of making a direct query to tbaa actually be able to re-use some other alias analysis's refinement logic for one of the other APIs, or some such. In short, this is entirely wasted work. To the extent possible, delegation to other API surfaces should be done at the aggregation layer so that we can avoid re-walking the aggregation. In fact, this significantly simplifies the logic as we no longer need to smuggle the aggregation layer into each alias analysis (or the TargetLibraryInfo into each alias analysis just so we can form argument memory locations!). However, we also have some delegation logic inside of BasicAA and some of it even makes sense. When the delegation logic is baking in specific knowledge of aliasing properties of the LLVM IR, as opposed to simply reformulating the query to utilize a different alias analysis interface entry point, it makes a lot of sense to restrict that logic to a different layer such as BasicAA. So one aspect of the delegation that was in every AA base class is that when we don't have operand bundles, we re-use function AA results as a fallback for callsite alias results. This relies on the IR properties of calls and functions w.r.t. aliasing, and so seems a better fit to BasicAA. I've lifted the logic up to that point where it seems to be a natural fit. This still does a bit of redundant work (we query function attributes twice, once via the callsite and once via the function AA query) but it is exactly twice here, no more. The end result is that all of the delegation logic is hoisted out of the base class and into either the aggregation layer when it is a pure retargeting to a different API surface, or into BasicAA when it relies on the IR's aliasing properties. This should fix the quadratic query pattern reported in PR26564, although I don't have a stand-alone test case to reproduce it. It also seems general goodness. Now the numerous AAs that don't need target library info don't carry it around and depend on it. I think I can even rip out the general access to the aggregation layer and only expose that in BasicAA as it is the only place where we re-query in that manner. However, this is a non-trivial change to the AA infrastructure so I want to get some additional eyes on this before it lands. Sadly, it can't wait long because we should really cherry pick this into 3.8 if we're going to go this route. Differential Revision: http://reviews.llvm.org/D17329 llvm-svn: 262490	2016-03-02 15:56:53 +00:00
Simon Pilgrim	d0f04efed6	[X86][SSSE3] Added combine test for unary shuffle (pshufb) only referencing elements from one of the inputs of a binary shuffle (punpcklbw) llvm-svn: 262486	2016-03-02 14:16:50 +00:00
Michael Zuckerman	823b8e16d6	[LLVM][AVX512]PSRAWI Change imm8 to int. Differential Revision: http://reviews.llvm.org/D17705 llvm-svn: 262480	2016-03-02 12:05:07 +00:00
Simon Pilgrim	f9f7ca4f85	[X86][SSE] Lower 128-bit MOVDDUP with existing VBROADCAST mechanisms We have a number of useful lowering strategies for VBROADCAST instructions (both from memory and register element 0) which the 128-bit form of the MOVDDUP instruction can make use of. This patch tweaks lowerVectorShuffleAsBroadcast to enable it to broadcast 2f64 args using MOVDDUP as well. It does require a slight tweak to the lowerVectorShuffleAsBroadcast mechanism as the existing MOVDDUP lowering uses isShuffleEquivalent which can match binary shuffles that can lower to (unary) broadcasts. Differential Revision: http://reviews.llvm.org/D17680 llvm-svn: 262478	2016-03-02 11:43:05 +00:00
Nikolay Haustov	7908fea386	Revert "[AMDGPU] table-driven parser/printer for amd_kernel_code_t structure fields" Build failure with clang. llvm-svn: 262477	2016-03-02 11:16:56 +00:00
Nikolay Haustov	c360acc67e	Revert "[AMDGPU] Using table-driven amd_kernel_code_t field parser in assembler." Build failure with clang. llvm-svn: 262475	2016-03-02 10:54:21 +00:00
Nikolay Haustov	9479911a82	[AMDGPU] Using table-driven amd_kernel_code_t field parser in assembler. complementary patch to table-driven amd_kernel_code_t field parser/printer utility. lit tests passed. Patch by: Valery Pykhtin Differential Revision: http://reviews.llvm.org/D17151 llvm-svn: 262474	2016-03-02 10:36:30 +00:00
Nikolay Haustov	0f9d70887b	[AMDGPU] table-driven parser/printer for amd_kernel_code_t structure fields This is going to be used in .hsatext disassembler and can be used in current assembler parser (lit tests passed on parsing). Code using this helpers isn't included in this patch. Benefits: unified approach fast field name lookup on parsing Later I would like to enhance some of the field naming/syntax using this code. Patch by: Valery Pykhtin Differential Revision: http://reviews.llvm.org/D17150 llvm-svn: 262473	2016-03-02 10:36:25 +00:00
Dmitry Vyukov	d77444bc90	libfuzzer: fix compiler warnings - unused sigaction/setitimer result (used in assert) - unchecked fscanf return value - signed/unsigned comparison llvm-svn: 262472	2016-03-02 09:54:40 +00:00
Craig Topper	e377f9a96e	[X86] Remove unnecessary call to isReg from emitter's DestMem handling for VEX prefix. The operand is always a register. NFC llvm-svn: 262468	2016-03-02 07:32:45 +00:00
Craig Topper	fba583ca7a	[X86] Make X86MCCodeEmitter::DetermineREXPrefix locate operands more like how VEX prefix handling does. llvm-svn: 262467	2016-03-02 07:32:43 +00:00
David Majnemer	572acaa24c	[X86] Permit reading of the FLAGS register without it being previously defined We modeled the RDFLAGS{32,64} operations as "using" {E,R}FLAGS. While technically correct, this is not be desirable for folks who want to examine aspects of the FLAGS register which are not related to computation like whether or not CPUID is a valid instruction. Differential Revision: http://reviews.llvm.org/D17782 llvm-svn: 262465	2016-03-02 06:46:52 +00:00
Craig Topper	a32ceccef8	[X86] Remove assertion I accidentally left in. llvm-svn: 262464	2016-03-02 06:35:22 +00:00
Craig Topper	05f010fdc4	[X86] Be more structured about how we capture the register number when it is encoded in bits 7:4 of the immediate. For some instructions the register is not the last operand and the immediate handling had to detect this and hardcode the index to find it. It also required CurOp to be pointing at the last operand handled in the Form switch whereas for any instruction it would be pointing at the next operand. Now we just capture the value in the Form switch when we know exactly where it is and the CurOp pointer can behave normally. llvm-svn: 262462	2016-03-02 06:06:18 +00:00
Sanjoy Das	ce8e76bbc5	[SCEV] Minor naming, braces cleanup; NFC llvm-svn: 262459	2016-03-02 04:52:22 +00:00
Craig Topper	f43237f0ae	[X86] Use MCPhysReg and uint16_t for static arrays of registers and opcodes respectively should reduce size tiny bit. NFC llvm-svn: 262458	2016-03-02 04:42:31 +00:00
Matt Arsenault	b2f04b3a87	AMDGPU: Fix bug 26659. Fix checking the same instruction twice instead of the second branch that uses vccz. I don't think this matters currently because s_branch_vccnz is always used currently. llvm-svn: 262457	2016-03-02 04:12:39 +00:00
Matt Arsenault	f18c3f7466	AMDGPU: Cleanup suggested in bug 23960 llvm-svn: 262456	2016-03-02 04:05:14 +00:00
Matt Arsenault	ce793849bc	Bug 20810: Use report_fatal_error instead of unreachable llvm-svn: 262455	2016-03-02 03:33:55 +00:00
Sanjoy Das	097e45c701	Add a comment with a rational for the unusual code structure llvm-svn: 262454	2016-03-02 02:56:29 +00:00
Sanjoy Das	f24997d09c	Qualify getRangeForAffineAR with this-> for MSVC llvm-svn: 262453	2016-03-02 02:44:08 +00:00
George Burgess IV	872ab3815a	Attempt to fix ASAN failure in a MemorySSA test. llvm-svn: 262452	2016-03-02 02:35:04 +00:00
Sanjoy Das	ab9ef01214	Perturb code in an attempt to appease MSVC For some reason MSVC seems to think I'm calling getConstant() from a static context. Try to avoid this issue by explicitly specifying 'this->' (though I'm not confident that this will actually work). llvm-svn: 262451	2016-03-02 02:34:20 +00:00
Sanjoy Das	5ad27c32eb	More code permutation to appease MSVC llvm-svn: 262449	2016-03-02 02:15:42 +00:00
Sanjoy Das	559acdcb2c	Remove "auto" to appease the MSVC bots llvm-svn: 262448	2016-03-02 01:59:37 +00:00
Matt Arsenault	ceef9d2175	DAGCombiner: Make sure an integer is being truncated llvm-svn: 262446	2016-03-02 01:36:51 +00:00
Sanjay Patel	afbc4f1551	revert r262424 because there's a clang test for AArch64 that checks -O3 asm output that is broken by this change llvm-svn: 262440	2016-03-02 01:04:09 +00:00
Daniel Berlin	3242d3bab3	Fix SHARED_LIBS build llvm-svn: 262439	2016-03-02 00:58:48 +00:00
Sanjoy Das	88f19f877b	[SCEV] Make getRange smarter around selects Have ScalarEvolution::getRange re-consider cases like "{C?A:B,+,C?P:Q}" by factoring out "C" and computing RangeOf{A,+,P} union RangeOf({B,+,Q}) instead. The latter can be easier to compute precisely in cases like "{C?0:N,+,C?1:-1}" N is the backedge taken count of the loop; since in such cases the latter form simplifies to [0,N+1) union [0,N+1). llvm-svn: 262438	2016-03-02 00:57:54 +00:00
Sanjoy Das	dc721f4ae2	[SCEV] Extract out a getRangeForAffineAR; NFC Pure code-motion change. Will be used later in making getRange more clever. llvm-svn: 262437	2016-03-02 00:57:39 +00:00
Chris Bieneman	011a9c3522	[CMake] Add convenience target llvm-test-depends to build test dependencies. This is useful when paired with the distribution targets to build prerequisites for running tests. llvm-svn: 262428	2016-03-02 00:27:14 +00:00
Chris Bieneman	d204338a75	[CMake] Add distribution target that is the "just-build" side of install-distribution This is just a convenience target to allow limiting what you build. llvm-svn: 262427	2016-03-02 00:27:12 +00:00
Sanjay Patel	7266aa16e2	[InstCombine] convert 'isPositive' and 'isNegative' vector comparisons to shifts (PR26701) As noted in the code comment, I don't think we can do the same transform that we do for scalar integers comparisons to vector integers comparisons because it might pessimize the general case. Exhibit A for an incomplete integer comparison ISA remains x86 SSE/AVX: it only has EQ and GT for integer vectors. But we should now recognize all the variants of this construct and produce the optimal code for the cases shown in: https://llvm.org/bugs/show_bug.cgi?id=26701 llvm-svn: 262424	2016-03-01 23:55:18 +00:00
Dehao Chen	9581e2d337	Perform InstructioinCombiningPass before SampleProfile pass. Summary: SampleProfile pass needs to be performed after InstructionCombiningPass, which helps eliminate un-inlinable function calls. Reviewers: davidxl, dnovillo Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D17742 llvm-svn: 262419	2016-03-01 22:53:02 +00:00
Kostya Serebryany	96af1208c1	[libFuzzer] deprecate exit_on_first flag llvm-svn: 262417	2016-03-01 22:33:14 +00:00
David Blaikie	8ddd12bfcb	llvm-dwp: Add missing copyright notice to llvm-dwp.cpp Addressing feedback on IRC by Sean Silva. llvm-svn: 262416	2016-03-01 22:29:00 +00:00
Kostya Serebryany	d5755334e5	[libFuzzer] add generic signal handlers so that libFuzzer can report at least something if ASan is not handlig the signals for us. Remove abort_on_timeout flag. llvm-svn: 262415	2016-03-01 22:19:21 +00:00
Simon Pilgrim	9cb02b1f3e	[X86][SSE41] Added missing fast-isel intrinsics tests Match IR generated in clang/test/CodeGen/sse41-builtins.c llvm-svn: 262412	2016-03-01 22:05:05 +00:00
Colin LeMahieu	7f4d873f79	[NFC] Convert tabs to spaces. llvm-svn: 262411	2016-03-01 22:05:03 +00:00
Simon Pilgrim	0ed267efba	[X86][XOP] Regenerated intrinsics tests llvm-svn: 262410	2016-03-01 21:58:50 +00:00
Matthias Braun	9215ed1dcf	AArch64: Reenable CompleteModel for A53, A57 and Kryo models The fixes in r262393 completed them as well. llvm-svn: 262408	2016-03-01 21:55:35 +00:00
Simon Pilgrim	aff08276e8	[X86][AVX2] Regenerated 256-bit vector / 64-bit element permute tests llvm-svn: 262406	2016-03-01 21:53:12 +00:00
Tim Northover	352af6b0a6	Fix typo. NFC. llvm-svn: 262405	2016-03-01 21:45:22 +00:00

1 2 3 4 5 ...

128325 Commits