llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 20:51:52 +01:00

Author	SHA1	Message	Date
Joseph Tremoulet	de5c9a8723	[Inliner/WinEH] Honor implicit nounwinds Summary: Funclet EH tables require that a given funclet have only one unwind destination for exceptional exits. The verifier will therefore reject e.g. two cleanuprets with different unwind dests for the same cleanup, or two invokes exiting the same funclet but to different unwind dests. Because catchswitch has no 'nounwind' variant, and because IR producers are not required to annotate calls which will not unwind as 'nounwind', it is legal to nest a call or an "unwind to caller" catchswitch within a funclet pad that has an unwind destination other than caller; it is undefined behavior for such a call or catchswitch to unwind. Normally when inlining an invoke, calls in the inlined sequence are rewritten to invokes that unwind to the callsite invoke's unwind destination, and "unwind to caller" catchswitches in the inlined sequence are rewritten to unwind to the callsite invoke's unwind destination. However, if such a call or "unwind to caller" catchswitch is located in a callee funclet that has another exceptional exit with an unwind destination within the callee, applying the normal transformation would give that callee funclet multiple unwind destinations for its exceptional exits. There would be no way for EH table generation to determine which is the "true" exit, and the verifier would reject the function accordingly. Add logic to the inliner to detect these cases and leave such calls and "unwind to caller" catchswitches as calls and "unwind to caller" catchswitches in the inlined sequence. This fixes PR26147. Reviewers: rnk, andrew.w.kaylor, majnemer Subscribers: alexcrichton, llvm-commits Differential Revision: http://reviews.llvm.org/D16319 llvm-svn: 258273	2016-01-20 02:15:15 +00:00
Tom Stellard	936d0be6f9	AMDGPU/SI: Prevent the DAGCombiner from creating setcc with i1 inputs Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15035 llvm-svn: 258256	2016-01-20 00:13:22 +00:00
Sanjoy Das	d2d9b2b709	[MachineSink] Don't break ImplicitNulls Summary: This teaches MachineSink to not sink instructions that might break the implicit null check optimization that runs later. This should not affect frontends that do not use implicit null checks. Reviewers: aadg, reames, hfinkel, atrick Subscribers: majnemer, llvm-commits Differential Revision: http://reviews.llvm.org/D14632 llvm-svn: 258254	2016-01-20 00:06:14 +00:00
Quentin Colombet	2fd9288cb1	[X86] Do not run shrink-wrapping on function with split-stack attribute or HiPE calling convention. The implementation of the related callbacks in the x86 backend for such functions are not ready to deal with a prologue block that is not the entry block of the function. This fixes PR26107, but the longer term solution would be to fix those callbacks. llvm-svn: 258221	2016-01-19 23:29:03 +00:00
Sanjay Patel	691c821001	add tests to show missing memset/malloc optimizations (PR25892) llvm-svn: 258218	2016-01-19 23:07:10 +00:00
David Majnemer	7c946c0aa3	[MC, COFF] Add .reloc support for WinCOFF This adds rudimentary support for a few relocations that we will use for the CodeView debug format. llvm-svn: 258216	2016-01-19 23:05:27 +00:00
Simon Pilgrim	709595fe14	[X86][SSE] Add VZEXT_MOVL target shuffle decoding. Add support for decoding VZEXT_MOVL target shuffle masks, allowing it to be used as a source in target shuffle combines. llvm-svn: 258215	2016-01-19 23:04:56 +00:00
Simon Pilgrim	f1e3dd87e3	[X86][SSE] Add INSERTPS target shuffle combines. As vector shuffles can only reference two inputs many (V)INSERTPS patterns end up being split over two targets shuffles. This patch adds combines to attempt to combine (V)INSERTPS nodes with input/output nodes that are just zeroing out these additional vector elements. Differential Revision: http://reviews.llvm.org/D16072 llvm-svn: 258205	2016-01-19 22:24:12 +00:00
Sanjoy Das	c6887c7e27	[SCEV] Fix PR26207 In some cases, the max backedge taken count can be more conservative than the exact backedge taken count (for instance, because ScalarEvolution::getRange is not control-flow sensitive whereas computeExitLimitFromICmp can be). In these cases, computeExitLimitFromCond (specifically the bit that deals with `and` and `or` instructions) can create an ExitLimit instance with a `SCEVCouldNotCompute` max backedge count expression, but a computable exact backedge count expression. This violates an implicit SCEV assumption: a computable exact BE count should imply a computable max BE count. This change - Makes the above implicit invariant explicit by adding an assert to ExitLimit's constructor - Changes `computeExitLimitFromCond` to be more robust around conservative max backedge counts llvm-svn: 258184	2016-01-19 20:53:51 +00:00
Michael Zuckerman	553ef84e85	[AVX512] Adding VPERMT2B and VPERMI2B instruction . Differential Revision: http://reviews.llvm.org/D16297 llvm-svn: 258161	2016-01-19 18:47:02 +00:00
Sanjay Patel	a2ab3d6165	[LibCallSimplifier] use instruction-level fast-math-flags to shrink calls This is a continuation of adding FMF to call instructions: http://reviews.llvm.org/rL255555 llvm-svn: 258158	2016-01-19 18:38:52 +00:00
Sanjay Patel	a46637dede	[LibCallSimplifier] use instruction-level fast-math-flags to transform pow(x, [small integer]) calls This is a continuation of adding FMF to call instructions: http://reviews.llvm.org/rL255555 As with D15937, the intent of the patch is to preserve the current behavior of the transform except that we use the pow call's 'fast' attribute as a trigger rather than a function-level attribute. The TODO comment notes a potential follow-on patch that would propagate FMF to the new instructions. Differential Revision: http://reviews.llvm.org/D16122 llvm-svn: 258153	2016-01-19 18:15:12 +00:00
Michael Zuckerman	71a84dc5a5	[AVX512] Adding VPERMB instruction Differential Revision: http://reviews.llvm.org/D16294 llvm-svn: 258144	2016-01-19 17:07:43 +00:00
Dan Gohman	fb437bc669	[WebAssembly] Rematerialize constants rather than hold them live in registers. Teach the register stackifier to rematerialize constants that have multiple uses instead of leaving them in registers. In the WebAssembly encoding, it's the same code size to materialize most constants as it is to read a value from a register. llvm-svn: 258142	2016-01-19 16:59:23 +00:00
Dan Gohman	7a898cc00d	[WebAssembly] Change a FIXME to a TODO in a comment. llvm-svn: 258139	2016-01-19 16:52:50 +00:00
Dan Gohman	1a53c68f6a	[WebAssembly] Re-enable this test, now that interactions with the coalescer are resolved. llvm-svn: 258138	2016-01-19 16:52:09 +00:00
Marina Yatsina	adac739033	[X86] Add support for "xlat m8" According to x86 spec "xlat m8" is a legal instruction and it is equivalent to "xlatb". Differential Revision: http://reviews.llvm.org/D15150 llvm-svn: 258135	2016-01-19 16:35:38 +00:00
Manuel Jacob	a2e0ca38ae	Fix constant folding of constant vector GEPs with undef or null as pointer argument. Reviewers: eddyb Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D16321 llvm-svn: 258134	2016-01-19 16:34:31 +00:00
Marina Yatsina	d7dac8fde4	[X86] Adding support for missing variations of X86 string related instructions The following are legal according to X86 spec: ins mem, DX outs DX, mem lods mem stos mem scas mem cmps mem, mem movs mem, mem Differential Revision: http://reviews.llvm.org/D14827 llvm-svn: 258132	2016-01-19 15:37:56 +00:00
Dan Gohman	e8c29f17af	[WebAssembly] Re-enable loop idiom recognition for memcpy et al. llvm-svn: 258125	2016-01-19 14:49:23 +00:00
Asaf Badouh	19e99238a0	[X86][AVX512]fix dag & add intrinsics for fixupimm cover all width and types (pd/ps/sd/ss) of fixupimm instruction and inrtinsics Differential Revision: http://reviews.llvm.org/D16313 llvm-svn: 258124	2016-01-19 14:21:39 +00:00
Tobias Edler von Koch	c19c96e06f	[LTO] Restore original linkage of externals prior to splitting Summary: This is a companion patch for http://reviews.llvm.org/D16124. Internalized symbols increase the size of strongly-connected components in SCC-based module splitting and thus reduce the amount of parallelism. This patch records the original linkage of non-local symbols prior to internalization and then restores it just before splitting/CodeGen. This is also useful for cases where the linker requires symbols to remain external, for instance, so they can be placed according to linker script rules. It's currently under its own flag (-restore-globals) but should eventually share a common flag with D16124. Reviewers: joker.eph, pcc Subscribers: slarin, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D16229 llvm-svn: 258100	2016-01-18 23:24:54 +00:00
Matt Arsenault	348623d27f	AMDGPU: Reduce 64-bit SRAs llvm-svn: 258096	2016-01-18 22:09:04 +00:00
Matt Arsenault	862bf93c73	AMDGPU: Split 64-bit and of constant up This breaks the tests that were meant for testing 64-bit inline immediates, so move those to shl where they won't be broken up. This should be repeated for the other related bit ops. llvm-svn: 258095	2016-01-18 22:01:13 +00:00
Simon Pilgrim	6dde57e745	[X86][AVX2] Ensure integer execution domain for integer blend tests llvm-svn: 258094	2016-01-18 21:58:21 +00:00
Matt Arsenault	97aeb607e4	AMDGPU: Generalize shl combine Reduce 64-bit shl with constant > 32. We already special cased this for the == 32 case, but this also works for any >= 32 constant. llvm-svn: 258092	2016-01-18 21:55:14 +00:00
Simon Pilgrim	5bbdf4402d	[X86][SSE] Regenerate vector blend commutation tests llvm-svn: 258091	2016-01-18 21:46:46 +00:00
Matt Arsenault	e1a6e6ae7f	AMDGPU: Reduce 64-bit lshr by constant to 32-bit 64-bit shifts are very slow on some subtargets. llvm-svn: 258090	2016-01-18 21:43:36 +00:00
Davide Italiano	410fab022e	[JIT] Add small-code model test for ELF. The coverage is almost non-existent, hopefully more will come after this. Differential Revision: http://reviews.llvm.org/D16096 llvm-svn: 258087	2016-01-18 21:14:12 +00:00
Matt Arsenault	7414572d7c	AMDGPU: Cleanup sra test llvm-svn: 258086	2016-01-18 21:13:56 +00:00
Sergei Larin	72115d5fb6	Add to the split module utility an SCC based method which allows not to globalize any local variables. Summary: Currently llvm::SplitModule as the first step globalizes all local objects, which might not be desirable in some scenarios. This change adds a new flag to llvm::SplitModule that uses SCC approach to search for a balanced partition without the need to externalize symbols. Such partition might not be possible or fully balanced for a given number of partitions, and is a function of the module properties (global/local dependencies within the module). Joint development Tobias Edler von Koch (tobias@codeaurora.org) and Sergei Larin (slarin@codeaurora.org) Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D16124 llvm-svn: 258083	2016-01-18 21:07:13 +00:00
Simon Pilgrim	4c1241282f	[X86][AVX2] Broadcast subvectors AVX2 can only broadcast from the zero'th element of a vector, but if the broadcastable element is the zero'th element of a 128-bit subvector its advantageous to extract the subvector, broadcast from that and avoid the loading of shuffle mask data that would be needed for VPERMPS/VPERMD. The only exception being when the source type is 4f64 or 4i64 which can directly use the immediate shuffle VPERMPD/VPERMQ directly. Differential Revision: http://reviews.llvm.org/D16050 llvm-svn: 258081	2016-01-18 20:59:04 +00:00
Igor Breger	7327a3bf3b	AVX512: Masked store intrinsic implementation. Implemented intrinsic for the follow instructions (store) : VMOVDQU8/16/32/64, VMOVDQA32/64, VMOVAPS/PD, VMOVUPS/PD. Differential Revision: http://reviews.llvm.org/D16271 llvm-svn: 258047	2016-01-18 13:52:57 +00:00
Igor Breger	74d74d20c2	AVX512 : Change v8i1 bitconvert GR8 pattern, remove unnecessary movzbl instruction. code example , previous implementation. movzbl %dil, %eax kmovw %eax, %k0 new code kmovw %edi, %k0 Differential Revision: http://reviews.llvm.org/D16287 llvm-svn: 258045	2016-01-18 12:02:45 +00:00
Oliver Stannard	ec1b7475d8	[ARM] Operands for PKHTB alias should be swapped When the shift immediate is zero, PKHTB is an alias for PKHBT, but the order of the input operands needs to be swapped. Differential Revision: http://reviews.llvm.org/D16288 llvm-svn: 258044	2016-01-18 11:56:35 +00:00
Sanjoy Das	aa011535f3	[IndVars] Fix PR25576 `LCSSASafePhiForRAUW` as computed was incorrect -- in cases like these (this exact example does not actually trigger the bug): define i32 @f(i32 %n, i1* %c) { entry: br label %outer.loop outer.loop: br label %inner.loop inner.loop: %iv = phi i32 [ 0, %outer.loop ], [ %iv.inc, %inner.loop ] %iv.inc = add nuw nsw i32 %iv, 1 %tc = udiv i32 %n, 13 %be.cond = icmp ult i32 %iv, %tc br i1 %be.cond, label %inner.loop, label %inner.exit inner.exit: %iv.lcssa = phi i32 [ %iv, %inner.loop ] %outer.be.cond = load volatile i1, i1* %c br i1 %outer.be.cond, label %outer.loop, label %leave leave: %iv.lcssa.lcssa = phi i32 [ %iv.lcssa, %inner.exit ] ret i32 %iv.lcssa.lcssa } `LCSSASafePhiForRAUW` is true for `%iv.lcssa` when re-rewriting the exit value of `%iv` for `%inner.loop` to `%tc` (this can happen due to `SCEVExpander::findExistingExpansion`), but the RAUW breaks LCSSA. To fix this, instead of computing `SafePhi` with special logic, decide the safety of RAUW directly via `replacementPreservesLCSSAForm`. llvm-svn: 258016	2016-01-17 18:12:52 +00:00
Simon Pilgrim	cfe1acc838	[X86][AVX512] Regenerate v1 shuffle tests llvm-svn: 258013	2016-01-17 14:53:17 +00:00
Artur Pilipenko	bb5abf9eb3	Push isDereferenceableAndAlignedPointer down into isSafeToLoadUnconditionally Reviewed By: reames Differential Revision: http://reviews.llvm.org/D16226 llvm-svn: 258010	2016-01-17 12:35:29 +00:00
Michael Zuckerman	04a3249a24	[AVX512] Adding VPERMW/D/Q VPERMPS/D Intrinsics Differential Revision: http://reviews.llvm.org/D16189 llvm-svn: 258008	2016-01-17 11:33:29 +00:00
Michael Zuckerman	365c9dfcf3	[AVX512] Adding VPERMQ VPERMPD Intrinsics Differential Revision: http://reviews.llvm.org/D16194 llvm-svn: 258006	2016-01-17 08:32:14 +00:00
Lang Hames	944463fffc	Remove some stale comments and fix a typo as suggested by David Blaikie in his review of r257343. Thanks Dave! llvm-svn: 258002	2016-01-17 01:49:46 +00:00
Simon Atanasyan	8a2e38211d	[llvm-readobj][ELF] Teach llvm-readobj to show dynamic relocation in REL format MIPS 32-bit ABI uses REL relocation record format to save dynamic relocations. The patch teaches llvm-readobj to show dynamic relocations in this format. Differential Revision: http://reviews.llvm.org/D16114 llvm-svn: 258001	2016-01-16 22:40:09 +00:00
Simon Pilgrim	5698ba2abd	[X86][AVX] Enable extraction of upper 128-bit subvectors for 'half undef' shuffle lowering Added support for the extraction of the upper 128-bit subvectors for lower/upper half undef shuffles if it would reduce the number of extractions/insertions or avoid loads of AVX2 permps/permd shuffle masks. Minor follow up to D15477. llvm-svn: 258000	2016-01-16 22:30:20 +00:00
Simon Pilgrim	65330864cb	[X86][SSE] Added extra 'float3' consecutive load tests llvm-svn: 257998	2016-01-16 19:53:33 +00:00
Manman Ren	efc840cd69	CXX_FAST_TLS calling convention: fix issue on x86-64. %RBP can't be handled explicitly. We generate the following code: pushq %rbp movq %rsp, %rbp ... movq %rbx, (%rbp) ## 8-byte Spill where %rbp will be overwritten by the spilled value. The fix is to let PEI handle %RBP. PR26136 llvm-svn: 257997	2016-01-16 16:39:46 +00:00
Simon Pilgrim	530550f33c	[X86][SSE] Regenerated SSE4 CRC32 and v2i64 comparison tests llvm-svn: 257996	2016-01-16 15:41:42 +00:00
Simon Pilgrim	886ec3f33e	[X86][AVX] Regenerated AVX tests Updated i1 select, vector truncation and subvector extraction tests llvm-svn: 257995	2016-01-16 15:25:02 +00:00
Simon Pilgrim	01fb5769b9	[X86]AVX] Tidyup shift/splat tests Missing comments, fixed bad word wrapping llvm-svn: 257993	2016-01-16 15:13:58 +00:00
Simon Pilgrim	0775d1e714	[X86][SSE] Regenerated HADD/HSUB tests llvm-svn: 257992	2016-01-16 14:03:40 +00:00
Igor Laevsky	1f8ac9245d	[BasicAliasAnalysis] Take into account operand bundles in the getModRefInfo function Differential Revision: http://reviews.llvm.org/D16225 llvm-svn: 257991	2016-01-16 12:15:53 +00:00

1 2 3 4 5 ...

34011 Commits