llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

Author	SHA1	Message	Date
Craig Topper	8bbc225e62	[AVX-512] Promote 512-bit integer loads to v8i64 similar to what is done for 128/256-bit vectors for overall consistency. llvm-svn: 278318	2016-08-11 06:04:07 +00:00
Craig Topper	27e352c1ed	[AVX-512] Add patterns to allow EVEX encoded stores of v16i16/v8i16/v16i8/v32i8 even when BWI is not supported. llvm-svn: 278317	2016-08-11 06:04:04 +00:00
Craig Topper	e8b95618e0	[AVX-512] Fix the 128-bit and 256-bit nontemporal load patterns with elements type other than i64. These loads have all been promoted to v2i64/v4i64 loads so we need bitcasts or we end up selecting VMOVDQA32/VMOVDQU32 instead. llvm-svn: 278316	2016-08-11 06:04:00 +00:00
Dominic Chen	f6f7f50802	[WebAssembly] Cleanup trailing whitespace Summary: Test for commit access. Subscribers: jfb, dschuff Differential Revision: https://reviews.llvm.org/D23392 llvm-svn: 278313	2016-08-11 04:10:56 +00:00
Tim Northover	cd8fd28f8c	GlobalISel: implement simple function calls on AArch64. We're still limited in the arguments we support, but this at least handles the basic cases. llvm-svn: 278293	2016-08-10 21:44:01 +00:00
Changpeng Fang	179532ade9	AMDGPU/SI: Implement amdgcn image intrinsics with sampler Summary: This patch define and implement amdgcn image intrinsics with sampler. 1. define vdata type to be llvm_anyfloat_ty, address type to be llvm_anyfloat_ty, and rsrc type to be llvm_anyint_ty. As a result, we expect the intrinsics name to have three suffixes to overload each of these three types; 2. D128 as well as two other flags are implied in the three types, for example, if you use v8i32 as resource type, then r128 is 0! 3. don't expose TFE flag, and other flags are exposed in the instruction order: unrm, glc, slc, lwe and da. Differential Revision: http://reviews.llvm.org/D22838 Reviewed by: arsenm and tstellarAMD llvm-svn: 278291	2016-08-10 21:15:30 +00:00
Matt Arsenault	cb12ba3447	AMDGPU: s_setpc_b64 should be an indirect branch llvm-svn: 278278	2016-08-10 19:20:02 +00:00
Matt Arsenault	806c7ea5a9	AMDGPU: Set sizes on control flow pseudos llvm-svn: 278276	2016-08-10 19:11:51 +00:00
Matt Arsenault	8eb3b846e7	AMDGPU: Remove empty file comment llvm-svn: 278275	2016-08-10 19:11:48 +00:00
Matt Arsenault	b6ebde3d1d	AMDGPU: Remove unnecessary cast llvm-svn: 278274	2016-08-10 19:11:45 +00:00
Matt Arsenault	5267e59706	AMDGPU: Change insertion point of si_mask_branch Insert before the skip branch if one is created. This is a somewhat more natural placement relative to the skip branches, and makes it possible to implement analyzeBranch for skip blocks. The test changes are mostly due to a quirk where the block label is not emitted if there is a terminator that is not also a branch. llvm-svn: 278273	2016-08-10 19:11:42 +00:00
Matt Arsenault	d534109bca	AMDGPU: Use CreateStackObject instead of CreateSpillStackObject I'm not sure what the difference is, but no other target uses this for emergency spill slots. llvm-svn: 278272	2016-08-10 19:11:36 +00:00
Sanjay Patel	eb88a9636a	[x86, AVX] allow FP vector select folding to bitwise logic ops (PR28895) This handles the case in: https://llvm.org/bugs/show_bug.cgi?id=28895 ...but we are not getting all of the possibilities yet. Eg, we use 'X86::FANDN' for scalar FP select combines. That enhancement is filed as: https://llvm.org/bugs/show_bug.cgi?id=28925 Differential Revision: https://reviews.llvm.org/D23337 llvm-svn: 278270	2016-08-10 19:00:11 +00:00
Krzysztof Parzyszek	8197d268d7	[Hexagon] Remove unused variants of LO/HI instructions llvm-svn: 278266	2016-08-10 18:40:36 +00:00
Simon Pilgrim	2ca47e753e	[X86][SSE] Dropped blend(insertps(x,y),zero) combine - this is now handled by target shuffle chain combining llvm-svn: 278260	2016-08-10 18:10:29 +00:00
Krzysztof Parzyszek	458d8ce010	[Hexagon] Simplify the SplitConst32/64 pass llvm-svn: 278256	2016-08-10 18:05:47 +00:00
Krzysztof Parzyszek	bdc1668cd8	[Hexagon] Add extra patterns for single-precision min/max instructions llvm-svn: 278252	2016-08-10 17:56:24 +00:00
Krzysztof Parzyszek	57fa692f90	[Hexagon] Fix table-gen decode conflict warnings for CONST32/64 llvm-svn: 278247	2016-08-10 17:22:24 +00:00
Krzysztof Parzyszek	631100a1eb	[Hexagon] Use integer instructions for floating point immediates Floating point instructions use general purpose registers, so the few instructions that can put floating point immediates into registers are, in fact, integer instruction. Use them explicitly instead of having pseudo-instructions specifically for dealing with floating point values. Simplify the constant loading instructions (from sdata) to have only two: one for 32-bit values and one for 64-bit values: CONST32 and CONST64. llvm-svn: 278244	2016-08-10 16:46:36 +00:00
Roger Ferrer Ibanez	4597590001	Fix build break of VS 2013 debug builds In debug mode extra macros are enabled for several C++ algorithms. Some of them may cause unfortunate build failures. This commit adds a redundant operator() to work around one of those troublesome macros which was hit accidentally by change r278012. llvm-svn: 278241	2016-08-10 16:39:58 +00:00
Krzysztof Parzyszek	fc9436e726	[Hexagon] Delete HexagonSelectCCInfo.td This file is not used. The location assignment of call arguments and return values is implemented directly in HexagonISelLowering. llvm-svn: 278237	2016-08-10 16:23:53 +00:00
Krzysztof Parzyszek	b6174c9c27	[Hexagon] Remove unneeded/unused ISD opcodes ARGEXTEND and FCONST32 llvm-svn: 278236	2016-08-10 16:20:33 +00:00
Simon Pilgrim	534f12b752	[X86][SSE] Add support for combining target shuffles to MOVSS/MOVSD Only do this on pre-SSE41 targets where we should be lowering to BLENDPS/BLENDPD instead llvm-svn: 278228	2016-08-10 14:15:41 +00:00
Simon Pilgrim	c0a2a41fc2	[X86][SSE] Only treat SM_SentinelUndef as UNDEF in shuffle mask predicates isUndefOrEqual and isUndefOrInRange treated all -ve shuffle mask values as UNDEF, now it has to be SM_SentinelUndef (-1) We already have asserts to check that lowered SHUFFLE_VECTOR indices are in the range -1 <= index < 2*masksize (or masksize for unary shuffles) llvm-svn: 278218	2016-08-10 12:55:25 +00:00
Simon Pilgrim	cb61cc0978	[X86][SSE] Reorder shuffle mask undef helper predicates. NFCI To make it easier for a more complex helper to use a simpler one llvm-svn: 278216	2016-08-10 12:34:23 +00:00
Sam Parker	306d4457fd	[ARM] Improve sxta{b\|h} and uxta{b\|h} tests Created a Thumb2 predicated pattern matcher that uses Thumb2 and HasT2ExtractPack and used it to redefine the patterns for sxta{b\|h} and uxta{b\|h}. Also used the similar patterns to fill in isel pattern gaps for the corresponding instructions in the ARM backend. The patch is mainly changes to tests since most of this functionality appears not to have been tested. Differential Revision: https://reviews.llvm.org/D23273 llvm-svn: 278207	2016-08-10 09:34:34 +00:00
Derek Schuff	f1541fc8c4	[WebAssembly] Add -emscripten-cxx-exceptions-whitelist option This patch adds -emscripten-cxx-exceptions-whitelist option to WebAssemblyLowerEmscriptenExceptions pass. This options is the list of function names in which Emscripten-style exception handling is enabled. This is to support emscripten's EXCEPTION_CATCHING_WHITELIST which exists because of the performance impact of emscripten's non-zero-cost EH method. Patch by Heejin Ahn Differential Revision: https://reviews.llvm.org/D23292 llvm-svn: 278171	2016-08-09 22:37:00 +00:00
David Majnemer	a3d1356e37	[X86] Don't model UD2/UD2B as a terminator A UD2 might make its way into the program via a call to @llvm.trap. Obviously, calls are not terminators. However, we modeled the X86 instruction, UD2, as a terminator. Later on, this confuses the epilogue insertion machinery which results in the epilogue getting inserted before the UD2. For some platforms, like x64, the result is a violation of the ABI. Instead, model UD2/UD2B as a side effecting instruction which may observe memory. llvm-svn: 278144	2016-08-09 17:55:12 +00:00
Simon Pilgrim	49b1b28839	[X86][XOP] Add support for combining target shuffles to VPERMIL2PD/VPERMIL2PS llvm-svn: 278120	2016-08-09 12:56:15 +00:00
Simon Pilgrim	a91df40f10	[X86][XOP] Add support for combining target shuffles to VPPERM llvm-svn: 278114	2016-08-09 10:56:29 +00:00
Dean Michael Berris	5440672d34	[XRay] Test for xray_instr_map in object file. (NFC) This makes a trivial change in the emission of the per-function XRay tables, and makes sure that the xray_instr_map section does show up in the object file. llvm-svn: 278113	2016-08-09 10:42:11 +00:00
Simon Pilgrim	d111e686cc	[X86][SSE] Fix memory folding of (v)roundsd / (v)roundss We only had partial memory folding support for the intrinsic definitions, and (as noted on PR27481) was causing FR32/FR64/VR128 mismatch errors with the machine verifier. This patch adds missing memory folding support for both intrinsics and the ffloor/fnearbyint/fceil/frint/ftrunc patterns and in doing so fixes the failing machine verifier stack folding tests from PR27481. Differential Revision: https://reviews.llvm.org/D23276 llvm-svn: 278106	2016-08-09 09:32:34 +00:00
Craig Topper	08a5d0387c	[X86] Reduce duplicated code in the execution domain lookup functions by passing tables as an argument. llvm-svn: 278098	2016-08-09 05:26:09 +00:00
Craig Topper	edce0939ff	[AVX-512] Add support for execution domain switching masked logical ops between floating point and integer domain. This switches PS<->D and PD<->Q. llvm-svn: 278097	2016-08-09 05:26:07 +00:00
Craig Topper	553c9adc29	[X86] Remove the Fv packed logical operation alias instructions. Replace them with patterns to the regular instructions. This enables execution domain fixing which is why the tests changed. llvm-svn: 278090	2016-08-09 03:06:33 +00:00
Craig Topper	52d9d29633	[X86] Cleanup patterns for AVX/SSE for PS operations. Always try to look for bitcasts from floating point types. If only AVX1 is supported we also need to handle integer types with floating point ops without looking for bitcasts. Previously SSE1 had a pattern that looked for integer types without bitcasts, but the type wasn't legal with only SSE1 and SSE2 add an identical pattern for the integer instructions. llvm-svn: 278089	2016-08-09 03:06:28 +00:00
Craig Topper	95fd988f3f	[X86] Remove unnecessary bitcast from the front of AVX1Only 256-bit logical operation patterns. llvm-svn: 278088	2016-08-09 03:06:26 +00:00
Matthias Braun	d8da32e94b	X86InstrInfo: Update liveness in classifyLea() We need to update liveness information when we create COPYs in classifyLea(). This fixes http://llvm.org/28301 llvm-svn: 278086	2016-08-09 01:47:26 +00:00
Derek Schuff	4212b80989	[WebAssembly] Fix bugs in WebAssemblyLowerEmscriptenExceptions pass * Delete extra '_' prefixes from JS library function names. fixImports() function in JS glue code deals with this for wasm. * Change command-line option names in order to be consistent with asm.js. * Add missing lowering code for llvm.eh.typeid.for intrinsics * Delete commas in mangled function names * Fix a function argument attributes bug. Because we add the pointer to the original callee as the first argument of invoke wrapper, all argument attribute indices have to be incremented by one. Patch by Heejin Ahn Differential Revision: https://reviews.llvm.org/D23258 llvm-svn: 278081	2016-08-09 00:29:55 +00:00
Sanjay Patel	1b873729aa	[x86] split combineVSelectWithAllOnesOrZeros into a helper function; NFCI llvm-svn: 278074	2016-08-09 00:01:11 +00:00
Charles Davis	2ca27b7279	Revert "[X86] Support the "ms-hotpatch" attribute." This reverts commit r278048. Something changed between the last time I built this--it takes awhile on my ridiculously slow and ancient computer--and now that broke this. llvm-svn: 278053	2016-08-08 21:20:15 +00:00
Charles Davis	24439f8d33	[X86] Support the "ms-hotpatch" attribute. Summary: Based on two patches by Michael Mueller. This is a target attribute that causes a function marked with it to be emitted as "hotpatchable". This particular mechanism was originally devised by Microsoft for patching their binaries (which they are constantly updating to stay ahead of crackers, script kiddies, and other ne'er-do-wells on the Internet), but is now commonly abused by Windows programs to hook API functions. This mechanism is target-specific. For x86, a two-byte no-op instruction is emitted at the function's entry point; the entry point must be immediately preceded by 64 (32-bit) or 128 (64-bit) bytes of padding. This padding is where the patch code is written. The two byte no-op is then overwritten with a short jump into this code. The no-op is usually a `movl %edi, %edi` instruction; this is used as a magic value indicating that this is a hotpatchable function. Reviewers: majnemer, sanjoy, rnk Subscribers: dberris, llvm-commits Differential Revision: https://reviews.llvm.org/D19908 llvm-svn: 278048	2016-08-08 21:01:39 +00:00
Krzysztof Parzyszek	24135ad35e	[Hexagon] Add pattern for 64-bit mulhs llvm-svn: 278040	2016-08-08 19:24:25 +00:00
Nirav Dave	a2892f3e2c	[X86] Improve code size on X86 segment moves Moves of a value to a segment register from a 16-bit register is equivalent to one from it's corresponding 32-bit register. Match gas's behavior and rewrite instructions to the shorter of equivalent forms. Reviewers: rnk, ab Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D23166 llvm-svn: 278031	2016-08-08 18:01:04 +00:00
Oliver Stannard	ffdf511cf4	[ARM] Add support for embedded position-independent code This patch adds support for some new relocation models to the ARM backend: * Read-only position independence (ROPI): Code and read-only data is accessed PC-relative. The offsets between all code and RO data sections are known at static link time. This does not affect read-write data. * Read-write position independence (RWPI): Read-write data is accessed relative to the static base register (r9). The offsets between all writeable data sections are known at static link time. This does not affect read-only data. These two modes are independent (they specify how different objects should be addressed), so they can be used individually or together. They are otherwise the same as the "static" relocation model, and are not compatible with SysV-style PIC using a global offset table. These modes are normally used by bare-metal systems or systems with small real-time operating systems. They are designed to avoid the need for a dynamic linker, the only initialisation required is setting r9 to an appropriate value for RWPI code. I have only added support to SelectionDAG, not FastISel, because FastISel is currently disabled for bare-metal targets where these modes would be used. Differential Revision: https://reviews.llvm.org/D23195 llvm-svn: 278015	2016-08-08 15:28:31 +00:00
Zhan Jun Liau	d6608e9acc	[SystemZ] Add support for the .insn directive Summary: Add support for the .insn directive. .insn is an s390 specific directive that allows encoding of an instruction instead of using a mnemonic. The motivating case is some code in node.js that requires support for the .insn directive. Reviewers: koriakin, uweigand Subscribers: koriakin, llvm-commits Differential Revision: https://reviews.llvm.org/D21809 llvm-svn: 278012	2016-08-08 15:13:08 +00:00
Silviu Baranga	77684f300a	[AArch64] PR28877: Don't assume we're running after legalization when creating vcvtfp2fxs Summary: The DAG combine transformation that was generating the aarch64_neon_vcvtfp2fxs node was assuming that all inputs where legal and wasn't accounting that the input could be a v4f64 if we're trying to do the transformation before legalization. We now bail out in this case. All illegal types besides v4f64 were already rejected. Fixes https://llvm.org/bugs/show_bug.cgi?id=28877. Reviewers: jmolloy Subscribers: aemerson, rengolin, llvm-commits Differential Revision: https://reviews.llvm.org/D23261 llvm-svn: 278002	2016-08-08 13:13:57 +00:00
Daniel Sanders	001d17f1c7	Re-commit r277988: [mips][ias] Fix all the hacks related to MIPS-specific unary operators (%hi/%lo/%gp_rel/etc.). Hopefully with the MSVC builds fixed. I've added a missing '#include <tuple>' that gcc and clang don't seem to need. llvm-svn: 277995	2016-08-08 11:50:25 +00:00
Simon Pilgrim	2fb1eee1bf	[X86][SSE] Assert if the shuffle mask indices are not -1 or within a valid input range As discussed in post-review rL277959 llvm-svn: 277993	2016-08-08 11:07:34 +00:00
Daniel Sanders	cbe38f2a34	Revert r277988: [mips][ias] Fix all the hacks related to MIPS-specific unary operators (%hi/%lo/%gp_rel/etc.). It seems that MSVC doesn't like std::tie(). llvm-svn: 277990	2016-08-08 09:33:14 +00:00

1 2 3 4 5 ...

38801 Commits