llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
Craig Topper	c4ce25ed3e	[AVX512] Update tests to show shuffle decoding for vpshuflw/vpshufhw. llvm-svn: 271869	2016-06-06 05:39:07 +00:00
Simon Pilgrim	585db6f6c3	[X86][XOP] Added VPERMIL2PD/VPERMIL2PS raw mask decoding for target shuffle combines llvm-svn: 271834	2016-06-05 15:21:30 +00:00
Simon Pilgrim	94a9893eda	[X86][XOP] Added VPERMIL2PD/VPERMIL2PS as a target shuffle type llvm-svn: 271831	2016-06-05 15:01:45 +00:00
Craig Topper	2249826460	[AVX512] Add support for lowering PALIGNR for v64i8. Could do this for other types to, but this is what's needed to replace the instrinsic with native IR in clang. llvm-svn: 271828	2016-06-05 06:29:12 +00:00
Craig Topper	47e5abb616	[AVX512] Split command lines and regenerate a test to prepare for a future commit. llvm-svn: 271827	2016-06-05 06:29:08 +00:00
Craig Topper	d8c697aad5	[AVX512] Fix PANDN combining for v4i32/v8i32 when VLX is enabled. v4i32/v8i32 ANDs aren't promoted to v2i64/v4i64 when VLX is enabled. llvm-svn: 271826	2016-06-05 05:35:11 +00:00
Simon Pilgrim	2edc73fed4	[X86][XOP] Added VPERMIL2PD/VPERMIL2PS shuffle mask comment decoding llvm-svn: 271809	2016-06-04 21:44:28 +00:00
Saleem Abdulrasool	f999318a81	X86: enable TLS on Windows itanium Windows itanium is nearly identical to windows-msvc (MS ABI for C, itanium for C++). Enable the TLS support for the target similar to the MSVC model. llvm-svn: 271797	2016-06-04 18:27:22 +00:00
Simon Pilgrim	73aea916e6	[X86][AVX2] Fix v16i16 SHL lowering (PR27730) The AVX2 v16i16 shift lowering works by unpacking to 2 x v8i32, performing the shift and then truncating the result. The unpacking is used to place the values in the upper 16-bits so that we can correctly sign-extend for SRA shifts. Unfortunately we weren't ensuring that the lower 16-bits were zero to ensure that SHL correctly shifts in zero bits. llvm-svn: 271796	2016-06-04 16:45:33 +00:00
Simon Pilgrim	d5ca72a493	[X86][AVX512] Fixed 512-bit vector nontemporal load alignment llvm-svn: 271673	2016-06-03 14:12:43 +00:00
Simon Pilgrim	096c6479fc	[X86][AVX512] Added 512-bit vector nontemporal load tests llvm-svn: 271668	2016-06-03 13:42:49 +00:00
Simon Pilgrim	42c22dd5cc	[X86][SSE] Added nontemporal load tests These currently all lower to regular loads, generic nontemporal load support will be added in a future patch llvm-svn: 271659	2016-06-03 11:00:55 +00:00
Simon Pilgrim	9ac3c4e1c9	[X86] Added nontemporal scalar store tests llvm-svn: 271656	2016-06-03 10:30:54 +00:00
Simon Pilgrim	df30ac0194	[X86][SSE] Regenerated nontemporal vector store tests and added extra target types llvm-svn: 271654	2016-06-03 10:24:24 +00:00
Simon Pilgrim	f30fab0fe4	[X86] Regenerated nontemporal store tests and added tests for all 128-bit vector types llvm-svn: 271651	2016-06-03 10:15:36 +00:00
Simon Pilgrim	3bafbaa984	[X86][AVX2] Relaxed alignment on nontemporal store tests llvm-svn: 271646	2016-06-03 10:06:59 +00:00
Simon Pilgrim	c8347b15c6	[X86][AVX2] Regenerated nontemporal store tests and added tests for all 256-bit vector types llvm-svn: 271645	2016-06-03 09:56:24 +00:00
Simon Pilgrim	0c614eb08a	[X86][XOP] Support for VPERMIL2PD/VPERMIL2PS 2-input shuffle instructions This patch begins adding support for lowering to the XOP VPERMIL2PD/VPERMIL2PS shuffle instructions - adding the X86ISD::VPERMIL2 opcode and cleaning up the usage. The internal llvm intrinsics were assuming the shuffle mask operand was the same type as the float/double input operands (I guess to simplify the intrinsic definitions in X86InstrXOP.td to a single value type). These needed changing to integer types (matching the clang builtin and the AMD intrinsics definitions), an auto upgrade path is added to convert old calls. Mask decoding/target shuffle support will be added in future patches. Differential Revision: http://reviews.llvm.org/D20049 llvm-svn: 271633	2016-06-03 08:06:03 +00:00
Craig Topper	7dae6773ea	[AVX512] Ensure EVEX vpshufd, vpshuflw, and vpshufhw have isel priority over the VEX encoded ones. llvm-svn: 271629	2016-06-03 05:31:04 +00:00
Craig Topper	8371de0a99	[AVX512] Fix shuffle comment printing for EVEX encoded PSHUFD, PSHUFHW, and PSHUFLW. llvm-svn: 271628	2016-06-03 05:31:00 +00:00
Simon Pilgrim	29c4e8764f	[X86][SSE] Added SSE41/AVX2 non-temporal tests Useful for when we add MOVNTDQA support llvm-svn: 271552	2016-06-02 18:01:21 +00:00
Dimitry Andric	9beb691de0	Only attempt to detect AVG if SSE2 is available Summary: In PR29973 Sanjay Patel reported an assertion failure when a certain loop was optimized, for a target without SSE2 support. It turned out this was because of the AVG pattern detection introduced in rL253952. Prevent the assertion failure by bailing out early in `detectAVGPattern()`, if the target does not support SSE2. Also add a minimized test case. Reviewers: congh, eli.friedman, spatel Subscribers: emaste, llvm-commits Differential Revision: http://reviews.llvm.org/D20905 llvm-svn: 271548	2016-06-02 17:30:49 +00:00
Sanjay Patel	153b260453	[DAG] use getBitcast() to reduce code Although this was intended to be NFC, the test case wiggle shows a change in code scheduling/RA caused by a difference in the SDLoc() generation. Depending on how you look at it, this is the (dis)advantage of exact checking in regression tests. llvm-svn: 271526	2016-06-02 16:01:15 +00:00
Simon Pilgrim	e49fac5565	[X86][SSE] Added non-temporal load tests for vector types These currently lower to regular loads instead of MOVNTDQA llvm-svn: 271516	2016-06-02 13:51:50 +00:00
Simon Pilgrim	2e72cbb66e	[X86][SSE] Replace (V)CVTTPS2DQ and VCVTTPD2DQ truncating (round to zero) f32/f64 to i32 with generic IR (llvm) This patch removes the llvm intrinsics (V)CVTTPS2DQ and VCVTTPD2DQ truncation (round to zero) conversions and auto-upgrades to FP_TO_SINT calls instead. Note: I looked at updating CVTTPD2DQ as well but this still requires a lot more work to correctly lower. Differential Revision: http://reviews.llvm.org/D20860 llvm-svn: 271510	2016-06-02 10:55:21 +00:00
Craig Topper	f70345a66d	[X86] Add AVX 256-bit load and stores to fast isel. I'm not sure why this was missing for so long. This also exposed that we were picking floating point 256-bit VMOVNTPS for some integer types in normal isel for AVX1 even though VMOVNTDQ is available. In practice it doesn't matter due to the execution dependency fix pass, but it required extra isel patterns. Fixing that in a follow up commit. llvm-svn: 271481	2016-06-02 04:19:45 +00:00
Craig Topper	1887664778	[AVX512] Remove masked load intrinsics. Clang now emits generic masked load intrinsics instead. The intrinsics will be autoupgraded to the same generic masked loads. llvm-svn: 271478	2016-06-02 04:19:36 +00:00
Sanjay Patel	287ac2dfec	[x86, AVX2] regenerate checks llvm-svn: 271434	2016-06-01 21:32:56 +00:00
Michael Kuperstein	1e7dd66dfc	[DAG] Improve legalization of INSERT_SUBVECTOR When the index is known to be constant 0, insert directly into the the low half, instead of spilling, performing the insert in-memory, and reloading. Differential Revision: http://reviews.llvm.org/D20763 llvm-svn: 271428	2016-06-01 20:49:35 +00:00
Than McIntosh	a03aeb4a97	Better fix for PR27903. Summary: Re-enable lifetime-start-on-first-use for stack coloring, but explicitly disable it for slots with more than one start or end lifetime marker. Bug: 27903 Reviewers: wmi, tejohnson, qcolombet, gbiv Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20739 llvm-svn: 271412	2016-06-01 17:55:10 +00:00
Simon Pilgrim	c1ef71aea6	[X86][SSE] Added non-temporal store tests for all 512-bit vector types llvm-svn: 271393	2016-06-01 13:58:00 +00:00
Simon Pilgrim	8864f582d0	[X86][SSE] Added non-temporal store tests for all 256-bit vector types Also added KNL AVX-512 checks llvm-svn: 271391	2016-06-01 13:20:25 +00:00
Simon Pilgrim	577c6a3c29	[X86][SSE] Added non-temporal store tests for all 128-bit integer vector types llvm-svn: 271389	2016-06-01 13:05:00 +00:00
Michael Zuckerman	e5673d8456	Adding back-end support to two bit scanning intrinsics Adding LLVM back-end support to two intrinsics dealing with bit scan: _bit_scan_forward and _bit_scan_reverse. Their functionality is as described in Intel intrinsics guide: https://software.intel.com/sites/landingpage/IntrinsicsGuide/#text=_bit_scan_forward&expand=371,370 https://software.intel.com/sites/landingpage/IntrinsicsGuide/#text=_bit_scan_reverse&expand=371,370 Commit on behalf of Omer Paparo Bivas Differential Revision: http://reviews.llvm.org/D19915 llvm-svn: 271386	2016-06-01 12:02:37 +00:00
Craig Topper	bc9e4ba942	Revert r271362 "[AVX512] Remove masked load intrinsics. Clang now emits generic masked load intrinsics instead." Looks like something isn't quite right still. Also forgot to move the test cases to an autoupgrade test. llvm-svn: 271363	2016-06-01 05:57:55 +00:00
Craig Topper	734c8343a6	[AVX512] Remove masked load intrinsics. Clang now emits generic masked load intrinsics instead. The intrinsics will be autoupgraded to the same generic masked loads. llvm-svn: 271362	2016-06-01 05:35:16 +00:00
Kevin B. Smith	21876b39ee	[X86]: Add a pattern that uses GR16_ABCD rather than GR32_ABCD to avoid falsely marking whole 32 bit register as live. Differential Revision: http://reviews.llvm.org/D20649 llvm-svn: 271341	2016-05-31 22:00:12 +00:00
Simon Pilgrim	29a89e51e6	[X86][SSE] Add load-folding patterns for (V)CVTDQ2PD (PR27291) Added patterns for (V)CVTDQ2PD -> 2f64 loading from a 64-bit source. llvm-svn: 271269	2016-05-31 12:04:35 +00:00
Igor Breger	dab6232733	[AVX512] Fix intrinsic vcvtps2ph lowering. Differential Revision: http://reviews.llvm.org/D20788 llvm-svn: 271255	2016-05-31 08:04:21 +00:00
Igor Breger	cc3e7d2dd1	Fix intrinsic vbroadcast{i32\|f32}x2 lowering. Differential Revision: http://reviews.llvm.org/D20780 llvm-svn: 271254	2016-05-31 07:43:39 +00:00
Craig Topper	cb79936a4b	[AVX512] Remove masked store intrinsics. Clang now emits generic masked store intrinsics instead. The intrinsics will be autoupgraded to the same generic masked stores. llvm-svn: 271245	2016-05-31 01:50:02 +00:00
Saleem Abdulrasool	f85a029a33	X86: permit using SjLj EH on x86 targets as an option This adds support to the backed to actually support SjLj EH as an exception model. This is NOT the default model, and requires explicitly opting into it from the frontend. GCC supports this model and for MinGW can still be enabled via the `--using-sjlj-exceptions` options. Addresses PR27749! llvm-svn: 271244	2016-05-31 01:48:07 +00:00
Craig Topper	4f195e8edb	[X86] Remove SSE/AVX unaligned store intrinsics as clang no longer uses them. Auto upgrade to native unaligned store instructions. llvm-svn: 271236	2016-05-30 23:15:56 +00:00
Craig Topper	9d8b6d7b31	[X86] Use update_llc_test_checks.py to re-generate a test in preparation for an upcoming commit. NFC llvm-svn: 271234	2016-05-30 22:54:14 +00:00
Simon Pilgrim	9226b7680f	[X86][XOP] Split off auto-upgraded xop intrinsics llvm-svn: 271228	2016-05-30 19:50:56 +00:00
Simon Pilgrim	5e4ee0eb80	[X86][SSE] Renamed pmovxrm tests These aren't intrinsics anymore - as discussed on D20686 llvm-svn: 271226	2016-05-30 19:14:37 +00:00
Simon Pilgrim	15ddf25bef	[X86][AVX2] Regenerated AVX2 extension tests llvm-svn: 271224	2016-05-30 18:49:57 +00:00
Simon Pilgrim	83a5e7cad9	[X86][SSE] Updated storeu fast-isel tests to match clang builtin tests Since rL271214 the headers have no longer used the storeu intrinsic llvm-svn: 271222	2016-05-30 18:42:51 +00:00
Simon Pilgrim	97b3734c4d	[X86][SSE2] Updated _mm_store_pd1/_mm_store1_pd fast-isel tests to match D20617 llvm-svn: 271220	2016-05-30 18:18:44 +00:00
Simon Pilgrim	6ec0f7efbc	[X86][SSE] (Reapplied) Replace (V)PMOVSX and (V)PMOVZX integer extension intrinsics with generic IR (llvm) This patch removes the llvm intrinsics VPMOVSX and (V)PMOVZX sign/zero extension intrinsics and auto-upgrades to SEXT/ZEXT calls instead. We already did this for SSE41 PMOVSX sometime ago so much of that implementation can be reused. Reapplied now that the the companion patch (D20684) removes/auto-upgrade the clang intrinsics has been committed. Differential Revision: http://reviews.llvm.org/D20686 llvm-svn: 271131	2016-05-28 18:03:41 +00:00

1 2 3 4 5 ...

7575 Commits