llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-25 14:02:52 +02:00

Author	SHA1	Message	Date
Amara Emerson	7ad0409c56	[ARMv8] Add support for the v8 cryptography extensions. llvm-svn: 190996	2013-09-19 11:59:01 +00:00
Jim Grosbach	4f219b5c7e	Revert "Revert "ARM: Improve pattern for isel mul of vector by scalar."" This reverts commit r189648. Fixes for the previously failing clang-side arm_neon_intrinsics test cases will be checked in separately. llvm-svn: 189841	2013-09-03 20:08:17 +00:00
Michael Gottesman	113e9285a1	Revert "ARM: Improve pattern for isel mul of vector by scalar." This reverts commit r189619. The commit was breaking the arm_neon_intrinsic test. llvm-svn: 189648	2013-08-30 05:36:14 +00:00
Jim Grosbach	7089633cb9	ARM: Improve pattern for isel mul of vector by scalar. In addition to recognizing when the multiply's second argument is coming from an explicit VDUPLANE, also look for a plain scalar f32 reference and reference it via the corresponding vector lane. rdar://14870054 llvm-svn: 189619	2013-08-29 22:41:46 +00:00
Tim Northover	490c4c1bda	ARM: remove unused v(add\|sub)hn and vqdml[as]l intrinsics. Clang is now generating cleaner IR, so this removes the old variants which should be completely unused. llvm-svn: 189481	2013-08-28 14:33:33 +00:00
Tim Northover	e4e6bb8e0e	ARM: add patterns for vqdmlal with separate vqdmull and vqadds The vqdmlal and vqdmlls instructions are really just a fused pair consisting of a vqdmull.sN and a vqadd.sN. This adds patterns to LLVM so that we can switch Clang's CodeGen over to generating these instead of the special vqdmlal intrinsics. llvm-svn: 189480	2013-08-28 12:15:16 +00:00
Joey Gouly	740189864d	[ARMv8] Add some negative tests for the recent VFP/NEON instructions. Fix two issues I found while writing these tests. llvm-svn: 189341	2013-08-27 11:24:16 +00:00
Tim Northover	24c6842d69	ARM: add natural patterns for vaddhl and vsubhl. These instructions aren't particularly complicated and it's well worth having patterns for some reasonably useful LLVM IR that will match them. Soon we should be able to switch Clang over to producing this natural version. llvm-svn: 189335	2013-08-27 10:31:36 +00:00
Mihai Popa	dfdccf5f00	Fix ARM vcvt encoding when the number of fractional bits is zero. The instruction to convert between floating point and fixed point representations takes an immediate operand for the number of fractional bits of the fixed point value. ARMARM specifies that when that number of bits is zero, the assembler should encode floating point/integer conversion instructions. This patch adds the necessary instruction aliases to achieve this behaviour. llvm-svn: 189009	2013-08-22 13:16:07 +00:00
Tim Northover	a6d63d6cc9	ARM: remove now unneeded custom Asm converters After Ulrich's r180677 (thanks!) TableGen is intelligent enough to handle tied constraints involving complex operands properly, so virtually all of the ARM custom converters are now unnecessary. llvm-svn: 186810	2013-07-22 09:06:12 +00:00
Joey Gouly	cfa16b3bc1	[ARMv8] Implement the NEON instructions VRINT{N, X, A, Z, M, P}. llvm-svn: 186688	2013-07-19 16:34:16 +00:00
Joey Gouly	73424fc519	Change 'n' to 'N' to keep consistent with other instructions. llvm-svn: 186576	2013-07-18 12:00:25 +00:00
Joey Gouly	933fb028d7	[ARMv8] Add NEON instructions VCVT{A, N, P, M}. llvm-svn: 186574	2013-07-18 11:53:22 +00:00
Joey Gouly	1ced091dc6	Remove the extra leading 0 from VMAXNMND. The N3VDIntnp pattern takes bits<5> and I gave it 6 bits. Thanks to Jiangning Liu for spotting it! llvm-svn: 186568	2013-07-18 09:34:35 +00:00
Joey Gouly	bc02a480d0	[ARMv8] Add support for the NEON instructions vmaxnm/vminnm. This adds a new class for non-predicable NEON instructions and a new DecoderNamespace for v8 NEON instructions. llvm-svn: 186504	2013-07-17 13:59:38 +00:00
Jim Grosbach	0f0c0ac8be	ARM: Add optional datatype suffix to NEON mvn asm syntax. rdar://14194152 llvm-svn: 184244	2013-06-18 21:49:21 +00:00
Amaury de la Vieuville	334567de5c	ARM: Enforce decoding rules for VLDn instructions llvm-svn: 183731	2013-06-11 08:14:14 +00:00
Tim Northover	6b0f4fd85b	ARM: fix VEXT encoding corner case The disassembly of VEXT instructions was too lax in the bits checked. This fixes the case where the instruction affects Q-registers but a misaligned lane was specified (should be UNDEFINED). Patch by Amaury de la Vieuville llvm-svn: 183003	2013-05-31 13:47:25 +00:00
Mihai Popa	d42b1e0685	VSTn instructions have a number of encoding constraints which are not implemented. I have added these using wrapper methods around the original custom decoder (incidentally - this is a huge poorly written method that should be cleaned up. I have left it as is since the changes would be much to hard to review). llvm-svn: 182281	2013-05-20 14:57:05 +00:00
Benjamin Kramer	40a2d53c85	ARM/NEON: Pattern match vector integer abs to vabs. llvm-svn: 180604	2013-04-26 15:00:57 +00:00
Jim Grosbach	10785fcd52	ARM: Add VACLT and VACLE assembly aliases. These are aliases for VACGT and VACGE, respectively, with the source operands reversed. rdar://13638090 llvm-svn: 179575	2013-04-15 22:42:50 +00:00
Arnold Schwaighofer	81e5b5e18f	ARM NEON: Don't need COPY_TO_REGCLASS in pattern In my previous commit: "Merge a f32 bitcast of a v2i32 extractelt A vectorized sitfp on doubles will get scalarized to a sequence of an extract_element of <2 x i32>, a bitcast to f32 and a sitofp. Due to the the extract_element, and the bitcast we will uneccessarily generate moves between scalar and vector registers." I added a pattern containing a copy_to_regclass. The copy_to_regclass is actually not needed. radar://13191881 llvm-svn: 175555	2013-02-19 20:16:45 +00:00
Arnold Schwaighofer	3a1cb40149	ARM NEON: Merge a f32 bitcast of a v2i32 extractelt A vectorized sitfp on doubles will get scalarized to a sequence of an extract_element of <2 x i32>, a bitcast to f32 and a sitofp. Due to the the extract_element, and the bitcast we will uneccessarily generate moves between scalar and vector registers. The patch fixes this by using a COPY_TO_REGCLASS and a EXTRACT_SUBREG to extract the element from the vector instead. radar://13191881 llvm-svn: 175520	2013-02-19 15:27:05 +00:00
Joel Jones	17b64a424b	The ARM NEON vector compare instructions take three arguments. However, the assembler should also accept a two arg form, as the docuemntation specifies that the first (destination) register is optional. This patch uses TwoOperandAliasConstraint to add the two argument form. It also fixes an 80-column formatting problem in: test/MC/ARM/neon-bitwise-encoding <rdar://problem/12909419> Clang rejects ARM NEON assembly instructions llvm-svn: 175221	2013-02-14 23:18:40 +00:00
Bob Wilson	3cae2545eb	Revert "Adding support for llvm.arm.neon.vaddl[su].* and" This reverts r170694. The operations can be represented in IR without adding any new intrinsics. llvm-svn: 170765	2012-12-20 21:09:38 +00:00
Renato Golin	1fbd598908	Adding support for llvm.arm.neon.vaddl[su].* and llvm.arm.neon.vsub[su].* intrinsics. Patch by Pete Couperus <pjcoup@gmail.com> llvm-svn: 170694	2012-12-20 13:52:11 +00:00
Anton Korobeynikov	3cd85d754d	Make sure FABS on v2f32 and v4f32 is legal on ARM NEON This fixes PR14359 llvm-svn: 168200	2012-11-16 21:15:20 +00:00
Jakob Stoklund Olesen	4b4db880a3	Revert r163298 "Optimize codegen for VSETLNi{8,16,32} operating on Q registers." Keep the integer_insertelement test case, the new coalescer can handle this kind of lane insertion without help from pseudo-instructions. llvm-svn: 166835	2012-10-26 23:39:46 +00:00
Jim Grosbach	8df1c73056	ARM: v1i64 and v2i64 VBSL intrinsic support. rdar://12502028 llvm-svn: 165981	2012-10-15 21:23:40 +00:00
Evan Cheng	72074df318	Add isel patterns for v2f32 / v4f32 neon.vbsl intrinsics. rdar://12471808 llvm-svn: 165673	2012-10-10 23:06:34 +00:00
Bob Wilson	ee6a40c517	Add LLVM support for Swift. llvm-svn: 164899	2012-09-29 21:43:49 +00:00
Jim Grosbach	135898ebe3	ARM: Use a dedicated intrinsic for vector bitwise select. The expression based expansion too often results in IR level optimizations splitting the intermediate values into separate basic blocks, preventing the formation of the VBSL instruction as the code author intended. In particular, LICM would often hoist part of the computation out of a loop. rdar://11011471 llvm-svn: 164340	2012-09-21 00:18:20 +00:00
Evan Cheng	82c85585f9	Use vld1 / vst2 for unaligned v2f64 load / store. e.g. Use vld1.16 for 2-byte aligned address. Based on patch by David Peixotto. Also use vld1.64 / vst1.64 with 128-bit alignment to take advantage of alignment hints. rdar://12090772, rdar://12238782 llvm-svn: 164089	2012-09-18 01:42:45 +00:00
Tim Northover	1c637c210f	Use correct part of complex operand to encode VST1 alignment. Patch by Chris Lidbury. llvm-svn: 163318	2012-09-06 14:36:55 +00:00
James Molloy	90179e600b	Optimize codegen for VSETLNi{8,16,32} operating on Q registers. Degenerate to a VSETLN on D registers, instead of an (INSERT_SUBREG (VSETLN (EXTRACT_SUBREG ))) sequence to help the register coalescer. llvm-svn: 163298	2012-09-06 09:16:01 +00:00
Evan Cheng	625c0ca5ee	Use vld1/vst1 to load/store f64 if alignment is < 4 and the target allows unaligned access. rdar://12091029 llvm-svn: 161962	2012-08-15 17:44:53 +00:00
Tim Northover	b1f8be6cbe	Use correct loads for vector types during extending-load operations. Previously, we used VLD1.32 in all cases, however there are both 16 and 64-bit accesses being selected, so we need to use an appropriate width load in those cases. llvm-svn: 161748	2012-08-13 09:06:31 +00:00
Joel Jones	4ce75efda5	More replacing of target-dependent intrinsics with target-indepdent intrinsics. The second instruction(s) to be handled are the vector versions of count set bits (ctpop). The changes here are to clang so that it generates a target independent vector ctpop when it sees an ARM dependent vector bits set count. The changes in llvm are to match the target independent vector ctpop and in VMCore/AutoUpgrade.cpp to update any existing bc files containing ARM dependent vector pop counts with target-independent ctpops. There are also changes to an existing test case in llvm for ARM vector count instructions and to a test for the bitcode upgrade. <rdar://problem/11892519> There is deliberately no test for the change to clang, as so far as I know, no consensus has been reached regarding how to test neon instructions in clang; q.v. <rdar://problem/8762292> llvm-svn: 160410	2012-07-18 00:02:16 +00:00
Joel Jones	12ea066486	This is one of the first steps at moving to replace target-dependent intrinsics with target-indepdent intrinsics. The first instruction(s) to be handled are the vector versions of count leading zeros (ctlz). The changes here are to clang so that it generates a target independent vector ctlz when it sees an ARM dependent vector ctlz. The changes in llvm are to match the target independent vector ctlz and in VMCore/AutoUpgrade.cpp to update any existing bc files containing ARM dependent vector ctlzs with target-independent ctlzs. There are also changes to an existing test case in llvm for ARM vector count instructions and a new test for the bitcode upgrade. <rdar://problem/11831778> There is deliberately no test for the change to clang, as so far as I know, no consensus has been reached regarding how to test neon instructions in clang; q.v. <rdar://problem/8762292> llvm-svn: 160200	2012-07-13 23:25:25 +00:00
Jim Grosbach	83589b60dc	ARM: Allow more flexible patterns in NEON formats. Some NEON instructions want to match against normal SDNodes for some operand types and Intrinsics for others. For example, CTLZ. To enable this, switch from explicitly requiring Intrinsic on the class templates to using SDPatternOperator instead. llvm-svn: 159974	2012-07-10 00:51:13 +00:00
Jim Grosbach	658b3efc30	ARM: Add missing two-operand VBIC aliases. llvm-svn: 156019	2012-05-02 21:11:56 +00:00
Lang Hames	7d83af4ed0	Fix the order of the operands in the llvm.fma intrinsic patterns for ARM, <rdar://problem/11325085>. llvm-svn: 155724	2012-04-27 18:51:24 +00:00
Tim Northover	876c151146	Use VLD1 in NEON extenting-load patterns instead of VLDR. On some cores it's a bad idea for performance to mix VFP and NEON instructions and since these patterns are NEON anyway, the NEON load should be used. llvm-svn: 155630	2012-04-26 08:46:29 +00:00
Jim Grosbach	66edf44403	Tidy up. 80 columns, whitespace, et. al. llvm-svn: 155399	2012-04-23 22:04:10 +00:00
Jim Grosbach	4221412829	ARM: VSLI two-operand assmebly aliases are tblgen'erated. llvm-svn: 155393	2012-04-23 21:22:04 +00:00
Jim Grosbach	8aac7f6a7c	ARM: tblgen'erate VSRA/VRSRA/VSRI assembly two-operand aliases. llvm-svn: 155392	2012-04-23 21:00:49 +00:00
Jim Grosbach	d377bc4e77	ARM: vqdmulh two-operand aliases are tblgen'erated now. llvm-svn: 155387	2012-04-23 20:37:20 +00:00
Jim Grosbach	ba84724346	ARM: tblgen'erate more NEON two-operand aliases. VMUL and VEXT. llvm-svn: 155258	2012-04-20 23:46:33 +00:00
Jim Grosbach	5329904457	ARM: tblgen'erate more NEON two-operand aliases. llvm-svn: 155254	2012-04-20 23:30:14 +00:00
Jim Grosbach	e33d0c7063	ARM: Update NEON assembly two-operand aliases. Use the new TwoOperandAliasConstraint to handle lots of the two-operand aliases for NEON instructions. There's still more to go, but this is a good chunk of them. llvm-svn: 155210	2012-04-20 18:12:54 +00:00

1 2 3 4 5 ...

544 Commits