llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-27 05:53:07 +01:00

Author	SHA1	Message	Date
Bob Wilson	1497601a7b	Remove unused conditional negate operations. llvm-svn: 127090	2011-03-05 16:54:31 +00:00
Evan Cheng	f540b0e0f6	VFP single precision arith instructions can go down to NEON pipeline, but on Cortex-A8 only. llvm-svn: 126238	2011-02-22 19:53:14 +00:00
Evan Cheng	d3928a2c3a	Some single precision VFP instructions may be executed on NEON pipeline, but not double precision ones. llvm-svn: 125624	2011-02-16 00:35:02 +00:00
Bruno Cardoso Lopes	e0f8fee637	Create two new generic classes to represent the following VMRS/VMSR variations: vmrs reg, fpexc vmrs reg, fpsid vmsr fpexc, reg vmsr fpsid, reg llvm-svn: 123783	2011-01-18 21:58:20 +00:00
Bob Wilson	63547ae69e	Fix a comment: We now have intrinsics for vcvtr. llvm-svn: 123246	2011-01-11 17:56:41 +00:00
Chris Lattner	01e8c46349	Flag -> Glue, the ongoing saga llvm-svn: 122513	2010-12-23 18:28:41 +00:00
Evan Cheng	fc78767730	Making use of VFP / NEON floating point multiply-accumulate / subtraction is difficult on current ARM implementations for a few reasons. 1. Even though a single vmla has latency that is one cycle shorter than a pair of vmul + vadd, a RAW hazard during the first (4? on Cortex-a8) can cause additional pipeline stall. So it's frequently better to single codegen vmul + vadd. 2. A vmla folowed by a vmul, vmadd, or vsub causes the second fp instruction to stall for 4 cycles. We need to schedule them apart. 3. A vmla followed vmla is a special case. Obvious issuing back to back RAW vmla + vmla is very bad. But this isn't ideal either: vmul vadd vmla Instead, we want to expand the second vmla: vmla vmul vadd Even with the 4 cycle vmul stall, the second sequence is still 2 cycles faster. Up to now, isel simply avoid codegen'ing fp vmla / vmls. This works well enough but it isn't the optimial solution. This patch attempts to make it possible to use vmla / vmls in cases where it is profitable. A. Add missing isel predicates which cause vmla to be codegen'ed. B. Make sure the fmul in (fadd (fmul)) has a single use. We don't want to compute a fmul and a fmla. C. Add additional isel checks for vmla, avoid cases where vmla is feeding into fp instructions (except for the #3 exceptional case). D. Add ARM hazard recognizer to model the vmla / vmls hazards. E. Add a special pre-regalloc case to expand vmla / vmls when it's likely the vmla / vmls will trigger one of the special hazards. Work in progress, only A+B are enabled. llvm-svn: 120960	2010-12-05 22:04:16 +00:00
Bill Wendling	f0a1acba8c	Proper encoding for VLDM and VSTM instructions. The register lists for these instructions have to distinguish between lists of single- and double-precision registers in order for the ASM matcher to do a proper job. In all other respects, a list of single- or double-precision registers are the same as a list of GPR registers. llvm-svn: 119460	2010-11-17 04:32:08 +00:00
Bill Wendling	64cb4dd72d	vldm and vstm are mnemonics for vldmia and vstmia resp. llvm-svn: 119321	2010-11-16 02:00:24 +00:00
Bill Wendling	b450d320ec	Encode the multi-load/store instructions with their respective modes ('ia', 'db', 'ib', 'da') instead of having that mode as a separate field in the instruction. It's more convenient for the asm parser and much more readable for humans. <rdar://problem/8654088> llvm-svn: 119310	2010-11-16 01:16:36 +00:00
Bill Wendling	fadcb3cded	Add uses of the *_ldst_multi multiclasses. These aren't used yet. llvm-svn: 118999	2010-11-13 10:57:02 +00:00
Bill Wendling	184bc1368d	Convert the modes to lower case. llvm-svn: 118998	2010-11-13 10:43:34 +00:00
Bill Wendling	aa9ca6fcca	Add *_ldst_mult multiclasses to the ARM back-end. These will be used in the future to separate out the ia, ib, da, db variants of the load/store multiple instructions. llvm-svn: 118995	2010-11-13 09:09:38 +00:00
Evan Cheng	b565d1acf9	Add some missing isel predicates on def : pat patterns to avoid generating VFP vmla / vmls (they cause stalls). Disabling them in isel is properly not a right solution, I'll look into a proper solution next. llvm-svn: 118922	2010-11-12 20:32:20 +00:00
Bill Wendling	3e6eee5c35	Add "write back" bit encoding. llvm-svn: 118446	2010-11-08 21:28:03 +00:00
Bill Wendling	990c247994	Add encoding for VSTR. llvm-svn: 118220	2010-11-04 00:59:42 +00:00
Bill Wendling	34599f4aa8	The MC code couldn't handle ARM LDR instructions with negative offsets: vldr.64 d1, [r0, #-32] The problem was with how the addressing mode 5 encodes the offsets. This change makes sure that the way offsets are handled in addressing mode 5 is consistent throughout the MC code. It involves re-refactoring the "getAddrModeImmOpValue" method into an "Imm12" and "addressing mode 5" version. But not to worry! The majority of the duplicated code has been unified. llvm-svn: 118144	2010-11-03 01:49:29 +00:00
Jim Grosbach	c10d3f3d4b	Break ARM addrmode4 (load/store multiple base address) into its constituent parts. Represent the operation mode as an optional operand instead. rdar://8614429 llvm-svn: 118137	2010-11-03 01:01:43 +00:00
Bill Wendling	1546322a9c	Rename getAddrModeImm12OpValue to getAddrModeImmOpValue and expand it to work with immediates up to 16-bits in size. The same logic is applied to other LDR encodings, e.g. VLDR, but which use a different immediate bit width (8-bits in VLDR's case). Removing the "12" allows it to be more generic. llvm-svn: 118094	2010-11-02 22:31:46 +00:00
Bill Wendling	dd4216420a	Missed reverting this bit. llvm-svn: 117971	2010-11-01 23:17:54 +00:00
Bill Wendling	37c9af176d	Minor cleanup. llvm-svn: 117969	2010-11-01 23:11:22 +00:00
Bill Wendling	69e7c09c32	Move the machine operand MC encoding patterns to the parent classes. llvm-svn: 117956	2010-11-01 21:17:06 +00:00
Bill Wendling	da3d0ce7b5	Move instruction encoding bits into the parent class and remove the temporary *_Encode classes. These instructions are the only ones which use those classes, so a subclass isn't necessary. llvm-svn: 117906	2010-11-01 06:00:39 +00:00
Chris Lattner	01acd65875	reapply r117858 with apparent editor malfunction fixed (somehow I got a dulicated line). llvm-svn: 117860	2010-10-31 19:10:56 +00:00
Chris Lattner	8132a182e7	revert r117858 while I check out a failure I missed. llvm-svn: 117859	2010-10-31 19:05:32 +00:00
Chris Lattner	70b05a5b88	the asm matcher can't handle operands with modifiers (like ${foo:bar}). Instead of silently ignoring these instructions, emit a hard error and force the target author to either refactor the target or mark the instruction 'isCodeGenOnly'. Mark a few instructions in ARM and MBlaze as isCodeGenOnly the are doing this. llvm-svn: 117858	2010-10-31 18:48:12 +00:00
Jim Grosbach	b6c76a2662	Add FIXME. llvm-svn: 117787	2010-10-30 14:54:23 +00:00
Bill Wendling	c7ef66fcf2	Add encoding for moving a value between two ARM core registers and a doublework extension register. llvm-svn: 116970	2010-10-20 23:37:40 +00:00
Bill Wendling	0f96ff63b3	Add encodings for movement between ARM core registers and single-precision registers. llvm-svn: 116961	2010-10-20 22:44:54 +00:00
Bill Wendling	64d2bf006c	Reformatting. No functionalogicality changes. llvm-svn: 116625	2010-10-15 21:50:45 +00:00
Bill Wendling	2c335d364c	Add support for vmov.f64/.f32 encoding. There's a bit of a hack going on here. The f32 in FCONSTS is handled as a double instead of a float in the code. So the encoding of the immediate into the instruction isn't exactly in line with the documentation in that regard. But given that we know it's handled as a double, it doesn't cause any harm. llvm-svn: 116471	2010-10-14 02:33:26 +00:00
Bill Wendling	33a2ecd5e4	Add encoding for 'fmstat'. llvm-svn: 116466	2010-10-14 01:19:34 +00:00
Bill Wendling	cd41f22ec1	- Add encodings for multiply add/subtract instructions in all their glory. - Add missing patterns for some multiply add/subtract instructions. - Add encodings for VMRS and VMSR. llvm-svn: 116464	2010-10-14 01:02:08 +00:00
Bill Wendling	bf63d6eb63	Add MC encodings for VCVT* instrunctions. llvm-svn: 116431	2010-10-13 20:58:46 +00:00
Bill Wendling	6d8a23c978	Add encodings for VNEG and VSQRT. Also add encodings for VMOV, but not a test just yet. llvm-svn: 116386	2010-10-13 01:17:33 +00:00
Bill Wendling	ea062d454d	Add encodings for VCVT instructions. llvm-svn: 116385	2010-10-13 00:56:35 +00:00
Bill Wendling	e6c2fdebbd	Add VCMPZ and VABS. llvm-svn: 116383	2010-10-13 00:38:07 +00:00
Bill Wendling	fddde4cc72	Refactor VCMP instructions. llvm-svn: 116379	2010-10-13 00:04:29 +00:00
Bill Wendling	47155cfddd	Add encodings for VNMUL[SD]. llvm-svn: 116375	2010-10-12 23:47:37 +00:00
Bill Wendling	185b548b07	Add encodings for VDIV and VMUL. llvm-svn: 116370	2010-10-12 23:22:27 +00:00
Bill Wendling	d1f06024ce	Refactor some of the encoding logic into a base class. This keeps us from having to add 10+ lines to every instruction. It may turn out that we can move this base class into it's parent class. llvm-svn: 116362	2010-10-12 23:06:54 +00:00
Bill Wendling	cd3cb8da45	Add encoding for VSUB and VCMP. Fear not! I'm going to try a refactoring right now. :) llvm-svn: 116359	2010-10-12 22:55:35 +00:00
Bill Wendling	33a26354c1	Encoding for VADDD. Plus a test for the VFP instructions. llvm-svn: 116348	2010-10-12 22:08:41 +00:00
Jim Grosbach	58ee6f3972	Encoding for ARM-mode VADD.F32 instruction. llvm-svn: 116338	2010-10-12 21:22:40 +00:00
Evan Cheng	1ce29574c2	Model operand cycles of vldm / vstm; also fixes scheduling itineraries of vldr / vstr, etc. llvm-svn: 115898	2010-10-07 01:50:48 +00:00
Eric Christopher	7d87a75fa4	Fix typo. llvm-svn: 114931	2010-09-28 00:35:33 +00:00
Jim Grosbach	27a5b1fd3b	VFP/NEON load/store multiple instructions are addrmode4, not 5. llvm-svn: 113322	2010-09-08 00:25:50 +00:00
Bob Wilson	31d487d235	Change ARM VFP VLDM/VSTM instructions to use addressing mode #4 , just like all the other LDM/STM instructions. This fixes asm printer crashes when compiling with -O0. I've changed one of the NEON tests (vst3.ll) to run with -O0 to check this in the future. Prior to this change VLDM/VSTM used addressing mode #5, but not really. The offset field was used to hold a count of the number of registers being loaded or stored, and the AM5 opcode field was expanded to specify the IA or DB mode, instead of the standard ADD/SUB specifier. Much of the backend was not aware of these special cases. The crashes occured when rewriting a frameindex caused the AM5 offset field to be changed so that it did not have a valid submode. I don't know exactly what changed to expose this now. Maybe we've never done much with -O0 and NEON. Regardless, there's no longer any reason to keep a count of the VLDM/VSTM registers, so we can use addressing mode #4 and clean things up in a lot of places. llvm-svn: 112322	2010-08-27 23:18:17 +00:00
Daniel Dunbar	8311cf950b	ARM: Mark some disassembler only instructions as not available for matching -- for some reason they have a very odd MCInst form where the operands overlap, but I haven't dug in to find out why yet. llvm-svn: 110781	2010-08-11 04:46:13 +00:00
Nate Begeman	b506e13a32	Add support for getting & setting the FPSCR application register on ARM when VFP is enabled. Add support for using the FPSCR in conjunction with the vcvtr instruction, for controlling fp to int rounding. Add support for the FLT_ROUNDS_ node now that the FPSCR is exposed. llvm-svn: 110152	2010-08-03 21:31:55 +00:00

1 2 3

134 Commits