llvm-mirror/lib/Target/Mips/MSA.txt

Code Generation Notes for MSA
=============================

Intrinsics are lowered to SelectionDAG nodes where possible in order to enable
optimisation, reduce the size of the ISel matcher, and reduce repetition in
the implementation. In a small number of cases, this can cause different
(semantically equivalent) instructions to be used in place of the requested
instruction, even when no optimisation has taken place.

Instructions
============

This section describes any quirks of instruction selection for MSA. For
example, two instructions might be equally valid for some given IR and one is
chosen in preference to the other.

bclri.b:
        It is not possible to emit bclri.b since andi.b covers exactly the
        same cases. andi.b should use fractionally less power than bclri.b in
        most hardware implementations so it is used in preference to bclri.b.

vshf.w:
        It is not possible to emit vshf.w when the shuffle description is
        constant since shf.w covers exactly the same cases. shf.w is used
        instead. It is also impossible for the shuffle description to be
        unknown at compile-time due to the definition of shufflevector in
        LLVM IR.

vshf.[bhwd]
        When the shuffle description describes a splat operation, splat.[bhwd]
        instructions will be selected instead of vshf.[bhwd]. Unlike the ilv*,
        and pck* instructions, this is matched from MipsISD::VSHF instead of
        a special-case MipsISD node.

ilvl.d, pckev.d:
        It is not possible to emit ilvl.d, or pckev.d since ilvev.d covers the
        same shuffle. ilvev.d will be emitted instead.

ilvr.d, ilvod.d, pckod.d:
        It is not possible to emit ilvr.d, or pckod.d since ilvod.d covers the
        same shuffle. ilvod.d will be emitted instead.

splat.[bhwd]
        The intrinsic will work as expected. However, unlike other intrinsics
        it lowers directly to MipsISD::VSHF instead of using common IR.

splati.w:
        It is not possible to emit splati.w since shf.w covers the same cases.
        shf.w will be emitted instead.

copy_s.w:
        On MIPS32, the copy_u.d intrinsic will emit this instruction instead of
        copy_u.w. This is semantically equivalent since the general-purpose
        register file is 32-bits wide.

binsri.[bhwd],  binsli.[bhwd]:
        These two operations are equivalent to each other with the operands
        swapped and condition inverted. The compiler may use either one as
        appropriate.
        Furthermore, the compiler may use bsel.[bhwd] for some masks that do
        not survive the legalization process (this is a bug and will be fixed).

bmnz.v, bmz.v, bsel.v:
        These three operations differ only in the operand that is tied to the
        result and the order of the operands.
        It is (currently) not possible to emit bmz.v, or bsel.v since bmnz.v is
        the same operation and will be emitted instead.
        In future, the compiler may choose between these three instructions
        according to register allocation.
        These three operations can be very confusing so here is a mapping
        between the instructions and the vselect node in one place:
                bmz.v  wd, ws, wt/i8 -> (vselect wt/i8, wd, ws)
                bmnz.v wd, ws, wt/i8 -> (vselect wt/i8, ws, wd)
                bsel.v wd, ws, wt/i8 -> (vselect wd, wt/i8, ws)

bmnzi.b, bmzi.b:
        Like their non-immediate counterparts, bmnzi.v and bmzi.v are the same
        operation with the operands swapped. bmnzi.v will (currently) be emitted
        for both cases.

bseli.v:
        Unlike the non-immediate versions, bseli.v is distinguishable from
        bmnzi.b and bmzi.b and can be emitted.
[mips][msa] Added MSA.txt to describe instruction selection quirks. This file contains notes about the instruction selection for MSA. For example, it notes that ilvl.d is cannot be selected because ilvev.d covers the same cases and is selected instead of ilvl.d. llvm-svn: 191507 2013-09-27 12:42:22 +02:00			`Code Generation Notes for MSA`
			`=============================`

			`Intrinsics are lowered to SelectionDAG nodes where possible in order to enable`
			`optimisation, reduce the size of the ISel matcher, and reduce repetition in`
			`the implementation. In a small number of cases, this can cause different`
			`(semantically equivalent) instructions to be used in place of the requested`
			`instruction, even when no optimisation has taken place.`

			`Instructions`
			`============`

			`This section describes any quirks of instruction selection for MSA. For`
			`example, two instructions might be equally valid for some given IR and one is`
			`chosen in preference to the other.`

[mips][msa] Added support for matching bclr, and bclri from normal IR (i.e. not intrinsics) llvm-svn: 194471 2013-11-12 11:45:18 +01:00			`bclri.b:`
			`It is not possible to emit bclri.b since andi.b covers exactly the`
			`same cases. andi.b should use fractionally less power than bclri.b in`
			`most hardware implementations so it is used in preference to bclri.b.`

[mips][msa] Added MSA.txt to describe instruction selection quirks. This file contains notes about the instruction selection for MSA. For example, it notes that ilvl.d is cannot be selected because ilvev.d covers the same cases and is selected instead of ilvl.d. llvm-svn: 191507 2013-09-27 12:42:22 +02:00			`vshf.w:`
			`It is not possible to emit vshf.w when the shuffle description is`
			`constant since shf.w covers exactly the same cases. shf.w is used`
			`instead. It is also impossible for the shuffle description to be`
			`unknown at compile-time due to the definition of shufflevector in`
			`LLVM IR.`

[mips][msa] Added support for matching splat.[bhw] from normal IR (i.e. not intrinsics) splat.d is implemented but this subtest is currently disabled. This is because it is difficult to match the appropriate IR on MIPS32. There is a patch under review that should help with this so I hope to enable the subtest soon. llvm-svn: 193680 2013-10-30 14:07:44 +01:00			`vshf.[bhwd]`
			`When the shuffle description describes a splat operation, splat.[bhwd]`
			`instructions will be selected instead of vshf.[bhwd]. Unlike the ilv*,`
			`and pck* instructions, this is matched from MipsISD::VSHF instead of`
			`a special-case MipsISD node.`

[mips][msa] Added MSA.txt to describe instruction selection quirks. This file contains notes about the instruction selection for MSA. For example, it notes that ilvl.d is cannot be selected because ilvev.d covers the same cases and is selected instead of ilvl.d. llvm-svn: 191507 2013-09-27 12:42:22 +02:00			`ilvl.d, pckev.d:`
			`It is not possible to emit ilvl.d, or pckev.d since ilvev.d covers the`
			`same shuffle. ilvev.d will be emitted instead.`

			`ilvr.d, ilvod.d, pckod.d:`
			`It is not possible to emit ilvr.d, or pckod.d since ilvod.d covers the`
			`same shuffle. ilvod.d will be emitted instead.`
[mips][msa] Added support for matching splati from normal IR (i.e. not intrinsics) Updated some of the vshf since they (correctly) emit splati's now llvm-svn: 191511 2013-09-27 13:48:57 +02:00
[mips][msa] Added support for matching splat.[bhw] from normal IR (i.e. not intrinsics) splat.d is implemented but this subtest is currently disabled. This is because it is difficult to match the appropriate IR on MIPS32. There is a patch under review that should help with this so I hope to enable the subtest soon. llvm-svn: 193680 2013-10-30 14:07:44 +01:00			`splat.[bhwd]`
			`The intrinsic will work as expected. However, unlike other intrinsics`
			`it lowers directly to MipsISD::VSHF instead of using common IR.`

[mips][msa] Added support for matching splati from normal IR (i.e. not intrinsics) Updated some of the vshf since they (correctly) emit splati's now llvm-svn: 191511 2013-09-27 13:48:57 +02:00			`splati.w:`
			`It is not possible to emit splati.w since shf.w covers the same cases.`
			`shf.w will be emitted instead.`
[mips][msa] Implemented copy_[us].d intrinsic. This intrinsic is lowered into equivalent copy_s.w instructions during legalization. llvm-svn: 191518 2013-09-27 15:04:21 +02:00
[mips][msa] Added support for matching bins[lr]i.[bhwd] from normal IR (i.e. not intrinsics) This required correcting the definition of the bins[lr]i intrinsics because the result is also the first operand. It also required removing the (arbitrary) check for 32-bit immediates in MipsSEDAGToDAGISel::selectVSplat(). Currently using binsli.d with 2 bits set in the mask doesn't select binsli.d because the constant is legalized into a ConstantPool. Similar things can happen with binsri.d with more than 10 bits set in the mask. The resulting code when this happens is correct but not optimal. llvm-svn: 193687 2013-10-30 15:45:14 +01:00			`copy_s.w:`
[mips][msa] Implemented copy_[us].d intrinsic. This intrinsic is lowered into equivalent copy_s.w instructions during legalization. llvm-svn: 191518 2013-09-27 15:04:21 +02:00			`On MIPS32, the copy_u.d intrinsic will emit this instruction instead of`
			`copy_u.w. This is semantically equivalent since the general-purpose`
			`register file is 32-bits wide.`
[mips][msa] Added support for matching bins[lr]i.[bhwd] from normal IR (i.e. not intrinsics) This required correcting the definition of the bins[lr]i intrinsics because the result is also the first operand. It also required removing the (arbitrary) check for 32-bit immediates in MipsSEDAGToDAGISel::selectVSplat(). Currently using binsli.d with 2 bits set in the mask doesn't select binsli.d because the constant is legalized into a ConstantPool. Similar things can happen with binsri.d with more than 10 bits set in the mask. The resulting code when this happens is correct but not optimal. llvm-svn: 193687 2013-10-30 15:45:14 +01:00
			`binsri.[bhwd], binsli.[bhwd]:`
			`These two operations are equivalent to each other with the operands`
			`swapped and condition inverted. The compiler may use either one as`
			`appropriate.`
			`Furthermore, the compiler may use bsel.[bhwd] for some masks that do`
			`not survive the legalization process (this is a bug and will be fixed).`
[mips][msa] Added support for matching bmnz, bmnzi, bmz, and bmzi from normal IR (i.e. not intrinsics) Also corrected the definition of the intrinsics for these instructions (the result register is also the first operand), and added intrinsics for bsel and bseli to clang (they already existed in the backend). These four operations are mostly equivalent to bsel, and bseli (the difference is which operand is tied to the result). As a result some of the tests changed as described below. bitwise.ll: - bsel.v test adapted so that the mask is unknown at compile-time. This stops it emitting bmnzi.b instead of the intended bsel.v. - The bseli.b test now tests the right thing. Namely the case when one of the values is an uimm8, rather than when the condition is a uimm8 (which is covered by bmnzi.b) compare.ll: - bsel.v tests now (correctly) emits bmnz.v instead of bsel.v because this is the same operation (see MSA.txt). i8.ll - CHECK-DAG-ized test. - bmzi.b test now (correctly) emits equivalent bmnzi.b with swapped operands because this is the same operation (see MSA.txt). - bseli.b still emits bseli.b though because the immediate makes it distinguishable from bmnzi.b. vec.ll: - CHECK-DAG-ized test. - bmz.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). - bsel.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). llvm-svn: 193693 2013-10-30 16:20:38 +01:00
			`bmnz.v, bmz.v, bsel.v:`
			`These three operations differ only in the operand that is tied to the`
[mips] BSEL's and BINS[RL] operands are reversed compared to the vselect node used in the pattern. Summary: Correct the match patterns and the lowerings that made the CodeGen tests pass despite the mistakes. The original testcase that discovered the problem was SingleSource/UnitTests/SignlessType/factor.c in test-suite. During review, we also found that some of the existing CodeGen tests were incorrect and fixed them: * bitwise.ll: In bsel_v16i8 the IfSet/IfClear were reversed because bsel and bmnz have different operand orders and the test didn't correctly account for this. bmnz goes 'IfClear, IfSet, CondMask', while bsel goes 'CondMask, IfClear, IfSet'. * vec.ll: In the cases where a bsel is emitted as a bmnz (they are the same operation with a different input tied to the result) the operands were in the wrong order. * compare.ll and compare_float.ll: The bsel operand order was correct for a greater-than comparison, but a greater-than comparison instruction doesn't exist. Lowering this operation inverts the condition so the IfSet/IfClear need to be swapped to match. The differences between BSEL, BMNZ, and BMZ and how they map to/from vselect are rather confusing. I've therefore added a note to MSA.txt to explain this in a single place in addition to the comments that explain each case. Reviewers: matheusalmeida, jacksprat Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3028 llvm-svn: 203657 2014-03-12 12:54:00 +01:00			`result and the order of the operands.`
[mips][msa] Added support for matching bmnz, bmnzi, bmz, and bmzi from normal IR (i.e. not intrinsics) Also corrected the definition of the intrinsics for these instructions (the result register is also the first operand), and added intrinsics for bsel and bseli to clang (they already existed in the backend). These four operations are mostly equivalent to bsel, and bseli (the difference is which operand is tied to the result). As a result some of the tests changed as described below. bitwise.ll: - bsel.v test adapted so that the mask is unknown at compile-time. This stops it emitting bmnzi.b instead of the intended bsel.v. - The bseli.b test now tests the right thing. Namely the case when one of the values is an uimm8, rather than when the condition is a uimm8 (which is covered by bmnzi.b) compare.ll: - bsel.v tests now (correctly) emits bmnz.v instead of bsel.v because this is the same operation (see MSA.txt). i8.ll - CHECK-DAG-ized test. - bmzi.b test now (correctly) emits equivalent bmnzi.b with swapped operands because this is the same operation (see MSA.txt). - bseli.b still emits bseli.b though because the immediate makes it distinguishable from bmnzi.b. vec.ll: - CHECK-DAG-ized test. - bmz.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). - bsel.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). llvm-svn: 193693 2013-10-30 16:20:38 +01:00			`It is (currently) not possible to emit bmz.v, or bsel.v since bmnz.v is`
			`the same operation and will be emitted instead.`
			`In future, the compiler may choose between these three instructions`
			`according to register allocation.`
[mips] BSEL's and BINS[RL] operands are reversed compared to the vselect node used in the pattern. Summary: Correct the match patterns and the lowerings that made the CodeGen tests pass despite the mistakes. The original testcase that discovered the problem was SingleSource/UnitTests/SignlessType/factor.c in test-suite. During review, we also found that some of the existing CodeGen tests were incorrect and fixed them: * bitwise.ll: In bsel_v16i8 the IfSet/IfClear were reversed because bsel and bmnz have different operand orders and the test didn't correctly account for this. bmnz goes 'IfClear, IfSet, CondMask', while bsel goes 'CondMask, IfClear, IfSet'. * vec.ll: In the cases where a bsel is emitted as a bmnz (they are the same operation with a different input tied to the result) the operands were in the wrong order. * compare.ll and compare_float.ll: The bsel operand order was correct for a greater-than comparison, but a greater-than comparison instruction doesn't exist. Lowering this operation inverts the condition so the IfSet/IfClear need to be swapped to match. The differences between BSEL, BMNZ, and BMZ and how they map to/from vselect are rather confusing. I've therefore added a note to MSA.txt to explain this in a single place in addition to the comments that explain each case. Reviewers: matheusalmeida, jacksprat Reviewed By: matheusalmeida Differential Revision: http://llvm-reviews.chandlerc.com/D3028 llvm-svn: 203657 2014-03-12 12:54:00 +01:00			`These three operations can be very confusing so here is a mapping`
			`between the instructions and the vselect node in one place:`
			`bmz.v wd, ws, wt/i8 -> (vselect wt/i8, wd, ws)`
			`bmnz.v wd, ws, wt/i8 -> (vselect wt/i8, ws, wd)`
			`bsel.v wd, ws, wt/i8 -> (vselect wd, wt/i8, ws)`
[mips][msa] Added support for matching bmnz, bmnzi, bmz, and bmzi from normal IR (i.e. not intrinsics) Also corrected the definition of the intrinsics for these instructions (the result register is also the first operand), and added intrinsics for bsel and bseli to clang (they already existed in the backend). These four operations are mostly equivalent to bsel, and bseli (the difference is which operand is tied to the result). As a result some of the tests changed as described below. bitwise.ll: - bsel.v test adapted so that the mask is unknown at compile-time. This stops it emitting bmnzi.b instead of the intended bsel.v. - The bseli.b test now tests the right thing. Namely the case when one of the values is an uimm8, rather than when the condition is a uimm8 (which is covered by bmnzi.b) compare.ll: - bsel.v tests now (correctly) emits bmnz.v instead of bsel.v because this is the same operation (see MSA.txt). i8.ll - CHECK-DAG-ized test. - bmzi.b test now (correctly) emits equivalent bmnzi.b with swapped operands because this is the same operation (see MSA.txt). - bseli.b still emits bseli.b though because the immediate makes it distinguishable from bmnzi.b. vec.ll: - CHECK-DAG-ized test. - bmz.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). - bsel.v tests now (correctly) emits bmnz.v with swapped operands (see MSA.txt). llvm-svn: 193693 2013-10-30 16:20:38 +01:00
			`bmnzi.b, bmzi.b:`
			`Like their non-immediate counterparts, bmnzi.v and bmzi.v are the same`
			`operation with the operands swapped. bmnzi.v will (currently) be emitted`
			`for both cases.`

			`bseli.v:`
			`Unlike the non-immediate versions, bseli.v is distinguishable from`
			`bmnzi.b and bmzi.b and can be emitted.`