mirror of
https://github.com/pmret/gcc-papermario.git
synced 2024-11-08 20:02:47 +01:00
1049 lines
47 KiB
Plaintext
1049 lines
47 KiB
Plaintext
|
This is Info file gcc.info, produced by Makeinfo version 1.67 from the
|
|||
|
input file gcc.texi.
|
|||
|
|
|||
|
This file documents the use and the internals of the GNU compiler.
|
|||
|
|
|||
|
Published by the Free Software Foundation 59 Temple Place - Suite 330
|
|||
|
Boston, MA 02111-1307 USA
|
|||
|
|
|||
|
Copyright (C) 1988, 1989, 1992, 1993, 1994, 1995, 1996, 1997, 1998
|
|||
|
Free Software Foundation, Inc.
|
|||
|
|
|||
|
Permission is granted to make and distribute verbatim copies of this
|
|||
|
manual provided the copyright notice and this permission notice are
|
|||
|
preserved on all copies.
|
|||
|
|
|||
|
Permission is granted to copy and distribute modified versions of
|
|||
|
this manual under the conditions for verbatim copying, provided also
|
|||
|
that the sections entitled "GNU General Public License," "Funding for
|
|||
|
Free Software," and "Protect Your Freedom--Fight `Look And Feel'" are
|
|||
|
included exactly as in the original, and provided that the entire
|
|||
|
resulting derived work is distributed under the terms of a permission
|
|||
|
notice identical to this one.
|
|||
|
|
|||
|
Permission is granted to copy and distribute translations of this
|
|||
|
manual into another language, under the above conditions for modified
|
|||
|
versions, except that the sections entitled "GNU General Public
|
|||
|
License," "Funding for Free Software," and "Protect Your Freedom--Fight
|
|||
|
`Look And Feel'", and this permission notice, may be included in
|
|||
|
translations approved by the Free Software Foundation instead of in the
|
|||
|
original English.
|
|||
|
|
|||
|
|
|||
|
File: gcc.info, Node: Standard Names, Next: Pattern Ordering, Prev: Constraints, Up: Machine Desc
|
|||
|
|
|||
|
Standard Pattern Names For Generation
|
|||
|
=====================================
|
|||
|
|
|||
|
Here is a table of the instruction names that are meaningful in the
|
|||
|
RTL generation pass of the compiler. Giving one of these names to an
|
|||
|
instruction pattern tells the RTL generation pass that it can use the
|
|||
|
pattern to accomplish a certain task.
|
|||
|
|
|||
|
`movM'
|
|||
|
Here M stands for a two-letter machine mode name, in lower case.
|
|||
|
This instruction pattern moves data with that machine mode from
|
|||
|
operand 1 to operand 0. For example, `movsi' moves full-word data.
|
|||
|
|
|||
|
If operand 0 is a `subreg' with mode M of a register whose own
|
|||
|
mode is wider than M, the effect of this instruction is to store
|
|||
|
the specified value in the part of the register that corresponds
|
|||
|
to mode M. The effect on the rest of the register is undefined.
|
|||
|
|
|||
|
This class of patterns is special in several ways. First of all,
|
|||
|
each of these names *must* be defined, because there is no other
|
|||
|
way to copy a datum from one place to another.
|
|||
|
|
|||
|
Second, these patterns are not used solely in the RTL generation
|
|||
|
pass. Even the reload pass can generate move insns to copy values
|
|||
|
from stack slots into temporary registers. When it does so, one
|
|||
|
of the operands is a hard register and the other is an operand
|
|||
|
that can need to be reloaded into a register.
|
|||
|
|
|||
|
Therefore, when given such a pair of operands, the pattern must
|
|||
|
generate RTL which needs no reloading and needs no temporary
|
|||
|
registers--no registers other than the operands. For example, if
|
|||
|
you support the pattern with a `define_expand', then in such a
|
|||
|
case the `define_expand' mustn't call `force_reg' or any other such
|
|||
|
function which might generate new pseudo registers.
|
|||
|
|
|||
|
This requirement exists even for subword modes on a RISC machine
|
|||
|
where fetching those modes from memory normally requires several
|
|||
|
insns and some temporary registers. Look in `spur.md' to see how
|
|||
|
the requirement can be satisfied.
|
|||
|
|
|||
|
During reload a memory reference with an invalid address may be
|
|||
|
passed as an operand. Such an address will be replaced with a
|
|||
|
valid address later in the reload pass. In this case, nothing may
|
|||
|
be done with the address except to use it as it stands. If it is
|
|||
|
copied, it will not be replaced with a valid address. No attempt
|
|||
|
should be made to make such an address into a valid address and no
|
|||
|
routine (such as `change_address') that will do so may be called.
|
|||
|
Note that `general_operand' will fail when applied to such an
|
|||
|
address.
|
|||
|
|
|||
|
The global variable `reload_in_progress' (which must be explicitly
|
|||
|
declared if required) can be used to determine whether such special
|
|||
|
handling is required.
|
|||
|
|
|||
|
The variety of operands that have reloads depends on the rest of
|
|||
|
the machine description, but typically on a RISC machine these can
|
|||
|
only be pseudo registers that did not get hard registers, while on
|
|||
|
other machines explicit memory references will get optional
|
|||
|
reloads.
|
|||
|
|
|||
|
If a scratch register is required to move an object to or from
|
|||
|
memory, it can be allocated using `gen_reg_rtx' prior to reload.
|
|||
|
But this is impossible during and after reload. If there are
|
|||
|
cases needing scratch registers after reload, you must define
|
|||
|
`SECONDARY_INPUT_RELOAD_CLASS' and perhaps also
|
|||
|
`SECONDARY_OUTPUT_RELOAD_CLASS' to detect them, and provide
|
|||
|
patterns `reload_inM' or `reload_outM' to handle them. *Note
|
|||
|
Register Classes::.
|
|||
|
|
|||
|
The constraints on a `movM' must permit moving any hard register
|
|||
|
to any other hard register provided that `HARD_REGNO_MODE_OK'
|
|||
|
permits mode M in both registers and `REGISTER_MOVE_COST' applied
|
|||
|
to their classes returns a value of 2.
|
|||
|
|
|||
|
It is obligatory to support floating point `movM' instructions
|
|||
|
into and out of any registers that can hold fixed point values,
|
|||
|
because unions and structures (which have modes `SImode' or
|
|||
|
`DImode') can be in those registers and they may have floating
|
|||
|
point members.
|
|||
|
|
|||
|
There may also be a need to support fixed point `movM'
|
|||
|
instructions in and out of floating point registers.
|
|||
|
Unfortunately, I have forgotten why this was so, and I don't know
|
|||
|
whether it is still true. If `HARD_REGNO_MODE_OK' rejects fixed
|
|||
|
point values in floating point registers, then the constraints of
|
|||
|
the fixed point `movM' instructions must be designed to avoid ever
|
|||
|
trying to reload into a floating point register.
|
|||
|
|
|||
|
`reload_inM'
|
|||
|
`reload_outM'
|
|||
|
Like `movM', but used when a scratch register is required to move
|
|||
|
between operand 0 and operand 1. Operand 2 describes the scratch
|
|||
|
register. See the discussion of the `SECONDARY_RELOAD_CLASS'
|
|||
|
macro in *note Register Classes::..
|
|||
|
|
|||
|
`movstrictM'
|
|||
|
Like `movM' except that if operand 0 is a `subreg' with mode M of
|
|||
|
a register whose natural mode is wider, the `movstrictM'
|
|||
|
instruction is guaranteed not to alter any of the register except
|
|||
|
the part which belongs to mode M.
|
|||
|
|
|||
|
`load_multiple'
|
|||
|
Load several consecutive memory locations into consecutive
|
|||
|
registers. Operand 0 is the first of the consecutive registers,
|
|||
|
operand 1 is the first memory location, and operand 2 is a
|
|||
|
constant: the number of consecutive registers.
|
|||
|
|
|||
|
Define this only if the target machine really has such an
|
|||
|
instruction; do not define this if the most efficient way of
|
|||
|
loading consecutive registers from memory is to do them one at a
|
|||
|
time.
|
|||
|
|
|||
|
On some machines, there are restrictions as to which consecutive
|
|||
|
registers can be stored into memory, such as particular starting or
|
|||
|
ending register numbers or only a range of valid counts. For those
|
|||
|
machines, use a `define_expand' (*note Expander Definitions::.)
|
|||
|
and make the pattern fail if the restrictions are not met.
|
|||
|
|
|||
|
Write the generated insn as a `parallel' with elements being a
|
|||
|
`set' of one register from the appropriate memory location (you may
|
|||
|
also need `use' or `clobber' elements). Use a `match_parallel'
|
|||
|
(*note RTL Template::.) to recognize the insn. See `a29k.md' and
|
|||
|
`rs6000.md' for examples of the use of this insn pattern.
|
|||
|
|
|||
|
`store_multiple'
|
|||
|
Similar to `load_multiple', but store several consecutive registers
|
|||
|
into consecutive memory locations. Operand 0 is the first of the
|
|||
|
consecutive memory locations, operand 1 is the first register, and
|
|||
|
operand 2 is a constant: the number of consecutive registers.
|
|||
|
|
|||
|
`addM3'
|
|||
|
Add operand 2 and operand 1, storing the result in operand 0. All
|
|||
|
operands must have mode M. This can be used even on two-address
|
|||
|
machines, by means of constraints requiring operands 1 and 0 to be
|
|||
|
the same location.
|
|||
|
|
|||
|
`subM3', `mulM3'
|
|||
|
`divM3', `udivM3', `modM3', `umodM3'
|
|||
|
`sminM3', `smaxM3', `uminM3', `umaxM3'
|
|||
|
`andM3', `iorM3', `xorM3'
|
|||
|
Similar, for other arithmetic operations.
|
|||
|
|
|||
|
`mulhisi3'
|
|||
|
Multiply operands 1 and 2, which have mode `HImode', and store a
|
|||
|
`SImode' product in operand 0.
|
|||
|
|
|||
|
`mulqihi3', `mulsidi3'
|
|||
|
Similar widening-multiplication instructions of other widths.
|
|||
|
|
|||
|
`umulqihi3', `umulhisi3', `umulsidi3'
|
|||
|
Similar widening-multiplication instructions that do unsigned
|
|||
|
multiplication.
|
|||
|
|
|||
|
`mulM3_highpart'
|
|||
|
Perform a signed multiplication of operands 1 and 2, which have
|
|||
|
mode M, and store the most significant half of the product in
|
|||
|
operand 0. The least significant half of the product is discarded.
|
|||
|
|
|||
|
`umulM3_highpart'
|
|||
|
Similar, but the multiplication is unsigned.
|
|||
|
|
|||
|
`divmodM4'
|
|||
|
Signed division that produces both a quotient and a remainder.
|
|||
|
Operand 1 is divided by operand 2 to produce a quotient stored in
|
|||
|
operand 0 and a remainder stored in operand 3.
|
|||
|
|
|||
|
For machines with an instruction that produces both a quotient and
|
|||
|
a remainder, provide a pattern for `divmodM4' but do not provide
|
|||
|
patterns for `divM3' and `modM3'. This allows optimization in the
|
|||
|
relatively common case when both the quotient and remainder are
|
|||
|
computed.
|
|||
|
|
|||
|
If an instruction that just produces a quotient or just a remainder
|
|||
|
exists and is more efficient than the instruction that produces
|
|||
|
both, write the output routine of `divmodM4' to call
|
|||
|
`find_reg_note' and look for a `REG_UNUSED' note on the quotient
|
|||
|
or remainder and generate the appropriate instruction.
|
|||
|
|
|||
|
`udivmodM4'
|
|||
|
Similar, but does unsigned division.
|
|||
|
|
|||
|
`ashlM3'
|
|||
|
Arithmetic-shift operand 1 left by a number of bits specified by
|
|||
|
operand 2, and store the result in operand 0. Here M is the mode
|
|||
|
of operand 0 and operand 1; operand 2's mode is specified by the
|
|||
|
instruction pattern, and the compiler will convert the operand to
|
|||
|
that mode before generating the instruction.
|
|||
|
|
|||
|
`ashrM3', `lshrM3', `rotlM3', `rotrM3'
|
|||
|
Other shift and rotate instructions, analogous to the `ashlM3'
|
|||
|
instructions.
|
|||
|
|
|||
|
`negM2'
|
|||
|
Negate operand 1 and store the result in operand 0.
|
|||
|
|
|||
|
`absM2'
|
|||
|
Store the absolute value of operand 1 into operand 0.
|
|||
|
|
|||
|
`sqrtM2'
|
|||
|
Store the square root of operand 1 into operand 0.
|
|||
|
|
|||
|
The `sqrt' built-in function of C always uses the mode which
|
|||
|
corresponds to the C data type `double'.
|
|||
|
|
|||
|
`ffsM2'
|
|||
|
Store into operand 0 one plus the index of the least significant
|
|||
|
1-bit of operand 1. If operand 1 is zero, store zero. M is the
|
|||
|
mode of operand 0; operand 1's mode is specified by the instruction
|
|||
|
pattern, and the compiler will convert the operand to that mode
|
|||
|
before generating the instruction.
|
|||
|
|
|||
|
The `ffs' built-in function of C always uses the mode which
|
|||
|
corresponds to the C data type `int'.
|
|||
|
|
|||
|
`one_cmplM2'
|
|||
|
Store the bitwise-complement of operand 1 into operand 0.
|
|||
|
|
|||
|
`cmpM'
|
|||
|
Compare operand 0 and operand 1, and set the condition codes. The
|
|||
|
RTL pattern should look like this:
|
|||
|
|
|||
|
(set (cc0) (compare (match_operand:M 0 ...)
|
|||
|
(match_operand:M 1 ...)))
|
|||
|
|
|||
|
`tstM'
|
|||
|
Compare operand 0 against zero, and set the condition codes. The
|
|||
|
RTL pattern should look like this:
|
|||
|
|
|||
|
(set (cc0) (match_operand:M 0 ...))
|
|||
|
|
|||
|
`tstM' patterns should not be defined for machines that do not use
|
|||
|
`(cc0)'. Doing so would confuse the optimizer since it would no
|
|||
|
longer be clear which `set' operations were comparisons. The
|
|||
|
`cmpM' patterns should be used instead.
|
|||
|
|
|||
|
`movstrM'
|
|||
|
Block move instruction. The addresses of the destination and
|
|||
|
source strings are the first two operands, and both are in mode
|
|||
|
`Pmode'.
|
|||
|
|
|||
|
The number of bytes to move is the third operand, in mode M.
|
|||
|
Usually, you specify `word_mode' for M. However, if you can
|
|||
|
generate better code knowing the range of valid lengths is smaller
|
|||
|
than those representable in a full word, you should provide a
|
|||
|
pattern with a mode corresponding to the range of values you can
|
|||
|
handle efficiently (e.g., `QImode' for values in the range 0-127;
|
|||
|
note we avoid numbers that appear negative) and also a pattern
|
|||
|
with `word_mode'.
|
|||
|
|
|||
|
The fourth operand is the known shared alignment of the source and
|
|||
|
destination, in the form of a `const_int' rtx. Thus, if the
|
|||
|
compiler knows that both source and destination are word-aligned,
|
|||
|
it may provide the value 4 for this operand.
|
|||
|
|
|||
|
These patterns need not give special consideration to the
|
|||
|
possibility that the source and destination strings might overlap.
|
|||
|
|
|||
|
`clrstrM'
|
|||
|
Block clear instruction. The addresses of the destination string
|
|||
|
is the first operand, in mode `Pmode'. The number of bytes to
|
|||
|
clear is the second operand, in mode M. See `movstrM' for a
|
|||
|
discussion of the choice of mode.
|
|||
|
|
|||
|
The third operand is the known alignment of the destination, in
|
|||
|
the form of a `const_int' rtx. Thus, if the compiler knows that
|
|||
|
the destination is word-aligned, it may provide the value 4 for
|
|||
|
this operand.
|
|||
|
|
|||
|
`cmpstrM'
|
|||
|
Block compare instruction, with five operands. Operand 0 is the
|
|||
|
output; it has mode M. The remaining four operands are like the
|
|||
|
operands of `movstrM'. The two memory blocks specified are
|
|||
|
compared byte by byte in lexicographic order. The effect of the
|
|||
|
instruction is to store a value in operand 0 whose sign indicates
|
|||
|
the result of the comparison.
|
|||
|
|
|||
|
`strlenM'
|
|||
|
Compute the length of a string, with three operands. Operand 0 is
|
|||
|
the result (of mode M), operand 1 is a `mem' referring to the
|
|||
|
first character of the string, operand 2 is the character to
|
|||
|
search for (normally zero), and operand 3 is a constant describing
|
|||
|
the known alignment of the beginning of the string.
|
|||
|
|
|||
|
`floatMN2'
|
|||
|
Convert signed integer operand 1 (valid for fixed point mode M) to
|
|||
|
floating point mode N and store in operand 0 (which has mode N).
|
|||
|
|
|||
|
`floatunsMN2'
|
|||
|
Convert unsigned integer operand 1 (valid for fixed point mode M)
|
|||
|
to floating point mode N and store in operand 0 (which has mode N).
|
|||
|
|
|||
|
`fixMN2'
|
|||
|
Convert operand 1 (valid for floating point mode M) to fixed point
|
|||
|
mode N as a signed number and store in operand 0 (which has mode
|
|||
|
N). This instruction's result is defined only when the value of
|
|||
|
operand 1 is an integer.
|
|||
|
|
|||
|
`fixunsMN2'
|
|||
|
Convert operand 1 (valid for floating point mode M) to fixed point
|
|||
|
mode N as an unsigned number and store in operand 0 (which has
|
|||
|
mode N). This instruction's result is defined only when the value
|
|||
|
of operand 1 is an integer.
|
|||
|
|
|||
|
`ftruncM2'
|
|||
|
Convert operand 1 (valid for floating point mode M) to an integer
|
|||
|
value, still represented in floating point mode M, and store it in
|
|||
|
operand 0 (valid for floating point mode M).
|
|||
|
|
|||
|
`fix_truncMN2'
|
|||
|
Like `fixMN2' but works for any floating point value of mode M by
|
|||
|
converting the value to an integer.
|
|||
|
|
|||
|
`fixuns_truncMN2'
|
|||
|
Like `fixunsMN2' but works for any floating point value of mode M
|
|||
|
by converting the value to an integer.
|
|||
|
|
|||
|
`truncMN2'
|
|||
|
Truncate operand 1 (valid for mode M) to mode N and store in
|
|||
|
operand 0 (which has mode N). Both modes must be fixed point or
|
|||
|
both floating point.
|
|||
|
|
|||
|
`extendMN2'
|
|||
|
Sign-extend operand 1 (valid for mode M) to mode N and store in
|
|||
|
operand 0 (which has mode N). Both modes must be fixed point or
|
|||
|
both floating point.
|
|||
|
|
|||
|
`zero_extendMN2'
|
|||
|
Zero-extend operand 1 (valid for mode M) to mode N and store in
|
|||
|
operand 0 (which has mode N). Both modes must be fixed point.
|
|||
|
|
|||
|
`extv'
|
|||
|
Extract a bit field from operand 1 (a register or memory operand),
|
|||
|
where operand 2 specifies the width in bits and operand 3 the
|
|||
|
starting bit, and store it in operand 0. Operand 0 must have mode
|
|||
|
`word_mode'. Operand 1 may have mode `byte_mode' or `word_mode';
|
|||
|
often `word_mode' is allowed only for registers. Operands 2 and 3
|
|||
|
must be valid for `word_mode'.
|
|||
|
|
|||
|
The RTL generation pass generates this instruction only with
|
|||
|
constants for operands 2 and 3.
|
|||
|
|
|||
|
The bit-field value is sign-extended to a full word integer before
|
|||
|
it is stored in operand 0.
|
|||
|
|
|||
|
`extzv'
|
|||
|
Like `extv' except that the bit-field value is zero-extended.
|
|||
|
|
|||
|
`insv'
|
|||
|
Store operand 3 (which must be valid for `word_mode') into a bit
|
|||
|
field in operand 0, where operand 1 specifies the width in bits and
|
|||
|
operand 2 the starting bit. Operand 0 may have mode `byte_mode' or
|
|||
|
`word_mode'; often `word_mode' is allowed only for registers.
|
|||
|
Operands 1 and 2 must be valid for `word_mode'.
|
|||
|
|
|||
|
The RTL generation pass generates this instruction only with
|
|||
|
constants for operands 1 and 2.
|
|||
|
|
|||
|
`movMODEcc'
|
|||
|
Conditionally move operand 2 or operand 3 into operand 0 according
|
|||
|
to the comparison in operand 1. If the comparison is true,
|
|||
|
operand 2 is moved into operand 0, otherwise operand 3 is moved.
|
|||
|
|
|||
|
The mode of the operands being compared need not be the same as
|
|||
|
the operands being moved. Some machines, sparc64 for example,
|
|||
|
have instructions that conditionally move an integer value based
|
|||
|
on the floating point condition codes and vice versa.
|
|||
|
|
|||
|
If the machine does not have conditional move instructions, do not
|
|||
|
define these patterns.
|
|||
|
|
|||
|
`sCOND'
|
|||
|
Store zero or nonzero in the operand according to the condition
|
|||
|
codes. Value stored is nonzero iff the condition COND is true.
|
|||
|
COND is the name of a comparison operation expression code, such
|
|||
|
as `eq', `lt' or `leu'.
|
|||
|
|
|||
|
You specify the mode that the operand must have when you write the
|
|||
|
`match_operand' expression. The compiler automatically sees which
|
|||
|
mode you have used and supplies an operand of that mode.
|
|||
|
|
|||
|
The value stored for a true condition must have 1 as its low bit,
|
|||
|
or else must be negative. Otherwise the instruction is not
|
|||
|
suitable and you should omit it from the machine description. You
|
|||
|
describe to the compiler exactly which value is stored by defining
|
|||
|
the macro `STORE_FLAG_VALUE' (*note Misc::.). If a description
|
|||
|
cannot be found that can be used for all the `sCOND' patterns, you
|
|||
|
should omit those operations from the machine description.
|
|||
|
|
|||
|
These operations may fail, but should do so only in relatively
|
|||
|
uncommon cases; if they would fail for common cases involving
|
|||
|
integer comparisons, it is best to omit these patterns.
|
|||
|
|
|||
|
If these operations are omitted, the compiler will usually
|
|||
|
generate code that copies the constant one to the target and
|
|||
|
branches around an assignment of zero to the target. If this code
|
|||
|
is more efficient than the potential instructions used for the
|
|||
|
`sCOND' pattern followed by those required to convert the result
|
|||
|
into a 1 or a zero in `SImode', you should omit the `sCOND'
|
|||
|
operations from the machine description.
|
|||
|
|
|||
|
`bCOND'
|
|||
|
Conditional branch instruction. Operand 0 is a `label_ref' that
|
|||
|
refers to the label to jump to. Jump if the condition codes meet
|
|||
|
condition COND.
|
|||
|
|
|||
|
Some machines do not follow the model assumed here where a
|
|||
|
comparison instruction is followed by a conditional branch
|
|||
|
instruction. In that case, the `cmpM' (and `tstM') patterns should
|
|||
|
simply store the operands away and generate all the required insns
|
|||
|
in a `define_expand' (*note Expander Definitions::.) for the
|
|||
|
conditional branch operations. All calls to expand `bCOND'
|
|||
|
patterns are immediately preceded by calls to expand either a
|
|||
|
`cmpM' pattern or a `tstM' pattern.
|
|||
|
|
|||
|
Machines that use a pseudo register for the condition code value,
|
|||
|
or where the mode used for the comparison depends on the condition
|
|||
|
being tested, should also use the above mechanism. *Note Jump
|
|||
|
Patterns::
|
|||
|
|
|||
|
The above discussion also applies to the `movMODEcc' and `sCOND'
|
|||
|
patterns.
|
|||
|
|
|||
|
`call'
|
|||
|
Subroutine call instruction returning no value. Operand 0 is the
|
|||
|
function to call; operand 1 is the number of bytes of arguments
|
|||
|
pushed (in mode `SImode', except it is normally a `const_int');
|
|||
|
operand 2 is the number of registers used as operands.
|
|||
|
|
|||
|
On most machines, operand 2 is not actually stored into the RTL
|
|||
|
pattern. It is supplied for the sake of some RISC machines which
|
|||
|
need to put this information into the assembler code; they can put
|
|||
|
it in the RTL instead of operand 1.
|
|||
|
|
|||
|
Operand 0 should be a `mem' RTX whose address is the address of the
|
|||
|
function. Note, however, that this address can be a `symbol_ref'
|
|||
|
expression even if it would not be a legitimate memory address on
|
|||
|
the target machine. If it is also not a valid argument for a call
|
|||
|
instruction, the pattern for this operation should be a
|
|||
|
`define_expand' (*note Expander Definitions::.) that places the
|
|||
|
address into a register and uses that register in the call
|
|||
|
instruction.
|
|||
|
|
|||
|
`call_value'
|
|||
|
Subroutine call instruction returning a value. Operand 0 is the
|
|||
|
hard register in which the value is returned. There are three more
|
|||
|
operands, the same as the three operands of the `call' instruction
|
|||
|
(but with numbers increased by one).
|
|||
|
|
|||
|
Subroutines that return `BLKmode' objects use the `call' insn.
|
|||
|
|
|||
|
`call_pop', `call_value_pop'
|
|||
|
Similar to `call' and `call_value', except used if defined and if
|
|||
|
`RETURN_POPS_ARGS' is non-zero. They should emit a `parallel'
|
|||
|
that contains both the function call and a `set' to indicate the
|
|||
|
adjustment made to the frame pointer.
|
|||
|
|
|||
|
For machines where `RETURN_POPS_ARGS' can be non-zero, the use of
|
|||
|
these patterns increases the number of functions for which the
|
|||
|
frame pointer can be eliminated, if desired.
|
|||
|
|
|||
|
`untyped_call'
|
|||
|
Subroutine call instruction returning a value of any type.
|
|||
|
Operand 0 is the function to call; operand 1 is a memory location
|
|||
|
where the result of calling the function is to be stored; operand
|
|||
|
2 is a `parallel' expression where each element is a `set'
|
|||
|
expression that indicates the saving of a function return value
|
|||
|
into the result block.
|
|||
|
|
|||
|
This instruction pattern should be defined to support
|
|||
|
`__builtin_apply' on machines where special instructions are needed
|
|||
|
to call a subroutine with arbitrary arguments or to save the value
|
|||
|
returned. This instruction pattern is required on machines that
|
|||
|
have multiple registers that can hold a return value (i.e.
|
|||
|
`FUNCTION_VALUE_REGNO_P' is true for more than one register).
|
|||
|
|
|||
|
`return'
|
|||
|
Subroutine return instruction. This instruction pattern name
|
|||
|
should be defined only if a single instruction can do all the work
|
|||
|
of returning from a function.
|
|||
|
|
|||
|
Like the `movM' patterns, this pattern is also used after the RTL
|
|||
|
generation phase. In this case it is to support machines where
|
|||
|
multiple instructions are usually needed to return from a
|
|||
|
function, but some class of functions only requires one
|
|||
|
instruction to implement a return. Normally, the applicable
|
|||
|
functions are those which do not need to save any registers or
|
|||
|
allocate stack space.
|
|||
|
|
|||
|
For such machines, the condition specified in this pattern should
|
|||
|
only be true when `reload_completed' is non-zero and the function's
|
|||
|
epilogue would only be a single instruction. For machines with
|
|||
|
register windows, the routine `leaf_function_p' may be used to
|
|||
|
determine if a register window push is required.
|
|||
|
|
|||
|
Machines that have conditional return instructions should define
|
|||
|
patterns such as
|
|||
|
|
|||
|
(define_insn ""
|
|||
|
[(set (pc)
|
|||
|
(if_then_else (match_operator
|
|||
|
0 "comparison_operator"
|
|||
|
[(cc0) (const_int 0)])
|
|||
|
(return)
|
|||
|
(pc)))]
|
|||
|
"CONDITION"
|
|||
|
"...")
|
|||
|
|
|||
|
where CONDITION would normally be the same condition specified on
|
|||
|
the named `return' pattern.
|
|||
|
|
|||
|
`untyped_return'
|
|||
|
Untyped subroutine return instruction. This instruction pattern
|
|||
|
should be defined to support `__builtin_return' on machines where
|
|||
|
special instructions are needed to return a value of any type.
|
|||
|
|
|||
|
Operand 0 is a memory location where the result of calling a
|
|||
|
function with `__builtin_apply' is stored; operand 1 is a
|
|||
|
`parallel' expression where each element is a `set' expression
|
|||
|
that indicates the restoring of a function return value from the
|
|||
|
result block.
|
|||
|
|
|||
|
`nop'
|
|||
|
No-op instruction. This instruction pattern name should always be
|
|||
|
defined to output a no-op in assembler code. `(const_int 0)' will
|
|||
|
do as an RTL pattern.
|
|||
|
|
|||
|
`indirect_jump'
|
|||
|
An instruction to jump to an address which is operand zero. This
|
|||
|
pattern name is mandatory on all machines.
|
|||
|
|
|||
|
`casesi'
|
|||
|
Instruction to jump through a dispatch table, including bounds
|
|||
|
checking. This instruction takes five operands:
|
|||
|
|
|||
|
1. The index to dispatch on, which has mode `SImode'.
|
|||
|
|
|||
|
2. The lower bound for indices in the table, an integer constant.
|
|||
|
|
|||
|
3. The total range of indices in the table--the largest index
|
|||
|
minus the smallest one (both inclusive).
|
|||
|
|
|||
|
4. A label that precedes the table itself.
|
|||
|
|
|||
|
5. A label to jump to if the index has a value outside the
|
|||
|
bounds. (If the machine-description macro
|
|||
|
`CASE_DROPS_THROUGH' is defined, then an out-of-bounds index
|
|||
|
drops through to the code following the jump table instead of
|
|||
|
jumping to this label. In that case, this label is not
|
|||
|
actually used by the `casesi' instruction, but it is always
|
|||
|
provided as an operand.)
|
|||
|
|
|||
|
The table is a `addr_vec' or `addr_diff_vec' inside of a
|
|||
|
`jump_insn'. The number of elements in the table is one plus the
|
|||
|
difference between the upper bound and the lower bound.
|
|||
|
|
|||
|
`tablejump'
|
|||
|
Instruction to jump to a variable address. This is a low-level
|
|||
|
capability which can be used to implement a dispatch table when
|
|||
|
there is no `casesi' pattern.
|
|||
|
|
|||
|
This pattern requires two operands: the address or offset, and a
|
|||
|
label which should immediately precede the jump table. If the
|
|||
|
macro `CASE_VECTOR_PC_RELATIVE' is defined then the first operand
|
|||
|
is an offset which counts from the address of the table;
|
|||
|
otherwise, it is an absolute address to jump to. In either case,
|
|||
|
the first operand has mode `Pmode'.
|
|||
|
|
|||
|
The `tablejump' insn is always the last insn before the jump table
|
|||
|
it uses. Its assembler code normally has no need to use the
|
|||
|
second operand, but you should incorporate it in the RTL pattern so
|
|||
|
that the jump optimizer will not delete the table as unreachable
|
|||
|
code.
|
|||
|
|
|||
|
`canonicalize_funcptr_for_compare'
|
|||
|
Canonicalize the function pointer in operand 1 and store the result
|
|||
|
into operand 0.
|
|||
|
|
|||
|
Operand 0 is always a `reg' and has mode `Pmode'; operand 1 may be
|
|||
|
a `reg', `mem', `symbol_ref', `const_int', etc and also has mode
|
|||
|
`Pmode'.
|
|||
|
|
|||
|
Canonicalization of a function pointer usually involves computing
|
|||
|
the address of the function which would be called if the function
|
|||
|
pointer were used in an indirect call.
|
|||
|
|
|||
|
Only define this pattern if function pointers on the target machine
|
|||
|
can have different values but still call the same function when
|
|||
|
used in an indirect call.
|
|||
|
|
|||
|
`save_stack_block'
|
|||
|
`save_stack_function'
|
|||
|
`save_stack_nonlocal'
|
|||
|
`restore_stack_block'
|
|||
|
`restore_stack_function'
|
|||
|
`restore_stack_nonlocal'
|
|||
|
Most machines save and restore the stack pointer by copying it to
|
|||
|
or from an object of mode `Pmode'. Do not define these patterns on
|
|||
|
such machines.
|
|||
|
|
|||
|
Some machines require special handling for stack pointer saves and
|
|||
|
restores. On those machines, define the patterns corresponding to
|
|||
|
the non-standard cases by using a `define_expand' (*note Expander
|
|||
|
Definitions::.) that produces the required insns. The three types
|
|||
|
of saves and restores are:
|
|||
|
|
|||
|
1. `save_stack_block' saves the stack pointer at the start of a
|
|||
|
block that allocates a variable-sized object, and
|
|||
|
`restore_stack_block' restores the stack pointer when the
|
|||
|
block is exited.
|
|||
|
|
|||
|
2. `save_stack_function' and `restore_stack_function' do a
|
|||
|
similar job for the outermost block of a function and are
|
|||
|
used when the function allocates variable-sized objects or
|
|||
|
calls `alloca'. Only the epilogue uses the restored stack
|
|||
|
pointer, allowing a simpler save or restore sequence on some
|
|||
|
machines.
|
|||
|
|
|||
|
3. `save_stack_nonlocal' is used in functions that contain labels
|
|||
|
branched to by nested functions. It saves the stack pointer
|
|||
|
in such a way that the inner function can use
|
|||
|
`restore_stack_nonlocal' to restore the stack pointer. The
|
|||
|
compiler generates code to restore the frame and argument
|
|||
|
pointer registers, but some machines require saving and
|
|||
|
restoring additional data such as register window information
|
|||
|
or stack backchains. Place insns in these patterns to save
|
|||
|
and restore any such required data.
|
|||
|
|
|||
|
When saving the stack pointer, operand 0 is the save area and
|
|||
|
operand 1 is the stack pointer. The mode used to allocate the
|
|||
|
save area is the mode of operand 0. You must specify an integral
|
|||
|
mode, or `VOIDmode' if no save area is needed for a particular
|
|||
|
type of save (either because no save is needed or because a
|
|||
|
machine-specific save area can be used). Operand 0 is the stack
|
|||
|
pointer and operand 1 is the save area for restore operations. If
|
|||
|
`save_stack_block' is defined, operand 0 must not be `VOIDmode'
|
|||
|
since these saves can be arbitrarily nested.
|
|||
|
|
|||
|
A save area is a `mem' that is at a constant offset from
|
|||
|
`virtual_stack_vars_rtx' when the stack pointer is saved for use by
|
|||
|
nonlocal gotos and a `reg' in the other two cases.
|
|||
|
|
|||
|
`allocate_stack'
|
|||
|
Subtract (or add if `STACK_GROWS_DOWNWARD' is undefined) operand 1
|
|||
|
from the stack pointer to create space for dynamically allocated
|
|||
|
data.
|
|||
|
|
|||
|
Store the resultant pointer to this space into operand 0. If you
|
|||
|
are allocating space from the main stack, do this by emitting a
|
|||
|
move insn to copy `virtual_stack_dynamic_rtx' to operand 0. If
|
|||
|
you are allocating the space elsewhere, generate code to copy the
|
|||
|
location of the space to operand 0. In the latter case, you must
|
|||
|
ensure this space gets freed when the corresponding space on the
|
|||
|
main stack is free.
|
|||
|
|
|||
|
Do not define this pattern if all that must be done is the
|
|||
|
subtraction. Some machines require other operations such as stack
|
|||
|
probes or maintaining the back chain. Define this pattern to emit
|
|||
|
those operations in addition to updating the stack pointer.
|
|||
|
|
|||
|
`probe'
|
|||
|
Some machines require instructions to be executed after space is
|
|||
|
allocated from the stack, for example to generate a reference at
|
|||
|
the bottom of the stack.
|
|||
|
|
|||
|
If you need to emit instructions before the stack has been
|
|||
|
adjusted, put them into the `allocate_stack' pattern. Otherwise,
|
|||
|
define this pattern to emit the required instructions.
|
|||
|
|
|||
|
No operands are provided.
|
|||
|
|
|||
|
`check_stack'
|
|||
|
If stack checking cannot be done on your system by probing the
|
|||
|
stack with a load or store instruction (*note Stack Checking::.),
|
|||
|
define this pattern to perform the needed check and signaling an
|
|||
|
error if the stack has overflowed. The single operand is the
|
|||
|
location in the stack furthest from the current stack pointer that
|
|||
|
you need to validate. Normally, on machines where this pattern is
|
|||
|
needed, you would obtain the stack limit from a global or
|
|||
|
thread-specific variable or register.
|
|||
|
|
|||
|
`nonlocal_goto'
|
|||
|
Emit code to generate a non-local goto, e.g., a jump from one
|
|||
|
function to a label in an outer function. This pattern has four
|
|||
|
arguments, each representing a value to be used in the jump. The
|
|||
|
first argument is to be loadedd into the frame pointer, the second
|
|||
|
is the address to branch to (code to dispatch to the actual label),
|
|||
|
the third is the address of a location where the stack is saved,
|
|||
|
and the last is the address of the label, to be placed in the
|
|||
|
location for the incoming static chain.
|
|||
|
|
|||
|
On most machines you need not define this pattern, since GNU CC
|
|||
|
will already generate the correct code, which is to load the frame
|
|||
|
pointer and static chain, restore the stack (using the
|
|||
|
`restore_stack_nonlocal' pattern, if defined), and jump indirectly
|
|||
|
to the dispatcher. You need only define this pattern if this code
|
|||
|
will not work on your machine.
|
|||
|
|
|||
|
`nonlocal_goto_receiver'
|
|||
|
This pattern, if defined, contains code needed at the target of a
|
|||
|
nonlocal goto after the code already generated by GNU CC. You
|
|||
|
will not normally need to define this pattern. A typical reason
|
|||
|
why you might need this pattern is if some value, such as a
|
|||
|
pointer to a global table, must be restored when the frame pointer
|
|||
|
is restored. There are no arguments.
|
|||
|
|
|||
|
`exception_receiver'
|
|||
|
This pattern, if defined, contains code needed at the site of an
|
|||
|
exception handler that isn't needed at the site of a nonlocal
|
|||
|
goto. You will not normally need to define this pattern. A
|
|||
|
typical reason why you might need this pattern is if some value,
|
|||
|
such as a pointer to a global table, must be restored after
|
|||
|
control flow is branched to the handler of an exception. There
|
|||
|
are no arguments.
|
|||
|
|
|||
|
`builtin_setjmp_receiver'
|
|||
|
This pattern, if defined, contains code needed at the site of an
|
|||
|
builtin setjmp that isn't needed at the site of a nonlocal goto.
|
|||
|
You will not normally need to define this pattern. A typical
|
|||
|
reason why you might need this pattern is if some value, such as a
|
|||
|
pointer to a global table, must be restored. This pattern is
|
|||
|
called immediate after the call to `__dummy' has been emitted.
|
|||
|
There are no arguments.
|
|||
|
|
|||
|
|
|||
|
File: gcc.info, Node: Pattern Ordering, Next: Dependent Patterns, Prev: Standard Names, Up: Machine Desc
|
|||
|
|
|||
|
When the Order of Patterns Matters
|
|||
|
==================================
|
|||
|
|
|||
|
Sometimes an insn can match more than one instruction pattern. Then
|
|||
|
the pattern that appears first in the machine description is the one
|
|||
|
used. Therefore, more specific patterns (patterns that will match
|
|||
|
fewer things) and faster instructions (those that will produce better
|
|||
|
code when they do match) should usually go first in the description.
|
|||
|
|
|||
|
In some cases the effect of ordering the patterns can be used to hide
|
|||
|
a pattern when it is not valid. For example, the 68000 has an
|
|||
|
instruction for converting a fullword to floating point and another for
|
|||
|
converting a byte to floating point. An instruction converting an
|
|||
|
integer to floating point could match either one. We put the pattern
|
|||
|
to convert the fullword first to make sure that one will be used rather
|
|||
|
than the other. (Otherwise a large integer might be generated as a
|
|||
|
single-byte immediate quantity, which would not work.) Instead of using
|
|||
|
this pattern ordering it would be possible to make the pattern for
|
|||
|
convert-a-byte smart enough to deal properly with any constant value.
|
|||
|
|
|||
|
|
|||
|
File: gcc.info, Node: Dependent Patterns, Next: Jump Patterns, Prev: Pattern Ordering, Up: Machine Desc
|
|||
|
|
|||
|
Interdependence of Patterns
|
|||
|
===========================
|
|||
|
|
|||
|
Every machine description must have a named pattern for each of the
|
|||
|
conditional branch names `bCOND'. The recognition template must always
|
|||
|
have the form
|
|||
|
|
|||
|
(set (pc)
|
|||
|
(if_then_else (COND (cc0) (const_int 0))
|
|||
|
(label_ref (match_operand 0 "" ""))
|
|||
|
(pc)))
|
|||
|
|
|||
|
In addition, every machine description must have an anonymous pattern
|
|||
|
for each of the possible reverse-conditional branches. Their templates
|
|||
|
look like
|
|||
|
|
|||
|
(set (pc)
|
|||
|
(if_then_else (COND (cc0) (const_int 0))
|
|||
|
(pc)
|
|||
|
(label_ref (match_operand 0 "" ""))))
|
|||
|
|
|||
|
They are necessary because jump optimization can turn direct-conditional
|
|||
|
branches into reverse-conditional branches.
|
|||
|
|
|||
|
It is often convenient to use the `match_operator' construct to
|
|||
|
reduce the number of patterns that must be specified for branches. For
|
|||
|
example,
|
|||
|
|
|||
|
(define_insn ""
|
|||
|
[(set (pc)
|
|||
|
(if_then_else (match_operator 0 "comparison_operator"
|
|||
|
[(cc0) (const_int 0)])
|
|||
|
(pc)
|
|||
|
(label_ref (match_operand 1 "" ""))))]
|
|||
|
"CONDITION"
|
|||
|
"...")
|
|||
|
|
|||
|
In some cases machines support instructions identical except for the
|
|||
|
machine mode of one or more operands. For example, there may be
|
|||
|
"sign-extend halfword" and "sign-extend byte" instructions whose
|
|||
|
patterns are
|
|||
|
|
|||
|
(set (match_operand:SI 0 ...)
|
|||
|
(extend:SI (match_operand:HI 1 ...)))
|
|||
|
|
|||
|
(set (match_operand:SI 0 ...)
|
|||
|
(extend:SI (match_operand:QI 1 ...)))
|
|||
|
|
|||
|
Constant integers do not specify a machine mode, so an instruction to
|
|||
|
extend a constant value could match either pattern. The pattern it
|
|||
|
actually will match is the one that appears first in the file. For
|
|||
|
correct results, this must be the one for the widest possible mode
|
|||
|
(`HImode', here). If the pattern matches the `QImode' instruction, the
|
|||
|
results will be incorrect if the constant value does not actually fit
|
|||
|
that mode.
|
|||
|
|
|||
|
Such instructions to extend constants are rarely generated because
|
|||
|
they are optimized away, but they do occasionally happen in nonoptimized
|
|||
|
compilations.
|
|||
|
|
|||
|
If a constraint in a pattern allows a constant, the reload pass may
|
|||
|
replace a register with a constant permitted by the constraint in some
|
|||
|
cases. Similarly for memory references. Because of this substitution,
|
|||
|
you should not provide separate patterns for increment and decrement
|
|||
|
instructions. Instead, they should be generated from the same pattern
|
|||
|
that supports register-register add insns by examining the operands and
|
|||
|
generating the appropriate machine instruction.
|
|||
|
|
|||
|
|
|||
|
File: gcc.info, Node: Jump Patterns, Next: Insn Canonicalizations, Prev: Dependent Patterns, Up: Machine Desc
|
|||
|
|
|||
|
Defining Jump Instruction Patterns
|
|||
|
==================================
|
|||
|
|
|||
|
For most machines, GNU CC assumes that the machine has a condition
|
|||
|
code. A comparison insn sets the condition code, recording the results
|
|||
|
of both signed and unsigned comparison of the given operands. A
|
|||
|
separate branch insn tests the condition code and branches or not
|
|||
|
according its value. The branch insns come in distinct signed and
|
|||
|
unsigned flavors. Many common machines, such as the Vax, the 68000 and
|
|||
|
the 32000, work this way.
|
|||
|
|
|||
|
Some machines have distinct signed and unsigned compare
|
|||
|
instructions, and only one set of conditional branch instructions. The
|
|||
|
easiest way to handle these machines is to treat them just like the
|
|||
|
others until the final stage where assembly code is written. At this
|
|||
|
time, when outputting code for the compare instruction, peek ahead at
|
|||
|
the following branch using `next_cc0_user (insn)'. (The variable
|
|||
|
`insn' refers to the insn being output, in the output-writing code in
|
|||
|
an instruction pattern.) If the RTL says that is an unsigned branch,
|
|||
|
output an unsigned compare; otherwise output a signed compare. When
|
|||
|
the branch itself is output, you can treat signed and unsigned branches
|
|||
|
identically.
|
|||
|
|
|||
|
The reason you can do this is that GNU CC always generates a pair of
|
|||
|
consecutive RTL insns, possibly separated by `note' insns, one to set
|
|||
|
the condition code and one to test it, and keeps the pair inviolate
|
|||
|
until the end.
|
|||
|
|
|||
|
To go with this technique, you must define the machine-description
|
|||
|
macro `NOTICE_UPDATE_CC' to do `CC_STATUS_INIT'; in other words, no
|
|||
|
compare instruction is superfluous.
|
|||
|
|
|||
|
Some machines have compare-and-branch instructions and no condition
|
|||
|
code. A similar technique works for them. When it is time to "output"
|
|||
|
a compare instruction, record its operands in two static variables.
|
|||
|
When outputting the branch-on-condition-code instruction that follows,
|
|||
|
actually output a compare-and-branch instruction that uses the
|
|||
|
remembered operands.
|
|||
|
|
|||
|
It also works to define patterns for compare-and-branch instructions.
|
|||
|
In optimizing compilation, the pair of compare and branch instructions
|
|||
|
will be combined according to these patterns. But this does not happen
|
|||
|
if optimization is not requested. So you must use one of the solutions
|
|||
|
above in addition to any special patterns you define.
|
|||
|
|
|||
|
In many RISC machines, most instructions do not affect the condition
|
|||
|
code and there may not even be a separate condition code register. On
|
|||
|
these machines, the restriction that the definition and use of the
|
|||
|
condition code be adjacent insns is not necessary and can prevent
|
|||
|
important optimizations. For example, on the IBM RS/6000, there is a
|
|||
|
delay for taken branches unless the condition code register is set three
|
|||
|
instructions earlier than the conditional branch. The instruction
|
|||
|
scheduler cannot perform this optimization if it is not permitted to
|
|||
|
separate the definition and use of the condition code register.
|
|||
|
|
|||
|
On these machines, do not use `(cc0)', but instead use a register to
|
|||
|
represent the condition code. If there is a specific condition code
|
|||
|
register in the machine, use a hard register. If the condition code or
|
|||
|
comparison result can be placed in any general register, or if there are
|
|||
|
multiple condition registers, use a pseudo register.
|
|||
|
|
|||
|
On some machines, the type of branch instruction generated may
|
|||
|
depend on the way the condition code was produced; for example, on the
|
|||
|
68k and Sparc, setting the condition code directly from an add or
|
|||
|
subtract instruction does not clear the overflow bit the way that a test
|
|||
|
instruction does, so a different branch instruction must be used for
|
|||
|
some conditional branches. For machines that use `(cc0)', the set and
|
|||
|
use of the condition code must be adjacent (separated only by `note'
|
|||
|
insns) allowing flags in `cc_status' to be used. (*Note Condition
|
|||
|
Code::.) Also, the comparison and branch insns can be located from
|
|||
|
each other by using the functions `prev_cc0_setter' and `next_cc0_user'.
|
|||
|
|
|||
|
However, this is not true on machines that do not use `(cc0)'. On
|
|||
|
those machines, no assumptions can be made about the adjacency of the
|
|||
|
compare and branch insns and the above methods cannot be used. Instead,
|
|||
|
we use the machine mode of the condition code register to record
|
|||
|
different formats of the condition code register.
|
|||
|
|
|||
|
Registers used to store the condition code value should have a mode
|
|||
|
that is in class `MODE_CC'. Normally, it will be `CCmode'. If
|
|||
|
additional modes are required (as for the add example mentioned above in
|
|||
|
the Sparc), define the macro `EXTRA_CC_MODES' to list the additional
|
|||
|
modes required (*note Condition Code::.). Also define `EXTRA_CC_NAMES'
|
|||
|
to list the names of those modes and `SELECT_CC_MODE' to choose a mode
|
|||
|
given an operand of a compare.
|
|||
|
|
|||
|
If it is known during RTL generation that a different mode will be
|
|||
|
required (for example, if the machine has separate compare instructions
|
|||
|
for signed and unsigned quantities, like most IBM processors), they can
|
|||
|
be specified at that time.
|
|||
|
|
|||
|
If the cases that require different modes would be made by
|
|||
|
instruction combination, the macro `SELECT_CC_MODE' determines which
|
|||
|
machine mode should be used for the comparison result. The patterns
|
|||
|
should be written using that mode. To support the case of the add on
|
|||
|
the Sparc discussed above, we have the pattern
|
|||
|
|
|||
|
(define_insn ""
|
|||
|
[(set (reg:CC_NOOV 0)
|
|||
|
(compare:CC_NOOV
|
|||
|
(plus:SI (match_operand:SI 0 "register_operand" "%r")
|
|||
|
(match_operand:SI 1 "arith_operand" "rI"))
|
|||
|
(const_int 0)))]
|
|||
|
""
|
|||
|
"...")
|
|||
|
|
|||
|
The `SELECT_CC_MODE' macro on the Sparc returns `CC_NOOVmode' for
|
|||
|
comparisons whose argument is a `plus'.
|
|||
|
|
|||
|
|
|||
|
File: gcc.info, Node: Insn Canonicalizations, Next: Peephole Definitions, Prev: Jump Patterns, Up: Machine Desc
|
|||
|
|
|||
|
Canonicalization of Instructions
|
|||
|
================================
|
|||
|
|
|||
|
There are often cases where multiple RTL expressions could represent
|
|||
|
an operation performed by a single machine instruction. This situation
|
|||
|
is most commonly encountered with logical, branch, and
|
|||
|
multiply-accumulate instructions. In such cases, the compiler attempts
|
|||
|
to convert these multiple RTL expressions into a single canonical form
|
|||
|
to reduce the number of insn patterns required.
|
|||
|
|
|||
|
In addition to algebraic simplifications, following canonicalizations
|
|||
|
are performed:
|
|||
|
|
|||
|
* For commutative and comparison operators, a constant is always
|
|||
|
made the second operand. If a machine only supports a constant as
|
|||
|
the second operand, only patterns that match a constant in the
|
|||
|
second operand need be supplied.
|
|||
|
|
|||
|
For these operators, if only one operand is a `neg', `not',
|
|||
|
`mult', `plus', or `minus' expression, it will be the first
|
|||
|
operand.
|
|||
|
|
|||
|
* For the `compare' operator, a constant is always the second operand
|
|||
|
on machines where `cc0' is used (*note Jump Patterns::.). On other
|
|||
|
machines, there are rare cases where the compiler might want to
|
|||
|
construct a `compare' with a constant as the first operand.
|
|||
|
However, these cases are not common enough for it to be worthwhile
|
|||
|
to provide a pattern matching a constant as the first operand
|
|||
|
unless the machine actually has such an instruction.
|
|||
|
|
|||
|
An operand of `neg', `not', `mult', `plus', or `minus' is made the
|
|||
|
first operand under the same conditions as above.
|
|||
|
|
|||
|
* `(minus X (const_int N))' is converted to `(plus X (const_int
|
|||
|
-N))'.
|
|||
|
|
|||
|
* Within address computations (i.e., inside `mem'), a left shift is
|
|||
|
converted into the appropriate multiplication by a power of two.
|
|||
|
|
|||
|
* De`Morgan's Law is used to move bitwise negation inside a bitwise
|
|||
|
logical-and or logical-or operation. If this results in only one
|
|||
|
operand being a `not' expression, it will be the first one.
|
|||
|
|
|||
|
A machine that has an instruction that performs a bitwise
|
|||
|
logical-and of one operand with the bitwise negation of the other
|
|||
|
should specify the pattern for that instruction as
|
|||
|
|
|||
|
(define_insn ""
|
|||
|
[(set (match_operand:M 0 ...)
|
|||
|
(and:M (not:M (match_operand:M 1 ...))
|
|||
|
(match_operand:M 2 ...)))]
|
|||
|
"..."
|
|||
|
"...")
|
|||
|
|
|||
|
Similarly, a pattern for a "NAND" instruction should be written
|
|||
|
|
|||
|
(define_insn ""
|
|||
|
[(set (match_operand:M 0 ...)
|
|||
|
(ior:M (not:M (match_operand:M 1 ...))
|
|||
|
(not:M (match_operand:M 2 ...))))]
|
|||
|
"..."
|
|||
|
"...")
|
|||
|
|
|||
|
In both cases, it is not necessary to include patterns for the many
|
|||
|
logically equivalent RTL expressions.
|
|||
|
|
|||
|
* The only possible RTL expressions involving both bitwise
|
|||
|
exclusive-or and bitwise negation are `(xor:M X Y)' and `(not:M
|
|||
|
(xor:M X Y))'.
|
|||
|
|
|||
|
* The sum of three items, one of which is a constant, will only
|
|||
|
appear in the form
|
|||
|
|
|||
|
(plus:M (plus:M X Y) CONSTANT)
|
|||
|
|
|||
|
* On machines that do not use `cc0', `(compare X (const_int 0))'
|
|||
|
will be converted to X.
|
|||
|
|
|||
|
* Equality comparisons of a group of bits (usually a single bit)
|
|||
|
with zero will be written using `zero_extract' rather than the
|
|||
|
equivalent `and' or `sign_extract' operations.
|
|||
|
|