mirror of
https://github.com/pmret/gcc-papermario.git
synced 2024-11-08 11:53:01 +01:00
1094 lines
48 KiB
Plaintext
1094 lines
48 KiB
Plaintext
This is Info file gcc.info, produced by Makeinfo version 1.67 from the
|
||
input file gcc.texi.
|
||
|
||
This file documents the use and the internals of the GNU compiler.
|
||
|
||
Published by the Free Software Foundation 59 Temple Place - Suite 330
|
||
Boston, MA 02111-1307 USA
|
||
|
||
Copyright (C) 1988, 1989, 1992, 1993, 1994, 1995, 1996, 1997, 1998
|
||
Free Software Foundation, Inc.
|
||
|
||
Permission is granted to make and distribute verbatim copies of this
|
||
manual provided the copyright notice and this permission notice are
|
||
preserved on all copies.
|
||
|
||
Permission is granted to copy and distribute modified versions of
|
||
this manual under the conditions for verbatim copying, provided also
|
||
that the sections entitled "GNU General Public License," "Funding for
|
||
Free Software," and "Protect Your Freedom--Fight `Look And Feel'" are
|
||
included exactly as in the original, and provided that the entire
|
||
resulting derived work is distributed under the terms of a permission
|
||
notice identical to this one.
|
||
|
||
Permission is granted to copy and distribute translations of this
|
||
manual into another language, under the above conditions for modified
|
||
versions, except that the sections entitled "GNU General Public
|
||
License," "Funding for Free Software," and "Protect Your Freedom--Fight
|
||
`Look And Feel'", and this permission notice, may be included in
|
||
translations approved by the Free Software Foundation instead of in the
|
||
original English.
|
||
|
||
|
||
File: gcc.info, Node: Inline, Next: Extended Asm, Prev: Alignment, Up: C Extensions
|
||
|
||
An Inline Function is As Fast As a Macro
|
||
========================================
|
||
|
||
By declaring a function `inline', you can direct GNU CC to integrate
|
||
that function's code into the code for its callers. This makes
|
||
execution faster by eliminating the function-call overhead; in
|
||
addition, if any of the actual argument values are constant, their known
|
||
values may permit simplifications at compile time so that not all of the
|
||
inline function's code needs to be included. The effect on code size is
|
||
less predictable; object code may be larger or smaller with function
|
||
inlining, depending on the particular case. Inlining of functions is an
|
||
optimization and it really "works" only in optimizing compilation. If
|
||
you don't use `-O', no function is really inline.
|
||
|
||
To declare a function inline, use the `inline' keyword in its
|
||
declaration, like this:
|
||
|
||
inline int
|
||
inc (int *a)
|
||
{
|
||
(*a)++;
|
||
}
|
||
|
||
(If you are writing a header file to be included in ANSI C programs,
|
||
write `__inline__' instead of `inline'. *Note Alternate Keywords::.)
|
||
|
||
You can also make all "simple enough" functions inline with the
|
||
option `-finline-functions'. Note that certain usages in a function
|
||
definition can make it unsuitable for inline substitution.
|
||
|
||
Note that in C and Objective C, unlike C++, the `inline' keyword
|
||
does not affect the linkage of the function.
|
||
|
||
GNU CC automatically inlines member functions defined within the
|
||
class body of C++ programs even if they are not explicitly declared
|
||
`inline'. (You can override this with `-fno-default-inline'; *note
|
||
Options Controlling C++ Dialect: C++ Dialect Options..)
|
||
|
||
When a function is both inline and `static', if all calls to the
|
||
function are integrated into the caller, and the function's address is
|
||
never used, then the function's own assembler code is never referenced.
|
||
In this case, GNU CC does not actually output assembler code for the
|
||
function, unless you specify the option `-fkeep-inline-functions'.
|
||
Some calls cannot be integrated for various reasons (in particular,
|
||
calls that precede the function's definition cannot be integrated, and
|
||
neither can recursive calls within the definition). If there is a
|
||
nonintegrated call, then the function is compiled to assembler code as
|
||
usual. The function must also be compiled as usual if the program
|
||
refers to its address, because that can't be inlined.
|
||
|
||
When an inline function is not `static', then the compiler must
|
||
assume that there may be calls from other source files; since a global
|
||
symbol can be defined only once in any program, the function must not
|
||
be defined in the other source files, so the calls therein cannot be
|
||
integrated. Therefore, a non-`static' inline function is always
|
||
compiled on its own in the usual fashion.
|
||
|
||
If you specify both `inline' and `extern' in the function
|
||
definition, then the definition is used only for inlining. In no case
|
||
is the function compiled on its own, not even if you refer to its
|
||
address explicitly. Such an address becomes an external reference, as
|
||
if you had only declared the function, and had not defined it.
|
||
|
||
This combination of `inline' and `extern' has almost the effect of a
|
||
macro. The way to use it is to put a function definition in a header
|
||
file with these keywords, and put another copy of the definition
|
||
(lacking `inline' and `extern') in a library file. The definition in
|
||
the header file will cause most calls to the function to be inlined.
|
||
If any uses of the function remain, they will refer to the single copy
|
||
in the library.
|
||
|
||
GNU C does not inline any functions when not optimizing. It is not
|
||
clear whether it is better to inline or not, in this case, but we found
|
||
that a correct implementation when not optimizing was difficult. So we
|
||
did the easy thing, and turned it off.
|
||
|
||
|
||
File: gcc.info, Node: Extended Asm, Next: Asm Labels, Prev: Inline, Up: C Extensions
|
||
|
||
Assembler Instructions with C Expression Operands
|
||
=================================================
|
||
|
||
In an assembler instruction using `asm', you can specify the
|
||
operands of the instruction using C expressions. This means you need
|
||
not guess which registers or memory locations will contain the data you
|
||
want to use.
|
||
|
||
You must specify an assembler instruction template much like what
|
||
appears in a machine description, plus an operand constraint string for
|
||
each operand.
|
||
|
||
For example, here is how to use the 68881's `fsinx' instruction:
|
||
|
||
asm ("fsinx %1,%0" : "=f" (result) : "f" (angle));
|
||
|
||
Here `angle' is the C expression for the input operand while `result'
|
||
is that of the output operand. Each has `"f"' as its operand
|
||
constraint, saying that a floating point register is required. The `='
|
||
in `=f' indicates that the operand is an output; all output operands'
|
||
constraints must use `='. The constraints use the same language used
|
||
in the machine description (*note Constraints::.).
|
||
|
||
Each operand is described by an operand-constraint string followed by
|
||
the C expression in parentheses. A colon separates the assembler
|
||
template from the first output operand and another separates the last
|
||
output operand from the first input, if any. Commas separate the
|
||
operands within each group. The total number of operands is limited to
|
||
ten or to the maximum number of operands in any instruction pattern in
|
||
the machine description, whichever is greater.
|
||
|
||
If there are no output operands but there are input operands, you
|
||
must place two consecutive colons surrounding the place where the output
|
||
operands would go.
|
||
|
||
Output operand expressions must be lvalues; the compiler can check
|
||
this. The input operands need not be lvalues. The compiler cannot
|
||
check whether the operands have data types that are reasonable for the
|
||
instruction being executed. It does not parse the assembler instruction
|
||
template and does not know what it means or even whether it is valid
|
||
assembler input. The extended `asm' feature is most often used for
|
||
machine instructions the compiler itself does not know exist. If the
|
||
output expression cannot be directly addressed (for example, it is a
|
||
bit field), your constraint must allow a register. In that case, GNU CC
|
||
will use the register as the output of the `asm', and then store that
|
||
register into the output.
|
||
|
||
The ordinary output operands must be write-only; GNU CC will assume
|
||
that the values in these operands before the instruction are dead and
|
||
need not be generated. Extended asm supports input-output or read-write
|
||
operands. Use the constraint character `+' to indicate such an operand
|
||
and list it with the output operands.
|
||
|
||
When the constraints for the read-write operand (or the operand in
|
||
which only some of the bits are to be changed) allows a register, you
|
||
may, as an alternative, logically split its function into two separate
|
||
operands, one input operand and one write-only output operand. The
|
||
connection between them is expressed by constraints which say they need
|
||
to be in the same location when the instruction executes. You can use
|
||
the same C expression for both operands, or different expressions. For
|
||
example, here we write the (fictitious) `combine' instruction with
|
||
`bar' as its read-only source operand and `foo' as its read-write
|
||
destination:
|
||
|
||
asm ("combine %2,%0" : "=r" (foo) : "0" (foo), "g" (bar));
|
||
|
||
The constraint `"0"' for operand 1 says that it must occupy the same
|
||
location as operand 0. A digit in constraint is allowed only in an
|
||
input operand and it must refer to an output operand.
|
||
|
||
Only a digit in the constraint can guarantee that one operand will
|
||
be in the same place as another. The mere fact that `foo' is the value
|
||
of both operands is not enough to guarantee that they will be in the
|
||
same place in the generated assembler code. The following would not
|
||
work reliably:
|
||
|
||
asm ("combine %2,%0" : "=r" (foo) : "r" (foo), "g" (bar));
|
||
|
||
Various optimizations or reloading could cause operands 0 and 1 to
|
||
be in different registers; GNU CC knows no reason not to do so. For
|
||
example, the compiler might find a copy of the value of `foo' in one
|
||
register and use it for operand 1, but generate the output operand 0 in
|
||
a different register (copying it afterward to `foo''s own address). Of
|
||
course, since the register for operand 1 is not even mentioned in the
|
||
assembler code, the result will not work, but GNU CC can't tell that.
|
||
|
||
Some instructions clobber specific hard registers. To describe this,
|
||
write a third colon after the input operands, followed by the names of
|
||
the clobbered hard registers (given as strings). Here is a realistic
|
||
example for the VAX:
|
||
|
||
asm volatile ("movc3 %0,%1,%2"
|
||
: /* no outputs */
|
||
: "g" (from), "g" (to), "g" (count)
|
||
: "r0", "r1", "r2", "r3", "r4", "r5");
|
||
|
||
If you refer to a particular hardware register from the assembler
|
||
code, you will probably have to list the register after the third colon
|
||
to tell the compiler the register's value is modified. In some
|
||
assemblers, the register names begin with `%'; to produce one `%' in the
|
||
assembler code, you must write `%%' in the input.
|
||
|
||
If your assembler instruction can alter the condition code register,
|
||
add `cc' to the list of clobbered registers. GNU CC on some machines
|
||
represents the condition codes as a specific hardware register; `cc'
|
||
serves to name this register. On other machines, the condition code is
|
||
handled differently, and specifying `cc' has no effect. But it is
|
||
valid no matter what the machine.
|
||
|
||
If your assembler instruction modifies memory in an unpredictable
|
||
fashion, add `memory' to the list of clobbered registers. This will
|
||
cause GNU CC to not keep memory values cached in registers across the
|
||
assembler instruction.
|
||
|
||
You can put multiple assembler instructions together in a single
|
||
`asm' template, separated either with newlines (written as `\n') or
|
||
with semicolons if the assembler allows such semicolons. The GNU
|
||
assembler allows semicolons and most Unix assemblers seem to do so.
|
||
The input operands are guaranteed not to use any of the clobbered
|
||
registers, and neither will the output operands' addresses, so you can
|
||
read and write the clobbered registers as many times as you like. Here
|
||
is an example of multiple instructions in a template; it assumes the
|
||
subroutine `_foo' accepts arguments in registers 9 and 10:
|
||
|
||
asm ("movl %0,r9;movl %1,r10;call _foo"
|
||
: /* no outputs */
|
||
: "g" (from), "g" (to)
|
||
: "r9", "r10");
|
||
|
||
Unless an output operand has the `&' constraint modifier, GNU CC may
|
||
allocate it in the same register as an unrelated input operand, on the
|
||
assumption the inputs are consumed before the outputs are produced.
|
||
This assumption may be false if the assembler code actually consists of
|
||
more than one instruction. In such a case, use `&' for each output
|
||
operand that may not overlap an input. *Note Modifiers::.
|
||
|
||
If you want to test the condition code produced by an assembler
|
||
instruction, you must include a branch and a label in the `asm'
|
||
construct, as follows:
|
||
|
||
asm ("clr %0;frob %1;beq 0f;mov #1,%0;0:"
|
||
: "g" (result)
|
||
: "g" (input));
|
||
|
||
This assumes your assembler supports local labels, as the GNU assembler
|
||
and most Unix assemblers do.
|
||
|
||
Speaking of labels, jumps from one `asm' to another are not
|
||
supported. The compiler's optimizers do not know about these jumps, and
|
||
therefore they cannot take account of them when deciding how to
|
||
optimize.
|
||
|
||
Usually the most convenient way to use these `asm' instructions is to
|
||
encapsulate them in macros that look like functions. For example,
|
||
|
||
#define sin(x) \
|
||
({ double __value, __arg = (x); \
|
||
asm ("fsinx %1,%0": "=f" (__value): "f" (__arg)); \
|
||
__value; })
|
||
|
||
Here the variable `__arg' is used to make sure that the instruction
|
||
operates on a proper `double' value, and to accept only those arguments
|
||
`x' which can convert automatically to a `double'.
|
||
|
||
Another way to make sure the instruction operates on the correct data
|
||
type is to use a cast in the `asm'. This is different from using a
|
||
variable `__arg' in that it converts more different types. For
|
||
example, if the desired type were `int', casting the argument to `int'
|
||
would accept a pointer with no complaint, while assigning the argument
|
||
to an `int' variable named `__arg' would warn about using a pointer
|
||
unless the caller explicitly casts it.
|
||
|
||
If an `asm' has output operands, GNU CC assumes for optimization
|
||
purposes the instruction has no side effects except to change the output
|
||
operands. This does not mean instructions with a side effect cannot be
|
||
used, but you must be careful, because the compiler may eliminate them
|
||
if the output operands aren't used, or move them out of loops, or
|
||
replace two with one if they constitute a common subexpression. Also,
|
||
if your instruction does have a side effect on a variable that otherwise
|
||
appears not to change, the old value of the variable may be reused later
|
||
if it happens to be found in a register.
|
||
|
||
You can prevent an `asm' instruction from being deleted, moved
|
||
significantly, or combined, by writing the keyword `volatile' after the
|
||
`asm'. For example:
|
||
|
||
#define get_and_set_priority(new) \
|
||
({ int __old; \
|
||
asm volatile ("get_and_set_priority %0, %1": "=g" (__old) : "g" (new)); \
|
||
__old; })
|
||
b
|
||
|
||
If you write an `asm' instruction with no outputs, GNU CC will know the
|
||
instruction has side-effects and will not delete the instruction or
|
||
move it outside of loops. If the side-effects of your instruction are
|
||
not purely external, but will affect variables in your program in ways
|
||
other than reading the inputs and clobbering the specified registers or
|
||
memory, you should write the `volatile' keyword to prevent future
|
||
versions of GNU CC from moving the instruction around within a core
|
||
region.
|
||
|
||
An `asm' instruction without any operands or clobbers (and "old
|
||
style" `asm') will not be deleted or moved significantly, regardless,
|
||
unless it is unreachable, the same wasy as if you had written a
|
||
`volatile' keyword.
|
||
|
||
Note that even a volatile `asm' instruction can be moved in ways
|
||
that appear insignificant to the compiler, such as across jump
|
||
instructions. You can't expect a sequence of volatile `asm'
|
||
instructions to remain perfectly consecutive. If you want consecutive
|
||
output, use a single `asm'.
|
||
|
||
It is a natural idea to look for a way to give access to the
|
||
condition code left by the assembler instruction. However, when we
|
||
attempted to implement this, we found no way to make it work reliably.
|
||
The problem is that output operands might need reloading, which would
|
||
result in additional following "store" instructions. On most machines,
|
||
these instructions would alter the condition code before there was time
|
||
to test it. This problem doesn't arise for ordinary "test" and
|
||
"compare" instructions because they don't have any output operands.
|
||
|
||
If you are writing a header file that should be includable in ANSI C
|
||
programs, write `__asm__' instead of `asm'. *Note Alternate Keywords::.
|
||
|
||
|
||
File: gcc.info, Node: Asm Labels, Next: Explicit Reg Vars, Prev: Extended Asm, Up: C Extensions
|
||
|
||
Controlling Names Used in Assembler Code
|
||
========================================
|
||
|
||
You can specify the name to be used in the assembler code for a C
|
||
function or variable by writing the `asm' (or `__asm__') keyword after
|
||
the declarator as follows:
|
||
|
||
int foo asm ("myfoo") = 2;
|
||
|
||
This specifies that the name to be used for the variable `foo' in the
|
||
assembler code should be `myfoo' rather than the usual `_foo'.
|
||
|
||
On systems where an underscore is normally prepended to the name of
|
||
a C function or variable, this feature allows you to define names for
|
||
the linker that do not start with an underscore.
|
||
|
||
You cannot use `asm' in this way in a function *definition*; but you
|
||
can get the same effect by writing a declaration for the function
|
||
before its definition and putting `asm' there, like this:
|
||
|
||
extern func () asm ("FUNC");
|
||
|
||
func (x, y)
|
||
int x, y;
|
||
...
|
||
|
||
It is up to you to make sure that the assembler names you choose do
|
||
not conflict with any other assembler symbols. Also, you must not use a
|
||
register name; that would produce completely invalid assembler code.
|
||
GNU CC does not as yet have the ability to store static variables in
|
||
registers. Perhaps that will be added.
|
||
|
||
|
||
File: gcc.info, Node: Explicit Reg Vars, Next: Alternate Keywords, Prev: Asm Labels, Up: C Extensions
|
||
|
||
Variables in Specified Registers
|
||
================================
|
||
|
||
GNU C allows you to put a few global variables into specified
|
||
hardware registers. You can also specify the register in which an
|
||
ordinary register variable should be allocated.
|
||
|
||
* Global register variables reserve registers throughout the program.
|
||
This may be useful in programs such as programming language
|
||
interpreters which have a couple of global variables that are
|
||
accessed very often.
|
||
|
||
* Local register variables in specific registers do not reserve the
|
||
registers. The compiler's data flow analysis is capable of
|
||
determining where the specified registers contain live values, and
|
||
where they are available for other uses.
|
||
|
||
These local variables are sometimes convenient for use with the
|
||
extended `asm' feature (*note Extended Asm::.), if you want to
|
||
write one output of the assembler instruction directly into a
|
||
particular register. (This will work provided the register you
|
||
specify fits the constraints specified for that operand in the
|
||
`asm'.)
|
||
|
||
* Menu:
|
||
|
||
* Global Reg Vars::
|
||
* Local Reg Vars::
|
||
|
||
|
||
File: gcc.info, Node: Global Reg Vars, Next: Local Reg Vars, Up: Explicit Reg Vars
|
||
|
||
Defining Global Register Variables
|
||
----------------------------------
|
||
|
||
You can define a global register variable in GNU C like this:
|
||
|
||
register int *foo asm ("a5");
|
||
|
||
Here `a5' is the name of the register which should be used. Choose a
|
||
register which is normally saved and restored by function calls on your
|
||
machine, so that library routines will not clobber it.
|
||
|
||
Naturally the register name is cpu-dependent, so you would need to
|
||
conditionalize your program according to cpu type. The register `a5'
|
||
would be a good choice on a 68000 for a variable of pointer type. On
|
||
machines with register windows, be sure to choose a "global" register
|
||
that is not affected magically by the function call mechanism.
|
||
|
||
In addition, operating systems on one type of cpu may differ in how
|
||
they name the registers; then you would need additional conditionals.
|
||
For example, some 68000 operating systems call this register `%a5'.
|
||
|
||
Eventually there may be a way of asking the compiler to choose a
|
||
register automatically, but first we need to figure out how it should
|
||
choose and how to enable you to guide the choice. No solution is
|
||
evident.
|
||
|
||
Defining a global register variable in a certain register reserves
|
||
that register entirely for this use, at least within the current
|
||
compilation. The register will not be allocated for any other purpose
|
||
in the functions in the current compilation. The register will not be
|
||
saved and restored by these functions. Stores into this register are
|
||
never deleted even if they would appear to be dead, but references may
|
||
be deleted or moved or simplified.
|
||
|
||
It is not safe to access the global register variables from signal
|
||
handlers, or from more than one thread of control, because the system
|
||
library routines may temporarily use the register for other things
|
||
(unless you recompile them specially for the task at hand).
|
||
|
||
It is not safe for one function that uses a global register variable
|
||
to call another such function `foo' by way of a third function `lose'
|
||
that was compiled without knowledge of this variable (i.e. in a
|
||
different source file in which the variable wasn't declared). This is
|
||
because `lose' might save the register and put some other value there.
|
||
For example, you can't expect a global register variable to be
|
||
available in the comparison-function that you pass to `qsort', since
|
||
`qsort' might have put something else in that register. (If you are
|
||
prepared to recompile `qsort' with the same global register variable,
|
||
you can solve this problem.)
|
||
|
||
If you want to recompile `qsort' or other source files which do not
|
||
actually use your global register variable, so that they will not use
|
||
that register for any other purpose, then it suffices to specify the
|
||
compiler option `-ffixed-REG'. You need not actually add a global
|
||
register declaration to their source code.
|
||
|
||
A function which can alter the value of a global register variable
|
||
cannot safely be called from a function compiled without this variable,
|
||
because it could clobber the value the caller expects to find there on
|
||
return. Therefore, the function which is the entry point into the part
|
||
of the program that uses the global register variable must explicitly
|
||
save and restore the value which belongs to its caller.
|
||
|
||
On most machines, `longjmp' will restore to each global register
|
||
variable the value it had at the time of the `setjmp'. On some
|
||
machines, however, `longjmp' will not change the value of global
|
||
register variables. To be portable, the function that called `setjmp'
|
||
should make other arrangements to save the values of the global register
|
||
variables, and to restore them in a `longjmp'. This way, the same
|
||
thing will happen regardless of what `longjmp' does.
|
||
|
||
All global register variable declarations must precede all function
|
||
definitions. If such a declaration could appear after function
|
||
definitions, the declaration would be too late to prevent the register
|
||
from being used for other purposes in the preceding functions.
|
||
|
||
Global register variables may not have initial values, because an
|
||
executable file has no means to supply initial contents for a register.
|
||
|
||
On the Sparc, there are reports that g3 ... g7 are suitable
|
||
registers, but certain library functions, such as `getwd', as well as
|
||
the subroutines for division and remainder, modify g3 and g4. g1 and
|
||
g2 are local temporaries.
|
||
|
||
On the 68000, a2 ... a5 should be suitable, as should d2 ... d7. Of
|
||
course, it will not do to use more than a few of those.
|
||
|
||
|
||
File: gcc.info, Node: Local Reg Vars, Prev: Global Reg Vars, Up: Explicit Reg Vars
|
||
|
||
Specifying Registers for Local Variables
|
||
----------------------------------------
|
||
|
||
You can define a local register variable with a specified register
|
||
like this:
|
||
|
||
register int *foo asm ("a5");
|
||
|
||
Here `a5' is the name of the register which should be used. Note that
|
||
this is the same syntax used for defining global register variables,
|
||
but for a local variable it would appear within a function.
|
||
|
||
Naturally the register name is cpu-dependent, but this is not a
|
||
problem, since specific registers are most often useful with explicit
|
||
assembler instructions (*note Extended Asm::.). Both of these things
|
||
generally require that you conditionalize your program according to cpu
|
||
type.
|
||
|
||
In addition, operating systems on one type of cpu may differ in how
|
||
they name the registers; then you would need additional conditionals.
|
||
For example, some 68000 operating systems call this register `%a5'.
|
||
|
||
Defining such a register variable does not reserve the register; it
|
||
remains available for other uses in places where flow control determines
|
||
the variable's value is not live. However, these registers are made
|
||
unavailable for use in the reload pass; excessive use of this feature
|
||
leaves the compiler too few available registers to compile certain
|
||
functions.
|
||
|
||
This option does not guarantee that GNU CC will generate code that
|
||
has this variable in the register you specify at all times. You may not
|
||
code an explicit reference to this register in an `asm' statement and
|
||
assume it will always refer to this variable.
|
||
|
||
|
||
File: gcc.info, Node: Alternate Keywords, Next: Incomplete Enums, Prev: Explicit Reg Vars, Up: C Extensions
|
||
|
||
Alternate Keywords
|
||
==================
|
||
|
||
The option `-traditional' disables certain keywords; `-ansi'
|
||
disables certain others. This causes trouble when you want to use GNU C
|
||
extensions, or ANSI C features, in a general-purpose header file that
|
||
should be usable by all programs, including ANSI C programs and
|
||
traditional ones. The keywords `asm', `typeof' and `inline' cannot be
|
||
used since they won't work in a program compiled with `-ansi', while
|
||
the keywords `const', `volatile', `signed', `typeof' and `inline' won't
|
||
work in a program compiled with `-traditional'.
|
||
|
||
The way to solve these problems is to put `__' at the beginning and
|
||
end of each problematical keyword. For example, use `__asm__' instead
|
||
of `asm', `__const__' instead of `const', and `__inline__' instead of
|
||
`inline'.
|
||
|
||
Other C compilers won't accept these alternative keywords; if you
|
||
want to compile with another compiler, you can define the alternate
|
||
keywords as macros to replace them with the customary keywords. It
|
||
looks like this:
|
||
|
||
#ifndef __GNUC__
|
||
#define __asm__ asm
|
||
#endif
|
||
|
||
`-pedantic' causes warnings for many GNU C extensions. You can
|
||
prevent such warnings within one expression by writing `__extension__'
|
||
before the expression. `__extension__' has no effect aside from this.
|
||
|
||
|
||
File: gcc.info, Node: Incomplete Enums, Next: Function Names, Prev: Alternate Keywords, Up: C Extensions
|
||
|
||
Incomplete `enum' Types
|
||
=======================
|
||
|
||
You can define an `enum' tag without specifying its possible values.
|
||
This results in an incomplete type, much like what you get if you write
|
||
`struct foo' without describing the elements. A later declaration
|
||
which does specify the possible values completes the type.
|
||
|
||
You can't allocate variables or storage using the type while it is
|
||
incomplete. However, you can work with pointers to that type.
|
||
|
||
This extension may not be very useful, but it makes the handling of
|
||
`enum' more consistent with the way `struct' and `union' are handled.
|
||
|
||
This extension is not supported by GNU C++.
|
||
|
||
|
||
File: gcc.info, Node: Function Names, Next: Return Address, Prev: Incomplete Enums, Up: C Extensions
|
||
|
||
Function Names as Strings
|
||
=========================
|
||
|
||
GNU CC predefines two string variables to be the name of the current
|
||
function. The variable `__FUNCTION__' is the name of the function as
|
||
it appears in the source. The variable `__PRETTY_FUNCTION__' is the
|
||
name of the function pretty printed in a language specific fashion.
|
||
|
||
These names are always the same in a C function, but in a C++
|
||
function they may be different. For example, this program:
|
||
|
||
extern "C" {
|
||
extern int printf (char *, ...);
|
||
}
|
||
|
||
class a {
|
||
public:
|
||
sub (int i)
|
||
{
|
||
printf ("__FUNCTION__ = %s\n", __FUNCTION__);
|
||
printf ("__PRETTY_FUNCTION__ = %s\n", __PRETTY_FUNCTION__);
|
||
}
|
||
};
|
||
|
||
int
|
||
main (void)
|
||
{
|
||
a ax;
|
||
ax.sub (0);
|
||
return 0;
|
||
}
|
||
|
||
gives this output:
|
||
|
||
__FUNCTION__ = sub
|
||
__PRETTY_FUNCTION__ = int a::sub (int)
|
||
|
||
These names are not macros: they are predefined string variables.
|
||
For example, `#ifdef __FUNCTION__' does not have any special meaning
|
||
inside a function, since the preprocessor does not do anything special
|
||
with the identifier `__FUNCTION__'.
|
||
|
||
|
||
File: gcc.info, Node: Return Address, Prev: Function Names, Up: C Extensions
|
||
|
||
Getting the Return or Frame Address of a Function
|
||
=================================================
|
||
|
||
These functions may be used to get information about the callers of a
|
||
function.
|
||
|
||
`__builtin_return_address (LEVEL)'
|
||
This function returns the return address of the current function,
|
||
or of one of its callers. The LEVEL argument is number of frames
|
||
to scan up the call stack. A value of `0' yields the return
|
||
address of the current function, a value of `1' yields the return
|
||
address of the caller of the current function, and so forth.
|
||
|
||
The LEVEL argument must be a constant integer.
|
||
|
||
On some machines it may be impossible to determine the return
|
||
address of any function other than the current one; in such cases,
|
||
or when the top of the stack has been reached, this function will
|
||
return `0'.
|
||
|
||
This function should only be used with a non-zero argument for
|
||
debugging purposes.
|
||
|
||
`__builtin_frame_address (LEVEL)'
|
||
This function is similar to `__builtin_return_address', but it
|
||
returns the address of the function frame rather than the return
|
||
address of the function. Calling `__builtin_frame_address' with a
|
||
value of `0' yields the frame address of the current function, a
|
||
value of `1' yields the frame address of the caller of the current
|
||
function, and so forth.
|
||
|
||
The frame is the area on the stack which holds local variables and
|
||
saved registers. The frame address is normally the address of the
|
||
first word pushed on to the stack by the function. However, the
|
||
exact definition depends upon the processor and the calling
|
||
convention. If the processor has a dedicated frame pointer
|
||
register, and the function has a frame, then
|
||
`__builtin_frame_address' will return the value of the frame
|
||
pointer register.
|
||
|
||
The caveats that apply to `__builtin_return_address' apply to this
|
||
function as well.
|
||
|
||
|
||
File: gcc.info, Node: C++ Extensions, Next: Gcov, Prev: C Extensions, Up: Top
|
||
|
||
Extensions to the C++ Language
|
||
******************************
|
||
|
||
The GNU compiler provides these extensions to the C++ language (and
|
||
you can also use most of the C language extensions in your C++
|
||
programs). If you want to write code that checks whether these
|
||
features are available, you can test for the GNU compiler the same way
|
||
as for C programs: check for a predefined macro `__GNUC__'. You can
|
||
also use `__GNUG__' to test specifically for GNU C++ (*note Standard
|
||
Predefined Macros: (cpp.info)Standard Predefined.).
|
||
|
||
* Menu:
|
||
|
||
* Naming Results:: Giving a name to C++ function return values.
|
||
* Min and Max:: C++ Minimum and maximum operators.
|
||
* Destructors and Goto:: Goto is safe to use in C++ even when destructors
|
||
are needed.
|
||
* C++ Interface:: You can use a single C++ header file for both
|
||
declarations and definitions.
|
||
* Template Instantiation:: Methods for ensuring that exactly one copy of
|
||
each needed template instantiation is emitted.
|
||
* C++ Signatures:: You can specify abstract types to get subtype
|
||
polymorphism independent from inheritance.
|
||
|
||
|
||
File: gcc.info, Node: Naming Results, Next: Min and Max, Up: C++ Extensions
|
||
|
||
Named Return Values in C++
|
||
==========================
|
||
|
||
GNU C++ extends the function-definition syntax to allow you to
|
||
specify a name for the result of a function outside the body of the
|
||
definition, in C++ programs:
|
||
|
||
TYPE
|
||
FUNCTIONNAME (ARGS) return RESULTNAME;
|
||
{
|
||
...
|
||
BODY
|
||
...
|
||
}
|
||
|
||
You can use this feature to avoid an extra constructor call when a
|
||
function result has a class type. For example, consider a function
|
||
`m', declared as `X v = m ();', whose result is of class `X':
|
||
|
||
X
|
||
m ()
|
||
{
|
||
X b;
|
||
b.a = 23;
|
||
return b;
|
||
}
|
||
|
||
Although `m' appears to have no arguments, in fact it has one
|
||
implicit argument: the address of the return value. At invocation, the
|
||
address of enough space to hold `v' is sent in as the implicit argument.
|
||
Then `b' is constructed and its `a' field is set to the value 23.
|
||
Finally, a copy constructor (a constructor of the form `X(X&)') is
|
||
applied to `b', with the (implicit) return value location as the
|
||
target, so that `v' is now bound to the return value.
|
||
|
||
But this is wasteful. The local `b' is declared just to hold
|
||
something that will be copied right out. While a compiler that
|
||
combined an "elision" algorithm with interprocedural data flow analysis
|
||
could conceivably eliminate all of this, it is much more practical to
|
||
allow you to assist the compiler in generating efficient code by
|
||
manipulating the return value explicitly, thus avoiding the local
|
||
variable and copy constructor altogether.
|
||
|
||
Using the extended GNU C++ function-definition syntax, you can avoid
|
||
the temporary allocation and copying by naming `r' as your return value
|
||
at the outset, and assigning to its `a' field directly:
|
||
|
||
X
|
||
m () return r;
|
||
{
|
||
r.a = 23;
|
||
}
|
||
|
||
The declaration of `r' is a standard, proper declaration, whose effects
|
||
are executed *before* any of the body of `m'.
|
||
|
||
Functions of this type impose no additional restrictions; in
|
||
particular, you can execute `return' statements, or return implicitly by
|
||
reaching the end of the function body ("falling off the edge"). Cases
|
||
like
|
||
|
||
X
|
||
m () return r (23);
|
||
{
|
||
return;
|
||
}
|
||
|
||
(or even `X m () return r (23); { }') are unambiguous, since the return
|
||
value `r' has been initialized in either case. The following code may
|
||
be hard to read, but also works predictably:
|
||
|
||
X
|
||
m () return r;
|
||
{
|
||
X b;
|
||
return b;
|
||
}
|
||
|
||
The return value slot denoted by `r' is initialized at the outset,
|
||
but the statement `return b;' overrides this value. The compiler deals
|
||
with this by destroying `r' (calling the destructor if there is one, or
|
||
doing nothing if there is not), and then reinitializing `r' with `b'.
|
||
|
||
This extension is provided primarily to help people who use
|
||
overloaded operators, where there is a great need to control not just
|
||
the arguments, but the return values of functions. For classes where
|
||
the copy constructor incurs a heavy performance penalty (especially in
|
||
the common case where there is a quick default constructor), this is a
|
||
major savings. The disadvantage of this extension is that you do not
|
||
control when the default constructor for the return value is called: it
|
||
is always called at the beginning.
|
||
|
||
|
||
File: gcc.info, Node: Min and Max, Next: Destructors and Goto, Prev: Naming Results, Up: C++ Extensions
|
||
|
||
Minimum and Maximum Operators in C++
|
||
====================================
|
||
|
||
It is very convenient to have operators which return the "minimum"
|
||
or the "maximum" of two arguments. In GNU C++ (but not in GNU C),
|
||
|
||
`A <? B'
|
||
is the "minimum", returning the smaller of the numeric values A
|
||
and B;
|
||
|
||
`A >? B'
|
||
is the "maximum", returning the larger of the numeric values A and
|
||
B.
|
||
|
||
These operations are not primitive in ordinary C++, since you can
|
||
use a macro to return the minimum of two things in C++, as in the
|
||
following example.
|
||
|
||
#define MIN(X,Y) ((X) < (Y) ? : (X) : (Y))
|
||
|
||
You might then use `int min = MIN (i, j);' to set MIN to the minimum
|
||
value of variables I and J.
|
||
|
||
However, side effects in `X' or `Y' may cause unintended behavior.
|
||
For example, `MIN (i++, j++)' will fail, incrementing the smaller
|
||
counter twice. A GNU C extension allows you to write safe macros that
|
||
avoid this kind of problem (*note Naming an Expression's Type: Naming
|
||
Types.). However, writing `MIN' and `MAX' as macros also forces you to
|
||
use function-call notation for a fundamental arithmetic operation.
|
||
Using GNU C++ extensions, you can write `int min = i <? j;' instead.
|
||
|
||
Since `<?' and `>?' are built into the compiler, they properly
|
||
handle expressions with side-effects; `int min = i++ <? j++;' works
|
||
correctly.
|
||
|
||
|
||
File: gcc.info, Node: Destructors and Goto, Next: C++ Interface, Prev: Min and Max, Up: C++ Extensions
|
||
|
||
`goto' and Destructors in GNU C++
|
||
=================================
|
||
|
||
In C++ programs, you can safely use the `goto' statement. When you
|
||
use it to exit a block which contains aggregates requiring destructors,
|
||
the destructors will run before the `goto' transfers control.
|
||
|
||
The compiler still forbids using `goto' to *enter* a scope that
|
||
requires constructors.
|
||
|
||
|
||
File: gcc.info, Node: C++ Interface, Next: Template Instantiation, Prev: Destructors and Goto, Up: C++ Extensions
|
||
|
||
Declarations and Definitions in One Header
|
||
==========================================
|
||
|
||
C++ object definitions can be quite complex. In principle, your
|
||
source code will need two kinds of things for each object that you use
|
||
across more than one source file. First, you need an "interface"
|
||
specification, describing its structure with type declarations and
|
||
function prototypes. Second, you need the "implementation" itself. It
|
||
can be tedious to maintain a separate interface description in a header
|
||
file, in parallel to the actual implementation. It is also dangerous,
|
||
since separate interface and implementation definitions may not remain
|
||
parallel.
|
||
|
||
With GNU C++, you can use a single header file for both purposes.
|
||
|
||
*Warning:* The mechanism to specify this is in transition. For the
|
||
nonce, you must use one of two `#pragma' commands; in a future
|
||
release of GNU C++, an alternative mechanism will make these
|
||
`#pragma' commands unnecessary.
|
||
|
||
The header file contains the full definitions, but is marked with
|
||
`#pragma interface' in the source code. This allows the compiler to
|
||
use the header file only as an interface specification when ordinary
|
||
source files incorporate it with `#include'. In the single source file
|
||
where the full implementation belongs, you can use either a naming
|
||
convention or `#pragma implementation' to indicate this alternate use
|
||
of the header file.
|
||
|
||
`#pragma interface'
|
||
`#pragma interface "SUBDIR/OBJECTS.h"'
|
||
Use this directive in *header files* that define object classes,
|
||
to save space in most of the object files that use those classes.
|
||
Normally, local copies of certain information (backup copies of
|
||
inline member functions, debugging information, and the internal
|
||
tables that implement virtual functions) must be kept in each
|
||
object file that includes class definitions. You can use this
|
||
pragma to avoid such duplication. When a header file containing
|
||
`#pragma interface' is included in a compilation, this auxiliary
|
||
information will not be generated (unless the main input source
|
||
file itself uses `#pragma implementation'). Instead, the object
|
||
files will contain references to be resolved at link time.
|
||
|
||
The second form of this directive is useful for the case where you
|
||
have multiple headers with the same name in different directories.
|
||
If you use this form, you must specify the same string to `#pragma
|
||
implementation'.
|
||
|
||
`#pragma implementation'
|
||
`#pragma implementation "OBJECTS.h"'
|
||
Use this pragma in a *main input file*, when you want full output
|
||
from included header files to be generated (and made globally
|
||
visible). The included header file, in turn, should use `#pragma
|
||
interface'. Backup copies of inline member functions, debugging
|
||
information, and the internal tables used to implement virtual
|
||
functions are all generated in implementation files.
|
||
|
||
If you use `#pragma implementation' with no argument, it applies to
|
||
an include file with the same basename(1) as your source file.
|
||
For example, in `allclass.cc', giving just `#pragma implementation'
|
||
by itself is equivalent to `#pragma implementation "allclass.h"'.
|
||
|
||
In versions of GNU C++ prior to 2.6.0 `allclass.h' was treated as
|
||
an implementation file whenever you would include it from
|
||
`allclass.cc' even if you never specified `#pragma
|
||
implementation'. This was deemed to be more trouble than it was
|
||
worth, however, and disabled.
|
||
|
||
If you use an explicit `#pragma implementation', it must appear in
|
||
your source file *before* you include the affected header files.
|
||
|
||
Use the string argument if you want a single implementation file to
|
||
include code from multiple header files. (You must also use
|
||
`#include' to include the header file; `#pragma implementation'
|
||
only specifies how to use the file--it doesn't actually include
|
||
it.)
|
||
|
||
There is no way to split up the contents of a single header file
|
||
into multiple implementation files.
|
||
|
||
`#pragma implementation' and `#pragma interface' also have an effect
|
||
on function inlining.
|
||
|
||
If you define a class in a header file marked with `#pragma
|
||
interface', the effect on a function defined in that class is similar to
|
||
an explicit `extern' declaration--the compiler emits no code at all to
|
||
define an independent version of the function. Its definition is used
|
||
only for inlining with its callers.
|
||
|
||
Conversely, when you include the same header file in a main source
|
||
file that declares it as `#pragma implementation', the compiler emits
|
||
code for the function itself; this defines a version of the function
|
||
that can be found via pointers (or by callers compiled without
|
||
inlining). If all calls to the function can be inlined, you can avoid
|
||
emitting the function by compiling with `-fno-implement-inlines'. If
|
||
any calls were not inlined, you will get linker errors.
|
||
|
||
---------- Footnotes ----------
|
||
|
||
(1) A file's "basename" was the name stripped of all leading path
|
||
information and of trailing suffixes, such as `.h' or `.C' or `.cc'.
|
||
|
||
|
||
File: gcc.info, Node: Template Instantiation, Next: C++ Signatures, Prev: C++ Interface, Up: C++ Extensions
|
||
|
||
Where's the Template?
|
||
=====================
|
||
|
||
C++ templates are the first language feature to require more
|
||
intelligence from the environment than one usually finds on a UNIX
|
||
system. Somehow the compiler and linker have to make sure that each
|
||
template instance occurs exactly once in the executable if it is needed,
|
||
and not at all otherwise. There are two basic approaches to this
|
||
problem, which I will refer to as the Borland model and the Cfront
|
||
model.
|
||
|
||
Borland model
|
||
Borland C++ solved the template instantiation problem by adding
|
||
the code equivalent of common blocks to their linker; the compiler
|
||
emits template instances in each translation unit that uses them,
|
||
and the linker collapses them together. The advantage of this
|
||
model is that the linker only has to consider the object files
|
||
themselves; there is no external complexity to worry about. This
|
||
disadvantage is that compilation time is increased because the
|
||
template code is being compiled repeatedly. Code written for this
|
||
model tends to include definitions of all templates in the header
|
||
file, since they must be seen to be instantiated.
|
||
|
||
Cfront model
|
||
The AT&T C++ translator, Cfront, solved the template instantiation
|
||
problem by creating the notion of a template repository, an
|
||
automatically maintained place where template instances are
|
||
stored. A more modern version of the repository works as follows:
|
||
As individual object files are built, the compiler places any
|
||
template definitions and instantiations encountered in the
|
||
repository. At link time, the link wrapper adds in the objects in
|
||
the repository and compiles any needed instances that were not
|
||
previously emitted. The advantages of this model are more optimal
|
||
compilation speed and the ability to use the system linker; to
|
||
implement the Borland model a compiler vendor also needs to
|
||
replace the linker. The disadvantages are vastly increased
|
||
complexity, and thus potential for error; for some code this can be
|
||
just as transparent, but in practice it can been very difficult to
|
||
build multiple programs in one directory and one program in
|
||
multiple directories. Code written for this model tends to
|
||
separate definitions of non-inline member templates into a
|
||
separate file, which should be compiled separately.
|
||
|
||
When used with GNU ld version 2.8 or later on an ELF system such as
|
||
Linux/GNU or Solaris 2, or on Microsoft Windows, g++ supports the
|
||
Borland model. On other systems, g++ implements neither automatic
|
||
model.
|
||
|
||
A future version of g++ will support a hybrid model whereby the
|
||
compiler will emit any instantiations for which the template definition
|
||
is included in the compile, and store template definitions and
|
||
instantiation context information into the object file for the rest.
|
||
The link wrapper will extract that information as necessary and invoke
|
||
the compiler to produce the remaining instantiations. The linker will
|
||
then combine duplicate instantiations.
|
||
|
||
In the mean time, you have the following options for dealing with
|
||
template instantiations:
|
||
|
||
1. Compile your code with `-fno-implicit-templates' to disable the
|
||
implicit generation of template instances, and explicitly
|
||
instantiate all the ones you use. This approach requires more
|
||
knowledge of exactly which instances you need than do the others,
|
||
but it's less mysterious and allows greater control. You can
|
||
scatter the explicit instantiations throughout your program,
|
||
perhaps putting them in the translation units where the instances
|
||
are used or the translation units that define the templates
|
||
themselves; you can put all of the explicit instantiations you
|
||
need into one big file; or you can create small files like
|
||
|
||
#include "Foo.h"
|
||
#include "Foo.cc"
|
||
|
||
template class Foo<int>;
|
||
template ostream& operator <<
|
||
(ostream&, const Foo<int>&);
|
||
|
||
for each of the instances you need, and create a template
|
||
instantiation library from those.
|
||
|
||
If you are using Cfront-model code, you can probably get away with
|
||
not using `-fno-implicit-templates' when compiling files that don't
|
||
`#include' the member template definitions.
|
||
|
||
If you use one big file to do the instantiations, you may want to
|
||
compile it without `-fno-implicit-templates' so you get all of the
|
||
instances required by your explicit instantiations (but not by any
|
||
other files) without having to specify them as well.
|
||
|
||
g++ has extended the template instantiation syntax outlined in the
|
||
Working Paper to allow forward declaration of explicit
|
||
instantiations, explicit instantiation of members of template
|
||
classes and instantiation of the compiler support data for a
|
||
template class (i.e. the vtable) without instantiating any of its
|
||
members:
|
||
|
||
extern template int max (int, int);
|
||
template void Foo<int>::f ();
|
||
inline template class Foo<int>;
|
||
|
||
2. Do nothing. Pretend g++ does implement automatic instantiation
|
||
management. Code written for the Borland model will work fine, but
|
||
each translation unit will contain instances of each of the
|
||
templates it uses. In a large program, this can lead to an
|
||
unacceptable amount of code duplication.
|
||
|
||
3. Add `#pragma interface' to all files containing template
|
||
definitions. For each of these files, add `#pragma implementation
|
||
"FILENAME"' to the top of some `.C' file which `#include's it.
|
||
Then compile everything with `-fexternal-templates'. The
|
||
templates will then only be expanded in the translation unit which
|
||
implements them (i.e. has a `#pragma implementation' line for the
|
||
file where they live); all other files will use external
|
||
references. If you're lucky, everything should work properly. If
|
||
you get undefined symbol errors, you need to make sure that each
|
||
template instance which is used in the program is used in the file
|
||
which implements that template. If you don't have any use for a
|
||
particular instance in that file, you can just instantiate it
|
||
explicitly, using the syntax from the latest C++ working paper:
|
||
|
||
template class A<int>;
|
||
template ostream& operator << (ostream&, const A<int>&);
|
||
|
||
This strategy will work with code written for either model. If
|
||
you are using code written for the Cfront model, the file
|
||
containing a class template and the file containing its member
|
||
templates should be implemented in the same translation unit.
|
||
|
||
A slight variation on this approach is to instead use the flag
|
||
`-falt-external-templates'; this flag causes template instances to
|
||
be emitted in the translation unit that implements the header
|
||
where they are first instantiated, rather than the one which
|
||
implements the file where the templates are defined. This header
|
||
must be the same in all translation units, or things are likely to
|
||
break.
|
||
|
||
*Note Declarations and Definitions in One Header: C++ Interface,
|
||
for more discussion of these pragmas.
|
||
|