llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 14:33:02 +02:00

Author	SHA1	Message	Date
Anton Korobeynikov	3399dc61e5	Unbreak llvm-svn: 53317	2008-07-09 13:27:37 +00:00
Anton Korobeynikov	1f697cd97b	Another bunch of hacks for named sections support llvm-svn: 53315	2008-07-09 13:26:52 +00:00
Anton Korobeynikov	933bf0ecc4	Add hacky way to distinguish named and named sections. This will be generalized in the future. llvm-svn: 53311	2008-07-09 13:25:26 +00:00
Anton Korobeynikov	751cfda7dd	Handle ELF mergeable sections llvm-svn: 53306	2008-07-09 13:23:37 +00:00
Anton Korobeynikov	f42d75201a	Provide general hook for section name calculation llvm-svn: 53304	2008-07-09 13:22:46 +00:00
Anton Korobeynikov	c421fcddb4	Print entity size for mergeable sections llvm-svn: 53303	2008-07-09 13:22:17 +00:00
Anton Korobeynikov	059999d321	Move flag decoding stuff into special hook llvm-svn: 53297	2008-07-09 13:20:07 +00:00
Anton Korobeynikov	ca271dd426	Properly handle linkonce stuff llvm-svn: 53296	2008-07-09 13:19:38 +00:00
Anton Korobeynikov	782a69505d	Provide skeletone code for calculation of section, where global should be emitted into llvm-svn: 53295	2008-07-09 13:19:08 +00:00
Anton Korobeynikov	c150d11dbd	Make hooks virtual llvm-svn: 53293	2008-07-09 13:18:21 +00:00
Anton Korobeynikov	5bc572ad96	Add default section name resolution routine llvm-svn: 53292	2008-07-09 13:18:02 +00:00
Anton Korobeynikov	552675bb0f	Constify llvm-svn: 53291	2008-07-09 13:17:36 +00:00
Anton Korobeynikov	651865851c	Add code for default section falgs computation llvm-svn: 53290	2008-07-09 13:16:59 +00:00
Duncan Sands	3ea6f15708	Rather than having a different custom legalization hook for each way in which a result type can be legalized (promotion, expansion, softening etc), just use one: ReplaceNodeResults, which returns a node with exactly the same result types as the node passed to it, but presumably with a bunch of custom code behind the scenes. No change if the new LegalizeTypes infrastructure is not turned on. llvm-svn: 53137	2008-07-04 11:47:58 +00:00
Duncan Sands	aac5c915ed	Linux also does not require exception handling moves in order to get correct debug info. Since I can't imagine how any target could possibly be any different, I've just stripped out the option: now all the world's like Darwin! llvm-svn: 53134	2008-07-04 09:55:48 +00:00
Owen Anderson	604f9f722d	Make LiveVariables even more optional, by making it optional in the call to TargetInstrInfo::convertToThreeAddressInstruction Also, if LV isn't around, then TwoAddr doesn't need to be updating flags, since they won't have been set in the first place. llvm-svn: 53058	2008-07-02 23:41:07 +00:00
Bill Wendling	27c38cee90	Darwin doesn't need exception handling information for the "move" info when debug information is being output, because it's leet! llvm-svn: 52994	2008-07-01 23:34:48 +00:00
Evan Cheng	5389352c98	Avoid creating expensive comment string if it's not going to be printed. llvm-svn: 52992	2008-07-01 23:18:29 +00:00
Dan Gohman	83c1b4cede	Prune a few dependencies on MachineFunction.h. llvm-svn: 52976	2008-07-01 18:15:35 +00:00
Owen Anderson	fe73cb09c8	Make the subregister hashtable output more readable by wrapping the lines, and mark it const along with the associated changes to TargetRegisterInfo. llvm-svn: 52966	2008-07-01 17:34:38 +00:00
Owen Anderson	9e88ac84bf	Implement suggestions from Chris: - Use a more accurate heuristic for the size of the hashtable. - Use bitwise and instead of modulo since the size is a power of two. - Use new[] instead of malloc(). llvm-svn: 52951	2008-07-01 07:02:30 +00:00
Owen Anderson	059d64938b	Replace the dynamically computed std::set lookup method for subregisters with a hashtable-based version that is computed by tblgen at the time LLVM is compiled. llvm-svn: 52945	2008-07-01 00:18:52 +00:00
Dan Gohman	c8097f8c8c	Split ISD::LABEL into ISD::DBG_LABEL and ISD::EH_LABEL, eliminating the need for a flavor operand, and add a new SDNode subclass, LabelSDNode, for use with them to eliminate the need for a label id operand. Change instruction selection to let these label nodes through unmodified instead of creating copies of them. Teach the MachineInstr emitter how to emit a MachineInstr directly from an ISD label node. This avoids the need for allocating SDNodes for the label id and flavor value, as well as SDNodes for each of the post-isel label, label id, and label flavor. llvm-svn: 52943	2008-07-01 00:05:16 +00:00
Dan Gohman	e58f07e5d6	Update comments to new-style syntax. llvm-svn: 52925	2008-06-30 21:00:56 +00:00
Dan Gohman	a44757308c	Reorder the fields in TargetLowering to require less padding. llvm-svn: 52919	2008-06-30 20:36:26 +00:00
Dan Gohman	7c7639816b	Change bools to 1-bit bitfields to shrink ArgListEntry slightly. llvm-svn: 52918	2008-06-30 20:33:57 +00:00
Dan Gohman	295abfe228	Replace some std::vectors that showed up in heap profiling with SmallVectors. Change the signature of TargetLowering::LowerArguments to avoid returning a vector by value, and update the two targets which still use this directly, Sparc and IA64, accordingly. llvm-svn: 52917	2008-06-30 20:31:15 +00:00
Evan Cheng	421919a049	Eliminate TargetRegisterDesc::ImmSubRegs. It's no longer in use. llvm-svn: 52892	2008-06-30 07:32:56 +00:00
Anton Korobeynikov	9e8c154272	Start refactoring of asmprinters: provide a TAI hook, which will select a 'section kind' for a global. llvm-svn: 52868	2008-06-28 13:45:57 +00:00
Owen Anderson	59ed35b2c2	Cache subregister relationships in a set in TargetRegisterInfo to allow faster lookups. This speeds up LiveVariables from 0.6279s to 0.6165s on kimwitu++. llvm-svn: 52818	2008-06-27 06:56:04 +00:00
Dale Johannesen	f170e29cf5	Fixes the last x86-64 test failure in compat.exp: <16 x float> is 64-byte aligned (for some reason), which gets us into the stack realignment code. The computation changing FP-relative offsets to SP-relative was broken, assiging a spill temp to a location also used for parameter passing. This fixes it by rounding up the stack frame to a multiple of the largest alignment (I concluded it wasn't fixable without doing this, but I'm not very sure.) llvm-svn: 52750	2008-06-26 01:51:13 +00:00
Chris Lattner	ef1aa7c9db	Switch the PPC backend and target-independent JIT to use the libsystem InvalidateInstructionCache method instead of calling through a hook on the JIT. This is a host feature, not a target feature. llvm-svn: 52734	2008-06-25 17:18:44 +00:00
Anton Korobeynikov	5b04f6d658	Provide generic hooks for icache invalidation. Add PPC implementation. Patch by Gary Benson! llvm-svn: 52418	2008-06-17 17:30:05 +00:00
Evan Cheng	2dfe8c2435	Add option to commuteInstruction() which forces it to create a new (commuted) instruction. llvm-svn: 52308	2008-06-16 07:33:11 +00:00
Duncan Sands	40c8db881a	Disable some DAG combiner optimizations that may be wrong for volatile loads and stores. In fact this is almost all of them! There are three types of problems: (1) it is wrong to change the width of a volatile memory access. These may be used to do memory mapped i/o, in which case a load can have an effect even if the result is not used. Consider loading an i32 but only using the lower 8 bits. It is wrong to change this into a load of an i8, because you are no longer tickling the other three bytes. It is also unwise to make a load/store wider. For example, changing an i16 load into an i32 load is wrong no matter how aligned things are, since the fact of loading an additional 2 bytes can have i/o side-effects. (2) it is wrong to change the number of volatile load/stores: they may be counted by the hardware. (3) it is wrong to change a volatile load/store that requires one memory access into one that requires several. For example on x86-32, you can store a double in one processor operation, but to store an i64 requires two (two i32 stores). In a multi-threaded program you may want to bitcast an i64 to a double and store as a double because that will occur atomically, and be indivisible to other threads. So it would be wrong to convert the store-of-double into a store of an i64, because this will become two i32 stores - no longer atomic. My policy here is to say that the number of processor operations for an illegal operation is undefined. So it is alright to change a store of an i64 (requires at least two stores; but could be validly lowered to memcpy for example) into a store of double (one processor op). In short, if the new store is legal and has the same size then I say that the transform is ok. It would also be possible to say that transforms are always ok if before they were illegal, whether after they are illegal or not, but that's more awkward to do and I doubt it buys us anything much. However this exposed an interesting thing - on x86-32 a store of i64 is considered legal! That is because operations are marked legal by default, regardless of whether the type is legal or not. In some ways this is clever: before type legalization this means that operations on illegal types are considered legal; after type legalization there are no illegal types so now operations are only legal if they really are. But I consider this to be too cunning for mere mortals. Better to do things explicitly by testing AfterLegalize. So I have changed things so that operations with illegal types are considered illegal - indeed they can never map to a machine operation. However this means that the DAG combiner is more conservative because before it was "accidentally" performing transforms where the type was illegal because the operation was nonetheless marked legal. So in a few such places I added a check on AfterLegalize, which I suppose was actually just forgotten before. This causes the DAG combiner to do slightly more than it used to, which resulted in the X86 backend blowing up because it got a slightly surprising node it wasn't expecting, so I tweaked it. llvm-svn: 52254	2008-06-13 19:07:40 +00:00
Duncan Sands	e52bc54496	If queried as to whether an operation is legal for a particular MVT, return false if the type is illegal rather than barfing. llvm-svn: 52229	2008-06-12 09:01:30 +00:00
Duncan Sands	fe2a970a5c	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Duncan Sands	d634afe3aa	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Dale Johannesen	5b060922a3	Expand documentation of StringConstantPrefix. llvm-svn: 51911	2008-06-03 18:15:03 +00:00
Dale Johannesen	c0cd6cd4d4	Add StringConstantPrefix to control what the assembler names of string constants look like. llvm-svn: 51909	2008-06-03 18:09:06 +00:00
Evan Cheng	ce2c3b07ba	Revert 51775. llvm-svn: 51795	2008-05-30 22:47:19 +00:00
Evan Cheng	8ebaae9d1c	Patches for building llvm on Solaris x86. Contributed by Nathan Keynes. llvm-svn: 51775	2008-05-30 17:16:20 +00:00
Dan Gohman	3ea2dceca0	Prune and tidy #includes. llvm-svn: 51697	2008-05-29 19:52:31 +00:00
Bill Wendling	b56bc61cf4	Add a flag to indicate that an instruction is as cheap (or cheaper) than a move instruction to execute. This can be used for transformations (like two-address conversion) to remat an instruction instead of generating a "move" instruction. The idea is to decrease the live ranges and register pressure and all that jazz. llvm-svn: 51660	2008-05-28 22:54:52 +00:00
Bill Wendling	1866fa680f	Remove warnings about unused parameters and shadowed variables. llvm-svn: 51266	2008-05-19 20:15:12 +00:00
Evan Cheng	73dadf21ce	Fix typos and comments. llvm-svn: 51165	2008-05-15 22:13:02 +00:00
Evan Cheng	778a5e27b0	Make use of vector load and store operations to implement memcpy, memmove, and memset. Currently only X86 target is taking advantage of these. llvm-svn: 51140	2008-05-15 08:39:06 +00:00
Evan Cheng	fcbdc8bd6e	Xform bitconvert(build_pair(load a, load b)) to a single load if the load locations are at the right offset from each other. llvm-svn: 51008	2008-05-12 23:04:07 +00:00
Bill Wendling	646f3458c4	Constify the machine instruction passed into the "is{Trivially,Really}ReMaterializable" methods. llvm-svn: 51001	2008-05-12 20:54:26 +00:00
Evan Cheng	228cce5b2d	Forgot this. llvm-svn: 50993	2008-05-12 20:08:05 +00:00
Evan Cheng	f87942325f	Make OpActionsCapacity multiple of 4. llvm-svn: 50917	2008-05-09 21:50:23 +00:00
Duncan Sands	6f4e916c6a	Output correct exception handling and frame info on x86-64 linux. This causes no regressions on 32 bit linux and 32 bit ppc. More tests pass on 64 bit ppc with no regressions. I didn't turn on eh on 64 bit linux because the intrinsics needed to compile the eh runtime aren't done yet. But if you turn it on and link with the mainline runtime then eh seems to work fine on x86-64 linux with this patch. Thanks to Dale for testing. The main point of the patch is that if you output that some object is encoded using 4 bytes you had better not output 8 bytes for it: the patch makes everything consistent. llvm-svn: 50825	2008-05-07 19:11:09 +00:00
Mon P Wang	84a269e023	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Evan Cheng	d9353009b7	Fix more -Wshorten-64-to-32 warnings. llvm-svn: 50659	2008-05-05 18:30:58 +00:00
Dan Gohman	c860d9c77c	Add AsmPrinter support for emitting a directive to declare that the code being generated does not require an executable stack. Also, add target-specific code to make use of this on Linux on x86. llvm-svn: 50634	2008-05-05 00:28:39 +00:00
Arnold Schwaighofer	f58a35e2ec	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Chris Lattner	39a4281deb	Implement a signficant optimization for inline asm: When choosing between constraints with multiple options, like "ir", test to see if we can use the 'i' constraint and go with that if possible. This produces more optimal ASM in all cases (sparing a register and an instruction to load it), and fixes inline asm like this: void test () { asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14)); } Previously we would dump "42" into a memory location (which is ok for the 'm' constraint) which would cause a problem because the 'c' modifier is not valid on memory operands. Isn't it great how inline asm turns 'missed optimization' into 'compile failed'?? Incidentally, this was the todo in PowerPC/2007-04-24-InlineAsm-I-Modifier.ll Please do NOT pull this into Tak. llvm-svn: 50315	2008-04-27 00:37:18 +00:00
Chris Lattner	b83aaaa855	Move a bunch of inline asm code out of line. llvm-svn: 50313	2008-04-27 00:09:47 +00:00
Chris Lattner	b5bd654163	A few inline asm cleanups: - Make targetlowering.h fit in 80 cols. - Make LowerAsmOperandForConstraint const. - Make lowerXConstraint -> LowerXConstraint - Make LowerXConstraint return a const char* instead of taking a string byref. llvm-svn: 50312	2008-04-26 23:02:14 +00:00
Anton Korobeynikov	b1ad6979dc	Add facility for pre-RA passes llvm-svn: 50165	2008-04-23 18:22:28 +00:00
Anton Korobeynikov	73935826d4	Make stack alignment options global for all targets llvm-svn: 50157	2008-04-23 18:18:10 +00:00
Nicolas Geoffray	1f3211af01	Correlate stubs with functions in JIT: when emitting a stub, the JIT tells the memory manager which function the stub will resolve. llvm-svn: 49814	2008-04-16 20:46:05 +00:00
Nicolas Geoffray	82baa2d2c6	Infrastructure for getting the machine code size of a function and an instruction. X86, PowerPC and ARM are implemented llvm-svn: 49809	2008-04-16 20:10:13 +00:00
Evan Cheng	cff9295e43	Sort sub-registers and super-registers lists according to super-sub register relations. e.g. X86::RAX sub-register list is EAX, AX, AL, AH (order of last two are not guaranteed). llvm-svn: 49714	2008-04-15 07:56:03 +00:00
Dan Gohman	8d46278998	Fix const-correctness issues with the SrcValue handling in the memory intrinsic expansion code. llvm-svn: 49666	2008-04-14 17:55:48 +00:00
Dale Johannesen	edcba1161f	Reverse sense of unwind-tables option. This means stack tracebacks on Darwin x86-64 won't work by default; nevertheless, everybody but me thinks this is a good idea. llvm-svn: 49663	2008-04-14 17:54:17 +00:00
Dan Gohman	237a69b49c	Clean up some comments. llvm-svn: 49661	2008-04-14 17:45:20 +00:00
Dan Gohman	15edbf989f	Drop ISD::MEMSET, ISD::MEMMOVE, and ISD::MEMCPY, which are not Legal on any current target and aren't optimized in DAGCombiner. Instead of using intermediate nodes, expand the operations, choosing between simple loads/stores, target-specific code, and library calls, immediately. Previously, the code to emit optimized code for these operations was only used at initial SelectionDAG construction time; now it is used at all times. This fixes some cases where rep;movs was being used for small copies where simple loads/stores would be better. This also cleans up code that checks for alignments less than 4; let the targets make that decision instead of doing it in target-independent code. This allows x86 to use rep;movs in low-alignment cases. Also, this fixes a bug that resulted in the use of rep;stos for memsets of 0 with non-constant memory size when the alignment was at least 4. It's better to use the library in this case, which can be significantly faster when the size is large. This also preserves more SourceValue information when memory intrinsics are lowered into simple loads/stores. llvm-svn: 49572	2008-04-12 04:36:06 +00:00
Dan Gohman	b3a511b236	Make isVectorClearMaskLegal's operand list const. llvm-svn: 49446	2008-04-09 20:09:42 +00:00
Dan Gohman	b05ea92f80	Fix some minor errors in comments. llvm-svn: 49445	2008-04-09 20:08:06 +00:00
Dale Johannesen	ec0fe04044	Implement new llc flag -disable-required-unwind-tables. Corresponds to -fno-unwind-tables (usually default in gcc). llvm-svn: 49361	2008-04-08 00:10:24 +00:00
Evan Cheng	38a755499d	Move reMaterialize() from TargetRegisterInfo to TargetInstrInfo. llvm-svn: 48995	2008-03-31 20:40:39 +00:00
Chris Lattner	49e9edd6f6	Fix "Control reaches the end of non-void function" warnings, patch by David Chisnall. llvm-svn: 48963	2008-03-30 18:22:13 +00:00
Dale Johannesen	4524d3a1d1	Fix a bug in Darwin EH: FDE->CIE pointer must be relocatable. Describe why .set is needed better. llvm-svn: 48848	2008-03-26 23:31:39 +00:00
Dan Gohman	22002efa15	A quick nm audit turned up several fixed tables and objects that were marked read-write. Use const so that they can be allocated in a read-only segment. llvm-svn: 48800	2008-03-25 21:45:14 +00:00
Devang Patel	71f3156ea3	Add optimize-for-size knob. llvm-svn: 48793	2008-03-25 21:02:35 +00:00
Christopher Lamb	b4f4b41048	Make insert_subreg a two-address instruction, vastly simplifying LowerSubregs pass. Add a new TII, subreg_to_reg, which is like insert_subreg except that it takes an immediate implicit value to insert into rather than a register. llvm-svn: 48412	2008-03-16 03:12:01 +00:00
Evan Cheng	9af68f56c5	Remove isImplicitDef TargetInstrDesc flag. llvm-svn: 48381	2008-03-15 00:19:36 +00:00
Evan Cheng	11d2c09adc	Replace all target specific implicit def instructions with a target independent one: TargetInstrInfo::IMPLICIT_DEF. llvm-svn: 48380	2008-03-15 00:03:38 +00:00
Christopher Lamb	0f1c32eb63	Get rid of a pseudo instruction and replace it with subreg based operation on real instructions, ridding the asm printers of the hack used to do this previously. In the process, update LowerSubregs to be careful about eliminating copies that have side affects. Note: the coalescer will have to be careful about this too, when it starts coalescing insert_subreg nodes. llvm-svn: 48329	2008-03-13 05:47:01 +00:00
Dan Gohman	8e3c88c4e9	Use PassManagerBase instead of FunctionPassManager for functions that merely add passes. This allows them to be used with either FunctionPassManager or PassManager, or even with a custom new kind of pass manager. llvm-svn: 48256	2008-03-11 22:29:46 +00:00
Evan Cheng	af1c76846d	When the register allocator runs out of registers, spill a physical register around the def's and use's of the interval being allocated to make it possible for the interval to target a register and spill it right away and restore a register for uses. This likely generates terrible code but is before than aborting. llvm-svn: 48218	2008-03-11 07:19:34 +00:00
Scott Michel	bb8e8fca47	Give TargetLowering::getSetCCResultType() a parameter so that ISD::SETCC's return ValueType can depend its operands' ValueType. This is a cosmetic change, no functionality impacted. llvm-svn: 48145	2008-03-10 15:42:14 +00:00
Evan Cheng	e0b3c221ab	Add a target lowering hook to control whether it's worthwhile to compress fp constant. For x86, if sse2 is available, it's not a good idea since cvtss2sd is slower than a movsd load and it prevents load folding. On x87, it's important to shrink fp constant since fldt is very expensive. llvm-svn: 47931	2008-03-05 01:30:59 +00:00
Anton Korobeynikov	875da2420a	Use enumeration for preffered EH dwarf encoding reason llvm-svn: 47770	2008-02-29 22:09:08 +00:00
Dale Johannesen	310201ba32	Interface of getByValTypeAlignment differed between generic & x86 versions; change generic to follow x86 and improve comments. Add PPC version (not right for non-Darwin.) llvm-svn: 47734	2008-02-28 22:31:51 +00:00
Evan Cheng	f8b1257d2e	Add a quick and dirty "loop aligner pass". x86 uses it to align its loops to 16-byte boundaries. llvm-svn: 47703	2008-02-28 00:43:03 +00:00
Anton Korobeynikov	7bd4b77e30	Preparation step for some cleanup/generalization in EH information emission: provide TAI hook for selection of EH data emission format. Currently unused. llvm-svn: 47699	2008-02-27 23:33:50 +00:00
Dan Gohman	689d8cac04	Convert SimplifyDemandedMask and ShrinkDemandedConstant to use APInt. Change several cases in SimplifyDemandedMask that don't ever do any simplifying to reuse the logic in ComputeMaskedBits instead of duplicating it. llvm-svn: 47648	2008-02-27 00:25:32 +00:00
Bill Wendling	8fb166bf6c	Rename PrintableName to Name. llvm-svn: 47629	2008-02-26 21:47:57 +00:00
Bill Wendling	50f5c4be14	Change "Name" to "AsmName" in the target register info. Gee, a refactoring tool would have been a Godsend here! llvm-svn: 47625	2008-02-26 21:11:01 +00:00
Evan Cheng	40c26c71c0	Refactor inline asm constraint matching code out of SDIsel into TargetLowering. llvm-svn: 47587	2008-02-26 02:33:44 +00:00
Bill Wendling	a369a6add8	Some platforms use the same name for 32-bit and 64-bit registers (like %r3 on PPC) in their ASM files. However, it's hard for humans to read during debugging. Adding a new field to the register data that lets you specify a different name to be printed than the one that goes into the ASM file -- %x3 instead of %r3, for instance. llvm-svn: 47534	2008-02-24 00:56:13 +00:00
Evan Cheng	c7ef6dc2db	Remove an invalid assertion now that there are implicit virtual register operands. llvm-svn: 47493	2008-02-22 19:25:04 +00:00
Dale Johannesen	a96eb3a1d8	Pass alignment on ByVal parameters, from FE, all the way through. It is now used for codegen. llvm-svn: 47484	2008-02-22 17:49:45 +00:00
Andrew Lenharth	db9cd46f5d	Atomic op support. If any gcc test uses __sync builtins, it might start failing on archs that haven't implemented them yet llvm-svn: 47430	2008-02-21 06:45:13 +00:00
Anton Korobeynikov	0c5e186924	Unbreak build with gcc 4.3: provide missed includes and silence most annoying warnings. llvm-svn: 47367	2008-02-20 11:08:44 +00:00
Evan Cheng	2cb3fd8f72	Added CommuteChangesDestination(). This returns true if commuting the specified machine instr will change its definition register. llvm-svn: 47166	2008-02-15 18:21:33 +00:00
Duncan Sands	0056f1e823	In TargetLowering::LowerCallTo, don't assert that the return value is zero-extended if it isn't sign-extended. It may also be any-extended. Also, if a floating point value was returned in a larger floating point type, pass 1 as the second operand to FP_ROUND, which tells it that all the precision is in the original type. I think this is right but I could be wrong. Finally, when doing libcalls, set isZExt on a parameter if it is "unsigned". Currently isSExt is set when signed, and nothing is set otherwise. This should be right for all calls to standard library routines. llvm-svn: 47122	2008-02-14 17:28:50 +00:00
Dan Gohman	99b38405e3	Simplify some logic in ComputeMaskedBits. And change ComputeMaskedBits to pass the mask APInt by value, not by reference. llvm-svn: 47096	2008-02-13 22:28:48 +00:00
Dan Gohman	09023887f8	Convert SelectionDAG::ComputeMaskedBits to use APInt instead of uint64_t. Add an overload that supports the uint64_t interface for use by clients that haven't been updated yet. llvm-svn: 47039	2008-02-13 00:35:47 +00:00
Duncan Sands	b9bf0dcb7c	Add arbitrary integer support to getRegisterType and getNumRegisters. This is needed for calling functions with apint parameters or return values. llvm-svn: 46956	2008-02-11 11:09:23 +00:00
Duncan Sands	204c89cafa	Add a isBigEndian method to complement isLittleEndian. llvm-svn: 46954	2008-02-11 10:37:04 +00:00
Dan Gohman	cabaec582f	Rename MRegisterInfo to TargetRegisterInfo. llvm-svn: 46930	2008-02-10 18:45:23 +00:00
Evan Cheng	90f03a0b88	It's not always safe to fold movsd into xorpd, etc. Check the alignment of the load address first to make sure it's 16 byte aligned. llvm-svn: 46893	2008-02-08 21:20:40 +00:00
Evan Cheng	c57ec111f2	SDIsel processes llvm.dbg.declare by recording the variable debug information descriptor and its corresponding stack frame index in MachineModuleInfo. This only works if the local variable is "homed" in the stack frame. It does not work for byval parameter, etc. Added ISD::DECLARE node type to represent llvm.dbg.declare intrinsic. Now the intrinsic calls are lowered into a SDNode and lives on through out the codegen passes. For now, since all the debugging information recording is done at isel time, when a ISD::DECLARE node is selected, it has the side effect of also recording the variable. This is a short term solution that should be fixed in time. llvm-svn: 46659	2008-02-02 04:07:54 +00:00
Evan Cheng	9ff6b89bd9	Frame index can be negative. llvm-svn: 46655	2008-02-02 00:17:00 +00:00
Evan Cheng	a63f6736f3	MRegisterInfo::getLocation() is a really bad idea. Its function is to calculate the offset from frame pointer to a stack slot and then storing the delta in a MachineLocation object. The name is bad (it implies a getter), and MRegisterInfo doesn't need to know about MachineLocation. Replace getLocation() with getFrameIndexOffset() which returns the delta from frame pointer to stack slot. Dwarf writer can then use the information for whatever it wants. llvm-svn: 46597	2008-01-31 03:37:28 +00:00
Evan Cheng	918b9c9335	Even though InsertAtEndOfBasicBlock is an ugly hack it still deserves a proper name. Rename it to EmitInstrWithCustomInserter since it does not necessarily insert instruction at the end. llvm-svn: 46562	2008-01-30 18:18:23 +00:00
Duncan Sands	390baa691d	Use getPreferredAlignmentLog or getPreferredAlignment to get the alignment of global variables, rather than using hand-made versions. llvm-svn: 46495	2008-01-29 06:23:44 +00:00
Dale Johannesen	f12104ce4b	Handle 'X' constraint in asm's better. llvm-svn: 46485	2008-01-29 02:21:21 +00:00
Duncan Sands	b9f1e3df90	Add more assertions to catch accesses outside of arrays. Also, as a convenience, don't barf, just return false, if someone calls isTruncStoreLegal or isLoadXLegal with an extended type for the in memory type. llvm-svn: 46352	2008-01-25 10:20:53 +00:00
Evan Cheng	91089e6d66	Let each target decide byval alignment. For X86, it's 4-byte unless the aggregare contains SSE vector(s). For x86-64, it's max of 8 or alignment of the type. llvm-svn: 46286	2008-01-23 23:17:41 +00:00
Chris Lattner	5c5e3031b0	remove magic numbers. llvm-svn: 46162	2008-01-18 17:13:03 +00:00
Chris Lattner	41717f6989	This commit changes: 1. Legalize now always promotes truncstore of i1 to i8. 2. Remove patterns and gunk related to truncstore i1 from targets. 3. Rename the StoreXAction stuff to TruncStoreAction in TLI. 4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions. 5. Mark a wide variety of invalid truncstores as such in various targets, e.g. X86 currently doesn't support truncstore of any of its integer types. 6. Add legalize support for truncstores with invalid value input types. 7. Add a dag combine transform to turn store(truncate) into truncstore when safe. The later allows us to compile CodeGen/X86/storetrunc-fp.ll to: _foo: fldt 20(%esp) fldt 4(%esp) faddp %st(1) movl 36(%esp), %eax fstps (%eax) ret instead of: _foo: subl $4, %esp fldt 24(%esp) fldt 8(%esp) faddp %st(1) fstps (%esp) movl 40(%esp), %eax movss (%esp), %xmm0 movss %xmm0, (%eax) addl $4, %esp ret llvm-svn: 46140	2008-01-17 19:59:44 +00:00
Dale Johannesen	319c7bb405	Fix and enable EH for x86-64 Darwin. Adds ShortenEHDataFor64Bits as a not-very-accurate abstraction to cover all the changes in DwarfWriter. Some cosmetic changes to Darwin assembly code for gcc testsuite compatibility. llvm-svn: 46029	2008-01-15 23:24:56 +00:00
Chris Lattner	bfffa4f21e	Simplify the side effect stuff a bit more and make licm/sinking both work right according to the new flags. This removes the TII::isReallySideEffectFree predicate, and adds TII::isInvariantLoad. It removes NeverHasSideEffects+MayHaveSideEffects and adds UnmodeledSideEffects as machine instr flags. Now the clients can decide everything they need. I think isRematerializable can be implemented in terms of the flags we have now, though I will let others tackle that. llvm-svn: 45843	2008-01-10 23:08:24 +00:00
Dale Johannesen	fdd4b3846f	Emit unused EH frames for weak definitions on Darwin, because assembler/linker can't cope with weak absolutes. PR 1880. llvm-svn: 45811	2008-01-10 02:03:30 +00:00
Chris Lattner	b02074514e	Fix PR1845 and rdar://5676945. Generic vectors smaller than hardware supported type will be scalarized, so we can infer their alignment from that info. We now codegen pr1845 into: _boolVectorSelect: lbz r2, 0(r3) stb r2, -16(r1) blr llvm-svn: 45796	2008-01-10 00:30:57 +00:00
Chris Lattner	9b7b3ade8f	add a mayLoad property for machine instructions, a correlary to mayStore. This is currently not set by anything. llvm-svn: 45748	2008-01-08 18:05:21 +00:00
Chris Lattner	ba567fa77b	split TargetInstrDesc out into its own header file. llvm-svn: 45696	2008-01-07 07:33:08 +00:00
Chris Lattner	f83aae613c	rename TargetInstrDescriptor -> TargetInstrDesc. Make MachineInstr::getDesc return a reference instead of a pointer, since it can never be null. llvm-svn: 45695	2008-01-07 07:27:27 +00:00
Chris Lattner	c9e870d7c6	remove a dead method. llvm-svn: 45694	2008-01-07 06:47:10 +00:00
Chris Lattner	57e851edfe	Rename all the M_* flags to be namespace qualified enums, and switch all clients over to using predicates instead of these flags directly. These are now private values which are only to be used to statically initialize the tables. llvm-svn: 45692	2008-01-07 06:42:05 +00:00
Chris Lattner	c745aa59b3	add more and significantly better comments to the rest of the machineinstr flags that can be set. Add predicates for the ones lacking it, and switch some clients over to using the predicates instead of Flags directly. llvm-svn: 45690	2008-01-07 06:21:53 +00:00
Chris Lattner	1cdb8f4da1	add some mroe comments, add a isImplicitDef() method, add isConditionalBranch() and isUnconditionalBranch() methods. llvm-svn: 45688	2008-01-07 05:38:38 +00:00
Chris Lattner	9b987de2c5	rename hasVariableOperands() -> isVariadic(). Add some comments. Evan, please review the comments I added to getNumDefs to make sure that they are accurate, thx. llvm-svn: 45687	2008-01-07 05:19:29 +00:00
Chris Lattner	b0e50db817	Move M_* flags down in the file. Move SchedClass up in the TargetInstrDescriptor class and shrink to 16-bits, saving a word in TargetInstrDescriptor. Add some comments. llvm-svn: 45686	2008-01-07 05:06:49 +00:00
Chris Lattner	9d38dfa4a5	Move a bunch more accessors from TargetInstrInfo to TargetInstrDescriptor llvm-svn: 45680	2008-01-07 03:13:06 +00:00
Chris Lattner	55343065e3	remove MachineOpCode typedef. llvm-svn: 45679	2008-01-07 02:48:55 +00:00
Chris Lattner	96d0a93f8e	remove some uses of MachineOpCode, move getSchedClass into TargetInstrDescriptor from TargetInstrInfo. llvm-svn: 45678	2008-01-07 02:46:03 +00:00
Chris Lattner	93e1e6ee12	Add predicates methods to TargetOperandInfo, and switch all clients over to using them, instead of diddling Flags directly. Change the various flags from const variables to enums. llvm-svn: 45677	2008-01-07 02:39:19 +00:00
Chris Lattner	f7f96d818f	Rename MachineInstr::getInstrDescriptor -> getDesc(), which reflects that it is cheap and efficient to get. Move a variety of predicates from TargetInstrInfo into TargetInstrDescriptor, which makes it much easier to query a predicate when you don't have TII around. Now you can use MI->getDesc()->isBranch() instead of going through TII, and this is much more efficient anyway. Not all of the predicates have been moved over yet. Update old code that used MI->getInstrDescriptor()->Flags to use the new predicates in many places. llvm-svn: 45674	2008-01-07 01:56:04 +00:00
Owen Anderson	f19692b2f6	Move even more functionality from MRegisterInfo into TargetInstrInfo. Some day I'll get it all moved over... llvm-svn: 45672	2008-01-07 01:35:02 +00:00
Chris Lattner	14310afe42	rename isLoad -> isSimpleLoad due to evan's desire to have such a predicate. llvm-svn: 45667	2008-01-06 23:38:27 +00:00
Chris Lattner	5489888580	rename isStore -> mayStore to more accurately reflect what it captures. llvm-svn: 45656	2008-01-06 08:36:04 +00:00
Chris Lattner	06c02cdcbc	describe isStore and simplify the implementation of hasUnmodelledSideEffects. No functionality change. llvm-svn: 45651	2008-01-06 05:43:21 +00:00
Evan Cheng	759f389846	X86 JIT PIC jumptable support. llvm-svn: 45616	2008-01-05 02:26:58 +00:00
Owen Anderson	2adf8c5533	Move some more functionality from MRegisterInfo to TargetInstrInfo. llvm-svn: 45603	2008-01-04 23:57:37 +00:00
Evan Cheng	7322e4dec4	X86 PIC JIT support fixes: encoding bugs, add lazy pointer stubs support. llvm-svn: 45575	2008-01-04 10:46:51 +00:00
Owen Anderson	e6856128ab	Move some more instruction creation methods from RegisterInfo into InstrInfo. llvm-svn: 45484	2008-01-01 21:11:32 +00:00
Chris Lattner	1285ec2ae7	Fix a problem where lib/Target/TargetInstrInfo.h would include and use a header file from libcodegen. This violates a layering order: codegen depends on target, not the other way around. The fix to this is to split TII into two classes, TII and TargetInstrInfoImpl, which defines stuff that depends on libcodegen. It is defined in libcodegen, where the base is not. llvm-svn: 45475	2008-01-01 01:03:04 +00:00
Owen Anderson	ae7e2c1e03	Move copyRegToReg from MRegisterInfo to TargetInstrInfo. This is part of the Machine-level API cleanup instigated by Chris. llvm-svn: 45470	2007-12-31 06:32:00 +00:00
Chris Lattner	96167aa93c	Rename SSARegMap -> MachineRegisterInfo in keeping with the idea that "machine" classes are used to represent the current state of the code being compiled. Given this expanded name, we can start moving other stuff into it. For now, move the UsedPhysRegs and LiveIn/LoveOuts vectors from MachineFunction into it. Update all the clients to match. This also reduces some needless #includes, such as MachineModuleInfo from MachineFunction. llvm-svn: 45467	2007-12-31 04:13:23 +00:00
Chris Lattner	e0b1ee937a	Don't attribute in file headers anymore. See llvmdev for the discussion of this change. Boy are my fingers tired. ;-) llvm-svn: 45411	2007-12-29 19:59:42 +00:00
Chris Lattner	a8f6fac7a3	Tell TargetLoweringOpt whether it is running before or after legalize. llvm-svn: 45321	2007-12-22 20:56:36 +00:00
Bill Wendling	0df69490dd	s/hasSideEffects/hasUnmodelledSideEffects/g llvm-svn: 45133	2007-12-17 23:19:54 +00:00
Bill Wendling	2d672998c5	Add "hasSideEffects" method to MachineInstrInfo class. llvm-svn: 45126	2007-12-17 21:53:30 +00:00
Bill Wendling	ec8be72a8b	As per feedback, revised comments to (hopefully) make the different side effect flags clearer. llvm-svn: 45120	2007-12-17 21:02:07 +00:00
Dan Gohman	a0d3f7d88c	Fix a typo in a comment. llvm-svn: 45032	2007-12-14 15:13:08 +00:00
Bill Wendling	c8c611e88f	Add flags to indicate that there are "never" side effects or that there "may be" side effects for machine instructions. llvm-svn: 45022	2007-12-14 01:48:59 +00:00
Evan Cheng	64a1febf9a	Implicit def instructions, e.g. X86::IMPLICIT_DEF_GR32, are always re-materializable and they should not be spilled. llvm-svn: 44960	2007-12-12 23:12:09 +00:00
Duncan Sands	47526c4a42	Remove host endianness info from TargetData and put it in a new header System/Host.h instead. Instead of getting the endianness from configure, calculate it directly. llvm-svn: 44959	2007-12-12 23:03:45 +00:00
Dan Gohman	4bf237b584	Remove a forward-declaration for a non-existant class. llvm-svn: 44955	2007-12-12 22:25:09 +00:00
Bill Wendling	e8eea25ad3	Bit masks conflicted. Needed to bump them by one. llvm-svn: 44903	2007-12-12 01:51:58 +00:00
Chris Lattner	f7c53191c0	Move TargetData::hostIsLittleEndian out of line, which means we don't have to #include config.h in it. #including config.h breaks other projects that have their own autoconf stuff and try to #include the llvm headers. One obscure example is llvm-gcc. llvm-svn: 44825	2007-12-11 00:28:59 +00:00
Duncan Sands	1279851352	Fix PR1836: in the interpreter, read and write apints using the minimum possible number of bytes. For little endian targets run on little endian machines, apints are stored in memory from LSB to MSB as before. For big endian targets on big endian machines they are stored from MSB to LSB which wasn't always the case before (if the target and host endianness doesn't match values are stored according to the host's endianness). Doing this requires knowing the endianness of the host, which is determined when configuring - thanks go to Anton for this. Only having access to little endian machines I was unable to properly test the big endian part, which is also the most complicated... llvm-svn: 44796	2007-12-10 17:43:13 +00:00
Bill Wendling	8d8d9a2f5e	Reverting 44702. It wasn't correct to rename them. llvm-svn: 44727	2007-12-08 23:58:46 +00:00
Bill Wendling	d10837def7	Renaming: isTriviallyReMaterializable -> hasNoSideEffects isReallyTriviallyReMaterializable -> isTriviallyReMaterializable llvm-svn: 44702	2007-12-08 07:17:56 +00:00
Evan Cheng	8464a0bf00	Add a argument to storeRegToStackSlot and storeRegToAddr to specify whether the stored register is killed. llvm-svn: 44600	2007-12-05 03:14:33 +00:00
Evan Cheng	58b387dfb0	Remove redundant foldMemoryOperand variants and other code clean up. llvm-svn: 44517	2007-12-02 08:30:39 +00:00
Evan Cheng	79e8b92dc3	Allow some reloads to be folded in multi-use cases. Specifically testl r, r -> cmpl [mem], 0. llvm-svn: 44479	2007-12-01 02:07:52 +00:00
Chris Lattner	28262fbaf2	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Dale Johannesen	3f01467781	File missing from previous patch. llvm-svn: 44259	2007-11-20 23:25:17 +00:00
Dan Gohman	27ac53cc23	Remove meaningless qualifiers from return types, avoiding compiler warnings. llvm-svn: 44240	2007-11-19 20:46:23 +00:00
Dale Johannesen	5fd9e7a615	Add parameter to getDwarfRegNum to permit targets to use different mappings for EH and debug info; no functional change yet. Fix warning in X86CodeEmitter. llvm-svn: 44056	2007-11-13 19:13:01 +00:00
Owen Anderson	aba398a5ce	Add a flag for indirect branch instructions. Target maintainers: please check that the instructions for your target are correctly marked. llvm-svn: 44012	2007-11-12 07:39:39 +00:00
Evan Cheng	7d8deec92f	Much improved pic jumptable codegen: Then: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry imull $4, %ecx, %ecx leal LJTI1_0-"L1$pb"(%eax), %edx addl LJTI1_0-"L1$pb"(%ecx,%eax), %edx jmpl %edx .align 2 .set L1_0_set_3,LBB1_3-LJTI1_0 .set L1_0_set_2,LBB1_2-LJTI1_0 .set L1_0_set_5,LBB1_5-LJTI1_0 .set L1_0_set_4,LBB1_4-LJTI1_0 LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 Now: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry addl LJTI1_0-"L1$pb"(%eax,%ecx,4), %eax jmpl %eax .align 2 .set L1_0_set_3,LBB1_3-"L1$pb" .set L1_0_set_2,LBB1_2-"L1$pb" .set L1_0_set_5,LBB1_5-"L1$pb" .set L1_0_set_4,LBB1_4-"L1$pb" LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 llvm-svn: 43924	2007-11-09 01:32:10 +00:00
Hartmut Kaiser	a23dfa2c76	Fixed compilation errors on VC++. llvm-svn: 43836	2007-11-07 19:33:31 +00:00
Rafael Espindola	ec025c3042	Move the LowerMEMCPY and LowerMEMCPYCall to a common place. Thanks for the suggestions Bill :-) llvm-svn: 43742	2007-11-05 23:12:20 +00:00
Duncan Sands	d1bdbd010b	Eliminate the remaining uses of getTypeSize. This should only effect x86 when using long double. Now 12/16 bytes are output for long double globals (the exact amount depends on the alignment). This brings globals in line with the rest of LLVM: the space reserved for an object is now always the ABI size. One tricky point is that only 10 bytes should be output for long double if it is a field in a packed struct, which is the reason for the additional argument to EmitGlobalConstant. llvm-svn: 43688	2007-11-05 00:04:43 +00:00
Duncan Sands	662fb070a7	Change uses of getTypeSize to getABITypeSize, getTypeStoreSize or getTypeSizeInBits as appropriate in ScalarReplAggregates. The right change to make was not always obvious, so it would be good to have an sroa guru review this. While there I noticed some bugs, and fixed them: (1) arrays of x86 long double have holes due to alignment padding, but this wasn't being spotted by HasStructPadding (renamed to HasPadding). The same goes for arrays of oddly sized ints. Vectors also suffer from this, in fact the problem for vectors is much worse because basic vector assumptions seem to be broken by vectors of type with alignment padding. I didn't try to fix any of these vector problems. (2) The code for extracting smaller integers from larger ones (in the "int union" case) was wrong on big-endian machines for integers with size not a multiple of 8, like i1. Probably this is impossible to hit via llvm-gcc, but I fixed it anyway while there and added a testcase. I also got rid of some trailing whitespace and changed a function name which had an obvious typo in it. llvm-svn: 43672	2007-11-04 14:43:57 +00:00
Duncan Sands	eb464e976f	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Evan Cheng	5fe81cf64e	Enable more fold (sext (load x)) -> (sext (truncate (sextload x))) transformation. Previously, it's restricted by ensuring the number of load uses is one. Now the restriction is loosened up by allowing setcc uses to be "extended" (e.g. setcc x, c, eq -> setcc sext(x), sext(c), eq). llvm-svn: 43465	2007-10-29 19:58:20 +00:00
Evan Cheng	53696b7e9f	Loosen up iv reuse to allow reuse of the same stride but a larger type when truncating from the larger type to smaller type is free. e.g. Turns this loop: LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx movw %dx, %si LBB1_2: # bb movl L_X$non_lazy_ptr, %edi movw %si, (%edi) movl L_Y$non_lazy_ptr, %edi movw %dx, (%edi) addw $4, %dx incw %si incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb into LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx LBB1_2: # bb movl L_X$non_lazy_ptr, %esi movw %cx, (%esi) movl L_Y$non_lazy_ptr, %esi movw %dx, (%esi) addw $4, %dx incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb llvm-svn: 43375	2007-10-26 01:56:11 +00:00
Evan Cheng	dc2f1b1741	isSubRegOf() is a dup of isSubRegister. llvm-svn: 43249	2007-10-23 06:51:50 +00:00
Evan Cheng	ded6550885	Local spiller optimization: Turn a store folding instruction into a load folding instruction. e.g. xorl %edi, %eax movl %eax, -32(%ebp) movl -36(%ebp), %eax orl %eax, -32(%ebp) => xorl %edi, %eax orl -36(%ebp), %eax mov %eax, -32(%ebp) This enables the unfolding optimization for a subsequent instruction which will also eliminate the newly introduced store instruction. llvm-svn: 43192	2007-10-19 21:23:22 +00:00
Chris Lattner	45b8558ec5	rename ExpandOperation to ExpandOperationResult, as suggested by Duncan llvm-svn: 43177	2007-10-19 15:28:47 +00:00
Chris Lattner	f02434cdaf	add a new target hook. llvm-svn: 43165	2007-10-19 03:31:45 +00:00
Evan Cheng	0449186690	- Added getOpcodeAfterMemoryUnfold(). It doesn't unfold an instruction, but only returns the opcode of the instruction post unfolding. - Fix some copy+paste bugs. llvm-svn: 43153	2007-10-18 22:40:57 +00:00
Evan Cheng	c852780685	Use SmallVectorImpl instead of SmallVector with hardcoded size in MRegister public interface. llvm-svn: 43150	2007-10-18 21:29:24 +00:00
Gordon Henriksen	422d66e53e	Missing 'public' keyword. llvm-svn: 43121	2007-10-18 11:31:21 +00:00
Gordon Henriksen	a6050b38d2	Switching TargetMachineRegistry to use the new generic Registry. llvm-svn: 43094	2007-10-17 21:28:48 +00:00
Duncan Sands	0a5a15c3a0	Return Expand from getOperationAction for all extended types. This is needed for SIGN_EXTEND_INREG at least. It is not clear if this is correct for other operations. On the other hand, for the various load/store actions it seems to correct to return the type action, as is currently done. Also, it seems that SelectionDAG::getValueType can be called for extended value types; introduce a map for holding these, since we don't really want to extend the vector to be 2^32 pointers long! Generalize DAGTypeLegalizer::PromoteResult_TRUNCATE and DAGTypeLegalizer::PromoteResult_INT_EXTEND to handle the various funky possibilities that apints introduce, for example that you can promote to a type that needs to be expanded. llvm-svn: 43071	2007-10-17 13:49:58 +00:00
Duncan Sands	9d622a6de1	Initial infrastructure for arbitrary precision integer codegen support. This should have no effect on codegen for other types. Debatable bits: (1) the use (abuse?) of a set in SDNode::getValueTypeList; (2) the length of getTypeToTransformTo, which maybe should be refactored with a non-inline part for extended value types. llvm-svn: 43030	2007-10-16 09:56:48 +00:00
Chris Lattner	828830d360	Fix 80 col violation llvm-svn: 42976	2007-10-15 05:30:27 +00:00
Evan Cheng	2e2d6358bc	Change unfoldMemoryOperand(). User is now responsible for passing in the register used by the unfolded instructions. User can also specify whether to unfold the load, the store, or both. llvm-svn: 42946	2007-10-13 02:35:06 +00:00
Arnold Schwaighofer	6bcd9e7ec2	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Arnold Schwaighofer	d47210011e	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Bill Wendling	19a314f8ac	Fix 80-column violations llvm-svn: 42823	2007-10-10 05:45:59 +00:00
Dan Gohman	7fa473514d	Add explicit keywords. llvm-svn: 42747	2007-10-08 15:08:41 +00:00
Evan Cheng	f536e2f41e	- Added a few target hooks to generate load / store instructions from / to any address (not just from / to frameindexes). - Added target hooks to unfold load / store instructions / SDNodes into separate load, data processing, store instructions / SDNodes. llvm-svn: 42621	2007-10-05 01:32:41 +00:00
Dale Johannesen	ef488c7b0e	Add getABITypeSize, getABITypeSizeInBits llvm-svn: 42488	2007-10-01 16:03:14 +00:00
Evan Cheng	f3c130a8b6	Enabling new condition code modeling scheme. llvm-svn: 42459	2007-09-29 00:00:36 +00:00
Dan Gohman	50747737a5	TargetAsmInfo::getAddressSize() was incorrect for x86-64 and 64-bit targets other than PPC64. Instead of fixing it, just remove it and fix all the places that use it to use TargetData::getPointerSize() instead, as there aren't very many. Most of the references were in DwarfWriter.cpp. llvm-svn: 42419	2007-09-27 23:12:31 +00:00
Evan Cheng	0f9e307353	- Added MRegisterInfo::getCrossCopyRegClass() hook. For register classes where reg to reg copies are not possible, this returns another register class which registers in the specified register class can be copied to (and copy back from). - X86 copyRegToReg() now supports copying between EFLAGS and GR32 / GR64 registers. llvm-svn: 42372	2007-09-26 21:31:07 +00:00
Evan Cheng	5f9e291240	Allow copyRegToReg to emit cross register classes copies. Tested with "make check"! llvm-svn: 42346	2007-09-26 06:25:56 +00:00
Evan Cheng	6bfaf13dd9	New temporary option -new-cc-modeling-scheme to test the new cc modeling scheme. llvm-svn: 42283	2007-09-25 01:50:04 +00:00
Dan Gohman	b642a1ddf8	Merge hasDotLoc and hasDotFile into hasDotLocAndDotFile since .loc and .file aren't really usable without each other. llvm-svn: 42274	2007-09-24 21:09:53 +00:00
Dale Johannesen	ea6ffa0b36	Fix PR 1681. When X86 target uses +sse -sse2, keep f32 in SSE registers and f64 in x87. This is effectively a new codegen mode. Change addLegalFPImmediate to permit float and double variants to do different things. Adjust callers. llvm-svn: 42246	2007-09-23 14:52:20 +00:00
Evan Cheng	859720c33b	Add CopyCost to TargetRegisterClass. This specifies the cost of copying a value between two registers in the specific class. llvm-svn: 42123	2007-09-19 01:35:01 +00:00
Evan Cheng	827f269fac	Added getNumDefs(). llvm-svn: 41901	2007-09-13 00:16:29 +00:00
Bill Wendling	bde1aacffa	Add accessor method. llvm-svn: 41854	2007-09-11 17:24:38 +00:00
Bill Wendling	74888e63e7	Add a bool to indicate if we should set the "indirect encoding" bit in the Dwarf information for EH. llvm-svn: 41852	2007-09-11 17:20:55 +00:00
Bill Wendling	3c6a8b2814	The personality function on Darwin needs a global stub. We then refer to that global stub instead of doing the ".set" thingy we were doing before. llvm-svn: 41838	2007-09-11 08:27:17 +00:00
Rafael Espindola	8c57e70f93	Add support for having different alignment for objects on call frames. The x86-64 ABI states that objects passed on the stack have 8 byte alignment. Implement that. llvm-svn: 41768	2007-09-07 14:52:14 +00:00
Owen Anderson	4b71e55287	Add lengthof and endof templates that hide a lot of sizeof computations. Patch by Sterling Stein! llvm-svn: 41758	2007-09-07 04:06:50 +00:00
Evan Cheng	58ea935f6f	Add a variant of foldMemoryOperand to fold any load / store, not just load / store from / to stack slots. llvm-svn: 41597	2007-08-30 05:52:20 +00:00
Dale Johannesen	e91a908971	Change LegalFPImmediates to use APFloat. Add APFloat interfaces to ConstantFP, SelectionDAG. Fix integer bit in double->APFloat conversion. Convert LegalizeDAG to use APFloat interface in ConstantFPSDNode uses. llvm-svn: 41587	2007-08-30 00:23:21 +00:00
Dan Gohman	e2c92effea	Add explicit keywords and remove spurious trailing semicolons. llvm-svn: 41482	2007-08-27 14:50:10 +00:00
Chris Lattner	1e089aac3a	rename isOperandValidForConstraint to LowerAsmOperandForConstraint, changing the interface to allow for future changes. llvm-svn: 41384	2007-08-25 00:47:38 +00:00
Rafael Espindola	68d95ff2b1	Partial implementation of calling functions with byval arguments: ) The needed information is propagated to the DAG ) The X86-64 backend detects it and aborts llvm-svn: 41179	2007-08-20 15:18:24 +00:00
Reid Spencer	8c5c7c8453	Change casts from old style to new style. This helps document the details better, gives the compiler a chance to validate the cast and reduces warnings if the user turns on -Wold-style-cast option. llvm-svn: 41033	2007-08-12 08:12:35 +00:00
Christopher Lamb	e0c9bd8d2e	Move isSubRegOf into MRegisterInfo. Fix a missed move elimination in LowerSubregs and add more debugging output there. llvm-svn: 41005	2007-08-10 21:11:55 +00:00
Dale Johannesen	a1340d5916	Update per review comments. llvm-svn: 40965	2007-08-09 17:27:48 +00:00
Dale Johannesen	79551baaad	long double 9 of N. This finishes up the X86-32 bits (constants are still not handled). Adds ConvertActions to control fp-to-fp conversions (these are currently defaulted for all other targets, so no changes there). llvm-svn: 40958	2007-08-09 01:04:01 +00:00
Reid Spencer	3e54e72ef1	Fix minor doxygen nits. llvm-svn: 40854	2007-08-05 20:06:04 +00:00
Evan Cheng	8f126e59f5	Added TargetInstrDescriptor::numDefs - num of results. llvm-svn: 40709	2007-08-02 00:20:17 +00:00
Dan Gohman	7dd04fb91c	More explicit keywords. llvm-svn: 40673	2007-08-01 15:32:29 +00:00
Dan Gohman	51e7be7b8f	Fix the comments for the 'fast' parameter in addPassesToEmitFile. llvm-svn: 40592	2007-07-30 15:04:59 +00:00
Dan Gohman	23f9a3ad3b	Fix the comment for getClosestTargetForJIT to reflect the fact that it does not have a Module parameter. llvm-svn: 40590	2007-07-30 14:58:59 +00:00
Dan Gohman	75473b49ea	More explicit keywords. llvm-svn: 40589	2007-07-30 14:51:59 +00:00
Duncan Sands	e8bb2c6d32	Support for trampolines, except for X86 codegen which is still under discussion. llvm-svn: 40549	2007-07-27 12:58:54 +00:00
Christopher Lamb	7bef240f69	Have register info provide the inverse mapping of register->superregisters. PR1350 llvm-svn: 40519	2007-07-26 08:01:58 +00:00
Christopher Lamb	9a0d88efde	Add target independent MachineInstr's to represent subreg insert/extract in MBB's. PR1350 llvm-svn: 40518	2007-07-26 07:48:21 +00:00
Evan Cheng	cdb4e7949f	Added -print-emitted-asm to print out JIT generated asm to cerr. llvm-svn: 40123	2007-07-20 21:56:13 +00:00
Dan Gohman	0ba554c0c8	Fix comments about vectors to use the current wording. llvm-svn: 39921	2007-07-16 14:29:03 +00:00
Anton Korobeynikov	5635277c36	Long live the exception handling! This patch fills the last necessary bits to enable exceptions handling in LLVM. Currently only on x86-32/linux. In fact, this patch adds necessary intrinsics (and their lowering) which represent really weird target-specific gcc builtins used inside unwinder. After corresponding llvm-gcc patch will land (easy) exceptions should be more or less workable. However, exceptions handling support should not be thought as 'finished': I expect many small and not so small glitches everywhere. llvm-svn: 39855	2007-07-14 14:06:15 +00:00
Evan Cheng	6125079452	Add OptionalDefOperand. Remove clobbersPred. Also add DefinesPredicate to be used by if-converter. llvm-svn: 38499	2007-07-10 18:06:29 +00:00
Dan Gohman	3f7558673e	Fix a typo in a comment. llvm-svn: 38456	2007-07-09 15:15:24 +00:00
Dan Gohman	61966b8551	Remove redundant declarations. llvm-svn: 37946	2007-07-06 13:59:28 +00:00
Dan Gohman	c6bdcfa8c0	Add new TargetLowering code to provide the final register type that an illegal value type will be transformed to, for code that needs the register type after all transformations instead of just after the first transformation. Factor out the code that uses this information to do copy-from-regs and copy-to-regs for various purposes into separate functions so that they are done consistently. llvm-svn: 37781	2007-06-28 23:29:44 +00:00
Dan Gohman	cb89e19a6d	Rename ("shrinkify") MVT::isExtendedValueType to MVT::isExtendedVT. llvm-svn: 37758	2007-06-27 16:08:04 +00:00
Evan Cheng	30adb50aaf	Add comment. llvm-svn: 37741	2007-06-26 21:19:07 +00:00
Evan Cheng	ac10d44736	Add immediate sub-registers. llvm-svn: 37738	2007-06-26 20:59:16 +00:00
Dan Gohman	eea90f90af	Replace ?: with if statements, for clarity. llvm-svn: 37735	2007-06-26 16:19:08 +00:00
Dan Gohman	fb8c9beba3	Simplify the expression for TargetLowering::isTypeLegal. llvm-svn: 37732	2007-06-26 15:16:27 +00:00
Dan Gohman	9cbc3fb1ab	Revert the earlier change that removed the M_REMATERIALIZABLE machine instruction flag, and use the flag along with a virtual member function hook for targets to override if there are instructions that are only trivially rematerializable with specific operands (i.e. constant pool loads). llvm-svn: 37728	2007-06-26 00:48:07 +00:00
Dan Gohman	354f02e03d	Generalize MVT::ValueType and associated functions to be able to represent extended vector types. Remove the special SDNode opcodes used for pre-legalize vector operations, and the special MVT::Vector type used with them. Adjust lowering and legalize to work with the normal SDNode kinds instead, and to use the normal MVT functions to work with vector types instead of using the two special operands that the pre-legalize nodes held. This allows pre-legalize and post-legalize DAGs, and the code that operates on them, to be more consistent. Pre-legalize vector operators can be handled more consistently with scalar operators. And, -view-dag-combine1-dags and -view-legalize-dags now look prettier for vector code. llvm-svn: 37719	2007-06-25 16:23:39 +00:00
Dan Gohman	a62327ea40	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Dan Gohman	1815e9bdb3	Rename TargetLowering::getNumElements and friends to TargetLowering::getNumRegisters and similar, to avoid confusion with the actual number of elements for vector types. llvm-svn: 37687	2007-06-21 14:42:22 +00:00
Dan Gohman	b60d8a92c9	Replace M_REMATERIALIZIBLE and the newly-added isOtherReMaterializableLoad with a general target hook to identify rematerializable instructions. Some instructions are only rematerializable with specific operands, such as loads from constant pools, while others are always rematerializable. This hook allows both to be identified as being rematerializable with the same mechanism. llvm-svn: 37644	2007-06-19 01:48:05 +00:00
Evan Cheng	4358a7f1e9	Replace CanBeDuplicated() with a M_NOT_DUPLICABLE bit. llvm-svn: 37642	2007-06-19 01:21:41 +00:00
Evan Cheng	34a6cb0b2e	Added CanBeDuplicated(). It returns true if an instruction can be safely duplicated (e.g. during ifcvt). llvm-svn: 37605	2007-06-15 21:13:54 +00:00
Dale Johannesen	62f49dd524	Do not treat FP_REG_KILL as terminator in branch analysis (X86). llvm-svn: 37578	2007-06-14 22:03:45 +00:00
Dan Gohman	35f2b4d716	Add a target hook to allow loads from constant pools to be rematerialized, and an implementation for x86. llvm-svn: 37576	2007-06-14 20:50:44 +00:00
Dan Gohman	74c92798d8	Eliminate some redundant newlines in asm output. llvm-svn: 37574	2007-06-14 15:00:27 +00:00
Christopher Lamb	68017d151b	Add support to tablegen for specifying subregister classes on a per register class basis. llvm-svn: 37572	2007-06-13 22:20:15 +00:00
Evan Cheng	1a6c0341fd	Add a utility routine to check for unpredicated terminator instruction. llvm-svn: 37528	2007-06-08 21:59:56 +00:00
Evan Cheng	4de6599e83	Add a machine instruction flag indicating the instruction can clobber condition code / register(s) used to predicate instructions. llvm-svn: 37464	2007-06-06 10:13:55 +00:00

... 3 4 5 6 7 ...

1097 Commits