llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Evan Cheng	40c26c71c0	Refactor inline asm constraint matching code out of SDIsel into TargetLowering. llvm-svn: 47587	2008-02-26 02:33:44 +00:00
Bill Wendling	a369a6add8	Some platforms use the same name for 32-bit and 64-bit registers (like %r3 on PPC) in their ASM files. However, it's hard for humans to read during debugging. Adding a new field to the register data that lets you specify a different name to be printed than the one that goes into the ASM file -- %x3 instead of %r3, for instance. llvm-svn: 47534	2008-02-24 00:56:13 +00:00
Evan Cheng	c7ef6dc2db	Remove an invalid assertion now that there are implicit virtual register operands. llvm-svn: 47493	2008-02-22 19:25:04 +00:00
Dale Johannesen	a96eb3a1d8	Pass alignment on ByVal parameters, from FE, all the way through. It is now used for codegen. llvm-svn: 47484	2008-02-22 17:49:45 +00:00
Andrew Lenharth	db9cd46f5d	Atomic op support. If any gcc test uses __sync builtins, it might start failing on archs that haven't implemented them yet llvm-svn: 47430	2008-02-21 06:45:13 +00:00
Anton Korobeynikov	0c5e186924	Unbreak build with gcc 4.3: provide missed includes and silence most annoying warnings. llvm-svn: 47367	2008-02-20 11:08:44 +00:00
Evan Cheng	2cb3fd8f72	Added CommuteChangesDestination(). This returns true if commuting the specified machine instr will change its definition register. llvm-svn: 47166	2008-02-15 18:21:33 +00:00
Duncan Sands	0056f1e823	In TargetLowering::LowerCallTo, don't assert that the return value is zero-extended if it isn't sign-extended. It may also be any-extended. Also, if a floating point value was returned in a larger floating point type, pass 1 as the second operand to FP_ROUND, which tells it that all the precision is in the original type. I think this is right but I could be wrong. Finally, when doing libcalls, set isZExt on a parameter if it is "unsigned". Currently isSExt is set when signed, and nothing is set otherwise. This should be right for all calls to standard library routines. llvm-svn: 47122	2008-02-14 17:28:50 +00:00
Dan Gohman	99b38405e3	Simplify some logic in ComputeMaskedBits. And change ComputeMaskedBits to pass the mask APInt by value, not by reference. llvm-svn: 47096	2008-02-13 22:28:48 +00:00
Dan Gohman	09023887f8	Convert SelectionDAG::ComputeMaskedBits to use APInt instead of uint64_t. Add an overload that supports the uint64_t interface for use by clients that haven't been updated yet. llvm-svn: 47039	2008-02-13 00:35:47 +00:00
Duncan Sands	b9bf0dcb7c	Add arbitrary integer support to getRegisterType and getNumRegisters. This is needed for calling functions with apint parameters or return values. llvm-svn: 46956	2008-02-11 11:09:23 +00:00
Duncan Sands	204c89cafa	Add a isBigEndian method to complement isLittleEndian. llvm-svn: 46954	2008-02-11 10:37:04 +00:00
Dan Gohman	cabaec582f	Rename MRegisterInfo to TargetRegisterInfo. llvm-svn: 46930	2008-02-10 18:45:23 +00:00
Evan Cheng	90f03a0b88	It's not always safe to fold movsd into xorpd, etc. Check the alignment of the load address first to make sure it's 16 byte aligned. llvm-svn: 46893	2008-02-08 21:20:40 +00:00
Evan Cheng	c57ec111f2	SDIsel processes llvm.dbg.declare by recording the variable debug information descriptor and its corresponding stack frame index in MachineModuleInfo. This only works if the local variable is "homed" in the stack frame. It does not work for byval parameter, etc. Added ISD::DECLARE node type to represent llvm.dbg.declare intrinsic. Now the intrinsic calls are lowered into a SDNode and lives on through out the codegen passes. For now, since all the debugging information recording is done at isel time, when a ISD::DECLARE node is selected, it has the side effect of also recording the variable. This is a short term solution that should be fixed in time. llvm-svn: 46659	2008-02-02 04:07:54 +00:00
Evan Cheng	9ff6b89bd9	Frame index can be negative. llvm-svn: 46655	2008-02-02 00:17:00 +00:00
Evan Cheng	a63f6736f3	MRegisterInfo::getLocation() is a really bad idea. Its function is to calculate the offset from frame pointer to a stack slot and then storing the delta in a MachineLocation object. The name is bad (it implies a getter), and MRegisterInfo doesn't need to know about MachineLocation. Replace getLocation() with getFrameIndexOffset() which returns the delta from frame pointer to stack slot. Dwarf writer can then use the information for whatever it wants. llvm-svn: 46597	2008-01-31 03:37:28 +00:00
Evan Cheng	918b9c9335	Even though InsertAtEndOfBasicBlock is an ugly hack it still deserves a proper name. Rename it to EmitInstrWithCustomInserter since it does not necessarily insert instruction at the end. llvm-svn: 46562	2008-01-30 18:18:23 +00:00
Duncan Sands	390baa691d	Use getPreferredAlignmentLog or getPreferredAlignment to get the alignment of global variables, rather than using hand-made versions. llvm-svn: 46495	2008-01-29 06:23:44 +00:00
Dale Johannesen	f12104ce4b	Handle 'X' constraint in asm's better. llvm-svn: 46485	2008-01-29 02:21:21 +00:00
Duncan Sands	b9f1e3df90	Add more assertions to catch accesses outside of arrays. Also, as a convenience, don't barf, just return false, if someone calls isTruncStoreLegal or isLoadXLegal with an extended type for the in memory type. llvm-svn: 46352	2008-01-25 10:20:53 +00:00
Evan Cheng	91089e6d66	Let each target decide byval alignment. For X86, it's 4-byte unless the aggregare contains SSE vector(s). For x86-64, it's max of 8 or alignment of the type. llvm-svn: 46286	2008-01-23 23:17:41 +00:00
Chris Lattner	5c5e3031b0	remove magic numbers. llvm-svn: 46162	2008-01-18 17:13:03 +00:00
Chris Lattner	41717f6989	This commit changes: 1. Legalize now always promotes truncstore of i1 to i8. 2. Remove patterns and gunk related to truncstore i1 from targets. 3. Rename the StoreXAction stuff to TruncStoreAction in TLI. 4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions. 5. Mark a wide variety of invalid truncstores as such in various targets, e.g. X86 currently doesn't support truncstore of any of its integer types. 6. Add legalize support for truncstores with invalid value input types. 7. Add a dag combine transform to turn store(truncate) into truncstore when safe. The later allows us to compile CodeGen/X86/storetrunc-fp.ll to: _foo: fldt 20(%esp) fldt 4(%esp) faddp %st(1) movl 36(%esp), %eax fstps (%eax) ret instead of: _foo: subl $4, %esp fldt 24(%esp) fldt 8(%esp) faddp %st(1) fstps (%esp) movl 40(%esp), %eax movss (%esp), %xmm0 movss %xmm0, (%eax) addl $4, %esp ret llvm-svn: 46140	2008-01-17 19:59:44 +00:00
Dale Johannesen	319c7bb405	Fix and enable EH for x86-64 Darwin. Adds ShortenEHDataFor64Bits as a not-very-accurate abstraction to cover all the changes in DwarfWriter. Some cosmetic changes to Darwin assembly code for gcc testsuite compatibility. llvm-svn: 46029	2008-01-15 23:24:56 +00:00
Chris Lattner	bfffa4f21e	Simplify the side effect stuff a bit more and make licm/sinking both work right according to the new flags. This removes the TII::isReallySideEffectFree predicate, and adds TII::isInvariantLoad. It removes NeverHasSideEffects+MayHaveSideEffects and adds UnmodeledSideEffects as machine instr flags. Now the clients can decide everything they need. I think isRematerializable can be implemented in terms of the flags we have now, though I will let others tackle that. llvm-svn: 45843	2008-01-10 23:08:24 +00:00
Dale Johannesen	fdd4b3846f	Emit unused EH frames for weak definitions on Darwin, because assembler/linker can't cope with weak absolutes. PR 1880. llvm-svn: 45811	2008-01-10 02:03:30 +00:00
Chris Lattner	b02074514e	Fix PR1845 and rdar://5676945. Generic vectors smaller than hardware supported type will be scalarized, so we can infer their alignment from that info. We now codegen pr1845 into: _boolVectorSelect: lbz r2, 0(r3) stb r2, -16(r1) blr llvm-svn: 45796	2008-01-10 00:30:57 +00:00
Chris Lattner	9b7b3ade8f	add a mayLoad property for machine instructions, a correlary to mayStore. This is currently not set by anything. llvm-svn: 45748	2008-01-08 18:05:21 +00:00
Chris Lattner	ba567fa77b	split TargetInstrDesc out into its own header file. llvm-svn: 45696	2008-01-07 07:33:08 +00:00
Chris Lattner	f83aae613c	rename TargetInstrDescriptor -> TargetInstrDesc. Make MachineInstr::getDesc return a reference instead of a pointer, since it can never be null. llvm-svn: 45695	2008-01-07 07:27:27 +00:00
Chris Lattner	c9e870d7c6	remove a dead method. llvm-svn: 45694	2008-01-07 06:47:10 +00:00
Chris Lattner	57e851edfe	Rename all the M_* flags to be namespace qualified enums, and switch all clients over to using predicates instead of these flags directly. These are now private values which are only to be used to statically initialize the tables. llvm-svn: 45692	2008-01-07 06:42:05 +00:00
Chris Lattner	c745aa59b3	add more and significantly better comments to the rest of the machineinstr flags that can be set. Add predicates for the ones lacking it, and switch some clients over to using the predicates instead of Flags directly. llvm-svn: 45690	2008-01-07 06:21:53 +00:00
Chris Lattner	1cdb8f4da1	add some mroe comments, add a isImplicitDef() method, add isConditionalBranch() and isUnconditionalBranch() methods. llvm-svn: 45688	2008-01-07 05:38:38 +00:00
Chris Lattner	9b987de2c5	rename hasVariableOperands() -> isVariadic(). Add some comments. Evan, please review the comments I added to getNumDefs to make sure that they are accurate, thx. llvm-svn: 45687	2008-01-07 05:19:29 +00:00
Chris Lattner	b0e50db817	Move M_* flags down in the file. Move SchedClass up in the TargetInstrDescriptor class and shrink to 16-bits, saving a word in TargetInstrDescriptor. Add some comments. llvm-svn: 45686	2008-01-07 05:06:49 +00:00
Chris Lattner	9d38dfa4a5	Move a bunch more accessors from TargetInstrInfo to TargetInstrDescriptor llvm-svn: 45680	2008-01-07 03:13:06 +00:00
Chris Lattner	55343065e3	remove MachineOpCode typedef. llvm-svn: 45679	2008-01-07 02:48:55 +00:00
Chris Lattner	96d0a93f8e	remove some uses of MachineOpCode, move getSchedClass into TargetInstrDescriptor from TargetInstrInfo. llvm-svn: 45678	2008-01-07 02:46:03 +00:00
Chris Lattner	93e1e6ee12	Add predicates methods to TargetOperandInfo, and switch all clients over to using them, instead of diddling Flags directly. Change the various flags from const variables to enums. llvm-svn: 45677	2008-01-07 02:39:19 +00:00
Chris Lattner	f7f96d818f	Rename MachineInstr::getInstrDescriptor -> getDesc(), which reflects that it is cheap and efficient to get. Move a variety of predicates from TargetInstrInfo into TargetInstrDescriptor, which makes it much easier to query a predicate when you don't have TII around. Now you can use MI->getDesc()->isBranch() instead of going through TII, and this is much more efficient anyway. Not all of the predicates have been moved over yet. Update old code that used MI->getInstrDescriptor()->Flags to use the new predicates in many places. llvm-svn: 45674	2008-01-07 01:56:04 +00:00
Owen Anderson	f19692b2f6	Move even more functionality from MRegisterInfo into TargetInstrInfo. Some day I'll get it all moved over... llvm-svn: 45672	2008-01-07 01:35:02 +00:00
Chris Lattner	14310afe42	rename isLoad -> isSimpleLoad due to evan's desire to have such a predicate. llvm-svn: 45667	2008-01-06 23:38:27 +00:00
Chris Lattner	5489888580	rename isStore -> mayStore to more accurately reflect what it captures. llvm-svn: 45656	2008-01-06 08:36:04 +00:00
Chris Lattner	06c02cdcbc	describe isStore and simplify the implementation of hasUnmodelledSideEffects. No functionality change. llvm-svn: 45651	2008-01-06 05:43:21 +00:00
Evan Cheng	759f389846	X86 JIT PIC jumptable support. llvm-svn: 45616	2008-01-05 02:26:58 +00:00
Owen Anderson	2adf8c5533	Move some more functionality from MRegisterInfo to TargetInstrInfo. llvm-svn: 45603	2008-01-04 23:57:37 +00:00
Evan Cheng	7322e4dec4	X86 PIC JIT support fixes: encoding bugs, add lazy pointer stubs support. llvm-svn: 45575	2008-01-04 10:46:51 +00:00
Owen Anderson	e6856128ab	Move some more instruction creation methods from RegisterInfo into InstrInfo. llvm-svn: 45484	2008-01-01 21:11:32 +00:00
Chris Lattner	1285ec2ae7	Fix a problem where lib/Target/TargetInstrInfo.h would include and use a header file from libcodegen. This violates a layering order: codegen depends on target, not the other way around. The fix to this is to split TII into two classes, TII and TargetInstrInfoImpl, which defines stuff that depends on libcodegen. It is defined in libcodegen, where the base is not. llvm-svn: 45475	2008-01-01 01:03:04 +00:00
Owen Anderson	ae7e2c1e03	Move copyRegToReg from MRegisterInfo to TargetInstrInfo. This is part of the Machine-level API cleanup instigated by Chris. llvm-svn: 45470	2007-12-31 06:32:00 +00:00
Chris Lattner	96167aa93c	Rename SSARegMap -> MachineRegisterInfo in keeping with the idea that "machine" classes are used to represent the current state of the code being compiled. Given this expanded name, we can start moving other stuff into it. For now, move the UsedPhysRegs and LiveIn/LoveOuts vectors from MachineFunction into it. Update all the clients to match. This also reduces some needless #includes, such as MachineModuleInfo from MachineFunction. llvm-svn: 45467	2007-12-31 04:13:23 +00:00
Chris Lattner	e0b1ee937a	Don't attribute in file headers anymore. See llvmdev for the discussion of this change. Boy are my fingers tired. ;-) llvm-svn: 45411	2007-12-29 19:59:42 +00:00
Chris Lattner	a8f6fac7a3	Tell TargetLoweringOpt whether it is running before or after legalize. llvm-svn: 45321	2007-12-22 20:56:36 +00:00
Bill Wendling	0df69490dd	s/hasSideEffects/hasUnmodelledSideEffects/g llvm-svn: 45133	2007-12-17 23:19:54 +00:00
Bill Wendling	2d672998c5	Add "hasSideEffects" method to MachineInstrInfo class. llvm-svn: 45126	2007-12-17 21:53:30 +00:00
Bill Wendling	ec8be72a8b	As per feedback, revised comments to (hopefully) make the different side effect flags clearer. llvm-svn: 45120	2007-12-17 21:02:07 +00:00
Dan Gohman	a0d3f7d88c	Fix a typo in a comment. llvm-svn: 45032	2007-12-14 15:13:08 +00:00
Bill Wendling	c8c611e88f	Add flags to indicate that there are "never" side effects or that there "may be" side effects for machine instructions. llvm-svn: 45022	2007-12-14 01:48:59 +00:00
Evan Cheng	64a1febf9a	Implicit def instructions, e.g. X86::IMPLICIT_DEF_GR32, are always re-materializable and they should not be spilled. llvm-svn: 44960	2007-12-12 23:12:09 +00:00
Duncan Sands	47526c4a42	Remove host endianness info from TargetData and put it in a new header System/Host.h instead. Instead of getting the endianness from configure, calculate it directly. llvm-svn: 44959	2007-12-12 23:03:45 +00:00
Dan Gohman	4bf237b584	Remove a forward-declaration for a non-existant class. llvm-svn: 44955	2007-12-12 22:25:09 +00:00
Bill Wendling	e8eea25ad3	Bit masks conflicted. Needed to bump them by one. llvm-svn: 44903	2007-12-12 01:51:58 +00:00
Chris Lattner	f7c53191c0	Move TargetData::hostIsLittleEndian out of line, which means we don't have to #include config.h in it. #including config.h breaks other projects that have their own autoconf stuff and try to #include the llvm headers. One obscure example is llvm-gcc. llvm-svn: 44825	2007-12-11 00:28:59 +00:00
Duncan Sands	1279851352	Fix PR1836: in the interpreter, read and write apints using the minimum possible number of bytes. For little endian targets run on little endian machines, apints are stored in memory from LSB to MSB as before. For big endian targets on big endian machines they are stored from MSB to LSB which wasn't always the case before (if the target and host endianness doesn't match values are stored according to the host's endianness). Doing this requires knowing the endianness of the host, which is determined when configuring - thanks go to Anton for this. Only having access to little endian machines I was unable to properly test the big endian part, which is also the most complicated... llvm-svn: 44796	2007-12-10 17:43:13 +00:00
Bill Wendling	8d8d9a2f5e	Reverting 44702. It wasn't correct to rename them. llvm-svn: 44727	2007-12-08 23:58:46 +00:00
Bill Wendling	d10837def7	Renaming: isTriviallyReMaterializable -> hasNoSideEffects isReallyTriviallyReMaterializable -> isTriviallyReMaterializable llvm-svn: 44702	2007-12-08 07:17:56 +00:00
Evan Cheng	8464a0bf00	Add a argument to storeRegToStackSlot and storeRegToAddr to specify whether the stored register is killed. llvm-svn: 44600	2007-12-05 03:14:33 +00:00
Evan Cheng	58b387dfb0	Remove redundant foldMemoryOperand variants and other code clean up. llvm-svn: 44517	2007-12-02 08:30:39 +00:00
Evan Cheng	79e8b92dc3	Allow some reloads to be folded in multi-use cases. Specifically testl r, r -> cmpl [mem], 0. llvm-svn: 44479	2007-12-01 02:07:52 +00:00
Chris Lattner	28262fbaf2	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Dale Johannesen	3f01467781	File missing from previous patch. llvm-svn: 44259	2007-11-20 23:25:17 +00:00
Dan Gohman	27ac53cc23	Remove meaningless qualifiers from return types, avoiding compiler warnings. llvm-svn: 44240	2007-11-19 20:46:23 +00:00
Dale Johannesen	5fd9e7a615	Add parameter to getDwarfRegNum to permit targets to use different mappings for EH and debug info; no functional change yet. Fix warning in X86CodeEmitter. llvm-svn: 44056	2007-11-13 19:13:01 +00:00
Owen Anderson	aba398a5ce	Add a flag for indirect branch instructions. Target maintainers: please check that the instructions for your target are correctly marked. llvm-svn: 44012	2007-11-12 07:39:39 +00:00
Evan Cheng	7d8deec92f	Much improved pic jumptable codegen: Then: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry imull $4, %ecx, %ecx leal LJTI1_0-"L1$pb"(%eax), %edx addl LJTI1_0-"L1$pb"(%ecx,%eax), %edx jmpl %edx .align 2 .set L1_0_set_3,LBB1_3-LJTI1_0 .set L1_0_set_2,LBB1_2-LJTI1_0 .set L1_0_set_5,LBB1_5-LJTI1_0 .set L1_0_set_4,LBB1_4-LJTI1_0 LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 Now: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry addl LJTI1_0-"L1$pb"(%eax,%ecx,4), %eax jmpl %eax .align 2 .set L1_0_set_3,LBB1_3-"L1$pb" .set L1_0_set_2,LBB1_2-"L1$pb" .set L1_0_set_5,LBB1_5-"L1$pb" .set L1_0_set_4,LBB1_4-"L1$pb" LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 llvm-svn: 43924	2007-11-09 01:32:10 +00:00
Hartmut Kaiser	a23dfa2c76	Fixed compilation errors on VC++. llvm-svn: 43836	2007-11-07 19:33:31 +00:00
Rafael Espindola	ec025c3042	Move the LowerMEMCPY and LowerMEMCPYCall to a common place. Thanks for the suggestions Bill :-) llvm-svn: 43742	2007-11-05 23:12:20 +00:00
Duncan Sands	d1bdbd010b	Eliminate the remaining uses of getTypeSize. This should only effect x86 when using long double. Now 12/16 bytes are output for long double globals (the exact amount depends on the alignment). This brings globals in line with the rest of LLVM: the space reserved for an object is now always the ABI size. One tricky point is that only 10 bytes should be output for long double if it is a field in a packed struct, which is the reason for the additional argument to EmitGlobalConstant. llvm-svn: 43688	2007-11-05 00:04:43 +00:00
Duncan Sands	662fb070a7	Change uses of getTypeSize to getABITypeSize, getTypeStoreSize or getTypeSizeInBits as appropriate in ScalarReplAggregates. The right change to make was not always obvious, so it would be good to have an sroa guru review this. While there I noticed some bugs, and fixed them: (1) arrays of x86 long double have holes due to alignment padding, but this wasn't being spotted by HasStructPadding (renamed to HasPadding). The same goes for arrays of oddly sized ints. Vectors also suffer from this, in fact the problem for vectors is much worse because basic vector assumptions seem to be broken by vectors of type with alignment padding. I didn't try to fix any of these vector problems. (2) The code for extracting smaller integers from larger ones (in the "int union" case) was wrong on big-endian machines for integers with size not a multiple of 8, like i1. Probably this is impossible to hit via llvm-gcc, but I fixed it anyway while there and added a testcase. I also got rid of some trailing whitespace and changed a function name which had an obvious typo in it. llvm-svn: 43672	2007-11-04 14:43:57 +00:00
Duncan Sands	eb464e976f	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Evan Cheng	5fe81cf64e	Enable more fold (sext (load x)) -> (sext (truncate (sextload x))) transformation. Previously, it's restricted by ensuring the number of load uses is one. Now the restriction is loosened up by allowing setcc uses to be "extended" (e.g. setcc x, c, eq -> setcc sext(x), sext(c), eq). llvm-svn: 43465	2007-10-29 19:58:20 +00:00
Evan Cheng	53696b7e9f	Loosen up iv reuse to allow reuse of the same stride but a larger type when truncating from the larger type to smaller type is free. e.g. Turns this loop: LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx movw %dx, %si LBB1_2: # bb movl L_X$non_lazy_ptr, %edi movw %si, (%edi) movl L_Y$non_lazy_ptr, %edi movw %dx, (%edi) addw $4, %dx incw %si incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb into LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx LBB1_2: # bb movl L_X$non_lazy_ptr, %esi movw %cx, (%esi) movl L_Y$non_lazy_ptr, %esi movw %dx, (%esi) addw $4, %dx incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb llvm-svn: 43375	2007-10-26 01:56:11 +00:00
Evan Cheng	dc2f1b1741	isSubRegOf() is a dup of isSubRegister. llvm-svn: 43249	2007-10-23 06:51:50 +00:00
Evan Cheng	ded6550885	Local spiller optimization: Turn a store folding instruction into a load folding instruction. e.g. xorl %edi, %eax movl %eax, -32(%ebp) movl -36(%ebp), %eax orl %eax, -32(%ebp) => xorl %edi, %eax orl -36(%ebp), %eax mov %eax, -32(%ebp) This enables the unfolding optimization for a subsequent instruction which will also eliminate the newly introduced store instruction. llvm-svn: 43192	2007-10-19 21:23:22 +00:00
Chris Lattner	45b8558ec5	rename ExpandOperation to ExpandOperationResult, as suggested by Duncan llvm-svn: 43177	2007-10-19 15:28:47 +00:00
Chris Lattner	f02434cdaf	add a new target hook. llvm-svn: 43165	2007-10-19 03:31:45 +00:00
Evan Cheng	0449186690	- Added getOpcodeAfterMemoryUnfold(). It doesn't unfold an instruction, but only returns the opcode of the instruction post unfolding. - Fix some copy+paste bugs. llvm-svn: 43153	2007-10-18 22:40:57 +00:00
Evan Cheng	c852780685	Use SmallVectorImpl instead of SmallVector with hardcoded size in MRegister public interface. llvm-svn: 43150	2007-10-18 21:29:24 +00:00
Gordon Henriksen	422d66e53e	Missing 'public' keyword. llvm-svn: 43121	2007-10-18 11:31:21 +00:00
Gordon Henriksen	a6050b38d2	Switching TargetMachineRegistry to use the new generic Registry. llvm-svn: 43094	2007-10-17 21:28:48 +00:00
Duncan Sands	0a5a15c3a0	Return Expand from getOperationAction for all extended types. This is needed for SIGN_EXTEND_INREG at least. It is not clear if this is correct for other operations. On the other hand, for the various load/store actions it seems to correct to return the type action, as is currently done. Also, it seems that SelectionDAG::getValueType can be called for extended value types; introduce a map for holding these, since we don't really want to extend the vector to be 2^32 pointers long! Generalize DAGTypeLegalizer::PromoteResult_TRUNCATE and DAGTypeLegalizer::PromoteResult_INT_EXTEND to handle the various funky possibilities that apints introduce, for example that you can promote to a type that needs to be expanded. llvm-svn: 43071	2007-10-17 13:49:58 +00:00
Duncan Sands	9d622a6de1	Initial infrastructure for arbitrary precision integer codegen support. This should have no effect on codegen for other types. Debatable bits: (1) the use (abuse?) of a set in SDNode::getValueTypeList; (2) the length of getTypeToTransformTo, which maybe should be refactored with a non-inline part for extended value types. llvm-svn: 43030	2007-10-16 09:56:48 +00:00
Chris Lattner	828830d360	Fix 80 col violation llvm-svn: 42976	2007-10-15 05:30:27 +00:00
Evan Cheng	2e2d6358bc	Change unfoldMemoryOperand(). User is now responsible for passing in the register used by the unfolded instructions. User can also specify whether to unfold the load, the store, or both. llvm-svn: 42946	2007-10-13 02:35:06 +00:00
Arnold Schwaighofer	6bcd9e7ec2	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Arnold Schwaighofer	d47210011e	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Bill Wendling	19a314f8ac	Fix 80-column violations llvm-svn: 42823	2007-10-10 05:45:59 +00:00
Dan Gohman	7fa473514d	Add explicit keywords. llvm-svn: 42747	2007-10-08 15:08:41 +00:00

1 2 3 4 5 ...

856 Commits