llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Evan Cheng	7d8deec92f	Much improved pic jumptable codegen: Then: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry imull $4, %ecx, %ecx leal LJTI1_0-"L1$pb"(%eax), %edx addl LJTI1_0-"L1$pb"(%ecx,%eax), %edx jmpl %edx .align 2 .set L1_0_set_3,LBB1_3-LJTI1_0 .set L1_0_set_2,LBB1_2-LJTI1_0 .set L1_0_set_5,LBB1_5-LJTI1_0 .set L1_0_set_4,LBB1_4-LJTI1_0 LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 Now: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry addl LJTI1_0-"L1$pb"(%eax,%ecx,4), %eax jmpl %eax .align 2 .set L1_0_set_3,LBB1_3-"L1$pb" .set L1_0_set_2,LBB1_2-"L1$pb" .set L1_0_set_5,LBB1_5-"L1$pb" .set L1_0_set_4,LBB1_4-"L1$pb" LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 llvm-svn: 43924	2007-11-09 01:32:10 +00:00
Hartmut Kaiser	a23dfa2c76	Fixed compilation errors on VC++. llvm-svn: 43836	2007-11-07 19:33:31 +00:00
Rafael Espindola	ec025c3042	Move the LowerMEMCPY and LowerMEMCPYCall to a common place. Thanks for the suggestions Bill :-) llvm-svn: 43742	2007-11-05 23:12:20 +00:00
Duncan Sands	d1bdbd010b	Eliminate the remaining uses of getTypeSize. This should only effect x86 when using long double. Now 12/16 bytes are output for long double globals (the exact amount depends on the alignment). This brings globals in line with the rest of LLVM: the space reserved for an object is now always the ABI size. One tricky point is that only 10 bytes should be output for long double if it is a field in a packed struct, which is the reason for the additional argument to EmitGlobalConstant. llvm-svn: 43688	2007-11-05 00:04:43 +00:00
Duncan Sands	662fb070a7	Change uses of getTypeSize to getABITypeSize, getTypeStoreSize or getTypeSizeInBits as appropriate in ScalarReplAggregates. The right change to make was not always obvious, so it would be good to have an sroa guru review this. While there I noticed some bugs, and fixed them: (1) arrays of x86 long double have holes due to alignment padding, but this wasn't being spotted by HasStructPadding (renamed to HasPadding). The same goes for arrays of oddly sized ints. Vectors also suffer from this, in fact the problem for vectors is much worse because basic vector assumptions seem to be broken by vectors of type with alignment padding. I didn't try to fix any of these vector problems. (2) The code for extracting smaller integers from larger ones (in the "int union" case) was wrong on big-endian machines for integers with size not a multiple of 8, like i1. Probably this is impossible to hit via llvm-gcc, but I fixed it anyway while there and added a testcase. I also got rid of some trailing whitespace and changed a function name which had an obvious typo in it. llvm-svn: 43672	2007-11-04 14:43:57 +00:00
Duncan Sands	eb464e976f	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Evan Cheng	5fe81cf64e	Enable more fold (sext (load x)) -> (sext (truncate (sextload x))) transformation. Previously, it's restricted by ensuring the number of load uses is one. Now the restriction is loosened up by allowing setcc uses to be "extended" (e.g. setcc x, c, eq -> setcc sext(x), sext(c), eq). llvm-svn: 43465	2007-10-29 19:58:20 +00:00
Evan Cheng	53696b7e9f	Loosen up iv reuse to allow reuse of the same stride but a larger type when truncating from the larger type to smaller type is free. e.g. Turns this loop: LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx movw %dx, %si LBB1_2: # bb movl L_X$non_lazy_ptr, %edi movw %si, (%edi) movl L_Y$non_lazy_ptr, %edi movw %dx, (%edi) addw $4, %dx incw %si incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb into LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx LBB1_2: # bb movl L_X$non_lazy_ptr, %esi movw %cx, (%esi) movl L_Y$non_lazy_ptr, %esi movw %dx, (%esi) addw $4, %dx incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb llvm-svn: 43375	2007-10-26 01:56:11 +00:00
Evan Cheng	dc2f1b1741	isSubRegOf() is a dup of isSubRegister. llvm-svn: 43249	2007-10-23 06:51:50 +00:00
Evan Cheng	ded6550885	Local spiller optimization: Turn a store folding instruction into a load folding instruction. e.g. xorl %edi, %eax movl %eax, -32(%ebp) movl -36(%ebp), %eax orl %eax, -32(%ebp) => xorl %edi, %eax orl -36(%ebp), %eax mov %eax, -32(%ebp) This enables the unfolding optimization for a subsequent instruction which will also eliminate the newly introduced store instruction. llvm-svn: 43192	2007-10-19 21:23:22 +00:00
Chris Lattner	45b8558ec5	rename ExpandOperation to ExpandOperationResult, as suggested by Duncan llvm-svn: 43177	2007-10-19 15:28:47 +00:00
Chris Lattner	f02434cdaf	add a new target hook. llvm-svn: 43165	2007-10-19 03:31:45 +00:00
Evan Cheng	0449186690	- Added getOpcodeAfterMemoryUnfold(). It doesn't unfold an instruction, but only returns the opcode of the instruction post unfolding. - Fix some copy+paste bugs. llvm-svn: 43153	2007-10-18 22:40:57 +00:00
Evan Cheng	c852780685	Use SmallVectorImpl instead of SmallVector with hardcoded size in MRegister public interface. llvm-svn: 43150	2007-10-18 21:29:24 +00:00
Gordon Henriksen	422d66e53e	Missing 'public' keyword. llvm-svn: 43121	2007-10-18 11:31:21 +00:00
Gordon Henriksen	a6050b38d2	Switching TargetMachineRegistry to use the new generic Registry. llvm-svn: 43094	2007-10-17 21:28:48 +00:00
Duncan Sands	0a5a15c3a0	Return Expand from getOperationAction for all extended types. This is needed for SIGN_EXTEND_INREG at least. It is not clear if this is correct for other operations. On the other hand, for the various load/store actions it seems to correct to return the type action, as is currently done. Also, it seems that SelectionDAG::getValueType can be called for extended value types; introduce a map for holding these, since we don't really want to extend the vector to be 2^32 pointers long! Generalize DAGTypeLegalizer::PromoteResult_TRUNCATE and DAGTypeLegalizer::PromoteResult_INT_EXTEND to handle the various funky possibilities that apints introduce, for example that you can promote to a type that needs to be expanded. llvm-svn: 43071	2007-10-17 13:49:58 +00:00
Duncan Sands	9d622a6de1	Initial infrastructure for arbitrary precision integer codegen support. This should have no effect on codegen for other types. Debatable bits: (1) the use (abuse?) of a set in SDNode::getValueTypeList; (2) the length of getTypeToTransformTo, which maybe should be refactored with a non-inline part for extended value types. llvm-svn: 43030	2007-10-16 09:56:48 +00:00
Chris Lattner	828830d360	Fix 80 col violation llvm-svn: 42976	2007-10-15 05:30:27 +00:00
Evan Cheng	2e2d6358bc	Change unfoldMemoryOperand(). User is now responsible for passing in the register used by the unfolded instructions. User can also specify whether to unfold the load, the store, or both. llvm-svn: 42946	2007-10-13 02:35:06 +00:00
Arnold Schwaighofer	6bcd9e7ec2	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Arnold Schwaighofer	d47210011e	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Bill Wendling	19a314f8ac	Fix 80-column violations llvm-svn: 42823	2007-10-10 05:45:59 +00:00
Dan Gohman	7fa473514d	Add explicit keywords. llvm-svn: 42747	2007-10-08 15:08:41 +00:00
Evan Cheng	f536e2f41e	- Added a few target hooks to generate load / store instructions from / to any address (not just from / to frameindexes). - Added target hooks to unfold load / store instructions / SDNodes into separate load, data processing, store instructions / SDNodes. llvm-svn: 42621	2007-10-05 01:32:41 +00:00
Dale Johannesen	ef488c7b0e	Add getABITypeSize, getABITypeSizeInBits llvm-svn: 42488	2007-10-01 16:03:14 +00:00
Evan Cheng	f3c130a8b6	Enabling new condition code modeling scheme. llvm-svn: 42459	2007-09-29 00:00:36 +00:00
Dan Gohman	50747737a5	TargetAsmInfo::getAddressSize() was incorrect for x86-64 and 64-bit targets other than PPC64. Instead of fixing it, just remove it and fix all the places that use it to use TargetData::getPointerSize() instead, as there aren't very many. Most of the references were in DwarfWriter.cpp. llvm-svn: 42419	2007-09-27 23:12:31 +00:00
Evan Cheng	0f9e307353	- Added MRegisterInfo::getCrossCopyRegClass() hook. For register classes where reg to reg copies are not possible, this returns another register class which registers in the specified register class can be copied to (and copy back from). - X86 copyRegToReg() now supports copying between EFLAGS and GR32 / GR64 registers. llvm-svn: 42372	2007-09-26 21:31:07 +00:00
Evan Cheng	5f9e291240	Allow copyRegToReg to emit cross register classes copies. Tested with "make check"! llvm-svn: 42346	2007-09-26 06:25:56 +00:00
Evan Cheng	6bfaf13dd9	New temporary option -new-cc-modeling-scheme to test the new cc modeling scheme. llvm-svn: 42283	2007-09-25 01:50:04 +00:00
Dan Gohman	b642a1ddf8	Merge hasDotLoc and hasDotFile into hasDotLocAndDotFile since .loc and .file aren't really usable without each other. llvm-svn: 42274	2007-09-24 21:09:53 +00:00
Dale Johannesen	ea6ffa0b36	Fix PR 1681. When X86 target uses +sse -sse2, keep f32 in SSE registers and f64 in x87. This is effectively a new codegen mode. Change addLegalFPImmediate to permit float and double variants to do different things. Adjust callers. llvm-svn: 42246	2007-09-23 14:52:20 +00:00
Evan Cheng	859720c33b	Add CopyCost to TargetRegisterClass. This specifies the cost of copying a value between two registers in the specific class. llvm-svn: 42123	2007-09-19 01:35:01 +00:00
Evan Cheng	827f269fac	Added getNumDefs(). llvm-svn: 41901	2007-09-13 00:16:29 +00:00
Bill Wendling	bde1aacffa	Add accessor method. llvm-svn: 41854	2007-09-11 17:24:38 +00:00
Bill Wendling	74888e63e7	Add a bool to indicate if we should set the "indirect encoding" bit in the Dwarf information for EH. llvm-svn: 41852	2007-09-11 17:20:55 +00:00
Bill Wendling	3c6a8b2814	The personality function on Darwin needs a global stub. We then refer to that global stub instead of doing the ".set" thingy we were doing before. llvm-svn: 41838	2007-09-11 08:27:17 +00:00
Rafael Espindola	8c57e70f93	Add support for having different alignment for objects on call frames. The x86-64 ABI states that objects passed on the stack have 8 byte alignment. Implement that. llvm-svn: 41768	2007-09-07 14:52:14 +00:00
Owen Anderson	4b71e55287	Add lengthof and endof templates that hide a lot of sizeof computations. Patch by Sterling Stein! llvm-svn: 41758	2007-09-07 04:06:50 +00:00
Evan Cheng	58ea935f6f	Add a variant of foldMemoryOperand to fold any load / store, not just load / store from / to stack slots. llvm-svn: 41597	2007-08-30 05:52:20 +00:00
Dale Johannesen	e91a908971	Change LegalFPImmediates to use APFloat. Add APFloat interfaces to ConstantFP, SelectionDAG. Fix integer bit in double->APFloat conversion. Convert LegalizeDAG to use APFloat interface in ConstantFPSDNode uses. llvm-svn: 41587	2007-08-30 00:23:21 +00:00
Dan Gohman	e2c92effea	Add explicit keywords and remove spurious trailing semicolons. llvm-svn: 41482	2007-08-27 14:50:10 +00:00
Chris Lattner	1e089aac3a	rename isOperandValidForConstraint to LowerAsmOperandForConstraint, changing the interface to allow for future changes. llvm-svn: 41384	2007-08-25 00:47:38 +00:00
Rafael Espindola	68d95ff2b1	Partial implementation of calling functions with byval arguments: ) The needed information is propagated to the DAG ) The X86-64 backend detects it and aborts llvm-svn: 41179	2007-08-20 15:18:24 +00:00
Reid Spencer	8c5c7c8453	Change casts from old style to new style. This helps document the details better, gives the compiler a chance to validate the cast and reduces warnings if the user turns on -Wold-style-cast option. llvm-svn: 41033	2007-08-12 08:12:35 +00:00
Christopher Lamb	e0c9bd8d2e	Move isSubRegOf into MRegisterInfo. Fix a missed move elimination in LowerSubregs and add more debugging output there. llvm-svn: 41005	2007-08-10 21:11:55 +00:00
Dale Johannesen	a1340d5916	Update per review comments. llvm-svn: 40965	2007-08-09 17:27:48 +00:00
Dale Johannesen	79551baaad	long double 9 of N. This finishes up the X86-32 bits (constants are still not handled). Adds ConvertActions to control fp-to-fp conversions (these are currently defaulted for all other targets, so no changes there). llvm-svn: 40958	2007-08-09 01:04:01 +00:00
Reid Spencer	3e54e72ef1	Fix minor doxygen nits. llvm-svn: 40854	2007-08-05 20:06:04 +00:00

1 2 3 4 5 ...

730 Commits