llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 11:42:57 +01:00

Author	SHA1	Message	Date
Bruno Cardoso Lopes	77e5c419ec	Better processor definition llvm-svn: 43749	2007-11-06 03:15:20 +00:00
Rafael Espindola	ec025c3042	Move the LowerMEMCPY and LowerMEMCPYCall to a common place. Thanks for the suggestions Bill :-) llvm-svn: 43742	2007-11-05 23:12:20 +00:00
Lauro Ramos Venancio	f5081ba980	[ARM] Fix code generation for: static __thread struct { int a; int b; } teste = {0, 0}; llvm-svn: 43722	2007-11-05 18:33:37 +00:00
Evan Cheng	c49995c027	Use movups to spill / restore SSE registers on targets where stacks alignment is less than 16. This is a temporary solution until dynamic stack alignment is implemented. llvm-svn: 43703	2007-11-05 07:30:01 +00:00
Bruno Cardoso Lopes	569b5512b0	Added support for PIC code with "explicit relocations" only. Removed all macro code for PIC (goodbye "la"). Support tested with shootout bench. llvm-svn: 43697	2007-11-05 03:02:32 +00:00
Duncan Sands	d1bdbd010b	Eliminate the remaining uses of getTypeSize. This should only effect x86 when using long double. Now 12/16 bytes are output for long double globals (the exact amount depends on the alignment). This brings globals in line with the rest of LLVM: the space reserved for an object is now always the ABI size. One tricky point is that only 10 bytes should be output for long double if it is a field in a packed struct, which is the reason for the additional argument to EmitGlobalConstant. llvm-svn: 43688	2007-11-05 00:04:43 +00:00
Chris Lattner	8fac63c8b5	Fix PR1761 by not printing (rip) suffix when in -static mode. Evan, please review this. llvm-svn: 43680	2007-11-04 19:23:28 +00:00
Nick Lewycky	36047b0b5b	Fix crash before main on ppc/linux with static constructors. PR1771 llvm-svn: 43676	2007-11-04 17:32:10 +00:00
Chris Lattner	67cd357fb8	Fix PR1763 by allowing the 'q' constraint to work with 64-bit regs on x86-64. llvm-svn: 43669	2007-11-04 06:51:12 +00:00
Evan Cheng	bf8e7c6644	Unbreak tailcall opt. llvm-svn: 43646	2007-11-02 17:45:40 +00:00
Chris Lattner	679e22d547	add a note llvm-svn: 43642	2007-11-02 17:04:20 +00:00
Evan Cheng	b50cc64eb0	Missing a getNumOperands check. llvm-svn: 43630	2007-11-02 01:26:22 +00:00
Duncan Sands	eb464e976f	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Bill Wendling	df2eaa8a55	Silence, accersed warning llvm-svn: 43609	2007-11-01 08:51:44 +00:00
Rafael Espindola	27a8907a7c	Make ARM and X86 LowerMEMCPY identical by moving the isThumb check into getMaxInlineSizeThreshold and by restructuring the X86 version. New I just have to move this to a common place :-) llvm-svn: 43554	2007-10-31 14:39:58 +00:00
Rafael Espindola	fae98471a9	Make ARM an X86 memcpy expansion more similar to each other. Now both subtarget define getMaxInlineSizeThreshold and the expansion uses it. This should not change generated code. llvm-svn: 43552	2007-10-31 11:52:06 +00:00
Dale Johannesen	9bc04ae496	Make i64=expand_vector_elt(v2i64) work in 32-bit mode. llvm-svn: 43535	2007-10-31 00:32:36 +00:00
Dale Johannesen	7167117945	Add missing SSE builtins: CVTPD2PI, CVTPS2PI, CVTTPD2PI, CVTTPS2PI, CVTPI2PD, CVTPI2PS. llvm-svn: 43523	2007-10-30 22:15:38 +00:00
Duncan Sands	f6837e8634	Fix for visibility warnings generated by gcc-4.2. llvm-svn: 43500	2007-10-30 13:14:37 +00:00
Dale Johannesen	461a0c47f8	Add missing MMX PSUBQ. llvm-svn: 43488	2007-10-30 01:18:38 +00:00
Evan Cheng	5fe81cf64e	Enable more fold (sext (load x)) -> (sext (truncate (sextload x))) transformation. Previously, it's restricted by ensuring the number of load uses is one. Now the restriction is loosened up by allowing setcc uses to be "extended" (e.g. setcc x, c, eq -> setcc sext(x), sext(c), eq). llvm-svn: 43465	2007-10-29 19:58:20 +00:00
Evan Cheng	1113931fd8	Avoid doing something dumb like rewriting using a 64-bit iv in 32-bit mode. llvm-svn: 43446	2007-10-29 07:57:50 +00:00
Chris Lattner	be8379fac5	add a note. llvm-svn: 43444	2007-10-29 06:19:48 +00:00
Chris Lattner	1503362624	Add support for the x86-64 'q' regigster modifier, and add support for the b/h/w/k/q inline asm memory modifiers, which are just ignored. This fixes PR1748 and CodeGen/X86/2007-10-28-inlineasm-q-modifier.ll llvm-svn: 43430	2007-10-29 03:09:07 +00:00
Chris Lattner	7e3a8a7604	Fix PR1749 and InstCombine/2007-10-28-EmptyField.ll by handling zero-length fields better. llvm-svn: 43427	2007-10-29 02:40:02 +00:00
Evan Cheng	053178440a	New entry. llvm-svn: 43420	2007-10-28 04:01:09 +00:00
Anton Korobeynikov	0d3f43480e	Fix off-by-one stack offset computations (dwarf information) for callee-saved registers in case, when FP pointer was eliminated. This should fixes misc. random EH-related crahses, when stuff is compiled with -fomit-frame-pointer. Thanks Duncan for nailing this bug! llvm-svn: 43381	2007-10-26 09:13:24 +00:00
Eric Christopher	82c77dd85b	clo/clz aren't supported on mips I. Keep them around for when we'll want them later (mips32/64). llvm-svn: 43380	2007-10-26 04:00:13 +00:00
Evan Cheng	53696b7e9f	Loosen up iv reuse to allow reuse of the same stride but a larger type when truncating from the larger type to smaller type is free. e.g. Turns this loop: LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx movw %dx, %si LBB1_2: # bb movl L_X$non_lazy_ptr, %edi movw %si, (%edi) movl L_Y$non_lazy_ptr, %edi movw %dx, (%edi) addw $4, %dx incw %si incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb into LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx LBB1_2: # bb movl L_X$non_lazy_ptr, %esi movw %cx, (%esi) movl L_Y$non_lazy_ptr, %esi movw %dx, (%esi) addw $4, %dx incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb llvm-svn: 43375	2007-10-26 01:56:11 +00:00
Dale Johannesen	0774a9c549	Support non-POSIX hosts by removing use of strncasecmp. llvm-svn: 43364	2007-10-25 21:54:43 +00:00
Dale Johannesen	94241a8d3a	Disable a couple more things for ppcf128. llvm-svn: 43267	2007-10-23 23:20:14 +00:00
Evan Cheng	0590c75f18	Temporary solution: added a different set of BCTRL_Macho / BCTRL_ELF with right callee-saved defs set for ppc64. llvm-svn: 43248	2007-10-23 06:42:42 +00:00
Evan Cheng	252d9ddb4d	Fix memcpy lowering when addresses are 4-byte aligned but size is not multiple of 4. llvm-svn: 43234	2007-10-22 22:11:27 +00:00
Dan Gohman	76e104c8ad	Fix the folding of multiplication into addresses on x86, which was broken by the recent {U,S}MUL_LOHI changes. llvm-svn: 43230	2007-10-22 20:22:24 +00:00
Evan Cheng	85eb733eff	Use ptr type in the immediate field of a BxA instruction so we don't end up selecting 32-bit call instruction for ppc64. llvm-svn: 43228	2007-10-22 19:46:19 +00:00
Evan Cheng	ddeab10144	Fix an unfolding bug. llvm-svn: 43212	2007-10-22 03:03:20 +00:00
Dale Johannesen	2edd0fb69d	Allow for copysign having f80 second argument. Fixes 5550319. llvm-svn: 43205	2007-10-21 01:07:44 +00:00
Evan Cheng	b56784f9ea	Resolve unfold tables ambiguity. llvm-svn: 43194	2007-10-19 23:50:58 +00:00
Evan Cheng	ded6550885	Local spiller optimization: Turn a store folding instruction into a load folding instruction. e.g. xorl %edi, %eax movl %eax, -32(%ebp) movl -36(%ebp), %eax orl %eax, -32(%ebp) => xorl %edi, %eax orl -36(%ebp), %eax mov %eax, -32(%ebp) This enables the unfolding optimization for a subsequent instruction which will also eliminate the newly introduced store instruction. llvm-svn: 43192	2007-10-19 21:23:22 +00:00
Rafael Espindola	c751cbdb02	split LowerMEMCPY into LowerMEMCPYCall and LowerMEMCPYInline in the ARM backend. llvm-svn: 43176	2007-10-19 14:35:17 +00:00
Rafael Espindola	d8d4372845	Add support for byval function whose argument is not 32 bit aligned. To do this it is necessary to add a "always inline" argument to the memcpy node. For completeness I have also added this node to memmove and memset. I have also added getMem* functions, because the extra argument makes it cumbersome to use getNode and because I get confused by it :-) llvm-svn: 43172	2007-10-19 10:41:11 +00:00
Chris Lattner	4354f2db6a	comment fixes llvm-svn: 43168	2007-10-19 04:08:28 +00:00
Chris Lattner	57e2fa4ba0	Add an easy microoptimization I noticed. llvm-svn: 43164	2007-10-19 03:29:26 +00:00
Dale Johannesen	b23b0bfa8f	More ppcf128 issues (maybe the last)? llvm-svn: 43160	2007-10-19 00:59:18 +00:00
Evan Cheng	0449186690	- Added getOpcodeAfterMemoryUnfold(). It doesn't unfold an instruction, but only returns the opcode of the instruction post unfolding. - Fix some copy+paste bugs. llvm-svn: 43153	2007-10-18 22:40:57 +00:00
Evan Cheng	c852780685	Use SmallVectorImpl instead of SmallVector with hardcoded size in MRegister public interface. llvm-svn: 43150	2007-10-18 21:29:24 +00:00
Christopher Lamb	7f21e45b06	Fix a misnamed parameter. llvm-svn: 43145	2007-10-18 19:29:45 +00:00
Christopher Lamb	a26b82ea94	Fix a typo llvm-svn: 43144	2007-10-18 19:28:55 +00:00
Gordon Henriksen	3b309c68d1	Work around downrev gccs which do not inherit visibility of the Registry<>::iterator member class. llvm-svn: 43122	2007-10-18 11:53:05 +00:00
Chris Lattner	374b185092	legalizing the ret operation on f64 shouldn't introduce a new i64 bit convert needlessly. llvm-svn: 43116	2007-10-18 06:17:07 +00:00

1 2 3 4 5 ...

7566 Commits