llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-24 21:42:54 +02:00

Author	SHA1	Message	Date
Chris Lattner	e3f1487574	add a note llvm-svn: 44638	2007-12-05 23:05:06 +00:00
Chris Lattner	011d2aab51	add a note llvm-svn: 44637	2007-12-05 22:58:19 +00:00
Scott Michel	a9a40d4347	Minor updates: - Fix typo in SPUCallingConv.td - Credit myself for CellSPU work - Add CellSPU to 'all' host target list llvm-svn: 44627	2007-12-05 21:23:16 +00:00
Evan Cheng	27986f1ac7	Added canFoldMemoryOperand for PPC. llvm-svn: 44623	2007-12-05 18:41:29 +00:00
Evan Cheng	aecb76bcc2	Update foldMemoryOperand. llvm-svn: 44621	2007-12-05 18:36:37 +00:00
Chris Lattner	0914ad3008	fix warnings llvm-svn: 44620	2007-12-05 18:32:18 +00:00
Chris Lattner	df5cd03710	allow this to build llvm-svn: 44619	2007-12-05 18:30:11 +00:00
Evan Cheng	8464a0bf00	Add a argument to storeRegToStackSlot and storeRegToAddr to specify whether the stored register is killed. llvm-svn: 44600	2007-12-05 03:14:33 +00:00
Scott Michel	871b3a4fd4	More stuff for CellSPU -- this should be enough to get an error-free compilation (no files missing). Test cases remain to be checked in. llvm-svn: 44598	2007-12-05 02:01:41 +00:00
Scott Michel	8a2cb11b05	Updated source file headers to llvm coding standard. llvm-svn: 44597	2007-12-05 01:40:25 +00:00
Scott Michel	026ace10b2	Two missing files. llvm-svn: 44596	2007-12-05 01:31:18 +00:00
Scott Michel	191775d31f	Main CellSPU backend files checked in. Intrinsics and autoconf files remain. llvm-svn: 44595	2007-12-05 01:24:05 +00:00
Scott Michel	512cb025cc	More files in the CellSPU drop... llvm-svn: 44584	2007-12-04 22:35:58 +00:00
Scott Michel	774da2e74c	More of the Cell SPU code drop from "Team Aerospace". llvm-svn: 44582	2007-12-04 22:23:35 +00:00
Scott Michel	3996f647d2	More CellSPU files... more to follow. llvm-svn: 44559	2007-12-03 23:14:43 +00:00
Scott Michel	c312b999e6	Makefile fragment for CellSPU. llvm-svn: 44558	2007-12-03 23:12:49 +00:00
Scott Michel	34987128e0	First commit to CellSPU. More to follow llvm-svn: 44557	2007-12-03 23:09:49 +00:00
Duncan Sands	1e2e4972ff	Rather than having special rules like "intrinsics cannot throw exceptions", just mark intrinsics with the nounwind attribute. Likewise, mark intrinsics as readnone/readonly and get rid of special aliasing logic (which didn't use anything more than this anyway). llvm-svn: 44544	2007-12-03 20:06:50 +00:00
Evan Cheng	58b387dfb0	Remove redundant foldMemoryOperand variants and other code clean up. llvm-svn: 44517	2007-12-02 08:30:39 +00:00
Evan Cheng	79e8b92dc3	Allow some reloads to be folded in multi-use cases. Specifically testl r, r -> cmpl [mem], 0. llvm-svn: 44479	2007-12-01 02:07:52 +00:00
Chris Lattner	906683b821	Work around a GCC bug, producing this code: unsigned char llvm_cbe_X; ... llvm_cbe_X = 0; ((void**)&llvm_cbe_X) = __builtin_stack_save(); instead of: llvm_cbe_X = __builtin_stack_save(); See PR1809 for details. llvm-svn: 44415	2007-11-28 21:26:17 +00:00
Chris Lattner	e59a7ee26a	Implement ExpandOperationResult for ppc i64 fp->int, which fixes CodeGen/Generic/fp_to_int.ll among others. Its unclear why this just started failing... llvm-svn: 44407	2007-11-28 18:44:47 +00:00
Duncan Sands	1b0feb42e2	Add some convenience methods for querying attributes, and use them. llvm-svn: 44403	2007-11-28 17:07:01 +00:00
Chris Lattner	706eb604ae	several entries got significantly better, though they still aren't done. llvm-svn: 44382	2007-11-27 22:41:52 +00:00
Chris Lattner	d2ee2dad04	implement a trivial readme entry. llvm-svn: 44380	2007-11-27 22:36:16 +00:00
Chris Lattner	5e0cabc90e	Fix a crash on invalid code due to memcpy lowering. llvm-svn: 44378	2007-11-27 22:14:42 +00:00
Nate Begeman	4278967588	Support returning non-power-of-2 vectors to unblock some work llvm-svn: 44371	2007-11-27 19:28:48 +00:00
Andrew Lenharth	6e449dc482	something wrong with this opt llvm-svn: 44370	2007-11-27 18:31:30 +00:00
Duncan Sands	3602011bec	Fix PR1146: parameter attributes are longer part of the function type, instead they belong to functions and function calls. This is an updated and slightly corrected version of Reid Spencer's original patch. The only known problem is that auto-upgrading of bitcode files doesn't seem to work properly (see test/Bitcode/AutoUpgradeIntrinsics.ll). Hopefully a bitcode guru (who might that be? :) ) will fix it. llvm-svn: 44359	2007-11-27 13:23:08 +00:00
Chris Lattner	be0c5a0500	Fix a long standing deficiency in the X86 backend: we would sometimes emit "zero" and "all one" vectors multiple times, for example: _test2: pcmpeqd %mm0, %mm0 movq %mm0, _M1 pcmpeqd %mm0, %mm0 movq %mm0, _M2 ret instead of: _test2: pcmpeqd %mm0, %mm0 movq %mm0, _M1 movq %mm0, _M2 ret This patch fixes this by always arranging for zero/one vectors to be defined as v4i32 or v2i32 (SSE/MMX) instead of letting them be any random type. This ensures they get trivially CSE'd on the dag. This fix is also important for LegalizeDAGTypes, as it gets unhappy when the x86 backend wants BUILD_VECTOR(i64 0) to be legal even when 'i64' isn't legal. This patch makes the following changes: 1) X86TargetLowering::LowerBUILD_VECTOR now lowers 0/1 vectors into their canonical types. 2) The now-dead patterns are removed from the SSE/MMX .td files. 3) All the patterns in the .td file that referred to immAllOnesV or immAllZerosV in the wrong form now use *_bc to match them with a bitcast wrapped around them. 4) X86DAGToDAGISel::SelectScalarSSELoad is generalized to handle bitcast'd zero vectors, which simplifies the code actually. 5) getShuffleVectorZeroOrUndef is updated to generate a shuffle that is legal, instead of generating one that is illegal and expecting a later legalize pass to clean it up. 6) isZeroShuffle is generalized to handle bitcast of zeros. 7) several other minor tweaks. This patch is definite goodness, but has the potential to cause random code quality regressions. Please be on the lookout for these and let me know if they happen. llvm-svn: 44310	2007-11-25 00:24:49 +00:00
Chris Lattner	8a1dfeecab	add a immAllZerosV_bc pattern fragment for consistency with others. llvm-svn: 44303	2007-11-24 19:02:07 +00:00
Chris Lattner	3862759b53	remove bogus assertion that broke CodeGen/Generic/cast-fp.ll on x86 among others. llvm-svn: 44302	2007-11-24 18:37:20 +00:00
Chris Lattner	28262fbaf2	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Chris Lattner	9020367ea0	add a note llvm-svn: 44299	2007-11-24 06:13:33 +00:00
Dale Johannesen	6293438d50	Fix compiler warning. llvm-svn: 44261	2007-11-21 00:45:00 +00:00
Dale Johannesen	8c3541787f	Fix .eh table linkage issues on Darwin. Some EH support for Darwin PPC, but it's not fully working yet. llvm-svn: 44258	2007-11-20 23:24:42 +00:00
Dan Gohman	27ac53cc23	Remove meaningless qualifiers from return types, avoiding compiler warnings. llvm-svn: 44240	2007-11-19 20:46:23 +00:00
Nate Begeman	2a8ef3f29a	Add support for vectors to int <-> float casts. llvm-svn: 44204	2007-11-17 03:58:34 +00:00
Anton Korobeynikov	cd9b16df61	Implement codegen for flt_rounds on x86 llvm-svn: 44183	2007-11-16 01:31:51 +00:00
Evan Cheng	c0dc7b6e61	Oops. Debugging code shouldn't have been checked in. llvm-svn: 44128	2007-11-14 19:08:32 +00:00
Anton Korobeynikov	58298cb9cc	Fix PIC jump table codegen on x86-32/linux. In fact, such thing should be applied to all targets uses GOT-relative offsets for PIC (Alpha?) llvm-svn: 44108	2007-11-14 09:18:41 +00:00
Duncan Sands	e6821dd990	Eliminate the recently introduced CCAssignToStackABISizeAlign in favour of teaching CCAssignToStack that size 0 and/or align 0 means to use the ABI values. This seems a neater solution. It is safe since no legal value type has size 0. llvm-svn: 44107	2007-11-14 08:29:13 +00:00
Evan Cheng	fd33cb316f	Clean up sub-register implementation by moving subReg information back to MachineOperand auxInfo. Previous clunky implementation uses an external map to track sub-register uses. That works because register allocator uses a new virtual register for each spilled use. With interval splitting (coming soon), we may have multiple uses of the same register some of which are of using different sub-registers from others. It's too fragile to constantly update the information. llvm-svn: 44104	2007-11-14 07:59:08 +00:00
Dale Johannesen	70ca3c1f03	Revert previous; these files aren't ready to go in yet. llvm-svn: 44057	2007-11-13 19:16:02 +00:00
Dale Johannesen	5fd9e7a615	Add parameter to getDwarfRegNum to permit targets to use different mappings for EH and debug info; no functional change yet. Fix warning in X86CodeEmitter. llvm-svn: 44056	2007-11-13 19:13:01 +00:00
Evan Cheng	994043f515	Fix x86-64 jit: remove reliance on Dwarf numbers. llvm-svn: 44048	2007-11-13 17:54:34 +00:00
Bill Wendling	934fcd87e7	Unifacalize the CALLSEQ{START,END} stuff. llvm-svn: 44045	2007-11-13 09:19:02 +00:00
Bill Wendling	cc75435ebf	Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If not, then there is the potential for the stack to be changed while the stack's being used by another instruction (like a call). This can only result in tears... llvm-svn: 44037	2007-11-13 00:44:25 +00:00
Anton Korobeynikov	c58fa8584b	Completely forgot, that we have some debug information emission on PPC. This should fix some regressions on ppc nightly tests. llvm-svn: 44029	2007-11-12 23:36:13 +00:00
Bruno Cardoso Lopes	0661f1be90	Added JumpTable support Fixed some AsmPrinter issues Added GLOBAL_OFFSET_TABLE Node handle. llvm-svn: 44024	2007-11-12 19:49:57 +00:00
Owen Anderson	aba398a5ce	Add a flag for indirect branch instructions. Target maintainers: please check that the instructions for your target are correctly marked. llvm-svn: 44012	2007-11-12 07:39:39 +00:00
Anton Korobeynikov	a4eb4336d2	Clarify the meaning of '-2' register number llvm-svn: 43998	2007-11-11 19:53:50 +00:00
Anton Korobeynikov	8e8473c783	Use TableGen to emit information for dwarf register numbers. This makes DwarfRegNum to accept list of numbers instead. Added three different "flavours", but only slightly tested on x86-32/linux. Please check another subtargets if possible, llvm-svn: 43997	2007-11-11 19:50:10 +00:00
Dale Johannesen	2e9b020e89	Add CCAssignToStackABISizeAlign for convenience in dealing with types whose size & alignment are different on different subtargets. Use it for x86 f80. llvm-svn: 43988	2007-11-10 22:07:15 +00:00
Arnold Schwaighofer	64ad6fa1fa	Update tailcall code to include inline attribute operand for memcpy. llvm-svn: 43978	2007-11-10 10:48:01 +00:00
Evan Cheng	946afd2f6c	Unbreak x86-64 jumptable. llvm-svn: 43955	2007-11-09 19:11:23 +00:00
Anton Korobeynikov	dcc6077439	Silence a warning llvm-svn: 43954	2007-11-09 19:06:14 +00:00
Dale Johannesen	eca19e7eca	Revert previous rewrite per chris's comments. llvm-svn: 43950	2007-11-09 18:07:11 +00:00
Evan Cheng	7d8deec92f	Much improved pic jumptable codegen: Then: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry imull $4, %ecx, %ecx leal LJTI1_0-"L1$pb"(%eax), %edx addl LJTI1_0-"L1$pb"(%ecx,%eax), %edx jmpl %edx .align 2 .set L1_0_set_3,LBB1_3-LJTI1_0 .set L1_0_set_2,LBB1_2-LJTI1_0 .set L1_0_set_5,LBB1_5-LJTI1_0 .set L1_0_set_4,LBB1_4-LJTI1_0 LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 Now: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry addl LJTI1_0-"L1$pb"(%eax,%ecx,4), %eax jmpl %eax .align 2 .set L1_0_set_3,LBB1_3-"L1$pb" .set L1_0_set_2,LBB1_2-"L1$pb" .set L1_0_set_5,LBB1_5-"L1$pb" .set L1_0_set_4,LBB1_4-"L1$pb" LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 llvm-svn: 43924	2007-11-09 01:32:10 +00:00
Dale Johannesen	8a9ec1582b	Rewrite Dwarf number handling per review comments. llvm-svn: 43918	2007-11-09 00:47:10 +00:00
Lauro Ramos Venancio	d8f2190c19	[ARM] Implement __builtin_thread_pointer. llvm-svn: 43892	2007-11-08 17:20:05 +00:00
Dale Johannesen	b11aca8a92	Complete conditionalization of Dwarf reg numbers. Would somebody not on Darwin please make sure this doesn't break anything. Exception handling failures would be the most likely symptom. llvm-svn: 43844	2007-11-07 21:48:35 +00:00
Dale Johannesen	a863789700	Interchange Dwarf numbers of ESP and EBP on x86 Darwin. Much improvement in exception handling. llvm-svn: 43794	2007-11-07 00:25:05 +00:00
Bruno Cardoso Lopes	77e5c419ec	Better processor definition llvm-svn: 43749	2007-11-06 03:15:20 +00:00
Rafael Espindola	ec025c3042	Move the LowerMEMCPY and LowerMEMCPYCall to a common place. Thanks for the suggestions Bill :-) llvm-svn: 43742	2007-11-05 23:12:20 +00:00
Lauro Ramos Venancio	f5081ba980	[ARM] Fix code generation for: static __thread struct { int a; int b; } teste = {0, 0}; llvm-svn: 43722	2007-11-05 18:33:37 +00:00
Evan Cheng	c49995c027	Use movups to spill / restore SSE registers on targets where stacks alignment is less than 16. This is a temporary solution until dynamic stack alignment is implemented. llvm-svn: 43703	2007-11-05 07:30:01 +00:00
Bruno Cardoso Lopes	569b5512b0	Added support for PIC code with "explicit relocations" only. Removed all macro code for PIC (goodbye "la"). Support tested with shootout bench. llvm-svn: 43697	2007-11-05 03:02:32 +00:00
Duncan Sands	d1bdbd010b	Eliminate the remaining uses of getTypeSize. This should only effect x86 when using long double. Now 12/16 bytes are output for long double globals (the exact amount depends on the alignment). This brings globals in line with the rest of LLVM: the space reserved for an object is now always the ABI size. One tricky point is that only 10 bytes should be output for long double if it is a field in a packed struct, which is the reason for the additional argument to EmitGlobalConstant. llvm-svn: 43688	2007-11-05 00:04:43 +00:00
Chris Lattner	8fac63c8b5	Fix PR1761 by not printing (rip) suffix when in -static mode. Evan, please review this. llvm-svn: 43680	2007-11-04 19:23:28 +00:00
Nick Lewycky	36047b0b5b	Fix crash before main on ppc/linux with static constructors. PR1771 llvm-svn: 43676	2007-11-04 17:32:10 +00:00
Chris Lattner	67cd357fb8	Fix PR1763 by allowing the 'q' constraint to work with 64-bit regs on x86-64. llvm-svn: 43669	2007-11-04 06:51:12 +00:00
Evan Cheng	bf8e7c6644	Unbreak tailcall opt. llvm-svn: 43646	2007-11-02 17:45:40 +00:00
Chris Lattner	679e22d547	add a note llvm-svn: 43642	2007-11-02 17:04:20 +00:00
Evan Cheng	b50cc64eb0	Missing a getNumOperands check. llvm-svn: 43630	2007-11-02 01:26:22 +00:00
Duncan Sands	eb464e976f	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Bill Wendling	df2eaa8a55	Silence, accersed warning llvm-svn: 43609	2007-11-01 08:51:44 +00:00
Rafael Espindola	27a8907a7c	Make ARM and X86 LowerMEMCPY identical by moving the isThumb check into getMaxInlineSizeThreshold and by restructuring the X86 version. New I just have to move this to a common place :-) llvm-svn: 43554	2007-10-31 14:39:58 +00:00
Rafael Espindola	fae98471a9	Make ARM an X86 memcpy expansion more similar to each other. Now both subtarget define getMaxInlineSizeThreshold and the expansion uses it. This should not change generated code. llvm-svn: 43552	2007-10-31 11:52:06 +00:00
Dale Johannesen	9bc04ae496	Make i64=expand_vector_elt(v2i64) work in 32-bit mode. llvm-svn: 43535	2007-10-31 00:32:36 +00:00
Dale Johannesen	7167117945	Add missing SSE builtins: CVTPD2PI, CVTPS2PI, CVTTPD2PI, CVTTPS2PI, CVTPI2PD, CVTPI2PS. llvm-svn: 43523	2007-10-30 22:15:38 +00:00
Duncan Sands	f6837e8634	Fix for visibility warnings generated by gcc-4.2. llvm-svn: 43500	2007-10-30 13:14:37 +00:00
Dale Johannesen	461a0c47f8	Add missing MMX PSUBQ. llvm-svn: 43488	2007-10-30 01:18:38 +00:00
Evan Cheng	5fe81cf64e	Enable more fold (sext (load x)) -> (sext (truncate (sextload x))) transformation. Previously, it's restricted by ensuring the number of load uses is one. Now the restriction is loosened up by allowing setcc uses to be "extended" (e.g. setcc x, c, eq -> setcc sext(x), sext(c), eq). llvm-svn: 43465	2007-10-29 19:58:20 +00:00
Evan Cheng	1113931fd8	Avoid doing something dumb like rewriting using a 64-bit iv in 32-bit mode. llvm-svn: 43446	2007-10-29 07:57:50 +00:00
Chris Lattner	be8379fac5	add a note. llvm-svn: 43444	2007-10-29 06:19:48 +00:00
Chris Lattner	1503362624	Add support for the x86-64 'q' regigster modifier, and add support for the b/h/w/k/q inline asm memory modifiers, which are just ignored. This fixes PR1748 and CodeGen/X86/2007-10-28-inlineasm-q-modifier.ll llvm-svn: 43430	2007-10-29 03:09:07 +00:00
Chris Lattner	7e3a8a7604	Fix PR1749 and InstCombine/2007-10-28-EmptyField.ll by handling zero-length fields better. llvm-svn: 43427	2007-10-29 02:40:02 +00:00
Evan Cheng	053178440a	New entry. llvm-svn: 43420	2007-10-28 04:01:09 +00:00
Anton Korobeynikov	0d3f43480e	Fix off-by-one stack offset computations (dwarf information) for callee-saved registers in case, when FP pointer was eliminated. This should fixes misc. random EH-related crahses, when stuff is compiled with -fomit-frame-pointer. Thanks Duncan for nailing this bug! llvm-svn: 43381	2007-10-26 09:13:24 +00:00
Eric Christopher	82c77dd85b	clo/clz aren't supported on mips I. Keep them around for when we'll want them later (mips32/64). llvm-svn: 43380	2007-10-26 04:00:13 +00:00
Evan Cheng	53696b7e9f	Loosen up iv reuse to allow reuse of the same stride but a larger type when truncating from the larger type to smaller type is free. e.g. Turns this loop: LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx movw %dx, %si LBB1_2: # bb movl L_X$non_lazy_ptr, %edi movw %si, (%edi) movl L_Y$non_lazy_ptr, %edi movw %dx, (%edi) addw $4, %dx incw %si incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb into LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx LBB1_2: # bb movl L_X$non_lazy_ptr, %esi movw %cx, (%esi) movl L_Y$non_lazy_ptr, %esi movw %dx, (%esi) addw $4, %dx incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb llvm-svn: 43375	2007-10-26 01:56:11 +00:00
Dale Johannesen	0774a9c549	Support non-POSIX hosts by removing use of strncasecmp. llvm-svn: 43364	2007-10-25 21:54:43 +00:00
Dale Johannesen	94241a8d3a	Disable a couple more things for ppcf128. llvm-svn: 43267	2007-10-23 23:20:14 +00:00
Evan Cheng	0590c75f18	Temporary solution: added a different set of BCTRL_Macho / BCTRL_ELF with right callee-saved defs set for ppc64. llvm-svn: 43248	2007-10-23 06:42:42 +00:00
Evan Cheng	252d9ddb4d	Fix memcpy lowering when addresses are 4-byte aligned but size is not multiple of 4. llvm-svn: 43234	2007-10-22 22:11:27 +00:00
Dan Gohman	76e104c8ad	Fix the folding of multiplication into addresses on x86, which was broken by the recent {U,S}MUL_LOHI changes. llvm-svn: 43230	2007-10-22 20:22:24 +00:00
Evan Cheng	85eb733eff	Use ptr type in the immediate field of a BxA instruction so we don't end up selecting 32-bit call instruction for ppc64. llvm-svn: 43228	2007-10-22 19:46:19 +00:00
Evan Cheng	ddeab10144	Fix an unfolding bug. llvm-svn: 43212	2007-10-22 03:03:20 +00:00
Dale Johannesen	2edd0fb69d	Allow for copysign having f80 second argument. Fixes 5550319. llvm-svn: 43205	2007-10-21 01:07:44 +00:00
Evan Cheng	b56784f9ea	Resolve unfold tables ambiguity. llvm-svn: 43194	2007-10-19 23:50:58 +00:00
Evan Cheng	ded6550885	Local spiller optimization: Turn a store folding instruction into a load folding instruction. e.g. xorl %edi, %eax movl %eax, -32(%ebp) movl -36(%ebp), %eax orl %eax, -32(%ebp) => xorl %edi, %eax orl -36(%ebp), %eax mov %eax, -32(%ebp) This enables the unfolding optimization for a subsequent instruction which will also eliminate the newly introduced store instruction. llvm-svn: 43192	2007-10-19 21:23:22 +00:00
Rafael Espindola	c751cbdb02	split LowerMEMCPY into LowerMEMCPYCall and LowerMEMCPYInline in the ARM backend. llvm-svn: 43176	2007-10-19 14:35:17 +00:00
Rafael Espindola	d8d4372845	Add support for byval function whose argument is not 32 bit aligned. To do this it is necessary to add a "always inline" argument to the memcpy node. For completeness I have also added this node to memmove and memset. I have also added getMem* functions, because the extra argument makes it cumbersome to use getNode and because I get confused by it :-) llvm-svn: 43172	2007-10-19 10:41:11 +00:00
Chris Lattner	4354f2db6a	comment fixes llvm-svn: 43168	2007-10-19 04:08:28 +00:00
Chris Lattner	57e2fa4ba0	Add an easy microoptimization I noticed. llvm-svn: 43164	2007-10-19 03:29:26 +00:00
Dale Johannesen	b23b0bfa8f	More ppcf128 issues (maybe the last)? llvm-svn: 43160	2007-10-19 00:59:18 +00:00
Evan Cheng	0449186690	- Added getOpcodeAfterMemoryUnfold(). It doesn't unfold an instruction, but only returns the opcode of the instruction post unfolding. - Fix some copy+paste bugs. llvm-svn: 43153	2007-10-18 22:40:57 +00:00
Evan Cheng	c852780685	Use SmallVectorImpl instead of SmallVector with hardcoded size in MRegister public interface. llvm-svn: 43150	2007-10-18 21:29:24 +00:00
Christopher Lamb	7f21e45b06	Fix a misnamed parameter. llvm-svn: 43145	2007-10-18 19:29:45 +00:00
Christopher Lamb	a26b82ea94	Fix a typo llvm-svn: 43144	2007-10-18 19:28:55 +00:00
Gordon Henriksen	3b309c68d1	Work around downrev gccs which do not inherit visibility of the Registry<>::iterator member class. llvm-svn: 43122	2007-10-18 11:53:05 +00:00
Chris Lattner	374b185092	legalizing the ret operation on f64 shouldn't introduce a new i64 bit convert needlessly. llvm-svn: 43116	2007-10-18 06:17:07 +00:00
Gordon Henriksen	a6050b38d2	Switching TargetMachineRegistry to use the new generic Registry. llvm-svn: 43094	2007-10-17 21:28:48 +00:00
Chris Lattner	3a19e981f5	Change fp to sint legalization on x86-32 to do 2 x i32 loads instead of 1 x i64 loads. This doesn't change any functionality yet. llvm-svn: 43068	2007-10-17 06:17:29 +00:00
Chris Lattner	ba2d55a564	fix some funny indentation, add comments. llvm-svn: 43066	2007-10-17 06:02:13 +00:00
Dale Johannesen	63411d36bf	Check for invalid cc's in f80 select. llvm-svn: 43033	2007-10-16 18:09:08 +00:00
Chris Lattner	45d9c7aa07	Fix a bug handling frame references in ppc inline asm when the frame offset doesn't fit into 16 bits. llvm-svn: 43032	2007-10-16 18:00:18 +00:00
Arnold Schwaighofer	f0d4d73bf6	Correction to tail call optimization code. The new return address was stored to the acutal stack slot before the parameters were lowered to their stack slot. This could cause arguments to be overwritten by the return address if the called function had less parameters than the caller function. The update should remove the last failing test case of llc-beta: SPASS. llvm-svn: 43027	2007-10-16 09:05:00 +00:00
Chris Lattner	c641c8c6ec	Change LowerFP_TO_SINT to create the specific code it needs instead of unconditionally creating an i64 bitcast. With the future legalizer design, operation legalization can't introduce new nodes with illegal types. This fixes the rest of olden on ppc32. llvm-svn: 43005	2007-10-15 20:14:52 +00:00
Evan Cheng	f5bcd3d737	LowerFP_TO_SINT must not create a stack object if it's not needed. llvm-svn: 43004	2007-10-15 20:11:21 +00:00
Dale Johannesen	28beae4a4f	Handle PPC long double in CBackend. llvm-svn: 42972	2007-10-15 01:05:37 +00:00
Evan Cheng	90645f30db	Unbreak x86-64. llvm-svn: 42962	2007-10-14 10:09:39 +00:00
Evan Cheng	33df6a6bed	Revert 42908 for now. llvm-svn: 42960	2007-10-14 05:57:21 +00:00
Dale Johannesen	6c89945eb8	Fix type mismatch error in PPC Altivec (only causes a problem when asserts are on). From vecLib. llvm-svn: 42959	2007-10-14 01:58:32 +00:00
Duncan Sands	bf31a19c62	Clarify that fastcc has a problem with nested function trampolines, rather than with nested functions themselves. llvm-svn: 42955	2007-10-13 07:38:37 +00:00
Evan Cheng	2e2d6358bc	Change unfoldMemoryOperand(). User is now responsible for passing in the register used by the unfolded instructions. User can also specify whether to unfold the load, the store, or both. llvm-svn: 42946	2007-10-13 02:35:06 +00:00
Arnold Schwaighofer	50d2c33530	Correcting the corrections. Bad bad baaad emacs! llvm-svn: 42935	2007-10-12 21:53:12 +00:00
Arnold Schwaighofer	6bcd9e7ec2	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Duncan Sands	d781ed9d21	Due to the new tail call optimization, trampolines can no longer be created for fastcc functions. llvm-svn: 42925	2007-10-12 19:37:31 +00:00
Evan Cheng	c36fdf163a	Update. llvm-svn: 42922	2007-10-12 18:22:55 +00:00
Dan Gohman	a75e4a62e6	Change the names used for internal labels to use the current function symbol name instead of a codegen-assigned function number. Thanks Evan! :-) llvm-svn: 42908	2007-10-12 14:53:36 +00:00
Dan Gohman	ad3e823efa	Mark vector ctpop, cttz, and ctlz as Expand on x86. llvm-svn: 42905	2007-10-12 14:09:42 +00:00
Dan Gohman	171fb68ae0	Mark vector pow, ctpop, cttz, and ctlz as Expand on PowerPC. llvm-svn: 42904	2007-10-12 14:08:57 +00:00
Evan Cheng	c7b7a3cb74	Fold load / store into MOV32to32_ and MOV16to16_. llvm-svn: 42895	2007-10-12 08:38:01 +00:00
Evan Cheng	f1ead16fd5	Flag MOV32to32_ with EXTRACT_SUBREG. They should not be scheduled apart. llvm-svn: 42894	2007-10-12 07:55:53 +00:00
Dan Gohman	edc841fb53	Set ISD::FPOW to Expand. llvm-svn: 42881	2007-10-11 23:21:31 +00:00
Dale Johannesen	9486be1cf2	Add missing argument to PALIGNR llvm-svn: 42874	2007-10-11 20:58:37 +00:00
Arnold Schwaighofer	d47210011e	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Chris Lattner	88dcf28e45	Fix CodeGen/Generic/BasicInstrs.llx on sparc by marking divrem illegal. Thanks to gabor for pointing this out! llvm-svn: 42832	2007-10-10 18:10:57 +00:00
Dale Johannesen	76458ddf1e	Next PPC long double bits: ppcf128->i32 conversion. Surprisingly complicated. Adds getTargetNode for 2 outputs, no inputs (missing). llvm-svn: 42822	2007-10-10 01:01:31 +00:00
Dan Gohman	6c3e0cdd36	LowerIntegerDivOrRem no longer exists. llvm-svn: 42787	2007-10-09 15:45:13 +00:00
Dan Gohman	cc317de0f5	Fix grammar in a comment. llvm-svn: 42786	2007-10-09 15:44:37 +00:00
Dan Gohman	9546d48e97	This is done. llvm-svn: 42785	2007-10-09 15:42:21 +00:00
Evan Cheng	c00dbfc5bc	Under 64-bit mode use LEA64_32r instead of LEA64r to save a byte. llvm-svn: 42783	2007-10-09 07:14:53 +00:00
Bruno Cardoso Lopes	257d5a5127	Position Independent Code (PIC) support [3] llvm-svn: 42780	2007-10-09 03:15:11 +00:00
Bruno Cardoso Lopes	627ba10946	Position Independent Code (PIC) support [2] - Added a function to hold the stack location where GP must be stored during LowerCALL - AsmPrinter now emits directives based on relocation type - PIC_ set to default relocation type (same as GCC) llvm-svn: 42779	2007-10-09 03:01:19 +00:00
Bruno Cardoso Lopes	3a48664e98	Position Independent Code (PIC) support [1] - Modified instruction format to handle pseudo instructions - Added LoadAddr SDNode to load symbols. llvm-svn: 42778	2007-10-09 02:55:31 +00:00
Evan Cheng	90aa032f98	Bug fix. X86 was emitting redundant setcc and test instructions before a conditional move. llvm-svn: 42774	2007-10-08 22:16:29 +00:00
Dan Gohman	6df332f0cb	Migrate X86 and ARM from using X86ISD::{,I}DIV and ARMISD::MULHILO{U,S} to use ISD::{S,U}DIVREM and ISD::{S,U}MUL_HIO. Move the lowering code associated with these operators into target-independent in LegalizeDAG.cpp and TargetLowering.cpp. llvm-svn: 42762	2007-10-08 18:33:35 +00:00
Evan Cheng	090bfbebd1	Allow x86 compare to be commutable by default. llvm-svn: 42761	2007-10-08 18:27:46 +00:00
Dan Gohman	ae3b47b06f	When we start enabling SMUL_LOHI/UMUL_LOHI or SDIVREM/UDIVREM in target-indepenent lowering, don't use them on PowerPC. llvm-svn: 42755	2007-10-08 17:28:24 +00:00
Dan Gohman	d7c8b98426	Simplify getIntPtrType, allowing it to work for arbitrary pointer sizes. llvm-svn: 42751	2007-10-08 15:16:25 +00:00
Chris Lattner	fcccf4b6c4	disable this entirely: it is causing use of invalidated iterators and infinite looping. llvm-svn: 42739	2007-10-07 22:00:31 +00:00
Chris Lattner	39dbb82db2	Fix many regressions on x86 by avoiding dereferencing the end iterator. llvm-svn: 42738	2007-10-07 21:53:12 +00:00
Anton Korobeynikov	54ecd77023	Oops, I really wanted to commit this part also :) llvm-svn: 42700	2007-10-06 16:39:43 +00:00
Anton Korobeynikov	34fefcf678	Move merge code into new helper function. llvm-svn: 42699	2007-10-06 16:17:49 +00:00
Evan Cheng	dc95020e30	Added DAG xforms. e.g. (vextract (v4f32 s2v (f32 load $addr)), 0) -> (f32 load $addr) (vextract (v4i32 bc (v4f32 s2v (f32 load $addr))), 0) -> (i32 load $addr) Remove x86 specific patterns. llvm-svn: 42677	2007-10-06 02:46:29 +00:00
Dale Johannesen	9b7ac95116	Next powerpc long double bits. Comparisons work, although not well, and shortening FP converts. llvm-svn: 42672	2007-10-06 01:24:11 +00:00
Evan Cheng	9af50ee6ef	Commute x86 cmove instructions by swapping the operands and change the condition to its inverse. Testing this as llcbeta llvm-svn: 42661	2007-10-05 23:13:21 +00:00
Evan Cheng	e0e36e4a0e	This is done. llvm-svn: 42656	2007-10-05 22:34:59 +00:00
Evan Cheng	dc467c6323	Enable convertToThreeAddress for X86 by default. llvm-svn: 42655	2007-10-05 22:31:10 +00:00
Evan Cheng	2b3122e56e	INC64_32r -> LEA64_32r is better than INC64_32r -> LEA32r, but it still can cause performance degradation. llvm-svn: 42653	2007-10-05 21:55:32 +00:00
Evan Cheng	688f34a273	In 64-bit mode, avoid using leal with 32-bit 32-bit address size, e.g. leal 1(%ecx), %edi, which requires 67H prefix. llvm-svn: 42647	2007-10-05 20:34:26 +00:00
Dale Johannesen	c7b51b678d	First round of ppc long double. call/return and basic arithmetic works. Rename RTLIB long double functions to distinguish different flavors of long double; the lib functions have different names, alas. llvm-svn: 42644	2007-10-05 20:04:43 +00:00
Evan Cheng	b069dd6a25	Add support to convert more 64-bit instructions to 3-address instructions. llvm-svn: 42642	2007-10-05 18:20:36 +00:00
Evan Cheng	f658191412	ADC and SBB uses EFLAGS. llvm-svn: 42640	2007-10-05 17:59:57 +00:00
Dan Gohman	821635b63f	Change a few more spaces to tabs in assembly output. llvm-svn: 42638	2007-10-05 15:58:41 +00:00
Dan Gohman	950f96e456	Change a space to a tab in the assembly output of a .globl directive for consistency. llvm-svn: 42637	2007-10-05 15:54:58 +00:00
Evan Cheng	4e46ad06fe	Testing convertToThreeeAddress as X86 llcbeta. llvm-svn: 42630	2007-10-05 08:04:01 +00:00
Evan Cheng	6e5205d379	Added storeRegToAddr, loadRegFromAddr, and unfoldMemoryOperand's. llvm-svn: 42624	2007-10-05 01:34:55 +00:00
Evan Cheng	32766d3518	Not needed any more. llvm-svn: 42623	2007-10-05 01:34:14 +00:00
Evan Cheng	d2ef8c689e	Forgot these. llvm-svn: 42622	2007-10-05 01:33:45 +00:00
Evan Cheng	f536e2f41e	- Added a few target hooks to generate load / store instructions from / to any address (not just from / to frameindexes). - Added target hooks to unfold load / store instructions / SDNodes into separate load, data processing, store instructions / SDNodes. llvm-svn: 42621	2007-10-05 01:32:41 +00:00
Chris Lattner	4224151a44	add a note. llvm-svn: 42607	2007-10-04 15:47:27 +00:00
Dan Gohman	30ba45b569	Use empty() member functions when that's what's being tested for instead of comparing begin() and end(). llvm-svn: 42585	2007-10-03 19:26:29 +00:00
Chris Lattner	a31fa80185	add a note llvm-svn: 42579	2007-10-03 17:10:03 +00:00
Chris Lattner	17feaa781c	add a note llvm-svn: 42573	2007-10-03 06:10:59 +00:00
Chris Lattner	dfcb750656	Bill's example is still not enough to repro this, but it has other issues that seem significant as well. llvm-svn: 42564	2007-10-03 03:40:24 +00:00
Bill Wendling	c5fbf331ff	Another micro-opt. llvm-svn: 42554	2007-10-02 21:49:31 +00:00
Bill Wendling	c4a53b617f	Another missed optimization with LICM. llvm-svn: 42552	2007-10-02 21:43:06 +00:00
Bill Wendling	36f033e53e	Small label changes. llvm-svn: 42549	2007-10-02 21:02:53 +00:00
Bill Wendling	a7d5c36215	Now with source code. llvm-svn: 42548	2007-10-02 21:01:16 +00:00
Bill Wendling	0159f0c5ba	Now with LL code! llvm-svn: 42547	2007-10-02 20:54:32 +00:00
Bill Wendling	48c27bf598	Another missed optimization. llvm-svn: 42546	2007-10-02 20:42:59 +00:00
Bill Wendling	5e50716a6b	Micro-optimization -- missed LICM opportunity. llvm-svn: 42542	2007-10-02 19:55:05 +00:00
Dale Johannesen	a4e3643cb3	Rewrite sqrt and powi to use anyfloat. By popular demand. llvm-svn: 42537	2007-10-02 17:43:59 +00:00
Evan Cheng	3537dbbd1e	Refactor code to add load / store folded instructions -> register only instructions reverse map. llvm-svn: 42509	2007-10-01 23:44:33 +00:00
Evan Cheng	c863779cd4	Typo. X86comi doesn't read / write chain's. llvm-svn: 42492	2007-10-01 18:12:48 +00:00
Dale Johannesen	ef488c7b0e	Add getABITypeSize, getABITypeSizeInBits llvm-svn: 42488	2007-10-01 16:03:14 +00:00
Gordon Henriksen	9b5a117d01	AsmPrinters overriding getAnalysisUsage should call super. And not super's super, either. llvm-svn: 42482	2007-09-30 13:39:29 +00:00
Evan Cheng	f3c130a8b6	Enabling new condition code modeling scheme. llvm-svn: 42459	2007-09-29 00:00:36 +00:00
Rafael Espindola	01b306e575	Refactor the memcpy lowering for the x86 target. The only generated code difference is that now we call memcpy when the size of the array is unknown. This matches GCC behavior and is better since the run time value can be arbitrarily large. llvm-svn: 42433	2007-09-28 12:53:01 +00:00
Evan Cheng	c2acb6f2e5	Stop inventing new words. :-) llvm-svn: 42429	2007-09-28 01:35:02 +00:00
Evan Cheng	d3ff9d3ff7	Pessimisively assume ADJCALLSTACKDOWN / ADJCALLSTACKUP (which becomes sub / add) clobbers EFLAGS. llvm-svn: 42426	2007-09-28 01:19:48 +00:00
Dan Gohman	50747737a5	TargetAsmInfo::getAddressSize() was incorrect for x86-64 and 64-bit targets other than PPC64. Instead of fixing it, just remove it and fix all the places that use it to use TargetData::getPointerSize() instead, as there aren't very many. Most of the references were in DwarfWriter.cpp. llvm-svn: 42419	2007-09-27 23:12:31 +00:00
Evan Cheng	d8ab90ae1f	Use GR64 in 64-bit mode. llvm-svn: 42417	2007-09-27 21:50:05 +00:00
Evan Cheng	826f0f94df	Doh. Calls clobber EFLAGS. llvm-svn: 42413	2007-09-27 19:01:55 +00:00
Dale Johannesen	089d2e760f	Make temporaries explicit to avoid premature destruction of compiler-created ones. llvm-svn: 42383	2007-09-26 23:20:33 +00:00
Evan Cheng	29817845b3	- Move getPhysicalRegisterRegClass() from ScheduleDAG to MRegisterInfo. - Added ability to emit cross class register copies to the BBRU scheduler. - More aggressive backtracking. llvm-svn: 42375	2007-09-26 21:36:17 +00:00

... 2 3 4 5 6 ...

7779 Commits