llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 14:33:02 +02:00

Author	SHA1	Message	Date
Dan Gohman	65be3b6502	Add explicit keywords. llvm-svn: 43464	2007-10-29 19:52:04 +00:00
Duncan Sands	b494fb97a4	The guaranteed alignment of ptr+offset is only the minimum of of offset and the alignment of ptr if these are both powers of 2. While the ptr alignment is guaranteed to be a power of 2, there is no reason to think that offset is. For example, if offset is 12 (the size of a long double on x86-32 linux) and the alignment of ptr is 8, then the alignment of ptr+offset will in general be 4, not 8. Introduce a function MinAlign, lifted from gcc, for computing the minimum guaranteed alignment. I've tried to fix up everywhere under lib/CodeGen/SelectionDAG/. I also changed some places that weren't wrong (because both values were a power of 2), as a defensive change against people copying and pasting the code. Hopefully someone who cares about alignment will review the rest of LLVM and fix up the remaining places. Since I'm on x86 I'm not very motivated to do this myself... llvm-svn: 43421	2007-10-28 12:59:45 +00:00
Bill Wendling	8d329ff809	- Remove the hacky code that forces a memcpy. Alignment is taken care of in the FE. - Explicitly pass in the alignment of the load & store. - XFAIL 2007-10-23-UnalignedMemcpy.ll because llc has a bug that crashes on unaligned pointers. llvm-svn: 43398	2007-10-26 20:24:42 +00:00
Duncan Sands	fcfc9fdd5c	Small formatting changes. Add a sanity check. Use NVT rather than looking it up, since we have it to hand. llvm-svn: 43341	2007-10-25 12:35:51 +00:00
Duncan Sands	15f9f7d669	Promote SETCC operands. llvm-svn: 43340	2007-10-25 12:32:31 +00:00
Duncan Sands	28582a76eb	Correctly extract the ValueType from a VTSDNode. llvm-svn: 43339	2007-10-25 12:30:51 +00:00
Dale Johannesen	53ca1384b0	Another expansion for i64 multiply, suitable for PPC. llvm-svn: 43314	2007-10-24 22:26:08 +00:00
Bill Wendling	e5f534148e	Fix comment and use the "Size" variable that's already provided. llvm-svn: 43271	2007-10-23 23:36:57 +00:00
Bill Wendling	a420d660c8	If there's an unaligned memcpy to/from the stack, don't lower it. Just call the memcpy library function instead. llvm-svn: 43270	2007-10-23 23:32:40 +00:00
Bill Wendling	34950e1291	This broke lots. Reverting. llvm-svn: 43264	2007-10-23 22:04:26 +00:00
Bill Wendling	34c16a1b2d	Lowering a memcpy to the stack is killing PPC. The ARM and X86 backends already have their own custom memcpy lowering code. This code needs to be factored out into a target-independent lowering method with hooks to the backend. In the meantime, just call memcpy if we're trying to copy onto a stack. llvm-svn: 43262	2007-10-23 21:30:25 +00:00
Duncan Sands	b47c73b341	Support for expanding extending loads of integers with funky bit-widths. llvm-svn: 43225	2007-10-22 19:00:05 +00:00
Duncan Sands	4df76bb946	Fix up the logic for result expanding the various extension operations so they work right for integers with funky bit-widths. For example, consider extending i48 to i64 on a 32 bit machine. The i64 result is expanded to 2 x i32. We know that the i48 operand will be promoted to i64, then also expanded to 2 x i32. If we had the expanded promoted operand to hand, then expanding the result would be trivial. Unfortunately at this stage we can only get hold of the promoted operand. So instead we kind of hand-expand, doing explicit shifting and truncating to get the top and bottom halves of the i64 operand into 2 x i32, which are then used to expand the result. This is harmless, because when the promoted operand is finally expanded all this bit fiddling turns into trivial operations which are eliminated either by the expansion code itself or the DAG combiner. llvm-svn: 43223	2007-10-22 18:26:21 +00:00
Chris Lattner	34bb3728ff	Add promote operand support for [su]int_to_fp. llvm-svn: 43204	2007-10-20 22:57:56 +00:00
Chris Lattner	1c4c6a384e	Add result promotion of FP_TO_*INT, fixing CodeGen/X86/trunc-to-bool.ll with the new legalizer. llvm-svn: 43199	2007-10-20 04:32:38 +00:00
Chris Lattner	aa6d58c766	simplify some code. llvm-svn: 43198	2007-10-20 04:09:48 +00:00
Chris Lattner	70abd7943f	Implement promote and expand for operands of memcpy and friends. This fixes CodeGen/X86/mem*.ll. llvm-svn: 43197	2007-10-20 04:07:07 +00:00
Dale Johannesen	f28404f7e8	Fix a few places vector operations were not getting the operand's type from the right place. llvm-svn: 43195	2007-10-20 00:07:52 +00:00
Duncan Sands	4dcd783a69	Add support for a few more nodes. llvm-svn: 43190	2007-10-19 20:29:48 +00:00
Dale Johannesen	4ae755d15c	Redo "last ppc long double fix" as Chris wants. llvm-svn: 43189	2007-10-19 20:29:00 +00:00
Chris Lattner	8c40f019c3	Fix a really nasty vector miscompilation bill recently introduced. llvm-svn: 43181	2007-10-19 16:47:35 +00:00
Chris Lattner	45b8558ec5	rename ExpandOperation to ExpandOperationResult, as suggested by Duncan llvm-svn: 43177	2007-10-19 15:28:47 +00:00
Duncan Sands	1d41485be4	Support for expanding ADDE and SUBE. llvm-svn: 43175	2007-10-19 13:06:17 +00:00
Duncan Sands	1bc7997ce7	If the value types are equal then this routine asserts in later checks rather than producing the ordinary load it is supposed to. Avoid all such hassles by directly returning an ordinary load in this case. llvm-svn: 43174	2007-10-19 13:05:40 +00:00
Rafael Espindola	d8d4372845	Add support for byval function whose argument is not 32 bit aligned. To do this it is necessary to add a "always inline" argument to the memcpy node. For completeness I have also added this node to memmove and memset. I have also added getMem* functions, because the extra argument makes it cumbersome to use getNode and because I get confused by it :-) llvm-svn: 43172	2007-10-19 10:41:11 +00:00
Chris Lattner	d459e119ba	Implement a few new operations. llvm-svn: 43171	2007-10-19 04:46:45 +00:00
Chris Lattner	fb5bc2fee1	Implement expansion of SINT_TO_FP and UINT_TO_FP operands. llvm-svn: 43170	2007-10-19 04:32:47 +00:00
Chris Lattner	890221835b	implement support for custom expansion of any node type, in one place. llvm-svn: 43169	2007-10-19 04:14:36 +00:00
Chris Lattner	e066099b95	Make use of TLI.ExpandOperation, remove softfloat stuff. llvm-svn: 43167	2007-10-19 03:58:25 +00:00
Chris Lattner	a4505cae9f	add expand support for bit_convert result, even allowing custom expansion. llvm-svn: 43166	2007-10-19 03:33:14 +00:00
Chris Lattner	f02434cdaf	add a new target hook. llvm-svn: 43165	2007-10-19 03:31:45 +00:00
Bill Wendling	84baa3a5b5	Negative indices aren't allowed here. llvm-svn: 43161	2007-10-19 01:10:49 +00:00
Dale Johannesen	b23b0bfa8f	More ppcf128 issues (maybe the last)? llvm-svn: 43160	2007-10-19 00:59:18 +00:00
Bill Wendling	32c9cd9e94	Pointer arithmetic should be done with the index the same size as the pointer. llvm-svn: 43120	2007-10-18 08:32:37 +00:00
Duncan Sands	68026c73d6	Support for ADDC/SUBC. llvm-svn: 43119	2007-10-18 08:22:16 +00:00
Dan Gohman	2903f7fc26	Add support for ISD::SELECT in SplitVectorOp. llvm-svn: 43072	2007-10-17 14:48:28 +00:00
Duncan Sands	0a5a15c3a0	Return Expand from getOperationAction for all extended types. This is needed for SIGN_EXTEND_INREG at least. It is not clear if this is correct for other operations. On the other hand, for the various load/store actions it seems to correct to return the type action, as is currently done. Also, it seems that SelectionDAG::getValueType can be called for extended value types; introduce a map for holding these, since we don't really want to extend the vector to be 2^32 pointers long! Generalize DAGTypeLegalizer::PromoteResult_TRUNCATE and DAGTypeLegalizer::PromoteResult_INT_EXTEND to handle the various funky possibilities that apints introduce, for example that you can promote to a type that needs to be expanded. llvm-svn: 43071	2007-10-17 13:49:58 +00:00
Dale Johannesen	fdb488d4b5	Disable attempts to constant fold PPC f128. Remove the assumption that this will happen from various places. llvm-svn: 43053	2007-10-16 23:38:29 +00:00
Duncan Sands	9d622a6de1	Initial infrastructure for arbitrary precision integer codegen support. This should have no effect on codegen for other types. Debatable bits: (1) the use (abuse?) of a set in SDNode::getValueTypeList; (2) the length of getTypeToTransformTo, which maybe should be refactored with a non-inline part for extended value types. llvm-svn: 43030	2007-10-16 09:56:48 +00:00
Duncan Sands	12d0747c70	Fixes due to lack of type-safety for ValueType: (1) ValueType being passed instead of an opcode; (2) ValueType being passed for isVolatile (!) in getLoad. llvm-svn: 43028	2007-10-16 09:07:20 +00:00
Chris Lattner	427c187f46	implement promotion of select and select_cc, allowing MallocBench/gs to work with type promotion on x86. llvm-svn: 43025	2007-10-16 03:00:22 +00:00
Evan Cheng	3650ad3e51	Make CalcLatency() non-recursive. llvm-svn: 43017	2007-10-15 21:33:22 +00:00
Chris Lattner	542fa12f9a	Move CreateStackTemporary out to SelectionDAG llvm-svn: 42995	2007-10-15 17:48:57 +00:00
Chris Lattner	292bebbb6f	add a new CreateStackTemporary helper method. llvm-svn: 42994	2007-10-15 17:47:20 +00:00
Chris Lattner	2a93fc3ffb	implement promotion of BR_CC operands, fixing bisort on ppc. llvm-svn: 42992	2007-10-15 17:16:12 +00:00
Chris Lattner	a2ebe2c07b	updates from duncan llvm-svn: 42991	2007-10-15 16:46:29 +00:00
Duncan Sands	14abc66cfe	Fix some typos. Call getTypeToTransformTo rather than getTypeToExpandTo. The difference is that getTypeToExpandTo gives the final result of expansion (eg: i128 -> i32 on a 32 bit machine) while getTypeToTransformTo does just one step (i128 -> i64). llvm-svn: 42982	2007-10-15 13:30:18 +00:00
Chris Lattner	452ebc199e	One mundane change: Change ReplaceAllUsesOfValueWith to optionally take a deleted nodes vector, instead of requiring it. One more significant change: Implement the start of a legalizer that just works on types. This legalizer is designed to run before the operation legalizer and ensure just that the input dag is transformed into an output dag whose operand and result types are all legal, even if the operations on those types are not. This design/impl has the following advantages: 1. When finished, this will significantly reduce the amount of code in LegalizeDAG.cpp. It will remove all the code related to promotion and expansion as well as splitting and scalarizing vectors. 2. The new code is very simple, idiomatic, and modular: unlike LegalizeDAG.cpp, it has no 3000 line long functions. :) 3. The implementation is completely iterative instead of recursive, good for hacking on large dags without blowing out your stack. 4. The implementation updates nodes in place when possible instead of deallocating and reallocating the entire graph that points to some mutated node. 5. The code nicely separates out handling of operations with invalid results from operations with invalid operands, making some cases simpler and easier to understand. 6. The new -debug-only=legalize-types option is very very handy :), allowing you to easily understand what legalize types is doing. This is not yet done. Until the ifdef added to SelectionDAGISel.cpp is enabled, this does nothing. However, this code is sufficient to legalize all of the code in 186.crafty, olden and freebench on an x86 machine. The biggest issues are: 1. Vectors aren't implemented at all yet 2. SoftFP is a mess, I need to talk to Evan about it. 3. No lowering to libcalls is implemented yet. 4. Various operations are missing etc. 5. There are FIXME's for stuff I hax0r'd out, like softfp. Hey, at least it is a step in the right direction :). If you'd like to help, just enable the #ifdef in SelectionDAGISel.cpp and compile code with it. If this explodes it will tell you what needs to be implemented. Help is certainly appreciated. Once this goes in, we can do three things: 1. Add a new pass of dag combine between the "type legalizer" and "operation legalizer" passes. This will let us catch some long-standing isel issues that we miss because operation legalization often obfuscates the dag with target-specific nodes. 2. We can rip out all of the type legalization code from LegalizeDAG.cpp, making it much smaller and simpler. When that happens we can then reimplement the core functionality left in it in a much more efficient and non-recursive way. 3. Once the whole legalizer is non-recursive, we can implement whole-function selectiondags maybe... llvm-svn: 42981	2007-10-15 06:10:22 +00:00
Chris Lattner	d987dc41fd	One xform performed by LegalizeDAG is transformation of "store of fp" to "store of int". Make two changes: 1) only xform "store of f32" if i32 is a legal type for the target. 2) only xform "store of f64" if either i64 or i32 are legal for the target. 3) if i64 isn't legal, manually lower to 2 stores of i32 instead of letting a later pass of legalize do it. This is ugly, but helps future changes I'm about to commit. llvm-svn: 42980	2007-10-15 05:46:06 +00:00
Chris Lattner	0841469ff0	Add a (disabled by default) way to view the ID of a node. llvm-svn: 42978	2007-10-15 05:32:43 +00:00
Chris Lattner	95def2ca5a	remove misleading comment. llvm-svn: 42970	2007-10-14 20:35:12 +00:00
Chris Lattner	ac17bf4c0f	If a target doesn't have HasMULHU or HasUMUL_LOHI, ExpandOp would return without lo/hi set. Fall through to making a libcall instead. llvm-svn: 42969	2007-10-14 18:35:05 +00:00
Dale Johannesen	4cbce377c6	Disable some compile-time optimizations on PPC long double. llvm-svn: 42958	2007-10-14 01:56:47 +00:00
Chris Lattner	c146f449b5	Enhance the truncstore optimization code to handle shifted values and propagate demanded bits through them in simple cases. This allows this code: void foo(char *P) { strcpy(P, "abc"); } to compile to: _foo: ldrb r3, [r1] ldrb r2, [r1, #+1] ldrb r12, [r1, #+2]! ldrb r1, [r1, #+1] strb r1, [r0, #+3] strb r2, [r0, #+1] strb r12, [r0, #+2] strb r3, [r0] bx lr instead of: _foo: ldrb r3, [r1, #+3] ldrb r2, [r1, #+2] orr r3, r2, r3, lsl #8 ldrb r2, [r1, #+1] ldrb r1, [r1] orr r2, r1, r2, lsl #8 orr r3, r2, r3, lsl #16 strb r3, [r0] mov r2, r3, lsr #24 strb r2, [r0, #+3] mov r2, r3, lsr #16 strb r2, [r0, #+2] mov r3, r3, lsr #8 strb r3, [r0, #+1] bx lr testcase here: test/CodeGen/ARM/truncstore-dag-combine.ll This also helps occasionally for X86 and other cases not involving unaligned load/stores. llvm-svn: 42954	2007-10-13 06:58:48 +00:00
Chris Lattner	133baf6012	Add a simple optimization to simplify the input to truncate and truncstore instructions, based on the knowledge that they don't demand the top bits. llvm-svn: 42952	2007-10-13 06:35:54 +00:00
Arnold Schwaighofer	6bcd9e7ec2	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Dale Johannesen	2f62d5da32	ppc long double. Implement fabs and fneg. llvm-svn: 42924	2007-10-12 19:02:17 +00:00
Dale Johannesen	296cc4ca22	Implement i64->ppcf128 conversions. llvm-svn: 42919	2007-10-12 17:52:03 +00:00
Dan Gohman	330b7915da	Fix some corner cases with vectors in copyToRegs and copyFromRegs. llvm-svn: 42907	2007-10-12 14:33:11 +00:00
Dan Gohman	b0b156e238	Add support to SplitVectorOp for powi, where the second operand is a scalar integer. llvm-svn: 42906	2007-10-12 14:13:46 +00:00
Evan Cheng	d11cd4a095	EXTRACT_SUBREG coalescing support. The coalescer now treats EXTRACT_SUBREG like (almost) a register copy. However, it always coalesced to the register of the RHS (the super-register). All uses of the result of a EXTRACT_SUBREG are sub- register uses which adds subtle complications to load folding, spiller rewrite, etc. llvm-svn: 42899	2007-10-12 08:50:34 +00:00
Dale Johannesen	4ade2701e0	PPC long double. Implement a couple more conversions. llvm-svn: 42888	2007-10-12 01:37:08 +00:00
Dan Gohman	ab5c3ed0d1	Add intrinsics for sin, cos, and pow. These use llvm_anyfloat_ty, and so may be overloaded with vector types. And add a testcase for codegen for these. llvm-svn: 42885	2007-10-12 00:01:22 +00:00
Dan Gohman	b9863c4738	Codegen support for vector intrinsics. Factor out the code that expands the "nasty scalar code" for unrolling vectors into a separate routine, teach it how to handle mixed vector/scalar operands, as seen in powi, and use it for several operators, including sin, cos, powi, and pow. Add support in SplitVectorOp for fpow, fpowi and for several unary operators. llvm-svn: 42884	2007-10-11 23:57:53 +00:00
Dale Johannesen	781403c410	Implement ppc long double->uint conversion. Make ppc long double constants print. llvm-svn: 42882	2007-10-11 23:32:15 +00:00
Dan Gohman	f8ab690988	Add runtime library names for pow. llvm-svn: 42880	2007-10-11 23:09:10 +00:00
Dan Gohman	bc5fc4f519	Add an ISD::FPOW node type. llvm-svn: 42879	2007-10-11 23:06:37 +00:00
Arnold Schwaighofer	d47210011e	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Dale Johannesen	0ee2a2fb59	Next PPC long double bits. First cut at constants. No compile-time support for constant operations yet, just format transformations. Make readers and writers work. Split constants into 2 doubles in Legalize. llvm-svn: 42865	2007-10-11 18:07:22 +00:00
Duncan Sands	a8baeb6dab	Correct swapped arguments to getConstant. llvm-svn: 42824	2007-10-10 09:54:50 +00:00
Dale Johannesen	76458ddf1e	Next PPC long double bits: ppcf128->i32 conversion. Surprisingly complicated. Adds getTargetNode for 2 outputs, no inputs (missing). llvm-svn: 42822	2007-10-10 01:01:31 +00:00
Dan Gohman	6df332f0cb	Migrate X86 and ARM from using X86ISD::{,I}DIV and ARMISD::MULHILO{U,S} to use ISD::{S,U}DIVREM and ISD::{S,U}MUL_HIO. Move the lowering code associated with these operators into target-independent in LegalizeDAG.cpp and TargetLowering.cpp. llvm-svn: 42762	2007-10-08 18:33:35 +00:00
Dan Gohman	6f2a2b45fa	DAGCombiner support for UDIVREM/SDIVREM and UMUL_LOHI/SMUL_LOHI. Check if one of the two results unneeded so see if a simpler operator could bs used. Also check to see if each of the two computations could be simplified if they were split into separate operators. Factor out the code that calls visit() so that it can be used for this purpose. llvm-svn: 42759	2007-10-08 17:57:15 +00:00
Dan Gohman	9ee1c4eee7	Add convenience overloads of SelectionDAG::getNode that take a SDVTList and individual SDOperand operands. llvm-svn: 42753	2007-10-08 15:49:58 +00:00
Dan Gohman	58512cb6e2	In -debug mode, dump SelectionDAGs both before and after the optimization passes. llvm-svn: 42749	2007-10-08 15:12:17 +00:00
Neil Booth	6e01c5df37	convertFromInteger, as originally written, expected sign-extended input. APInt unfortunately zero-extends signed integers, so Dale modified the function to expect zero-extended input. Make this assumption explicit in the function name. llvm-svn: 42732	2007-10-07 11:45:55 +00:00
Evan Cheng	0eed7948ce	Reapply 42677. llvm-svn: 42692	2007-10-06 08:19:55 +00:00
Chris Lattner	63443a5bc3	revert evan's patch until the header is committed llvm-svn: 42686	2007-10-06 06:08:17 +00:00
Evan Cheng	dc95020e30	Added DAG xforms. e.g. (vextract (v4f32 s2v (f32 load $addr)), 0) -> (f32 load $addr) (vextract (v4i32 bc (v4f32 s2v (f32 load $addr))), 0) -> (i32 load $addr) Remove x86 specific patterns. llvm-svn: 42677	2007-10-06 02:46:29 +00:00
Dale Johannesen	9b7ac95116	Next powerpc long double bits. Comparisons work, although not well, and shortening FP converts. llvm-svn: 42672	2007-10-06 01:24:11 +00:00
Dale Johannesen	c7b51b678d	First round of ppc long double. call/return and basic arithmetic works. Rename RTLIB long double functions to distinguish different flavors of long double; the lib functions have different names, alas. llvm-svn: 42644	2007-10-05 20:04:43 +00:00
Dan Gohman	06e6956edb	Legalize support for MUL_LOHI and DIVREM. llvm-svn: 42636	2007-10-05 14:17:22 +00:00
Dan Gohman	6132098d91	Fix a typo in a comment. llvm-svn: 42635	2007-10-05 14:11:58 +00:00
Dan Gohman	dab5fa8685	Provide names for MUL_LOHI and DIVREM operators. llvm-svn: 42634	2007-10-05 14:11:04 +00:00
Evan Cheng	ed7614b4d6	Chain producing nodes cannot be moved, not chain reading nodes. llvm-svn: 42627	2007-10-05 01:42:35 +00:00
Evan Cheng	20239e9725	Oops. Didn't mean to leave this in. llvm-svn: 42626	2007-10-05 01:39:40 +00:00
Evan Cheng	de07843bf3	If a node that defines a physical register that is expensive to copy. The scheduler will try a number of tricks in order to avoid generating the copies. This may not be possible in case the node produces a chain value that prevent movement. Try unfolding the load from the node before to allow it to be moved / cloned. llvm-svn: 42625	2007-10-05 01:39:18 +00:00
Evan Cheng	5ae84c0fa6	Add a variant of getTargetNode() that takes a vector of MVT::ValueType. llvm-svn: 42620	2007-10-05 01:10:49 +00:00
Evan Cheng	ec6d5655c6	Silence a warning. llvm-svn: 42619	2007-10-05 01:09:32 +00:00
Dan Gohman	30ba45b569	Use empty() member functions when that's what's being tested for instead of comparing begin() and end(). llvm-svn: 42585	2007-10-03 19:26:29 +00:00
Dale Johannesen	a4e3643cb3	Rewrite sqrt and powi to use anyfloat. By popular demand. llvm-svn: 42537	2007-10-02 17:43:59 +00:00
Dale Johannesen	d94f00234f	Fix stride computations for long double arrays. llvm-svn: 42508	2007-10-01 23:08:35 +00:00
Evan Cheng	d1a77589e9	Remove simple scheduler. llvm-svn: 42499	2007-10-01 20:44:07 +00:00
Dale Johannesen	aa97ad6250	remove dup comment llvm-svn: 42486	2007-09-30 19:08:12 +00:00
Dale Johannesen	2a0b2ab2fc	Constant fold int-to-long-double conversions; use APFloat for int-to-float/double; use round-to-nearest for these (implementation-defined, seems to match gcc). llvm-svn: 42484	2007-09-30 18:19:03 +00:00
Dan Gohman	02f80006f8	Teach SplitVectorOp how to split INSERT_VECTOR_ELT. llvm-svn: 42457	2007-09-28 23:53:40 +00:00
Evan Cheng	1a48cdc61e	If two instructions are both two-address code, favors (schedule closer to terminator) the one that has a CopyToReg use. This fixes 2006-05-11-InstrSched.ll with -new-cc-modeling-scheme. llvm-svn: 42453	2007-09-28 22:32:30 +00:00
Evan Cheng	2fca1e4f84	Remove a poor scheduling heuristic. llvm-svn: 42443	2007-09-28 19:37:35 +00:00
Evan Cheng	71904c241e	Trim some unneeded fields. llvm-svn: 42442	2007-09-28 19:24:24 +00:00
Dale Johannesen	8660e0ed82	Fix long double -> uint64 conversion. llvm-svn: 42440	2007-09-28 18:44:17 +00:00
Dale Johannesen	e61886cee4	Add sqrt and powi intrinsics for long double. llvm-svn: 42423	2007-09-28 01:08:20 +00:00
Evan Cheng	02e1749295	Avoid inserting a live register more than once. llvm-svn: 42410	2007-09-27 18:46:06 +00:00
Evan Cheng	7a9237420b	Silence a compiler warning. llvm-svn: 42389	2007-09-27 07:35:39 +00:00
Evan Cheng	2d4b603a75	Boogs. llvm-svn: 42388	2007-09-27 07:29:27 +00:00
Evan Cheng	f2cd163dc5	Be smarter about which node to force schedule. Reduce # of duplications + copies; Added statistics. llvm-svn: 42387	2007-09-27 07:09:03 +00:00
Evan Cheng	788683ab56	Backtracking only when it won't create a cycle. llvm-svn: 42384	2007-09-27 00:25:29 +00:00
Evan Cheng	29817845b3	- Move getPhysicalRegisterRegClass() from ScheduleDAG to MRegisterInfo. - Added ability to emit cross class register copies to the BBRU scheduler. - More aggressive backtracking. llvm-svn: 42375	2007-09-26 21:36:17 +00:00
Dale Johannesen	69595b587f	Enable codegen for long double abs, sin, cos llvm-svn: 42368	2007-09-26 21:10:55 +00:00
Dale Johannesen	20674e6c52	Fix f80 UNDEF. llvm-svn: 42359	2007-09-26 17:26:49 +00:00
Evan Cheng	5f9e291240	Allow copyRegToReg to emit cross register classes copies. Tested with "make check"! llvm-svn: 42346	2007-09-26 06:25:56 +00:00
Dan Gohman	8385890394	Move the setOperationAction(ISD::DEBUG_LOC, MVT::Other, Expand) and the check to see if the assembler supports .loc from X86TargetLowering into the superclass TargetLowering. llvm-svn: 42297	2007-09-25 15:10:49 +00:00
Evan Cheng	6d8f155a63	Added major new capabilities to scheduler (only BURR for now) to support physical register dependency. The BURR scheduler can now backtrace and duplicate instructions in order to avoid "expensive / impossible to copy" values (e.g. status flag EFLAGS for x86) from being clobbered. llvm-svn: 42284	2007-09-25 01:54:36 +00:00
Dan Gohman	dd675a5064	Use the correct result value type instead of using getValueType(0) in ExpandEXTRACT_VECTOR_ELT and SplitVectorOp. This fixes an abort in the included testcase. llvm-svn: 42264	2007-09-24 15:54:53 +00:00
Chris Lattner	12ed5081b8	initialize isstore/isload fields in ctor, fixing PR1695 llvm-svn: 42222	2007-09-22 07:02:12 +00:00
Dale Johannesen	c7279629cc	Change APFloat::convertFromInteger to take the incoming bit width instead of number of words allocated, which makes it actually work for int->APF conversions. Adjust callers. Add const to one of the APInt constructors to prevent surprising match when called with const argument. llvm-svn: 42210	2007-09-21 22:09:37 +00:00
Chris Lattner	eed3e42816	initialize SetCCResultContents, fixing PR1693 llvm-svn: 42193	2007-09-21 17:06:39 +00:00
Dale Johannesen	04682bdc81	More long double fixes. x86_64 should build now. llvm-svn: 42155	2007-09-19 23:55:34 +00:00
Dale Johannesen	9e04c2d5af	Fix longdouble -> uint conversion. llvm-svn: 42143	2007-09-19 17:53:26 +00:00
Evan Cheng	4a117958df	Use struct SDep instead of std::pair for SUnit pred and succ lists. First step in tracking physical register output dependencies. llvm-svn: 42125	2007-09-19 01:38:40 +00:00
Evan Cheng	2716b97b13	Fix a bogus splat xform: shuffle <undef, undef, x, undef>, <undef, undef, undef, undef>, <2, 2, 2, 2> != <undef, undef, x, undef> llvm-svn: 42111	2007-09-18 21:54:37 +00:00
Dale Johannesen	83b2001d42	Prevent crash on long double. llvm-svn: 42103	2007-09-18 18:36:59 +00:00
Devang Patel	547d418b8e	Do not hide APInt::dump() inside #ifndef NDEBUG. llvm-svn: 42068	2007-09-17 22:24:00 +00:00
Devang Patel	c7cfb7ebfc	This is not ideal but unbreaks build failure. APInt::dump() is inside #ifndef NDEBUG, however SelectionDAG dump() routines are not. llvm-svn: 42047	2007-09-17 20:03:03 +00:00
Dale Johannesen	78628a9108	Adjust per revew comments. llvm-svn: 42002	2007-09-16 16:51:49 +00:00
Dale Johannesen	575bd6070a	Remove the assumption that FP's are either float or double from some of the many places in the optimizers it appears, and do something reasonable with x86 long double. Make APInt::dump() public, remove newline, use it to dump ConstantSDNode's. Allow APFloats in FoldingSet. Expand X86 backend handling of long doubles (conversions to/from int, mostly). llvm-svn: 41967	2007-09-14 22:26:36 +00:00
Chris Lattner	f672250ee0	Fix build problems on Cygwin (PR1652), patch by Patrick Walton. llvm-svn: 41923	2007-09-13 06:09:48 +00:00
Evan Cheng	7adc0f3eff	Bug fixes. llvm-svn: 41900	2007-09-13 00:06:00 +00:00
Evan Cheng	d9d3176de3	Remove dead code. llvm-svn: 41899	2007-09-12 23:45:46 +00:00
Evan Cheng	e88f30877d	Yet another getTargetNode variant. llvm-svn: 41898	2007-09-12 23:39:49 +00:00
Dale Johannesen	4784ee3431	Revise previous patch per review comments. Next round of x87 long double stuff. Getting close now, basically works. llvm-svn: 41875	2007-09-12 03:30:33 +00:00
Dale Johannesen	7bc3969cea	Add APInt interfaces to APFloat (allows directly access to bits). Use them in place of float and double interfaces where appropriate. First bits of x86 long double constants handling (untested, probably does not work). llvm-svn: 41858	2007-09-11 18:32:33 +00:00
Duncan Sands	c358890f73	Fold the adjust_trampoline intrinsic into init_trampoline. There is now only one trampoline intrinsic. llvm-svn: 41841	2007-09-11 14:10:23 +00:00
Chris Lattner	2add65570c	Emit: cmpl %eax, %ecx setae %al movzbl %al, %eax instead of: cmpl %eax, %ecx setb %al xorb $1, %al movzbl %al, %eax when using logical not of a C comparison. llvm-svn: 41807	2007-09-10 21:39:07 +00:00
Chris Lattner	ca656d2007	1. Don't call Value::getName(), which is slow. 2. Lower calls to fabs and friends to FABS nodes etc unless the function has internal linkage. Before we wouldn't lower if it had a definition, which is incorrect. This allows us to compile: define double @fabs(double %f) { %tmp2 = tail call double @fabs( double %f ) ret double %tmp2 } into: _fabs: fabs f1, f1 blr llvm-svn: 41805	2007-09-10 21:15:22 +00:00
Dale Johannesen	9dfdc452d9	Implement misaligned FP loads and stores. llvm-svn: 41786	2007-09-08 19:29:23 +00:00
Rafael Espindola	8c57e70f93	Add support for having different alignment for objects on call frames. The x86-64 ABI states that objects passed on the stack have 8 byte alignment. Implement that. llvm-svn: 41768	2007-09-07 14:52:14 +00:00
Anton Korobeynikov	899c0c9c8d	Split eh.select / eh.typeid.for intrinsics into i32/i64 versions. This is needed, because they just "mark" register liveins and we let frontend solve type issue, not lowering code :) llvm-svn: 41763	2007-09-07 11:39:35 +00:00
Owen Anderson	4b71e55287	Add lengthof and endof templates that hide a lot of sizeof computations. Patch by Sterling Stein! llvm-svn: 41758	2007-09-07 04:06:50 +00:00
Dale Johannesen	86f367a6b7	Next round of APFloat changes. Use APFloat in UpgradeParser and AsmParser. Change all references to ConstantFP to use the APFloat interface rather than double. Remove the ConstantFP double interfaces. Use APFloat functions for constant folding arithmetic and comparisons. (There are still way too many places APFloat is just a wrapper around host float/double, but we're getting there.) llvm-svn: 41747	2007-09-06 18:13:44 +00:00
Duncan Sands	ab8eb598be	Fix PR1628. When exception handling is turned on, labels are generated bracketing each call (not just invokes). This is used to generate entries in the exception table required by the C++ personality. However it gets in the way of tail-merging. This patch solves the problem by no longer placing labels around ordinary calls. Instead we generate entries in the exception table that cover every instruction in the function that wasn't covered by an invoke range (the range given by the labels around the invoke). As an optimization, such entries are only generated for parts of the function that contain a call, since for the moment those are the only instructions that can throw an exception [1]. As a happy consequence, we now get a smaller exception table, since the same region can cover many calls. While there, I also implemented folding of invoke ranges - successive ranges are merged when safe to do so. Finally, if a selector contains only a cleanup, there's a special shorthand for it - place a 0 in the call-site entry. I implemented this while there. As a result, the exception table output (excluding filters) is now optimal - it cannot be made smaller [2]. The problem with throw filters is that folding them optimally is hard, and the benefit of folding them is minimal. [1] I tested that having trapping instructions (eg divide by zero) in such a region doesn't cause trouble. [2] It could be made smaller with the help of higher layers, eg by having branch folding reorder basic blocks ending in invokes with the same landing pad so they follow each other. I don't know if this is worth doing. llvm-svn: 41718	2007-09-05 11:27:52 +00:00
Evan Cheng	bb21883dd3	Fix for PR1632. EHSELECTION always produces a i32 value. llvm-svn: 41712	2007-09-04 20:39:26 +00:00
Dale Johannesen	b34e6b4898	Add mod, copysign, abs operations to APFloat. Implement some constant folding in SelectionDAG and DAGCombiner using APFloat. Remove double versions of constructor and getValue from ConstantFPSDNode. llvm-svn: 41664	2007-08-31 23:34:27 +00:00
Dale Johannesen	a79f7d4068	Revise per review of previous patch. llvm-svn: 41645	2007-08-31 17:03:33 +00:00
Dale Johannesen	81d6ecb886	Enhance APFloat to retain bits of NaNs (fixes oggenc). Use APFloat interfaces for more references, mostly of ConstantFPSDNode. llvm-svn: 41632	2007-08-31 04:03:46 +00:00
Dale Johannesen	e91a908971	Change LegalFPImmediates to use APFloat. Add APFloat interfaces to ConstantFP, SelectionDAG. Fix integer bit in double->APFloat conversion. Convert LegalizeDAG to use APFloat interface in ConstantFPSDNode uses. llvm-svn: 41587	2007-08-30 00:23:21 +00:00
Anton Korobeynikov	5845c41e4d	Fix use of declaration inside case block llvm-svn: 41584	2007-08-29 23:18:48 +00:00
Anton Korobeynikov	a3531f71ca	Lower FRAME_TO_ADDR_OFFSET to zero by default (if not custom lowered) llvm-svn: 41578	2007-08-29 19:28:29 +00:00
Dan Gohman	cbb2ee9062	Add an option, -view-sunit-dags, for viewing the actual SUnit DAGs used by scheduling. llvm-svn: 41556	2007-08-28 20:32:58 +00:00
Dan Gohman	123b8effaa	Make DAGCombiner's global alias analysis query more precise in the case where both pointers have non-zero offsets. llvm-svn: 41491	2007-08-27 16:32:11 +00:00
Dan Gohman	2e7e251f24	If the source and destination pointers in an llvm.memmove are known to not alias each other, it can be translated as an llvm.memcpy. llvm-svn: 41489	2007-08-27 16:26:13 +00:00

1 2 3 4 5 ...

2028 Commits