llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 19:52:54 +01:00

Author	SHA1	Message	Date
Mon P Wang	c4bf9b94d5	Cleaned up and fix bugs in convert_rndsat node llvm-svn: 59025	2008-11-11 05:40:06 +00:00
Bill Wendling	891f177dd0	Temporarily revert r58979 and related patch. It's causing a failure in X86 bootstrap: Comparing stages 2 and 3 warning: ./cc1-checksum.o differs warning: ./cc1obj-checksum.o differs warning: ./cc1objplus-checksum.o differs warning: ./cc1plus-checksum.o differs Bootstrap comparison failure! ./alias.o differs ./alloc-pool.o differs ./attribs.o differs ./bb-reorder.o differs ./bitmap.o differs ./build/errors.o differs ./build/genattrtab.o differs ./build/genautomata.o differs ./build/genemit.o differs ./build/genextract.o differs ... -bw llvm-svn: 59003	2008-11-10 21:22:06 +00:00
Mon P Wang	6792115592	Added CONVERT_RNDSAT (conversion with rounding and saturation) SDNode to support targets that support these conversions. Users should avoid using this node as the current targets don't generating code for it. llvm-svn: 59001	2008-11-10 20:54:11 +00:00
Duncan Sands	22e8a45a01	Fix PR2667: add soft float support for sint_to_fp/uint_to_fp where the argument is an apint, or smaller than the minimum size for which there is a libcall (i32). llvm-svn: 58994	2008-11-10 17:36:26 +00:00
Duncan Sands	eca6e696ca	Tweak some comments. llvm-svn: 58993	2008-11-10 17:31:56 +00:00
Duncan Sands	b6c3634c90	Small cleanups. No functionality change intended! llvm-svn: 58992	2008-11-10 17:29:56 +00:00
Duncan Sands	1d0b7dccf7	When promoting the result of fp_to_uint/fp_to_sint, inform the optimizers that the result must be zero/ sign extended from the smaller type. For example, if a fp to unsigned i16 is promoted to fp to i32, then we are allowed to assume that the extra 16 bits are zero (because the result of fp to i16 is undefined if the result does not fit in an i16). This is quite aggressive, but should help the optimizers produce better code. This requires correcting a test which thought that fp_to_uint is some kind of truncation, which it is not: in the testcase (which does fp to i1), either the fp value converts to 0 or 1 or the result is undefined, which is quite different to truncation. llvm-svn: 58991	2008-11-10 17:28:30 +00:00
Dale Johannesen	8a43172ff1	Really fix testb optimization on big-endian. Fixes ppc32 bootstrap. llvm-svn: 58979	2008-11-10 07:16:42 +00:00
Mon P Wang	911ee5bf8b	Added support for the following definition of shufflevector <result> = shufflevector <n x <ty>> <v1>, <n x <ty>> <v2>, <m x i32> <mask> llvm-svn: 58964	2008-11-10 04:46:22 +00:00
Dale Johannesen	27c03be35e	Temporarily revert 58825, which breaks PPC bootstrap. xs llvm-svn: 58930	2008-11-09 06:48:10 +00:00
Duncan Sands	da4e03de04	Try to produce better code when scalarizing VSETCC. llvm-svn: 58920	2008-11-08 18:26:48 +00:00
Dale Johannesen	e0608af6a4	Make testb optimization work on big-endian targets. llvm-svn: 58874	2008-11-08 00:01:16 +00:00
Dale Johannesen	bc914a7cf9	Make FP tests requiring two compares work on PPC (PR 642). This is Chris' patch from the PR, modified to realize that SETUGT/SETULT occur legitimately with integers, plus two fixes in LegalizeDAG to pass a valid result type into LegalizeSetCC. The argument of TLI.getSetCCResultType is ignored on PPC, but I think I'm following usage elsewhere. llvm-svn: 58871	2008-11-07 22:54:33 +00:00
Duncan Sands	60220e7127	Sign-extend rather than zero-extend when promoting the condition for a BRCOND, according to what is returned by getSetCCResultContents. Since all targets return the same thing (ZeroOrOneSetCCResult), this should be harmless! The point is that all over the place the result of SETCC is fed directly into BRCOND. On machines for which getSetCCResultContents returns ZeroOrNegativeOneSetCCResult, this is a sign-extended boolean. So it seems dangerous to also feed BRCOND zero-extended booleans in some circumstances - for example, when promoting the condition. llvm-svn: 58861	2008-11-07 20:13:04 +00:00
Dale Johannesen	4510149fd2	Fix unsigned->ppcf128 conversion. llvm-svn: 58856	2008-11-07 19:11:43 +00:00
Dale Johannesen	89da8440e1	When we're doing a compare of load-AND-constant to 0 (e.g. a bitfield test) narrow the load as much as possible. The has the potential to avoid unnecessary partial-word load-after-store conflicts, which cause stalls on several targets. Also a size win on x86 (testb vs testl). llvm-svn: 58825	2008-11-07 01:28:02 +00:00
Bill Wendling	3fe9fef0da	- Modify the stack protector algorithm so that the stack slot is allocated in LLVM IR code and not in the selection DAG ISel. This is a cleaner solution. - Fix the heuristic for determining if protectors are necessary. The previous one wasn't checking the proper type size. llvm-svn: 58824	2008-11-07 01:23:58 +00:00
Mon P Wang	888f4e6fb0	Fixed scalarizing an extract subvector and prevent an infinite loop when simplify a vector. llvm-svn: 58820	2008-11-06 22:52:21 +00:00
Devang Patel	8640fd500a	Emit label for llvm.dbg.func.start of the inlined function. llvm-svn: 58814	2008-11-06 21:28:20 +00:00
Duncan Sands	50acaf2367	Formating/comment changes - no functionality change. llvm-svn: 58801	2008-11-06 08:51:32 +00:00
Bill Wendling	b6e2d60e7a	- Rename stackprotector_{prologue,epilogue} to stackprotector_{create,check}. - Get rid of "HasStackProtector" in MachineFrameInfo. - Modify intrinsics to tell which are doing what with memory. llvm-svn: 58799	2008-11-06 07:23:03 +00:00
Mon P Wang	41f90a3ee5	Widening cleanup llvm-svn: 58796	2008-11-06 05:31:54 +00:00
Bill Wendling	08905ed703	Implement the stack protector stack accesses via intrinsics: - stackprotector_prologue creates a stack object and stores the guard there. - stackprotector_epilogue reads the stack guard from the stack position created by stackprotector_prologue. - The PrologEpilogInserter was changed to make sure that the stack guard is first on the stack frame. llvm-svn: 58791	2008-11-06 02:29:10 +00:00
Devang Patel	ec135e1f33	Emit label for llvm.dbg.func.start of the inlined function. llvm-svn: 58786	2008-11-06 00:30:09 +00:00
Duncan Sands	f56e2fb5c2	Fix thinko in ppcf128 expansion of truncating store. llvm-svn: 58753	2008-11-05 07:17:27 +00:00
Evan Cheng	1bde698192	Type of shuffle mask has changed. llvm-svn: 58751	2008-11-05 06:04:18 +00:00
Dale Johannesen	eee3a8a2e0	80 columns llvm-svn: 58717	2008-11-04 20:52:49 +00:00
Duncan Sands	58ebf09772	Fix PR3011: LegalizeTypes support for scalarizing SELECT_CC. llvm-svn: 58706	2008-11-04 17:31:08 +00:00
Dale Johannesen	d9906b90d0	Fix some ppcf128 regressions: make ExpandFloatRes_LOAD work correctly, and bring over a late change to ppcf128 SetCC handling. llvm-svn: 58642	2008-11-03 20:47:45 +00:00
Duncan Sands	8a94be8c5b	Make VAARG promotion work correctly with large funky sized integers like i129, and also reduce the number of assumptions made about how vaarg is implemented. This still doesn't work correctly for small integers like (eg) i1 on x86, since x86 passes each of them (essentially an i8) in a 4 byte stack slot, so the pointer needs to be advanced by 4 bytes not by 1 byte as now. But this is no longer a LegalizeTypes problem (it was also wrong in LT before): it is a bug in the operation expansion in LegalizeDAG: now LegalizeTypes turns an i1 vaarg into an i8 vaarg which would work fine if only the i8 vaarg was turned into correct code later. llvm-svn: 58635	2008-11-03 20:22:12 +00:00
Duncan Sands	a9047944bc	Make VAARG work with x86 long double (which is 10 bytes long, but is passed in 12/16 bytes). llvm-svn: 58608	2008-11-03 11:51:11 +00:00
Mon P Wang	0d137a1c51	Added interface to allow clients to create a MemIntrinsicNode for target intrinsics that touches memory llvm-svn: 58548	2008-11-01 20:24:53 +00:00
Dan Gohman	f46431018c	Remove some unused virtual function bodies. llvm-svn: 58524	2008-10-31 19:06:33 +00:00
Duncan Sands	d2500010a3	Add a bunch of libcalls for ppcf128 that were somehow completely forgotten about when writing LegalizeTypes. llvm-svn: 58508	2008-10-31 14:06:52 +00:00
Duncan Sands	615567edc6	Fix PR2986: do not use a potentially illegal type for the shift amount type. Add a check that shifts and rotates use the type returned by getShiftAmountTy for the amount. This exposed some problems in CellSPU and PPC, which have already been fixed. llvm-svn: 58455	2008-10-30 20:26:50 +00:00
Mon P Wang	64e6e15947	Add missing vsetcc expansion for widening llvm-svn: 58443	2008-10-30 18:21:52 +00:00
Mon P Wang	d7e34cd378	Add initial support for vector widening. Logic is set to widen for X86. One will only see an effect if legalizetype is not active. Will move support to LegalizeType soon. llvm-svn: 58426	2008-10-30 08:01:45 +00:00
Duncan Sands	4f4d9d24a4	Uniformize capitalization of NodeId. llvm-svn: 58386	2008-10-29 17:52:12 +00:00
Duncan Sands	fd032c5bef	Fix PR2977: LegalizeTypes support for expanding VAARG. llvm-svn: 58379	2008-10-29 14:25:28 +00:00
Duncan Sands	ada9e7a16d	Add sanity checking for BUILD_PAIR (I noticed the other day that PPC custom lowering could create a BUILD_PAIR of two f64 with a result type of... f64! - already fixed). Fix a place that triggers the sanity check. llvm-svn: 58378	2008-10-29 14:22:20 +00:00
Duncan Sands	3faee6737e	Fix a FIXME: in ReplaceNodeWith, if the new node is morphed by AnalyzeNewNode into a previously processed node, and different result values of that node are remapped to values with different nodes, then we could end up using wrong values here [we were assuming that all results remap to values with the same underlying node]. This seems theoretically possible, but I don't have a testcase. The meat of the patch is in the changes to AnalyzeNewNode/AnalyzeNewValue and ReplaceNodeWith. While there, I changed names like RemapNode to RemapValue, since it really remaps values. To tell the truth, I would be much happier if we were only remapping nodes (it would simplify a bunch of logic, and allow for some cute speedups) but I haven't yet worked out how to do that. llvm-svn: 58372	2008-10-29 06:42:19 +00:00
Duncan Sands	cb5432cdb4	Fix 80 column violations. llvm-svn: 58371	2008-10-29 06:33:00 +00:00
Duncan Sands	790e7e655b	Fix 80 column violations. llvm-svn: 58370	2008-10-29 06:31:03 +00:00
Dan Gohman	eb869eb116	Take Chris' suggestion and define EnableFastISelVerbose and EnableFastISelAbort variables for Release mode instead of using ifdefs in the code. llvm-svn: 58350	2008-10-28 20:35:31 +00:00
Dan Gohman	5a2a8f4b9b	Protect the code for fast-isel debugging with #ifndef NDEBUG. llvm-svn: 58340	2008-10-28 19:08:46 +00:00
Duncan Sands	a64641fbd2	Fix darwin ppc llvm-gcc build breakage: intercept ppcf128 to i32 conversion and expand it into a code sequence like in LegalizeDAG. This needs custom ppc lowering of FP_ROUND_INREG, so turn that on and make it work with LegalizeTypes. Probably PPC should simply custom lower the original conversion. llvm-svn: 58329	2008-10-28 15:00:32 +00:00
Duncan Sands	ce82e0aa82	Fix a testcase provided by Bill in which the node id could end up being wrong mostly because of forgetting to remap new nodes that morphed into processed nodes through CSE. llvm-svn: 58323	2008-10-28 09:38:36 +00:00
Chris Lattner	508a62823e	Don't produce invalid comparisons after legalize. llvm-svn: 58320	2008-10-28 07:11:07 +00:00
Chris Lattner	e39269e22a	fix some whitespace stuff llvm-svn: 58319	2008-10-28 07:10:51 +00:00
Ted Kremenek	03c067710c	Fix bogus comparison of "const char *" with c-string literal. Use strcmp instead. llvm-svn: 58290	2008-10-27 22:43:07 +00:00
David Greene	5015610892	Add setSubgraphColor to color an entire portion of a SelectionDAG. This will be used to support debug features in TableGen. llvm-svn: 58257	2008-10-27 18:17:03 +00:00
Duncan Sands	22451e0303	Fix UpdateNodeOperands so that it does CSE of calls (and a bunch of other node types). While there, I added a doNotCSE predicate and used it to reduce code duplication (some of the duplicated code was wrong...). This fixes ARM/cse-libcalls.ll when using LegalizeTypes. llvm-svn: 58249	2008-10-27 15:30:53 +00:00
Duncan Sands	039edb065f	Fix a bug in which a node could be added to the worklist twice: UpdateNodeOperands could morph a new node into a node already on the worklist. We would then recalculate the NodeId for this existing node and add it to the worklist. The testcase is ARM/cse-libcalls.ll, the problem showing up once UpdateNodeOperands is taught to do CSE for calls. llvm-svn: 58246	2008-10-27 13:18:32 +00:00
Duncan Sands	a6bbc047d5	Turn on LegalizeTypes, the new type legalization codegen infrastructure, by default. Please report any breakage to the mailing lists. llvm-svn: 58232	2008-10-27 08:42:46 +00:00
Dan Gohman	e7c43e94b0	SDNodes may have at most one Flag result. Update this comment to reflect that. llvm-svn: 58145	2008-10-25 17:51:24 +00:00
Dale Johannesen	9edd60f710	Initialize uninitialized variable. llvm-svn: 58057	2008-10-24 01:06:58 +00:00
Duncan Sands	d4ea54fd77	Fix thinko - the operand number has nothing to do with the result number. llvm-svn: 58041	2008-10-23 19:34:23 +00:00
Duncan Sands	91535074e9	LegalizeTypes soft-float support for fpow. llvm-svn: 57973	2008-10-22 11:49:09 +00:00
Duncan Sands	0d122150ce	Be nice to CellSPU: for this target getSetCCResultType may return i8, which can result in SELECT nodes for which the type of the condition is i8, but there are no patterns for select with i8 condition. Tweak the LegalizeTypes logic to avoid this as much as possible. This isn't a real fix because it is still perfectly possible to end up with such select nodes - CellSPU needs to be fixed IMHO. llvm-svn: 57968	2008-10-22 09:23:20 +00:00
Duncan Sands	ebf65ef3f9	Port from LegalizeDAG the logic to only generate ADDC/ADDE/SUBC/SUBE if the target supports it. llvm-svn: 57967	2008-10-22 09:07:29 +00:00
Duncan Sands	81c4c88859	Add some comments explaining the meaning of a boolean that is not of type MVT::i1 in SELECT and SETCC nodes. Relax the LegalizeTypes SELECT condition promotion sanity checks to allow other condition types than i1. llvm-svn: 57966	2008-10-22 09:06:24 +00:00
Duncan Sands	7ba0cc16c1	Temporarily allow the operands of a BUILD_VECTOR to have a different type to the vector element type. This should be fairly harmless because in the past guys like this were being built all over the place (and were cleaned up when I added this check). The reason for relaxing this check is that it helps LegalizeTypes legalize vector shuffles: the mask is a BUILD_VECTOR that it is not always possible to legalize while keeping it a BUILD_VECTOR (vector_shuffle requires the mask to be a BUILD_VECTOR, as opposed to a vector with the right vector type). With this check it is even harder to legalize the mask - turning the check off means that LegalizeTypes manages to legalize almost all vector shuffles encountered in practice. The correct solution is to change vector_shuffle to be a variadic node with the mask built into it as operands. While waiting for that change, this hack stops the problem with vector_shuffle from blocking the turning on of LegalizeTypes. llvm-svn: 57965	2008-10-22 09:00:33 +00:00
Dale Johannesen	eb7e2deb1d	Add an SSE2 algorithm for uint64->f64 conversion. The same one Apple gcc uses, faster. Also gets the extreme case in gcc.c-torture/execute/ieee/rbug.c correct which we weren't before; this is not sufficient to get the test to pass though, there is another bug. llvm-svn: 57926	2008-10-21 20:50:01 +00:00
Dan Gohman	b6f073ce21	Fix SelectionDAGBuild lowering of Select instructions to handle first-class aggregate values. Also, fix a bug in the Ret handling for empty aggregates. llvm-svn: 57925	2008-10-21 20:00:42 +00:00
Dan Gohman	847a83dbad	Don't create TargetGlobalAddress nodes with offsets that don't fit in the 32-bit signed offset field of addresses. Even though this may be intended, some linkers refuse to relocate code where the relocated address computation overflows. Also, fix the sign-extension of constant offsets to use the actual pointer size, rather than the size of the GlobalAddress node, which may be different, for example on x86-64 where MVT::i32 is used when the address is being fit into the 32-bit displacement field. llvm-svn: 57885	2008-10-21 03:38:42 +00:00
Dan Gohman	d9b79484e0	Make the NaN test come second, heuristically assuming that NaNs are less common. llvm-svn: 57871	2008-10-21 03:12:54 +00:00
Chris Lattner	c4a880e03c	Fix gcc.c-torture/compile/920520-1.c by inserting bitconverts for strange asm conditions earlier. In this case, we have a double being passed in an integer reg class. Convert to like sized integer register so that we allocate the right number for the class (two i32's for the f64 in this case). llvm-svn: 57862	2008-10-21 00:45:36 +00:00
Dan Gohman	204cc4e5ff	Fast-isel no longer an experiment. llvm-svn: 57845	2008-10-20 21:30:12 +00:00
Duncan Sands	9a3acf8d88	Support operations like fp_to_uint with a vector result type when the result type is legal but not the operand type. Add additional support for EXTRACT_SUBVECTOR and CONCAT_VECTORS, needed to handle such cases. llvm-svn: 57840	2008-10-20 16:31:21 +00:00
Duncan Sands	53a9bbae16	LegalizeTypes support for atomic operation promotion. llvm-svn: 57838	2008-10-20 16:17:42 +00:00
Duncan Sands	e2c4d654e3	Use DAG.getIntPtrConstant rather than DAG.getConstant with TLI.getPointerTy for a small simplification. llvm-svn: 57837	2008-10-20 16:14:43 +00:00
Duncan Sands	b912b4c4c4	Always use either MVT::i1 or getSetCCResultType for the condition of a SELECT node. Make sure that the correct extension type (any-, sign- or zero-extend) is used. llvm-svn: 57836	2008-10-20 16:13:04 +00:00
Duncan Sands	81b834c160	Formatting - no functional change. llvm-svn: 57834	2008-10-20 16:06:47 +00:00
Duncan Sands	1872cc22b0	Don't use a random type for the select condition, use an MVT::i1 and simplify the code while there. llvm-svn: 57833	2008-10-20 16:04:57 +00:00
Bill Wendling	ed477995f1	Set N->OperandList to 0 after deletion. Otherwise, it's possible that it will be either deleted or referenced afterwards. llvm-svn: 57786	2008-10-19 20:51:12 +00:00
Bill Wendling	980c8ad152	Fix comment. Other formatting changes. No functionality changes. llvm-svn: 57785	2008-10-19 20:34:04 +00:00
Duncan Sands	0a9525febd	Vector shuffle mask elements may be "undef". Handle this everywhere in LegalizeTypes. llvm-svn: 57783	2008-10-19 15:00:25 +00:00
Duncan Sands	65f39e9819	Use a legal integer type for vector shuffle mask elements. Otherwise LegalizeTypes will, reasonably enough, legalize the mask, which may result in it no longer being a BUILD_VECTOR node (LegalizeDAG simply ignores the legality or not of vector masks). llvm-svn: 57782	2008-10-19 14:58:05 +00:00
Chris Lattner	c369db13cc	Reapply r57699 with a fix to not crash on asms with multiple results. Unlike the previous patch this one actually passes make check. "Fix PR2356 on PowerPC: if we have an input and output that are tied together that have different sizes (e.g. i32 and i64) make sure to reserve registers for the bigger operand." llvm-svn: 57771	2008-10-18 18:49:30 +00:00
Dan Gohman	ea1d0d8823	Don't truncate GlobalAddress offsets to int in debug output. llvm-svn: 57770	2008-10-18 18:22:42 +00:00
Dan Gohman	15597f07b2	Teach DAGCombine to fold constant offsets into GlobalAddress nodes, and add a TargetLowering hook for it to use to determine when this is legal (i.e. not in PIC mode, etc.) This allows instruction selection to emit folded constant offsets in more cases, such as the included testcase, eliminating the need for explicit arithmetic instructions. This eliminates the need for the C++ code in X86ISelDAGToDAG.cpp that attempted to achieve the same effect, but wasn't as effective. Also, fix handling of offsets in GlobalAddressSDNodes in several places, including changing GlobalAddressSDNode's offset from int to int64_t. The Mips, Alpha, Sparc, and CellSPU targets appear to be unaware of GlobalAddress offsets currently, so set the hook to false on those targets. llvm-svn: 57748	2008-10-18 02:06:02 +00:00
Dan Gohman	2eaf4f1c48	Revert r57699. It's causing regressions in test/CodeGen/X86/2008-09-17-inline-asm-1.ll and a few others, and it breaks the llvm-gcc build. llvm-svn: 57747	2008-10-18 01:03:45 +00:00
Dan Gohman	e1e1c5197e	Factor out the code for mapping LLVM IR condition opcodes to ISD condition opcodes into helper functions. llvm-svn: 57726	2008-10-17 21:16:08 +00:00
Chris Lattner	75618cbb6f	add support for 128 bit aggregates. llvm-svn: 57715	2008-10-17 19:59:51 +00:00
Mon P Wang	fdfc9a2c4f	Added MemIntrinsicNode which is useful to represent target intrinsics that touches memory and need an associated MemOperand llvm-svn: 57712	2008-10-17 18:22:58 +00:00
Dan Gohman	96269ec52a	Factor out the code for mapping LLVM IR condition opcodes to ISD condition opcodes into helper functions. llvm-svn: 57710	2008-10-17 18:18:45 +00:00
Chris Lattner	e2342cd790	Fix PR2356 on PowerPC: if we have an input and output that are tied together that have different sizes (e.g. i32 and i64) make sure to reserve registers for the bigger operand. llvm-svn: 57699	2008-10-17 17:52:49 +00:00
Chris Lattner	bb4ae53b94	refactor some code into a helper method, no functionality change. llvm-svn: 57690	2008-10-17 17:05:25 +00:00
Chris Lattner	d748d12000	Keep track of which input constraint matches an output constraint. Reject asms where an output has multiple input constraints tied to it. llvm-svn: 57687	2008-10-17 16:47:46 +00:00
Chris Lattner	1d0742a530	add an assert so that PR2356 explodes instead of running off an array. Improve some minor comments, refactor some helpers in AsmOperandInfo. No functionality change for valid code. llvm-svn: 57686	2008-10-17 16:21:11 +00:00
Dan Gohman	5d83bd89a5	Define patterns for shld and shrd that match immediate shift counts, and patterns that match dynamic shift counts when the subtract is obscured by a truncate node. Add DAGCombiner support for recognizing rotate patterns when the shift counts are defined by truncate nodes. Fix and simplify the code for commuting shld and shrd instructions to work even when the given instruction doesn't have a parent, and when the caller needs a new instruction. These changes allow LLVM to use the shld, shrd, rol, and ror instructions on x86 to replace equivalent code using two shifts and an or in many more cases. llvm-svn: 57662	2008-10-17 01:23:35 +00:00
Evan Cheng	cb8b4e9dd4	- Add target lowering hooks that specify which setcc conditions are illegal, i.e. conditions that cannot be checked with a single instruction. For example, SETONE and SETUEQ on x86. - Teach legalizer to implement illegal setcc as a and / or of a number of legal setcc nodes. For now, only implement FP conditions. e.g. SETONE is implemented as SETO & SETNE, SETUEQ is SETUO \| SETEQ. - Move x86 target over. llvm-svn: 57542	2008-10-15 02:05:31 +00:00
Dan Gohman	c070ffc493	FastISel support for exception-handling constructs. - Move the EH landing-pad code and adjust it so that it works with FastISel as well as with SDISel. - Add FastISel support for @llvm.eh.exception and @llvm.eh.selector. llvm-svn: 57539	2008-10-14 23:54:11 +00:00
Evan Cheng	3faedff2de	Rename LoadX to LoadExt. llvm-svn: 57526	2008-10-14 21:26:46 +00:00
Dan Gohman	9543edc4ef	Fix command-line option printing to print two spaces where needed, instead of requiring all "short description" strings to begin with two spaces. This makes these strings less mysterious, and it fixes some cases where short description strings mistakenly did not begin with two spaces. llvm-svn: 57521	2008-10-14 20:25:08 +00:00
Evan Cheng	de99d94c58	FIX PR2794. Make sure SIGN_EXTEND_INREG nodes introduced by LegalizeSetCCOperands are leglized. Patch by Richard Pennington. llvm-svn: 57460	2008-10-13 18:46:18 +00:00
Matthijs Kooijman	1ea1008e1f	* Make TargetLowering not crash when TargetMachine::getTargetAsmInfo() returns null. This assumes that any target that does not have AsmInfo, does not support "LocAndDot". llvm-svn: 57438	2008-10-13 12:41:46 +00:00
Chris Lattner	16c0109fd4	calls can be supported. llvm-svn: 57428	2008-10-13 01:59:13 +00:00
Chris Lattner	7910d59d44	Change CALLSEQ_BEGIN and CALLSEQ_END to take TargetConstant's as parameters instead of raw Constants. This prevents the constants from being selected by the isel pass, fixing PR2735. llvm-svn: 57385	2008-10-11 22:08:30 +00:00
Chris Lattner	2f2ef3f25f	simplify comparison llvm-svn: 57371	2008-10-11 00:08:02 +00:00
Dale Johannesen	075a62519f	Add a "loses information" return value to APFloat::convert and APFloat::convertToInteger. Restore return value to IEEE754. Adjust all users accordingly. llvm-svn: 57329	2008-10-09 23:00:39 +00:00
Dale Johannesen	9e57068854	Rename APFloat::convertToAPInt to bitcastToAPInt to make it clearer what the function does. No functional change. llvm-svn: 57325	2008-10-09 18:53:47 +00:00
Dan Gohman	6d98c5ab31	Avoid emitting redundant materializations of integer constants for things like null pointers, which at this level aren't different from regular integer constants. llvm-svn: 57265	2008-10-07 22:03:27 +00:00
Andrew Lenharth	66cc9d5e53	Use Dan's supperior check llvm-svn: 57255	2008-10-07 18:27:23 +00:00
Andrew Lenharth	950618a347	No need for \|= llvm-svn: 57249	2008-10-07 17:11:29 +00:00
Andrew Lenharth	2399c6cb95	Use ADDC if it is valid at any smaller size. Do it right this time llvm-svn: 57248	2008-10-07 17:09:16 +00:00
Andrew Lenharth	84166420cc	Use ADDC if it is valid at any smaller size. fixes test/Codegen/Generic/i128-addsub.ll on x86 llvm-svn: 57247	2008-10-07 17:03:15 +00:00
Andrew Lenharth	c00c2a0058	Expand arith on machines without carry flags llvm-svn: 57243	2008-10-07 14:15:42 +00:00
Dan Gohman	188af3ae0d	Correctly handle calls with no return values. This fixes 2006-01-23-UnionInit on x86-64 when inlining is not enabled. llvm-svn: 57223	2008-10-07 00:12:37 +00:00
Chris Lattner	576e29c87d	wrap some long lines and expand i32 mul's to libcalls, inspired by a patch by Mikael Lepisto! llvm-svn: 57077	2008-10-04 21:27:46 +00:00
Dan Gohman	5944450d5f	Fix fast-isel's handling of atomic instructions. They may expand to multiple basic blocks, in which case fast-isel needs to informed of which block to use as it resumes inserting instructions. llvm-svn: 57040	2008-10-04 00:56:36 +00:00
Dale Johannesen	27d8955b8f	Pass MemOperand through for 64-bit atomics on 32-bit, incidentally making the case where the memop is a pointer deref work. Fix cmp-and-swap regression. llvm-svn: 57027	2008-10-03 19:41:08 +00:00
Dan Gohman	dc99a744fd	Use -1ULL instead of uint64_t(-1), at Anton's suggestion. llvm-svn: 57021	2008-10-03 17:56:45 +00:00
Duncan Sands	00b25fea88	The result of getSetCCResultType (eg: i32) may be larger than the type an i1 is promoted to (eg: i8). Account for this. Noticed by Tilmann Scheller on CellSPU; he will hopefully take care of fixing this in LegalizeDAG and adding a testcase! llvm-svn: 56997	2008-10-03 07:41:46 +00:00
Dan Gohman	68e27d0b27	Implement fast-isel support for zero-extending from i1. It turns out that this is a fairly common operation, and it's easy enough to handle. llvm-svn: 56990	2008-10-03 01:28:47 +00:00
Dan Gohman	e75d14f8b0	Optimize conditional branches in X86FastISel. This replaces sequences like this: sete %al testb %al, %al jne LBB11_1 with this: je LBB11_1 llvm-svn: 56969	2008-10-02 22:15:21 +00:00
Dale Johannesen	dbd7b1bd33	Handle some 64-bit atomics on x86-32, some of the time. llvm-svn: 56963	2008-10-02 18:53:47 +00:00
Dan Gohman	e1d0930044	Make some implicit conversions explicit, to avoid compiler warnings. llvm-svn: 56927	2008-10-01 19:58:59 +00:00
Dan Gohman	6466b44315	Fold trivial two-operand tokenfactors where the operands are equal immediately. llvm-svn: 56921	2008-10-01 15:11:19 +00:00
Dan Gohman	81ade97fe0	Fix typos in comments. llvm-svn: 56919	2008-10-01 15:07:49 +00:00
Bill Wendling	d7effcf8da	Implement the -fno-builtin option in the front-end, not in the back-end. llvm-svn: 56900	2008-10-01 00:59:58 +00:00
Bill Wendling	86f6fdc7e3	- Initialize "--no-builtin" to "false". - Testcase for r56885. llvm-svn: 56886	2008-09-30 21:40:30 +00:00
Bill Wendling	9ad453e943	Add the new `-no-builtin' flag. This flag is meant to mimic the GCC `-fno-builtin' flag. Currently, it's used to replace "memset" with "_bzero" instead of "__bzero" on Darwin10+. This arguably violates the meaning of this flag, but is currently sufficient. The meaning of this flag should become more specific over time. llvm-svn: 56885	2008-09-30 21:22:07 +00:00
Dan Gohman	19530c810d	Move the primary fast-isel top-level comments to FastISel.cpp, where they'll be a little more visible. Also, update and reword them a bit. llvm-svn: 56877	2008-09-30 20:48:29 +00:00
Dan Gohman	5a2169ee6e	Optimize SelectionDAG's AssignTopologicalOrder even further. Completely eliminate the TopOrder std::vector. Instead, sort the AllNodes list in place. This also eliminates the need to call AllNodes.size(), a linear-time operation, before performing the sort. Also, eliminate the Sources temporary std::vector, since it essentially duplicates the sorted result as it is being built. This also changes the direction of the topological sort from bottom-up to top-down. The AllNodes list starts out in roughly top-down order, so this reduces the amount of reordering needed. Top-down is also more convenient for Legalize, and ISel needed only minor adjustments. llvm-svn: 56867	2008-09-30 18:30:35 +00:00
Dale Johannesen	52987eab6e	Remove misuse of ReplaceNodeResults for atomics with valid types. No functional change. llvm-svn: 56808	2008-09-29 22:25:26 +00:00
Dan Gohman	f78676d0b6	Fix FastISel to not initialize the PIC-base register multiple times in functions with PIC references from more than one basic block. llvm-svn: 56807	2008-09-29 21:55:50 +00:00
Bill Wendling	7273078850	Temporarily reverting r56683. This is causing a failure during the build of llvm-gcc: /Volumes/Gir/devel/llvm/clean/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Gir/devel/llvm/clean/llvm-gcc.obj/./gcc/ -B/Volumes/Gir/devel/llvm/clean/llvm-gcc.install/i386-apple-darwin9.5.0/bin/ -B/Volumes/Gir/devel/llvm/clean/llvm-gcc.install/i386-apple-darwin9.5.0/lib/ -isystem /Volumes/Gir/devel/llvm/clean/llvm-gcc.install/i386-apple-darwin9.5.0/include -isystem /Volumes/Gir/devel/llvm/clean/llvm-gcc.install/i386-apple-darwin9.5.0/sys-include -mmacosx-version-min=10.4 -O2 -O2 -g -O2 -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -Wold-style-definition -isystem ./include -fPIC -pipe -g -DHAVE_GTHR_DEFAULT -DIN_LIBGCC2 -D__GCC_FLOAT_NOT_NEEDED -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Gir/devel/llvm/clean/llvm.obj/include -I/Volumes/Gir/devel/llvm/clean/llvm.src/include -fexceptions -fvisibility=hidden -DHIDE_EXPORTS -c ../../llvm-gcc.src/gcc/unwind-dw2-fde-darwin.c -o libgcc/./unwind-dw2-fde-darwin.o Assertion failed: (TargetRegisterInfo::isVirtualRegister(regA) && TargetRegisterInfo::isVirtualRegister(regB) && "cannot update physical register live information"), function runOnMachineFunction, file /Volumes/Gir/devel/llvm/clean/llvm.src/lib/CodeGen/TwoAddressInstructionPass.cpp, line 311. ../../llvm-gcc.src/gcc/unwind-dw2.c:1527: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. {standard input}:3521:non-relocatable subtraction expression, "_dwarf_reg_size_table" minus "L20$pb" {standard input}:3521:symbol: "_dwarf_reg_size_table" can't be undefined in a subtraction expression {standard input}:3520:non-relocatable subtraction expression, "_dwarf_reg_size_table" minus "L20$pb" ... llvm-svn: 56703	2008-09-26 22:10:44 +00:00
Dan Gohman	989db64c93	Rename ConstantSDNode's getSignExtended to getSExtValue, for consistancy with ConstantInt, and re-implement it in terms of ConstantInt's getSExtValue. llvm-svn: 56700	2008-09-26 21:54:37 +00:00
Evan Cheng	9946443460	Fix @llvm.frameaddress codegen. FP elimination optimization should be disabled when frame address is desired. Also add support for depth > 0. llvm-svn: 56683	2008-09-26 19:48:35 +00:00
Dale Johannesen	3f62c40108	Add "inreg" field to CallSDNode (doesn't increase its size). Adjust various lowering functions to pass this info through from CallInst. Use it to implement sseregparm returns on X86. Remove X86_ssecall calling convention. llvm-svn: 56677	2008-09-26 19:31:26 +00:00
Devang Patel	64dd7a2e89	Large mechanical patch. s/ParamAttr/Attribute/g s/PAList/AttrList/g s/FnAttributeWithIndex/AttributeWithIndex/g s/FnAttr/Attribute/g This sets the stage - to implement function notes as function attributes and - to distinguish between function attributes and return value attributes. This requires corresponding changes in llvm-gcc and clang. llvm-svn: 56622	2008-09-25 21:00:45 +00:00
Dale Johannesen	62f64ab4c8	Accept 'inreg' attribute on x86 functions as meaning sse_regparm (i.e. float/double values go in XMM0 instead of ST0). Update documentation to reflect reality. llvm-svn: 56619	2008-09-25 20:47:45 +00:00
Dan Gohman	9bb92ce6e9	Support for i1 XOR in FastISel. It is actually safe because i1 operands are assumed to already by zero-extended. llvm-svn: 56615	2008-09-25 17:22:52 +00:00
Dan Gohman	51cd436ad8	Don't print fast-isel debug messages by default. Thanks Chris! llvm-svn: 56614	2008-09-25 17:21:42 +00:00
Dan Gohman	5ee9d9af7f	Don't forget the newline in debug output. llvm-svn: 56613	2008-09-25 17:17:27 +00:00
Dan Gohman	b63e6de501	FastISel support for debug info. llvm-svn: 56610	2008-09-25 17:05:24 +00:00
Richard Pennington	1db9263b05	bug 2812: Segmentation fault on a big emdiam processor. llvm-svn: 56609	2008-09-25 16:15:10 +00:00
Dan Gohman	1776533304	Fix a recent fast-isel coverage regression - don't bail out before giving the target a chance to materialize constants. llvm-svn: 56605	2008-09-25 01:28:51 +00:00
Dan Gohman	ed3216739e	Enable DeadMachineInstructionElim when Fast-ISel is enabled. llvm-svn: 56604	2008-09-25 01:14:49 +00:00
Evan Cheng	bac8250ba4	<rdar://problem/6234798> Assertion failed: (!OpInfo.AssignedRegs.Regs.empty() && "Couldn't allocate input reg!") llvm-svn: 56597	2008-09-25 00:14:04 +00:00
Dale Johannesen	4184c23365	Remove SelectionDag early allocation of registers for earlyclobbers. Teach Local RA about earlyclobber, and add some tests for it. llvm-svn: 56592	2008-09-24 23:13:09 +00:00
Bill Wendling	7c60c6e7bf	Reapplying r56550 llvm-svn: 56553	2008-09-24 10:25:02 +00:00
Bill Wendling	236c4d0204	Forgot this part with my last patch. Sorry about the breakage. llvm-svn: 56552	2008-09-24 10:16:24 +00:00
Eric Christopher	8ffa64fdb5	Temporarily revert r56550 until missing commit can be added. llvm-svn: 56551	2008-09-24 08:30:44 +00:00
Bill Wendling	456b33b615	Refactor the constant folding code into it's own function. And call it from both the SelectionDAG and DAGCombiner code. The only functionality change is that now the DAG combiner is performing the constant folding for these operations instead of being a no-op. This is not in response to a bug, so there isn't a testcase. llvm-svn: 56550	2008-09-24 07:11:26 +00:00
Dale Johannesen	bc29bec7f8	Next round of earlyclobber handling. Approach the RA problem by expanding the live interval of an earlyclobber def back one slot. Remove overlap-earlyclobber throughout. Remove earlyclobber bits and their handling from live internals. llvm-svn: 56539	2008-09-24 01:07:17 +00:00
Evan Cheng	f942615847	Properly handle 'm' inline asm constraints. If a GV is being selected for the addressing mode, it requires the same logic for PIC relative addressing, etc. llvm-svn: 56526	2008-09-24 00:05:32 +00:00
Devang Patel	a3e9bf1bca	s/ParameterAttributes/Attributes/g llvm-svn: 56513	2008-09-23 23:03:40 +00:00
Dan Gohman	01a070f9c7	Arrange for FastISel code to have access to the MachineModuleInfo object. This will be needed to support debug info. llvm-svn: 56508	2008-09-23 21:53:34 +00:00
Dan Gohman	52a9adb3fa	Replace the LiveRegs SmallSet with a simple counter that keeps track of the number of live registers, which is all the set was being used for. llvm-svn: 56498	2008-09-23 18:50:48 +00:00
Dan Gohman	28970ed355	Fix the alignment of loads from constant pool entries when the load address has an offset from the base of the constant pool entry. llvm-svn: 56479	2008-09-22 22:40:08 +00:00
Dale Johannesen	3722f4c14c	Make log, log2, log10, exp, exp2 use Expand by default. llvm-svn: 56471	2008-09-22 21:57:32 +00:00
Evan Cheng	638ae1be58	Per review feedback: Only perform (srl x, (trunc (and y, c))) -> (srl x, (and (trunc y), c)) etc. when both "trunc" and "and" have single uses. llvm-svn: 56452	2008-09-22 18:19:24 +00:00
Oscar Fuentes	0f25988689	Initial support for the CMake build system. llvm-svn: 56419	2008-09-22 01:08:49 +00:00
Bill Wendling	3ee08ff81e	Add helper function to get a 32-bit floating point constant. No functionality change. llvm-svn: 56418	2008-09-22 00:44:35 +00:00
Chris Lattner	01cab96cba	don't print GlobalAddressSDNode's with an offset of zero as "foo0". llvm-svn: 56399	2008-09-21 18:38:31 +00:00
Dan Gohman	f66b3277d3	Refactor X86SelectConstAddr, folding it into X86SelectAddress. This results in better code for globals. Also, unbreak the local CSE for GlobalValue stub loads. llvm-svn: 56371	2008-09-19 22:16:54 +00:00
Dan Gohman	b7c5b0f44b	Add a new "fast" scheduler. This is currently basically just a copy of the BURRList scheduler, but with several parts ripped out, such as backtracking, online topological sort maintenance (needed by backtracking), the priority queue, and Sethi-Ullman number computation and maintenance (needed by the priority queue). As a result of all this, it generates somewhat lower quality code, but that's its tradeoff for running about 30% faster than list-burr in -fast mode in many cases. This is somewhat experimental. Moving forward, major pieces of this can be refactored with pieces in common with ScheduleDAGRRList.cpp. llvm-svn: 56307	2008-09-18 16:26:26 +00:00
Dale Johannesen	99091ed94f	Add a bit to mark operands of asm's that conflict with an earlyclobber operand elsewhere. Propagate this bit and the earlyclobber bit through SDISel. Change linear-scan RA not to allocate regs in a way that conflicts with an earlyclobber. See also comments. llvm-svn: 56290	2008-09-17 21:13:11 +00:00
Dan Gohman	4bef47e8c3	Don't worry about clobbering physical register defs that aren't used. llvm-svn: 56281	2008-09-17 15:25:49 +00:00
Evan Cheng	6cfbecd1fa	When converting a CopyFromReg to a copy instruction, use the register class of its uses to determine the right destination register class of the copy. This is important for targets where a physical register may belong to multiple register classes. llvm-svn: 56258	2008-09-16 23:12:11 +00:00
Dan Gohman	9cbb3f591a	Change SelectionDAG::getConstantPool to always set the alignment of the ConstantPoolSDNode, using the target's preferred alignment for the constant type. In LegalizeDAG, when performing loads from the constant pool, the ConstantPoolSDNode's alignment is used in the calls to getLoad and getExtLoad. This change prevents SelectionDAG::getLoad/getExtLoad from incorrectly choosing the ABI alignment for constant pool loads when Alignment == 0. The incorrect alignment is only a performance issue when ABI alignment does not equal preferred alignment (i.e., on x86 it was generating MOVUPS instead of MOVAPS for v4f32 constant loads when the default ABI alignment for 128bit vectors is forced to 1 byte.) Patch by Paul Redmond! llvm-svn: 56253	2008-09-16 22:05:41 +00:00
Bill Wendling	932818c75a	Reverting r56249. On further investigation, this functionality isn't needed. Apologies for the thrashing. llvm-svn: 56251	2008-09-16 21:48:12 +00:00
Dan Gohman	6da3227304	Include the alignment value when displaying ConstantPoolSDNodes. llvm-svn: 56250	2008-09-16 21:18:22 +00:00
Bill Wendling	1a240c8033	- Change "ExternalSymbolSDNode" to "SymbolSDNode". - Add linkage to SymbolSDNode (default to external). - Change ISD::ExternalSymbol to ISD::Symbol. - Change ISD::TargetExternalSymbol to ISD::TargetSymbol These changes pave the way to allowing SymbolSDNodes with non-external linkage. llvm-svn: 56249	2008-09-16 21:12:30 +00:00
Dan Gohman	37457ca74b	Don't take the time to CheckDAGForTailCallsAndFixThem when tail calls are not enabled. Instead just omit the tail call flag when calls are created. llvm-svn: 56235	2008-09-16 01:42:28 +00:00
Dan Gohman	f38d63884f	Re-enable SelectionDAG CSE for calls. It matters in the case of libcalls, as in this testcase on ARM. llvm-svn: 56226	2008-09-15 19:46:03 +00:00
Dan Gohman	3450a8252f	Define CallSDNode, an SDNode subclass for use with ISD::CALL. Currently it just holds the calling convention and flags for isVarArgs and isTailCall. And it has several utility methods, which eliminate magic 5+2*i and similar index computations in several places. CallSDNodes are not CSE'd. Teach UpdateNodeOperands to handle nodes that are not CSE'd gracefully. llvm-svn: 56183	2008-09-13 01:54:27 +00:00
Dan Gohman	082879cfde	Change ConstantSDNode and ConstantFPSDNode to use ConstantInt* and ConstantFP* instead of APInt and APFloat directly. This reduces the amount of time to create ConstantSDNode and ConstantFPSDNode nodes when ConstantInt* and ConstantFP* respectively are already available, as is the case in SelectionDAGBuild.cpp. Also, it reduces the amount of time to legalize constants into constant pools, and the amount of time to add ConstantFP operands to MachineInstrs, due to eliminating ConstantInt::get and ConstantFP::get calls. It increases the amount of work needed to create new constants in cases where the client doesn't already have a ConstantInt* or ConstantFP*, such as legalize expanding 64-bit integer constants to 32-bit constants. And it adds a layer of indirection for the accessor methods. But these appear to be outweight by the benefits in most cases. It will also make it easier to make ConstantSDNode and ConstantFPNode more consistent with ConstantInt and ConstantFP. llvm-svn: 56162	2008-09-12 18:08:03 +00:00
Dale Johannesen	6395da3510	Pass "earlyclobber" bit through to machine representation; coalescer and RA need to know about it. No functional change. llvm-svn: 56161	2008-09-12 17:49:03 +00:00
Dan Gohman	89660301e3	Rename ConstantSDNode::getValue to getZExtValue, for consistency with ConstantInt. This led to fixing a bug in TargetLowering.cpp using getValue instead of getAPIntValue. llvm-svn: 56159	2008-09-12 16:56:44 +00:00
Dale Johannesen	fbc17046ff	The sequence for ppcf128 compares was not IEEE safe in the presence of NaNs. llvm-svn: 56136	2008-09-12 00:30:56 +00:00
Dan Gohman	ad5824104b	FastISel support for i1 PHI nodes. llvm-svn: 56069	2008-09-10 21:01:31 +00:00
Dan Gohman	5a6134a875	FastISel support for i1 constants. llvm-svn: 56068	2008-09-10 21:01:08 +00:00
Dan Gohman	3ccdde5eef	Add X86FastISel support for static allocas, and refences to static allocas. As part of this change, refactor the address mode code for laods and stores. llvm-svn: 56066	2008-09-10 20:11:02 +00:00
Dan Gohman	5211f1f5fc	Add a break statement that I accidentally deleted when I shuffled the fast-isel command-line options around. This fixes a bunch of fast-isel failures. llvm-svn: 56057	2008-09-10 15:52:34 +00:00
Bill Wendling	c7c5d73866	Remove unnecessary bit-wise AND from the limited precision work. llvm-svn: 56049	2008-09-10 06:26:10 +00:00
Daniel Dunbar	df44fb835c	Fix 80 col violation. llvm-svn: 56048	2008-09-10 04:16:29 +00:00
Bill Wendling	4dc9d148b5	Check that both operands are f32 before attempting to lower. llvm-svn: 56036	2008-09-10 00:24:59 +00:00
Bill Wendling	aa39e64468	Implement "visitPow". This is mainly used to see if we have a pow() call of this form: powf(10.0f, x); If this is the case, and also we want limited precision floating-point calculations, then lower to do the limited-precision stuff. llvm-svn: 56035	2008-09-10 00:20:20 +00:00
Evan Cheng	150ee094e2	A few more places where FPOW is being ignored. llvm-svn: 56032	2008-09-09 23:35:53 +00:00
Dan Gohman	5b55c46044	Change -fast-isel-no-abort to -fast-isel-abort, which now defaults to being off by default. Also, add assertion checks to check that the various fast-isel-related command-line options are only used when -fast-isel itself is enabled. llvm-svn: 56029	2008-09-09 23:05:00 +00:00
Evan Cheng	ba11945234	Legalizer was missing code that expand fpow to a libcall. llvm-svn: 56028	2008-09-09 23:02:14 +00:00
Bill Wendling	5d6d774240	Adding 6-, 12-, and 18-bit limited-precision floating-point support for exp2 function. llvm-svn: 56025	2008-09-09 22:39:21 +00:00
Bill Wendling	103d08d4ce	Add support for 6-, 12-, and 18-bit limited precision calculations of exp for floating-point numbers. llvm-svn: 56023	2008-09-09 22:13:54 +00:00
Dan Gohman	2799e1dc63	Add a new option, -fast-isel-verbose, that can be used with -fast-isel-no-abort to get a dump of all unhandled instructions, without an abort. llvm-svn: 56021	2008-09-09 22:06:46 +00:00
Owen Anderson	0bdc9407ca	Clean this up, based on Evan's suggestions. llvm-svn: 56009	2008-09-09 20:47:17 +00:00
Bill Wendling	929486349f	- Add support for 6-, 12-, and 18-bit limited precision floating-point "log" values. - Refactored some of the code. llvm-svn: 56008	2008-09-09 20:39:27 +00:00
Anton Korobeynikov	6ad8b060d0	Make safer variant of alias resolution routine to be default llvm-svn: 56005	2008-09-09 20:05:04 +00:00
Bill Wendling	727b25981a	Add limited precision floating-point conversions of log10 for 6- and 18-bit precisions. llvm-svn: 56000	2008-09-09 18:42:23 +00:00
Owen Anderson	22debc8bec	Check for type legality before materializing integer constants in fast isel. With this change, all of MultiSource/Applications passes on Darwin/X86 under FastISel. llvm-svn: 55982	2008-09-09 06:32:02 +00:00
Dan Gohman	4b334f02b1	Remove the code that protected FastISel from aborting in the case of loads, stores, and conditional branches. It can handle those now, so any that aren't handled should trigger the abort. llvm-svn: 55977	2008-09-09 02:40:04 +00:00
Evan Cheng	dc011a1b10	Fix a constant lowering bug. Now we can do load and store instructions with funky getelementptr embedded in the address operand. llvm-svn: 55975	2008-09-09 01:26:59 +00:00
Bill Wendling	701da64da7	Add support for floating-point calculations of log2 with limited precisions of 6 and 18. llvm-svn: 55968	2008-09-09 00:28:24 +00:00
Anton Korobeynikov	d82cd01929	Reapply 55904: Unbreak and fix indentation llvm-svn: 55958	2008-09-08 21:13:56 +00:00
Dan Gohman	121cc3c111	Fix a few I's that were meant to be renamed to BI's. llvm-svn: 55942	2008-09-08 20:37:59 +00:00
Dale Johannesen	78617b727e	Redo the 3 existing low-precision expansions to use float constants. An oversight by the numerics people who supplied this. llvm-svn: 55930	2008-09-08 18:00:26 +00:00
Bill Wendling	4cc4caab72	Reverting r55898 to r55909. One of these patches was causing an ICE during the full bootstrap on Darwin: /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/bin/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/lib/ -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/include -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/sys-include -O2 -O2 -g -O2 -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -Wold-style-definition -isystem ./include -fPIC -pipe -g -DHAVE_GTHR_DEFAULT -DIN_LIBGCC2 -D__GCC_FLOAT_NOT_NEEDED -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DSHARED -m64 -DL_negdi2 -c ../../llvm-gcc.src/gcc/libgcc2.c -o libgcc/x86_64/_negdi2_s.o Assertion failed: (TargetRegisterInfo::isVirtualRegister(regA) && TargetRegisterInfo::isVirtualRegister(regB) && "cannot update physical register live information"), function runOnMachineFunction, file /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/lib/CodeGen/TwoAddressInstructionPass.cpp, line 311. /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/xgcc -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.obj/./gcc/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/bin/ -B/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/lib/ -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/include -isystem /Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm-gcc.install/i386-apple-darwin9.4.0/sys-include -O2 -O2 -g -O2 -DIN_GCC -W -Wall -Wwrite-strings -Wstrict-prototypes -Wmissing-prototypes -Wold-style-definition -isystem ./include -fPIC -pipe -g -DHAVE_GTHR_DEFAULT -DIN_LIBGCC2 -D__GCC_FLOAT_NOT_NEEDED -I. -I. -I../../llvm-gcc.src/gcc -I../../llvm-gcc.src/gcc/. -I../../llvm-gcc.src/gcc/../include -I./../intl -I../../llvm-gcc.src/gcc/../libcpp/include -I../../llvm-gcc.src/gcc/../libdecnumber -I../libdecnumber -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.obj/include -I/Volumes/Sandbox/Buildbot/llvm/full-llvm/build/llvm.src/include -DSHARED -m64 -DL_lshrdi3 -c ../../llvm-gcc.src/gcc/libgcc2.c -o libgcc/x86_64/_lshrdi3_s.o ../../llvm-gcc.src/gcc/unwind-dw2.c:1527: internal compiler error: Abort trap Please submit a full bug report, with preprocessed source if appropriate. See <URL:http://developer.apple.com/bugreporter> for instructions. {standard input}:unknown:Undefined local symbol LBB21_11 {standard input}:unknown:Undefined local symbol LBB21_12 {standard input}:unknown:Undefined local symbol LBB21_13 {standard input}:unknown:Undefined local symbol LBB21_8 llvm-svn: 55928	2008-09-08 17:59:12 +00:00
Dan Gohman	c9fdcfb189	In visitUREM, arrange for the temporary UDIV node to be revisited, consistent with the code in visitSREM. llvm-svn: 55923	2008-09-08 16:59:01 +00:00
Daniel Dunbar	ddd9b2aeb0	Add VISIBILITY_HIDDEN on SDISelAsmOperandInfo llvm-svn: 55922	2008-09-08 16:56:08 +00:00
Dan Gohman	afe7e3f3b1	Fix the string for ISD::UDIVREM. llvm-svn: 55917	2008-09-08 16:30:29 +00:00
Evan Cheng	432600a47a	Avoid redefinition and nnbreak windows build. llvm-svn: 55911	2008-09-08 16:01:27 +00:00
Anton Korobeynikov	5ea45bb9eb	Unbreak and fix indentation llvm-svn: 55904	2008-09-08 14:23:34 +00:00
Evan Cheng	8cb490d2f3	Add fast isel physical register definition support. llvm-svn: 55892	2008-09-08 08:38:20 +00:00
Bill Wendling	2239de4290	Revert my previous change -- the subtraction of two constants was a no-op before. This is taken care of in the selection DAG pass. In my opinion, this should be in one place or the other. I.e., it should probably be removed from the DAG combiner (along with the other arithmetic transformations on constants that are essentially no-ops). llvm-svn: 55889	2008-09-08 01:56:32 +00:00
Bill Wendling	91e9abe370	Convert // fold (sub c1, c2) -> c1-c2 from a no-op into an actual transformation. llvm-svn: 55886	2008-09-07 11:34:47 +00:00
Evan Cheng	396c744dfa	Indentation. llvm-svn: 55880	2008-09-07 09:04:52 +00:00
Evan Cheng	285350703c	- Doh. Pass vector by value is bad. - Add a AnalyzeCallResult specialized for calls which produce a single value. This is used by fastisel. llvm-svn: 55879	2008-09-07 09:02:18 +00:00
Dale Johannesen	3be45974bb	Next limited float precision expansion (log2 12 bits) llvm-svn: 55866	2008-09-05 23:49:37 +00:00
Owen Anderson	453bcfcf8d	Revert r55859. This is breaking the build in the abscence of its companion commit. llvm-svn: 55865	2008-09-05 23:36:01 +00:00
Dan Gohman	85d35b92df	Move the code that inserts copies for function livein registers out of ScheduleDAGEmit.cpp and into SelectionDAGISel.cpp. This allows it to be run exactly once per function, even if multiple SelectionDAG iterations happen in the entry block, as may happen with FastISel. llvm-svn: 55863	2008-09-05 22:59:21 +00:00
Dale Johannesen	6b48790d88	Add the next limited-precision expansion. llvm-svn: 55856	2008-09-05 21:27:19 +00:00
Dan Gohman	b22e9b050f	FastISel support for AND and OR with type i1. llvm-svn: 55846	2008-09-05 18:44:22 +00:00
Dale Johannesen	116163ab21	Add hooks for other intrinsics to get low-precision expansions. llvm-svn: 55845	2008-09-05 18:38:42 +00:00
Dan Gohman	525caf83e5	FastISel support for ConstantExprs. llvm-svn: 55843	2008-09-05 18:18:20 +00:00
Dan Gohman	13d7484b4a	Revert r55817. It broke PIC. FastISel will need to find a different approach here. llvm-svn: 55842	2008-09-05 18:13:01 +00:00
Evan Cheng	339a06f29e	Add a variant of AnalyzeCallOperands that can be used by fast isel. llvm-svn: 55838	2008-09-05 16:59:26 +00:00
Duncan Sands	566e0f1053	"Fix" PR2762. The testcase now crashes codegen elsewhere due to a missing pattern for v2f64 = sint_to_fp v2i32. That is PR2687. llvm-svn: 55828	2008-09-05 08:13:35 +00:00
Dan Gohman	a3987ed4e2	Fix a search+replace-o. llvm-svn: 55824	2008-09-05 01:58:21 +00:00
Dale Johannesen	ce63ed5b47	Add -flimit-float-precision to enable some faster, but less accurate (non-IEEE) code sequences for certain math library functions. Add the first of several such expansions. Don't worry, if you don't turn it on it won't affect you. llvm-svn: 55823	2008-09-05 01:48:15 +00:00
Dan Gohman	e64326ff34	FastISel support for unreachable. llvm-svn: 55818	2008-09-05 01:08:41 +00:00
Dan Gohman	fcd2cbd985	In FastISel mode, the scheduler may be invoked multiple times in the same block. Fix the entry-block handling to only run at at the beginning of the entry block, and not any other times. llvm-svn: 55817	2008-09-05 01:07:48 +00:00
Owen Anderson	6d5b72d45a	Add initial support for selecting constant materializations that require constant pool loads on X86 in fast isel. This isn't actually used yet. llvm-svn: 55814	2008-09-05 00:06:23 +00:00
Dan Gohman	1ffb4ad3a8	Add an include of SmallSet.h. llvm-svn: 55793	2008-09-04 20:49:27 +00:00
Dan Gohman	e1f9be27bc	Tidy up several unbeseeming casts from pointer to intptr_t. llvm-svn: 55779	2008-09-04 17:05:41 +00:00
Dan Gohman	7ee14837e6	Clean up uses of TargetLowering::getTargetMachine. llvm-svn: 55769	2008-09-04 15:39:15 +00:00
Dale Johannesen	9e4d101fab	Add intrinsics for log, log2, log10, exp, exp2. No functional change (and no FE change to generate them). llvm-svn: 55753	2008-09-04 00:47:13 +00:00
Dan Gohman	18f659d804	Do trivial local CSE for constants and other non-Instruction values in FastISel. llvm-svn: 55748	2008-09-03 23:32:19 +00:00
Dan Gohman	f86538dd3c	Put RegsForValue in the llvm namespace to avoid warnings about classes in the llvm namespace having members with types from anonymous namespaces. llvm-svn: 55747	2008-09-03 23:18:39 +00:00
Dan Gohman	18cc2a26df	Create HandlePHINodesInSuccessorBlocksFast, a version of HandlePHINodesInSuccessorBlocks that works FastISel-style. This allows PHI nodes to be updated correctly while using FastISel. This also involves some code reorganization; ValueMap and MBBMap are now members of the FastISel class, so they needn't be passed around explicitly anymore. Also, SelectInstructions is changed to SelectInstruction, and only does one instruction at a time. llvm-svn: 55746	2008-09-03 23:12:08 +00:00
Owen Anderson	d08396955c	Oops, I accidentally broke the fallback case with my last commit. llvm-svn: 55704	2008-09-03 17:51:57 +00:00
Owen Anderson	906c590bb8	Fix an issue where we were reusing materializations of constants in blocks not dominated by the materialization. This is the simple fix, materializing the constant before every use. It might be better to either track domination of uses or to materialize all constants and the beginning of the function and let remat sort when to do materialization at uses. llvm-svn: 55703	2008-09-03 17:37:03 +00:00
Dan Gohman	3ee4edfe9c	Split the SelectionDAG-building code, including the FunctionLoweringInfo and SelectionDAGLowering classes, out of SelectionDAGISel.cpp and put it in a separate file, SelectionDAGBuild.cpp. llvm-svn: 55701	2008-09-03 16:12:24 +00:00
Dan Gohman	c58897359b	Separate MachineInstr-emitting routines from actual scheduling routines and move them into a separate file, ScheduleDAGEmit.cpp. llvm-svn: 55699	2008-09-03 16:01:59 +00:00
Evan Cheng	f993be4cc8	If TargetSelectInstruction returns true, move to next instruction. llvm-svn: 55692	2008-09-03 06:43:41 +00:00
Evan Cheng	5e0e6dfc7f	80 col violations. llvm-svn: 55668	2008-09-02 21:59:13 +00:00
Dan Gohman	9969b223c7	Ensure that HandlePHINodesInSuccessorBlocks is run for all blocks, even in FastISel mode in the case where FastISel successfully selects all the instructions. llvm-svn: 55641	2008-09-02 20:17:56 +00:00
Gabor Greif	632fa3a318	Provide two overloads of AnalyzeNewNode. The first can update the SDNode in an SDValue while the second is called with SDNode* and returns a possibly updated SDNode*. This patch has no intended functional impact, but helps eliminating ugly temporary SDValues. llvm-svn: 55608	2008-09-01 15:10:19 +00:00
Duncan Sands	efc82024e0	Even though no caller actually uses the new value (what matters is that it is added to the worklist), it seems more logical to return it. llvm-svn: 55606	2008-09-01 13:11:13 +00:00
Bill Wendling	3f918b3603	Another situation where ROTR is cheaper than ROTL. llvm-svn: 55577	2008-08-31 01:13:31 +00:00
Bill Wendling	ef64d4333e	For this pattern, ROTR is the cheaper option. llvm-svn: 55576	2008-08-31 01:04:56 +00:00
Bill Wendling	08690f06b2	- Fix comment so that it describes how the code really works: // fold (or (shl x, (ext y)), (srl x, (ext (sub 32, y)))) -> // (rotl x, y) // fold (or (shl x, (ext y)), (srl x, (ext (sub 32, y)))) -> // (rotr x, (sub 32, y)) Example: (x == 0xDEADBEEF and y == 4) (x << 4) \| (x >> 28) => 0xEADBEEF0 \| 0x0000000D => 0xEADBEEFD (rotl x, 4) => 0xEADBEEFD (rotr x, 28) => 0xEADBEEFD - Fix comment and code for second version. It wasn't using the rot* propertly. // fold (or (shl x, (ext (sub 32, y))), (srl x, (ext r))) -> // (rotr x, y) // fold (or (shl x, (ext (sub 32, y))), (srl x, (ext r))) -> // (rotl x, (sub 32, y)) (x << 28) \| (x >> 4) => 0xD0000000 \| 0x0DEADBEE => 0xDDEADBEE (rotl x, 4) => 0xEADBEEFD (rotr x, 28) => (0xEADBEEFD) llvm-svn: 55575	2008-08-31 00:37:27 +00:00
Gabor Greif	fa6e220233	typo llvm-svn: 55574	2008-08-30 22:16:05 +00:00
Gabor Greif	2aef1d5e4c	fix some 80-col violations llvm-svn: 55571	2008-08-30 19:29:20 +00:00
Evan Cheng	4bc8c9652e	Transform (x << (y&31)) -> (x << y). This takes advantage of the fact x86 shift instructions 2nd operand (shift count) is limited to 0 to 31 (or 63 in the x86-64 case). llvm-svn: 55558	2008-08-30 02:03:58 +00:00
Owen Anderson	6c8f04643e	Fix an issue where a use might be selected before a def, and then we didn't respect the pre-chosen vreg assignment when selecting the def. This is the naive solution to the problem: insert a copy to the pre-chosen vreg. Other solutions might be preferable, such as: 1) Passing the dest reg into FastEmit_. However, this would require the higher level code to know about reg classes, which they don't currently. 2) Selecting blocks in reverse postorder. This has some compile time cost for computing the order, and we'd need to measure its impact. llvm-svn: 55555	2008-08-30 00:38:46 +00:00
Evan Cheng	9ee227f1df	Fix 80 col. violations. llvm-svn: 55551	2008-08-29 23:20:46 +00:00
Evan Cheng	2a3e05b519	Back out 55498. It broken Apple style bootstrapping. llvm-svn: 55549	2008-08-29 22:21:44 +00:00
Dan Gohman	c7b8401b77	Add a target callback for FastISel. llvm-svn: 55512	2008-08-28 23:21:34 +00:00

... 3 4 5 6 7 ...

3129 Commits