llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 04:22:57 +02:00

Author	SHA1	Message	Date
Dan Gohman	d789392934	Handle empty aggregate values. llvm-svn: 52150	2008-06-09 15:21:47 +00:00
Duncan Sands	fe2a970a5c	Remove comparison methods for MVT. The main cause of apint codegen failure is the DAG combiner doing the wrong thing because it was comparing MVT's using < rather than comparing the number of bits. Removing the < method makes this mistake impossible to commit. Instead, add helper methods for comparing bits and use them. llvm-svn: 52098	2008-06-08 20:54:56 +00:00
Dan Gohman	d4e2736532	CodeGen support for insertvalue and extractvalue, and for loads and stores of aggregate values. llvm-svn: 52069	2008-06-07 02:02:36 +00:00
Owen Anderson	a18629b9c6	Connect successors before creating the DAG node for the branch. This has no visible functionality change, but enables a future patch where node creation will update the CFG if it decides to create an unconditional rather than a conditional branch. llvm-svn: 52067	2008-06-07 00:00:23 +00:00
Duncan Sands	d634afe3aa	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Dan Gohman	144390078f	Use isSingleValueType instead of isFirstClassType to exclude struct and array types. llvm-svn: 51460	2008-05-23 00:34:04 +00:00
Dan Gohman	821bf58428	IR support for extractvalue and insertvalue instructions. Also, begin moving toward making structs and arrays first-class types. llvm-svn: 51157	2008-05-15 19:50:34 +00:00
Evan Cheng	9d22a90b0b	Really silence compiler warnings. llvm-svn: 51126	2008-05-14 20:29:30 +00:00
Evan Cheng	e7684b9e91	Silence some compiler warnings. llvm-svn: 51115	2008-05-14 20:07:51 +00:00
Dan Gohman	bab18cae46	Clean up the use of static and anonymous namespaces. This turned up several things that were neither in an anonymous namespace nor static but not intended to be global. llvm-svn: 51017	2008-05-13 00:00:25 +00:00
Nate Begeman	11c0772a30	Add support for vicmp/vfcmp codegen, more legalize support coming. This is necessary to unbreak the build. llvm-svn: 50988	2008-05-12 19:40:03 +00:00
Anton Korobeynikov	ddb93e7a02	Turn StripPointerCast() into a method llvm-svn: 50836	2008-05-07 22:54:15 +00:00
Anton Korobeynikov	90ee6d6616	Make StripPointerCast a common function (should we mak it method of Value instead?) llvm-svn: 50775	2008-05-06 22:52:30 +00:00
Dan Gohman	d4a670284c	Make several variable declarations static. llvm-svn: 50696	2008-05-06 01:53:16 +00:00
Mon P Wang	84a269e023	Added addition atomic instrinsics and, or, xor, min, and max. llvm-svn: 50663	2008-05-05 19:05:59 +00:00
Dan Gohman	a55bbcacce	Use push_back(...) instead of resize(1, ...), per review feedback. llvm-svn: 50561	2008-05-02 00:03:54 +00:00
Dan Gohman	148b1904fe	Fix uninitialized uses of the FPC variable. llvm-svn: 50558	2008-05-01 23:40:44 +00:00
Chris Lattner	e9bbe8e6b6	don't randomly miscompile seto/setuo just because we are in ffastmath mode. This fixes rdar://5902801, a miscompilation of gcc.dg/builtins-8.c. Bill, please pull this into Tak. llvm-svn: 50523	2008-05-01 07:26:11 +00:00
Arnold Schwaighofer	f58a35e2ec	Tail call optimization improvements: Move platform independent code (lowering of possibly overwritten arguments, check for tail call optimization eligibility) from target X86ISelectionLowering.cpp to TargetLowering.h and SelectionDAGISel.cpp. Initial PowerPC tail call implementation: Support ppc32 implemented and tested (passes my tests and test-suite llvm-test). Support ppc64 implemented and half tested (passes my tests). On ppc tail call optimization is performed if caller and callee are fastcc call is a tail call (in tail call position, call followed by ret) no variable argument lists or byval arguments option -tailcallopt is enabled Supported: * non pic tail calls on linux/darwin * module-local tail calls on linux(PIC/GOT)/darwin(PIC) * inter-module tail calls on darwin(PIC) If constraints are not met a normal call will be emitted. A test checking the argument lowering behaviour on x86-64 was added. llvm-svn: 50477	2008-04-30 09:16:33 +00:00
Chris Lattner	0f63b8fecc	make the vector conversion magic handle multiple results. We now compile test2/test3 to: _test2: ## InlineAsm Start set %xmm0, %xmm1 ## InlineAsm End addps %xmm1, %xmm0 ret _test3: ## InlineAsm Start set %xmm0, %xmm1 ## InlineAsm End paddd %xmm1, %xmm0 ret as expected. llvm-svn: 50389	2008-04-29 04:48:56 +00:00
Chris Lattner	e75d09711d	add support for multiple return values in inline asm. This is a step towards PR2094. It now compiles the attached .ll file to: _sad16_sse2: movslq %ecx, %rax ## InlineAsm Start %ecx %rdx %rax %rax %r8d %rdx %rsi ## InlineAsm End ## InlineAsm Start set %eax ## InlineAsm End ret which is pretty decent for a 3 output, 4 input asm. llvm-svn: 50386	2008-04-29 04:29:54 +00:00
Evan Cheng	a2e4ffcd8a	Fix a bug in RegsForValue::getCopyToRegs() that causes cyclical scheduling units. If it's creating multiple CopyToReg nodes that are "flagged" together, it should not create a TokenFactor for it's chain outputs: c1, f1 = CopyToReg c2, f2 = CopyToReg c3 = TokenFactor c1, c2 ... = user c3, ..., f2 Now that the two CopyToReg's and the user are "flagged" together. They effectively forms a single scheduling unit. The TokenFactor is now both an operand and a successor of the Flagged nodes. llvm-svn: 50376	2008-04-28 22:07:13 +00:00
Dan Gohman	d67d878df0	Delete an unused constructor. llvm-svn: 50367	2008-04-28 18:28:49 +00:00
Dan Gohman	733bb3e992	Add a comment to CreateRegForValue that clarifies the handling of aggregate types. llvm-svn: 50366	2008-04-28 18:19:43 +00:00
Dan Gohman	2f0476499c	Rewrite the comments for RegsForValue and its members, and reorder some of the members for clarity. llvm-svn: 50365	2008-04-28 18:10:39 +00:00
Dan Gohman	5d36cd74b0	Don't call size() on each iteration of the loop. llvm-svn: 50361	2008-04-28 17:42:03 +00:00
Chris Lattner	c83326d89f	Another collection of random cleanups. No functionality change. llvm-svn: 50341	2008-04-28 07:16:35 +00:00
Chris Lattner	27fa922841	Remove the SmallVector ctor that converts from a SmallVectorImpl. This conversion open the door for many nasty implicit conversion issues, and can be easily solved by initializing with (V.begin(), V.end()) when needed. This patch includes many small cleanups for sdisel also. llvm-svn: 50340	2008-04-28 06:44:42 +00:00
Chris Lattner	d6315b68f2	switch RegsForValue::Regs to be a SmallVector to avoid heap thrash on tiny (usually single-element) vectors. llvm-svn: 50335	2008-04-28 06:02:19 +00:00
Chris Lattner	459f6ed05c	move static function out of anon namespace, no functionality change. llvm-svn: 50330	2008-04-27 23:48:12 +00:00
Chris Lattner	113de6b3a8	Another step to getting multiple result inline asm to work. llvm-svn: 50329	2008-04-27 23:44:28 +00:00
Chris Lattner	39a4281deb	Implement a signficant optimization for inline asm: When choosing between constraints with multiple options, like "ir", test to see if we can use the 'i' constraint and go with that if possible. This produces more optimal ASM in all cases (sparing a register and an instruction to load it), and fixes inline asm like this: void test () { asm volatile (" %c0 %1 " : : "imr" (42), "imr"(14)); } Previously we would dump "42" into a memory location (which is ok for the 'm' constraint) which would cause a problem because the 'c' modifier is not valid on memory operands. Isn't it great how inline asm turns 'missed optimization' into 'compile failed'?? Incidentally, this was the todo in PowerPC/2007-04-24-InlineAsm-I-Modifier.ll Please do NOT pull this into Tak. llvm-svn: 50315	2008-04-27 00:37:18 +00:00
Chris Lattner	42aa4f9620	isa+cast -> dyn_cast llvm-svn: 50314	2008-04-27 00:16:18 +00:00
Chris Lattner	b83aaaa855	Move a bunch of inline asm code out of line. llvm-svn: 50313	2008-04-27 00:09:47 +00:00
Dan Gohman	c4b6768db4	Remove the code from CodeGenPrepare that moved getresult instructions to the block that defines their operands. This doesn't work in the case that the operand is an invoke, because invoke is a terminator and must be the last instruction in a block. Replace it with support in SelectionDAGISel for copying struct values into sequences of virtual registers. llvm-svn: 50279	2008-04-25 18:27:55 +00:00
Dan Gohman	37f4dc9ab4	Use isa instead of dyn_cast. llvm-svn: 50181	2008-04-23 20:25:16 +00:00
Dan Gohman	afa475f207	Add support to codegen for getresult instructions with undef operands. llvm-svn: 50180	2008-04-23 20:21:29 +00:00
Nicolas Geoffray	7e0110f724	Change Divided flag to Split, as suggested by Evan llvm-svn: 49715	2008-04-15 08:08:50 +00:00
Nicolas Geoffray	5d04329f4b	Fix /test/CodeGen/PowerPC/big-endian-actual-args.ll for linux/ppc32 llvm-svn: 49652	2008-04-14 17:17:14 +00:00
Nicolas Geoffray	ad5556e8ba	Add a divided flag for the first piece of an argument divided into mulitple parts. Fixes PR1643 llvm-svn: 49611	2008-04-13 13:40:22 +00:00
Dan Gohman	15edbf989f	Drop ISD::MEMSET, ISD::MEMMOVE, and ISD::MEMCPY, which are not Legal on any current target and aren't optimized in DAGCombiner. Instead of using intermediate nodes, expand the operations, choosing between simple loads/stores, target-specific code, and library calls, immediately. Previously, the code to emit optimized code for these operations was only used at initial SelectionDAG construction time; now it is used at all times. This fixes some cases where rep;movs was being used for small copies where simple loads/stores would be better. This also cleans up code that checks for alignments less than 4; let the targets make that decision instead of doing it in target-independent code. This allows x86 to use rep;movs in low-alignment cases. Also, this fixes a bug that resulted in the use of rep;stos for memsets of 0 with non-constant memory size when the alignment was at least 4. It's better to use the library in this case, which can be significantly faster when the size is large. This also preserves more SourceValue information when memory intrinsics are lowered into simple loads/stores. llvm-svn: 49572	2008-04-12 04:36:06 +00:00
Dale Johannesen	6ebaf848f4	Make sure both PendingLoads and PendingExports are flushed before an invoke. Failure to do this causes references in the landing pad to variables that were not set. Fixes g++.dg/eh/delayslot1.C g++.dg/eh/fp-regs.C g++.old-deja/g++.brendan/eh1.C llvm-svn: 49243	2008-04-04 23:48:31 +00:00
Dale Johannesen	79633a914f	Recommitting EH patch; this should answer most of the review feedback. -enable-eh is still accepted but doesn't do anything. EH intrinsics use Dwarf EH if the target supports that, and are handled by LowerInvoke otherwise. The separation of the EH table and frame move data is, I think, logically figured out, but either one still causes full EH info to be generated (not sure how to split the metadata correctly). MachineModuleInfo::needsFrameInfo is no longer used and is removed. llvm-svn: 49064	2008-04-02 00:25:04 +00:00
Dale Johannesen	8813206b7f	Revert 49006 for the moment. llvm-svn: 49046	2008-04-01 20:00:57 +00:00
Dale Johannesen	fa4433be71	Emit exception handling info for functions which are not marked nounwind, or for all functions when -enable-eh is set, provided the target supports Dwarf EH. llvm-gcc generates nounwind in the right places; other FEs will need to do so also. Given such a FE, -enable-eh should no longer be needed. llvm-svn: 49006	2008-03-31 23:40:23 +00:00
Chris Lattner	49e9edd6f6	Fix "Control reaches the end of non-void function" warnings, patch by David Chisnall. llvm-svn: 48963	2008-03-30 18:22:13 +00:00
Dan Gohman	199ab29337	Avoid creating chain dependencies from CopyToReg nodes to load and store nodes. This doesn't currently have much impact the generated code, but it does produce simpler-looking SelectionDAGs, and consequently simpler-looking ScheduleDAGs, because there are fewer spurious dependencies. In particular, CopyValueToVirtualRegister now uses the entry node as the input chain dependency for new CopyToReg nodes instead of calling getRoot and depending on the most recent memory reference. Also, rename UnorderedChains to PendingExports and pull it up from being a local variable in SelectionDAGISel::BuildSelectionDAG to being a member variable of SelectionDAGISel, so that it doesn't have to be passed around to all the places that need it. llvm-svn: 48893	2008-03-27 19:56:19 +00:00
Duncan Sands	4153fc30c9	Introduce a new node for holding call argument flags. This is needed by the new legalize types infrastructure which wants to expand the 64 bit constants previously used to hold the flags on 32 bit machines. There are two functional changes: (1) in LowerArguments, if a parameter has the zext attribute set then that is marked in the flags; before it was being ignored; (2) PPC had some bogus code for handling two word arguments when using the ELF 32 ABI, which was hard to convert because of the bogusness. As suggested by the original author (Nicolas Geoffray), I've disabled it for the moment. Tested with "make check" and the Ada ACATS testsuite. llvm-svn: 48640	2008-03-21 09:14:45 +00:00
Duncan Sands	3760c87373	Do not generate special entries in the dwarf eh table for nounwind calls. llvm-svn: 48373	2008-03-14 21:36:24 +00:00
Duncan Sands	05eb212b2d	Don't try to extract an i32 from an f64. This getCopyToParts problem was noticed by the new LegalizeTypes infrastructure. In order to avoid this kind of thing in the future I've added a check that EXTRACT_ELEMENT is only used with integers. Once LegalizeTypes is up and running most likely BUILD_PAIR and EXTRACT_ELEMENT can be removed, in favour of using apints instead. llvm-svn: 48294	2008-03-12 20:30:08 +00:00
Dan Gohman	55a443d612	Initial codegen support for functions and calls with multiple return values. llvm-svn: 48244	2008-03-11 21:11:25 +00:00
Scott Michel	bb8e8fca47	Give TargetLowering::getSetCCResultType() a parameter so that ISD::SETCC's return ValueType can depend its operands' ValueType. This is a cosmetic change, no functionality impacted. llvm-svn: 48145	2008-03-10 15:42:14 +00:00
Dale Johannesen	e6b0009792	Increase ISD::ParamFlags to 64 bits. Increase the ByValSize field to 32 bits, thus enabling correct handling of ByVal structs bigger than 0x1ffff. Abstract interface a bit. Fixes gcc.c-torture/execute/pr23135.c and gcc.c-torture/execute/pr28982b.c in gcc testsuite (were ICE'ing on ppc32, quietly producing wrong code on x86-32.) llvm-svn: 48122	2008-03-10 02:17:22 +00:00
Chris Lattner	9d93efcfbd	remove an extraneous (and ugly) default argument, thanks Duncan. llvm-svn: 48117	2008-03-09 20:04:36 +00:00
Chris Lattner	a85b0b591e	fp_round's produced by getCopyFromParts should always be exact, because they are produced by calls (which are known exact) and by cross block copies which are known to be produced by extends. This improves: define double @test2() { %tmp85 = call double asm sideeffect "fld0", "={st(0)}"() ret double %tmp85 } from: _test2: subl $20, %esp # InlineAsm Start fld0 # InlineAsm End fstpl 8(%esp) movsd 8(%esp), %xmm0 movsd %xmm0, (%esp) fldl (%esp) addl $20, %esp #FP_REG_KILL ret to: _test2: # InlineAsm Start fld0 # InlineAsm End #FP_REG_KILL ret by avoiding a f64 <-> f80 trip llvm-svn: 48108	2008-03-09 09:38:46 +00:00
Chris Lattner	9e5cf6dc21	extend fp values with FP_EXTEND not FP_ROUND. llvm-svn: 48097	2008-03-09 07:47:22 +00:00
Evan Cheng	dba1dfe962	Implement x86 support for @llvm.prefetch. It corresponds to prefetcht{0\|1\|2} and prefetchnta instructions. llvm-svn: 48042	2008-03-08 00:58:38 +00:00
Dan Gohman	43f51f5cb2	Use the new APInt-enabled form of getConstant instead of converting an APInt into a uint64_t to call getConstant. llvm-svn: 47742	2008-02-29 01:41:59 +00:00
Evan Cheng	40c26c71c0	Refactor inline asm constraint matching code out of SDIsel into TargetLowering. llvm-svn: 47587	2008-02-26 02:33:44 +00:00
Dan Gohman	012abf0109	Convert MaskedValueIsZero and all its users to use APInt. Also add a SignBitIsZero function to simplify a common use case. llvm-svn: 47561	2008-02-25 21:11:39 +00:00
Dale Johannesen	a96eb3a1d8	Pass alignment on ByVal parameters, from FE, all the way through. It is now used for codegen. llvm-svn: 47484	2008-02-22 17:49:45 +00:00
Chris Lattner	b3c8d120dc	Make the clobber analysis a bit more smart: we only are careful about early clobbers if the clobber list contains a register not some thing like {memory}, {dirflag} etc. llvm-svn: 47457	2008-02-21 20:54:31 +00:00
Chris Lattner	4f87f1c087	Treat clobber operands like early clobbers: if we have any, we force sdisel to do all regalloc for an asm. This leads to gross but correct codegen. This fixes the rest of PR2078. llvm-svn: 47454	2008-02-21 19:43:13 +00:00
Andrew Lenharth	db9cd46f5d	Atomic op support. If any gcc test uses __sync builtins, it might start failing on archs that haven't implemented them yet llvm-svn: 47430	2008-02-21 06:45:13 +00:00
Chris Lattner	702abbeb51	Add support for matching mem operands. This fixes PR1133, patch by Eli Friedman. This implements CodeGen/Generic/2008-02-20-MatchingMem.ll. llvm-svn: 47428	2008-02-21 05:27:19 +00:00
Chris Lattner	99b5a37d39	Fix a (harmless) but where vregs were added to the used reg lists for inline asms. Fix PR2078 by marking aliases of registers used when a register is marked used. This prevents EAX from being allocated when AX is listed in the clobber set for the asm. llvm-svn: 47426	2008-02-21 04:55:52 +00:00
Devang Patel	8a80334c8a	assert is more effective reminder then FIXME tag for unimplemented features. llvm-svn: 47388	2008-02-20 18:37:40 +00:00
Anton Korobeynikov	7dd00942cc	Update gcc 4.3 warnings fix patch with recent head changes llvm-svn: 47368	2008-02-20 11:10:28 +00:00
Devang Patel	a74d2cbb6f	Add GetResultInst. First step for multiple return value support. llvm-svn: 47348	2008-02-19 22:15:16 +00:00
Andrew Lenharth	c178981b85	llvm.memory.barrier, and impl for x86 and alpha llvm-svn: 47204	2008-02-16 01:24:58 +00:00
Duncan Sands	0056f1e823	In TargetLowering::LowerCallTo, don't assert that the return value is zero-extended if it isn't sign-extended. It may also be any-extended. Also, if a floating point value was returned in a larger floating point type, pass 1 as the second operand to FP_ROUND, which tells it that all the precision is in the original type. I think this is right but I could be wrong. Finally, when doing libcalls, set isZExt on a parameter if it is "unsigned". Currently isSExt is set when signed, and nothing is set otherwise. This should be right for all calls to standard library routines. llvm-svn: 47122	2008-02-14 17:28:50 +00:00
Chris Lattner	a30946c576	In SDISel, for targets that support FORMAL_ARGUMENTS nodes, lower this node as soon as we create it in SDISel. Previously we would lower it in legalize. The problem with this is that it only exposes the argument loads implied by FORMAL_ARGUMENTs after legalize, so that only dag combine 2 can hack on them. This causes us to miss some optimizations because datatype expansion also happens here. Exposing the loads early allows us to do optimizations on them. For example we now compile arg-cast.ll to: _foo: movl $2147483647, %eax andl 8(%esp), %eax ret where we previously produced: _foo: subl $12, %esp movsd 16(%esp), %xmm0 movsd %xmm0, (%esp) movl $2147483647, %eax andl 4(%esp), %eax addl $12, %esp ret It might also make sense to do this for ISD::CALL nodes, which have implicit stores on many targets. llvm-svn: 47054	2008-02-13 07:39:09 +00:00
Duncan Sands	1122ee4ded	Generalize getCopyFromParts and getCopyToParts to handle arbitrary precision integers and any number of parts. For example, on a 32 bit machine an i50 corresponds to two i32 parts. getCopyToParts will extend the i50 to an i64 then write half of the i64 to each part; getCopyFromParts will combine the two i32 parts into an i64 then truncate the result to i50. llvm-svn: 47024	2008-02-12 20:46:31 +00:00
Duncan Sands	7916fcbe27	Generalize the handling of call and return arguments, in preparation for apint support. These changes are intended to have no functional effect. llvm-svn: 46967	2008-02-11 20:58:28 +00:00
Dan Gohman	cabaec582f	Rename MRegisterInfo to TargetRegisterInfo. llvm-svn: 46930	2008-02-10 18:45:23 +00:00
Evan Cheng	c57ec111f2	SDIsel processes llvm.dbg.declare by recording the variable debug information descriptor and its corresponding stack frame index in MachineModuleInfo. This only works if the local variable is "homed" in the stack frame. It does not work for byval parameter, etc. Added ISD::DECLARE node type to represent llvm.dbg.declare intrinsic. Now the intrinsic calls are lowered into a SDNode and lives on through out the codegen passes. For now, since all the debugging information recording is done at isel time, when a ISD::DECLARE node is selected, it has the side effect of also recording the variable. This is a short term solution that should be fixed in time. llvm-svn: 46659	2008-02-02 04:07:54 +00:00
Evan Cheng	d6222fc11d	Remove the nasty LABEL hack with a much less evil one. Now llvm.dbg.func.start implies a stoppoint is set. SelectionDAGISel records a new source line but does not create a ISD::LABEL node for this special stoppoint. Asm printer will magically print this label. This ensures nothing is emitted before. llvm-svn: 46635	2008-02-01 09:10:45 +00:00
Evan Cheng	705212577d	Add an extra operand to LABEL nodes which distinguishes between debug, EH, or misc labels. This fixes the EH breakage. However I am not convinced this is the solution. llvm-svn: 46609	2008-01-31 09:59:15 +00:00
Dan Gohman	3993809a0c	Rename ISD::FLT_ROUNDS to ISD::FLT_ROUNDS_ to avoid conflicting with the real FLT_ROUNDS (defined in <float.h>). llvm-svn: 46587	2008-01-31 00:41:03 +00:00
Evan Cheng	918b9c9335	Even though InsertAtEndOfBasicBlock is an ugly hack it still deserves a proper name. Rename it to EmitInstrWithCustomInserter since it does not necessarily insert instruction at the end. llvm-svn: 46562	2008-01-30 18:18:23 +00:00
Dale Johannesen	f12104ce4b	Handle 'X' constraint in asm's better. llvm-svn: 46485	2008-01-29 02:21:21 +00:00
Chris Lattner	8c621a474e	fix long lines. llvm-svn: 46355	2008-01-25 17:24:52 +00:00
Evan Cheng	59cdf8bb16	Forgot these. llvm-svn: 46292	2008-01-24 00:22:01 +00:00
Chris Lattner	d033200a8f	* Introduce a new SelectionDAG::getIntPtrConstant method and switch various codegen pieces and the X86 backend over to using it. * Add some comments to SelectionDAGNodes.h * Introduce a second argument to FP_ROUND, which indicates whether the FP_ROUND changes the value of its input. If not it is safe to xform things like fp_extend(fp_round(x)) -> x. llvm-svn: 46125	2008-01-17 07:00:52 +00:00
Anton Korobeynikov	08ea121968	For PR1839: add initial support for __builtin_trap. llvm-gcc part is missed as well as PPC codegen llvm-svn: 46001	2008-01-15 07:02:33 +00:00
Duncan Sands	423244deda	Remove the assumption that byval has been applied to a pointer to a struct. llvm-svn: 45939	2008-01-13 21:19:59 +00:00
Gordon Henriksen	88a41c672b	Enabling the target-independent garbage collection infrastructure by hooking it up to the various compiler pipelines. This doesn't actually add support for any GC algorithms, which means it temporarily breaks a few tests. To be fixed shortly. llvm-svn: 45669	2008-01-07 01:30:38 +00:00
Chris Lattner	96167aa93c	Rename SSARegMap -> MachineRegisterInfo in keeping with the idea that "machine" classes are used to represent the current state of the code being compiled. Given this expanded name, we can start moving other stuff into it. For now, move the UsedPhysRegs and LiveIn/LoveOuts vectors from MachineFunction into it. Update all the clients to match. This also reduces some needless #includes, such as MachineModuleInfo from MachineFunction. llvm-svn: 45467	2007-12-31 04:13:23 +00:00
Chris Lattner	c2f0543beb	use simplified operand addition methods. llvm-svn: 45436	2007-12-30 00:57:42 +00:00
Chris Lattner	ad9a6ccb83	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Duncan Sands	b2e0a67cc0	Simplify LowerCallTo by using a callsite. llvm-svn: 45198	2007-12-19 09:48:52 +00:00
Duncan Sands	09250d2dff	The C++ exception handling personality function wants to know about calls that cannot throw ('nounwind'): if such a call does throw for some reason then the personality will terminate the program. The distinction between an ordinary call and a nounwind call is that an ordinary call gets an entry in the exception table but a nounwind call does not. This patch sets up the exception table appropriately. One oddity is that I've chosen to bracket nounwind calls with labels (like invokes) - the other choice would have been to bracket ordinary calls with labels. While bracketing ordinary calls is more natural (because bracketing by labels would then correspond exactly to getting an entry in the exception table), I didn't do it because introducing labels impedes some optimizations and I'm guessing that ordinary calls occur more often than nounwind calls. This fixes the gcc filter2 eh test, at least at -O0 (the inliner needs some tweaking at higher optimization levels). llvm-svn: 45197	2007-12-19 07:36:31 +00:00
Duncan Sands	3a0d757bd5	Make invokes of inline asm legal. Teach codegen how to lower them (with no attempt made to be efficient, since they should only occur for unoptimized code). llvm-svn: 45108	2007-12-17 18:08:19 +00:00
Duncan Sands	1e2e4972ff	Rather than having special rules like "intrinsics cannot throw exceptions", just mark intrinsics with the nounwind attribute. Likewise, mark intrinsics as readnone/readonly and get rid of special aliasing logic (which didn't use anything more than this anyway). llvm-svn: 44544	2007-12-03 20:06:50 +00:00
Duncan Sands	1b0feb42e2	Add some convenience methods for querying attributes, and use them. llvm-svn: 44403	2007-11-28 17:07:01 +00:00
Duncan Sands	3602011bec	Fix PR1146: parameter attributes are longer part of the function type, instead they belong to functions and function calls. This is an updated and slightly corrected version of Reid Spencer's original patch. The only known problem is that auto-upgrading of bitcode files doesn't seem to work properly (see test/Bitcode/AutoUpgradeIntrinsics.ll). Hopefully a bitcode guru (who might that be? :) ) will fix it. llvm-svn: 44359	2007-11-27 13:23:08 +00:00
Chris Lattner	ab5cd0b1c5	err, no really. llvm-svn: 44352	2007-11-27 06:14:32 +00:00
Chris Lattner	a2be558b75	don't depend on ADL. llvm-svn: 44351	2007-11-27 06:14:12 +00:00
Chris Lattner	28262fbaf2	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Anton Korobeynikov	b6c3255d80	Implement necessary bits for flt_rounds gcc builtin. Codegen bits and llvm-gcc support will follow. llvm-svn: 44182	2007-11-15 23:25:33 +00:00
Duncan Sands	895e6284a9	This assertion was bogus. llvm-svn: 44167	2007-11-15 09:54:37 +00:00
Dale Johannesen	1f70f86c7a	Make labels work in asm blocks; allow labels as parameters. Rename ValueRefList to ParamList in AsmParser, since its only use is for parameters. llvm-svn: 43734	2007-11-05 21:20:28 +00:00
Dan Gohman	19d88d511b	Add std:: to sort calls. llvm-svn: 43652	2007-11-02 22:24:01 +00:00
Dan Gohman	26c8800fbd	Change illegal uses of ++ to uses of STLExtra.h's next function. llvm-svn: 43651	2007-11-02 22:22:02 +00:00
Duncan Sands	eb464e976f	Executive summary: getTypeSize -> getTypeStoreSize / getABITypeSize. The meaning of getTypeSize was not clear - clarifying it is important now that we have x86 long double and arbitrary precision integers. The issue with long double is that it requires 80 bits, and this is not a multiple of its alignment. This gives a primitive type for which getTypeSize differed from getABITypeSize. For arbitrary precision integers it is even worse: there is the minimum number of bits needed to hold the type (eg: 36 for an i36), the maximum number of bits that will be overwriten when storing the type (40 bits for i36) and the ABI size (i.e. the storage size rounded up to a multiple of the alignment; 64 bits for i36). This patch removes getTypeSize (not really - it is still there but deprecated to allow for a gradual transition). Instead there is: (1) getTypeSizeInBits - a number of bits that suffices to hold all values of the type. For a primitive type, this is the minimum number of bits. For an i36 this is 36 bits. For x86 long double it is 80. This corresponds to gcc's TYPE_PRECISION. (2) getTypeStoreSizeInBits - the maximum number of bits that is written when storing the type (or read when reading it). For an i36 this is 40 bits, for an x86 long double it is 80 bits. This is the size alias analysis is interested in (getTypeStoreSize returns the number of bytes). There doesn't seem to be anything corresponding to this in gcc. (3) getABITypeSizeInBits - this is getTypeStoreSizeInBits rounded up to a multiple of the alignment. For an i36 this is 64, for an x86 long double this is 96 or 128 depending on the OS. This is the spacing between consecutive elements when you form an array out of this type (getABITypeSize returns the number of bytes). This is TYPE_SIZE in gcc. Since successive elements in a SequentialType (arrays, pointers and vectors) need to be aligned, the spacing between them will be given by getABITypeSize. This means that the size of an array is the length times the getABITypeSize. It also means that GEP computations need to use getABITypeSize when computing offsets. Furthermore, if an alloca allocates several elements at once then these too need to be aligned, so the size of the alloca has to be the number of elements multiplied by getABITypeSize. Logically speaking this doesn't have to be the case when allocating just one element, but it is simpler to also use getABITypeSize in this case. So alloca's and mallocs should use getABITypeSize. Finally, since gcc's only notion of size is that given by getABITypeSize, if you want to output assembler etc the same as gcc then getABITypeSize is the size you want. Since a store will overwrite no more than getTypeStoreSize bytes, and a read will read no more than that many bytes, this is the notion of size appropriate for alias analysis calculations. In this patch I have corrected all type size uses except some of those in ScalarReplAggregates, lib/Codegen, lib/Target (the hard cases). I will get around to auditing these too at some point, but I could do with some help. Finally, I made one change which I think wise but others might consider pointless and suboptimal: in an unpacked struct the amount of space allocated for a field is now given by the ABI size rather than getTypeStoreSize. I did this because every other place that reserves memory for a type (eg: alloca) now uses getABITypeSize, and I didn't want to make an exception for unpacked structs, i.e. I did it to make things more uniform. This only effects structs containing long doubles and arbitrary precision integers. If someone wants to pack these types more tightly they can always use a packed struct. llvm-svn: 43620	2007-11-01 20:53:16 +00:00
Bill Wendling	8d329ff809	- Remove the hacky code that forces a memcpy. Alignment is taken care of in the FE. - Explicitly pass in the alignment of the load & store. - XFAIL 2007-10-23-UnalignedMemcpy.ll because llc has a bug that crashes on unaligned pointers. llvm-svn: 43398	2007-10-26 20:24:42 +00:00
Bill Wendling	e5f534148e	Fix comment and use the "Size" variable that's already provided. llvm-svn: 43271	2007-10-23 23:36:57 +00:00
Bill Wendling	a420d660c8	If there's an unaligned memcpy to/from the stack, don't lower it. Just call the memcpy library function instead. llvm-svn: 43270	2007-10-23 23:32:40 +00:00
Bill Wendling	34950e1291	This broke lots. Reverting. llvm-svn: 43264	2007-10-23 22:04:26 +00:00
Bill Wendling	34c16a1b2d	Lowering a memcpy to the stack is killing PPC. The ARM and X86 backends already have their own custom memcpy lowering code. This code needs to be factored out into a target-independent lowering method with hooks to the backend. In the meantime, just call memcpy if we're trying to copy onto a stack. llvm-svn: 43262	2007-10-23 21:30:25 +00:00
Chris Lattner	45b8558ec5	rename ExpandOperation to ExpandOperationResult, as suggested by Duncan llvm-svn: 43177	2007-10-19 15:28:47 +00:00
Rafael Espindola	d8d4372845	Add support for byval function whose argument is not 32 bit aligned. To do this it is necessary to add a "always inline" argument to the memcpy node. For completeness I have also added this node to memmove and memset. I have also added getMem* functions, because the extra argument makes it cumbersome to use getNode and because I get confused by it :-) llvm-svn: 43172	2007-10-19 10:41:11 +00:00
Chris Lattner	f02434cdaf	add a new target hook. llvm-svn: 43165	2007-10-19 03:31:45 +00:00
Chris Lattner	452ebc199e	One mundane change: Change ReplaceAllUsesOfValueWith to optionally take a deleted nodes vector, instead of requiring it. One more significant change: Implement the start of a legalizer that just works on types. This legalizer is designed to run before the operation legalizer and ensure just that the input dag is transformed into an output dag whose operand and result types are all legal, even if the operations on those types are not. This design/impl has the following advantages: 1. When finished, this will significantly reduce the amount of code in LegalizeDAG.cpp. It will remove all the code related to promotion and expansion as well as splitting and scalarizing vectors. 2. The new code is very simple, idiomatic, and modular: unlike LegalizeDAG.cpp, it has no 3000 line long functions. :) 3. The implementation is completely iterative instead of recursive, good for hacking on large dags without blowing out your stack. 4. The implementation updates nodes in place when possible instead of deallocating and reallocating the entire graph that points to some mutated node. 5. The code nicely separates out handling of operations with invalid results from operations with invalid operands, making some cases simpler and easier to understand. 6. The new -debug-only=legalize-types option is very very handy :), allowing you to easily understand what legalize types is doing. This is not yet done. Until the ifdef added to SelectionDAGISel.cpp is enabled, this does nothing. However, this code is sufficient to legalize all of the code in 186.crafty, olden and freebench on an x86 machine. The biggest issues are: 1. Vectors aren't implemented at all yet 2. SoftFP is a mess, I need to talk to Evan about it. 3. No lowering to libcalls is implemented yet. 4. Various operations are missing etc. 5. There are FIXME's for stuff I hax0r'd out, like softfp. Hey, at least it is a step in the right direction :). If you'd like to help, just enable the #ifdef in SelectionDAGISel.cpp and compile code with it. If this explodes it will tell you what needs to be implemented. Help is certainly appreciated. Once this goes in, we can do three things: 1. Add a new pass of dag combine between the "type legalizer" and "operation legalizer" passes. This will let us catch some long-standing isel issues that we miss because operation legalization often obfuscates the dag with target-specific nodes. 2. We can rip out all of the type legalization code from LegalizeDAG.cpp, making it much smaller and simpler. When that happens we can then reimplement the core functionality left in it in a much more efficient and non-recursive way. 3. Once the whole legalizer is non-recursive, we can implement whole-function selectiondags maybe... llvm-svn: 42981	2007-10-15 06:10:22 +00:00
Arnold Schwaighofer	6bcd9e7ec2	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Dan Gohman	330b7915da	Fix some corner cases with vectors in copyToRegs and copyFromRegs. llvm-svn: 42907	2007-10-12 14:33:11 +00:00
Dan Gohman	ab5c3ed0d1	Add intrinsics for sin, cos, and pow. These use llvm_anyfloat_ty, and so may be overloaded with vector types. And add a testcase for codegen for these. llvm-svn: 42885	2007-10-12 00:01:22 +00:00
Arnold Schwaighofer	d47210011e	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Dan Gohman	58512cb6e2	In -debug mode, dump SelectionDAGs both before and after the optimization passes. llvm-svn: 42749	2007-10-08 15:12:17 +00:00
Dale Johannesen	a4e3643cb3	Rewrite sqrt and powi to use anyfloat. By popular demand. llvm-svn: 42537	2007-10-02 17:43:59 +00:00
Dale Johannesen	d94f00234f	Fix stride computations for long double arrays. llvm-svn: 42508	2007-10-01 23:08:35 +00:00
Dale Johannesen	e61886cee4	Add sqrt and powi intrinsics for long double. llvm-svn: 42423	2007-09-28 01:08:20 +00:00
Dale Johannesen	69595b587f	Enable codegen for long double abs, sin, cos llvm-svn: 42368	2007-09-26 21:10:55 +00:00
Dale Johannesen	575bd6070a	Remove the assumption that FP's are either float or double from some of the many places in the optimizers it appears, and do something reasonable with x86 long double. Make APInt::dump() public, remove newline, use it to dump ConstantSDNode's. Allow APFloats in FoldingSet. Expand X86 backend handling of long doubles (conversions to/from int, mostly). llvm-svn: 41967	2007-09-14 22:26:36 +00:00
Duncan Sands	c358890f73	Fold the adjust_trampoline intrinsic into init_trampoline. There is now only one trampoline intrinsic. llvm-svn: 41841	2007-09-11 14:10:23 +00:00
Chris Lattner	ca656d2007	1. Don't call Value::getName(), which is slow. 2. Lower calls to fabs and friends to FABS nodes etc unless the function has internal linkage. Before we wouldn't lower if it had a definition, which is incorrect. This allows us to compile: define double @fabs(double %f) { %tmp2 = tail call double @fabs( double %f ) ret double %tmp2 } into: _fabs: fabs f1, f1 blr llvm-svn: 41805	2007-09-10 21:15:22 +00:00
Rafael Espindola	8c57e70f93	Add support for having different alignment for objects on call frames. The x86-64 ABI states that objects passed on the stack have 8 byte alignment. Implement that. llvm-svn: 41768	2007-09-07 14:52:14 +00:00
Anton Korobeynikov	899c0c9c8d	Split eh.select / eh.typeid.for intrinsics into i32/i64 versions. This is needed, because they just "mark" register liveins and we let frontend solve type issue, not lowering code :) llvm-svn: 41763	2007-09-07 11:39:35 +00:00
Dale Johannesen	86f367a6b7	Next round of APFloat changes. Use APFloat in UpgradeParser and AsmParser. Change all references to ConstantFP to use the APFloat interface rather than double. Remove the ConstantFP double interfaces. Use APFloat functions for constant folding arithmetic and comparisons. (There are still way too many places APFloat is just a wrapper around host float/double, but we're getting there.) llvm-svn: 41747	2007-09-06 18:13:44 +00:00
Duncan Sands	ab8eb598be	Fix PR1628. When exception handling is turned on, labels are generated bracketing each call (not just invokes). This is used to generate entries in the exception table required by the C++ personality. However it gets in the way of tail-merging. This patch solves the problem by no longer placing labels around ordinary calls. Instead we generate entries in the exception table that cover every instruction in the function that wasn't covered by an invoke range (the range given by the labels around the invoke). As an optimization, such entries are only generated for parts of the function that contain a call, since for the moment those are the only instructions that can throw an exception [1]. As a happy consequence, we now get a smaller exception table, since the same region can cover many calls. While there, I also implemented folding of invoke ranges - successive ranges are merged when safe to do so. Finally, if a selector contains only a cleanup, there's a special shorthand for it - place a 0 in the call-site entry. I implemented this while there. As a result, the exception table output (excluding filters) is now optimal - it cannot be made smaller [2]. The problem with throw filters is that folding them optimally is hard, and the benefit of folding them is minimal. [1] I tested that having trapping instructions (eg divide by zero) in such a region doesn't cause trouble. [2] It could be made smaller with the help of higher layers, eg by having branch folding reorder basic blocks ending in invokes with the same landing pad so they follow each other. I don't know if this is worth doing. llvm-svn: 41718	2007-09-05 11:27:52 +00:00
Evan Cheng	bb21883dd3	Fix for PR1632. EHSELECTION always produces a i32 value. llvm-svn: 41712	2007-09-04 20:39:26 +00:00
Dan Gohman	cbb2ee9062	Add an option, -view-sunit-dags, for viewing the actual SUnit DAGs used by scheduling. llvm-svn: 41556	2007-08-28 20:32:58 +00:00
Dan Gohman	2e7e251f24	If the source and destination pointers in an llvm.memmove are known to not alias each other, it can be translated as an llvm.memcpy. llvm-svn: 41489	2007-08-27 16:26:13 +00:00
Duncan Sands	883740b39f	There is an impedance matching problem between LLVM and gcc exception handling: if an exception unwinds through an invoke, then execution must branch to the invoke's unwind target. We previously tried to enforce this by appending a cleanup action to every selector, however this does not always work correctly due to an optimization in the C++ unwinding runtime: if only cleanups would be run while unwinding an exception, then the program just terminates without actually executing the cleanups, as invoke semantics would require. I was hoping this wouldn't be a problem, but in fact it turns out to be the cause of all the remaining failures in the LLVM testsuite (these also fail with -enable-correct-eh-support, so turning on -enable-eh didn't make things worse!). Instead we need to append a full-blown catch-all to the end of each selector. The correct way of doing this depends on the personality function, i.e. it is language dependent, so can only be done by gcc. Thus this patch which generalizes the eh.selector intrinsic so that it can handle all possible kinds of action table entries (before it didn't accomodate cleanups): now 0 indicates a cleanup, and filters have to be specified using the number of type infos plus one rather than the number of type infos. Related gcc patches will cause Ada to pass a cleanup (0) to force the selector to always fire, while C++ will use a C++ catch-all (null). llvm-svn: 41484	2007-08-27 15:47:50 +00:00
Chris Lattner	1e089aac3a	rename isOperandValidForConstraint to LowerAsmOperandForConstraint, changing the interface to allow for future changes. llvm-svn: 41384	2007-08-25 00:47:38 +00:00
Anton Korobeynikov	9451c871c5	Perform correct codegen for eh_dwarf_cfa intrinsic. llvm-svn: 41316	2007-08-23 07:21:06 +00:00
Rafael Espindola	68d95ff2b1	Partial implementation of calling functions with byval arguments: ) The needed information is propagated to the DAG ) The X86-64 backend detects it and aborts llvm-svn: 41179	2007-08-20 15:18:24 +00:00
Evan Cheng	9a05381a81	- If a dynamic_stackalloc alignment requirement is <= stack alignment, then the alignment argument is ignored. - Always round up the size of the allocation to multiples of stack alignment to ensure the stack ptr is never left in an invalid state after a dynamic_stackalloc. llvm-svn: 41132	2007-08-16 23:46:29 +00:00
Dan Gohman	f18e94535f	Fix EXTRACT_ELEMENT, EXTRACT_SUBVECTOR, and EXTRACT_VECTOR_ELT to use an intptr ValueType instead of i32 for the index operand in getCopyToParts. llvm-svn: 40987	2007-08-10 14:59:38 +00:00
Rafael Espindola	b20b9e985a	propagate struct size and alignment of byval arguments to the DAG llvm-svn: 40986	2007-08-10 14:44:42 +00:00
Chandler Carruth	00e56b0e81	This is the patch to provide clean intrinsic function overloading support in LLVM. It cleans up the intrinsic definitions and generally smooths the process for more complicated intrinsic writing. It will be used by the upcoming atomic intrinsics as well as vector and float intrinsics in the future. This also changes the syntax for llvm.bswap, llvm.part.set, llvm.part.select, and llvm.ct* intrinsics. They are automatically upgraded by both the LLVM ASM reader and the bitcode reader. The test cases have been updated, with special tests added to ensure the automatic upgrading is supported. llvm-svn: 40807	2007-08-04 01:51:18 +00:00
Chris Lattner	9319dfc93a	don't redefine a parameter llvm-svn: 40748	2007-08-02 18:08:16 +00:00
Dan Gohman	375d541183	Fix a bug in getCopyFromParts turned up in the testcase for PR1132. llvm-svn: 40598	2007-07-30 19:09:17 +00:00
Duncan Sands	e8bb2c6d32	Support for trampolines, except for X86 codegen which is still under discussion. llvm-svn: 40549	2007-07-27 12:58:54 +00:00
Dan Gohman	1444c5840b	Add const to CanBeFoldedBy, CheckAndMask, and CheckOrMask. llvm-svn: 40480	2007-07-24 23:00:27 +00:00
Dan Gohman	4c140b7128	It's not necessary to do rounding for alloca operations when the requested alignment is equal to the stack alignment. llvm-svn: 40004	2007-07-18 16:29:46 +00:00
Dan Gohman	0ba554c0c8	Fix comments about vectors to use the current wording. llvm-svn: 39921	2007-07-16 14:29:03 +00:00
Anton Korobeynikov	5635277c36	Long live the exception handling! This patch fills the last necessary bits to enable exceptions handling in LLVM. Currently only on x86-32/linux. In fact, this patch adds necessary intrinsics (and their lowering) which represent really weird target-specific gcc builtins used inside unwinder. After corresponding llvm-gcc patch will land (easy) exceptions should be more or less workable. However, exceptions handling support should not be thought as 'finished': I expect many small and not so small glitches everywhere. llvm-svn: 39855	2007-07-14 14:06:15 +00:00
Dale Johannesen	469ed8e17e	Skeleton of post-RA scheduler; doesn't do anything yet. Change name of -sched option and DEBUG_TYPE to pre-RA-sched; adjust testcases. llvm-svn: 39816	2007-07-13 17:13:54 +00:00
Dan Gohman	81cfdc2f19	Change getCopyToParts and getCopyFromParts to always use target-endian register ordering, for both physical and virtual registers. Update the PPC target lowering for calls to expect registers for the call result to already be in target order. llvm-svn: 38471	2007-07-09 20:59:04 +00:00
Duncan Sands	f926a3080c	The exception handling intrinsics return values, so must be lowered to a value, not nothing at all. Subtle point: I made eh_selector return 0 and eh_typeid_for return 1. This means that only cleanups (destructors) will be run as the exception unwinds [if eh_typeid_for returned 0 then it would be as if the first catch always matched, and the corresponding handler would be run], which is probably want you want in the CBE. llvm-svn: 37947	2007-07-06 14:46:23 +00:00
Rafael Espindola	7b3de98989	Add the byval attribute llvm-svn: 37940	2007-07-06 10:57:03 +00:00
Duncan Sands	7e50c11edd	Remove propagateEHRegister in favour of a more limited fix, that is adequate while PR1508 remains unresolved. llvm-svn: 37938	2007-07-06 09:18:59 +00:00
Duncan Sands	e7650d2b1e	Remove ExtractGlobalVariable - use StripPointerCasts instead. llvm-svn: 37937	2007-07-06 09:10:03 +00:00
Evan Cheng	bae19254f0	Workaround of getCopyToRegs and getCopyFromRegs bugs for big-endian machines. llvm-svn: 37935	2007-07-06 01:47:35 +00:00
Dan Gohman	90c6b87b31	Add a parameter to getCopyToParts and getCopyFromParts to specify whether endian swapping should be done, and update the code to use it. This fixes some register ordering issues on big-endian systems, such as PowerPC, introduced by the recent illegal by-val arguments changes. llvm-svn: 37921	2007-07-05 20:12:34 +00:00
Duncan Sands	4441eff1ac	Extend eh.selector to support both catches and filters. Drop the eh.filter intrinsic. llvm-svn: 37875	2007-07-04 20:52:51 +00:00
Dale Johannesen	7af19491d3	Fix for PR 1505 (and 1489). Rewrite X87 register model to include f32 variants. Some factoring improvments forthcoming. llvm-svn: 37847	2007-07-03 00:53:03 +00:00
Dan Gohman	68f0cbccfb	Replace ExpandScalarFormalArgs and ExpandScalarCallArgs with the newly refactored getCopyFromParts and getCopyToParts, which are more general. This effectively adds support for lowering illegal by-val vector call arguments. llvm-svn: 37843	2007-07-02 16:18:06 +00:00
Evan Cheng	a1a06d1763	Only do FNEG xform when the vector type is a floating point type. llvm-svn: 37818	2007-06-29 21:44:35 +00:00
David Greene	0942edd414	Remove unnecessary attributions in comments. llvm-svn: 37799	2007-06-29 03:42:23 +00:00
David Greene	df6a87ea1c	Fix reference to cached end iterator invalidated by an erase operation. Uncovered by _GLIBCXX_DEBUG. llvm-svn: 37795	2007-06-29 02:49:11 +00:00
Dan Gohman	c6bdcfa8c0	Add new TargetLowering code to provide the final register type that an illegal value type will be transformed to, for code that needs the register type after all transformations instead of just after the first transformation. Factor out the code that uses this information to do copy-from-regs and copy-to-regs for various purposes into separate functions so that they are done consistently. llvm-svn: 37781	2007-06-28 23:29:44 +00:00
Evan Cheng	26542e347a	Partial fix for PR1502: If a EH register is needed in a successor of landing pad, add it as livein to all the blocks in the paths between the landing pad and the specified block. llvm-svn: 37763	2007-06-27 18:45:32 +00:00
Dan Gohman	c1c4b0972f	Use getVectorTypeBreakdown in FunctionLoweringInfo::CreateRegForValue to compute the number and type of registers needed for vector values instead of computing it manually. This fixes PR1529. llvm-svn: 37755	2007-06-27 14:34:07 +00:00
Dan Gohman	354f02e03d	Generalize MVT::ValueType and associated functions to be able to represent extended vector types. Remove the special SDNode opcodes used for pre-legalize vector operations, and the special MVT::Vector type used with them. Adjust lowering and legalize to work with the normal SDNode kinds instead, and to use the normal MVT functions to work with vector types instead of using the two special operands that the pre-legalize nodes held. This allows pre-legalize and post-legalize DAGs, and the code that operates on them, to be more consistent. Pre-legalize vector operators can be handled more consistently with scalar operators. And, -view-dag-combine1-dags and -view-legalize-dags now look prettier for vector code. llvm-svn: 37719	2007-06-25 16:23:39 +00:00
Dan Gohman	a62327ea40	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Dan Gohman	1815e9bdb3	Rename TargetLowering::getNumElements and friends to TargetLowering::getNumRegisters and similar, to avoid confusion with the actual number of elements for vector types. llvm-svn: 37687	2007-06-21 14:42:22 +00:00
Tanya Lattner	4c078c1ace	Codegen support (stripped out) for the annotate attribute. llvm-svn: 37608	2007-06-15 22:26:58 +00:00
Chris Lattner	c08e8abb53	Fix CodeGen/X86/inline-asm-x-scalar.ll:test4, by retaining regclass info for tied register constraints. llvm-svn: 37601	2007-06-15 19:11:01 +00:00
Duncan Sands	deef6fe78b	Workaround for PR1508. llvm-svn: 37597	2007-06-15 19:04:19 +00:00
Dan Gohman	2fd7d26df8	Rename MVT::getVectorBaseType to MVT::getVectorElementType. llvm-svn: 37579	2007-06-14 22:58:02 +00:00
Duncan Sands	51e6294637	Only correctly lower exception handing intrinsics if exception handling is turned on. Likewise for scanning of invokes to mark landing pads. llvm-svn: 37570	2007-06-13 16:53:21 +00:00
Dan Gohman	6415b2548e	Introduce new SelectionDAG node opcodes VEXTRACT_SUBVECTOR and VCONCAT_VECTORS. Use these for CopyToReg and CopyFromReg legalizing in the case that the full register is to be split into subvectors instead of scalars. This replaces uses of VBIT_CONVERT to present values as vector-of-vector types in order to make whole subvectors accessible via BUILD_VECTOR and EXTRACT_VECTOR_ELT. This is in preparation for adding extended ValueType values, where having vector-of-vector types is undesirable. llvm-svn: 37569	2007-06-13 15:12:02 +00:00
Dan Gohman	3bc1455d49	When creating CopyFromReg nodes, always use legal types. And use the correct types for the result vector, even though it is currently bitcasted to a different type immediately. llvm-svn: 37568	2007-06-13 14:55:16 +00:00
Duncan Sands	32eaa9b30a	The fix that was applied for PR1224 stops the compiler crashing but breaks exception handling. The problem described in PR1224 is that invoke is a terminator that can produce a value. The value may be needed in other blocks. The code that writes to registers values needed in other blocks runs before terminators are lowered (in this case invoke) so asserted because the value was not yet available. The fix that was applied was to do invoke lowering earlier, before writing values to registers. The problem this causes is that the code to copy values to registers can be output after the invoke call. If an exception is raised and control is passed to the landing pad then this copy-code will never execute. If the value is needed in some code path reached via the landing pad then that code will get something bogus. So revert the original fix and simply skip invoke values in the general copying to registers code. Instead copy the invoke value to a register in the invoke lowering code. llvm-svn: 37567	2007-06-13 05:51:31 +00:00
Dale Johannesen	0f3c011fd5	Do not change the size of function arguments. PR 1489. llvm-svn: 37496	2007-06-07 21:07:15 +00:00
Duncan Sands	484bdb927b	Additional fix for PR1422: make sure the landing pad label is placed in the correct machine basic block - do not rely on the eh.exception intrinsic being in the landing pad: the loop optimizers can move it out. llvm-svn: 37463	2007-06-06 10:05:18 +00:00
Duncan Sands	37925d3f39	Integrate exception filter support and exception catch support. This simplifies the code in DwarfWriter, allows for multiple filters and makes it trivial to specify filters accompanied by cleanups or catch-all specifications (see next patch). What a deal! Patch blessed by Anton. llvm-svn: 37398	2007-06-02 16:53:42 +00:00
Duncan Sands	b1cc3f9881	Since TypeInfos are passed as i8 pointers, a NULL TypeInfo should be passed as a null i8 pointer not as a 0 i32. llvm-svn: 37383	2007-06-01 08:18:30 +00:00
Dan Gohman	2c5d31ee81	Minor comment cleanups. llvm-svn: 37321	2007-05-24 14:36:04 +00:00
Anton Korobeynikov	0f184e86ab	Mark all calls as "could throw", when exceptions are enabled. Emit necessary LP info too. This fixes PR1439 llvm-svn: 37311	2007-05-23 11:08:31 +00:00
Dan Gohman	d6a33914fb	Qualify several calls to functions in the MVT namespace, for consistency. llvm-svn: 37230	2007-05-18 17:52:13 +00:00
Chris Lattner	ba648e0d45	Fix some subtle issues handling immediate values. This fixes test/CodeGen/ARM/2007-05-14-InlineAsmCstCrash.ll llvm-svn: 37069	2007-05-15 01:33:58 +00:00
Anton Korobeynikov	4423f1a3fd	Do not assert, when case range split metric is zero and JTs are not allowed: just emit binary tree in this case. This fixes PR1403. llvm-svn: 36959	2007-05-09 20:07:08 +00:00
Duncan Sands	20a9ed0e20	Parameter attributes on invoke calls were being lost due to the wrong attribute index being used. Fix proposed by Anton Korobeynikov, who asked me to implement and commit it for him. This is PR1398. llvm-svn: 36906	2007-05-07 20:49:28 +00:00
Anton Korobeynikov	3765489a61	Detabify llvm-svn: 36891	2007-05-06 20:14:21 +00:00
Duncan Sands	f2323ca89c	A bitcast of a global variable may have been constant folded to a GEP - handle this case too. llvm-svn: 36745	2007-05-04 17:12:26 +00:00
Devang Patel	cd45427a87	Drop 'const' llvm-svn: 36662	2007-05-03 01:11:54 +00:00
Anton Korobeynikov	ca71b83e50	Properly set arguments bitwidth of EHSELECT node llvm-svn: 36654	2007-05-02 22:15:48 +00:00
Devang Patel	8ee9065162	Use 'static const char' instead of 'static const int'. Due to darwin gcc bug, one version of darwin linker coalesces static const int, which defauts PassID based pass identification. llvm-svn: 36652	2007-05-02 21:39:20 +00:00
Devang Patel	38a66bc82e	Do not use typeinfo to identify pass in pass manager. llvm-svn: 36632	2007-05-01 21:15:47 +00:00
Chris Lattner	3b2c717e6a	Continue refactoring inline asm code. If there is an earlyclobber output register, preallocate all input registers and the early clobbered output. This fixes PR1357 and CodeGen/PowerPC/2007-04-30-InlineAsmEarlyClobber.ll llvm-svn: 36599	2007-04-30 21:11:17 +00:00
Chris Lattner	4ac6ba02d1	refactor GetRegistersForValue to take OpInfo as an argument instead of various pieces of it. No functionality change. llvm-svn: 36592	2007-04-30 17:29:31 +00:00
Chris Lattner	e3f7f2afcb	refactor some code, no functionality change llvm-svn: 36590	2007-04-30 17:16:27 +00:00
Chris Lattner	087a1aaaab	generalize aggregate handling llvm-svn: 36568	2007-04-29 18:58:03 +00:00
Chris Lattner	85e1cac6b7	memory operands that have a direct operand should have their stores created before the copies into physregs are done. This avoids having flag operands skip the store, causing cycles in the dag at sched time. This fixes infinite loops on these tests: test/CodeGen/Generic/2007-04-08-MultipleFrameIndices.ll for PR1308 test/CodeGen/PowerPC/2007-01-29-lbrx-asm.ll test/CodeGen/PowerPC/2007-01-31-InlineAsmAddrMode.ll test/CodeGen/X86/2006-07-12-InlineAsmQConstraint.ll for PR828 llvm-svn: 36547	2007-04-28 21:12:06 +00:00
Chris Lattner	97d396d928	eliminate more redundant constraint type analysis llvm-svn: 36546	2007-04-28 21:03:16 +00:00
Chris Lattner	4153538017	merge constraint type analysis stuff together. llvm-svn: 36545	2007-04-28 21:01:43 +00:00
Chris Lattner	0e0d379b5d	Significant refactoring of the inline asm stuff, to support future changes. No functionality change. llvm-svn: 36544	2007-04-28 20:49:53 +00:00
Chris Lattner	4c7178e326	memory inputs to an inline asm are required to have an address available. If the operand is not already an indirect operand, spill it to a constant pool entry or a stack slot. This fixes PR1356 and CodeGen/X86/2007-04-27-InlineAsm-IntMemInput.ll llvm-svn: 36536	2007-04-28 06:42:38 +00:00
Chris Lattner	0e2a4a7890	Fix CodeGen/Generic/2007-04-27-LargeMemObject.ll and CodeGen/Generic/2007-04-27-InlineAsm-X-Dest.ll llvm-svn: 36534	2007-04-28 06:08:13 +00:00
Chris Lattner	0a0983b493	Fix this to match change to InlineAsm class. llvm-svn: 36524	2007-04-28 04:05:59 +00:00
Chris Lattner	eef11c75d4	improve EH global handling, patch by Duncan Sands. llvm-svn: 36499	2007-04-27 01:20:11 +00:00
Chris Lattner	92bff16acb	enable Anton's shift/and switch lowering stuff! It now passes ppc bootstrap successfully! woohoo... llvm-svn: 36496	2007-04-26 21:09:43 +00:00
Anton Korobeynikov	3e0f9076d0	Fixx off-by-one bug, which prevents llvm-gcc bootstrap on ppc32 llvm-svn: 36490	2007-04-26 20:44:04 +00:00
Evan Cheng	c713a5ea36	This was lefted out. Fixed sumarray-dbl. llvm-svn: 36445	2007-04-25 18:33:21 +00:00
Chris Lattner	8860742825	allow support for 64-bit stack objects llvm-svn: 36420	2007-04-25 04:08:28 +00:00
Bill Wendling	b3b0427654	Assertion when using a 1-element vector for an add operation. Get the real vector type in this case. llvm-svn: 36402	2007-04-24 21:13:23 +00:00
Scott Michel	5a33297ae3	Use '-1U' where '-1UL' is obvious overkill, eliminating gcc warnings about tests always being true in the process. llvm-svn: 36387	2007-04-24 01:24:20 +00:00
Christopher Lamb	a157874a8a	PR400 phase 2. Propagate attributed load/store information through DAGs. llvm-svn: 36356	2007-04-22 23:15:30 +00:00
Reid Spencer	81070d52da	Revert Christopher Lamb's load/store alignment changes. llvm-svn: 36309	2007-04-21 18:36:27 +00:00
Christopher Lamb	b56b6a7ad7	add support for alignment attributes on load/store instructions llvm-svn: 36301	2007-04-21 08:16:25 +00:00
Chris Lattner	357a11fcbb	disable switch lowering using shift/and. It still breaks ppc bootstrap for some reason. :( Will investigate. llvm-svn: 36011	2007-04-14 19:39:41 +00:00
Anton Korobeynikov	bdb4f560da	Fix PR1325: Case range optimization was performed in the case it shouldn't. Also fix some "latent" bug on 64-bit platforms llvm-svn: 35990	2007-04-14 13:25:55 +00:00
Chris Lattner	6e71d21892	disable shift/and lowering to work around PR1325 for now. llvm-svn: 35985	2007-04-14 02:26:56 +00:00
Anton Korobeynikov	5bb6590218	Fix PR1323 : we haven't updated phi nodes in good manner :) llvm-svn: 35963	2007-04-13 06:53:51 +00:00
Chris Lattner	0da8de5848	the result of an inline asm copy can be an arbitrary VT that the register class supports. In the case of vectors, this means we often get the wrong type (e.g. we get v4f32 instead of v8i16). Make sure to convert the vector result to the right type. This fixes CodeGen/X86/2007-04-11-InlineAsmVectorResult.ll llvm-svn: 35944	2007-04-12 06:00:20 +00:00
Reid Spencer	82da0eb67c	For PR1284: Implement the "part_set" intrinsic. llvm-svn: 35938	2007-04-12 02:48:46 +00:00
Reid Spencer	2792e203c5	For PR1146: Put the parameter attributes in their own ParamAttr name space. Adjust the rest of llvm as a result. llvm-svn: 35877	2007-04-11 02:44:20 +00:00
Chris Lattner	e2444f7ec8	apparently some people commit without building the tree, or they forget to commit a LOT of files. llvm-svn: 35858	2007-04-10 03:20:39 +00:00
Jeff Cohen	bd7d060e79	No longer needed. llvm-svn: 35850	2007-04-09 23:42:32 +00:00
Anton Korobeynikov	6e6b2d493a	Use integer log for metric calculation llvm-svn: 35834	2007-04-09 21:57:03 +00:00
Jeff Cohen	b3d61e6c05	Unbreak VC++ build. llvm-svn: 35817	2007-04-09 14:32:59 +00:00
Anton Korobeynikov	6ee97ee42a	Next stage into switch lowering refactoring 1. Fix some bugs in the jump table lowering threshold 2. Implement much better metric for optimal pivot selection 3. Tune thresholds for different lowering methods 4. Implement shift-and trick for lowering small (<machine word length) cases with few destinations. Good testcase will follow. llvm-svn: 35816	2007-04-09 12:31:58 +00:00
Reid Spencer	2660b8dccb	For PR1146: Adapt handling of parameter attributes to use the new ParamAttrsList class. llvm-svn: 35814	2007-04-09 06:17:21 +00:00
Chris Lattner	13a530ec7f	implement CodeGen/X86/inline-asm-x-scalar.ll:test3 llvm-svn: 35802	2007-04-09 05:31:20 +00:00
Chris Lattner	5f8b0c2acc	Fix PR1316 llvm-svn: 35783	2007-04-09 00:33:58 +00:00
Chris Lattner	f9506a185b	Fix for CodeGen/X86/2007-04-08-InlineAsmCrash.ll and PR1314 llvm-svn: 35779	2007-04-08 22:23:26 +00:00
Chris Lattner	b39a2df066	minor comment fix llvm-svn: 35696	2007-04-06 17:47:14 +00:00
Reid Spencer	aad0b4536b	Change the bit_part_select (non)implementation from "return 0" to abort. llvm-svn: 35679	2007-04-05 01:20:18 +00:00
Reid Spencer	6eb55df794	Implement the llvm.bit.part_select.iN.iN.iN overloaded intrinsic. llvm-svn: 35678	2007-04-04 23:48:25 +00:00
Anton Korobeynikov	e16f421e0e	Properly emit range comparisons for switch cases, where neighbour cases go to the same destination. Now we're producing really good code for switch-lower-feature.ll testcase llvm-svn: 35672	2007-04-04 21:14:49 +00:00
Reid Spencer	4a28a16efb	For PR1297: Adjust for changes in the bit counting intrinsics. They all return i32 now so we have to trunc/zext the DAG node accordingly. llvm-svn: 35546	2007-04-01 07:34:11 +00:00
Chris Lattner	f01b0a800b	move a bunch of code out of the sdisel pass into its own opt pass "codegenprepare". llvm-svn: 35529	2007-03-31 04:18:03 +00:00
Evan Cheng	13cc34e91b	Scale 1 is always ok. llvm-svn: 35407	2007-03-28 01:55:52 +00:00
Evan Cheng	6056fd729d	GEP index sinking fixes: 1) Take address scale into consideration. e.g. i32* -> scale 4. 2) Examine all the users of GEP. 3) Generalize to inter-block GEP's (no longer uses loopinfo). 4) Don't do xform if GEP has other variable index(es). llvm-svn: 35403	2007-03-28 01:49:39 +00:00
Anton Korobeynikov	64622a0ddf	Remove dead code llvm-svn: 35380	2007-03-27 12:05:48 +00:00
Anton Korobeynikov	b58a93156f	Split big monster into small helpers. No functionality change. llvm-svn: 35379	2007-03-27 11:29:11 +00:00
Evan Cheng	7218d782fe	SDISel does not preserve all, it changes CFG and other info. llvm-svn: 35376	2007-03-27 00:53:36 +00:00
Anton Korobeynikov	6f78c59650	First step of switch lowering refactoring: perform worklist-driven strategy, emit JT's where possible. llvm-svn: 35338	2007-03-25 15:07:15 +00:00
Chris Lattner	6f17a615cb	Implement support for vector operands to inline asm, implementing CodeGen/X86/2007-03-24-InlineAsmVectorOp.ll llvm-svn: 35332	2007-03-25 05:00:54 +00:00
Chris Lattner	b19069959d	switch TargetLowering::getConstraintType to take the entire constraint, not just the first letter. No functionality change. llvm-svn: 35322	2007-03-25 02:14:49 +00:00
Dan Gohman	d0a0ea9916	Change uses of Function::front to Function::getEntryBlock for readability. llvm-svn: 35265	2007-03-22 16:38:57 +00:00
Evan Cheng	fe301e0f29	Minor bug. llvm-svn: 35219	2007-03-20 19:32:11 +00:00
Evan Cheng	65d69fe08d	Use SmallSet instead of std::set. llvm-svn: 35133	2007-03-17 08:53:30 +00:00
Evan Cheng	8552300ab1	If sdisel has decided to sink GEP index expression into any BB. Replace all uses in that BB. llvm-svn: 35132	2007-03-17 08:22:49 +00:00
Evan Cheng	77099bef05	Turn on GEP index sinking by default. llvm-svn: 35127	2007-03-16 18:32:30 +00:00
Evan Cheng	449900b988	Stupid bug. llvm-svn: 35126	2007-03-16 17:50:20 +00:00
Evan Cheng	c3e7d4b884	Sink a binary expression into its use blocks if it is a loop invariant computation used as GEP indexes and if the expression can be folded into target addressing mode of GEP load / store use types. llvm-svn: 35123	2007-03-16 08:46:27 +00:00

... 3 4 5 6 7 ...

834 Commits