llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
Dan Gohman	012abf0109	Convert MaskedValueIsZero and all its users to use APInt. Also add a SignBitIsZero function to simplify a common use case. llvm-svn: 47561	2008-02-25 21:11:39 +00:00
Dale Johannesen	574bb5e1e2	Expand removal of MMX memory copies to allow 1 level of TokenFactor underneath chain (seems to be enough) llvm-svn: 47554	2008-02-25 19:20:14 +00:00
Bill Wendling	a369a6add8	Some platforms use the same name for 32-bit and 64-bit registers (like %r3 on PPC) in their ASM files. However, it's hard for humans to read during debugging. Adding a new field to the register data that lets you specify a different name to be printed than the one that goes into the ASM file -- %x3 instead of %r3, for instance. llvm-svn: 47534	2008-02-24 00:56:13 +00:00
Scott Michel	36bbc39272	Merge current work back to tree to minimize diffs and drift. Major highlights for CellSPU modifications: - SPUInstrInfo.td refactoring: "multiclass" really is _your_ friend. - Other improvements based on refactoring effort in SPUISelLowering.cpp, esp. in SPUISelLowering::PerformDAGCombine(), where zero amount shifts and rotates are now eliminiated, other scalar-to-vector-to-scalar silliness is also eliminated. - 64-bit operations are being implemented, _muldi3.c gcc runtime now compiles and generates the right code. More work still needs to be done. llvm-svn: 47532	2008-02-23 18:41:37 +00:00
Evan Cheng	d3e9e78aa0	Turning on remat of pic loads. llvm-svn: 47524	2008-02-23 02:07:42 +00:00
Evan Cheng	97ca29e99a	No need recognize load from a fixed argument slot as re-materializable. LiveIntervalAnalysis already handles it as a special case. llvm-svn: 47522	2008-02-23 01:47:44 +00:00
Dale Johannesen	ae08bdb4cf	Split ParameterAttributes.h, putting the complicated stuff into ParamAttrsList.h. Per feedback from ParamAttrs changes. llvm-svn: 47504	2008-02-22 22:17:59 +00:00
Dale Johannesen	23a6f7fc1f	MMX vectors are passed 4-byte aligned. llvm-svn: 47483	2008-02-22 17:47:28 +00:00
Evan Cheng	1b417c4d84	Allow re-materialization of pic load (controlled by -remat-pic-load for now). llvm-svn: 47476	2008-02-22 09:25:47 +00:00
Chris Lattner	a64d4179d4	copy mmx values from/to memory with GPRs on x86-32 instead of with mmx registers. This horribleness is apparently done by gcc to avoid having to insert emms in places that really should have it. This is the second half of rdar://5741668. llvm-svn: 47474	2008-02-22 05:18:04 +00:00
Chris Lattner	e70bc39d74	Start using GPR's to copy around mmx value instead of mmx regs. GCC apparently does this, and code depends on not having to do emms when this happens. This is x86-64 only so far, second half should handle x86-32. rdar://5741668 llvm-svn: 47470	2008-02-22 02:09:43 +00:00
Eli Friedman	123fc4b97d	A few minor updates, removing implemented stuff and adding a couple of new things. llvm-svn: 47458	2008-02-21 21:16:49 +00:00
Chris Lattner	b25a98e364	Dan implemented one multiply issue. Replace it with another. :) llvm-svn: 47431	2008-02-21 06:51:29 +00:00
Andrew Lenharth	db9cd46f5d	Atomic op support. If any gcc test uses __sync builtins, it might start failing on archs that haven't implemented them yet llvm-svn: 47430	2008-02-21 06:45:13 +00:00
Evan Cheng	f3a7cd1c62	Poorly named option. llvm-svn: 47400	2008-02-20 20:57:32 +00:00
Evan Cheng	51789192ce	Temporarily backing out r47337. It breaks a number of CBE tests. llvm-svn: 47385	2008-02-20 18:32:05 +00:00
Anton Korobeynikov	c41f5b6af4	Fix newly-introduced 4.3 warnings llvm-svn: 47375	2008-02-20 12:07:57 +00:00
Anton Korobeynikov	9b1c5f2cac	Fix code style llvm-svn: 47370	2008-02-20 11:24:05 +00:00
Anton Korobeynikov	4f6e612973	Remove bunch of gcc 4.3-related warnings from Target llvm-svn: 47369	2008-02-20 11:22:39 +00:00
Anton Korobeynikov	0c5e186924	Unbreak build with gcc 4.3: provide missed includes and silence most annoying warnings. llvm-svn: 47367	2008-02-20 11:08:44 +00:00
Evan Cheng	e9708c997f	Disable for now. This is pessimizing code. llvm-svn: 47354	2008-02-20 02:29:17 +00:00
Evan Cheng	35253f2c22	Add hidden option -x86-fold-and-in-test to test the effect the test / and folding change. llvm-svn: 47351	2008-02-19 23:36:51 +00:00
Andrew Lenharth	8e5c7e0bd9	fix some byval problems in the cbe. Closes PR2065 llvm-svn: 47337	2008-02-19 19:47:54 +00:00
Chris Lattner	3a4ac3a69e	Don't fold and's into test instructions if they have multiple uses. This compiles test-nofold.ll into: _test: movl $15, %ecx andl 4(%esp), %ecx testl %ecx, %ecx movl $42, %eax cmove %ecx, %eax ret instead of: _test: movl 4(%esp), %eax movl %eax, %ecx andl $15, %ecx testl $15, %eax movl $42, %eax cmove %ecx, %eax ret llvm-svn: 47330	2008-02-19 17:37:35 +00:00
Evan Cheng	ece0db124f	Me not like duplicated comments. llvm-svn: 47300	2008-02-19 02:05:16 +00:00
Evan Cheng	bb577266bf	- When DAG combiner is folding a bit convert into a BUILD_VECTOR, it should check if it's essentially a SCALAR_TO_VECTOR. Avoid turning (v8i16) <10, u, u, u> to <10, 0, u, u, u, u, u, u>. Instead, simply convert it to a SCALAR_TO_VECTOR of the proper type. - X86 now normalize SCALAR_TO_VECTOR to (BIT_CONVERT (v4i32 SCALAR_TO_VECTOR)). Get rid of X86ISD::S2VEC. llvm-svn: 47290	2008-02-18 23:04:32 +00:00
Dan Gohman	0d0f16ca85	Chris pointed out that it's not necessary to set i64 MUL to Expand on x86-32 since i64 itself is not a Legal type. And, update some comments. llvm-svn: 47282	2008-02-18 19:34:53 +00:00
Chris Lattner	79ecc053ca	upgrade some tests. llvm-svn: 47280	2008-02-18 18:46:39 +00:00
Nate Begeman	66df9740df	Add a note llvm-svn: 47279	2008-02-18 18:39:23 +00:00
Chris Lattner	6ab061dd2d	Add a note about sext from i1 plus flags use. llvm-svn: 47278	2008-02-18 18:30:13 +00:00
Dan Gohman	70b9b2f77f	Don't mark scalar integer multiplication as Expand on x86, since x86 has plain one-result scalar integer multiplication instructions. This avoids expanding such instructions into MUL_LOHI sequences that must be special-cased at isel time, and avoids the problem with that code that provented memory operands from being folded. This fixes PR1874, addressesing the most common case. The uncommon cases of optimizing multiply-high operations will require work in DAGCombiner. llvm-svn: 47277	2008-02-18 17:55:26 +00:00
Chris Lattner	3e886fa85a	move PR2053 to here. llvm-svn: 47237	2008-02-17 19:43:57 +00:00
Andrew Lenharth	da54523742	I cannot find a libgcc function for this builtin. Therefor expanding it to a noop (which is how it use to be treated). If someone who knows the x86 backend better than me could tell me how to get a lock prefix on an instruction, that would be nice to complete x86 support. llvm-svn: 47213	2008-02-16 14:46:26 +00:00
Andrew Lenharth	c178981b85	llvm.memory.barrier, and impl for x86 and alpha llvm-svn: 47204	2008-02-16 01:24:58 +00:00
Chris Lattner	d55c26a77d	Handle \n's in value names for more targets. The asm printers really really really need refactoring :( llvm-svn: 47171	2008-02-15 19:04:54 +00:00
Chris Lattner	cf98f7291b	If the llvm name contains an unprintable character, don't print it in the global comment. This prevents printing things like: ... # foo bar when the name is "foo\nbar". llvm-svn: 47170	2008-02-15 18:56:05 +00:00
Dale Johannesen	b9e1a37497	Cosmetics. llvm-svn: 47168	2008-02-15 18:40:53 +00:00
Dale Johannesen	de53aaec39	Remove warning about 64-bit code on processor that doesn't support it. Per Chris. llvm-svn: 47162	2008-02-15 18:09:51 +00:00
Dale Johannesen	da9de4b3f0	nocona, core2 and penryn support 64 bit. llvm-svn: 47149	2008-02-15 01:22:41 +00:00
Dale Johannesen	6cb8a628a2	Rewrite tblgen handling of subtarget features so it follows the order of the enum, not alphabetical. The motivation is to make -mattr=+ssse3,+sse41 select SSE41 as it ought to. Added "ignored" enum values of 0 to PPC and SPU to avoid compiler warnings. llvm-svn: 47143	2008-02-14 23:35:16 +00:00
Nate Begeman	9deedb0114	Fix single precision FP constants on SPU. They are actually legal, which allows us to kill a target-specific node. llvm-svn: 47127	2008-02-14 18:43:04 +00:00
Duncan Sands	0056f1e823	In TargetLowering::LowerCallTo, don't assert that the return value is zero-extended if it isn't sign-extended. It may also be any-extended. Also, if a floating point value was returned in a larger floating point type, pass 1 as the second operand to FP_ROUND, which tells it that all the precision is in the original type. I think this is right but I could be wrong. Finally, when doing libcalls, set isZExt on a parameter if it is "unsigned". Currently isSExt is set when signed, and nothing is set otherwise. This should be right for all calls to standard library routines. llvm-svn: 47122	2008-02-14 17:28:50 +00:00
Nate Begeman	1ef1013b6c	Change how FP immediates are handled. 1) ConstantFP is now expand by default 2) ConstantFP is not turned into TargetConstantFP during Legalize if it is legal. This allows ConstantFP to be handled like Constant, allowing for targets that can encode FP immediates as MachineOperands. As a bonus, fix up Itanium FP constants, which now correctly match, and match more constants! Hooray. llvm-svn: 47121	2008-02-14 08:57:00 +00:00
Nate Begeman	5d61361bb9	Move some useful operands up into the all-targets .td llvm-svn: 47115	2008-02-14 07:25:46 +00:00
Chris Lattner	b714906acf	upgrade some entries, remove stuff that is done. llvm-svn: 47109	2008-02-14 06:19:02 +00:00
Chris Lattner	037aa64987	the mid-level optimizer removes this stuff. llvm-svn: 47108	2008-02-14 05:43:18 +00:00
Chris Lattner	10dc770a36	this one is easy. llvm-svn: 47107	2008-02-14 05:41:38 +00:00
Chris Lattner	d696c25db5	This readme entry is done, testcase here: CodeGen/X86/zero-remat.ll llvm-svn: 47106	2008-02-14 05:39:46 +00:00
Dan Gohman	737856bd0d	Assigning an APInt to 0 with plain assignment gives it a one-bit size. Initialize these APInts to properly-sized zero values. llvm-svn: 47099	2008-02-13 23:07:24 +00:00
Dan Gohman	99b38405e3	Simplify some logic in ComputeMaskedBits. And change ComputeMaskedBits to pass the mask APInt by value, not by reference. llvm-svn: 47096	2008-02-13 22:28:48 +00:00
Nicolas Geoffray	72fa78e195	Enable exception handling int JIT llvm-svn: 47079	2008-02-13 18:39:37 +00:00
Chris Lattner	80b3a56774	Fix the PPC JIT regressions by encoding zeroreg as 0 for BLR. llvm-svn: 47067	2008-02-13 17:24:14 +00:00
Chris Lattner	57f2088225	don't try to avoid inserting loads when lowering FORMAL_ARGUMENTS. DAGCombine is now quite good at zapifying them. llvm-svn: 47053	2008-02-13 07:35:30 +00:00
Nate Begeman	5f18794295	readme updates llvm-svn: 47051	2008-02-13 07:06:12 +00:00
Nate Begeman	1867c6c264	Make register scavenging happy by not using a reg (CR0) that isn't defined llvm-svn: 47045	2008-02-13 02:58:33 +00:00
Evan Cheng	4b37f5ff05	commuteInstr() can now commute non-ssa machine instrs. llvm-svn: 47043	2008-02-13 02:46:49 +00:00
Dan Gohman	09023887f8	Convert SelectionDAG::ComputeMaskedBits to use APInt instead of uint64_t. Add an overload that supports the uint64_t interface for use by clients that haven't been updated yet. llvm-svn: 47039	2008-02-13 00:35:47 +00:00
Dale Johannesen	4621e574c9	__DATA not __DATA__ is the right segment name on darwin. Spotted by Nick Kledzik. llvm-svn: 47037	2008-02-12 23:35:09 +00:00
Nate Begeman	589ecad41d	Remove some dead code llvm-svn: 47036	2008-02-12 22:54:40 +00:00
Nate Begeman	810b85bde8	SSE4.1 64b integer insert/extract pattern support Move formats into the formats file llvm-svn: 47035	2008-02-12 22:51:28 +00:00
Evan Cheng	b05b05aba5	Revert r46916 PPCTargetAsmInfo.cpp. llvm-svn: 47020	2008-02-12 19:25:12 +00:00
Evan Cheng	e3ddcfa588	Only using x86-64 rip relative addressing in non-staic mode? llvm-svn: 47019	2008-02-12 19:20:46 +00:00
Evan Cheng	c3875c88a7	Update comment. llvm-svn: 47002	2008-02-12 07:59:55 +00:00
Evan Cheng	075ce702eb	Unbreak various insert_vector_elt and extract_vector_elt tests in presence of SSE4. llvm-svn: 47001	2008-02-12 07:59:45 +00:00
Nate Begeman	5c59b16468	Stuff noticed while grepping code llvm-svn: 46979	2008-02-11 23:47:56 +00:00
Nate Begeman	5a4e290b70	Enable SSE4 codegen and pattern matching. Add some notes to the README. llvm-svn: 46949	2008-02-11 04:19:36 +00:00
Nate Begeman	9e8b2ffd52	additional missing feature llvm-svn: 46948	2008-02-11 04:16:09 +00:00
Nate Begeman	297d683980	xmm0 variable blends llvm-svn: 46931	2008-02-10 18:47:57 +00:00
Dan Gohman	cabaec582f	Rename MRegisterInfo to TargetRegisterInfo. llvm-svn: 46930	2008-02-10 18:45:23 +00:00
Nick Lewycky	b072c0b3ed	Match GCC's behaviour for these sections. llvm-svn: 46916	2008-02-10 00:03:54 +00:00
Nate Begeman	2627ffd14b	memopv16i8 had wrong alignment requirement, would have broken pabsb pabs{b,w,d} are not two address fix extract-to-mem sse4 ops add sse4 vector sign extend nodes llvm-svn: 46915	2008-02-09 23:46:37 +00:00
Nate Begeman	a78c35a368	Skeleton of insert and extract matching, more to come llvm-svn: 46902	2008-02-09 01:38:08 +00:00
Nate Begeman	336fba2146	Tablegen support for insert & extract element matching llvm-svn: 46901	2008-02-09 01:37:05 +00:00
Evan Cheng	90f03a0b88	It's not always safe to fold movsd into xorpd, etc. Check the alignment of the load address first to make sure it's 16 byte aligned. llvm-svn: 46893	2008-02-08 21:20:40 +00:00
Dale Johannesen	9bbfeaea4d	64-bit (MMX) vectors do not need restrictive alignment. 128-bit vectors need it only when SSE is on. llvm-svn: 46890	2008-02-08 19:48:20 +00:00
Dan Gohman	d1cc100aef	Avoid needlessly casting away const qualifiers. llvm-svn: 46877	2008-02-08 03:29:40 +00:00
Evan Cheng	b2bc19ee5b	Added missing entries in X86 load / store folding tables. llvm-svn: 46866	2008-02-08 00:12:56 +00:00
Dan Gohman	eb7c8e4f6b	Follow Chris' suggestion; change the PseudoSourceValue accessors to return pointers instead of references, since this is always what is needed. llvm-svn: 46857	2008-02-07 18:41:25 +00:00
Dan Gohman	3af6eba3dd	Add SourceValue information for outgoing argument stores on x86. llvm-svn: 46854	2008-02-07 16:28:05 +00:00
Evan Cheng	a377b2bbd1	Fix a x86-64 codegen deficiency. Allow gv + offset when using rip addressing mode. Before: _main: subq $8, %rsp leaq _X(%rip), %rax movsd 8(%rax), %xmm1 movss _X(%rip), %xmm0 call _t xorl %ecx, %ecx movl %ecx, %eax addq $8, %rsp ret Now: _main: subq $8, %rsp movsd _X+8(%rip), %xmm1 movss _X(%rip), %xmm0 call _t xorl %ecx, %ecx movl %ecx, %eax addq $8, %rsp ret Notice there is another idiotic codegen issue that needs to be fixed asap: xorl %ecx, %ecx movl %ecx, %eax llvm-svn: 46850	2008-02-07 08:53:49 +00:00
Evan Cheng	92635b3d94	In some cases, e.g. ADD32ri, no transformation is made. Guide against it. llvm-svn: 46849	2008-02-07 08:29:53 +00:00
Dan Gohman	f00842e086	Re-apply the memory operand changes, with a fix for the static initializer problem, a minor tweak to the way the DAGISelEmitter finds load/store nodes, and a renaming of the new PseudoSourceValue objects. llvm-svn: 46827	2008-02-06 22:27:42 +00:00
Evan Cheng	e509232483	Move to getCALLSEQ_END to ensure CALLSEQ_END node produces a flag. This is consistent with the definition in td file. llvm-svn: 46775	2008-02-05 22:44:06 +00:00
Dale Johannesen	e8fa3130f8	Implement sseregparm. llvm-svn: 46764	2008-02-05 20:46:33 +00:00
Nate Begeman	9f4245c16a	Ident mnemonics appropriately llvm-svn: 46746	2008-02-05 08:49:09 +00:00
Evan Cheng	1c67dcaae7	Dwarf requires variable entries to be in the source order. Right now, since we are recording variable information at isel time this means parameters would appear in the reverse order. The short term fix is to issue recordVariable() at asm printing time instead. llvm-svn: 46724	2008-02-04 23:06:48 +00:00
Nate Begeman	2b00217d58	This method should be virtual llvm-svn: 46723	2008-02-04 23:04:24 +00:00
Nate Begeman	8285430ed7	Eliminate some redundant code. llvm-svn: 46720	2008-02-04 21:44:06 +00:00
Nate Begeman	16830a34c4	The rest of the SSE4.1 intrinsic patterns that are obvious to me. Getting Evan's help with the rest. llvm-svn: 46697	2008-02-04 06:00:24 +00:00
Nate Begeman	ee7503810f	Some more SSE 4.1 intrinsic patterns. llvm-svn: 46696	2008-02-04 05:34:34 +00:00
Nate Begeman	ead8dfeef2	SSE 4.1 Intrinsics and detection llvm-svn: 46681	2008-02-03 07:18:54 +00:00
Chris Lattner	e926ece00b	explicitly include Compiler.h instead of getting it from tblgen in the middle of a class. llvm-svn: 46676	2008-02-03 05:43:57 +00:00
Chris Lattner	df36b95e00	don't do ReplaceUses on a result that doesn't exist. llvm-svn: 46673	2008-02-03 03:20:59 +00:00
Evan Cheng	f61d1115af	Get rid of the annoying blank lines before labels. llvm-svn: 46667	2008-02-02 08:39:46 +00:00
Nick Lewycky	78e2e2cd07	Don't use uninitialized values. Fixes vec_align.ll on X86 Linux. llvm-svn: 46666	2008-02-02 08:29:58 +00:00
Evan Cheng	dd8d07749a	Unbreak ppc debug support. llvm-svn: 46665	2008-02-02 05:06:29 +00:00
Evan Cheng	c57ec111f2	SDIsel processes llvm.dbg.declare by recording the variable debug information descriptor and its corresponding stack frame index in MachineModuleInfo. This only works if the local variable is "homed" in the stack frame. It does not work for byval parameter, etc. Added ISD::DECLARE node type to represent llvm.dbg.declare intrinsic. Now the intrinsic calls are lowered into a SDNode and lives on through out the codegen passes. For now, since all the debugging information recording is done at isel time, when a ISD::DECLARE node is selected, it has the side effect of also recording the variable. This is a short term solution that should be fixed in time. llvm-svn: 46659	2008-02-02 04:07:54 +00:00
Evan Cheng	9ff6b89bd9	Frame index can be negative. llvm-svn: 46655	2008-02-02 00:17:00 +00:00
Lauro Ramos Venancio	563e0a3ea3	CBackend: Implement unaligned load/store. llvm-svn: 46646	2008-02-01 21:25:59 +00:00
Evan Cheng	d6222fc11d	Remove the nasty LABEL hack with a much less evil one. Now llvm.dbg.func.start implies a stoppoint is set. SelectionDAGISel records a new source line but does not create a ISD::LABEL node for this special stoppoint. Asm printer will magically print this label. This ensures nothing is emitted before. llvm-svn: 46635	2008-02-01 09:10:45 +00:00
Evan Cheng	2a533e6894	Revert 46556 and 46585. Dan please fix the PseudoSourceValue problem and re-commit. llvm-svn: 46623	2008-01-31 21:00:00 +00:00
Evan Cheng	705212577d	Add an extra operand to LABEL nodes which distinguishes between debug, EH, or misc labels. This fixes the EH breakage. However I am not convinced this is the solution. llvm-svn: 46609	2008-01-31 09:59:15 +00:00
Christopher Lamb	1a102eecb0	Allow ComplexExpressions in InstrInfo.td files to be slightly more... complex! ComplexExpressions can now have attributes which affect how TableGen interprets the pattern when generating matchin code. The first (and currently, only) attribute causes the immediate parent node of the ComplexPattern operand to be passed into the matching code rather than the node at the root of the entire DAG containing the pattern. llvm-svn: 46606	2008-01-31 07:27:46 +00:00
Evan Cheng	d1bed85965	Add x86 specific getFrameIndexOffset(). This fixes local variable debugging info. llvm-svn: 46598	2008-01-31 04:06:00 +00:00
Evan Cheng	a63f6736f3	MRegisterInfo::getLocation() is a really bad idea. Its function is to calculate the offset from frame pointer to a stack slot and then storing the delta in a MachineLocation object. The name is bad (it implies a getter), and MRegisterInfo doesn't need to know about MachineLocation. Replace getLocation() with getFrameIndexOffset() which returns the delta from frame pointer to stack slot. Dwarf writer can then use the information for whatever it wants. llvm-svn: 46597	2008-01-31 03:37:28 +00:00
Evan Cheng	f9d2cf4cae	Makes the same change in ppc backend: avoid inserting prologue before debug labels. llvm-svn: 46596	2008-01-31 03:33:38 +00:00
Dan Gohman	018ea14d87	Avoid unnecessarily casting away const. llvm-svn: 46590	2008-01-31 01:01:48 +00:00
Dan Gohman	3993809a0c	Rename ISD::FLT_ROUNDS to ISD::FLT_ROUNDS_ to avoid conflicting with the real FLT_ROUNDS (defined in <float.h>). llvm-svn: 46587	2008-01-31 00:41:03 +00:00
Dan Gohman	4326d513ab	Create a new class, MemOperand, for describing memory references in the backend. Introduce a new SDNode type, MemOperandSDNode, for holding a MemOperand in the SelectionDAG IR, and add a MemOperand list to MachineInstr, and code to manage them. Remove the offset field from SrcValueSDNode; uses of SrcValueSDNode that were using it are all all using MemOperandSDNode now. Also, begin updating some getLoad and getStore calls to use the PseudoSourceValue objects. Most of this was written by Florian Brander, some reorganization and updating to TOT by me. llvm-svn: 46585	2008-01-31 00:25:39 +00:00
Evan Cheng	b2b94f7a81	Treat the label for the first @llvm.dbg.stoppoint the same way as the dbg_func_start label. Make sure nothing else is inserted before them. Note this solution might be somewhat fragile since ISD::LABEL may be used for other purposes. If that ends up to be an issue, we may need to introduce a different node for debug labels. llvm-svn: 46571	2008-01-30 20:08:35 +00:00
Evan Cheng	918b9c9335	Even though InsertAtEndOfBasicBlock is an ugly hack it still deserves a proper name. Rename it to EmitInstrWithCustomInserter since it does not necessarily insert instruction at the end. llvm-svn: 46562	2008-01-30 18:18:23 +00:00
Evan Cheng	84adda6b67	Skip over the label which marks the beginning of the function before inserting prologue code. llvm-svn: 46546	2008-01-30 03:57:33 +00:00
Scott Michel	ab3702fba9	More cleanups for CellSPU: - Expand tabs... (poss 80-col violations, will get them later...) - Consolidate logic for SelectDFormAddr and SelectDForm2Addr into a single function, simplifying maintenance. Also reduced custom instruction generation for SPUvecinsert/INSERT_MASK. llvm-svn: 46544	2008-01-30 02:55:46 +00:00
Dan Gohman	13d1327796	Factor the addressing mode and the load/store VT out of LoadSDNode and StoreSDNode into their common base class LSBaseSDNode. Member functions getLoadedVT and getStoredVT are replaced with the common getMemoryVT to simplify code that will handle both loads and stores. llvm-svn: 46538	2008-01-30 00:15:11 +00:00
Evan Cheng	618761903d	Work in progress. This patch fixes x86-64 calls which are modelled as StructRet but really should be return in registers, e.g. _Complex long double, some 128-bit aggregates. This is a short term solution that is necessary only because llvm, for now, cannot model i128 nor call's with multiple results. Status: This only works for direct calls, and only the caller side is done. Disabled for now. llvm-svn: 46527	2008-01-29 19:34:22 +00:00
Duncan Sands	390baa691d	Use getPreferredAlignmentLog or getPreferredAlignment to get the alignment of global variables, rather than using hand-made versions. llvm-svn: 46495	2008-01-29 06:23:44 +00:00
Dale Johannesen	f12104ce4b	Handle 'X' constraint in asm's better. llvm-svn: 46485	2008-01-29 02:21:21 +00:00
Scott Michel	dc780aeb57	Overhaul Cell SPU's addressing mode internals so that there are now only two addressing mode nodes, SPUaform and SPUindirect (vice the three previous ones, SPUaform, SPUdform and SPUxform). This improves code somewhat because we now avoid using reg+reg addressing when it can be avoided. It also simplifies the address selection logic, which was the main point for doing this. Also, for various global variables that would be loaded using SPU's A-form addressing, prefer D-form offs[reg] addressing, keeping the base in a register if the variable is used more than once. llvm-svn: 46483	2008-01-29 02:16:57 +00:00
Bill Wendling	5b6f587a80	If the function has no machine instructions, then emit a "nop" so that the function label isn't associated with something it shouldn't be. llvm-svn: 46449	2008-01-28 09:15:03 +00:00
Chris Lattner	39c52e030b	add a note llvm-svn: 46413	2008-01-27 07:31:41 +00:00
Chris Lattner	f4bc2c5718	Use fldz and fld1 for long double constants instead of a constant pool load. llvm-svn: 46411	2008-01-27 06:19:31 +00:00
Chris Lattner	00183edf55	Add some notes. llvm-svn: 46405	2008-01-26 20:12:07 +00:00
Chris Lattner	6124c0eb2d	Remove some code for inferring alignment info from the x86 backend now that the dag combiner does it. llvm-svn: 46404	2008-01-26 20:07:42 +00:00
Bill Wendling	7b83688c73	If there's no instructions being emitted on X86 for a function, emit a nop. Emit the nop directly for PPC. llvm-svn: 46398	2008-01-26 09:03:52 +00:00
Bill Wendling	1c92468074	If there are no machine instructions emitted for a function, then insert a "nop" instruction so that we don't have the function's label associated with something that it's not supposed to be associated with. llvm-svn: 46394	2008-01-26 06:51:24 +00:00
Chris Lattner	00ead854ef	JITEmitter.cpp was trying to sync the icache for function stubs, but was actually passing a completely incorrect size to sys_icache_invalidate. Instead of having the JITEmitter do this (which doesn't have the correct size), just make the target sync its own stubs. llvm-svn: 46354	2008-01-25 16:41:09 +00:00
Chris Lattner	726a4e45e5	optimize fxor like for llvm-svn: 46345	2008-01-25 06:14:17 +00:00
Chris Lattner	79076fdf2a	Add target-specific dag combines for FAND(x,0) and FOR(x,0). This allows us to compile: double test(double X) { return copysign(0.0, X); } into: _test: andpd LCPI1_0(%rip), %xmm0 ret instead of: _test: pxor %xmm1, %xmm1 andpd LCPI1_0(%rip), %xmm1 movapd %xmm0, %xmm2 andpd LCPI1_1(%rip), %xmm2 movapd %xmm1, %xmm0 orpd %xmm2, %xmm0 ret llvm-svn: 46344	2008-01-25 05:46:26 +00:00
Anton Korobeynikov	37309ed741	Provide correct DWARF register numbering for debug information emission on x86-32/Darwin. This should fix bunch of issues. llvm-svn: 46337	2008-01-25 00:34:13 +00:00
Chris Lattner	16a8f126d3	Significantly simplify and improve handling of FP function results on x86-32. This case returns the value in ST(0) and then has to convert it to an SSE register. This causes significant codegen ugliness in some cases. For example in the trivial fp-stack-direct-ret.ll testcase we used to generate: _bar: subl $28, %esp call L_foo$stub fstpl 16(%esp) movsd 16(%esp), %xmm0 movsd %xmm0, 8(%esp) fldl 8(%esp) addl $28, %esp ret because we move the result of foo() into an XMM register, then have to move it back for the return of bar. Instead of hacking ever-more special cases into the call result lowering code we take a much simpler approach: on x86-32, fp return is modeled as always returning into an f80 register which is then truncated to f32 or f64 as needed. Similarly for a result, we model it as an extension to f80 + return. This exposes the truncate and extensions to the dag combiner, allowing target independent code to hack on them, eliminating them in this case. This gives us this code for the example above: _bar: subl $12, %esp call L_foo$stub addl $12, %esp ret The nasty aspect of this is that these conversions are not legal, but we want the second pass of dag combiner (post-legalize) to be able to hack on them. To handle this, we lie to legalize and say they are legal, then custom expand them on entry to the isel pass (PreprocessForFPConvert). This is gross, but less gross than the code it is replacing :) This also allows us to generate better code in several other cases. For example on fp-stack-ret-conv.ll, we now generate: _test: subl $12, %esp call L_foo$stub fstps 8(%esp) movl 16(%esp), %eax cvtss2sd 8(%esp), %xmm0 movsd %xmm0, (%eax) addl $12, %esp ret where before we produced (incidentally, the old bad code is identical to what gcc produces): _test: subl $12, %esp call L_foo$stub fstpl (%esp) cvtsd2ss (%esp), %xmm0 cvtss2sd %xmm0, %xmm0 movl 16(%esp), %eax movsd %xmm0, (%eax) addl $12, %esp ret Note that we generate slightly worse code on pr1505b.ll due to a scheduling deficiency that is unrelated to this patch. llvm-svn: 46307	2008-01-24 08:07:48 +00:00
Evan Cheng	91089e6d66	Let each target decide byval alignment. For X86, it's 4-byte unless the aggregare contains SSE vector(s). For x86-64, it's max of 8 or alignment of the type. llvm-svn: 46286	2008-01-23 23:17:41 +00:00
Duncan Sands	aff4eef6df	The last pieces needed for loading arbitrary precision integers. This won't actually work (and most of the code is dead) unless the new legalization machinery is turned on. While there, I rationalized the handling of i1, and removed some bogus (and unused) sextload patterns. For i1, this could result in microscopically better code for some architectures (not X86). It might also result in worse code if annotating with AssertZExt nodes turns out to be more harmful than helpful. llvm-svn: 46280	2008-01-23 20:39:46 +00:00
Dale Johannesen	a54401ee30	Honor explicit section information on Darwin. llvm-svn: 46267	2008-01-23 00:58:14 +00:00
Evan Cheng	d436c2e724	SSE varargs arguments are passed in memory. llvm-svn: 46262	2008-01-22 23:26:53 +00:00
Chris Lattner	c964ccb2c4	Trivial patch to fix two warnings, please pull into llvm 2.2 llvm-svn: 46243	2008-01-22 04:47:47 +00:00
Anton Korobeynikov	ad17c6bbe6	Honour ByVal parameter attribute for name decoration llvm-svn: 46200	2008-01-20 14:00:07 +00:00
Anton Korobeynikov	539129f881	Remove Darwin'ism llvm-svn: 46199	2008-01-20 13:59:37 +00:00
Anton Korobeynikov	5ac8a5b72c	Enable PIC codegen on x86-64/linux llvm-svn: 46198	2008-01-20 13:58:16 +00:00
Duncan Sands	5e1cbc1ad7	Need to handle any 'nest' parameter before integer parameters, since otherwise it won't be passed in the right register. With this change trampolines work on x86-64 (thanks to Luke Guest for providing access to an x86-64 box). llvm-svn: 46192	2008-01-19 16:42:10 +00:00
Dale Johannesen	7807e86260	Implement flt_rounds for PowerPC. llvm-svn: 46174	2008-01-18 19:55:37 +00:00
Chris Lattner	b3be660985	get symbolic information for ppc ldbl nodes. llvm-svn: 46165	2008-01-18 18:51:16 +00:00
Chris Lattner	febc7ea9bf	Fix a latent bug exposed by my truncstore patch. We compiled stfiwx-2.ll to: _test: fctiwz f0, f1 stfiwx f0, 0, r4 blr instead of: _test: fctiwz f0, f1 stfd f0, -8(r1) nop nop lwz r2, -4(r1) stb r2, 0(r4) blr The former is not correct (stores 4 bytes, not 1). llvm-svn: 46161	2008-01-18 16:54:56 +00:00
Chris Lattner	eb7d65d073	make a method public llvm-svn: 46159	2008-01-18 06:52:41 +00:00
Dale Johannesen	0e1328e880	Revert the part of 45849 that treated weak globals as weak globals rather than commons. While not wrong, this change tickled a latent bug in Darwin's strip, so revert it for now as a workaround. llvm-svn: 46147	2008-01-17 23:36:04 +00:00
Dale Johannesen	08c757e707	Revert the part of 45848 that treated weak globals as weak globals rather than commons. While not wrong, this change tickled a latent bug in Darwin's strip, so revert it for now as a workaround. llvm-svn: 46144	2008-01-17 23:04:07 +00:00
Scott Michel	506e61bad1	Forward progress: crtbegin.c now compiles successfully! Fixed CellSPU's A-form (local store) address mode, so that all globals, externals, constant pool and jump table symbols are now wrapped within a SPUISD::AFormAddr pseudo-instruction. This now identifies all local store memory addresses, although it requires a bit of legerdemain during instruction selection to properly select loads to and stores from local store, properly generating "LQA" instructions. Also added mul_ops.ll test harness for exercising integer multiplication. llvm-svn: 46142	2008-01-17 20:38:41 +00:00
Chris Lattner	41717f6989	This commit changes: 1. Legalize now always promotes truncstore of i1 to i8. 2. Remove patterns and gunk related to truncstore i1 from targets. 3. Rename the StoreXAction stuff to TruncStoreAction in TLI. 4. Make the TLI TruncStoreAction table a 2d table to handle from/to conversions. 5. Mark a wide variety of invalid truncstores as such in various targets, e.g. X86 currently doesn't support truncstore of any of its integer types. 6. Add legalize support for truncstores with invalid value input types. 7. Add a dag combine transform to turn store(truncate) into truncstore when safe. The later allows us to compile CodeGen/X86/storetrunc-fp.ll to: _foo: fldt 20(%esp) fldt 4(%esp) faddp %st(1) movl 36(%esp), %eax fstps (%eax) ret instead of: _foo: subl $4, %esp fldt 24(%esp) fldt 8(%esp) faddp %st(1) fstps (%esp) movl 40(%esp), %eax movss (%esp), %xmm0 movss %xmm0, (%eax) addl $4, %esp ret llvm-svn: 46140	2008-01-17 19:59:44 +00:00
Chris Lattner	d033200a8f	* Introduce a new SelectionDAG::getIntPtrConstant method and switch various codegen pieces and the X86 backend over to using it. * Add some comments to SelectionDAGNodes.h * Introduce a second argument to FP_ROUND, which indicates whether the FP_ROUND changes the value of its input. If not it is safe to xform things like fp_extend(fp_round(x)) -> x. llvm-svn: 46125	2008-01-17 07:00:52 +00:00
Duncan Sands	78e448d8b4	Trampoline support for x86-64. This looks like it should work, but I have no machine to test it on. Committed because it will at least cause no harm, and maybe someone can test it for me! llvm-svn: 46098	2008-01-16 22:55:25 +00:00
Chris Lattner	2825f8fff0	make it more clear that this predicate only applies to scalar FP types. llvm-svn: 46058	2008-01-16 06:24:21 +00:00

1 2 3 4 5 ...

8091 Commits