llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 14:33:02 +02:00

Author	SHA1	Message	Date
Chris Lattner	ad9a6ccb83	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Chris Lattner	f8e408b7b1	Codegen: as: _bar: pushl %esi subl $8, %esp movl 16(%esp), %esi call L_foo$stub fstps (%esi) addl $8, %esp popl %esi #FP_REG_KILL ret instead of: _bar: pushl %esi subl $8, %esp movl 16(%esp), %esi call L_foo$stub fstpl (%esi) cvtsd2ss (%esi), %xmm0 movss %xmm0, (%esi) addl $8, %esp popl %esi #FP_REG_KILL ret llvm-svn: 45401	2007-12-29 06:57:38 +00:00
Chris Lattner	e3515220d2	avoid going through a stack slot to convert from fpstack to xmm reg if we are just going to store it back anyway. This improves things like: double foo(); void bar(double P) { P = foo(); } llvm-svn: 45399	2007-12-29 06:41:28 +00:00
Chris Lattner	f04ce286e2	fix a questionable cast, thanks to Mike Stump for pointing this out. llvm-svn: 45075	2007-12-16 20:26:54 +00:00
Evan Cheng	1d95b669b6	Make better use of instructions that clear high bits; fix various 2-wide shuffle bugs. llvm-svn: 45058	2007-12-15 03:00:47 +00:00
Evan Cheng	6909ff8c4b	Fix ctlz and cttz. llvm definition requires them to return number of bits in of the src type when value is zero. llvm-svn: 45029	2007-12-14 08:30:15 +00:00
Evan Cheng	51cf86ded0	Implement ctlz and cttz with bsr and bsf. llvm-svn: 45024	2007-12-14 02:13:44 +00:00
Dan Gohman	0075ea1f5f	Allow vector integer constants to be created with SelectionDAG::getConstant, in the same way as vector floating-point constants. This allows the legalize expansion code for @llvm.ctpop and friends to be usable with vector types. llvm-svn: 44954	2007-12-12 22:21:26 +00:00
Evan Cheng	ad3e7f3286	Use shuffles to implement insert_vector_elt for i32, i64, f32, and f64. llvm-svn: 44929	2007-12-12 07:55:34 +00:00
Evan Cheng	d36d69fe92	Lower a build_vector with all constants into a constpool load unless it can be done with a move to low part. llvm-svn: 44921	2007-12-12 06:45:40 +00:00
Evan Cheng	f6c2838f36	- Improved v8i16 shuffle lowering. It now uses pshuflw and pshufhw as much as possible before resorting to pextrw and pinsrw. - Better codegen for v4i32 shuffles masquerading as v8i16 or v16i8 shuffles. - Improves (i16 extract_vector_element 0) codegen by recognizing (i32 extract_vector_element 0) does not require a pextrw. llvm-svn: 44836	2007-12-11 01:46:18 +00:00
Nate Begeman	8b194d1718	x86 doesn't actually want to custom lower v3i32 llvm-svn: 44835	2007-12-11 01:41:33 +00:00
Evan Cheng	c4db072c74	Add comment. llvm-svn: 44686	2007-12-07 21:30:01 +00:00
Evan Cheng	34c7b35135	Much improved v8i16 shuffles. (Step 1). llvm-svn: 44676	2007-12-07 08:07:39 +00:00
Evan Cheng	4dc538449d	Remove a bogus optimization. It's not possible to do a move to low element to a <8 x i16> or <16 x i8> vector. llvm-svn: 44669	2007-12-06 22:14:22 +00:00
Duncan Sands	3602011bec	Fix PR1146: parameter attributes are longer part of the function type, instead they belong to functions and function calls. This is an updated and slightly corrected version of Reid Spencer's original patch. The only known problem is that auto-upgrading of bitcode files doesn't seem to work properly (see test/Bitcode/AutoUpgradeIntrinsics.ll). Hopefully a bitcode guru (who might that be? :) ) will fix it. llvm-svn: 44359	2007-11-27 13:23:08 +00:00
Chris Lattner	be0c5a0500	Fix a long standing deficiency in the X86 backend: we would sometimes emit "zero" and "all one" vectors multiple times, for example: _test2: pcmpeqd %mm0, %mm0 movq %mm0, _M1 pcmpeqd %mm0, %mm0 movq %mm0, _M2 ret instead of: _test2: pcmpeqd %mm0, %mm0 movq %mm0, _M1 movq %mm0, _M2 ret This patch fixes this by always arranging for zero/one vectors to be defined as v4i32 or v2i32 (SSE/MMX) instead of letting them be any random type. This ensures they get trivially CSE'd on the dag. This fix is also important for LegalizeDAGTypes, as it gets unhappy when the x86 backend wants BUILD_VECTOR(i64 0) to be legal even when 'i64' isn't legal. This patch makes the following changes: 1) X86TargetLowering::LowerBUILD_VECTOR now lowers 0/1 vectors into their canonical types. 2) The now-dead patterns are removed from the SSE/MMX .td files. 3) All the patterns in the .td file that referred to immAllOnesV or immAllZerosV in the wrong form now use *_bc to match them with a bitcast wrapped around them. 4) X86DAGToDAGISel::SelectScalarSSELoad is generalized to handle bitcast'd zero vectors, which simplifies the code actually. 5) getShuffleVectorZeroOrUndef is updated to generate a shuffle that is legal, instead of generating one that is illegal and expecting a later legalize pass to clean it up. 6) isZeroShuffle is generalized to handle bitcast of zeros. 7) several other minor tweaks. This patch is definite goodness, but has the potential to cause random code quality regressions. Please be on the lookout for these and let me know if they happen. llvm-svn: 44310	2007-11-25 00:24:49 +00:00
Chris Lattner	3862759b53	remove bogus assertion that broke CodeGen/Generic/cast-fp.ll on x86 among others. llvm-svn: 44302	2007-11-24 18:37:20 +00:00
Chris Lattner	28262fbaf2	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Anton Korobeynikov	cd9b16df61	Implement codegen for flt_rounds on x86 llvm-svn: 44183	2007-11-16 01:31:51 +00:00
Bill Wendling	cc75435ebf	Unify CALLSEQ_{START,END}. They take 4 parameters: the chain, two stack adjustment fields, and an optional flag. If there is a "dynamic_stackalloc" in the code, make sure that it's bracketed by CALLSEQ_START and CALLSEQ_END. If not, then there is the potential for the stack to be changed while the stack's being used by another instruction (like a call). This can only result in tears... llvm-svn: 44037	2007-11-13 00:44:25 +00:00
Arnold Schwaighofer	64ad6fa1fa	Update tailcall code to include inline attribute operand for memcpy. llvm-svn: 43978	2007-11-10 10:48:01 +00:00
Evan Cheng	7d8deec92f	Much improved pic jumptable codegen: Then: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry imull $4, %ecx, %ecx leal LJTI1_0-"L1$pb"(%eax), %edx addl LJTI1_0-"L1$pb"(%ecx,%eax), %edx jmpl %edx .align 2 .set L1_0_set_3,LBB1_3-LJTI1_0 .set L1_0_set_2,LBB1_2-LJTI1_0 .set L1_0_set_5,LBB1_5-LJTI1_0 .set L1_0_set_4,LBB1_4-LJTI1_0 LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 Now: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry addl LJTI1_0-"L1$pb"(%eax,%ecx,4), %eax jmpl %eax .align 2 .set L1_0_set_3,LBB1_3-"L1$pb" .set L1_0_set_2,LBB1_2-"L1$pb" .set L1_0_set_5,LBB1_5-"L1$pb" .set L1_0_set_4,LBB1_4-"L1$pb" LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 llvm-svn: 43924	2007-11-09 01:32:10 +00:00
Rafael Espindola	ec025c3042	Move the LowerMEMCPY and LowerMEMCPYCall to a common place. Thanks for the suggestions Bill :-) llvm-svn: 43742	2007-11-05 23:12:20 +00:00
Chris Lattner	67cd357fb8	Fix PR1763 by allowing the 'q' constraint to work with 64-bit regs on x86-64. llvm-svn: 43669	2007-11-04 06:51:12 +00:00
Evan Cheng	bf8e7c6644	Unbreak tailcall opt. llvm-svn: 43646	2007-11-02 17:45:40 +00:00
Evan Cheng	b50cc64eb0	Missing a getNumOperands check. llvm-svn: 43630	2007-11-02 01:26:22 +00:00
Rafael Espindola	27a8907a7c	Make ARM and X86 LowerMEMCPY identical by moving the isThumb check into getMaxInlineSizeThreshold and by restructuring the X86 version. New I just have to move this to a common place :-) llvm-svn: 43554	2007-10-31 14:39:58 +00:00
Rafael Espindola	fae98471a9	Make ARM an X86 memcpy expansion more similar to each other. Now both subtarget define getMaxInlineSizeThreshold and the expansion uses it. This should not change generated code. llvm-svn: 43552	2007-10-31 11:52:06 +00:00
Dale Johannesen	9bc04ae496	Make i64=expand_vector_elt(v2i64) work in 32-bit mode. llvm-svn: 43535	2007-10-31 00:32:36 +00:00
Dale Johannesen	461a0c47f8	Add missing MMX PSUBQ. llvm-svn: 43488	2007-10-30 01:18:38 +00:00
Evan Cheng	5fe81cf64e	Enable more fold (sext (load x)) -> (sext (truncate (sextload x))) transformation. Previously, it's restricted by ensuring the number of load uses is one. Now the restriction is loosened up by allowing setcc uses to be "extended" (e.g. setcc x, c, eq -> setcc sext(x), sext(c), eq). llvm-svn: 43465	2007-10-29 19:58:20 +00:00
Evan Cheng	1113931fd8	Avoid doing something dumb like rewriting using a 64-bit iv in 32-bit mode. llvm-svn: 43446	2007-10-29 07:57:50 +00:00
Evan Cheng	53696b7e9f	Loosen up iv reuse to allow reuse of the same stride but a larger type when truncating from the larger type to smaller type is free. e.g. Turns this loop: LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx movw %dx, %si LBB1_2: # bb movl L_X$non_lazy_ptr, %edi movw %si, (%edi) movl L_Y$non_lazy_ptr, %edi movw %dx, (%edi) addw $4, %dx incw %si incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb into LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx LBB1_2: # bb movl L_X$non_lazy_ptr, %esi movw %cx, (%esi) movl L_Y$non_lazy_ptr, %esi movw %dx, (%esi) addw $4, %dx incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb llvm-svn: 43375	2007-10-26 01:56:11 +00:00
Dale Johannesen	2edd0fb69d	Allow for copysign having f80 second argument. Fixes 5550319. llvm-svn: 43205	2007-10-21 01:07:44 +00:00
Rafael Espindola	d8d4372845	Add support for byval function whose argument is not 32 bit aligned. To do this it is necessary to add a "always inline" argument to the memcpy node. For completeness I have also added this node to memmove and memset. I have also added getMem* functions, because the extra argument makes it cumbersome to use getNode and because I get confused by it :-) llvm-svn: 43172	2007-10-19 10:41:11 +00:00
Chris Lattner	3a19e981f5	Change fp to sint legalization on x86-32 to do 2 x i32 loads instead of 1 x i64 loads. This doesn't change any functionality yet. llvm-svn: 43068	2007-10-17 06:17:29 +00:00
Chris Lattner	ba2d55a564	fix some funny indentation, add comments. llvm-svn: 43066	2007-10-17 06:02:13 +00:00
Dale Johannesen	63411d36bf	Check for invalid cc's in f80 select. llvm-svn: 43033	2007-10-16 18:09:08 +00:00
Arnold Schwaighofer	f0d4d73bf6	Correction to tail call optimization code. The new return address was stored to the acutal stack slot before the parameters were lowered to their stack slot. This could cause arguments to be overwritten by the return address if the called function had less parameters than the caller function. The update should remove the last failing test case of llc-beta: SPASS. llvm-svn: 43027	2007-10-16 09:05:00 +00:00
Evan Cheng	f5bcd3d737	LowerFP_TO_SINT must not create a stack object if it's not needed. llvm-svn: 43004	2007-10-15 20:11:21 +00:00
Evan Cheng	90645f30db	Unbreak x86-64. llvm-svn: 42962	2007-10-14 10:09:39 +00:00
Arnold Schwaighofer	50d2c33530	Correcting the corrections. Bad bad baaad emacs! llvm-svn: 42935	2007-10-12 21:53:12 +00:00
Arnold Schwaighofer	6bcd9e7ec2	Corrected many typing errors. And removed 'nest' parameter handling for fastcc from X86CallingConv.td. This means that nested functions are not supported for calling convention 'fastcc'. llvm-svn: 42934	2007-10-12 21:30:57 +00:00
Duncan Sands	d781ed9d21	Due to the new tail call optimization, trampolines can no longer be created for fastcc functions. llvm-svn: 42925	2007-10-12 19:37:31 +00:00
Dan Gohman	ad3e823efa	Mark vector ctpop, cttz, and ctlz as Expand on x86. llvm-svn: 42905	2007-10-12 14:09:42 +00:00
Dan Gohman	edc841fb53	Set ISD::FPOW to Expand. llvm-svn: 42881	2007-10-11 23:21:31 +00:00
Arnold Schwaighofer	d47210011e	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Evan Cheng	90aa032f98	Bug fix. X86 was emitting redundant setcc and test instructions before a conditional move. llvm-svn: 42774	2007-10-08 22:16:29 +00:00
Dan Gohman	6df332f0cb	Migrate X86 and ARM from using X86ISD::{,I}DIV and ARMISD::MULHILO{U,S} to use ISD::{S,U}DIVREM and ISD::{S,U}MUL_HIO. Move the lowering code associated with these operators into target-independent in LegalizeDAG.cpp and TargetLowering.cpp. llvm-svn: 42762	2007-10-08 18:33:35 +00:00
Evan Cheng	32766d3518	Not needed any more. llvm-svn: 42623	2007-10-05 01:34:14 +00:00
Evan Cheng	f3c130a8b6	Enabling new condition code modeling scheme. llvm-svn: 42459	2007-09-29 00:00:36 +00:00
Rafael Espindola	01b306e575	Refactor the memcpy lowering for the x86 target. The only generated code difference is that now we call memcpy when the size of the array is unknown. This matches GCC behavior and is better since the run time value can be arbitrarily large. llvm-svn: 42433	2007-09-28 12:53:01 +00:00
Dale Johannesen	69595b587f	Enable codegen for long double abs, sin, cos llvm-svn: 42368	2007-09-26 21:10:55 +00:00
Evan Cheng	38c7d5082a	translateX86CC updates the last two operands. llvm-svn: 42333	2007-09-26 00:45:55 +00:00
Dan Gohman	1bb346f9f1	When both x/y and x%y are needed (x and y both scalar integer), compute both results with a single div or idiv instruction. This uses new X86ISD nodes for DIV and IDIV which are introduced during the legalize phase so that the SelectionDAG's CSE can automatically eliminate redundant computations. llvm-svn: 42308	2007-09-25 18:23:27 +00:00
Dan Gohman	8385890394	Move the setOperationAction(ISD::DEBUG_LOC, MVT::Other, Expand) and the check to see if the assembler supports .loc from X86TargetLowering into the superclass TargetLowering. llvm-svn: 42297	2007-09-25 15:10:49 +00:00
Evan Cheng	36b3babfde	Added support for new condition code modeling scheme (i.e. physical register dependency). These are a bunch of instructions that are duplicated so the x86 backend can support both the old and new schemes at the same time. They will be deleted after all the kinks are worked out. llvm-svn: 42285	2007-09-25 01:57:46 +00:00
Dan Gohman	96d5f979bc	Add support on x86 for having Legalize lower ISD::LOCATION to ISD::DEBUG_LOC instead of ISD::LABEL with a manual .debug_line entry when the assembler supports .file and .loc directives. llvm-svn: 42278	2007-09-24 21:54:14 +00:00
Chris Lattner	594d3aa066	claim that "st" is from the 80-bit register file. This causes x87-using inline asm to die with: ScheduleDAG.cpp:269: failed assertion `false && "Couldn't find the register class"' instead of: failed assertion `RegMap->getRegClass(VReg) == RC && "Register class of operand and regclass of use don't agree!"' yay. llvm-svn: 42259	2007-09-24 05:27:37 +00:00
Dale Johannesen	ea6ffa0b36	Fix PR 1681. When X86 target uses +sse -sse2, keep f32 in SSE registers and f64 in x87. This is effectively a new codegen mode. Change addLegalFPImmediate to permit float and double variants to do different things. Adjust callers. llvm-svn: 42246	2007-09-23 14:52:20 +00:00
Rafael Espindola	11ee0898b9	Don't add a default STACK_ALIGN (use the generic ABI alignment) Implement calls to functions with byval arguments on X86 llvm-svn: 42192	2007-09-21 15:50:22 +00:00
Rafael Espindola	b0b536b597	small cleanup: use LowerMemArgument in LowerFastCCArguments also llvm-svn: 42189	2007-09-21 14:55:38 +00:00
Dale Johannesen	04682bdc81	More long double fixes. x86_64 should build now. llvm-svn: 42155	2007-09-19 23:55:34 +00:00
Dan Gohman	1aeaeec570	Emit integer x<1 as x<=0, as comparisons with zero (now includeing 64-bit) can use test instead of cmp with an immediate. llvm-svn: 42026	2007-09-17 14:49:27 +00:00
Dale Johannesen	575bd6070a	Remove the assumption that FP's are either float or double from some of the many places in the optimizers it appears, and do something reasonable with x86 long double. Make APInt::dump() public, remove newline, use it to dump ConstantSDNode's. Allow APFloats in FoldingSet. Expand X86 backend handling of long doubles (conversions to/from int, mostly). llvm-svn: 41967	2007-09-14 22:26:36 +00:00
Rafael Espindola	5d8b225881	Add support for functions with byval arguments on x86 llvm-svn: 41953	2007-09-14 15:48:13 +00:00
Dale Johannesen	7bc3969cea	Add APInt interfaces to APFloat (allows directly access to bits). Use them in place of float and double interfaces where appropriate. First bits of x86 long double constants handling (untested, probably does not work). llvm-svn: 41858	2007-09-11 18:32:33 +00:00
Duncan Sands	c358890f73	Fold the adjust_trampoline intrinsic into init_trampoline. There is now only one trampoline intrinsic. llvm-svn: 41841	2007-09-11 14:10:23 +00:00
Dale Johannesen	86f367a6b7	Next round of APFloat changes. Use APFloat in UpgradeParser and AsmParser. Change all references to ConstantFP to use the APFloat interface rather than double. Remove the ConstantFP double interfaces. Use APFloat functions for constant folding arithmetic and comparisons. (There are still way too many places APFloat is just a wrapper around host float/double, but we're getting there.) llvm-svn: 41747	2007-09-06 18:13:44 +00:00
Anton Korobeynikov	cf91be2c79	Reapply r41578 with proper fix llvm-svn: 41680	2007-09-03 00:36:06 +00:00
Rafael Espindola	4ddaad4de0	Initial support for calling functions with byval arguments on x86-64 llvm-svn: 41643	2007-08-31 15:06:30 +00:00
Dale Johannesen	81d6ecb886	Enhance APFloat to retain bits of NaNs (fixes oggenc). Use APFloat interfaces for more references, mostly of ConstantFPSDNode. llvm-svn: 41632	2007-08-31 04:03:46 +00:00
Dale Johannesen	e91a908971	Change LegalFPImmediates to use APFloat. Add APFloat interfaces to ConstantFP, SelectionDAG. Fix integer bit in double->APFloat conversion. Convert LegalizeDAG to use APFloat interface in ConstantFPSDNode uses. llvm-svn: 41587	2007-08-30 00:23:21 +00:00
Duncan Sands	26ef2a1767	Move getX86RegNum into X86RegisterInfo and use it in the trampoline lowering. Lookup the jump and mov opcodes for the trampoline rather than hard coding them. llvm-svn: 41577	2007-08-29 19:01:20 +00:00
Rafael Espindola	dc5450f7fb	Add a comment about using libc memset/memcpy or generating inline code. llvm-svn: 41502	2007-08-27 17:48:26 +00:00
Rafael Espindola	3d52fe3ef3	call libc memcpy/memset if array size is bigger then threshold. Coping 100MB array (after a warmup) shows that glibc 2.6.1 implementation on x86-64 (core 2) is 30% faster (from 0.270917s to 0.188079s) llvm-svn: 41479	2007-08-27 10:18:20 +00:00
Chris Lattner	1e089aac3a	rename isOperandValidForConstraint to LowerAsmOperandForConstraint, changing the interface to allow for future changes. llvm-svn: 41384	2007-08-25 00:47:38 +00:00
Rafael Espindola	68d95ff2b1	Partial implementation of calling functions with byval arguments: ) The needed information is propagated to the DAG ) The X86-64 backend detects it and aborts llvm-svn: 41179	2007-08-20 15:18:24 +00:00
Anton Korobeynikov	3094846993	Move ReturnAddrIndex variable to X86MachineFunctionInfo structure. This fixed hard to catch bugs with retaddr lowering llvm-svn: 41104	2007-08-15 17:12:32 +00:00
Evan Cheng	eef13203e7	Fix a typo pointd out by Maarten ter Huurne. llvm-svn: 41059	2007-08-13 23:27:11 +00:00
Christopher Lamb	450f6815b9	Increase efficiency of sign_extend_inreg by using subregisters for truncation. As the README suggests sign_extend_subreg is selected to (sext(trunc)). llvm-svn: 41010	2007-08-10 21:48:46 +00:00
Rafael Espindola	b20b9e985a	propagate struct size and alignment of byval arguments to the DAG llvm-svn: 40986	2007-08-10 14:44:42 +00:00
Dale Johannesen	79551baaad	long double 9 of N. This finishes up the X86-32 bits (constants are still not handled). Adds ConvertActions to control fp-to-fp conversions (these are currently defaulted for all other targets, so no changes there). llvm-svn: 40958	2007-08-09 01:04:01 +00:00
Dale Johannesen	2c35d56edd	Long double patch 7 of N, unless I lost count:). Last x87 bits for full functionality (not thoroughly tested, and long doubles do not work in SSE modes at all - use -mcpu=i486 for now) llvm-svn: 40886	2007-08-07 01:17:37 +00:00
Dale Johannesen	a85f11d870	Long double patch 4 of N: initial x87 implementation. Lots of problems yet but some simple things work. llvm-svn: 40847	2007-08-05 18:49:15 +00:00
Dan Gohman	1afde4166e	Fix the alignment requirements of several unpck and shuf instructions. Generalize isPSHUFDMask and add a unary SHUFPD pattern so that SHUFPD's memory operand alignment can be tested as well, with a fix to avoid breaking MMX's use of isPSHUFDMask. llvm-svn: 40756	2007-08-02 21:17:01 +00:00
Evan Cheng	019ecf3b91	Can't handle offset and scale if rip-relative addressing is to be used. llvm-svn: 40703	2007-08-01 23:46:47 +00:00
Evan Cheng	e90ad40aa1	This isn't safe when there are uses of load's chain result. llvm-svn: 40617	2007-07-31 06:21:44 +00:00
Duncan Sands	35a77d857b	Trampoline codegen support for X86-32. llvm-svn: 40566	2007-07-27 20:02:49 +00:00
Dan Gohman	0252aa07ee	Re-apply 40504, but with a fix for the segfault it caused in oggenc: Make the alignedload and alignedstore patterns always require 16-byte alignment. This way when they are used in the "Fs" instructions, in which a vector instruction is used for a scalar purpose, they can still require the full vector alignment. And add a regression test for this. llvm-svn: 40555	2007-07-27 17:16:43 +00:00
Evan Cheng	cb8f08ebca	Reverting 40504 for now. It's breaking oggenc. llvm-svn: 40547	2007-07-27 01:37:47 +00:00
Dan Gohman	513dcba4f8	Remove X86ISD::LOAD_PACK and X86ISD::LOAD_UA and associated code from the x86 target, replacing them with the new alignment attributes on memory references. llvm-svn: 40504	2007-07-26 00:31:09 +00:00
Dan Gohman	a2e07a38bc	Use movaps to load a v4f32 build_vector of all-constant values into a register instead of loading each element individually. llvm-svn: 40478	2007-07-24 22:55:08 +00:00
Dan Gohman	54b8032d64	Fix some uses of dyn_cast to be uses of cast. llvm-svn: 40443	2007-07-23 20:24:29 +00:00
Evan Cheng	ba990bbc3f	Fix custom lowering of SSE FXOR. llvm-svn: 40071	2007-07-19 23:36:01 +00:00
Anton Korobeynikov	5635277c36	Long live the exception handling! This patch fills the last necessary bits to enable exceptions handling in LLVM. Currently only on x86-32/linux. In fact, this patch adds necessary intrinsics (and their lowering) which represent really weird target-specific gcc builtins used inside unwinder. After corresponding llvm-gcc patch will land (easy) exceptions should be more or less workable. However, exceptions handling support should not be thought as 'finished': I expect many small and not so small glitches everywhere. llvm-svn: 39855	2007-07-14 14:06:15 +00:00
Dan Gohman	928144b051	Define non-intrinsic instructions for vector min, max, sqrt, rsqrt, and rcp, in addition to the intrinsic forms. Add spill-folding entries for these new instructions, and for the scalar min and max instrinsic instructions which were missing. And add some preliminary ISelLowering code for using the new non-intrinsic vector sqrt instruction, and fneg and fabs. llvm-svn: 38478	2007-07-10 00:05:58 +00:00
Anton Korobeynikov	e8215d1780	Proper flag __alloca call llvm-svn: 37923	2007-07-05 20:36:08 +00:00
Dale Johannesen	9072b65b0b	Refactor X87 instructions. As a side effect, all their names are changed. llvm-svn: 37876	2007-07-04 21:07:47 +00:00
Dale Johannesen	7af19491d3	Fix for PR 1505 (and 1489). Rewrite X87 register model to include f32 variants. Some factoring improvments forthcoming. llvm-svn: 37847	2007-07-03 00:53:03 +00:00
Evan Cheng	992f296a71	No vector fneg. llvm-svn: 37786	2007-06-29 00:18:15 +00:00
Evan Cheng	e233ec5e46	Type of vector extract / insert index operand should be iPTR. llvm-svn: 37784	2007-06-29 00:01:20 +00:00
Dan Gohman	354f02e03d	Generalize MVT::ValueType and associated functions to be able to represent extended vector types. Remove the special SDNode opcodes used for pre-legalize vector operations, and the special MVT::Vector type used with them. Adjust lowering and legalize to work with the normal SDNode kinds instead, and to use the normal MVT functions to work with vector types instead of using the two special operands that the pre-legalize nodes held. This allows pre-legalize and post-legalize DAGs, and the code that operates on them, to be more consistent. Pre-legalize vector operators can be handled more consistently with scalar operators. And, -view-dag-combine1-dags and -view-legalize-dags now look prettier for vector code. llvm-svn: 37719	2007-06-25 16:23:39 +00:00
Dan Gohman	a62327ea40	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Chris Lattner	e13fac05d7	If a function is vararg, never pass inreg arguments in registers. Thanks to Anton for half of this patch. llvm-svn: 37641	2007-06-19 00:13:10 +00:00
Evan Cheng	80f0d5ae45	Look for VECTOR_SHUFFLE that's identity operation on either LHS or RHS. This can happen before DAGCombiner catches it. llvm-svn: 37636	2007-06-19 00:02:56 +00:00
Bill Wendling	94f3474832	Revert patch. It regresses: define double @test2(i64 %A) { %B = bitcast i64 %A to double ret double %B } $ llvm-as < t.ll \| llc -march=x86-64 before: .align 4 .globl _test2 _test2: movd %rdi, %xmm0 ret after: _test2: subq $8, %rsp movq %rdi, (%rsp) movsd (%rsp), %xmm0 addq $8, %rsp ret llvm-svn: 37617	2007-06-16 23:57:15 +00:00
Bill Wendling	a1f8f0aa97	Fix a failure to bit_convert from integer GPR to MMX register. llvm-svn: 37611	2007-06-16 06:17:31 +00:00
Dan Gohman	2fd7d26df8	Rename MVT::getVectorBaseType to MVT::getVectorElementType. llvm-svn: 37579	2007-06-14 22:58:02 +00:00
Chris Lattner	5f85da00bb	fix x86-64 mmx calling convention for real, which passes in integer gprs. llvm-svn: 37534	2007-06-09 05:08:10 +00:00
Chris Lattner	e965432273	fix mmx handling bug llvm-svn: 37533	2007-06-09 05:01:50 +00:00
Dan Gohman	1b1932dda5	Add explicit qualification for namespace MVT members. llvm-svn: 37320	2007-05-24 14:33:05 +00:00
Dan Gohman	ec87afe526	Use MVT::FIRST_VECTOR_VALUETYPE and MVT::LAST_VECTOR_VALUETYPE. llvm-svn: 37234	2007-05-18 18:44:07 +00:00
Evan Cheng	1b4af5f975	Fix a bogus check that prevented folding VECTOR_SHUFFLE to UNDEF; add an optimization to fold VECTOR_SHUFFLE to a zero vector. llvm-svn: 37173	2007-05-17 18:45:50 +00:00
Chris Lattner	9a53871650	This is the correct fix for PR1427. This fixes mmx-shuffle.ll and doesn't cause other regressions. llvm-svn: 37160	2007-05-17 17:13:13 +00:00
Anton Korobeynikov	375cafc275	Revert patch for PR1427. It breaks almost all vector tests. llvm-svn: 37159	2007-05-17 07:50:14 +00:00
Chris Lattner	f65fe1d931	Fix PR1427 and test/CodeGen/X86/mmx-shuffle.ll llvm-svn: 37141	2007-05-17 03:29:42 +00:00
Chris Lattner	ce20a357f1	fix subtle bugs in inline asm operand selection llvm-svn: 37065	2007-05-15 01:28:08 +00:00
Chris Lattner	60cd08c23e	Fix two classes of bugs: 1. x86 backend rejected (&gv+c) for the 'i' constraint when in static mode. 2. the matcher didn't correctly reject and accept some global addresses. the right predicate is GVRequiresExtraLoad, not "relomodel = pic". llvm-svn: 36670	2007-05-03 16:52:29 +00:00
Anton Korobeynikov	44aa4c588b	Emit correct register move information in eh frames for X86. This allows Shootout-C++/except to pass on x86/linux with non-llvm-compiled (e.g. "native") unwind runtime. llvm-svn: 36647	2007-05-02 19:53:33 +00:00
Bill Wendling	6856e741fa	Support for the special case of a vector with the canonical form: vector_shuffle v1, v2, <2, 6, 3, 7> I.e. vector_shuffle v, undef, <2, 2, 3, 3> MMX only has a shuffle for v4i16 vectors. It needs to use the unpackh for this type of operation. llvm-svn: 36403	2007-04-24 21:16:55 +00:00
Lauro Ramos Venancio	b1a101f0e7	X86 TLS: fix and optimize the implementation of "initial exec" model. llvm-svn: 36355	2007-04-22 22:50:52 +00:00
Lauro Ramos Venancio	3b60b9546e	X86 TLS: Implement review feedback. llvm-svn: 36318	2007-04-21 20:56:26 +00:00
Lauro Ramos Venancio	bc32d90b46	Implement "general dynamic", "initial exec" and "local exec" TLS models for X86 32 bits. llvm-svn: 36283	2007-04-20 21:38:10 +00:00
Anton Korobeynikov	60de2ce283	Add comment llvm-svn: 36213	2007-04-17 19:34:00 +00:00
Chris Lattner	c7109ece27	rename X86FunctionInfo to X86MachineFunctionInfo to match the header file it is defined in. llvm-svn: 36196	2007-04-17 17:21:52 +00:00
Anton Korobeynikov	9bc4b792bf	Implemented correct stack probing on mingw/cygwin for dynamic alloca's. Also, fixed static case in presence of eax livin. This fixes PR331 PS: Why don't we still have push/pop instructions? :) llvm-svn: 36195	2007-04-17 09:20:00 +00:00
Anton Korobeynikov	f3e62a428a	Removed tabs everywhere except autogenerated & external files. Add make target for tabs checking. llvm-svn: 36146	2007-04-16 18:10:23 +00:00
Chris Lattner	2b6b79b896	Fix mmx paddq, add support for the 'y' register class, though it isn't tested. llvm-svn: 35940	2007-04-12 04:14:49 +00:00
Chris Lattner	3f9ff05309	remove some dead hooks llvm-svn: 35845	2007-04-09 23:31:19 +00:00
Chris Lattner	ae6e2c0ee5	remove some dead target hooks, subsumed by isLegalAddressingMode llvm-svn: 35840	2007-04-09 22:27:04 +00:00
Chris Lattner	de148c7887	move a bunch of register constraints from being handled by getRegClassForInlineAsmConstraint to being handled by getRegForInlineAsmConstraint. This allows us to let the llvm register allocator allocate, which gives us better code. For example, X86/2007-01-29-InlineAsm-ir.ll used to compile to: _run_init_process: subl $4, %esp movl %ebx, (%esp) xorl %ebx, %ebx movl $11, %eax movl %ebx, %ecx movl %ebx, %edx # InlineAsm Start push %ebx ; movl %ebx,%ebx ; int $0x80 ; pop %ebx # InlineAsm End Now we get: _run_init_process: xorl %ecx, %ecx movl $11, %eax movl %ecx, %edx # InlineAsm Start push %ebx ; movl %ecx,%ebx ; int $0x80 ; pop %ebx # InlineAsm End llvm-svn: 35804	2007-04-09 05:49:22 +00:00
Chris Lattner	b940a717ac	implement support for CodeGen/X86/inline-asm-x-scalar.ll:test3 - i32/i64 values used with x constraints. llvm-svn: 35803	2007-04-09 05:31:48 +00:00
Chris Lattner	e2d3bf8ecf	implement CodeGen/X86/inline-asm-x-scalar.ll llvm-svn: 35799	2007-04-09 05:11:28 +00:00
Chris Lattner	c0405a348d	implement the new addressing mode description hook. llvm-svn: 35521	2007-03-30 23:15:24 +00:00
Bill Wendling	e8eccb1684	Remove cruft I put in there... llvm-svn: 35394	2007-03-28 01:02:54 +00:00
Bill Wendling	1087888176	Unbreak mmx arithmetic. It was barfing trying to do v8i8 arithmetic. llvm-svn: 35392	2007-03-28 00:57:11 +00:00
Bill Wendling	d43819da2f	Fix so that pandn is emitted instead of an xor/and combo. Add integer comparison operators. llvm-svn: 35385	2007-03-27 20:22:40 +00:00
Bill Wendling	8065cc3173	Promote to v1i64 type... llvm-svn: 35353	2007-03-26 08:03:33 +00:00
Bill Wendling	a42484728c	Add support for the v1i64 type. This makes better code for this: #include <mmintrin.h> extern __m64 C; void baz(__v2si A, __v2si B) { *A = C; _mm_empty(); } We get this: _baz: call "L1$pb" "L1$pb": popl %eax movl L_C$non_lazy_ptr-"L1$pb"(%eax), %eax movq (%eax), %mm0 movl 4(%esp), %eax movq %mm0, (%eax) emms ret GCC gives us this: _baz: pushl %ebx call L3 "L00000000001$pb": L3: popl %ebx subl $8, %esp movl L_C$non_lazy_ptr-"L00000000001$pb"(%ebx), %eax movl (%eax), %edx movl 4(%eax), %ecx movl 16(%esp), %eax movl %edx, (%eax) movl %ecx, 4(%eax) emms addl $8, %esp popl %ebx ret llvm-svn: 35351	2007-03-26 07:53:08 +00:00
Chris Lattner	b19069959d	switch TargetLowering::getConstraintType to take the entire constraint, not just the first letter. No functionality change. llvm-svn: 35322	2007-03-25 02:14:49 +00:00
Chris Lattner	104e73382c	enforce the proper range for the i386 N constraint. llvm-svn: 35319	2007-03-25 01:57:35 +00:00
Bill Wendling	1bcad4c1cd	Support added for shifts and unpacking MMX instructions. llvm-svn: 35266	2007-03-22 18:42:45 +00:00
Dale Johannesen	44c0a5d545	repair x86 performance, dejagnu problems from previous change llvm-svn: 35245	2007-03-21 21:51:52 +00:00
Chris Lattner	59fe2be1c4	fix a warning llvm-svn: 35152	2007-03-19 00:39:32 +00:00
Devang Patel	2dabb16eac	Support 'I' inline asm constraint. llvm-svn: 35129	2007-03-17 00:13:28 +00:00
Bill Wendling	8ced23ee5a	And now support for MMX logical operations. llvm-svn: 35125	2007-03-16 09:44:46 +00:00
Bill Wendling	feaff80149	Multiplication support for MMX. llvm-svn: 35118	2007-03-15 21:24:36 +00:00
Evan Cheng	00edaa08b5	Under X86-64 large code model, do not emit 32-bit pc relative calls. llvm-svn: 35108	2007-03-14 22:11:11 +00:00
Evan Cheng	0eeb8b59eb	More flexible TargetLowering LSR hooks for testing whether an immediate is a legal target address immediate or scale. llvm-svn: 35073	2007-03-12 23:28:50 +00:00
Evan Cheng	4224fa3617	Stupid bug: SSE2 supports v2i64 add / sub. llvm-svn: 35070	2007-03-12 22:58:52 +00:00
Bill Wendling	236cfc4344	Adding more arithmetic operators to MMX. This is an almost exact copy of the addition. Please let me know if you have suggestions. llvm-svn: 35055	2007-03-10 09:57:05 +00:00
Bill Wendling	5fef3fd7e7	Added "padd*" support for MMX. Added MMX move stuff to X86InstrInfo so that moves, loads, etc. are recognized. llvm-svn: 35031	2007-03-08 22:09:11 +00:00
Anton Korobeynikov	85d6c1ebad	Refactoring of formal parameter flags. Enable properly use of zext/sext/aext stuff. llvm-svn: 35008	2007-03-07 16:25:09 +00:00
Bill Wendling	3c201ddd02	Properly support v8i8 and v4i16 types. It now converts them to v2i32 for load and stores. llvm-svn: 35002	2007-03-07 05:43:18 +00:00
Bill Wendling	a02d43fbbd	Add LOAD/STORE support for MMX. llvm-svn: 34978	2007-03-06 18:53:42 +00:00
Anton Korobeynikov	6da6c8c48b	Use new SDIselParamAttr enumeration. This removes "magick" constants from formal attributes' flags processing. llvm-svn: 34963	2007-03-06 08:12:33 +00:00
Evan Cheng	2fb461c1b5	X86-64 VACOPY needs custom expansion. va_list is a struct { i32, i32, i8, i8 }. llvm-svn: 34857	2007-03-02 23:16:35 +00:00
Anton Korobeynikov	7cec92bcd2	Simplify things llvm-svn: 34849	2007-03-02 21:50:27 +00:00
Chris Lattner	55dcf58453	argument lowering should copy from the vreg shadows of live-in arguments passed in registers, not directly from the pregs themselves. llvm-svn: 34838	2007-03-02 05:12:29 +00:00
Anton Korobeynikov	eaf27d276a	Ensure that fastcall'ed function is correctly mangled & stack is properly aligned llvm-svn: 34788	2007-03-01 16:29:22 +00:00
Chris Lattner	bcc44762bc	remove dead option llvm-svn: 34754	2007-02-28 18:39:53 +00:00
Chris Lattner	a66d550298	use high-level functions in CCState llvm-svn: 34739	2007-02-28 07:09:55 +00:00
Chris Lattner	3663b6e73a	make use of helper functions in CCState for analyzing formals and calls. llvm-svn: 34737	2007-02-28 07:00:42 +00:00
Chris Lattner	3762b44a0c	switch LowerFastCCCallTo over to using the new fastcall description. llvm-svn: 34734	2007-02-28 06:26:33 +00:00
Chris Lattner	a8dd712470	switch LowerFastCCArguments over to using the autogenerated Fastcall description. llvm-svn: 34733	2007-02-28 06:21:19 +00:00
Chris Lattner	3b16744840	rearrange code llvm-svn: 34731	2007-02-28 06:10:12 +00:00
Chris Lattner	023751c20b	remove fastcc (not fastcall) support llvm-svn: 34730	2007-02-28 06:05:16 +00:00
Chris Lattner	012066f78b	switch LowerCCCArguments over to using autogenerated CC. llvm-svn: 34729	2007-02-28 05:46:49 +00:00
Chris Lattner	6424f8e245	simplify sret handling llvm-svn: 34728	2007-02-28 05:39:26 +00:00
Chris Lattner	76147834d6	switch LowerCCCCallTo over to using an autogenerated callingconv llvm-svn: 34727	2007-02-28 05:31:48 +00:00
Chris Lattner	eef57fed6e	switch return value passing and the x86-64 calling convention information over to being autogenerated from the X86CallingConv.td file. llvm-svn: 34722	2007-02-28 04:55:35 +00:00
Chris Lattner	9117648533	switch x86-64 return value lowering over to using same mechanism as argument lowering uses. llvm-svn: 34657	2007-02-27 05:28:59 +00:00
Chris Lattner	11a1c2113c	Minor refactoring of CC Lowering interfaces llvm-svn: 34656	2007-02-27 05:13:54 +00:00
Chris Lattner	e34136f6d5	move CC Lowering stuff to its own public interface llvm-svn: 34655	2007-02-27 04:43:02 +00:00
Chris Lattner	cac44e283d	refactor x86-64 argument lowering yet again, this time eliminating templates, 'clients', etc, and adding CCValAssign instead. llvm-svn: 34654	2007-02-27 04:18:15 +00:00
Chris Lattner	7165ee9b6b	switch to smallvector llvm-svn: 34633	2007-02-26 07:59:53 +00:00
Chris Lattner	3fe1132dcd	initial hack at splitting the x86-64 calling convention info out from the mechanics that process it. I'm still not happy with this, but it's a step in the right direction. llvm-svn: 34631	2007-02-26 07:50:02 +00:00
Chris Lattner	d0c941c89e	the truncate must always be done, it's only the assert that is conditional. llvm-svn: 34628	2007-02-26 05:21:05 +00:00
Chris Lattner	2e7125dc74	in X86-64 CCC, i8/i16 arguments are already properly zext/sext'd on input. Capture this so that downstream zext/sext's are optimized out. This compiles: int test(short X) { return (int)X; } to: _test: movl %edi, %eax ret instead of: _test: movswl %di, %eax ret GCC produces this bizarre code: _test: movw %di, -12(%rsp) movswl -12(%rsp),%eax ret llvm-svn: 34623	2007-02-26 03:18:56 +00:00
Chris Lattner	ad14e21b97	Fix an X86-64 abi bug. We now compile: void foo(short); void bar(unsigned short A) { foo(A); } into: _bar: subq $8, %rsp movswl %di, %edi call _foo addq $8, %rsp ret instead of: _bar: subq $8, %rsp call _foo addq $8, %rsp ret Testcase here: test/CodeGen/X86/x86-64-shortint.ll llvm-svn: 34615	2007-02-25 23:10:46 +00:00
Chris Lattner	15c167cc61	fix CodeGen/X86/2007-02-25-FastCCStack.ll, a regression from my patch last night: fastcc returns should only go in XMM0 if we have SSE2 or above. llvm-svn: 34613	2007-02-25 22:23:46 +00:00
Chris Lattner	65ba08d627	fastcc functions that return double values now return them in xmm0 on x86-32. This implements CodeGen/X86/fp-stack-ret.ll:test[23] llvm-svn: 34592	2007-02-25 09:31:16 +00:00
Chris Lattner	e4ba88824d	allow vectors to be passed to stdcall/fastcall functions llvm-svn: 34590	2007-02-25 09:14:25 +00:00
Chris Lattner	fac0b30da0	move LowerRET into the 'Return Value Calling Convention Implementation' section of the file. llvm-svn: 34589	2007-02-25 09:12:39 +00:00
Chris Lattner	65d915a3b6	make all Lower*CallTo implementations use LowerCallResult to handle their result value stuff. This eliminates a bunch of duplicated code and now GetRetValueLocs is the sole place that decides where a value is returned. llvm-svn: 34588	2007-02-25 09:10:05 +00:00
Chris Lattner	423224a7b4	pass the calling convention into Lower*CallTo, instead of using ad-hoc flags. llvm-svn: 34587	2007-02-25 09:06:15 +00:00
Chris Lattner	8fa75c3ae8	factor a bunch of code out of LowerCCCCallTo into a new LowerCallResult function. This function now uses GetRetValueLocs to determine where the result values are located and concerns itself with how to pull the values out. llvm-svn: 34586	2007-02-25 08:59:22 +00:00
Chris Lattner	3bfbc23ccd	move some code around, pass in calling conv, even though it is unused llvm-svn: 34585	2007-02-25 08:29:00 +00:00
Chris Lattner	f119813ff4	simplify result value lowering by splitting the selection of where to return registers out from the logic of how to return them. This changes X86-64 to mark EAX live out when returning a 32-bit value, where before it marked RAX liveout. llvm-svn: 34582	2007-02-25 08:15:11 +00:00
Chris Lattner	bcce79717b	make void-return not a special case llvm-svn: 34579	2007-02-25 07:18:38 +00:00
Chris Lattner	d00fcb3277	eliminate a bunch more temporary vectors from X86 lowering. llvm-svn: 34578	2007-02-25 07:10:00 +00:00
Chris Lattner	f7eeef816d	eliminate temporary vectors created during X86 lowering. llvm-svn: 34577	2007-02-25 06:40:16 +00:00
Chris Lattner	6f25082e67	remove std::vector's in RET lowering. llvm-svn: 34576	2007-02-25 06:21:57 +00:00
Jim Laskey	b57ee1fc37	Simplify lowering and selection of exception ops. llvm-svn: 34488	2007-02-22 14:56:36 +00:00
Jim Laskey	6a937ad320	Support to provide exception and selector registers. llvm-svn: 34482	2007-02-21 22:54:50 +00:00
Evan Cheng	0e7be3c4e0	ELF / PIC requires GOT be in the EBX register during calls via PLT GOT pointer. Add implicit uses of EBX to calls to ensure liveintervalanalysis does not treat the GOT in EBX move as dead upon definition. This should fix PR1207. llvm-svn: 34470	2007-02-21 21:18:14 +00:00
Anton Korobeynikov	c469cbc2e7	Fixed uninitialized stuff inside LegalizeDAG. Fortunately, the only affected part is codegen of "memove" inside x86 backend. This fixes PR1144 llvm-svn: 33752	2007-02-01 08:39:52 +00:00
Nate Begeman	dc46021355	Finish off bug 680, allowing targets to custom lower frame and return address nodes. llvm-svn: 33636	2007-01-29 22:58:52 +00:00
Nick Lewycky	e788dc93d5	Fix compile error "jump to case label crosses initialization". What compiler are people using that accepts this code? llvm-svn: 33603	2007-01-28 15:39:16 +00:00
Anton Korobeynikov	611d5e2eda	Propagate changes from my local tree. This patch includes: 1. New parameter attribute called 'inreg'. It has meaning "place this parameter in registers, if possible". This is some generalization of gcc's regparm(n) attribute. It's currently used only in X86-32 backend. 2. Completely rewritten CC handling/lowering code inside X86 backend. Merged stdcall + c CCs and fastcall + fast CC. 3. Dropped CSRET CC. We cannot add struct return variant for each target-specific CC (e.g. stdcall + csretcc and so on). 4. Instead of CSRET CC introduced 'sret' parameter attribute. Setting in on first attribute has meaning 'This is hidden pointer to structure return. Handle it gently'. 5. Fixed small bug in llvm-extract + add new feature to FunctionExtraction pass, which relinks all internal-linkaged callees from deleted function to external linkage. This will allow further linking everything together. NOTEs: 1. Documentation will be updated soon. 2. llvm-upgrade should be improved to translate csret => sret. Before this, there will be some unexpected test fails. llvm-svn: 33597	2007-01-28 13:31:35 +00:00
Jim Laskey	23ed7d2625	Make LABEL a builtin opcode. llvm-svn: 33537	2007-01-26 14:34:52 +00:00
Evan Cheng	818c6bdfa2	Linux GOT indirect reference is only necessary in PIC mode. llvm-svn: 33441	2007-01-22 21:34:25 +00:00
Anton Korobeynikov	548b9af9c2	* PIC codegen for X86/Linux has been implemented * PIC-aware internal structures in X86 Codegen have been refactored * Visibility (default/weak) has been added * Docs fixes (external weak linkage, visibility, formatting) llvm-svn: 33136	2007-01-12 19:20:47 +00:00
Evan Cheng	df277336b8	- FCOPYSIGN custom lowering bug. Clear the sign bit of operand 0 first before or'ing in the sign bit of operand 1. - Tweaking: rather than left shift the sign bit, fp_extend operand 1 first before taking its sign bit if its type is smaller than that of operand 0. llvm-svn: 32932	2007-01-05 21:37:56 +00:00
Evan Cheng	c273c9c830	Typo llvm-svn: 32902	2007-01-05 08:32:24 +00:00
Evan Cheng	bcf3d2bd15	With SSE2, expand FCOPYSIGN to a series of SSE bitwise operations. llvm-svn: 32900	2007-01-05 07:55:56 +00:00
Reid Spencer	c2da5d3d97	Fix a comment that referred to the now defunct ubyte type. llvm-svn: 32840	2007-01-03 17:24:59 +00:00
Anton Korobeynikov	2b39939053	Really big cleanup. - New target type "mingw" was introduced - Same things for both mingw & cygwin are marked as "cygming" (as in gcc) - .lcomm is supported here, so allow LLVM to use it - Correctly use underscored versions of setjmp & _longjmp for both mingw & cygwin llvm-svn: 32833	2007-01-03 11:43:14 +00:00
Reid Spencer	dda168599d	For PR950: Three changes: 1. Convert signed integer types to signless versions. 2. Implement the @sext and @zext parameter attributes. Previously the type of an function parameter was used to determine whether it should be sign extended or zero extended before the call. This information is now communicated via the function type's parameter attributes. 3. The interface to LowerCallTo had to be changed in order to accommodate the parameter attribute information. Although it would have been convenient to pass in the FunctionType itself, there isn't always one present in the caller. Consequently, a signedness indication for the result type and for each parameter was provided for in the interface to this method. All implementations were changed to make the adjustment necessary. llvm-svn: 32788	2006-12-31 05:55:36 +00:00
Anton Korobeynikov	3a6faf0b96	Refactored JIT codegen for mingw32. Now we're using standart relocation type for distinguish JIT & non-JIT instead of "dirty" hacks :) llvm-svn: 32745	2006-12-22 22:29:05 +00:00
Evan Cheng	5effab79f3	f64 <-> i64 bit_convert using movq in 64-bit mode. llvm-svn: 32587	2006-12-14 21:55:39 +00:00
Anton Korobeynikov	e76b69846d	Cleaned setjmp/longjmp lowering interfaces. Now we're producing right code (both asm & cbe) for Mingw32 target. Removed autoconf checks for underscored versions of setjmp/longjmp. llvm-svn: 32415	2006-12-10 23:12:42 +00:00
Chris Lattner	6a9de21df5	If we have ScalarSSE, we can select bitconvert into single instructions. This compiles bitcast.ll:test3/test4 into: _test3: movd %xmm0, %eax ret _test4: movd %edi, %xmm0 ret llvm-svn: 32230	2006-12-05 18:45:06 +00:00
Chris Lattner	4ad68ab4d7	Fix PR1033 and CodeGen/X86/bitcast.ll, by expanding bitcast to a load/store pair. This could be better, readme entry pending. llvm-svn: 32228	2006-12-05 18:22:22 +00:00
Chris Lattner	7368984a3d	Fix typo noticed by Lauro Ramos Venancio, thanks! llvm-svn: 32223	2006-12-05 17:29:40 +00:00
Evan Cheng	2c35691a02	- Fix X86-64 JIT by temporarily disabling code that treats GV address as 32-bit immediate in small code model. The JIT cannot ensure GV's are placed in the lower 4G. - Some preliminary support for large code model. llvm-svn: 32215	2006-12-05 04:01:03 +00:00
Chris Lattner	0be88afd90	Add support for 64-bit 'r' constraint. Patch by by Rafael ~Avila de EspÃndol! This fixes PR1029 and Regression/CodeGen/X86/x86-64-asm.ll llvm-svn: 32206	2006-12-04 22:38:21 +00:00
Evan Cheng	456101ebb9	- Use a different wrapper node for RIP-relative GV, etc. - Proper support for both small static and PIC modes under X86-64 - Some (non-optimal) support for medium modes. llvm-svn: 32046	2006-11-30 21:55:46 +00:00
Evan Cheng	f64e341522	In PIC mode, GV not requiring an extra load can be used as address immediate. llvm-svn: 32028	2006-11-29 23:48:14 +00:00
Evan Cheng	7e20347607	Fix for PR1018 - Better support for X86-64 Linux in small code model. llvm-svn: 32026	2006-11-29 23:19:46 +00:00
Evan Cheng	f968824cdb	Custom lower READCYCLECOUNTER for x86-64. llvm-svn: 32017	2006-11-29 08:28:13 +00:00
Evan Cheng	98fa7ab4d7	Change MachineInstr ctor's to take a TargetInstrDescriptor reference instead of opcode and number of operands. llvm-svn: 31947	2006-11-27 23:37:22 +00:00
Anton Korobeynikov	ddb73b4f1e	Refactored *GVRequiresExtraLoad() to Subtarget method. llvm-svn: 31887	2006-11-21 00:01:06 +00:00
Anton Korobeynikov	2449b29bdd	We should perform extra load for dllimported calls llvm-svn: 31874	2006-11-20 10:46:14 +00:00
Evan Cheng	0e82270ff2	Matches MachineInstr changes. llvm-svn: 31712	2006-11-13 23:36:35 +00:00
Evan Cheng	b9e2ae9e37	Add implicit use / def operands to created MI's. llvm-svn: 31676	2006-11-11 10:21:44 +00:00
Evan Cheng	ae1f3758bd	Don't dag combine floating point select to max and min intrinsics. Those take v4f32 / v2f64 operands and may end up causing larger spills / restores. Added X86 specific nodes X86ISD::FMAX, X86ISD::FMIN instead. This fixes PR996. llvm-svn: 31645	2006-11-10 21:43:37 +00:00
Anton Korobeynikov	23ffdb1971	Fixing PR990: http://llvm.org/PR990 . This should unbreak csretcc on Linux & mingw targets. Several tests from llvm-test should be also restored (fftbench, bigfib). llvm-svn: 31613	2006-11-10 00:48:11 +00:00
Evan Cheng	7ca1f47a96	Fixed a bug which causes x86 be to incorrectly match shuffle v, undef, <2, ?, 3, ?> to movhlps It should match to unpckhps instead. Added proper matching code for shuffle v, undef, <2, 3, 2, 3> llvm-svn: 31519	2006-11-07 22:14:24 +00:00
Reid Spencer	4bafa71dc1	For PR786: Turn on -Wunused and -Wno-unused-parameter. Clean up most of the resulting fall out by removing unused variables. Remaining warnings have to do with unused functions (I didn't want to delete code without review) and unused variables in generated code. Maintainers should clean up the remaining issues when they see them. All changes pass DejaGnu tests and Olden. llvm-svn: 31380	2006-11-02 20:25:50 +00:00
Chris Lattner	def30d3eda	allow the address of a global to be used with the "i" constraint when in -static mode. This implements PR882. llvm-svn: 31326	2006-10-31 20:13:11 +00:00
Chris Lattner	3bed109ed9	handle "st" as "st(0)" llvm-svn: 31320	2006-10-31 19:42:44 +00:00
Anton Korobeynikov	e6ba8a819c	1. Clean up code due to changes in SwitchTo*Section(2) 2. Added partial debug support for mingw\cygwin targets (the same as Linux\ELF). Please note, that currently mingw\cygwin uses 'stabs' format for storing debug info by default, thus many (runtime) libraries has this information included. These formats shouldn't be mixed in one binary ('stabs' & 'DWARF'), otherwise binutils tools will be confused. llvm-svn: 31311	2006-10-31 08:31:24 +00:00
Reid Spencer	db06ed9156	Add debug support for X86/ELF targets (Linux). This allows llvm-gcc4 generated object modules to be debugged with gdb. Hopefully this helps pre-release debugging. llvm-svn: 31299	2006-10-30 22:32:30 +00:00
Evan Cheng	5766dd6455	All targets expand BR_JT for now. llvm-svn: 31294	2006-10-30 08:02:39 +00:00
Evan Cheng	090e9abaee	Fixed a significant bug where unpcklpd is incorrectly used to extract element 1 from a v2f64 value. llvm-svn: 31228	2006-10-27 21:08:32 +00:00
Evan Cheng	a1ce4523e5	Fix for PR968: expand vector sdiv, udiv, srem, urem. llvm-svn: 31220	2006-10-27 18:49:08 +00:00
Evan Cheng	1abe8bd233	During vector shuffle lowering, we sometimes commute a vector shuffle to try to match MOVL (movss, movsd, etc.). Don't forget to commute it back and try unpck* and shufp* if that doesn't pan out. llvm-svn: 31186	2006-10-25 21:49:50 +00:00
Evan Cheng	01529405c9	Remove -disable-x86-shuffle-opti llvm-svn: 31183	2006-10-25 20:48:19 +00:00
Chris Lattner	62a0f00312	Implement branch analysis/xform hooks required by the branch folding pass. llvm-svn: 31065	2006-10-20 17:42:20 +00:00
Evan Cheng	ca5eaf4020	Avoid getting into an infinite loop when -disable-x86-shuffle-opti is specified. llvm-svn: 30974	2006-10-16 06:36:00 +00:00
Evan Cheng	fe5bb5dbe6	Merge ISD::TRUNCSTORE to ISD::STORE. Switch to using StoreSDNode. llvm-svn: 30945	2006-10-13 21:14:26 +00:00
Evan Cheng	d07e2f081a	Some X86ISD::CMP were created with wrong ValueType's. llvm-svn: 30913	2006-10-12 19:12:56 +00:00
Evan Cheng	8f6c6b19e6	Don't convert to MOVLP if using shufps etc. may allow load folding. llvm-svn: 30847	2006-10-09 21:39:25 +00:00
Evan Cheng	d22f3dd3ed	Reflects ISD::LOAD / ISD::LOADX / LoadSDNode changes. llvm-svn: 30844	2006-10-09 20:57:25 +00:00
Evan Cheng	275825195a	Make use of getStore(). llvm-svn: 30759	2006-10-05 23:01:46 +00:00
Chris Lattner	7f98896c02	Lower some min/max idioms to minss/maxss when unsafe fp math is enabled. llvm-svn: 30748	2006-10-05 04:11:26 +00:00
Evan Cheng	a77dd83caf	Added option -disable-x86-shuffle-opti to disable X86 specific vector shuffle optimizations. llvm-svn: 30723	2006-10-04 18:33:38 +00:00

... 3 4 5 6 7 ...

718 Commits