llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 03:23:01 +02:00

Author	SHA1	Message	Date
Chris Lattner	ad9a6ccb83	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Evan Cheng	51cf86ded0	Implement ctlz and cttz with bsr and bsf. llvm-svn: 45024	2007-12-14 02:13:44 +00:00
Chris Lattner	28262fbaf2	Several changes: 1) Change the interface to TargetLowering::ExpandOperationResult to take and return entire NODES that need a result expanded, not just the value. This allows us to handle things like READCYCLECOUNTER, which returns two values. 2) Implement (extremely limited) support in LegalizeDAG::ExpandOp for MERGE_VALUES. 3) Reimplement custom lowering in LegalizeDAGTypes in terms of the new ExpandOperationResult. This makes the result simpler and fully general. 4) Implement (fully general) expand support for MERGE_VALUES in LegalizeDAGTypes. 5) Implement ExpandOperationResult support for ARM f64->i64 bitconvert and ARM i64 shifts, allowing them to work with LegalizeDAGTypes. 6) Implement ExpandOperationResult support for X86 READCYCLECOUNTER and FP_TO_SINT, allowing them to work with LegalizeDAGTypes. LegalizeDAGTypes now passes several more X86 codegen tests when enabled and when type legalization in LegalizeDAG is ifdef'd out. llvm-svn: 44300	2007-11-24 07:07:01 +00:00
Anton Korobeynikov	cd9b16df61	Implement codegen for flt_rounds on x86 llvm-svn: 44183	2007-11-16 01:31:51 +00:00
Evan Cheng	7d8deec92f	Much improved pic jumptable codegen: Then: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry imull $4, %ecx, %ecx leal LJTI1_0-"L1$pb"(%eax), %edx addl LJTI1_0-"L1$pb"(%ecx,%eax), %edx jmpl %edx .align 2 .set L1_0_set_3,LBB1_3-LJTI1_0 .set L1_0_set_2,LBB1_2-LJTI1_0 .set L1_0_set_5,LBB1_5-LJTI1_0 .set L1_0_set_4,LBB1_4-LJTI1_0 LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 Now: call "L1$pb" "L1$pb": popl %eax ... LBB1_1: # entry addl LJTI1_0-"L1$pb"(%eax,%ecx,4), %eax jmpl %eax .align 2 .set L1_0_set_3,LBB1_3-"L1$pb" .set L1_0_set_2,LBB1_2-"L1$pb" .set L1_0_set_5,LBB1_5-"L1$pb" .set L1_0_set_4,LBB1_4-"L1$pb" LJTI1_0: .long L1_0_set_3 .long L1_0_set_2 llvm-svn: 43924	2007-11-09 01:32:10 +00:00
Rafael Espindola	ec025c3042	Move the LowerMEMCPY and LowerMEMCPYCall to a common place. Thanks for the suggestions Bill :-) llvm-svn: 43742	2007-11-05 23:12:20 +00:00
Evan Cheng	5fe81cf64e	Enable more fold (sext (load x)) -> (sext (truncate (sextload x))) transformation. Previously, it's restricted by ensuring the number of load uses is one. Now the restriction is loosened up by allowing setcc uses to be "extended" (e.g. setcc x, c, eq -> setcc sext(x), sext(c), eq). llvm-svn: 43465	2007-10-29 19:58:20 +00:00
Evan Cheng	53696b7e9f	Loosen up iv reuse to allow reuse of the same stride but a larger type when truncating from the larger type to smaller type is free. e.g. Turns this loop: LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx movw %dx, %si LBB1_2: # bb movl L_X$non_lazy_ptr, %edi movw %si, (%edi) movl L_Y$non_lazy_ptr, %edi movw %dx, (%edi) addw $4, %dx incw %si incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb into LBB1_1: # entry.bb_crit_edge xorl %ecx, %ecx xorw %dx, %dx LBB1_2: # bb movl L_X$non_lazy_ptr, %esi movw %cx, (%esi) movl L_Y$non_lazy_ptr, %esi movw %dx, (%esi) addw $4, %dx incl %ecx cmpl %eax, %ecx jne LBB1_2 # bb llvm-svn: 43375	2007-10-26 01:56:11 +00:00
Arnold Schwaighofer	d47210011e	Added tail call optimization to the x86 back end. It can be enabled by passing -tailcallopt to llc. The optimization is performed if the following conditions are satisfied: * caller/callee are fastcc * elf/pic is disabled OR elf/pic enabled + callee is in module + callee has visibility protected or hidden llvm-svn: 42870	2007-10-11 19:40:01 +00:00
Dan Gohman	6c3e0cdd36	LowerIntegerDivOrRem no longer exists. llvm-svn: 42787	2007-10-09 15:45:13 +00:00
Dan Gohman	6df332f0cb	Migrate X86 and ARM from using X86ISD::{,I}DIV and ARMISD::MULHILO{U,S} to use ISD::{S,U}DIVREM and ISD::{S,U}MUL_HIO. Move the lowering code associated with these operators into target-independent in LegalizeDAG.cpp and TargetLowering.cpp. llvm-svn: 42762	2007-10-08 18:33:35 +00:00
Evan Cheng	f3c130a8b6	Enabling new condition code modeling scheme. llvm-svn: 42459	2007-09-29 00:00:36 +00:00
Rafael Espindola	01b306e575	Refactor the memcpy lowering for the x86 target. The only generated code difference is that now we call memcpy when the size of the array is unknown. This matches GCC behavior and is better since the run time value can be arbitrarily large. llvm-svn: 42433	2007-09-28 12:53:01 +00:00
Dan Gohman	a01dd49472	Fix a typo in a comment. llvm-svn: 42313	2007-09-25 19:37:26 +00:00
Dan Gohman	1bb346f9f1	When both x/y and x%y are needed (x and y both scalar integer), compute both results with a single div or idiv instruction. This uses new X86ISD nodes for DIV and IDIV which are introduced during the legalize phase so that the SelectionDAG's CSE can automatically eliminate redundant computations. llvm-svn: 42308	2007-09-25 18:23:27 +00:00
Evan Cheng	36b3babfde	Added support for new condition code modeling scheme (i.e. physical register dependency). These are a bunch of instructions that are duplicated so the x86 backend can support both the old and new schemes at the same time. They will be deleted after all the kinks are worked out. llvm-svn: 42285	2007-09-25 01:57:46 +00:00
Dale Johannesen	ea6ffa0b36	Fix PR 1681. When X86 target uses +sse -sse2, keep f32 in SSE registers and f64 in x87. This is effectively a new codegen mode. Change addLegalFPImmediate to permit float and double variants to do different things. Adjust callers. llvm-svn: 42246	2007-09-23 14:52:20 +00:00
Evan Cheng	d7742e7939	X86ISD::TEST is dead. llvm-svn: 42037	2007-09-17 17:42:53 +00:00
Rafael Espindola	5d8b225881	Add support for functions with byval arguments on x86 llvm-svn: 41953	2007-09-14 15:48:13 +00:00
Rafael Espindola	4ddaad4de0	Initial support for calling functions with byval arguments on x86-64 llvm-svn: 41643	2007-08-31 15:06:30 +00:00
Chris Lattner	1e089aac3a	rename isOperandValidForConstraint to LowerAsmOperandForConstraint, changing the interface to allow for future changes. llvm-svn: 41384	2007-08-25 00:47:38 +00:00
Anton Korobeynikov	3094846993	Move ReturnAddrIndex variable to X86MachineFunctionInfo structure. This fixed hard to catch bugs with retaddr lowering llvm-svn: 41104	2007-08-15 17:12:32 +00:00
Dan Gohman	352de5aeb9	More explicit keywords. llvm-svn: 40757	2007-08-02 21:21:54 +00:00
Duncan Sands	35a77d857b	Trampoline codegen support for X86-32. llvm-svn: 40566	2007-07-27 20:02:49 +00:00
Dan Gohman	0252aa07ee	Re-apply 40504, but with a fix for the segfault it caused in oggenc: Make the alignedload and alignedstore patterns always require 16-byte alignment. This way when they are used in the "Fs" instructions, in which a vector instruction is used for a scalar purpose, they can still require the full vector alignment. And add a regression test for this. llvm-svn: 40555	2007-07-27 17:16:43 +00:00
Evan Cheng	cb8f08ebca	Reverting 40504 for now. It's breaking oggenc. llvm-svn: 40547	2007-07-27 01:37:47 +00:00
Dan Gohman	513dcba4f8	Remove X86ISD::LOAD_PACK and X86ISD::LOAD_UA and associated code from the x86 target, replacing them with the new alignment attributes on memory references. llvm-svn: 40504	2007-07-26 00:31:09 +00:00
Anton Korobeynikov	5635277c36	Long live the exception handling! This patch fills the last necessary bits to enable exceptions handling in LLVM. Currently only on x86-32/linux. In fact, this patch adds necessary intrinsics (and their lowering) which represent really weird target-specific gcc builtins used inside unwinder. After corresponding llvm-gcc patch will land (easy) exceptions should be more or less workable. However, exceptions handling support should not be thought as 'finished': I expect many small and not so small glitches everywhere. llvm-svn: 39855	2007-07-14 14:06:15 +00:00
Dan Gohman	928144b051	Define non-intrinsic instructions for vector min, max, sqrt, rsqrt, and rcp, in addition to the intrinsic forms. Add spill-folding entries for these new instructions, and for the scalar min and max instrinsic instructions which were missing. And add some preliminary ISelLowering code for using the new non-intrinsic vector sqrt instruction, and fneg and fabs. llvm-svn: 38478	2007-07-10 00:05:58 +00:00
Dan Gohman	a62327ea40	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Bill Wendling	6856e741fa	Support for the special case of a vector with the canonical form: vector_shuffle v1, v2, <2, 6, 3, 7> I.e. vector_shuffle v, undef, <2, 2, 3, 3> MMX only has a shuffle for v4i16 vectors. It needs to use the unpackh for this type of operation. llvm-svn: 36403	2007-04-24 21:16:55 +00:00
Lauro Ramos Venancio	bc32d90b46	Implement "general dynamic", "initial exec" and "local exec" TLS models for X86 32 bits. llvm-svn: 36283	2007-04-20 21:38:10 +00:00
Anton Korobeynikov	9bc4b792bf	Implemented correct stack probing on mingw/cygwin for dynamic alloca's. Also, fixed static case in presence of eax livin. This fixes PR331 PS: Why don't we still have push/pop instructions? :) llvm-svn: 36195	2007-04-17 09:20:00 +00:00
Chris Lattner	3f9ff05309	remove some dead hooks llvm-svn: 35845	2007-04-09 23:31:19 +00:00
Chris Lattner	ae6e2c0ee5	remove some dead target hooks, subsumed by isLegalAddressingMode llvm-svn: 35840	2007-04-09 22:27:04 +00:00
Chris Lattner	c0405a348d	implement the new addressing mode description hook. llvm-svn: 35521	2007-03-30 23:15:24 +00:00
Chris Lattner	b19069959d	switch TargetLowering::getConstraintType to take the entire constraint, not just the first letter. No functionality change. llvm-svn: 35322	2007-03-25 02:14:49 +00:00
Dale Johannesen	44c0a5d545	repair x86 performance, dejagnu problems from previous change llvm-svn: 35245	2007-03-21 21:51:52 +00:00
Evan Cheng	0eeb8b59eb	More flexible TargetLowering LSR hooks for testing whether an immediate is a legal target address immediate or scale. llvm-svn: 35073	2007-03-12 23:28:50 +00:00
Evan Cheng	2fb461c1b5	X86-64 VACOPY needs custom expansion. va_list is a struct { i32, i32, i8, i8 }. llvm-svn: 34857	2007-03-02 23:16:35 +00:00
Chris Lattner	023751c20b	remove fastcc (not fastcall) support llvm-svn: 34730	2007-02-28 06:05:16 +00:00
Chris Lattner	decf97fae2	add an accessor. llvm-svn: 34625	2007-02-26 04:01:25 +00:00
Chris Lattner	423224a7b4	pass the calling convention into Lower*CallTo, instead of using ad-hoc flags. llvm-svn: 34587	2007-02-25 09:06:15 +00:00
Chris Lattner	8fa75c3ae8	factor a bunch of code out of LowerCCCCallTo into a new LowerCallResult function. This function now uses GetRetValueLocs to determine where the result values are located and concerns itself with how to pull the values out. llvm-svn: 34586	2007-02-25 08:59:22 +00:00
Chris Lattner	f119813ff4	simplify result value lowering by splitting the selection of where to return registers out from the logic of how to return them. This changes X86-64 to mark EAX live out when returning a 32-bit value, where before it marked RAX liveout. llvm-svn: 34582	2007-02-25 08:15:11 +00:00
Nate Begeman	dc46021355	Finish off bug 680, allowing targets to custom lower frame and return address nodes. llvm-svn: 33636	2007-01-29 22:58:52 +00:00
Anton Korobeynikov	611d5e2eda	Propagate changes from my local tree. This patch includes: 1. New parameter attribute called 'inreg'. It has meaning "place this parameter in registers, if possible". This is some generalization of gcc's regparm(n) attribute. It's currently used only in X86-32 backend. 2. Completely rewritten CC handling/lowering code inside X86 backend. Merged stdcall + c CCs and fastcall + fast CC. 3. Dropped CSRET CC. We cannot add struct return variant for each target-specific CC (e.g. stdcall + csretcc and so on). 4. Instead of CSRET CC introduced 'sret' parameter attribute. Setting in on first attribute has meaning 'This is hidden pointer to structure return. Handle it gently'. 5. Fixed small bug in llvm-extract + add new feature to FunctionExtraction pass, which relinks all internal-linkaged callees from deleted function to external linkage. This will allow further linking everything together. NOTEs: 1. Documentation will be updated soon. 2. llvm-upgrade should be improved to translate csret => sret. Before this, there will be some unexpected test fails. llvm-svn: 33597	2007-01-28 13:31:35 +00:00
Evan Cheng	df277336b8	- FCOPYSIGN custom lowering bug. Clear the sign bit of operand 0 first before or'ing in the sign bit of operand 1. - Tweaking: rather than left shift the sign bit, fp_extend operand 1 first before taking its sign bit if its type is smaller than that of operand 0. llvm-svn: 32932	2007-01-05 21:37:56 +00:00
Evan Cheng	bcf3d2bd15	With SSE2, expand FCOPYSIGN to a series of SSE bitwise operations. llvm-svn: 32900	2007-01-05 07:55:56 +00:00
Evan Cheng	456101ebb9	- Use a different wrapper node for RIP-relative GV, etc. - Proper support for both small static and PIC modes under X86-64 - Some (non-optimal) support for medium modes. llvm-svn: 32046	2006-11-30 21:55:46 +00:00

1 2 3

130 Commits