llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 04:22:57 +02:00

Author	SHA1	Message	Date
Dan Gohman	b91bef08a7	Add titles to the various SelectionDAG viewGraph calls that include useful information like the name of the block being viewed and the current phase of compilation. llvm-svn: 53872	2008-07-21 20:00:07 +00:00
Dan Gohman	8981962672	Add a new function, ReplaceAllUsesOfValuesWith, which handles bulk replacement of multiple values. This is slightly more efficient than doing multiple ReplaceAllUsesOfValueWith calls, and theoretically could be optimized even further. However, an important property of this new function is that it handles the case where the source value set and destination value set overlap. This makes it feasible for isel to use SelectNodeTo in many very common cases, which is advantageous because SelectNodeTo avoids a temporary node and it doesn't require CSEMap updates for users of values that don't change position. Revamp MorphNodeTo, which is what does all the work of SelectNodeTo, to handle operand lists more efficiently, and to correctly handle a number of corner cases to which its new wider use exposes it. This commit also includes a change to the encoding of post-isel opcodes in SDNodes; now instead of being sandwiched between the target-independent pre-isel opcodes and the target-dependent pre-isel opcodes, post-isel opcodes are now represented as negative values. This makes it possible to test if an opcode is pre-isel or post-isel without having to know the size of the current target's post-isel instruction set. These changes speed up llc overall by 3% and reduce memory usage by 10% on the InstructionCombining.cpp testcase with -fast and -regalloc=local. llvm-svn: 53728	2008-07-17 19:10:17 +00:00
Dan Gohman	4c8c8e3aad	Fix the result type of X86's truncate to i8. llvm-svn: 53688	2008-07-16 16:20:48 +00:00
Evan Cheng	67ce381ffe	Do not use computationally expensive scheduling heuristics with -fast. llvm-svn: 52971	2008-07-01 18:05:03 +00:00
Evan Cheng	3f664b6fd3	Split scheduling from instruction selection. llvm-svn: 52923	2008-06-30 20:45:06 +00:00
Evan Cheng	deb754898b	Unbreak DECLARE isel in pic mode. llvm-svn: 52439	2008-06-18 02:48:27 +00:00
Evan Cheng	89e2e3292d	Rather than avoiding to wrap ISD::DECLARE GV operand in X86ISD::Wrapper, simply handle it at dagisel time with x86 specific isel code. llvm-svn: 52377	2008-06-17 02:01:22 +00:00
Duncan Sands	d634afe3aa	Wrap MVT::ValueType in a struct to get type safety and better control the abstraction. Rename the type to MVT. To update out-of-tree patches, the main thing to do is to rename MVT::ValueType to MVT, and rewrite expressions like MVT::getSizeInBits(VT) in the form VT.getSizeInBits(). Use VT.getSimpleVT() to extract a MVT::SimpleValueType for use in switch statements (you will get an assert failure if VT is an extended value type - these shouldn't exist after type legalization). This results in a small speedup of codegen and no new testsuite failures (x86-64 linux). llvm-svn: 52044	2008-06-06 12:08:01 +00:00
Dan Gohman	4e87d82476	Fix a tblgen problem handling variable_ops in tblgen instruction definitions. This adds a new construct, "discard", for indicating that a named node in the input matching pattern is to be discarded, instead of corresponding to a node in the output pattern. This allows tblgen to know where the arguments for the varaible_ops are supposed to begin. This fixes "rdar://5791600", whatever that is ;-). llvm-svn: 51699	2008-05-29 19:57:41 +00:00
Evan Cheng	4f660778f0	Use movlps / movhps to modify low / high half of 16-byet memory location. llvm-svn: 51501	2008-05-23 21:23:16 +00:00
Evan Cheng	3493e43afd	Handle a few more cases of folding load i64 into xmm and zero top bits. Note, some of the code will be moved into target independent part of DAG combiner in a subsequent patch. llvm-svn: 50918	2008-05-09 21:53:03 +00:00
Evan Cheng	f97e716511	Handle vector move / load which zero the destination register top bits (i.e. movd, movq, movss (addr), movsd (addr)) with X86 specific dag combine. llvm-svn: 50838	2008-05-08 00:57:18 +00:00
Evan Cheng	37ca5de3b7	Not checking for intrinsics which do not have a chain operand. llvm-svn: 50260	2008-04-25 08:55:28 +00:00
Evan Cheng	e177dc6696	- Switch from std::set to SmallPtrSet. - Add comments. llvm-svn: 50259	2008-04-25 08:22:20 +00:00
Chris Lattner	8c9f6c929a	Loosen up an assertion to allow intrinsics. I really have no idea what this code (findNonImmUse) does, so I'm only guessing that this is the right thing. It would be really really nice if this had comments and perhaps switched to SmallPtrSet (hint hint) :) This fixes rdar://5886601, a crash on gcc.target/i386/sse4_1-pblendw.c llvm-svn: 50252	2008-04-25 05:13:01 +00:00
Roman Levenstein	b40d332929	Re-commit of the r48822, where the infinite looping problem discovered by Dan Gohman is fixed. llvm-svn: 49330	2008-04-07 10:06:32 +00:00
Evan Cheng	12d2bbde0d	Cosmetic llvm-svn: 49156	2008-04-03 07:45:18 +00:00
Evan Cheng	497c607fae	Backing out 48222 temporarily. llvm-svn: 49124	2008-04-03 03:13:16 +00:00
Roman Levenstein	55b8822511	Use a linked data structure for the uses lists of an SDNode, just like LLVM Value/Use does and MachineRegisterInfo/MachineOperand does. This allows constant time for all uses list maintenance operations. The idea was suggested by Chris. Reviewed by Evan and Dan. Patch is tested and approved by Dan. On normal use-cases compilation speed is not affected. On very big basic blocks there are compilation speedups in the range of 15-20% or even better. llvm-svn: 48822	2008-03-26 12:39:26 +00:00
Chris Lattner	edfc239ced	remove Evan's "ugly hack" that sorta attempted to get x86-64 return conventions correct, but was never enabled. We can now do the "right thing" with multiple return values. llvm-svn: 48635	2008-03-21 06:50:21 +00:00
Christopher Lamb	b4f4b41048	Make insert_subreg a two-address instruction, vastly simplifying LowerSubregs pass. Add a new TII, subreg_to_reg, which is like insert_subreg except that it takes an immediate implicit value to insert into rather than a register. llvm-svn: 48412	2008-03-16 03:12:01 +00:00
Christopher Lamb	0f1c32eb63	Get rid of a pseudo instruction and replace it with subreg based operation on real instructions, ridding the asm printers of the hack used to do this previously. In the process, update LowerSubregs to be careful about eliminating copies that have side affects. Note: the coalescer will have to be careful about this too, when it starts coalescing insert_subreg nodes. llvm-svn: 48329	2008-03-13 05:47:01 +00:00
Christopher Lamb	74f4d837df	Recommitting parts of r48130. These do not appear to cause the observed failures. llvm-svn: 48223	2008-03-11 10:09:17 +00:00
Chris Lattner	9826c9365e	Change the model for FP Stack return to use fp operands on the RET instruction instead of using FpSET_ST0_32. This also generalizes the code to handling returning of multiple FP results. llvm-svn: 48209	2008-03-11 03:23:40 +00:00
Chris Lattner	f0684bfd16	Don't emit FP_REG_KILL into a block that just returns. Nothing can be live out of the block anyway, so it isn't needed. llvm-svn: 48192	2008-03-10 23:34:12 +00:00
Evan Cheng	067ecbc341	Revert 48125, 48126, and 48130 for now to unbreak some x86-64 tests. llvm-svn: 48167	2008-03-10 19:31:26 +00:00
Christopher Lamb	32e5ce3d96	Allow insert_subreg into implicit, target-specific values. Change insert/extract subreg instructions to be able to be used in TableGen patterns. Use the above features to reimplement an x86-64 pseudo instruction as a pattern. llvm-svn: 48130	2008-03-10 06:12:08 +00:00
Chris Lattner	826402e365	rename FpGETRESULT32 -> FpGET_ST0_32 etc. Add support for isel'ing value preserving FP roundings from one fp stack reg to another into a noop, instead of stack traffic. llvm-svn: 48093	2008-03-09 07:05:32 +00:00
Evan Cheng	139517b682	Remove -always-fold-and-in-test. llvm-svn: 47871	2008-03-04 00:40:35 +00:00
Evan Cheng	e1d3e0958b	Set to default: x86 no longer fold and into test if it has more than one use. llvm-svn: 47711	2008-02-28 07:46:38 +00:00
Dan Gohman	b8e7fea22f	Revert the assert for MUL_LOHI with an unused high result; Chris pointed out that this isn't correct at -O0. llvm-svn: 47575	2008-02-25 22:43:48 +00:00
Dan Gohman	2585bc7c46	Add an assert to verify that we don't see an {S,U}MUL_LOHI with an unused high value. llvm-svn: 47569	2008-02-25 22:15:55 +00:00
Dan Gohman	1914fa6932	Remove the hack that turned an {S,U}MUL_LOHI with an unused high result into a MUL late in the X86 codegen process. ISD::MUL is once again Legal on X86, so this is no longer needed. And, the hack was suboptimal; see PR1874 for details. llvm-svn: 47567	2008-02-25 21:57:04 +00:00
Dan Gohman	012abf0109	Convert MaskedValueIsZero and all its users to use APInt. Also add a SignBitIsZero function to simplify a common use case. llvm-svn: 47561	2008-02-25 21:11:39 +00:00
Evan Cheng	f3a7cd1c62	Poorly named option. llvm-svn: 47400	2008-02-20 20:57:32 +00:00
Evan Cheng	e9708c997f	Disable for now. This is pessimizing code. llvm-svn: 47354	2008-02-20 02:29:17 +00:00
Evan Cheng	35253f2c22	Add hidden option -x86-fold-and-in-test to test the effect the test / and folding change. llvm-svn: 47351	2008-02-19 23:36:51 +00:00
Evan Cheng	e3ddcfa588	Only using x86-64 rip relative addressing in non-staic mode? llvm-svn: 47019	2008-02-12 19:20:46 +00:00
Dan Gohman	cabaec582f	Rename MRegisterInfo to TargetRegisterInfo. llvm-svn: 46930	2008-02-10 18:45:23 +00:00
Evan Cheng	a377b2bbd1	Fix a x86-64 codegen deficiency. Allow gv + offset when using rip addressing mode. Before: _main: subq $8, %rsp leaq _X(%rip), %rax movsd 8(%rax), %xmm1 movss _X(%rip), %xmm0 call _t xorl %ecx, %ecx movl %ecx, %eax addq $8, %rsp ret Now: _main: subq $8, %rsp movsd _X+8(%rip), %xmm1 movss _X(%rip), %xmm0 call _t xorl %ecx, %ecx movl %ecx, %eax addq $8, %rsp ret Notice there is another idiotic codegen issue that needs to be fixed asap: xorl %ecx, %ecx movl %ecx, %eax llvm-svn: 46850	2008-02-07 08:53:49 +00:00
Evan Cheng	1c67dcaae7	Dwarf requires variable entries to be in the source order. Right now, since we are recording variable information at isel time this means parameters would appear in the reverse order. The short term fix is to issue recordVariable() at asm printing time instead. llvm-svn: 46724	2008-02-04 23:06:48 +00:00
Evan Cheng	c57ec111f2	SDIsel processes llvm.dbg.declare by recording the variable debug information descriptor and its corresponding stack frame index in MachineModuleInfo. This only works if the local variable is "homed" in the stack frame. It does not work for byval parameter, etc. Added ISD::DECLARE node type to represent llvm.dbg.declare intrinsic. Now the intrinsic calls are lowered into a SDNode and lives on through out the codegen passes. For now, since all the debugging information recording is done at isel time, when a ISD::DECLARE node is selected, it has the side effect of also recording the variable. This is a short term solution that should be fixed in time. llvm-svn: 46659	2008-02-02 04:07:54 +00:00
Evan Cheng	618761903d	Work in progress. This patch fixes x86-64 calls which are modelled as StructRet but really should be return in registers, e.g. _Complex long double, some 128-bit aggregates. This is a short term solution that is necessary only because llvm, for now, cannot model i128 nor call's with multiple results. Status: This only works for direct calls, and only the caller side is done. Disabled for now. llvm-svn: 46527	2008-01-29 19:34:22 +00:00
Chris Lattner	16a8f126d3	Significantly simplify and improve handling of FP function results on x86-32. This case returns the value in ST(0) and then has to convert it to an SSE register. This causes significant codegen ugliness in some cases. For example in the trivial fp-stack-direct-ret.ll testcase we used to generate: _bar: subl $28, %esp call L_foo$stub fstpl 16(%esp) movsd 16(%esp), %xmm0 movsd %xmm0, 8(%esp) fldl 8(%esp) addl $28, %esp ret because we move the result of foo() into an XMM register, then have to move it back for the return of bar. Instead of hacking ever-more special cases into the call result lowering code we take a much simpler approach: on x86-32, fp return is modeled as always returning into an f80 register which is then truncated to f32 or f64 as needed. Similarly for a result, we model it as an extension to f80 + return. This exposes the truncate and extensions to the dag combiner, allowing target independent code to hack on them, eliminating them in this case. This gives us this code for the example above: _bar: subl $12, %esp call L_foo$stub addl $12, %esp ret The nasty aspect of this is that these conversions are not legal, but we want the second pass of dag combiner (post-legalize) to be able to hack on them. To handle this, we lie to legalize and say they are legal, then custom expand them on entry to the isel pass (PreprocessForFPConvert). This is gross, but less gross than the code it is replacing :) This also allows us to generate better code in several other cases. For example on fp-stack-ret-conv.ll, we now generate: _test: subl $12, %esp call L_foo$stub fstps 8(%esp) movl 16(%esp), %eax cvtss2sd 8(%esp), %xmm0 movsd %xmm0, (%eax) addl $12, %esp ret where before we produced (incidentally, the old bad code is identical to what gcc produces): _test: subl $12, %esp call L_foo$stub fstpl (%esp) cvtsd2ss (%esp), %xmm0 cvtss2sd %xmm0, %xmm0 movl 16(%esp), %eax movsd %xmm0, (%eax) addl $12, %esp ret Note that we generate slightly worse code on pr1505b.ll due to a scheduling deficiency that is unrelated to this patch. llvm-svn: 46307	2008-01-24 08:07:48 +00:00
Evan Cheng	165d39b8e4	Fix a x86-64 static codegen bug. This fixes a lot of x86-64 jit failures. llvm-svn: 45733	2008-01-08 02:06:11 +00:00
Evan Cheng	5b9282f3b5	Combine MovePCtoStack + POP32r into one instruction MOVPC32r so it can be moved if needed. llvm-svn: 45605	2008-01-05 00:41:47 +00:00
Chris Lattner	96167aa93c	Rename SSARegMap -> MachineRegisterInfo in keeping with the idea that "machine" classes are used to represent the current state of the code being compiled. Given this expanded name, we can start moving other stuff into it. For now, move the UsedPhysRegs and LiveIn/LoveOuts vectors from MachineFunction into it. Update all the clients to match. This also reduces some needless #includes, such as MachineModuleInfo from MachineFunction. llvm-svn: 45467	2007-12-31 04:13:23 +00:00
Chris Lattner	ad9a6ccb83	Remove attribution from file headers, per discussion on llvmdev. llvm-svn: 45418	2007-12-29 20:36:04 +00:00
Evan Cheng	8f4ec948d3	Fix JIT code emission of X86::MovePCtoStack. llvm-svn: 45307	2007-12-22 02:26:46 +00:00
Evan Cheng	343929c773	Fold some and + shift in x86 addressing mode. llvm-svn: 44970	2007-12-13 00:43:27 +00:00
Chris Lattner	12fca81026	aesthetic changes, no functionality change. Evan, it's not clear what 'Available' is, please add a comment near it and rename it if appropriate. llvm-svn: 44703	2007-12-08 07:22:58 +00:00
Chris Lattner	be0c5a0500	Fix a long standing deficiency in the X86 backend: we would sometimes emit "zero" and "all one" vectors multiple times, for example: _test2: pcmpeqd %mm0, %mm0 movq %mm0, _M1 pcmpeqd %mm0, %mm0 movq %mm0, _M2 ret instead of: _test2: pcmpeqd %mm0, %mm0 movq %mm0, _M1 movq %mm0, _M2 ret This patch fixes this by always arranging for zero/one vectors to be defined as v4i32 or v2i32 (SSE/MMX) instead of letting them be any random type. This ensures they get trivially CSE'd on the dag. This fix is also important for LegalizeDAGTypes, as it gets unhappy when the x86 backend wants BUILD_VECTOR(i64 0) to be legal even when 'i64' isn't legal. This patch makes the following changes: 1) X86TargetLowering::LowerBUILD_VECTOR now lowers 0/1 vectors into their canonical types. 2) The now-dead patterns are removed from the SSE/MMX .td files. 3) All the patterns in the .td file that referred to immAllOnesV or immAllZerosV in the wrong form now use *_bc to match them with a bitcast wrapped around them. 4) X86DAGToDAGISel::SelectScalarSSELoad is generalized to handle bitcast'd zero vectors, which simplifies the code actually. 5) getShuffleVectorZeroOrUndef is updated to generate a shuffle that is legal, instead of generating one that is illegal and expecting a later legalize pass to clean it up. 6) isZeroShuffle is generalized to handle bitcast of zeros. 7) several other minor tweaks. This patch is definite goodness, but has the potential to cause random code quality regressions. Please be on the lookout for these and let me know if they happen. llvm-svn: 44310	2007-11-25 00:24:49 +00:00
Bill Wendling	df2eaa8a55	Silence, accersed warning llvm-svn: 43609	2007-11-01 08:51:44 +00:00
Dan Gohman	76e104c8ad	Fix the folding of multiplication into addresses on x86, which was broken by the recent {U,S}MUL_LOHI changes. llvm-svn: 43230	2007-10-22 20:22:24 +00:00
Evan Cheng	f1ead16fd5	Flag MOV32to32_ with EXTRACT_SUBREG. They should not be scheduled apart. llvm-svn: 42894	2007-10-12 07:55:53 +00:00
Dan Gohman	cc317de0f5	Fix grammar in a comment. llvm-svn: 42786	2007-10-09 15:44:37 +00:00
Dan Gohman	6df332f0cb	Migrate X86 and ARM from using X86ISD::{,I}DIV and ARMISD::MULHILO{U,S} to use ISD::{S,U}DIVREM and ISD::{S,U}MUL_HIO. Move the lowering code associated with these operators into target-independent in LegalizeDAG.cpp and TargetLowering.cpp. llvm-svn: 42762	2007-10-08 18:33:35 +00:00
Anton Korobeynikov	ca03aec919	Partly revert invalid r41774 llvm-svn: 42322	2007-09-25 21:52:30 +00:00
Dan Gohman	1bb346f9f1	When both x/y and x%y are needed (x and y both scalar integer), compute both results with a single div or idiv instruction. This uses new X86ISD nodes for DIV and IDIV which are introduced during the legalize phase so that the SelectionDAG's CSE can automatically eliminate redundant computations. llvm-svn: 42308	2007-09-25 18:23:27 +00:00
Dale Johannesen	5ea6a9bc3a	When mixing SSE and x87 codegen, it's possible to have situations where an SSE instruction turns into multiple blocks, with the live range of an x87 register crossing them. To do this correctly make sure we examine all blocks when inserting FP_REG_KILL. PR 1697. (This was exposed by my fix for PR 1681, but the same thing could happen mixing x87 long double with SSE.) llvm-svn: 42281	2007-09-24 22:52:39 +00:00
Evan Cheng	65df926ced	TableGen no longer emit CopyFromReg nodes for implicit results in physical registers. The scheduler is now responsible for emitting them. llvm-svn: 41781	2007-09-07 23:59:02 +00:00
Dale Johannesen	783215c630	Apply feedback from previous patch. llvm-svn: 41774	2007-09-07 21:07:57 +00:00
Dale Johannesen	81d6ecb886	Enhance APFloat to retain bits of NaNs (fixes oggenc). Use APFloat interfaces for more references, mostly of ConstantFPSDNode. llvm-svn: 41632	2007-08-31 04:03:46 +00:00
Dan Gohman	2390ff5060	When x86 addresses matching exceeds its recursion limit, check to see if the base register is already occupied before assuming it can be used. This fixes bogus code generation in the accompanying testcase. llvm-svn: 41049	2007-08-13 20:03:06 +00:00
Christopher Lamb	7e52a97df5	Use subregs to improve any_extend code generation when feasible. llvm-svn: 41013	2007-08-10 22:22:41 +00:00
Christopher Lamb	450f6815b9	Increase efficiency of sign_extend_inreg by using subregisters for truncation. As the README suggests sign_extend_subreg is selected to (sext(trunc)). llvm-svn: 41010	2007-08-10 21:48:46 +00:00
Evan Cheng	a58ebc46dd	divb / mulb outputs to ah. Under x86-64 it's not legal to read ah if the instruction requires a rex prefix (i.e. outputs to r8b, etc.). So issue shift right by 8 on AX and then truncate it to 8 bits instead. llvm-svn: 40972	2007-08-09 21:59:35 +00:00
Dale Johannesen	6b8e91e7e3	Long double patch 8 of N: make it partially work in SSE mode (all but conversions <-> other FP types, I think): >>Do not mark all-80-bit operations as "Requires[FPStack]" (which really means "not SSE"). >>Refactor load-and-extend to facilitate this. >>Update comments. >>Handle long double in SSE when computing FP_REG_KILL. llvm-svn: 40906	2007-08-07 20:29:26 +00:00
Dale Johannesen	3ea9879011	Get X86 long double calling convention to work (on Darwin, anyway). Fix some table omissions for LD arithmetic. llvm-svn: 40877	2007-08-06 21:31:06 +00:00
Evan Cheng	3163814591	Switch some multiplication instructions over to the new scheme for testing. llvm-svn: 40723	2007-08-02 05:48:35 +00:00
Evan Cheng	0fa6cdbff5	Mac OS X X86-64 low 4G address not available. llvm-svn: 40701	2007-08-01 23:45:51 +00:00
Christopher Lamb	919ce03da6	Change the x86 backend to use extract_subreg for truncation operations. Passes DejaGnu, SingleSource and MultiSource. llvm-svn: 40578	2007-07-29 01:24:57 +00:00
Evan Cheng	9802b13b38	Minor bug. llvm-svn: 40535	2007-07-26 17:02:45 +00:00
Evan Cheng	413d222576	Same goes for constantpool, etc. llvm-svn: 40517	2007-07-26 07:35:15 +00:00
Evan Cheng	9588231d34	Mac OS X x86-64 lower 4G address is not available. llvm-svn: 40502	2007-07-25 23:41:36 +00:00
Dan Gohman	1444c5840b	Add const to CanBeFoldedBy, CheckAndMask, and CheckOrMask. llvm-svn: 40480	2007-07-24 23:00:27 +00:00
Dale Johannesen	7af19491d3	Fix for PR 1505 (and 1489). Rewrite X87 register model to include f32 variants. Some factoring improvments forthcoming. llvm-svn: 37847	2007-07-03 00:53:03 +00:00
Dan Gohman	a62327ea40	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Chris Lattner	b97b122176	Fix CodeGen/X86/2007-03-24-InlineAsmPModifier.ll llvm-svn: 35926	2007-04-11 22:29:46 +00:00
Anton Korobeynikov	1a8740c88b	Oops :) llvm-svn: 35438	2007-03-28 18:38:33 +00:00
Anton Korobeynikov	d59c4e54c7	Don't allow MatchAddress recurse too much. This trims exponential behaviour in some cases. llvm-svn: 35437	2007-03-28 18:36:33 +00:00
Chris Lattner	b9cc0ade43	Two changes: 1) codegen a shift of a register as a shift, not an LEA. 2) teach the RA to convert a shift to an LEA instruction if it wants something in three-address form. This gives us asm diffs like: - leal (,%eax,4), %eax + shll $2, %eax which is faster on some processors and smaller on all of them. and, more interestingly: - movl 24(%esi), %eax - leal (,%eax,4), %edi + movl 24(%esi), %edi + shll $2, %edi Without #2, #1 was a significant pessimization in some cases. This implements CodeGen/X86/shift-codegen.ll llvm-svn: 35204	2007-03-20 06:08:29 +00:00
Chris Lattner	3e01807e87	Fix a miscompilation in the addr mode code trying to implement X \| C and X + C to promote LEA formation. We would incorrectly apply it in some cases (test) and miss it in others. This fixes CodeGen/X86/2007-02-04-OrAddrMode.ll llvm-svn: 33884	2007-02-04 20:18:17 +00:00
Evan Cheng	818c6bdfa2	Linux GOT indirect reference is only necessary in PIC mode. llvm-svn: 33441	2007-01-22 21:34:25 +00:00
Reid Spencer	09efdecc2d	Adjust #includes to compensate for lost of DerivedTypes.h in TargetLowering.h llvm-svn: 33154	2007-01-12 23:22:14 +00:00
Anton Korobeynikov	548b9af9c2	* PIC codegen for X86/Linux has been implemented * PIC-aware internal structures in X86 Codegen have been refactored * Visibility (default/weak) has been added * Docs fixes (external weak linkage, visibility, formatting) llvm-svn: 33136	2007-01-12 19:20:47 +00:00
Anton Korobeynikov	2b39939053	Really big cleanup. - New target type "mingw" was introduced - Same things for both mingw & cygwin are marked as "cygming" (as in gcc) - .lcomm is supported here, so allow LLVM to use it - Correctly use underscored versions of setjmp & _longjmp for both mingw & cygwin llvm-svn: 32833	2007-01-03 11:43:14 +00:00
Chris Lattner	8896b6cb46	eliminate static ctors for Statistic objects. llvm-svn: 32703	2006-12-19 22:59:26 +00:00
Evan Cheng	f5c9f4c3c9	Fix for PR1062 by Dan Gohman. llvm-svn: 32688	2006-12-19 21:31:42 +00:00
Bill Wendling	f13d78d3b8	What should be the last unnecessary <iostream>s in the library. llvm-svn: 32333	2006-12-07 22:21:48 +00:00
Chris Lattner	a531ce882e	Detemplatize the Statistic class. The only type it is instantiated with is 'unsigned'. llvm-svn: 32279	2006-12-06 17:46:33 +00:00
Evan Cheng	40a5de9cd9	Revert an unintended change. llvm-svn: 32239	2006-12-05 22:03:40 +00:00
Evan Cheng	adeea85f7d	- Switch X86-64 JIT to large code size model. - Re-enable some codegen niceties for X86-64 static relocation model codegen. - Clean ups, etc. llvm-svn: 32238	2006-12-05 19:50:18 +00:00
Evan Cheng	2c35691a02	- Fix X86-64 JIT by temporarily disabling code that treats GV address as 32-bit immediate in small code model. The JIT cannot ensure GV's are placed in the lower 4G. - Some preliminary support for large code model. llvm-svn: 32215	2006-12-05 04:01:03 +00:00
Evan Cheng	456101ebb9	- Use a different wrapper node for RIP-relative GV, etc. - Proper support for both small static and PIC modes under X86-64 - Some (non-optimal) support for medium modes. llvm-svn: 32046	2006-11-30 21:55:46 +00:00
Evan Cheng	1e3f41acde	Clean up. llvm-svn: 32027	2006-11-29 23:46:27 +00:00
Evan Cheng	7e20347607	Fix for PR1018 - Better support for X86-64 Linux in small code model. llvm-svn: 32026	2006-11-29 23:19:46 +00:00
Evan Cheng	98fa7ab4d7	Change MachineInstr ctor's to take a TargetInstrDescriptor reference instead of opcode and number of operands. llvm-svn: 31947	2006-11-27 23:37:22 +00:00
Evan Cheng	a9176b38f9	For unsigned 8-bit division. Use movzbw to set the lower 8 bits of AX while clearing the upper 8-bits instead of issuing two instructions. This also eliminates the need to target the AH register which can be problematic on x86-64. llvm-svn: 31832	2006-11-17 22:10:14 +00:00
Bill Wendling	b6061e32fa	Removed even more std::cerr and #include <iostream> things. llvm-svn: 31813	2006-11-17 07:52:03 +00:00
Evan Cheng	0e82270ff2	Matches MachineInstr changes. llvm-svn: 31712	2006-11-13 23:36:35 +00:00
Evan Cheng	b9e2ae9e37	Add implicit use / def operands to created MI's. llvm-svn: 31676	2006-11-11 10:21:44 +00:00
Evan Cheng	f880ed86ff	Add all implicit defs to FP_REG_KILL mi. llvm-svn: 31674	2006-11-11 07:19:36 +00:00
Evan Cheng	3a017e8abd	Fix a bug in SelectScalarSSELoad. Since the load is wrapped in a SCALAR_TO_VECTOR, even if the hasOneUse() check pass we may end up folding the load into two instructions. Make sure we check the SCALAR_TO_VECTOR has only one use as well. llvm-svn: 31641	2006-11-10 21:23:04 +00:00
Evan Cheng	736a8eb3cd	Match tblegen changes. llvm-svn: 31571	2006-11-08 20:34:28 +00:00
Jeff Cohen	e1003da1a2	Unbreak VC++ build. llvm-svn: 31464	2006-11-05 19:31:28 +00:00
Chris Lattner	24e8fdc1f6	silence warning llvm-svn: 31393	2006-11-03 01:13:15 +00:00
Evan Cheng	c73547a71d	SelectScalarSSELoad should call CanBeFoldedBy as well. llvm-svn: 30973	2006-10-16 06:34:55 +00:00
Evan Cheng	6c8de88f88	Corrected load folding check. We need to start from the root of the sub-dag being matched and ensure there isn't a non-direct path to the load (i.e. a path that goes out of the sub-dag.) llvm-svn: 30958	2006-10-14 08:33:25 +00:00
Evan Cheng	fe5bb5dbe6	Merge ISD::TRUNCSTORE to ISD::STORE. Switch to using StoreSDNode. llvm-svn: 30945	2006-10-13 21:14:26 +00:00
Evan Cheng	76d365ac84	Doh. This wasn't causing problems by luck. llvm-svn: 30914	2006-10-12 19:13:59 +00:00
Chris Lattner	fde6859201	fix compilation failure of smg2000 llvm-svn: 30900	2006-10-12 03:55:48 +00:00
Chris Lattner	502246c4a6	Fold "zero extending vector loads" now that evan added the chain manip stuff. This compiles both tests in X86/vec_ss_load_fold.ll into: _test1: movss 4(%esp), %xmm0 subss LCPI1_0, %xmm0 mulss LCPI1_1, %xmm0 minss LCPI1_2, %xmm0 xorps %xmm1, %xmm1 maxss %xmm1, %xmm0 cvttss2si %xmm0, %eax andl $65535, %eax ret instead of: _test1: movss LCPI1_0, %xmm0 movss 4(%esp), %xmm1 subss %xmm0, %xmm1 movss LCPI1_1, %xmm0 mulss %xmm0, %xmm1 movss LCPI1_2, %xmm0 minss %xmm0, %xmm1 xorps %xmm0, %xmm0 maxss %xmm0, %xmm1 cvttss2si %xmm1, %eax andl $65535, %eax ret llvm-svn: 30894	2006-10-11 22:09:58 +00:00
Evan Cheng	95140c9c64	ComplexPatterns sse_load_f32 and sse_load_f64 returns in / out chain operands. llvm-svn: 30892	2006-10-11 21:06:01 +00:00
Evan Cheng	b2998e15f2	More isel time load folding checking for nodes that produce flag values. See comment in CanBeFoldedBy() for detailed explanation. llvm-svn: 30851	2006-10-10 01:46:56 +00:00
Evan Cheng	d22f3dd3ed	Reflects ISD::LOAD / ISD::LOADX / LoadSDNode changes. llvm-svn: 30844	2006-10-09 20:57:25 +00:00
Chris Lattner	3cd1d08ac6	completely disable folding of loads into scalar sse instructions and provide a framework for doing it right. This fixes CodeGen/X86/2006-10-07-ScalarSSEMiscompile.ll. Once X86DAGToDAGISel::SelectScalarSSELoad is implemented right, this task will be done. llvm-svn: 30817	2006-10-07 21:55:32 +00:00
Evan Cheng	82dcacb63d	Not needed. llvm-svn: 30674	2006-09-29 22:05:10 +00:00
Anton Korobeynikov	7c2118575c	Added some eye-candy for Subtarget type checking Added X86 StdCall & FastCall calling conventions. Codegen will follow. llvm-svn: 30446	2006-09-17 20:25:45 +00:00
Evan Cheng	e9bbf85e5e	Remove a unnecessary check. llvm-svn: 30382	2006-09-14 23:55:02 +00:00
Chris Lattner	89d7fe3917	Fix a regression in the 32-bit port from the 64-bit port landing. We now compile CodeGen/X86/lea-2.ll into: _test: movl 4(%esp), %eax movl 8(%esp), %ecx leal -5(%ecx,%eax,4), %eax ret instead of: _test: movl 4(%esp), %eax leal (,%eax,4), %eax addl 8(%esp), %eax addl $4294967291, %eax ret llvm-svn: 30288	2006-09-13 04:45:25 +00:00
Evan Cheng	dd52a60189	Reflects MachineConstantPoolEntry changes. llvm-svn: 30279	2006-09-12 21:04:05 +00:00
Evan Cheng	15dd42884e	Committing X86-64 support. llvm-svn: 30177	2006-09-08 06:48:29 +00:00
Evan Cheng	69ef4ae2a1	Oops. Bad typo. Without the check of N1.hasOneUse() bad things can happen. Suppose the TokenFactor can reach the Op: [Load chain] ^ \| [Load] ^ ^ \| \| / \- / \| / [Op] / ^ ^ \| .. \| \| / \| [TokenFactor] \| ^ \| \| \| \ / \ / [Store] If we move the Load below the TokenFactor, we would have created a cycle in the DAG. llvm-svn: 30040	2006-09-01 22:52:28 +00:00
Evan Cheng	9fda1129ce	Remove dead code. llvm-svn: 29962	2006-08-29 21:42:58 +00:00
Evan Cheng	b747dc2ab0	Don't performance load/op/store transformation if op produces a floating point or vector result. X86 does not have load/mod/store variants of those instructions. llvm-svn: 29957	2006-08-29 18:37:37 +00:00
Evan Cheng	4c63a7ed05	- Enable x86 isel preprocessing by default unless -fast is specified. - Also disable isel load folding if -fast. llvm-svn: 29956	2006-08-29 18:28:33 +00:00
Evan Cheng	3f6a206f01	Avoid making unneeded load/mod/store transformation which can hurt performance. llvm-svn: 29952	2006-08-29 06:44:17 +00:00
Evan Cheng	25d25dd384	Add an optional pass to preprocess the DAG before x86 isel to allow selecting more load/mod/store instructions. llvm-svn: 29943	2006-08-28 20:10:17 +00:00
Chris Lattner	33bd5dcfb7	s\|llvm/Support/Visibility.h\|llvm/Support/Compiler.h\| llvm-svn: 29911	2006-08-27 12:54:02 +00:00
Evan Cheng	a6f81f1863	Do not use getTargetNode() and SelectNodeTo() which takes more than 3 SDOperand arguments. Use the variants which take an array and number instead. llvm-svn: 29907	2006-08-27 08:14:06 +00:00
Evan Cheng	1c3d571e4b	SelectNodeTo now returns a SDNode*. llvm-svn: 29901	2006-08-26 08:00:10 +00:00
Evan Cheng	2db7799507	Select() no longer require Result operand by reference. llvm-svn: 29898	2006-08-26 05:34:46 +00:00
Evan Cheng	57893e39fe	Match tblgen changes; clean up. llvm-svn: 29894	2006-08-26 01:05:16 +00:00
Evan Cheng	33c5017ffb	Doh. Incorrectly inverted condition. Also add a isOnlyUse check to match tablegen. llvm-svn: 29741	2006-08-16 23:59:00 +00:00
Evan Cheng	7fb75bbc8d	SelectNodeTo() may return a SDOperand that is different from the input. llvm-svn: 29726	2006-08-16 07:30:09 +00:00
Evan Cheng	6053206580	Match tablegen changes. llvm-svn: 29604	2006-08-11 09:08:15 +00:00
Evan Cheng	01cd84d113	Eliminate reachability matrix. It has to be calculated before any instruction selection is done. That's rather expensive especially in situations where it isn't really needed. Move back to a searching the predecessors, but make use of topological order to trim the search space. llvm-svn: 29559	2006-08-08 00:31:00 +00:00
Evan Cheng	d18be1d9c1	Match tablegen isel changes. llvm-svn: 29549	2006-08-07 22:28:20 +00:00
Evan Cheng	445674348f	Reflect change to AssignTopologicalOrder(). llvm-svn: 29480	2006-08-02 22:01:32 +00:00
Evan Cheng	6fd2b20b8a	Use of vector<bool> causes some horrendous compile time regression (2x)! Looks like libstdc++ implementation does not scale very well. Switch back to using directly managed arrays. llvm-svn: 29469	2006-08-02 09:18:33 +00:00
Evan Cheng	29d6f9d252	Factor topological order code to SelectionDAG. Clean up. llvm-svn: 29430	2006-08-01 08:17:22 +00:00
Evan Cheng	e4c19806cd	Can't spell. llvm-svn: 29383	2006-07-28 06:33:41 +00:00
Evan Cheng	8ea5ac0abd	Some clean up. llvm-svn: 29382	2006-07-28 06:05:06 +00:00
Evan Cheng	5f0e94c299	Rename IsFoldableBy to CanBeFoldedleBy llvm-svn: 29376	2006-07-28 01:03:48 +00:00
Evan Cheng	c43a75b7d4	Node selected into address mode cannot be folded. llvm-svn: 29374	2006-07-28 00:49:31 +00:00
Evan Cheng	8920047e85	Another duh. Determine topological order before any target node is added. llvm-svn: 29371	2006-07-28 00:10:59 +00:00
Evan Cheng	9d43eb616a	Brain cramp.. llvm-svn: 29370	2006-07-27 23:35:40 +00:00
Evan Cheng	24b41a766b	Allocating too large an array for ReachibilityMatrix. llvm-svn: 29367	2006-07-27 22:35:40 +00:00
Evan Cheng	17ccdcc415	Calculate the portion of reachbility matrix on demand. llvm-svn: 29366	2006-07-27 22:10:00 +00:00
Evan Cheng	6a126e3adb	isNonImmUse is replaced by IsFoldableBy llvm-svn: 29365	2006-07-27 21:19:10 +00:00
Evan Cheng	dbcca8f422	Use reachbility information to determine whether a node can be folded into another during isel. llvm-svn: 29346	2006-07-27 16:44:36 +00:00
Chris Lattner	adc7078c98	Hide x86 symbols llvm-svn: 28976	2006-06-28 23:27:49 +00:00
Chris Lattner	82b121e762	Add support for "m" inline asm constraints. llvm-svn: 28728	2006-06-08 18:03:49 +00:00
Evan Cheng	696779cea0	Cygwin support. Patch by Anton Korobeynikov! llvm-svn: 28672	2006-06-02 22:38:37 +00:00
Evan Cheng	ddb0525a32	Use xor to clear a register. llvm-svn: 28667	2006-06-02 21:20:34 +00:00
Evan Cheng	d7e0bab7f0	Remove bogus comment. llvm-svn: 28564	2006-05-30 20:24:48 +00:00
Evan Cheng	f7637e403f	A addressing mode folding enhancement: Fold c2 in (x << c1) \| c2 where (c2 < c1) e.g. int test(int x) { return (x << 3) + 7; } This can be codegen'd as: leal 7(,%eax,8), %eax llvm-svn: 28550	2006-05-30 06:59:36 +00:00
Evan Cheng	09942d3f8b	Assert if InflightSet is not cleared after instruction selecting a BB. llvm-svn: 28459	2006-05-25 00:24:28 +00:00
Evan Cheng	b040dd86af	Clear HandleMap and ReplaceMap after instruction selection. Or it may cause non-deterministic behavior. llvm-svn: 28454	2006-05-24 20:46:25 +00:00
Chris Lattner	f604017e47	Patches to make the LLVM sources more -pedantic clean. Patch provided by Anton Korobeynikov! This is a step towards closing PR786. llvm-svn: 28447	2006-05-24 17:04:05 +00:00
Evan Cheng	52fde7f5ce	Back out indirect branch load folding hack. It broke some tests. llvm-svn: 28425	2006-05-21 06:28:50 +00:00
Evan Cheng	a0bbbba168	- Use of load's chain result should be redirected to load's chain operand. If it reads the chain result of the call, then the use, callseq_start, and call would form a cycle! - Don't forget handle node replacement! - There could also be a TokenFactor between the load and the callseq_start. llvm-svn: 28420	2006-05-20 09:21:39 +00:00
Evan Cheng	95259a0f68	Missing break statements. llvm-svn: 28418	2006-05-20 07:44:28 +00:00
Evan Cheng	550e73a900	Remove unused patterns. llvm-svn: 28417	2006-05-20 01:40:16 +00:00
Evan Cheng	c68ea538e7	Handle indirect call which folds a load manually. This never matches by the TableGen generated code since the load's chain result is read by the callseq_start node. llvm-svn: 28416	2006-05-20 01:36:52 +00:00
Evan Cheng	dc9b5f5fc0	X86 integer register classes naming changes. Make them consistent with FP, vector classes. llvm-svn: 28324	2006-05-16 07:21:53 +00:00
Evan Cheng	871a83d4d0	Remove dead code llvm-svn: 28261	2006-05-12 19:03:56 +00:00
Evan Cheng	0fb3fc3626	Fixing truncate. Previously we were emitting truncate from r16 to r8 as movw. That is we promote the destination operand to r16. So %CH = TRUNC_R16_R8 %BP is emitted as movw %bp, %cx. This is incorrect. If %cl is live, it would be clobbered. Ideally we want to do the opposite, that is emitted it as movb ??, %ch But this is not possible since %bp does not have a r8 sub-register. We are now defining a new register class R16_ which is a subclass of R16 containing only those 16-bit registers that have r8 sub-registers (i.e. AX - DX). We isel the truncate to two instructions, a MOV16to16_ to copy the value to the R16_ class, followed by a TRUNC_R16_R8. Due to bug 770, the register colaescer is not going to coalesce between R16 and R16_. That will be fixed later so we can eliminate the MOV16to16_. Right now, it can only be eliminated if we are lucky that source and destination registers are the same. llvm-svn: 28164	2006-05-08 08:01:26 +00:00
Evan Cheng	84612a59c2	Better implementation of truncate. ISel matches it to a pseudo instruction that gets emitted as movl (for r32 to i16, i8) or a movw (for r16 to i8). And if the destination gets allocated a subregister of the source operand, then the instruction will not be emitted at all. llvm-svn: 28119	2006-05-05 05:40:20 +00:00
Chris Lattner	e199d55073	#include Intrinsics.h into all dag isels llvm-svn: 27109	2006-03-25 06:47:10 +00:00
Evan Cheng	7ec94f2ff7	Added getTargetLowering() to TargetMachine. Refactored targets to support this. llvm-svn: 26742	2006-03-13 23:20:37 +00:00
Evan Cheng	fab8a53944	Don't match x << 1 to LEAL. It's better to emit x + x. llvm-svn: 26429	2006-02-28 21:13:57 +00:00
Evan Cheng	de768027d9	* Cleaned up addressing mode matching code. * Cleaned up and tweaked LEA cost analysis code. Removed some hacks. * Handle ADD $X, c to MOV32ri $X+c. These patterns cannot be autogen'd and they need to be matched before LEA. llvm-svn: 26376	2006-02-25 10:09:08 +00:00
Evan Cheng	cb9fb051a5	- Clean up the lowering and selection code of ConstantPool, GlobalAddress, and ExternalSymbol. - Use C++ code (rather than tblgen'd selection code) to match the above mentioned leaf nodes. Do not mutate and nodes and do not record the selection in CodeGenMap. These nodes should be safe to duplicate. This is a performance win. llvm-svn: 26335	2006-02-23 20:41:18 +00:00
Evan Cheng	2977507828	PIC related bug fixes. 1. Various asm printer bug. 2. Lowering bug. Now TargetGlobalAddress is wrapped in X86ISD::TGAWrapper. llvm-svn: 26324	2006-02-23 02:43:52 +00:00
Evan Cheng	b8000b03aa	X86 codegen tweak to use lea in another case: Suppose base == %eax and it has multiple uses, then instead of movl %eax, %ecx addl $8, %ecx use leal 8(%eax), %ecx. llvm-svn: 26323	2006-02-23 00:13:58 +00:00
Evan Cheng	bf3558a375	x86 / Darwin PIC support. llvm-svn: 26273	2006-02-18 00:15:05 +00:00
Evan Cheng	1b8029264a	Prevent certain nodes that have already been selected from being folded into X86 addressing mode. Currently we do not allow any node whose target node produces a chain as well as any node that is at the root of the addressing mode expression tree. llvm-svn: 26117	2006-02-11 02:05:36 +00:00
Evan Cheng	eaaafb36c8	Nicer code. :-) llvm-svn: 26111	2006-02-10 22:46:26 +00:00
Evan Cheng	d141f84c34	Added X86 isel debugging stuff. llvm-svn: 26110	2006-02-10 22:24:32 +00:00
Evan Cheng	d491020c15	Match tblgen change. llvm-svn: 26096	2006-02-09 22:12:53 +00:00
Evan Cheng	6bd0f9c4ba	Match getTargetNode() changes (now return SDNode* instead of SDOperand). llvm-svn: 26085	2006-02-09 07:17:49 +00:00
Evan Cheng	521e5a1bfe	Change Select() from SDOperand Select(SDOperand N); to void Select(SDOperand &Result, SDOperand N); llvm-svn: 26067	2006-02-09 00:37:58 +00:00
Evan Cheng	080996281c	- Update load folding checks to match those auto-generated by tblgen. - Manually select SDOperand's returned by TryFoldLoad which make up the load address. llvm-svn: 26012	2006-02-06 06:02:33 +00:00
Evan Cheng	fb902782e8	Use SelectRoot() as entry of any tblgen based isel. llvm-svn: 25997	2006-02-05 06:46:41 +00:00
Evan Cheng	c23d3cd6c3	Re-commit the last bit of change that was backed out. llvm-svn: 25983	2006-02-05 05:25:07 +00:00
Chris Lattner	15ab5c6d0c	Temporarily revert this patch, which probably breaks with the tblgen patch reverted. llvm-svn: 25971	2006-02-04 09:24:16 +00:00
Evan Cheng	6a366c5eda	Complex pattern's custom matcher should not call Select() on any operands. Select them afterwards if it returns true. llvm-svn: 25968	2006-02-04 08:50:49 +00:00
Evan Cheng	45ebd632f2	- Allow XMM load (for scalar use) to be folded into ANDP* and XORP. - Use XORP to implement fneg. llvm-svn: 25857	2006-01-31 22:28:30 +00:00
Evan Cheng	5891f49c47	x86 CPU detection and proper subtarget support llvm-svn: 25679	2006-01-27 08:10:46 +00:00
Chris Lattner	aafc339b4e	Add explicit #includes of <iostream> llvm-svn: 25515	2006-01-22 23:41:00 +00:00
Evan Cheng	ddb170b73e	Didn't mean to check that in. llvm-svn: 25436	2006-01-19 01:52:56 +00:00
Evan Cheng	aebece2f7b	A obvious typo llvm-svn: 25435	2006-01-19 01:46:14 +00:00
Evan Cheng	de33ca2831	Fix FP_TO_INT**_IN_MEM lowering. llvm-svn: 25368	2006-01-16 21:21:29 +00:00
Chris Lattner	20f25dc8c2	Use the default lowering of ISD::DYNAMIC_STACKALLOC, delete now dead code. llvm-svn: 25333	2006-01-15 09:00:21 +00:00
Chris Lattner	95882698e5	silence a warning llvm-svn: 25322	2006-01-14 20:11:13 +00:00
Evan Cheng	3c3391632d	Select DYNAMIC_STACKALLOC llvm-svn: 25225	2006-01-11 22:15:18 +00:00
Evan Cheng	e42281bcba	* Add special entry code main() (to set x87 to 64-bit precision). * Allow a register node as SelectAddr() base. * ExternalSymbol -> TargetExternalSymbol as direct function callee. * Use X86::ESP register rather than CopyFromReg(X86::ESP) as stack ptr for call parmater passing. llvm-svn: 25207	2006-01-11 06:09:51 +00:00
Chris Lattner	1d0926075f	implement FP_REG_KILL insertion for the dag-dag instruction selector llvm-svn: 25192	2006-01-11 01:15:34 +00:00

... 2 3 4 5 6 ...

377 Commits