llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Dale Johannesen	b423c8f205	Do not force asm's to be chained if they don't touch memory and aren't volatile. This was interfering with good scheduling. llvm-svn: 69008	2009-04-14 00:56:56 +00:00
Daniel Dunbar	e2a54c13b3	Make these errors more noticable in build logs. llvm-svn: 68998	2009-04-13 22:26:09 +00:00
Bob Wilson	355508e70f	Change SelectionDAG type legalization to allow BUILD_VECTOR operands to be promoted to legal types without changing the type of the vector. This is following a suggestion from Duncan (http://lists.cs.uiuc.edu/pipermail/llvmdev/2009-February/019923.html). The transformation that used to be done during type legalization is now postponed to DAG legalization. This allows the BUILD_VECTORs to be optimized and potentially handled specially by target-specific code. It turns out that this is also consistent with an optimization done by the DAG combiner: a BUILD_VECTOR and INSERT_VECTOR_ELT may be combined by replacing one of the BUILD_VECTOR operands with the newly inserted element; but INSERT_VECTOR_ELT allows its scalar operand to be larger than the element type, with any extra high bits being implicitly truncated. The result is a BUILD_VECTOR where one of the operands has a type larger the the vector element type. Any code that operates on BUILD_VECTORs may now need to be aware of the potential type discrepancy between the vector element type and the BUILD_VECTOR operands. This patch updates all of the places that I could find to handle that case. llvm-svn: 68996	2009-04-13 22:05:19 +00:00
Dan Gohman	8393d29bc8	Rename COPY_TO_SUBCLASS to COPY_TO_REGCLASS, and generalize it accordingly. Thanks to Jakob Stoklund Olesen for pointing out how this might be useful. llvm-svn: 68986	2009-04-13 21:06:25 +00:00
Bob Wilson	58529085d2	Refactor some code in SelectionDAGLegalize::ExpandBUILD_VECTOR. llvm-svn: 68981	2009-04-13 20:20:30 +00:00
Devang Patel	92d79ef835	Right now, Debugging information to encode scopes (DW_TAG_lexical_block) relies on DBG_LABEL. Unfortunately this intefers with the quality of optimized code. This patch updates dwarf writer to encode scoping information in DWARF only in FastISel mode. llvm-svn: 68973	2009-04-13 18:13:16 +00:00
Devang Patel	ad7f61c279	Reapply 68847. Now debug_inlined section is covered by TAI->doesDwarfUsesInlineInfoSection(), which is false by default. llvm-svn: 68964	2009-04-13 17:02:03 +00:00
Dan Gohman	3873cb7a36	Add a new TargetInstrInfo MachineInstr opcode, COPY_TO_SUBCLASS. This will be used to replace things like X86's MOV32to32_. Enhance ScheduleDAGSDNodesEmit to be more flexible and robust in the presense of subregister superclasses and subclasses. It can now cope with the definition of a virtual register being in a subclass of a use. Re-introduce the code for recording register superreg classes and subreg classes. This is needed because when subreg extracts and inserts get coalesced away, the virtual registers are left in the correct subclass. llvm-svn: 68961	2009-04-13 15:38:05 +00:00
Chris Lattner	d1b365fc66	make UpdateValueMap handle the possiblity that we could be copying into the right register, avoiding a copy. llvm-svn: 68889	2009-04-12 07:46:30 +00:00
Chris Lattner	ed355b8551	optimize FastISel::UpdateValueMap to avoid duplicate map lookups, and make it return the assigned register. llvm-svn: 68888	2009-04-12 07:45:01 +00:00
Dan Gohman	ac11c8d30f	Revert r68847. It breaks the build on non-Darwin targets, with this message from the assembler: Error: unknown pseudo-op: `.debug_inlined' llvm-svn: 68863	2009-04-11 15:57:04 +00:00
Devang Patel	6f907173e0	Keep track of inlined functions and their locations. This information is collected when nested llvm.dbg.func.start intrinsics are seen. (Right now, inliner removes nested llvm.dbg.func.start intrinisics during inlining.) Create debug_inlined dwarf section using these information. This info is used by gdb, at least on Darwin, to enable better experience debugging inlined functions. See DwarfWriter.cpp for more information on structure of debug_inlined section. llvm-svn: 68847	2009-04-11 00:16:47 +00:00
Bob Wilson	3b4991e76d	Clean up a bunch of whitespace issues and fix a comment typo. No functional changes. llvm-svn: 68808	2009-04-10 18:48:47 +00:00
Dan Gohman	b85fd685f8	Now that register classes have names, include the name in debug output. llvm-svn: 68786	2009-04-10 15:59:38 +00:00
Dan Gohman	8121b3f88d	Remove the obsolete SelectionDAG::getNodeValueTypes and simplify code that uses it by using SelectionDAG::getVTList instead. llvm-svn: 68744	2009-04-09 23:54:40 +00:00
Devang Patel	500dc5157e	Silence unused variable warning. llvm-svn: 68735	2009-04-09 23:45:17 +00:00
Devang Patel	78fd57741b	llvm.dbg.func_start also defines beginning of function scope. llvm-svn: 68727	2009-04-09 21:42:11 +00:00
Dan Gohman	68de98eef3	Generalize ExtendUsesToFormExtLoad to be usable for ANY_EXTEND, in addition to ZERO_EXTEND and SIGN_EXTEND. Fix a bug in the way it checked for live-out values, and simplify the way it find users by using SDNode::use_iterator's (relatively) new features. Also, make it slightly more permissive on targets with free truncates. In SelectionDAGBuild, avoid creating ANY_EXTEND nodes that are larger than necessary. If the target's SwitchAmountTy has enough bits, use it. This exposes the truncate to optimization early, enabling more optimizations. llvm-svn: 68670	2009-04-09 03:51:29 +00:00
Dan Gohman	6479d0cbb7	Don't copy the operand of a SwitchInst into virtual registers as eagerly. This helps avoid CopyToReg nodes in some cases where they aren't needed, and also helps subsequent optimizer heuristics in cases where the extra nodes would cause the node to appear to have multiple results. This doesn't have a significant impact currently; it'll help an upcoming change. llvm-svn: 68667	2009-04-09 02:33:36 +00:00
Duncan Sands	d0e186d90f	Soft float support for FREM. llvm-svn: 68614	2009-04-08 16:20:57 +00:00
Duncan Sands	ee34b0d05d	Soft float support for undef. Reported by Xerxes Rånby. llvm-svn: 68607	2009-04-08 13:33:37 +00:00
Dan Gohman	c9ce27d6b7	Implement support for using modeling implicit-zero-extension on x86-64 with SUBREG_TO_REG, teach SimpleRegisterCoalescing to coalesce SUBREG_TO_REG instructions (which are similar to INSERT_SUBREG instructions), and teach the DAGCombiner to take advantage of this on targets which support it. This eliminates many redundant zero-extension operations on x86-64. This adds a new TargetLowering hook, isZExtFree. It's similar to isTruncateFree, except it only applies to actual definitions, and not no-op truncates which may not zero the high bits. Also, this adds a new optimization to SimplifyDemandedBits: transform operations like x+y into (zext (add (trunc x), (trunc y))) on targets where all the casts are no-ops. In contexts where the high part of the add is explicitly masked off, this allows the mask operation to be eliminated. Fix the DAGCombiner to avoid undoing these transformations to eliminate casts on targets where the casts are no-ops. Also, this adds a new two-address lowering heuristic. Since two-address lowering runs before coalescing, it helps to be able to look through copies when deciding whether commuting and/or three-address conversion are profitable. Also, fix a bug in LiveInterval::MergeInClobberRanges. It didn't handle the case that a clobber range extended both before and beyond an existing live range. In that case, multiple live ranges need to be added. This was exposed by the new subreg coalescing code. Remove 2008-05-06-SpillerBug.ll. It was bugpoint-reduced, and the spiller behavior it was looking for no longer occurrs with the new instruction selection. llvm-svn: 68576	2009-04-08 00:15:30 +00:00
Devang Patel	74fc1ab23d	Revert prev. patch for now. llvm-svn: 68569	2009-04-07 23:00:04 +00:00
Devang Patel	6fce3ad779	Right now DBG_LABEL are required for llvm.dbg.region_start and llvm.dbg.region_end in non-fast mode also. llvm-svn: 68559	2009-04-07 22:27:56 +00:00
Dan Gohman	e98c3b1ea1	Don't attempt to handle aggregate argument values in FastISel; let SelectionDAG do those. This fixes PR3955. llvm-svn: 68546	2009-04-07 20:40:11 +00:00
Dan Gohman	ea48adc739	Fix a TargetLowering optimization so that it doesn't duplicate loads when an input node has multiple uses. llvm-svn: 68398	2009-04-03 20:11:30 +00:00
Dan Gohman	f431de416e	Delete ISD::INSERT_SUBREG and ISD::EXTRACT_SUBREG, which are unused. Note that these are distinct from TargetInstrInfo::INSERT_SUBREG and TargetInstrInfo::EXTRACT_SUBREG, which are used. llvm-svn: 68355	2009-04-03 00:25:26 +00:00
Sanjiv Gupta	01fb90085c	To convert the StopPoint insn into an assembler directive by ISel, we need to have access to the line number field. So we convert that info as an operand by custom handling DBG_STOPPOINT in legalize. llvm-svn: 68329	2009-04-02 18:03:10 +00:00
Evan Cheng	0674a512bf	Fully general expansion of integer shift of any size. llvm-svn: 68134	2009-03-31 19:39:24 +00:00
Dan Gohman	5f64e10d8d	Minor top-level comment fix. llvm-svn: 68113	2009-03-31 16:51:18 +00:00
Dan Gohman	cba99ee717	Fix live-out reg logic to not insert over-aggressive AssertZExt instructions. This fixes lua. llvm-svn: 68083	2009-03-31 01:38:29 +00:00
Duncan Sands	602234cdf3	Fix PR3899: add support for extracting floats from vectors when using -soft-float. Based on a patch by Jakob Stoklund Olesen. llvm-svn: 67996	2009-03-29 13:51:06 +00:00
Arnold Schwaighofer	76188bc8a1	Make check in CheckTailCallReturnConstraints for ignorable instructions between a CALL and a RET node more generic. Add a test for tail calls with a void return. llvm-svn: 67943	2009-03-28 12:36:29 +00:00
Arnold Schwaighofer	636127325b	Enable tail call optimization for functions that return a struct (bug 3664) and for functions that return types that need extending (e.g i1). llvm-svn: 67934	2009-03-28 08:33:27 +00:00
Evan Cheng	a15fdaa292	Optimize some 64-bit multiplication by constants into two lea's or one lea + shl since imulq is slow (latency 5). e.g. x * 40 => shlq $3, %rdi leaq (%rdi,%rdi,4), %rax This has the added benefit of allowing more multiply to be folded into addressing mode. e.g. a * 24 + b => leaq (%rdi,%rdi,2), %rax leaq (%rsi,%rax,8), %rax llvm-svn: 67917	2009-03-28 05:57:29 +00:00
Dan Gohman	b360b0390a	Fix what surely must be a copy+pasto. llvm-svn: 67881	2009-03-27 23:55:04 +00:00
Dan Gohman	401df03e2d	Initialize LiveOutInfo's APInt members to zero, as APInt's default constructor produces an uninitialized APInt. This fixes PR3896. llvm-svn: 67879	2009-03-27 23:51:02 +00:00
Bill Wendling	f4247ff478	Pull transform from target-dependent code into target-independent code. llvm-svn: 67742	2009-03-26 06:14:09 +00:00
Evan Cheng	7e4217176a	Revert 67132. This is breaking some objective-c apps. Also fixes SDISel so it does not force promote return value if the function is not marked signext / zeroext. llvm-svn: 67701	2009-03-25 20:20:11 +00:00
Dale Johannesen	9fb1dcd077	When optimizing with debug info, don't keep the stoppoint nodes around until Legalize; doing this imposed an ordering on a sequence of loads that came from different lines, interfering with scheduling. llvm-svn: 67692	2009-03-25 17:36:08 +00:00
Chris Lattner	135eeefe66	more tidying: name the components of PhysReg in the case when the target constraint specifies a specific physreg. llvm-svn: 67618	2009-03-24 15:27:37 +00:00
Chris Lattner	8793e812ef	Tidy a bit more. llvm-svn: 67617	2009-03-24 15:25:07 +00:00
Chris Lattner	a60dd19c3e	simplify this code a bit now that "allocation to a vreg class" can never fail. llvm-svn: 67616	2009-03-24 15:22:11 +00:00
Dan Gohman	547cfc882e	Minor compile-time optimization; don't bother checking canClobberPhysRegDefs if the successor node doesn't clobber any physical registers. llvm-svn: 67587	2009-03-24 00:50:07 +00:00
Dan Gohman	e6d7478bc1	Add a pre-pass to the burr-list scheduler which makes adjustments to help out the register pressure reduction heuristics in the case of nodes with multiple uses. Currently this uses very conservative heuristics, so it doesn't have a broad impact, but in cases where it does help it can make a big difference. llvm-svn: 67586	2009-03-24 00:49:12 +00:00
Dan Gohman	60c652de57	When unfolding a load during scheduling, the new operator node has a data dependency on the load node, so it really needs a data-dependence edge to the load node, even if the load previously existed. And add a few comments. llvm-svn: 67554	2009-03-23 20:20:43 +00:00
Dan Gohman	a6842708e6	Don't set SUnit::hasPhysRegDefs to true unless the defs are actually have uses, which reflects the way it's used. llvm-svn: 67540	2009-03-23 17:39:36 +00:00
Dan Gohman	b6b24c5fc1	Fix canClobberPhysRegDefs to check all SDNodes grouped together in an SUnit, instead of just the first one. This fix is needed by some upcoming scheduler changes. llvm-svn: 67531	2009-03-23 16:23:01 +00:00
Dan Gohman	78a1698ac0	Add a new bit to SUnit to record whether a node has implicit physreg defs, regardless of whether they are actually used. llvm-svn: 67528	2009-03-23 16:10:52 +00:00
Dan Gohman	18daca0895	Now that errs() is properly non-buffered, there's no need to explicitly flush it. llvm-svn: 67526	2009-03-23 15:57:19 +00:00
Evan Cheng	2ec94dd447	Model inline asm constraint which ties an input to an output register as machine operand TIED_TO constraint. This eliminated the need to pre-allocate registers for these. This also allows register allocator can eliminate the unneeded copies. llvm-svn: 67512	2009-03-23 08:01:15 +00:00
Dan Gohman	60cfa194d4	Simplify this code; use a while instead of an if and a do-while. llvm-svn: 67400	2009-03-20 20:42:23 +00:00
Evan Cheng	0c629db2aa	For inline asm output operand that matches an input. Encode the input operand index in the high bits. llvm-svn: 67387	2009-03-20 18:03:34 +00:00
Sanjiv Gupta	5957bb99ba	Fixed the comment. No functionality change. llvm-svn: 67370	2009-03-20 09:38:50 +00:00
Mon P Wang	3d7fb6738a	Added missing support for widening when splitting an unary op (PR3683) and expanding a bit convert (PR3711). In both cases, we extract the valid part of the widen vector and then do the conversion. llvm-svn: 67175	2009-03-18 06:24:04 +00:00
Rafael Espindola	6a6d9e48dd	Don't force promotion of return arguments on the callee. Some architectures (like x86) don't require it. This fixes bug 3779. llvm-svn: 67132	2009-03-17 23:43:59 +00:00
Chris Lattner	e3c442050d	Fix codegen to compute the size of an allocation by multiplying the size by the array amount as an i32 value instead of promoting from i32 to i64 then doing the multiply. Not doing this broke wrap-around assumptions that the optimizers (validly) made. The ultimate real fix for this is to introduce i64 version of alloca and remove mallocinst. This fixes PR3829 llvm-svn: 67093	2009-03-17 19:36:00 +00:00
Mon P Wang	7184e30bdb	Fix a problem with DAGCombine where we were building an illegal build vector shuffle mask. Forced the mask to be built using i32. Note: this will be irrelevant once vector_shuffle no longer takes a build vector for the shuffle mask. llvm-svn: 67076	2009-03-17 06:33:10 +00:00
Mon P Wang	1cd6172342	Avoid doing the transformation c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4 if FPConstant is legal because if the FPConstant doesn't need to be stored in a constant pool, the transformation is unlikely to be profitable. llvm-svn: 66994	2009-03-14 00:25:19 +00:00
Dan Gohman	fa0a3504ba	Improve FastISel's handling of truncates to i1, and implement ptrtoint and inttoptr in X86FastISel. These casts aren't always handled in the generic FastISel code because X86 sometimes needs custom code to do truncation and zero-extension. llvm-svn: 66988	2009-03-13 23:53:06 +00:00
Dan Gohman	790659c0d6	Fix FastISel's assumption that i1 values are always zero-extended by inserting explicit zero extensions where necessary. Included is a testcase where SelectionDAG produces a virtual register holding an i1 value which FastISel previously mistakenly assumed to be zero-extended. llvm-svn: 66941	2009-03-13 20:42:20 +00:00
Evan Cheng	f9951d1557	Fix some significant problems with constant pools that resulted in unnecessary paddings between constant pool entries, larger than necessary alignments (e.g. 8 byte alignment for .literal4 sections), and potentially other issues. 1. ConstantPoolSDNode alignment field is log2 value of the alignment requirement. This is not consistent with other SDNode variants. 2. MachineConstantPool alignment field is also a log2 value. 3. However, some places are creating ConstantPoolSDNode with alignment value rather than log2 values. This creates entries with artificially large alignments, e.g. 256 for SSE vector values. 4. Constant pool entry offsets are computed when they are created. However, asm printer group them by sections. That means the offsets are no longer valid. However, asm printer uses them to determine size of padding between entries. 5. Asm printer uses expensive data structure multimap to track constant pool entries by sections. 6. Asm printer iterate over SmallPtrSet when it's emitting constant pool entries. This is non-deterministic. Solutions: 1. ConstantPoolSDNode alignment field is changed to keep non-log2 value. 2. MachineConstantPool alignment field is also changed to keep non-log2 value. 3. Functions that create ConstantPool nodes are passing in non-log2 alignments. 4. MachineConstantPoolEntry no longer keeps an offset field. It's replaced with an alignment field. Offsets are not computed when constant pool entries are created. They are computed on the fly in asm printer and JIT. 5. Asm printer uses cheaper data structure to group constant pool entries. 6. Asm printer compute entry offsets after grouping is done. 7. Change JIT code to compute entry offsets on the fly. llvm-svn: 66875	2009-03-13 07:51:59 +00:00
Bill Wendling	5499163a0a	Oops...I committed too much. llvm-svn: 66867	2009-03-13 04:39:26 +00:00
Bill Wendling	02a239b837	Temporarily XFAIL this test. llvm-svn: 66866	2009-03-13 04:37:11 +00:00
Dan Gohman	71245ea439	Fix a typo in a comment. llvm-svn: 66843	2009-03-12 23:55:10 +00:00
Chris Lattner	26a971c4ec	Move 3 "(add (select cc, 0, c), x) -> (select cc, x, (add, x, c))" related transformations out of target-specific dag combine into the ARM backend. These were added by Evan in r37685 with no testcases and only seems to help ARM (e.g. test/CodeGen/ARM/select_xform.ll). Add some simple X86-specific (for now) DAG combines that turn things like cond ? 8 : 0 -> (zext(cond) << 3). This happens frequently with the recently added cp constant select optimization, but is a very general xform. For example, we now compile the second example in const-select.ll to: _test: movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 seta %al movzbl %al, %eax movl 4(%esp), %ecx movsbl (%ecx,%eax,4), %eax ret instead of: _test: movl 4(%esp), %eax leal 4(%eax), %ecx movsd LCPI2_0, %xmm0 ucomisd 8(%esp), %xmm0 cmovbe %eax, %ecx movsbl (%ecx), %eax ret This passes multisource and dejagnu. llvm-svn: 66779	2009-03-12 06:52:53 +00:00
Evan Cheng	47778669f5	Enable Chris' value propagation change. It make available known sign, zero, one bits information for values that are live out of basic blocks. The goal is to eliminate unnecessary sext, zext, truncate of values that are live-in to blocks. This does not handle PHI nodes yet. llvm-svn: 66777	2009-03-12 06:29:49 +00:00
Chris Lattner	a0c4a99ec2	reapply my previous patch (r66358) with a tweak to set the alignment of the generated constant pool entry to the desired alignment of a type. If we don't do this, we end up trying to do movsd from 4-byte alignment memory. This fixes 450.soplex and 456.hmmer. llvm-svn: 66641	2009-03-11 05:08:08 +00:00
Evan Cheng	535dbf4ffd	Revert 66358 for now. It's breaking povray, 450.soplex, and 456.hmmer on x86 / Darwin. llvm-svn: 66574	2009-03-10 20:47:18 +00:00
Chris Lattner	93662edfb8	Fix PR3763 by using proper APInt methods instead of uint64_t's. llvm-svn: 66434	2009-03-09 20:22:18 +00:00
Bill Wendling	13fcab1ef3	Pass in a std::string when getting the names of debugging things. This cuts down on the number of times a std::string is created and copied. llvm-svn: 66396	2009-03-09 05:04:40 +00:00
Chris Lattner	578b634f56	implement an optimization to codegen c ? 1.0 : 2.0 as load { 2.0, 1.0 } + c*4. For 2009-03-07-FPConstSelect.ll we now produce: _f: xorl %eax, %eax testl %edi, %edi movl $4, %ecx cmovne %rax, %rcx leaq LCPI1_0(%rip), %rax movss (%rcx,%rax), %xmm0 ret previously we produced: _f: subl $4, %esp cmpl $0, 8(%esp) movss LCPI1_0, %xmm0 je LBB1_2 ## entry LBB1_1: ## entry movss LCPI1_1, %xmm0 LBB1_2: ## entry movss %xmm0, (%esp) flds (%esp) addl $4, %esp ret on PPC the code also improves to: _f: cntlzw r2, r3 srwi r2, r2, 5 li r3, lo16(LCPI1_0) slwi r2, r2, 2 addis r3, r3, ha16(LCPI1_0) lfsx f1, r3, r2 blr from: _f: li r2, lo16(LCPI1_1) cmplwi cr0, r3, 0 addis r2, r2, ha16(LCPI1_1) beq cr0, LBB1_2 ; entry LBB1_1: ; entry li r2, lo16(LCPI1_0) addis r2, r2, ha16(LCPI1_0) LBB1_2: ; entry lfs f1, 0(r2) blr This also improves the existing pic-cpool case from: foo: subl $12, %esp call .Lllvm$1.$piclabel .Lllvm$1.$piclabel: popl %eax addl $_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax cmpl $0, 16(%esp) movsd .LCPI1_0@GOTOFF(%eax), %xmm0 je .LBB1_2 # entry .LBB1_1: # entry movsd .LCPI1_1@GOTOFF(%eax), %xmm0 .LBB1_2: # entry movsd %xmm0, (%esp) fldl (%esp) addl $12, %esp ret to: foo: call .Lllvm$1.$piclabel .Lllvm$1.$piclabel: popl %eax addl $_GLOBAL_OFFSET_TABLE_ + [.-.Lllvm$1.$piclabel], %eax xorl %ecx, %ecx cmpl $0, 4(%esp) movl $8, %edx cmovne %ecx, %edx fldl .LCPI1_0@GOTOFF(%eax,%edx) ret This triggers a few dozen times in spec FP 2000. llvm-svn: 66358	2009-03-08 01:51:30 +00:00
Chris Lattner	f89a9c9d74	random cleanups. llvm-svn: 66357	2009-03-08 01:47:41 +00:00
Duncan Sands	5ab54d488f	Introduce new linkage types linkonce_odr, weak_odr, common_odr and extern_weak_odr. These are the same as the non-odr versions, except that they indicate that the global will only be overridden by an equivalent global. In C, a function with weak linkage can be overridden by a function which behaves completely differently. This means that IP passes have to skip weak functions, since any deductions made from the function definition might be wrong, since the definition could be replaced by something completely different at link time. This is not allowed in C++, thanks to the ODR (One-Definition-Rule): if a function is replaced by another at link-time, then the new function must be the same as the original function. If a language knows that a function or other global can only be overridden by an equivalent global, it can give it the weak_odr linkage type, and the optimizers will understand that it is alright to make deductions based on the function body. The code generators on the other hand map weak and weak_odr linkage to the same thing. llvm-svn: 66339	2009-03-07 15:45:40 +00:00
Dan Gohman	42049dafd7	Fix ScheduleDAGRRList::CopyAndMoveSuccessors' handling of nodes with multiple chain operands. This can occur when the scheduler has added chain operands to a node that already has a chain operand, in order to handle physical register dependencies. This fixes an llvm-gcc bootstrap failure on x86-64 introduced in r66058. llvm-svn: 66240	2009-03-06 02:23:01 +00:00
Bob Wilson	b75225556d	Fix BuildVectorSDNode::isConstantSplat to handle one-element vectors. It is an error to call APInt::zext with a size that is equal to the value's current size, so use zextOrTrunc instead. llvm-svn: 66039	2009-03-04 17:47:01 +00:00
Eli Friedman	1bed40b86a	PR3686: make the legalizer handle bitcast from i80 to x86 long double. llvm-svn: 66021	2009-03-04 06:23:34 +00:00
Evan Cheng	db402a7a49	Fix PR3701. 1. X86 target renamed eflags register to flags. This matches what llvm-gcc generates so codegen knows flags register is being clobbered by inline asm. 2. BURR scheduler should also check if inline asm nodes can clobber "live" physical registers. Previously it was only checking target nodes with implicit defs. llvm-svn: 65996	2009-03-04 01:41:49 +00:00
Bill Wendling	93eeea0493	The DAG combiner was performing a BT combine. The BT combine had a value of -1, so it changed it into a 31 via the TLO.ShrinkDemandedConstant() call. Then it would go through the DAG combiner again. This time it had a value of 31, which was turned into a -1 by TLI.SimplifyDemandedBits(). This would ping pong forever. Teach the TLO.ShrinkDemandedConstant() call not to lower a value if the demanded value is an XOR of all ones. llvm-svn: 65985	2009-03-04 00:18:06 +00:00
Bob Wilson	5b276b6bc9	Generalize BuildVectorSDNode::isConstantSplat to use APInts and handle arbitrary vector sizes. Add an optional MinSplatBits parameter to specify a minimum for the splat element size. Update the PPC target to use the revised interface. llvm-svn: 65899	2009-03-02 23:24:16 +00:00
Nate Begeman	6b41d33726	Fix a problem with DAGCombine on 64b targets where folding extracts + build_vector into a shuffle would fail, because the type of the new build_vector would not be legal. Try harder to create a legal build_vector type. Note: this will be totally irrelevant once vector_shuffle no longer takes a build_vector for shuffle mask. New: _foo: xorps %xmm0, %xmm0 xorps %xmm1, %xmm1 subps %xmm1, %xmm1 mulps %xmm0, %xmm1 addps %xmm0, %xmm1 movaps %xmm1, 0 Old: _foo: xorps %xmm0, %xmm0 movss %xmm0, %xmm1 xorps %xmm2, %xmm2 unpcklps %xmm1, %xmm2 pshufd $80, %xmm1, %xmm1 unpcklps %xmm1, %xmm2 pslldq $16, %xmm2 pshufd $57, %xmm2, %xmm1 subps %xmm0, %xmm1 mulps %xmm0, %xmm1 addps %xmm0, %xmm1 movaps %xmm1, 0 llvm-svn: 65791	2009-03-01 23:44:07 +00:00
Bob Wilson	7482f84ae6	Combine PPC's GetConstantBuildVectorBits and isConstantSplat functions to a new method in a BuildVectorSDNode "pseudo-class". llvm-svn: 65747	2009-03-01 01:13:55 +00:00
Rafael Espindola	880e63bf01	Refactor TLS code and add some tests. The tests and expected results are: pic \| declaration \| linkage \| visibility \| !pic \| declaration \| external \| default \| tls1.ll tls2.ll \| local exec pic \| declaration \| external \| default \| tls1-pic.ll tls2-pic.ll \| general dynamic !pic \| !declaration \| external \| default \| tls3.ll tls4.ll \| initial exec pic \| !declaration \| external \| default \| tls3-pic.ll tls4-pic.ll \| general dynamic !pic \| declaration \| external \| hidden \| tls7.ll tls8.ll \| local exec pic \| declaration \| external \| hidden \| X \| local dynamic !pic \| !declaration \| external \| hidden \| tls9.ll tls10.ll \| local exec pic \| !declaration \| external \| hidden \| X \| local dynamic !pic \| declaration \| internal \| default \| tls5.ll tls6.ll \| local exec pic \| declaration \| internal \| default \| X \| local dynamic The ones marked with an X have not been implemented since local dynamic is not implemented. llvm-svn: 65632	2009-02-27 13:37:18 +00:00
Evan Cheng	ec34226c2b	Revert BuildVectorSDNode related patches: 65426, 65427, and 65296. llvm-svn: 65482	2009-02-25 22:49:59 +00:00
Dale Johannesen	9a6ca304f1	Fix big-endian codegen bug. We're splitting up overly long ints, e.g. i96, into pieces at PHIs and the nodes that feed into them; however big-endian reverses the order of the pieces (for some reason), and wasn't doing it the same way on both sides, so the pieces didn't match and runtime failures ensued. Fixes 188.ammp and sqlite3 on ppc32. llvm-svn: 65481	2009-02-25 22:39:13 +00:00
Evan Cheng	befebff8fc	Clean up dwarf writer, part 1. This eliminated the horrible recursive getGlobalVariablesUsing and replaced it something readable. It eliminated use of slow UniqueVector and replaced it with StringMap, SmallVector, and DenseMap, etc. It also fixed some non-deterministic behavior. This is a very minor compile time win. llvm-svn: 65438	2009-02-25 07:04:34 +00:00
Scott Michel	3f273a8412	Expand tabs to spaces (overlooked in previous commit) llvm-svn: 65427	2009-02-25 03:57:49 +00:00
Scott Michel	29d7559c09	Remove all "cached" data from BuildVectorSDNode, preferring to retrieve results via reference parameters. This patch also appears to fix Evan's reported problem supplied as a reduced bugpoint test case. llvm-svn: 65426	2009-02-25 03:12:50 +00:00
Bill Wendling	9d4eb136da	Overhaul my earlier submission due to feedback. It's a large patch, but most of them are generic changes. - Use the "fast" flag that's already being passed into the asm printers instead of shoving it into the DwarfWriter. - Instead of calling "MI->getParent()->getParent()" for every MI, set the machine function when calling "runOnMachineFunction" in the asm printers. llvm-svn: 65379	2009-02-24 08:30:20 +00:00
Bill Wendling	0d8316c5e0	- Use the "Fast" flag instead of "OptimizeForSize" to determine whether to emit a DBG_LABEL or not. We want to fall back to the original way of emitting debug info when we're in -O0/-fast mode. - Add plumbing in to pass the "Fast" flag to places that need it. - XFAIL DebugInfo/deaddebuglabel.ll. This is finding 11 labels instead of 8. I need to investigate still. llvm-svn: 65367	2009-02-24 02:35:30 +00:00
Dan Gohman	1197d46ccf	Fix a ValueTracking rule: RHS means operand 1, not 0. Add a simple ashr instcombine to help expose this code. And apply the fix to SelectionDAG's copy of this code too. llvm-svn: 65364	2009-02-24 02:00:40 +00:00
Scott Michel	3f8637305f	Introduce the BuildVectorSDNode class that encapsulates the ISD::BUILD_VECTOR instruction. The class also consolidates the code for detecting constant splats that's shared across PowerPC and the CellSPU backends (and might be useful for other backends.) Also introduces SelectionDAG::getBUID_VECTOR() for generating new BUILD_VECTOR nodes. llvm-svn: 65296	2009-02-22 23:36:09 +00:00
Richard Pennington	7a2ebd7d09	bug 3610: Floating point vaarg not softened. llvm-svn: 65239	2009-02-21 19:11:18 +00:00
Dan Gohman	c9cfc596a3	Fix a bug that David Greene found in the DAGCombiner's logic that checks whether it's safe to transform a store of a bitcast value into a store of the original value. llvm-svn: 65201	2009-02-20 23:29:13 +00:00
Bill Wendling	983f839148	Temporarily revert r65065. It was causing test failures. llvm-svn: 65068	2009-02-19 21:57:07 +00:00
Bill Wendling	9af8f830c0	Check for -fast here too. llvm-svn: 65065	2009-02-19 21:23:54 +00:00
Bill Wendling	4a86875529	Generate these labels when we're in "fast" mode, not simply when we're no in "optimize-for-size" mode. llvm-svn: 65064	2009-02-19 21:12:54 +00:00
Scott Michel	4c5fa6c982	Remove trailing whitespace to reduce later commit patch noise. (Note: Eventually, commits like this will be handled via a pre-commit hook that does this automagically, as well as expand tabs to spaces and look for 80-col violations.) llvm-svn: 64827	2009-02-17 22:15:04 +00:00
Bill Wendling	266c8bc98f	--- Merging (from foreign repository) r64714 into '.': U include/llvm/CodeGen/DebugLoc.h U lib/CodeGen/SelectionDAG/LegalizeDAG.cpp U lib/CodeGen/SelectionDAG/SelectionDAGBuild.cpp U lib/Target/X86/AsmPrinter/X86ATTAsmPrinter.cpp Enable debug location generation at -Os. This goes with the reapplication of the r63639 patch. llvm-svn: 64715	2009-02-17 01:04:54 +00:00
Dan Gohman	c47d143107	Don't assume that a left-shift of a value with one bit set will have one bit set, because the bit may be shifted off the end. Instead, just check for a constant 1 being shifted. This is still sufficient to handle all the cases in test/CodeGen/X86/bt.ll. This fixes PR3583. llvm-svn: 64622	2009-02-15 23:59:32 +00:00
Cedric Venet	48be6b5e15	Unbreak the build on win32. Cleanup some warning. Remark: when struct/class are declared differently than they are defined, this make problem for VC++ since it seems to mangle class differently that struct. These error are very hard to understand and find. So please, try to keep your definition/declaration in sync. Only tested with VS2008. hope it does not break anything. feel free to revert. llvm-svn: 64554	2009-02-14 16:06:42 +00:00
Bill Wendling	40e4b271af	Revert this. It was breaking stuff. llvm-svn: 64428	2009-02-13 02:16:35 +00:00
Bill Wendling	83b6edd760	Turn off the old way of handling debug information in the code generator. Use the new way, where all of the information is passed on SDNodes and machine instructions. llvm-svn: 64427	2009-02-13 02:01:04 +00:00
Dale Johannesen	47321cf01f	Arrange to print constants that match "n" and "i" constraints in inline asm as signed (what gcc does). Add partial support for x86-specific "e" and "Z" constraints, with appropriate signedness for printing. llvm-svn: 64400	2009-02-12 20:58:09 +00:00
Chris Lattner	4e2ed3e5e7	make fast isel fall back to selectiondags for VLA llvm.declare intrinsics. llvm-svn: 64379	2009-02-12 17:23:20 +00:00
Evan Cheng	521b8b5ef3	Oops. Last second clean up messed things up. llvm-svn: 64373	2009-02-12 09:52:13 +00:00
Evan Cheng	19a1ba3680	Replace one of burr scheduling heuristic with something more sensible. Now calcMaxScratches simply compute the number of true data dependencies. This actually improve a couple of tests in dejagnu suite as many tests in llvm nightly test suite. llvm-svn: 64369	2009-02-12 08:59:45 +00:00
Dan Gohman	c4f5d4d35d	Fix a comment. llvm-svn: 64328	2009-02-11 21:32:08 +00:00
Dan Gohman	d252329703	Don't use special heuristics for nodes with no data predecessors unless they actually have data successors, and likewise for nodes with no data successors unless they actually have data precessors. llvm-svn: 64327	2009-02-11 21:29:39 +00:00
Dan Gohman	35db81715f	Delete the heuristic for non-livein CopyFromReg nodes. Non-liveinness is determined by whether the node has a Flag operand. However, if the node does have a Flag operand, it will be glued to its register's def, so the heuristic would end up spuriously applying to whatever node is the def. llvm-svn: 64319	2009-02-11 20:25:59 +00:00
Dale Johannesen	f367ef04af	Make a transformation added in 63266 a bit less aggressive. It was transforming (x&y)==y to (x&y)!=0 in the case where y is variable and known to have at most one bit set (e.g. z&1). This is not correct; the expressions are not equivalent when y==0. I believe this patch salvages what can be salvaged, including all the cases in bt.ll. Dan, please review. Fixes gcc.c-torture/execute/20040709-[12].c llvm-svn: 64314	2009-02-11 19:19:41 +00:00
Dan Gohman	7e48e82adb	When scheduling a block in parts, keep track of the overall instruction index across each part. Instruction indices are used to make live range queries, and live ranges can extend beyond scheduling region boundaries. Refactor the ScheduleDAGSDNodes class some more so that it doesn't have to worry about this additional information. llvm-svn: 64288	2009-02-11 04:27:20 +00:00
Dan Gohman	f2b9543ee5	Factor out more code for computing register live-range informationfor scheduling, and generalize is so that preserves state across scheduling regions. This fixes incorrect live-range information around terminators and labels, which are effective region boundaries. In place of looking for terminators to anchor inter-block dependencies, introduce special entry and exit scheduling units for this purpose. llvm-svn: 64254	2009-02-10 23:27:53 +00:00
Evan Cheng	5be1afd928	Fix PR3457: Ignore control successors when looking for closest scheduled successor. A control successor doesn't read result(s) produced by the scheduling unit being evaluated. llvm-svn: 64210	2009-02-10 08:30:11 +00:00
Evan Cheng	acd0fd545d	If the target cannot issue a copy for the given source and dest registers, abort instead of silently continue. llvm-svn: 64184	2009-02-09 22:47:36 +00:00
Evan Cheng	773303a763	Simplify code. llvm-svn: 64164	2009-02-09 21:01:06 +00:00
Evan Cheng	44e58653af	Make sure constant subscript is truncated to ptr size if it may not fit. llvm-svn: 64163	2009-02-09 20:54:38 +00:00
Dale Johannesen	b22cb23f6f	Use getDebugLoc forwarder instead of getNode()->getDebugLoc. No functional change. llvm-svn: 64026	2009-02-07 19:59:05 +00:00
Dan Gohman	4105a38248	Constify TargetInstrInfo::EmitInstrWithCustomInserter, allowing ScheduleDAG's TLI member to use const. llvm-svn: 64018	2009-02-07 16:15:20 +00:00
Dale Johannesen	4fc8a5c0b5	Make SDNode constructors take a DebugLoc always. Adjust derived classes to pass UnknownLoc where a DebugLoc does not make sense. Pick one of DebugLoc and non-DebugLoc variants to survive for all such classes. llvm-svn: 64000	2009-02-07 02:15:05 +00:00
Dale Johannesen	71d758eead	Remove now-unused constructors. llvm-svn: 63995	2009-02-07 01:27:09 +00:00
Dale Johannesen	a259483aae	Get rid of the last non-DebugLoc versions of getNode! Many targets build placeholder nodes for special operands, e.g. GlobalBaseReg on X86 and PPC for the PIC base. There's no sensible way to associate debug info with these. I've left them built with getNode calls with explicit DebugLoc::getUnknownLoc operands. I'm not too happy about this but don't see a good improvement; I considered adding a getPseudoOperand or something, but it seems to me that'll just make it harder to read. llvm-svn: 63992	2009-02-07 00:55:49 +00:00
Dale Johannesen	1580ab6b7f	Remove more non-DebugLoc getNode variants. Use getCALLSEQ_{END,START} to permit passing no DebugLoc there. UNDEF doesn't logically have DebugLoc; add getUNDEF to encapsulate this. llvm-svn: 63978	2009-02-06 23:05:02 +00:00
Dale Johannesen	f9268f8523	And one more file. llvm-svn: 63971	2009-02-06 21:55:48 +00:00
Dale Johannesen	c405486235	Remove more non-DebugLoc versions of getNode. llvm-svn: 63969	2009-02-06 21:50:26 +00:00
Bill Wendling	cd6bb3e248	Clear out the CurDebugLoc info when doing a 'clear' on the SDL object. llvm-svn: 63967	2009-02-06 21:36:23 +00:00
Dale Johannesen	7099fa18a4	Eliminate remaining non-DebugLoc version of getTargetNode. llvm-svn: 63951	2009-02-06 19:16:40 +00:00
Dan Gohman	c097f9628e	Rename SelectionDAGISel::Schedule to SelectionDAGISel::CreateScheduler, and make it just create the scheduler. Leave running the scheduler to the higher-level code. This makes the higher-level code a little more explicit and easier to follow, and will help enable some future refactoring. llvm-svn: 63944	2009-02-06 18:26:51 +00:00
Dan Gohman	2ae1006f93	Delete an unused member function. llvm-svn: 63941	2009-02-06 18:19:52 +00:00
Evan Cheng	e00df1d39c	Move getPointerRegClass from TargetInstrInfo to TargetRegisterInfo. llvm-svn: 63938	2009-02-06 17:43:24 +00:00
Dan Gohman	5784cf7f6d	Move ScheduleDAGSDNodes.h to be a private header. Front-ends that previously included this header should include SchedulerRegistry.h instead. llvm-svn: 63937	2009-02-06 17:22:58 +00:00
Dale Johannesen	cda84103ff	get rid of some non-DebugLoc getTargetNode variants. llvm-svn: 63909	2009-02-06 02:08:06 +00:00
Dale Johannesen	e95c76b65e	Get rid of one more non-DebugLoc getNode and its corresponding getTargetNode. Lots of caller changes. llvm-svn: 63904	2009-02-06 01:31:28 +00:00
Dale Johannesen	c96091a672	Remove a non-DebugLoc version of getNode. llvm-svn: 63889	2009-02-05 22:07:54 +00:00
Dale Johannesen	b218273a7a	Remove 3 non-DebugLoc variants of getNode. llvm-svn: 63886	2009-02-05 21:20:44 +00:00
Mon P Wang	50369c6847	Fix a bug where we were not emitting a cvt rnd sat node for converting between a unsigned integer and signed integer. llvm-svn: 63831	2009-02-05 04:47:42 +00:00
Dale Johannesen	fa28929927	Reapply 63765. Patches for clang and llvm-gcc to follow. llvm-svn: 63812	2009-02-05 01:49:45 +00:00
Dale Johannesen	c901d013fb	Get rid of 3 non-DebugLoc getNode variants. llvm-svn: 63808	2009-02-05 01:01:16 +00:00
Dale Johannesen	c2896f9310	Remove non-DebugLoc versions of getMergeValues, ZeroExtendInReg. llvm-svn: 63800	2009-02-05 00:20:09 +00:00
Dale Johannesen	d27bc65e74	Remove non-DebugLoc forms of CopyToReg and CopyFromReg. Adjust callers. llvm-svn: 63789	2009-02-04 23:02:30 +00:00
Dale Johannesen	f6e1822ccd	Reverting 63765. This broke the build of both clang and llvm-gcc. llvm-svn: 63786	2009-02-04 22:47:25 +00:00
Stuart Hastings	010b5858c1	80 column rule. llvm-svn: 63768	2009-02-04 20:30:10 +00:00
Dale Johannesen	15a801f11d	Remove non-DebugLoc versions of getLoad and getStore. Adjust the many callers of those versions. llvm-svn: 63767	2009-02-04 20:06:27 +00:00
Nate Begeman	66f10b55ed	New feature: add support for target intrinsics being defined in the target directories themselves. This also means that VMCore no longer needs to know about every target's list of intrinsics. Future work will include converting the PowerPC target to this interface as an example implementation. llvm-svn: 63765	2009-02-04 19:47:21 +00:00
Mon P Wang	83f8f82e8a	Avoids generating a legalization assert for the case where a vector type is legal but when legalizing the operation, we split the vector type and generate a library call whose type needs to be promoted. For example, X86 with SSE on but MMX off, a divide v2i64 will be scalarized to 2 calls to a library using i64. llvm-svn: 63760	2009-02-04 19:38:14 +00:00
Stuart Hastings	9d89863489	Since I'm obliged to work with a development OS that currently doesn't support GraphViz, I've been using the foo->dump() facility. This patch is a minor rewrite to the SelectionDAG dump() stuff to make it a little more helpful. The existing foo->dump() functionality does not change; this patch adds foo->dumpr(). All of this is only useful when running LLVM under a debugger. llvm-svn: 63736	2009-02-04 16:46:19 +00:00
Dale Johannesen	19fc835580	Remove non-DebugLoc forms of the exotic forms of Lod and Sto; patch uses. llvm-svn: 63716	2009-02-04 02:34:38 +00:00
Dale Johannesen	52d1d8d5a0	Remove some more non-DebugLoc versions of construction functions, with callers adjusted to fit. llvm-svn: 63705	2009-02-04 01:48:28 +00:00
Dale Johannesen	94df14c046	Check in file I forgot. llvm-svn: 63704	2009-02-04 01:33:20 +00:00
Dale Johannesen	2c4af04551	Remove a few non-DebugLoc versions of node creation functions. llvm-svn: 63703	2009-02-04 01:17:06 +00:00

1 2 3 4 5 ...

3494 Commits