llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Dan Gohman	5fe8266d04	Teach GetNegatedExpression to negate 0-B to B in UnsafeFPMath mode, and visitFSUB to fold 0-B to -B in UnsafeFPMath mode. Also change visitFNEG to use isNegatibleForFree/GetNegatedExpression instead of doing a subset of the same thing manually. This fixes test/CodeGen/X86/negative-sin.ll. llvm-svn: 37842	2007-07-02 15:48:56 +00:00
Dan Gohman	354f02e03d	Generalize MVT::ValueType and associated functions to be able to represent extended vector types. Remove the special SDNode opcodes used for pre-legalize vector operations, and the special MVT::Vector type used with them. Adjust lowering and legalize to work with the normal SDNode kinds instead, and to use the normal MVT functions to work with vector types instead of using the two special operands that the pre-legalize nodes held. This allows pre-legalize and post-legalize DAGs, and the code that operates on them, to be more consistent. Pre-legalize vector operators can be handled more consistently with scalar operators. And, -view-dag-combine1-dags and -view-legalize-dags now look prettier for vector code. llvm-svn: 37719	2007-06-25 16:23:39 +00:00
Dan Gohman	a62327ea40	Move ComputeMaskedBits, MaskedValueIsZero, and ComputeNumSignBits from TargetLowering to SelectionDAG so that they have more convenient access to the current DAG, in preparation for the ValueType routines being changed from standalone functions to members of SelectionDAG for the pre-legalize vector type changes. llvm-svn: 37704	2007-06-22 14:59:07 +00:00
Evan Cheng	f26fc091ac	Xforms: (add (select cc, 0, c), x) -> (select cc, x, (add, x, c)) (sub x, (select cc, 0, c)) -> (select cc, x, (sub, x, c)) llvm-svn: 37685	2007-06-21 07:39:16 +00:00
Dan Gohman	485cb57eab	Pass a SelectionDAG into SDNode::dump everywhere it's used, in prepration for needing the DAG node to print pre-legalize extended value types, and to get better debug messages with target-specific nodes. llvm-svn: 37656	2007-06-19 14:13:56 +00:00
Dan Gohman	2fd7d26df8	Rename MVT::getVectorBaseType to MVT::getVectorElementType. llvm-svn: 37579	2007-06-14 22:58:02 +00:00
Chris Lattner	5161203d52	tighten up recursion depth again llvm-svn: 37330	2007-05-25 02:19:06 +00:00
Evan Cheng	439bf58dc2	Fix a typo that caused combiner to create mal-formed pre-indexed store where value store is the same as the base pointer. llvm-svn: 37318	2007-05-24 02:35:39 +00:00
Chris Lattner	d540c6429f	prevent exponential recursion in isNegatibleForFree llvm-svn: 37310	2007-05-23 07:35:22 +00:00
Dan Gohman	b593ad9fb0	Qualify calls to getTypeForValueType with MVT:: too. llvm-svn: 37233	2007-05-18 18:41:29 +00:00
Dale Johannesen	cc99a6fc32	Don't fold bitconvert(load) for preinc/postdec loads. Likewise stores. llvm-svn: 37130	2007-05-16 22:45:30 +00:00
Chris Lattner	a18b36cf45	Use a ptr set instead of a linear search to unique TokenFactor operands. This fixes PR1423 llvm-svn: 37102	2007-05-16 06:37:59 +00:00
Evan Cheng	a781c7788a	Bug fix: should check ABI alignment, not pref. alignment. llvm-svn: 37094	2007-05-16 02:04:50 +00:00
Lauro Ramos Venancio	251ea5ab65	Fix an infinite recursion in GetNegatedExpression. llvm-svn: 37086	2007-05-15 17:05:43 +00:00
Chris Lattner	eba1b74df3	implement a simple fneg optimization/propagation thing. This compiles: CodeGen/PowerPC/fneg.ll into: _t4: fmul f0, f3, f4 fmadd f1, f1, f2, f0 blr instead of: _t4: fneg f0, f3 fmul f0, f0, f4 fmsub f1, f1, f2, f0 blr llvm-svn: 37054	2007-05-14 22:04:50 +00:00
Evan Cheng	649f25dad1	Can't fold the bit_convert is the store is a truncating store. llvm-svn: 36962	2007-05-09 21:49:47 +00:00
Evan Cheng	e18b87868d	Forgot a check. llvm-svn: 36910	2007-05-07 21:36:06 +00:00
Evan Cheng	18d994d6d6	Enable a couple of xforms: - (store (bitconvert v)) -> (store v) if resultant store does not require higher alignment - (bitconvert (load v)) -> (load (bitconvert*)v) if resultant load does not require higher alignment llvm-svn: 36908	2007-05-07 21:27:48 +00:00
Evan Cheng	8c8b6ce116	Don't create indexed load / store with zero offset! llvm-svn: 36716	2007-05-03 23:52:19 +00:00
Evan Cheng	6dc02c2b07	Forgot about chain result; also UNDEF cannot have multiple values. llvm-svn: 36622	2007-05-01 08:53:39 +00:00
Evan Cheng	fe933cd6ca	* Only turn a load to UNDEF if all of its outputs have no uses (indexed loads produce two results.) * Do not touch volatile loads. llvm-svn: 36604	2007-05-01 00:38:21 +00:00
Christopher Lamb	a157874a8a	PR400 phase 2. Propagate attributed load/store information through DAGs. llvm-svn: 36356	2007-04-22 23:15:30 +00:00
Reid Spencer	81070d52da	Revert Christopher Lamb's load/store alignment changes. llvm-svn: 36309	2007-04-21 18:36:27 +00:00
Christopher Lamb	b56b6a7ad7	add support for alignment attributes on load/store instructions llvm-svn: 36301	2007-04-21 08:16:25 +00:00
Chris Lattner	ea3c945817	allow SRL to simplify its operands, as it doesn't demand all bits as input. llvm-svn: 36245	2007-04-18 03:06:49 +00:00
Chris Lattner	4ce8602d58	When replacing a node in SimplifyDemandedBits, if the old node used any single-use nodes, they will be dead soon. Make sure to remove them before processing other nodes. This implements CodeGen/X86/shl_elim.ll llvm-svn: 36244	2007-04-18 03:05:22 +00:00
Chris Lattner	9ad682ad80	SIGN_EXTEND_INREG does not demand its top bits. Give SimplifyDemandedBits a chance to hack on it. This compiles: int baz(long long a) { return (short)(((int)(a >>24)) >> 9); } into: _baz: slwi r2, r3, 8 srwi r2, r2, 9 extsh r3, r2 blr instead of: _baz: srwi r2, r4, 24 rlwimi r2, r3, 8, 0, 23 srwi r2, r2, 9 extsh r3, r2 blr This implements CodeGen/PowerPC/sign_ext_inreg1.ll llvm-svn: 36212	2007-04-17 19:03:21 +00:00
Chris Lattner	f29ad16397	fix an infinite loop compiling ldecod, notice by JeffC. llvm-svn: 35910	2007-04-11 16:51:53 +00:00
Chris Lattner	1d20292190	Fix this harder. llvm-svn: 35888	2007-04-11 06:50:51 +00:00
Chris Lattner	01ebc25b36	don't create shifts by zero, fix some problems with my previous patch llvm-svn: 35887	2007-04-11 06:43:25 +00:00
Chris Lattner	0289490285	Teach the codegen to turn [aez]ext (setcc) -> selectcc of 1/0, which often allows other simplifications. For example, this compiles: int isnegative(unsigned int X) { return !(X < 2147483648U); } Into this code: x86: movl 4(%esp), %eax shrl $31, %eax ret arm: mov r0, r0, lsr #31 bx lr thumb: lsr r0, r0, #31 bx lr instead of: x86: cmpl $0, 4(%esp) sets %al movzbl %al, %eax ret arm: mov r3, #0 cmp r0, #0 movlt r3, #1 mov r0, r3 bx lr thumb: mov r2, #1 mov r1, #0 cmp r0, #0 blt LBB1_2 @entry LBB1_1: @entry cpy r2, r1 LBB1_2: @entry cpy r0, r2 bx lr Testcase here: test/CodeGen/Generic/ispositive.ll llvm-svn: 35883	2007-04-11 05:32:27 +00:00
Chris Lattner	3f0e49403c	Codegen integer abs more efficiently using the trick from the PPC CWG. This improves codegen on many architectures. Tests committed as CodeGen/*/iabs.ll X86 Old: X86 New: _test: _test: movl 4(%esp), %ecx movl 4(%esp), %eax movl %ecx, %eax movl %eax, %ecx negl %eax sarl $31, %ecx testl %ecx, %ecx addl %ecx, %eax cmovns %ecx, %eax xorl %ecx, %eax ret ret PPC Old: PPC New: _test: _test: cmpwi cr0, r3, -1 srawi r2, r3, 31 neg r2, r3 add r3, r3, r2 bgt cr0, LBB1_2 ; xor r3, r3, r2 LBB1_1: ; blr mr r3, r2 LBB1_2: ; blr ARM Old: ARM New: _test: _test: rsb r3, r0, #0 add r3, r0, r0, asr #31 cmp r0, #0 eor r0, r3, r0, asr #31 movge r3, r0 bx lr mov r0, r3 bx lr Thumb Old: Thumb New: _test: _test: neg r2, r0 asr r2, r0, #31 cmp r0, #0 add r0, r0, r2 bge LBB1_2 eor r0, r2 LBB1_1: @ bx lr cpy r0, r2 LBB1_2: @ bx lr Sparc Old: Sparc New: test: test: save -96, %o6, %o6 save -96, %o6, %o6 sethi 0, %l0 sra %i0, 31, %l0 sub %l0, %i0, %l0 add %i0, %l0, %l1 subcc %i0, -1, %l1 xor %l1, %l0, %i0 bg .BB1_2 restore %g0, %g0, %g0 nop retl .BB1_1: nop or %g0, %l0, %i0 .BB1_2: restore %g0, %g0, %g0 retl nop It also helps alpha/ia64 :) llvm-svn: 35881	2007-04-11 05:11:38 +00:00
Scott Michel	d6b3d3d6ab	1. Insert custom lowering hooks for ISD::ROTR and ISD::ROTL. 2. Help DAGCombiner recognize zero/sign/any-extended versions of ROTR and ROTL patterns. This was motivated by the X86/rotate.ll testcase, which should now generate code for other platforms (and soon-to-come platforms.) Rewrote code slightly to make it easier to read. llvm-svn: 35605	2007-04-02 21:36:32 +00:00
Dale Johannesen	d4ab7d28e9	Fix incorrect combination of different loads. Reenable zext-over-truncate combination. llvm-svn: 35517	2007-03-30 21:38:07 +00:00
Evan Cheng	2d09850760	Disable load width reduction xform of variant (zext (truncate load x)) for big endian targets until llvm-gcc build issue has been resolved. llvm-svn: 35449	2007-03-29 07:56:46 +00:00
Evan Cheng	5b1c21d27b	SIGN_EXTEND_INREG requires one extra operand, a ValueType node. llvm-svn: 35350	2007-03-26 07:12:51 +00:00
Evan Cheng	a484f31d4b	Adjust offset to compensate for big endian machines. llvm-svn: 35293	2007-03-24 00:02:43 +00:00
Evan Cheng	84aecc56e7	Make sure SEXTLOAD of the specific type is supported on the target. llvm-svn: 35289	2007-03-23 22:13:36 +00:00
Evan Cheng	7dd7666120	Also replace uses of SRL if that's also folded during ReduceLoadWidth(). llvm-svn: 35286	2007-03-23 20:55:21 +00:00
Evan Cheng	62ccdaea67	A couple of bug fixes for reducing load width xform: 1. Address offset is in bytes. 2. Make sure truncate node uses are replaced with new load. llvm-svn: 35274	2007-03-23 02:16:52 +00:00
Evan Cheng	d7be4893f4	More opportunities to reduce load size. llvm-svn: 35254	2007-03-22 01:54:19 +00:00
Evan Cheng	9867632e64	fold (truncate (srl (load x), c)) -> (smaller load (x+c/vt bits)) llvm-svn: 35239	2007-03-21 20:14:05 +00:00
Evan Cheng	2f55532e84	Avoid combining indexed load further. llvm-svn: 35005	2007-03-07 08:07:03 +00:00
Chris Lattner	8c7d418eaf	fold away addc nodes when we know there cannot be a carry-out. llvm-svn: 34913	2007-03-04 20:40:38 +00:00
Chris Lattner	7021449d8b	generalize llvm-svn: 34910	2007-03-04 20:08:45 +00:00
Chris Lattner	06e4ea2b21	canonicalize constants to the RHS of addc/adde. If nothing uses the carry out of addc, turn it into add. This allows us to compile: long long test(long long A, unsigned B) { return (A + ((long long)B << 32)) & 123; } into: _test: movl $123, %eax andl 4(%esp), %eax xorl %edx, %edx ret instead of: _test: xorl %edx, %edx movl %edx, %eax addl 4(%esp), %eax ;; add of zero andl $123, %eax ret llvm-svn: 34909	2007-03-04 20:03:15 +00:00
Chris Lattner	65de507797	Fold (sext (truncate x)) more aggressively, by avoiding creation of a sextinreg if not needed. This is useful in two cases: before legalize, it avoids creating a sextinreg that will be trivially removed. After legalize if the target doesn't support sextinreg, the trunc/sext would not have been removed before. llvm-svn: 34621	2007-02-26 03:13:59 +00:00
Evan Cheng	1b155ac243	Move SimplifySetCC to TargetLowering and allow it to be shared with legalizer. llvm-svn: 34065	2007-02-08 22:13:59 +00:00
Evan Cheng	ae02dfb090	Fix for PR1108: type of insert_vector_elt index operand is PtrVT, not MVT::i32. llvm-svn: 33398	2007-01-20 10:10:26 +00:00
Evan Cheng	ced4fcb608	Remove this xform: (shl (add x, c1), c2) -> (add (shl x, c2), c1<<c2) Replace it with: (add (shl (add x, c1), c2), ) -> (add (add (shl x, c2), c1<<c2), ) This fixes test/CodeGen/ARM/smul.ll llvm-svn: 33361	2007-01-19 17:51:44 +00:00

1 2 3 4 5 ...

317 Commits