llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 20:43:44 +02:00

Author	SHA1	Message	Date
Benjamin Kramer	4e04c2e69f	MCELF: Always overwrite FixedValue. llvm-svn: 112259	2010-08-27 10:38:39 +00:00
Daniel Dunbar	f642d43594	X86: Fix an encoding issue with LOCK_ADD64mr, which could lead to very hard to find miscompiles with the integrated assembler. llvm-svn: 112250	2010-08-27 01:30:14 +00:00
Devang Patel	24ad069401	Revert r112213. It is not needed. llvm-svn: 112242	2010-08-26 23:35:15 +00:00
Jim Grosbach	2b81a07dc7	Simplify eliminateFrameIndex() interface back down now that PEI doesn't need to try to re-use scavenged frame index reference registers. rdar://8277890 llvm-svn: 112241	2010-08-26 23:32:16 +00:00
Devang Patel	9d4933f6e0	If node is not available then use FuncInfo.ValueMap to emit debug info for byval parameter. llvm-svn: 112238	2010-08-26 22:53:27 +00:00
Jim Grosbach	352d312320	Remove the now obsolete frame index virtual re-use algorithm from PEI. Pre-RA virtual base registers handle this function, and more. A bit more cleanup to do on the interface to eliminateFrameIndex() after this. llvm-svn: 112237	2010-08-26 22:42:12 +00:00
Chris Lattner	d5d68438c1	optimize "integer extraction out of the middle of a vector" as produced by SRoA. This is part of rdar://7892780, but needs another xform to expose this. llvm-svn: 112232	2010-08-26 22:14:59 +00:00
Jim Grosbach	d21756ab1e	tidy up a bit. no functional change. llvm-svn: 112228	2010-08-26 21:56:30 +00:00
Chris Lattner	19a5dc488b	optimize bitcast(trunc(bitcast(x))) where the result is a float and 'x' is a vector to be a vector element extraction. This allows clang to compile: struct S { float A, B, C, D; }; float foo(struct S A) { return A.A + A.B+A.C+A.D; } into: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movapd %xmm1, %xmm3 addss %xmm2, %xmm3 movd %xmm1, %rax shrq $32, %rax movd %eax, %xmm0 addss %xmm3, %xmm0 ret instead of: _foo: ## @foo ## BB#0: ## %entry movd %xmm0, %rax movd %eax, %xmm0 shrq $32, %rax movd %eax, %xmm2 addss %xmm0, %xmm2 movd %xmm1, %rax movd %eax, %xmm1 addss %xmm2, %xmm1 shrq $32, %rax movd %eax, %xmm0 addss %xmm1, %xmm0 ret ... eliminating half of the horribleness. llvm-svn: 112227	2010-08-26 21:55:42 +00:00
Jim Grosbach	5b8e21eaa6	Turn off the scavenging based frame reg reuse briefly to measure whether it's still having a significant effect. It shouldn't be now that the pre-RA virtual base reg stuff is in. Assuming that's valididated by the nightly testers, we can simplify a lot of the PEI frame index code. llvm-svn: 112220	2010-08-26 21:29:54 +00:00
Bruno Cardoso Lopes	6150648a64	zap the now unused MVT::getIntVectorWithNumElements llvm-svn: 112218	2010-08-26 20:53:12 +00:00
Devang Patel	81e0e2622b	Speculatively revert r112207. llvm-svn: 112216	2010-08-26 20:33:42 +00:00
Devang Patel	4dd806f321	80 col. llvm-svn: 112215	2010-08-26 20:32:32 +00:00
Devang Patel	d2b70bf32b	Update DanglingDebugInfo so that it can be used to track llvm.dbg.declare also. llvm-svn: 112213	2010-08-26 20:06:46 +00:00
Bob Wilson	efc503afd2	Use pseudo instructions for VST3. llvm-svn: 112208	2010-08-26 18:51:29 +00:00
Devang Patel	9565184a05	Donot forget to resolve dangling debug info in a case where virtual register, used for a value, is initialized after a dbg intrinsic is seen. llvm-svn: 112207	2010-08-26 18:36:14 +00:00
Bill Wendling	c76a3e317c	Reapply r112176 without removing the other CMN patterns (that was unintentional). llvm-svn: 112206	2010-08-26 18:33:51 +00:00
Benjamin Kramer	537450a5e8	MCELF: Fix a thinko of mine. llvm-svn: 112203	2010-08-26 18:12:04 +00:00
Bob Wilson	640cc8ce83	Fix comment typos. llvm-svn: 112202	2010-08-26 18:08:11 +00:00
Owen Anderson	77fcf53657	Make JumpThreading smart enough to properly thread StrSwitch when it's compiled with clang++. llvm-svn: 112198	2010-08-26 17:40:24 +00:00
Benjamin Kramer	a1f93ec939	MCELF: Compensate for the addend on i386. Patch by Roman Divacky, with some cleanups. llvm-svn: 112197	2010-08-26 17:23:02 +00:00
Jim Grosbach	5b1ce460ec	Restrict the register to tGPR to make sure the str instruction will be encodable as a 16-bit wide instruction. llvm-svn: 112195	2010-08-26 17:02:47 +00:00
Dan Gohman	b1020bb551	Revert r112176; it broke test/CodeGen/Thumb2/thumb2-cmn.ll. llvm-svn: 112191	2010-08-26 15:50:25 +00:00
Dan Gohman	8088d5e31d	Reapply r112091 and r111922, support for metadata linking, with a fix: add a flag to MapValue and friends which indicates whether any module-level mappings are being made. In the common case of inlining, no module-level mappings are needed, so MapValue doesn't need to examine non-function-local metadata, which can be very expensive in the case of a large module with really deep metadata (e.g. a large C++ program compiled with -g). This flag is a little awkward; perhaps eventually it can be moved into the ClonedCodeInfo class. llvm-svn: 112190	2010-08-26 15:41:53 +00:00
Benjamin Kramer	7dcd303d9c	StringRef::compare_numeric also differed from StringRef::compare for characters > 127. llvm-svn: 112189	2010-08-26 15:25:35 +00:00
Benjamin Kramer	25003daf90	Do unsigned char comparisons in StringRef::compare_lower to be more consistent with compare in corner cases. llvm-svn: 112185	2010-08-26 14:21:08 +00:00
Bill Wendling	a125fb1689	There seems to be a (potential) hardware bug with the CMN instruction and comparison with 0. These two pieces of code should give identical results: rsbs r1, r1, 0 cmp r0, r1 mov r0, #0 it ls mov r0, #1 and: cmn r0, r1 mov r0, #0 it ls mov r0, #1 However, the CMN gives the opposite result when r1 is 0. This is because the carry flag is set in the CMP case but not in the CMN case. In short, the CMP instruction doesn't perform a truncate of the (logical) NOT of 0 plus the value of r0 and the carry bit (because the "carry bit" parameter to AddWithCarry is defined as 1 in this case, the carry flag will always be set when r0 >= 0). The CMN instruction doesn't perform a NOT of 0 so there is never a "carry" when this AddWithCarry is performed (because the "carry bit" parameter to AddWithCarry is defined as 0). The AddWithCarry in the CMP case seems to be relying upon the identity: ~x + 1 = -x However when x is 0 and unsigned, this doesn't hold: x = 0 ~x = 0xFFFF FFFF ~x + 1 = 0x1 0000 0000 (-x = 0) != (0x1 0000 0000 = ~x + 1) Therefore, we should disable all versions of CMN, especially when comparing against zero, until we can limit when the CMN instruction is used (when we know that the RHS is not 0) or when we have a hardware fix for this. (See the ARM docs for the "AddWithCarry" pseudo-code.) This is related to <rdar://problem/7569620>. llvm-svn: 112176	2010-08-26 09:07:33 +00:00
Chris Lattner	bc2f7bb5f3	Add a hackaround for PR7993 which is causing failures on x86 builders that lack sse2. llvm-svn: 112175	2010-08-26 06:57:07 +00:00
Chris Lattner	148485f707	implement SplitVecOp_CONCAT_VECTORS, fixing the included testcase with SSE1. llvm-svn: 112171	2010-08-26 05:51:22 +00:00
Bob Wilson	e74da18e57	Use pseudo instructions for VST1d64Q. llvm-svn: 112170	2010-08-26 05:33:30 +00:00
Chris Lattner	5256226fc8	fix sse1 only codegen in x86-64 mode, which is something we apparently try to support. llvm-svn: 112168	2010-08-26 05:24:29 +00:00
Daniel Dunbar	3839de70f5	Revert r111922, "MapValue support for MDNodes. This is similar to r109117, except ...", it is causing massive performance regressions when building Clang with itself (-O3 -g). llvm-svn: 112158	2010-08-26 03:48:11 +00:00
Daniel Dunbar	aeb8abb0e0	Revert r112091, "Remap metadata attached to instructions when remapping individual ...", which depends on r111922, which I am reverting. llvm-svn: 112157	2010-08-26 03:48:08 +00:00
Chris Lattner	2a233bf011	zap dead code. llvm-svn: 112155	2010-08-26 02:57:35 +00:00
Chris Lattner	467be04ce6	remove dead proto llvm-svn: 112131	2010-08-26 01:14:37 +00:00
Chris Lattner	164c35930a	zap dead code. llvm-svn: 112130	2010-08-26 01:13:54 +00:00
Bruno Cardoso Lopes	8bb7c79c1a	Fix PR7748 without using microsoft extensions llvm-svn: 112128	2010-08-26 01:02:53 +00:00
Jim Grosbach	6500a1a2f9	Enable pre-RA virtual frame base register allocation. rdar://8277890 llvm-svn: 112127	2010-08-26 00:58:06 +00:00
Dan Gohman	c7605a66b7	Rewrite ExtractGV, removing a bunch of stuff that didn't fully work, and was over-complicated, and replacing it with a simple implementation. llvm-svn: 112120	2010-08-26 00:22:55 +00:00
Bob Wilson	1df383d9cb	Revert svn 107892 (with changes to work with trunk). It caused a crash if a VLD result was not used (Radar 8355607). It should also fix pr7988, but I haven't verified that yet. llvm-svn: 112118	2010-08-26 00:13:36 +00:00
Chris Lattner	eb4c7e43cc	we should pattern match the SSE complex arithmetic ops. llvm-svn: 112109	2010-08-25 23:31:42 +00:00
Bob Wilson	b85b3cf91f	Start converting NEON load/stores to use pseudo instructions, beginning here with the VST4 instructions. Until after register allocation, we want to represent sets of adjacent registers by a single super-register. These VST4 pseudo instructions have a single QQ or QQQQ source register operand. They get expanded to the real VST4 instructions with 4 separate D register operands. Once this conversion is complete, we'll be able to remove the NEONPreAllocPass and avoid some fragile and hacky code elsewhere. llvm-svn: 112108	2010-08-25 23:27:42 +00:00
Chris Lattner	ef3055ca05	remove some llvmcontext arguments that are now dead post-refactoring. llvm-svn: 112104	2010-08-25 23:00:45 +00:00
Chris Lattner	fe7c4ec039	Change handling of illegal vector types to widen when possible instead of expanding: e.g. <2 x float> -> <4 x float> instead of -> 2 floats. This affects two places in the code: handling cross block values and handling function return and arguments. Since vectors are already widened by legalizetypes, this gives us much better code and unblocks x86-64 abi and SPU abi work. For example, this (which is a silly example of a cross-block value): define <4 x float> @test2(<4 x float> %A) nounwind { %B = shufflevector <4 x float> %A, <4 x float> undef, <2 x i32> <i32 0, i32 1> %C = fadd <2 x float> %B, %B br label %BB BB: %D = fadd <2 x float> %C, %C %E = shufflevector <2 x float> %D, <2 x float> undef, <4 x i32> <i32 0, i32 1, i32 undef, i32 undef> ret <4 x float> %E } Now compiles into: _test2: ## @test2 ## BB#0: addps %xmm0, %xmm0 addps %xmm0, %xmm0 ret previously it compiled into: _test2: ## @test2 ## BB#0: addps %xmm0, %xmm0 pshufd $1, %xmm0, %xmm1 ## kill: XMM0<def> XMM0<kill> XMM0<def> insertps $0, %xmm0, %xmm0 insertps $16, %xmm1, %xmm0 addps %xmm0, %xmm0 ret This implements rdar://8230384 llvm-svn: 112101	2010-08-25 22:49:25 +00:00
Dan Gohman	d19a0a49d1	Remap metadata attached to instructions when remapping individual instructions, not when remapping modules. llvm-svn: 112091	2010-08-25 21:36:50 +00:00
Bruno Cardoso Lopes	28f3261dbd	Revert this for now, PUNPCKLDQ dont operate on v4f32 llvm-svn: 112090	2010-08-25 21:26:37 +00:00
Daniel Dunbar	1a881a3eca	X86: Fix misencode of RI64mi8. This fixes OpenSSL / x86_64-apple-darwin10 / clang -O3. llvm-svn: 112089	2010-08-25 21:11:02 +00:00
Devang Patel	3abb5dbc91	Fix comment. llvm-svn: 112086	2010-08-25 20:41:24 +00:00
Devang Patel	25b17ca1c3	Remove dead argument. llvm-svn: 112085	2010-08-25 20:39:26 +00:00
Jim Grosbach	eab4de864e	Add some statistics for PEI register scavenging llvm-svn: 112084	2010-08-25 20:34:28 +00:00

1 2 3 4 5 ...

40947 Commits