llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-02 00:42:52 +01:00

Author	SHA1	Message	Date
Andrew Trick	a53688c65c	Comment correction. llvm-svn: 134958	2011-07-12 03:39:22 +00:00
Jim Grosbach	93f2ebb5e7	Simplify printing of ARM shifted immediates. Print shifted immediate values directly rather than as a payload+shifter value pair. This makes for more readable output assembly code, simplifies the instruction printer, and is consistent with how Thumb immediates are displayed. llvm-svn: 134902	2011-07-11 16:48:36 +00:00
NAKAMURA Takumi	183ec41f4a	test/CodeGen/PowerPC/vector.ll: Tweak redirection >%t >%t to >%t >>%t. See also r134814 (test/CodeGen/X86/vector.ll). llvm-svn: 134900	2011-07-11 16:21:52 +00:00
Cameron Zwarich	1efde78890	Add a missing test for r134882. llvm-svn: 134889	2011-07-11 08:35:17 +00:00
Chris Lattner	a106725fc5	Land the long talked about "type system rewrite" patch. This patch brings numerous advantages to LLVM. One way to look at it is through diffstat: 109 files changed, 3005 insertions(+), 5906 deletions(-) Removing almost 3K lines of code is a good thing. Other advantages include: 1. Value::getType() is a simple load that can be CSE'd, not a mutating union-find operation. 2. Types a uniqued and never move once created, defining away PATypeHolder. 3. Structs can be "named" now, and their name is part of the identity that uniques them. This means that the compiler doesn't merge them structurally which makes the IR much less confusing. 4. Now that there is no way to get a cycle in a type graph without a named struct type, "upreferences" go away. 5. Type refinement is completely gone, which should make LTO much MUCH faster in some common cases with C++ code. 6. Types are now generally immutable, so we can use "Type " instead "const Type " everywhere. Downsides of this patch are that it removes some functions from the C API, so people using those will have to upgrade to (not yet added) new API. "LLVM 3.0" is the right time to do this. There are still some cleanups pending after this, this patch is large enough as-is. llvm-svn: 134829	2011-07-09 17:41:24 +00:00
Chris Lattner	4ddffa2acc	more tests not making the jump into the brave new world. llvm-svn: 134820	2011-07-09 16:57:10 +00:00
NAKAMURA Takumi	2cbabf301a	test/CodeGen/X86/vector.ll: Tweak temporary output to appease Win32 hosts. With Lit (not bash) in a test, multiple redirects >%t might open(%t, "w") multiple. It can be avoided if latter redirect is >>%t. It might work even if ">/dev/null" were used. llvm-svn: 134814	2011-07-09 10:22:28 +00:00
Jakob Stoklund Olesen	fe41eb3bda	Hoist spills within a basic block. Try to move spills as early as possible in their basic block. This can help eliminate interferences by shortening the live range being spilled. This fixes PR10221. llvm-svn: 134776	2011-07-09 00:25:03 +00:00
Evan Cheng	9719ca7c76	Fix broken x86_64 tests which specify non-64-bit cpu's. llvm-svn: 134756	2011-07-08 22:29:33 +00:00
Eli Friedman	0ea2c325a9	Default 64-bit target features and SSE2 on when a triple specifies x86-64. Clean up all the other hacks which are now unnecessary. llvm-svn: 134753	2011-07-08 22:16:47 +00:00
Jim Grosbach	2b8103505a	Make tBX_RET and tBX_RET_vararg predicable. The normal tBX instruction is predicable, so there's no reason the pseudos for using it as a return shouldn't be. Gives us some nice code-gen improvements as can be seen by the test changes. In particular, several tests now have to disable if-conversion because it works too well and defeats the test. llvm-svn: 134746	2011-07-08 21:50:04 +00:00
Julien Lerouge	75e462e164	Add _allrem, _aullrem and _allmul to the runtime for MSVC. http://llvm.org/bugs/show_bug.cgi?id=10305 llvm-svn: 134744	2011-07-08 21:40:25 +00:00
Cameron Zwarich	c23366d357	Add an intrinsic and codegen support for fused multiply-accumulate. The intent is to use this for architectures that have a native FMA instruction. llvm-svn: 134742	2011-07-08 21:39:21 +00:00
Jakob Stoklund Olesen	acaf9e9ce1	Be more aggressive about following hints. RAGreedy::tryAssign will now evict interference from the preferred register even when another register is free. To support this, add the EvictionCost struct that counts how many hints are broken by an eviction. We don't want to break one hint just to satisfy another. Rename canEvict to shouldEvict, and add the first bit of eviction policy that doesn't depend on spill weights: Always make room in the preferred register as long as the evictees can be split and aren't already assigned to their preferred register. Also make the CSR avoidance more accurate. When looking for a cheaper register it is OK to use a new volatile register. Only CSR aliases that have never been used before should be avoided. llvm-svn: 134735	2011-07-08 20:46:18 +00:00
Jim Grosbach	435ca7304c	Use ARMPseudoExpand for ARM tail calls. llvm-svn: 134719	2011-07-08 18:50:22 +00:00
Benjamin Kramer	44c76d239a	Emit a more efficient magic number multiplication for exact sdivs. We have to do this in DAGBuilder instead of DAGCombiner, because the exact bit is lost after building. struct foo { char x[24]; }; long bar(struct foo a, struct foo b) { return a-b; } is now compiled into movl 4(%esp), %eax subl 8(%esp), %eax sarl $3, %eax imull $-1431655765, %eax, %eax instead of movl 4(%esp), %eax subl 8(%esp), %eax movl $715827883, %ecx imull %ecx movl %edx, %eax shrl $31, %eax sarl $2, %edx addl %eax, %edx movl %edx, %eax llvm-svn: 134695	2011-07-08 10:31:30 +00:00
Jakob Stoklund Olesen	99c67603c7	Fix more register allocation sensitive tests. llvm-svn: 134667	2011-07-08 00:24:06 +00:00
Jakob Stoklund Olesen	47bc41b3c3	Remove a test that no longer makes sense. It was testing a linear scan feature: Test if linearscan is unfavoring registers for allocation to allow more reuse of reloads from stack slots. The greedy register allocator doesn't access any stack slots in this function, so the linear scan feature was not being tested. llvm-svn: 134666	2011-07-08 00:24:03 +00:00
Nick Lewycky	a82f7a687e	Let the inline asm 'q' constraint match float, and on 64-bit double too. Fixes PR9602! llvm-svn: 134665	2011-07-08 00:19:27 +00:00
Eric Christopher	5fb023bb10	Go ahead and emit the barrier on x86-64 even without sse2. The processor supports it just fine. Fixes PR9675 and rdar://9740801 llvm-svn: 134664	2011-07-08 00:04:56 +00:00
Eric Christopher	b7597bc669	Add support for the X86 'l' constraint. Fixes PR10149 and rdar://9738585 llvm-svn: 134648	2011-07-07 22:29:07 +00:00
Evan Cheng	bbed81df25	Add Mode64Bit feature and sink it down to MC layer. llvm-svn: 134641	2011-07-07 21:06:52 +00:00
Evan Cheng	952943f744	Change some ARM subtarget features to be single bit yes/no in order to sink them down to MC layer. Also fix tests. llvm-svn: 134590	2011-07-07 03:55:05 +00:00
Lang Hames	2c2f6ed1f7	Added a testcase for PR10220. llvm-svn: 134573	2011-07-07 00:36:02 +00:00
Jakub Staszak	28bcc8673e	Introduce "expect" intrinsic instructions. llvm-svn: 134516	2011-07-06 18:22:43 +00:00
Dan Gohman	151e8ce446	Revert r134366 and add an explicit triple to make this test host-independent. llvm-svn: 134447	2011-07-05 22:09:19 +00:00
Jakob Stoklund Olesen	f95a1068bd	Fix PR10277. Remat during spilling triggers dead code elimination. If a phi-def becomes unused, that may also cause live ranges to split into separate connected components. This type of splitting is different from normal live range splitting. In particular, there may not be a common original interval. When the split range is its own original, make sure that the new siblings are also their own originals. The range being split cannot be used as an original since it doesn't cover the new siblings. llvm-svn: 134413	2011-07-05 15:38:41 +00:00
NAKAMURA Takumi	c0837d703b	test/CodeGen/X86/lsr-nonaffine.ll: Relax expressions for Win64 CC to appease Win32 hosts. llvm-svn: 134366	2011-07-03 09:26:14 +00:00
Chandler Carruth	e07bb36a9e	FileCheck-ize another test. Reduces the llc invocations from 8 to 1, and makes one of the tests actually mean something (as the string 'add' will always appear in the output of this file). llvm-svn: 134358	2011-07-02 21:34:52 +00:00
Chandler Carruth	78b12b3ed4	FileCheck-ize another X86 test, making it more precisely verify the desired result based on the comments in the file. llvm-svn: 134354	2011-07-02 20:43:16 +00:00
Chandler Carruth	1926e141f1	FileCheck-ize and simplify RUN lines. llvm-svn: 134352	2011-07-02 20:43:11 +00:00
Chandler Carruth	5de1d825e4	FileCheck-ize llvm-svn: 134351	2011-07-02 20:43:08 +00:00
Chandler Carruth	01e8f9314e	FileCheck-ize and tighten up assertions to only check the relevant sections. llvm-svn: 134350	2011-07-02 20:43:04 +00:00
Chandler Carruth	500b05b1bb	FileCheck-ize and cleanup IR. llvm-svn: 134349	2011-07-02 20:43:01 +00:00
Chandler Carruth	c674fb38ef	FileCheck-ize llvm-svn: 134348	2011-07-02 20:42:59 +00:00
Chandler Carruth	341ed5f0a0	Remove a grep that is already checked with FileCheck. llvm-svn: 134346	2011-07-02 20:42:56 +00:00
Chandler Carruth	88e183829b	FileCheck-ize llvm-svn: 134345	2011-07-02 20:42:53 +00:00
Chandler Carruth	7a0f51e003	FileCheck-ize and modernize IR. llvm-svn: 134344	2011-07-02 20:42:50 +00:00
Chandler Carruth	4af34fe339	FileCheck-ize and simplify RUNs. llvm-svn: 134343	2011-07-02 20:42:48 +00:00
Chandler Carruth	9e114fc3ee	FileCheck-ize and modernize the RUN line. llvm-svn: 134342	2011-07-02 20:42:44 +00:00
Chandler Carruth	df1690a113	FileCheck-ize, tightening checks and avoiding a temporary file. llvm-svn: 134341	2011-07-02 20:42:42 +00:00
Chandler Carruth	a5b1de166b	FileCheck-ize, tightening checks and avoiding a temporary file. llvm-svn: 134340	2011-07-02 20:42:39 +00:00
Chandler Carruth	c041ee0766	FileCheck-ize llvm-svn: 134339	2011-07-02 20:42:36 +00:00
Chandler Carruth	4f82b948fd	FileCheck-ize llvm-svn: 134338	2011-07-02 20:42:33 +00:00
Chandler Carruth	e344d9c676	FileCheck-ize a test, avoiding a temporary file. llvm-svn: 134337	2011-07-02 20:42:31 +00:00
Chandler Carruth	d939fba46d	FileCheck-ize and simplify this test. llvm-svn: 134336	2011-07-02 20:42:28 +00:00
Chandler Carruth	b870175dd5	FileCheck-ize llvm-svn: 134335	2011-07-02 20:42:25 +00:00
Chandler Carruth	d98a57cc5a	FileCheck-ize another codegen test. llvm-svn: 134334	2011-07-02 20:42:22 +00:00
Chandler Carruth	4c7e28777b	Partially FileCheck-ize a test to remove a weird quoting situation. llvm-svn: 134333	2011-07-02 20:42:20 +00:00
Chandler Carruth	0d1da937eb	FileCheck-ize another test, and upgrade its syntax a bit. llvm-svn: 134332	2011-07-02 20:42:17 +00:00
Chandler Carruth	4fd8502d12	FileCheck-ize another codegen test, tightening it up. llvm-svn: 134331	2011-07-02 20:42:14 +00:00
Chandler Carruth	b74aff3ce8	FileCheck-ize another test, making it much more precise for testing the individual cases, while hard coding less about registers in use. llvm-svn: 134330	2011-07-02 20:42:11 +00:00
Chandler Carruth	70fa55f478	FileCheck-ize another test. This one is more clear and runs fewer commands as a result. llvm-svn: 134329	2011-07-02 20:42:08 +00:00
Chandler Carruth	72358a4bf8	FileCheck-ize a test, no functionality changed. llvm-svn: 134328	2011-07-02 20:42:06 +00:00
Jakob Stoklund Olesen	b94d989634	Better diagnostics when inline asm fails to allocate. asm.c:2:7: error: ran out of registers during register allocation asm(""::"r"(0), "r"(1), "r"(2), "r"(3), "r"(4), "r"(5), "r"(6), "r"(7), "r"(8), "r"(9)); ^ llvm-svn: 134310	2011-07-02 07:17:37 +00:00
Eric Christopher	9689f96b1e	Be less specific about register allocation ordering. llvm-svn: 134308	2011-07-02 04:06:41 +00:00
Eric Christopher	7260817287	TargetConstant immediates won't be placed into registers so tighten up the valid constant check earlier. rdar://9692967 llvm-svn: 134286	2011-07-01 23:04:38 +00:00
Dan Gohman	c093f48834	Teach IVUsers to stop at non-affine expressions unless they are both outside the loop and reducible. This more completely hides them from LSR, which isn't usually able to do anything meaningful with non-affine expressions anyway, and this consequently hides them from SCEVExpander, which is acutely unprepared for non-affine expressions. Replace test/CodeGen/X86/lsr-nonaffine.ll with a new test that tests the new behavior. This works around the bug in PR10117 / rdar://problem/9633149, and is generally an improvement besides. llvm-svn: 134268	2011-07-01 22:05:19 +00:00
Jim Grosbach	461adc233e	ARMv7M vs. ARMv7E-M support. The DSP instructions in the Thumb2 instruction set are an optional extension in the Cortex-M* archtitecture. When present, the implementation is considered an "ARMv7E-M implementation," and when not, an "ARMv7-M implementation." Add a subtarget feature hook for the v7e-m instructions and hook it up. The cortex-m3 cpu is an example of a v7m implementation, while the cortex-m4 is a v7e-m implementation. rdar://9572992 llvm-svn: 134261	2011-07-01 21:12:19 +00:00
Eric Christopher	d369a9fe83	Add support for the 'j' immediate constraint. This is conditionalized on supporting the instruction that the constraint is for 'movw'. Part of rdar://9119939 llvm-svn: 134222	2011-07-01 01:00:07 +00:00
Eric Christopher	4bc6b7e1a6	Add support for the ARM 't' register constraint. And another testcase for the 'x' register constraint. Part of rdar://9119939 llvm-svn: 134220	2011-07-01 00:30:46 +00:00
Eric Christopher	d40f06b48f	Add support for the 'x' constraint. Part of rdar://9307836 and rdar://9119939 llvm-svn: 134215	2011-07-01 00:14:47 +00:00
Jakob Stoklund Olesen	8b22811785	Fix a problem with fast-isel return values introduced in r134018. We would put the return value from long double functions in the wrong register. This fixes gcc.c-torture/execute/conversion.c llvm-svn: 134205	2011-06-30 23:42:18 +00:00
Eric Christopher	2582061ec1	Add support for the 'h' constraint. Part of rdar://9119939 llvm-svn: 134203	2011-06-30 23:23:01 +00:00
Jim Grosbach	32d3b2625b	Thumb1 register to register MOV instruction is predicable. Fix a FIXME and allow predication (in Thumb2) for the T1 register to register MOV instructions. This allows some better codegen with if-conversion (as seen in the test updates), plus it lays the groundwork for pseudo-izing the tMOVCC instructions. llvm-svn: 134197	2011-06-30 22:10:46 +00:00
Jim Grosbach	8c1fb3c4e1	Pseudo-ize the t2LDMIA_RET instruction. It's just a t2LDMIA_UPD instruction with extra codegen properties, so it doesn't need the encoding information. As a side-benefit, we now correctly recognize for instruction printing as a 'pop' instruction. llvm-svn: 134173	2011-06-30 18:25:42 +00:00
Eric Christopher	7ce905754f	Fix a small thinko for constant i64 lock/orq optimization where we we didn't have an opcode for 64-bit constant or expressions. Fixes rdar://9692967 llvm-svn: 134121	2011-06-30 00:48:30 +00:00
Devang Patel	66c4bc1dda	Revert r133953 for now. llvm-svn: 134116	2011-06-29 23:50:13 +00:00
Cameron Zwarich	2ffbcf9b96	In the ARM global merging pass, allow extraneous alignment specifiers. This pass already makes the assumption, which is correct on ARM, that a type's alignment is less than its alloc size. This improves codegen with Clang (which inserts a lot of extraneous alignment specifiers) and fixes <rdar://problem/9695089>. llvm-svn: 134106	2011-06-29 22:24:25 +00:00
Benjamin Kramer	d97872524b	Don't depend on the optimization reverted in r134067. llvm-svn: 134068	2011-06-29 14:07:18 +00:00
Benjamin Kramer	cc91642a94	Revert a part of r126557 which could create unschedulable DAGs. llvm-svn: 134067	2011-06-29 13:47:25 +00:00
Jakob Stoklund Olesen	7d3e1553d2	Clean up the handling of the x87 fp stack to make it more robust. Drop the FpMov instructions, use plain COPY instead. Drop the FpSET/GET instruction for accessing fixed stack positions. Instead use normal COPY to/from ST registers around inline assembly, and provide a single new FpPOP_RETVAL instruction that can access the return value(s) from a call. This is still necessary since you cannot tell from the CALL instruction alone if it returns anything on the FP stack. Teach fast isel to use this. This provides a much more robust way of handling fixed stack registers - we can tolerate arbitrary FP stack instructions inserted around calls and inline assembly. Live range splitting could sometimes break x87 code by inserting spill code in unfortunate places. As a bonus we handle floating point inline assembly correctly now. llvm-svn: 134018	2011-06-28 18:32:28 +00:00
Roman Divacky	736e37d9b9	Implement ISD::VAARG lowering on PPC32. llvm-svn: 134005	2011-06-28 15:30:42 +00:00
Jakob Stoklund Olesen	55a0ce1776	FileCheckize a couple of tests. Also and add a test for popping dead return values and avoid testing the spill precision. llvm-svn: 133997	2011-06-28 06:25:03 +00:00
Chandler Carruth	910d35b98b	FileCheck-ize a test that had the strangest TCL quote I've seen yet: an opening single quote with no closing single quote, and with {} quotes "inside" of it. This broke some of our tools that scrape test cases. Also, while here, make the test actually assert what the comment says it asserts. This was essentially authored by Nick Lewycky, and merely typed in by myself. Let me know if this is still missing the mark, but the previous test only succeeded due to the improper quoting preventing anything from matching the grep -- it had a '4(%...)' sequence in the output! llvm-svn: 133980	2011-06-28 02:03:10 +00:00
Evan Cheng	7df851a4ff	Remove the experimental (and unused) pre-ra splitting pass. Greedy regalloc can split live ranges. llvm-svn: 133962	2011-06-27 23:40:45 +00:00
Devang Patel	8fbd4b55ea	During bottom up fast-isel, instructions emitted to materalize registers are at top of basic block and do not have debug location. This may misguide debugger while entering the basic block and sometimes debugger provides semi useful view of current location to developer by picking up previous known location as current location. Assign a sensible location to the first instruction in a basic block, if it does not have one location derived from source file, so that debugger can provide meaningful user experience to developers in edge cases. llvm-svn: 133953	2011-06-27 22:32:04 +00:00
Eric Christopher	bb65f96b18	Allow lr in the register options here. llvm-svn: 133935	2011-06-27 20:31:01 +00:00
Jakob Stoklund Olesen	58c34c0e80	Move all inline-asm-fpstack tests to a single file. Also fix some of the tests that were actually testing wrong behavior - An input operand in {st} is only popped by the inline asm when {st} is also in the clobber list. The original bug reports all had ~{st} clobbers as they should. llvm-svn: 133916	2011-06-27 17:27:37 +00:00
Dan Bailey	8de16fa817	PTX: corrected tests that were failing llvm-svn: 133875	2011-06-25 19:41:17 +00:00
Dan Bailey	5b68fc5126	PTX: Reverting implementation of i8. The .b8 operations in PTX are far more limiting than I first thought. The mov operation isn't even supported, so there's no way of converting a .pred value into a .b8 without going via .b16, which is not sensible. An improved implementation needs to use the fact that loads and stores automatically extend and truncate to implement support for EXTLOAD and TRUNCSTORE in order to correctly support boolean values. llvm-svn: 133873	2011-06-25 18:16:28 +00:00
Chad Rosier	2c0dc1fb19	Test case for r133858 (tail call optimize in the presence of byval). llvm-svn: 133863	2011-06-25 02:44:56 +00:00
Devang Patel	91fee59b74	Handle debug info for i128 constants. llvm-svn: 133821	2011-06-24 20:46:11 +00:00
Dan Bailey	2237ea06fb	PTX: Add support for i8 type and introduce associated .b8 registers The i8 type is required for boolean values, but can only use ld, st and mov instructions. The i1 type continues to be used for predicates. llvm-svn: 133814	2011-06-24 19:27:10 +00:00
Chad Rosier	3127a19140	The Neon VCVT (between floating-point and fixed-point, Advanced SIMD) instructions can be used to match combinations of multiply/divide and VCVT (between floating-point and integer, Advanced SIMD). Basically the VCVT immediate operand that specifies the number of fraction bits corresponds to a floating-point multiply or divide by the corresponding power of 2. For example, VCVT (floating-point to fixed-point, Advanced SIMD) can replace a combination of VMUL and VCVT (floating-point to integer) as follows: Example (assume d17 = <float 8.000000e+00, float 8.000000e+00>): vmul.f32 d16, d17, d16 vcvt.s32.f32 d16, d16 becomes: vcvt.s32.f32 d16, d16, #3 Similarly, VCVT (fixed-point to floating-point, Advanced SIMD) can replace a combinations of VCVT (integer to floating-point) and VDIV as follows: Example (assume d17 = <float 8.000000e+00, float 8.000000e+00>): vcvt.f32.s32 d16, d16 vdiv.f32 d16, d17, d16 becomes: vcvt.f32.s32 d16, d16, #3 llvm-svn: 133813	2011-06-24 19:23:04 +00:00
Akira Hatanaka	539ba34c25	Change the chain input of nodes that load the address of a function. This change enables SelectionDAG::getLoad at MipsISelLowering.cpp:1914 to return a pre-existing node instead of redundantly create a new node every time it is called. llvm-svn: 133811	2011-06-24 19:01:25 +00:00
Akira Hatanaka	3a3e7dfd84	Prevent generation of redundant addiu instructions that compute address of static variables or functions. llvm-svn: 133803	2011-06-24 17:55:19 +00:00
Justin Holewinski	a1dd1dd26e	PTX: Always use registers for return values, but use .param space for device parameters if SM >= 2.0 - Update test cases to be more robust against register allocation changes - Bump up the number of registers to 128 per type - Include Python script to re-generate register file with any number of registers llvm-svn: 133736	2011-06-23 18:10:13 +00:00
Justin Holewinski	acf53a172e	PTX: Fixup test cases for device param changes llvm-svn: 133735	2011-06-23 18:10:08 +00:00
Andrew Trick	aec8bc23bf	lit support for REQUIRES: asserts. Take #2. Don't piggyback on the existing config.build_mode. Instead, define a new lit feature for each build feature we need (currently just "asserts"). Teach both autoconf'd and cmake'd Makefiles to define this feature within test/lit.site.cfg. This doesn't require any lit harness changes and should be more robust across build systems. llvm-svn: 133664	2011-06-22 23:23:19 +00:00
Rafael Espindola	e57d6977be	Reenable tail duplication of bb with just an unconditional jump, but don't remove blocks that have their address taken. llvm-svn: 133659	2011-06-22 22:31:57 +00:00
Nick Lewycky	7f45c2bd84	Needs a triple. llvm-svn: 133634	2011-06-22 19:42:14 +00:00
Nick Lewycky	bf55e4b776	Emit trailing padding on constant vectors when TargetData says that the vector is larger than the sum of the elements (including per-element padding). llvm-svn: 133631	2011-06-22 18:55:03 +00:00
Justin Holewinski	376f1d46d4	PTX: Add signed integer comparisons llvm-svn: 133599	2011-06-22 02:09:50 +00:00
Justin Holewinski	0844ac41b6	PTX: Add .address_size directive if PTX version >= 2.3 Patch by Wei-Ren Chen llvm-svn: 133589	2011-06-22 00:43:56 +00:00
Devang Patel	f610afdefb	Test case for r133560. llvm-svn: 133585	2011-06-22 00:03:42 +00:00
Bob Wilson	5b04895bb8	Revert r133452: "Emit movq for 64-bit register to XMM register moves..." This is breaking compiler-rt and llvm-gcc builds on MacOSX when not using the integrated assembler. llvm-svn: 133524	2011-06-21 17:35:13 +00:00
Anna Zaks	488fc45c84	Add support for sadd.with.overflow and uadd.with.overflow intrinsics to the CBackend by emitting definitions for each intrinsic that occurs in the module. llvm-svn: 133522	2011-06-21 17:18:15 +00:00
Evan Cheng	40adfc21f6	Teach dag combine to match halfword byteswap patterns. 1. (((x) & 0xFF00) >> 8) \| (((x) & 0x00FF) << 8) => (bswap x) >> 16 2. ((x&0xff)<<8)\|((x&0xff00)>>8)\|((x&0xff000000)>>8)\|((x&0x00ff0000)<<8)) => (rotl (bswap x) 16) This allows us to eliminate most of the def : Pat patterns for ARM rev16 revsh instructions. It catches many more cases for ARM and x86. rdar://9609108 llvm-svn: 133503	2011-06-21 06:01:08 +00:00
Akira Hatanaka	1e08980a21	Re-apply 132758 and 132768 which were speculatively reverted in 132777. llvm-svn: 133494	2011-06-21 00:40:49 +00:00
Justin Holewinski	e62da847fa	PTX: Fix conversion between predicates and value types llvm-svn: 133454	2011-06-20 18:42:48 +00:00
Nick Lewycky	831fb8200d	Emit movq for 64-bit register to XMM register moves, but continue to accept movd when assembling. llvm-svn: 133452	2011-06-20 18:33:26 +00:00
Roman Divacky	79578394f5	Don't apply on PPC64 the 32bit ADDIC optimizations as there's no overflow with 32bit values. llvm-svn: 133439	2011-06-20 15:28:39 +00:00
Nadav Rotem	ea7e393b4e	Fix PromoteIntRes_TRUNCATE: Add support for cases where the source vector type is to be split while the target vector is to be promoted. (eg: <4 x i64> -> <4 x i8> ) llvm-svn: 133424	2011-06-20 07:15:58 +00:00
Benjamin Kramer	c20d8728fc	Update test. llvm-svn: 133390	2011-06-19 12:14:34 +00:00
Nadav Rotem	07b7d6858d	Reduce the runtime of the test. Keep only the interesting cases. llvm-svn: 133381	2011-06-19 08:12:43 +00:00
Chris Lattner	6aa403748e	Remove support for parsing the "type i32" syntax for defining a numbered top level type without a specified number. This syntax isn't documented and blocks forward progress. llvm-svn: 133371	2011-06-19 00:03:46 +00:00
Chris Lattner	ad5400fa72	rip out a ton of intrinsic modernization logic from AutoUpgrade.cpp, which is for pre-2.9 bitcode files. We keep x86 unaligned loads, movnt, crc32, and the target indep prefetch change. As usual, updating the testsuite is a PITA. llvm-svn: 133337	2011-06-18 06:05:24 +00:00
Jakob Stoklund Olesen	6346426b8c	Switch ARM to using AltOrders instead of MethodBodies. This slightly changes the GPR allocation order on Darwin where R9 is not a callee-saved register: Before: %R0 %R1 %R2 %R3 %R12 %R9 %LR %R4 %R5 %R6 %R8 %R10 %R11 After: %R0 %R1 %R2 %R3 %R9 %R12 %LR %R4 %R5 %R6 %R8 %R10 %R11 llvm-svn: 133326	2011-06-18 01:14:46 +00:00
Galina Kistanova	36039fd720	Moved to the right place. llvm-svn: 133324	2011-06-18 00:59:37 +00:00
Eric Christopher	169d53e1e0	Fix UMULO support for 2x register width to allow the full range without a libcall to a new mulo<mode> libcall that we'd have to create. Finishes the rest of rdar://9090077 and rdar://9210061 llvm-svn: 133318	2011-06-18 00:09:57 +00:00
Nadav Rotem	0cec5ab356	Fix a bug in the type-lowering of integer-promoted elements. Add a check that the newly created simple type is valid before checking its legality. Re-commit the test file. llvm-svn: 133291	2011-06-17 20:54:12 +00:00
Evan Cheng	df9192b200	Add an alternative rev16 pattern. We should figure out a better way to handle these complex rev patterns. rdar://9609108 llvm-svn: 133289	2011-06-17 20:47:21 +00:00
Eric Christopher	25aa04466a	Lower multiply with overflow checking to __mulo<mode> calls if we haven't been able to lower them any other way. Fixes rdar://9090077 and rdar://9210061 llvm-svn: 133288	2011-06-17 20:41:29 +00:00
Galina Kistanova	b46dac6e3d	est 2008-06-04-indirectmem.ll is X86-specific. Move to X86 folder. llvm-svn: 133275	2011-06-17 18:26:23 +00:00
Chris Lattner	2e2fad280a	Stop accepting and ignoring attributes in function types. Attributes are applied to functions and call/invokes, not to types. llvm-svn: 133266	2011-06-17 17:37:13 +00:00
Roman Divacky	6778c94b24	Fix a few places where 32bit instructions/registerset were used on PPC64. llvm-svn: 133260	2011-06-17 15:21:10 +00:00
Justin Holewinski	c515f1b903	PTX: Adjust rounding modes * rounding modes for fp add, mul, sub now use .rn * float -> int rounding correctly uses .rzi not .rni * 32bit fdiv for sm13 uses div.rn (instead of div.approx) * 32bit fdiv for sm10 now uses div (instead of div.approx) Approx is not IEEE 754 compatible (and should be optionally set by a flag to the backend instead). The .rn rounding modifier is the PTX default anyway, but it's better to be explicit. All these modifiers should be available by using __fmul_rz functions for example, but support will need to be added for this in the backend. Patch by Dan Bailey llvm-svn: 133253	2011-06-17 12:12:42 +00:00
Chris Lattner	0899957b99	make the asmparser reject function and type redefinitions. 'Merging' hasn't been needed since llvm-gcc 3.4 days. llvm-svn: 133248	2011-06-17 07:06:44 +00:00
Chris Lattner	385977c252	remove asmparser support for the old getresult instruction, which has been subsumed by extractvalue. llvm-svn: 133247	2011-06-17 06:57:15 +00:00
Chris Lattner	9e7c036d09	remove parser support for the obsolete "multiple return values" syntax, which was replaced with return of a "first class aggregate". llvm-svn: 133245	2011-06-17 06:49:41 +00:00
Chris Lattner	4eb6f76fa6	Remove support for using "foo" as symbols instead of %"foo". This is ancient syntax and has been long obsolete. As usual, updating the tests is the nasty part of this. llvm-svn: 133242	2011-06-17 06:36:20 +00:00
Chris Lattner	9ec82f54d4	manually upgrade a bunch of tests to modern syntax, and remove some that are either unreduced or only test old syntax. llvm-svn: 133228	2011-06-17 03:14:27 +00:00
Cameron Zwarich	681f02ec26	Update an insertion point iterator after replacing a return instruction with a tail call pseudoinstruction. This fixes <rdar://problem/9624333>. llvm-svn: 133227	2011-06-17 02:16:43 +00:00
Jakob Stoklund Olesen	91874697b3	Don't use register classes larger than TLI->getRegClassFor(VT). In Thumb mode we cannot handle GPR virtual registers, even though some instructions can. When isel is lowering a CopyFromReg, it should limit itself to subclasses of getRegClassFor(VT). <rdar://problem/9624323> llvm-svn: 133210	2011-06-16 22:50:38 +00:00
Nick Lewycky	ba962a7115	There's no need to be so picky about the particular register. llvm-svn: 133189	2011-06-16 21:00:00 +00:00
Justin Holewinski	32a7bad9db	PTX: Finish new calling convention implementation llvm-svn: 133172	2011-06-16 17:50:00 +00:00
Bruno Cardoso Lopes	f52f4dd0b8	Add AVX suport for fpextend. Original patch by Syoyo Fujita with more comments by me. llvm-svn: 133153	2011-06-16 07:03:21 +00:00
Eli Friedman	4594e0f01a	FileCheck-ize test, and make it work on EABI hosts, like clang-native-arm-cortex-a9. llvm-svn: 133139	2011-06-16 02:36:32 +00:00
Eli Friedman	014d4feac5	Force a triple here so this test doesn't fail on EABI hosts (like clang-native-arm-cortex-a9). llvm-svn: 133134	2011-06-16 01:49:31 +00:00
Nick Lewycky	c62f935caf	Commit the right set of tests for r133124. Sorry 'bout that! llvm-svn: 133133	2011-06-16 01:35:45 +00:00
Andrew Trick	8c37d99180	Reenabling this test with REQUIRES: Asserts llvm-svn: 133132	2011-06-16 01:34:41 +00:00
Chad Rosier	26513932a2	Typos. llvm-svn: 133128	2011-06-16 01:24:24 +00:00
Chad Rosier	66fa658a4b	Revision r128665 added an optimization to make use of NEON multiplier accumulator forwarding. Specifically (from SVN log entry): Distribute (A + B) * C to (A * C) + (B * C) to make use of NEON multiplier accumulator forwarding: vadd d3, d0, d1 vmul d3, d3, d2 => vmul d3, d0, d2 vmla d3, d1, d2 Make sure it catches cases where operand 1 is add/fadd/sub/fsub, which was intended in the original revision. llvm-svn: 133127	2011-06-16 01:21:54 +00:00
Nick Lewycky	f4886c7374	Add a DAGCombine for (ext (binop (load x), cst)). llvm-svn: 133124	2011-06-16 01:15:49 +00:00
Anna Zaks	73f1ba0a88	Rename the test. Thanks Cameron! Use shorter/generic names. llvm-svn: 133115	2011-06-16 00:34:10 +00:00
Anna Zaks	e2a947a4f7	Function::getNumBlockIDs() should be used instead of Function::size() to set the upper limit on the block IDs since basic blocks might get removed (simplified away) after being initially numbered. Plus the test case, in which SelectionDAGBuilder::visitBr() calls llvm::MachineFunction::removeFromMBBNumbering(), which introduces the hole in numbering leading to an assert in llc (prior to the fix). llvm-svn: 133113	2011-06-16 00:03:21 +00:00
Rafael Espindola	8edd93b519	Testcase for previous commit. llvm-svn: 133089	2011-06-15 21:18:51 +00:00
John McCall	e6835ee44e	Add a new function attribute, nonlazybind, which inhibits lazy-loading optimizations when emitting calls to the function; instead those calls may use faster relocations which require the function to be immediately resolved upon loading the dynamic object featuring the call. This is useful when it is known that the function will be called frequently and pervasively and therefore there is no merit in delaying binding of the function. Currently only implemented for x86-64, where it turns into a call through the global offset table. Patch by Dan Gohman, who assures me that he's going to add LangRef documentation for this once it's committed. llvm-svn: 133080	2011-06-15 20:36:13 +00:00
Andrew Trick	6d96f3a7a2	Disabling this test until I can figure out the right lit flags. llvm-svn: 133068	2011-06-15 18:25:38 +00:00
Jakob Stoklund Olesen	7b0de9a9e0	Remove custom allocation orders in SystemZ. Note that this actually changes code generation, and someone who understands this target better should check the changes. - R12Q is now allocatable. I think it was omitted from the allocation order by mistake since it isn't reserved. It as apparently used as a GOT pointer sometimes, and it should probably be reserved if that is the case. - The GR64 registers are allocated in a different order now. The register allocator will automatically put the CSRs last. There were other changes to the order that may have been significant. The test fix is because r0 and r1 swapped places in the allocation order. llvm-svn: 133067	2011-06-15 18:02:56 +00:00
Evan Cheng	30f84a59ae	Another revsh pattern. rdar://9609059 llvm-svn: 133064	2011-06-15 17:17:48 +00:00
Andrew Trick	ce93f28a36	Added -stress-sched flag in the Asserts build. Added a test case for handling physreg aliases during pre-RA-sched. llvm-svn: 133063	2011-06-15 17:16:12 +00:00
Chad Rosier	f6c1b3b81f	TargetLoweringOpt is a struct used by DAGCombine, not a pass. llvm-svn: 133062	2011-06-15 16:48:02 +00:00
Nadav Rotem	72e51c94b1	This test was failing on X86 machines which do not have SSE4. Fixed the test by specifying that the target CPU is corei7. llvm-svn: 133053	2011-06-15 12:26:53 +00:00
Evan Cheng	7624839811	PerformBFICombine - (bfi A, (and B, Mask1), Mask2) -> (bfi A, B, Mask2) iff the bits being cleared by the AND are not demanded by the BFI. The previous BFI dag combine rule was actually incorrect (or used to be correct until BFI representation changed). rdar://9609030 llvm-svn: 133034	2011-06-15 01:12:31 +00:00
Tanya Lattner	5ee64fc868	Add an optimization that looks for a specific pair-wise add pattern and generates a vpaddl instruction instead of scalarizing the add. Includes a test case. llvm-svn: 133027	2011-06-14 23:48:48 +00:00
Rafael Espindola	0811c47a3a	Add triple. llvm-svn: 133026	2011-06-14 23:47:36 +00:00
Chad Rosier	30333c668f	When pattern matching during instruction selection make sure shl x,1 is not converted to add x,x if x is a undef. add undef, undef does not guarantee that the resulting low order bit is zero. Fixes <rdar://problem/9453156> and <rdar://problem/9487392>. llvm-svn: 133022	2011-06-14 22:29:10 +00:00
Rafael Espindola	d133b092e2	Check the llc output. llvm-svn: 133021	2011-06-14 22:24:32 +00:00
Stuart Hastings	63da197d28	Test case for x86 MMX inline asm. rdar://problem/8886707 llvm-svn: 133014	2011-06-14 21:51:38 +00:00
Rafael Espindola	4e8b511063	Add a test for the recent regression. llvm-svn: 133009	2011-06-14 20:38:50 +00:00
Dan Gohman	2071b00bdf	This test is still failing. Delete the rest of it. llvm-svn: 133001	2011-06-14 18:07:36 +00:00
Dan Gohman	d6dcf3e5e3	Revert r132991. This test is failing on the llvm-gcc-x86_64-linux-selfhost buildbot and others. llvm-svn: 133000	2011-06-14 18:03:11 +00:00
Rafael Espindola	1e809f99ad	Add 132986 back, but avoid non-determinism if a bb address gets reused. llvm-svn: 132995	2011-06-14 15:31:54 +00:00
Nadav Rotem	7b529545b7	Add a testcase for #9623 llvm-svn: 132991	2011-06-14 13:23:10 +00:00
Rafael Espindola	b90ea8a8c7	revert 132986 to see if the bots go green. llvm-svn: 132988	2011-06-14 12:48:26 +00:00
Nadav Rotem	b638c3c037	This testcase cause a failure on some bots. Remove the failing test until further investigation. llvm-svn: 132986	2011-06-14 09:10:37 +00:00
Nadav Rotem	1b92c3d96c	Add a testcase for checking the integer-promotion of many different vector types (with power of two types such as 8,16,32 .. 512). Fix a bug in the integer promotion of bitcast nodes. Enable integer expanding only if the target of the conversion is an integer (when the type action is scalarize). Add handling to the legalization of vector load/store in cases where the saved vector is integer-promoted. llvm-svn: 132985	2011-06-14 08:11:52 +00:00
Rafael Espindola	434d19ff30	Implement Jakob's suggestion on how to detect fall thought without calling AnalyzeBranch. llvm-svn: 132981	2011-06-14 06:08:32 +00:00
Bruno Cardoso Lopes	15b9096112	Since ARM's prefetch implementation predicted the presence of a instruction cache prefetch and now that the info from "prefetch" to "ARMPreload" is present, only add a testcase for PLI. llvm-svn: 132978	2011-06-14 05:11:46 +00:00
Bruno Cardoso Lopes	b6afc5168f	Add one more argument to the prefetch intrinsic to indicate whether it's a data or instruction cache access. Update the targets to match it and also teach autoupgrade. llvm-svn: 132976	2011-06-14 04:58:37 +00:00
Rafael Espindola	56a82c5ef8	Make the threshold used by branch folding softer. Before we would get a sharp all or nothing transition when one extra predecessor was added. Now we still test first ones for merging. llvm-svn: 132974	2011-06-14 04:41:17 +00:00
Bill Wendling	77d4d62693	Heuristic: If the number of operands in the alias are more than the number of operands in the aliasee, don't print the alias. llvm-svn: 132963	2011-06-14 03:17:20 +00:00
Jakob Stoklund Olesen	2cac2ea7a1	Be less aggressive about hinting in RAFast. In particular, don't spill dirty registers only to satisfy a hint. It is not worth it. The attached test case provides an example where the fast allocator would spill a register when other registers are available. llvm-svn: 132900	2011-06-13 03:26:46 +00:00
Rafael Espindola	8d0f7518b2	Really fix the fall-through logic. Add a triple to the tests. llvm-svn: 132885	2011-06-12 05:57:01 +00:00
Rafael Espindola	f73c2dc8f6	Test for the previous commit. llvm-svn: 132884	2011-06-12 05:35:39 +00:00
Rafael Espindola	db58547906	AnalyzeBranch doesn't change which successors a bb has, just the order we try to branch to them. Before we were creating successor lists with duplicated entries. Fixing that found a bug in isBlockOnlyReachableByFallthrough that would causes it to return the wrong answer for ----------- ... jne foo jmp bar foo: ---------- llvm-svn: 132882	2011-06-12 03:20:32 +00:00
Eli Friedman	0bb1c525fd	Add full x86 fast-isel support for memcpy and memset. rdar://9431466 llvm-svn: 132864	2011-06-10 23:39:36 +00:00
Eli Friedman	b3764b7c97	Add -mattr=+sse2 to make the buildbots happy. llvm-svn: 132839	2011-06-10 08:26:26 +00:00
Chad Rosier	670af01484	Adding a test case for revision 132825. llvm-svn: 132830	2011-06-10 02:44:19 +00:00
Eli Friedman	96581336e6	Add a simple test which makes sure folding immediate float zero to a memory operand works. llvm-svn: 132824	2011-06-10 00:30:08 +00:00
Cameron Zwarich	af47f4a117	A CCState was being created without setting whether it is in the Call or Prologue state, causing an assertion failure downstream. This fixes <rdar://problem/9562908>. This really seems like it should always be set at CCState creation time, so mistakes like this can never happen. I'll take a look at doing that. llvm-svn: 132811	2011-06-09 22:30:07 +00:00
Eli Friedman	14c6ce9041	Change this DAGCombine to build AND of SHR instead of SHR of AND; this matches the ordering we prefer in instcombine. Part of rdar://9562809. The potential DAGCombine which enforces this more generally messes up some other very fragile patterns, so I'm leaving that alone, at least for now. llvm-svn: 132809	2011-06-09 22:14:44 +00:00
Eric Christopher	24dafa3dbc	Speculatively revert 132758 and 132768 to try to fix the Windows buildbots. llvm-svn: 132777	2011-06-09 16:03:19 +00:00
Eric Christopher	88088d9b8a	Recommit r132764 since it didn't cause the windows buildbot failures. llvm-svn: 132776	2011-06-09 15:39:01 +00:00
Eric Christopher	386e80f51e	Temporarily revert 132764 to see if it fixes the Windows buildbot. llvm-svn: 132771	2011-06-09 06:29:54 +00:00
Akira Hatanaka	33ec063f3b	Initial support for inline asm memory operand constraints. llvm-svn: 132768	2011-06-09 03:31:05 +00:00
Eric Christopher	65f7ea8a35	If the alignment of the byval argument is greater than the alignment of the frame then increase the maximum alignment of the frame to match. Fixes PR6965 llvm-svn: 132764	2011-06-09 00:15:19 +00:00
Akira Hatanaka	38115eb019	Fix bug in lowering of DYNAMIC_STACKALLOC nodes. The correct offset of the dynamically allocated stack area was not set. llvm-svn: 132758	2011-06-08 21:28:09 +00:00
Cameron Zwarich	e7e6bc3a33	Fix an issue where the two-address conversion pass incorrectly rewrites untied operands to an early clobber register. This fixes <rdar://problem/9566076>. llvm-svn: 132738	2011-06-07 23:54:00 +00:00
Rafael Espindola	12efa298a0	Fix a silly error I introduce in r131951. Fixes PR10095. llvm-svn: 132735	2011-06-07 23:26:45 +00:00
Stuart Hastings	ddd47ea403	Tweak this test for ARM-hosted 'bot. llvm-svn: 132711	2011-06-07 15:23:11 +00:00
Nadav Rotem	0f5e672008	Move the legalizer tests to the X86 directory because the test uses the x86 codegen. Thanks Galina. llvm-svn: 132706	2011-06-07 05:23:58 +00:00
Akira Hatanaka	fbeb14925f	Add test case for C++ exception handling and fix the following mistakes in MipsFrameLowering::emitPrologue: - cfi directives are not inserted at the right location or in the right order. - The source MachineLocation for the cfi directive that changes the cfa register to $fp should be MachineLocation::VirtualFP. - A PROLOG_LABEL that marks the beginning of cfi_offset directives for callee-saved register is emitted even when no callee-saved registers are saved. - When a callee-saved double precision register is saved, two cfi_offset directives, one for each of the paired single precision registers, should be emitted. llvm-svn: 132703	2011-06-07 02:17:21 +00:00
Jakob Stoklund Olesen	cf00a6d764	Simplify local live range splitting's safeguard to fix PR10070. When local live range splitting creates a live range with the same number of instructions as the old range, mark it as RS_Local. When such a range is seen again, require that it be split in a way that reduces the number of instructions. That guarantees we are making progress while still being able to perform 3 -> 2+3 splits as required by PR10070. This also means that the PrevSlot map is no longer needed. This was also used to estimate new spill weights, but that is no longer necessary after slotIndexes::insertMachineInstrInMaps() got the extra Late insertion argument. llvm-svn: 132697	2011-06-06 23:55:20 +00:00
Stuart Hastings	d044ba7a9f	Followup to 132458, omit unnecessary stack copy when x87 input is a load. rdar://problem/6373334 llvm-svn: 132696	2011-06-06 23:15:58 +00:00
Nadav Rotem	bfff2bd65a	Add methods to support the integer-promotion of vector types. Methods to legalize SDNodes such as BUILD_VECTOR, EXTRACT_VECTOR_ELT, etc. llvm-svn: 132689	2011-06-06 20:55:56 +00:00
Stuart Hastings	ecfa8a1a74	Test case for PR10085. llvm-svn: 132682	2011-06-06 20:03:22 +00:00
Eli Friedman	69da49c53a	PR10077: fix fast-isel of extractvalue of aggregate constants. llvm-svn: 132676	2011-06-06 05:46:34 +00:00
Benjamin Kramer	d15bc54757	Harden tests for windows path separators. llvm-svn: 132671	2011-06-05 18:20:05 +00:00
Jakob Stoklund Olesen	d513e33c69	Fix a test that keeps breaking when allocation orders change. Who said FileCheck couldn't handle arbitrarily complex conditions? llvm-svn: 132654	2011-06-04 23:34:40 +00:00
Nadav Rotem	5a64a09036	TypeLegalizer: Add support for passing of vector-promoted types in registers (copyFromParts/copyToParts). llvm-svn: 132649	2011-06-04 20:58:08 +00:00
Stuart Hastings	ea8b49dff3	Reapply 132424 with fixes. This fixes PR10068. rdar://problem/5993888 llvm-svn: 132606	2011-06-03 23:53:54 +00:00
Jakob Stoklund Olesen	1db0c48cba	Fix some tests that depend on register allocation. llvm-svn: 132602	2011-06-03 22:45:21 +00:00
Eric Christopher	bd0677f8db	Another possible bug. Stopgap until we can autogenerate tables and constraint lengths. Part of rdar://9037836 and rdar://9119939 llvm-svn: 132598	2011-06-03 22:09:12 +00:00
Eric Christopher	51ff48ad30	Fix an off by one error. Part of rdar://9037836 and rdar://9119939 llvm-svn: 132590	2011-06-03 20:44:52 +00:00
Jakob Stoklund Olesen	449aaba5b0	Switch AllocationOrder to using RegisterClassInfo instead of a BitVector of reserved registers. Use RegisterClassInfo in RABasic as well. This slightly changes som allocation orders because RegisterClassInfo puts CSR aliases last. llvm-svn: 132581	2011-06-03 20:34:53 +00:00
Eric Christopher	e831655dd9	Make the Uv constraint a memory operand. This doesn't solve the addressing mode problem mentioned in r132559. Backend part of rdar://9037836 and part of rdar://9119939 llvm-svn: 132561	2011-06-03 17:24:37 +00:00
Roman Divacky	3624922127	Fix wrong usages of CTR/MCTR where CTR8/MCTR8 was meant. - Check for MTCTR8 in addition to MTCTR when looking up a hazard. - When lowering an indirect call use CTR8 when targeting 64bit. - Introduce BCTR8 that uses CTR8 and use it on 64bit when expanding ISD::BRIND. The last change fixes PR8487. With those changes, we are able to compile a running "ls" and "sh" on FreeBSD/PowerPC64. llvm-svn: 132552	2011-06-03 15:47:49 +00:00
Eli Friedman	eae10d6163	Add ARM fast-isel support for materializing the address of a global in cases where the global uses an indirect symbol. rdar://9431157 llvm-svn: 132522	2011-06-03 01:13:19 +00:00
Devang Patel	1c30f3ac27	During post RA scheduling, do not try to chase reg defs. to preserve DBG_VALUEs. This approach has several downsides, for example, it does not work when dbg value is a constant integer, it does not work if reg is defined more than once, it places end of debug value range markers in the wrong place. It even causes misleading incorrect debug info when duplicate DBG_VALUE instructions point to same reg def. Instead, use simpler approach and let DBG_VALUE follow its predecessor instruction. After live debug value analysis pass, all DBG_VALUE instruction are placed at the right place. Thanks Jakob for the hint! llvm-svn: 132483	2011-06-02 20:07:12 +00:00
Rafael Espindola	2eab5458f6	Add test for PR10068. llvm-svn: 132482	2011-06-02 20:02:48 +00:00
Rafael Espindola	1299f014d4	Revert 132424 to fix PR10068. llvm-svn: 132479	2011-06-02 19:57:47 +00:00
Stuart Hastings	bf1b4a2e2e	Andy pointed out a dumb omission in this test case. Thanks Andy! llvm-svn: 132477	2011-06-02 19:26:49 +00:00
Stuart Hastings	af7e57f485	Jakob pointed out a dumb omission in this test case. Thanks Jakob! llvm-svn: 132472	2011-06-02 18:44:05 +00:00
Stuart Hastings	8447f18f85	Omit unnecessary stack copy when x87 input is a load. rdar://problem/6373334 llvm-svn: 132458	2011-06-02 15:57:11 +00:00
Stuart Hastings	cf5c3fdc33	Tweak testcase for ARM bot. rdar://problem/5993888 llvm-svn: 132454	2011-06-02 05:05:39 +00:00
Akira Hatanaka	1f91013bcb	Detect FI\|cst pattern in MipsDAGToDAGISel::SelectAddr. Patch by Sasa Stankovic. llvm-svn: 132448	2011-06-02 01:03:14 +00:00
Akira Hatanaka	77501e89a7	Test case for r132444. llvm-svn: 132445	2011-06-02 00:25:53 +00:00
Devang Patel	1a3058d727	Do not drop constant values when a variable's content is described using .debug_loc entries. llvm-svn: 132427	2011-06-01 22:03:25 +00:00
Stuart Hastings	9a085fb9d8	Recommit 132404 with fixes. rdar://problem/5993888 llvm-svn: 132424	2011-06-01 21:33:14 +00:00
Eric Christopher	9fe91039e4	Allow bitcasts between valid types of the same size and vector types if the vector type is legal. Fixes rdar://9306086 llvm-svn: 132420	2011-06-01 19:55:10 +00:00
Stuart Hastings	4b33767382	Revert 132404 to appease a buildbot. rdar://problem/5993888 llvm-svn: 132419	2011-06-01 19:52:20 +00:00
Stuart Hastings	cd336a4ee0	Cleanup test case. rdar://problem/5660695 llvm-svn: 132408	2011-06-01 18:23:14 +00:00
Stuart Hastings	23f5ceda96	Add support for x86 CMPEQSS and friends. These instructions do a floating-point comparison, generate a mask of 0s or 1s, and generally DTRT with NaNs. Only profitable when the user wants a materialized 0 or 1 at runtime. rdar://problem/5993888 llvm-svn: 132404	2011-06-01 17:17:45 +00:00
Stuart Hastings	d81819d57b	A forthcoming SSE patch will break this test; since the test is also valid for x87, re-target to x87. rdar://problem/5993888 llvm-svn: 132401	2011-06-01 16:13:09 +00:00
Stuart Hastings	159bfd2d1c	Test case for 132396. rdar://problem/5660695 llvm-svn: 132399	2011-06-01 15:50:29 +00:00
Nadav Rotem	111ad2f6ce	This patch is another step in the direction of adding vector select. In this patch we add a flag to enable a new type legalization decision - to promote integer elements in vectors. Currently, the rest of the codegen does not support this kind of legalization. This flag will be removed when the transition is complete. llvm-svn: 132394	2011-06-01 12:51:46 +00:00
Richard Osborne	4293c93896	Add XCore intrinsic for crc8. llvm-svn: 132340	2011-05-31 16:24:49 +00:00
Richard Osborne	34a4652dcd	Add XCore intrinsic for crc32. llvm-svn: 132336	2011-05-31 14:47:36 +00:00
Richard Osborne	d84d3d1068	Convert test to FileCheck. llvm-svn: 132335	2011-05-31 14:00:05 +00:00
Bruno Cardoso Lopes	728ea362c3	This patch implements atomic intrinsics atomic.load.add (sub,and,or,xor, nand), atomic.swap and atomic.cmp.swap, all in i8, i16 and i32 versions. The intrinsics are implemented by creating pseudo-instructions, which are then expanded in the method MipsTargetLowering::EmitInstrWithCustomInserter. Patch by Sasa Stankovic. llvm-svn: 132323	2011-05-31 02:54:07 +00:00
Bruno Cardoso Lopes	f6fa29e7a1	This patch implements the thread local storage. Implemented are General Dynamic, Initial Exec and Local Exec TLS models. Patch by Sasa Stankovic llvm-svn: 132322	2011-05-31 02:53:58 +00:00
Rafael Espindola	33f7d7f9fa	Use the dwarf->llvm mapping to print register names in the cfi directives. Fixes PR9826. llvm-svn: 132317	2011-05-30 20:20:15 +00:00
Jakob Stoklund Olesen	49bf4dd965	Fix PR10046 by updating LiveVariables kill info when splitting live ranges. This only affects targets like Mips where branch instructions may kill virtual registers. Most other targets branch on flag values, so virtual registers are not involved. The problem is that MachineBasicBlock::updateTerminator deletes branches and inserts new ones while LiveVariables keeps a list of pointers to instructions that kill virtual registers. That list wasn't properly updated in MBB::SplitCriticalEdge. llvm-svn: 132298	2011-05-29 20:10:28 +00:00
John McCall	64ff21faa7	On Darwin ARM, set the UNWIND_RESUME libcall to _Unwind_SjLj_Resume. This is important for the correct lowering of unwind instructions (which doesn't matter at all) and llvm.eh.resume calls (which does). Take 2, now with more basic competence. llvm-svn: 132295	2011-05-29 19:50:32 +00:00
John McCall	ffdb2d5e70	I didn't mean to commit these residues of a personal project. llvm-svn: 132293	2011-05-29 19:41:56 +00:00
John McCall	46c7b963b2	On Darwin ARM, set the UNWIND_RESUME libcall to _Unwind_SjLj_Resume. This is important for the correct lowering of unwind instructions (which doesn't matter at all) and llvm.eh.resume calls (which does). llvm-svn: 132291	2011-05-29 19:39:04 +00:00
Bruno Cardoso Lopes	6d5e369a10	Add support for ARM ldrexd/strexd intrinsics. They both use i32 register pairs to load/store i64 values. Since there's no current support to explicitly declare such restrictions, implement it by using specific hardcoded register pairs during isel. llvm-svn: 132248	2011-05-28 04:07:29 +00:00
Eric Christopher	000dd7d0e6	Implement the 'M' output modifier for arm inline asm. This is fairly register allocation dependent and will occasionally break. WIP in the register allocator to model paired/etc registers. rdar://9119939 llvm-svn: 132242	2011-05-28 01:40:44 +00:00
Akira Hatanaka	1590e4eab1	Define a wrapper node for target constant nodes (tglobaladdr, etc.). Need this to prevent emitting illegal conditional move instructions. llvm-svn: 132240	2011-05-28 01:07:07 +00:00
Cameron Zwarich	cd3c1b5829	Fix the remaining atomic intrinsics to use the right register classes on Thumb2, and add some basic tests for them. llvm-svn: 132235	2011-05-27 23:54:00 +00:00
Eli Friedman	a64ba39381	Force a triple to make this test pass on Darwin. llvm-svn: 132228	2011-05-27 23:12:48 +00:00
Cameron Zwarich	ded03d4e24	Add a GR32_NOREX_NOSP register class and fix a bug where getMatchingSuperRegClass() was saying that the matching superregister class of GR32_NOREX in GR64_NOREX_NOSP is GR64_NOREX, which drops the NOSP constraint. This fixes PR10032. llvm-svn: 132225	2011-05-27 22:26:04 +00:00
Rafael Espindola	2230168a0f	Make size computation less brittle. llvm-svn: 132222	2011-05-27 22:05:41 +00:00
Jakob Stoklund Olesen	516eb93107	Make room for register allocation to improve. llvm-svn: 132213	2011-05-27 20:15:06 +00:00
Evan Cheng	0fcb465bab	Don't use movw / movt for iOS static codegen for now to workaround some tools issues. rdar://9514789 llvm-svn: 132211	2011-05-27 20:11:27 +00:00
Jakob Stoklund Olesen	fb206b98bd	Delete a test that is no longer relevant. According to PR2536, the old spiller had trouble with the IMPLICIT_DEF in this code: %reg1028<def> = MOV16rm %reg0, 1, %reg0, <ga:g_5>, Mem:LD(2,2) [g_5 + 0] %reg1039<def> = IMPLICIT_DEF %reg1038<def> = INSERT_SUBREG %reg1039, %reg1028, 2 %reg1025<def> = AND32ri %reg1038, 65534, %%EFLAGS<imp-def> However, today we emit a zero-extending load instead: %vreg10<def> = MOVZX32rm16 %noreg, 1, %noreg, <ga:@g_5>, %noreg; %mem:LD2[@g_5] GR32:%vreg10 %vreg0<def> = AND32ri %vreg10, 65534, %%EFLAGS<imp-def,dead>; %GR32:%vreg0,%vreg10 This makes the test pointless since it no longer creates the spiller hazard. llvm-svn: 132210	2011-05-27 20:02:42 +00:00
Evan Cheng	4192d53d1e	Add iOS test llvm-svn: 132203	2011-05-27 19:04:21 +00:00
Eli Friedman	55343ef7bb	And fix the test in r132194. llvm-svn: 132196	2011-05-27 18:14:28 +00:00
Eli Friedman	560532051b	Fix a silly mistake (which trips over an assertion) in r132099. rdar://9515076 llvm-svn: 132194	2011-05-27 18:02:04 +00:00
Devang Patel	62a7038a9f	Select DW_AT_const_value size based on variable size. llvm-svn: 132193	2011-05-27 16:45:18 +00:00
Cameron Zwarich	a9c418b1c3	Fix PR10029 - VerifyCoalescing failure on patterns_dfa.c of 445.gobmk. llvm-svn: 132181	2011-05-27 05:04:51 +00:00
Chad Rosier	b87c4a6945	Renamed llvm.x86.sse42.crc32 intrinsics; crc64 doesn't exist. crc32.[8\|16\|32] have been renamed to .crc32.32.[8\|16\|32] and crc64.[8\|16\|32] have been renamed to .crc32.64.[8\|64]. llvm-svn: 132163	2011-05-26 23:13:19 +00:00
Devang Patel	e0b7ab9296	During branch folding avoid inserting redundant DBG_VALUE machine instructions. llvm-svn: 132148	2011-05-26 21:47:59 +00:00
Akira Hatanaka	5bfbea9ef2	Add support for C++ exception handling. llvm-svn: 132131	2011-05-26 18:59:03 +00:00
Eli Friedman	15dc009422	Fix test on Windows. llvm-svn: 132126	2011-05-26 18:00:32 +00:00
Stuart Hastings	837a958ff6	Reverting 132105: it broke some LLVM-GCC DejaGNU tests. llvm-svn: 132108	2011-05-26 04:09:49 +00:00
Stuart Hastings	e704bfb21e	Correctly handle a one-word struct passed byval on x86_64. rdar://problem/6920088 llvm-svn: 132105	2011-05-26 02:44:56 +00:00
Eli Friedman	93ffb875ad	Rewrite fast-isel integer cast handling to handle more cases, and to be simpler and more consistent. The practical effects here are that x86-64 fast-isel can now handle trunc from i8 to i1, and ARM fast-isel can handle many more constructs involving integers narrower than 32 bits (including loads, stores, and many integer casts). rdar://9437928 . llvm-svn: 132099	2011-05-25 23:49:02 +00:00
Akira Hatanaka	3f49cbeb37	Define WeakRefDirective. llvm-svn: 132098	2011-05-25 23:30:30 +00:00
Eric Christopher	807da21e47	Implement the 'm' modifier. Note that it only works for memory operands. Part of rdar://9119939 llvm-svn: 132081	2011-05-25 20:51:58 +00:00
Akira Hatanaka	32b5043265	Custom-lower FCOPYSIGN nodes. llvm-svn: 132074	2011-05-25 19:32:07 +00:00
Cameron Zwarich	beae5f20e8	Make tTAILJMPr/tTAILJMPrND emit a tBX without a preceding MOV of PC to LR. This fixes <rdar://problem/9495913> llvm-svn: 132042	2011-05-25 04:45:27 +00:00
Rafael Espindola	70213c7c5f	Replace the -unwind-tables option with a per function flag. This is more LTO friendly as we can now correctly merge files compiled with or without -fasynchronous-unwind-tables. llvm-svn: 132033	2011-05-25 03:44:17 +00:00
Akira Hatanaka	a5b11ee449	Fix lowering of DYNAMIC_STACKALLOC nodes. llvm-svn: 132030	2011-05-25 02:20:00 +00:00
Eric Christopher	4f193f9555	Implement the arm 'L' asm modifier. Part of rdar://9119939 llvm-svn: 132024	2011-05-24 23:27:13 +00:00
Eric Christopher	a6d7ccb170	Implement the immediate part of the 'B' modifier. Part of rdar://9119939 llvm-svn: 132023	2011-05-24 23:15:43 +00:00
Eric Christopher	03965fa3b6	Add support for the arm 'y' asm modifier. Fixes part of rdar://9444657 llvm-svn: 132011	2011-05-24 22:10:34 +00:00
Akira Hatanaka	4ef318f1e1	Test case for r132003. llvm-svn: 132005	2011-05-24 21:28:18 +00:00
Akira Hatanaka	21f003a8d4	Fix test case. llvm-svn: 131988	2011-05-24 19:37:15 +00:00
Akira Hatanaka	98951fd8b5	Revision 131986 test case. llvm-svn: 131987	2011-05-24 19:29:37 +00:00
Rafael Espindola	176fe6a0e0	Fix the defaults for .eh_frame. We were marking it as writable. llvm-svn: 131951	2011-05-24 02:50:20 +00:00
Evan Cheng	b5950697e8	- Teach SelectionDAG::isKnownNeverZero to return true (op x, c) when c is non-zero. - Teach X86 cmov optimization to eliminate the cmov from ctlz, cttz extension when the source of X86ISD::BSR / X86ISD::BSF is proven to be non-zero. rdar://9490949 llvm-svn: 131948	2011-05-24 01:48:22 +00:00
Akira Hatanaka	5b696387f7	Add pattern for double-to-integer conversion. Patch by Sasa Stankovic. llvm-svn: 131927	2011-05-23 22:16:43 +00:00
Dan Gohman	e6a4a2aa6f	When checking for signed multiplication overflow, watch out for INT_MIN and -1. This fixes PR9845. llvm-svn: 131919	2011-05-23 21:07:39 +00:00
Akira Hatanaka	6ddbe02441	Change StackDirection from StackGrowsUp to StackGrowsDown. The following improvements are accomplished as a result of applying this patch: - Fixed frame objects' offsets (relative to either the virtual frame pointer or the stack pointer) are set before instruction selection is completed. There is no need to wait until Prologue/Epilogue Insertion is run to set them. - Calculation of final offsets of fixed frame objects is straightforward. It is no longer necessary to assign negative offsets to fixed objects for incoming arguments in order to distinguish them from the others. - Since a fixed object has its relative offset set during instruction selection, there is no need to conservatively set its alignment to 4. - It is no longer necessary to reorder non-fixed frame objects in MipsFrameLowering::adjustMipsStackFrame. llvm-svn: 131915	2011-05-23 20:16:59 +00:00
Devang Patel	ac809854cc	Test case for r131908. llvm-svn: 131909	2011-05-23 17:49:29 +00:00
Devang Patel	5920de5c8c	While replacing all uses of a SDValue with another value, do not forget to transfer SDDbgValue. llvm-svn: 131907	2011-05-23 17:35:08 +00:00
Cameron Zwarich	5a416bda73	Fix <rdar://problem/9476260> by having tail calls always generate 32-bit branches in Darwin Thumb2 code. Tail calls are already disabled on Thumb1. llvm-svn: 131894	2011-05-23 01:57:17 +00:00
Renato Golin	759db3cbe3	RTABI chapter 4.3.4 specifies __eabi_mem* calls. Specifically, __eabi_memset accepts parameters (ptr, size, value) in a different order than GNU's memset (ptr, value, size), therefore the special lowering in AAPCS mode. Implementation by Evzen Muller. llvm-svn: 131868	2011-05-22 21:41:23 +00:00
Benjamin Kramer	df3070e83d	Implement mulo x, 2 -> addo x, x in DAGCombiner. llvm-svn: 131800	2011-05-21 18:31:55 +00:00
Benjamin Kramer	e55e27c161	Merge and FileCheckize test cases. llvm-svn: 131799	2011-05-21 18:31:48 +00:00
Eli Friedman	dfd96ebe52	Add fast-isel support for byval calls on x86. llvm-svn: 131764	2011-05-20 22:21:04 +00:00
Stuart Hastings	e3158f93ec	Re-commit 131641 with fixes; de-pseudoize MOVSX16rr8 and friends. rdar://problem/8614450 llvm-svn: 131746	2011-05-20 19:04:40 +00:00
Akira Hatanaka	89801a318c	Make $fp and $ra callee-saved registers and let PrologEpilogInserter handle saving and restoring them. llvm-svn: 131745	2011-05-20 18:39:33 +00:00
Chad Rosier	fa78ec1a3e	Fixed regression due to commit 131709, which disables vararg tail call optimizations on Win64 llvm-svn: 131740	2011-05-20 17:49:39 +00:00
Benjamin Kramer	83096d1db1	Rename the "sandybridge" subtarget to "corei7-avx", for GCC compatibility. llvm-svn: 131730	2011-05-20 15:11:26 +00:00
Cameron Zwarich	a487989a73	Fix PR9960 by teaching SimpleRegisterCoalescing::AdjustCopiesBackFrom() to preserve the phikill flag. llvm-svn: 131717	2011-05-20 03:54:04 +00:00
Akira Hatanaka	9736ffe863	Fix bug in which nodes that write to argument registers do not get glued with the JALR node. Patch by Sasa Stankovic llvm-svn: 131714	2011-05-20 02:30:51 +00:00
Chad Rosier	a5f0bb3719	Don't attempt to tail call optimize for Win64. llvm-svn: 131709	2011-05-20 00:59:28 +00:00
Evan Cheng	a3f5204c82	Revert r131664 and fix it in instcombine instead. rdar://9467055 llvm-svn: 131708	2011-05-20 00:54:37 +00:00
Eli Friedman	ecdbb58b95	Add fast-isel support for zeroext and signext ret instructions on x86. llvm-svn: 131689	2011-05-19 22:16:13 +00:00
Eric Christopher	74a9e350d2	Oddly people want to use the 'r' constraint for fp constants on x86. Fixes rdar://9218925 Fixes PR9601 llvm-svn: 131682	2011-05-19 21:33:47 +00:00
Eli Friedman	97fda21a43	Fix up this test to use explicit triples (Win64 passes a different number of arguments in registers). llvm-svn: 131676	2011-05-19 21:13:08 +00:00
Akira Hatanaka	669a518bab	Align i64 arguments to 64 bit boundaries. llvm-svn: 131668	2011-05-19 20:29:48 +00:00
Evan Cheng	efcc06b08f	crc32 with 64-bit output zeros upper 32-bits. rdar://9467055 llvm-svn: 131664	2011-05-19 18:57:12 +00:00
Stuart Hastings	b524e73afc	Move test to Transforms/InstCombine. llvm-svn: 131634	2011-05-19 05:53:22 +00:00
Tanya Lattner	6814933ea6	Handle perfect shuffle case that generates a vrev for vectors of floats. Add test case. llvm-svn: 131582	2011-05-18 21:44:54 +00:00
Chad Rosier	be943c5d9a	Enables vararg functions that pass all arguments via registers to be optimized into tail-calls when possible. llvm-svn: 131560	2011-05-18 19:59:50 +00:00
Stuart Hastings	c77046f8d7	An imminent fix to the x86_64 byval logic will expose a flaw in the x86_64 sibcall logic. I've filed PR9943 for the sibcall problem, and this patch alters the testcase to work around the flaw. When PR9943 is fixed, this patch should be reverted. llvm-svn: 131557	2011-05-18 19:19:17 +00:00
Eli Friedman	0703b8a293	Force a triple on a couple of tests; we don't support fast-isel of ret on Win64. llvm-svn: 131540	2011-05-18 17:16:37 +00:00
Stuart Hastings	03a1927217	Merge pmovzx test case into existing file. llvm-svn: 131539	2011-05-18 17:02:04 +00:00
Justin Holewinski	eb209f0916	PTX: add flag to disable mad/fma selection Patch by Dan Bailey llvm-svn: 131537	2011-05-18 15:42:23 +00:00
Tanya Lattner	06cb9cbf98	In r131488 I misunderstood how VREV works. It splits the vector in half and splits each half. Therefore, the real problem was that we were using a VREV64 for a 4xi16, when we should have been using a VREV32. Updated test case and reverted change to the PerfectShuffle Table. llvm-svn: 131529	2011-05-18 06:42:21 +00:00
Eli Friedman	97233814a0	Make some of the fast-isel tests actually test fast-isel (and fix test failures). llvm-svn: 131510	2011-05-18 00:00:10 +00:00
Stuart Hastings	719cee1aa8	X86 pmovsx/pmovzx ignore the upper half of their inputs. rdar://problem/6945110 llvm-svn: 131493	2011-05-17 22:13:31 +00:00
Tanya Lattner	7145d69427	vrev is incorrectly defined in the perfect shuffle table. The ordering is backwards (should be 0x3210 versus 0x1032) which exposed a bug when doing a shuffle on a 4xi16. I've attached a test case. llvm-svn: 131488	2011-05-17 20:48:40 +00:00
Galina Kistanova	96a3ce9a6f	Move test for appropriate directory. llvm-svn: 131477	2011-05-17 19:06:43 +00:00
Eli Friedman	ba315a4fcc	Add x86 fast-isel for calls returning first-class aggregates. rdar://9435872. This is r131438 with a couple small fixes. llvm-svn: 131474	2011-05-17 18:29:03 +00:00
Eli Friedman	3aa2fe389f	Back out r131444 and r131438; they're breaking nightly tests. I'll look into it more tomorrow. llvm-svn: 131451	2011-05-17 02:36:59 +00:00
Eli Friedman	410094a937	Fix test. llvm-svn: 131444	2011-05-17 00:39:14 +00:00
Evan Cheng	24322e6f0a	Add target triple so test doesn't fail on Windows machines. llvm-svn: 131439	2011-05-17 00:15:58 +00:00
Eli Friedman	8c4de16d2b	Add x86 fast-isel for calls returning first-class aggregates. rdar://9435872. llvm-svn: 131438	2011-05-17 00:13:47 +00:00
Jakob Stoklund Olesen	16f11212fc	Teach LiveInterval::isZeroLength about null SlotIndexes. When instructions are deleted, they leave tombstone SlotIndex entries. The isZeroLength method should ignore these null indexes. This causes RABasic to sometimes spill a callee-saved register in the abi-isel.ll test, so don't run that test with -regalloc=basic. Prioritizing register allocation according to spill weight can cause more registers to be used. llvm-svn: 131436	2011-05-16 23:50:05 +00:00
Eli Friedman	23e7691f59	Remove dead code. Fix associated test to use FileCheck. llvm-svn: 131424	2011-05-16 21:28:22 +00:00
Eli Friedman	cb60e2293f	Make fast-isel work correctly s/uadd.with.overflow intrinsics. llvm-svn: 131420	2011-05-16 21:06:17 +00:00
Eli Friedman	5f1b7e4153	Basic fast-isel of extractvalue. Not too helpful on its own, given the IR clang generates for cases like this, but it should become more useful soon. llvm-svn: 131417	2011-05-16 20:27:46 +00:00
Rafael Espindola	98372d430c	Don't produce a vmovntdq if we don't have AVX support. llvm-svn: 131330	2011-05-14 00:30:01 +00:00
Rafael Espindola	95d9ad78ea	Make codegen able to handle values of empty types. This is one way to fix PR9900. I will keep it open until sable is able to comment on it. llvm-svn: 131294	2011-05-13 15:18:06 +00:00
Stuart Hastings	5fb280fd39	Since I can't reproduce the failures from 131261, re-trying with a simplified version. <rdar://problem/9298790> llvm-svn: 131274	2011-05-13 00:51:54 +00:00
Stuart Hastings	b362a9bcc6	Revert 131266 and 131261 due to buildbot complaints. rdar://problem/9298790 llvm-svn: 131269	2011-05-13 00:15:17 +00:00
Stuart Hastings	002f9765c6	Tweak 131261 (thumb2-cbnz.ll) to generate the intended cbnz. rdar://problem/9298790 llvm-svn: 131266	2011-05-13 00:10:03 +00:00
Stuart Hastings	d106d72681	Non-fast-isel followup to 129634; correctly handle branches controlled by non-CMP expressions. The executable test case (129821) would test this as well, if we had an "-O0 -disable-arm-fast-isel" LLVM-GCC tester. Alas, the ARM assembly would be very difficult to check with FileCheck. The thumb2-cbnz.ll test is affected; it generates larger code (tst.w vs. cmp #0), but I believe the new version is correct. rdar://problem/9298790 llvm-svn: 131261	2011-05-12 23:36:41 +00:00
Galina Kistanova	f8f6de03c6	Correction. Use explicit target triple in the test. llvm-svn: 131252	2011-05-12 21:55:34 +00:00
Evan Cheng	f3eb9e3262	Re-enable branchfolding common code hoisting optimization. Fixed a liveness test bug and also taught it to update liveins. llvm-svn: 131241	2011-05-12 20:30:01 +00:00
Stuart Hastings	1d08224543	Move this test to CodeGen/Thumb. rdar://problem/9416774 llvm-svn: 131196	2011-05-11 19:41:28 +00:00
Devang Patel	344808fbe5	Identify end of prologue (and beginning of function body) using DW_LNS_set_prologue_end line table opcode. llvm-svn: 131194	2011-05-11 19:22:19 +00:00
Nadav Rotem	57dd315a3b	Fixes a bug in the DAGCombiner. LoadSDNodes have two values (data, chain). If there is a store after the load node, then there is a chain, which means that there is another user. Thus, asking hasOneUser would fail. Instead we ask hasNUsesOfValue on the 'data' value. llvm-svn: 131183	2011-05-11 14:40:50 +00:00
Nadav Rotem	2a654a69ed	Add custom lowering of X86 vector SRA/SRL/SHL when the shift amount is a splat vector. llvm-svn: 131179	2011-05-11 08:12:09 +00:00
Rafael Espindola	dfc30289f1	Revert 131172 as it is causing clang to miscompile itself. I will try to provide a reduced testcase. llvm-svn: 131176	2011-05-11 03:27:17 +00:00
Evan Cheng	271e0ebf0a	Add a late optimization to BranchFolding that hoist common instruction sequences at the start of basic blocks to their common predecessor. It's actually quite common (e.g. about 50 times in JM/lencod) and has shown to be a nice code size benefit. e.g. pushq %rax testl %edi, %edi jne LBB0_2 ## BB#1: xorb %al, %al popq %rdx ret LBB0_2: xorb %al, %al callq _foo popq %rdx ret => pushq %rax xorb %al, %al testl %edi, %edi je LBB0_2 ## BB#1: callq _foo LBB0_2: popq %rdx ret rdar://9145558 llvm-svn: 131172	2011-05-11 01:03:01 +00:00
Rafael Espindola	46b0ce1b5f	Produce a __debug_frame section on darwin ARM when appropriate. llvm-svn: 131151	2011-05-10 21:04:45 +00:00
Justin Holewinski	4afbfa1796	PTX: add test cases for cvt, fneg, and selp Patch by Dan Bailey llvm-svn: 131128	2011-05-10 14:53:13 +00:00
Benjamin Kramer	ba7c9948e8	X86: Add a bunch of peeps for add and sub of SETB. "b + ((a < b) ? 1 : 0)" compiles into cmpl %esi, %edi adcl $0, %esi instead of cmpl %esi, %edi sbbl %eax, %eax andl $1, %eax addl %esi, %eax This saves a register, a false dependency on %eax (Intel's CPUs still don't ignore it) and it's shorter. llvm-svn: 131070	2011-05-08 18:36:07 +00:00
Jakob Stoklund Olesen	bb09bbccb8	Emit a proper error message when register allocators run out of registers. This can't be just an assertion, users can always write impossible inline assembly. Such an assembly statement should be included in the error message. llvm-svn: 131024	2011-05-06 21:58:30 +00:00
Justin Holewinski	e4a7007565	PTX: add PTX 2.3 language target Patch by Wei-Ren Chen llvm-svn: 130980	2011-05-06 11:40:36 +00:00
Eli Friedman	f7b4d848ae	Re-revert r130877; it's apparently causing a regression on 197.parser, possibly related to cbnz formation. llvm-svn: 130977	2011-05-06 05:23:07 +00:00
Rafael Espindola	ab39b8319b	Don't produce a __debug_frame. I tested both gdb on a bootstrapped clang and and the gdb testsuite on OS X (snow leopard) and both are happy using __eh_frame. llvm-svn: 130937	2011-05-05 18:43:39 +00:00
Eli Friedman	09ec41fcde	Avoid extra vreg copies for arguments passed in registers. Specifically, this can make MachineCSE more effective in some cases (especially in small functions). PR8361 / part of rdar://problem/8259436 . llvm-svn: 130928	2011-05-05 16:53:34 +00:00
Jakob Stoklund Olesen	f27731bf40	Prepare remaining tests for -join-physreg going away. llvm-svn: 130893	2011-05-04 23:54:59 +00:00
Jakob Stoklund Olesen	e964058440	Fix a batch of x86 tests to be coalescer independent. Most of these tests require a single mov instruction that can come either before or after a 2-addr instruction. -join-physregs changes the behavior, but the results are equivalent. llvm-svn: 130891	2011-05-04 23:54:51 +00:00
Dan Gohman	62dbd536c0	Give this test an explicit register allocator, so that it can work even if the default register allocator is changed. llvm-svn: 130883	2011-05-04 23:14:02 +00:00
Bill Wendling	279e17e523	SjLj EH could produce a machine basic block that legitimately has more than one landing pad as its successor. SjLj exception handling jumps to the correct landing pad via a switch statement that's generated right before code-gen. Loosen the constraint in the machine instruction verifier to allow for this. Note, this isn't the most rigorous check since we cannot determine where that switch statement came from. But it's marginally better than turning this check off when SjLj exceptions are used. <rdar://problem/9187612> llvm-svn: 130881	2011-05-04 22:54:05 +00:00
Eli Friedman	5b78092546	Re-commit r130862 with a minor change to avoid an iterator running off the edge in some cases. Original message: Teach MachineCSE how to do simple cross-block CSE involving physregs. This allows, for example, eliminating duplicate cmpl's on x86. Part of rdar://problem/8259436 . llvm-svn: 130877	2011-05-04 22:10:36 +00:00
Galina Kistanova	3b29721cf4	This test fails on ARM. The test shouldn't explicitly specify alignment (and alignment 4 is wrong) and requires hard-float. llvm-svn: 130875	2011-05-04 21:57:44 +00:00
Eli Friedman	cc74616be6	Back out r130862; it appears to be breaking bootstrap. llvm-svn: 130867	2011-05-04 20:48:42 +00:00
Eli Friedman	e086e00208	Teach MachineCSE how to do simple cross-block CSE involving physregs. This allows, for example, eliminating duplicate cmpl's on x86. Part of rdar://problem/8259436 . llvm-svn: 130862	2011-05-04 19:54:24 +00:00
Jakob Stoklund Olesen	5ea6203ea6	Fix more register and coalescing dependencies. llvm-svn: 130859	2011-05-04 19:02:11 +00:00
Jakob Stoklund Olesen	780e6d1f64	Explicitly request physreg coalesing for a bunch of Thumb2 unit tests. These tests all follow the same pattern: mov r2, r0 movs r0, #0 $CMP r2, r1 it eq moveq r0, #1 bx lr The first 'mov' can be eliminated by rematerializing 'movs r0, #0' below the test instruction: $CMP r0, r1 mov.w r0, #0 it eq moveq r0, #1 bx lr So far, only physreg coalescing can do that. The register allocators won't yet split live ranges just to eliminate copies. They can learn, but this particular problem is not likely to show up in real code. It only appears because r0 is used for both the function argument and return value. llvm-svn: 130858	2011-05-04 19:02:07 +00:00
Jakob Stoklund Olesen	b85b71f4de	FileCheckize and break dependence on coalescing order. llvm-svn: 130856	2011-05-04 19:02:01 +00:00
Jakob Stoklund Olesen	8a075ce7ea	Explicitly request -join-physregs for some tests that depend on it. llvm-svn: 130855	2011-05-04 19:01:59 +00:00
Devang Patel	8823e24dde	Do not emit location expression size twice. llvm-svn: 130854	2011-05-04 19:00:57 +00:00
Akira Hatanaka	8ab44cecb4	Remove LLVM IR metadata in test case committed in r130847. llvm-svn: 130849	2011-05-04 18:28:36 +00:00
Akira Hatanaka	e9a2d2a78f	Prevent instructions using $gp from being placed between a jalr and the instruction that restores the clobbered $gp. llvm-svn: 130847	2011-05-04 17:54:27 +00:00
Jakob Stoklund Olesen	4d020cd8e5	Don't depend on the physreg coalescing order. llvm-svn: 130818	2011-05-04 01:01:47 +00:00
Jakob Stoklund Olesen	307c86d1ee	Don't run this test through -regalloc=basic. The basic allocator is really bad about hinting, so it doesn't eliminate all copies when physreg joining is disabled. llvm-svn: 130817	2011-05-04 01:01:44 +00:00
Jakob Stoklund Olesen	20af0b593b	Fix register-dependent XCore tests llvm-svn: 130816	2011-05-04 01:01:41 +00:00
Jakob Stoklund Olesen	55e2c5ec6e	Fix register-dependent test in MSP430. llvm-svn: 130815	2011-05-04 01:01:39 +00:00

... 5 6 7 8 9 ...

5046 Commits