llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 20:43:44 +02:00

Author	SHA1	Message	Date
Hal Finkel	7e324aee83	Implement builtin_{setjmp/longjmp} on PPC This implements SJLJ lowering on PPC, making the Clang functions __builtin_{setjmp/longjmp} functional on PPC platforms. The implementation strategy is similar to that on X86, with the exception that a branch-and-link variant is used to get the right jump address. Credit goes to Bill Schmidt for suggesting the use of the unconditional bcl form (instead of the regular bl instruction) to limit return-address-cache pollution. Benchmarking the speed at -O3 of: static jmp_buf env_sigill; void foo() { __builtin_longjmp(env_sigill,1); } main() { ... for (int i = 0; i < c; ++i) { if (__builtin_setjmp(env_sigill)) { goto done; } else { foo(); } done:; } ... } vs. the same code using the libc setjmp/longjmp functions on a P7 shows that this builtin implementation is ~4x faster with Altivec enabled and ~7.25x faster with Altivec disabled. This comparison is somewhat unfair because the libc version must also save/restore the VSX registers which we don't yet support. llvm-svn: 177666	2013-03-21 21:37:52 +00:00
Renato Golin	1fca3efc0b	Fix Darwin NEON FP and increase coverage llvm-svn: 177664	2013-03-21 21:30:49 +00:00
David Blaikie	67c9dc82dc	Remove unused field in DISubprogram llvm-svn: 177661	2013-03-21 20:28:52 +00:00
Hal Finkel	7e6dc78317	Add support for spilling VRSAVE on PPC Although there is only one Altivec VRSAVE register, it is a member of a register class, and we need the ability to spill it. Because this register is normally callee-preserved and handled by special code this has never before been necessary. However, this capability will be required by a forthcoming commit adding SjLj support. llvm-svn: 177654	2013-03-21 19:03:21 +00:00
Hal Finkel	2043b2adae	Correct PPC FRAMEADDR lowering using a pseudo-register The old code used to lower FRAMEADDR tried to replicate the logic in the real frame-lowering code that determines whether or not the frame pointer (r31) will be used. When it seemed as through the frame pointer would not be used, the stack pointer (r1) was used instead. Unfortunately, because the stack size is not yet known, this does not work. Instead, this change introduces new always-reserved pseudo-registers (FP and FP8) that are replaced during prologue insertion with the real frame-pointer register (either r1 or r31). It is important that this intrinsic always return a valid frame address because it is used by Clang to store the frame address as part of code generation for __builtin_setjmp. llvm-svn: 177653	2013-03-21 19:03:19 +00:00
Renato Golin	0854fd9bef	Avoid NEON SP-FP unless unsafe-math or Darwin NEON is not IEEE 754 compliant, so we should avoid lowering single-precision floating point operations with NEON unless unsafe-math is turned on. The equivalent VFP instructions are IEEE 754 compliant, but in some cores they're much slower, so some archs/OSs might still request it to be on by default, such as Swift and Darwin. llvm-svn: 177651	2013-03-21 18:47:47 +00:00
David Blaikie	b2d35852ea	Debug info: refactor the first field of DICompileUnit to be a raw file/directory pair This removes the DICompileUnit special case from DIScope. llvm-svn: 177610	2013-03-20 23:58:12 +00:00
Nadav Rotem	3a9f2d7de8	When computing the demanded bits of Load SDNodes, make sure that we are looking at the loaded-value operand and not the ptr result (in case of pre-inc loads). rdar://13348420 llvm-svn: 177596	2013-03-20 22:53:44 +00:00
David Blaikie	78d3bdea74	Debug Info: Swap the 2nd and 3rd parameters to DICompileUnit to match the common DIScope prefix llvm-svn: 177595	2013-03-20 22:52:54 +00:00
David Blaikie	30abbc718f	Remove unused field in DICompileUnit llvm-svn: 177590	2013-03-20 22:34:33 +00:00
Hao Liu	00e59a535f	Add a test case for PR15318 fixed in r177472 llvm-svn: 177489	2013-03-20 06:18:06 +00:00
Michael Liao	d0e167edfb	Fix PR15296 - Move SRA/SRL/SHL lowering support from DAG combination to DAG lowering to support extended 256-bit integer in AVX but not AVX2. llvm-svn: 177478	2013-03-20 02:33:21 +00:00
David Blaikie	b9f490e28c	Refactor the DIFile (2nd) parameter to DITypes to be an MDNode reference to a raw directory/file pair This makes DIType's first non-tag parameter the same as DIFile's, allowing them to both share the common implementation of getFilename/getDirectory in DIScope. llvm-svn: 177467	2013-03-20 00:26:26 +00:00
Justin Holewinski	d1c0859c87	Propagate DAG node ordering during type legalization and instruction selection A node's ordering is only propagated during legalization if (a) the new node does not have an ordering (is not a CSE'd node), or (b) the new node has an ordering that is higher than the node being legalized. llvm-svn: 177465	2013-03-20 00:10:32 +00:00
David Blaikie	dd2f7e5b88	Move the DIFile operand to DITypes from the 4th operand to the 2nd. This is another step along the way to making all DIScopes have a common prefix which can be added to in a general manner to support using directives (DW_TAG_imported_module). llvm-svn: 177462	2013-03-19 23:25:22 +00:00
Hal Finkel	6c0ef5bcb5	Add a comment to the CodeGen/PowerPC/asym-regclass-copy.ll test llvm-svn: 177434	2013-03-19 20:22:32 +00:00
Ulrich Weigand	d5787350ad	Rewrite pre-increment store patterns to use standard memory operands. Currently, pre-increment store patterns are written to use two separate operands to represent address base and displacement: stwu $rS, $ptroff($ptrreg) This causes problems when implementing the assembler parser, so this commit changes the patterns to use standard (complex) memory operands like in all other memory access instruction patterns: stwu $rS, $dst To still match those instructions against the appropriate pre_store SelectionDAG nodes, the patch uses the new feature that allows a Pat to match multiple DAG operands against a single (complex) instruction operand. Approved by Hal Finkel. llvm-svn: 177429	2013-03-19 19:52:04 +00:00
Hal Finkel	08d0f0125c	Prepare to make r0 an allocatable register on PPC Currently the PPC r0 register is unconditionally reserved. There are two reasons for this: 1. r0 is treated specially (as the constant 0) by certain instructions, and so cannot be used with those instructions as a regular register. 2. r0 is used as a temporary register in the CR-register spilling process (where, under some circumstances, we require two GPRs). This change addresses the first reason by introducing a restricted register class (without r0) for use by those instructions that treat r0 specially. These register classes have a new pseudo-register, ZERO, which represents the r0-as-0 use. This has the side benefit of making the existing target code simpler (and easier to understand), and will make it clear to the register allocator that uses of r0 as 0 don't conflict will real uses of the r0 register. Once the CR spilling code is improved, we'll be able to allocate r0. Adding these extra register classes, for some reason unclear to me, causes requests to the target to copy 32-bit registers to 64-bit registers. The resulting code seems correct (and causes no test-suite failures), and the new test case covers this new kind of asymmetric copy. As r0 is still reserved, no functionality change intended. llvm-svn: 177423	2013-03-19 18:51:05 +00:00
Nadav Rotem	317ff20b46	Optimize sext <4 x i8> and <4 x i16> to <4 x i64>. Patch by Ahmad, Muhammad T <muhammad.t.ahmad@intel.com> llvm-svn: 177421	2013-03-19 18:38:27 +00:00
Hal Finkel	b4208059c6	Cleanup PPC64 unaligned i64 load/store Remove an accidentally-added instruction definition and add a comment in the test case. This is in response to a post-commit review by Bill Schmidt. No functionality change intended. llvm-svn: 177404	2013-03-19 15:23:39 +00:00
Renato Golin	6d0295565e	Improve long vector sext/zext lowering on ARM The ARM backend currently has poor codegen for long sext/zext operations, such as v8i8 -> v8i32. This patch addresses this by performing a custom expansion in ARMISelLowering. It also adds/changes the cost of such lowering in ARMTTI. This partially addresses PR14867. Patch by Pete Couperus llvm-svn: 177380	2013-03-19 08:15:38 +00:00
Hal Finkel	5fd6394c16	Don't reserve R31 on PPC64 unless the frame pointer is needed llvm-svn: 177379	2013-03-19 08:09:38 +00:00
Hal Finkel	b4a799cf7e	Fix a sign-extension bug in PPCCTRLoops Don't sign extend the immediate value from the OR instruction in an LIS/OR pair. llvm-svn: 177361	2013-03-18 23:58:28 +00:00
Hal Finkel	42f72e7756	Fix PPC unaligned 64-bit loads and stores PPC64 supports unaligned loads and stores of 64-bit values, but in order to use the r+i forms, the offset must be a multiple of 4. Unfortunately, this cannot always be determined by examining the immediate itself because it might be available only via a TOC entry. In order to get around this issue, we additionally predicate the selection of the r+i form on the alignment of the load or store (forcing it to be at least 4 in order to select the r+i form). llvm-svn: 177338	2013-03-18 23:00:58 +00:00
Quentin Colombet	bb36556d97	Extend global merge pass to optionally consider global constant variables. Also add some checks to not merge globals used within landing pad instructions or marked as "used". llvm-svn: 177331	2013-03-18 22:30:07 +00:00
Bill Schmidt	532eac0ca2	Change test cases to handle unaligned references. Hal Finkel recently added code to allow unaligned memory references for PowerPC. Two tests were temporarily modified with -disable-ppc-unaligned to keep them from failing. This patch adjusts the expected code generation for the unaligned references. llvm-svn: 177328	2013-03-18 22:12:04 +00:00
David Blaikie	928fd30ba7	Remove unnecessary leading comment characters in lit-only file llvm-svn: 177327	2013-03-18 22:08:16 +00:00
David Blaikie	ae14af22c5	Include '.test' suffix in target specific lit configs that need it Apparently my final cleanup to use a relevant suffix for these tests before committing r176831 caused them to stop running since lit wasn't configured to run tests with that suffix in those directories (why don't we just have a global suffix list?). So, add the suffix to the relevant directories & fix the test that has bitrotted over the last week due to my debug info schema changes. llvm-svn: 177315	2013-03-18 20:31:44 +00:00
Hal Finkel	ad2997da12	Fix large count and negative constant count handling in PPCCTRLoops This commit fixes an assert that would occur on loops with large constant counts (like looping for ((uint32_t) -1) iterations on PPC64). The existing code did not handle counts that it computed to be negative (asserting instead), but these can be created with valid inputs. This bug was discovered by bugpoint while I was attempting to isolate a completely different problem. Also, in writing test cases for the negative-count problem, I discovered that the ori/lsi handling was broken (there was a typo which caused the logic that was supposed to detect these pairs and extract the iteration count to always fail). This has now also been corrected (and is covered by one of the new test cases). llvm-svn: 177295	2013-03-18 17:40:44 +00:00
Hal Finkel	2ab64cdbb2	Cleanup initial-value constants in PPCCTRLoops Because the initial-value constants had not been added to the list of instructions considered for DCE the resulting code had redundant constant-materialization instructions. llvm-svn: 177294	2013-03-18 17:40:27 +00:00
David Blaikie	3193e0599a	Split out filename & directory from DIFile to start generalizing over DIScopes This is the first step to making all DIScopes have a common metadata prefix (so that things (using directives, for example) that can appear in any scope can be added to that common prefix). DIFile is itself a DIScope so the common prefix of all DIScopes cannot be a DIFile - instead it's the raw filename/directory name pair. llvm-svn: 177239	2013-03-17 21:13:55 +00:00
Hal Finkel	54a73d3443	Improve PPC VR (Altivec) register spilling This change cleans up two issues with Altivec register spilling: 1. The spilling code was inefficient (using two instructions, and add and a load, when just one would do) 2. The code assumed that r0 would always be available (true for now, but this will change) The new code handles VR spilling just like GPR spills but forced into r+r mode. As a result, when any VR spills are present, we must now always allocate the register-scavenger spill slot. llvm-svn: 177231	2013-03-17 04:43:44 +00:00
Hal Finkel	e729872345	Remove FIXMEs in PPC test cases related to unaligned loads/stores As pointed out by Bill in response to r177160, these two FIXMEs can also be removed. llvm-svn: 177229	2013-03-16 23:02:31 +00:00
Craig Topper	bcf2bc336a	Add X86 code emitter support AVX encoded MRMDestReg instructions. Previously we weren't skipping the VVVV encoded register. Based on patch by Michael Liao. llvm-svn: 177221	2013-03-16 03:44:31 +00:00
Arnold Schwaighofer	c83f5b493e	ARM cost model: Fix costs for some vector selects I was too pessimistic in r177105. Vector selects that fit into a legal register type lower just fine. I was mislead by the code fragment that I was using. The stores/loads that I saw in those cases came from lowering the conditional off an address. Changing the code fragment to: %T0_3 = type <8 x i18> %T1_3 = type <8 x i1> define void @func_blend3(%T0_3* %loadaddr, %T0_3* %loadaddr2, %T1_3* %blend, %T0_3* %storeaddr) { %v0 = load %T0_3* %loadaddr %v1 = load %T0_3* %loadaddr2 ==> FROM: ;%c = load %T1_3* %blend ==> TO: %c = icmp slt %T0_3 %v0, %v1 ==> USE: %r = select %T1_3 %c, %T0_3 %v0, %T0_3 %v1 store %T0_3 %r, %T0_3* %storeaddr ret void } revealed this mistake. radar://13403975 llvm-svn: 177170	2013-03-15 18:31:01 +00:00
Silviu Baranga	ff316abe9d	Adding an A15 specific optimization pass for interactions between S/D/Q registers. The pass handles all the required transformations pre-regalloc. llvm-svn: 177169	2013-03-15 18:28:25 +00:00
Benjamin Kramer	2294ee0960	ARM: Fix an old refacto. Fixes PR15520. llvm-svn: 177167	2013-03-15 17:27:39 +00:00
Hal Finkel	a5a86f0a8e	Enable unaligned memory access on PPC for scalar types Unaligned access is supported on PPC for non-vector types, and is generally more efficient than manually expanding the loads and stores. A few of the existing test cases were using expanded unaligned loads and stores to test other features (like load/store with update), and for these test cases, unaligned access remains disabled. llvm-svn: 177160	2013-03-15 15:27:13 +00:00
Hal Finkel	2ecb85412e	Protect PPC Altivec patterns with a predicate In preparation for the addition of other SIMD ISA extensions (such as QPX) we need to make sure that all Altivec patterns are properly predicated on having Altivec support. No functionality change intended (one test case needed to be updated b/c it assumed that Altivec intrinsics would be supported without enabling Altivec support). llvm-svn: 177152	2013-03-15 13:21:21 +00:00
Hal Finkel	503a3723d1	Allocate the RS spill slot for any PPC function with spills and a large stack frame For spills into a large stack frame, the FI-elimination code uses the register scavenger to obtain a free GPR for use with an r+r-addressed load or store. When there are no available GPRs, the scavenger gets one by using its spill slot. Previously, we were not always allocating that spill slot and the RS would assert when the spill slot was needed. I don't currently have a small test that triggered the assert, but I've created a small regression test that verifies that the spill slot is now added when the stack frame is sufficiently large. llvm-svn: 177140	2013-03-15 05:06:04 +00:00
Nadav Rotem	6500dc0dd2	Add a triple to the test. llvm-svn: 177131	2013-03-15 00:10:23 +00:00
Nadav Rotem	03b60b8657	Unaligned loads should use the VMOVUPS opcode. llvm-svn: 177130	2013-03-14 23:49:44 +00:00
Chad Rosier	aca0d2a5a0	[fast-isel] The X86FastISel::FastLowerArguments function doesn't properly handle the win64 calling convention. rdar://13423768 llvm-svn: 177113	2013-03-14 21:25:04 +00:00
Hal Finkel	37a5522734	Not all PPC functions with a frame pointer need a RS spill slot We used to add a spill slot for the register scavenger whenever the function has a frame pointer. This is unnecessarily conservative: We may need the spill slot for dynamic stack allocations, and functions with dynamic stack allocations always have a FP, but we might also have a FP for other reasons (such as the user explicitly disabling frame-pointer elimination), and we don't necessarily need a spill slot for those functions. The structsinregs test needed adjustment because it disables FP elimination. llvm-svn: 177106	2013-03-14 19:34:32 +00:00
Arnold Schwaighofer	63a59d3be8	ARM cost model: Increase cost of some vector selects we do terrible on By terrible I mean we store/load from the stack. This matters on PAQp8 in _Z5trainPsS_ii (which is inlined into Mixer::update) where we decide to vectorize a loop with a VF of 8 resulting in a 25% degradation on a cortex-a8. LV: Found an estimated cost of 2 for VF 8 For instruction: icmp slt i32 LV: Found an estimated cost of 2 for VF 8 For instruction: select i1, i32, i32 The bug that tracks the CodeGen part is PR14868. radar://13403975 llvm-svn: 177105	2013-03-14 19:17:02 +00:00
Jyotsna Verma	f2d3c71cf4	Hexagon: Removed asserts regarding alignment and offset. We are warning the user about the alignment, so we should not assert. llvm-svn: 177103	2013-03-14 19:08:03 +00:00
Vincent Lejeune	cd12dadb5c	R600: Factorize code handling Const Read Port limitation llvm-svn: 177078	2013-03-14 15:50:45 +00:00
Michael Liao	89d165e673	Fix PR15309 - Fix the typo on type checking llvm-svn: 177010	2013-03-14 06:57:42 +00:00
Jiong Wang	f4d5a4cd79	test commit: remove blank line. llvm-svn: 177009	2013-03-14 05:43:59 +00:00
David Blaikie	27dd933a64	Simplify file/directory name handling in DILexicalBlock llvm-svn: 176993	2013-03-13 22:52:59 +00:00
David Blaikie	127d79d573	Remove the unused 4th operand for DIFile debug info metadata llvm-svn: 176983	2013-03-13 22:05:21 +00:00
Arnold Schwaighofer	3294ca42bf	ARM cost model: Add test case to make sure we would notice a change in CodeGen In r176898 I updated the cost model to reflect the fact that sext/zext/cast on v8i32 <-> v8i8 and v16i32 <-> v16i8 are expensive. This test case is so that we make sure to update the cost model once we fix CodeGen. llvm-svn: 176955	2013-03-13 16:25:55 +00:00
David Blaikie	3c701e7671	Refactor filename/directory in DICompileUnit into a DIFile This is the next step towards making the metadata for DIScopes have a common prefix rather than having to delegate based on their tag type. llvm-svn: 176913	2013-03-13 00:01:35 +00:00
David Blaikie	98d9ccffb8	Remove unused "isMain" field from DICompileUnit llvm-svn: 176910	2013-03-12 22:43:04 +00:00
David Blaikie	c37a0a822a	Update debug info test cases with empty SplitDebugFilename field. This could be 'null' or the empty string, DIDescriptor::getStringField coalesces the two cases anyway so it's just a matter of legible/efficient representation. The change in behavior of the DICompileUnit::get* functions could be subsumed by the full verification check - but ideally that should just be an assertion if we could front-load the actual debug info metadata failure paths. llvm-svn: 176907	2013-03-12 22:25:36 +00:00
Jan Wen Voung	74d9647d18	Revert the test moves from 176733. Use "REQUIRES: asserts" instead. llvm-svn: 176873	2013-03-12 16:27:52 +00:00
Hal Finkel	3edf100dda	Don't reserve R2 on Darwin/PPC Now that only the register-scavenger version of the CR spilling code remains, we no longer need the Darwin R2 hack. Darwin can use R0 as a spare register in any case where the System V ABI uses it (R0 is special architecturally, and so is reserved under all common ABIs). A few test cases needed to be updated to reflect the register-allocation changes. llvm-svn: 176868	2013-03-12 15:18:14 +00:00
NAKAMURA Takumi	7bb11c0e1f	llvm/test/CodeGen/R600/schedule-*.ll: Let them require +Asserts. llvm-svn: 176835	2013-03-11 23:16:30 +00:00
David Blaikie	00170a5a62	Upgrading debug info test cases to be (more) compatible with the current debug info format. These cases were found by further work to remove support for debug info versioning. Common cleanups (other than changing the version info in the tag field) included adding the last parameter to compile_units (recently added for fission support) and other cases of trailing fields in lexical blocks, compile units, and subprograms. llvm-svn: 176834	2013-03-11 22:37:40 +00:00
David Blaikie	b8d3b70835	Remove duplicate test contents. llvm-svn: 176831	2013-03-11 22:10:14 +00:00
Nick Lewycky	9bfa310e10	Fix a crasher newly introduced in r176659/r176649, where fast-isel tries to lower an expect intrinsic that is a constant expression. llvm-svn: 176830	2013-03-11 21:44:37 +00:00
Vincent Lejeune	712c6f4f44	R600: Fix JUMP handling so that MachineInstr verification can occur This allows R600 Target to use the newly created -verify-misched llc flag llvm-svn: 176819	2013-03-11 18:15:06 +00:00
NAKAMURA Takumi	6b526cc81f	llvm/test/CodeGen/X86/handle-move.ll: Mark it as XFAIL:cygming. Investigating. llvm-svn: 176808	2013-03-11 16:30:26 +00:00
NAKAMURA Takumi	32506912d9	Suppress atomic(32\|64).ll as XFAIL on win32 codegen. Investigating. llvm-svn: 176798	2013-03-11 08:39:48 +00:00
Lang Hames	57a19e2cc0	Remove date from test case file name. The PR number provides a unique ID already. llvm-svn: 176796	2013-03-11 03:49:23 +00:00
Lang Hames	aed76c2308	Don't glue users to extract_subreg when selecting the llvm.arm.ldrexd intrinsic - it can cause impossible-to-schedule subgraphs to be introduced. PR15053. llvm-svn: 176777	2013-03-09 22:56:09 +00:00
Benjamin Kramer	202c1b8357	Test case hygiene. llvm-svn: 176772	2013-03-09 18:25:40 +00:00
Jan Wen Voung	2346df4d41	Disable statistics on Release builds and move tests that depend on -stats. Summary: Statistics are still available in Release+Asserts (any +Asserts builds), and stats can also be turned on with LLVM_ENABLE_STATS. Move some of the FastISel stats that were moved under DEBUG() back out of DEBUG(), since stats are disabled across the board now. Many tests depend on grepping "-stats" output. Move those into a orig_dir/Stats/. so that they can be marked as unsupported when building without statistics. Differential Revision: http://llvm-reviews.chandlerc.com/D486 llvm-svn: 176733	2013-03-08 22:56:31 +00:00
Jakob Stoklund Olesen	7528b1268b	Rewrite the physreg part of findLastUseBefore(). To find the last use of a register unit, start from the bottom and scan upwards until a user is found. <rdar://problem/13353090> llvm-svn: 176706	2013-03-08 18:08:57 +00:00
Tom Stellard	963bae7608	R600: Optimize another selectcc case fold selectcc (selectcc x, y, a, b, cc), b, a, b, setne -> selectcc x, y, a, b, cc Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176700	2013-03-08 15:37:11 +00:00
Tom Stellard	2eae31f648	R600: Improve custom lowering of select_cc Two changes: 1. Prefer SET* instructions when possible 2. Handle the CND*_INT case with floating-point args Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176699	2013-03-08 15:37:09 +00:00
Tom Stellard	3f88348d66	R600: Change operation action from Custom to Expand for BR_CC Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176698	2013-03-08 15:37:07 +00:00
Tom Stellard	54e0b366e8	R600: Change operation action from Custom to Expand for SETCC Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 176697	2013-03-08 15:37:05 +00:00
Tom Stellard	3cb97749ac	LegalizeDAG: Respect the result of TLI.getBooleanContents() when expanding SETCC llvm-svn: 176695	2013-03-08 15:37:02 +00:00
Vincent Lejeune	7cd72eed68	R600: Change addresspace in fold-kcache.ll AddressSpace definition has changed in a previous commit, reflect it to avoid false failure. llvm-svn: 176693	2013-03-08 15:34:07 +00:00
Tim Northover	35bab190c4	AArch64: specify full triple in test as only Linux works for now. llvm-svn: 176692	2013-03-08 15:27:30 +00:00
Christian Konig	dcd30c46b9	R600/SI: adjust test to recent changes Signed-off-by: Christian König <christian.koenig@amd.com> llvm-svn: 176691	2013-03-08 14:44:00 +00:00
Jyotsna Verma	6ba1f8e0d8	Hexagon: Add patterns for zero extended loads from i1->i64. llvm-svn: 176689	2013-03-08 14:15:15 +00:00
Tim Northover	215762f13b	AArch64: expand sincos operations, we don't support them. Patch based on Mans Rullgard's. llvm-svn: 176688	2013-03-08 13:55:07 +00:00
David Blaikie	8873f284b2	Another test fix for r176671. llvm-svn: 176679	2013-03-08 02:27:40 +00:00
David Blaikie	c1ad732102	Couple of test fixes for r176671. Not sure why these aren't failing on my linux machine, but this should cover it. llvm-svn: 176678	2013-03-08 02:26:16 +00:00
Bill Wendling	8c7ceb2a0e	Revert r176154 in favor of a better approach. Code generation makes some basic assumptions about the IR it's been given. In particular, if there is only one 'invoke' in the function, then that invoke won't be going away. However, with the advent of the `llvm.donothing' intrinsic, those invokes may go away. If all of them go away, the landing pad no longer has any users. This confuses the back-end, which asserts. This happens with SjLj exceptions, because that's the model that modifies the IR based on there being invokes, etc. in the function. Remove any invokes of `llvm.donothing' during SjLj EH preparation. This will give us a CFG that the back-end won't be confused about. If all of the invokes in a function are removed, then the SjLj EH prepare pass won't insert the bogus code the relies upon the invokes being there. <rdar://problem/13228754&13316637> llvm-svn: 176677	2013-03-08 02:21:08 +00:00
David Blaikie	41f29ff448	Upgrade tests to the latest debug info format. Mostly this is just changing the named metadata (llvm.dbg.sp, llvm.dbg.gv, llvm.dbg.<func>.lv, etc -> llvm.dbg.cu), adding a few fields to older records (DIVariable: flags/inlined-at, DICompileUnit: sp/gv/types, DISubprogram: local variables list) The tests to update were discovered by a change I'm working on to remove debug info version support - so any tests using old debug info versions I haven't updated probably are bad tests or just not actually designed to test debug info. llvm-svn: 176671	2013-03-08 00:23:31 +00:00
Chad Rosier	bd6edf2054	[fast-isel] Add support for the expect intrinsic. rdar://13370942 llvm-svn: 176649	2013-03-07 20:42:17 +00:00
Jyotsna Verma	50e5f420d7	Hexagon: Handle i8, i16 and i1 Var Args. llvm-svn: 176647	2013-03-07 20:28:34 +00:00
Jyotsna Verma	c3d8f08545	Hexagon: Add support to lower block address. llvm-svn: 176637	2013-03-07 19:10:28 +00:00
Benjamin Kramer	a99a00f2bf	Move testcase, this is testing extraction not inserting. llvm-svn: 176635	2013-03-07 18:51:02 +00:00
Benjamin Kramer	d2f85ae895	X86: Fold EXTRACT_SUBVECTORs of a BUILD_VECTOR into a smaller BUILD_VECTOR. That can usually be lowered efficiently and is common in sandybridge code. It would be nice to do this in DAGCombiner but we can't insert arbitrary BUILD_VECTORs this late. Fixes PR15462. llvm-svn: 176634	2013-03-07 18:48:40 +00:00
Jim Grosbach	bd1a513b55	SDAG: Handle scalarizing an extend of a <1 x iN> vector. Just scalarize the element and rebuild a vector of the result type from that. rdar://13281568 llvm-svn: 176614	2013-03-07 05:47:54 +00:00
Michael Liao	32f3aca77c	Fix two remaining issue after fixing PR15355 when CMOV is not available - Phi nodes should be replaced/updated after lowering CMOV into branch because 'mainMBB' updating operand in Phi node is changed. - Add EFLAGS in livein before lowering the 2nd CMOV. It's necessary as we will reuse the EFLAGS generated before the 1st lowered CMOV, which won't clobber EFLAGS. However, we need explicitly specify that. - '-attr=-cmov' test case are added. llvm-svn: 176598	2013-03-07 01:01:29 +00:00
Akira Hatanaka	46c323ab23	[mips] Custom-legalize BR_JT. In N64-static, GOT address is needed to compute the branch address. llvm-svn: 176580	2013-03-06 21:32:03 +00:00
Akira Hatanaka	41da8c24c4	[mips] Add a line which checks function name. Rename file. llvm-svn: 176543	2013-03-06 01:58:03 +00:00
Michael Liao	5859ab0234	Fix PR15355 - Clear 'mayStore' flag when loading from the atomic variable before the spin loop - Clear kill flag from one use to multiple use in registers forming the address to that atomic variable - don't use a physical register as live-in register in BB (neither entry nor landing pad.) by copying it into virtual register (patch by Cameron Zwarich) llvm-svn: 176538	2013-03-06 00:17:04 +00:00
Akira Hatanaka	449c09d59d	[mips] Remove android calling convention. This calling convention was added just to handle functions which return vector of floats. The fix committed in r165585 solves the problem. llvm-svn: 176530	2013-03-05 23:22:30 +00:00
Akira Hatanaka	22fc44e180	[mips] Fix MipsCC::analyzeReturn so that, in soft-float mode, fp128 gets returned in registers $2 and $4. llvm-svn: 176527	2013-03-05 22:54:59 +00:00
Akira Hatanaka	583e235871	[mips] Fix MipsTargetLowering::LowerCallResult and LowerReturn to correctly handle fp128 returns. llvm-svn: 176523	2013-03-05 22:41:55 +00:00
Akira Hatanaka	50ca6f8bf7	[mips] Fix MipsTargetLowering::LowerCall to pass fp128 arguments in floating point registers. llvm-svn: 176521	2013-03-05 22:20:28 +00:00
Akira Hatanaka	5d48741407	[mips] Correct handling of fp128 (long double) formals and read long double parameters from floating point registers if target is mips64 hard float. llvm-svn: 176520	2013-03-05 22:13:04 +00:00
Jyotsna Verma	e27b88fb08	reverting patch 176508. llvm-svn: 176513	2013-03-05 20:29:23 +00:00
Jyotsna Verma	ef3cf2b345	Hexagon: Add support for lowering block address. llvm-svn: 176508	2013-03-05 19:37:46 +00:00
Jyotsna Verma	baf7212bfe	Hexagon: Expand addc, adde, subc and sube. llvm-svn: 176505	2013-03-05 19:04:47 +00:00
Jyotsna Verma	c096fbaa7d	Hexagon: Add encoding bits to the TFR64 instructions. Set imMoveImm, isAsCheapAsAMove flags for TFRI instructions. llvm-svn: 176499	2013-03-05 18:42:28 +00:00
Vincent Lejeune	9ca3635aac	R600: Turn BUILD_VECTOR into Reg_Sequence Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 176487	2013-03-05 15:04:49 +00:00
Vincent Lejeune	d0d37f790e	R600: Use MUL_IEEE for trig/fdiv intrinsic Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 176485	2013-03-05 15:04:37 +00:00
NAKAMURA Takumi	7ab1b62c2b	llvm/test/CodeGen/Mips/mips64-f128.ll: Add explicit -mtriple=mips64el-unknown-unknown to appease win32. FIXME: Is it expected for win32 to affect mips targets? llvm-svn: 176471	2013-03-05 02:18:59 +00:00
NAKAMURA Takumi	7c27fc3cf0	llvm/test/CodeGen/Thumb/iabs.ll: Add explicit -mtriple=thumb-unknown-unknown to appease win32 hosts. llvm-svn: 176470	2013-03-05 02:18:52 +00:00
Akira Hatanaka	8d412f5a8a	[mips] Print move instructions. "move $4, $5" is printed instead of "or $4, $5, $zero". llvm-svn: 176455	2013-03-04 22:25:01 +00:00
Jack Carter	44abaa390d	Mips specific inline assembler constraint 'R' 'R' An address that can be sued in a non-macro load or store. This patch includes a positive test case. llvm-svn: 176452	2013-03-04 21:33:15 +00:00
Eli Bendersky	f241522533	Reapply r176381, writing the CHECKs in a more forgiving manner to account for running llvm-objdump on Darwin. llvm-svn: 176443	2013-03-04 18:20:31 +00:00
Preston Gurd	66b9c4fcf9	Bypass Slow Divides * Only apply divide bypass optimization when not optimizing for size. * Fixed bug caused by constant for 0 value of type Int32, used dividend type to generate the constant instead. * For atom x86-64 apply the divide bypass to use 16-bit divides instead of 64-bit divides when operand values are small enough. * Added lit tests for 64-bit divide bypass. Patch by Tyler Nowicki! llvm-svn: 176442	2013-03-04 18:13:57 +00:00
Jim Grosbach	2b831fb8d3	ARM: Creating a vector from a lane of another. The VDUP instruction source register doesn't allow a non-constant lane index, so make sure we don't construct a ARM::VDUPLANE node asking it to do so. rdar://13328063 http://llvm.org/bugs/show_bug.cgi?id=13963 llvm-svn: 176413	2013-03-02 20:16:24 +00:00
Arnold Schwaighofer	c633bf302e	ARM NEON: Fix v2f32 float intrinsics Mark them as expand, they are not legal as our backend does not match them. llvm-svn: 176410	2013-03-02 19:38:33 +00:00
Michael Gottesman	a4c89f27cc	Revert "Rewrite a test to count emitted instructions without using -stats" This reverts commit aac7922b8fe7ae733d3fe6697e6789fd730315dc. I am reverting the commit since it broke the phase 1 public buildbot for a few hours. http://lab.llvm.org:8013/builders/clang-x86_64-darwin11-nobootstrap-RA/builds/2137 llvm-svn: 176394	2013-03-02 00:53:20 +00:00
Akira Hatanaka	d2f7ed089c	[mips] Fix inefficient code generation. This patch eliminates the need to emit a constant move instruction when this pattern is matched: (select (setgt a, Constant), T, F) The pattern above effectively turns into this: (conditional-move (setlt a, Constant + 1), F, T) llvm-svn: 176384	2013-03-01 21:52:08 +00:00
Eli Bendersky	f364fecbb8	Rewrite a test to count emitted instructions without using -stats Also removed the comments of "should produce..." because they completely don't match the actually produced output. llvm-svn: 176381	2013-03-01 21:34:37 +00:00
Akira Hatanaka	e2ccd1b4d6	Set properties for f128 type. llvm-svn: 176378	2013-03-01 21:11:44 +00:00
Michael Liao	fde72e5106	Add regression tests (WORKSFORME) - These tests wont't crash on trunk but would be better to add them so that they don't break again in the future. llvm-svn: 176369	2013-03-01 19:23:37 +00:00
Chad Rosier	25ffc43c38	Generate an error message instead of asserting or segfaulting when we can't handle indirect register inputs. rdar://13322011 llvm-svn: 176367	2013-03-01 19:12:05 +00:00
Michael Liao	1e621fbd2f	Fix PR10475 - ISD::SHL/SRL/SRA must have either both scalar or both vector operands but TLI.getShiftAmountTy() so far only return scalar type. As a result, backend logic assuming that breaks. - Rename the original TLI.getShiftAmountTy() to TLI.getScalarShiftAmountTy() and re-define TLI.getShiftAmountTy() to return target-specificed scalar type or the same vector type as the 1st operand. - Fix most TICG logic assuming TLI.getShiftAmountTy() a simple scalar type. llvm-svn: 176364	2013-03-01 18:40:30 +00:00
Chad Rosier	313ffa4bc0	Add support for using non-pic code for arm and thumb1 when emitting the sjlj dispatch code. As far as I can tell the thumb2 code is behaving as expected. I was able to compile and run the associated test case for both arm and thumb1. rdar://13066352 llvm-svn: 176363	2013-03-01 18:30:38 +00:00
Christian Konig	1a86119413	R600/SI: fix sampler tests after fixing wait insertions Signed-off-by: Christian König <christian.koenig@amd.com> Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 176359	2013-03-01 17:39:05 +00:00
Jyotsna Verma	359daa8ad9	Hexagon: Add constant extender support framework. llvm-svn: 176358	2013-03-01 17:37:13 +00:00
Akira Hatanaka	04b651d332	[mips] Remove unused option. Fix 80-column violations. llvm-svn: 176330	2013-03-01 02:17:02 +00:00
Akira Hatanaka	9449ff1c6e	[mips] Add the capability to search delay slot filling instructions in successor basic blocks. Currently this is off by default. llvm-svn: 176329	2013-03-01 02:03:51 +00:00
Akira Hatanaka	01c253f718	[mips] Add capability to search in the forward direction for instructions that can fill the delay slot. Currently, this is off by default. llvm-svn: 176320	2013-03-01 00:50:52 +00:00
Akira Hatanaka	581402232c	[mips] Define class MemDefsUses. This class tracks dependence between memory instructions using underlying objects of memory operands. llvm-svn: 176313	2013-03-01 00:16:31 +00:00
Tim Northover	779708f861	AArch64: be more careful resorting to inefficient addressing for weak vars. If an otherwise weak var is actually defined in this unit, it can't be undefined at runtime so we can use normal global variable sequences (ADRP/ADD) to access it. llvm-svn: 176259	2013-02-28 14:36:31 +00:00
Tim Northover	b24657b0c5	AArch64: don't drop GlobalAddress offset when handling extern_weak decls. llvm-svn: 176258	2013-02-28 14:36:24 +00:00
Tim Northover	e2cf283c3e	AArch64: Use cbnz instead of cmp/b.ne pair for atomic operations. llvm-svn: 176253	2013-02-28 13:52:07 +00:00
Jim Grosbach	4d945565f7	ARM: FMA is legal only if VFP4 is available. rdar://13306723 llvm-svn: 176212	2013-02-27 21:31:12 +00:00
Manman Ren	894d0f9fc3	SelectionDAG: If llvm.donothing has a landingpad, we should clear CurrentCallSite to avoid an assertion failure: assert(MMI.getCurrentCallSite() == 0 && "Overlapping call sites!"); rdar://problem/13228754 llvm-svn: 176154	2013-02-27 02:11:57 +00:00
Bill Schmidt	5440b8eaca	Fix PR15332 (patch by Florian Zeitz). There's no need to generate a stack frame for PPC32 SVR4 when there are no local variables assigned to the stack, i.e., when no red zone is needed. (PPC64 supports a red zone, but PPC32 does not.) llvm-svn: 176124	2013-02-26 21:28:57 +00:00
Chad Rosier	0370d957b0	Add a test case for r176066. llvm-svn: 176119	2013-02-26 20:22:30 +00:00
Chad Rosier	3c39a1292b	Remove a few unused arguments. llvm-svn: 176109	2013-02-26 18:39:31 +00:00
Bill Schmidt	76befd83d4	Fix PR15359. The PowerPC TLS relocation types were not previously added to the necessary list in MCELFStreamer::fixSymbolsInTLSFixups(). Now they are! llvm-svn: 176094	2013-02-26 16:41:03 +00:00
Kostya Serebryany	f560b78692	Unify clang/llvm attributes for asan/tsan/msan (LLVM part) These are two related changes (one in llvm, one in clang). LLVM: - rename address_safety => sanitize_address (the enum value is the same, so we preserve binary compatibility with old bitcode) - rename thread_safety => sanitize_thread - rename no_uninitialized_checks -> sanitize_memory CLANG: - add __attribute__((no_sanitize_address)) as a synonym for __attribute__((no_address_safety_analysis)) - add __attribute__((no_sanitize_thread)) - add __attribute__((no_sanitize_memory)) for S in address thread memory If -fsanitize=S is present and __attribute__((no_sanitize_S)) is not set llvm attribute sanitize_S llvm-svn: 176075	2013-02-26 06:58:09 +00:00
Michael Liao	ff7d7ec88b	Fix PR10499 - Check whether SSE is available before lowering all 1s vector building with PCMPEQD, which is only available from SSE2 llvm-svn: 176058	2013-02-25 23:01:03 +00:00
Chad Rosier	f38b2c410b	Remove extraneous attribute number. llvm-svn: 176053	2013-02-25 22:06:05 +00:00
Chad Rosier	37142b6930	[fast-isel] Add X86FastIsel::FastLowerArguments to handle functions with 6 or fewer scalar integer (i32 or i64) arguments. It completely eliminates the need for SDISel for trivial functions. Also, add the new llc -fast-isel-abort-args option, which is similar to -fast-isel-abort option, but for formal argument lowering. llvm-svn: 176052	2013-02-25 21:59:35 +00:00
Andrew Trick	9dd0c20307	pre-RA-sched fix: only reevaluate physreg interferences when necessary. Fixes rdar:13279013: scheduler was blowing up on select instructions. llvm-svn: 176037	2013-02-25 19:11:48 +00:00
Bill Schmidt	a7e4a58051	Fix missing relocation for TLS addressing peephole optimization. Report and fix due to Kai Nacke. Testcase update by me. llvm-svn: 176029	2013-02-25 16:44:35 +00:00
Chandler Carruth	aea541125e	Fix the root cause of PR15348 by correctly handling alignment 0 on memory intrinsics in the SDAG builder. When alignment is zero, the lang ref says that no alignment assumptions can be made. This is the exact opposite of the internal API contracts of the DAG where alignment 0 indicates that the alignment can be made to be anything desired. There is another, more explicit alignment that is better suited for the role of "no alignment at all": an alignment of 1. Map the intrinsic alignment to this early so that we don't end up generating aligned DAGs. It is really terrifying that we've never seen this before, but we suddenly started generating a large number of alignment 0 memcpys due to the new code to do memcpy-based copying of POD class members. That patch contains a bug that rounds bitfield alignments down when they are the first field. This can in turn produce zero alignments. This fixes weird crashes I've seen in library users of LLVM on 32-bit hosts, etc. llvm-svn: 176022	2013-02-25 14:20:21 +00:00
Nadav Rotem	0740239f87	Revert r169638 because it broke Mesa llvmpipe tests. Fix PR15239. llvm-svn: 175985	2013-02-24 07:09:35 +00:00
Benjamin Kramer	bdb1d9aad3	X86: Disable cmov-memory patterns on subtargets without cmov. Fixes PR15115. llvm-svn: 175962	2013-02-23 10:40:58 +00:00
Reed Kotler	65cb21ddd8	Expand pseudos/macros for Selt. This is the last of the complex macros.The rest is some small misc. stuff. llvm-svn: 175950	2013-02-23 03:09:56 +00:00
Akira Hatanaka	8f0f207217	[mips] Emit call16 operator instead of got_disp. The former allows lazy binding. llvm-svn: 175920	2013-02-22 21:10:03 +00:00
Peter Collingbourne	276de50188	Fix test by matching movaps instead of AVX-only vmovaps llvm-svn: 175914	2013-02-22 19:53:30 +00:00
Peter Collingbourne	7dc1ee08f5	x86_64: designate most general purpose and SSE registers as callee save under coldcc llvm-svn: 175911	2013-02-22 19:19:44 +00:00
Pete Cooper	b4726c928e	Remove unused CHECK lines copied from another test llvm-svn: 175905	2013-02-22 18:16:21 +00:00
Kristof Beyls	a686678676	Make ARMAsmPrinter generate the correct alignment specifier syntax in instructions. The Printer will now print instructions with the correct alignment specifier syntax, like vld1.8 {d16}, [r0:64] llvm-svn: 175884	2013-02-22 10:01:33 +00:00
Reed Kotler	340c9d39ce	Expand mips16 SelT form pseudso/macros. llvm-svn: 175862	2013-02-22 05:10:51 +00:00
Pete Cooper	6da577a986	Fix isa<> check which could never be true. It was incorrectly checking a Function* being an IntrinsicInst* which isn't possible. It should always have been checking the CallInst* instead. Added test case for x86 which ensures we only get one constant load. It was 2 before this change. rdar://problem/13267920 llvm-svn: 175853	2013-02-22 01:50:38 +00:00
Anshuman Dasgupta	810cccb843	Hexagon: Expand cttz, ctlz, and ctpop for now. llvm-svn: 175783	2013-02-21 19:39:40 +00:00
Jakob Stoklund Olesen	38b12c2ce2	Make RAFast::UsedInInstr indexed by register units. This fixes some problems with too conservative checking where we were marking all aliases of a register as used, and then also checking all aliases when allocating a register. <rdar://problem/13249625> llvm-svn: 175782	2013-02-21 19:35:21 +00:00
Bill Schmidt	049ba390f5	Large code model support for PowerPC. Large code model is identical to medium code model except that the addis/addi sequence for "local" accesses is never used. All accesses use the addis/ld sequence. The coding changes are straightforward; most of the patch is taken up with creating variants of the medium model tests for large model. llvm-svn: 175767	2013-02-21 17:12:27 +00:00
Benjamin Kramer	9de866701b	DAGCombiner: Make the post-legalize vector op optimization more aggressive. A legal BUILD_VECTOR goes in and gets constant folded into another legal BUILD_VECTOR so we don't lose any legality here. The problematic PPC optimization that made this check necessary was fixed recently. llvm-svn: 175759	2013-02-21 15:24:35 +00:00
Tom Stellard	aa63f0e8d4	R600: Fix for Unigine when MachineSched is enabled Fixes for-loop.cl piglit test Patch By: Vincent Lejeune Reviewed-by: Tom Stellard <thomas.stellard@amd.com> NOTE: This is a candidate for the Mesa stable branch. llvm-svn: 175742	2013-02-21 15:06:59 +00:00
Michel Danzer	756af8b106	R600/SI: Make sure M0 is loaded for V_INTERP_MOV_F32 NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard@amd.com> llvm-svn: 175733	2013-02-21 08:57:10 +00:00
Reed Kotler	276bb6b70b	Expand the sel pseudo/macro. This generates basic blocks where previously there were inline br .+4 instructions. Soon everything can enjoy the full instruction scheduling experience. llvm-svn: 175718	2013-02-21 04:22:38 +00:00
Bill Schmidt	0e7935e723	PPCDAGToDAGISel::PostprocessISelDAG() This patch implements the PPCDAGToDAGISel::PostprocessISelDAG virtual method to perform post-selection peephole optimizations on the DAG representation. One optimization is implemented here: folds to clean up complex addressing expressions for thread-local storage and medium code model. It will also be useful for large code model sequences when those are added later. I originally thought about doing this on the MI representation prior to register assignment, but it's difficult to do effective global dead code elimination at that point. DCE is trivial on the DAG representation. A typical example of a candidate code sequence in assembly: addis 3, 2, globalvar@toc@ha addi 3, 3, globalvar@toc@l lwz 5, 0(3) When the final instruction is a load or store with an immediate offset of zero, the offset from the add-immediate can replace the zero, provided the relocation information is carried along: addis 3, 2, globalvar@toc@ha lwz 5, globalvar@toc@l(3) Since the addi can in general have multiple uses, we need to only delete the instruction when the last use is removed. llvm-svn: 175697	2013-02-21 00:38:25 +00:00
Bill Schmidt	9e8b42e2f9	Stabilize vec_constants.ll llvm-svn: 175683	2013-02-20 22:43:03 +00:00
Arnold Schwaighofer	170d2a8c25	DAGCombiner: Fold pointless truncate, bitcast, buildvector series (2xi32) (truncate ((2xi64) bitcast (buildvector i32 a, i32 x, i32 b, i32 y))) can be folded into a (2xi32) (buildvector i32 a, i32 b). Such a DAG would cause uneccessary vdup instructions followed by vmovn instructions. We generate this code on ARM NEON for a setcc olt, 2xf64, 2xf64. For example, in the vectorized version of the code below. double A[N]; double B[N]; void test_double_compare_to_double() { int i; for(i=0;i<N;i++) A[i] = (double)(A[i] < B[i]); } radar://13191881 Fixes bug 15283. llvm-svn: 175670	2013-02-20 21:33:32 +00:00
Bill Schmidt	bcb4fa48fa	Additional fixes for bug 15155. This handles the cases where the 6-bit splat element is odd, converting to a three-instruction sequence to add or subtract two splats. With this fix, the XFAIL in test/CodeGen/PowerPC/vec_constants.ll is removed. llvm-svn: 175663	2013-02-20 20:41:42 +00:00
Michael Liao	a500005adc	Fix PR15267 - When extloading from a vector with non-byte-addressable element, e.g. <4 x i1>, the current logic breaks. Extend the current logic to fix the case where the element type is not byte-addressable by loading all bytes, bit-extracting/packing each element. llvm-svn: 175642	2013-02-20 18:04:21 +00:00
Bill Schmidt	358367c60f	Fix bug 14779 for passing anonymous aggregates [patch by Kai Nacke]. The PPC backend doesn't handle these correctly. This patch uses logic similar to that in the X86 and ARM backends to track these arguments properly. llvm-svn: 175635	2013-02-20 17:31:41 +00:00
Jyotsna Verma	84136133e3	Hexagon: Move HexagonMCInst.h to MCTargetDesc/HexagonMCInst.h. Add HexagonMCInst class which adds various Hexagon VLIW annotations. In addition, this class also includes some APIs related to the constant extenders. llvm-svn: 175634	2013-02-20 16:13:27 +00:00
Bill Schmidt	93b2fc9f50	Fix PR15155: lost vadd/vsplat optimization. During lowering of a BUILD_VECTOR, we look for opportunities to use a vector splat. When the splatted value fits in 5 signed bits, a single splat does the job. When it doesn't fit in 5 bits but does fit in 6, and is an even value, we can splat on half the value and add the result to itself. This last optimization hasn't been working recently because of improved constant folding. To circumvent this, create a pseudo VADD_SPLAT that can be expanded during instruction selection. llvm-svn: 175632	2013-02-20 15:50:31 +00:00
Elena Demikhovsky	0886fb4d55	I optimized the following patterns: sext <4 x i1> to <4 x i64> sext <4 x i8> to <4 x i64> sext <4 x i16> to <4 x i64> I'm running Combine on SIGN_EXTEND_IN_REG and revert SEXT patterns: (sext_in_reg (v4i64 anyext (v4i32 x )), ExtraVT) -> (v4i64 sext (v4i32 sext_in_reg (v4i32 x , ExtraVT))) The sext_in_reg (v4i32 x) may be lowered to shl+sar operations. The "sar" does not exist on 64-bit operation, so lowering sext_in_reg (v4i64 x) has no vector solution. I also added a cost of this operations to the AVX costs table. llvm-svn: 175619	2013-02-20 12:42:54 +00:00
Logan Chien	740a4514e2	Fix thumbv5e frame lowering assertion failure. It is possible that frame pointer is not found in the callee saved info, thus FramePtrSpillFI may be incorrect if we don't check the result of hasFP(MF). Besides, if we enable the stack coloring algorithm, there will be an assertion to ensure the slot is live. But in the test case, %var1 is not live in the prologue of the function, and we will get the assertion failure. Note: There is similar code in ARMFrameLowering.cpp. llvm-svn: 175616	2013-02-20 12:21:33 +00:00
Reed Kotler	030e941124	Expand pseudos/macros: SltCCRxRy16, SltiCCRxImmX16, SltiuCCRxImmX16, SltuCCRxRy16 $T8 shows up as register $24 when emitted from C++ code so we had to change some tests that were already there for this functionality. llvm-svn: 175593	2013-02-20 05:45:15 +00:00
Chad Rosier	2be41be7b9	[ms-inline asm] Force the use of a base pointer if the MachineFunction includes MS-style inline assembly. This is a follow-on to r175334. Forcing a FP to be emitted doesn't ensure it will be used. Therefore, force the base pointer as well. We now treat MS inline assembly in the same way we treat functions with dynamic stack realignment and VLAs. This guarantees the BP will be used to reference parameters and locals. rdar://13218191 llvm-svn: 175576	2013-02-19 23:50:45 +00:00
Jim Grosbach	0d47c3335f	ARM: Allocation hints must make sure to be in the alloc order. When creating an allocation hint for a register pair, make sure the hint for the physical register reference is still in the allocation order. rdar://13240556 llvm-svn: 175541	2013-02-19 18:55:36 +00:00
Eli Bendersky	1523eabc7e	Fix typo llvm-svn: 175530	2013-02-19 17:11:48 +00:00
Benjamin Kramer	d0bfa4e8dc	Fix GCMetadaPrinter::finishAssembly not executed, patch by Yiannis Tsiouris. Due to the execution order of doFinalization functions, the GC information were deleted before AsmPrinter::doFinalization was executed. Thus, the GCMetadataPrinter::finishAssembly was never called. The patch fixes that by moving the code of the GCInfoDeleter::doFinalization to Printer::doFinalization. llvm-svn: 175528	2013-02-19 16:51:44 +00:00
Arnold Schwaighofer	3a1cb40149	ARM NEON: Merge a f32 bitcast of a v2i32 extractelt A vectorized sitfp on doubles will get scalarized to a sequence of an extract_element of <2 x i32>, a bitcast to f32 and a sitofp. Due to the the extract_element, and the bitcast we will uneccessarily generate moves between scalar and vector registers. The patch fixes this by using a COPY_TO_REGCLASS and a EXTRACT_SUBREG to extract the element from the vector instead. radar://13191881 llvm-svn: 175520	2013-02-19 15:27:05 +00:00
Reed Kotler	d849980705	Expand pseudos/macros BteqzT8SltiX16, BteqzT8SltiuX16, BtnezT8SltiX16, BtnezT8SltiuX16 . llvm-svn: 175486	2013-02-19 03:56:57 +00:00
Reed Kotler	7ddfd1de27	Expand pseudos BteqzT8CmpiX16 and BtnezT8CmpiX16. llvm-svn: 175474	2013-02-19 00:20:58 +00:00
Chad Rosier	5babcb4a4b	Comment out the rdar number. llvm-svn: 175460	2013-02-18 21:59:15 +00:00
Chad Rosier	81ced58e28	[fast-isel] Remove an invalid assert. If the memcpy has an odd length with an alignment of 2, this would incorrectly assert on the last 1 byte copy. rdar://13202135 llvm-svn: 175459	2013-02-18 21:46:28 +00:00
Benjamin Kramer	462b555ebe	Support for HiPE-compatible code emission, patch by Yiannis Tsiouris. llvm-svn: 175457	2013-02-18 20:55:12 +00:00
Vincent Lejeune	9328de1e18	R600/SI: Use MULADD_IEEE/V_MAD_F32 instruction for mad pattern llvm-svn: 175446	2013-02-18 14:11:28 +00:00
Reed Kotler	a23b2388d3	Expand macro/pseudo instructions BtnezT8SltX16 and BtnezT8SltuX16. llvm-svn: 175420	2013-02-18 05:43:03 +00:00
Reed Kotler	8e9b3f2984	Expand pseudo/macro BteqzT8SltX16. llvm-svn: 175417	2013-02-18 04:04:26 +00:00
Reed Kotler	6faf1b4290	Expand macro/pseudo BteqzT8CmpX16. llvm-svn: 175416	2013-02-18 03:06:29 +00:00
Reed Kotler	1ca4a75d36	Beginning of expanding all current mips16 macro/pseudo instruction sequences. This expansion will be moved to expandISelPseudos as soon as I can figure out how to do that. There are other instructions which use this ExpandFEXT_T8I816_ins and as soon as I have finished expanding them all, I will delete the macro asm string text so it has no way to be used in the future. llvm-svn: 175413	2013-02-18 00:59:04 +00:00
Benjamin Kramer	49d8149a57	Force a cpu for test. It failed on atom due to different scheduling decisions. llvm-svn: 175401	2013-02-17 18:26:11 +00:00
Jakub Staszak	8a35ccb6e4	Replace "check:" wth "CHECK:". Also fix one test by changing "vpermilps" to "vpshufd". llvm-svn: 175357	2013-02-16 12:16:56 +00:00
Bill Wendling	fb87157cc8	Reinitialize the ivars in the subtarget so that they can be reset with the new features. llvm-svn: 175336	2013-02-16 01:36:26 +00:00
Chad Rosier	1062ec80b5	[ms-inline asm] Do not omit the frame pointer if we have ms-inline assembly. If the frame pointer is omitted, and any stack changes occur in the inline assembly, e.g.: "pusha", then any C local variable or C argument references will be incorrect. I pass no judgement on anyone who would do such a thing. ;) rdar://13218191 llvm-svn: 175334	2013-02-16 01:25:28 +00:00
Bill Wendling	3e17fc6664	Temporary revert of 175320. llvm-svn: 175322	2013-02-15 23:22:32 +00:00
Bill Wendling	ecc7822c1e	Reinitialize the ivars in the subtarget. When we're recalculating the feature set of the subtarget, we need to have the ivars in their initial state. llvm-svn: 175320	2013-02-15 23:18:01 +00:00
Paul Redmond	09a6b11f75	enable SDISel sincos optimization for GNU environments - add sincos to runtime library if target triple environment is GNU - added canCombineSinCosLibcall() which checks that sincos is in the RTL and if the environment is GNU then unsafe fpmath is enabled (required to preserve errno) - extended sincos-opt lit test Reviewed by: Hal Finkel llvm-svn: 175283	2013-02-15 18:45:18 +00:00
Tim Northover	04e9446751	AArch64: remove ConstantIsland pass & put literals in separate section. This implements the review suggestion to simplify the AArch64 backend. If we later discover that we really need the extra complexity of the ConstantIslands pass for performance reasons it can be resurrected. llvm-svn: 175258	2013-02-15 09:33:43 +00:00
Tim Northover	9f3ff5cc4c	AArch64: refactor frame handling to use movz/movk for overlarge offsets. In the near future litpools will be in a different section, which means that any access to them is at least two instructions. This makes the case for a movz/movk pair (if total offset <= 32-bits) even more compelling. llvm-svn: 175257	2013-02-15 09:33:26 +00:00
Reed Kotler	45e1076551	Fix minor mips16 issues in directives for function prologue. Probably this does not matter but makes it more gcc compatible which avoids possible subtle problems. Also, turned back on a disabled check in helloworld.ll. llvm-svn: 175237	2013-02-15 01:04:38 +00:00
Nadav Rotem	da8ef29d81	Dont merge consecutive loads/stores into vectors when noimplicitfloat is used. llvm-svn: 175190	2013-02-14 18:28:52 +00:00
Weiming Zhao	c1d92fe42d	Re-apply r175088 for bug fix 13622: Add paired register support for inline asm with 64-bit data on ARM Update test case to use -mtriple=arm-linux-gnueabi llvm-svn: 175186	2013-02-14 18:10:21 +00:00
Vincent Lejeune	f597698c9c	R600: Do not fold single instruction with more that 3 kcache read It fixes around 100 tfb piglit tests and 16 glean tests. NOTE: This is a candidate for the Mesa stable branch. Reviewed-by: Tom Stellard <thomas.stellard at amd.com> llvm-svn: 175183	2013-02-14 16:57:19 +00:00
Kristof Beyls	d33917748d	Make ARMAsmParser accept the correct alignment specifier syntax in instructions. The parser will now accept instructions with alignment specifiers written like vld1.8 {d16}, [r0:64] , while also still accepting the incorrect syntax vld1.8 {d16}, [r0, :64] llvm-svn: 175164	2013-02-14 14:46:12 +00:00
Elena Demikhovsky	3a155506e7	Fixed a bug in X86TargetLowering::LowerVectorIntExtend() (assertion failure). Added a test. llvm-svn: 175144	2013-02-14 08:20:26 +00:00

... 2 3 4 5 6 ...

7244 Commits