llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 20:12:56 +02:00

Author	SHA1	Message	Date
Craig Topper	1548551887	Allow pinsrw/pinsrb/pextrb/pextrw/movmskps/movmskpd/pmovmskb/extractps instructions to parse either GR32 or GR64 without resorting to duplicating instructions. llvm-svn: 192567	2013-10-14 04:55:01 +00:00
Craig Topper	ba1540e28a	Add disassembler support for SSE4.1 register/register form of PEXTRW. There is a shorter encoding that was part of SSE2, but a memory form was added in SSE4.1. This is the register form of that encoding. llvm-svn: 192566	2013-10-14 01:42:32 +00:00
Craig Topper	c9050b2d46	Mark MOVMSKPS/MOVMSKPD/VPINSRWrr64i as AsmParserOnly to remove them from the disassembler tables. Add PINSRWrr64i to complement the AVX version. llvm-svn: 192565	2013-10-14 01:21:22 +00:00
Vincent Lejeune	7594bd2071	R600: improve dump of S_WAITCNT llvm-svn: 192557	2013-10-13 17:56:28 +00:00
Vincent Lejeune	316b632e03	R600: Use masked read sel for texture instructions llvm-svn: 192554	2013-10-13 17:56:10 +00:00
Vincent Lejeune	b337ac16bc	R600: fix swizzle export llvm-svn: 192553	2013-10-13 17:56:04 +00:00
Arnold Schwaighofer	38ec37faba	SLPVectorizer: Sort PHINodes based on their opcode Before this patch we relied on the order of phi nodes when we looked for phi nodes of the same type. This could prevent vectorization of cases where there was a phi node of a second type in between phi nodes of some type. This is important for vectorization of an internal graphics kernel. On the test suite + external on x86_64 (and on a run on armv7s) it showed no impact on either performance or compile time. radar://15024459 llvm-svn: 192537	2013-10-12 18:56:27 +00:00
Benjamin Kramer	3000b81f9a	Force a CPU on test so it doesn't depend on microarchitectural scheduling decisions. llvm-svn: 192532	2013-10-12 11:17:12 +00:00
Reed Kotler	9efb450361	For Mips16, start to consolidate all forms of 32 bit literal loading so that they can be better handled and optimized in the Mips16 constant island code. llvm-svn: 192520	2013-10-12 02:19:08 +00:00
Andrew Kaylor	54a20b979d	Fixing problems in lli's RemoteMemoryManager. This fixes a problem from a previous check-in where a return value was omitted. Previously the remote/stubs-remote.ll and remote/stubs-sm-pic.ll tests were reporting passes, but they should have been failing. Those tests attempt to link against an external symbol and remote symbol resolution is not supported. The old RemoteMemoryManager implementation resulted in local symbols being used for resolution and the child process crashed but the test didn't notice. With this check-in remote symbol resolution fails, and so the test (correctly) fails. llvm-svn: 192514	2013-10-11 22:47:10 +00:00
Andrew Kaylor	0f28749d52	Adding multiple object support to MCJIT EH frame handling llvm-svn: 192504	2013-10-11 21:25:48 +00:00
Matt Arsenault	289accc07f	R600: Add scalar i32 add test llvm-svn: 192501	2013-10-11 21:03:41 +00:00
Matt Arsenault	d5c3e13cc5	Use CHECK-LABEL llvm-svn: 192500	2013-10-11 21:03:39 +00:00
Benjamin Kramer	3e32e13c0b	Mips: Disassemble sign-extended 64 bit immediates properly. This doesn't change the meaning of the output, but makes look right. PR17539. llvm-svn: 192483	2013-10-11 19:05:08 +00:00
Matthias Braun	f96d183309	Remove kill flags after if conversion if necessary When if converting something like: true: ... = R0<kill> false: ... = R0<kill> then the instructions of the true block must not have a <kill> flag anymore, as the instruction of the false block follow and do still read the R0 value. Specifically this patch determines the set of register live-in in the false block (possibly after simulating the liveness changes of the duplicated instructions). Each of these live-in registers mustn't be killed. llvm-svn: 192482	2013-10-11 19:04:37 +00:00
Quentin Colombet	7ba3455dfc	[DAGCombiner] Load slicing test case: attempt to really fix the buildbots (used sse4.2 instead of avx!). <rdar://problem/14477220> llvm-svn: 192480	2013-10-11 18:54:49 +00:00
Manman Ren	dc257d1644	Debug Info Testing Case: check for the name of a structure. llvm-svn: 192478	2013-10-11 18:50:00 +00:00
Stephen Lin	4a5c56d804	Really fix CHECK-LABEL and CHECK-DAG interaction. This actually just restores the initial implementation that was in r186162 but got lost in some subsequent refactoring. More explicit variable names and comments are present now to hopefully prevent repeat regression, as well as another test. llvm-svn: 192477	2013-10-11 18:38:36 +00:00
Quentin Colombet	c02e5604f4	[DAGCombiner] Reapply load slicing (192471) with a test that explicitly set sse4.2 support. This should fix the buildbots. Original commit message: [DAGCombiner] Slice a big load in two loads when the element are next to each other in memory and the target has paired load and performs post-isel loads combining. E.g., this optimization will transform something like this: a = load i64* addr b = trunc i64 a to i32 c = lshr i64 a, 32 d = trunc i64 c to i32 into: b = load i32* addr1 d = load i32* addr2 Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and performs post-isel loads combining. One should overload TargetLowering::hasPairedLoad to provide this information. The default is false. <rdar://problem/14477220> llvm-svn: 192476	2013-10-11 18:29:42 +00:00
Quentin Colombet	fd0097531f	[DAGCombiner] Revert load slicing (r192471), until I figure out why it fails on ubuntu. llvm-svn: 192474	2013-10-11 18:17:17 +00:00
Matthias Braun	434fbd854b	Revert "Tests: Be less dependent on a specific schedule/regalloc" This reverts r192454 Apparently FileCheck isn't as smart as I though and does not enforce a topological order between variable defs+uses. llvm-svn: 192472	2013-10-11 18:09:19 +00:00
Quentin Colombet	b60dc81c8b	[DAGCombiner] Slice a big load in two loads when the element are next to each other in memory and the target has paired load and performs post-isel loads combining. E.g., this optimization will transform something like this: a = load i64* addr b = trunc i64 a to i32 c = lshr i64 a, 32 d = trunc i64 c to i32 into: b = load i32* addr1 d = load i32* addr2 Where addr1 = addr2 +/- sizeof(i32), if the target supports paired load and performs post-isel loads combining. One should overload TargetLowering::hasPairedLoad to provide this information. The default is false. <rdar://problem/14477220> llvm-svn: 192471	2013-10-11 18:01:14 +00:00
Rafael Espindola	d708c218d0	Fix handling of CHECK-DAG inside of CHECK-LABEL. llvm-svn: 192463	2013-10-11 16:48:02 +00:00
Amara Emerson	bf6dcda63c	[ARM] Fix FP ABI attributes with no VFP enabled. llvm-svn: 192458	2013-10-11 16:03:43 +00:00
Matthias Braun	4beef11e35	Tests: Be less dependent on a specific schedule/regalloc llvm-svn: 192454	2013-10-11 15:40:12 +00:00
Matheus Almeida	c2695f53f2	This reverts 192447 because of compiler warning generated on darwin build. llvm-svn: 192451	2013-10-11 13:58:32 +00:00
Matheus Almeida	ee726b70e1	This reverts r192449 because of compiler warning generated on darwin build. llvm-svn: 192450	2013-10-11 13:56:12 +00:00
Matheus Almeida	394228ce3f	[mips][msa] Direct Object Emission for the majority of the ELM instructions. llvm-svn: 192449	2013-10-11 13:39:49 +00:00
Matheus Almeida	4a18a3f38a	[mips][msa] Direct Object Emission of INSERT.{B,H,W} instruction. INSERT is the first type of MSA instruction that requires a change to the way MSA registers are parsed. This happens because MSA registers may be suffixed by an index in the form of an immediate or a general purpose register. The changes to parseMSARegs reflect that requirement. llvm-svn: 192447	2013-10-11 13:29:36 +00:00
Matheus Almeida	73759d3a3b	[mips][msa] Improves robustness of the test by enhancing pattern matching. llvm-svn: 192446	2013-10-11 13:18:01 +00:00
Justin Holewinski	9769d1f0ef	[NVPTX] Switch from StrongPHIElimination to PHIElimination in NVPTXTargetMachine, and add some missing optimization passes to addOptimizedRegAlloc Fixes PR17529 llvm-svn: 192445	2013-10-11 12:39:39 +00:00
Justin Holewinski	f7d6ae0d5b	Make AsmPrinter::emitImplicitDef a virtual method so targets can emit custom comments for implicit defs For NVPTX, this fixes a crash where the emitImplicitDef implementation was expecting physical registers, while NVPTX uses virtual registers (with a couple of exceptions). Now, the implicit def comment will be emitted as a true PTX register name. Other targets can use this to customize the output of implicit def comments. Fixes PR17519 llvm-svn: 192444	2013-10-11 12:39:36 +00:00
Amara Emerson	83afefcfe3	[ARM] Add a test case for disabled neon/fpu features. llvm-svn: 192440	2013-10-11 11:07:00 +00:00
Daniel Sanders	3649e05b17	[mips][msa] Added support for matching maddv.[bhwd], and msubv.[bhwd] from normal IR (i.e. not intrinsics) llvm-svn: 192438	2013-10-11 10:50:42 +00:00
Daniel Sanders	9bec7b823b	[mips][msa] Added support for matching fmsub.[wd] from normal IR (i.e. not intrinsics) llvm-svn: 192435	2013-10-11 10:27:32 +00:00
Robert Lytton	864d2bd56d	XCore target fix bug in emitArrayBound() causing segmentation fault llvm-svn: 192434	2013-10-11 10:27:13 +00:00
Robert Lytton	12def987ea	XCore target does not emit '.hidden' or '.protected' attributes llvm-svn: 192433	2013-10-11 10:27:00 +00:00
Robert Lytton	b441cef9c5	XCore target: fix bug in XCoreLowerThreadLocal.cpp When a ConstantExpr which uses a thread local is part of a PHI node instruction, the insruction that replaces the ConstantExpr must be inserted in the predecessor block, in front of the terminator instruction. If the predecessor block has multiple successors, the edge is first split. llvm-svn: 192432	2013-10-11 10:26:48 +00:00
Robert Lytton	e5a2d050ac	XCore target: add XCoreTargetLowering::isZExtFree() llvm-svn: 192431	2013-10-11 10:26:29 +00:00
Daniel Sanders	253e018134	[mips][msa] Added support for matching fmadd.[wd] from normal IR (i.e. not intrinsics) llvm-svn: 192430	2013-10-11 10:14:25 +00:00
Daniel Sanders	4971ec128b	[mips][msa] Added support for matching ffint_[us].[wd], and ftrunc_[us].[wd] from normal IR (i.e. not intrinsics) llvm-svn: 192429	2013-10-11 10:00:06 +00:00
Kevin Qin	e90902acc5	Implement aarch64 neon instruction set AdvSIMD (copy). llvm-svn: 192410	2013-10-11 02:33:55 +00:00
Matthias Braun	9e9e0f5d4c	Tests: Do not unnecessarily depend on kill comments llvm-svn: 192404	2013-10-10 22:37:49 +00:00
Matthias Braun	fd7da2a38d	Tests: Use CHECK-LABEL where possible llvm-svn: 192403	2013-10-10 22:37:47 +00:00
Manman Ren	1f6bdc7436	Debug Info: In DIBuilder, the context field of subprogram is updated to use DIScopeRef. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192378	2013-10-10 18:40:01 +00:00
Manman Ren	615619229d	Add comments to debug info testing case. llvm-svn: 192376	2013-10-10 18:13:17 +00:00
Matt Arsenault	6f45619203	R600: Fix trunc i64 to i32 on SI llvm-svn: 192375	2013-10-10 18:04:16 +00:00
Tom Stellard	582bb030db	R600/SI: Use -verify-machineinstrs for most tests We can't enable the verifier for tests with SI_IF and SI_ELSE, because these instructions are always followed by a COPY which copies their result to the next basic block. This violates the machine verifier's rule that non-terminators can not folow terminators. Reviewed-by: Vincent Lejeune<vljn at ovi.com> llvm-svn: 192366	2013-10-10 17:11:46 +00:00
Hao Liu	d0ab407a23	Implement AArch64 vector load/store multiple N-element structure class SIMD(lselem). Including following 14 instructions: 4 ld1 insts: load multiple 1-element structure to sequential 1/2/3/4 registers. ld2/ld3/ld4: load multiple N-element structure to sequential N registers (N=2,3,4). 4 st1 insts: store multiple 1-element structure from sequential 1/2/3/4 registers. st2/st3/st4: store multiple N-element structure from sequential N registers (N = 2,3,4). llvm-svn: 192361	2013-10-10 17:00:52 +00:00
Rafael Espindola	bb93e39fe2	Revert "Implement AArch64 vector load/store multiple N-element structure class SIMD(lselem). Including following 14 instructions: 4 ld1 insts: load multiple 1-element structure to sequential 1/2/3/4 registers. ld2/ld3/ld4: load multiple N-element structure to sequential N registers (N=2,3,4). 4 st1 insts: store multiple 1-element structure from sequential 1/2/3/4 registers. st2/st3/st4: store multiple N-element structure from sequential N registers (N = 2,3,4)." This reverts commit r192352. It broke the build. llvm-svn: 192354	2013-10-10 15:15:17 +00:00
Hao Liu	0ff11c9c71	Implement AArch64 vector load/store multiple N-element structure class SIMD(lselem). Including following 14 instructions: 4 ld1 insts: load multiple 1-element structure to sequential 1/2/3/4 registers. ld2/ld3/ld4: load multiple N-element structure to sequential N registers (N=2,3,4). 4 st1 insts: store multiple 1-element structure from sequential 1/2/3/4 registers. st2/st3/st4: store multiple N-element structure from sequential N registers (N = 2,3,4). llvm-svn: 192352	2013-10-10 15:01:24 +00:00
Benjamin Kramer	11421cc493	Disable function padding to get this test to pass on atom. llvm-svn: 192348	2013-10-10 12:46:23 +00:00
Tim Northover	50b95fa75d	ARM: correct liveness flags during ARMLoadStoreOpt When we had a sequence like: s1 = VLDRS [r0, 1], Q0<imp-def> s3 = VLDRS [r0, 2], Q0<imp-use,kill>, Q0<imp-def> s0 = VLDRS [r0, 0], Q0<imp-use,kill>, Q0<imp-def> s2 = VLDRS [r0, 4], Q0<imp-use,kill>, Q0<imp-def> we were gathering the {s0, s1} loads below the s3 load. This is fine, but confused the verifier since now the s3 load had Q0<imp-use> with no definition above it. This should mark such uses <undef> as well. The liveness structure at the beginning and end of the block is unaffected, and the true sN definitions should prevent any dodgy reorderings being introduced elsewhere. rdar://problem/15124449 llvm-svn: 192344	2013-10-10 09:28:20 +00:00
Craig Topper	60ef08db39	Allow non-AVX form of pmovmskb to take a GR64 operand. llvm-svn: 192341	2013-10-10 05:33:31 +00:00
Akira Hatanaka	d7e78a8926	[mips] Do not generate INS/EXT nodes if target does not have support for ins/ext. llvm-svn: 192330	2013-10-09 23:36:17 +00:00
Manman Ren	992ab551ae	Debug Info: In DIBuilder, the context and type fields of template_type and template_value are updated to use DIRef. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192320	2013-10-09 19:46:28 +00:00
Manman Ren	25dbd15627	Debug Info: In DIBuilder, the context field of a forward decl is updated to use DIScopeRef. llvm-svn: 192309	2013-10-09 18:10:55 +00:00
Shuxin Yang	708f0cc2e4	Fix a bug in Dead Argument Elimination. If a function seen at compile time is not necessarily the one linked to the binary being built, it is illegal to change the actual arguments passing to it. e.g. -------------------------- void foo(int lol) { // foo() has linkage satisifying isWeakForLinker() // "lol" is not used at all. } void bar(int lo2) { // xform to foo(undef) is illegal, as compiler dose not know which // instance of foo() will be linked to the the binary being built. foo(lol2); } ----------------------------- Such functions can be captured by isWeakForLinker(). NOTE that mayBeOverridden() is insufficient for this purpose as it dosen't include linkage types like AvailableExternallyLinkage and LinkOnceODRLinkage. Take link_odr* as an example, it indicates a set of EQUIVALENT globals that can be merged at link-time. However, the semantic of EQUIVALENT-functions includes parameters. Changing parameters breaks the assumption. Thank John McCall for help, especially for the explanation of subtle difference between linkage types. rdar://11546243 llvm-svn: 192302	2013-10-09 17:21:44 +00:00
Venkatraman Govindaraju	aedc12be2e	[Sparc] Disable tail call optimization for sparc64. This patch fixes PR17506. llvm-svn: 192294	2013-10-09 12:50:39 +00:00
Elena Demikhovsky	f24ecf7862	AVX-512: Added VRCP28 and VRSQRT28 instructions and intrinsics. llvm-svn: 192283	2013-10-09 08:16:14 +00:00
Tim Northover	87db53ff7a	AArch64: enable MISched by default. Substantial SelectionDAG scheduling is going away soon, and is interfering with Hao's attempts to implement LDn/STn instructions, so I say we make the leap first. There were a few reorderings (inevitably) which broke some tests. I tried to replace them with CHECK-DAG variants mostly, but some too complex for that to be useful and I just reordered them. llvm-svn: 192282	2013-10-09 07:53:57 +00:00
Tim Northover	a9df6657ee	AArch64: migrate ADRP relaxation test to be llvm-mc only. llvm-svn: 192281	2013-10-09 07:53:49 +00:00
Craig Topper	d5082631e1	Add in64BitMode/in32BitMode to the MMX/SSE2/AVX maskmovq/dq instructions. This way the asm parser will pick the right one based on the mode. Instruction selection already did the right thing based on the pointer size. llvm-svn: 192266	2013-10-09 02:18:34 +00:00
NAKAMURA Takumi	942a2fc0cb	llvm/test/LTO should run also on cygwin. llvm-svn: 192262	2013-10-09 01:07:31 +00:00
Manman Ren	15624ed0ea	Debug Info: In DIBuilder, the context field of a DICompositeType is updated to use DIScopeRef. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192256	2013-10-09 00:17:04 +00:00
Manman Ren	e31c6f626f	Debug Info: In DIBuilder, the context fields of a static member and a typedef are updated to use DIScopeRef. llvm-svn: 192254	2013-10-08 23:49:38 +00:00
Manman Ren	7259773e8e	Debug Info: In DIBuilder, the derived-from field of DICompositeType is updated to use DITypeRef. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192251	2013-10-08 23:28:51 +00:00
Manman Ren	d3dd477f4b	Debug Info: In DIBuilder, the derived-from field of DIDerivedType is updated to use DITypeRef. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192246	2013-10-08 22:56:31 +00:00
Chad Rosier	d30c4af71b	[AArch64] Add support for NEON scalar floating-point reciprocal estimate, reciprocal exponent, and reciprocal square root estimate instructions. llvm-svn: 192242	2013-10-08 22:09:04 +00:00
Chad Rosier	e281a17b84	[AArch64] Add support for NEON scalar signed/unsigned integer to floating-point convert instructions. llvm-svn: 192231	2013-10-08 20:43:30 +00:00
Manman Ren	5b259edab8	Debug Info: update testing to reflect r192018. llvm-svn: 192224	2013-10-08 20:06:43 +00:00
Reed Kotler	0b1b97d48b	Add fabsf to the list of inlined functions; otherwise Mips16 will try and create a stub for it and this will result in a link error because that function does not exist in libc. llvm-svn: 192223	2013-10-08 19:55:01 +00:00
Matt Arsenault	ed8ec1a52a	Add some xfaild R600 tests. These are bugs to fix later. llvm-svn: 192212	2013-10-08 18:06:36 +00:00
Reed Kotler	57455fdc7c	Let rotr and bswap be handled by expansion for Mips16 since we don't have native instructions for this. llvm-svn: 192207	2013-10-08 17:32:33 +00:00
Craig Topper	2d70555027	Fix a typo in the mattr part of the run line. llvm-svn: 192174	2013-10-08 06:12:26 +00:00
Craig Topper	3d7c6afb79	Explicitly disable AVX on a bunch of tests so they won't fail on AVX machines post r192171. llvm-svn: 192173	2013-10-08 06:06:57 +00:00
Craig Topper	aa1a4d51f0	Remove some instructions that existed to provide aliases to the assembler. Can be done with InstAlias instead. Unfortunately, this was causing printer to use 'vmovq' or 'vmovd' based on what was parsed. To cleanup the inconsistencies convert all 'vmovd' with 64-bit registers to 'vmovq', but provide an alias so that 'vmovd' will still parse. llvm-svn: 192171	2013-10-08 05:53:50 +00:00
Adrian Prantl	c584bc7266	typo. llvm-svn: 192158	2013-10-08 02:30:54 +00:00
Adrian Prantl	82c328eb30	typo. llvm-svn: 192157	2013-10-08 02:28:20 +00:00
Adrian Prantl	e399829d39	Reduce testcase from 1r92011. llvm-svn: 192156	2013-10-08 02:21:44 +00:00
Akira Hatanaka	6c2bf15c93	[mips] Test case for r192124. llvm-svn: 192135	2013-10-07 21:32:57 +00:00
Arnold Schwaighofer	bfe48b104a	LoopVectorize: External uses must use the last value in a reduction cycle Otherwise, we don't perform operations that would have been performed on the scalar version. Fixes PR17498. llvm-svn: 192133	2013-10-07 21:05:43 +00:00
Reed Kotler	33301878d0	Add Mips16 patterns for sign extend byte and sign extend halfword. llvm-svn: 192130	2013-10-07 20:46:19 +00:00
Manman Ren	b284db0070	Struct byval: use the correct alignment for loads generated to load from struct byval to registers. We used to pass 0 which means the alignment of PtrVT. Even when the alignment of the struct is smaller than 4, the LOADs would have alignment of 4, and further optimizations could combine the LOADs into a ldm, which would cause crash. The fix is to pass the alignment of the struct byval. rdar://problem/15144402 llvm-svn: 192126	2013-10-07 19:47:53 +00:00
Benjamin Kramer	feace9b737	X86: Fix type check. Just because an integer type is illegal doesn't mean it's i64. Fixes PR17495, where an i24 triggered this code. It's intended to optimize i64 loads on 32 bit x86. llvm-svn: 192123	2013-10-07 19:11:35 +00:00
Alexey Samsonov	a04533615d	Revert r191834 until we measure the effect of this benchmarks and maybe find a better way to fix it llvm-svn: 192121	2013-10-07 19:03:24 +00:00
Matt Arsenault	9c8541d286	Change objectsize intrinsic to accept different address spaces. Bitcasting everything to i8* won't work. Autoupgrade the old intrinsic declarations to use the new mangling. llvm-svn: 192117	2013-10-07 18:06:48 +00:00
Amara Emerson	688cdc2151	[ARM] Improve build attributes emission. llvm-svn: 192111	2013-10-07 16:55:23 +00:00
Chad Rosier	128d9134e7	[AArch64] Add support for NEON scalar arithmetic instructions: SQDMULH, SQRDMULH, FMULX, FRECPS, and FRSQRTS. llvm-svn: 192107	2013-10-07 16:36:15 +00:00
Joey Gouly	3ecfaf19bb	[ARMv8] Add some disassembly tests for Thumb sevl/sevl.w llvm-svn: 192106	2013-10-07 16:13:03 +00:00
Tim Northover	1979375a30	ARM: allow cortex-m0 to use hint instructions The hint instructions ("nop", "yield", etc) are mostly Thumb2-only, but have been ported across to the v6M architecture. Fortunately, v6M seems to sit nicely between v6 (thumb-1 only) and v6T2, so we can add a feature for it fairly easily. rdar://problem/15144406 llvm-svn: 192097	2013-10-07 11:10:47 +00:00
Simon Atanasyan	724835dff7	[Mips] Teach llvm-readobj to print MIPS-specific ELF program headers. The patch reviewed by Michael Spencer. http://llvm-reviews.chandlerc.com/D1846 llvm-svn: 192093	2013-10-07 08:58:27 +00:00
Craig Topper	6e389a510f	Remove some instructions that seem to only exist to trick the filtering checks in the disassembler table creation. Just fix up the filter to let the real instruction through instead. llvm-svn: 192090	2013-10-07 07:19:47 +00:00
Craig Topper	4a7ff81d5f	Teach X86 asm parser that VMOVAPSrr and other VEX-encoded register to register moves should be switched from using the MRMSrcReg form to the MRMDestReg form if the source register is a 64-bit extended register and the destination register is not. This allows the instruction to be encoded using the 2-byte VEX form instead of the 3-byte VEX form. The GNU assembler has similar behavior and instruction selection already does this. llvm-svn: 192088	2013-10-07 05:42:48 +00:00
Craig Topper	b5918acf04	Add disassembler support for long encodings for INC/DEC in 32-bit mode. llvm-svn: 192086	2013-10-07 04:28:06 +00:00
Rafael Espindola	499aaf305a	Add support for aliases with linkonce_odr. This will be used to extend constructor aliases in clang. llvm-svn: 192066	2013-10-06 15:10:43 +00:00
Benjamin Kramer	44710574cb	Force a CPU that doesn't have AVX, otherwise this test fails. llvm-svn: 192065	2013-10-06 13:52:41 +00:00
Benjamin Kramer	a7e734d765	X86: Don't fold spills into SSE operations if the stack is unaligned. Regalloc can emit unaligned spills nowadays, but we can't fold the spills into SSE ops if we can't guarantee alignment. PR12250. llvm-svn: 192064	2013-10-06 13:48:22 +00:00
Elena Demikhovsky	cb8eaca2e4	AVX-512: added scalar convert instructions and intrinsics. Fixed load folding in VPERM2I instruction. llvm-svn: 192063	2013-10-06 13:11:09 +00:00
Venkatraman Govindaraju	2d62beab83	[Sparc] Do not emit nop after fcmp* instruction with V9. llvm-svn: 192056	2013-10-06 07:06:44 +00:00
Elena Demikhovsky	0ff833ab99	AVX-512: fixed shuffle lowering in case of BLEND and added VSHUFPS patterns. llvm-svn: 192055	2013-10-06 06:11:18 +00:00
Venkatraman Govindaraju	aacd252702	[Sparc] Custom lower addc/adde/subc/sube on i64 in sparc64. This is required because i64 is a legal type but addxcc/subxcc reads icc carry bit, which are 32 bit conditional codes. llvm-svn: 192054	2013-10-06 03:36:18 +00:00
Venkatraman Govindaraju	fa75d8536b	[Sparc] Use addxcc/subxcc for adde/sube instead of addx/subx. addx/subx does not modify conditional codes whereas addxcc/subxx does. llvm-svn: 192053	2013-10-06 02:11:10 +00:00
Benjamin Kramer	3a6afef4e7	Emit a better error when running out of registers on inline asm. The most likely case where this error happens is when the user specifies too many register operands. Don't make it look like an internal LLVM bug when we can see that the error is coming from an inline asm instruction. For other instructions we keep the "ran out of registers" error. llvm-svn: 192041	2013-10-05 19:33:37 +00:00
Craig Topper	0a8f3fc996	Remove unneeded TBM intrinsics. The arithmetic/logical operation patterns are sufficient. llvm-svn: 192039	2013-10-05 19:22:59 +00:00
Craig Topper	d0a63f6722	Add an additional pattern for BLCI since opt can turn (not (add x, 1)) into (sub -2, x). llvm-svn: 192037	2013-10-05 17:17:53 +00:00
Rafael Espindola	a1a1d34e51	Remove some really nasty uses of hasRawTextSupport. When MC was first added, targets could use hasRawTextSupport to keep features working before they were added to the MC interface. The design goal of MC is to provide an uniform api for printing assembly and object files. Short of relaxations and other corner cases, a object file is just another representation of the assembly. It was never the intention that targets would keep doing things like if (hasRawTextSupport()) Set flags in one way. else Set flags in another way. When they do that they create two code paths and the object file is no longer just another representation of the assembly. This also then requires testing with llc -filetype=obj, which is extremelly brittle. This patch removes some of these hacks by replacing them with smaller ones. The ARM flag setting is trivial, so I just moved it to the constructor. For Mips, the patch adds two temporary hack directives that allow the assembly to represent the same things as the object file was already able to. The hope is that the mips developers will replace the hack directives with the same ones that gas uses and drop the -print-hack-directives flag. I will also try to implement a target streamer interface, so that we can move this out of the common code. In summary, for any new work, two rules of the thumb are * Don't use "llc -filetype=obj" in tests. * Don't add calls to hasRawTextSupport. llvm-svn: 192035	2013-10-05 16:42:21 +00:00
Jiangning Liu	6d9b4a0e25	Implement aarch64 neon instruction set AdvSIMD (Across). llvm-svn: 192028	2013-10-05 08:22:10 +00:00
Rafael Espindola	86d473eceb	Convert test to FileCheck. llvm-svn: 192025	2013-10-05 02:58:36 +00:00
Venkatraman Govindaraju	179e7e6dea	[Sparc] Use correct alignment while loading/storing fp128 values. llvm-svn: 192023	2013-10-05 02:29:47 +00:00
Andrew Kaylor	72249ec342	Updating XFAILs for recent GOT tests llvm-svn: 192022	2013-10-05 01:56:50 +00:00
Andrew Kaylor	c62977e263	Adding tests for multiple GOTs with MCJIT llvm-svn: 192021	2013-10-05 01:53:19 +00:00
Manman Ren	b6cdd4e959	Debug Info: In DIBuilder, the derived-from field of a DW_TAG_pointer_type is updated to use DITypeRef. Move isUnsignedDIType and getOriginalTypeSize from DebugInfo.h to be static helper functions in DwarfCompileUnit. We already have a static helper function "isTypeSigned" in DwarfCompileUnit, and a pointer to DwarfDebug is added to resolve the derived-from field. All three functions need to go across link for derived-from fields, so we need to get hold of a type identifier map. A pointer to DwarfDebug is also added to DbgVariable in order to resolve the derived-from field. Debug info verifier is updated to check a derived-from field is a TypeRef. Verifier will not go across link for derived-from fields, in debug info finder, we go across the link to add derived-from fields to types. Function getDICompositeType is only used by dragonegg and since dragonegg does not generate identifier for types, we use an empty map to resolve the derived-from field. When printing a derived-from field, we use DITypeRef::getName to either return the type identifier or getName of the DIType. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 192018	2013-10-05 01:43:03 +00:00
Venkatraman Govindaraju	cf869e9b2a	[Sparc] Respect hasHardQuad parameter correctly when lowering SINT_TO_FP with fp128 operand. llvm-svn: 192015	2013-10-05 00:31:41 +00:00
Adrian Prantl	a2e9135101	Debug info: Don't crash in SelectionDAGISel when a vreg that is being pointed to by a dbg_value belonging to a function argument is eliminated during instruction selection. rdar://problem/15094721. llvm-svn: 192011	2013-10-05 00:08:27 +00:00
Venkatraman Govindaraju	271e9485db	[Sparc] Correct the floating point conditional code mapping in GetOppositeBranchCondition(). llvm-svn: 192006	2013-10-04 23:54:30 +00:00
Hal Finkel	83df3058ed	UpdatePHINodes in BasicBlockUtils should not crash on duplicate predecessors UpdatePHINodes has an optimization to reuse an existing PHI node, where it first deletes all of its entries and then replaces them. Unfortunately, in the case where we had duplicate predecessors (which are allowed so long as the associated PHI entries have the same value), the loop removing the existing PHI entries from the to-be-reused PHI would assert (if that PHI was not the one which had the duplicates). llvm-svn: 192001	2013-10-04 23:41:05 +00:00
Jack Carter	6dc369450f	reverting per request llvm-svn: 191992	2013-10-04 22:52:31 +00:00
Eric Christopher	03e669d0f5	Use addFlag to add the enum class attribute. This has the side effect of using DW_FORM_flag_present on dwarf4 and above. llvm-svn: 191991	2013-10-04 22:40:10 +00:00
Reed Kotler	13ebdc7d9c	Support tblockaddr for static compilation in Mips16. llvm-svn: 191986	2013-10-04 22:01:40 +00:00
Rafael Espindola	8b2f4200cf	Fix object file writing in llvm-lto on Windows. We were writing in text mode. Patch by Greg Bedwell. llvm-svn: 191985	2013-10-04 21:40:54 +00:00
Jack Carter	70c25749d2	[MC][AsmParser] Hook for post assembly file processing This patch handles LLVM standalone assembler (llvm-mc) ELF flag setting based on input file directive processing. Mips assembly requires processing inline directives that directly and indirectly affect the output ELF header flags. This patch handles one ".abicalls". To process these directives we are following the model the code generator uses by storing state in a container as we go through processing and when we detect the end of input file processing, AsmParser is notified and we update the ELF header flags through a MipsELFStreamer method with a call from MCTargetAsmParser::emitEndOfAsmFile(MCStreamer &OutStreamer). This patch will allow other targets the same functionality. Jack llvm-svn: 191982	2013-10-04 21:26:15 +00:00
Akira Hatanaka	e85ca33e98	[mips] Fix a bug in MipsLongBranch::replaceBranch, which was erasing instructions in delay slots along with the original branch instructions. llvm-svn: 191978	2013-10-04 20:51:40 +00:00
Arnold Schwaighofer	6fab174840	SLPVectorizer: Sort inputs to commutative binary operations Sort the operands of the other entries in the current vectorization root according to the first entry's operands opcodes. %conv0 = uitofp ... %load0 = load float ... = fmul %conv0, %load0 = fmul %load0, %conv1 = fmul %load0, %conv2 Make sure that we recursively vectorize <%conv0, %conv1, %conv2> and <%load0, %load0, %load0>. This makes it more likely to obtain vectorizable trees. We have to be careful when we sort that we don't destroy 'good' existing ordering implied by source order. radar://15080067 llvm-svn: 191977	2013-10-04 20:39:16 +00:00
Eric Christopher	88a8082ea0	Temporarily revert r191792 as it is causing some LTO debug failures on platforms with relocations in debug info and also temporarily revert r191800 due to conflicts with the revert of r191792. llvm-svn: 191967	2013-10-04 17:08:38 +00:00
Matthias Braun	f7ddf86363	ARM: optimizeSelect has to consider the previous register class optimizeSelect folds (predicated) copy instructions, it must not ignore the original register class of the operand when replacing the register with the copies dest register. llvm-svn: 191963	2013-10-04 16:52:56 +00:00
Matthias Braun	fbba53e45c	ARM: do not add a regmask for TAILJUMPs The jump doesn't really kill the registers, the following call does but we never get back anyway. This avoids some verify-machineinstrs problems when TAILJUMPs are if-converted. llvm-svn: 191962	2013-10-04 16:52:54 +00:00
Matthias Braun	ae6465eb28	ARM: preserve undef flag in pseudo instruction expanders Copy over the whole register machine operand instead of creating a new one with an incomplete set of flags. llvm-svn: 191961	2013-10-04 16:52:51 +00:00
Rafael Espindola	c246c7547b	Fix this test. llvm-svn: 191958	2013-10-04 14:53:58 +00:00
Jiangning Liu	9f33a743ab	Implement aarch64 neon instruction set AdvSIMD (3V elem). llvm-svn: 191944	2013-10-04 09:20:44 +00:00
David Blaikie	ca9c6bd8ed	DebugInfo: Fix ordering of members after r191928 In the case (shown in the attached test) where a member function definition was emitted into debug info the following could occur: 1) build the debug info for the member function definition 2) in (1), build the debug info for the member function declaration 3) construct and add the member function declaration DIE 4) add it to its context 5) build its context (the type it is a member of) 6) construct the members and add them to the type 7) except don't add member functions because "getOrCreateSubprogram" adds the function to its parent anyway 8) except we're only partway through building this subprogram declaration so it hasn't been added yet - but we returned the partially constructed DIE (since it's already in the MDNode->DIE mapping to avoid infinitely recursing trying to create the member function DIE) 9) once the type is constructed, add the member function to it 10) now the members are out of order (the member function being defined is listed as the last member, even though it was declared as the first) To avoid this, construct the context of the subprogram DIE before we query to see if it exists. That way we never end up creating it before creating its context and ending up in this situation. Alternatively, the type construction that visits/builds all the members could call something like getOrCreateSubprogram, but that doesn't ever do the "add to context" step. Then the type building code would always be responsible for adding members (and the subprogram "addToContextDIE" would no-op because the context building would have added the subprogram declaration to the type/context DIE already). (the test cases updated were overly-sensitive to offsets or abbreviation numbers. We don't have a nice way to make these tests more robust as yet - multiline FileCheck matches would be required) llvm-svn: 191939	2013-10-04 01:39:59 +00:00
Andrew Kaylor	4c916e5212	Adding support and tests for multiple module handling in lli llvm-svn: 191938	2013-10-04 00:49:38 +00:00
Richard Mitton	476b8f3036	Fixed a bug with section names containing special characters. Changed the dwarf aranges code to not use getLabelEndName, as it turns out it's not reliable to call that given user-defined section names. Section names can have characters in that aren't representable as symbol names. The dwarf-aranges test case has been updated to include a special character, to check this. This fixes pr17416. llvm-svn: 191932	2013-10-03 22:07:08 +00:00
Owen Anderson	5e2eba8c18	Pull fptrunc's upwards through selects when one of the select's selectands was a constant. This has a number of benefits, including producing small immediates (easier to materialize, smaller constant pools) as well as being more likely to allow the fptrunc to fuse with a preceding instruction (truncating selects are unusual). llvm-svn: 191929	2013-10-03 21:08:05 +00:00
Rafael Espindola	a88c415234	Optimize linkonce_odr unnamed_addr functions during LTO. Generalize the API so we can distinguish symbols that are needed just for a DSO symbol table from those that are used from some native .o. The symbols that are only wanted for the dso symbol table can be dropped if llvm can prove every other dso has a copy (linkonce_odr) and the address is not important (unnamed_addr). llvm-svn: 191922	2013-10-03 18:29:09 +00:00
Matt Arsenault	c8e9a77ae3	Make gep i8* X, -(ptrtoint Y) transform work with address spaces llvm-svn: 191920	2013-10-03 18:15:57 +00:00
Eric Christopher	f7dc939b6e	Make sure we emit a section for pubnames even if that section is going to be empty. This is particularly important for the gnu pubnames case since we're emitting a relocation to the section. llvm-svn: 191915	2013-10-03 17:41:20 +00:00
Logan Chien	b51d0cd53b	[arm] Enhance the test case by checking .fpu directive. llvm-svn: 191891	2013-10-03 12:18:56 +00:00
Amara Emerson	ece8c5f612	[ARM] Warn on deprecated IT blocks in v8 AArch32 assembly. Patch by Artyom Skrobov. llvm-svn: 191885	2013-10-03 09:31:51 +00:00
Alexey Samsonov	78295ae841	Remove wild .debug_aranges entries generated from unimportant labels r191052 added emitting .debug_aranges to Clang, but this functionality is broken: it uses all MC labels added in DWARF Asm printer, including the labels for build relocations between different DWARF sections, like .Lsection_line or .Ldebug_loc0. As a result, if any DIE .debug_info would contain "DW_AT_location=0x123" attribute, .debug_aranges would also contain a range starting from 0x123, breaking tools that rely on this section. This patch fixes this by using only MC labels that corresponds to the addresses in the user program. llvm-svn: 191884	2013-10-03 08:54:43 +00:00
Craig Topper	6fb0648c41	Add XOP disassembler support. Fixes PR13933. llvm-svn: 191874	2013-10-03 05:17:48 +00:00
Craig Topper	5b62ea95ec	Remove duplicated test cases that occurred when I applied the same patch file to my model twice. llvm-svn: 191873	2013-10-03 04:27:14 +00:00
Craig Topper	5ac188d0f2	Add patterns for selecting TBM instructions from logical operations. Patch from Yunzhong Gao. llvm-svn: 191871	2013-10-03 04:16:45 +00:00
Matt Arsenault	2c180f66a9	Don't use runtime bounds check between address spaces. Don't vectorize with a runtime check if it requires a comparison between pointers with different address spaces. The values can't be assumed to be directly comparable. Previously it would create an illegal bitcast. llvm-svn: 191862	2013-10-02 22:38:17 +00:00
Andrew Kaylor	009a3cd324	Fixing lli-child-target build llvm-svn: 191861	2013-10-02 22:27:23 +00:00
Matt Arsenault	c43958bf36	Fix missing CHECK-LABELs llvm-svn: 191853	2013-10-02 20:29:00 +00:00
Yi Jiang	09195de8f3	Apply slp vectorization on fully-vectorizable tree of height 2 llvm-svn: 191852	2013-10-02 20:20:39 +00:00
Benjamin Kramer	4458ae839d	SLPVectorizer: Make store chain finding more aggressive with GetUnderlyingObject. This recursively strips all GEPs like the existing code. It also handles bitcasts and other operations that do not change the pointer value. llvm-svn: 191847	2013-10-02 19:06:06 +00:00
Andrew Kaylor	948eef739c	Adding out-of-process execution support to lli. At this time only Unix-based systems are supported. Windows has stubs and should re-route to the simulated mode. Thanks to Sriram Murali for contributions to this patch. llvm-svn: 191843	2013-10-02 17:12:36 +00:00
Tom Stellard	b9cbfd3959	StructurizeCFG: Add dependency on LowerSwitch pass Switch instructions were crashing the StructurizeCFG pass, and it's probably easier anyway if we don't need to handle them in this pass. Reviewed-by: Christian König <christian.koenig@amd.com> llvm-svn: 191841	2013-10-02 17:04:59 +00:00
Rafael Espindola	c55c7f41ff	Try harder to disable the LTO tests on windows. llvm-svn: 191836	2013-10-02 15:47:30 +00:00
Chandler Carruth	ee12d58370	Remove the very substantial, largely unmaintained legacy PGO infrastructure. This was essentially work toward PGO based on a design that had several flaws, partially dating from a time when LLVM had a different architecture, and with an effort to modernize it abandoned without being completed. Since then, it has bitrotted for several years further. The result is nearly unusable, and isn't helping any of the modern PGO efforts. Instead, it is getting in the way, adding confusion about PGO in LLVM and distracting everyone with maintenance on essentially dead code. Removing it paves the way for modern efforts around PGO. Among other effects, this removes the last of the runtime libraries from LLVM. Those are being developed in the separate 'compiler-rt' project now, with somewhat different licensing specifically more approriate for runtimes. llvm-svn: 191835	2013-10-02 15:42:23 +00:00
Alexey Samsonov	3291ccfed7	Remove "localize global" optimization Summary: As discussed in http://llvm-reviews.chandlerc.com/D1754, this optimization isn't really valid for C, and fires too rarely anyway. Reviewers: rafael, nicholas Reviewed By: nicholas CC: rnk, llvm-commits, nicholas Differential Revision: http://llvm-reviews.chandlerc.com/D1769 llvm-svn: 191834	2013-10-02 15:31:34 +00:00
Rafael Espindola	32fc94f28e	Add test I forgot to git add in r191824. llvm-svn: 191831	2013-10-02 14:49:41 +00:00
Rafael Espindola	881d008653	Disable this test on Win32 for now. llvm-svn: 191830	2013-10-02 14:48:35 +00:00
Chandler Carruth	33de408c73	Don't layout items in a list in columns. That requires changing every line just to add or remove a single element. What I wouldn't give to have clang-format here an be able to format this more densely without caring... Re-group and sort the entries while here to make the whole thing more clear. llvm-svn: 191828	2013-10-02 14:31:21 +00:00
Rafael Espindola	2625113559	Add a -exported-symbol option to llvm-lto. Patch by Tom Roeder. llvm-svn: 191825	2013-10-02 14:12:56 +00:00
Rafael Espindola	f84c89a432	Enable building LTO on WIN32. Enable building the LTO library (.lib and.dll) and llvm-lto.exe on Windows with MSVC and Mingw as well as re-enabling the associated test. Patch by Greg Bedwell! llvm-svn: 191823	2013-10-02 14:04:38 +00:00
Elena Demikhovsky	ee11e148e9	AVX-512: fixed a bug in getLoadStoreRegOpcode() for AVX-512 target llvm-svn: 191818	2013-10-02 12:20:42 +00:00
Manman Ren	2771d4ef9c	Debug Info: In DIBuilder, the derived-from field of a DW_TAG_pointer_type is updated to use DITypeRef. Move isUnsignedDIType and getOriginalTypeSize from DebugInfo.h to be static helper functions in DwarfCompileUnit. We already have a static helper function "isTypeSigned" in DwarfCompileUnit, and a pointer to DwarfDebug is added to resolve the derived-from field. All three functions need to go across link for derived-from fields, so we need to get hold of a type identifier map. A pointer to DwarfDebug is also added to DbgVariable in order to resolve the derived-from field. Debug info verifier is updated to check a derived-from field is a TypeRef. Verifier will not go across link for derived-from fields, in debug info finder, we go across the link to add derived-from fields to types. Function getDICompositeType is only used by dragonegg and since dragonegg does not generate identifier for types, we use an empty map to resolve the derived-from field. When printing a derived-from field, we use DITypeRef::getName to either return the type identifier or getName of the DIType. A paired commit at clang is required due to changes to DIBuilder. llvm-svn: 191800	2013-10-01 23:45:54 +00:00
Manman Ren	674295347e	Remove triple from type unique testing cases. llvm-svn: 191794	2013-10-01 20:27:56 +00:00
Manman Ren	2ab308bb6c	Try to fix native-arm bot llvm-svn: 191793	2013-10-01 20:23:12 +00:00
Manman Ren	ba5cd65c4c	Debug Info: remove duplication of DIEs when a DIE is part of the type system and it is shared across CUs. We add a few maps in DwarfDebug to map MDNodes for the type system to the corresponding DIEs: MDTypeNodeToDieMap, MDSPNodeToDieMap, and MDStaticMemberNodeToDieMap. These DIEs can be shared across CUs, that is why we keep the maps in DwarfDebug instead of CompileUnit. Sometimes, when we try to add an attribute to a DIE, the DIE is not yet added to its owner yet, so we don't know whether we should use ref_addr or ref4. We create a worklist that will be processed during finalization to add attributes with the correct form (ref_addr or ref4). We add addDIEEntry to DwarfDebug to be a wrapper around DIE->addValue. It checks whether we know the correct form, if not, we update the worklist (DIEEntryWorklist). A testing case is added to show that we only create a single DIE for a type MDNode and we use ref_addr to refer to the type DIE. llvm-svn: 191792	2013-10-01 19:52:23 +00:00
Vincent Lejeune	c7c1075d49	R600: add a pass that merges clauses. llvm-svn: 191790	2013-10-01 19:32:58 +00:00
Vincent Lejeune	0321b7798e	R600: Put PRED_X instruction in its own clause llvm-svn: 191789	2013-10-01 19:32:49 +00:00
Vincent Lejeune	e0ac07a3cb	R600: Enable -verify-machineinstrs in some tests. llvm-svn: 191788	2013-10-01 19:32:38 +00:00
Matt Arsenault	e06d79aa83	Don't merge tiny functions. It's silly to merge functions like these: define void @foo(i32 %x) { ret void } define void @bar(i32 %x) { ret void } to get define void @bar(i32) { tail call void @foo(i32 %0) ret void } llvm-svn: 191786	2013-10-01 18:05:30 +00:00
Preston Gurd	c52dfda610	Add test case for PR16785. Thanks for Dimitry Andric, Rafael Espindola, and Benjamin Kramer for providing and progressively reducing the test case! llvm-svn: 191782	2013-10-01 17:02:48 +00:00
Richard Sandiford	8ac2bcbe80	[SystemZ] Add comparisons of high words and memory llvm-svn: 191777	2013-10-01 15:00:44 +00:00
Richard Sandiford	2ed79fb1d7	[SystemZ] Add comparisons of large immediates using high words There are no corresponding patterns for small immediates because they would prevent the use of fused compare-and-branch instructions. llvm-svn: 191775	2013-10-01 14:56:23 +00:00
Richard Sandiford	3b7b53e6f4	[SystemZ] Add immediate addition involving high words llvm-svn: 191774	2013-10-01 14:53:46 +00:00
Richard Sandiford	884566de6e	[SystemZ] Extend test-under-mask support to high GR32s llvm-svn: 191773	2013-10-01 14:41:52 +00:00
Richard Sandiford	d2e34690a4	[SystemZ] Extend 32-bit RISBG optimizations to high words This involves using RISB[LH]G, whereas the equivalent z10 optimization uses RISBG. llvm-svn: 191770	2013-10-01 14:36:20 +00:00
Richard Sandiford	e2f5332463	[SystemZ] Extend pseudo conditional 8- and 16-bit stores to high words As the comment says, we always want to use STOC for 32-bit stores. llvm-svn: 191767	2013-10-01 14:33:55 +00:00
Tim Northover	684a0e633d	ARM: support interrupt attribute This function-attribute modifies the callee-saved register list and function epilogue (specifically the return instruction) so that a routine is suitable for use as an interrupt-handler of the specified type without disrupting user-mode applications. rdar://problem/14207019 llvm-svn: 191766	2013-10-01 14:33:28 +00:00
Richard Sandiford	5df2380d20	[SystemZ] Add test missing from r191764. llvm-svn: 191765	2013-10-01 14:31:50 +00:00
Richard Sandiford	7125240faa	[SystemZ] Allow integer AND involving high words llvm-svn: 191762	2013-10-01 14:20:41 +00:00
Richard Sandiford	9d3cacb101	[SystemZ] Allow integer XOR involving high words llvm-svn: 191759	2013-10-01 14:08:44 +00:00
Richard Sandiford	d2a449d3de	[SystemZ] Allow integer OR involving high words llvm-svn: 191755	2013-10-01 13:22:41 +00:00
Richard Sandiford	3af32e8cab	[SystemZ] Allow integer insertions with a high-word destination llvm-svn: 191753	2013-10-01 13:18:56 +00:00
Richard Sandiford	497097c027	[SystemZ] Allow selects with a high-word destination llvm-svn: 191751	2013-10-01 13:10:16 +00:00
Richard Sandiford	8c8e2f0237	[SystemZ] Add patterns to load a constant into a high word (IIHF) Similar to low words, we can use the shorter LLIHL and LLIHH if it turns out that the other half of the GR64 isn't live. llvm-svn: 191750	2013-10-01 13:02:28 +00:00
Richard Sandiford	ac3360b004	[SystemZ] Add register zero extensions involving at least one high word llvm-svn: 191746	2013-10-01 12:49:07 +00:00
Joey Gouly	12afb60cf2	[ARM] Introduce the 'sevl' instruction in ARMv8. This also removes the restriction on the immediate field of the 'hint' instruction. llvm-svn: 191744	2013-10-01 12:39:11 +00:00
Richard Sandiford	192be1070b	[SystemZ] Add truncating high-word stores (STCH and STHH) llvm-svn: 191743	2013-10-01 12:22:49 +00:00
Richard Sandiford	de433bf58d	[SystemZ] Add zero-extending high-word loads (LLCH and LLHH) llvm-svn: 191742	2013-10-01 12:19:08 +00:00
Benjamin Kramer	e04c4c3faf	SCEVExpander: Fix a regression I introduced by to eagerly adding RAII objects. PR17425. llvm-svn: 191741	2013-10-01 12:17:11 +00:00
Richard Sandiford	dd8ae7a617	[SystemZ] Add sign-extending high-word loads (LBH and LHH) llvm-svn: 191740	2013-10-01 12:11:47 +00:00
Richard Sandiford	c2e496f7ba	[SystemZ] Use upper words of GR64s for codegen This just adds the basics necessary for allocating the upper words to virtual registers (move, load and store). The move support is parameterised in a way that makes it easy to handle zero extensions, but the associated zero-extend patterns are added by a later patch. The easiest way of testing this seemed to be add a new "h" register constraint for high words. I don't expect the constraint to be useful in real inline asms, but it should work, so I didn't try to hide it behind an option. llvm-svn: 191739	2013-10-01 11:26:28 +00:00
Richard Sandiford	21933530e5	[SystemZ] Reapply: Add definitions of LFH and STFH Originally committed as r191661, but reverted because it changed the matching order of comparisons on some hosts. That should have been fixed by r191735. llvm-svn: 191738	2013-10-01 10:31:04 +00:00
Daniel Sanders	6ffe6fc99c	[mips][msa] Added support for matching mod_[us] from normal IR (i.e. not intrinsics) llvm-svn: 191737	2013-10-01 10:22:35 +00:00
Vladimir Medic	fe4fd5260f	This patch adds aliases for Mips sub instruction with immediate operands. Corresponding test cases are added. llvm-svn: 191734	2013-10-01 09:48:56 +00:00
Elena Demikhovsky	84c6cd222d	AVX-512: Added X86vzmovl patterns llvm-svn: 191733	2013-10-01 08:38:02 +00:00
Eric Christopher	f7c4656096	Add the DW_AT_GNU_ranges_base attribute if we've emitted any ranges into the debug_ranges section. llvm-svn: 191721	2013-10-01 00:43:36 +00:00
Matt Arsenault	19f03c0f9c	Use CHECK-LABEL llvm-svn: 191713	2013-09-30 23:31:55 +00:00
Eric Christopher	d9670853b9	The DW_AT_GNU_pubnames/pubtypes attributes are actually form SEC_OFFSET from the beginning of the section so go ahead and emit a label at the beginning of each one. llvm-svn: 191710	2013-09-30 23:14:16 +00:00
Eric Christopher	aa86376c19	Add llvm-readobj to the list of programs to find in the freshly built toolchain. Patch by Richard Pennington. llvm-svn: 191706	2013-09-30 21:55:01 +00:00
Matt Arsenault	bc0b70cd8d	Use right address space size in InstCombineCompares The test's output doesn't change, but this ensures this is actually hit with a different address space. llvm-svn: 191701	2013-09-30 21:11:01 +00:00
Matt Arsenault	27f5203bba	Constant fold ptrtoint + compare with address spaces llvm-svn: 191699	2013-09-30 21:06:18 +00:00
Tilmann Scheller	72c5c7344c	[ARM] Fix Thumb(-2) diagnostic tests. Changing the diagnostic message for out of range branch targets in 191686 broke the tests. The diagnostic message for out of range branch targets was changed to be more consistent with the other diagnostics. llvm-svn: 191691	2013-09-30 18:50:51 +00:00
Manman Ren	ad317a135a	TBAA: update tbaa format from scalar format to struct-path aware format. llvm-svn: 191690	2013-09-30 18:17:55 +00:00
Manman Ren	799fd39420	TBAA: remove !tbaa from testing cases when they are not needed. llvm-svn: 191689	2013-09-30 18:17:35 +00:00
Jack Carter	87696d33e7	[mips][msa] Direct Object Emission for I8 instructions. This patch adds Direct Object Emission support for I8 instructions: andi.b, bmnzi.b, bmzi.b, bseli.b, nori.b, ori.b, shf.{b,h,w} and xori.b. Patch by Matheus Almeida llvm-svn: 191688	2013-09-30 18:05:18 +00:00
Jack Carter	ef8714ee72	[mips][msa] Direct Object Emission for I5 instructions. This patch adds Direct Object Emission support for I5 instructions: addvi.{b,h,w,d}, ceqi.{b,h,w,d}, clei_s.{b,h,w,d}, clei_u.{b,h,w,d}, clti_s.{b,h,w,d}, clti_u.{b,h,w,d}, maxi_s.{b,h,w,d}, maxi_u.{b,h,w,d}, mini_s.{b,h,w,d}, mini_u.{b,h,w,d}, subvi.{b,h,w,d}. Patch by Matheus Almeida llvm-svn: 191687	2013-09-30 17:58:07 +00:00
Jack Carter	edd6f0149e	[mips][msa] Direct Object Emission for 2R instructions. This patch adds Direct Object Emission support for 2R instructions: nloc.{b,h,w}, nlzc.{b,h,w}, pcnt.{b,w,d}. Patch by Matheus Almeida llvm-svn: 191685	2013-09-30 17:52:33 +00:00
Jack Carter	0d832949ac	[PATCH 1/4] [mips][msa] Source register of FILL instructions is GPR and not an MSA register Patch by Matheus Almeida llvm-svn: 191684	2013-09-30 17:43:04 +00:00
Tilmann Scheller	3f19743d37	[ARM] Use FileCheck instead of grep for ARM LDRD negative tests. llvm-svn: 191683	2013-09-30 17:31:26 +00:00
Rafael Espindola	bed99fa456	Revert "Enable building LTO on WIN32." This reverts commit r191670. It was causing build failures on the msvc bots: http://bb.pgr.jp/builders/ninja-clang-i686-msc17-R/builds/5166/steps/compile/logs/stdio llvm-svn: 191679	2013-09-30 16:32:51 +00:00
Tilmann Scheller	e69a5118c3	[ARM] Assembler: ARM LDRD with writeback requires the base register to be different from the destination registers. See ARM ARM A8.8.72. Violating this constraint results in unpredictable behavior. llvm-svn: 191678	2013-09-30 16:11:48 +00:00
Benjamin Kramer	33abdcddb3	IRBuilder: Add RAII objects to reset insertion points or fast math flags. Inspired by the object from the SLPVectorizer. This found a minor bug in the debug loc restoration in the vectorizer where the location of a following instruction was attached instead of the location from the original instruction. llvm-svn: 191673	2013-09-30 15:39:48 +00:00
Rafael Espindola	5a90437273	Enable building LTO on WIN32. Enable building the LTO library (.lib and.dll) and llvm-lto.exe on Windows with MSVC and Mingw as well as re-enabling the associated test. Patch by Greg Bedwell! llvm-svn: 191670	2013-09-30 15:28:14 +00:00
Joey Gouly	b2a84bfcc1	Fix a bug in InstCombine where it attempted to cast a Value* to an Instruction* when it was actually a Constant*. There are quite a few other casts to Instruction that might have the same problem, but this is the only one I have a test case for. llvm-svn: 191668	2013-09-30 14:18:35 +00:00
Tilmann Scheller	d1b1de4b4c	[ARM] Assembler: Add more negative tests for ARM LDRD. llvm-svn: 191664	2013-09-30 13:04:22 +00:00
Richard Sandiford	981be3bcbe	[SystemZ] Revert r191661: Add definitions of LFH and STFH For some reason, adding definitions for these load and store instructions changed whether some of the build bots matched comparisons as signed or unsigned. llvm-svn: 191663	2013-09-30 12:01:35 +00:00
Richard Sandiford	5f554d415e	[SystemZ] Add definitions of LFH and STFH llvm-svn: 191661	2013-09-30 10:50:33 +00:00
Craig Topper	13fe261777	Add a few more FMA4 disassembler test cases to match the scalar set with regards to combinations of L and W-bits. llvm-svn: 191650	2013-09-30 02:50:51 +00:00
Craig Topper	a4bd7d9c3c	Various x86 disassembler fixes. Add VEX_LIG to scalar FMA4 instructions. Use VEX_LIG in some of the inheriting checks in disassembler table generator. Make use of VEX_L_W, VEX_L_W_XS, VEX_L_W_XD contexts. Don't let VEX_L_W, VEX_L_W_XS, VEX_L_W_XD, VEX_L_W_OPSIZE inherit from their non-L forms unless VEX_LIG is set. Let VEX_L_W, VEX_L_W_XS, VEX_L_W_XD, VEX_L_W_OPSIZE inherit from all of their non-L or non-W cases. Increase ranking on VEX_L_W, VEX_L_W_XS, VEX_L_W_XD, VEX_L_W_OPSIZE so they get chosen over non-L/non-W forms. llvm-svn: 191649	2013-09-30 02:46:36 +00:00
Benjamin Kramer	1dab382232	ObjectSizeOffsetEvaluator: Don't run into infinite recursion if we have a cyclic GEP. Those can occur in dead code. PR17402. llvm-svn: 191644	2013-09-29 19:39:13 +00:00
Craig Topper	f8804865a0	Revert accidental commit. llvm-svn: 191633	2013-09-29 08:35:51 +00:00
Craig Topper	3844406801	Change type of XOP flag in code emitters to a bool. Remove a some unneeded cases from switch. llvm-svn: 191632	2013-09-29 08:33:34 +00:00
Benjamin Kramer	17653a5c14	Add a test that large offsets on GEPs on 32 bits targets are handled correctly. llvm-svn: 191628	2013-09-28 21:27:49 +00:00
Robert Wilhelm	6b36431ffa	Fix spelling intruction -> instruction. llvm-svn: 191610	2013-09-28 11:46:15 +00:00
Tom Stellard	1cb4ba2a4d	R600: Fix handling of NAN in comparison instructions We were completely ignoring the unorder/ordered attributes of condition codes and also incorrectly lowering seto and setuo. Reviewed-by: Vincent Lejeune<vljn at ovi.com> llvm-svn: 191603	2013-09-28 02:50:50 +00:00
Manman Ren	5124fcbf05	AutoUpgrade: upgrade from scalar TBAA format to struct-path aware TBAA format. We treat TBAA tags as struct-path aware TBAA format when the first operand is a MDNode and the tag has 3 or more operands. llvm-svn: 191593	2013-09-28 00:22:27 +00:00
Akira Hatanaka	e5351a10fe	[mips] Make sure loads from lazy-binding entries do not get CSE'd or hoisted out of loops. Previously, two consecutive calls to function "func" would result in the following sequence of instructions: 1. load $16, %got(func)($gp) // load address of lazy-binding stub. 2. move $25, $16 3. jalr $25 // jump to lazy-binding stub. 4. nop 5. move $25, $16 6. jalr $25 // jump to lazy-binding stub again. With this patch, the second call directly jumps to func's address, bypassing the lazy-binding resolution routine: 1. load $25, %got(func)($gp) // load address of lazy-binding stub. 2. jalr $25 // jump to lazy-binding stub. 3. nop 4. load $25, %got(func)($gp) // load resolved address of func. 5. jalr $25 // directly jump to func. llvm-svn: 191591	2013-09-28 00:12:32 +00:00
Matt Arsenault	f672e722f2	Use right pointer type in DebugIR llvm-svn: 191576	2013-09-27 22:26:25 +00:00
Rui Ueyama	6ba4300bf8	Resurrect lit.local.cfg to un-break hexagon buildbot. llvm-svn: 191565	2013-09-27 21:26:38 +00:00
Matt Arsenault	0dc0668061	Fix SLPVectorizer using wrong address space for load/store llvm-svn: 191564	2013-09-27 21:24:57 +00:00
Rui Ueyama	10d27e5fc0	Re-submit r191472 with a fix for big endian. llvm-objdump: Dump COFF import table if -private-headers option is given. llvm-svn: 191557	2013-09-27 21:04:00 +00:00
Justin Bogner	d2e08f6deb	InstCombine: Only foldSelectICmpAndOr for integer types Currently foldSelectICmpAndOr asserts if the "or" involves a vector containing several of the same power of two. We can easily avoid this by only performing the fold on integer types, like foldSelectICmpAnd does. Fixes <rdar://problem/15012516> llvm-svn: 191552	2013-09-27 20:35:39 +00:00
Yunzhong Gao	e51da27a74	Adding intrinsics to the llvm backend for TBM instruction set. Phabricator code review is located here: http://llvm-reviews.chandlerc.com/D1750 llvm-svn: 191539	2013-09-27 18:38:42 +00:00
Manman Ren	2ef9ca7627	TBAA: handle scalar TBAA format and struct-path aware TBAA format. Remove the command line argument "struct-path-tbaa" since we should not depend on command line argument to decide which format the IR file is using. Instead, we check the first operand of the tbaa tag node, if it is a MDNode, we treat it as struct-path aware TBAA format, otherwise, we treat it as scalar TBAA format. When clang starts to use struct-path aware TBAA format no matter whether struct-path-tbaa is no, and we can auto-upgrade existing bc files, the support for scalar TBAA format can be dropped. Existing testing cases are updated to use the struct-path aware TBAA format. llvm-svn: 191538	2013-09-27 18:34:27 +00:00
Justin Bogner	db7f32982e	Transforms: Use getFirstNonPHI to set the insertion point for PHIs We were previously using getFirstInsertionPt to insert PHI instructions when vectorizing, but getFirstInsertionPt also skips past landingpads, causing this to generate invalid IR. We can avoid this issue by using getFirstNonPHI instead. llvm-svn: 191526	2013-09-27 15:30:25 +00:00
Richard Sandiford	e1db330ce8	[SystemZ] Rein back the use of block operations The backend tries to use block operations like MVC, NC, OC and XC for simple scalar operations. For correctness reasons, it rejects any case in which the regions might partially overlap. However, for performance reasons, it should also reject cases where the regions might be equal, since the instruction might then not use the fast path. This fixes a performance regression seen in bzip2. We may want to limit the optimisation even more in future, or even remove it entirely, but I'll try with this for now. llvm-svn: 191525	2013-09-27 15:29:20 +00:00
Richard Sandiford	cae9d29151	[SystemZ] Improve handling of PC-relative addresses The backend previously folded offsets into PC-relative addresses whereever possible. That's the right thing to do when the address can be used directly in a PC-relative memory reference (using things like LRL). But if we have a register-based memory reference and need to load the PC-relative address separately, it's better to use an anchor point that could be shared with other accesses to the same area of the variable. Fixes a FIXME. llvm-svn: 191524	2013-09-27 15:14:04 +00:00
Daniel Sanders	0987676281	[mips][msa] Implemented insert.d intrinsic. This intrinsic is lowered into an equivalent INSERT_VECTOR_ELT which is further lowered into a sequence of insert.w's on MIPS32. llvm-svn: 191521	2013-09-27 13:36:54 +00:00
Tilmann Scheller	758b35d6b8	ARM: Teach assembler to enforce constraints for ARM LDRD destination register operands. As specified in A8.8.72/A8.8.73/A8.8.74 in the ARM ARM, all variants of the ARM LDRD instruction have the following two constraints: LDRD<c> <Rt>, <Rt2>, ... (a) Rt must be even-numbered and not r14 (b) Rt2 must be R(t+1) If those two constraints are not met the result of executing the instruction will be unpredictable. Constraint (b) was already enforced, this commit adds support for constraint (a). Fixes rdar://14479793. llvm-svn: 191520	2013-09-27 13:28:17 +00:00
Daniel Sanders	3c43957555	[mips][msa] Implemented fill.d intrinsic. This intrinsic is lowered into an equivalent BUILD_VECTOR which is further lowered into a sequence of insert.w's on MIPS32. llvm-svn: 191519	2013-09-27 13:20:41 +00:00
Daniel Sanders	935673af60	[mips][msa] Implemented copy_[us].d intrinsic. This intrinsic is lowered into equivalent copy_s.w instructions during legalization. llvm-svn: 191518	2013-09-27 13:04:21 +00:00
Daniel Sanders	8c83ddcdd2	[mips][msa] Implemented insert_vector_elt for v4f32 and v2f64. For v4f32 and v2f64, INSERT_VECTOR_ELT is matched by a pseudo-insn which is later expanded to appropriate insve.[wd] insns. llvm-svn: 191515	2013-09-27 12:31:32 +00:00
Daniel Sanders	0bb1b5a37f	[mips][msa] Implemented extract_vector_elt for v4f32 or v2f64 For v4f32 and v2f64, EXTRACT_VECTOR_ELT is matched by a pseudo-insn which may be expanded to subregister copies and/or instructions as appropriate. llvm-svn: 191514	2013-09-27 12:17:32 +00:00
Andrea Di Biagio	a10165167b	Remove superfluous comment accidentally checked-in. llvm-svn: 191513	2013-09-27 12:13:58 +00:00
Daniel Sanders	0f009e6be5	[mips][msa] Added support for MSA registers to copyPhysReg llvm-svn: 191512	2013-09-27 12:03:51 +00:00
Daniel Sanders	8e7e5fd076	[mips][msa] Added support for matching splati from normal IR (i.e. not intrinsics) Updated some of the vshf since they (correctly) emit splati's now llvm-svn: 191511	2013-09-27 11:48:57 +00:00
Andrea Di Biagio	a96ff5eeac	Re-apply the change from r191393 with fix for pr17380. This change fixes the problem reported in pr17380 and re-add the dagcombine transformation ensuring that the value types are always legal if the transformation is triggered after Legalization took place. Added the test case from pr17380. llvm-svn: 191509	2013-09-27 11:37:05 +00:00
Tilmann Scheller	f2c27b743a	ARM: Teach assembler to enforce constraint for Thumb2 LDRD (literal/immediate) destination register operands. LDRD<c> <Rt>, <Rt2>, <label> LDRD<c> <Rt>, <Rt2>, [<Rn>{, #+/-<imm>}] LDRD<c> <Rt>, <Rt2>, [<Rn>], #+/-<imm> LDRD<c> <Rt>, <Rt2>, [<Rn>, #+/-<imm>]! As specified in A8.8.72/A8.8.73 in the ARM ARM, the T1 encoding has a constraint which enforces that Rt != Rt2. If this constraint is not met the result of executing the instruction will be unpredictable. Fixes rdar://14479780. llvm-svn: 191504	2013-09-27 10:30:18 +00:00
Daniel Sanders	d13fea547a	[mips][msa] MSA requires FR=1 mode (64-bit FPU register file). Report fatal error when using it in FR=0 mode. llvm-svn: 191498	2013-09-27 10:08:31 +00:00
Daniel Sanders	6a20248b3a	[mips][msa] Expand all truncstores and loadexts for MSA as well as DSP llvm-svn: 191496	2013-09-27 09:44:59 +00:00
Daniel Sanders	27836999cd	[mips][msa] Added missing check in performSRACombine Reviewers: jacksprat, dsanders Reviewed By: dsanders Differential Revision: http://llvm-reviews.chandlerc.com/D1755 llvm-svn: 191495	2013-09-27 09:25:29 +00:00
Yunzhong Gao	54d338bb6a	Fixing Intel format of the vshufpd instruction. Phabricator code review is located at: http://llvm-reviews.chandlerc.com/D1759 llvm-svn: 191481	2013-09-27 01:44:23 +00:00

... 3 4 5 6 7 ...

21405 Commits