llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 19:42:54 +02:00

Author	SHA1	Message	Date
Lang Hames	1bc9d3795a	[MCJIT] [AArch64] Make sure to propegate ARM64_RELOC_ADDEND values into the RelocationEntry. No test case yet, as this primarily hits GOT entries, which RuntimeDyldChecker can't examine yet. I'm actively working on features that will enable us to test this. llvm-svn: 213408	2014-07-18 20:29:36 +00:00
Eric Christopher	62ccb2f8d9	Make non-module passes unconditionally added in the pass manager for mips, and early exit if we don't want to do anything because of the current subtarget. llvm-svn: 213407	2014-07-18 20:29:02 +00:00
Eli Bendersky	71f651043f	Add tests for atomic adds on floats. llvm-svn: 213406	2014-07-18 20:11:26 +00:00
Tyler Nowicki	9f432ebc6e	Rename DiagnosticInfoOptimizationWarning to DiagnosticInfoOptimizationFailure so the severity of the message is not part of the type name. Reviewed by Alp Toker llvm-svn: 213399	2014-07-18 19:36:04 +00:00
Eli Bendersky	c9ff12e4fa	Use CHECK-LABEL where appropriate in this test. llvm-svn: 213398	2014-07-18 19:32:09 +00:00
Mark Heffernan	ffff58c308	Add loop unrolling metadata descriptions to docs/LangRef.rst. llvm-svn: 213397	2014-07-18 19:24:51 +00:00
Gerolf Hoflehner	5fa7774dfd	MergedLoadStoreMotion pass Merges equivalent loads on both sides of a hammock/diamond and hoists into into the header. Merges equivalent stores on both sides of a hammock/diamond and sinks it to the footer. Can enable if conversion and tolerate better load misses and store operand latencies. llvm-svn: 213396	2014-07-18 19:13:09 +00:00
David Blaikie	5f06450187	Reapply "DebugInfo: Ensure that all debug location scope chains from instructions within a function, lead to the function itself.""" Recommits 212776 which was reverted in r212793. This has been committed and recommitted a few times as I try to test it harder and find/fix more issues. The most recent revert was due to an asan bot failure which I can't seem to reproduce locally, though I believe I'm following all the steps the buildbot does. So I'm going to recommit this in the hopes of investigating the failure on the buildbot itself... apologies in advance for the bot noise. If anyone sees failures with this /please/ provide me with any reproductions, etc. llvm-svn: 213391	2014-07-18 17:49:10 +00:00
David Peixotto	e3bc4f6f83	Fix build failure on windows Add explicit constructor to struct instead of using brace initialization. llvm-svn: 213389	2014-07-18 16:41:58 +00:00
David Peixotto	569d73691e	MC: support different sized constants in constant pools On AArch64 the pseudo instruction ldr <reg>, =... supports both 32-bit and 64-bit constants. Add support for 64 bit constants for the pools to support the pseudo instruction fully. Changes the AArch64 ldr-pseudo tests to use 32-bit registers and adds tests with 64-bit registers. Patch by Janne Grunau! Differential Revision: http://reviews.llvm.org/D4279 llvm-svn: 213387	2014-07-18 16:05:14 +00:00
Hal Finkel	000be1bc2f	Add a dereferenceable attribute This attribute indicates that the parameter or return pointer is dereferenceable. Practically speaking, loads from such a pointer within the associated byte range are safe to speculatively execute. Such pointer parameters are common in source languages (C++ references, for example). llvm-svn: 213385	2014-07-18 15:51:28 +00:00
Daniel Sanders	b7f4e6b616	Add MIPS Technologies to the vendors in llvm::Triple. This is a prerequisite for checking for 'mti' and 'img' in a consistent way in clang. Previously 'img' could use Triple::getVendor() but 'mti' could only use Triple::getVendorName(). llvm-svn: 213381	2014-07-18 14:28:19 +00:00
Tim Northover	be8b73df94	AArch64: implement efficient f16 bitcasts Because i16 is illegal, there's no native DAG method to represent a bitcast to or from an f16 type. This meant LLVM was inserting a stack store/load pair which is really not ideal. llvm-svn: 213378	2014-07-18 13:07:05 +00:00
Tim Northover	0380ef9c00	NVPTX: support fpext/fptrunc to and from f16. llvm-svn: 213377	2014-07-18 13:01:43 +00:00
Tim Northover	854fe649af	R600: support fpext/fptrunc operations to and from f16. llvm-svn: 213376	2014-07-18 13:01:37 +00:00
Tim Northover	1b266803fb	AArch64: support f16 extend/trunc operations. llvm-svn: 213375	2014-07-18 13:01:31 +00:00
Tim Northover	07cb1b71c9	X86: support fpext/fptrunc operations to and from 16-bit floats. llvm-svn: 213374	2014-07-18 13:01:25 +00:00
Tim Northover	25c770b7c4	ARM: support legalisation of "fptrunc ... to half" operations. llvm-svn: 213373	2014-07-18 13:01:19 +00:00
Tim Northover	e4c93c0798	CodeGen: soften f16 type by default instead of marking legal. Actual support for softening f16 operations is still limited, and can be added when it's needed. But Soften is much closer to being a useful thing to try than keeping it Legal when no registers can actually hold such values. Longer term, we probably want something between Soften and Promote semantics for most targets, it'll be more efficient to promote the 4 basic operations to f32 than libcall them. llvm-svn: 213372	2014-07-18 12:41:46 +00:00
Renato Golin	af1eb026d1	Suppress 'not handled in switch' warning llvm-svn: 213371	2014-07-18 12:13:04 +00:00
Tilmann Scheller	d04d5fde2e	[ARM] Add earlyclobber constraint to pre/post-indexed ARM STR instructions. The post-indexed instructions were missing the constraint, causing unpredictable STR instructions to be emitted. The earlyclobber constraint on the pre-indexed STR instructions is not strictly necessary, as the instruction selection for pre-indexed STR instructions goes through an additional layer of pseudo instructions which have the constraint defined, however it doesn't hurt to specify the constraint directly on the pre-indexed instructions as well, since at some point someone might create instances of them programmatically and then the constraint is definitely needed. This fixes PR20323. llvm-svn: 213369	2014-07-18 12:05:49 +00:00
Renato Golin	05cd608eea	Refactor ARM subarchitecture parsing Re-commit of a patch to rework the triple parsing on ARM to a more sane model. Patch by Gabor Ballabas. llvm-svn: 213367	2014-07-18 12:00:48 +00:00
Artyom Skrobov	140aab7a40	extracting swapStruct into include/llvm/Support/MachO.h (no functional change) llvm-svn: 213361	2014-07-18 09:26:16 +00:00
Tim Northover	62ad6904d9	R600: rename misleading fp16 test. This test is actually going in the opposite direction to what the filename and function name suggested. llvm-svn: 213358	2014-07-18 08:43:30 +00:00
Tim Northover	de7867151d	R600: support f16 -> f64 conversion intrinsic. Unfortunately, we don't seem to have a direct truncation, but the extension can be legally split into two operations so we should support that. llvm-svn: 213357	2014-07-18 08:43:24 +00:00
Tim Northover	86458323c0	NVPTX: support direct f16 <-> f64 conversions via intrinsics. Clang may well start emitting these soon, and while it may not be directly relevant for OpenCL or GLSL, the instructions were just sitting there waiting to be used. llvm-svn: 213356	2014-07-18 08:30:10 +00:00
Hal Finkel	2587ff3060	Rename AlignAttribute to IntAttribute Currently the only kind of integer IR attributes that we have are alignment attributes, and so the attribute kind that takes an integer parameter is called AlignAttr, but that will change (we'll soon be adding a dereferenceable attribute that also takes an integer value). Accordingly, rename AlignAttribute to IntAttribute (class names, enums, etc.). No functionality change intended. llvm-svn: 213352	2014-07-18 06:51:55 +00:00
Matt Arsenault	b71d4aab48	R600: Implement TTI:getPopcntSupport The test is just copied from X86, and I don't know of a better way to test it. llvm-svn: 213351	2014-07-18 06:07:13 +00:00
Jim Grosbach	7a17678ea4	X86: Constant fold converting vector setcc results to float. Since the result of a SETCC for X86 is 0 or -1 in each lane, we can move unary operations, in this case [su]int_to_fp through the mask operation and constant fold the operation away. Generally speaking: UNARYOP(AND(VECTOR_CMP(x,y), constant)) --> AND(VECTOR_CMP(x,y), constant2) where constant2 is UNARYOP(constant). This implements the transform where UNARYOP is [su]int_to_fp. For example, consider the simple function: define <4 x float> @foo(<4 x float> %val, <4 x float> %test) nounwind { %cmp = fcmp oeq <4 x float> %val, %test %ext = zext <4 x i1> %cmp to <4 x i32> %result = sitofp <4 x i32> %ext to <4 x float> ret <4 x float> %result } Before this change, the SSE code is generated as: LCPI0_0: .long 1 ## 0x1 .long 1 ## 0x1 .long 1 ## 0x1 .long 1 ## 0x1 .section __TEXT,__text,regular,pure_instructions .globl _foo .align 4, 0x90 _foo: ## @foo cmpeqps %xmm1, %xmm0 andps LCPI0_0(%rip), %xmm0 cvtdq2ps %xmm0, %xmm0 retq After, the code is improved to: LCPI0_0: .long 1065353216 ## float 1.000000e+00 .long 1065353216 ## float 1.000000e+00 .long 1065353216 ## float 1.000000e+00 .long 1065353216 ## float 1.000000e+00 .section __TEXT,__text,regular,pure_instructions .globl _foo .align 4, 0x90 _foo: ## @foo cmpeqps %xmm1, %xmm0 andps LCPI0_0(%rip), %xmm0 retq The cvtdq2ps has been constant folded away and the floating point 1.0f vector lanes are materialized directly via the ModRM operand of andps. llvm-svn: 213342	2014-07-18 00:40:56 +00:00
Jim Grosbach	2f665f1cd7	AArch64: Constant fold converting vector setcc results to float. Since the result of a SETCC for AArch64 is 0 or -1 in each lane, we can move unary operations, in this case [su]int_to_fp through the mask operation and constant fold the operation away. Generally speaking: UNARYOP(AND(VECTOR_CMP(x,y), constant)) --> AND(VECTOR_CMP(x,y), constant2) where constant2 is UNARYOP(constant). This implements the transform where UNARYOP is [su]int_to_fp. For example, consider the simple function: define <4 x float> @foo(<4 x float> %val, <4 x float> %test) nounwind { %cmp = fcmp oeq <4 x float> %val, %test %ext = zext <4 x i1> %cmp to <4 x i32> %result = sitofp <4 x i32> %ext to <4 x float> ret <4 x float> %result } Before this change, the code is generated as: fcmeq.4s v0, v0, v1 movi.4s v1, #0x1 // Integer splat value. and.16b v0, v0, v1 // Mask lanes based on the comparison. scvtf.4s v0, v0 // Convert each lane to f32. ret After, the code is improved to: fcmeq.4s v0, v0, v1 fmov.4s v1, #1.00000000 // f32 splat value. and.16b v0, v0, v1 // Mask lanes based on the comparison. ret The svvtf.4s has been constant folded away and the floating point 1.0f vector lanes are materialized directly via fmov.4s. Rather than do the folding manually in the target code, teach getNode() in the generic SelectionDAG to handle folding constant operands of vector [su]int_to_fp nodes. It is reasonable (as noted in a FIXME) to do additional constant folding there as well, but I don't have test cases for those operations, so leaving them for another time when it becomes appropriate. rdar://17693791 llvm-svn: 213341	2014-07-18 00:40:52 +00:00
Michael J. Spencer	ff351b2bdb	Revert "[x86] Fold extract_vector_elt of a load into the Load's address computation." There's a bug where this can create cycles in the DAG. It will take a bit to fix, so I'm backing it out for now. llvm-svn: 213339	2014-07-18 00:15:50 +00:00
Eric Christopher	1ff1279d6f	Reset the Subtarget in the AsmPrinter for each machine function and add explanatory comment about dual initialization. Fix use of the Subtarget to grab the information off of the target machine. llvm-svn: 213336	2014-07-18 00:08:53 +00:00
Eric Christopher	29cda30f93	Avoid resetting the UseSoftFloat and FloatABIType on the TargetMachine Options struct and move the comment to inMips16HardFloat. Use the fact that we now know whether or not we cared about soft float to set the libcalls. Accordingly rename mipsSEUsesSoftFloat to abiUsesSoftFloat and propagate since it's no longer CPU specific. llvm-svn: 213335	2014-07-18 00:08:50 +00:00
Lang Hames	b9404be62f	[MCJIT] Fix the alignment requirements for ARM and AArch64 which were mistakenly relaxed in the big RuntimeDyldMachO cleanup of r213293. No test case yet - this was found via inspection and there's no easy way to test GOT alignment in RuntimeDyldChecker at the moment. I'm working on adding support for this now, and hope to have a test case for this soon. llvm-svn: 213331	2014-07-17 23:11:30 +00:00
Kevin Enderby	0d0ec178f4	Tweak formating to match what clang-format would be for llvm-nm.cpp . No functional change. llvm-svn: 213330	2014-07-17 22:56:27 +00:00
Kevin Enderby	abaf0d20c2	Add printing of Mach-O stabs in llvm-nm. llvm-svn: 213327	2014-07-17 22:47:16 +00:00
Reid Kleckner	cabffe9de0	Remove rules against std::function from the programmer's manual Clarify that llvm::function_ref is like StringRef for callables. llvm-svn: 213326	2014-07-17 22:43:00 +00:00
Nico Weber	5a62ce559b	ms inline asm: Don't add x86 segment registers to the clobber list. Clang tries to check the clobber list but doesn't list segment registers in its x86 register list. This fixes PR20343. llvm-svn: 213303	2014-07-17 20:24:55 +00:00
Lang Hames	fa39401520	Make myself code owner of MCJIT. llvm-svn: 213302	2014-07-17 20:23:31 +00:00
Alp Toker	1bfa029bc4	Drop the udis86 wrapper from llvm::sys This optional dependency on the udis86 library was added some time back to aid JIT development, but doesn't make much sense to link into LLVM binaries these days. llvm-svn: 213300	2014-07-17 20:05:29 +00:00
Reid Kleckner	a0033713ef	TableGen: Add 'static' to a large array to avoid a huge stack allocation Speculative fix for a -Wframe-larger-than warning from gcc. Clang will implicitly promote such constant arrays to globals, so in theory it won't hit this. llvm-svn: 213298	2014-07-17 19:43:40 +00:00
Arnaud A. de Grandmaison	32cb6202b9	[AArch64] Cleanup AsmParser: no need to use dyn_cast + assert. cast does it for us. llvm-svn: 213296	2014-07-17 19:08:14 +00:00
Suyog Sarda	7302ccd41e	Rectify r213231. Use proper version of 'ComputeNumSignBits'. Earlier when the code was in InstCombine, we were calling the version of ComputeNumSignBits in InstCombine.h that automatically added the DataLayout* before calling into ValueTracking. When the code moved to InstSimplify, we are calling into ValueTracking directly without passing in the DataLayout*. This patch rectifies the same by passing DataLayout in ComputeNumSignBits. llvm-svn: 213295	2014-07-17 19:07:00 +00:00
Lang Hames	8eaed621d6	[MCJIT] Significantly refactor the RuntimeDyldMachO class. The previous implementation of RuntimeDyldMachO mixed logic for all targets within a single class, creating problems for readability, maintainability, and performance. To address these issues, this patch strips the RuntimeDyldMachO class down to just target-independent functionality, and moves all target-specific functionality into target-specific subclasses RuntimeDyldMachO. The new class hierarchy is as follows: class RuntimeDyldMachO Implemented in RuntimeDyldMachO.{h,cpp} Contains logic that is completely independent of the target. This consists mostly of MachO helper utilities which the derived classes use to get their work done. template <typename Impl> class RuntimeDyldMachOCRTPBase<Impl> : public RuntimeDyldMachO Implemented in RuntimeDyldMachO.h Contains generic MachO algorithms/data structures that defer to the Impl class for target-specific behaviors. RuntimeDyldMachOARM : public RuntimeDyldMachOCRTPBase<RuntimeDyldMachOARM> RuntimeDyldMachOARM64 : public RuntimeDyldMachOCRTPBase<RuntimeDyldMachOARM64> RuntimeDyldMachOI386 : public RuntimeDyldMachOCRTPBase<RuntimeDyldMachOI386> RuntimeDyldMachOX86_64 : public RuntimeDyldMachOCRTPBase<RuntimeDyldMachOX86_64> Implemented in their respective *.h files in lib/ExecutionEngine/RuntimeDyld/MachOTargets Each of these contains the relocation logic specific to their target architecture. llvm-svn: 213293	2014-07-17 18:54:50 +00:00
Alexey Samsonov	7fb0be9dd0	[ASan] Don't instrument load/stores with !nosanitize metadata. This is used to avoid instrumentation of instructions added by UBSan in Clang frontend (see r213291). This fixes PR20085. Reviewed in http://reviews.llvm.org/D4544. llvm-svn: 213292	2014-07-17 18:48:12 +00:00
Hans Wennborg	ebf51704c5	Typo: exists -> exits llvm-svn: 213290	2014-07-17 18:33:44 +00:00
Justin Holewinski	3021ef095b	[NVPTX] Improve handling of FP fusion We now consider the FPOpFusion flag when determining whether to fuse ops. We also explicitly emit add.rn when fusion is disabled to prevent ptxas from fusing the operations on its own. llvm-svn: 213287	2014-07-17 18:10:09 +00:00
Matt Arsenault	0968995f30	Fix typos llvm-svn: 213285	2014-07-17 17:50:22 +00:00
Zinovy Nis	c10a269e5f	[BUG] Due to a typo introduced in r199933 and r200027 two tests for FMA were never even started. llvm-svn: 213283	2014-07-17 17:14:35 +00:00
Adam Nemet	09fcf8939c	[X86] AVX512: Add disassembler support for compressed displacement There are two parts here. First is to modify tablegen to adjust the encoding type ENCODING_RM with the scaling factor. The second is to use the new encoding types to compute the correct displacement in the decoder. Fixes <rdar://problem/17608489> llvm-svn: 213281	2014-07-17 17:04:56 +00:00

1 2 3 4 5 ...

105706 Commits