llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Changpeng Fang	8727650a31	AMDGPU/SI: Use flat for global load/store when targeting HSA Summary: For some reason doing executing an MUBUF instruction with the addr64 bit set and a zero base pointer in the resource descriptor causes the memory operation to be dropped when the shader is executed using the HSA runtime. This kind of MUBUF instruction is commonly used when the pointer is stored in VGPRs. The base pointer field in the resource descriptor is set to zero and and the pointer is stored in the vaddr field. This patch resolves the issue by only using flat instructions for global memory operations when targeting HSA. This is an overly conservative fix as all other configurations of MUBUF instructions appear to work. NOTE: re-commit by fixing a failure in Codegen/AMDGPU/llvm.dbg.value.ll Reviewers: tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15543 llvm-svn: 256282	2015-12-22 20:55:23 +00:00
Rafael Espindola	067bfb9e99	Also add unnamed_addr to functions. llvm-svn: 256281	2015-12-22 20:43:30 +00:00
Akira Hatanaka	dfd76e927a	Revert r256277 and r256279. Some of the bots failed again. llvm-svn: 256280	2015-12-22 20:29:09 +00:00
Akira Hatanaka	f5b453640f	Add a .td file I forgot to add in r256277. llvm-svn: 256279	2015-12-22 20:06:50 +00:00
Akira Hatanaka	fa235f0243	Provide a way to specify inliner's attribute compatibility and merging. This reapplies r252990 and r252949. I've added member function getKind to the Attr classes which returns the enum or string of the attribute. Original commit message for r252949: Provide a way to specify inliner's attribute compatibility and merging rules using table-gen. NFC. This commit adds new classes CompatRule and MergeRule to Attributes.td, which are used to generate code to check attribute compatibility and merge attributes of the caller and callee. rdar://problem/19836465 llvm-svn: 256277	2015-12-22 20:00:05 +00:00
Rafael Espindola	f795707790	Delete dead GlobalAliases. llvm-svn: 256276	2015-12-22 19:50:22 +00:00
Rafael Espindola	6880ff7f5d	Revert "AMDGPU/SI: Use flat for global load/store when targeting HSA" This reverts commit r256273. It broke CodeGen/AMDGPU/llvm.dbg.value.ll llvm-svn: 256275	2015-12-22 19:46:44 +00:00
Rafael Espindola	2af3ff098d	Merge duplicated code. The code for deleting dead global variables and functions was duplicated. This is in preparation for also deleting dead global aliases. llvm-svn: 256274	2015-12-22 19:38:07 +00:00
Changpeng Fang	36fd3caea4	AMDGPU/SI: Use flat for global load/store when targeting HSA Summary: For some reason doing executing an MUBUF instruction with the addr64 bit set and a zero base pointer in the resource descriptor causes the memory operation to be dropped when the shader is executed using the HSA runtime. This kind of MUBUF instruction is commonly used when the pointer is stored in VGPRs. The base pointer field in the resource descriptor is set to zero and and the pointer is stored in the vaddr field. This patch resolves the issue by only using flat instructions for global memory operations when targeting HSA. This is an overly conservative fix as all other configurations of MUBUF instructions appear to work. Reviewers: tstellarAMD Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15543 llvm-svn: 256273	2015-12-22 19:32:28 +00:00
Rafael Espindola	7f6f7bca5d	Use early continue to reduce indentation. llvm-svn: 256272	2015-12-22 19:26:18 +00:00
Rafael Espindola	611a6e336d	Simplify iterator management. NFC. Not passing an iterator to processGlobal will allow it to work with other GlobalValues. llvm-svn: 256271	2015-12-22 19:16:50 +00:00
Paul Robinson	70ef8b0ecf	Add advice on choosing reviewers llvm-svn: 256265	2015-12-22 18:59:02 +00:00
Cong Hou	50c405416c	[BPI] Replace weights by probabilities in BPI. This patch removes all weight-related interfaces from BPI and replace them by probability versions. With this patch, we won't use edge weight anymore in either IR or MC passes. Edge probabilitiy is a better representation in terms of CFG update and validation. Differential revision: http://reviews.llvm.org/D15519 llvm-svn: 256263	2015-12-22 18:56:14 +00:00
Manuel Jacob	3a4569b878	Remove deprecated llvm.experimental.gc.result.{int,float,ptr} intrinsics. Summary: These were deprecated 11 months ago when a generic llvm.experimental.gc.result intrinsic, which works for all types, was added. Reviewers: sanjoy, reames Subscribers: sanjoy, chenli, llvm-commits Differential Revision: http://reviews.llvm.org/D15719 llvm-svn: 256262	2015-12-22 18:44:45 +00:00
Vedant Kumar	4a1d86d7e2	[Support] Allow multiple paired calls to {start,stop}Timer() Differential Revision: http://reviews.llvm.org/D15619 Reviewed-by: rafael llvm-svn: 256258	2015-12-22 17:36:17 +00:00
Manuel Jacob	b94cce35d1	[RS4GC] Fix crash in the case that a live variable has a constant base. Summary: Previously, RS4GC crashed in CreateGCRelocates() because it assumed that every base is also in the array of live variables, which isn't true if a live variable has a constant base. This change fixes the crash by making sure CreateGCRelocates() won't try to relocate a live variable with a constant base. This would be unnecessary anyway because anything with a constant base won't move. Reviewers: reames Subscribers: llvm-commits, sanjoy Differential Revision: http://reviews.llvm.org/D15556 llvm-svn: 256252	2015-12-22 16:50:44 +00:00
Jun Bum Lim	3cdf780204	[AArch64] Promote loads from stored This is a recommit of r256004 which was reverted in r256160. The issue was the incorrect promotion for half and byte loads transformed into mov instructions. This fix will replace half and byte type loads only with bit field extracts. Original commit message: This change promotes load instructions which directly read from stored by replacing them with mov instructions. If the store is wider than the load, the load will be replaced with a bitfield extract. For example : STRWui %W1, %X0, 1 %W0 = LDRHHui %X0, 3 becomes STRWui %W1, %X0, 1 %W0 = UBFMWri %W1, 16, 31 llvm-svn: 256249	2015-12-22 16:36:16 +00:00
Chad Rosier	6822e0bdd8	Typo. NFC. llvm-svn: 256242	2015-12-22 15:06:47 +00:00
Asaf Badouh	d891bbfe44	[X86][AVX512] Add rcp14 and rsqrt14 intrinsics Differential Revision: http://reviews.llvm.org/D15414 llvm-svn: 256237	2015-12-22 11:40:04 +00:00
Keno Fischer	81cbfe48f4	[ASMPrinter] Fix missing handling of DW_OP_bit_piece In r256077, I added printing for DIExpressions in DEBUG_VALUE comments, but neglected to handle DW_OP_bit_piece operands. Thanks to Mikael Holmen and Joerg Sonnenberger for spotting this. llvm-svn: 256236	2015-12-22 07:14:50 +00:00
Kostya Serebryany	066e99fd12	[libFuzzer] add AFL-style dictionary for C++, remove the old file with tokens llvm-svn: 256229	2015-12-22 01:50:51 +00:00
David Majnemer	54b9760f86	[MC] Don't use the architecture to govern which object file format to use InitMCObjectFileInfo was trying to override the triple in awkward ways. For example, a triple specifying COFF but not Windows was forced as ELF. This makes it easy for internal invariants to get violated, such as those which triggered PR25912. This fixes PR25912. llvm-svn: 256226	2015-12-22 01:39:04 +00:00
Kostya Serebryany	e58a61dc83	Partial fix for PR25912, see comment 13. Should fix the sanitizer bootstrap bot llvm-svn: 256225	2015-12-22 01:18:49 +00:00
Teresa Johnson	4fc9fb1957	Handle empty Subprogram list when linking metadata. Use an iterator that handles an empty subprogram list. Fixes PR25915. llvm-svn: 256224	2015-12-22 01:17:19 +00:00
Easwaran Raman	66e5fa28c2	Determine callee's hotness and adjust threshold based on that. NFC. This uses the same criteria used in CFE's CodeGenPGO to identify hot and cold callees and uses values of inlinehint-threshold and inlinecold-threshold respectively as the thresholds for such callees. Differential Revision: http://reviews.llvm.org/D15245 llvm-svn: 256222	2015-12-22 00:32:35 +00:00
Evgeniy Stepanov	4c984e7582	[safestack] Add option for non-TLS unsafe stack pointer. This patch adds an option, -safe-stack-no-tls, for using normal storage instead of thread-local storage for the unsafe stack pointer. This can be useful when SafeStack is applied to an operating system kernel. http://reviews.llvm.org/D15673 Patch by Michael LeMay. llvm-svn: 256221	2015-12-22 00:13:11 +00:00
Xinliang David Li	899e22e96f	[PGO] Fix another comdat related issue for COFF The linker requires that a comdat section must be associated with a another comdat section that precedes it. This means the comdat section's name needs to use the profile name var's name. Patch tested by Johan Engelen. llvm-svn: 256220	2015-12-22 00:11:15 +00:00
Vedant Kumar	bd22e09f55	[Support] Timer: Use emplace_back() and range-based loops (NFC) llvm-svn: 256217	2015-12-21 23:41:38 +00:00
Vedant Kumar	e4167de423	[Support] Timer: simplify the init() method llvm-svn: 256215	2015-12-21 23:27:44 +00:00
Dylan McKay	9c39fe2bb2	[AVR] Added configuration file and machine function information class This commit adds the 'AVRMachineFunctionInfo' class, which simply stores basic properties about generated machine functions. llvm-svn: 256213	2015-12-21 23:13:15 +00:00
Eric Christopher	7774315fd8	Fix line endings after r256155. NFC. llvm-svn: 256211	2015-12-21 23:04:27 +00:00
Xinliang David Li	be295b9d9e	Fix test case comment (NFC) llvm-svn: 256206	2015-12-21 22:26:49 +00:00
Evgeniy Stepanov	7ed9f33690	[cfi] Fix LowerBitSets on 32-bit targets. This code attempts to truncate IntPtrTy to i32, which may be the same type. llvm-svn: 256205	2015-12-21 22:14:04 +00:00
David Majnemer	47d3d1e5ef	[MC, COFF] Support link /incremental conditionally Today, we always take into account the possibility that object files produced by MC may be consumed by an incremental linker. This results in us initialing fields which vary with time (TimeDateStamp) which harms hermetic builds (e.g. verifying a self-host went well) and produces sub-optimal code because we cannot assume anything about the relative position of functions within a section (call sites can get redirected through incremental linker thunks). Let's provide an MCTargetOption which controls this behavior so that we can disable this functionality if we know a-priori that the build will not rely on /incremental. llvm-svn: 256203	2015-12-21 22:09:27 +00:00
Jun Bum Lim	8b973fac5e	Enhance BranchProbabilityInfo::calcUnreachableHeuristics for InvokeInst This is recommit of r256028 with minor fixes in unittests: CodeGen/Mips/eh.ll CodeGen/Mips/insn-zero-size-bb.ll Original commit message: When identifying blocks post-dominated by an unreachable-terminated block in BranchProbabilityInfo, consider only the edge to the normal destination block if the terminator is InvokeInst and let calcInvokeHeuristics() decide edge weights for the InvokeInst. llvm-svn: 256202	2015-12-21 22:00:51 +00:00
Xinliang David Li	383d55359b	Resubmit r256193 with test fix: assertion failure analyzed llvm-svn: 256201	2015-12-21 21:52:27 +00:00
Xinliang David Li	05d5ed17a7	Revert r256193: build bot failure triggered llvm-svn: 256198	2015-12-21 21:00:33 +00:00
Cong Hou	ac19620238	[X86][SSE] Transform truncations between vectors of integers into X86ISD::PACKUS/PACKSS operations during DAG combine. This patch transforms truncation between vectors of integers into X86ISD::PACKUS/PACKSS operations during DAG combine. We don't do it in lowering phase because after type legalization, the original truncation will be turned into a BUILD_VECTOR with each element that is extracted from a vector and then truncated, and from them it is difficult to do this optimization. This greatly improves the performance of truncations on some specific types. Cost table is updated accordingly. Differential revision: http://reviews.llvm.org/D14588 llvm-svn: 256194	2015-12-21 20:42:43 +00:00
Xinliang David Li	43eab391e6	[PGO] Fix profile var comdat generation problem with COFF When targeting COFF, it is required that a comdat section to have a global obj with the same name as the comdat (except for comdats with select kind to be associative). This fix makes sure that the comdat is keyed on the data variable for COFF. Also improved test coverage for this. llvm-svn: 256193	2015-12-21 20:41:20 +00:00
Michael Zolotukhin	fa6d622c80	[ValueTracking] Properly handle non-sized types in isAligned function. Reviewers: apilipenko, reames, sanjoy, hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15597 llvm-svn: 256192	2015-12-21 20:38:18 +00:00
Adrian Prantl	062efd8b18	Fix PR24563 (LiveDebugVariables unconditionally propagates all DBG_VALUEs) LiveDebugVariables unconditionally propagates all DBG_VALUE down the dominator tree, which happens to work fine if there already is another DBG_VALUE or the DBG_VALUE happends to describe a single-assignment vreg but is otherwise wrong if the DBG_VALUE is coming from only one of the predecessors. In r255759 we introduced a proper data flow analysis scheduled after LiveDebugVariables that correctly propagates DBG_VALUEs across basic block boundaries. With the new pass in place, the incorrect propagation in LiveDebugVariables can be retired witout loosing any of the benefits where LiveDebugVariables happened to do the right thing. llvm-svn: 256188	2015-12-21 20:03:00 +00:00
Adrian Prantl	cab6697eb0	Convert the CodeGen/ARM/sched-it-debug-nodes.ll testcase from IR -> MIR. NFC PR24563 llvm-svn: 256187	2015-12-21 19:44:42 +00:00
Adrian Prantl	1f94dd1efc	Teach ARMLoadStoreOptimizer to ignore DBG_VALUE instructions when merging instructions. As noted in PR24563. rdar://problem/23963293 llvm-svn: 256183	2015-12-21 19:25:03 +00:00
Kostya Serebryany	8270e2df22	fix leak in a test, make the sanitizer bot green llvm-svn: 256179	2015-12-21 19:09:01 +00:00
Tom Stellard	e81c016153	AMDGPU/SI: Fix encoding for FLAT_SCRATCH registers on VI Summary: These register has different encodings on CI and VI, so we add pseudo FLAT_SCRACTH registers to be used before MC, and subtarget specific registers to be used by the MC layer. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15661 llvm-svn: 256178	2015-12-21 18:44:27 +00:00
Tom Stellard	e5c23ab1eb	AMDGPU/SI: Change assembly name for flat scratch registers to flat_scratch This matches what the assembler accepts. llvm-svn: 256177	2015-12-21 18:44:21 +00:00
Matthew Simpson	f8053302ae	[AArch64] Add additional extract-extend patterns for smov This patch adds to the target description two additional patterns for matching extract-extend operations to SMOV. The patterns catch the v16i8-to-i64 and v8i16-to-i64 cases. The existing patterns miss these cases because the extracted elements must first be legalized to i32, resulting in any_extend nodes. This was originally implemented as a DAG combine (r255895), but was reverted due to failing out-of-tree tests. llvm-svn: 256176	2015-12-21 18:31:25 +00:00
Teresa Johnson	34957cd924	Add testcase for r256161 (PR25907) llvm-svn: 256174	2015-12-21 18:24:35 +00:00
Chad Rosier	609c3edd2d	Remove extra whitespace. NFC. llvm-svn: 256173	2015-12-21 18:08:05 +00:00
Teresa Johnson	c59bb76cad	[ThinLTO] Rename variable to reflect bulk importing change (NFC) llvm-svn: 256171	2015-12-21 17:33:24 +00:00

... 2 3 4 5 6 ...

125463 Commits