llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 21:13:02 +02:00

Author	SHA1	Message	Date
Matt Arsenault	a418419fae	AMDGPU: Use SGPR_64 for argument lowerings llvm-svn: 288190	2016-11-29 19:39:48 +00:00
Geoff Berry	980b0ef812	[LiveRangeEdit] Handle instructions with no defs correctly. Summary: The code in LiveRangeEdit::eliminateDeadDef() that computes isOrigDef doesn't handle instructions in which operand 0 is not a def (e.g. KILL) correctly. Add a check that operand 0 is a def before doing the rest of the isOrigDef computation. Reviewers: qcolombet, MatzeB, wmi Subscribers: mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D27174 llvm-svn: 288189	2016-11-29 19:31:35 +00:00
Matt Arsenault	c97932df14	AMDGPU: Rename flat operands to match mubuf Use vaddr/vdst for the same purposes. This also fixes a beg in SIInsertWaits for the operand check. The stored value operand is currently called data0 in the single offset case, not data. llvm-svn: 288188	2016-11-29 19:30:44 +00:00
Matt Arsenault	75513f4636	AMDGPU: Use else if llvm-svn: 288187	2016-11-29 19:30:41 +00:00
Matt Arsenault	3c5076a42d	AMDGPU: Materialize frame index before add It isn't generally safe to fold the frame index directly into the operand since it will possibly not be an inline immediate after it is expanded. This surprisingly seems to produce better code, since the FI doesn't prevent folding other immediate operands. llvm-svn: 288185	2016-11-29 19:20:48 +00:00
Matt Arsenault	a89faeec9c	AMDGPU: Refactor immediate folding logic Change the logic for when to fold immediates to consider the destination operand rather than the source of the materializing mov instruction. No change yet, but this will allow for correctly handling i16/f16 operands. Since 32-bit moves are used to materialize constants for these, the same bitvalue will not be in the register. llvm-svn: 288184	2016-11-29 19:20:42 +00:00
Sanjay Patel	223fb45a69	[AArch64] add tests for bics; NFC llvm-svn: 288183	2016-11-29 19:15:27 +00:00
Sanjay Patel	e0bd5a33fc	[AArch64] add tests to show select transforms; NFC llvm-svn: 288180	2016-11-29 18:35:04 +00:00
Adam Nemet	aa8eea6427	Revert "[GVN] Basic optimization remark support" This reverts commit r288046. Trying to see if the revert fixes a compiler crash during a stage2 LTO build with a GVN backtrace. llvm-svn: 288179	2016-11-29 18:32:04 +00:00
Adam Nemet	dca038fdfb	Revert "[GVN, OptDiag] Include the value that is forwarded in load elimination" This reverts commit r288047. Trying to see if the revert fixes a compiler crash during a stage2 LTO build with a GVN backtrace. llvm-svn: 288178	2016-11-29 18:32:00 +00:00
Adam Nemet	ee7ff0a4cd	Revert "[GVN, OptDiag] Print the interesting instructions involved in missed load-elimination" This reverts commit r288090. Trying to see if the revert fixes a compiler crash during a stage2 LTO build with a GVN backtrace. llvm-svn: 288177	2016-11-29 18:31:53 +00:00
Geoff Berry	5ed377ecc1	[AArch64] Fold spills of COPY of WZR/XZR Summary: In AArch64InstrInfo::foldMemoryOperandImpl, catch more cases where the COPY being spilled is copying from WZR/XZR, but the source register is not in the COPY destination register's regclass. For example, when spilling: %vreg0 = COPY %XZR ; %vreg0:GPR64common without this change, the code in TargetInstrInfo::foldMemoryOperand() and canFoldCopy() that normally handles cases like this would fail to optimize since %XZR is not in GPR64common. So the spill code generated would be: %vreg0 = COPY %XZR STR %vreg instead of the new code generated: STR %XZR Reviewers: qcolombet, MatzeB Subscribers: mcrosier, aemerson, t.p.northover, llvm-commits, rengolin Differential Revision: https://reviews.llvm.org/D26976 llvm-svn: 288176	2016-11-29 18:28:32 +00:00
Mehdi Amini	6e479ee202	[docs] Typos and whitespace fixed in LTO docs. While reading the LTO docs I fixed few small typos and whitespace issues. Patch by: Jonas Devlieghere <jonas@devlieghere.com> Differential Revision: https://reviews.llvm.org/D27196 llvm-svn: 288171	2016-11-29 18:00:31 +00:00
Simon Pilgrim	7d5e5aa9e9	Avoid repeated calls to MVT getSizeInBits and getScalarSizeInBits(). NFCI. llvm-svn: 288170	2016-11-29 17:57:48 +00:00
NAKAMURA Takumi	fd2258bf45	Suppress abi-breaking.h on cygming, for now. FIXME: Implement checks without weak for them. llvm-svn: 288168	2016-11-29 17:32:58 +00:00
NAKAMURA Takumi	3b9d01fa78	Fix a linefeed at eof. llvm-svn: 288167	2016-11-29 17:32:43 +00:00
Artur Pilipenko	1f94190a69	[CVP] Remove use of removed flag (-cvp-dont-process-adds) from the test The flag was removed by 288154 llvm-svn: 288161	2016-11-29 16:43:30 +00:00
Artur Pilipenko	edf365ba79	[CVP] Remove cvp-dont-process-adds flag The flag was introduced because the optimization controlled by the flag initially caused regressions. All the regressions were fixed some time ago and the flag has been false for quite a while. llvm-svn: 288154	2016-11-29 16:24:57 +00:00
Nemanja Ivanovic	325b871295	[PowerPC] Improvements for BUILD_VECTOR Vol. 1 This patch corresponds to review: https://reviews.llvm.org/D25912 This is the first patch in a series of 4 that improve the lowering and combining for BUILD_VECTOR nodes on PowerPC. llvm-svn: 288152	2016-11-29 16:11:34 +00:00
Alexey Bataev	a4bfd1d3ea	[SLP] Add a new test for tree vectorization starting from insertelement instruction. llvm-svn: 288148	2016-11-29 15:37:52 +00:00
Simon Pilgrim	f1dcbdb14d	[X86] Moved getTargetConstantFromNode function so a future patch is more understandable. NFCI. llvm-svn: 288147	2016-11-29 15:32:58 +00:00
Aditya Kumar	4982e25469	[GVNHoist] Rename variables. Differential Revision: https://reviews.llvm.org/D27110 llvm-svn: 288142	2016-11-29 14:36:27 +00:00
Aditya Kumar	830876ec2a	[GVNHoist] Enable aggressive hoisting when optimizing for code-size Enable scalar hoisting at -Oz as it is safe to hoist scalars to a place where they are partially needed. Differential Revision: https://reviews.llvm.org/D27111 llvm-svn: 288141	2016-11-29 14:34:01 +00:00
Simon Pilgrim	b228a39a42	[X86][SSE] Add initial support for combining target shuffles to (V)PMOVZX. We can only handle 128-bit vectors until we support target shuffle inputs of different size to the output. llvm-svn: 288140	2016-11-29 14:18:51 +00:00
Simon Pilgrim	b9fd0b9690	Avoid repeated calls to MVT::getScalarSizeInBits(). NFCI. llvm-svn: 288138	2016-11-29 13:43:08 +00:00
Simon Pilgrim	190dd3eff7	[X86][SSE] Added tests showing missed combines to (V)PMOVZX llvm-svn: 288136	2016-11-29 13:16:11 +00:00
Chandler Carruth	2001edc7fd	[PM] Fix a bad invalid densemap iterator bug in the new invalidation logic. Yup, the invalidation logic has an invalid iterator bug. Can't make this stuff up. We can recursively insert things into the map so we can't cache the iterator into that map across those recursive calls. We did this differently in two places. I have an end-to-end test that triggers at least one of them. I'm going to work on a nice minimal test case that triggers these, but I didn't want to leave the bug in the tree while I tried to trigger it. Also, the dense map iterator checking stuff we have now is awesome. =D llvm-svn: 288135	2016-11-29 12:54:34 +00:00
Malcolm Parsons	2eb7118041	[StringRef] Use default member initializers and = default. Summary: This makes the default constructor implicitly constexpr and noexcept. Reviewers: zturner, beanz Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D27094 llvm-svn: 288131	2016-11-29 10:53:18 +00:00
Alexey Bataev	62124599cb	[SLPVectorizer] Improved support of partial tree vectorization. Currently SLP vectorizer tries to vectorize a binary operation and dies immediately after unsuccessful the first unsuccessfull attempt. Patch tries to improve the situation, trying to vectorize all binary operations of all children nodes in the binop tree. Differential Revision: https://reviews.llvm.org/D25517 llvm-svn: 288115	2016-11-29 08:21:14 +00:00
Warren Ristow	61eb446384	Test commit. Comment changes. NFC. llvm-svn: 288100	2016-11-29 02:37:13 +00:00
Peter Collingbourne	73ec8f79de	Bitcode: Change expected layout of module blocks. We now expect each module's identification block to appear immediately before the module block. Any module block that appears without an identification block immediately before it is interpreted as if it does not have a module block. Also change the interpretation of VST and function offsets in bitcode. The offset is always taken as relative to the start of the identification (or module if not present) block, minus one word. This corresponds to the historical interpretation of offsets, i.e. relative to the start of the file. These changes allow for bitcode modules to be concatenated by copying bytes. Differential Revision: https://reviews.llvm.org/D27184 llvm-svn: 288098	2016-11-29 02:27:04 +00:00
Reid Kleckner	f93801b9e8	[asan/win] Align global registration metadata to its size This way, when the linker adds padding between globals, we can skip over the zero padding bytes and reliably find the start of the next metadata global. llvm-svn: 288096	2016-11-29 01:32:21 +00:00
Tom Stellard	4f879c1dc0	AMDGPU/SI: Avoid moving PHIs to VALU when phi values are defined in scalar branches Reviewers: arsenm Subscribers: arsenm, llvm-commits, kzhuravl Differential Revision: https://reviews.llvm.org/D23417 llvm-svn: 288095	2016-11-29 00:46:46 +00:00
Reid Kleckner	fb316ff4a3	Recognize ${:uid} escapes in intel syntax inline asm It looks like this logic was duplicated long ago and the GCC side of things has grown additional functionality. We need ${:uid} at least to generate unique MS inline asm labels (PR23715), so expose these. llvm-svn: 288092	2016-11-29 00:29:27 +00:00
Adam Nemet	cc71c27f30	[GVN, OptDiag] Print the interesting instructions involved in missed load-elimination This includes the intervening store and the load/store that we're trying to forward from in the optimization remark for the missed load elimination. This is hooked up under a new mode in ORE that allows for compile-time budget for a bit more analysis to print more insightful messages. This mode is currently enabled for -fsave-optimization-record (-Rpass is trickier since it is controlled in the front-end). With this we can now print the red remark in http://lab.llvm.org:8080/artifacts/opt-view_test-suite/build/SingleSource/Benchmarks/Dhrystone/CMakeFiles/dry.dir/html/_org_test-suite_SingleSource_Benchmarks_Dhrystone_dry.c.html#L446 Differential Revision: https://reviews.llvm.org/D26490 llvm-svn: 288090	2016-11-29 00:09:22 +00:00
Sanjay Patel	bca0e42bb8	[DAG] clean up foldSelectCCToShiftAnd(); NFCI llvm-svn: 288088	2016-11-28 23:05:55 +00:00
Mehdi Amini	c518a52036	Put ABI breaking test in Error checking behind LLVM_ENABLE_ABI_BREAKING_CHECKS This macro is supposed to be the one controlling the compatibility of ABI breaks induced when enabling or disabling assertions in LLVM. The macro is enabled by default in assertions build, so this commit won't disable the tests. Differential Revision: https://reviews.llvm.org/D26700 llvm-svn: 288087	2016-11-28 22:57:11 +00:00
Kevin Enderby	e2cc943705	Add error checking for Mach-O universal files. Add the checking for both the MachO::fat_header and the MachO::fat_arch struct values in the constructor for MachOUniversalBinary. Such that when the constructor for ObjectForArch is called it can assume the values in the MachO::fat_arch for the offset and size are contained in the file after the MachOUniversalBinary constructor is called for the Parent. llvm-svn: 288084	2016-11-28 22:40:50 +00:00
Mehdi Amini	cfa184f7d2	Add link-time detection of LLVM_ABI_BREAKING_CHECKS mismatch The macro LLVM_ENABLE_ABI_BREAKING_CHECKS is moved to a new header abi-breaking.h, from llvm-config.h. Only headers that are using the macro are including this new header. LLVM will define a symbol, either EnableABIBreakingChecks or DisableABIBreakingChecks depending on the configuration setting for LLVM_ABI_BREAKING_CHECKS. The abi-breaking.h header will add weak references to these symbols in every clients that includes this header. This should ensure that a mismatch triggers a link failure (or a load time failure for DSO). On MSVC, the pragma "detect_mismatch" is used instead. Differential Revision: https://reviews.llvm.org/D26876 llvm-svn: 288082	2016-11-28 22:23:53 +00:00
Chandler Carruth	84780666b4	[PM] Extend the explicit 'invalidate' method API on analysis results to accept an Invalidator that allows them to invalidate themselves if their dependencies are in turn invalidated. Rather than recording the dependency graph ahead of time when analysis get results from other analyses, this simply lets each result trigger the immediate invalidation of any analyses they actually depend on. They do this in a way that has three nice properties: 1) They don't have to handle transitive dependencies because the infrastructure will recurse for them. 2) The invalidate methods are still called only once. We just dynamically discover the necessary topological ordering, everything is memoized nicely. 3) The infrastructure still provides a default implementation and can access it so that only analyses which have dependencies need to do anything custom. To make this work at all, the invalidation logic also has to defer the deletion of the result objects themselves so that they can remain alive until we have collected the complete set of results to invalidate. A unittest is added here that has exactly the dependency pattern we are concerned with. It hit the use-after-free described by Sean in much detail in the long thread about analysis invalidation before this change, and even in an intermediate form of this change where we failed to defer the deletion of the result objects. There is an important problem with doing dependency invalidation that isn't solved here: we don't enforce that results correctly invalidate all the analyses whose results they depend on. I actually looked at what it would take to do that, and it isn't as hard as I had thought but the complexity it introduces seems very likely to outweigh the benefit. The technique would be to provide a base class for an analysis result that would be populated with other results, and automatically provide the invalidate method which immediately does the correct thing. This approach has some nice pros IMO: - Handles the case we care about and nothing else: only results that depend on other analyses trigger extra invalidation. - Localized to the result rather than centralized in the analysis manager. - Ties the storage of the reference to another result to the triggering of the invalidation of that analysis. - Still supports extending invalidation in customized ways. But the down sides here are: - Very heavy-weight meta-programming is needed to provide this base class. - Requires a pretty awful API for accessing the dependencies. Ultimately, I fear it will not pull its weight. But we can re-evaluate this at any point if we start discovering consistent problems where the invalidation and dependencies get out of sync. It will fit as a clean layer on top of the facilities in this patch that we can add if and when we need it. Note that I'm not really thrilled with the names for these APIs... The name "Invalidator" seems ok but not great. The method name "invalidate" also. In review some improvements were suggested, but they really need other uses of these terms to be updated as well so I'm going to do that in a follow-up commit. I'm working on the actual fixes to various analyses that need to use these, but I want to try to get tests for each of them so we don't regress. And those changes are seperable and obvious so once this goes in I should be able to roll them out throughout LLVM. Many thanks to Sean, Justin, and others for help reviewing here. Differential Revision: https://reviews.llvm.org/D23738 llvm-svn: 288077	2016-11-28 22:04:31 +00:00
Peter Collingbourne	1c65e3e9b2	cmake: Set rpath for loadable modules as well as shared libraries. This fixes a regression introduced by r285714: we weren't setting the rpath on LLVMgold.so correctly. Spotted by mark@chromium.org! Differential Revision: https://reviews.llvm.org/D27176 llvm-svn: 288076	2016-11-28 21:59:14 +00:00
Eli Friedman	2a06f6629d	[SROA] Drop lifetime.start/end intrinsics when they block promotion. Preserving lifetime markers isn't as important as allowing promotion, so just drop the lifetime markers if necessary. This also fixes an assertion failure where other parts of SROA assumed that lifetime markers never block promotion. Fixes https://llvm.org/bugs/show_bug.cgi?id=29139. Differential Revision: https://reviews.llvm.org/D24854 llvm-svn: 288074	2016-11-28 21:50:34 +00:00
Sanjay Patel	8da2f123d2	[DAG] add helper function for selectcc --> and+shift transforms; NFC llvm-svn: 288073	2016-11-28 21:47:41 +00:00
Mehdi Amini	59c07301c1	Improve error handling in YAML parsing Some scanner errors were not checked and reported by the parser. Fix PR30934. Recommit r288014 after fixing unittest. Patch by: Serge Guelton <serge.guelton@telecom-bretagne.eu> Differential Revision: https://reviews.llvm.org/D26419 llvm-svn: 288071	2016-11-28 21:38:52 +00:00
David Blaikie	04a4839cfd	[DebugInfo] Add support for DW_AT_main_subprogram on subprograms Patch by Tom Tromey! (for use with Rust) llvm-svn: 288068	2016-11-28 21:32:19 +00:00
Matthias Braun	ce011a4aed	MachineScheduler: Export function to construct "default" scheduler. This makes the createGenericSchedLive() function that constructs the default scheduler available for the public API. This should help when you want to get a scheduler and the default list of DAG mutations. This also shrinks the list of default DAG mutations: {Load\|Store}ClusterDAGMutation and MacroFusionDAGMutation are no longer added by default. Targets can easily add them if they need them. It also makes it easier for targets to add alternative/custom macrofusion or clustering mutations while staying with the default createGenericSchedLive(). It also saves the callback back and forth in TargetInstrInfo::enableClusterLoads()/enableClusterStores(). Differential Revision: https://reviews.llvm.org/D26986 llvm-svn: 288057	2016-11-28 20:11:54 +00:00
Artem Belevich	f09406906a	Revert r287637 "[wasm] hack around test failure after r287553." -cgp-freq-ratio-to-skip-merge option was removed by rollback in r288052. llvm-svn: 288055	2016-11-28 19:55:46 +00:00
Stanislav Mekhanoshin	92eac85076	[AMDGPU] Allow hoisting of comparisons out of a loop and eliminate condition copies Codegen prepare sinks comparisons close to a user is we have only one register for conditions. For AMDGPU we have many SGPRs capable to hold vector conditions. Changed BE to report we have many condition registers. That way IR LICM pass would hoist an invariant comparison out of a loop and codegen prepare will not sink it. With that done a condition is calculated in one block and used in another. Current behavior is to store workitem's condition in a VGPR using v_cndmask_b32 and then restore it with yet another v_cmp instruction from that v_cndmask's result. To mitigate the issue a propagation of source SGPR pair in place of v_cmp is implemented. Additional side effect of this is that we may consume less VGPRs at a cost of more SGPRs in case if holding of multiple conditions is needed, and that is a clear win in most cases. Differential Revision: https://reviews.llvm.org/D26114 llvm-svn: 288053	2016-11-28 18:58:49 +00:00
Joerg Sonnenberger	d647090837	Revert r287553: [CodeGenPrep] Skip merging empty case blocks It results in assertions in lib/Analysis/BlockFrequencyInfoImpl.cpp line 670 ("Expected irreducible CFG"). llvm-svn: 288052	2016-11-28 18:56:54 +00:00
Justin Lebar	4f982330de	[StructurizeCFG] Use range-based for loops. Reviewers: arsenm Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D27000 llvm-svn: 288051	2016-11-28 18:50:03 +00:00

1 2 3 4 5 ...

141261 Commits