llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Simon Pilgrim	7b9017da12	[X86] combineUIntToFP - Fix UINT_TO_FP(vXi1) comment (PR39078) llvm-svn: 343026	2018-09-25 20:52:08 +00:00
Lang Hames	bd45c5f4db	Remove 'orc' namespace from MSVCErrorWorkarounds.h, fix some typos that were breaking windows builds. The 'orc' namespace was accidentally left in when the workarounds were moved out of orc in r343011. llvm-svn: 343025	2018-09-25 20:48:57 +00:00
Lang Hames	5324e0c217	Fix a missing includes and a use of the MSVC promise/future workaround that were left out of r343011/r343012. llvm-svn: 343022	2018-09-25 20:16:06 +00:00
Teresa Johnson	44ebb78e5e	[ThinLTO] Efficiency fix for writing type id records in per-module indexes Summary: In D49565/r337503, the type id record writing was fixed so that only referenced type ids were emitted into each per-module index for ThinLTO distributed builds. However, this still left an efficiency issue: each per-module index checked all type ids for membership in the referenced set, yielding O(M*N) performance (M indexes and N type ids). Change the TypeIdMap in the summary to be indexed by GUID, to facilitate correlating with type identifier GUIDs referenced in the function summary TypeIdInfo structures. This allowed simplifying other places where a map from type id GUID to type id map entry was previously being used to aid this correlation. Also fix AsmWriter code to handle the rare case of type id GUID collision. For a large internal application, this reduced the thin link time by almost 15%. Reviewers: pcc, vitalybuka Subscribers: mehdi_amini, inglorion, steven_wu, dexonsmith, llvm-commits Differential Revision: https://reviews.llvm.org/D51330 llvm-svn: 343021	2018-09-25 20:14:40 +00:00
Craig Topper	2008068a9a	[MC] Return a std::string instead of taking it as an out parameter. Make two parser methods into static functions at file scope. NFC llvm-svn: 343020	2018-09-25 20:13:55 +00:00
Heejin Ahn	ab02666df3	Unify landing pad information adding routines (NFC) Summary: We have `llvm::addLandingPadInfo` and `MachineFunction::addLandingPad`, both of which add landing pad information to populate `LandingPadInfo` but are called from different locations, which was confusing. This patch unifies them with one `MachineFunction::addLandingPad` function, which now has functionlities of both functions. Reviewers: rnk Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52428 llvm-svn: 343018	2018-09-25 19:56:44 +00:00
Lang Hames	276d90bd36	[ORC] Reapply r342939 with a fix for MSVC's promise/future restrictions. llvm-svn: 343012	2018-09-25 19:48:46 +00:00
Lang Hames	d1f4173f73	Move MSVC workarounds for future<Error>/future<Expected<T>> out of ORC and into a header in support. MSVC's std::future implementation requires types to be default constructible, but Error and Expected are not. This issue came up once before in ORC's RPCUtils.h header and was worked around there but came up again in r342939, so I am moving the workaround to Support to make it available to other clients. llvm-svn: 343011	2018-09-25 19:48:44 +00:00
Craig Topper	4a4034fc60	[MC] Fix bad indentation and 80 column violations. Use StringRef::front instead of dereferencing StringRef::begin. NFC llvm-svn: 343010	2018-09-25 19:37:35 +00:00
Sanjay Patel	50d6ec057c	[x86] avoid 256-bit andnp that requires insert/extract with AVX1 (PR37449) This is the final (I hope!) problem pattern mentioned in PR37749: https://bugs.llvm.org/show_bug.cgi?id=37749 We are trying to avoid an AVX1 sinkhole caused by having 256-bit bitwise logic ops but no other 256-bit integer ops. We've already solved the simple logic ops, but 'andn' is an x86 special. I looked at alternative solutions like extending the generic DAG combine or trying to wait until the ANDNP node is created, but those are bigger patches that can over-reach. Ie, splitting to 128-bit does not look like a win in most cases with >1 256-bit op. The pattern matching is cluttered with bitcasts because of our i64 element canonicalization. For the affected test, we have this vector-type-legalized sequence: t29: v8i32 = concat_vectors t27, t28 t30: v4i64 = bitcast t29 t18: v8i32 = BUILD_VECTOR Constant:i32<-1>, Constant:i32<-1>, ... t31: v4i64 = bitcast t18 t32: v4i64 = xor t30, t31 t9: v8i32 = BUILD_VECTOR Constant:i32<255>, Constant:i32<255>, ... t34: v4i64 = bitcast t9 t35: v4i64 = and t32, t34 t36: v8i32 = bitcast t35 t37: v4i32 = extract_subvector t36, Constant:i64<0> t38: v4i32 = extract_subvector t36, Constant:i64<4> Differential Revision: https://reviews.llvm.org/D52318 llvm-svn: 343008	2018-09-25 19:09:34 +00:00
Yury Delendik	da8773b8d1	[WebAssembly] Move/clone DBG_VALUE during WebAssemblyRegStackify pass Summary: The MoveForSingleUse or MoveAndTeeForMultiUse functions move wasm instructions, however DBG_VALUE stay unchanged -- moving or cloning these. Reviewers: dschuff Reviewed By: dschuff Subscribers: mattd, MatzeB, dschuff, sbc100, jgravelle-google, aheejin, sunfish, llvm-commits, aardappel Tags: #debug-info Differential Revision: https://reviews.llvm.org/D49034 llvm-svn: 343007	2018-09-25 18:59:34 +00:00
Jessica Paquette	2539089260	Revert "[ConstHoist] Do not rebase single (or few) dependent constant" This caused a couple test failures on a bot: CodeGen/X86/constant-hoisting-bfi.ll Transforms/ConstantHoisting/X86/ehpad.ll Example: http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/53575/ llvm-svn: 343005	2018-09-25 18:41:40 +00:00
Daniil Fukalov	991f9ea5b7	[RegAllocGreedy] avoid using physreg candidates that cannot be correctly spilled For the AMDGPU target if a MBB contains exec mask restore preamble, SplitEditor may get state when it cannot insert a spill instruction. E.g. for a MIR bb.100: %1 = S_OR_SAVEEXEC_B64 %2, implicit-def $exec, implicit-def $scc, implicit $exec and if the regalloc will try to allocate a virtreg to the physreg already assigned to virtreg %1, it should insert spill instruction before the S_OR_SAVEEXEC_B64 instruction. But it is not possible since can generate incorrect code in terms of exec mask. The change makes regalloc to ignore such physreg candidates. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D52052 llvm-svn: 343004	2018-09-25 18:37:38 +00:00
Craig Topper	3bf4ee9d1e	[MC] Replace NULL constant in code with nullptr. llvm-svn: 343003	2018-09-25 18:33:00 +00:00
Daniel Sanders	091f90c030	[globalisel][tblgen] Table optimization should consider the C++ code in C++ predicates This fixes PR39045 llvm-svn: 342997	2018-09-25 17:59:02 +00:00
Zhaoshi Zheng	bf32af5c9a	[ConstHoist] Do not rebase single (or few) dependent constant If an instance (InsertionPoint or IP) of Base constant A has only one or few rebased constants depending on it, do NOT rebase. One extra ADD instruction is required to materialize each rebased constant, assuming A and the rebased have the same materialization cost. Differential Revision: https://reviews.llvm.org/D52243 llvm-svn: 342994	2018-09-25 17:45:37 +00:00
Justin Bogner	245caff12b	Revert "[DebugInfo] Do not generate address info for removed debug labels." The added test is failing on macOS: http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/53550/ This reverts r342943. llvm-svn: 342993	2018-09-25 17:29:30 +00:00
Craig Topper	e41e84b57b	[X86] Add AVX512 support to combineVectorSizedSetCCEquality. Reviewers: spatel, RKSimon Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52424 llvm-svn: 342989	2018-09-25 16:27:12 +00:00
Sanjay Patel	55fffb978e	[InstCombine] narrow binops on concatenated vectors (PR33026) The motivating case from: https://bugs.llvm.org/show_bug.cgi?id=33026 ...has no shuffles now. This kind of pattern may occur during vectorization when targets have lumpy ISAs like SSE/AVX. llvm-svn: 342988	2018-09-25 15:57:37 +00:00
Nirav Dave	36a936ebf7	[ARM] Share predecessor bookkeeping in CombineBaseUpdate. NFCI. llvm-svn: 342987	2018-09-25 15:30:47 +00:00
Nirav Dave	c26720b88c	[AArch64] Share search bookkeeping in combines. NFCI. Share predecessor search bookkeeping in both perform PostLD1Combine and performNEONPostLDSTCombine. This should be approximately a 4x and 2x performance improvement. llvm-svn: 342986	2018-09-25 15:30:22 +00:00
Nirav Dave	9c9fb30cc3	[LegalizeDAG] Prune Predecessor check in ExpandExtractFromVectorThroughStack. NFCI. llvm-svn: 342985	2018-09-25 15:29:57 +00:00
Nirav Dave	b68f8ddcb9	[DAGCombine] Improve Predecessor check in SimplifySelectOps. NFCI. Reuse search space bookkeeping across multiple predecessor checks qdone to avoid redundancy. This should cut search cost by ~4x. llvm-svn: 342984	2018-09-25 15:29:30 +00:00
Nirav Dave	049e71fd4e	[DAGCombine] Share predecessor bookkeeping in CombineToPostIndexedLoadStore. NFCI. llvm-svn: 342983	2018-09-25 15:29:04 +00:00
Guillaume Chatelet	d867442cfe	[llvm-exegesis] Serializes registers initial values. Summary: Adds the registers initial values to the YAML output of llvm-exegesis. Reviewers: courbet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52460 llvm-svn: 342982	2018-09-25 15:15:54 +00:00
Guillaume Chatelet	ad6af5f3a5	[llvm-exegesis] Fix missing document separator in YAML output. Reviewers: courbet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52496 llvm-svn: 342981	2018-09-25 14:48:24 +00:00
Nirav Dave	ba65b2d43a	[DAGCombine] Don't fold dependent loads across SELECT_CC. DAGCombine will try to fold two loads that feed a SELECT or SELECT_CC after the select, resulting in a select of an address and a single load after. If either of the loads depend on the other, this is not legal as it could introduce cycles. However, it only checked this if the opcode was a SELECT, and not for a SELECT_CC. Unfortunately, the only reproducer I have for this is for our downstream target. I've tried getting it to trigger on an upstream one but haven't been successful. Patch thanks to Bevin Hansson. llvm-svn: 342980	2018-09-25 14:43:05 +00:00
Clement Courbet	4d1bc5dc43	[llvm-exegesis] Add lit tests (v2). Summary: This revisits rL342953 by adding detection of host support. Reviewers: gchatelet, lebedev.ri, alexshap Subscribers: mgorny, tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52464 llvm-svn: 342975	2018-09-25 13:59:35 +00:00
Guillaume Chatelet	b798312f4c	[llvm-exegesis] Fix broken test. llvm-svn: 342971	2018-09-25 13:18:10 +00:00
Simon Pilgrim	4a581314fe	Revert rL342916: [X86] Remove shift/rotate by CL memory (RMW) overrides As suggested by Craig Topper - I'm going to look at cleaning up the RMW sequences instead. The uops are slightly different to the register variant, so requires a +1uop tweak llvm-svn: 342969	2018-09-25 13:01:26 +00:00
Guillaume Chatelet	d1656d7dab	[llvm-exegesis][NFC] Rewrite of the YAML serialization. Summary: This is a NFC in preparation of exporting the initial registers as part of the YAML dump Reviewers: courbet Reviewed By: courbet Subscribers: mgorny, tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52427 llvm-svn: 342967	2018-09-25 12:18:08 +00:00
Hans Wennborg	22f1fa8070	Revert r342637 "[ADT] Try again to use the same version of llvm::Optional on all compilers" and also revert follow-ups r342643 and r342723. This caused Clang to be miscompiled by GCC 4.8.4 (Unbuntu 14.04's default compiler) and break the Chromium build (see https://crbug.com/888061). llvm-svn: 342966	2018-09-25 12:08:56 +00:00
David Green	685f324068	[LoopUnroll] Add check to Latch's terminator in UnrollRuntimeLoopRemainder In this patch, I'm adding an extra check to the Latch's terminator in llvm::UnrollRuntimeLoopRemainder, similar to how it is already done in the llvm::UnrollLoop. The compiler would crash if this function is called with a malformed loop. Patch by Rodrigo Caetano Rocha! Differential Revision: https://reviews.llvm.org/D51486 llvm-svn: 342958	2018-09-25 10:08:47 +00:00
Sameer Sahasrabuddhe	4d0a123031	[AMDGPU] restore r342722 which was reverted with r342743 [AMDGPU] lower-switch in preISel as a workaround for legacy DA Summary: The default target of the switch instruction may sometimes be an "unreachable" block, when it is guaranteed that one of the cases is always taken. The dominator tree concludes that such a switch instruction does not have an immediate post dominator. This confuses divergence analysis, which is unable to propagate sync dependence to the targets of the switch instruction. As a workaround, the AMDGPU target now invokes lower-switch as a preISel pass. LowerSwitch is designed to handle the unreachable default target correctly, allowing the divergence analysis to locate the correct immediate dominator of the now-lowered switch. llvm-svn: 342956	2018-09-25 09:39:21 +00:00
Clement Courbet	4ed360691f	Revert rL342953 "[llvm-exegesis] Add lit tests." We also need to make sure that we're on the right subtarget. llvm-svn: 342955	2018-09-25 09:36:44 +00:00
Clement Courbet	4916d38dfd	[llvm-exegesis] Add lit tests. Summary: Right now we only have unit tests. This will allow testing the whole tool. Even though We can't really check actual values, this will avoid regressions such as PR39055. Reviewers: gchatelet, alexshap Subscribers: mgorny, tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52407 llvm-svn: 342953	2018-09-25 09:27:43 +00:00
Heejin Ahn	b7573e319d	[llvm-exegesis] Add MCParser to LLVM_LINK_COMPONENTS We need this to make builds with `-DBUILD_SHARED_LIBS=ON` work. llvm-svn: 342952	2018-09-25 08:25:29 +00:00
Fangrui Song	cd25254f64	Add macro LLVM_ATTRIBUTE_REINITIALIZES Summary: This marks legitimate use-after-move (e.g. `Found.clear()` in rC342925) which would otherwise be caught by bugprone-use-after-move. bugprone-use-after-move recognizes this attribute after rCTE339571. Reviewers: aaron.ballman, rsmith, mboehme, hokein Reviewed By: mboehme Subscribers: kristina, llvm-commits Differential Revision: https://reviews.llvm.org/D52451 llvm-svn: 342949	2018-09-25 08:06:32 +00:00
Clement Courbet	519dab966b	[llvm-exegesis] Fix doc in r342947. llvm-exegesis.rst was using invalid indentation for bullet points. llvm-svn: 342948	2018-09-25 07:48:38 +00:00
Clement Courbet	0301f9855d	[llvm-exegesis] Allow benchmarking arbitrary code snippets. Summary: This is a step towards fixing PR38048. Note that right now the measurements are given per instruction. We'll need to give measurements a per code snippet and update the analysis (PR38731). Reviewers: gchatelet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D52041 llvm-svn: 342947	2018-09-25 07:31:44 +00:00
Stefan Maksimovic	6defe0ebcf	[mips] Correct MUL pattern for mips64 Guard existing pattern with a predicate, introduce a new one for revision 6. Differential Revision: https://reviews.llvm.org/D51684 llvm-svn: 342946	2018-09-25 06:27:49 +00:00
Fangrui Song	556d8103ca	Use unique_ptr to hold AsmInfo,MRI,MII,STI Reviewers: pcc, dblaikie Reviewed By: dblaikie Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52389 llvm-svn: 342945	2018-09-25 06:19:31 +00:00
Mikael Holmen	49af72e55b	Use TRI->regsOverlap() in MachineBasicBlock::computeRegisterLiveness Summary: For the loop that used MCRegAliasIterator this should be NFC. For the loop that previously used MCSubRegIterator we should now detect more cases where the register is actually live out that we previously missed. Reviewers: MatzeB, arsenm Reviewed By: MatzeB Subscribers: wdng, llvm-commits Differential Revision: https://reviews.llvm.org/D52410 llvm-svn: 342944	2018-09-25 06:10:04 +00:00
Hsiangkai Wang	e307d30c46	[DebugInfo] Do not generate address info for removed debug labels. In some senario, LLVM will remove llvm.dbg.labels in IR. For example, when the labels are in unreachable blocks, these labels will not be generated in LLVM IR. In the case, these debug labels will have address zero as their address. It is not legal address for debugger to set breakpoints or query sources. So, the patch inhibits the address info (DW_AT_low_pc) of removed labels. Differential Revision: https://reviews.llvm.org/D51908 llvm-svn: 342943	2018-09-25 06:09:50 +00:00
Justin Bogner	c175a1ad14	[MachineCopyPropagation] Reimplement CopyTracker in terms of register units Change the copy tracker to keep a single map of register units instead of 3 maps of registers. This gives a very significant compile time performance improvement to the pass. I measured a 30-40% decrease in time spent in MCP on x86 and AArch64 and much more significant improvements on out of tree targets with more registers. Differential Revision: https://reviews.llvm.org/D52374 llvm-svn: 342942	2018-09-25 05:16:44 +00:00
Lang Hames	67265a68fd	Revert "[ORC] Switch to asynchronous resolution in JITSymbolResolver." This reverts commit r342939. MSVC's promise/future implementation does not like types that are not default constructible. Reverting while I figure out a solution. llvm-svn: 342941	2018-09-25 04:54:03 +00:00
Justin Bogner	48d4cf5b0d	[MachineCopyPropagation] Rework how we manage RegMask clobbers Instead of updating the CopyTracker's maps each time we come across a RegMask, defer checking for this kind of interference until we're actually trying to propagate a copy. This avoids the need to repeatedly iterate over maps in the cases where we don't end up doing any work. This is a slight compile time improvement for MachineCopyPropagation as is, but it also enables a much bigger improvement that I'll follow up with soon. Differential Revision: https://reviews.llvm.org/D52370 llvm-svn: 342940	2018-09-25 04:45:25 +00:00
Lang Hames	c1dc6f5684	[ORC] Switch to asynchronous resolution in JITSymbolResolver. Asynchronous resolution (where the caller receives a callback once the requested set of symbols are resolved) is a core part of the new concurrent ORC APIs. This change extends the asynchronous resolution model down to RuntimeDyld, which is necessary to prevent deadlocks when compiling/linking on a fixed number of threads: If RuntimeDyld's linking process were a blocking operation, then any complete K-graph in a program will require at least K threads to link in the worst case, as each thread would block waiting for all the others to complete. Using callbacks instead allows the work to be passed between dependent threads until it is complete. For backwards compatibility, all existing RuntimeDyld functions will continue to operate in blocking mode as before. This change will enable the introduction of a new async finalization process in a subsequent patch to enable asynchronous JIT linking. llvm-svn: 342939	2018-09-25 04:43:38 +00:00
Thomas Lively	3503ba6394	[WebAssembly] SIMD sqrt Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D52387 llvm-svn: 342937	2018-09-25 03:39:28 +00:00
Stanislav Mekhanoshin	a3275cd945	[AMDGPU] Remove useless check from test. NFC. The check for assignment of zero is practically useless while the assignment moves around with different scheduling. llvm-svn: 342935	2018-09-25 01:24:54 +00:00

1 2 3 4 5 ...

169586 Commits