llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

Author	SHA1	Message	Date
Fangrui Song	2e32c23e9d	[X86] Fix -Wunused-variable in -DLLVM_ENABLE_ASSERTIONS=off builds after r373174 llvm-svn: 373175	2019-09-30 02:06:23 +00:00
Craig Topper	429917acf2	[X86] Remove -x86-experimental-vector-widening-legalization command line flag This was added back to allow some performance regressions to be investigated. The main perf issue was fixed shortly after adding this back and no other major issues have been reported. So I think its safe to remove this again. llvm-svn: 373174	2019-09-29 23:32:37 +00:00
Craig Topper	52834729d8	[X86] Add custom isel logic to match VPTERNLOG from 2 logic ops. There's room from improvement here, but this is a decent starting point. There are a few minor regressions in the vector-rotate tests, where we are now forming a vpternlog from an and before we get a chance to form it for a bitselect that we were matching previously. This results in an AND and an ANDN feeding the vpternlog where previously we just had an AND after the vpternlog. I think we can probably DAG combine the AND with the bitselect to get back to similar codegen. llvm-svn: 373172	2019-09-29 18:43:08 +00:00
Amaury Sechet	f1f66201d3	Add test case peeking through vector concat when combining insert into shuffles. NFC llvm-svn: 373171	2019-09-29 17:54:03 +00:00
Aditya Kumar	909ec9e622	[LLVM-C][Ocaml] Add MergeFunctions and DCE pass MergeFunctions and DCE pass are missing from OCaml/C-api. This patch adds them. Differential Revision: https://reviews.llvm.org/D65071 Reviewers: whitequark, hiraditya, deadalnix Reviewed By: whitequark Subscribers: llvm-commits Tags: #llvm Authored by: kren1 llvm-svn: 373170	2019-09-29 16:06:22 +00:00
DeForest Richards	1d0a9f010f	[Docs] Moves article links to new pages Moves existing article links on the Programming, Subsystem, and Reference documentation pages to new locations. Also moves Github Repository and Publications links to the sidebar. llvm-svn: 373169	2019-09-29 15:31:52 +00:00
Fangrui Song	f3ea7f2495	[MC] Emit unused undefined symbol even if its binding is not set For the following two cases, we currently suppress the symbols. This patch emits them (compatible with GNU as). * `test2_a = undef`: if `undef` is otherwise unused. * `.hidden hidden`: if `hidden` is unused. This is the main point of the patch, because omitting the symbol would cause a linker semantic difference. It causes a behavior change that is not compatible with GNU as: .weakref foo1, bar1 When neither foo1 nor bar1 is used, we now emit bar1, which is arguably more consistent. Another change is that we will emit .TOC. for .TOC.@tocbase . For this directive, suppressing .TOC. can be seen as a size optimization, but we choose to drop it for simplicity and consistency. llvm-svn: 373168	2019-09-29 15:26:12 +00:00
Roman Lebedev	408398f5f4	[DivRemPairs] Don't assert that we won't ever get expanded-form rem pairs in different BB's (PR43500) If we happen to have the same div in two basic blocks, and in one of those we also happen to have the rem part, we'd match the div-rem pair, but the wrong ones. So let's drop overly-ambiguous assert. Fixes https://bugs.llvm.org/show_bug.cgi?id=43500 llvm-svn: 373167	2019-09-29 15:25:24 +00:00
Alexey Bataev	4cfaf409b9	[SLP] Fix for PR31847: Assertion failed: (isLoopInvariant(Operands[i], L) && "SCEVAddRecExpr operand is not loop-invariant!") Initially SLP vectorizer replaced all going-to-be-vectorized instructions with Undef values. It may break ScalarEvaluation and may cause a crash. Reworked SLP vectorizer so that it does not replace vectorized instructions by UndefValue anymore. Instead vectorized instructions are marked for deletion inside if BoUpSLP class and deleted upon class destruction. Reviewers: mzolotukhin, mkuper, hfinkel, RKSimon, davide, spatel Subscribers: RKSimon, Gerolf, anemet, hans, majnemer, llvm-commits, sanjoy Differential Revision: https://reviews.llvm.org/D29641 llvm-svn: 373166	2019-09-29 14:18:06 +00:00
Jinsong Ji	27057936f6	[PowerPC] Fix conditions of assert in PPCAsmPrinter Summary: g++ build emits warning: llvm/lib/Target/PowerPC/PPCAsmPrinter.cpp:667:77: error: suggest parentheses around ?&&? within ?\|\|? [-Werror=parentheses] assert(MO.isGlobal() \|\| MO.isCPI() \|\| MO.isJTI() \|\| MO.isBlockAddress() && ~~~~~~~~~~~~~~~~~~~~^~ "Unexpected operand type for LWZtoc pseudo."); I believe the intension is to assert all different types, so we should add a parentheses to include all '\|\|'. Reviewers: #powerpc, sfertile, hubert.reinterpretcast, Xiangling_L Reviewed By: Xiangling_L Subscribers: wuzish, nemanjai, hiraditya, kbarton, MaskRay, shchenz, steven.zhang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68180 llvm-svn: 373164	2019-09-29 12:43:46 +00:00
David Green	2f19a7836e	[ARM] Cortex-M4 schedule additions This is an attempt to fill in some of the missing instructions from the Cortex-M4 schedule, and make it easier to do the same for other ARM cpus. - Some instructions are marked as hasNoSchedulingInfo as they are pseudos or otherwise do not require scheduling info - A lot of features have been marked not supported - Some WriteRes's have been added for cvt instructions. - Some extra instruction latencies have been added, notably by relaxing the regex for dsp instruction to catch more cases, and some fp instructions. This goes a long way to get the CompleteModel working for this CPU. It does not go far enough as to get all scheduling info for all output operands correct. Differential Revision: https://reviews.llvm.org/D67957 llvm-svn: 373163	2019-09-29 08:38:48 +00:00
DeForest Richards	6e8bb199f9	[Docs] Adds sections for Command Line and LibFuzzer articles Adds sections for Command Line and Libfuzzer articles on Programming Documentation page. llvm-svn: 373158	2019-09-29 02:16:38 +00:00
Craig Topper	9c39cbffb4	[X86] Enable isel to fold broadcast loads that have been bitcasted from FP into a vpternlog. llvm-svn: 373157	2019-09-29 01:24:33 +00:00
Craig Topper	0bed363d8a	[X86] Move bitselect matching to vpternlog into X86ISelDAGToDAG.cpp This allows us to reduce the use count on the condition node before the match. This enables load folding for that operand without relying on the peephole pass. This will be improved on for broadcast load folding in a subsequent commit. This still requires a bunch of isel patterns for vXi16/vXi8 types though. llvm-svn: 373156	2019-09-29 01:24:29 +00:00
Craig Topper	d4425268f7	[X86] Enable canonicalizeBitSelect for AVX512 since we can use VPTERNLOG now. llvm-svn: 373155	2019-09-29 01:24:22 +00:00
Craig Topper	9caaea8d34	[X86] Match (or (and A, B), (andn (A, C))) to VPTERNLOG with AVX512. This uses a similar isel pattern as we used for vpcmov with XOP. llvm-svn: 373154	2019-09-29 01:24:16 +00:00
Aditya Kumar	95be4dd95f	[NFC] Move hot cold splitting class to header file Summary: This is to facilitate unittests Reviewers: compnerd, vsk, tejohnson, sebpop, brzycki, SirishP Reviewed By: tejohnson Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68079 llvm-svn: 373151	2019-09-28 18:13:33 +00:00
Sanjay Patel	784406598d	[PowerPC] make tests immune to improved undef handling The fma mutate test will not exercise what it was intended to test once we simplify those ops immediately, but the test will still pass with the existing CHECKs, so I'm leaving it in case that still has minimal value. llvm-svn: 373149	2019-09-28 13:34:53 +00:00
Amara Emerson	d75f9d2a8e	[GlobalISel Enable memcpy inlining with optsize. We should be disabling inline for minsize, not optsize. llvm-svn: 373143	2019-09-28 07:55:42 +00:00
Anton Afanasyev	d113fba9a5	[TimeProfiler] Fix "OptModule" section and add new "Backend" sections Remove unnecessary "OptModule" section. Add "PerFunctionPasses", "PerModulePasses" and "CodeGenPasses" sections under "Backend" section. llvm-svn: 373142	2019-09-28 07:14:12 +00:00
Amara Emerson	bcfd2edd61	Add an operand to memory intrinsics to denote the "tail" marker. We need to propagate this information from the IR in order to be able to safely do tail call optimizations on the intrinsics during legalization. Assuming it's safe to do tail call opt without checking for the marker isn't safe because the mem libcall may use allocas from the caller. This adds an extra immediate operand to the end of the intrinsics and fixes the legalizer to handle it. Differential Revision: https://reviews.llvm.org/D68151 llvm-svn: 373140	2019-09-28 05:33:21 +00:00
Matt Arsenault	c982aa7a0b	AMDGPU/GlobalISel: Avoid getting MRI in every function Store it in AMDGPUInstructionSelector to avoid boilerplate in nearly every select function. llvm-svn: 373139	2019-09-28 03:41:13 +00:00
Craig Topper	666a1a40ae	[X86] Add broadcast load unfolding support for VPTESTMD/Q and VPTESTNMD/Q. llvm-svn: 373138	2019-09-28 01:56:36 +00:00
Craig Topper	e3b69f4175	[X86] Stop using UpdateNodeOperands in combineGatherScatter. Create new nodes like most other DAG combines. Creating new nodes is what we usually do. Have to explicitly check that we don't update to an existing node and having to manually manage the worklist is unusual. We can probably add a helper function to reduce the duplication of having to check if we should create a gather or scatter, but I wanted to just get the simple thing done. llvm-svn: 373137	2019-09-28 01:08:46 +00:00
Craig Topper	a379ee1929	[X86] Split combineGatherScatter into a version for generic ISD nodes and another version for X86 specific nodes. The majority of the code doesn't run on the X86 nodes today since its gated by isBeforeLegalizeOps and we don't formm X86 nodes until after that. Except for a couple special case in type legalization. But I think we would probably break those if some of the transforms fire on them. I want to remove the hardcoded operand numbers and the unusual use of UpdateNodeOperands. Being able to know which ISD opcodes are present should help with that. llvm-svn: 373136	2019-09-28 01:06:58 +00:00
Wei Mi	dd45f5f726	[SampleFDO] Create a separate flag profile-accurate-for-symsinlist to handle profile symbol list. Currently many existing users using profile-sample-accurate want to reduce code size as much as possible. Their use cases are different from the scenario profile symbol list tries to handle -- the major motivation of adding profile symbol list is to get the major memory/code size saving without introduce performance regression. So to keep the behavior of profile-sample-accurate unchanged, we think decoupling these two things and using a new flag to control the handling of profile symbol list may be better. When profile-sample-accurate and the new flag profile-accurate-for-symsinlist are both present, since profile-sample-accurate is a user assertion we let it have a higher precedence. Differential Revision: https://reviews.llvm.org/D68047 llvm-svn: 373133	2019-09-27 22:33:59 +00:00
Alexander Shaposhnikov	5d5e87b41e	[llvm-lipo] Add support for -arch Add support for -arch. Differential revision: https://reviews.llvm.org/D68116 Test plan: make check-all llvm-svn: 373132	2019-09-27 22:33:18 +00:00
Craig Topper	59afa86678	[X86] Add test case to show missed opportunity to turn (add (zext (vXi1 X)), Y) -> (sub Y, (sext (vXi1 X))) with avx512. With avx512, the vXi1 type is legal. And we can more easily sign extend them to vector registers. zext requires a sign extend and a shift. If we can easily turn the zext into a sext we should. llvm-svn: 373131	2019-09-27 22:30:24 +00:00
Roman Lebedev	13a563c646	[PatternMatch] Add m_SExtOrSelf(), m_ZExtOrSExtOrSelf() matchers + unittests m_SExtOrSelf() is for consistency. m_ZExtOrSExtOrSelf() is motivated by the D68103/r373106 : sometimes it is useful to look past any extensions of the shift amount, and m_ZExtOrSExtOrSelf() may be exactly the tool to do that. llvm-svn: 373128	2019-09-27 21:53:04 +00:00
Yi Kong	f7a791d80b	[llvm-readobj] Rename --arm-attributes to --arch-specific This is for compatibility with GNU readobj. --arm-attributes option is left as a hidden alias due to large number of tests using it. Differential Revision: https://reviews.llvm.org/D68110 llvm-svn: 373125	2019-09-27 20:38:18 +00:00
Sanjay Patel	31f480507c	[InstSimplify] generalize FP folds with undef/NaN; NFC We can reuse this logic for things like fma. llvm-svn: 373119	2019-09-27 20:09:09 +00:00
Jakub Kuderski	3bff328a07	Revert [Dominators][CodeGen] Clean up MachineDominators This reverts r373101 (git commit 72c57ec3e6b320c31274dadb888dc16772b8e7b6) llvm-svn: 373117	2019-09-27 19:33:39 +00:00
Jakub Kuderski	a411fb008a	Revert XFAIL a codegen test AArch64/tailmerging_in_mbp.ll This reverts r373103 (git commit a524e630a793e18e7d5fabc2262781f310eb0279) llvm-svn: 373116	2019-09-27 19:33:35 +00:00
Roman Lebedev	7229fdbc81	[NFC][PhaseOrdering] Add end-to-end tests for the 'two shifts by sext' problem We start with two separate sext's, but EarlyCSE runs before InstCombine, so when we get them, they are a single sext, and we just ignore that. Likewise, if we had a single sext, we don't do anything there. llvm-svn: 373115	2019-09-27 19:32:43 +00:00
DeForest Richards	e4f8869b72	[Docs] Adds new section to User Guides page Adds a section to the User Guides page for articles related to building, packaging, and distributing LLVM. Includes sub-sections for CMake, Clang, and Docker. llvm-svn: 373113	2019-09-27 19:12:00 +00:00
Sanjay Patel	58281c58fc	[InstSimplify] add tests for fma/fmuladd with undef operand; NFC llvm-svn: 373109	2019-09-27 18:38:51 +00:00
Craig Topper	db855efbed	[X86] Call SimplifyDemandedBits in combineGatherScatter any time the mask element is wider than i1, not just when AVX512 is disabled. The AVX2 intrinsics can still be used when AVX512 is enabled and those go through this path. So we should simplify them. llvm-svn: 373108	2019-09-27 18:23:55 +00:00
Craig Topper	a2a144c550	[X86] Add test case to show failure to perform SimplifyDemandedBits on mask of avx2 gather intrinsics when avx512 is enabled. llvm-svn: 373107	2019-09-27 18:23:46 +00:00
Roman Lebedev	120da12eeb	[InstCombine] Simplify shift-by-sext to shift-by-zext Summary: This is valid for any `sext` bitwidth pair: ``` Processing /tmp/opt.ll.. ---------------------------------------- %signed = sext %y %r = shl %x, %signed ret %r => %unsigned = zext %y %r = shl %x, %unsigned ret %r %signed = sext %y Done: 2016 Optimization is correct! ``` (This isn't so for funnel shifts, there it's illegal for e.g. i6->i7.) Main motivation is the C++ semantics: ``` int shl(int a, char b) { return a << b; } ``` ends as ``` %3 = sext i8 %1 to i32 %4 = shl i32 %0, %3 ``` https://godbolt.org/z/0jgqUq which is, as this shows, too pessimistic. There is another problem here - we can only do the fold if sext is one-use. But we can trivially have cases where several shifts have the same sext shift amount. This should be resolved, later. Reviewers: spatel, nikic, RKSimon Reviewed By: spatel Subscribers: efriedma, hiraditya, nlopes, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68103 llvm-svn: 373106	2019-09-27 18:12:15 +00:00
Jakub Kuderski	610e2b3c9d	XFAIL a codegen test AArch64/tailmerging_in_mbp.ll This test fails when machine dominator tree verifier is run. Needs more investigation, as this is not a new failure. llvm-svn: 373103	2019-09-27 17:41:17 +00:00
Jakub Kuderski	040d8dbfa9	[Dominators][CodeGen] Clean up MachineDominators Summary: This is a cleanup patch for MachineDominatorTree. It would be an NFC, except for replacing custom DomTree verification with the generic one. Reviewers: tstellar, tpr, nhaehnle, arsenm, NutshellySima, grosser, hliao Reviewed By: arsenm Subscribers: wdng, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D67976 llvm-svn: 373101	2019-09-27 17:25:39 +00:00
Simon Pilgrim	434994afd7	Fix MSVC "not all control paths return a value" warning. NFCI. llvm-svn: 373100	2019-09-27 16:56:07 +00:00
Simon Pilgrim	d0072a6ced	ModuleUtils - silence static analyzer dyn_cast<> null dereference warning. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<> directly and if not assert will fire for us. llvm-svn: 373099	2019-09-27 16:55:49 +00:00
Kai Nacke	b3f72a7ae7	Change -march=systemz to triple and fix test These two test cases use -march=systemz instead of a triple. In particular, the used file format is then based on the default host triple. This leads to different behaviour on different platforms. The SystemZ implementation uses the integrated assembler for a long time now. The mature-mc-support test can be fully enabled. Differential Revision: https://reviews.llvm.org/D68129 llvm-svn: 373098	2019-09-27 16:19:15 +00:00
Simon Pilgrim	763eac1a90	FunctionImportGlobalProcessing::processGlobalForThinLTO - silence static analyzer dyn_cast<FunctionSummary> null dereference warning. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<FunctionSummary> directly and if not assert will fire for us. llvm-svn: 373097	2019-09-27 15:49:19 +00:00
Luis Marques	bc3e25486a	[RISCV] Rename FPRs and use Register arithmetic The new names for FPRs ensure that the Register values within the same class are enumerated consecutively (the order is determined by the `LessRecordRegister` function object). Where there were tables mapping between 32- and 64-bit FPRs (and vice versa) this patch replaces them with Register arithmetic. The enumeration order between different register classes is expected to continue to be arbitrary, although it does impact the conversion from the (overloaded) asm FPR names to Register values, and therefore might require updates to the target if the sorting algorithm is changed. Static asserts were added to ensure that changes to the ordering that would impact the current implementation are detected. Differential Revision: https://reviews.llvm.org/D67423 llvm-svn: 373096	2019-09-27 15:49:10 +00:00
Simon Pilgrim	a9e2380a5d	SCCP - silence static analyzer dyn_cast<StructType> null dereference warning. NFCI. The static analyzer is warning about a potential null dereference, but we should be able to use cast<StructType> directly and if not assert will fire for us. llvm-svn: 373095	2019-09-27 15:49:10 +00:00
Dmitry Preobrazhensky	09506da3e3	[AMDGPU][MC] Corrected parsing of registers Summary of changes: refactored code for better readability and future improvements; fixed bug 41281: https://bugs.llvm.org/show_bug.cgi?id=41281 Reviewers: artem.tamazov, arsenm Differential Revision: https://reviews.llvm.org/D65224 llvm-svn: 373094	2019-09-27 15:41:31 +00:00
Djordje Todorovic	7c429fc379	[DebugInfo] Exclude memory location values as parameter entry values Abandon describing of loaded values due to safety concerns. Loaded values are described as derefed memory location at caller point. At callee we can unintentionally change that memory location which would lead to different entry being printed value before and after the memory location clobbering. This problem is described in llvm.org/PR43343. Patch by Nikola Prica Differential Revision: https://reviews.llvm.org/D67717 llvm-svn: 373089	2019-09-27 13:52:43 +00:00
Clement Courbet	8a234db13c	[llvm-exegesis] Fix r373083: Module -> Mod. SnippetRepetitorTest.cpp:66:27: error: declaration of ‘std::unique_ptr<llvm::Module> llvm::exegesis::{anonymous}::X86SnippetRepetitorTest::Module’ [-fpermissive] std::unique_ptr<Module> Module; llvm-svn: 373087	2019-09-27 13:21:37 +00:00

1 2 3 4 5 ...

185544 Commits