llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Sanjay Patel	be1f63bc5d	[DAGCombiner] clean up in mergeConsecutiveStores(); NFC	2020-07-08 14:48:05 -04:00
Sanjay Patel	dd6322ea43	[DAGCombiner] fix code comment and improve readability; NFC	2020-07-08 14:48:05 -04:00
Jay Foad	29d5e085f3	SILoadStoreOptimizer: add support for GFX10 image instructions GFX10 image instructions use one or more address operands starting at vaddr0, instead of a single vaddr operand, to allow for NSA forms. Differential Revision: https://reviews.llvm.org/D81675	2020-07-08 19:15:46 +01:00
Jay Foad	54ba85c6ed	[AMDGPU] Fix and simplify AMDGPULegalizerInfo::legalizeUDIV_UREM32Impl Use the algorithm from AMDGPUCodeGenPrepare::expandDivRem32. Differential Revision: https://reviews.llvm.org/D83383	2020-07-08 19:14:49 +01:00
Jay Foad	875c9d1f56	[AMDGPU] Fix and simplify AMDGPUTargetLowering::LowerUDIVREM Use the algorithm from AMDGPUCodeGenPrepare::expandDivRem32. Differential Revision: https://reviews.llvm.org/D83382	2020-07-08 19:14:49 +01:00
Jay Foad	8b9c9b1aec	[AMDGPU] Fix and simplify AMDGPUCodeGenPrepare::expandDivRem32 Fix the division/remainder algorithm by adding a second quotient refinement step, which is required in some cases like 0xFFFFFFFFu / 0x11111111u (https://bugs.llvm.org/show_bug.cgi?id=46212). Also document, rewrite and simplify it by ensuring that we always have a lower bound on inv(y), which simplifies the UNR step and the quotient refinement steps. Differential Revision: https://reviews.llvm.org/D83381	2020-07-08 19:14:48 +01:00
Christopher Tetreault	486978ecc4	[SVE] Remove calls to VectorType::getNumElements from Scalar Reviewers: efriedma, fhahn, reames, kmclaughlin, sdesmalen Reviewed By: sdesmalen Subscribers: tschuett, hiraditya, rkruppe, psnobl, dantrushin, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82243	2020-07-08 11:08:20 -07:00
Fangrui Song	6d85737ae7	[test] Add REQUIRES: x86-registered-target to tools/obj2yaml/COFF/bss.s	2020-07-08 10:53:30 -07:00
Fangrui Song	3be3eb48cd	[Support] Define llvm::parallel::strategy for -DLLVM_ENABLE_THREADS=off builds after D76885	2020-07-08 10:51:20 -07:00
Fangrui Song	0bd7e34508	[test] Run llvm/test/*/.yaml & don't run llvm/test/*/.cxx (not exist) This patch extends D58439 (`llvm/test/{yaml2obj,obj2yaml}/*/.yaml`) and runs all `llvm/test/*/.yaml` Many directories have configured `.yaml` (see the deleted lit.local.cfg files). Yet still some don't configure .yaml and have caused stale tests: * 8c5825befb7bbb2e76f7eccedc6d3bf26e9b2a6a test/llvm-readobj * bdc3134e237737dd46b51cd1ecd41ecbbe9f921a test/ExecutionEngine Just hoist .yaml to `llvm/test/lit.cfg.py`. Also delete .cxx which is not used. The number of tests running on my machine increases from 38304 to 38309. The list of new tests: ``` ExecutionEngine/RuntimeDyld/X86/ELF_x86-64_none.yaml Object/archive-error-tmp.txt tools/llvm-ar/coff-weak.yaml tools/llvm-readobj/ELF/verneed-flags.yaml tools/obj2yaml/COFF/bss.s ``` Reviewed By: grimar, jhenderson, rupprecht Differential Revision: https://reviews.llvm.org/D83350	2020-07-08 10:22:49 -07:00
Sanjay Patel	3e51e7ee3e	[DAGCombiner] fix function-name formatting; NFC	2020-07-08 12:49:59 -04:00
Sanjay Patel	f709c202f9	[DAGCombiner] add enum for store source value; NFC This removes existing code duplication and allows us to assert that we are handling the expected cases. We have a list of outstanding bugs that could benefit by handling truncated source values, so that's a possible addition going forward.	2020-07-08 12:49:59 -04:00
Simon Pilgrim	eadfc75a6c	[X86][SSE] Pull out PACK(SHUFFLE(),SHUFFLE()) folds into its own function. NFC. Future patches will extend this so declutter combineVectorPack before we start.	2020-07-08 17:42:42 +01:00
Simon Pilgrim	e72a1824bc	[X86] Fix copy+paste typo in combineVectorPack assert message. NFC.	2020-07-08 17:42:42 +01:00
Arthur Eubanks	c51ed34bad	[NewPM] Add PredicateInfoPrinterPass to PassRegistry.def Fixes tests under NPM in Transforms/Util/PredicateInfo.	2020-07-08 09:32:46 -07:00
Wei Mi	1f3e158bbf	[SampleFDO] Enable sample-profile-top-down-load and sample-profile-merge-inlinee by default. sample-profile-top-down-load is an internal option which can enable top-down order of inlining and profile annotation in sample profile load pass. It was found to be beneficial for better profile annotation. Recently we found it could also solve some build time issue. Suppose function A has many callsites in function B. In the last release binary where sample profile was collected, the outline copy of A is large because there are many other functions inlined into A. However although all the callsites calling A in B are inlined, but every inlined body is small (A was inlined into B before other functions are inlined into A), there is no build time issue in last release. In an optimized build using the sample profile collected from last release, without top-down inlining, we saw a case that A got very large because of inlining, and then multiple callsites of A got inlined into B, and that led to a huge B which caused significant build time issue besides profile annotation issue. To solve that problem, the patch enables the flag sample-profile-top-down-load by default. sample-profile-top-down-load can have better performance when it is enabled together with sample-profile-merge-inlinee so in this patch we also enable sample-profile-merge-inlinee by default. Differential Revision: https://reviews.llvm.org/D82919	2020-07-08 09:23:18 -07:00
Ulrich Weigand	e075c2a8b5	[SystemZ] Allow specifying integer registers as part of the address calculation Revision e1de2773a534957305d7a559c6d88c4b5ac354e2 provided support for accepting integer registers in inline asm i.e. __asm("lhi %r0, 5") -> lhi %r0, 5 __asm("lhi 0, 5") -> lhi 0,5 This patch aims to extend this support to instructions which compute addresses as well. (i.e instructions of type BDMem and BD[X\|R\|V\|L]Mem) Author: anirudhp Differential Revision: https://reviews.llvm.org/D83251	2020-07-08 18:20:24 +02:00
Nicolai Hähnle	1bea5e0ab7	DomTree: remove explicit use of DomTreeNodeBase::iterator Summary: Almost all uses of these iterators, including implicit ones, really only need the const variant (as it should be). The only exception is in NewGVN, which changes the order of dominator tree child nodes. Change-Id: I4b5bd71e32d71b0c67b03d4927d93fe9413726d4 Reviewers: arsenm, RKSimon, mehdi_amini, courbet, rriddle, aartbik Subscribers: wdng, Prazek, hiraditya, kuhar, rogfer01, rriddle, jpienaar, shauheen, antiagainst, nicolasvasilache, arpith-jacob, mgester, lucyrfox, aartbik, liufengdb, stephenneuendorffer, Joonsoo, grosul1, vkmr, Kayjukh, jurahul, msifontes, cfe-commits, llvm-commits Tags: #clang, #mlir, #llvm Differential Revision: https://reviews.llvm.org/D83087	2020-07-08 18:18:49 +02:00
serge-sans-paille	ab87c3ee5f	Revert "Double check that passes correctly set their Modified status" This reverts commit 37afd99c768b29c7df7c5f2eb645362fb61f9915.	2020-07-08 18:14:40 +02:00
Sjoerd Meijer	24b093524a	[matrix] Add some more Verifier negative tests. NFC.	2020-07-08 17:11:53 +01:00
Arthur Eubanks	e8d13f6c1b	[NewPM][opt] Translate "-O#" to NPM's "default<O#>" Fixes 52 check-llvm tests under NPM. Reviewed By: hans Differential Revision: https://reviews.llvm.org/D83367	2020-07-08 09:01:20 -07:00
Arthur Eubanks	bcce08feb7	[NewPM][opt] Share -disable-loop-unrolling between pass managers There's no reason to introduce a new option for the NPM. The various PGO options are shared in this manner. Reviewed By: echristo Differential Revision: https://reviews.llvm.org/D83368	2020-07-08 08:50:56 -07:00
Evgeny Leviant	dc367e49cc	[MIR] Speedup parsing of function with large number of basic blocks Patch eliminates string length calculation when lexing a token. Speedup can be up to 1000x. Differential revision: https://reviews.llvm.org/D83389	2020-07-08 18:50:00 +03:00
Arthur Eubanks	9f384ed588	[Preallocated] Add @llvm.call.preallocated.teardown This cleans up the stack allocated by a @llvm.call.preallocated.setup. Should either call the teardown or the preallocated call to clean up the stack. Calling both is UB. Add LangRef. Add verifier check that the token argument is a @llvm.call.preallocated.setup. Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D83354	2020-07-08 08:48:44 -07:00
Paul Walker	4882063b26	[SelectionDAG] Fix incorrect offset when expanding CONCAT_VECTORS. ExpandVectorBuildThroughStack is also used for CONCAT_VECTORS. However, when calculating the offsets for each of the operands we incorrectly use the element size rather than actual size and thus the stores overlap. Differential Revision: https://reviews.llvm.org/D83303	2020-07-08 15:39:25 +00:00
serge-sans-paille	bd5694a496	Double check that passes correctly set their Modified status The approach is simple: if a pass reports that it's not modifying a Function/Module, compute a loose hash of that Function/Module and compare it with the original one. If we report no change but there's a hash change, then we have an error. This approach misses a lot of change but it's not super intrusive and can detect most of the simple mistakes. Differential Revision: https://reviews.llvm.org/D80916	2020-07-08 17:36:13 +02:00
sstefan1	7cc76647fc	[OpenMPIRBuilder][Fix] Move llvm::omp::types to OpenMPIRBuilder. Summary: D82193 exposed a problem with global type definitions in `OMPConstants.h`. This causes a race when running in thinLTO mode. Types now live inside of OpenMPIRBuilder to prevent this from happening. Reviewers: jdoerfert Subscribers: yaxunl, hiraditya, guansong, dexonsmith, aaron.ballman, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D83176	2020-07-08 17:23:55 +02:00
Stanislav Mekhanoshin	6a04f24d67	SLP: honor requested max vector size merging PHIs At the moment this place does not check maximum size set by TTI and just creates a maximum possible vectors. Differential Revision: https://reviews.llvm.org/D82227	2020-07-08 08:06:15 -07:00
Ties Stuij	f246100fbd	[CodeGen] Don't combine extract + concat vectors with non-legal types Summary: The following combine currently breaks in the DAGCombiner: ``` extract_vector_elt (concat_vectors v4i16:a, v4i16:b), x -> extract_vector_elt a, x ``` This happens because after we have combined these nodes we have inserted nodes that use individual instances of the vector element type. In the above example i16. However this isn't a legal type on all backends, and when the combining pass calls the legalizer it breaks as it expects types to already be legal. The type legalizer has already been run, and running it again would make a mess of the nodes. In the example code at least, the generated code is still efficient after the change. Reviewers: miyuki, arsenm, dmgreen, lebedev.ri Reviewed By: miyuki, lebedev.ri Subscribers: lebedev.ri, wdng, hiraditya, steven.zhang, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83231	2020-07-08 15:29:57 +01:00
Alexey Bataev	ec602ab5d2	[NVPTX]Add a test for debug info for packed bitfields, NFC.	2020-07-08 09:10:42 -04:00
Sanjay Patel	fdd6b46286	[x86] improve codegen for non-splat bit-masked vector compare and select (PR46531) vselect ((X & Pow2C) == 0), LHS, RHS --> vselect ((shl X, C') < 0), RHS, LHS Follow-up to D83073 - the non-splat mask cases where we actually see an improvement are quite limited from what I can tell. AVX1 needs multiply and blend capabilities and AVX2 needs vector shift and blend capabilities. The intersection of those 2 constraints is only vectors with 32-bit or 64-bit elements. XOP is/was better. Differential Revision: https://reviews.llvm.org/D83181	2020-07-08 08:20:49 -04:00
Michael Forney	b4e49c027c	[cmake] Use CMAKE_GENERATOR to determine if Ninja is used The name of the make program does not necessarily match "ninja", especially if an alternative implementation like samurai is used. Using CMAKE_GENERATOR is a more robust detection method, and is already used elsewhere in this file. Differential revision: https://reviews.llvm.org/D77091	2020-07-08 13:04:13 +02:00
Simon Pilgrim	9c3fe8e8e8	[X86][AVX] SimplifyDemandedVectorEltsForTargetShuffle - ensure mask is same size as constant size Fixes test regression reported on D81791	2020-07-08 11:47:59 +01:00
Alex Richardson	735c4b39a5	Add missing REQUIRES: x86-registered-target This should fix build bot failures after a80afc032859ebe65af283f76b38a0f5921b683f	2020-07-08 11:39:29 +01:00
Petar Avramovic	98aed41242	[GlobalISel][InlineAsm] Fix matching input constraints to mem operand Mark matching input constraint to mem operand as not supported. Differential Revision: https://reviews.llvm.org/D83235	2020-07-08 12:32:17 +02:00
Simon Pilgrim	98cae15779	[X86][AVX] Add SimplifyDemandedVectorEltsForTargetShuffle test for v32i8->v16i8 PSHUFB On SKX targets we end up loading a v16i8 PSHUFB mask from a v32i8 constant and scaling incorrectly indexes the demanded elts mask - we're missing a check that the constant pool is the same size as the loaded mask. Test case from D81791 post-commit review.	2020-07-08 11:26:33 +01:00
Oliver Stannard	d3a77878d6	[Support] Fix signed/unsigned comparison warning	2020-07-08 11:26:10 +01:00
Alex Richardson	e7f8cb8118	[UpdateTestChecks] Add UTC_ARGS support for update_{llc,cc}_test_checks.py https://reviews.llvm.org/D69701 added support for on-the-fly argument changes for update scripts. I recently wanted to keep some manual check lines in a test generated by update_cc_test_checks.py in our CHERI fork, so this commit adds support for UTC_ARGS in update_cc_test_checks.py. And since I was refactoring the code to be in common.py, I also added it for update_llc_test_checks.py. Reviewed By: jdoerfert, MaskRay Differential Revision: https://reviews.llvm.org/D78478	2020-07-08 11:00:10 +01:00
Alex Richardson	199e49d00e	[UpdateTestChecks] Move more update_test_checks.py logic to common.py I intend to reuse this to add UTC_ARGS support for update_llc_test_checks.py and update_cc_test_checks.py in D78478. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D78618	2020-07-08 10:59:28 +01:00
Paul Walker	f7143dfdb4	[SVE] Custom ISel for fixed length extract/insert_subvector. We use extact_subvector and insert_subvector to "cast" between fixed length and scalable vectors. This patch adds custom c++ based ISel for the following cases: fixed_vector = ISD::EXTRACT_SUBVECTOR scalable_vector, 0 scalable_vector = ISD::INSERT_SUBVECTOR undef(scalable_vector), fixed_vector, 0 Which result in either EXTRACT_SUBREG/INSERT_SUBREG for NEON sized vectors or COPY_TO_REGCLASS otherwise. Differential Revision: https://reviews.llvm.org/D82871	2020-07-08 09:49:28 +00:00
Jeremy Morse	5ea70edf9e	[DWARF] Add cuttoff guarding quadratic validThroughout behaviour Occasionally we see absolutely massive basic blocks, typically in global constructors that are vulnerable to heavy inlining. When these blocks are dense with DBG_VALUE instructions, we can hit near quadratic complexity in DwarfDebug's validThroughout function. The problem is caused by: * validThroughout having to step through all instructions in the block to examine their lexical scope, * and a high proportion of instructions in that block being DBG_VALUEs for a unique variable fragment, Leading to us stepping through every instruction in the block, for (nearly) each instruction in the block. By adding this guard, we force variables in large blocks to use a location list rather than a single-location expression, as shown in the added test. This shouldn't change the meaning of the output DWARF at all: instead we use a less efficient DWARF encoding to avoid a poor-performance code path. Differential Revision: https://reviews.llvm.org/D83236	2020-07-08 10:30:09 +01:00
Roman Lebedev	986cd9219d	[NFCI][llvm-reduce] Cleanup Delta passes to use Oracle abstraction Summary: I think, this results in much more understandable/readable flow. At least the original logic was perhaps the most hard thing for me to grasp when taking an initial look on the delta passes. Reviewers: nickdesaulniers, dblaikie, diegotf, george.burgess.iv Reviewed By: nickdesaulniers Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D83287	2020-07-08 12:26:00 +03:00
Simon Pilgrim	aa90871f1e	Fix MSVC "not all control paths return a value" warnings. NFC.	2020-07-08 10:18:36 +01:00
Simon Pilgrim	fa3b4e792c	[X86][AVX] Remove redundant EXTRACT_VECTOR_ELT(VBROADCAST(SCALAR())) fold Noticed while looking for similar cases to rG931ec74f7a29 - SimplifyDemandedVectorElts and shuffle combining both should handle this now.	2020-07-08 10:18:36 +01:00
Georgii Rymar	6e1353517c	[DebugInfo/DWARF] - Test invalid CFI opcodes properly and refine related `CFIProgram::parse` code. There are following issues with `CFIProgram::parse` code: 1) Invalid CFI opcodes were never tested. And currently a test would fail when the `LLVM_ENABLE_ABI_BREAKING_CHECKS` is enabled. It happens because the `DataExtractor::Cursor C` remains unchecked when the "Invalid extended CFI opcode" error is reported: ``` .eh_frame section at offset 0x1128 address 0x0: Program aborted due to an unhandled Error: Error value was Success. (Note: Success values must still be checked prior to being destroyed). ``` 2) It is impossible to reach the "Invalid primary CFI opcode" error with the current code. There are 3 possible primary opcode values and all of them are handled. Hence this error should be replaced with llvm_unreachable. 3) Errors currently reported are upper-case. This patch refines the code in the `CFIProgram::parse` method to fix all issues mentioned and adds unit tests for all possible invalid extended CFI opcodes. Differential revision: https://reviews.llvm.org/D82868	2020-07-08 12:10:23 +03:00
Georgii Rymar	54223457e2	[llvm-readobj] - Refine error reporting in MipsGOTParser<ELFT> helper. This is a follow-up for D83225. This does the following: 1) Adds missing tests for existent errors. 2) Stops using `unwrapOrError` to propagate errors to caller. (I am trying to get rid of all `unwrapOrErr` calls in the llvm-readelf code). 3) Improves error messages reported slightly. Differential revision: https://reviews.llvm.org/D83314	2020-07-08 12:05:52 +03:00
David Sherwood	89db14b5e3	[CodeGen] Fix wrong use of getVectorNumElements() in DAGTypeLegalizer::SplitVecRes_ExtendOp In DAGTypeLegalizer::SplitVecRes_ExtendOp I have replaced an invalid call to getVectorNumElements() with a call to getVectorMinNumElements(), since the code path works for both fixed and scalable vectors. This fixes up a warning in the following test: sve-sext-zext.ll Differential Revision: https://reviews.llvm.org/D83197	2020-07-08 09:53:20 +01:00
David Sherwood	e7d954b756	[CodeGen] Fix wrong use of getVectorNumElements in PromoteIntRes_EXTRACT_SUBVECTOR Calling getVectorNumElements() is not safe for scalable vectors and we should normally use getVectorElementCount() instead. However, for the code changed in this patch I decided to simply move the instantiation of the variable 'OutNumElems' lower down to the place where only fixed-width vectors are used, and hence it is safe to call getVectorNumElements(). Fixes up one warning in this test: sve-sext-zext.ll Differential Revision: https://reviews.llvm.org/D83195	2020-07-08 09:36:34 +01:00
David Sherwood	bd3697d837	[CodeGen] Fix warnings in sve-ld1-addressing-mode-reg-imm.ll For the GetElementPtr case in function AddressingModeMatcher::matchOperationAddr I've changed the code to use the TypeSize class instead of relying upon the implicit conversion to a uint64_t. As part of this we now check for scalable types and if we encounter one just bail out for now as the subsequent optimisations doesn't currently support them. This changes fixes up all warnings in the following tests: llvm/test/CodeGen/AArch64/sve-ld1-addressing-mode-reg-imm.ll llvm/test/CodeGen/AArch64/sve-st1-addressing-mode-reg-imm.ll Differential Revision: https://reviews.llvm.org/D83124	2020-07-08 09:16:00 +01:00
Heejin Ahn	07746c07ec	[WebAssembly] Generate unreachable after __stack_chk_fail `__stack_chk_fail` does not return, but `unreachable` was not generated following `call __stack_chk_fail`. This had a possibility to generate an invalid binary for functions with a return type, because `__stack_chk_fail`'s return type is void and `call __stack_chk_fail` can be the last instruction in the function whose return type is non-void. Generating `unreachable` after it makes sure CFGStackify's `fixEndsAtEndOfFunction` handles it correctly. Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D83277	2020-07-08 01:02:05 -07:00

... 4 5 6 7 8 ...

200009 Commits