llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 12:41:49 +01:00

Author	SHA1	Message	Date
Florian Hahn	ca7c734d47	[DSE] Eliminate stores at the end of the function. This patch add support for eliminating MemoryDefs that do not have any aliasing users, which indicates that there are no reads/writes to the memory location until the end of the function. To eliminate such defs, we have to ensure that the underlying object is not visible in the caller and does not escape via returning. We need a separate check for that, as InvisibleToCaller does not consider returns. Reviewers: dmgreen, rnk, efriedma, bryant, asbirlea, Tyker, george.burgess.iv Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D72631	2020-06-24 12:58:20 +01:00
sstefan1	bc3cd62620	[OpenMPOpt] ICV macro definitions Summary: This defines some basic information about ICVs in `OMPKinds.def`. We also emit remarks with initial values for each function (which are default for now) as a way to test this. Reviewers: jdoerfert, JonChesterfield, hamax97, jhuber6 Subscribers: yaxunl, hiraditya, guansong, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82193	2020-06-24 13:43:35 +02:00
Simon Pilgrim	495bb9acd5	ObjCARC.h - remove unnecessary includes. NFC. Add implicit InstIterator.h dependency in ObjCARCContract.cpp	2020-06-24 12:30:59 +01:00
Simon Pilgrim	27e32d5530	StackLifetime.h - remove unused AliasAnalysis.h include. NFC.	2020-06-24 12:30:59 +01:00
Georgii Rymar	5a1dd77522	[llvm-readelf] - Don't crash when e_shstrndx==SHN_XINDEX, but there is no section header. Currently we crash when trying to print --sections and the SHN_XINDEX escape value is used for the e_shstrndx field, but there is no section header at index 0 to read the value from. Differential revision: https://reviews.llvm.org/D82374	2020-06-24 14:09:34 +03:00
Cullen Rhodes	f2a50c987a	[AArch64][SVE2] Add bfloat16 support to whilerw/whilewr intrinsics Reviewed By: fpetrogalli Differential Revision: https://reviews.llvm.org/D82399	2020-06-24 10:06:31 +00:00
Cullen Rhodes	8543c38ff5	[AArch64][SVE] Add bfloat16 support to perm and select intrinsics Summary: Added for following intrinsics: * zip1, zip2, zip1q, zip2q * trn1, trn2, trn1q, trn2q * uzp1, uzp2, uzp1q, uzp2q * splice * rev * sel Reviewed By: david-arm Differential Revision: https://reviews.llvm.org/D82182	2020-06-24 10:04:51 +00:00
Kerry McLaughlin	31c721c75b	[AArch64][SVE] Add bfloat16 support to load intrinsics Summary: Bfloat16 support added for the following intrinsics: - LD1 - LD1RQ - LDNT1 - LDNF1 - LDFF1 Reviewers: sdesmalen, c-rhodes, efriedma, stuij, fpetrogalli, david-arm Reviewed By: fpetrogalli Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D82298	2020-06-24 10:32:19 +01:00
Florian Hahn	0963627075	[DSE,MSSA] Precommit small test changes for D72631.	2020-06-24 10:17:09 +01:00
alex-t	c89646a3e3	[AMDGPU] Enable compare operations to be selected by divergence Summary: Details: This patch enables SETCC to be selected to S_CMP_* if uniform and V_CMP_* if divergent. Reviewers: rampitec, arsenm Reviewed By: rampitec Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82194	2020-06-24 11:50:40 +03:00
Simon Tatham	e3b33eadce	[ARM][BFloat] Legalize bf16 type even without fullfp16. Summary: This change permits scalar bfloats to be loaded, stored, moved and used as function call arguments and return values, whenever the bf16 feature is supported by the subtarget. Previously that was only supported in the presence of the fullfp16 feature, because the code generation strategy depended on instructions from that extension. This change adds alternative code generation strategies so that those operations can be done even without fullfp16. The strategy for loads and stores is to replace VLDRH/VSTRH with integer LDRH/STRH plus a move between register classes. I've written isel patterns for those, conditional on //not// having the fullfp16 feature (so that in the fullfp16 case, the existing patterns will still be used). For function arguments and returns, instead of writing isel patterns to match `VMOVhr` and `VMOVrh`, I've avoided generating those SDNodes in the first place, by factoring out the code that constructs them into helper functions `MoveToHPR` and `MoveFromHPR` which have a fallback for non-fullfp16 subtargets. The current output code is not especially pretty: in the new test file you can see unnecessary store/load pairs implementing no-op bitcasts, and lots of pointless moves back and forth between FP registers and GPRs. But it at least works, which is an improvement on the previous situation. Reviewers: dmgreen, SjoerdMeijer, stuij, chill, miyuki, labrinea Reviewed By: dmgreen, labrinea Subscribers: labrinea, kristof.beyls, hiraditya, danielkiss, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82372	2020-06-24 09:36:26 +01:00
LLVM GN Syncbot	fe9dbca461	[gn build] Port 96d4ccf00c8	2020-06-24 08:17:48 +00:00
Craig Topper	479811648a	[X86] Speculatively fix to X86AvoidStoreForwardingBlocks not deference a machine mem operand if there isn't one present. Eric Christopher informed me that FastISel memcpy handling creates load/store instructions without mem operands. We should fix that, but I doubt that's the only case of missed mem operands so seems better to be defensive here. I don't have a test case yet, but I'll try to add one if i get a test from Eric.	2020-06-24 00:13:58 -07:00
Craig Topper	ee765be827	[X86] Add mayLoad/mayStore flags to some X87 instructions that don't have isel patterns to infer them from. Should remove part of the differences in D81833 due to some some of these getting isel patterns.	2020-06-23 23:40:30 -07:00
Alex Lorenz	f1629757c5	[cmake] configure the host triple on an Apple Silicon machine correctly The cmake build of LLVM now uses the appropriate arm64 arch for the host triple when building llvm-project on an Apple Silicon mac. Differential Revision: https://reviews.llvm.org/D82428	2020-06-23 21:08:11 -07:00
Eli Friedman	be8b475c4d	[BitcodeReader] Fix DelayedShuffle handling for ConstantExpr shuffles. The indexing was messed up, so the result was completely broken. Shuffle constant exprs are rare in practice; without vscale types, constant folding generally elminates them. So sort of hard to trip over. Fixes regression from D72467. Differential Revision: https://reviews.llvm.org/D80330	2020-06-23 19:50:30 -07:00
Amara Emerson	7f4328cecf	[AArch64][GlobalISel] Improve codegen for some constant vectors by using constant pool loads. There's more smarts in AArch64ISelLowering that we don't have yet, but this change incrementally improves some of the more common patterns. I think future iterations will want to use some combination of PostLegalizerCombiner and the selector to catch the other cases. Differential Revision: https://reviews.llvm.org/D82340	2020-06-23 19:23:47 -07:00
Eli Friedman	9d315e1c2b	Remove GlobalValue::getAlignment(). This function is deceptive at best: it doesn't return what you'd expect. If you have an arbitrary GlobalValue and you want to determine the alignment of that pointer, Value::getPointerAlignment() returns the correct value. If you want the actual declared alignment of a function or variable, GlobalObject::getAlignment() returns that. This patch switches all the users of GlobalValue::getAlignment to an appropriate alternative. Differential Revision: https://reviews.llvm.org/D80368	2020-06-23 19:13:42 -07:00
Vedant Kumar	8bce4cf299	[SimplifyCFG] Drop debug loc in SpeculativelyExecuteBB Summary: According to HowToUpdateDebugInfo.rst: ``` Preserving the debug locations of speculated instructions can make it seem like a condition is true when it's not (or vice versa), which leads to a confusing single-stepping experience ``` This patch follows the recommendation to drop debug locations on speculated instructions. Reviewers: aprantl, davide Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82420	2020-06-23 18:25:52 -07:00
Matt Arsenault	4b2eead6d5	AMDGPU/GlobalISel: Fix fixed ABI special VGPR function arguments I forgot to copy the new fixed function ABI into GlobalISel, so this was mismatched with the DAG compiled calling function. This was allocating part of the argument list to v31, which was supposed to be reserved for the workitem IDs.	2020-06-23 21:21:35 -04:00
Amy Huang	a7e7de3de6	[NFC] Remove outdated comment in llvm-symbolizer test case.	2020-06-23 17:10:46 -07:00
Eli Friedman	bd863e1c80	[AArch64][SVE] Add legalization support for i32/i64 vector srem/urem Implement them on top of sdiv/udiv, similar to what we do for integer types. Potential future work: implementing i8/i16 srem/urem, optimizations for constant divisors, optimizing the mul+sub to mls. Differential Revision: https://reviews.llvm.org/D81511	2020-06-23 16:27:52 -07:00
Eli Friedman	0b4cac4d83	[IR] Prefer scalar type for struct indexes in GEP constant expressions. This has two advantages: one, it's simpler, and two, it doesn't require heroic pattern matching with scalable vectors. Also includes a small fix to DataLayout to allow the scalable vector testcase to work correctly. Differential Revision: https://reviews.llvm.org/D82061	2020-06-23 16:14:36 -07:00
Tony	21efe1c804	[AMDGPU] Update AMD GPU processor information Summary: - Add product names for some processors. - Correct XNACK support for a processor. Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82348	2020-06-23 18:47:56 -04:00
Sam Clegg	28b30ccf89	[WebAssembly] Fix for use of uninitialized member in WasmObjectWriter.cpp Currently, section indices may be passed uninitialized by value if writing the section fails. Removes section indices form class initialization and returns them from the write{Code,Data}Section function calls instead. Patch by Gui Andrade! Differential Revision: https://reviews.llvm.org/D81702	2020-06-23 15:26:18 -07:00
Jonas Devlieghere	5f2e7e7476	[lldb] Fix the modules build Fixes error: invalid operands to binary expression ('llvm::StringRef' and 'const char [6]')	2020-06-23 15:13:30 -07:00
Luís Marques	05cc619556	[RISCV][NFC] Add tests for folds of ADDIs into load/stores This patch adds tests for folds of ADDIs into load/stores, focusing on load/stores with nonzero offsets. When the offset is nonzero we currently don't do the fold. A follow-up patch will improve on that. Differential Revision: https://reviews.llvm.org/D79689	2020-06-23 22:59:54 +01:00
David Green	680cf8ff46	[ARM] Mark more integer instructions as not having side effects. LDRD and STRD along with UBFX and SBFX are selected from DAGToDAG transforms, so do not have tblgen patterns. They don't get marked as having side effects so cannot be scheduled as efficiently as you would like. This specifically marks then as not having side effects. Differential Revision: https://reviews.llvm.org/D82358	2020-06-23 22:45:51 +01:00
Christopher Tetreault	d945366684	[SVE] Remove calls to VectorType::getNumElements from AsmParser Reviewers: efriedma, RKSimon, c-rhodes, fpetrogalli Reviewed By: fpetrogalli Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82208	2020-06-23 14:31:49 -07:00
David Green	b31f621535	[ARM] Cortex-M4 integer instructions scheduler info test. NFC Most useful at the moment for showing where unpredicatable instructions are.	2020-06-23 22:26:23 +01:00
Zequan Wu	027c7186ad	[ASan][MSan] Remove EmptyAsm and set the CallInst to nomerge to avoid from merging. Summary: `nomerge` attribute was added at D78659. So, we can remove the EmptyAsm workaround in ASan the MSan and use this attribute. Reviewers: vitalybuka Reviewed By: vitalybuka Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82322	2020-06-23 14:22:53 -07:00
Ryan Santhiraraja	a010b09b66	Preserve GlobalsAA analysis result in InjectTLIMappings InjectTLIMappings fails to preserve the analysis result of GlobalsAA. Not preserving the analysis might affect benchmark performance. This change fixes this issue. Patch by: Ryan Santhiraraja <rsanthir@quicinc.com> Reviewers: fpetrogalli, joerg, fhahn Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D82343	2020-06-23 22:05:42 +01:00
Adrian Prantl	186de7d4d3	Add missing string conversions to fix a compile error in Local.h	2020-06-23 13:36:56 -07:00
Nikita Popov	0aa6512bc2	[IR] Remove MSVC warning workaround (NFC) While LLVM does fold this to x+1, GCC does not. As this is hot code, let's try to avoid that. According to https://developercommunity.visualstudio.com/content/problem/211134/unsigned-integer-overflows-in-constexpr-functionsa.html this spurious warning in MSVC has been fixed in Visual Studio 2019 Version 16.4. Let's see if there are any build bots running old MSVC versions with warnings treated as errors...	2020-06-23 22:33:57 +02:00
Christopher Tetreault	0ab258ac79	[SVE] Remove calls to VectorType::getNumElements from Bitcode Reviewers: efriedma, evgeny777, tejohnson, david-arm, kmclaughlin Reviewed By: david-arm Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82209	2020-06-23 13:21:40 -07:00
Nikita Popov	be6c1c439a	[IR] Remove unnecessary uint64_t casts (NFC) As pointed out by foad, it's not necessary to work on uint64_t here. The values used here fit uint8_t.	2020-06-23 22:20:15 +02:00
Florian Hahn	7b6e066ca8	[DSE,MSSA] Treat `store 0` after calloc as noop stores. This patch extends storeIsNoop to also detect stores of 0 to an calloced object. This basically ports the logic from legacy DSE to the MemorySSA backed version. It triggers in a few cases on MultiSource, SPEC2000, SPEC2006 with -O3 LTO: Same hash: 218 (filtered out) Remaining: 19 Metric: dse.NumNoopStores Program base patch2 diff test-suite...CFP2000/177.mesa/177.mesa.test 1.00 15.00 1400.0% test-suite...6/482.sphinx3/482.sphinx3.test 1.00 14.00 1300.0% test-suite...lications/ClamAV/clamscan.test 2.00 28.00 1300.0% test-suite...CFP2006/433.milc/433.milc.test 1.00 8.00 700.0% test-suite...pplications/oggenc/oggenc.test 2.00 9.00 350.0% test-suite.../CINT2000/176.gcc/176.gcc.test 6.00 6.00 0.0% test-suite.../CINT2006/403.gcc/403.gcc.test NaN 137.00 nan% test-suite...libquantum/462.libquantum.test NaN 3.00 nan% test-suite...6/464.h264ref/464.h264ref.test NaN 7.00 nan% test-suite...decode/alacconvert-decode.test NaN 2.00 nan% test-suite...encode/alacconvert-encode.test NaN 2.00 nan% test-suite...ications/JM/ldecod/ldecod.test NaN 9.00 nan% test-suite...ications/JM/lencod/lencod.test NaN 39.00 nan% test-suite.../Applications/lemon/lemon.test NaN 2.00 nan% test-suite...pplications/treecc/treecc.test NaN 4.00 nan% test-suite...hmarks/McCat/08-main/main.test NaN 4.00 nan% test-suite...nsumer-lame/consumer-lame.test NaN 3.00 nan% test-suite.../Prolangs-C/bison/mybison.test NaN 1.00 nan% test-suite...arks/mafft/pairlocalalign.test NaN 30.00 nan% Reviewers: efriedma, zoecarver, asbirlea Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D82204	2020-06-23 21:01:39 +01:00
Your Name	47713f7b79	[AMDGPU/MemOpsCluster] Implement new heuristic for computing max mem ops cluster size Summary: Make use of both the - (1) clustered bytes and (2) cluster length, to decide on the max number of mem ops that can be clustered. On an average, when loads are dword or smaller, consider `5` as max threshold, otherwise `4`. This heuristic is purely based on different experimentation conducted, and there is no analytical logic here. Reviewers: foad, rampitec, arsenm, vpykhtin Reviewed By: rampitec Subscribers: llvm-commits, kerbowa, hiraditya, t-tye, Anastasia, tpr, dstuttard, yaxunl, nhaehnle, wdng, jvesely, kzhuravl, thakis Tags: #llvm Differential Revision: https://reviews.llvm.org/D82393	2020-06-24 00:39:41 +05:30
Zion Nimchuk	1527d4f60f	Change CMake so that we only look for Z3 when LLVM_ENABLE_Z3_SOLVER is enabled Reviewers: mikhail.ramalho Reviewed By: mikhail.ramalho Subscribers: mehdi_amini, mgorny, mikhail.ramalho, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75544	2020-06-23 14:49:56 -04:00
Mehdi Amini	d46e92d6cb	Fix incorrect "REQUIRE" (default_target->default_triple) introduced in 59f45a1cdb3 Adding `default_target` fixed the build by excluding these tests... but this excluded these tests from ever running! The correct feature check is `default_triple`	2020-06-23 18:22:39 +00:00
Christopher Tetreault	34144cdb66	[SVE] Remove calls to VectorType::getNumElements from FuzzMutate Reviewers: efriedma, bkramer, kmclaughlin, sdesmalen Reviewed By: sdesmalen Subscribers: tschuett, hiraditya, rkruppe, psnobl, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82212	2020-06-23 11:02:20 -07:00
Simon Pilgrim	1b7526b7e8	[X86][AVX] Attempt to lower v16i32/v16f32 shuffles with lowerShuffleAsRepeatedMaskAndLanePermute Avoids prematurely creating permps/permd variable shuffles. Fixes PR46249	2020-06-23 18:33:50 +01:00
Simon Pilgrim	d1c97bc19b	[X86][AVX] Add v16f32 variant of PR46249 test case	2020-06-23 18:14:57 +01:00
Arthur Eubanks	39765fa3ed	[NewPM] Attempt to run opt passes specified via -foo-pass under NPM Summary: In order to enable mass testing of opt under NPM, specifically passes specified via -foo-pass. This is gated under a new opt flag -enable-new-pm. Currently the pass flag parser looks for legacy PM passes with the name "foo" (for opt arg "-foo") and creates a PassInfo for each one. Here we take the (legacy PM) pass name and try to match it with one defined in (NPM) PassRegistry.def. Ultimately if we want all tests to pass like this, we'll need to port all passes to NPM and register them in PassRegistry.def under the same name as they were reigstered in the legacy PM. Maybe at some point we'll migrate all -foo to --passes=foo, but that would be after the NPM switch. Flipping on the flag causes 2XXX failures under check-llvm. By far most of them are passes either not ported to NPM or don't have the same name in PassRegistry.def as their old name. Reviewers: hans, echristo, asbirlea, leonardchan Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D82320	2020-06-23 10:10:40 -07:00
Simon Pilgrim	12aae3e8af	WithColor.h - reduce CommandLine.h include to forward declaration. NFC. WithColor.h is one of the most common headers, we can severely reduce its frontend impact (in ClangBuildAnalyzer reports) by removing the bulky CommandLine.h include, forward declaring llvm:🆑:OptionCategory and just including raw_ostream.h instead.	2020-06-23 17:07:53 +01:00
Simon Pilgrim	115dd458c9	[X86][AVX] Add PR46249 test case	2020-06-23 17:07:53 +01:00
Xing GUO	98dc9021fb	[ObjectYAML][DWARF] Remove unused context. NFC. The context is unused. This patch helps remove it. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D82351	2020-06-24 00:02:51 +08:00
Sanjay Patel	5e2f6a06b2	[PhaseOrdering] add test for missed vectorization; NFC (PR43745) Either SLP or VectorCombine should be able to form vector compares reliably on this example.	2020-06-23 11:57:32 -04:00
Xing GUO	708b87a1e4	[ObjectYAML][ELF] Add support for emitting the .debug_pubtypes section. This patch helps add support for emitting the .debug_pubtypes section. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D82347	2020-06-24 00:01:07 +08:00
Momchil Velikov	631e350306	[ARM] Describe defs/uses of VLLDM and VLSTM The VLLDM and VLSTM instructions are incompletely specified. They (potentially) write (or read, respectively) registers Q0-Q7, VPR, and FPSCR, but the compiler is unaware of it. In the new test case `cmse-vlldm-no-reorder.ll` case the compiler missed an anti-dependency and reordered a `VLLDM` ahead of the instruction, which stashed the return value from the non-secure call, effectively clobbering said value. This test case does not fail with upstream LLVM, because of scheduling differences and I couldn't find a test case for the VLSTM either. Differential Revision: https://reviews.llvm.org/D81586	2020-06-23 16:04:23 +01:00

1 2 3 4 5 ...

198923 Commits