llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
Craig Topper	c3928d2bd0	[RISCV] Use (not X) in instead of (xor X, -1) in isel patterns to improve readability. NFC	2020-11-07 11:50:52 -08:00
LLVM GN Syncbot	2f720b3a66	[gn build] Port d725f1ce531	2020-11-07 19:18:18 +00:00
Jonas Devlieghere	85b15f0ad5	[DWARFLinker] Convert analyzeContextInfo to a work list (NFC) Convert analyzeContextInfo to a work list using the same approach I used to remove the recursion from lookForDIEsToKeep. This fixes the crash reported in https://llvm.org/PR48029. Tested using the reproducer attached to PR48029 as well as by comparing the clang MD5 hashes before and after the change (with and without gmodules). Differential revision: https://reviews.llvm.org/D90873	2020-11-07 10:46:09 -08:00
Nikita Popov	80a7041502	[BasicAA] Unify struct/other offset (NFC) The distinction between StructOffset and OtherOffset has been originally introduced by 82069c44ca39df9d506e16bfb0ca2481866dd0bb, which applied different reasoning to both offset kinds. However, this distinction was not actually correct, and has been fixed by c84e77aeaefccb8d0c4c508b8017dcad80607f53. Since then, we only ever consider the sum StructOffset + OtherOffset, so we may as well store it in that form directly.	2020-11-07 18:56:05 +01:00
Nikita Popov	c44d13b67c	[BasicAA] Use smul_ov helper (NFCI) Instead of performing the multiplication in double the bit width and using active bits to determine overflow, use the existing smul_ov() APInt method to detect overflow. The smul_ov() implementation is not particularly efficient, but it's still better than doing this a wide, usually 128-bit, type.	2020-11-07 18:14:48 +01:00
Nikita Popov	c77875582f	[CaptureTracking] Add statistics (NFC) Add basic statistics on the number of pointers that have been determined to maybe capture / not capture.	2020-11-07 12:57:00 +01:00
Nikita Popov	32c88b4c53	[CaptureTracking] Early abort on too many uses (NFCI) If there are too many uses, we should directly return -- there's no point in inspecting the remaining uses in the worklist, as we have to conservatively assume a capture anyway. This also means that tooManyUses() gets called exactly once, rather than potentially many times. This restores the behavior prior to e9832dfdf366ddffba68164adb6855d17c9f87c1, where this was accidentally changed while moving the AddUses logic into a closure, thus making the return a return from the closure rather than the whole function.	2020-11-07 11:52:08 +01:00
Nikita Popov	160413ec80	[CaptureTrackingTest] Add missing override marker (NFC)	2020-11-07 11:44:02 +01:00
Nikita Popov	c9414e5876	[CaptureTracking] Correctly handle multiple uses in one instruction If the same value is used multiple times in the same instruction, CaptureTracking may end up reporting the wrong use as being captured, and/or report the same use as being captured multiple times. Make sure that all checks take the use operand number into account, rather than performing unreliable comparisons against the used value. I'm not sure whether this can cause any problems in practice, but at least some capture trackers (ArgUsesTracker, AACaptureUseTracker) do care about which call argument is captured.	2020-11-07 11:31:20 +01:00
Nikita Popov	5c9c72b83d	[CaptureTracking] Avoid duplicate shouldExplode() check (NFCI) We check shouldExplore() before adding uses to the worklist, so uses that should not be explored will not reach captured() in the first place.	2020-11-07 10:16:58 +01:00
Jonas Devlieghere	0a89c7f7ee	[DWARFLinker] Use union to reduce sizeof(WorklistItem) (NFC) Reduce the size of the WorklistItem struct by using a struct.	2020-11-06 23:24:41 -08:00
Kazu Hirata	f462bf9b0e	[BranchProbabilityInfo] Simplify getEdgeProbability (NFC) The patch simplifies BranchProbabilityInfo::getEdgeProbability by handling two cases separately, depending on whether we have edge probabilities. - If we have edge probabilities, then add up probabilities for successors being equal to Dst. - Otherwise, return the number of ocurrences divided by the total number of successors. Differential Revision: https://reviews.llvm.org/D90980	2020-11-06 22:47:22 -08:00
Fangrui Song	8b06f86d2f	[test] Fix Other/new-pass-manager.ll with has different behaviors whether or not Polly is enabled after D89158	2020-11-06 22:19:37 -08:00
Fangrui Song	5445a20663	[test] Fix Other/new-pass-manager.ll & clang/test/Misc/loop-opt-setup.c	2020-11-06 21:55:11 -08:00
Atmn Patel	3dd5790777	Revert "[LoopDeletion] Allows deletion of possibly infinite side-effect free loops" This reverts commit 0b17c6e4479d62bd4ff05c48d6cdf340b198832f. This patch causes a compile-time error in SCEV.	2020-11-07 00:32:12 -05:00
Fangrui Song	840325e8ac	AsmPrinter/Dwarf*: Use llvm::Register instead of unsigned	2020-11-06 21:00:28 -08:00
Fangrui Song	d460b6115e	[AsmPrinter] Rename ByteStreamer::EmitInt8 to emitInt8 to be consistent with other emit*	2020-11-06 20:02:56 -08:00
Jonas Devlieghere	34b67b4c77	[DWARFLinker] Add CompileUnit::getInfo helper that takes a DWARFDie (NFC) Eliminate the need to go through the DIE index by passing the DIE to CompileUnit::getInfo directly. Before: unsigned Idx = Unit->getOrigUnit().getDIEIndex(Die); CompileUnit::DIEInfo &Info = Unit->getInfo(Idx); After: CompileUnit::DIEInfo &Info = Unit->getInfo(Die);	2020-11-06 19:37:44 -08:00
Atmn Patel	51ad1efef5	[LoopDeletion] Allows deletion of possibly infinite side-effect free loops From C11 and C++11 onwards, a forward-progress requirement has been introduced for both languages. In the case of C, loops with non-constant conditionals that do not have any observable side-effects (as defined by 6.8.5p6) can be assumed by the implementation to terminate, and in the case of C++, this assumption extends to all functions. The clang frontend will emit the `mustprogress` function attribute for C++ functions (D86233, D85393, D86841) and emit the loop metadata `llvm.loop.mustprogress` for every loop in C11 or later that has a non-constant conditional. This patch modifies LoopDeletion so that only loops with the `llvm.loop.mustprogress` metadata or loops contained in functions that are required to make progress (`mustprogress` or `willreturn`) are checked for observable side-effects. If these loops do not have an observable side-effect, then we delete them. Loops without observable side-effects that do not satisfy the above conditions will not be deleted. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D86844	2020-11-06 22:06:58 -05:00
Atmn Patel	2dde5b4c98	[Inliner] Handle `mustprogress` functions When inlining `mustprogress` functions, if the caller or the callee has the attribute, we drop the function attribute. The loops that have the `llvm.loop.mustprogress` metadata keep their metadata. We do not need to add new loop metadata to inlined functions because the patch in D86841 already adds the relevant loop metadata in all of the necessary places. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D87262	2020-11-06 20:03:46 -05:00
Fangrui Song	d485a2798c	[test] -mtriple=x86_64-* -> -mtriple=x86_64	2020-11-06 16:49:52 -08:00
Zequan Wu	f88097c935	[llvm-cov] Fix missing slash in -path-equivalence	2020-11-06 14:54:11 -08:00
Elvina Yakubova	7acee7fe26	[AArch64] Add pipeline model for HiSilicon's TSV110 This patch adds the scheduling and cost model for TSV110. Reviewed by: SjoerdMeijer, bryanpkc Differential Revision: https://reviews.llvm.org/D89972	2020-11-07 01:23:00 +03:00
Kazu Hirata	e786f8b19e	[TableGen] Use llvm::is_contained (NFC)	2020-11-06 14:18:01 -08:00
Eric Astor	fb1b9af1ea	[ms] [llvm-ml] Allow arbitrary strings as integer constants MASM interprets strings in expression contexts as integers expressed in big-endian base-256, treating each character as its ASCII representation. This completely eliminates the need to special-case single-character strings. Reviewed By: thakis Differential Revision: https://reviews.llvm.org/D90788	2020-11-06 17:15:49 -05:00
Atmn Patel	f15e3a4579	[LoopDeletion] Remove dead loops with no exit blocks Currently, LoopDeletion refuses to remove dead loops with no exit blocks because it cannot statically determine the control flow after it removes the block. This leads to miscompiles if the loop is an infinite loop and should've been removed. Differential Revision: https://reviews.llvm.org/D90115	2020-11-06 17:08:34 -05:00
Alexander Shaposhnikov	a0b02387ac	[llvm-objcopy][MachO] Skip sections with zero offset Some binaries can contain regular sections with zero offset and zero size. This diff makes llvm-objcopy's handling of such sections consistent with cctools's strip (which doesn't modify them), previously the tool would allocate file space for them. Test plan: make check-all Differential revision: https://reviews.llvm.org/D90796	2020-11-06 13:29:43 -08:00
Rahman Lavaee	2405269073	[obj2yaml] [yaml2obj] Add yaml support for SHT_LLVM_BB_ADDR_MAP section. YAML support allows us to better test the feature in the subsequent patches. The implementation is quite similar to the .stack_sizes section. Reviewed By: jhenderson, grimar Differential Revision: https://reviews.llvm.org/D88717	2020-11-06 12:44:42 -08:00
Valentin Churavy	b77e3682b3	Fix unwind info relocation with large code model on AArch64 Makes sure that the unwind info uses 64bits pcrel relocation if a large code model is specified and handle the corresponding relocation in the ExecutionEngine. This can happen with certain kernel configuration (the same as the one in https://reviews.llvm.org/D27609, found at least on the ArchLinux stock kernel and the one used on https://www.packet.net/) using the builtin JIT memory manager. Co-authored-by: Yichao Yu <yyc1992@gmail.com> Differential Revision: https://reviews.llvm.org/D27629	2020-11-06 14:41:30 -05:00
Valentin Churavy	63578da85a	[RTDYLD] support absolute relocations where needed These appear in some sections, such as DWARF tables, since RuntimeDyldELF explicitly maps to this as a sentinel value: `29d1fba7b5/llvm/lib/ExecutionEngine/RuntimeDyld/RuntimeDyldELF.cpp (L1199)` That could then be a source of problems if it tried to examine these sections (for example, with either setProcessAllSections(true) or ORCv2 on i686). Replaces https://reviews.llvm.org/D89241 Reviewed By: lhames, vchuravy Differential Revision: https://reviews.llvm.org/D90722	2020-11-06 14:08:59 -05:00
Kazu Hirata	879db4ed4b	[BranchProbabilityInfo] Use succ_size (NFC)	2020-11-06 11:05:35 -08:00
Craig Topper	3c7db00c2f	[RISCV] Add test case to show incorrect matching to sroiw when the or mask does not have 1s in the upper 32 bits. The matching code for sroiw is truncating the mask to 32 bits before checking its value. We need to check all 64 bits.	2020-11-06 10:58:59 -08:00
Quentin Colombet	c2874d6a67	Prevent LICM and machineLICM from hoisting convergent operations Results of convergent operations are implicitly affected by the enclosing control flows and should not be hoisted out of arbitrary loops. Patch by Xiaoqing Wu <xiaoqing_wu@apple.com> Differential Revision: https://reviews.llvm.org/D90361	2020-11-06 10:26:39 -08:00
Simon Pilgrim	c107acf938	[InstCombine] computeKnownBitsMul - use KnownBits::isNonZero() helper. Avoid an expensive isKnownNonZero() call - this is a small cleanup before moving the extra NSW functionality from computeKnownBitsMul into KnownBits::computeForMul.	2020-11-06 17:27:13 +00:00
Simon Pilgrim	964dad51f4	[SLP][AMDGPU] Regenerate packed-math tests and remove unused check prefix	2020-11-06 17:27:13 +00:00
Simon Pilgrim	1274be49b8	[VectorCombine][X86] Removed unused check prefixes	2020-11-06 17:27:12 +00:00
Kevin P. Neal	941987a165	[FPEnv] Use strictfp metadata in casting nodes The strictfp metadata was added to the casting AST nodes in D85960, but we aren't using that metadata yet. This patch adds that support. In order to avoid lots of ad-hoc passing around of the strictfp bits I updated the IRBuilder when moving from a function that has the Expr* to a function that lacks it. I believe we should switch to this pattern to keep the strictfp support from being overly invasive. For the purpose of testing that we're picking up the right metadata, I also made my tests use a pragma to make the AST's strictfp metadata not match the global strictfp metadata. This exposes issues that we need to deal with in subsequent patches, and I believe this is the right method for most all of our clang strictfp tests. Differential Revision: https://reviews.llvm.org/D88913	2020-11-06 11:56:12 -05:00
Jay Foad	0cb73d61a1	[TableGen] Indentation and whitespace fixes in generated code. NFC. Some of these were found by running clang-format over the generated code, although that complains about far more issues than I have fixed here. Differential Revision: https://reviews.llvm.org/D90937	2020-11-06 16:10:57 +00:00
Jay Foad	10e740a528	[AMDGPU] Simplify exp target parsing Treat any identifier as a potential exp target and diagnose them all the same way as "invalid exp target"s. Differential Revision: https://reviews.llvm.org/D90947	2020-11-06 16:09:34 +00:00
David Spickett	4187c2aea8	[Arm][MC] Remove unused prefixes in .arch_extension fp tests idiv: There is no difference between Armv7m and Thumbv7M behaviour so the specific CHECKs are not needed. The errors for Armv7-a and Thumbv7-a will always include "ARM" or "THUMB" respectively so they need their own CHECK prefix, making CHECK-V7 redundant. mp: Behaviour is dependent on whether the triple is v6/v7/v7M regardless of being Arm or Thumb. So we don't need the more specific CHECK-ARMv7M etc. simd: Errors are either v7 only, or v7 and v8 so CHECK-V8 is not needed. fp: Same as simd Reviewed By: ostannard Differential Revision: https://reviews.llvm.org/D90918	2020-11-06 15:13:07 +00:00
Jay Foad	d74fd5721f	[AMDGPU] Run exp tests on GFX9 and GFX10 too. NFC.	2020-11-06 15:03:05 +00:00
Roman Lebedev	bba9249b3e	[NFC][InstCombine] Update few comment updates i missed in 0ac56e8eaaeb As pointed out in post-commit review in that commit	2020-11-06 17:38:00 +03:00
Arnold Schwaighofer	3fe61868a9	llvm.coro.id.async lowering: Parameterize how-to restore the current's continutation context and restart the pipeline after splitting The `llvm.coro.suspend.async` intrinsic takes a function pointer as its argument that describes how-to restore the current continuation's context from the context argument of the continuation function. Before we assumed that the current context can be restored by loading from the context arguments first pointer field (`first_arg->caller_context`). This allows for defining suspension points that reuse the current context for example. Also: llvm.coro.id.async lowering: Add llvm.coro.preprare.async intrinsic Blocks inlining until after the async coroutine was split. Also, change the async function pointer's context size position struct async_function_pointer { uint32_t relative_function_pointer_to_async_impl; uint32_t context_size; } And make the position of the `async context` argument configurable. The position is specified by the `llvm.coro.id.async` intrinsic. rdar://70097093 Differential Revision: https://reviews.llvm.org/D90783	2020-11-06 06:22:46 -08:00
Paul C. Anagnostopoulos	72e317cd3c	[NVPTX] [TableGen] Use new features of TableGen to simplify and clarify. Differential Revision: https://reviews.llvm.org/D90861	2020-11-06 09:20:19 -05:00
Simon Moll	6dd86e401d	[VE] Add v(m)regs to preserve_all reg mask V(m)regs where defined before CSR_preserve_all was, add them now. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D90912	2020-11-06 15:16:11 +01:00
Than McIntosh	b387ac6f07	[NFC] Fix typo in comment. Differential Revision: https://reviews.llvm.org/D90846	2020-11-06 09:03:07 -05:00
Paul C. Anagnostopoulos	af8abf220b	[TableGen] Clarify text and fix errors in the Programmer's Reference Differential Revision: https://reviews.llvm.org/D90881	2020-11-06 08:56:29 -05:00
Simon Moll	3379eef9b9	[VE][NFC] Refactor to support more than one calling conv Prepare for supporting different calling conventions by factoring out things into CC-dependent selection functions (getParamCC, getReturnCC). Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D90911	2020-11-06 14:25:25 +01:00
Florian Hahn	c9d167829a	[SLP] Also try to vectorize incoming values of PHIs . Currently we do not consider incoming values of PHIs as roots for SLP vectorization. This means we miss scenarios like the one in the test case and PR47670. It appears quite straight-forward to consider incoming values of PHIs as roots for vectorization, but I might be missing something that makes this problematic. In terms of vectorized instructions, this applies to quite a few benchmarks across MultiSource/SPEC2000/SPEC2006 on X86 with -O3 -flto Same hash: 185 (filtered out) Remaining: 52 Metric: SLP.NumVectorInstructions Program base patch diff test-suite...ProxyApps-C++/HPCCG/HPCCG.test 9.00 27.00 200.0% test-suite...C/CFP2000/179.art/179.art.test 8.00 22.00 175.0% test-suite...T2006/458.sjeng/458.sjeng.test 14.00 30.00 114.3% test-suite...ce/Benchmarks/PAQ8p/paq8p.test 11.00 18.00 63.6% test-suite...s/FreeBench/neural/neural.test 12.00 18.00 50.0% test-suite...rimaran/enc-3des/enc-3des.test 65.00 95.00 46.2% test-suite...006/450.soplex/450.soplex.test 63.00 89.00 41.3% test-suite...ProxyApps-C++/CLAMR/CLAMR.test 177.00 250.00 41.2% test-suite...nchmarks/McCat/18-imp/imp.test 13.00 18.00 38.5% test-suite.../Applications/sgefa/sgefa.test 26.00 35.00 34.6% test-suite...pplications/oggenc/oggenc.test 100.00 133.00 33.0% test-suite...6/482.sphinx3/482.sphinx3.test 103.00 134.00 30.1% test-suite...oxyApps-C++/miniFE/miniFE.test 169.00 213.00 26.0% test-suite.../Benchmarks/Olden/tsp/tsp.test 59.00 73.00 23.7% test-suite...TimberWolfMC/timberwolfmc.test 503.00 622.00 23.7% test-suite...T2006/456.hmmer/456.hmmer.test 65.00 79.00 21.5% test-suite...libquantum/462.libquantum.test 58.00 68.00 17.2% test-suite...ternal/HMMER/hmmcalibrate.test 84.00 98.00 16.7% test-suite...ications/JM/ldecod/ldecod.test 351.00 401.00 14.2% test-suite...arks/VersaBench/dbms/dbms.test 52.00 57.00 9.6% test-suite...ce/Benchmarks/Olden/bh/bh.test 118.00 128.00 8.5% test-suite.../Benchmarks/Bullet/bullet.test 6355.00 6880.00 8.3% test-suite...nsumer-lame/consumer-lame.test 480.00 519.00 8.1% test-suite...000/183.equake/183.equake.test 226.00 244.00 8.0% test-suite...chmarks/Olden/power/power.test 105.00 113.00 7.6% test-suite...6/471.omnetpp/471.omnetpp.test 92.00 99.00 7.6% test-suite...ications/JM/lencod/lencod.test 1173.00 1261.00 7.5% test-suite...0/253.perlbmk/253.perlbmk.test 55.00 59.00 7.3% test-suite...oxyApps-C/miniAMR/miniAMR.test 92.00 98.00 6.5% test-suite...chmarks/MallocBench/gs/gs.test 446.00 473.00 6.1% test-suite.../CINT2006/403.gcc/403.gcc.test 464.00 491.00 5.8% test-suite...6/464.h264ref/464.h264ref.test 998.00 1055.00 5.7% test-suite...006/453.povray/453.povray.test 5711.00 6007.00 5.2% test-suite...FreeBench/distray/distray.test 102.00 107.00 4.9% test-suite...:: External/Povray/povray.test 4184.00 4378.00 4.6% test-suite...DOE-ProxyApps-C/CoMD/CoMD.test 112.00 117.00 4.5% test-suite...T2006/445.gobmk/445.gobmk.test 104.00 108.00 3.8% test-suite...CI_Purple/SMG2000/smg2000.test 789.00 819.00 3.8% test-suite...yApps-C++/PENNANT/PENNANT.test 233.00 241.00 3.4% test-suite...marks/7zip/7zip-benchmark.test 417.00 428.00 2.6% test-suite...arks/mafft/pairlocalalign.test 627.00 643.00 2.6% test-suite.../Benchmarks/nbench/nbench.test 259.00 265.00 2.3% test-suite...006/447.dealII/447.dealII.test 4641.00 4732.00 2.0% test-suite...lications/ClamAV/clamscan.test 106.00 108.00 1.9% test-suite...CFP2000/177.mesa/177.mesa.test 1639.00 1664.00 1.5% test-suite...oxyApps-C/RSBench/rsbench.test 66.00 65.00 -1.5% test-suite.../CINT2000/252.eon/252.eon.test 3416.00 3444.00 0.8% test-suite...CFP2000/188.ammp/188.ammp.test 1846.00 1861.00 0.8% test-suite.../CINT2000/176.gcc/176.gcc.test 152.00 153.00 0.7% test-suite...CFP2006/444.namd/444.namd.test 3528.00 3544.00 0.5% test-suite...T2006/473.astar/473.astar.test 98.00 98.00 0.0% test-suite...frame_layout/frame_layout.test NaN 39.00 nan% On ARM64, there appears to be a slight regression on SPEC2006, which might be interesting to investigate: test-suite...T2006/473.astar/473.astar.test 0.9% Reviewed By: ABataev Differential Revision: https://reviews.llvm.org/D88735	2020-11-06 12:50:32 +00:00
Simon Pilgrim	fbc5698619	[InstCombine] Regenerate narrow-math.ll tests	2020-11-06 11:35:54 +00:00

1 2 3 4 5 ...

206428 Commits