llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-22 18:54:02 +01:00

Author	SHA1	Message	Date
Benjamin Kramer	1b5743af8d	Move DFA tables into the read-only data segmant.	2020-02-18 14:36:56 +01:00
Miloš Stojanović	ed803f6dcb	[llvm-exegesis] Improve error reporting in Assembler.cpp Followup to D74085. Replace the use of `report_fatal_error()` with returning the error to `llvm-exegesis.cpp` and handling it there. Differential Revision: https://reviews.llvm.org/D74325	2020-02-18 14:30:56 +01:00
Brian Gesiak	1c9c59e475	[IR] Set name when inserting 'llvm::Value' Summary: I noticed a small regression in a toy project of mine after applying D73835, in which instruction names weren't being set properly. In the example test case included with this patch, `llvm::IRBuilderBase::CreateAdd` returns an `llvm::Value ` that is then passed as an argument to `llvm::IRBuilderBase::Insert`. The overloaded function that is selected for that call then ignores the `Name` parameter that is given. This patch addresses that issue. Reviewers: nikic, Meinersbur, nhaehnle, fhahn, thakis, teemperor Reviewed By: nikic, fhahn Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74754	2020-02-18 08:22:03 -05:00
James Clarke	38b58c28d1	Use SETNE directly rather than SUB/SETNE 0 for stack guard check Summary: Backends should fold the subtraction into the comparison, but not all seem to. Moreover, on targets where pointers are not integers, such as CHERI, an integer subtraction is not appropriate. Instead we should just compare the two pointers directly, as this should work everywhere and potentially generate more efficient code. Reviewers: bogner, lebedev.ri, efriedma, t.p.northover, uweigand, sunfish Reviewed By: lebedev.ri Subscribers: dschuff, sbc100, arichardson, jgravelle-google, hiraditya, aheejin, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74454	2020-02-18 13:21:26 +00:00
Cristian Adam	7d153b6b68	llvm: Use quotes around MSVC_DIA_SDK_DIR CMake variable MSVC_DIA_SDK_DIR variable will point to a path which contains spaces, and without quotes it will fail to configure the project.	2020-02-18 14:42:19 +02:00
Florian Hahn	296c8157b6	[CGP] Add uaddo test with math used, SPARC/AArch64 variants.	2020-02-18 12:49:08 +01:00
Georgii Rymar	eef8b0bedf	[llvm-readobj] - Report a warning when an unexpected DT_SYMENT tag value is met. There was a short discussion about this: https://reviews.llvm.org/D73484#inline-676942 To summarize: It is a bit unclear to me why the `DT_SYMENT` tag exist. LLD has the code that does: "addInt(DT_SYMENT, sizeof(Elf_Sym));" and I guess other linkers has the same logic. It is unclear why it can be possible to have other values rather than values of a size of platform symbol. Seems it is not possible, and atm for me it looks that this tag should not be used. This patch starts reporting the warning when the value it contains differs from a symbol size for a 32/64 bit platform for safety. It keeps the rest of the logic we have unchanged. Before this patch we did not handle the tag at all. Differential review: https://reviews.llvm.org/D74479	2020-02-18 14:36:17 +03:00
Djordje Todorovic	2799d4faca	[CSInfo][TailDuplicator] Delete the call site info when removing dead MBBs This is needed for the debug entry values feature. Differential Revision: https://reviews.llvm.org/D74702	2020-02-18 12:29:51 +01:00
Kerry McLaughlin	e88a817d42	[AArch64][SVE] Add remaining SVE2 intrinsics for widening DSP operations Summary: Implements the following intrinsics: - llvm.aarch64.sve.[s\|u]mullb_lane - llvm.aarch64.sve.[s\|u]mullt_lane - llvm.aarch64.sve.sqdmullb_lane - llvm.aarch64.sve.sqdmullt_lane - llvm.aarch64.sve.[s\|u]addwb - llvm.aarch64.sve.[s\|u]addwt - llvm.aarch64.sve.[s\|u]shllb - llvm.aarch64.sve.[s\|u]shllt - llvm.aarch64.sve.[s\|u]subwb - llvm.aarch64.sve.[s\|u]subwt Reviewers: sdesmalen, dancgr, efriedma, c-rhodes, rengolin Reviewed By: sdesmalen Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, cameron.mcinally, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73903	2020-02-18 10:28:00 +00:00
Mikhail Maltsev	5ee57aca2e	[ARM,CDE] Cosmetic changes, additonal driver tests Summary: This is a follow-up patch addressing post-commit comments in https://reviews.llvm.org/D74044: * Add more Clang driver tests (-march=armv8.1m.main and -march=armv8.1m.main+mve.fp) * Clang-format a chunk in ARMAsmParser.cpp * Add a missing copyright header to ARMInstrCDE.td Reviewers: SjoerdMeijer, simon_tatham, dmgreen Reviewed By: SjoerdMeijer Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D74732	2020-02-18 10:23:09 +00:00
Simon Tatham	d407baeecc	[ARM,MVE] Add the vmovnbq,vmovntq intrinsic family. Summary: These are in some sense the inverse of vmovl[bt]q: they take a vector of n wide elements and truncate each to half its width. So they only write half a vector's worth of output data, and therefore they also take an 'inactive' parameter to provide the other half of the data in the output vector. So vmovnb overwrites the even lanes of 'inactive' with the narrowed values from the main input, and vmovnt overwrites the odd lanes. LLVM had existing codegen which generates these MVE instructions in response to IR that takes two vectors of wide elements, or two vectors of narrow ones. But in this case, we have one vector of each. So my clang codegen strategy is to narrow the input vector of wide elements by simply reinterpreting it as the output type, and then we have two narrow vectors and can represent the operation as a vector shuffle that interleaves lanes from both of them. Even so, not all the cases I needed ended up being selected as a single MVE instruction, so I've added a couple more patterns that spot combinations of the 'MVEvmovn' and 'ARMvrev32' SDNodes which can be generated as a VMOVN instruction with operands swapped. This commit adds the unpredicated forms only. Reviewers: dmgreen, miyuki, MarkMurrayARM, ostannard Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D74337	2020-02-18 09:34:50 +00:00
Simon Tatham	2eda64cc3d	[ARM,MVE] Add the vmovlbq,vmovltq intrinsic family. Summary: These intrinsics take a vector of 2n elements, and return a vector of n wider elements obtained by sign- or zero-extending every other element of the input vector. They're represented in IR as a shufflevector that extracts the odd or even elements of the input, followed by a sext or zext. Existing LLVM codegen already matches this pattern and generates the VMOVLB instruction (which widens the even-index input lanes). But no existing isel rule was generating VMOVLT, so I've added some. However, the new rules currently only work in little-endian MVE, because the pattern they expect from isel lowering includes a bitconvert which doesn't have the right semantics in big-endian. The output of one existing codegen test is improved by those new rules. This commit adds the unpredicated forms only. Reviewers: dmgreen, miyuki, MarkMurrayARM, ostannard Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D74336	2020-02-18 09:34:50 +00:00
Simon Tatham	c18b60acd4	[ARM] Allow `ARMVectorRegCast` to match bitconverts too. (NFC) Summary: When we start putting instances of `ARMVectorRegCast` in complex isel patterns, it will be awkward that they're often turned into the more standard `bitconvert` in little-endian mode. We'd rather not have to write separate isel patterns for the two endiannesses, matching different but equivalent cast operations. This change aims to fix that awkwardness in advance, by turning the Tablegen record `ARMVectorRegCast` from a simple `SDNode` instance into a `PatFrags` that can match either kind of cast – with a predicate that prevents it matching a bitconvert in the big-endian case, where bitconvert isn't semantically identical. No existing code generation should be affected by this change, but it will enable the patterns introduced by D74336 to work in both endiannesses. Reviewers: dmgreen Reviewed By: dmgreen Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74716	2020-02-18 09:34:50 +00:00
Simon Tatham	a5cb7f1640	[ARM,MVE] Add intrinsics vclzq and vclsq. Summary: vclzq maps nicely to the existing target-independent @llvm.ctlz IR intrinsic. But vclsq ('count leading sign bits') has no corresponding target-independent intrinsic, so I've made up @llvm.arm.mve.vcls. This commit adds the unpredicated forms only. Reviewers: dmgreen, miyuki, MarkMurrayARM, ostannard Reviewed By: miyuki Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D74335	2020-02-18 09:34:50 +00:00
Simon Tatham	2d913ae276	[ARM,MVE] Add intrinsics for FP rounding operations. Summary: This adds the unpredicated forms of six different MVE intrinsics which all round a vector of floating-point numbers to integer values, leaving them still in FP format, differing only in rounding mode and exception settings. Five of them map to existing target-independent intrinsics in LLVM IR, such as @llvm.trunc and @llvm.rint. The sixth, mapping to the `vrintn` instruction, is done by inventing a target-specific intrinsic. (`vrintn` behaves the same as `vrintx` in terms of the output value: the side effects on the FPSCR flags are the only difference between the two. But ACLE specifies separate user-callable intrinsics for the two, so the side effects matter enough to make sure we generate the right one of the two instructions in each case.) Reviewers: dmgreen, miyuki, MarkMurrayARM, ostannard Reviewed By: miyuki Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D74333	2020-02-18 09:34:50 +00:00
Florian Hahn	4d0e92ae7b	[InstCombin] Avoid nested Create calls, to guarantee order. The original code allowed creating the != checks in unpredictable order, causing http://lab.llvm.org:8011/builders/clang-cmake-x86_64-sde-avx512-linux/builds/34014 to fail.	2020-02-18 09:44:11 +01:00
Florian Hahn	9599c6b985	[InstCombine] Simplify a umul overflow check to a != 0 && b != 0. This patch adds a simplification if an OR weakens the overflow condition for umul.with.overflow by treating any non-zero result as overflow. In that case, we overflow if both umul.with.overflow operands are != 0, as in that case the result can only be 0, iff the multiplication overflows. Code like this is generated by code using __builtin_mul_overflow with negative integer constants, e.g. bool test(unsigned long long v, unsigned long long *res) { return __builtin_mul_overflow(v, -4775807LL, res); } ``` ---------------------------------------- Name: D74141 %res = umul_overflow {i8, i1} %a, %b %mul = extractvalue {i8, i1} %res, 0 %overflow = extractvalue {i8, i1} %res, 1 %cmp = icmp ne %mul, 0 %ret = or i1 %overflow, %cmp ret i1 %ret => %t0 = icmp ne i8 %a, 0 %t1 = icmp ne i8 %b, 0 %ret = and i1 %t0, %t1 ret i1 %ret %res = umul_overflow {i8, i1} %a, %b %mul = extractvalue {i8, i1} %res, 0 %cmp = icmp ne %mul, 0 %overflow = extractvalue {i8, i1} %res, 1 Done: 1 Optimization is correct! ``` Reviewers: nikic, lebedev.ri, spatel, Bigcheese, dexonsmith, aemerson Reviewed By: lebedev.ri Differential Revision: https://reviews.llvm.org/D74141	2020-02-18 09:11:55 +01:00
Gokturk Yuksek	1826a5d26b	[Support] Check for atomics64 when deciding if '-latomic' is needed The CheckAtomic module performs two tests to determine if passing '-latomic' to the linker is required: one for 64-bit atomics, and another for non-64-bit atomics. Include the missing check for 64-bit atomics. Reviewers: beanz, compnerd Reviewed By: beanz, compnerd Tags: #llvm Differential Revision: https://reviews.llvm.org/D69444	2020-02-18 07:54:54 +00:00
Florian Hahn	72aa083958	[InstCombine] Precommit umul.with.overflow sign check test. Precommit tests for D74141.	2020-02-18 08:46:50 +01:00
Alexey Lapshin	fbee76b150	[Debuginfo][NFC] add comments for WithColor routines. Summary: This patch is follow-up for D74481. It adds comments to WithColor::defaultErrorHandler() and WithColor::defaultWarningHandler(). Reviewers: jhenderson, dblaikie, JDevlieghere Reviewed By: JDevlieghere Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74742	2020-02-18 10:36:10 +03:00
Craig Topper	982740ed98	[X86] Move avx512 code that forces zeros to the false side of vselects above a check for legal types. This helps this transform occur earlier so we can fold the not with setcc. If we delay it until after type legalization we might have introduced instructions to widen the mask if the vselect was widened. This can prevent the not from making it to the setcc. We could of course add more DAG combines to handle that, but moving this earlier is easier.	2020-02-17 22:24:21 -08:00
Brian Gesiak	c1b2f58c10	Revert new files from new pass manager coro-split/coro-elide This reverts https://reviews.llvm.org/rG7125d66f9969605d886b5286780101a45b5bed67 and https://reviews.llvm.org/rG00fec8004aca6588d8d695a2c3827c3754c380a0 due to buildbot failures: http://lab.llvm.org:8011/builders/clang-cmake-x86_64-sde-avx512-linux/builds/34004 Previous revert 11053a1cc61afaabf2df2b8345d8d392c88cd508 missed newly added files, this commit removes those as well.	2020-02-18 00:34:01 -05:00
Brian Gesiak	a50ed36c0c	Revert new pass manager coro-split and coro-elide This reverts https://reviews.llvm.org/rG7125d66f9969605d886b5286780101a45b5bed67 and https://reviews.llvm.org/rG00fec8004aca6588d8d695a2c3827c3754c380a0 due to buildbot failures: http://lab.llvm.org:8011/builders/clang-cmake-x86_64-sde-avx512-linux/builds/34004	2020-02-17 23:55:10 -05:00
Brian Gesiak	6bab7ef5db	[Coroutines][3/6] New pass manager: coro-elide Summary: Depends on https://reviews.llvm.org/D71899. The third in a series of patches that ports the LLVM coroutines passes to the new pass manager infrastructure. This patch implements 'coro-elide'. The new pass manager infrastructure does not implicitly repeat CGSCC pass pipelines when a function is devirtualized, and so the tests for the new pass manager that rely on that behavior now explicitly specify `repeat<2>`. Reviewers: GorNishanov, lewissbaker, chandlerc, jdoerfert, junparser, deadalnix, wenlei Reviewed By: wenlei Subscribers: wenlei, EricWF, Prazek, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71900	2020-02-17 23:41:57 -05:00
Brian Gesiak	7c90f67d48	[Coroutines][2/6] New pass manager: coro-split Summary: This patch has four dependencies: 1. The first in this series of patches that implement coroutine passes in the new pass manager: https://reviews.llvm.org/D71898. 2. A patch that introduces an API for CGSCC passes to add new reference edges to a `LazyCallGraph`, `updateCGAndAnalysisManagerForCGSCCPass`: https://reviews.llvm.org/D72025. 3. A patch that introduces a `CallGraphUpdater` helper class that is capable of mutating internal `LazyCallGraph` state in order to insert new function nodes into a specific SCC: https://reviews.llvm.org/D70927. 4. And finally, a small edge case fix for updating `LazyCallGraph` that patch 3 above happens to run into: https://reviews.llvm.org/D72226. This is the second in a series of patches that ports the LLVM coroutines passes to the new pass manager infrastructure. This patch implements 'coro-split'. Some notes: * Using the new CGSCC pass manager resulted in IR being printed in the reverse order in some tests. To prevent FileCheck checks from failing due to these reversed orders, this patch splits up test files that test multiple different coroutine functions: specifically coro-alloc-with-param.ll, coro-split-eh.ll, and coro-eh-aware-edge-split.ll. * CoroSplit.cpp contained 2 overloads of `splitCoroutine`, one of which dispatched to the other based on the coroutine ABI being used (C++20 switch-based versus Swift returned-continuation-based). I found this confusing, especially with the additional branching based on `CallGraph` vs. `LazyCallGraph`, so I removed the ABI-checking overload of `splitCoroutine`. Reviewers: GorNishanov, lewissbaker, chandlerc, jdoerfert, junparser, deadalnix, wenlei Reviewed By: wenlei Subscribers: wenlei, qcolombet, EricWF, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71899	2020-02-17 23:35:27 -05:00
Craig Topper	1b63b53dd5	[X86] Use isScalarFPTypeInSSEReg to simplify code in LowerSELECT. NFC	2020-02-17 19:43:57 -08:00
Jim Lin	0596dad096	[NFC] Remove trailing space sed -Ei 's/[[:space:]]+$//' include/*/.{def,h,td} lib/*/.{cpp,h,td}	2020-02-18 10:49:13 +08:00
Jim Lin	dfbf17f1a2	[XCore][NFC] Remove trailing space	2020-02-18 10:32:58 +08:00
Craig Topper	4feb22500d	[X86] Add one use check to '0-x == y --> x+y == 0' in EmitCmp. I failed to copy it when I moved this in b62de210cf50ccb6822260e4075dd93333adb23e	2020-02-17 18:16:42 -08:00
Vedant Kumar	c09998b436	[HotColdSplit] Mark entire function cold when entry block is cold rdar://58855712	2020-02-17 15:57:50 -08:00
Stanislav Mekhanoshin	e58b532e8c	[TBLGEN] Inhibit generation of unneeded psets Differential Revision: https://reviews.llvm.org/D74744	2020-02-17 15:38:08 -08:00
Nicolai Hähnle	6d960a3b2c	LowerMatrixIntrinsics: Avoid use of deprecated CreateCall methods Reviewers: t.p.northover Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74675	2020-02-18 00:24:09 +01:00
Tim Northover	6b64c16d13	Coroutines: avoid use of deprecated CreateLoad and CreateCall methods Summary: Patch originally by Tim Northover Reviewers: t.p.northover Subscribers: EricWF, hiraditya, modocache, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D74674	2020-02-18 00:24:09 +01:00
Gokturk Yuksek	715ddfd799	[dsymutil] Explicitly link against libatomic when necessary In some systems, such as RISC-V, atomic support requires explicit linking against '-latomic' (see https://github.com/riscv/riscv-gcc/issues/12). Reviewers: davezarzycki, hhb, beanz, jfb, JDevlieghere Reviewed By: beanz, JDevlieghere Tags: #llvm Differential Revision: https://reviews.llvm.org/D69003	2020-02-17 22:28:18 +00:00
Vedant Kumar	e5608f4ac0	[LiveDebugValues] Visit open var locs just once in transferRegisterDef, NFC For a file in WebKit, this brings the time spent in LiveDebugValues down from 16 minutes to 2 minutes. The reduction comes from iterating the set of open variable locations just once in transferRegisterDef. Post-patch, the most expensive item inside of transferRegisterDef is a call to VarLoc::isDescribedByReg, which we have to do. Testing: I built LNT using the Os-g cmake cache with & without this patch, then diffed the object files to verify there was no binary diff. rdar://59446577 Differential Revision: https://reviews.llvm.org/D74633	2020-02-17 14:04:22 -08:00
Brian Gesiak	a9f4cb4ac7	Re-land "Add LazyCallGraph API to add function to RefSCC" This re-commits https://reviews.llvm.org/D70927, which I reverted in https://reviews.llvm.org/rG28213680b2a7d1fdeea16aa3f3a368879472c72a due to a buildbot error: http://lab.llvm.org:8011/builders/clang-cmake-x86_64-avx2-linux/builds/13251 I no longer include a test case that appears to crash when built with the buildbot's compiler, GCC 5.4.0.	2020-02-17 16:59:25 -05:00
Craig Topper	71b2ecb78b	[X86] Add missing isel pattern for BLCFILL producing flags.	2020-02-17 13:20:13 -08:00
Matt Arsenault	c13dcaa52f	AMDGPU/GlobalISel: Fix RegBankSelect for G_SHUFFLE_VECTOR	2020-02-17 15:11:25 -05:00
Matt Arsenault	4547afd953	AMDGPU/GlobalISel: Custom lower 32-bit G_SDIV/G_SREM	2020-02-17 15:09:51 -05:00
Nico Weber	179e6ab402	[gn build] (manually) merge e9849d519	2020-02-17 14:37:43 -05:00
Matt Arsenault	7ca27c32cc	AMDGPU/GlobalISel: Allow arbitrary global values Treat unknown address spaces as global	2020-02-17 11:32:28 -08:00
Craig Topper	00de94bc04	[X86] Change how the alignment for the stack object is created in LowerFLT_ROUNDS_. We don't need FrameInfo's concept of the stack alignment. We just need to tell it the desired alignment. Which in this case is 2.	2020-02-17 11:27:34 -08:00
Craig Topper	7a044e58ab	[X86] Move '0-x == y --> x+y == 0' and similar combines to EmitCmp. AArch64 handles this pattern in their lowering code. By emitting CMN. ARM handles it as an isel pattern.	2020-02-17 11:27:34 -08:00
Brian Gesiak	175b662e49	Revert "Add LazyCallGraph API to add function to RefSCC" This reverts commit https://reviews.llvm.org/rG449a13509190b1c57e5fcf5cd7e8f0f647f564b4, due to buildbot failures such as http://lab.llvm.org:8011/builders/clang-cmake-x86_64-avx2-linux/builds/13251.	2020-02-17 14:25:10 -05:00
Matt Arsenault	936ccc1714	GlobalISel: Allow running localizer earlier This required legal and regbankselected MIR for seemingly no reason. For AMDGPU this wouldn't see legalized G_GLOBAL_VALUEs.	2020-02-17 11:24:06 -08:00
Vedant Kumar	cc8dec63fe	Fix modules build after https://reviews.llvm.org/D73835 (IRBuilder virtualization change) I readily admit that I don't know why this fixes the modules build, but it seems to get things building again. Previously I saw the error message: http://lab.llvm.org:8080/green/view/LLDB/job/lldb-cmake/9404/consoleFull#-361314398a1ca8a51-895e-46c6-af87-ce24fa4cd561 ``` /Users/buildslave/jenkins/workspace/lldb-cmake/llvm-project/llvm/include/llvm/IR/IRBuilderFolder.h:18:10: fatal error: cyclic dependency in module 'LLVM_intrinsic_gen': LLVM_intrinsic_gen -> LLVM_IR -> LLVM_intrinsic_gen ^ While building module 'LLVM_intrinsic_gen' imported from /Users/buildslave/jenkins/workspace/lldb-cmake/llvm-project/llvm/lib/IR/IRBuilder.cpp:14: In file included from <module-includes>:1: /Users/buildslave/jenkins/workspace/lldb-cmake/llvm-project/llvm/include/llvm/IR/Argument.h:19:10: fatal error: could not build module 'LLVM_IR' ~~~~~~~~^~~~~~~~~~~~~~~~~ /Users/buildslave/jenkins/workspace/lldb-cmake/llvm-project/llvm/lib/IR/IRBuilder.cpp:14:10: fatal error: could not build module 'LLVM_intrinsic_gen' ``` And reproduced with: cmake -G Ninja /Users/vsk/src/llvm-backup-master/llvm -DCLANG_ENABLE_ARCMT=Off -DCLANG_ENABLE_STATIC_ANALYZER=Off -DLLVM_ENABLE_PROJECTS='clang;clang-tools-extra;lld;libcxx;libcxxabi;compiler-rt;libunwind;lldb' -DLLDB_USE_SYSTEM_DEBUGSERVER=On -DCMAKE_BUILD_TYPE=RelWithDebInfo -DLLVM_ENABLE_ASSERTIONS=On -DLLVM_ENABLE_MODULES=On	2020-02-17 11:22:44 -08:00
Matt Arsenault	4b811ff523	AMDGPU/GlobalISel: Custom lower 32-bit G_UDIV/G_UREM AMDGPUCodeGenPrepare expands this most of the time, but not always. We will always at least need a fallback option here. This is the 3rd implementation of the same expansion in the backend. Eventually I would like to eliminate the IR expansion (and the DAG version obviously). Currently the new legalizer path produces a better result, since the IR expansion results in extra operations which need to be combined out. Notably, the IR expansion results in multiplies by 0.	2020-02-17 11:05:50 -08:00
Gokturk Yuksek	c05d814fb9	[CMake] CheckAtomic.cmake: catch false positives in RISC-V The check for 'HAVE_CXX_ATOMICS_WITHOUT_LIB' may create false positives in RISC-V. This is reproducible when compiling LLVM natively using GCC on a rv64gc (rv64imafdgc) host. Due to the 'A' (atomic) extension, g++ replaces calls to libatomic operations on the std::atomic<int> type with the native hardware instructions. As a result, the compilation succeeds and the build system thinks it doesn't need to pass '-latomic'. Improve the reliability of the 'HAVE_CXX_ATOMICS_WITHOUT_LIB' test in two steps: 1. Force a pre-increment on x (++x), which should force a call to a libatomic function; 2. Because step 1 would resolve the increment to 'amoadd.w.aq' under the 'A' extension, force the same operation on sub-word types, for which there is no hardware support. Reviewers: jfb, hintonda, smeenai, mgorny, JDevlieghere, jyknight Reviewed By: jfb Tags: #llvm Differential Revision: https://reviews.llvm.org/D68964	2020-02-17 18:53:41 +00:00
Matt Arsenault	54c8963c23	GlobalISel: Extend narrowing to G_ASHR	2020-02-17 10:42:59 -08:00
Brian Gesiak	3a80e0bb03	[Coroutines][1/6] New pass manager: coro-early Summary: The first in a series of patches that ports the LLVM coroutines passes to the new pass manager infrastructure. This patch implements 'coro-early'. NB: All coroutines passes begin by checking that coroutine intrinsics are declared within the LLVM IR module they're operating on. To do so, they call `coro::declaresIntrinsics`. The next 3 patches in this series, which add new pass manager implementations of the 'coro-split', 'coro-elide', and 'coro-cleanup' passes, use a similar pattern as the one used here: a static function is shared across both old and new passes to detect if relevant coroutine intrinsics are delcared. To make this pattern easier to read, this patch adds `const` keywords to the parameters of `coro::declaresIntrinsics`. Reviewers: GorNishanov, lewissbaker, junparser, chandlerc, deadalnix, wenlei Reviewed By: wenlei Subscribers: ychen, wenlei, EricWF, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71898	2020-02-17 13:27:48 -05:00

... 5 6 7 8 9 ...

192396 Commits