llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
Amara Emerson	59c2440372	[llvm][mlir] Promote the experimental reduction intrinsics to be first class intrinsics. This change renames the intrinsics to not have "experimental" in the name. The autoupgrader will handle legacy intrinsics. Relevant ML thread: http://lists.llvm.org/pipermail/llvm-dev/2020-April/140729.html Differential Revision: https://reviews.llvm.org/D88787	2020-10-07 10:36:44 -07:00
Fanbo Meng	aeea490c2a	[NFC] Add contributors names to CREDITS.TXT	2020-10-07 13:22:55 -04:00
Heejin Ahn	0ab2923b30	[WebAssembly] Rename Emscripten EH functions Renaming for some Emscripten EH functions has so far been done in wasm-emscripten-finalize tool in Binaryen. But recently we decided to make a compilation/linking path that does not rely on wasm-emscripten-finalize for modifications, so here we move that functionality to LLVM. Invoke wrappers are generated in LowerEmscriptenEHSjLj pass, but final wasm types are not available in the IR pass, we need to rename them at the end of the pipeline. This patch also removes uses of `emscripten_longjmp_jmpbuf` in LowerEmscriptenEHSjLj pass, replacing that with `emscripten_longjmp`. `emscripten_longjmp_jmpbuf` is lowered to `emscripten_longjmp`, but previously we generated calls to `emscripten_longjmp_jmpbuf` in LowerEmscriptenEHSjLj pass because it takes `jmp_buf*` instead of `i32`. But we were able use `ptrtoint` to make it use `emscripten_longjmp` directly here. Addresses: https://github.com/WebAssembly/binaryen/issues/3043 https://github.com/WebAssembly/binaryen/issues/3081 Companions: https://github.com/WebAssembly/binaryen/pull/3191 https://github.com/emscripten-core/emscripten/pull/12399 Reviewed By: dschuff, tlively, sbc100 Differential Revision: https://reviews.llvm.org/D88697	2020-10-07 09:42:49 -07:00
Daniel Sanders	4940183748	[json] Provide a means to delegate writing a value to another API (Based on D87170 by dsanders) I recently had need to call out to an external API to emit a JSON object as part of one an LLVM tool was emitting. However, our JSON support didn't provide a way to delegate part of the JSON output to that API. Add rawValueBegin() and rawValueEnd() to maintain and check the internal state while something else is writing to the stream. It's the users responsibility to ensure that the resulting JSON output is still valid. Differential Revision: https://reviews.llvm.org/D88902	2020-10-07 18:31:45 +02:00
Sam McCall	a23b9aed3d	Reapply [ADT] function_ref's constructor is unavailable if the argument is not callable. This reverts commit 281703e67ffaee8e26efef86e0df3e145477f4cb. GCC 5.4 bugs are worked around by avoiding use of variable templates. Differential Revision: https://reviews.llvm.org/D88977	2020-10-07 18:31:12 +02:00
Nikita Popov	50cd460a1b	[MemCpyOpt] Add additional callslot test cases (NFC) For cases where the destination is captured.	2020-10-07 18:06:29 +02:00
Roman Lebedev	716be9de3a	[NFC][InstCombine] Autogenerate a few tests being affected by upcoming patch	2020-10-07 19:00:08 +03:00
Philip Reames	4f50d7bd91	[Tests] Precommit test showing gap around load forwarding of vectors in instcombine	2020-10-07 08:57:24 -07:00
LLVM GN Syncbot	c887e89634	[gn build] Port ddf1864ace4	2020-10-07 15:50:43 +00:00
Yonghong Song	a703509947	BPF: add AdjustOpt IR pass to generate verifier friendly codes Add an IR phase right before main module optimization. This is to modify IR to restrict certain downward optimizations in order to generate verifier friendly code. > prevent certain instcombine optimizations, handling both in-block/cross-block instcombines. > avoid speculative code motion if the variable used in condition is also used in the later blocks. Internally, a bpf IR builtin result = __builtin_bpf_passthrough(seq_num, result) is used to enforce ordering. This builtin is only used during target independent IR optimizations and it will be removed at the beginning of target dependent IR optimizations. For example, removing the following workaround, --- a/tools/testing/selftests/bpf/progs/test_sysctl_loop1.c +++ b/tools/testing/selftests/bpf/progs/test_sysctl_loop1.c @@ -47,7 +47,7 @@ int sysctl_tcp_mem(struct bpf_sysctl ctx) / a workaround to prevent compiler from generating * codes verifier cannot handle yet. */ - volatile int ret; + int ret; this patch is able to generate code which passed the verifier. To disable optimization, users need to use "opt" command like below: clang -target bpf -O2 -S -emit-llvm -Xclang -disable-llvm-passes test.c // disable icmp serialization opt -O2 -bpf-disable-serialize-icmp test.ll \| llvm-dis > t.ll // disable avoid-speculation opt -O2 -bpf-disable-avoid-speculation test.ll \| llvm-dis > t.ll llc t.ll Differential Revision: https://reviews.llvm.org/D85570	2020-10-07 08:49:10 -07:00
Ronak Chauhan	e33484a234	[AMDGPU] Support disassembly for AMDGPU kernel descriptors Decode AMDGPU Kernel descriptors as assembler directives. Reviewed By: scott.linder, jhenderson, kzhuravl Differential Revision: https://reviews.llvm.org/D80713	2020-10-07 20:39:43 +05:30
Cameron McInally	ac18071719	[SVE] Lower fixed length VECREDUCE_OR operation Differential Revision: https://reviews.llvm.org/D88847	2020-10-07 09:56:25 -05:00
Jay Foad	7685826746	[AMDGPU] Use @LINE for error checking in gfx10.3 assembler tests	2020-10-07 15:48:01 +01:00
Sam McCall	dc3a1be2c9	Revert "[ADT] function_ref's constructor is unavailable if the argument is not callable." This reverts commit 4cae6228d129d4c4dfb156c043977bb6b5690031. Breaks GCC build: http://lab.llvm.org:8011/#/builders/8/builds/33/steps/6/logs/stdio	2020-10-07 16:37:13 +02:00
Nico Weber	667a8e234b	[gn build] (manually) port ce1365f8f7e	2020-10-07 10:33:51 -04:00
Sam McCall	360a9fdce4	[ADT] function_ref's constructor is unavailable if the argument is not callable. This allows overload sets containing function_ref arguments to work correctly Otherwise they're ambiguous as anything "could be" converted to a function_ref. This matches proposed std::function_ref, absl::function_ref, etc. Differential Revision: https://reviews.llvm.org/D88901	2020-10-07 16:31:09 +02:00
Georgii Rymar	331305b3b2	[obj2yaml] - Rename `Group` to `GroupSection`. NFC. The `Group` class represents a group section and it is named inconsistently with other sections which all has the "Section" suffix. It is sometimes confusing, this patch addresses the issue. Differential revision: https://reviews.llvm.org/D88892	2020-10-07 17:04:15 +03:00
Georgii Rymar	67f382cd89	[llvm-readelf] - Implement --addrsig option. We have `--addrsig` implemented for `llvm-readobj`. Usually it is convenient to use a single tool for dumping, so it seems we might want to implement `--addrsig` for `llvm-readelf` too. I've selected a simple output format which is a bit similar to one, used for dumping of the symbol table. It looks like: ``` Address-significant symbols section '.llvm_addrsig' contains 2 entries: Num: Name 1: foo 2: bar ``` Differential revision: https://reviews.llvm.org/D88835	2020-10-07 16:45:30 +03:00
Dmitry Preobrazhensky	dec39256b3	[AMDGPU][MC] Improved diagnostics for instructions with missing features Reviewers: rampitec Differential Revision: https://reviews.llvm.org/D88887	2020-10-07 16:31:29 +03:00
Roman Lebedev	6ff5a01f80	InstCombine: Negator: don't rely on complexity sorting already being performed (PR47752) In some cases, we can negate instruction if only one of it's operands negates. Previously, we assumed that constants would have been canonicalized to RHS already, but that isn't guaranteed to happen, because of InstCombine worklist visitation order, as the added test (previously-hanging) shows. So if we only need to negate a single operand, we should ensure ourselves that we try constant operand first. Do that by re-doing the complexity sorting ourselves, when we actually care about it. Fixes https://bugs.llvm.org/show_bug.cgi?id=47752	2020-10-07 15:09:50 +03:00
Rodrigo Dominguez	83d858a534	[AMDGPU] Implement hardware bug workaround for image instructions Summary: This implements a workaround for a hardware bug in gfx8 and gfx9, where register usage is not estimated correctly for image_store and image_gather4 instructions when D16 is used. Change-Id: I4e30744da6796acac53a9b5ad37ac1c2035c8899 Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D81172	2020-10-07 07:39:52 -04:00
Simon Pilgrim	4e1a82c240	[InstCombine] Tweak funnel by constant tests for better shl/lshr commutation coverage	2020-10-07 11:47:03 +01:00
Simon Pilgrim	ee4778c45b	[ARM] Regenerate vldlane tests To help make the diffs in D88569 clearer	2020-10-07 11:47:03 +01:00
Florian Hahn	424d2206a7	[LAA] Add test for PR47751, which currently uses wrong bounds.	2020-10-07 11:22:22 +01:00
Jay Foad	c63e985f6f	[SDag] SimplifyDemandedBits: simplify to FP constant if all bits known We were already doing this for integer constants. This patch implements the same thing for floating point constants. Differential Revision: https://reviews.llvm.org/D88570	2020-10-07 09:24:38 +01:00
Max Kazantsev	fe21571576	[Test] Add one more test where we can avoid creating trunc	2020-10-07 15:06:38 +07:00
Rainer Orth	5116ffc6e0	[Support][unittests] Enforce alignment in ConvertUTFTest `LLVM-Unit :: Support/./SupportTests/ConvertUTFTest.ConvertUTF16LittleEndianToUTF8String` `FAIL`s on Solaris/sparcv9: In `llvm/lib/Support/ConvertUTFWrapper.cpp` (`convertUTF16ToUTF8String`) the `SrcBytes` arg is reinterpreted/accessed as `UTF16` (`unsigned short`, which requires 2-byte alignment on strict-alignment targets like Sparc) without anything guaranteeing the alignment, so the access yields a `SIGBUS`. This patch avoids this by enforcing the required alignment in the callers. Tested on `sparcv9-sun-solaris2.11`. Differential Revision: https://reviews.llvm.org/D88824	2020-10-07 09:08:41 +02:00
Max Kazantsev	5a5039d1f8	[NFC] Use getZero instead of getConstant(0)	2020-10-07 13:53:36 +07:00
Roman Lebedev	034e32765e	[SROA] rewritePartition()/findCommonType(): if uses have conflicting type, try getTypePartition() before falling back to largest integral use type (PR47592) And another step towards transformss not introducing inttoptr and/or ptrtoint casts that weren't there already. In this case, when load/store uses have conflicting types, instead of falling back to the iN, we can try to use allocated sub-type. As disscussed, this isn't the best idea overall (we shouldn't rely on allocated type), but it works fine as a temporary measure. I've measured, and @ `-O3` as of vanilla llvm test-suite + RawSpeed, this results in +0.05% more bitcasts, -5.51% less inttoptr and -1.05% less ptrtoint (at the end of middle-end opt pipeline) See https://bugs.llvm.org/show_bug.cgi?id=47592 Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D88788	2020-10-07 09:20:19 +03:00
Yonghong Song	a6506984a7	BPF: avoid duplicated globals for CORE relocations This patch fixed two issues related with relocation globals. In LLVM, if a global, e.g. with name "g", is created and conflict with another global with the same name, LLVM will rename the global, e.g., with a new name "g.2". Since relocation global name has special meaning, we do not want llvm to change it, so internally we have logic to check whether duplication happens or not. If happens, just reuse the previous global. The first bug is related to non-btf-id relocation (BPFAbstractMemberAccess.cpp). Commit 54d9f743c8b0 ("BPF: move AbstractMemberAccess and PreserveDIType passes to EP_EarlyAsPossible") changed ModulePass to FunctionPass, i.e., handling each function at a time. But still just one BPFAbstractMemberAccess object is created so module level de-duplication still possible. Commit 40251fee0084 ("[BPF][NewPM] Make BPFTargetMachine properly adjust NPM optimizer pipeline") made a change to create a BPFAbstractMemberAccess object per function so module level de-duplication is not possible any more without going through all module globals. This patch simply changed the map which holds reloc globals as class static, so it will be available to all BPFAbstractMemberAccess objects for different functions. The second bug is related to btf-id relocation (BPFPreserveDIType.cpp). Before Commit 54d9f743c8b0, the pass is a ModulePass, so we have a local variable, incremented for each instance, and works fine. But after Commit 54d9f743c8b0, the pass becomes a FunctionPass. Local variable won't work properly since different functions will start with the same initial value. Fix the issue by change the local count variable as static, so it will be truely unique across the whole module compilation. Differential Revision: https://reviews.llvm.org/D88942	2020-10-06 22:37:49 -07:00
Max Kazantsev	725c655687	[Test] Add test showing that we can avoid inserting trunc/zext	2020-10-07 12:19:01 +07:00
Chen Zheng	937f27d268	[MachineInstr] exclude call instruction in mayAlias we now get noAlias result for a call instruction and other load/store/call instructions if we query mayAlias. This is not right as call instruction is not with mayloadorstore, but it may alter the memory. This patch fixes this wrong alias query. Differential Revision: https://reviews.llvm.org/D87490	2020-10-07 00:12:21 -04:00
Chen Zheng	a20254fe88	[PowerPC] implement target hook getTgtMemIntrinsic This patch can make pass recognize Powerpc related memory intrinsics. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D88373	2020-10-07 00:02:44 -04:00
Chen Zheng	60fa7c7101	[PowerPC] add more builtins for PPCTargetLowering::getTgtMemIntrinsic Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D88374	2020-10-06 23:48:33 -04:00
Bill Wendling	0be97dba3f	[CodeGen][TailDuplicator] Don't duplicate blocks with INLINEASM_BR Tail duplication of a block with an INLINEASM_BR may result in a PHI node on the indirect branch. This is okay, but it also introduces a copy for that PHI node after the INLINEASM_BR, which is not okay. See: https://github.com/ClangBuiltLinux/linux/issues/1125 Differential Revision: https://reviews.llvm.org/D88823	2020-10-06 18:44:59 -07:00
Valentin Clement	865511fe51	[flang][openacc] Fix device_num and device_type clauses for init directive This patch fix the device_num and device_type clauses used in the init clause. device_num was not spelled correctly in the parser and was to restrictive with scalarIntConstantExpr instead of scalarIntExpr. device_type is now taking a list of ScalarIntExpr. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D88571	2020-10-06 21:27:01 -04:00
Johannes Doerfert	b71edafce2	[Attributor] Use smarter way to determine alignment of GEPs Use same logic existing in other places to deal with base case GEPs. Add the original Attributor talk example.	2020-10-06 19:31:08 -05:00
Johannes Doerfert	66f0aafaf0	[Attributor] Ignore read accesses to constant memory The old function attribute deduction pass ignores reads of constant memory and we need to copy this behavior to replace the pass completely. First step are constant globals. TBAA can also describe constant accesses and there are other possibilities. We might want to consider asking the alias analyses that are available but for now this is simpler and cheaper.	2020-10-06 19:31:07 -05:00
Johannes Doerfert	cfcde5122b	[Attributor] Give up early on AANoReturn::initialize If the function is not assumed `noreturn` we should not wait for an update to mark the call site as "may-return". This has two kinds of consequences: - We have less iterations in many tests. - We have less deductions based on "known information" (since we ask earlier, point 1, and therefore assumed information is not "known" yet). The latter is an artifact that we might want to tackle properly at some point but which is not easily fixable right now.	2020-10-06 19:31:07 -05:00
Nico Weber	2e9cf00e3d	[gn build] manually port 5e4409f308177	2020-10-06 18:43:49 -04:00
Dave Airlie	fd4efdea0f	Fix out-of-tree clang build due to sysexits change The sysexists change broke clang building out of tree against llvm. https://reviews.llvm.org/D88467	2020-10-06 18:21:17 -04:00
Lang Hames	c9187b8c21	[RuntimeDyld][COFF] Report fatal error on error, rather than emiting diagnostic. Report a fatal error if an IMAGE_REL_AMD64_ADDR32NB cannot be applied due to an out-of-range target. Previously we emitted a diagnostic to llvm::errs and continued. Patch by Dale Martin. Thanks Dale!	2020-10-06 15:16:29 -07:00
Duncan P. N. Exon Smith	8fbfc2b8c5	docs: Emphasize ArrayRef over SmallVectorImpl The section on SmallVector has a note about preferring SmallVectorImpl for APIs but doesn't mention ArrayRef. Although ArrayRef is discussed elsewhere, let's re-emphasize here. Differential Revision: https://reviews.llvm.org/D49881	2020-10-06 18:13:52 -04:00
Alexandre Ganea	8eda5d6321	Revert [lit] Support running tests on Windows without GnuWin32 This reverts b3418cb4eb1456c41606f4621dcfa362fe54183c and d12ae042e17b27ebc8d2b5ae3d8dd5f88384d093 This breaks some external bots, see discussion in https://reviews.llvm.org/D84380 In the meanwhile, please use `cmake -DLLVM_LIT_TOOLS_DIR="C:/Program Files/Git/usr/bin"` or add it to %PATH%.	2020-10-06 15:38:18 -04:00
Mircea Trofin	c33f12cf26	[NFC][MC] Type uses of MCRegUnitIterator as MCRegister This is one of many subsequent similar changes. Note that we're ok with the parameter being typed as MCPhysReg, as MCPhysReg -> MCRegister is a correct conversion; Register -> MCRegister assumes the former is indeed physical, so we stop relying on the implicit conversion and use the explicit, value-asserting asMCReg(). Differential Revision: https://reviews.llvm.org/D88862	2020-10-06 12:09:56 -07:00
Scott Linder	709b3d8e67	[AMDGPU] Fix remaining kernel descriptor test Follow up on e4a9e4ef554a to fix a test I missed in the original patch. Committed as obvious.	2020-10-06 18:45:04 +00:00
Scott Linder	265f34d2aa	[AMDGPU] Emit correct kernel descriptor on big-endian hosts Previously we wrote multi-byte values out as-is from host memory. Use the `emitIntN` helpers in `MCStreamer` to produce a valid descriptor irrespective of the host endianness. Reviewed By: arsenm, rochauha Differential Revision: https://reviews.llvm.org/D88858	2020-10-06 17:29:38 +00:00
Stanislav Mekhanoshin	1aaa1eb0d8	[AMDGPU] Create isGFX9Plus utility function Introduce a utility function to make it more convenient to write code that is the same on the GFX9 and GFX10 subtargets. Use isGFX9Plus in the AsmParser for AMDGPU. Authored By: Joe_Nash Differential Revision: https://reviews.llvm.org/D88908	2020-10-06 10:18:43 -07:00
Simon Pilgrim	c076cb5935	[X86][SSE] combineX86ShuffleChain add 'CanonicalizeShuffleInput' helper. NFCI. As part of PR45974, we're getting closer to not creating 'padded' vectors on-the-fly in combineX86ShufflesRecursively, and only pad the source inputs if we have a definite match inside combineX86ShuffleChain. At the moment combineX86ShuffleChain just has to bitcast an input to the correct shuffle type, but eventually we'll need to pad them as well. So, move the bitcast into a 'CanonicalizeShuffleInput helper for now, making the diff for future padding support a lot smaller.	2020-10-06 17:47:24 +01:00
Sebastian Neubauer	2d9c389a77	[AMDGPU] Remove SIInstrInfo::calculateLDSSpillAddress This function does not seem to be used anymore. Differential Revision: https://reviews.llvm.org/D88904	2020-10-06 18:45:22 +02:00

1 2 3 4 5 ...

204765 Commits