llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 19:12:56 +02:00

Author	SHA1	Message	Date
Roman Lebedev	8b2d3e8b88	[X86] AMD Zen 3: same-reg AVX YMM VXORPS is a zero-cycle(!) dep-breaking zero-idiom As confirmed by exegesis, and ref docs.	2021-05-14 11:56:06 +03:00
Roman Lebedev	942ccdfd9c	[NFC][X86][MCA] AMD Zen 3: add same-reg AVX YMM VXORPS tests	2021-05-14 11:56:06 +03:00
Roman Lebedev	af80d479c4	[X86] AMD Zen 3: same-reg AVX XMM VXORPS is a zero-cycle(!) dep-breaking zero-idiom Unlike it's legacy SSE XMM XORPS version, which measures as being 1-cycle, this one is certainly a zero-cycle instruction, in addition to both of them being dependency breaking. As confirmed by exegesis measurements, and ref docs.	2021-05-14 11:56:06 +03:00
Roman Lebedev	6d05f6c67e	[NFC][X86][MCA] AMD Zen 3: add same-reg AVX XMM VXORPS tests	2021-05-14 11:56:06 +03:00
Pooja Yadav	5296db8df0	[docs] Added llvm/cmake section Added information about the cmake inside llvm. Reviewed By: xgupta, jroelofs Differential Revision: https://reviews.llvm.org/D101925	2021-05-14 14:10:56 +05:30
David Stuttard	5ca0f8e582	[AMDGPU] Fix codegen of image intrinsics for g16 and a16 For gfx10 gradient (g16) and address (a16) can be independent. Previous implementation assumed that a16 implied g16. There are some other changes that fix the verification (as well as asm/disasm) that are required for the included test to pass - the XFAIL will be removed in those changes. This also includes required fixes for GlobalISel Differential Revision: https://reviews.llvm.org/D102066 Change-Id: I7d171cc90994de05f41669b66a6d0ffa2ed05d09	2021-05-14 09:28:15 +01:00
David Stuttard	0a28768900	[AMDGPU][AsmParser/Disassembler] Correct A16 and G16 handling A16 support for image instructions assembly/disassembly (gfx10) was missing Also refactor MIMG op addr size calcs to common function We'd got 3 places where the same operation was being done. One test is now marked XFAIL until a related codegen patch is in place Differential Revision: https://reviews.llvm.org/D102231 Change-Id: I7e86e730ef8c71901457855cba570581f4f576bb	2021-05-14 09:25:44 +01:00
David Spickett	3d7d76ad6f	[llvm][AsmPrinter] Restore source location to register clobber warning Since 5de2d189e6ad466a1f0616195e8c524a4eb3cbc0 this particular warning hasn't had the location of the source file containing the inline assembly. Fix this by reporting via LLVMContext. Which means that we no longer have the "instantiated into assembly here" lines but they were going to point to the start of the inline asm string anyway. This message is already tested via IR in llvm. However we won't have the required location info there so I've added a C file test in clang to cover it. (though strictly, this is testing llvm code) Reviewed By: ychen Differential Revision: https://reviews.llvm.org/D102244	2021-05-14 08:22:57 +00:00
Alexey Bader	ae75421161	New tag for ittapi - fix an error related to cross-compiling ITTAPI in LLVM with mingw Fix was implemented in the ittap repo to solve an error about cross-compiling ITTAPI in LLVM with mingw. The problem occurred in the cross-compilation environment for Julia's dependencies. The corresponding issue item in ittapi repo: https://github.com/intel/ittapi/issues/19 A new tag was created in ittapi repo for that fix. This patch contains changes to update the ittapi tag in LLVM. Reviewed By: bader Differential Revision: https://reviews.llvm.org/D102471	2021-05-14 08:18:49 +03:00
dfukalov	3f7e516e28	[GVN] Clobber partially aliased loads. Use offsets stored in `AliasResult` implemented in D98718. Updated with fix of issue reported in https://reviews.llvm.org/D95543#2745161 Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D95543	2021-05-14 11:17:14 +03:00
David Green	d997870c3e	[DSE] Move isOverwrite into DSEState. NFC This moves the isOverwrite function into the DSEState so that it can share the analyses and members from the state. A few extra loop tests were also added to test stores in and around multi block loops for D100464.	2021-05-14 09:16:51 +01:00
Lang Hames	5fae3540d3	[ORC] Add JITLink dependence for ObjectLinkingLayerTest. This aims to fix the failure at https://lab.llvm.org/buildbot/#/builders/61/builds/9590.	2021-05-13 22:48:30 -07:00
LLVM GN Syncbot	12903dbf9e	[gn build] Port 0fda4c4745b8	2021-05-14 04:56:03 +00:00
Lang Hames	d5111c292d	[ORC] Add support for adding LinkGraphs directly to ObjectLinkingLayer. This is separate from (but builds on) the support added in ec6b71df70a for emitting LinkGraphs in the context of an active materialization. This commit makes LinkGraphs a first-class data structure with features equivalent to object files within ObjectLinkingLayer.	2021-05-13 21:44:13 -07:00
Lang Hames	736891b041	[JITLink] Fix missing 'static' keyword in unit test.	2021-05-13 21:44:13 -07:00
Carl Ritson	6f78a47930	[AMDGPU] Do not clause NSA instructions To ensure correct behaviour NSA instructions should not be claused. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D102211	2021-05-14 12:54:56 +09:00
Lang Hames	ec3bd41065	[ORC] Remove the OrcExecutionTest class. It is no longer used.	2021-05-13 18:32:36 -07:00
Lang Hames	30e7a462fb	[ORC] Remove unused RTDyldObjectLinkingLayerExecutionTest class from unit test.	2021-05-13 18:32:35 -07:00
Lang Hames	7eb0435795	[ORC] Remove some stale unit test utils. This code was used to test ORCv1, which has been removed. It is not useful for testing ORCv2.	2021-05-13 18:32:35 -07:00
Chen Zheng	4eed210a79	[Debug-Info] change Tag type to dwarf::Tag for createAndAddDIE; NFC Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D102207	2021-05-13 21:15:06 -04:00
Arthur Eubanks	6e23cf88e5	[test] Fix new-pm-lto-defaults.ll to work on all platforms https://lab.llvm.org/buildbot/#/builders/119/builds/3775/steps/8/logs/FAIL__LLVM__new-pm-lto-defaults_ll Followup to D102345.	2021-05-13 18:12:55 -07:00
Chen Zheng	b9ce1812f9	[Debug-Info] make DIE attributes generation under strict DWARF control Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D101024	2021-05-13 20:34:07 -04:00
Amara Emerson	0ab348ad5b	[AArch64][GlobalISel] Fix a crash during unsuccessful G_CTPOP <2 x s64> legalization. The legalization rule for scalar-same-as doesn't handle vectors. Until we implement custom legalization for this, at least fall back properly.	2021-05-13 17:28:11 -07:00
Reid Kleckner	b409d78956	[gn] Don't pass -fprofile-instr-generate to linker on Windows Avoids a warning from the linker. The user still has to put the resource directory on the linker search path, and I can't find a clean way to do that automatically in gn.	2021-05-13 16:04:11 -07:00
Matt Arsenault	b8b464cdd8	AMDGPU/GlobalISel: Don't hardcode stack alignment in assert message	2021-05-13 19:00:13 -04:00
Matt Arsenault	2f4beff49d	AMDGPU/GlobalISel: Implement tail calls Or at least the sibling call cases which the DAG already handles.	2021-05-13 18:57:42 -04:00
Arthur Eubanks	1b32fba3b3	[IR] Introduce the opaque pointer type The opaque pointer type is essentially just a normal pointer type with a null pointee type. This also adds support for the opaque pointer type to the bitcode reader/writer, as well as to textual IR. To avoid confusion with existing pointer types, we disallow creating a pointer to an opaque pointer. Opaque pointer types should not be widely used at this point since many parts of LLVM still do not support them. The next steps are to add some very simple use cases of opaque pointers to make sure they work, then start pretending that all pointers are opaque pointers and see what breaks. https://lists.llvm.org/pipermail/llvm-dev/2021-May/150359.html Reviewed By: dblaikie, dexonsmith, pcc Differential Revision: https://reviews.llvm.org/D101704	2021-05-13 15:22:27 -07:00
cynecx	aac2298d34	Don't run MachineVerifier on sjlj-unwind-inline-asm test because of known issue (PR39439) Fixes buildbot failure (https://lab.llvm.org/buildbot/#/builders/16/builds/10825). Reviewed By: Amanieu Differential Revision: https://reviews.llvm.org/D102433	2021-05-13 23:14:05 +01:00
Arthur Eubanks	88e8f8e83e	[docs] Add page on opaque pointer types Reviewed By: dblaikie, dexonsmith Differential Revision: https://reviews.llvm.org/D102292	2021-05-13 15:10:27 -07:00
Nikita Popov	475b09053f	[CaptureTracking] Use isIdentifiedFunctionLocal() (NFC) These conditions together exactly match isIdentifiedFunctionLocal(), and this is also what we logically want to check for here.	2021-05-13 23:06:42 +02:00
Nikita Popov	9577207705	[AA] Use isIdentifiedFunctionLocal() (NFC) This condition is equivalent to isIdentifiedFunctionLocal(), and this is also what we semantically want to check here.	2021-05-13 23:06:42 +02:00
Roman Lebedev	7a524b3d94	Revert "[X86][CostModel] X86TTIImpl::getMemoryOpCost(): rewrite vector handling again" As reported in post-commit feedback, this has issues with e.g. <16 x i1>: https://llvm.godbolt.org/z/jxPvdGEW4 This reverts commit c02476f3158f2908ef0a6f628210b5380bd33695.	2021-05-14 00:03:36 +03:00
Roman Lebedev	014eba5801	Revert "[X86] X86TTIImpl::getInterleavedMemoryOpCostAVX2(): use getMemoryOpCost()" Depends on a commit that is about to be reverted. This reverts commit 69ed93a4355123a45c1d7216aea7cd53d07a361b.	2021-05-14 00:03:36 +03:00
Roman Lebedev	5b84e10115	[X86] AMD Zen 3: same-reg SSE XMM XORPS is a 1-cycle(!) dep-breaking one-idiom While both the SOG and Agner insist that it is zero-cycle, i can not confirm that claim. While it clearly breaks the dependency, i can not come up with a snippet, or measurement approach, to end up with IPC bigger than 4, which, to me, means that it actually consumes execution resource of an FP unit for a cycle.	2021-05-14 00:03:36 +03:00
Roman Lebedev	ad2976f8c7	[NFC][X86][MCA] AMD Zen 3: add same-reg SSE XMM XORPS test	2021-05-14 00:03:36 +03:00
Fangrui Song	00a3d20bf3	[CMake][ELF] Link libLLVM.so and libclang-cpp.so with -Bsymbolic-functions llvm-dev message: https://lists.llvm.org/pipermail/llvm-dev/2021-May/150465.html In an ELF shared object, a default visibility defined symbol is preemptible by default. This creates some missed optimization opportunities. -Bsymbolic-functions is more aggressive than our current -fvisibility-inlines-hidden (present since 2012) as it applies to all function definitions. It can * avoid PLT for cross-TU function calls && reduce dynamic symbol lookup * reduce dynamic symbol lookup for taking function addresses and optimize out GOT/TOC on x86-64/ppc64 In a -DLLVM_TARGETS_TO_BUILD=X86 build, the number of JUMP_SLOT decreases from 12716 to 1628, and the number of GLOB_DAT decreases from 1918 to 1313 The built clang with `-DLLVM_LINK_LLVM_DYLIB=on -DCLANG_LINK_CLANG_DYLIB=on` is significantly faster. See the Linux kernel build result https://bugs.archlinux.org/task/70697 Note: the performance of -fno-semantic-interposition -Bsymbolic-functions libLLVM.so and libclang-cpp.so is close to a PIE binary linking against `libLLVM.a` and `libclang.a`. When the host compiler is Clang, -Bsymbolic-functions is the major contributor. On x86-64 (with GOTPCRELX) and ppc64 ELFv2, the GOT/TOC relocations can be optimized. Some implication: Interposing a subset of functions is no longer supported. (This is fragile on ELF and unsupported on Mach-O at all. For Mach-O we don't use `ld -interpose` or `-flat_namespace`) Compiling a program which takes the address of any LLVM function with `{gcc,clang} -fno-pic` and expects the address to equal to the address taken from libLLVM.so or libclang-cpp.so is unsupported. I am fairly confident that llvm-project shouldn't have different behaviors depending on such pointer equality (as we've been using -fvisibility-inlines-hidden which applies to inline functions for a long time), but if we accidentally do, users should be aware that they should not make assumption on pointer equality in `-fno-pic` mode. See more on https://maskray.me/blog/2021-05-09-fno-semantic-interposition Reviewed By: phosek Differential Revision: https://reviews.llvm.org/D102090	2021-05-13 13:44:57 -07:00
Joseph Huber	21cc6bcd03	[OpenMP] Prevent Attributor from deleting functions in OpenMPOptCGSCC pass Summary: This patch prevents the Attributor instances made in the CGSCC pass from deleting functions. This prevents the attributor from changing the call graph while OpenMPOpt is working with it. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D102363	2021-05-13 16:35:23 -04:00
Martin Storsjö	b2438b1473	[llvm-nm] Support the -V option, print that the tool is compatible with GNU nm This unlocks some codepaths in libtool. Differential Revision: https://reviews.llvm.org/D102321	2021-05-13 22:36:25 +03:00
Aakanksha Patil	e5759b8eca	[AMDGPU] Add gfx1034 target Differential Revision: https://reviews.llvm.org/D102306	2021-05-13 14:25:18 -04:00
cynecx	729fe8041c	Support unwinding from inline assembly I've taken the following steps to add unwinding support from inline assembly: 1) Add a new `unwind` "attribute" (like `sideeffect`) to the asm syntax: ``` invoke void asm sideeffect unwind "call thrower", "~{dirflag},~{fpsr},~{flags}"() to label %exit unwind label %uexit ``` 2.) Add Bitcode writing/reading support + LLVM-IR parsing. 3.) Emit EHLabels around inline assembly lowering (SelectionDAGBuilder + GlobalISel) when `InlineAsm::canThrow` is enabled. 4.) Tweak InstCombineCalls/InlineFunction pass to not mark inline assembly "calls" as nounwind. 5.) Add clang support by introducing a new clobber: "unwind", which lower to the `canThrow` being enabled. 6.) Don't allow unwinding callbr. Reviewed By: Amanieu Differential Revision: https://reviews.llvm.org/D95745	2021-05-13 19:13:03 +01:00
Stefan Pintilie	20d928df72	[PowerPC] Add ROP Protection to prologue and epilogue Added hashst to the prologue and hashchk to the epilogue. The hash for the prologue and epilogue must always be stored as the first element in the local variable space on the stack. Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D99377	2021-05-13 12:54:44 -05:00
David Green	65a6f9a60e	[ARM] Constrain CMPZ shift combine to a single use We currently prefer t2CMPrs over t2CMPri when the node contains a shift. This can introduce more nodes if the shift has multiple uses though, as value from the shift will be needed anyway, and in the case of a t2CMPri compared with zero will more readily be removed entirely. Differential Revision: https://reviews.llvm.org/D101688	2021-05-13 18:31:01 +01:00
Joe Ellis	3dc0c1e7a7	[InstSimplify] Remove redundant {insert,extract}_vector intrinsic chains This commit removes some redundant {insert,extract}_vector intrinsic chains by implementing the following patterns as instsimplifies: (insert_vector _, (extract_vector X, 0), 0) -> X (extract_vector (insert_vector _, X, 0), 0) -> X Reviewed By: peterwaller-arm Differential Revision: https://reviews.llvm.org/D101986	2021-05-13 16:09:50 +00:00
Stanislav Mekhanoshin	e5d6c898b4	[AMDGPU] Only allow global fp atomics with unsafe option Previously we were allowing to use FP atomics without -amdgpu-unsafe-fp-atomics option if a scope is less then system. This is not safe just as well if we have UC memory. This change only allows global and flat FP atomics with the unsafe option. Consequentially that makes a check for denorm mode redundant since we skip it with the unsafe option and do not have a way to produce these instructions without it anyway. Differential Revision: https://reviews.llvm.org/D102347	2021-05-13 08:52:20 -07:00
Bradley Smith	72cb3b6e1a	[AArch64][SVE] Fix missed immediate selection due to mishandling of signedness The complex selection pattern for add/sub shifted immediates is incorrect in it's handling of incoming constant values, in that it does not properly anticipate the values to be signed extended to 32-bits. Co-authored-by: Graham Hunter <graham.hunter@arm.com> Differential Revision: https://reviews.llvm.org/D101833	2021-05-13 16:02:49 +01:00
Paul C. Anagnostopoulos	7b982bc8b5	[TableGen] Make the NUL character invalid in .td files Now uses tr instead of sed. Differential Revision: https://reviews.llvm.org/D102254	2021-05-13 10:17:45 -04:00
Juneyoung Lee	3b0d18824a	Reapply [ConstantFold] Fold more operations to poison This was reverted to mitigate mitigate miscompiles caused by the logical and/or to bitwise and/or fold. Reapply it now that the underlying issue has been fixed by D101191. ----- This patch folds more operations to poison. Alive2 proof: https://alive2.llvm.org/ce/z/mxcb9G (it does not contain tests about div/rem because they fold to poison when raising UB) Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D92270	2021-05-13 16:04:12 +02:00
Krzysztof Parzyszek	78ddcf78f9	Fix section title underlining in the release notes	2021-05-13 08:37:06 -05:00
Oliver Stannard	260194e7a2	Revert "[CMake][ELF] Add -fno-semantic-interposition and -Bsymbolic-functions" This reverts commit 3bf1acab5b454ad7fb2074b34663108b53620695. This is causing the test `gcov-shared-flush.c' to fail on the 2-stage aarch64 buildbots (https://lab.llvm.org/buildbot/#/builders/7/builds/2720).	2021-05-13 14:31:17 +01:00
Krzysztof Parzyszek	5562e5f41c	Add entry about Hexagon V68 support to the release notes	2021-05-13 08:28:55 -05:00

1 2 3 4 5 ...

215689 Commits