llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-22 10:42:39 +01:00

Author	SHA1	Message	Date
Simon Moll	dcdd80355c	[VE] Vector 'and' isel and tests Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D93709	2020-12-23 13:29:29 +01:00
Sebastian Neubauer	7a6d989537	[AMDGPU][GlobalISel] Fold flat vgpr + constant addresses Use getPtrBaseWithConstantOffset in selectFlatOffsetImpl to fold more vgpr+constant addresses. Differential Revision: https://reviews.llvm.org/D93692	2020-12-23 10:40:30 +01:00
ShihPo Hung	07cf8b0588	[RISCV] Add intrinsics for vfwmacc, vfwnmacc, vfwmsac, vfwnmsac instructions This patch defines vfwmacc, vfwnmacc, vfwmsc, vfwnmsac intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93693	2020-12-23 00:42:04 -08:00
Georgii Rymar	baa62a1798	[llvm-readobj] - Dump the ELF file type better. Currently llvm-readelf might print "OS Specific/Processor Specific/<unknown>" hint when dumping the ELF file type. The patch teaches llvm-readobj to do the same. This fixes https://bugs.llvm.org/show_bug.cgi?id=40868 I am removing `Object/elf-unknown-type.test` test because it is not in the right place, it is outdated and very limited. The `readobj/ELF/file-types.test` checks the functionality much better. Differential revision: https://reviews.llvm.org/D93689	2020-12-23 11:13:19 +03:00
Zakk Chen	26eb06f03f	[RISCV] Define vmerge/vfmerge intrinsics. Define vmerge/vfmerge intrinsics and lower to V instructions. Include support for vector-vector vfmerge by vmerge.vvm. We work with @rogfer01 from BSC to come out this patch. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D93674	2020-12-23 00:07:09 -08:00
Evandro Menezes	33fa319e35	[RISCV] Define the vfmin, vfmax RVV intrinsics Define the vfmin, vfmax IR intrinsics for the respective V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: Evandro Menezes <evandro.menezes@sifive.com> Differential Revision: https://reviews.llvm.org/D93673	2020-12-23 00:27:38 -06:00
Arthur Eubanks	00b1ba347d	[NewPM] Fix objc-arc-apelim pass typo	2020-12-22 21:40:43 -08:00
Kazu Hirata	0be0e8002c	[NewGVN] Remove for_each_found (NFC) The last use of the function was removed on Sep 30, 2017 in commit 9b926e90d33e0f71c16618365333fc7b330b6bb5.	2020-12-22 20:13:27 -08:00
Thomas Lively	48e96cf778	[WebAssembly][NFC] Refactor SIMD load/store tablegen defs Introduce `Vec` records, each bundling all information related to a single SIMD lane interpretation. This lets TableGen definitions take a single Vec parameter from which they can extract information rather than taking multiple redundant parameters. This commit refactors all of the SIMD load and store instruction definitions to use the new `Vec`s. Subsequent commits will similarly refactor additional instruction definitions. Differential Revision: https://reviews.llvm.org/D93660	2020-12-22 20:06:12 -08:00
Kazu Hirata	c74e8374d5	[MemorySSA] Use is_contained (NFC)	2020-12-22 19:58:54 -08:00
Matt Arsenault	7df923493b	GlobalISel: Return APInt from getConstantVRegVal Returning int64_t was arbitrarily limiting for wide integer types, and the functions should handle the full generality of the IR. Also changes the full form which returns the originally defined vreg. Add another wrapper for the common case of just immediately converting to int64_t (arguably this would be useful for the full return value case as well). One possible issue with this change is some of the existing uses did break without conversion to getConstantVRegSExtVal, and it's possible some without adequate test coverage are now broken.	2020-12-22 22:23:58 -05:00
Matt Arsenault	4e9da4ff71	AMDGPU: Use Register	2020-12-22 21:55:59 -05:00
Matt Arsenault	3419b798ef	AMDGPU: Add spilled CSR SGPRs to entry block live ins	2020-12-22 21:55:59 -05:00
ShihPo Hung	a268fdb29f	[RISCV] Add intrinsics for vf[n]macc/vf[n]msac/vf[n]madd/vf[n]msub instructions This patch defines vfmadd/vfnmacc, vfmsac/vfnmsac, vfmadd/vfnmadd, and vfmsub/vfnmsub lower to V instructions. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93691	2020-12-22 18:34:00 -08:00
ShihPo Hung	ec91727e53	[RISCV] Add intrinsics for vwmacc[u\|su\|us] instructions This patch defines vwmacc[u\|su\|us] intrinsics and lower to V instructions. We work with @rogfer01 from BSC to come out this patch. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93675	2020-12-22 18:17:39 -08:00
ShihPo Hung	819a1811bd	[RISCV] Add intrinsics for vslide1up/down, vfslide1up/down instruction This patch adds intrinsics for vslide1up, vslide1down, vfslide1up, vfslide1down. Authored-by: Roger Ferrer Ibanez <rofirrim@gmail.com> Co-Authored-by: ShihPo Hung <shihpo.hung@sifive.com> Differential Revision: https://reviews.llvm.org/D93608	2020-12-22 18:14:22 -08:00
Matt Arsenault	a33259aebe	AMDGPU: Fix assert when checking for implicit operand legality	2020-12-22 20:56:24 -05:00
Matt Arsenault	43e94d4c6a	VirtRegMap: Use Register	2020-12-22 20:56:14 -05:00
Arthur O'Dwyer	bf9fb2bea1	Replace `T(x)` with `reinterpret_cast<T>(x)` everywhere it means reinterpret_cast. NFC. Differential Revision: https://reviews.llvm.org/D76572	2020-12-22 19:54:29 -05:00
Stanislav Mekhanoshin	09ef5f7bec	[AMDGPU][GlobalISel] GlobalISel for flat scratch It does not seem to fold offsets but this is not specific to the flat scratch as getPtrBaseWithConstantOffset() does not return the split for these tests unlike its SDag counterpart. Differential Revision: https://reviews.llvm.org/D93670	2020-12-22 16:33:06 -08:00
Stanislav Mekhanoshin	b2c1643118	[AMDGPU] Support unaligned flat scratch in TLI Adjust SITargetLowering::allowsMisalignedMemoryAccessesImpl for unaligned flat scratch support. Mostly needed for global isel. Differential Revision: https://reviews.llvm.org/D93669	2020-12-22 16:12:31 -08:00
Thomas Lively	92eadd3cde	[WebAssembly][SIMD] Rename shuffle, swizzle, and load_splats These instructions previously used prefixes like v8x16 to signify that they were agnostic between float and int interpretations. We renamed these instructions to remove this form of prefix in https://github.com/WebAssembly/simd/issues/297 and https://github.com/WebAssembly/simd/issues/316 and this commit brings the names in LLVM up to date. Differential Revision: https://reviews.llvm.org/D93722	2020-12-22 14:29:06 -08:00
Sanjay Patel	4ddc126029	[SLP] add reduction tests for maxnum/minnum intrinsics; NFC	2020-12-22 16:05:39 -05:00
Sanjay Patel	39bfb797e5	[SLP] use operand index abstraction for number of operands I think this is NFC currently, but the bug would be exposed when we allow binary intrinsics (maxnum, etc) as candidates for reductions. The code in matchAssociativeReduction() is using OperationData::getNumberOfOperands() when comparing whether the "EdgeToVisit" iterator is in-bounds, so this code must use the same (potentially offset) operand value to set the "EdgeToVisit".	2020-12-22 16:05:39 -05:00
Craig Topper	ab5054f7c8	[RISCV] Remove unneeded !eq comparing a single bit value to 0/1 in RISCVInstrInfoVPseudos.td. NFC Instead we can either use the bit directly. If it was checking for 0 we need to swap the operands or use !not.	2020-12-22 11:57:16 -08:00
Arnold Schwaighofer	213a4b317b	Add a llvm.coro.end.async intrinsic The llvm.coro.end.async intrinsic allows to specify a function that is to be called as the last action before returning. This function will be inlined after coroutine splitting. This function can contain a 'musttail' call to allow for guaranteed tail calling as the last action. Differential Revision: https://reviews.llvm.org/D93568	2020-12-22 10:52:28 -08:00
Stanislav Mekhanoshin	90da9cd31d	[AMDGPU] Folding of FI operand with flat scratch Differential Revision: https://reviews.llvm.org/D93501	2020-12-22 10:48:04 -08:00
sameeran joshi	1f7bfce5b3	Revert "[Flang][openmp][5.0] Add task_reduction clause." This reverts commit 9a7895dc20852b662a66976d06871ec2a0b968c8. Reverting due to missing Co-author attribution. https://reviews.llvm.org/D93105	2020-12-22 23:53:51 +05:30
Nathan James	201e2329a7	[ADT] Fix some tests after 5d10b8ad Some bots were failing due to signed/unsigned comparison.	2020-12-22 18:06:19 +00:00
Florian Hahn	b4471f9735	[LoopDeletion] Add test case where outer loop needs to be deleted. In the test case @test1, the inner loop cannot be removed, because it has a live-out value. But the outer loop is a no-op and can be removed.	2020-12-22 17:49:20 +00:00
Philip Reames	9dce9d4c76	[tests] precommit a test mentioned in review for D93317	2020-12-22 09:47:19 -08:00
Nathan James	b6e73d2ad1	[ADT] Add resize_for_overwrite method to SmallVector. Analagous to the std::make_(unqiue\|shared)_for_overwrite added in c++20. If T is POD, and the container gets larger, any new values added wont be initialized. This is useful when using SmallVector as a buffer where its planned to overwrite any potential new values added. If T is not POD, `new (Storage) T` functions identically to `new (Storage) T()` so this will function identically to `resize(size_type)`. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D93532	2020-12-22 17:18:59 +00:00
Paul Walker	70073e607b	Fix some misnamed variables in sve-fixed-length-int-minmax.ll.	2020-12-22 17:11:23 +00:00
Kamau Bridgeman	86aa55c858	[PowerPC][Power10] Exploit store rightmost vector element instructions Using the store rightmost vector element instructions to do vector element extraction and store. The rightmost vector element on little endian is the zeroth vector element, with these patterns that element can be extracted and stored in one instruction for all vector types. Differential Revision: https://reviews.llvm.org/D89195	2020-12-22 12:06:43 -05:00
sameeran joshi	d525c0db00	[Flang][openmp][5.0] Add task_reduction clause. See OMP-5.0 2.19.5.5 task_reduction Clause. To add a positive test case we need `taskgroup` directive which is not added hence skipping the test. This is a dependency for `taskgroup` construct. Reviewed By: clementval Differential Revision: https://reviews.llvm.org/D93105	2020-12-22 22:34:38 +05:30
Paul Walker	1499ac0fc7	[SVE] Lower vector BITREVERSE and BSWAP operations. These operations are lowered to RBIT and REVB instructions respectively. In the case of fixed-length support using SVE we also lower BITREVERSE operating on NEON sized vectors as this results in fewer instructions. Differential Revision: https://reviews.llvm.org/D93606	2020-12-22 16:49:50 +00:00
Nandor Licker	b6ede64cc1	[RISCV] Basic jump table lowering This patch enables jump table lowering in the RISC-V backend. In addition to the test case included, the new lowering was tested by compiling the OCaml runtime and running it under qemu. Differential Revision: https://reviews.llvm.org/D92097	2020-12-22 15:05:54 +00:00
clementval	503c7c65fe	[openacc][openmp][NFC] Fix typo in comments	2020-12-22 09:59:50 -05:00
Florian Hahn	4cdccd0b93	[LV] Use ScalarEvolution::getURemExpr to reduce duplication. ScalarEvolution should be able to handle both constant and variable trip counts using getURemExpr, so we do not have to handle them separately. This is a small simplification of a56280094e08. Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D93677	2020-12-22 14:48:42 +00:00
Paul C. Anagnostopoulos	e6292b9b96	[MCInstrDesc] [TableGen] Reduce size of MCOperandInfo instances. Differential Revision: https://reviews.llvm.org/D93326	2020-12-22 09:44:30 -05:00
Jan Svoboda	f04bf685e6	[clang][cli] Implement `getAllArgValues` marshalling This infrastructure can be used ~30 more command line options. Reviewed By: dexonsmith Differential Revision: https://reviews.llvm.org/D93631	2020-12-22 14:11:16 +01:00
Sjoerd Meijer	e92d6b8dd8	[AArch64] Add a test for MachineLICM SinkIntoLoop. NFC.	2020-12-22 12:22:24 +00:00
Nico Weber	101d5213c5	Revert "-fstack-clash-protection: Return an actual error when used on unsupported OS" This reverts commit 4d59c8fdb955ea0d668b854f467e12bce05a8857. Breaks tens of thousands of tests, and had pending review comments, see comments on https://reviews.llvm.org/D92245 (and e.g. http://lab.llvm.org:8011/#/builders/109/builds/5236 for failures).	2020-12-22 06:51:19 -05:00
Nemanja Ivanovic	437b6055f0	[PowerPC] Restore stack ptr from base ptr when available On subtargets that have a red zone, we will copy the stack pointer to the base pointer in the prologue prior to updating the stack pointer. There are no other updates to the base pointer after that. This suggests that we should be able to restore the stack pointer from the base pointer rather than loading it from the back chain or adding the frame size back to either the stack pointer or the frame pointer. This came about because functions that call setjmp need to restore the SP from the FP because the back chain might have been clobbered (see https://reviews.llvm.org/D92906). However, if the stack is realigned, the restored SP might be incorrect (which is what caused the failures in the two ASan test cases). This patch was tested quite extensivelly both with sanitizer runtimes and general code. Differential revision: https://reviews.llvm.org/D93327	2020-12-22 05:44:03 -06:00
Nico Weber	88d23da9ea	[gn build] (manually) port b8c37153d5393	2020-12-22 06:35:40 -05:00
David Spickett	fe844f277e	[llvm][Arm/AArch64] Format extension flags in CPU test failures Previously you just two hex numbers you had to decode manually. This change adds a predicate formatter for extension flags to produce failure messages like: ``` [ RUN ] AArch64CPUTests/AArch64CPUTestFixture.testAArch64CPU/2 <...>llvm/unittests/Support/TargetParserTest.cpp:862: Failure Expected extension flags: +fp-armv8, +crc, +crypto (0xe) Got extension flags: +fp-armv8, +neon, +crc, +crypto (0x1e) [ FAILED ] AArch64CPUTests/AArch64CPUTestFixture.testAArch64CPU/2, where GetParam() = "cortex-a34", "armv8-a", <...> ``` From there you can take the feature name and map it back to the enum in ARM/AArch64TargetParser.def. (which isn't perfect but you've probably got both files open if you're editing these tests) Note that AEK_NONE is not meant to be user facing in the compiler but here it is part of the tests. So failures may show an extension "none" where the normal target parser wouldn't. The formatter is implemented as a template on ARM::ISAKind because the predicate formatters assume all parameters are used for comparison. (e.g. PRED_FORMAT3 is for comparing 3 values, not having 3 arguments in general) Reviewed By: MarkMurrayARM Differential Revision: https://reviews.llvm.org/D93448	2020-12-22 11:13:36 +00:00
Sylvestre Ledru	3ce38f4632	-fstack-clash-protection: Return an actual error when used on unsupported OS $ clang-12: error: -fstack-clash-protection is not supported on Windows or Mac OS X Differential Revision: https://reviews.llvm.org/D92245	2020-12-22 12:06:08 +01:00
Siddhesh Poyarekar	f256168c20	Fold comparison of __builtin_object_size expression with -1 for non-const size When __builtin_dynamic_object_size returns a non-constant expression, it cannot be -1 since that is an invalid return value for object size. However since passes running after the substitution don't know this, they are unable to optimize away the comparison and hence the comparison and branch stays in there. This change generates an appropriate call to llvm.assume to help the optimizer folding the test. glibc is considering adopting __builtin_dynamic_object_size for additional protection[1] and this change will help reduce branching overhead in fortified implementations of all of the functions that don't have the __builtin___*_chk type builtins, e.g. __ppoll_chk. Also remove the test limit-max-iterations.ll because it was deemed unnecessary during review. [1] https://sourceware.org/pipermail/libc-alpha/2020-November/120191.html Differential Revision: https://reviews.llvm.org/D93015	2020-12-22 10:56:31 +01:00
Florian Hahn	f84e692f2b	[VPlan] Make VPInstruction a VPDef This patch turns updates VPInstruction to manage the value it defines using VPDef. The VPValue is used during VPlan construction and codegeneration instead of the plain IR reference where possible. Reviewed By: gilr Differential Revision: https://reviews.llvm.org/D90565	2020-12-22 09:53:47 +00:00
Sjoerd Meijer	cf31f6d3c6	[MachineLICM] Add llvm debug messages to SinkIntoLoop. NFC. I am investigating sinking instructions back into the loop under high register pressure. This is just a first NFC step to add some debug messages that allows tracing of the decision making.	2020-12-22 09:19:43 +00:00

1 2 3 4 5 ...

208673 Commits