llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 11:33:24 +02:00

Author	SHA1	Message	Date
Simon Pilgrim	4b200d97f1	[X86] Regenerate tbm intrinsics tests. NFCI. Merge prefixes where possible, use 'X86' instead of 'X32' (which we try to only use for gnux32 triple tests).	2020-10-27 16:45:47 +00:00
Simon Pilgrim	0bef6245da	[X86] Regenerate popcnt tests. NFCI. Merge prefixes where possible, use 'X86' instead of 'X32' (which we try to only use for gnux32 triple tests).	2020-10-27 16:45:46 +00:00
Simon Pilgrim	b0dfafed55	[X86] Regenerate xop tests with common prefixes.	2020-10-27 16:45:46 +00:00
Yashaswini Hegde	40798dd5b1	[Flang][OpenMP 4.5] Add semantic check for OpenMP default clause	2020-10-27 12:38:47 -04:00
Jay Foad	2ab75283cb	[AMDGPU] Add llvm.amdgcn.div.scale with fneg tests	2020-10-27 16:05:51 +00:00
Tony	f0f576d14d	[AMDGPU] Add missing support for targets - Add missing tests. Differential Revision: https://reviews.llvm.org/D90212	2020-10-27 15:36:31 +00:00
Florian Hahn	34968e9ca6	[AArch64] Add additional tests for vector inserts with common element.	2020-10-27 14:58:56 +00:00
Raphael Isemann	80dc6586fb	Revert "[IndVars] Remove monotonic checks with unknown exit count" This reverts commit c6ca26c0bfedb8f80d6f8cb9adde25b1d6aac1c5. This breaks stage2 builds due to hitting this assert: ``` Assertion failed: (WeightSum <= UINT32_MAX && "Expected weights to scale down to 32 bits"), function calcMetadataWeights ``` when compiling AArch64RegisterBankInfo.cpp in LLVM.	2020-10-27 15:31:37 +01:00
Raphael Isemann	f3c8e5c597	Revert "[NFC] Factor away lambda's redundant parameter" This reverts commit fdc845b36130d162e5a66e427bf69b2c37b6c6bb. It seems to be a follow-up to c6372b3fb495 which will be reverted.	2020-10-27 15:30:52 +01:00
Michael Liao	ebdef472f9	[amdgpu] Enable use of AA during codegen. - Add an internal option `-amdgpu-use-aa-in-codegen` to enable or disable this feature. By Default, it's enabled. Differential Revision: https://reviews.llvm.org/D89320	2020-10-27 09:46:23 -04:00
Simon Pilgrim	ef550a5251	Revert rG0905bd5c2fa42bd4c "[InstCombine] collectBitParts - add trunc support." This reverts commit 0905bd5c2fa42bd4c0e6e0aaa08b966f165b9dfa. Causing failures in multistage buildbots that I need to investigate	2020-10-27 13:43:54 +00:00
Benjamin Kramer	b4071a353d	[X86] Don't crash on CVTPS2PH with wide vector inputs.	2020-10-27 14:42:02 +01:00
Simon Pilgrim	60eeed7af0	[X86] Regenerate all-ones vector tests with common prefixes.	2020-10-27 13:41:27 +00:00
Nico Weber	1ea6033a22	Revert "Use uint64_t for branch weights instead of uint32_t" This reverts commit e5766f25c62c185632e3a75bf45b313eadab774b. Makes clang assert when building Chromium, see https://crbug.com/1142813 for a repro.	2020-10-27 09:26:21 -04:00
Simon Pilgrim	29e9d9a1e2	[X86] Regenerate vector shift tests. NFCI. Merge prefixes where possible, use 'X86' instead of 'X32' (which we try to only use for gnux32 triple tests).	2020-10-27 13:14:54 +00:00
Simon Pilgrim	3e4f2ad439	[InstCombine] collectBitParts - add trunc support. This should allow us to remove the rather limited matchOrConcat fold and just use recognizeBSwapOrBitReverseIdiom.	2020-10-27 13:14:54 +00:00
Djordje Todorovic	d3e8393700	[NFC][IntrRefLDV] Some code clean up As reading the source code, I've found some minor nits: -Use using instead of typedef -Fix a comment -Refactor Differential Revision: https://reviews.llvm.org/D90155	2020-10-27 05:31:24 -07:00
Sven van Haastregt	a77c70490e	[TargetLowering] Add i1 condition for bit comparison fold For i1 types, boolean false is represented identically regardless of the boolean content, so we can allow optimizations that otherwise would not be correct for booleans with false represented as a negative one. Patch by Erik Hogeman. Differential Revision: https://reviews.llvm.org/D90145	2020-10-27 12:22:20 +00:00
LLVM GN Syncbot	3188a13350	[gn build] Port 850325348ae	2020-10-27 12:17:41 +00:00
Alex Richardson	ab6e2d9347	[ValueTracking][NFC] Use Log2(Align) instead of countTrailingZeroes The latter can probably be optimized to the same final code, but this might help -O0 builds.	2020-10-27 12:16:45 +00:00
Alex Richardson	58c20890b0	[ValueTracking] Add more tests for alignment assume bundles I noticed that alignment was no longer inferred as well after I last merged our CHERI fork from upstream. I opened this review before seeing that D88669 already fixes the same problem, so this commit simply adds the new test that I added as part of this change. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D89830	2020-10-27 12:16:45 +00:00
Shimin Cui	29a0e7a508	[ValueTracking] Add tracking of the alignment assume bundle This patch is to add the support of the value tracking of the alignment assume bundle. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D88669	2020-10-27 12:16:45 +00:00
Sebastian Neubauer	d2b71753ac	msgpack: Improve error for empty node	2020-10-27 12:57:00 +01:00
Roman Lebedev	49368a63f8	[InstCombine] Fold `(X >>? C1) << C2` patterns to shift+bitmask (PR37872) This is essentially finalizes a revert of rL155136, because nowadays the situation has improved, SCEV can model all these patterns well, and we canonicalize rotate-like patterns into a funnel shift intrinsics in InstCombine. So this should not cause any pessimization. I've verified the canonicalize-{a,l}shr-shl-to-masking.ll transforms with alive, which confirms that we can freely preserve exact-ness, and no-wrap flags. Profs: * base: https://rise4fun.com/Alive/gPQ * exact-ness preservation: https://rise4fun.com/Alive/izi * nuw preservation: https://rise4fun.com/Alive/DmD * nsw preservation: https://rise4fun.com/Alive/SLN6N * nuw nsw preservation: https://rise4fun.com/Alive/Qp7 Refs. https://reviews.llvm.org/D46760	2020-10-27 14:42:53 +03:00
Roman Lebedev	27b5264858	[NFC][PhaseOrdering] Autogenerate basic.ll test	2020-10-27 14:42:53 +03:00
Roman Lebedev	ff2b16b568	[NFC][InstCombine] Autogenerate cast.ll test	2020-10-27 14:42:52 +03:00
Roman Lebedev	7c2811a760	[NFC][InstCombine] Add more exhaustive test coverage for `(x >>? X1) << C2` pattern (PR37872)	2020-10-27 14:42:52 +03:00
Kazushi (Jam) Marukawa	7caf0af059	[VE] Add vector float instructions Add VFAD/VFSB/VFMP/VFDV/VFSQRT/VFCP/VFCM/VFMAD/VFMSB/VFNMAD/VFNMSB/ VRCP/VRSQRT/VRSQRTNEX/VFIX/VFIXX/VFLT/VFLTX/VCVS/VCVD instructions. Add regression tests too. Also add additional AsmParser for VFIX and VFIXX instructions to parse their mnemonic. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90166	2020-10-27 20:42:24 +09:00
Kazushi (Jam) Marukawa	aa30484b49	[VE] Add missing regression test In the previous "Add vector shift instructions", I forgot to add regression tests for VSRL and VSRD instructions. This patch is adding them. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D90167	2020-10-27 20:40:30 +09:00
Georgii Rymar	4708221cf7	[llvm-readelf] - Implement --section-details option. --section-details/-t is a GNU readelf option that produce an output that is an alternative to --sections. Differential revision: https://reviews.llvm.org/D89304	2020-10-27 13:29:39 +03:00
Med Ismail Bennani	a554ad6fde	[llvm/DebugInfo] Simplify DW_OP_implicit_value condition (NFC) Signed-off-by: Med Ismail Bennani <medismail.bennani@gmail.com>	2020-10-27 11:25:19 +01:00
Jay Foad	1e89a40c5d	[AMDGPU] Use DPP instead of Ext in a couple of class names. NFC.	2020-10-27 10:22:30 +00:00
Georgii Rymar	76df633769	[yaml2obj] - Add a way to override the sh_addralign field of a section. Imagine the following declaration of a section: ``` Sections: - Name: .dynsym Type: SHT_DYNSYM AddressAlign: 0x1111111111111111 ``` The aligment is large and yaml2obj reports an error currently: "the desired output size is greater than permitted. Use the --max-size option to change the limit" This patch implements the "ShAddrAlign" key, which is similar to other "Sh*" keys we have. With it it is possible to override the `sh_addralign` field, ignoring the writing of alignment bytes. Differential revision: https://reviews.llvm.org/D90019	2020-10-27 13:03:38 +03:00
Florian Hahn	5d85b8dafe	[LoopRotation] Allow loop header duplication if vectorization is forced. -Oz normally does not allow loop header duplication so this loop wouldn't be vectorized. However the vectorization pragma should override this and allow for loop rotation. rdar://problem/49281061 Original patch by Adam Nemet. Reviewed By: Meinersbur Differential Revision: https://reviews.llvm.org/D59832	2020-10-27 09:28:01 +00:00
David Green	a094c34600	[ARM][AArch64] Add VLDN shuffled interleaving tests. NFC	2020-10-27 09:27:32 +00:00
Max Kazantsev	a367d72019	[Test] One more range check test	2020-10-27 14:51:36 +07:00
Craig Topper	b7f1619d2b	[X86] Alternate implementation of D88194. This uses PreprocessISelDAG to replace the constant before instruction selection instead of matching opcodes after. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D89178	2020-10-27 00:20:03 -07:00
Wei Wang	16ed7005a2	[X86] Encode global address in small code model In small code model, program and its symbols are linked in the lower 2 GB of the address space. Try encoding global address even when the range is unknown in such case. Differential Revision: https://reviews.llvm.org/D89341	2020-10-26 23:14:06 -07:00
Max Kazantsev	158b2e3e50	[NFC] Factor away lambda's redundant parameter	2020-10-27 12:56:52 +07:00
Serguei Katkov	fd4f36fcb9	[GVN LoadPRE] Add an option to disable splitting backedge GVN Load PRE can split the backedge causing breaking the loop structure where the latch contains the conditional branch with for example induction variable. Different optimizations expect this form of the loop, so it is better to preserve it for some time. This CL adds an option to control an ability to split backedge. Default value is true so technically it is NFC and current behavior is not changed. Reviewers: fedor.sergeev, mkazantsev, nikic, reames, fhahn Reviewed By: mkazasntsev Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D89854	2020-10-27 11:59:52 +07:00
Max Kazantsev	e22ba25d54	[IndVars] Remove monotonic checks with unknown exit count Even if the exact exit count is unknown, we can still prove that this exit will not be taken. If we can prove that the predicate is monotonic, fulfilled on first & last iteration, and no overflow happened in between, then the check can be removed. Differential Revision: https://reviews.llvm.org/D87832 Reviewed By: apilipenko	2020-10-27 11:35:16 +07:00
Jonas Devlieghere	7274d684e5	Fix calls to (p)read on macOS when size > INT32_MAX On macOS, the read and pread syscalls return EINVAL when the number of bytes to read exceeds INT32_MAX: `a449c6a3b8/bsd/kern/sys_generic.c (L355)` rdar://68751407 Differential revision: https://reviews.llvm.org/D90201	2020-10-26 20:51:44 -07:00
Arthur Eubanks	f956540dd4	Reland [AlwaysInliner] Pass callee AAResults to InlineFunction() Test copied from noalias-calls.ll with small changes. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D89609	2020-10-26 20:40:46 -07:00
Arthur Eubanks	3273c1a681	Use uint64_t for branch weights instead of uint32_t CallInst::updateProfWeight() creates branch_weights with i64 instead of i32. To be more consistent everywhere and remove lots of casts from uint64_t to uint32_t, use i64 for branch_weights. Reviewed By: davidxl Differential Revision: https://reviews.llvm.org/D88609	2020-10-26 20:24:04 -07:00
Arthur Eubanks	17b16c6882	Revert "[AlwaysInliner] Pass callee AAResults to InlineFunction()" This reverts commit 504fbec7a61cdfbb5f6e1b25cf14afe5195ccaf6. Test failure.	2020-10-26 20:23:38 -07:00
Bing1 Yu	450c5e89bb	[CostModel][X86] teach TTI calculate cost of chain of vector inserts/extracts more precisely and correctly:In each 128-lane, if there is at least one index is demanded and not all indices are demanded... In each 128-lane, if there is at least one index is demanded and not all indices are demanded and this 128-lane is not the first 128-lane of the legalized-vector, then this 128-lane needs a extracti128; If in each 128-lane, there is at least one index is demanded, this 128-lane needs a inserti128. The following cases will help you build a better understanding: Assume we insert several elements into a v8i32 vector in avx2, Case#1: inserting into 1th index needs vpinsrd + inserti128 Case#2: inserting into 5th index needs extracti128 + vpinsrd + inserti128 Case#3: inserting into 4,5,6,7 index needs 4*vpinsrd + inserti128. Reviewed By: pengfei, RKSimon Differential Revision: https://reviews.llvm.org/D89767	2020-10-27 11:21:13 +08:00
Arthur Eubanks	b58f72c901	[AlwaysInliner] Pass callee AAResults to InlineFunction() Test copied from noalias-calls.ll with small changes. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D89609	2020-10-26 20:10:09 -07:00
Arthur Eubanks	e86f39b76e	[PlaceSafepoints] Pin tests to legacy PM This pass isn't used in tree and can be ported to the NPM later on if desired. Differential Revision: https://reviews.llvm.org/D90189	2020-10-26 20:07:37 -07:00
Arthur Eubanks	7e6a52443b	Port -objc-arc-expand to NPM Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D90182	2020-10-26 20:05:10 -07:00
Arthur Eubanks	6085a8c54a	Port -objc-arc-apelim to NPM Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D90181	2020-10-26 20:01:46 -07:00

1 2 3 4 5 ...

205804 Commits