llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Nico Weber	47138b2f83	Suppress a few -Wunreachable-code warnings. No behavior change. Also fix a comment to say match reality.	2020-03-25 13:55:42 -04:00
Simon Pilgrim	18ad2e6d54	[X86][AVX] Combine shuffles to TRUNCATE/VTRUNC patterns Add support for combining shuffles to AVX512 truncate instructions - another step toward fixing D56387/D66004. It also fixes SKX code on PR31443. We could probably extend this further to handle non-VLX truncation cases.	2020-03-25 17:41:51 +00:00
Gil Rapaport	41cef53616	[LV] Replace stored value with a VPValue (NFCI) InnerLoopVectorizer's code called during VPlan execution still relies on original IR's def-use relations to decide which vector code to generate, limiting VPlan transformations ability to modify def-use relations and still have ILV generate the vector code. This commit introduces a VPValue for VPWidenMemoryInstructionRecipe to use as the stored value. The recipe is generated with a VPValue wrapping the stored value of the scalar store. This reduces ingredient def-use usage by ILV as a step towards full VPlan-based def-use relations. Differential Revision: https://reviews.llvm.org/D76373	2020-03-25 19:36:55 +02:00
Tyker	e193e13b1b	[NFC] Rename function to match Coding Convention and fix typo in KnowledgeRetention	2020-03-25 18:31:13 +01:00
Nico Weber	9ce67593a9	[gn build] try removing a duplicate include dir	2020-03-25 13:23:47 -04:00
Mikhail Maltsev	3301f6f1d2	[ARM,CDE] Implement predicated Q-register CDE intrinsics Summary: This patch implements the following CDE intrinsics: T __arm_vcx1q_m(int coproc, T inactive, uint32_t imm, mve_pred_t p); T __arm_vcx2q_m(int coproc, T inactive, U n, uint32_t imm, mve_pred_t p); T __arm_vcx3q_m(int coproc, T inactive, U n, V m, uint32_t imm, mve_pred_t p); T __arm_vcx1qa_m(int coproc, T acc, uint32_t imm, mve_pred_t p); T __arm_vcx2qa_m(int coproc, T acc, U n, uint32_t imm, mve_pred_t p); T __arm_vcx3qa_m(int coproc, T acc, U n, V m, uint32_t imm, mve_pred_t p); The intrinsics are not part of the released ACLE spec, but internally at Arm we have reached consensus to add them to the next ACLE release. Reviewers: simon_tatham, MarkMurrayARM, ostannard, dmgreen Reviewed By: simon_tatham Subscribers: kristof.beyls, hiraditya, danielkiss, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76610	2020-03-25 17:08:19 +00:00
Sam McCall	a95e919d47	[clangd] Support multiple cursors in selectionRange. Summary: One change: because there's no way to signal failure individually for each cursor, we now "succeed" with an empty range with no parent if a cursor doesn't point at anything. Reviewers: usaxena95 Subscribers: ilya-biryukov, MaskRay, jkorous, arphaman, kadircet, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76741	2020-03-25 17:59:09 +01:00
Yvan Roux	c9838f5f9f	[ARM] Move ConstantIsland and LowOverheadLoops Passes. Move ARM ConstantIsland and LowOverheadLopps passes later in the pipeline such that they will be run after the upcoming Machine Outlining pass. Differential Revision: https://reviews.llvm.org/D76065	2020-03-25 16:49:21 +01:00
LLVM GN Syncbot	6d7506ae1e	[gn build] Port ce984129eaa	2020-03-25 15:36:51 +00:00
cdevadas	63a2b80308	[AMDGPU] Add SIPreEmitPeephole pass. This pass can handle all the optimization opportunities found just before code emission. Presently it includes the handling of vcc branch optimization that was handled earlier in SIInsertSkips. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D76712	2020-03-25 15:35:35 +00:00
Simon Pilgrim	7dbe7d4c84	[X86][AVX] Add common prefix to merge 32/64-bit AVX1 checks	2020-03-25 15:33:58 +00:00
Jonas Paulsson	8975f60913	[SystemZ] Improve foldMemoryOperandImpl() A spilled load of an immediate can use MVHI/MVGHI instead. A compare of a spilled register against an immediate can use CHSI/CGHSI. A logical compare can use CLFHSI/CLGHSI. Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D76055	2020-03-25 16:21:08 +01:00
Fangrui Song	f77570d407	[llvm-objdump] Replace array_pod_sort with llvm::stable_sort llvm-objdump.cpp has 3 array_pod_sort() calls used for symbolization. array_pod_start() calls qsort() internally and can have different behaviors across different libcs. Use llvm::stable_sort instead. Reviewed By: davidb, thopre Differential Revision: https://reviews.llvm.org/D76739	2020-03-25 08:13:40 -07:00
Sean Fertile	f0b5c97abf	[PowerPC][AIX] ByVal formal arguments in a single register. Adds support for passing ByVal formal arguments as long as they fit in a single register. Differential Revision: https://reviews.llvm.org/D76401	2020-03-25 11:09:40 -04:00
Sanjay Patel	127c0e46d6	[VectorCombine] add shuffle tests; NFC Goes with DD76727.	2020-03-25 10:35:03 -04:00
sstefan1	952aa40b93	OpenMP] Adding InaccessibleMemOnly and InaccessibleMemOrArgMemOnly for runtime calls. Summary: Attempt to add more attributes for runtime calls. Reviewers: jdoerfert Subscribers: guansong, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75010	2020-03-25 14:08:50 +00:00
Kerry McLaughlin	1b630dbb38	[AArch64][SVE] Add SVE intrinsics for masked loads & stores Summary: Implements the following intrinsics for contiguous loads & stores: - @llvm.aarch64.sve.ld1 - @llvm.aarch64.sve.st1 Reviewers: sdesmalen, andwar, efriedma, cameron.mcinally, dancgr, rengolin Reviewed By: cameron.mcinally Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76688	2020-03-25 11:48:40 +00:00
Juneyoung Lee	644dc31c55	Rename test name, add more tests for codegenprepare	2020-03-25 20:31:12 +09:00
Sam Parker	6f6eed9671	[ARM][MVE] Add HorizontalReduction flag Add a target flag for instructions that reduce into one, or more, scalar reg(s), including variants of: - VADDV - VABAV - VMINV/VMAXV - VMLADAV Differential Revision: https://reviews.llvm.org/D76683	2020-03-25 11:12:03 +00:00
Simon Tatham	bda88afd34	[ARM,MVE] Add missing tests for vqdmlash intrinsics. Summary: These were accidentally left out of D76123. I added tests for the other three instructions in this small cross-product family (vqdmlah, vqrdmlah, vqrdmlash) but missed this one. Reviewers: miyuki Reviewed By: miyuki Subscribers: kristof.beyls, dmgreen, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76714	2020-03-25 09:46:16 +00:00
Juneyoung Lee	01f7b0a379	Add freeze(and x, const) case to codegenprepare's freeze-cmp.ll	2020-03-25 17:29:01 +09:00
Kazushi (Jam) Marukawa	69765715bb	[VE] Change name of enum to CondCode Summary: Change enum name for condition codes from CondCodes to CondCode. Reviewers: arsenm, simoll, k-ishizaka Reviewed By: arsenm Subscribers: wdng, hiraditya, llvm-commits Tags: #llvm, #ve Differential Revision: https://reviews.llvm.org/D76747	2020-03-25 09:20:05 +01:00
Juneyoung Lee	1f40e952e7	Minor fixes to a comment in CodeGenPrepare	2020-03-25 16:34:43 +09:00
Craig Topper	029e75ad61	[X86] Split masked instruction tests to enable D60940. We need to split tests that rely on isel duplicating operations for different masking conditions. Repeating the operation is more costly than emitting the masking separately. The change here is a mechanical splitting of tests that call multiple intrinsics in one function into separate functions that call one intrinsic. We could obviously avoid the splitting by giving the intrinsics different operands, but that would need closer scrutiny than just splitting.	2020-03-24 23:44:16 -07:00
Kai Luo	2d5117a026	[PowerPC] Pre-commit reduced test case for PR45297. NFC.	2020-03-25 06:19:59 +00:00
LLVM GN Syncbot	f619eba211	[gn build] Port ba1f4405c68	2020-03-25 03:27:56 +00:00
QingShan Zhang	7b103ebd13	[NFC][Test][PowerPC] Add one test to verify the behavior of vector mul/add for v8i16	2020-03-25 02:37:26 +00:00
Matt Arsenault	1b734b0a35	AMDGPU/GlobalISel: Add a testcase for G_UNMERGE_VALUES legalization I had a note that this doesn't work, but it seems to now.	2020-03-24 21:54:43 -04:00
Matt Arsenault	5f7ce12de0	AMDGPU/GlobalISel: Add some end to end tests for fma selection	2020-03-24 21:23:37 -04:00
Matt Arsenault	7c485c0fb9	AMDGPU/GlobalISel: Add select patterns for v_and_or_b32	2020-03-24 20:47:54 -04:00
Matt Arsenault	88f96ba9ed	AMDGPU/GlobalISel: Add load legalization tests	2020-03-24 20:41:01 -04:00
Matt Arsenault	c8eead66c5	AMDGPU/GlobalISel: Add missing tests for G_FRINT selection	2020-03-24 20:41:01 -04:00
Adrian Prantl	49996d074b	Add an -object-path-prefix option to dsymutil to remap object file paths (but no source paths) before processing. This is meant to be used for Clang objects where the module cache location was remapped using ``-fdebug-prefix-map``; to help dsymutil find the Clang module cache. <rdar://problem/55685132> Differential Revision: https://reviews.llvm.org/D76391	2020-03-24 17:13:42 -07:00
Matt Arsenault	c9dcead077	GlobalISel: Introduce bitcast legalize action For some operations, the type is unimportant and only the number of bits matters. For example I don't want to treat <4 x s8> as a legal type, but I also don't want to decompose loads of this into smaller pieces to get legal register types. On AMDGPU in SelectionDAG, we legalize a number of operations (most notably load and store) by coercing all types to vectors of i32. For GlobalISel, I'm trying very hard to avoid doing this for every type, but I don't think this strategy can be completely avoided. I'm trying to avoid bitcasts for any legitimately legal type we can operate on, since the intervening bitcasts have proven to be a hassle. For loads, I think I can get away without ever casting the result type, and handling any arbitrary bitwidth during selection (I will eventually want new tablegen support to help with this, rather than having to add every possible type as legal). The unmerge required to do anything with the value should expand to the expected shifts. This is trickier for stores, since it would now require handling a wide array of truncates during selection which I don't want. Future potentially interesting case are for vector indexing, where sub-dword type should be indexed in s32 pieces.	2020-03-24 19:33:33 -04:00
Nikita Popov	4edeaf3fd8	[LVI] Convert some checks to assertions; NFC solveBlockValue() should only be called if the value isn't cached yet. Similarly, it does not make sense to "solve" a constant.	2020-03-24 23:11:13 +01:00
Amara Emerson	9cd6b80807	[AArch64][GlobalISel] Don't localize TLS G_GLOBAL_VALUEs on Darwin. On Darwin these need to be selected into a function call for the TLS address lookup. As a result, they can't be moved below a physreg write, which happens in call sequences. In the long term, we should have some mechanism in the localizer to prevent localizing into target-specific atomic instruction sequences. rdar://60056248 Differential Revision: https://reviews.llvm.org/D76652	2020-03-24 13:35:50 -07:00
Johannes Doerfert	31d276a1c1	[Attributor] Use knowledge retained in llvm.assume (operand bundles) This patch integrates operand bundle llvm.assumes [0] with the Attributor. Most IRAttributes will now look at uses of the associated value and if there are llvm.assume operand bundle uses with the right tag we will check if they are in the must-be-executed-context (around the context instruction). Droppable users, which is currently only llvm::assume, are handled special in some places now as well. [0] http://lists.llvm.org/pipermail/llvm-dev/2019-December/137632.html Reviewed By: uenoku Differential Revision: https://reviews.llvm.org/D74888	2020-03-24 15:33:40 -05:00
Craig Topper	7db5b91467	[X86] Disable autoupgrade support for avx512.mask.broadcasti32x2.* and avx512.mask.broadcastf32x2.*. These intrinsics take a v4i32/v4f32 input and are supposed to broadcast elements 0 and 1. Instead the autoupgrade code was broadcasting elements 0, 1, 2, and 3. I could fix the autoupgrade, but since its been broken for years it seemed better just to steer anyone still trying to use it away completely.	2020-03-24 12:35:24 -07:00
Sanjay Patel	1dafd42317	[VectorCombine] add tests for bitcast (shuffle); NFC	2020-03-24 15:18:32 -04:00
Reid Kleckner	a7424b53fb	Re-land "Avoid emitting unreachable SP adjustments after `throw`" This reverts commit 4e0fe038f438ae1679eae9e156e1f248595b2373. Re-lands 65b21282c710afe9c275778820c6e3c1cf46734b. After landing 5ff5ddd0adc89f8827b345577bbb3e7eb74fc644 to add int3 into trailing unreachable blocks, we can now remove these extra stack adjustments without confusing the Win64 unwinder. See https://llvm.org/45064#c4 or X86AvoidTrailingCall.cpp for a full explanation. Fixes PR45064.	2020-03-24 12:04:43 -07:00
Louis Dionne	e8ebe9cd4f	[lit] Allow passing extra commands to executeShTest This allows creating custom test formats on top of `executeShTest` that inject commands at the beginning of the file being parsed, without requiring these commands to physically appear in the test file itself. For example, one could define a test format that prints out additional debug information at the beginning of each test. More realistically, this has been used to define custom test formats like one that supports compilation failure tests (e.g. with the extension `compile.fail.cpp`) by injecting a command that calls the compiler on the file itself and expects it to fail. Without this change, the only alternative is to create a temporary file with the same content as the original test, then prepend the desired `// RUN:` lines to that file, and call `executeShTest` on that file instead. This is both slow and cumbersome to do. Differential Revision: https://reviews.llvm.org/D76290	2020-03-24 15:02:37 -04:00
Vedant Kumar	baf8348499	[DWARF] Emit DW_AT_call_pc for tail calls Record the address of a tail-calling branch instruction within its call site entry using DW_AT_call_pc. This allows a debugger to determine the address to use when creating aritificial frames. This creates an extra attribute + relocation at tail call sites, which constitute 3-5% of all call sites in xnu/clang respectively. rdar://60307600 Differential Revision: https://reviews.llvm.org/D76336	2020-03-24 12:01:55 -07:00
Louis Dionne	0bf724260d	NFC: Fix typos in TestingGuide documentation	2020-03-24 14:54:55 -04:00
Louis Dionne	eaf303b7d3	[lit] NFC: Document missing result codes These result codes already exist, but they were not documented. I assume this is an oversight when adding these result codes.	2020-03-24 14:46:54 -04:00
Juneyoung Lee	49bbd5d17a	[DivRemPairs] Freeze operands if they can be undef values Summary: DivRemPairs is unsound with respect to undef values. ``` // bb1: // %rem = srem %x, %y // bb2: // %div = sdiv %x, %y // --> // bb1: // %div = sdiv %x, %y // %mul = mul %div, %y // %rem = sub %x, %mul ``` If X can be undef, X should be frozen first. For example, let's assume that Y = 1 & X = undef: ``` %div = sdiv undef, 1 // %div = undef %rem = srem undef, 1 // %rem = 0 => %div = sdiv undef, 1 // %div = undef %mul = mul %div, 1 // %mul = undef %rem = sub %x, %mul // %rem = undef - undef = undef ``` http://volta.cs.utah.edu:8080/z/m7Xrx5 Same for Y. If X = 1 and Y = (undef \| 1), %rem in src is either 1 or 0, but %rem in tgt can be one of many integer values. This resolves https://bugs.llvm.org/show_bug.cgi?id=42619 . This miscompilation disappears if undef value is removed, but it may take a while. DivRemPair happens pretty late during the optimization pipeline, so this optimization seemed as a good candidate to fix without major regression using freeze than other broken optimizations. Reviewers: spatel, lebedev.ri, george.burgess.iv Reviewed By: spatel Subscribers: wuzish, regehr, nlopes, nemanjai, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76483	2020-03-25 03:46:14 +09:00
Benjamin Kramer	43c419e4aa	[SelectionDAG] Don't crash when freezing illegal float types	2020-03-24 19:45:19 +01:00
Simon Pilgrim	7433341c72	[X86][AVX] Add some v32i16 to v32i8 style truncation shuffle tests	2020-03-24 18:38:13 +00:00
Matt Arsenault	962631ded7	AMDGPU/GlobalISel: Add more tests for add3 folding Forget to squash into 2ea46051055b37faf95c58daad57608bb7610f58	2020-03-24 14:30:24 -04:00
Matt Arsenault	7ae688c9cc	AMDGPU/GlobalISel: Add some more tests for add3 folding These currently fail to form add3 due to the pointer type, but they should be handle.	2020-03-24 14:26:23 -04:00
Matt Arsenault	66c5ce183c	AMDGPU/GlobalISel: Fix smrd loads of v4i64	2020-03-24 13:44:41 -04:00

1 2 3 4 5 ...

193837 Commits