llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Fangrui Song	f77570d407	[llvm-objdump] Replace array_pod_sort with llvm::stable_sort llvm-objdump.cpp has 3 array_pod_sort() calls used for symbolization. array_pod_start() calls qsort() internally and can have different behaviors across different libcs. Use llvm::stable_sort instead. Reviewed By: davidb, thopre Differential Revision: https://reviews.llvm.org/D76739	2020-03-25 08:13:40 -07:00
Sean Fertile	f0b5c97abf	[PowerPC][AIX] ByVal formal arguments in a single register. Adds support for passing ByVal formal arguments as long as they fit in a single register. Differential Revision: https://reviews.llvm.org/D76401	2020-03-25 11:09:40 -04:00
Sanjay Patel	127c0e46d6	[VectorCombine] add shuffle tests; NFC Goes with DD76727.	2020-03-25 10:35:03 -04:00
sstefan1	952aa40b93	OpenMP] Adding InaccessibleMemOnly and InaccessibleMemOrArgMemOnly for runtime calls. Summary: Attempt to add more attributes for runtime calls. Reviewers: jdoerfert Subscribers: guansong, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75010	2020-03-25 14:08:50 +00:00
Kerry McLaughlin	1b630dbb38	[AArch64][SVE] Add SVE intrinsics for masked loads & stores Summary: Implements the following intrinsics for contiguous loads & stores: - @llvm.aarch64.sve.ld1 - @llvm.aarch64.sve.st1 Reviewers: sdesmalen, andwar, efriedma, cameron.mcinally, dancgr, rengolin Reviewed By: cameron.mcinally Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, danielkiss, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76688	2020-03-25 11:48:40 +00:00
Juneyoung Lee	644dc31c55	Rename test name, add more tests for codegenprepare	2020-03-25 20:31:12 +09:00
Sam Parker	6f6eed9671	[ARM][MVE] Add HorizontalReduction flag Add a target flag for instructions that reduce into one, or more, scalar reg(s), including variants of: - VADDV - VABAV - VMINV/VMAXV - VMLADAV Differential Revision: https://reviews.llvm.org/D76683	2020-03-25 11:12:03 +00:00
Simon Tatham	bda88afd34	[ARM,MVE] Add missing tests for vqdmlash intrinsics. Summary: These were accidentally left out of D76123. I added tests for the other three instructions in this small cross-product family (vqdmlah, vqrdmlah, vqrdmlash) but missed this one. Reviewers: miyuki Reviewed By: miyuki Subscribers: kristof.beyls, dmgreen, cfe-commits Tags: #clang Differential Revision: https://reviews.llvm.org/D76714	2020-03-25 09:46:16 +00:00
Juneyoung Lee	01f7b0a379	Add freeze(and x, const) case to codegenprepare's freeze-cmp.ll	2020-03-25 17:29:01 +09:00
Kazushi (Jam) Marukawa	69765715bb	[VE] Change name of enum to CondCode Summary: Change enum name for condition codes from CondCodes to CondCode. Reviewers: arsenm, simoll, k-ishizaka Reviewed By: arsenm Subscribers: wdng, hiraditya, llvm-commits Tags: #llvm, #ve Differential Revision: https://reviews.llvm.org/D76747	2020-03-25 09:20:05 +01:00
Juneyoung Lee	1f40e952e7	Minor fixes to a comment in CodeGenPrepare	2020-03-25 16:34:43 +09:00
Craig Topper	029e75ad61	[X86] Split masked instruction tests to enable D60940. We need to split tests that rely on isel duplicating operations for different masking conditions. Repeating the operation is more costly than emitting the masking separately. The change here is a mechanical splitting of tests that call multiple intrinsics in one function into separate functions that call one intrinsic. We could obviously avoid the splitting by giving the intrinsics different operands, but that would need closer scrutiny than just splitting.	2020-03-24 23:44:16 -07:00
Kai Luo	2d5117a026	[PowerPC] Pre-commit reduced test case for PR45297. NFC.	2020-03-25 06:19:59 +00:00
LLVM GN Syncbot	f619eba211	[gn build] Port ba1f4405c68	2020-03-25 03:27:56 +00:00
QingShan Zhang	7b103ebd13	[NFC][Test][PowerPC] Add one test to verify the behavior of vector mul/add for v8i16	2020-03-25 02:37:26 +00:00
Matt Arsenault	1b734b0a35	AMDGPU/GlobalISel: Add a testcase for G_UNMERGE_VALUES legalization I had a note that this doesn't work, but it seems to now.	2020-03-24 21:54:43 -04:00
Matt Arsenault	5f7ce12de0	AMDGPU/GlobalISel: Add some end to end tests for fma selection	2020-03-24 21:23:37 -04:00
Matt Arsenault	7c485c0fb9	AMDGPU/GlobalISel: Add select patterns for v_and_or_b32	2020-03-24 20:47:54 -04:00
Matt Arsenault	88f96ba9ed	AMDGPU/GlobalISel: Add load legalization tests	2020-03-24 20:41:01 -04:00
Matt Arsenault	c8eead66c5	AMDGPU/GlobalISel: Add missing tests for G_FRINT selection	2020-03-24 20:41:01 -04:00
Adrian Prantl	49996d074b	Add an -object-path-prefix option to dsymutil to remap object file paths (but no source paths) before processing. This is meant to be used for Clang objects where the module cache location was remapped using ``-fdebug-prefix-map``; to help dsymutil find the Clang module cache. <rdar://problem/55685132> Differential Revision: https://reviews.llvm.org/D76391	2020-03-24 17:13:42 -07:00
Matt Arsenault	c9dcead077	GlobalISel: Introduce bitcast legalize action For some operations, the type is unimportant and only the number of bits matters. For example I don't want to treat <4 x s8> as a legal type, but I also don't want to decompose loads of this into smaller pieces to get legal register types. On AMDGPU in SelectionDAG, we legalize a number of operations (most notably load and store) by coercing all types to vectors of i32. For GlobalISel, I'm trying very hard to avoid doing this for every type, but I don't think this strategy can be completely avoided. I'm trying to avoid bitcasts for any legitimately legal type we can operate on, since the intervening bitcasts have proven to be a hassle. For loads, I think I can get away without ever casting the result type, and handling any arbitrary bitwidth during selection (I will eventually want new tablegen support to help with this, rather than having to add every possible type as legal). The unmerge required to do anything with the value should expand to the expected shifts. This is trickier for stores, since it would now require handling a wide array of truncates during selection which I don't want. Future potentially interesting case are for vector indexing, where sub-dword type should be indexed in s32 pieces.	2020-03-24 19:33:33 -04:00
Nikita Popov	4edeaf3fd8	[LVI] Convert some checks to assertions; NFC solveBlockValue() should only be called if the value isn't cached yet. Similarly, it does not make sense to "solve" a constant.	2020-03-24 23:11:13 +01:00
Amara Emerson	9cd6b80807	[AArch64][GlobalISel] Don't localize TLS G_GLOBAL_VALUEs on Darwin. On Darwin these need to be selected into a function call for the TLS address lookup. As a result, they can't be moved below a physreg write, which happens in call sequences. In the long term, we should have some mechanism in the localizer to prevent localizing into target-specific atomic instruction sequences. rdar://60056248 Differential Revision: https://reviews.llvm.org/D76652	2020-03-24 13:35:50 -07:00
Johannes Doerfert	31d276a1c1	[Attributor] Use knowledge retained in llvm.assume (operand bundles) This patch integrates operand bundle llvm.assumes [0] with the Attributor. Most IRAttributes will now look at uses of the associated value and if there are llvm.assume operand bundle uses with the right tag we will check if they are in the must-be-executed-context (around the context instruction). Droppable users, which is currently only llvm::assume, are handled special in some places now as well. [0] http://lists.llvm.org/pipermail/llvm-dev/2019-December/137632.html Reviewed By: uenoku Differential Revision: https://reviews.llvm.org/D74888	2020-03-24 15:33:40 -05:00
Craig Topper	7db5b91467	[X86] Disable autoupgrade support for avx512.mask.broadcasti32x2.* and avx512.mask.broadcastf32x2.*. These intrinsics take a v4i32/v4f32 input and are supposed to broadcast elements 0 and 1. Instead the autoupgrade code was broadcasting elements 0, 1, 2, and 3. I could fix the autoupgrade, but since its been broken for years it seemed better just to steer anyone still trying to use it away completely.	2020-03-24 12:35:24 -07:00
Sanjay Patel	1dafd42317	[VectorCombine] add tests for bitcast (shuffle); NFC	2020-03-24 15:18:32 -04:00
Reid Kleckner	a7424b53fb	Re-land "Avoid emitting unreachable SP adjustments after `throw`" This reverts commit 4e0fe038f438ae1679eae9e156e1f248595b2373. Re-lands 65b21282c710afe9c275778820c6e3c1cf46734b. After landing 5ff5ddd0adc89f8827b345577bbb3e7eb74fc644 to add int3 into trailing unreachable blocks, we can now remove these extra stack adjustments without confusing the Win64 unwinder. See https://llvm.org/45064#c4 or X86AvoidTrailingCall.cpp for a full explanation. Fixes PR45064.	2020-03-24 12:04:43 -07:00
Louis Dionne	e8ebe9cd4f	[lit] Allow passing extra commands to executeShTest This allows creating custom test formats on top of `executeShTest` that inject commands at the beginning of the file being parsed, without requiring these commands to physically appear in the test file itself. For example, one could define a test format that prints out additional debug information at the beginning of each test. More realistically, this has been used to define custom test formats like one that supports compilation failure tests (e.g. with the extension `compile.fail.cpp`) by injecting a command that calls the compiler on the file itself and expects it to fail. Without this change, the only alternative is to create a temporary file with the same content as the original test, then prepend the desired `// RUN:` lines to that file, and call `executeShTest` on that file instead. This is both slow and cumbersome to do. Differential Revision: https://reviews.llvm.org/D76290	2020-03-24 15:02:37 -04:00
Vedant Kumar	baf8348499	[DWARF] Emit DW_AT_call_pc for tail calls Record the address of a tail-calling branch instruction within its call site entry using DW_AT_call_pc. This allows a debugger to determine the address to use when creating aritificial frames. This creates an extra attribute + relocation at tail call sites, which constitute 3-5% of all call sites in xnu/clang respectively. rdar://60307600 Differential Revision: https://reviews.llvm.org/D76336	2020-03-24 12:01:55 -07:00
Louis Dionne	0bf724260d	NFC: Fix typos in TestingGuide documentation	2020-03-24 14:54:55 -04:00
Louis Dionne	eaf303b7d3	[lit] NFC: Document missing result codes These result codes already exist, but they were not documented. I assume this is an oversight when adding these result codes.	2020-03-24 14:46:54 -04:00
Juneyoung Lee	49bbd5d17a	[DivRemPairs] Freeze operands if they can be undef values Summary: DivRemPairs is unsound with respect to undef values. ``` // bb1: // %rem = srem %x, %y // bb2: // %div = sdiv %x, %y // --> // bb1: // %div = sdiv %x, %y // %mul = mul %div, %y // %rem = sub %x, %mul ``` If X can be undef, X should be frozen first. For example, let's assume that Y = 1 & X = undef: ``` %div = sdiv undef, 1 // %div = undef %rem = srem undef, 1 // %rem = 0 => %div = sdiv undef, 1 // %div = undef %mul = mul %div, 1 // %mul = undef %rem = sub %x, %mul // %rem = undef - undef = undef ``` http://volta.cs.utah.edu:8080/z/m7Xrx5 Same for Y. If X = 1 and Y = (undef \| 1), %rem in src is either 1 or 0, but %rem in tgt can be one of many integer values. This resolves https://bugs.llvm.org/show_bug.cgi?id=42619 . This miscompilation disappears if undef value is removed, but it may take a while. DivRemPair happens pretty late during the optimization pipeline, so this optimization seemed as a good candidate to fix without major regression using freeze than other broken optimizations. Reviewers: spatel, lebedev.ri, george.burgess.iv Reviewed By: spatel Subscribers: wuzish, regehr, nlopes, nemanjai, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76483	2020-03-25 03:46:14 +09:00
Benjamin Kramer	43c419e4aa	[SelectionDAG] Don't crash when freezing illegal float types	2020-03-24 19:45:19 +01:00
Simon Pilgrim	7433341c72	[X86][AVX] Add some v32i16 to v32i8 style truncation shuffle tests	2020-03-24 18:38:13 +00:00
Matt Arsenault	962631ded7	AMDGPU/GlobalISel: Add more tests for add3 folding Forget to squash into 2ea46051055b37faf95c58daad57608bb7610f58	2020-03-24 14:30:24 -04:00
Matt Arsenault	7ae688c9cc	AMDGPU/GlobalISel: Add some more tests for add3 folding These currently fail to form add3 due to the pointer type, but they should be handle.	2020-03-24 14:26:23 -04:00
Matt Arsenault	66c5ce183c	AMDGPU/GlobalISel: Fix smrd loads of v4i64	2020-03-24 13:44:41 -04:00
Sanjay Patel	446f29b2c2	[ValueTracking] improve undef/poison analysis for constant vectors Differential Revision: https://reviews.llvm.org/D76702	2020-03-24 13:35:47 -04:00
LLVM GN Syncbot	caa4525525	[gn build] Port b91905a2637	2020-03-24 16:46:53 +00:00
Hiroshi Yamauchi	2be2675a03	Revert "Include static prof data when collecting loop BBs" This reverts commit 129c911efaa492790c251b3eb18e4db36b55cbc5. Due to an internal benchmark regression.	2020-03-24 09:41:16 -07:00
Nico Weber	b5e7c63121	[gn build] (manually) port 8140f6bcde4 better	2020-03-24 12:39:49 -04:00
Nico Weber	45c217b8bb	[gn build] (manually) port 8140f6bcde4	2020-03-24 12:38:25 -04:00
Nico Weber	8375fac139	[gn build] Port 49e5a97ec36	2020-03-24 12:36:08 -04:00
David Green	b8d11aabd8	[ARM] Fold VMOVrh VLDR to LDRH This adds a simple fold to combine VMOVrh load to a integer load. Similar to what is already performed for BITCAST, but needs to account for the types being of different sizes, creating an zero extending load. Differential Revision: https://reviews.llvm.org/D76485	2020-03-24 15:51:03 +00:00
Sanjay Patel	f8f875fc08	[InstSimplify] add tests for freeze(constexpr); NFC	2020-03-24 11:39:19 -04:00
Lama	5cc4cf3bbd	[MachinePipeliner] Fix a bug in Output Dependency chains The current implementation collects all Preds/Succs of a Dep of kind Output, creating a long chain and subsequently a schedule with an unnecessarily large II. Was this done on purpose for a reason I'm missing? Reviewed By: bcahoon Differential Revision: https://reviews.llvm.org/D75424	2020-03-24 14:37:50 +00:00
Simon Pilgrim	eca2dede42	[X86][SSE1] Add support for logic+movmsk patterns (PR42870) rL368506 handled the basic case, but we need to account for boolean logic patterns as well.	2020-03-24 14:28:40 +00:00
Pavel Labath	58fd4ef93a	[DWARF] Fix v5 debug_line parsing of prologues with many files Summary: The directory_count and file_name_count fields are (section 6.2.4 of DWARF5 spec) supposed to be uleb128s, not bytes. This bug meant that it was not possible to correctly parse headers with more than 128 files or directories. I've found this bug by code inspection, though the limit is so small someone would have run into it for real sooner or later. I've verified that the producer side handles many files correctly, and that we are able to parse such files after this fix. Reviewers: dblaikie, jhenderson Subscribers: aprantl, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76498	2020-03-24 15:11:54 +01:00
Juneyoung Lee	c23deb9eda	[SelDag] Add FREEZE Summary: - Add FREEZE node to SelDag - Lower FreezeInst (in IR) to FREEZE node - Add Legalization for FREEZE node Reviewers: qcolombet, bogner, efriedma, lebedev.ri, nlopes, craig.topper, arsenm Reviewed By: lebedev.ri Subscribers: wdng, xbolva00, Petar.Avramovic, liuz, lkail, dylanmckay, hiraditya, Jim, arsenm, craig.topper, RKSimon, spatel, lebedev.ri, regehr, trentxintong, nlopes, mkuper, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D29014	2020-03-24 23:04:58 +09:00

1 2 3 4 5 ...

193925 Commits