llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

Author	SHA1	Message	Date
Matt Arsenault	c6e1c3ad9f	AMDGPU: Forgot to commit test file for r358890 llvm-svn: 359885	2019-05-03 13:55:40 +00:00
Simon Pilgrim	2628d88f52	Avoid cppcheck operator precedence warnings. NFCI. Prefer ((X & Y) ? A : B) to (X & Y ? A : B) llvm-svn: 359884	2019-05-03 13:50:38 +00:00
Matt Arsenault	9606e8658d	AMDGPU: Fix incorrect commute with sub when folding immediates When a fold of an immediate into a sub/subrev required shrinking the instruction, the wrong VOP2 opcode was used. This was using the VOP2 equivalent of the original instruction, not the commuted instruction with the inverted opcode. llvm-svn: 359883	2019-05-03 13:42:56 +00:00
Matt Arsenault	0ce08418dc	AMDGPU: Fix test verification This should run the verifier, and needs to enable trackRegLiveness. llvm-svn: 359882	2019-05-03 13:42:55 +00:00
Sanjay Patel	67b626ea1e	[LICM] auto-generate complete test checks; NFC llvm-svn: 359881	2019-05-03 13:25:06 +00:00
Sanjay Patel	93f4e020e6	[CodeGenPrepare] limit overflow intrinsic matching to a single basic block Using/updating a dominator tree to match math overflow patterns may be very expensive in compile-time (because of the way CGP uses a DT), so just handle the single-block case. Also, we were restarting the iterator loops when doing the overflow intrinsic transforms by marking the dominator tree for update. That was done to prevent iterating over a removed instruction. But we can postpone the deletion using the existing "RemovedInsts" structure, and that means we don't need to update the DT. See post-commit thread for rL354298 for more details: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20190422/646276.html Differential Revision: https://reviews.llvm.org/D61075 llvm-svn: 359879	2019-05-03 13:09:18 +00:00
Sean Fertile	b7773a156f	[Object][XCOFF] Add an XCOFF dumper for llvm-readobj. Patch adds support for dumping of file headers with llvm-readobj. XCOFF object files are added to test dumping a well formed file, and dumping both negative timestamps and negative symbol counts, both of which are allowed in the XCOFF definition. Differential Revision: https://reviews.llvm.org/D60878 llvm-svn: 359878	2019-05-03 12:57:07 +00:00
Simon Pilgrim	72f050c770	[TargetLowering] expandUnalignedStore - cleanup EVT variables. NFCI. Avoid duplicated EVTs and rename Store/Load VTs to avoid -Wshadow warnings. llvm-svn: 359877	2019-05-03 12:55:25 +00:00
Anton Afanasyev	c7acc390c4	Revert "[MIR] Add simple PRE pass to MachineCSE" This reverts commit 9c20156de39b377190d7a91783d61877b303fe35. It breaks stage 2 of clang-ppc64be-linux-multistage. llvm-svn: 359875	2019-05-03 12:36:22 +00:00
Simon Pilgrim	194aa9f9b3	[SelectionDAG] Use INT_MIN as (1 << 31) is UB for signed integers. NFCI. llvm-svn: 359873	2019-05-03 11:32:00 +00:00
Simon Pilgrim	c94b6a071b	[SelectionDAG] computeKnownBits - remove some duplicate/shadow variables. NFCI. llvm-svn: 359872	2019-05-03 11:11:03 +00:00
Simon Pilgrim	6b0005160a	[X86] LowerMULH - remove unused Lo/Hi vector indices. NFCI. Leftover from before we had the extract128BitVector helpers. llvm-svn: 359871	2019-05-03 10:32:07 +00:00
Anton Afanasyev	e767893524	[MIR] Add simple PRE pass to MachineCSE This is the second part of the commit fixing PR38917 (hoisting partitially redundant machine instruction). Most of PRE (partitial redundancy elimination) and CSE work is done on LLVM IR, but some of redundancy arises during DAG legalization. Machine CSE is not enough to deal with it. This simple PRE implementation works a little bit intricately: it passes before CSE, looking for partitial redundancy and transforming it to fully redundancy, anticipating that the next CSE step will eliminate this created redundancy. If CSE doesn't eliminate this, than created instruction will remain dead and eliminated later by Remove Dead Machine Instructions pass. The third part of the commit is supposed to refactor MachineCSE, to make it more clear and to merge MachinePRE with MachineCSE, so one need no rely on further Remove Dead pass to clear instrs not eliminated by CSE. First step: https://reviews.llvm.org/D54839 Fixes llvm.org/PR38917 Reviewers: RKSimon Subscribers: hfinkel, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D56772 llvm-svn: 359870	2019-05-03 10:30:59 +00:00
Simon Pilgrim	ebc0f7236b	Reduce variable scope to just the if() block its actually used in. NFCI. llvm-svn: 359869	2019-05-03 10:13:41 +00:00
Craig Topper	ba89375968	[X86] Add more one checks to masked compare patterns that were missed in r358358. This covers the patterns we use for widening 128/256 comparisons to 512-bit when AVX512VL isn't supported. llvm-svn: 359863	2019-05-03 07:14:05 +00:00
Igor Kudrin	17113999e5	[docs] Add support for Markdown documentation when creating man pages rL358749 added a documentation page in the Markdown format. Currently, such pages are ignored in the configuration script for manual pages. This patch fixes that. Differential Revision: https://reviews.llvm.org/D60964 llvm-svn: 359860	2019-05-03 05:11:48 +00:00
Quentin Colombet	cf8d0ecc05	[IRTranslator] Use the alloc size instead of the store size when translating allocas We use to incorrectly use the store size instead of the alloc size when creating the stack slot for allocas. On aarch64 this can be demonstrated by allocating weirdly sized types. For instance, in the added test case, we use an alloca for i19. We used to allocate a slot of size 24-bit (19 rounded up to the next byte), whereas we really want to use a full 32-bit slot for this type. llvm-svn: 359856	2019-05-03 01:23:56 +00:00
Eli Friedman	f714f84cd7	[AArch64][MC] Reject "add x0, x1, w2, lsl #1 " etc. Looks like just a minor oversight in the parsing code. Fixes https://bugs.llvm.org/show_bug.cgi?id=41504. Differential Revision: https://reviews.llvm.org/D60840 llvm-svn: 359855	2019-05-03 00:59:52 +00:00
Eric Christopher	7706a4c905	Tidy up a comment, fix a typo, remove a comment that's obsolete. llvm-svn: 359852	2019-05-03 00:15:23 +00:00
Eli Friedman	92fda57ccc	[AArch64][Windows] Compute function length correctly in unwind tables. The primary fix here is to WinException.cpp: we need to exclude jump tables when computing the length of a function, or else we fail to correctly compute the length. (We can only compute the number of bytes consumed by certain assembler directives after the entire file is parsed. ".p2align" is one of those directives, and is used by jump table generation.) The secondary fix, to MCWin64EH, is to make sure we don't silently miscompile if we hit a similar situation in the future. It's possible we could extend ARM64EmitUnwindInfo so it allows function bodies that contain assembler directives, but that's a lot more complicated; see the FIXME in MCWin64EH.cpp. Fixes https://bugs.llvm.org/show_bug.cgi?id=41581 . Differential Revision: https://reviews.llvm.org/D61095 llvm-svn: 359849	2019-05-03 00:10:45 +00:00
Alina Sbirlea	abd3f639d8	[MemorySSA] Check that block is reachable when adding phis. Summary: Originally the insertDef method was only used when building MemorySSA, and was limiting the number of Phi nodes that it created. Now it's used for updates as well, and it can create additional Phis needed for correctness. Make sure no Phis are created in unreachable blocks (condition met during MSSA build), otherwise the renamePass will find a null DTNode. Resolves PR41640. Reviewers: george.burgess.iv Subscribers: jlebar, Prazek, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61410 llvm-svn: 359845	2019-05-02 23:41:58 +00:00
Alina Sbirlea	45212263d0	[MemorySSA] Refactor removing multiple trivial phis [NFC]. Summary: Create a method to clean up multiple potentially trivial phis, since we will need this often. Reviewers: george.burgess.iv Subscribers: jlebar, Prazek, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61471 llvm-svn: 359842	2019-05-02 23:12:49 +00:00
Craig Topper	bdd0a08703	[X86] Remove LEA16r references from X86FixupLEAs. NFCI As far as I know, we never emit LEA16r llvm-svn: 359840	2019-05-02 22:46:23 +00:00
Craig Topper	4c479b950c	[X86] Correct the register class for specific mask register constraints in getRegForInlineAsmConstraint when the VT is a scalar type The default impementation in the base class for TargetLowering::getRegForInlineAsmConstraint doesn't work for mask registers when the VT is a scalar type integer types since the only legal mask types are vXi1. So we end up just getting whatever the first register class that contains the register. Currently this appears to be VK1, but its really dependent on the order tablegen outputs the register classes. Some code in the caller ends up looking up the type for this register class and find v1i1 then generates a copyfromreg from the physical k-register with the v1i1 type. Then it generates an any_extend from v1i1 to the scalar VT which isn't legal. This bad any_extend sticks around until isel where it selects a MOVZX32rr8 with a v1i1 input or maybe a i8 input. Not sure but eventually we pick up a copy from VK1 to GR8 in MachineIR which isn't supported. This leads to a failure in physical register copying. This patch uses the scalar type to find a VK class of the right size. In the attached test case this will be VK16. This causes a bitcast from vk16 to i16 to be generated instead of an any_extend. This will be properly iseled to a VK16 to GR32 copy and a GR32->GR16 extract_subreg. Fixes PR41678 Differential Revision: https://reviews.llvm.org/D61453 llvm-svn: 359837	2019-05-02 22:26:40 +00:00
Craig Topper	353bb1a1d2	[SelectionDAG] Add asserts to verify the vectorness of input and output types of TRUNCATE/ZERO_EXTEND/ANY_EXTEND/SIGN_EXTEND agree As a result of the underlying cause of PR41678 we created an ANY_EXTEND node with a scalar result type and v1i1 input type. Ideally we would have asserted for this instead of letting it go through to instruction selection and generate bad machine IR Differential Revision: https://reviews.llvm.org/D61463 llvm-svn: 359836	2019-05-02 22:26:26 +00:00
Evandro Menezes	46d8190766	[AArch64] Update for Exynos Fix the forwarding of multiplication results for Exynos M4. llvm-svn: 359834	2019-05-02 22:01:39 +00:00
Craig Topper	0af3e0ceab	[X86] Remove string literal from an if. NFC This if used to be an assert that got refactored into an if, but left the string literal behind. Fixes PR41718 llvm-svn: 359833	2019-05-02 21:57:18 +00:00
Jordan Rupprecht	124c73d901	Revert [ThinLTO] Fix X86/strong_non_prevailing.ll after llvm-nm 'r' change This reverts r359314 (git commit 5015aa854dc043b2ae5d95e04e00d98518207ce5) llvm-svn: 359831	2019-05-02 21:48:04 +00:00
Jordan Rupprecht	dc435f6b24	Revert [llvm-nm] Fix handling of symbol types + [llvm-nm] Generalize symbol types This reverts r359311 and r359312 (git commit 0bf06a8f59b0074a60871865e828d92db8930c59 and 5f184f17800ea2ac27be5e4ab540cb94a46e80c7) llvm-svn: 359830	2019-05-02 21:42:46 +00:00
Nico Weber	ce03ba5572	lld-link: Add /force:multipleres extension to make dupe resource diag non-fatal As a side benefit, lld-link now reports more than one duplicate resource entry before exiting with an error even if the new flag is not passed. llvm-svn: 359829	2019-05-02 21:21:55 +00:00
Petr Hosek	a954e23d64	[gn] Include the missing BUILD.gn file for libcxxabi includes This was omitted in r359805. Differential Revision: https://reviews.llvm.org/D61462 llvm-svn: 359828	2019-05-02 21:20:08 +00:00
Petr Hosek	296e01372b	[gn] Update the clangd test lit site configuration This reflects changes made in r359763. llvm-svn: 359825	2019-05-02 20:34:54 +00:00
Eric Christopher	56e65fa084	Typo Functino->Function. llvm-svn: 359821	2019-05-02 19:49:35 +00:00
Saleem Abdulrasool	3c0a69a728	build: add option to disable unwind tables The unwind tables (`.eh_frame`, `.arm.extab`) add a significant chunk of data to the final binaries. These should not be needed normally, particularly when exceptions are disabled. This enables shrinking `lldb-server` by ~18% (3 MiB) when built with gold. llvm-svn: 359819	2019-05-02 19:37:26 +00:00
George Rimar	e09eddae5d	[yaml2obj] - Make interface of `NameToIdxMap` class be human friendly and fix users. This patch inverses the values returned by `addName` and `lookup` methods of the class mentioned so that they now return true on success and false on failure. Also, it does minor code cleanup. Differential revision: https://reviews.llvm.org/D61190 llvm-svn: 359818	2019-05-02 19:28:04 +00:00
Petr Hosek	52777ec83d	[gn] Include libcxx configuration file This was omitted in r359806 but is already referenced in the GN build. llvm-svn: 359815	2019-05-02 19:07:23 +00:00
Petr Hosek	fdc10c3a09	[gn] Support for building libc++ This change introduces support for building libc++. The library build should be complete, but not all CMake options have been replicated in GN. We also don't support tests yet. We only support two stage build at the moment. Differential Revision: https://reviews.llvm.org/D61143 llvm-svn: 359806	2019-05-02 17:29:41 +00:00
Petr Hosek	180a8a461c	[gn] Support for building libcxxabi This change introduces support for building libcxxabi. The library build should be complete, but not all CMake options have been replicated in GN. We also don't support tests yet. We only support two stage build at the moment. Differential Revision: https://reviews.llvm.org/D60372 llvm-svn: 359805	2019-05-02 17:29:39 +00:00
Petr Hosek	89e4e6043e	[gn] Support for building libunwind This change introduces support for building libuwind. The library build should be complete, but not all CMake options have been replicated in GN. We also don't support tests yet. We only support two stage build at the moment. Differential Revision: https://reviews.llvm.org/D60370 llvm-svn: 359804	2019-05-02 17:29:37 +00:00
Sanjay Patel	25fd630e98	[DAGCombiner] try repeated fdiv divisor transform before building estimate (2nd try) The original patch was committed at rL359398 and reverted at rL359695 because of infinite looping. This includes a fix to check for a vector splat of "1.0" to avoid the infinite loop. Original commit message: This was originally part of D61028, but it's an independent diff. If we try the repeated divisor reciprocal transform before producing an estimate sequence, then we have an opportunity to use scalar fdiv. On x86, the trade-off is 1 divss vs. 5 vector FP ops in the default estimate sequence. On recent chips (Skylake, Ryzen), the full-precision division is only 3 cycle throughput, so that's probably the better perf default option and avoids problems from x86's inaccurate estimates. The last 2 tests show that users still have the option to override the defaults by using the function attributes for reciprocal estimates, but those patterns are potentially made faster by converting the vector ops (including ymm ops) to scalar math. Differential Revision: https://reviews.llvm.org/D61149 llvm-svn: 359793	2019-05-02 15:02:08 +00:00
Sanjay Patel	cb50bfc476	[SelectionDAG] remove constant folding limitations based on FP exceptions We don't have FP exception limits in the IR constant folder for the binops (apart from strict ops), so it does not make sense to have them here in the DAG either. Nothing else in the backend tries to preserve exceptions (again outside of strict ops), so I don't see how this could have ever worked for real code that cares about FP exceptions. There are still cases (examples: unary opcodes in SDAG, FMA in IR) where we are trying (at least partially) to preserve exceptions without even asking if the target supports FP exceptions. Those should be corrected in subsequent patches. Real support for FP exceptions requires several changes to handle the constrained/strict FP ops. Differential Revision: https://reviews.llvm.org/D61331 llvm-svn: 359791	2019-05-02 14:47:59 +00:00
Simon Pilgrim	f357de2ce2	[X86][SSE] lowerAddSubToHorizontalOp - enable ymm extraction+fold Limiting scalar hadd/hsub generation to the lowest xmm looks to be unnecessary - we will be extracting one upper xmm whatever, and we can remove a shuffle by using the hop which is inline with what shouldUseHorizontalOp expects to happen anyway. Testing on btver2 (the main target for fast-hops) shows this is beneficial even for float ops where we have a 'shuffle' to extract the float result: https://godbolt.org/z/0R-U-K Differential Revision: https://reviews.llvm.org/D61426 llvm-svn: 359786	2019-05-02 14:00:55 +00:00
Simon Pilgrim	ebbf2b37ff	[X86][SSE] Move shouldUseHorizontalOp inside isHorizontalBinOp. NFCI. Matches what we do for lowerAddSubToHorizontalOp and will make it easier to peek through subvectors to help fix PR39921 llvm-svn: 359782	2019-05-02 12:18:24 +00:00
James Henderson	cecec4697a	[llvm-strip]Add --no-strip-all to disable --strip-all behaviour (including default stripping) If certain switches are not specified, llvm-strip behaves as if --strip-all were specified. This means that for testing, when we don't want the stripping behaviour, we have to specify one of these switches, which can be confusing. This change adds --no-strip-all to allow an alternative way of suppressing the default stripping, in a less confusing manner. Reviewed by: jakehehrlich, MaskRay Differential Revision: https://reviews.llvm.org/D61377 llvm-svn: 359781	2019-05-02 11:53:02 +00:00
Fangrui Song	4ded17c871	[LTO] Migrate typedef to using using has been used in several places in the file. Migrate the rest for consistency. llvm-svn: 359776	2019-05-02 10:52:34 +00:00
Fangrui Song	8cbafadfc6	[llvm-readobj] Delete and inline relocAddressLess It is used only once in COFFDumper.cpp. Deleting it from the public interface seems better. llvm-svn: 359775	2019-05-02 10:49:27 +00:00
Fangrui Song	228858c00c	[Object] Change getSectionName() to return Expected<StringRef> Summary: It currently receives an output parameter and returns std::error_code. Expected<StringRef> fits for this purpose perfectly. Differential Revision: https://reviews.llvm.org/D61421 llvm-svn: 359774	2019-05-02 10:32:03 +00:00
Diana Picus	262e71c77d	[ARM GlobalISel] Fixup r359768 Get rid of local variable used only in assertion. llvm-svn: 359772	2019-05-02 10:08:29 +00:00
Diana Picus	e8b91df048	[ARM GlobalISel] Select extensions to < 32 bits Select G_SEXT and G_ZEXT with destination types smaller than 32 bits in the exact same way as 32 bits. This overwrites the higher bits, but that should be ok since all legal users of types smaller than 32 bits ignore those bits anyway. llvm-svn: 359768	2019-05-02 09:28:00 +00:00
Diana Picus	3d1035c103	[ARM GlobalISel] Rename some inst selector tests. NFC Prepare to add support for extensions to types smaller than 32 bits. llvm-svn: 359767	2019-05-02 09:24:47 +00:00

1 2 3 4 5 ...

178333 Commits