llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

Author	SHA1	Message	Date
Jonas Devlieghere	4c6cebe9e5	Re-land "[FileCollector] Add a method to add a whole directory and it contents." Extend the FileCollector's API with addDirectory which adds a directory and its contents to the VFS mapping. Differential revision: https://reviews.llvm.org/D76671	2020-03-30 13:19:18 -07:00
Daan Sprenkels	83fa53f614	[InstCombine] Update assertions in InstCombine test; NFC	2020-03-30 22:15:50 +02:00
Sanjay Patel	baad469559	[InstCombine] do not exclude min/max from icmp with casted operand fold InstCombine has a mess of logic that tries to preserve min/max patterns, but AFAICT, this one is not necessary because we can always narrow the corresponding select in this sequence to match the narrow compare. The biggest danger for this patch is inducing infinite looping or assert from exceeding max iterations. If any bots hit that in the vicinity of this commit, this is the likely patch to blame.	2020-03-30 16:10:51 -04:00
Sam Clegg	7e969f4e53	[ADT] Allow empty string in StringSet Also add a test case to wasm-ld that asserts without this change. Internally wasm-ld builds a StringMap of exported functions and it seems like allowing empty string in the set is preferable to adding checks. This assert looks like it was most likely just a historical accident. It started life here purely to support InputLanguagesSet: eeac27e38c5c567d63bbfa5410620d955696491b Then got extracted here: e57a4033385c5976cbb17af1e962b1224a61183b Then got moved to AST here 5c48bae209bcbd261886f63abac695b1e30544e6 With the `InLang` paramater name still intact which suggested is InputLanguagesSet origins. Differential Revision: https://reviews.llvm.org/D74589	2020-03-30 12:59:34 -07:00
Eli Friedman	12acf42d57	[llvm-cov] Improve error message for missing profdata I got a report recently that a user was having trouble interpreting the meaning of the error message. Hopefully this is more readable; produces something like the following: error: No such file or directory: Could not read profile data! Differential Revision: https://reviews.llvm.org/D76796	2020-03-30 12:54:07 -07:00
Julian Lettner	245d123cd8	[lit] Use Python's support for None in array slice indexing	2020-03-30 12:44:03 -07:00
Matt Arsenault	a754d4d9c0	CodeGen: Add missing MachineOperand setter	2020-03-30 15:27:17 -04:00
Matt Arsenault	b5a136377e	AMDGPU/GlobalISel: Basic legalize rules for G_FSHR Only handles easy 32-bit cases.	2020-03-30 11:53:01 -07:00
Bill Wendling	5e28cd3473	[Intrinsic] Give "is.constant" the "convergent" attribute Summary: Code frequently relies upon the results of "is.constant" intrinsics to DCE invalid code paths. We don't want the intrinsic to be made control- dependent on any additional values. For instance, we can't split a PHI into a "constant" and "non-constant" part via jump threading in order to "optimize" the constant part, because the "is.constant" intrinsic is meant to return "false". Reviewers: wmi, kazu, MaskRay Reviewed By: kazu Subscribers: jdoerfert, efriedma, joerg, lebedev.ri, nikic, xbolva00, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75799	2020-03-30 11:47:12 -07:00
Nico Weber	ae71f17f3f	fix a comment grammar-o	2020-03-30 14:40:15 -04:00
Matt Arsenault	34960265f6	GlobalISel: Add accessor to known bits to CombinerHelper I need to pass known bits to a target combine matcher (which for some reason aren't methods in a subclass of CombinerHelper?)	2020-03-30 11:34:42 -07:00
Matt Arsenault	d97f166c5b	GlobalISel: Translate llvm.fshl/llvm.fshr	2020-03-30 11:34:42 -07:00
Thomas Raoux	88fda15a68	[ConstantFold][NFC] Compile time optimization for large vectors Optimize the common case of splat vector constant. For large vector going through all elements is expensive. For splatr/broadcast cases we can skip going through all elements. Differential Revision: https://reviews.llvm.org/D76664	2020-03-30 11:27:09 -07:00
LLVM GN Syncbot	737fd97ee6	[gn build] Port 3cbbded68c2	2020-03-30 18:16:33 +00:00
Nico Weber	6fbcc674fe	Move CLANG_SYSTEMZ_DEFAULT_ARCH to config.h. Instead of using a global define; see comments on D75914. While here, port 9c9d88d8b1b to the GN build.	2020-03-30 14:16:17 -04:00
Jakub Kuderski	a8ac91189c	[AMDGPU] Add Relocation Constant Support Summary: This change adds amdgcn.reloc.constant intrinsic to the amdgpu backend, which will compile into a relocation entry in the resulting elf. The intrinsics takes a MetadataNode (String) as its only argument, which specifies the symbol name of the relocation entry. `SelectionDAGBuilder::getValueImpl` is changed to allow metadata operands passed through to ISel. Author: csyonghe <yonghe@google.com> Reviewers: tpr, nhaehnle Reviewed By: nhaehnle Subscribers: arsenm, kzhuravl, jvesely, wdng, yaxunl, dstuttard, t-tye, hiraditya, kerbowa, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76440	2020-03-30 13:49:20 -04:00
Sameer Sahasrabuddhe	b4d5045713	Introduce unify-loop-exits pass. For each natural loop with multiple exit blocks, this pass creates a new block N such that all exiting blocks now branch to N, and then control flow is redistributed to all the original exit blocks. The bulk of the tranformation is a new function introduced in BasicBlockUtils that an redirect control flow from a set of incoming blocks to a set of outgoing blocks via a common "hub". This is a useful workaround for a limitation in the structurizer which incorrectly orders blocks when processing a nest of loops. This pass bypasses that issue by ensuring that each natural loop is recognized as a separate region. Since the structurizer is a region pass, it no longer sees a nest of loops in a single region, and instead processes each "level" in the nesting as a separate region. The AMDGPU backend provides a new option to enable this pass before the structurizer, which may eventually be enabled by default. Reviewers: madhur13490, arsenm, nhaehnle Reviewed By: nhaehnle Differential Revision: https://reviews.llvm.org/D75865	2020-03-30 13:23:56 -04:00
Vedant Kumar	5db978be0e	[LoopVectorize] Fix crash on "getNoopOrZeroExtend cannot truncate!" (PR45259) In InnerLoopVectorizer::getOrCreateTripCount, when the backedge taken count is a SCEV add expression, its type is defined by the type of the last operand of the add expression. In the test case from PR45259, this last operand happens to be a pointer, which (according to llvm::Type) does not have a primitive size in bits. In this case, LoopVectorize fails to truncate the SCEV and crashes as a result. Uing ScalarEvolution::getTypeSizeInBits makes the truncation work as expected. https://bugs.llvm.org/show_bug.cgi?id=45259 Differential Revision: https://reviews.llvm.org/D76669	2020-03-30 10:14:14 -07:00
Yuanfang Chen	5676208119	[X86] make sure POP has implicit def/use of stack pointer when materializing 8-bit immediates for minsize Summary: Otherwise PostRA list scheduler may reorder instruction, such as schedule this ''' pushq $0x8 pop %rbx lea 0x2a0(%rsp),%r15 ''' to ''' pushq $0x8 lea 0x2a0(%rsp),%r15 pop %rbx ''' by mistake. The patch is to prevent this to happen by making sure POP has implicit use of SP. Reviewers: craig.topper Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77031	2020-03-30 09:25:31 -07:00
Guillaume Chatelet	61ed715c3a	[Alignment][NFC] Use Align version of getMachineMemOperand Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: jyknight, sdardis, nemanjai, hiraditya, kbarton, fedor.sergeev, asb, rbar, johnrusso, simoncook, sabuasal, niosHD, jrtc27, MaskRay, zzheng, edward-jones, atanasyan, rogfer01, MartinMosbeck, brucehoult, the_o, jfb, PkmX, jocewei, Jim, lenary, s.egerton, pzheng, sameer.abuasal, apazos, luismarques, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77059	2020-03-30 15:46:27 +00:00
Matt Arsenault	ee8a6b7a61	GlobalISel: Minor cleanups	2020-03-30 11:26:22 -04:00
Matt Arsenault	b4f2a15cb5	AMDGPU/GlobalISel: Hack to fix i24 argument lowering I still think the call lowering type legalization logic split between the generic code and target is too confusing, but largely induced by the reliance on the DAG infrastructure.	2020-03-30 11:00:45 -04:00
Matt Arsenault	64e3cb670a	AMDGPU/GlobalISel: Legalize 64-bit G_UDIV/G_UREM Mostly ported from the DAG version. This results in much worse code than the DAG version, largely due to a much worse expansion for G_UMULH.	2020-03-30 10:57:37 -04:00
Chris Jackson	65cfb4a16b	[DebugInfo] Ensure that a demanded bits optimisation in InstCombine does not result in an incorrect debuginfo variable value - Add an additional salvage and a test. Reviewers: aprantl, djtodoro Differential Revision: https://reviews.llvm.org/D76854 Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=44371	2020-03-30 15:39:22 +01:00
Florian Hahn	18e00645a2	Revert "[Darwin] Respect -fno-unroll-loops during LTO." As per post-commit comment at https://reviews.llvm.org/D76916, this should better be done at the TU level. This reverts commit 9ce198d6ed371399e9bd9ba8b48fbab0f4e60240.	2020-03-30 15:20:30 +01:00
Chris Jackson	2d8c535e77	[DebugInfo] Ensure dead store elimination can mark an operand value as undefined - Correct a debug info salvage and add a test Reviewers: aprantl, vsk Differential Revision: https://reviews.llvm.org/D76930 Bugzilla: https://bugs.llvm.org/show_bug.cgi?id=45080	2020-03-30 14:58:14 +01:00
Sanjay Patel	c41823bb5a	[InstCombine] add test for trunc-extelt; NFC Goes with D76983	2020-03-30 09:43:03 -04:00
Guillaume Chatelet	f993ddbf7d	[Alignment][NFC] Provide tightened up functions in SelectionDAG, MachineFunction and MachineMemOperand Summary: This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: hiraditya, jfb, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D77046	2020-03-30 13:03:27 +00:00
Georgii Rymar	6e7e465fc6	[llvm-readobj] - Improve test of --elf-hash-histogram option. This test missed the check of histograms printed for .hash sections. It was removed by mistake in D71606 where I tried to get rid of precompiled objects and did not realize that time that both SHT_GNU_HASH and SHT_HASH sections were tested and not just GNU version. Also it never tested aliases for the --elf-hash-histogram option. Differential revision: https://reviews.llvm.org/D76920	2020-03-30 15:46:45 +03:00
Georgii Rymar	6e3c5fe844	[llvm-readobj][test] - Simplify hash-symbols test. We are able to reduce `-DBITS=32/64` to reduce this test case. I've rewrote the comments we had to generalize them and fix wrong computations they contained. Differential revision: https://reviews.llvm.org/D76924	2020-03-30 14:44:30 +03:00
Simon Pilgrim	2b1f75c079	[X86][AVX] lowerV4X128Shuffle - attempt to widen to 2x256 to simplify shuffles If we are lowering to X86ISD::SHUF128 we are going to lose track of individual 128-bit lanes that are UNDEF, so if we can widen these to guarantee that they are sequential with their neighbour we should. This helps with later shuffle combines.	2020-03-30 12:22:26 +01:00
Florian Hahn	fcea358269	[Matrix] Rename emitChainedMatrixMultiply to emitMatrixMultiply (NFC). The Chained in the name potentially leads to confusion. Also updated the comment to drop the unnecessary mention of tile-sized.	2020-03-30 11:17:25 +01:00
Florian Hahn	fdeed10483	[AMDGPU] Drop const for value that is copied (NFC). This fixes warning: loop variable 'Def' of type 'const llvm::Register' creates a copy from type 'const llvm::Register' [-Wrange-loop-analysis] llvm::Register just contains a single unsigned and should be copied. Reviewers: rampitec Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D77011	2020-03-30 10:59:59 +01:00
Florian Hahn	49b3d2c46a	[CVP] Add additional icmp for ranges with undef to test.	2020-03-30 10:59:25 +01:00
Qiu Chaofan	de43cc4915	[NFC] [PowerPC] Update and add tests for ori Use script to update test for ori with 32-bit imms, and add test for ori with 64-bit imms.	2020-03-30 17:46:12 +08:00
Sam Parker	a88c160cb1	[ARM][LowOverheadLoops] Add horizontal reduction support Add a bit more logic into the 'FalseLaneZeros' tracking to enable horizontal reductions and also make the VADDV variants validForTailPredication. Differential Revision: https://reviews.llvm.org/D76708	2020-03-30 09:55:41 +01:00
Guillaume Chatelet	d8275ca792	[Alignment][NFC] Return Align for SelectionDAGNodes::getOriginalAlignment/getAlignment Summary: Also deprecate getOriginalAlignment, getAlignment will take much more time as it is pervasive through the codebase (including TableGened files). This is patch is part of a series to introduce an Alignment type. See this thread for context: http://lists.llvm.org/pipermail/llvm-dev/2019-July/133851.html See this patch for the introduction of the type: https://reviews.llvm.org/D64790 Reviewers: courbet Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76933	2020-03-30 07:26:48 +00:00
David Green	b1c13d77ac	[ARM] MVE VMOV.i64 In the original batch of MVE VMOVimm code generation VMOV.i64 was left out due to the way it was done downstream. It turns out that it's fairly simple though. This adds the codegen for it, similar to NEON. Bigendian is technically incorrect in this version, which John is fixing in a Neon patch.	2020-03-30 07:44:23 +01:00
Craig Topper	2289b000f6	[TTI][X86] Fix the value passed to IsUnsigned for cost modeling of experimental.vector.reduce.smin/smax/umin/umax. We were passing true for smax/smin and false for umax/umin.	2020-03-29 23:34:22 -07:00
Max Kazantsev	8794887cea	[NFC] Remove obsolete checks followed by fix of isGuaranteedToTransferExecutionToSuccessor In past, isGuaranteedToTransferExecutionToSuccessor contained some weird logic for volatile loads/stores that was ultimately removed by patch D65375. It's time to remove a piece of dependent logic that used to be a workaround for the code which is now deleted. Reviewed By: uenoku Differential Revision: https://reviews.llvm.org/D76918	2020-03-30 12:24:41 +07:00
Juneyoung Lee	2cd7fcb259	[LangRef] Clarify the semantics of branch on undef Summary: This patch clarifies the semantics of branching on undef value. Defining `br undef` as undefined behavior explains optimizations that use branch conditions, such as CVP (D76931) and GVN (propagateEquality). For `switch cond`, it is defined to raise UB if cond is an expression containing undef && cond is not frozen && it may yield different values. This allows that at the destination block the branch condition can be assumed to be frozen already (otherwise UB was already triggered). This condition is slightly stricter than MemorySanitizer, which allows undef-y condition if it always leads to the same destination, but it does not break MemorySanitizer because we are giving stricter constraint. Reviewers: efriedma, fhahn, nikic, spatel, jdoerfert, nlopes Reviewed By: nlopes Subscribers: regehr, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D76973	2020-03-30 11:41:47 +09:00
Jun Ma	1519f8d30a	[Coroutines 2/2] Improve symmetric control transfer feature Differential Revision: https://reviews.llvm.org/D76913	2020-03-30 09:53:09 +08:00
Jun Ma	9f7a51855e	[Coroutines 1/2] Improve symmetric control transfer feature Differential Revision: https://reviews.llvm.org/D76911	2020-03-30 09:53:09 +08:00
Craig Topper	8bfc3e764e	[X86] Add sse4.1 RUNs lines to the min/max reduction cost model tests. Mostly this matches the sse4.2 we already had command lines for. Except in the i64 case since sse4.1 doesn't have pcmpgtq.	2020-03-29 16:05:35 -07:00
Daan Sprenkels	c80b7e3f8e	[InstCombine] Add tests for trunc (extelt x); (NFC) Baseline tests for D76983 (PR45314) Differential Revision: https://reviews.llvm.org/D77024	2020-03-29 17:30:54 -04:00
Craig Topper	af68664485	[X86] Add sse4.2 command lines to min/max reduction tests. SSE4.2 has the pcmpgtq instruction which we will use in vXi64 reductions when its available.	2020-03-29 13:51:03 -07:00
David Green	fbea30d2df	[ARM] VMOV.64 immediate tests. NFC	2020-03-29 21:08:43 +01:00
LLVM GN Syncbot	cf4881fd2f	[gn build] Port 854f268ca62	2020-03-29 19:24:34 +00:00
Benjamin Kramer	4c79d49be9	[MC] Move deprecation infos from MCTargetDesc to MCInstrInfo This allows emitting it only when the feature is used by a target. Shrinks Release+Asserts clang by 900k.	2020-03-29 21:20:40 +02:00
Simon Pilgrim	5bc79f436c	[X86][AVX] Combine 128/256-bit lane shuffles with zeroable upper subvectors to EXTRACT_SUBVECTOR (PR40720) As explained on PR40720, EXTRACTF128 is always as good/better than VPERM2F128/SHUF128, and we can use the implicit zeroing of the uppers.	2020-03-29 19:51:38 +01:00

1 2 3 4 5 ...

194067 Commits