llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 20:51:52 +01:00

Author	SHA1	Message	Date
Alexey Lapshin	073a1e428c	[AARCH64][NEON] Allow to sink operands of aarch64_neon_pmull64. Summary: This patch fixes a problem when pmull2 instruction is not generated for vmull_high_p64 intrinsic. ISel has a pattern for int_aarch64_neon_pmull64 intrinsic to generate PMULL2 instruction. That pattern assumes that extraction operations are located in the same basic block. We need to sink them if they are not. Handle operands of int_aarch64_neon_pmull64 into AArch64TargetLowering::shouldSinkOperands. Reviewed by: efriedma Differential Revision: https://reviews.llvm.org/D80320	2020-05-22 01:35:24 +03:00
Craig Topper	b8040080d8	[Target] Use Align in TargetLoweringObjectFile::getSectionForConstant. Differential Revision: https://reviews.llvm.org/D80363	2020-05-21 15:23:29 -07:00
Arthur Eubanks	7fa9fcec62	Don't jump to landing pads in Control Flow Optimizer Summary: Likely fixes https://bugs.llvm.org/show_bug.cgi?id=45858. Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80047	2020-05-21 15:19:10 -07:00
Jinsong Ji	efd35f1fad	[docs] Fix buildbot failures Buildbot has been failing since http://lab.llvm.org:8011/builders/llvm-sphinx-docs/builds/44711 This patch fix the minor issues that cause warnings.	2020-05-21 22:07:33 +00:00
Dominic Chen	69d36b0ea5	llvm-diff: Avoid crash with complex expressions Summary: With complex IR, the difference engine can enter an inconsistent state where the instruction and block difference engines return different results. Prevent a crash by refusing to pop beyond the end of the vector. Reviewers: rjmccall Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80351	2020-05-21 17:43:47 -04:00
Tim Renouf	4ee52612b3	[AMDGPU] Fixed incorrect PAL metadata register naming This only affects assembly and -filetype=asm codegen of PAL metadata. Differential Revision: https://reviews.llvm.org/D78860 Change-Id: I7b822e1917bf7b403486820d31afc483be207652	2020-05-21 22:13:19 +01:00
Tim Renouf	9ec51ac7fd	[MsgPack] Added convenience assignment to MsgPackDocument This commit increases the convenience of using the MsgPackDocument API, especially when creating a document for writing out. It adds direct assignment of bool, integer and string types to a DocNode, as long as that DocNode is already inside a document, e.g. the result of a map lookup. It also adds map lookup given an integer type (it already had that for string). So, to assign a string to a map element whose key is an int, you can now write MyMap[42] = "towel"; instead of MyMap[MyMap.getDocument()->getNode(42)] = MyMap.getDocument()->getNode("towel"); Also added MapDocNode::erase methods. Differential Revision: https://reviews.llvm.org/D80121 Change-Id: I17301fa15bb9802231c52542798af5b54beb583e	2020-05-21 22:13:19 +01:00
Jean-Michel Gorius	b6e158e140	[CodeGen] Add support for multiple memory operands in MachineInstr::mayAlias Summary: To support all targets, the mayAlias member function needs to support instructions with multiple operands. This revision also changes the order of the emitted instructions in some test cases. Reviewers: efriedma, hfinkel, craig.topper, dmgreen Reviewed By: efriedma Subscribers: MatzeB, dmgreen, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D80161	2020-05-21 23:02:54 +02:00
Stanislav Mekhanoshin	fbee044fb9	[AMDGPU] Promote alloca to vector in opt Promote alloca to vector before SROA and loop unroll. If we manage to eliminate allocas before unroll we may choose to unroll less. Differential Revision: https://reviews.llvm.org/D80386	2020-05-21 13:49:51 -07:00
Eli Friedman	23a7c73a2f	[AArch64][SVE] Fill out missing unpredicated load/store patterns. The set of patterns for unpredicated load/store was incomplete: it only included non-extending stores. Fill out the remaining patterns for extending stores, and add the corresponding support to frame offset lowering. Differential Revision: https://reviews.llvm.org/D80349	2020-05-21 13:29:30 -07:00
Tim Renouf	7bca68b8dc	[MsgPack] MsgPackDocument::readFromBlob now merges The readFromBlob method can now be used to read MsgPack into a Document that already contains something, merging the two. There is a new Merger argument to readFromBlob, a callback function to resolve conflicts. Differential Revision: https://reviews.llvm.org/D79671 Change-Id: Icf3e959217fe33cd907a41516c0386aef2847c0c	2020-05-21 21:26:26 +01:00
Hendrik Greving	14c4cee2d8	[ModuloSchedule] Add missing comma. This is a test commit as per Chris to verify commit access. Thanks!	2020-05-21 13:18:07 -07:00
Marcello Maggioni	9e2b961780	[SelectionDAG] Add the option of disabling generic combines. Summary: For some targets generic combines don't really do much and they consume a disproportionate amount of time. There's not really a mechanism in SDISel to tactically disable combines, but we can have a switch to disable all of them and let the targets just implement what they specifically need. Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79112	2020-05-21 20:11:29 +00:00
Jonas Devlieghere	e5fd992382	[dsymutil] Add llvm_unreachable to silence warning Fixes warning: control reaches end of non-void function [-Wreturn-type]	2020-05-21 12:27:52 -07:00
Stanislav Mekhanoshin	ed3bd1410f	[AMDGPU] Added opt pipeline test. NFC.	2020-05-21 11:58:35 -07:00
Jonas Devlieghere	435f397d00	[dsymutil] Fix conversion between unique_ptr and Expected Reproducer.cpp:70:12: error: could not convert ‘Repro’ from ‘std::unique_ptr<llvm::dsymutil::ReproducerGenerate, std::default_delete<llvm::dsymutil::ReproducerGenerate> >’ to ‘llvm::Expected<std::unique_ptr<llvm::dsymutil::Reproducer> >’	2020-05-21 11:42:04 -07:00
Hiroshi Yamauchi	4123cb0132	[IR] Make Module::setProfileSummary to replace an existing ProfileSummary flag. Summary: Module::setProfileSummary currently calls addModuelFlag. This prevents from updating the ProfileSummary metadata in the module and results in a second ProfileSummary added instead of replacing an existing one. I don't think this is the expected behavior. It prevents updating the ProfileSummary and it does not make sense to have more than one. To address this, add Module::setModuleFlag and use it from setProfileSummary. Reviewers: davidxl Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79902	2020-05-21 11:38:39 -07:00
LLVM GN Syncbot	837832af2a	[gn build] Port 92fd3971e0d	2020-05-21 18:20:26 +00:00
Jonas Devlieghere	c85ea496cb	[dsymutil] Fix include-style	2020-05-21 11:19:57 -07:00
Jonas Devlieghere	b03521b98d	[dsymutil] Add reproducers to dsymutil Add support for generating a dsymutil reproducer. The result is a folder containing all the object files for linking. When --gen-reproducer is passed, dsymutil uses a FileCollectorFileSystem which keeps track of all the files used by dsymutil. These files are copied into a temporary directory when dsymutil exists. When this path is passed to --use-reproducer, dsymutil uses a RedirectingFileSystem that will use the files from the reproducer directory instead of the actual paths. This means you don't need to mess with the OSO path prefix. Differential revision: https://reviews.llvm.org/D79398	2020-05-21 10:59:49 -07:00
Simon Atanasyan	08e3b796f7	[mips] Reorganize check directives in the test. NFC	2020-05-21 20:57:04 +03:00
Jonas Devlieghere	6dd20e6e2d	Revert "Revert "[YAMLTraits] Add trait for char"" Reverting this to unblock all the LLDB bots while we try to figure out a solution for Solaris in https://reviews.llvm.org/D79745.	2020-05-21 10:33:09 -07:00
Benjamin Kramer	86b9f45429	[ImmutableSet] Use IntrusiveRefCntPtr to eliminate some manual refcounting Still not ideal as the refcounting leaks to users, but better than before. NFCI.	2020-05-21 19:10:22 +02:00
Jean-Michel Gorius	e77d83d2b0	[ADT][Analysis] NFC: Fix some more typos	2020-05-21 18:53:43 +02:00
Hiroshi Yamauchi	4d59570b59	[ProfileSummary] Add the PartialProfileRatio field in ProfileSummary metadata. Summary: PartialProfileRatio approximately represents the ratio of the number of profile counters of the program being built to the number of profile counters in the partial sample profile. It is used to scale the working set size under the partial sample profile to reflect the size of the program being built and to improve the working set size heuristics. This is a split from D79831. Reviewers: davidxl Subscribers: eraman, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D79951	2020-05-21 09:12:23 -07:00
Stanislav Mekhanoshin	6d06d957a7	[AMDGPU] Tune threshold for cmp/select vector lowering It was set in total vector size while the idea was to limit a number of instructions. Now it started to work with doubles and thresholds needs to be updated. Differential Revision: https://reviews.llvm.org/D80322	2020-05-21 08:59:35 -07:00
Jean-Michel Gorius	bd64a1c8db	[ADT] NFC: Fix typos in header comments	2020-05-21 17:43:00 +02:00
Rainer Orth	aa8832a483	Revert "[YAMLTraits] Add trait for char" This reverts commit fab08bf4899e40d02d8bf394a63499ac679ac61c. It has left the Solaris buildbots broken for a week and a half as reported in https://reviews.llvm.org/D79745.	2020-05-21 17:33:42 +02:00
Jon Roelofs	ff25a24574	[llvm][test] Add missing FileCheck colons. NFC	2020-05-21 09:29:27 -06:00
Jon Roelofs	aae486e5a4	[llvm][test] Add COM: directives before colon-less non-CHECKs in comments. NFC Differential Revision: https://reviews.llvm.org/D79963	2020-05-21 09:29:27 -06:00
Dinar Temirbulatov	d037f0f025	[SLP][NFC] PR45269 getVectorElementSize() is slow The algorithm inside getVectorElementSize() is almost O(x^2) complexity and when, for example, we compile MultiSource/Applications/ClamAV/shared_sha256.c with 1k instructions inside sha256_transform() function that resulted in almost ~800k iterations. The following change improves the algorithm with the map to a liner complexity. Differential Revision: https://reviews.llvm.org/D80241	2020-05-21 17:26:50 +02:00
Thomas Raoux	ce046e33ef	[ModuloSchedule] Trivial fix for instruction with more than one destination in modulo peeler. When moving an instruction into a block where it was referenced by a phi when peeling, refer to the phi's register number and assert that the instruction has it in its destinations. This way, it also covers instructions with more than one destination. Patch by Hendrik Greving! Differential Revision: https://reviews.llvm.org/D80027	2020-05-21 08:14:42 -07:00
Jean-Michel Gorius	5163d6baec	[x86] NFC: Fix typo in command line option description	2020-05-21 16:53:25 +02:00
Simon Pilgrim	7c091722a6	GenericDomTree.h - remove unused PointerIntPair.h include. NFC.	2020-05-21 15:48:36 +01:00
Benjamin Kramer	b3fcfd789a	[BitcodeReader] Simplify code. NFCI.	2020-05-21 16:03:09 +02:00
Benjamin Kramer	6bd4f52a3c	[StringRef] Use some trickery to avoid initializing the std::string returned by upper()/lower()	2020-05-21 16:03:09 +02:00
James Henderson	813a9c4567	On Windows, handle interrupt signals without crash message For LLVM on *nix systems, the signal handlers are not run on signals such as SIGINT due to CTRL-C. See sys::CleanupOnSignal. This makes sense, as such signals are not really crashes. Prior to this change, this wasn't the case on Windows, however. This patch changes the Windows behaviour to be consistent with Linux, and adds testing that verifies this. The test uses llvm-symbolizer, but any tool with an interactive mode would do the job. Fixes https://bugs.llvm.org/show_bug.cgi?id=45754. Reviewed by: MaskRay, rnk, aganea Differential Revision: https://reviews.llvm.org/D79847	2020-05-21 13:27:10 +01:00
Sam Parker	15a2182f6d	[CostModel] Sink intrinsic costs to base TTI. Recommitting part of "[CostModel] Unify Intrinsic Costs." de71def3f59dc9f12f67141b5040d8e15c84d08a Move the switch statement from TTImpl::getIntrinsicCost to TTI::getIntrinsicInstrCost. This enables BasicTTI to understand more 'free' intrinsics instead of defaulting to a cost of 1. Differential Revision: https://reviews.llvm.org/D80012	2020-05-21 13:16:05 +01:00
Sam Parker	ea6298dde2	Revert "[CostModel] Unify Intrinsic Costs." This reverts commit de71def3f59dc9f12f67141b5040d8e15c84d08a. This is causing some very large changes, so I'm first going to break this patch down and re-commit in parts.	2020-05-21 12:50:24 +01:00
Ehud Katz	14ed97da5e	[FlattenCFG] Fix `MergeIfRegion` in case then-path is empty In case the then-path of an if-region is empty, then merging with the else-path should be handled with the inverse of the condition (leading to that path). Fix PR37662 Differential Revision: https://reviews.llvm.org/D78881	2020-05-21 14:06:44 +03:00
Simon Pilgrim	c76a10c58f	MachineMemOperand.h - reduce GlobalValue.h include to just DerivedTypes.h. NFC. We don't need anything specifically from GlobalValue.h	2020-05-21 11:38:25 +01:00
Roman Lebedev	509d4884e7	[IndVarSimplify][LoopUtils] Avoid TOCTOU/ordering issues (PR45835) Summary: Currently, `rewriteLoopExitValues()`'s logic is roughly as following: > Loop over each incoming value in each PHI node. > Query whether the SCEV for that incoming value is high-cost. > Expand the SCEV. > Perform sanity check (`isValidRewrite()`, D51582) > Record the info > Afterwards, see if we can drop the loop given replacements. > Maybe perform replacements. The problem is that we interleave SCEV cost checking and expansion. This is A Problem, because `isHighCostExpansion()` takes special care to not bill for the expansions that were already expanded, and we can reuse. While it makes sense in general - if we know that we will expand some SCEV, all the other SCEV's costs should account for that, which might cause some of them to become non-high-cost too, and cause chain reaction. But that isn't what we are doing here. We expand all SCEV's, unconditionally. So every next SCEV's cost will be affected by the already-performed expansions for previous SCEV's. Even if we are not planning on keeping some of the expansions we performed. Worse yet, this current "bonus" depends on the exact PHI node incoming value processing order. This is completely wrong. As an example of an issue, see @dmajor's `pr45835.ll` - if we happen to have a PHI node with two(!) identical high-cost incoming values for the same basic blocks, we would decide first time around that it is high-cost, expand it, and immediately decide that it is not high-cost because we have an expansion that we could reuse (because we expanded it right before, temporarily), and replace the second incoming value but not the first one; thus resulting in a broken PHI. What we instead should do for now, is not perform any expansions until after we've queried all the costs. Later, in particular after `isValidRewrite()` is an assertion (D51582) we could improve upon that, but in a more coherent fashion. See [[ https://bugs.llvm.org/show_bug.cgi?id=45835 \| PR45835 ]] Reviewers: dmajor, reames, mkazantsev, fhahn, efriedma Reviewed By: dmajor, mkazantsev Subscribers: smeenai, nikic, hiraditya, javed.absar, llvm-commits, dmajor Tags: #llvm Differential Revision: https://reviews.llvm.org/D79787	2020-05-21 13:05:55 +03:00
Sjoerd Meijer	a5cc4d095a	[HardwareLoops] llvm.loop.decrement.reg definition This is split off from D80316, slightly tightening the definition of overloaded hardwareloop intrinsic llvm.loop.decrement.reg specifying that both operands its result have the same type.	2020-05-21 10:48:16 +01:00
Denis Antrushin	7476f090de	[Statepoint] Constant fold FP deopt args. We do not have any special handling for constant FP deopt arguments. They are just spilled to stack or generated in register by MOVS instruction. This is inefficient and, when we have too many such constant arguments, may result in register allocation failure. Instead, we can bitcast such constant FP operands to appropriately sized integer and record as constant into statepoint and later, into StackMap. Reviewed By: skatkov Differential Revision: https://reviews.llvm.org/D80318	2020-05-21 11:02:54 +03:00
Benjamin Kramer	84e5f451b2	Fix a layering violation by not depending from Transforms/Utils on Transforms/Scalar. NFC.	2020-05-21 09:51:58 +02:00
David Sherwood	bdccb69dac	[SVE] Remove IITDescriptor::ScalableVecArgument I have refactored the code so that we no longer need the ScalableVecArgument descriptor - the scalable property of vectors is now encoded using the ElementCount class in IITDescriptor. This means that when matching intrinsics we know precisely how to match the arguments and return values. Differential Revision: https://reviews.llvm.org/D80107	2020-05-21 08:15:10 +01:00
Chen Zheng	8094c83ddb	[PowerPC] add more high latency opcodes for machine combiner pass Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D80097	2020-05-21 02:39:20 -04:00
Sam Parker	84147d2076	[CostModel] Unify Intrinsic Costs. With the two getIntrinsicInstrCosts folded into one, now fold in the scalar/code-size orientated getIntrinsicCost. This involved sinking cost of the TTIImpl into the base implementation, as it performs no target checks. The opcodes remaining were memcpy, cttz and ctlz which now have special handling in the BasicTTI implementation. getInstructionThroughput can now directly return the result of getUserCost. This had required a change in the AMDGPU backend for fabs and its always 'free'. I've also changed the X86 backend to return '1' for any intrinsic when the CostKind isn't RecipThroughput. Though this intended to be a non-functional change, there are many paths being combined here so I would be very surprised if this didn't have an effect. Differential Revision: https://reviews.llvm.org/D80012	2020-05-21 07:38:25 +01:00
Jonas Devlieghere	769a6f0193	Revert "[lit] GoogleTest framework should report failures if test binary crashes" This reverts commit ef2103182244c96f5206b02164b62b9c9e0cbce8 because it breaks the Windows bot: http://lab.llvm.org:8011/builders/lldb-x64-windows-ninja/builds/16447 Failing Tests (2): ... lldb-unit :: API/./APITests.exe/failed_to_discover_tests_from_gtest	2020-05-20 23:22:47 -07:00
Sam Parker	1bd0055ef6	[CostModel] Remove getExtCost This has not been implemented by any backends which appear to cover the functionality through getCastInstrCost. Sink what there is in the default implementation into BasicTTI. Differential Revision: https://reviews.llvm.org/D78922	2020-05-21 07:18:06 +01:00

1 2 3 4 5 ...

197027 Commits