llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Tim Northover	47e1a53df5	AArch64: mark x22 livein if it's an async context that gets stored. This fixes a crash with expensive checks enabled (the verifier was not happy).	2021-05-17 11:56:03 +01:00
Max Kazantsev	a6b214f432	[Test] Fix test to make the transform for which is was added legal %limit in these tests is supposed to be positive.	2021-05-17 17:19:01 +07:00
Simon Pilgrim	633bafaf5f	[TargetLowering] prepareUREMEqFold/prepareSREMEqFold - account for non legal shift types Ensure we tell getShiftAmountTy that we're working with pre-legalized types to prevent cases where the (legalized) shift type can no longer handle the (non-legalized) type width. Fixes https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=34366	2021-05-17 11:03:27 +01:00
Tim Northover	fc5daa6083	IR/AArch64/X86: add "swifttailcc" calling convention. Swift's new concurrency features are going to require guaranteed tail calls so that they don't consume excessive amounts of stack space. This would normally mean "tailcc", but there are also Swift-specific ABI desires that don't naturally go along with "tailcc" so this adds another calling convention that's the combination of "swiftcc" and "tailcc". Support is added for AArch64 and X86 for now.	2021-05-17 10:48:34 +01:00
Jacob Bramley	2320a8cd94	[AArch64] Lower fptoi.sat intrinsics. AArch64's fctv instructions implement the saturating behaviour that the fpto*i.sat intrinsics require, in cases where the destination width matches the saturation width. Lowering them removes a lot of unnecessary generated code. Only scalar lowerings are supported for now. Differential Revision: https://reviews.llvm.org/D102353	2021-05-17 10:19:19 +01:00
Fraser Cormack	69fc258fc0	[DAGCombiner] Relax an assertion to an early return The select-of-constants transform was asserting that its constant vector inputs did not implicitly truncate their input without that as an explicit precondition to the function. This patch relaxes that assertion into an early return to skip the optimization. Reviewed By: RKSimon Differential Revision: https://reviews.llvm.org/D102393	2021-05-17 09:15:55 +01:00
Chen Zheng	9ccc0ec15a	[PowerPC] add a testcase for reverse memory op; nfc	2021-05-17 03:29:14 -04:00
Hongtao Yu	4a7561c809	[CSSPGO] Update pseudo probe distribution factor based on inline context. With prelink inlining, pseudo probes with same ID can come from different inline contexts. Such probes should not share samples and their factors should be fixed up separately. I'm seeing 0.3% speedup for SPEC2017 overall. Benchmark 631.deepsjeng_s benefits the most, about 4%. Reviewed By: wenlei, wmi Differential Revision: https://reviews.llvm.org/D102429	2021-05-16 23:11:36 -07:00
Arthur Eubanks	99d811bdc5	Revert "[TargetLowering] Only inspect attributes in the arguments for ArgListEntry" This reverts commit 16748bd2fb1fe10d7d097961f1988327338f3f9f. Causes https://crbug.com/1209013	2021-05-16 22:02:10 -07:00
Arthur Eubanks	630c86e151	Revert "[NFC] Use ArgListEntry indirect types more in ISel lowering" This reverts commit 85af8a8c1b574faa0d5d57d189ae051debdfada8.	2021-05-16 22:00:54 -07:00
Pan, Tao	f9d8052498	[SelectionDAG] Make fast and linearize visible by clang -pre-RA-sched ScheduleDAGFast.cpp is compiled to object file, but the ScheduleDAGFast object file isn't linked into clang executable file as no symbol is referred by outside. Add calling to createXxx of ScheduleDAGFast.cpp, then the ScheduleDAGFast object file will be linked into clang executable file. The static RegisterScheduler will register scheduler fast and linearize at clang boot time. Reviewed By: pengfei Differential Revision: https://reviews.llvm.org/D101601	2021-05-17 11:25:15 +08:00
Ben Shi	796d22443b	[RISCV] Optimize or/xor with immediate in the zbs extension Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D102398	2021-05-17 10:59:52 +08:00
Ben Shi	f38e4dcf6f	[RISCV][test] Add new tests of or/xor in the zbs extension Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D102396	2021-05-17 09:47:23 +08:00
David Blaikie	2cfbc27b4f	Fix some deprecated copy ops in google mock	2021-05-16 15:59:06 -07:00
Craig Topper	05a02f8da8	[RISCV] Replace AddiPair ComplexPattern with a PatLeaf. NFC The ComplexPattern is looking for an immediate in a certain range that has a single use. This can be handled with a PatLeaf since we aren't matching multiple patterns or checking any complicated relationships between nodes. This shrinks the isel table a little bit since tablegen no longer has to generate patterns with commuted operands. With the PatLeaf, tablegen can see we're matching an immediate which should always be on the right hand side of add. Reviewed By: benshi001 Differential Revision: https://reviews.llvm.org/D102510	2021-05-16 12:17:52 -07:00
Fangrui Song	89806dccc8	[test] Improve CodeGen/*/semantic-interposition-asm.ll	2021-05-16 11:17:09 -07:00
Alessandro Decina	770bc060c5	[BPF] add support for 32 bit registers in inline asm Add "w" constraint type which allows selecting 32 bit registers. 32 bit registers were added in https://reviews.llvm.org/rGca31c3bb3ff149850b664838fbbc7d40ce571879. Differential Revision: https://reviews.llvm.org/D102118	2021-05-16 11:01:47 -07:00
Lang Hames	e4c74b057d	[JITLink] Fix symbol comparator in LinkGraph::dump. The existing implementation did not provide a strict weak ordering.	2021-05-16 10:11:58 -07:00
David Green	99f589780c	[CPG][ARM] Optimize towards branch on zero in codegenprepare This adds a simple fold into codegenprepare that converts comparison of branches towards comparison with zero if possible. For example: %c = icmp ult %x, 8 br %c, bla, blb %tc = lshr %x, 3 becomes %tc = lshr %x, 3 %c = icmp eq %tc, 0 br %c, bla, blb As a first order approximation, this can reduce the number of instructions needed to perform the branch as the shift is (often) needed anyway. At the moment this does not effect very much, as llvm tends to prefer the opposite form. But it can protect against regressions from commits like rG9423f78240a2. Simple cases of Add and Sub are added along with Shift, equally as the comparison to zero can often be folded with cpsr flags. Differential Revision: https://reviews.llvm.org/D101778	2021-05-16 17:54:06 +01:00
Philip Reames	9bcc60e5c1	Revert "Do actual DCE in LoopUnroll (try 2)" This reverts commit 653fa0b46ae34c06495b542414b704b30381cd02. Reported to trigger pr50354. Reverting until investigated.	2021-05-16 09:38:36 -07:00
David Green	010e023bd4	[ARM] Extra branch on zero tests. NFC	2021-05-16 17:22:52 +01:00
Kai Luo	f5ce33b9fe	[Utils] Fix indentation error in utils/wciia.py Running this script gives ``` "llvm-project/llvm/./utils/wciia.py", line 56 if word == "N:": TabError: inconsistent use of tabs and spaces in indentation ``` Under emacs' whitespace-mode, it shows ``` for·line·in·code_owners_file:$ ····for·word·in·line.split():$ » if·word·==·"N:":$ » » name·=·line[2:].strip()$ » » if·code_owner:$ » » » process_code_owner(code_owner)$ » » » code_owner·=·{}$ ``` I use `yapf` to format this script directly and it's running correctly.	2021-05-16 22:34:09 +08:00
Nikita Popov	945b8d0f5d	[CaptureTracking] Simplify reachability check (NFCI) This code was re-implementing the same-BB case of isPotentiallyReachable(). Historically, this was done because CaptureTracking used additional caching for local dominance queries. Now that it is no longer needed, the code is effectively the same as isPotentiallyReachable(). The only difference are extra checks for invoke/phis. These are misleading checks related to dominance in the value availability sense that are not relevant for control reachability. The invoke check was correct but redundant in that invokes are always terminators, so `I` could never come before the invoke. The phi check is a matter of interpretation (should an earlier phi node be considered reachable from a later phi node in the same block?) but ultimately doesn't matter because phis don't capture anyway.	2021-05-16 16:04:10 +02:00
Nikita Popov	88e5c8610b	Reapply [CaptureTracking] Do not check domination Reapply after adjusting the synchronized.m test case, where the TODO is now resolved. The pointer is only captured on the exception handling path. ----- For the CapturesBefore tracker, it is sufficient to check that I can not reach BeforeHere. This does not necessarily require that BeforeHere dominates I, it can also occur if the capture happens on an entirely disjoint path. This change was previously accepted in D90688, but had to be reverted due to large compile-time impact in some cases: It increases the number of reachability queries that are performed. After recent changes, the compile-time impact is largely mitigated, so I'm reapplying this patch. The remaining compile-time impact is largely proportional to changes in code-size.	2021-05-16 15:46:31 +02:00
Florian Hahn	f52bf2cd81	[Matrix] Fix some newpm check lines, which fail on some bots. (2)	2021-05-16 14:11:18 +01:00
Simon Pilgrim	c0cbb64a42	[X86][SSE] Pull out combineToHorizontalAddSub helper from inside (F)ADD/SUB combines (REAPPLIED). NFCI. The intention is to be able to run this from additional locations (such as shuffle combining) in the future. Reapplies rGb95a103808ac (after reversion at rGc012a388a15b), with SSE3/SSSE3 typo fix, test added at rG0afb10de1449.	2021-05-16 13:50:58 +01:00
Florian Hahn	ed4cb46d3b	[Matrix] Fix some newpm check lines, which fail on some bots.	2021-05-16 13:47:46 +01:00
Florian Hahn	c5a35017b8	[Matrix] Extend test to also check new PM pipeline.	2021-05-16 13:08:51 +01:00
Florian Hahn	fb2b5fc8db	[PhaseOrdering] Add test for insert/extract code for matrixes.	2021-05-16 12:52:20 +01:00
Florian Hahn	e3158361a1	[VectorCombine] Add tests with and & urem guaranteeing idx is valid.	2021-05-16 12:51:53 +01:00
Roman Lebedev	ecdf27a217	[NFC][X86][Costmodel] Add tests for load/store with i1 element type	2021-05-16 14:29:37 +03:00
Nikita Popov	35da6f5202	Revert "[CaptureTracking] Do not check domination" This reverts commit 6b8b43e7af3074124e3c9e429e1fb08165799be4. This causes clang test to fail (CodeGenObjC/synchronized.m). Revert until I can figure out whether that's an expected change.	2021-05-16 11:04:45 +02:00
Nikita Popov	e9761e11f7	[CaptureTracking] Do not check domination For the CapturesBefore tracker, it is sufficient to check that I can not reach BeforeHere. This does not necessarily require that BeforeHere dominates I, it can also occur if the capture happens on an entirely disjoint path. This change was previously accepted in D90688, but had to be reverted due to large compile-time impact in some cases: It increases the number of reachability queries that are performed. After recent changes, the compile-time impact is largely mitigated, so I'm reapplying this patch. The remaining compile-time impact is largely proportional to changes in code-size.	2021-05-16 10:49:36 +02:00
Nikita Popov	d123c17f13	[MemCpyOpt] Add test for unreachable capture (NFC) This is based on the test from D90688, without the argmemonly attribute. The argmemonly attribute would guaranteed no modref by itself and the question of captures would not arise in the first place.	2021-05-16 10:48:52 +02:00
Jinsong Ji	5c2441043a	[AIX][AsmPrinter] Print Global Variable in comments The default AsmPrinter print GV in comments, AIX should do so too. This also fix LLVM :: CodeGen/Generic/inline-asm-mem-clobber.ll. Reviewed By: hubert.reinterpretcast Differential Revision: https://reviews.llvm.org/D102534	2021-05-16 03:04:46 +00:00
Tomasz Miąsko	5f48540e55	[Demangle][Rust] Parse trait definitions Part of https://reviews.llvm.org/D102549	2021-05-15 23:52:25 +02:00
Tomasz Miąsko	0529498484	[Demangle][Rust] Parse trait implementations Part of https://reviews.llvm.org/D102549	2021-05-15 23:52:25 +02:00
Tomasz Miąsko	160e922bf8	[Demangle][Rust] Parse inherent implementations Part of https://reviews.llvm.org/D102549	2021-05-15 23:52:25 +02:00
Kuter Dinel	ee05e77f81	[Attributor] Call site specific AAValueSimplification and AAIsDead. This patch makes it possible to do call site specific deductions for AAValueSimplification and AAIsDead. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D84722	2021-05-15 21:39:07 +00:00
Simon Pilgrim	a182c10403	[X86] Remove unused check-prefixes	2021-05-15 22:05:02 +01:00
Nikita Popov	0f909bf1fc	[CaptureTracking] Only check reachability for capture candidates Reachability queries are very expensive, and currently performed for each instruction we look at, even though most of them will not lead to a capture and are thus ultimately irrelevant. It is more efficient to walk a few unnecessary instructions than to perform unnecessary reachability queries. Theoretically, this may produce worse results, because the additional instructions considered may cause us to hit the use count limit earlier. In practice, this does not appear to be a problem, e.g. on test-suite O3 we report only one more captured-before with this change, with no resulting codegen differences. This makes PointerMayBeCapturedBefore() significantly cheaper in practice, hopefully allowing it to be used in more places.	2021-05-15 22:57:56 +02:00
Simon Pilgrim	12eb3255f8	[X86][SSE] Add SSE3 coverage to PHADD/SUB generation tests This would have caught the regression identified in rGc012a388a15b.	2021-05-15 21:25:46 +01:00
Alex Orlov	bd1969f7e2	NFC. Refactored DIPrinter for support embedded source. This patch introduces source loading and pruning functions. It will allow to use the DWARF embedded source and use the same code for JSON printout. No functional changes. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D102539	2021-05-15 23:01:12 +04:00
Pengxuan Zheng	3d401a1bf6	Support GCC's -fstack-usage flag This patch adds support for GCC's -fstack-usage flag. With this flag, a stack usage file (i.e., .su file) is generated for each input source file. The format of the stack usage file is also similar to what is used by GCC. For each function defined in the source file, a line with the following information is produced in the .su file. <source_file>:<line_number>:<function_name> <size_in_byte> <static/dynamic> "Static" means that the function's frame size is static and the size info is an accurate reflection of the frame size. While "dynamic" means the function's frame size can only be determined at run-time because the function manipulates the stack dynamically (e.g., due to variable size objects). The size info only reflects the size of the fixed size frame objects in this case and therefore is not a reliable measure of the total frame size. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D100509	2021-05-15 10:22:49 -07:00
Nico Weber	342cc0a8e8	Revert "[X86][SSE] Pull out combineToHorizontalAddSub helper from inside (F)ADD/SUB combines. NFCI." This reverts commit b95a103808acfd8f33290d3e80e28af434454b28. Makes clang assert very early in a Chromium build. See https://bugs.chromium.org/p/chromium/issues/detail?id=1209490#c1 for a standalone repro.	2021-05-15 12:20:02 -04:00
David Green	1a47831c3d	[ARM] Add an extra memset test showing reverted WLSTP loops. NFC	2021-05-15 16:48:58 +01:00
Simon Pilgrim	67ff1bc63e	[X86] X86OptimizeLEAPass::replaceDebugValue - take a copy of the DebugLoc not a reference as it may be deleted. Fixes msan warning due to rG9ca2c50b3601	2021-05-15 16:28:20 +01:00
Simon Pilgrim	eae2fe148f	[X86] X86CmovConverterPass::convertCmovInstsToBranches - take a copy of the DebugLoc not a reference as it may be deleted. Fixes msan warning due to rG9ca2c50b3601	2021-05-15 16:13:34 +01:00
Simon Pilgrim	919f54daef	Revert rG632668c1c0e7dcf97154d2e377491cdc8cb6963c : "[TableGen] TreePatternNode::isIsomorphicTo - early out for matching leafs. NFCI." Revert premature (and very broken....) experimental commit.	2021-05-15 15:49:14 +01:00
Simon Pilgrim	7e5bfbbe69	[TableGen] TreePatternNode::isIsomorphicTo - early out for matching leafs. NFCI. If the leafs are the same then no need to perform DefInit matching.	2021-05-15 15:34:43 +01:00

... 3 4 5 6 7 ...

216073 Commits