llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Nikita Popov	5386c2c329	[ConstProp] Add test for bitcast to gep fold; NFC	2020-03-04 18:27:20 +01:00
Nikita Popov	9fbee2d627	[InstSimplify] Add additional icmp of gep folding test; NFC	2020-03-04 18:27:01 +01:00
Nikita Popov	971327d627	[InstSimplify] Regenerate compare.ll checks; NFC	2020-03-04 18:26:42 +01:00
Nikita Popov	81d4b8478e	[ConstantFolding] Always return something from ConstantFoldConstant Spin-off from D75407. As described there, ConstantFoldConstant() currently returns null for non-ConstantExpr/ConstantVector inputs, but otherwise always returns non-null, independently of whether any folding has happened or not. This is confusing and makes consumer code more complicated. I would expect either that ConstantFoldConstant() returns only if it actually folded something, or that it always returns non-null. I'm going to the latter possibility here, which appears to be more useful considering existing usage. Differential Revision: https://reviews.llvm.org/D75543	2020-03-04 18:24:47 +01:00
Craig Topper	cc6a0a3760	[X86] Directly form VBROADCAST_LOAD in lowerShuffleAsBroadcast on AVX targets. If we would emit a VBROADCAST node, we can instead directly emit a VBROADCAST_LOAD. This allows us to get rid of the special case to use an f64 load on 32-bit targets for vXi64. I believe there is more cleanup we can do later in this function, but I'll do that in follow ups.	2020-03-04 09:11:57 -08:00
Simon Pilgrim	00fd3c059e	[X86] Add tests showing failure to combine consecutive loads + FSHR into a single load Similar to some of the regressions seen in D75114	2020-03-04 17:07:03 +00:00
Simon Pilgrim	f59127037c	[X86] Add tests showing failure to combine consecutive loads + FSHL into a single load Similar to some of the regressions seen in D75114	2020-03-04 17:07:02 +00:00
Raphael Isemann	bf6b44780d	Fix modules build after MatrixBuilder patch The addition of MatrixBuilder.h broke the modules build: ``` While building module 'LLVM_intrinsic_gen' imported from llvm-project/llvm/lib/IR/AbstractCallSite.cpp:19: While building module 'LLVM_IR' imported from llvm-project/llvm/include/llvm/IR/Argument.h:19: In file included from <module-includes>:6: llvm-project/llvm/include/llvm/IR/MatrixBuilder.h:19:10: fatal error: cyclic dependency in module 'LLVM_intrinsic_gen': LLVM_intrinsic_gen -> LLVM_IR -> LLVM_intrinsic_gen ^ While building module 'LLVM_intrinsic_gen' imported from llvm-project/llvm/lib/IR/AbstractCallSite.cpp:19: In file included from <module-includes>:1: llvm-project/llvm/include/llvm/IR/Argument.h:19:10: fatal error: could not build module 'LLVM_IR' ~~~~~~~~^~~~~~~~~~~~~~~~~ llvm-project/llvm/lib/IR/AbstractCallSite.cpp:19:10: fatal error: could not build module 'LLVM_intrinsic_gen' ~~~~~~~~^~~~~~~~~~~~~~~~~~~~ ```	2020-03-04 09:03:34 -08:00
Sanjay Patel	8f04b72eb3	[PassManager] adjust VectorCombine placement The initial placement of vector-combine in the opt pipeline revealed phase ordering bugs: https://bugs.llvm.org/show_bug.cgi?id=45015 https://bugs.llvm.org/show_bug.cgi?id=42022 This patch contains a few independent changes: 1. Move the pass up in the pipeline, so it happens just after loop-vectorization. This is only to keep vectorization passes together in the pipeline at the moment. I don't have evidence of interaction between these yet. 2. Add an -early-cse pass after -vector-combine to clean up redundant ops. This was partly proposed as far back as rL219644 (which is why it's effectively being moved in the old PM code). This is important because the subsequent -instcombine doesn't work as well without EarlyCSE. With the CSE, -instcombine is able to squash shuffles together in 1 of the tests (because those are simple "select" shuffles). 3. Remove the -vector-combine pass that was running after SLP. We may want to do that eventually, but I don't have a test case to support it yet. Differential Revision: https://reviews.llvm.org/D75145	2020-03-04 11:10:49 -05:00
Sanjay Patel	611aa86281	[SDAG] simplify FP binops to undef As discussed in the commit thread for rGa253a2a and D73978, we can do more undef folding for FP ops. The nnan and ninf fast-math-flags specify that if an operand is the disallowed value, the result is poison, so we can produce an undef result. But this doesn't work as expected (the undef operand cases remain) because of a Flags propagation problem in SelectionDAGBuilder. I've added DAGCombiner calls to enable these for the other cases because we've shown in other patches that (because of the limited way that SDAG iterates), it is possible to miss simplifications like this if they are done only at node creation time. Several potential follow-ups to expand on this patch are possible. Differential Revision: https://reviews.llvm.org/D75576	2020-03-04 10:42:16 -05:00
David Green	a701432f58	[ARM] Change all tests from "thumbv8.1-m.main" to "thumbv8.1m.main". NFC	2020-03-04 13:47:35 +00:00
Evgeniy Brevnov	c9c4c14d22	Lost regression test from commit 5a63813dc7f.	2020-03-04 19:52:42 +07:00
Pavel Labath	4b7edf8a76	Use new DWARFDataExtractor::getInitialLength in DWARFDebugFrame	2020-03-04 13:01:35 +01:00
Pavel Labath	5b780ab14f	Use new DWARFDataExtractor::getInitialLength in DWARFDebugPubTable	2020-03-04 13:01:35 +01:00
Pavel Labath	0777f5891c	Use new DWARFDataExtractor::getInitialLength in DWARFUnit	2020-03-04 13:01:35 +01:00
Pavel Labath	bd8f009db3	Use new DWARFDataExtractor::getInitialLength in DWARFVerifier	2020-03-04 13:01:34 +01:00
Pavel Labath	9ccd43f14c	Use DWARFDataExtractor::getInitialLength in debug_aranges Summary: getInitialLength is a DWARFDataExtractor method so I had to "upgrade" some DataExtractors to be able to make use of it. Reviewers: ikudrin, jhenderson, probinson Subscribers: aprantl, hiraditya, llvm-commits, dblaikie Tags: #llvm Differential Revision: https://reviews.llvm.org/D75535	2020-03-04 13:01:07 +01:00
Pavel Labath	34adeccc0f	Use DWARFDataExtractor::getInitialLength in DWARFDebugAddr Reviewers: ikudrin, jhenderson, probinson Subscribers: hiraditya, dblaikie, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75532	2020-03-04 13:00:15 +01:00
Kerry McLaughlin	50040da475	[AArch64][SVE] Add SVE2 intrinsic for xar Summary: Implements the @llvm.aarch64.sve.xar intrinsic Reviewers: andwar, c-rhodes, dancgr, efriedma, rengolin Reviewed By: andwar Subscribers: tschuett, kristof.beyls, hiraditya, rkruppe, psnobl, cfe-commits, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75160	2020-03-04 11:44:32 +00:00
Evgeniy Brevnov	434024e6e4	[DependenceAnalysis] Dependecies for loads marked with "ivnariant.load" should not be shared with general accesses(PR42151). Summary: This is second attempt to fix the problem with incorrect dependencies reported in presence of invariant load. Initial fix (https://reviews.llvm.org/D64405) was reverted due to a regression reported in https://reviews.llvm.org/D70516. The original fix changed caching behavior for invariant loads. Namely such loads are not put into the second level cache (NonLocalDepInfo). The problem with that fix is the first level cache (CachedNonLocalPointerInfo) still works as if invariant loads were in the second level cache. The solution is in addition to not putting dependence results into the second level cache avoid putting info about invariant loads into the first level cache as well. Reviewers: jdoerfert, reames, hfinkel, efriedma Reviewed By: jdoerfert Subscribers: DaniilSuchkov, hiraditya, bmahjour, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D73027	2020-03-04 18:40:02 +07:00
Simon Pilgrim	bc217d930d	[AMDGPU] performCvtF32UByteNCombine - revisit node after src operand simplification. If SimplifyDemandedBits succeeds in simplifying the byte src, add the CVT_F32_UBYTE node back to the worklist as we might be able to simplify further. Yet another step towards removing SelectionDAG::GetDemandedBits.	2020-03-04 11:25:50 +00:00
Florian Hahn	24ce4bb4a9	[Matrix] Add IR MatrixBuilder. This builder provides a convenient way for targets to lower various matrix operations to LLVM IR, making use of matrix intrinsics where available. Reviewers: anemet, Gerolf, hfinkel, andrew.w.kaylor, LuoYuanke Reviewed By: anemet Differential Revision: https://reviews.llvm.org/D72280	2020-03-04 11:14:20 +00:00
gbreynoo	3b4c521b04	[llvm-ar][test] Add to llvm-ar test coverage - Added handling of thin archives to symtab.test. - Added handling of newlines to response.test. - 62fa3332c9c1af1e66dfecd40f5b4e78882998b2 exposed behaviour regarding the use of -- on the command line. Added double-hyphen.test to cover this. Differential Revision: https://reviews.llvm.org/D73333	2020-03-04 10:56:48 +00:00
Georgii Rymar	3a864f79c9	[Object/ELF] - Fix the offset type used in ELFFile<ELFT>::getEntry(). We use size_t for a file offset what is wrong, because size_t is 32-bit value on 32-bit platforms. I was reported that after my 0b511c23021 "[llvm-readobj] - Report warnings instead of errors for broken relocations." The following error is observed on 32-bit Arch Linux: [100%] Running all regression tests FAIL: LLVM :: tools/llvm-readobj/ELF/relocation-errors.test (52954 of 54768) ****************** TEST 'LLVM :: tools/llvm-readobj/ELF/relocation-errors.test' FAILED * ... llvm-project/llvm/test/tools/llvm-readobj/ELF/relocation-errors.test:9:14:error: LLVM-NEXT: expected string not found in input # LLVM-NEXT: warning: '[[FILE]]': unable to print relocation 1 in section 3: unable to access section [index 6] data at 0x17e7e7e8b0: offset goes past the end of file ^ <stdin>:9:1: note: scanning from here /llvm-project/build/bin/llvm-readobj: warning: 'llvm-project/build/test/tools/llvm-readobj/ELF/Output/relocation-errors.test.tmp64': unable to print relocation 1 in section 3: unable to access section [index 6] data at 0xe7e7e8b0: offset goes past the end of file This patch should fix the issue.	2020-03-04 12:33:10 +03:00
Simon Tatham	0c088632cd	[ARM,MVE] Add the `vshlcq` intrinsics. Summary: The VSHLC instruction performs a left shift of a whole vector register by an immediate shift count up to 32, shifting in new bits at the low end from a GPR and delivering the shifted-out bits from the high end back into the same GPR. Since the instruction produces two outputs (the shifted vector register and the output GPR of shifted-out bits), it has to be instruction-selected in C++ rather than Tablegen. Reviewers: MarkMurrayARM, dmgreen, miyuki, ostannard Reviewed By: miyuki Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D75445	2020-03-04 08:49:27 +00:00
Simon Tatham	d0b4a2caa5	[ARM,MVE] Add the `vsbciq` intrinsics. Summary: These are exactly parallel to the existing `vadciq` intrinsics, which we implemented last year as part of the original MVE intrinsics framework setup. Just like VADC/VADCI, the MVE VSBC/VSBCI instructions deliver two outputs, both of which the intrinsic exposes: a modified vector register and a carry flag. So they have to be instruction-selected in C++ rather than Tablegen. However, in this case, that's trivial: the same C++ isel routine we already have for VADC works unchanged, and all we have to do is to pass it a different instruction id. Reviewers: MarkMurrayARM, dmgreen, miyuki, ostannard Reviewed By: miyuki Subscribers: kristof.beyls, hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D75444	2020-03-04 08:49:27 +00:00
Craig Topper	f9e1c39f52	[X86] Directly form VBROADCAST_LOAD for BUILD_VECTOR of splat loads in lowerBuildVectorAsBroadcast.	2020-03-03 22:27:34 -08:00
Juneyoung Lee	427b0f52ac	[ValueTracking] Let isGuaranteedNotToBeUndefOrPoison look into branch conditions of dominating blocks' terminators Summary: ``` br i1 c, BB1, BB2: BB1: use1(c) BB2: use2(c) ``` In BB1 and BB2, c is never undef or poison because otherwise the branch would have triggered UB. Checked with Alive2 Reviewers: xbolva00, spatel, lebedev.ri, reames, jdoerfert, nlopes, sanjoy Reviewed By: reames Subscribers: jdoerfert, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75401	2020-03-04 11:43:31 +09:00
Amara Emerson	64e174c923	[GlobalISel][Localizer] Enable intra-block localization of already-local uses. This changes the localizer to attempt intra-block localizer of instructions that have local uses. This is useful because sometimes the entry block itself has many uses of constant-like instructions, which would benefit from shortening live ranges. Previously if an inst had no non-local uses, we wouldn't add it to the list of instructions to attempt further intra-block localization. This gives a 0.7% geomean code size improvement on CTMark. Differential Revision: https://reviews.llvm.org/D75555	2020-03-03 18:14:57 -08:00
Fangrui Song	d9ac4ee31f	[MC][test] Improve some llvm-objdump -t tests Delete two redundant tests.	2020-03-03 17:27:06 -08:00
Fangrui Song	a84de5bb19	[gn build] Fix llvm-gsymutil after D75291	2020-03-03 16:37:52 -08:00
Fangrui Song	a0c1558472	[MCDwarf] Change emitListsTableHeaderStart to use a reference and fold Start/End symbols generation into it Apply @dblaikie's suggestions in a post-commit review for D75375 Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D75568	2020-03-03 16:20:40 -08:00
Lang Hames	af3fbf0d88	[ORC] Skip ST_File symbols in MaterializationUnit interfaces / resolution. ST_File symbols aren't relevant for linking purposes, but can end up shadowing real symbols if they're not filtered. No test case yet: The ideal testcase for this would be an ELF llvm-jitlink test, but llvm-jitlink support for ELF is still under development. We should add a testcase for this once support lands in tree.	2020-03-03 16:15:44 -08:00
Stefanos Baziotis	53ede36a1c	[LoopTerminology][NFC] Fix typo	2020-03-04 02:12:33 +02:00
Greg Clayton	e58a46dc29	Fix buildbots by including MC for StringTableBuilder.	2020-03-03 15:52:51 -08:00
Lang Hames	6ba74a0e2f	[JITLink] Add a -slab-address option to llvm-jitlink. This option can be used to for JITLink to link as-if the target memory slab were allocated at a specific start address. This can be used to both verify that cross-address space linking is working correctly, and to ensure that certain address-sensitive optimizations (e.g. GOT and stub elimination) either do or do not fire, depending on the requirements of the test case. This argument is only valid for testing in conjunction with -noexec -slab-alloc, and will produce an error if used without those arguments.	2020-03-03 14:25:51 -08:00
Matt Arsenault	1e680749e8	LICM: Reorder condition checks Check the fast math flag before the more expensive loop check.	2020-03-03 17:15:57 -05:00
Matt Arsenault	583ac72b46	AMDGPU: Fix computation for getOccupancyWithLocalMemSize The computation here didn't really make sense to me, and reported wildy different results depending on the flat work group size attribute. I think this should really report a range derived from the possible work group size bounds, and only allow an occupancy that is a multiple of the group size.	2020-03-03 17:15:57 -05:00
Brian Gesiak	1388dae917	[Coroutines] Use dbg.declare for frame variables Summary: https://gist.github.com/modocache/ed7c62f6e570766c0f39b35dad675c2f is an example of a small C++ program that uses C++20 coroutines that is difficult to debug, due to the loss of debug info for variables that "spill" across coroutine suspension boundaries. This patch addresses that issue by inserting 'llvm.dbg.declare' intrinsics that point the debugger to the variables' location at an offset to the coroutine frame. With this patch, I confirmed that running the 'frame variable' commands in https://gist.github.com/modocache/ed7c62f6e570766c0f39b35dad675c2f at the specified breakpoints results in the correct values being printed for coroutine frame variables 'i' and 'j' when using an lldb built from trunk, as well as with gdb 8.3 (lldb 9.0.1, however, could not print the values). The added test case also verifies this improved behavior. The existing coro-debug.ll test case is also modified to reflect the locations at which Clang actually places calls to 'dbg.declare', and additional checks are added to ensure this patch works as intended in that example as well. Reviewers: vsk, jmorse, GorNishanov, lewissbaker, wenlei Subscribers: EricWF, aprantl, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75338	2020-03-03 17:13:46 -05:00
Greg Clayton	5516acaf24	Rename "llvm-gsym" to "llvm-gsymutil" and fix dependencies. Summary: This patch renames the "llvm-gsym" tool directory to "llvm-gsymutil". Dependencies are also reduced to the bare minimum for llvm-gsymutil. Reviewers: aprantl, thakis Subscribers: mgorny, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D75291	2020-03-03 14:13:29 -08:00
Amy Huang	84aaa5f928	[DebugInfo] Fix for adding "returns cxx udt" option to functions in CodeView. Summary: This change checks for the return type in the frontend and adds a flag to the DISubroutineType to indicate that the option should be added in CodeViewDebug. Previously function types sometimes appeared twice in the PDB: once with "returns cxx udt" and once without. See https://bugs.llvm.org/show_bug.cgi?id=44785. Reviewers: rnk, asmith Subscribers: hiraditya, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D75215	2020-03-03 14:00:08 -08:00
Lang Hames	efa76b5281	[JITLink] Fix a pointer-to-integer cast in jitlink::InProcessMemoryManager. reinterpret_cast'ing the block base address directly to a uint64_t leaves the high bits in an implementation-defined state, but JITLink expects them to be zero. Switching to pointerToJITTargetAddress for the cast should fix this. This should fix the jitlink test failures that we have seen on some of the 32-bit testers.	2020-03-03 13:53:00 -08:00
Sanjay Patel	deb075f201	[AArch64] add tests for nnan/ninf/undef FP simplifications; NFC	2020-03-03 16:38:58 -05:00
Vedant Kumar	cd3362f222	test: Adjust no-dbg-value-after-terminator.mir to use `not --crash`	2020-03-03 13:30:31 -08:00
Sanjay Patel	c045acf693	[PowerPC] adjust test to avoid getting zapped completely; NFC div-by-0 -> Inf The math ops are 'fast' so 'ninf' applies and the whole thing is undef.	2020-03-03 16:15:48 -05:00
Vedant Kumar	2a9d79a776	[MachineVerifier] Remove placement rule exception for debug entry values There should not be an exception allowing debug entry values to be placed after a terminator. Differential Revision: https://reviews.llvm.org/D75559	2020-03-03 13:02:18 -08:00
Vedant Kumar	4933893b80	[LiveDebugValues] Do not insert DBG_VALUEs after a MBB terminator This fixes a miscompile that happened because a DBG_VALUE interfered with the MachineOutliner's liveness analysis. Inserting a DBG_VALUE after a terminator breaks predicates on MBB such as isReturnBlock(). And the resulting DBG_VALUE cannot be "live". I plan to introduce a MachineVerifier check for this situation in a follow up. rdar://59859175 Testing: check-llvm, LNT build with a stage2 compiler & entry values enabled Differential Revision: https://reviews.llvm.org/D75548	2020-03-03 13:00:52 -08:00
Craig Topper	ed198c45c9	[X86] Match vpmullq latency to uops.info. Correct port usage for 512-bit memory form uops.info says these should be 15 cycle instructions. Uops.info also shows the 512-bit form uses port 0 and 5 for both register and memory. We had memory using 0 and 1. Differential Revision: https://reviews.llvm.org/D75549	2020-03-03 12:16:03 -08:00
Stefan Stipanovic	cc327f9710	Revert "[OpenMP] Adding InaccessibleMemOnly and InaccessibleMemOrArgMemOnly for runtime calls." This reverts commit 9989b859efccafacb0cc1f8d393d8b9fc49f4037.	2020-03-03 20:42:05 +01:00
Stefan Stipanovic	42a7f787c5	[OpenMP] Adding InaccessibleMemOnly and InaccessibleMemOrArgMemOnly for runtime calls. Summary: Attempt to add more attributes for runtime calls. Reviewers: jdoerfertA, ggeorgakoudis, lebedev.ri, dreachem Subscribers:	2020-03-03 20:32:22 +01:00

1 2 3 4 5 ...

192996 Commits