llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-01-31 12:41:49 +01:00

Author	SHA1	Message	Date
Vedant Kumar	59c8fa2921	[DebugInfo] Add support for DWARF5 call site-related attributes DWARF v5 introduces DW_AT_call_all_calls, a subprogram attribute which indicates that all calls (both regular and tail) within the subprogram have call site entries. The information within these call site entries can be used by a debugger to populate backtraces with synthetic tail call frames. Tail calling frames go missing in backtraces because the frame of the caller is reused by the callee. Call site entries allow a debugger to reconstruct a sequence of (tail) calls which led from one function to another. This improves backtrace quality. There are limitations: tail recursion isn't handled, variables within synthetic frames may not survive to be inspected, etc. This approach is not novel, see: https://gcc.gnu.org/wiki/summit2010?action=AttachFile&do=get&target=jelinek.pdf This patch adds an IR-level flag (DIFlagAllCallsDescribed) which lowers to DW_AT_call_all_calls. It adds the minimal amount of DWARF generation support needed to emit standards-compliant call site entries. For easier deployment, when the debugger tuning is LLDB, the DWARF requirement is adjusted to v4. Testing: Apart from check-{llvm, clang}, I built a stage2 RelWithDebInfo clang binary. Its dSYM passed verification and grew by 1.4% compared to the baseline. 151,879 call site entries were added. rdar://42001377 Differential Revision: https://reviews.llvm.org/D49887 llvm-svn: 343883	2018-10-05 20:37:17 +00:00
Sanjay Patel	b48b336682	[x86] make blend tests resistant to demanded elements improvements; NFC Similar to rL343858 - we don't want these tests to lose value with D52912. llvm-svn: 343882	2018-10-05 20:26:54 +00:00
Reid Kleckner	31de553b45	Fix dwarf-no-source-loc.ll path separator on Windows llvm-svn: 343880	2018-10-05 19:46:51 +00:00
Matthias Braun	b0b4d6c795	DwarfDebug: Pick next location in case of missing location at block begin Context: Compiler generated instructions do not have a debug location assigned to them. However emitting 0-line records for all of them bloats the line tables for very little benefit so we usually avoid doing that. Not emitting anything will lead to the previous debug location getting applied to the locationless instructions. This is not desirable for block begin and after labels. Previously we would emit simply emit line-0 records in this case, this patch changes the behavior to do a forward search for a debug location in these cases before emitting a line-0 record to further reduce line table bloat. Inspired by the discussion in https://reviews.llvm.org/D52862 llvm-svn: 343874	2018-10-05 18:29:24 +00:00
Alex Bradbury	48db9e2c0c	[RISCV] Regenerate several tests now enableMultipleCopyHints is enabled by default r343851 caused codegen changes in several tests. This patch regenerates them. llvm-svn: 343873	2018-10-05 18:25:55 +00:00
Craig Topper	b2fdfdff54	[X86] Don't promote i16 compares to i32 if the immediate will fit in 8 bits. The comments in this code say we were trying to avoid 16-bit immediates, but if the immediate fits in 8-bits this isn't an issue. This avoids creating a zero extend that probably won't go away. The movmskb related changes are interesting. The movmskb instruction writes a 32-bit result, but fills the upper bits with 0. So the zero_extend we were previously emitting was free, but we turned a -1 immediate that would fit in 8-bits into a 32-bit immediate so it was still bad. llvm-svn: 343871	2018-10-05 18:13:36 +00:00
Simon Pilgrim	f27e2128ee	[X86] Move ReadAfterLd functionality into X86FoldableSchedWrite (PR36957) Currently we hardcode instructions with ReadAfterLd if the register operands don't need to be available until the folded load has completed. This doesn't take into account the different load latencies of different memory operands (PR36957). This patch adds a ReadAfterFold def into X86FoldableSchedWrite to replace ReadAfterLd, allowing us to specify the load latency at a scheduler class level. I've added ReadAfterVec*Ld classes that match the XMM/Scl, XMM and YMM/ZMM WriteVecLoad classes that we currently use, we can tweak these values in future patches once this infrastructure is in place. Differential Revision: https://reviews.llvm.org/D52886 llvm-svn: 343868	2018-10-05 17:57:29 +00:00
Sanjay Patel	01f181a636	[SelectionDAG] allow undefs when matching splat constants And use that to transform fsub with zero constant operands. The integer part isn't used yet, but it is proposed for use in D44548, so adding both enhancements here makes that patch simpler. llvm-svn: 343865	2018-10-05 17:42:19 +00:00
Adrian Prantl	cc5583af00	Format the dwarfdump --statistics version as an integer instead of a string. llvm-svn: 343864	2018-10-05 17:41:30 +00:00
Sanjay Patel	9b272ef9b1	[x86] add test for (X - 0.0) vector with undef elts; NFC llvm-svn: 343863	2018-10-05 17:36:51 +00:00
Simon Pilgrim	82a21c5635	[X86][SSE] Try to make MOVLPS/MOVHPS(+PD) instructions SimplifyDemandedElts proof Fix for D52912 which was simplifying MOVLPS/MOVHPS(+PD) instructions as the tests were only touching one of the vector halfs llvm-svn: 343858	2018-10-05 15:50:18 +00:00
Sanjay Patel	a3245a125f	[x86] regenerate full checks; NFC llvm-svn: 343855	2018-10-05 14:56:14 +00:00
Sanjay Patel	0d1162d526	[x86] add test for fneg matching failure; NFC llvm-svn: 343854	2018-10-05 14:49:20 +00:00
Simon Pilgrim	dc140c3b4d	[X86][AVX] getFauxShuffleMask - add support for INSERT_SUBVECTOR subvector shuffles Decode subvector shuffles from INSERT_SUBVECTOR(SRC0, SHUFFLE(EXTRACT_SUBVECTOR(SRC1)) This was found necessary while investigating PR39161 llvm-svn: 343853	2018-10-05 14:41:00 +00:00
Jonas Paulsson	00b41c999f	[LoopVectorizer] Use TTI.getOperandInfo() Call getOperandInfo() instead of using (near) duplicated code in LoopVectorizationCostModel::getInstructionCost(). This gets the OperandValueKind and OperandValueProperties values for a Value passed as operand to an arithmetic instruction. getOperandInfo() used to be a static method in TargetTransformInfo.cpp, but is now instead a public member. Review: Florian Hahn https://reviews.llvm.org/D52883 llvm-svn: 343852	2018-10-05 14:34:04 +00:00
Jonas Paulsson	d362ca6156	[TargetRegisterInfo] Remove temporary hook enableMultipleCopyHints() Finally all targets are enabling multiple regalloc hints, so the hook to disable this can now be removed. NFC. Review: Simon Pilgrim https://reviews.llvm.org/D52316 llvm-svn: 343851	2018-10-05 14:23:11 +00:00
Neil Henning	c09490e32b	Add missing period to comment to match style of file. This is a test commit to show that my commit access is working. llvm-svn: 343842	2018-10-05 09:39:07 +00:00
Tom Stellard	9272baed4e	AMDGPU/GlobalISel: Add support for G_INTTOPTR Summary: This is a no-op. Reviewers: arsenm Reviewed By: arsenm Subscribers: kzhuravl, jvesely, wdng, nhaehnle, yaxunl, rovka, kristof.beyls, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D52916 llvm-svn: 343839	2018-10-05 04:34:09 +00:00
Thomas Lively	3cb6cbd893	[WebAssembly] Saturating arithmetic intrinsics Summary: Depends on D52805. Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D52813 llvm-svn: 343833	2018-10-05 00:45:20 +00:00
Shoaib Meenai	c26a80208b	[cmake] Also create lowercase extension WinSDK symlinks Some projects rely on using libraries from the Windows SDK with their original casing, just with a lowercase extension. E.g. the WinSock2 lib is named WS2_32.Lib in the Windows SDK, and we would previously only create a ws2_32.lib symlink for it (i.e. all lowercase). Also create a WS2_32.lib symlink (i.e. original casing with lowercase extension) to cover users of this casing. As a drive-by fix, only create these symlinks when they differ from the original name to reduce the amount of noise in the library symlinks directory. llvm-svn: 343832	2018-10-05 00:08:27 +00:00
Wouter van Oortmerssen	5f74590867	[WebAssembly] Fixed missing "global" symbol type in AsmParser. Summary: These are emitted by the wasm backend for e.g. __stack_pointer@GLOBAL which previously wasn't accepted by the assembler. Reviewers: aheejin, dschuff Subscribers: sbc100, jgravelle-google, llvm-commits, sunfish Differential Revision: https://reviews.llvm.org/D52911 llvm-svn: 343830	2018-10-04 23:48:53 +00:00
Daniel Sanders	9be5c566f7	[globalisel][combine] When placing truncates, handle the case when the BB is empty GlobalISel uses MIR with implicit fallthrough on each basic block. As a result, getFirstNonPhi() can return end(). llvm-svn: 343829	2018-10-04 23:47:37 +00:00
Craig Topper	6dff1dbb58	[SimplifyCFG] Pass AggressiveInsts to DominatesMergePoint by reference. Remove null check. Summary: At some point in the past the recursion in DominatesMergePoint used to pass null for AggressiveInsts as part of the recursion. It no longer does this. So there is no way for AggressiveInsts to be null. This passes it by reference and removes the null check to make this explicit. Reviewers: efriedma, reames Reviewed By: efriedma Subscribers: xbolva00, llvm-commits Differential Revision: https://reviews.llvm.org/D52575 llvm-svn: 343828	2018-10-04 23:40:31 +00:00
Yury Delendik	8483e906cc	[WebAssembly] Ignore DBG_VALUE in WebAssemblyCFGStackify pass when looking for block start Summary: Fixes https://bugs.llvm.org/show_bug.cgi?id=39158 and regression caused by D49034. Though it is possible the problem was existed before and was exposed by additional DBG_VALUEs. Reviewers: sunfish, dschuff, aheejin Reviewed By: aheejin Subscribers: sbc100, aheejin, llvm-commits, alexcrichton, jgravelle-google Differential Revision: https://reviews.llvm.org/D52837 llvm-svn: 343827	2018-10-04 23:31:00 +00:00
Matt Davis	3135cf4130	[llvm-mca] Remove unused/stale forward decl. NFC. llvm-svn: 343823	2018-10-04 22:16:39 +00:00
Ana Pazos	c6b3072fd0	[RISCV] Support named operands for CSR instructions. Reviewers: asb, mgrang Reviewed By: asb Subscribers: jocewei, mgorny, jfb, PkmX, MartinMosbeck, brucehoult, the_o, rkruppe, rogfer01, rbar, johnrusso, simoncook, jordy.potman.lists, sabuasal, niosHD, kito-cheng, shiva0217, zzheng, edward-jones Differential Revision: https://reviews.llvm.org/D46759 llvm-svn: 343822	2018-10-04 21:50:54 +00:00
Daniel Sanders	c0ac42c9ba	[globalisel][combine] Fix a rare crash when encountering an instruction whose op0 isn't a reg The simplest instance of this is an intrinsic with no results which will have the intrinsic ID as operand 0. Also fix some benign incorrectness when op0 is a reg but isn't a def that was guarded against by checking for the extension opcodes. llvm-svn: 343821	2018-10-04 21:44:32 +00:00
Sanjay Patel	091a0f0213	[InstCombine] drop poison flags in SimplifyVectorDemandedElts We established the (unfortunately complicated) rules for UB/poison propagation with vector ops in: D48893 D48987 D49047 It's clear from the affected tests that we are potentially creating poison where none existed before the transforms. For add/sub/mul, the answer is simple: just drop the flags because the extra undef vector lanes are generally more valuable for analysis and codegen. llvm-svn: 343819	2018-10-04 21:36:50 +00:00
Craig Topper	c3846919b0	[X86][LegalizeVectorOps] Use MERGE_VALUES to return two results from LowerLoad. Remove special case code in LegalizeVectorOps that allowed us to only return one result. Previously we replaced the chain use ourself and return the data result. LegalizeVectorOps then detected that we'd done this and assumed the chain had already been handled. This commit instead returns a MERGE_VALUES node with two results joined from nodes. This allows LegalizeVectorOps to do all the replacements for us without any special casing. The MERGE_VALUES will be removed by DAG combine. llvm-svn: 343817	2018-10-04 21:24:24 +00:00
Craig Topper	f88ea9afaf	[SimplifyCFG] Change recursive calls to llvm::SimplifyCFG to instead use an outer while loop to revisit. Summary: The llvm::SimplifyCFG function creates a SimplifyCFGOpt object and calls run on it. There were numerous places reached from this run function that called back out llvm::SimplifyCFG which would create another SimplifyCFGOpt object. This is an inefficient use of stack space at minimum. We are also not passing along the LoopHeaders pointer passed into the outer llvm::SimplifyCFG call. So if its not null we lose it on the first recursion and get nullptr from there on. This patch adds an outer loop around the main BasicBlock simplifying code and adds a flag to the SimplifyCFGOpt class that can be set by to request another iteration. I don't think we can iterate based just on the change flag alone since some of the simplifications delete a basic block entirely leaving nothing to iterate on. Reviewers: bogner, eli.friedman, reames Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52760 llvm-svn: 343816	2018-10-04 21:11:52 +00:00
Heejin Ahn	79633d4709	[WebAssembly] Don't modify preds/succs iterators while erasing from them Summary: This caused out-of-bound bugs. Found by `-DLLVM_ENABLE_EXPENSIVE_CHECKS=ON`. Reviewers: dschuff Subscribers: sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D52902 llvm-svn: 343814	2018-10-04 21:03:35 +00:00
Konstantin Zhuravlyov	82b977316e	AMDGPU: Rename isAmdCodeObjectV2 -> isAmdHsaOrMesa The isAmdCodeObjectV2 is a misleading name which actually checks whether the os is amdhsa or mesa. Also add a test to make sure we do not generate old kernel header for code object v3. Differential Revision: https://reviews.llvm.org/D52897 llvm-svn: 343813	2018-10-04 21:02:16 +00:00
Martin Storsjo	27aec8037f	[COFF] [X86] Don't use llvm_unreachable for unsupported relocation types This can happen if assembling a reference to _GLOBAL_OFFSET_TABLE_. While it doesn't make sense to try to assemble that for COFF, the fact that we previously used llvm_unreachable meant that the code had undefined behaviour if something tried to assemble that. The configure script of libgmp would try to assemble such a snippet (which should signal a failure). If llvm is built without assertions, the undefined behaviour meant a (near) infinite loop. Differential Revision: https://reviews.llvm.org/D52903 llvm-svn: 343811	2018-10-04 20:43:38 +00:00
Sanjay Patel	de79e7e9c8	[InstCombine] reduce code duplication in SimplifyDemandedVectorElts; NFCI llvm-svn: 343806	2018-10-04 19:12:07 +00:00
James Y Knight	699b43fe0e	Give same-named members unique timestamps on Darwin in llvm-ar. This change ensures that the (membername,timestamp) tuple uniquely identifies an entry in an archive for format=darwin, in deterministic mode (which is the default). That, then, enables lldb and dsymutil to locate the appropriate object within the archive. Differential Revision: https://reviews.llvm.org/D47659 llvm-svn: 343805	2018-10-04 18:49:21 +00:00
Daniel Sanders	b02a6ea2c1	[globalisel][combine] Improve the truncate placement for the extending-loads combine This brings the extending loads patch back to the original intent but minus the PHI bug and with another small improvement to de-dupe truncates that are inserted into the same block. The truncates are sunk to their uses unless this would require inserting before a phi in which case it sinks to the _beginning_ of the predecessor block for that path (but no earlier than the def). The reason for choosing the beginning of the predecessor is that it makes de-duping multiple truncates in the same block simple, and optimized code is going to run a scheduler at some point which will likely change the position anyway. llvm-svn: 343804	2018-10-04 18:44:58 +00:00
Sanjay Patel	ce7988126e	[x86] add test for SSE sqrtss register dep (PR22206) llvm-svn: 343803	2018-10-04 17:59:30 +00:00
Matthias Braun	1f941370f1	AArch64: Fix XSeqPairs/WSeqPairs problems - Fix spill/reloads of XSeqPairs failing with vregs (only physregs worked correctly) - Add missing spill/reload code for WSeqPairs class Differential Revision: https://reviews.llvm.org/D52761 llvm-svn: 343799	2018-10-04 17:02:53 +00:00
Farhana Aleen	c2106b631d	[AMDGPU] Match signed dot4/8 pattern. Summary: This patch matches signed dot4 and dot8 pattern. Author: FarhanaAleen Reviewed By: msearles Differential Revision: https://reviews.llvm.org/D52520 llvm-svn: 343798	2018-10-04 16:57:37 +00:00
Simon Pilgrim	b7a3435a1e	[llvm-mca][x86] Add PR36951 ReadAfterLd test case llvm-svn: 343795	2018-10-04 16:26:56 +00:00
Sanjay Patel	fd5c125b13	[InstCombine] allow bitcast to/from FP for vector insert/extract transform This is a follow-up to rL343482 / D52439. This was a pattern that initially caused the commit to be reverted because the transform requires a bitcast as shown here. llvm-svn: 343794	2018-10-04 16:25:05 +00:00
Andrea Di Biagio	9c6205b103	[llvm-mca] Move field 'AllowZeroMoveEliminationOnly' to class RegisterFile. NFC. Flag 'AllowZeroMoveEliminationOnly' should have been a property of the PRF, and not set at register granularity. This change also restricts move elimination to writes that update a full physical register. We assume that there is a strong correlation between logical registers that allow move elimination, and how those same registers are allocated to physical registers by the register renamer. This is still a no functional change, because this experimental code path is disabled for now. This is done in preparation for another patch that will add the ability to describe how move elimination works in scheduling models. llvm-svn: 343787	2018-10-04 15:20:56 +00:00
Simon Pilgrim	99377499b4	[X86][AVX] Add PR39161 test case for v4f64 zzww shuffle llvm-svn: 343786	2018-10-04 15:06:09 +00:00
Greg Bedwell	20883bf128	[utils] Ensure that update_mca_test_checks.py writes prefixes in alphabetical order llvm-svn: 343783	2018-10-04 14:42:19 +00:00
Greg Bedwell	698890f4d4	[utils] simple refactor in update_mca_test_checks.py to make intent more readable llvm-svn: 343782	2018-10-04 14:42:06 +00:00
Alex Bradbury	91c5da8408	[RISCV] Remove overzealous is64Bit checks lowerGlobalAddress, lowerBlockAddress, and insertIndirectBranch contain overzealous checks for is64Bit. These functions are all safe as-implemented for RV64. llvm-svn: 343781	2018-10-04 14:30:03 +00:00
David Greene	b3fe8cfb83	[X86] Set correct MMO offset on scalarized load pieces When scalarizing a load, be sure to update the offset in the MachineMemOperand for each scalar load. llvm-svn: 343776	2018-10-04 14:07:59 +00:00
Simon Pilgrim	1c14bbe8a0	[llvm-mca][x86] Add tests demonstrating ReadAfterLd delay llvm-svn: 343773	2018-10-04 13:05:42 +00:00
Fedor Sergeev	3087bc3290	[PassTimingInfo] cleanup on TimingData's Timer handling Replacing Timer* with unique_ptr<Timer> in a pass-to-timer map. That allows to get rid of unpretty raw deletes in PassTimingInfo destructor. Strictly cleanup, not intended to change any visible behavior. llvm-svn: 343772	2018-10-04 12:49:57 +00:00
Guillaume Chatelet	f890b56408	[llvm-exegesis][NFC] Improve parsing of the YAML files Summary: sscanf turns out to be slow for reading floating points. Reviewers: courbet Subscribers: tschuett, llvm-commits, RKSimon Differential Revision: https://reviews.llvm.org/D52866 llvm-svn: 343771	2018-10-04 12:33:46 +00:00

1 2 3 4 5 ...

170020 Commits