llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 19:12:56 +02:00

Author	SHA1	Message	Date
Simon Pilgrim	0ab6abaf21	[ARM] Regenerate reverse shuffle costs Came about while cleaning up general shuffle costs for PR39368 llvm-svn: 344966	2018-10-22 22:26:00 +00:00
Craig Topper	e7725047a0	Recommit r344877 "[X86] Stop promoting integer loads to vXi64" I've included a fix to DAGCombiner::ForwardStoreValueToDirectLoad that I believe will prevent the previous miscompile. Original commit message: Theoretically this was done to simplify the amount of isel patterns that were needed. But it also meant a substantial number of our isel patterns have to match an explicit bitcast. By making the vXi32/vXi16/vXi8 types legal for loads, DAG combiner should be able to change the load type to rem I had to add some additional plain load instruction patterns and a few other special cases, but overall the isel table has reduced in size by ~12000 bytes. So it looks like this promotion was hurting us more than helping. I still have one crash in vector-trunc.ll that I'm hoping @RKSimon can help with. It seems to relate to using getTargetConstantFromNode on a load that was shrunk due to an extract_subvector combine after the constant pool entry was created. So we end up decoding more mask elements than the lo I'm hoping this patch will simplify the number of patterns needed to remove the and/or/xor promotion. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits, RKSimon Differential Revision: https://reviews.llvm.org/D53306 llvm-svn: 344965	2018-10-22 22:14:05 +00:00
Sanjay Patel	a6149dec85	[Reassociate] add vector tests with undef elements; NFC Also, regenerate checks for these files. We should do better on the vector tests by using the PatternMatch API instead of BinaryOperator::isNot/isNeg. llvm-svn: 344964	2018-10-22 22:04:13 +00:00
Thomas Lively	e3205431fe	[WebAssembly][NFC] Remove WebAssemblyStackifier TableGen backend Summary: Replace its functionality with a TableGen InstrInfo relational instruction mapping. Although arguably more complex than the TableGen backend, the relational mapping is a smaller maintenance burden than a TableGen backend. Reviewers: aardappel, aheejin, dschuff Subscribers: mgorny, sbc100, jgravelle-google, sunfish, llvm-commits Differential Revision: https://reviews.llvm.org/D53307 llvm-svn: 344962	2018-10-22 21:55:26 +00:00
Vedant Kumar	2c21d8815f	[DWARF] Use a function-local offset for AT_call_return_pc Logs provided by @stella.stamenova indicate that on Linux, lldb adds a spurious slide offset to the return PC it loads from AT_call_return_pc attributes (see the list thread: "[PATCH] D50478: Add support for artificial tail call frames"). This patch side-steps the issue by getting rid of the load address calculation in lldb's CallEdge::GetReturnPCAddress. The idea is to have the DWARF writer emit function-local offsets to the instruction after a call. I.e. return-pc = label-after-call-insn - function-entry. LLDB can simply add this offset to the base address of a function to get the return PC. Differential Revision: https://reviews.llvm.org/D53469 llvm-svn: 344960	2018-10-22 21:44:21 +00:00
Sanjay Patel	044bfe78a6	[Reassociate] add 'using namespace' to reduce bloat; NFC llvm-svn: 344959	2018-10-22 21:37:02 +00:00
Lang Hames	998230f60c	[ORC] Guard access to the MemMgrs vector in RTDyldObjectLinkingLayer. Otherwise we can end up with a data-race when linking concurrently. This should fix an intermittent failure in the multiple-compile-threads-basic.ll testcase. llvm-svn: 344956	2018-10-22 21:17:56 +00:00
Sanjay Patel	0fd5d90244	[x86] add test for PR25498 and complete checks; NFC Might as well test the actual codegen instead of just the absence of crashing. llvm-svn: 344955	2018-10-22 21:11:15 +00:00
Tim Northover	acf2757a23	X86: add alias for pushfw/popfw in Intel mode A while ago we changed pushf and popf in Intel mode to generate pushfq and popfq. Unfortunately that left us with no way to get the 16-bit encoding in Intel mode so this patch adds pushfw and popfw as aliases there. llvm-svn: 344949	2018-10-22 20:38:13 +00:00
Justin Bogner	2439a5a3d2	Reapply "[MachineCopyPropagation] Reimplement CopyTracker in terms of register units" Recommits r342942, which was reverted in r343189, with a fix for an issue where we would propagate unsafely if we defined only the upper part of a register. Original message: Change the copy tracker to keep a single map of register units instead of 3 maps of registers. This gives a very significant compile time performance improvement to the pass. I measured a 30-40% decrease in time spent in MCP on x86 and AArch64 and much more significant improvements on out of tree targets with more registers. llvm-svn: 344942	2018-10-22 19:51:31 +00:00
Teresa Johnson	72cf5dc4f0	[hot-cold-split] Add opt remark on success Summary: Emit optimization remark on successful hot cold split. Reviewers: sebpop, hiraditya Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53512 llvm-svn: 344938	2018-10-22 19:06:42 +00:00
Simon Pilgrim	17590d5f71	Revert rL344931 from llvm/trunk: [X86][SSE] getTargetShuffleMaskIndices - allow opt-in support for whole undef shuffle mask elements We can't safely assume that certain RawMask entries are UNDEF as most variable shuffles ignore non-index bits - PSHUFB only works on i8 elts so it'd be safe to use but I'm intending to come up with an alternative approach that works for all. ........ Enable this for PSHUFB constant mask decoding and remove the ConstantPool DecodePSHUFBMask llvm-svn: 344937	2018-10-22 19:01:25 +00:00
Simon Pilgrim	ee8a57df9e	Revert rL344933 from llvm/trunk: [X86][SSE] Tidyup DecodeVPERMILPMask shuffle mask decoding We can't safely assume that certain RawMask entries are UNDEF as most variable shuffles ignore non-index bits. ........ Add support for UNDEF raw mask elements and remove the ConstantPool DecodeVPERMILPMask usage in X86ISelLowering.cpp llvm-svn: 344936	2018-10-22 18:58:32 +00:00
Aaron Ballman	f8047f57bb	Revert r344930 as it broke some of the bots on Windows. http://lab.llvm.org:8011/builders/clang-x64-windows-msvc/builds/739 llvm-svn: 344935	2018-10-22 18:51:29 +00:00
Simon Pilgrim	13eaf41964	[X86][SSE] Tidyup DecodeVPERMILPMask shuffle mask decoding Add support for UNDEF raw mask elements and remove the ConstantPool DecodeVPERMILPMask usage in X86ISelLowering.cpp llvm-svn: 344933	2018-10-22 18:35:13 +00:00
Simon Pilgrim	e6bd6b53c2	[X86][SSE] getTargetShuffleMaskIndices - allow opt-in support for whole undef shuffle mask elements Enable this for PSHUFB constant mask decoding and remove the ConstantPool DecodePSHUFBMask llvm-svn: 344931	2018-10-22 18:09:02 +00:00
Joel E. Denny	92bb73aa3f	[SourceMgr][FileCheck] Obey -color by extending WithColor While this change specifically targets FileCheck, it affects any tool using the same SourceMgr facilities. Previously, -color was documented in FileCheck's -help output, but -color had no effect. Now, -color obeys its documentation: it forces colors to be used in FileCheck diagnostics even when stderr is not a terminal. -color is especially helpful when combined with FileCheck's -v, which can produce a long series of diagnostics that you might wish to pipe to a pager, such as less -R. The WithColor extensions here will also help to clean up color usage in FileCheck's annotated dump of input, which is proposed in D52999. Reviewed By: JDevlieghere Differential Revision: https://reviews.llvm.org/D53419 llvm-svn: 344930	2018-10-22 18:00:49 +00:00
Teresa Johnson	20fd200386	[hot-cold-split] Add missing FileCheck invocations Summary: r344558 added some CHECK statements to split-cold-2.ll, but didn't add any invocations of FileCheck. Add those here. Reviewers: sebpop Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53505 llvm-svn: 344928	2018-10-22 17:57:02 +00:00
Fangrui Song	c853a5dbc9	[llvm-exegesis] Fix name lookup ambiguity in MSVC after 344922 llvm-svn: 344927	2018-10-22 17:52:31 +00:00
Simon Pilgrim	a46c40b9ec	[X86] getTargetConstantBitsFromNode - handle extraction from larger constant pool entries First step towards removing X86ShuffleDecodeConstantPool usage from X86ISelLowering.cpp llvm-svn: 344924	2018-10-22 17:43:33 +00:00
Fangrui Song	edf37d23b8	[llvm-exegesis] Move namespace exegesis inside llvm:: Summary: This allows simplifying references of llvm::foo with foo when the needs come in the future. Reviewers: courbet, gchatelet Reviewed By: gchatelet Subscribers: javed.absar, tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D53455 llvm-svn: 344922	2018-10-22 17:10:47 +00:00
Craig Topper	6f08b710ee	Revert r344877 "[X86] Stop promoting integer loads to vXi64" Sam McCall reported miscompiles in some tensorflow code. Reverting while I try to figure out. llvm-svn: 344921	2018-10-22 16:59:24 +00:00
Vedant Kumar	2347bcfab6	[test] Relax test/Other/opt-hot-cold-split.ll On some ARM bots, 'Target Pass Configuration' does not run after 'Target Transform Info'. Relax this pipeline test to allow that. This is the same fix as in r328167. Bot URL: http://lab.llvm.org:8011/builders/clang-cmake-armv7-quick/builds/4611 llvm-svn: 344919	2018-10-22 16:50:24 +00:00
Andrea Di Biagio	aea440496b	[llvm-mca] Remove a couple of using directives and a bunch of redundant namespace llvm prefixes. NFC llvm-svn: 344916	2018-10-22 16:28:07 +00:00
Matt Arsenault	bc7384fc93	DAG: Change behavior of fminnum/fmaxnum nodes Introduce new versions that follow the IEEE semantics to help with legalization that may need quieted inputs. There are some regressions from inserting unnecessary canonicalizes when these are matched from fast math fcmp + select which should be fixed in a future commit. llvm-svn: 344914	2018-10-22 16:27:27 +00:00
Zachary Turner	66054f015a	Some cleanups to the native pdb plugin [NFC]. This is mostly some cleanup done in the process of implementing some basic support for types. I tried to split up the patch a bit to get some of the NFC portion of the patch out into a separate commit, and this is the result of that. It moves some code around, deletes some spurious namespace qualifications, removes some unnecessary header includes, forward declarations, etc. llvm-svn: 344913	2018-10-22 16:19:07 +00:00
Andrea Di Biagio	05decf8cc3	[llvm-mca] Use llvm::ArrayRef in class SourceMgr. NFCI Class SourceMgr now uses type ArrayRef<MCInst> to reference the sequence of code from a "CodeRegion". llvm-svn: 344911	2018-10-22 15:36:15 +00:00
Simon Pilgrim	02793818d2	[X86][SSE] getTargetShuffleMask - pull out repeated shuffle mask element size. NFCI. llvm-svn: 344910	2018-10-22 15:33:30 +00:00
Aleksandr Urakov	ca7f3c8dc7	Revert "[PDB] Extend IPDBSession's interface to retrieve frame data" This reverts commit b5c7e2f9a4dbb34e3667c4bb4972735eadd3247a. llvm-svn: 344909	2018-10-22 15:30:48 +00:00
Sanjay Patel	30decc46fa	[InstCombine] add tests for shuffle+insert folds; NFC llvm-svn: 344908	2018-10-22 15:26:27 +00:00
Guillaume Chatelet	879f293273	[llvm-exegesis] Crash when assembling invalid Operand llvm-svn: 344907	2018-10-22 15:06:10 +00:00
Guillaume Chatelet	2d70c5ba49	[llvm-exegesis] Mark x86 segment register instructions as unsupported. Reviewers: courbet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D53499 llvm-svn: 344906	2018-10-22 14:55:43 +00:00
Guillaume Chatelet	dc38d0d574	[llvm-exegesis] Reject x86 instructions that use non uniform memory accesses Reviewers: courbet Subscribers: tschuett, llvm-commits Differential Revision: https://reviews.llvm.org/D53438 llvm-svn: 344905	2018-10-22 14:46:08 +00:00
Roman Lebedev	940854b6ef	[X86] X86DAGToDAGISel: handle BZHI selection too, not just BEXTR. Summary: As discussed in D52304 / IRC, we now have pattern matching for 'bit extract' in two places - tablegen and `X86DAGToDAGISel`. There are 4 patterns. And we will have a problem with `x & (-1 >> (32 - y))` pattern. * If the mask is one-use, then it is always unfolded into `x << (32 - y) >> (32 - y)` first. Thus, the existing test coverage is already broken. * If it is not one-use, then it is not unfolded, and is matched as BZHI. * If it is not one-use, we will not match it as BEXTR. And if it is one-use, it will have been unfolded already. So we will either not handle that pattern for BEXTR, or not have test coverage for it. This is bad. As discussed with @craig.topper, let's unify this matching, and do everything in `X86DAGToDAGISel`. Then we will not have code duplication, and will have proper test coverage. This indeed does not affect any tests, and this is great. It means that for these two patterns, the `X86DAGToDAGISel` is identical to the tablegen version. Please review carefully, i'm not fully sure about that intrinsic change, and introduction of the new `X86ISD` opcode. Reviewers: craig.topper, RKSimon, spatel Reviewed By: craig.topper Subscribers: llvm-commits, craig.topper Differential Revision: https://reviews.llvm.org/D53164 llvm-svn: 344904	2018-10-22 14:12:44 +00:00
David Greene	b00650ec4a	Document bisect-skip-count Provide an example of how to use bisect-skip count to find bugs. Differential revision: https://reviews.llvm.org/D52314 llvm-svn: 344903	2018-10-22 14:04:13 +00:00
Roman Lebedev	377e16cc77	[X86][BMI1]: X86DAGToDAGISel: select BEXTR from x & ((1 << nbits) + (-1)) pattern Summary: Trivial continuation of D52304. While this pattern is not canonical, we do select it in the BZHI case, so this should not be any different. Reviewers: RKSimon, craig.topper, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D52348 llvm-svn: 344902	2018-10-22 13:54:17 +00:00
Petar Avramovic	182b1bf4c9	Test commit: change comment. llvm-svn: 344900	2018-10-22 13:27:50 +00:00
George Rimar	2048c73555	[llvm-dwarfdump] - Fix win10 build bot failture. Bot failed: http://lab.llvm.org:8011/builders/llvm-clang-lld-x86_64-scei-ps4-windows10pro-fast/builds/20877/steps/test/logs/stdio This was broken after the r344895 "[llvm-dwarfdump] - Add the support of parsing .debug_loclists." because of wrong formatting specifiers used. llvm-svn: 344896	2018-10-22 12:18:30 +00:00
George Rimar	4392ace2e7	[llvm-dwarfdump] - Add the support of parsing .debug_loclists. This teaches llvm-dwarfdump to dump the content of .debug_loclists sections. It converts the DWARFDebugLocDWO class to DWARFDebugLoclists, teaches llvm-dwarfdump about .debug_loclists section and adds the implementation for parsing the DW_LLE_offset_pair entries. Differential revision: https://reviews.llvm.org/D53364 llvm-svn: 344895	2018-10-22 11:30:54 +00:00
Nemanja Ivanovic	12522d8b68	[PowerPC][NFC] Fix bugs in r+r to r+i conversion The D-Form VSX loads introduced in ISA 3.0 are not direct D-Form equivalent of the corresponding X-Forms since they only target the Altivec registers. Namely LXSSPX can load into any of the 64 VSX registers whereas LXSSP can only load into the upper 32 VSX registers. Similarly with the remaining affected instructions. There is currently no way that I can see to trigger the bug, but as we add other ways of exploiting these instructions, there may very well be instances that do. This is an NFC patch in practical terms since the changes it introduces can not be triggered without an MIR test. Differential revision: https://reviews.llvm.org/D53323 llvm-svn: 344894	2018-10-22 11:22:59 +00:00
Benjamin Kramer	5f5115e7e2	[CGProfile] Turn constant-size SmallVector into array No functionality change. llvm-svn: 344893	2018-10-22 10:51:34 +00:00
Aleksandr Urakov	3d2b91651d	[PDB] Extend IPDBSession's interface to retrieve frame data Summary: This patch just extends the `IPDBSession` interface to allow retrieving of frame data through it, and adds an implementation over DIA. It is needed for an implementation (for now with DIA) of the conversion from FPO programs to DWARF expressions mentioned in D53086. Reviewers: zturner, asmith, rnk Reviewed By: asmith Subscribers: mgorny, aprantl, JDevlieghere, llvm-commits Differential Revision: https://reviews.llvm.org/D53324 llvm-svn: 344886	2018-10-22 07:18:08 +00:00
Craig Topper	372520a72f	[X86] Add patterns for vector and/or/xor/andn with other types than vXi64. This makes fast isel treat all legal vector types the same way. Previously only vXi64 was in the fast-isel tables. This unfortunately prevents matching of andn by fast-isel for these types since the requires SelectionDAG. But we already had this issue for vXi64. So at least we're consistent now. Interestinly it looks like fast-isel can't handle instructions with constant vector arguments so the the not part of the andn patterns is selected with SelectionDAG. This explains why VPTERNLOG shows up in some of the tests. This is a subset of D53268. As I make progress on that, I will try to reduce the number of lines in the tablegen files. llvm-svn: 344884	2018-10-22 06:30:22 +00:00
Dorit Nuzman	f4cfce2619	[IAI,LV] Avoid creating a scalar epilogue due to gaps in interleave-groups when optimizing for size LV is careful to respect -Os and not to create a scalar epilog in all cases (runtime tests, trip-counts that require a remainder loop) except for peeling due to gaps in interleave-groups. This patch fixes that; -Os will now have us invalidate such interleave-groups and vectorize without an epilog. The patch also removes a related FIXME comment that is now obsolete, and was also inaccurate: "FIXME: return None if loop requiresScalarEpilog(<MaxVF>), or look for a smaller MaxVF that does not require a scalar epilog." (requiresScalarEpilog() has nothing to do with VF). Reviewers: Ayal, hsaito, dcaballe, fhahn Reviewed By: Ayal Differential Revision: https://reviews.llvm.org/D53420 llvm-svn: 344883	2018-10-22 06:17:09 +00:00
Craig Topper	11f01950b1	[X86] Stop promoting integer loads to vXi64 Summary: Theoretically this was done to simplify the amount of isel patterns that were needed. But it also meant a substantial number of our isel patterns have to match an explicit bitcast. By making the vXi32/vXi16/vXi8 types legal for loads, DAG combiner should be able to change the load type to remove the bitcast. I had to add some additional plain load instruction patterns and a few other special cases, but overall the isel table has reduced in size by ~12000 bytes. So it looks like this promotion was hurting us more than helping. I still have one crash in vector-trunc.ll that I'm hoping @RKSimon can help with. It seems to relate to using getTargetConstantFromNode on a load that was shrunk due to an extract_subvector combine after the constant pool entry was created. So we end up decoding more mask elements than the load size. I'm hoping this patch will simplify the number of patterns needed to remove the and/or/xor promotion. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits, RKSimon Differential Revision: https://reviews.llvm.org/D53306 llvm-svn: 344877	2018-10-21 21:30:26 +00:00
Craig Topper	671b85eab5	Revert r344873 "foo" Rebase gone wrong left this in my tree. llvm-svn: 344875	2018-10-21 21:08:37 +00:00
Craig Topper	469f8dae45	[X86] Remove SDIVREM8_SEXT_HREG/UDIVREM8_ZEXT_HREG and their associated DAG combine and target bits support. Use a post isel peephole instead. Summary: These nodes exist to overcome an isel problem where we can generate a zero extend of an AH register followed by an extract subreg, and another zero extend. The first zero extend exists to avoid a partial register update copying the AH register into the low 8-bits. The second zero extend exists if the user wanted the remainder zero extended. To make this work we had a DAG combine to morph the DIVREM opcode to a special opcode that included the extend. But then we had to add the new node to computeKnownBits and computeNumSignBits to process the extension portion. This patch instead removes all of that and adds a late peephole to detect the two extends. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D53449 llvm-svn: 344874	2018-10-21 21:07:27 +00:00
Craig Topper	abdcc12a9b	foo llvm-svn: 344873	2018-10-21 21:07:25 +00:00
Sanjay Patel	be7f6a09b0	[DAGCombiner] reduce insert+bitcast+extract vector ops to truncate (PR39016) This is a late backend subset of the IR transform added with: D52439 We can confirm that the conversion to a 'trunc' is correct by running: $ opt -instcombine -data-layout="e" (assuming the IR transforms are correct; change "e" to "E" for big-endian) As discussed in PR39016: https://bugs.llvm.org/show_bug.cgi?id=39016 ...the pattern may emerge during legalization, so that's we are waiting for an insertelement to become a scalar_to_vector in the pattern matching here. The DAG allows for fun variations that are not possible in IR. Result types for extracts and scalar_to_vector don't necessarily match input types, so that means we have to be a bit more careful in the transform (see code comments). The tests show that we don't handle cases that require a shift (as we did in the IR version). I've left that as a potential follow-up because I'm not sure if that's a real concern at this late stage. Differential Revision: https://reviews.llvm.org/D53201 llvm-svn: 344872	2018-10-21 20:13:29 +00:00
Aditya Kumar	8249a7a474	Schedule Hot Cold Splitting pass after most optimization passes Summary: In the new+old pass manager, hot cold splitting was schedule too early. Thanks to Vedant for pointing this out. Reviewers: sebpop, vsk Reviewed By: sebpop, vsk Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D53437 llvm-svn: 344869	2018-10-21 18:11:56 +00:00

1 2 3 4 5 ...

170691 Commits