llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 03:23:01 +02:00

Author	SHA1	Message	Date
Reid Kleckner	bdc1c0106b	Move endian constant from Host.h to SwapByteOrder.h, prune include Works on this dependency chain: ArrayRef.h -> Hashing.h -> --CUT-- Host.h -> StringMap.h / StringRef.h ArrayRef is very popular, but Host.h is rarely needed. Move the IsBigEndianHost constant to SwapByteOrder.h. Clients of that header are more likely to need it. llvm-svn: 375316	2019-10-19 00:48:11 +00:00
Reid Kleckner	02a07ff3cc	Prune two MachineInstr.h includes, fix up deps MachineInstr.h included AliasAnalysis.h, which includes a world of IR constructs mostly unneeded in CodeGen. Prune it. Same for DebugInfoMetadata.h. Noticed with -ftime-trace. llvm-svn: 375311	2019-10-19 00:22:07 +00:00
Matt Arsenault	8e7a1a8142	LiveIntervals: Fix handleMoveUp with subreg def moving across a def If a subregister def was moved across another subregister def and another use, the main range was not correctly updated. The end point of the moved interval ended too early and missed the use from theh other lanes in the subreg def. llvm-svn: 375300	2019-10-18 23:24:25 +00:00
Peter Collingbourne	8bc0d42879	gn build: Build compiler-rt code with -fvisibility=hidden. This matches the CMake build. Differential Revision: https://reviews.llvm.org/D69202 llvm-svn: 375299	2019-10-18 22:52:17 +00:00
Stanislav Mekhanoshin	18380a59a6	[AMDGPU] move PHI nodes to AGPR class If all uses of a PHI are in AGPR register class we should avoid unneeded copies via VGPRs. Differential Revision: https://reviews.llvm.org/D69200 llvm-svn: 375297	2019-10-18 22:48:45 +00:00
Wei Mi	7cc7328f4b	[SampleFDO] Add profile remapping support for profile on-demand loading used by ExtBinary format profile Profile on-demand loading was added for ExtBinary format profile in rL374233, but currently profile on-demand loading doesn't work well with profile remapping. The patch adds the support. Suppose a function in the current module has outline instance in the profile. The function name in the module is different from the name of the outline instance, but remapper knows the two names are equal. When loading profile on-demand, the outline instance has to be loaded with remapper's help. At the same time SampleProfileReaderItaniumRemapper is changed from a proxy of SampleProfileReader to a helper member in SampleProfileReader. Differential Revision: https://reviews.llvm.org/D68901 llvm-svn: 375295	2019-10-18 22:35:20 +00:00
Jay Foad	6298616030	[AMDGPU] Remove -amdgpu-spill-sgpr-to-smem. Summary: The implementation was never completed and never used except in tests. Reviewers: arsenm, mareko Subscribers: qcolombet, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69163 llvm-svn: 375293	2019-10-18 21:48:22 +00:00
Roman Lebedev	46e0f8f422	[CVP] setDeducedOverflowingFlags(): actually inc per-opcode stats This is really embarrassing. Those are pointers, so that offsets the pointers, not the statistics pointed-by the pointer... llvm-svn: 375290	2019-10-18 21:19:26 +00:00
GN Sync Bot	37bfd16fca	gn build: Merge r375288 llvm-svn: 375289	2019-10-18 21:11:20 +00:00
Vedant Kumar	09341bd5f5	Disable exit-on-SIGPIPE in lldb Occasionally, during test teardown, LLDB writes to a closed pipe. Sometimes the communication is inherently unreliable, so LLDB tries to avoid being killed due to SIGPIPE (it calls `signal(SIGPIPE, SIG_IGN)`). However, LLVM's default SIGPIPE behavior overrides LLDB's, causing it to exit with IO_ERR. Opt LLDB out of the default SIGPIPE behavior. I expect that this will resolve some LLDB test suite flakiness (tests randomly failing with IO_ERR) that we've seen since r344372. rdar://55750240 Differential Revision: https://reviews.llvm.org/D69148 llvm-svn: 375288	2019-10-18 21:05:30 +00:00
Reid Kleckner	e262abe247	[X86] Fix register parsing in .seh_* in Intel syntax Previously, the parser checked for a '%' prefix to indicate a register. In Intel syntax mode, LLVM does not print a '%' prefix on registers, so LLVM could not parse its own assembly output. Instead, require that register numbers be integer literals, or at least start with an integer literal, which is consistent with .cfi_* directive register parsing. llvm-svn: 375287	2019-10-18 21:01:41 +00:00
Roman Lebedev	a02ae49a6b	[NFC][CVP] Some tests for `mul` no-wrap deduction llvm-svn: 375285	2019-10-18 20:36:19 +00:00
Thomas Lively	2769a48f27	[WebAssembly] Allow multivalue signatures in object files Summary: Also changes the wasm YAML format to reflect the possibility of having multiple return types and to put the returns after the params for consistency with the binary encoding. Reviewers: aheejin, sbc100 Subscribers: dschuff, jgravelle-google, hiraditya, sunfish, arphaman, rupprecht, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69156 llvm-svn: 375283	2019-10-18 20:27:30 +00:00
Quentin Colombet	cf9f28ca10	[GISel][CallLowering] Make isIncomingArgumentHandler a pure virtual method The default implementation of isIncomingArgumentHandler could lead to generating incorrect code. Make it a pure virtual method, so that targets know they have to override it to produce correct code. NFC Differential Revision: https://reviews.llvm.org/D69187 llvm-svn: 375277	2019-10-18 20:13:42 +00:00
Roman Lebedev	8217a0ec45	[CVP] After proving that @llvm.with.overflow()/@llvm.sat() don't overflow, also try to prove other no-wrap Summary: CVP, unlike InstCombine, does not run till exaustion. It only does a single pass. When dealing with those special binops, if we prove that they can safely be demoted into their usual binop form, we do set the no-wrap we deduced. But when dealing with usual binops, we try to deduce both no-wraps. So if we convert e.g. @llvm.uadd.with.overflow() to `add nuw`, we won't attempt to check whether it can be `add nuw nsw`. This patch proposes to call `processBinOp()` on newly-created binop, which is identical to what we do for div/rem already. Reviewers: nikic, spatel, reames Reviewed By: nikic Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69183 llvm-svn: 375273	2019-10-18 19:32:47 +00:00
Lang Hames	cf5bfe3c8f	[examples] Fix some comments in the LLJITWithJITLink example llvm-svn: 375269	2019-10-18 18:35:02 +00:00
Matt Arsenault	c1329a6c94	AMDGPU: Relax 32-bit SGPR register class Mostly use SReg_32 instead of SReg_32_XM0 for arbitrary values. This will allow the register coalescer to do a better job eliminating copies to m0. For GlobalISel, as a terrible hack, use SGPR_32 for things that should use SCC until booleans are solved. llvm-svn: 375267	2019-10-18 18:26:37 +00:00
Lang Hames	058bc378c4	[examples] Add an example of how to use JITLink and small-code-model with LLJIT. JITLink is LLVM's newer jit-linker. It is an alternative to (and hopefully eventually a replacement for) LLVM's older jit-linker, RuntimeDyld. Unlike RuntimeDyld which requries JIT'd code to be complied with the large code model, JITlink can link code compiled with the small code model, which is the native code model for a number of targets (including all supported MachO targets). This example shows how to: -- Create a JITLink InProcessMemoryManager -- Set the code model to small -- Use a JITLink backed ObjectLinkingLayer as the linking layer for LLJIT (rather than the default RTDyldObjectLinkingLayer). Note: This example will only work on platforms supported by JITLink. As of this commit that's MachO/x86-64 and MachO/arm64. llvm-svn: 375266	2019-10-18 18:25:15 +00:00
Austin Kerbow	c608812ce8	AMDGPU: Fix SMEM WAR hazard for gfx10 readlane Summary: Hazard recognizer fails to see hazard with V_READLANE_B32_gfx10. Reviewers: rampitec Reviewed By: rampitec Subscribers: arsenm, kzhuravl, jvesely, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69172 llvm-svn: 375265	2019-10-18 18:20:30 +00:00
Julian Lettner	227838e7cc	[lit] Reduce value of synthesized timeouts Large timeout values (one year, positive infinity) trip up Python on Windows with "OverflowError: timeout value is too large". One week seems to work and is still large enough in practice. Thanks to Simon Pilgrim for helping me test this. https://reviews.llvm.org/rL375171 llvm-svn: 375264	2019-10-18 17:59:46 +00:00
Julian Lettner	dbaacebca7	[lit] Remove unnecessary tracking of test_index llvm-svn: 375263	2019-10-18 17:31:48 +00:00
Julian Lettner	e55060e8c3	[lit] Only send back test result from worker process Avoid sending back the whole run.Test object (which needs to be pickled) from the worker process when we are only interested in the test result. llvm-svn: 375262	2019-10-18 17:31:45 +00:00
Roman Lebedev	e39305ee5f	[Codegen] Link MIRParser into CodeGenTests to fix MachineSizeOptsTest building llvm-svn: 375261	2019-10-18 17:18:21 +00:00
Roman Lebedev	bb21aab54b	[NFC][CVP] Add @llvm.*.sat tests where we could prove both no-overflows llvm-svn: 375260	2019-10-18 17:18:12 +00:00
GN Sync Bot	0ba2cc2656	gn build: Merge r375254 llvm-svn: 375256	2019-10-18 16:52:12 +00:00
Hiroshi Yamauchi	bdc08cd08e	[PGO][PGSO] SizeOpts changes. Summary: (Split of off D67120) SizeOpts/MachineSizeOpts changes for profile guided size optimization. Reviewers: davidxl Subscribers: mgorny, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69070 llvm-svn: 375254	2019-10-18 16:46:01 +00:00
Simon Pilgrim	6e4fe49961	[X86] combineX86ShufflesRecursively - pull out isTargetShuffleVariableMask. NFCI. llvm-svn: 375253	2019-10-18 16:39:01 +00:00
Jay Foad	496ec33cdb	[IR] Reimplement FPMathOperator::classof as a whitelist. Summary: This makes it much easier to verify that the implementation matches the documentation. It uncovered a bug in the unit tests where we were accidentally setting fast math flags on a load instruction. Reviewers: spatel, wristow, arsenm, hfinkel, aemerson, efriedma, cameron.mcinally, mcberg2017, jmolloy Subscribers: wdng, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69176 llvm-svn: 375252	2019-10-18 16:16:36 +00:00
Jay Foad	ee4f590a23	Update docs for fast-math flags. This adds fneg, phi and select to the list of operations that may use fast-math flags. llvm-svn: 375250	2019-10-18 16:07:09 +00:00
Joseph Tremoulet	87bd66b7f7	Update MinidumpYAML to use minidump::Exception for exception stream Reviewers: labath, jhenderson, clayborg, MaskRay, grimar Reviewed By: grimar Subscribers: lldb-commits, grimar, MaskRay, hiraditya, llvm-commits Tags: #llvm, #lldb Differential Revision: https://reviews.llvm.org/D68657 llvm-svn: 375242	2019-10-18 14:56:19 +00:00
Dmitry Preobrazhensky	3ae7d38102	[AMDGPU][MC][GFX10] Added sdwa/dpp versions of v_cndmask_b32 See https://bugs.llvm.org/show_bug.cgi?id=43608 Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D69096 llvm-svn: 375241	2019-10-18 14:49:53 +00:00
James Molloy	703b48d1bc	[DFAPacketizer] Fix large compile-time regression for VLIW targets D68992 / rL375086 refactored the packetizer and removed a bunch of logic. Unfortunately it creates an Automaton object whenever a DFAPacketizer is required. These objects have no longevity, and in particular on a debug build the population of the Automaton's transition map from the underlying table is very slow (because it is called ~10 times per MachineFunction, in the testcase I'm looking at). This patch changes Automaton to wrap its underlying constant data in std::shared_ptr, which allows trivial copy construction. The DFAPacketizer creation function now creates a static archetypical Automaton and copies that whenever a new DFAPacketizer is required. This takes a testcase down from ~20s to ~0.5s in debug mode. llvm-svn: 375240	2019-10-18 14:48:35 +00:00
Joseph Tremoulet	db01c87a41	Add ExceptionStream to llvm::Object::minidump Summary: This will allow updating MinidumpYAML and LLDB to use this common definition. Reviewers: labath, jhenderson, clayborg Reviewed By: labath Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68656 llvm-svn: 375239	2019-10-18 14:43:15 +00:00
Eugene Leviant	1a39c6003e	One more attempt to fix PS4 buildbot after r375219 PS4 buildbot seems to be dropping variable names for some reason llvm-svn: 375237	2019-10-18 14:11:19 +00:00
Eugene Leviant	9c01211772	Attempt to fix PS4 buildbot after r375219 llvm-svn: 375235	2019-10-18 13:52:51 +00:00
Nemanja Ivanovic	e2528e41b4	Revert r375152 as it is causing failures on EXPENSIVE_CHECKS bot llvm-svn: 375233	2019-10-18 13:38:46 +00:00
Victor Campos	e5f0157111	[SCEV] Removing deprecated comment in ScalarEvolutionExpander Removing a comment in the ScalarEvolutionExpander.cpp file that was about the class SCEVSDivExpr, which has been long gone from LLVM. llvm-svn: 375232	2019-10-18 13:33:45 +00:00
Dmitry Preobrazhensky	6bbb7124f8	[AMDGPU][MC][GFX9] Corrected parsing of v_cndmask_b32_sdwa See https://bugs.llvm.org/show_bug.cgi?id=43607 Reviewers: arsenm, rampitec Differential Revision: https://reviews.llvm.org/D69095 llvm-svn: 375231	2019-10-18 13:31:53 +00:00
Roman Lebedev	96193db5d0	[NFC][CVP] Count all the no-wraps we proved Summary: It looks like this is the only missing statistic in the CVP pass. Since we prove NSW and NUW separately i'd think we should count them separately too. Reviewers: nikic, spatel, reames Reviewed By: spatel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68740 llvm-svn: 375230	2019-10-18 13:20:16 +00:00
Victor Campos	3dd3d16f4d	[AArch64] Adding support for PMMIR_EL1 register Summary: The PMMIR_EL1 register is present in Armv8.4 with PMU extension. This patch adds support for it. Reviewers: t.p.northover, dnsampaio Reviewed By: dnsampaio Subscribers: kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D68940 llvm-svn: 375228	2019-10-18 12:40:29 +00:00
Graham Hunter	3a4ae7a786	[AArch64][SVE] Add SPLAT_VECTOR ISD Node Adds a new ISD node to replicate a scalar value across all elements of a vector. This is needed for scalable vectors, since BUILD_VECTOR cannot be used. Fixes up default type legalization for scalable vectors after the new MVT type ranges were introduced. At present I only use this node for scalable vectors. A DAGCombine has been added to transform a BUILD_VECTOR into a SPLAT_VECTOR if all elements are the same, but only if the default operation action of Expand has been overridden by the target. I've only added result promotion legalization for scalable vector i8/i16/i32/i64 types in AArch64 for now. Reviewers: t.p.northover, javed.absar, greened, cameron.mcinally, jmolloy Reviewed By: jmolloy Differential Revision: https://reviews.llvm.org/D47775 llvm-svn: 375222	2019-10-18 11:48:35 +00:00
Eugene Leviant	7787d23a97	[ThinLTOCodeGenerator] Add support for index-based WPD Differential revision: https://reviews.llvm.org/D68950 llvm-svn: 375219	2019-10-18 10:54:14 +00:00
David Green	6baac0ec7f	[AArch64] Don't combine callee-save and local stack adjustment when optimizing for size For arm64, D18619 introduced the ability to combine bumping the stack pointer upfront in case it needs to be bumped for both the callee-save area as well as the local stack area. That diff already remarks that "This change can cause an increase in instructions", but argues that even when that happens, it should be still be a performance benefit because the number of micro-ops is reduced. We have observed that this code-size increase can be significant in practice. This diff disables combining stack bumping for methods that are marked as optimize-for-size. Example of a prologue with the behavior before this diff (combining stack bumping when possible): sub sp, sp, #0x40 stp d9, d8, [sp, #0x10] stp x20, x19, [sp, #0x20] stp x29, x30, [sp, #0x30] add x29, sp, #0x30 [... compute x8 somehow ...] stp x0, x8, [sp] And after this diff, if the method is marked as optimize-for-size: stp d9, d8, [sp, #-0x30]! stp x20, x19, [sp, #0x10] stp x29, x30, [sp, #0x20] add x29, sp, #0x20 [... compute x8 somehow ...] stp x0, x8, [sp, #-0x10]! Note that without combining the stack bump there are two auto-decrements, nicely folded into the stp instructions, whereas otherwise there is a single sub sp, ... instruction, but not folded. Patch by Nikolai Tillmann! Differential Revision: https://reviews.llvm.org/D68530 llvm-svn: 375217	2019-10-18 10:35:46 +00:00
Simon Pilgrim	6bc5258a4c	[X86] Regenerate memcmp tests and add X64-AVX512 common prefix Should help make the changes in D69157 clearer llvm-svn: 375215	2019-10-18 09:59:51 +00:00
Simon Pilgrim	ea9ed56729	Fix MSVC "not all control paths return a value" warning. NFCI. llvm-svn: 375214	2019-10-18 09:59:40 +00:00
Simon Pilgrim	5583967ef2	Fix MSVC "result of 32-bit shift implicitly converted to 64 bits" warnings. NFCI. llvm-svn: 375213	2019-10-18 09:59:31 +00:00
David Green	e2e0a9d886	[Codegen] Alter the default promotion for saturating adds and subs The default promotion for the add_sat/sub_sat nodes currently does: ANY_EXTEND iN to iM SHL by M-N [US][ADD\|SUB]SAT L/ASHR by M-N If the promoted add_sat or sub_sat node is not legal, this can produce code that effectively does a lot of shifting (and requiring large constants to be materialised) just to use the overflow flag. It is simpler to just do the saturation manually, using the higher bitwidth addition and a min/max against the saturating bounds. That is what this patch attempts to do. Differential Revision: https://reviews.llvm.org/D68926 llvm-svn: 375211	2019-10-18 09:47:48 +00:00
Kerry McLaughlin	1b1e3237c0	[AArch64][SVE] Implement unpack intrinsics Summary: Implements the following intrinsics: - int_aarch64_sve_sunpkhi - int_aarch64_sve_sunpklo - int_aarch64_sve_uunpkhi - int_aarch64_sve_uunpklo This patch also adds AArch64ISD nodes for UNPK instead of implementing the intrinsics directly, as they are required for a future patch which implements the sign/zero extension of legal vectors. This patch includes tests for the Subdivide2Argument type added by D67549 Reviewers: sdesmalen, SjoerdMeijer, greened, rengolin, rovka Reviewed By: greened Subscribers: tschuett, kristof.beyls, rkruppe, psnobl, cfe-commits, llvm-commits Differential Revision: https://reviews.llvm.org/D67550 llvm-svn: 375210	2019-10-18 09:40:16 +00:00
Bjorn Pettersson	76ef08355b	[InstCombine] Fix miscompile bug in canEvaluateShuffled Summary: Add restrictions in canEvaluateShuffled to prevent that we for example transform %0 = insertelement <2 x i16> undef, i16 %a, i32 0 %1 = srem <2 x i16> %0, <i16 2, i16 1> %2 = shufflevector <2 x i16> %1, <2 x i16> undef, <2 x i32> <i32 undef, i32 0> into %1 = insertelement <2 x i16> undef, i16 %a, i32 1 %2 = srem <2 x i16> %1, <i16 undef, i16 2> as having an undef denominator makes the srem undefined (for all vector elements). Fixes: https://bugs.llvm.org/show_bug.cgi?id=43689 Reviewers: spatel, lebedev.ri Reviewed By: spatel, lebedev.ri Subscribers: lebedev.ri, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D69038 llvm-svn: 375208	2019-10-18 07:42:02 +00:00
Bjorn Pettersson	79aab57cf6	[InstCombine] Pre-commit of test case showing miscompile bug in canEvaluateShuffled Adding the reproducer from https://bugs.llvm.org/show_bug.cgi?id=43689, showing that instcombine is doing a bad transform. It transforms %0 = insertelement <2 x i16> undef, i16 %a, i32 0 %1 = srem <2 x i16> %0, <i16 2, i16 1> %2 = shufflevector <2 x i16> %1, <2 x i16> undef, <2 x i32> <i32 undef, i32 0> into %1 = insertelement <2 x i16> undef, i16 %a, i32 1 %2 = srem <2 x i16> %1, <i16 undef, i16 2> The undef denominator makes the whole srem undefined. llvm-svn: 375207	2019-10-18 07:41:53 +00:00

... 2 3 4 5 6 ...

186765 Commits