llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Jonas Paulsson	55afb21f97	[RegUsageInfoCollector] Bugfix for handling of register aliases. Don't assume the alias of a defined reg is always already in the set. As the test case in https://bugs.llvm.org/show_bug.cgi?id=36587 discovered, it is wrong to assume that all the aliases of the defined register in the current function is already present in the UsedPhysRegsMask. This patch changes this so that any definition in the current function of a phys-reg always results in all its aliases inserted into the set of defined registers. Review: Quentin Colombet https://reviews.llvm.org/D45157 llvm-svn: 331509	2018-05-04 07:50:05 +00:00
Max Kazantsev	7579f254a8	[IRCE] Fix misuse of dyn_cast which leads to UB llvm-svn: 331508	2018-05-04 07:34:35 +00:00
Dean Michael Berris	18fc7875e9	[XRay][compiler-rt+docs] Introduce __xray_log_init_mode(...). Summary: This addresses http://llvm.org/PR36790. The change Deprecates a number of functions and types in `include/xray/xray_log_interface.h` to recommend using string-based configuration of XRay through the __xray_log_init_mode(...) function. In particular, this deprecates the following: - `__xray_set_log_impl(...)` -- users should instead use the `__xray_log_register_mode(...)` and `__xray_log_select_mode(...)` APIs. - `__xray_log_init(...)` -- users should instead use the `__xray_log_init_mode(...)` function, which also requires using the `__xray_log_register_mode(...)` and `__xray_log_select_mode(...)` functionality. - `__xray::FDRLoggingOptions` -- in following patches, we'll be migrating the FDR logging implementations (and tests) to use the string-based configuration. In later stages we'll remove the `__xray::FDRLoggingOptions` type, and ask users to migrate to using the string-based configuration mechanism instead. - `__xray::BasicLoggingOptions` -- same as `__xray::FDRLoggingOptions`, we'll be removing this type later and instead rely exclusively on the string-based configuration API. We also update the documentation to reflect the new advice and remove some of the deprecated notes. Reviewers: eizan, kpw, echristo, pelikan Reviewed By: kpw Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46173 llvm-svn: 331503	2018-05-04 06:01:12 +00:00
Michael Zolotukhin	1769860761	[MachineCSE] Rewrite a loop checking if a block is in a set of blocks without using a set. NFC. Summary: Using a set is unnecessary here an in some cases (see e.g. PR37277) takes significant amount of time to just insert values into it. In this particular case all we need is just to check if we find the block we are looking for or not. Reviewers: davide Subscribers: hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D46411 llvm-svn: 331502	2018-05-04 01:40:05 +00:00
Craig Topper	b8509ccc4e	[LoopIdiomRecognize] Replace more unchecked dyn_casts with cast. Two of these are immediately dereferenced on the next line. The other two are passed immediately to the IRBuilder constructor which can't handle a nullptr. llvm-svn: 331500	2018-05-04 01:04:28 +00:00
Craig Topper	cad41d3462	[LoopIdiomRecognize] Use a regular array instead of a SmallVector and explicit ArrayRef. llvm-svn: 331499	2018-05-04 01:04:26 +00:00
Craig Topper	a8a679e506	[LoopIdiomRecognize] Turn two uncheck dyn_casts into regular casts. These are casts on users of a PHINode to Instruction. I think since PHINode is an Instruction any users would also be Instructions. At least a cast will give us an assertion if its wrong. llvm-svn: 331498	2018-05-04 01:04:24 +00:00
Craig Topper	b74172394c	[LoopIdiomRecognize] Add a test case to show incorrect transformation of an infinite loop with side effets into a countable loop using ctlz. We currently recognize this idiom where x is signed and thus the shift in an ashr. int cnt = 0; while (x) { x >>= 1; // arithmetic shift right ++cnt; } and turn it into (bitwidth - ctlz(x)). And if there is anything else in the loop we will create a new loop that runs that many times. If x is initially negative, the shift result will never be 0 and thus the loop is infinite. If you put something with side effects in the loop, that side effect will now only happen bitwidth times instead of an infinite number of times. So this transform is only safe for logical shift right (which we don't currently recognize) or if we can prove that x cannot be negative before the loop. llvm-svn: 331493	2018-05-03 23:50:29 +00:00
Tom Stellard	86275193d3	AMDGPU: Make getSubRegFromChannel a static member of AMDGPURegisterInfo Summary: This makes is possible to have R600RegisterInfo and SIRegisterInfo not inherit from AMDGPURegisterInfo. Reviewers: arsenm, nhaehnle Reviewed By: arsenm Subscribers: kzhuravl, wdng, yaxunl, dstuttard, tpr, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D46280 llvm-svn: 331490	2018-05-03 22:38:06 +00:00
Simon Pilgrim	575c3c5874	[X86] Add WriteDPPD/WriteDPPS dot product scheduler classes llvm-svn: 331489	2018-05-03 22:31:19 +00:00
Simon Pilgrim	66c6126889	[X86][Znver1] Use SchedAlias to tag microcoded scheduler classes Avoids extra entries in the class tables. Found a typo that missed the MMX_PHSUBSW instruction. llvm-svn: 331488	2018-05-03 22:12:23 +00:00
Justin Bogner	fd9c8be5ef	Fix include of config.h that was incorrectly changed in r331184 The RWMutex implementation depends on config.h macros (specifically HAVE_PTHREAD_H and HAVE_PTHREAD_RWLOCK_INIT), so we need to be including it and not just llvm-config.h here or we fall back to a much slower implementation. llvm-svn: 331487	2018-05-03 21:59:13 +00:00
Sanjay Patel	c32d634c5e	[InstCombine] refine select-of-constants to bitwise ops Add logic for the special case when a cmp+select can clearly be reduced to just a bitwise logic instruction, and remove an over-reaching chunk of general purpose bit magic. The primary goal is to remove cases where we are not improving the IR instruction count when doing these select transforms, and in all cases here that is true. In the motivating 3-way compare tests, there are further improvements because we can combine/propagate select values (not sure if that belongs in instcombine, but it's there for now). DAGCombiner has folds to turn some of these selects into bit magic, so there should be no difference in the end result in those cases. Not all constant combinations are handled there yet, however, so it is possible that some targets will see more cmov/csel codegen with this change in IR canonicalization. Ideally, we'll go further to not turn selects into multiple logic/math ops in instcombine, and we'll canonicalize to selects. But we should make sure that this step does not result in regressions first (and if it does, we should fix those in the backend). The general direction for this change was discussed here: http://lists.llvm.org/pipermail/llvm-dev/2016-September/105373.html http://lists.llvm.org/pipermail/llvm-dev/2017-July/114885.html Alive proofs for the new bit magic: https://rise4fun.com/Alive/XG7 Differential Revision: https://reviews.llvm.org/D46086 llvm-svn: 331486	2018-05-03 21:58:44 +00:00
Tom Stellard	47e2407968	GlobalISel: Use a callback to compute constrained reg class for unallocatble registers Summary: constrainOperandRegClass() currently fails if it tries to constrain the register class of an operand that is defeined with an unallocatable register class. This patch resolves this by adding a target callback to compute register constriants in this case. This is required by the AMDGPU because many of its instructions have source opreands defined with the unallocatable register classe VS_32 which is a union of two allocatable register classes VGPR_32 and SReg_32. Reviewers: dsanders, aditya_nandakumar Reviewed By: aditya_nandakumar Subscribers: rovka, kristof.beyls, tpr, llvm-commits Differential Revision: https://reviews.llvm.org/D45991 llvm-svn: 331485	2018-05-03 21:44:16 +00:00
Teresa Johnson	0c94127f9b	[ThinLTO] Add support for optimization remarks to thinBackend Summary: Support was added to the regular LTO backend, but not thinBackend. This patch adds that support. Reviewers: pcc, davide Subscribers: mehdi_amini, inglorion, llvm-commits Differential Revision: https://reviews.llvm.org/D46376 llvm-svn: 331481	2018-05-03 20:24:12 +00:00
Sanjay Patel	ab02032316	[PowerPC] add more FMF debug output; NFC We can't see all of the problems currently unless we look at debug output when the global 'unsafe' is on. It's a mess. This is another attempt to make sure that D45710 is not making changes unintentionally. llvm-svn: 331476	2018-05-03 18:49:35 +00:00
Simon Pilgrim	54fed1e65a	[X86][AVX512] VPLZCNT instructions match SchedWriteVecIMul scheduling class not SchedWriteVecALU. llvm-svn: 331473	2018-05-03 18:22:49 +00:00
Simon Pilgrim	bbc813836e	[X86] Split WriteVecShift/WriteVarVecShift into MMX, XMM and YMM/ZMM scheduler classes This took a bit of extra work as on Intel targets the old (V)PSLLDrr/(V)PSLLDrm style instructions act differently - I ended up creating WriteVecShiftImm classes for XMM/YMM/ZMM vector shift by immediate and retaining WriteVecShift as the default (used only by MMX) plus WriteVecShiftX/WriteVecShiftY. X86SchedWriteWidths hides most of this thank goodness. llvm-svn: 331472	2018-05-03 17:56:43 +00:00
Sanjay Patel	b9989c7cc8	[PowerPC] add tests for FMF propagation; NFC I'm choosing PPC out of convenience because it does all of the transforms of interest in these tests by default. There are multiple FMF problems shown in the current checks. D45710 is proposing to fix part of that. llvm-svn: 331471	2018-05-03 17:41:37 +00:00
Bjorn Pettersson	67fede018f	[DebugInfo] Correction for an assert in DIExpression::createFragmentExpression Summary: When we create a fragment expression, and there already is an old fragment expression, we assert that the new fragment is within the range for the old fragment. If for example the old fragment expression says that we describe bit 10-16 of a variable (Offset=10, Size=6), and we now want to create a new fragment expression only describing bit 3-6 of the original value, then the resulting fragment expression should have Offset=13, Size=3. The assert is supposed to catch if the resulting fragment expression is outside the range for the old fragment. However, it used to verify that the Offset+Size of the new fragment was smaller or equal than Offset+Size for the old fragment. What we really want to check is that Offset+Size of the new fragment is smaller than the Size of the old fragment. Reviewers: aprantl, vsk Reviewed By: aprantl Subscribers: davide, llvm-commits, JDevlieghere Differential Revision: https://reviews.llvm.org/D46391 llvm-svn: 331465	2018-05-03 17:04:21 +00:00
Bjorn Pettersson	80220423bb	Reapply "[SelectionDAG] Selection of DBG_VALUE using a PHI node result (pt 2)" Summary: This reverts SVN r331441 (reapplies r331337), together with a fix in to handle an already existing fragment expression in the dbg.value that must be fragmented due to a split PHI node. This should solve the problem seen in PR37321, which was the reason for the revert of r331337. The situation in PR37321 is that we have a PHI node like this %u.sroa = phi i80 [ %u.sroa.x, %if.x ], [ %u.sroa.y, %if.y ], [ %u.sroa.z, %if.z ] and a dbg.value like this call void @llvm.dbg.value(metadata i80 %u.sroa, metadata !13, metadata !DIExpression(DW_OP_LLVM_fragment, 0, 80)) The phi node is split into three 32-bit PHI nodes %30:gr32 = PHI %11:gr32, %bb.4, %14:gr32, %bb.5, %27:gr32, %bb.8 %31:gr32 = PHI %12:gr32, %bb.4, %15:gr32, %bb.5, %28:gr32, %bb.8 %32:gr32 = PHI %13:gr32, %bb.4, %16:gr32, %bb.5, %29:gr32, %bb.8 but since the original value only is 80 bits we need to adjust the size of the last fragment expression, and with this patch we get DBG_VALUE debug-use %30:gr32, debug-use $noreg, !"u", !DIExpression(DW_OP_LLVM_fragment, 0, 32) DBG_VALUE debug-use %31:gr32, debug-use $noreg, !"u", !DIExpression(DW_OP_LLVM_fragment, 32, 32) DBG_VALUE debug-use %32:gr32, debug-use $noreg, !"u", !DIExpression(DW_OP_LLVM_fragment, 64, 16) Reviewers: vsk, aprantl, mstorsjo Reviewed By: aprantl Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D46384 llvm-svn: 331464	2018-05-03 17:04:16 +00:00
Nico Weber	56d8831d29	use LLVM's standard CMakeLists.txt layout for llvm-xray llvm-svn: 331455	2018-05-03 14:25:57 +00:00
Roman Lebedev	8cf123ed80	[CodeGen][X86][NFC] Copy two selectcc tests from AArch64. These tests are for DAGCombiner::foldSelectCCToShiftAnd(). Right now, they were only tested for AArch64, but given the upcoming X86 changes to the hasAndNot(), the test coverage needs to be added. These tests originated from D27489 / rL289738 llvm-svn: 331454	2018-05-03 13:33:07 +00:00
Simon Pilgrim	c4c90c5eac	[X86] Split WriteVecALU/WritePHAdd into XMM and YMM/ZMM scheduler classes llvm-svn: 331453	2018-05-03 13:27:10 +00:00
Tim Northover	9ef696c849	ARM: don't try to over-align large vectors as arguments. By default LLVM thinks very large vectors get aligned to their size when passed across functions. Unfortunately no-one told the ARM backend so it doesn't trigger stack realignment and so accesses can cause the usual misalignment issues (e.g. a data abort). This changes the ABI alignment to the stack alignment, which in practice (and as a bonus) also coincides with the alignment "natural" vectors get. llvm-svn: 331451	2018-05-03 12:54:25 +00:00
Piotr Padlewski	997163a54e	perform DSE through launder.invariant.group Summary: Alias Analysis knows that llvm.launder.invariant.group returns pointer that mustalias argument, but this information wasn't used, therefor we didn't DSE through launder.invariant.group Reviewers: chandlerc, dberlin, bogner, hfinkel, efriedma Reviewed By: dberlin Subscribers: amharc, llvm-commits, nlewycky, rsmith Differential Revision: https://reviews.llvm.org/D31581 llvm-svn: 331449	2018-05-03 11:03:53 +00:00
Piotr Padlewski	1e96fe1a21	Rename invariant.group.barrier to launder.invariant.group Summary: This is one of the initial commit of "RFC: Devirtualization v2" proposal: https://docs.google.com/document/d/16GVtCpzK8sIHNc2qZz6RN8amICNBtvjWUod2SujZVEo/edit?usp=sharing Reviewers: rsmith, amharc, kuhar, sanjoy Subscribers: arsenm, nhaehnle, javed.absar, hiraditya, llvm-commits Differential Revision: https://reviews.llvm.org/D45111 llvm-svn: 331448	2018-05-03 11:03:01 +00:00
Simon Pilgrim	4f80b189b7	[X86][AVX512] VPAVG instructions should be tagged as SchedWriteVecALU llvm-svn: 331446	2018-05-03 10:53:17 +00:00
Simon Pilgrim	b7289046cc	[X86] Split WriteVecIMul/WriteVecPMULLD/WriteMPSAD/WritePSADBW into XMM and YMM/ZMM scheduler classes Also retagged VDBPSADBW instructions as SchedWritePSADBW instead of SchedWriteVecIMul which matches the behaviour on SkylakeServer (the only thing that supports it...) llvm-svn: 331445	2018-05-03 10:31:20 +00:00
Simon Pilgrim	61aa16d663	[X86] Update MMX instructions to be tagged with X86SchedWriteWidths types llvm-svn: 331443	2018-05-03 09:11:32 +00:00
Benjamin Kramer	8410bf3c2f	[WebAssembly] MC: Don't litter test directory. llvm-svn: 331442	2018-05-03 08:25:14 +00:00
Martin Storsjo	0e9558e4c6	Revert "[SelectionDAG] Selection of DBG_VALUE using a PHI node result (pt 2)" This reverts SVN r331337, see PR37321 for details on the regression it introduced. llvm-svn: 331441	2018-05-03 07:09:33 +00:00
Clement Courbet	41ab59e992	[TableGen][NFC] Make ResourceCycles definitions more explicit. https://reviews.llvm.org/D46356 llvm-svn: 331439	2018-05-03 06:08:47 +00:00
Craig Topper	711da34d55	[LoopIdiomRecognize] When looking for 'x & (x -1)' for popcnt, make sure the left hand side of the 'and' matches the left hand side of the 'subtract' llvm-svn: 331437	2018-05-03 05:48:49 +00:00
Craig Topper	4ae293e526	[LoopIdiomRecognize] Add a test case showing that we transform to ctpop without fully checking the 'x & (x-1)' part. The code fails to check that the same value is used twice. We only make sure the left hand side of the and is part of the loop recurrence. The 'x' in the subtract can be any value. llvm-svn: 331436	2018-05-03 05:48:48 +00:00
Craig Topper	ab47ea42e6	[LoopIdiomRecognize] Remove unnecessary cast from BinaryOperator to Instruction. NFC BinaryOperator is a sub class of Instruction. We don't need an explicit cast back to Instruction. llvm-svn: 331432	2018-05-03 05:00:18 +00:00
Saleem Abdulrasool	7dafb8329b	lit: flesh out `SubsituteCaptures` further Add overloads for `__len__` and `__getitem__` to allow use of this class on Linux as well as Windows. With these overloads, lit can be used on both hosts for the swift testsuite. llvm-svn: 331431	2018-05-03 04:45:43 +00:00
Max Kazantsev	3dbdfd12d9	Re-enable "[SCEV] Make computeExitLimit more simple and more powerful" This patch was temporarily reverted because it has exposed bug 37229 on PowerPC platform. The bug is unrelated to the patch and was just a general bug in the optimization done for PowerPC platform only. The bug was fixed by the patch rL331410. This patch returns the disabled commit since the bug was fixed. llvm-svn: 331427	2018-05-03 02:37:55 +00:00
Petr Hosek	8259fee6a9	[Support] Support building LLVM for Fuchsia These are necessary changes to support building LLVM for Fuchsia. While these are not sufficient to run on Fuchsia, they are still useful when cross-compiling LLVM libraries and runtimes for Fuchsia. Differential Revision: https://reviews.llvm.org/D46345 llvm-svn: 331423	2018-05-03 01:38:49 +00:00
Shoaib Meenai	22e91e6119	[ObjCARC] Convert an if to an early continue. NFC This reduces nesting and makes the logic slightly easier to follow. Differential Revision: https://reviews.llvm.org/D46371 llvm-svn: 331422	2018-05-03 01:20:36 +00:00
Nemanja Ivanovic	4d80e2a071	Commit r331416 breaks the big-endian PPC bot. On the big endian build, we actually encounter constants wider than 64-bits. Add the guard to prevent tripping the assert. llvm-svn: 331420	2018-05-03 01:04:13 +00:00
Chandler Carruth	6e8ec9c534	[gcov] Switch to an explicit if clunky array to satisfy some compilers on various build bots that are unhappy with using makeArrayRef with an initializer list. llvm-svn: 331418	2018-05-03 00:11:03 +00:00
Michael Berg	dc3d19e5de	MachineInst support mapping SDNode fast math flags for support in Back End code generation Summary: Machine Instruction flags for fast math support and MIR print support Reviewers: spatel, arsenm Reviewed By: arsenm Subscribers: wdng Differential Revision: https://reviews.llvm.org/D45781 llvm-svn: 331417	2018-05-03 00:07:56 +00:00
Nemanja Ivanovic	3c3f64c605	[PowerPC] Implement isMaskAndCmp0FoldingBeneficial Sinking the and closer to a compare against zero is beneficial on PPC as it allows us to emit record-form instructions. In the future, we may expand this to a larger set of operations that feed compares against zero since PPC has lots of record-form instructions. Differential revision: https://reviews.llvm.org/D46060 llvm-svn: 331416	2018-05-02 23:55:23 +00:00
Sam Clegg	10436289e7	[WebAssembly] MC: Create and use first class section symbols Differential Revision: https://reviews.llvm.org/D46335 llvm-svn: 331413	2018-05-02 23:11:38 +00:00
Sam Clegg	38a2e730d3	[MC] Factor MCObjectStreamer::addFragmentAtoms out of MachO streamer. This code previously existed only in MCMachOStreamer but is useful for WebAssembly too. See: D46335 Differential Revision: https://reviews.llvm.org/D46297 llvm-svn: 331412	2018-05-02 23:01:10 +00:00
Nemanja Ivanovic	9dcc8baaaa	[PowerPC] No CTR loop if the candidate exiting block is in a different loop The CTR loops pass will insert the decrementing branch instruction in an exiting block for the loop being transformed. However if that block is part of another loop as well (whether a nested loop or with irreducible CFG), it is not valid to use that exiting block. In fact, if the loop hass irreducible CFG, we don't bother analyzing it and we just bail on the transformation. In practice, this doesn't lead to a noticeable reduction in the number of loops transformed by this pass. Fixes https://bugs.llvm.org/show_bug.cgi?id=37229 Differential Revision: https://reviews.llvm.org/D46162 llvm-svn: 331410	2018-05-02 22:56:04 +00:00
Chandler Carruth	213b57c660	[GCOV] Emit the writeout function as nested loops of global data. Summary: Prior to this change, LLVM would in some cases emit massive writeout functions with many 10s of 1000s of function calls in straight-line code. This is a very wasteful way to represent what are fundamentally loops and creates a number of scalability issues. Among other things, register allocating these calls is extremely expensive. While D46127 makes this less severe, we'll still run into scaling issues with this eventually. If not in the compile time, just from the code size. Now the pass builds up global data structures modeling the inputs to these functions, and simply loops over the data structures calling the relevant functions with those values. This ensures that the code size is a fixed and only data size grows with larger amounts of coverage data. A trivial change to IRBuilder is included to make it easier to build the constants that make up the global data. Reviewers: wmi, echristo Subscribers: sanjoy, mcrosier, llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D46357 llvm-svn: 331407	2018-05-02 22:24:39 +00:00
Martin Storsjo	214c7bcdc6	[llvm-rc] Default to writing the output next to the input, if no output is specified This matches what rc.exe does if no output is specified. Differential Revision: https://reviews.llvm.org/D46239 llvm-svn: 331403	2018-05-02 21:15:24 +00:00
Martin Storsjo	cb354baeb1	[llvm-cvtres] Allow parameters preceded by '-' in addition to '/' The real cvtres.exe also allows parameters in either form. Differential Revision: https://reviews.llvm.org/D46358 llvm-svn: 331402	2018-05-02 21:15:13 +00:00

1 2 3 4 5 ...

163682 Commits