llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 19:52:54 +01:00

Author	SHA1	Message	Date
Konstantin Zhuravlyov	3a760e95ad	AMDGPU: Expand setcc for v2f32 and v4f32 llvm-svn: 314853	2017-10-03 21:45:01 +00:00
Konstantin Zhuravlyov	9814ddbc82	AMDGPU: Expand setcc for v2i32 and v4i32 llvm-svn: 314852	2017-10-03 21:31:24 +00:00
Konstantin Zhuravlyov	7ec1e6fa4f	AMDGPU/Docs: Follow up on review feedback in https://reviews.llvm.org/D38387 llvm-svn: 314848	2017-10-03 21:18:03 +00:00
Jakub Kuderski	2679c6a49f	[Dominators] Make eraseNode invalidate DFS numbers This patch makes DT::eraseNode mark DFSInfo as invalid. Not marking it as invalid leads to DFS numbers getting corrupted and failing VerifyDFSNumbers check. This patch also makes children iterator const (NFC). llvm-svn: 314847	2017-10-03 21:17:48 +00:00
Konstantin Zhuravlyov	dd0f62bde0	AMDGPU: Add ELFOSABI_AMDGPU_MESA3D Differential Revision: https://reviews.llvm.org/D38387 llvm-svn: 314846	2017-10-03 21:14:14 +00:00
Reid Kleckner	17e3a5eb26	[X86] Remove dead declaration convertArgMovsToPushes, NFC This was dead when it landed in r252578. We have this functionality, if not for stack probe calls, but for regular calls in X86CallFrameOptimization.cpp. llvm-svn: 314845	2017-10-03 21:12:18 +00:00
Rafael Espindola	5d5702b4dd	Pre-compute the tail of the archive An archive looks like <header> <symbol table> <tail> The symbol table refers to offsets in the tail. A complication is that we would like to support symbol tables that use 64 bit offsets if it turns out that any of the offsets is too big. This patch changes the archive writer to first compute the tail. We cannot just compute one big StringRef since that would require reading every member upfront, but we can represent it as a series of StringRefs. Having done that it is much easier to compute the symbol table and all offsets are computed before it is written. With this if there is an accounting problem it will show up with a regular symbol table, not just when a 64 bit one is needed. llvm-svn: 314844	2017-10-03 20:59:43 +00:00
Konstantin Zhuravlyov	faac723406	AMDGPU: Add ELFOSABI_AMDGPU_PAL llvm-svn: 314843	2017-10-03 20:54:07 +00:00
Reid Kleckner	ad0810fc5c	Refactor DIBuilder dbg intrinsic insertion, NFC Both dbg.declare and dbg.value insertion had duplicate code for the two overloads with different insertion point conventions. llvm-svn: 314839	2017-10-03 20:36:40 +00:00
Sanjay Patel	ca8280aff1	[InstCombine] add tests for icmp gt/lt (shr X, C1), C2; NFC Surprisingly, we have zero coverage for these patterns. Many of these are handled in InstSimplify, but it's not obvious what the rule for folding each case should be, so I've just stamped out everything. It should be possible to fold every case, but currently, we miss these: int ashr_slt(int x) { return (x >> 1) < 1; } int ashr_sgt(int x) { return (x >> 1) > 0; } https://godbolt.org/g/aB2hLE llvm-svn: 314837	2017-10-03 20:34:20 +00:00
Jessica Paquette	6d75551680	[MachineOutliner] Fix off-by-one in cost model This commit does two things. Firstly, it cleans up some of the benefit calculation wrt outlined functions and candidates. Secondly, it fixes an off-by-one bug in the cost model which was caused by the benefit value of an OutlinedFunction and Candidate differing by 1. It updates the remarks test to reflect this change. llvm-svn: 314836	2017-10-03 20:32:55 +00:00
Stefan Pintilie	fc233e8dcc	[PowerPC] Revert P9 scheduling model to incomplete Partially revert a previous change from commit: https://llvm.org/svn/llvm-project/llvm/trunk@314026 The previous change caused regressions on Power 9. llvm-svn: 314835	2017-10-03 20:27:30 +00:00
Craig Topper	cdf895a6c6	[InstCombine] Use isSignBitCheck to simplify an if statement. Directly create new sign bit compares instead of manipulating the constant. NFCI Since we no longer had the direct constant compares, manipulating the constant seemeded less clear. llvm-svn: 314830	2017-10-03 19:14:23 +00:00
Tim Renouf	f4680f0891	[AMDGPU] implemented pal metadata Summary: For the amdpal OS type: We write an AMDGPU_PAL_METADATA record in the .note section in the ELF (or as an assembler directive). It contains key=value pairs of 32 bit ints. It is a merge of metadata from codegen of the shaders, and metadata provided by the frontend as _amdgpu_pal_metadata IR metadata. Where both sources have a key=value with the same key, the two values are ORed together. This .note record is part of the amdpal ABI and will be documented in docs/AMDGPUUsage.rst in a future commit. Eventually the amdpal OS type will stop generating the .AMDGPU.config section once the frontend has safely moved over to using the .note records above instead of .AMDGPU.config. Reviewers: arsenm, nhaehnle, dstuttard Subscribers: kzhuravl, wdng, yaxunl, llvm-commits, t-tye Differential Revision: https://reviews.llvm.org/D37753 llvm-svn: 314829	2017-10-03 19:03:52 +00:00
Alexander Timofeev	623ce2de41	[AMDGPU] Avoid predicated execution of the basic blocks containing scalar instructions. Differential revision: https://reviews.llvm.org/D38293 llvm-svn: 314828	2017-10-03 18:55:36 +00:00
Hans Wennborg	439468e183	Fix -Wcovered-switch-default warnings from r314821 llvm-svn: 314826	2017-10-03 18:44:12 +00:00
Hans Wennborg	059c2fadb1	Revert r314817 "[dwarfdump] Add -lookup option" The test fails on Linux; see follow-up email on the llvm-commits list. > Add the option to lookup an address in the debug information and print > out the file, function, block and line table details. > > Differential revision: https://reviews.llvm.org/D38409 This also reverts the follow-up r314818: > [test] Fix llvm-dwarfdump/cmdline.test > > Fixes test/tools/llvm-dwarfdump/cmdline.test llvm-svn: 314825	2017-10-03 18:39:13 +00:00
Hans Wennborg	11d67365c5	Revert r314806 "[SLP] Vectorize jumbled memory loads." All the buildbots are red, e.g. http://lab.llvm.org:8011/builders/clang-cmake-aarch64-lld/builds/2436/ > Summary: > This patch tries to vectorize loads of consecutive memory accesses, accessed > in non-consecutive or jumbled way. An earlier attempt was made with patch D26905 > which was reverted back due to some basic issue with representing the 'use mask' of > jumbled accesses. > > This patch fixes the mask representation by recording the 'use mask' in the usertree entry. > > Change-Id: I9fe7f5045f065d84c126fa307ef6ebe0787296df > > Reviewers: mkuper, loladiro, Ayal, zvi, danielcdh > > Reviewed By: Ayal > > Subscribers: hans, mzolotukhin > > Differential Revision: https://reviews.llvm.org/D36130 llvm-svn: 314824	2017-10-03 18:32:29 +00:00
Reid Kleckner	5ae6aa0c7e	Fix expectations in MC wasm init-fini-array test llvm-svn: 314823	2017-10-03 18:30:38 +00:00
Reid Kleckner	b41768c2ea	Implement David Blaikie's suggestion for comparison operators llvm-svn: 314822	2017-10-03 18:30:11 +00:00
Hans Wennborg	abd9b7ecb5	CodeView: Provide a .def file with the register ids The list of register ids was previously written out in a couple of dirrent places. This puts it in a .def file and also adds a few more registers (e.g. the x87 regs) which should lead to more readable dumps, but I didn't include the whole list since that seems unnecessary. X86_MC::initLLVMToSEHAndCVRegMapping is pretty ugly, but at least it's not relying on magic constants anymore. The TODO of using tablegen still stands. Differential revision: https://reviews.llvm.org/D38480 llvm-svn: 314821	2017-10-03 18:27:22 +00:00
Reid Kleckner	261ba47d84	[DebugInfo] Correctly coalesce DBG_VALUEs that mix direct and indirect values Summary: This should fix a regression introduced by r313786, which switched from MachineInstr::isIndirectDebugValue() to checking if operand 1 is an immediate. I didn't have a test case for it until now. A single UserValue, which approximates a user variable, may have many DBG_VALUE instructions that disagree about whether the variable is in memory or in a virtual register. This will become much more common once we have llvm.dbg.addr, but you can construct such a test case manually today with llvm.dbg.value. Before this change, we would get two UserValues: one for direct and one for indirect DBG_VALUE instructions describing the same variable. If we build separate interval maps for direct and indirect locations, we will end up accidentally coalescing identical DBG_VALUE intervals that need to remain separate because they are broken up by intervals of the opposite direct-ness. Reviewers: aprantl Subscribers: llvm-commits, hiraditya Differential Revision: https://reviews.llvm.org/D37932 llvm-svn: 314819	2017-10-03 17:59:02 +00:00
Jonas Devlieghere	af60763a35	[test] Fix llvm-dwarfdump/cmdline.test Fixes test/tools/llvm-dwarfdump/cmdline.test llvm-svn: 314818	2017-10-03 17:28:37 +00:00
Jonas Devlieghere	cce4ff485e	[dwarfdump] Add -lookup option Add the option to lookup an address in the debug information and print out the file, function, block and line table details. Differential revision: https://reviews.llvm.org/D38409 llvm-svn: 314817	2017-10-03 17:10:21 +00:00
Simon Pilgrim	06722d0fa9	[X86] Add non-SSE tests for PR15215 as well llvm-svn: 314815	2017-10-03 17:04:36 +00:00
Geoff Berry	a1a23c617d	Revert "Re-enable "[MachineCopyPropagation] Extend pass to do COPY source forwarding"" This reverts commit r314729. Another bug has been encountered in an out-of-tree target reported by Quentin. llvm-svn: 314814	2017-10-03 16:59:13 +00:00
Simon Pilgrim	a4dbf20bc2	[X86][SSE] Add bool vector extraction test cases from PR15215 llvm-svn: 314813	2017-10-03 16:56:57 +00:00
Rafael Espindola	129f5a2768	Use sched_getaffinity instead of std:🧵:hardware_concurrency. The issue with std:🧵:hardware_concurrency is that it forwards to libc and some implementations (like glibc) don't take thread affinity into consideration. With this change a llvm program that can execute in only 2 cores will use 2 threads, even if the machine has 32 cores. This makes benchmarking a lot easier, but should also help if someone doesn't want to use all cores for compilation for example. llvm-svn: 314809	2017-10-03 16:25:15 +00:00
Dehao Chen	6884ce196d	Revert the change that accidentally went in r314806. llvm-svn: 314807	2017-10-03 15:50:42 +00:00
Mohammad Shahid	9bab937b54	[SLP] Vectorize jumbled memory loads. Summary: This patch tries to vectorize loads of consecutive memory accesses, accessed in non-consecutive or jumbled way. An earlier attempt was made with patch D26905 which was reverted back due to some basic issue with representing the 'use mask' of jumbled accesses. This patch fixes the mask representation by recording the 'use mask' in the usertree entry. Change-Id: I9fe7f5045f065d84c126fa307ef6ebe0787296df Reviewers: mkuper, loladiro, Ayal, zvi, danielcdh Reviewed By: Ayal Subscribers: hans, mzolotukhin Differential Revision: https://reviews.llvm.org/D36130 llvm-svn: 314806	2017-10-03 15:28:48 +00:00
Jakub Kuderski	80a5a5c1e0	[Dominators] Don't use default parameter in lambda ... to make GCC buildbots happy. llvm-svn: 314805	2017-10-03 14:51:31 +00:00
Oliver Stannard	7ca8df0841	[ARM] Use table-gen'd assembly operand diags in ARM asm parser This switches the ARM AsmParser to use assembly operand diagnostics from tablegen, rather than a switch statement on the ARMMatchResultTy. It moves the existing diagnostic strings to tablegen, but adds no new ones, so this is NFC except for one diagnostic string that had an off-by-1 error in the hand-written switch statement. Differential revision: https://reviews.llvm.org/D31607 llvm-svn: 314804	2017-10-03 14:38:52 +00:00
Oliver Stannard	a8cc0cc3a1	[AsmParser] Add DiagnosticString to AsmOperands in tablegen This adds a DiagnosticString member to the AsmOperand tablegen class, so that the diagnostic text to be used when an assembly operand is incorrect can be stored in the tablegen description of the operand, rather than in a separate switch statement in the AsmParser. If DiagnosticString is used for any operands, tablegen will emit a getMatchKindDiag function, to map from diagnostic enums to strings. Differential revision: https://reviews.llvm.org/D31606 llvm-svn: 314803	2017-10-03 14:34:57 +00:00
Jakub Kuderski	38458ea813	[Dominators] Add DFS number verification Summary: This patch teaches the DominatorTree verifier to check DFS In/Out numbers which are used to answer dominance queries. DFS number verification is done in O(nlogn), so it shouldn't add much overhead on top of the O(n^3) sibling property verification. This check should detect errors like the one spotted in PR34466 and related bug reports. The patch also cleans up the DFS calculation a bit, as all constructed trees should have a single root now. I see 2 new test failures when running check-all after this change: ``` Failing Tests (2): Polly :: Isl/CodeGen/OpenMP/reference-argument-from-non-affine-region.ll Polly :: Isl/CodeGen/OpenMP/two-parallel-loops-reference-outer-indvar.ll ``` which seem to happen just after `Create LLVM-IR from SCoPs` -- I XFAILed them in r314800. Reviewers: dberlin, grosser, davide, zhendongsu, bollu Reviewed By: dberlin Subscribers: nandini12396, bollu, Meinersbur, brzycki, llvm-commits Differential Revision: https://reviews.llvm.org/D38331 llvm-svn: 314801	2017-10-03 14:33:41 +00:00
Oliver Stannard	17b7236099	[ARM, Asm] Use correct source location for register tokens tryParseRegister advances the lexer, so we need to take copies of the start and end locations of the register operand before calling it. Previously, the caret in the diagnostic pointer to the comma after the r0 operand in the test, rather than the start of the operand. Differential revision: https://reviews.llvm.org/D31537 llvm-svn: 314799	2017-10-03 14:30:58 +00:00
Simon Dardis	487e504edd	[mips] Enable spilling and reloading of the dsp register set. The dsp register class is an alias of the gpr register class, so we have to define instructions for spilling and reloading. Reviewers: atanasyan Differential Revision: https://reviews.llvm.org/D38038 llvm-svn: 314798	2017-10-03 13:45:49 +00:00
John Brawn	914d52fafc	[CGP] Make optimizeMemoryInst capable of handling multiple AddrModes Currently optimizeMemoryInst requires that all of the AddrModes it sees are identical. This patch makes it capable of tracking multiple AddrModes, so long as they differ in at most one field. This patch does nothing by itself, but later patches will make use of it to insert or reuse phi or select instructions for the differing fields. Differential Revision: https://reviews.llvm.org/D38278 llvm-svn: 314795	2017-10-03 13:08:22 +00:00
John Brawn	238fcba073	[CGP] In optimizeMemoryInst handle select similarly to phi This lets us optimize away selects that perform the same address computation in two different ways and is also the first step towards being able to handle selects between two different, but compatible, address computations. Differential Revision: https://reviews.llvm.org/D38242 llvm-svn: 314794	2017-10-03 13:04:15 +00:00
Oliver Stannard	492f68f0bb	[ARM, Asm] Fix ubsan failure caused by out-of-range enum value In this code, we use ~0U as a sentinel value for any operand class that doesn't have a user-friendly error message, but this value isn't in range of the MatchClassKind enum, so we need to ensure it does not get passed to isSubclass. llvm-svn: 314793	2017-10-03 12:45:18 +00:00
Simon Pilgrim	1aab667385	[X86][SSE] Add support for decoding PACKSS/PACKUS shuffles masks with UNDEF llvm-svn: 314792	2017-10-03 12:41:39 +00:00
Oliver Stannard	af934d6e94	[ARM, Asm] Remove dead code causing MSan failure. r314779 caused ErrorInfo to be red uninitialised, but also made this code dead, so it can just be removed. llvm-svn: 314791	2017-10-03 12:28:28 +00:00
Simon Pilgrim	1ab89602fb	[X86][SSE] Add support for lowering shuffles to PACKSS/PACKUS If the upper bits of a truncation shuffle patterns have at least the minimum number of sign/zero bits on their inputs then we can safely use PACKSS/PACKUS as shuffles. Partial fix for https://bugs.llvm.org/show_bug.cgi?id=34773 Differential Revision: https://reviews.llvm.org/D38472 llvm-svn: 314788	2017-10-03 12:01:31 +00:00
Evgeny Astigeevich	bde6aee838	[InlineCost, NFC] Extract code dealing with inbounds GEPs from visitGetElementPtr into a function The code responsible for analysis of inbounds GEPs is extracted into a separate function: CallAnalyzer::canFoldInboundsGEP. With the patch SROA enabling/disabling code is localized at one place instead of spreading across the code of CallAnalyzer::visitGetElementPtr. Differential Revision: https://reviews.llvm.org/D38233 llvm-svn: 314787	2017-10-03 12:00:40 +00:00
Sam Clegg	cef54868ef	[WebAssembly] MC: Support for init_array and fini_array Differential Revision: https://reviews.llvm.org/D37757 llvm-svn: 314783	2017-10-03 11:20:28 +00:00
Sean Eveson	39aefb7654	[llvm-cov] Hide files with no coverage from the index when filtering by name Differential Revision: https://reviews.llvm.org/D38457 llvm-svn: 314782	2017-10-03 11:05:28 +00:00
Bjorn Pettersson	3734c37713	[DebugInfo] Handle endianness when moving debug info for split integer values (reapplied) Summary: Take the target's endianness into account when splitting the debug information in DAGTypeLegalizer::SetExpandedInteger. This patch fixes so that, for big-endian targets, the fragment expression corresponding to the high part of a split integer value is placed at offset 0, in order to correctly represent the memory address order. I have attached a PPC32 reproducer where the resulting DWARF pieces for a 64-bit integer were incorrectly reversed. Original patch was reverted due to using -stop-after=isel in the test case (but that is only working when AMDGPU target is included in the llc build). The test case has now been updated to use -stop-before=expand-isel-pseudos instead. Patch by: dstenb Reviewers: JDevlieghere, aprantl, dblaikie Reviewed By: JDevlieghere, aprantl, dblaikie Subscribers: nemanjai Differential Revision: https://reviews.llvm.org/D38172 llvm-svn: 314781	2017-10-03 11:03:02 +00:00
Oliver Stannard	dacbc9891d	[ARM] Use new assembler diags for ARM This converts the ARM AsmParser to use the new assembly matcher error reporting mechanism, which allows errors to be reported for multiple instruction encodings when it is ambiguous which one the user intended to use. By itself this doesn't improve many error messages, because we don't have diagnostic text for most operand types, but as we add that then this will allow more of those diagnostic strings to be used when they are relevant. Differential revision: https://reviews.llvm.org/D31530 llvm-svn: 314779	2017-10-03 10:26:11 +00:00
Simon Pilgrim	23af65d712	Remove unused variable. NFCI. llvm-svn: 314778	2017-10-03 10:01:02 +00:00
Simon Pilgrim	3449eec7c7	[X86][SSE] Add support for shuffle combining from PACKSS/PACKUS Mentioned in D38472 llvm-svn: 314777	2017-10-03 09:54:03 +00:00
Simon Pilgrim	7c222043f3	[X86][SSE] Add support for PACKSS/PACKUS constant folding Pulled out of D38472 llvm-svn: 314776	2017-10-03 09:41:00 +00:00

1 2 3 4 5 ...

154959 Commits