llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 11:42:57 +01:00

Author	SHA1	Message	Date
Simon Pilgrim	9cb8b71cb7	[X86][AVX512] Split AVX512F and AVX512BW bool-vector bitcast tests llvm-svn: 317020	2017-10-31 18:41:48 +00:00
Benjamin Kramer	f423bb4d8a	[ADT] Split optional to only include copy mechanics and dtor for non-trivial types. This makes uses of Optional more transparent to the compiler (and clang-tidy) and generates slightly smaller code. llvm-svn: 317019	2017-10-31 18:35:54 +00:00
Wolfgang Pieb	2c53dfefa1	[Metadata][NFC] Make MDNode::resolve() public in preparation for the fix to PR33930. Reviewers: aprantl llvm-svn: 317018	2017-10-31 18:25:28 +00:00
Daniel Sanders	24856518f6	[globalisel][tablegen] Allow any comment in DebugCommentAction. NFC llvm-svn: 317017	2017-10-31 18:07:03 +00:00
Philip Reames	d06638559c	[IndVarSimplify] Extract wrapper around SE-.isLoopInvariantPredicate [NFC] This an intermediate state, the next patch will re-inline the markLoopInvariantPredicate function to reduce code duplication. llvm-svn: 317016	2017-10-31 18:04:57 +00:00
Rui Ueyama	f0ff305346	[Support] Make the default chunk size of raw_fd_ostream to 1 GiB. Previously, we call write(2) for each 32767 byte chunk. That is not efficient because Linux can handle much larger write requests. This patch changes the chunk size on Linux to 1 GiB. This patch also changes the default chunks size to SSIZE_MAX. I think that doesn't in practice change this function's behavior on any operating system because SSIZE_MAX on 64-bit machine is unrealistically large, and writing 2 GiB (SSIZE_MAX on 32-bit) on a 32-bit machine by a single call of write(2) is also unrealistic, as the userspace is usually limited to 2 GiB. That said, it is in general a good thing to do because a write larger than SSIZE_MAX is implementation-defined in POSIX. Differential Revision: https://reviews.llvm.org/D39444 llvm-svn: 317015	2017-10-31 17:37:20 +00:00
Philip Reames	e87a19c3aa	[IndVarSimplify] Simplify code using a dictionary Possibly very slightly slower, but this code is not performance critical and the readability benefit alone is huge. llvm-svn: 317012	2017-10-31 17:06:32 +00:00
Reid Kleckner	6dce094074	[X86][AsmParser] Treat '%' as the modulo operator under Intel syntax It can't be a register prefix, anyway. This is consistent with the masm docs on MSDN: https://msdn.microsoft.com/en-us/library/t4ax90d2.aspx This is a straight-forward extension of our support for "MOD" implemented in https://reviews.llvm.org/D33876 / r306425 llvm-svn: 317011	2017-10-31 16:47:38 +00:00
Nico Weber	9406190dba	LTOModule::isBitcodeFile() shouldn't assert when returning false. Fixes a bunch of assert-on-invalid-bitcode regressions after 315483. Expected<> calls assertIsChecked() in its dtor, and operator bool() only calls setChecked() if there's no error. So for functions that don't return an error itself, the Expected<> version needs explicit code to disarm the error that the ErrorOr<> code didn't need. https://reviews.llvm.org/D39437 llvm-svn: 317010	2017-10-31 16:39:47 +00:00
Reid Kleckner	c03d168784	[asan] Upgrade private linkage globals to internal linkage on COFF COFF comdats require symbol table entries, which means the comdat leader cannot have private linkage. llvm-svn: 317009	2017-10-31 16:16:08 +00:00
Simon Pilgrim	b90b467bbf	[X86][SSE] Add VSRLI/VSRAI/VSLLI demanded elts support to computeKnownBits/ComputeNumSignBits Mainly a perf improvements as most combines will have occurred before we lower to these instructions llvm-svn: 317005	2017-10-31 16:06:21 +00:00
Benjamin Kramer	d6177e6458	[LoopVectorize] Replace manual VPlan memory management with unique_ptr. No functionality change intended. llvm-svn: 317003	2017-10-31 14:58:22 +00:00
Jonas Devlieghere	30acec2960	[test] Fix dsymutil/cmdline.test This fixes dsymutil/cmdline.test on platforms where the dsymutil binary has an extension. llvm-svn: 317001	2017-10-31 14:19:02 +00:00
Florian Hahn	4a7156f496	[Reassociate] Remove FIXME from looptest.ll (NFC) Summary: The loop invariant add (i+j) is reassoicated, I think the FIXME can be removed, because this is what the test case tries to check (AFAIK). I also changed the test to use FileCheck. Reviewers: mcrosier, davide Reviewed By: mcrosier, davide Subscribers: davide, llvm-commits Differential Revision: https://reviews.llvm.org/D39424 llvm-svn: 317000	2017-10-31 14:06:31 +00:00
Jonas Devlieghere	17ac392c74	[dsymutil] Implement the --threads option This patch adds the --threads option to dsymutil to process architectures in parallel. The feature is already present in the version distributed with Xcode, but was not yet upstreamed. This is NFC as far as the linking behavior is concerned. As threads are used automatically, the current tests cover the change in implementation. Differential revision: https://reviews.llvm.org/D39355 llvm-svn: 316999	2017-10-31 13:54:15 +00:00
Teresa Johnson	f1bd32e79d	[ThinLTO] Double bits of module hash used for renaming Summary: Use 64 instead of 32 bits of the module hash as the suffix when renaming after promotion to reduce the likelihood of a collision (which we observed in a binary when using 32 bits). Reviewers: pcc Subscribers: llvm-commits, inglorion Differential Revision: https://reviews.llvm.org/D39443 llvm-svn: 316996	2017-10-31 12:56:09 +00:00
Matthew Simpson	fa52fc8d71	[InstCombine] Simplify selects that test cmpxchg instructions If a select instruction tests the returned flag of a cmpxchg instruction and selects between the returned value of the cmpxchg instruction and its compare operand, the result of the select will always be equal to its false value. Differential Revision: https://reviews.llvm.org/D39383 llvm-svn: 316994	2017-10-31 12:34:02 +00:00
Ayman Musa	a556ca2ee3	Adding a shufflevector and select LLVM IR instructions fuzz tool Based on similar python tool - utils/shuffle-fuzz.py - this tool extends the ability of it's previous by optionally attaching select instruction to the generated shufflevector instructions. This was mainly developed to perform exhaustive testing of the X86 AVX512 masked shuffle instructions. But yet it can be used for various other targets. The general design of the implementation is much modular than the original shuffle_fuzz.py tool, which makes it easier for anyone to extend it further. Differential Revision: https://reviews.llvm.org/D38031 Change-Id: I0efc2aaa091b61a8a9552311c21cc77916a97111 llvm-svn: 316989	2017-10-31 11:39:31 +00:00
David Green	90105ec5c8	[LoopUnroll] Clean up remarks for unroll remainder The optimisation remarks for loop unrolling with an unrolled remainder looks something like: test.c:7:18: remark: completely unrolled loop with 3 iterations [-Rpass=loop-unroll] C[i] += A[i*N+j]; ^ test.c:6:9: remark: unrolled loop by a factor of 4 with run-time trip count [-Rpass=loop-unroll] for(int j = 0; j < N; j++) ^ This removes the first of the two messages. Differential revision: https://reviews.llvm.org/D38725 llvm-svn: 316986	2017-10-31 10:47:46 +00:00
Michael Zuckerman	8e4ae49177	[AVX512] Adding new patterns for extract_subvector of vXi1 extract subvector of vXi1 from vYi1 is poorly supported by LLVM and most of the time end with an assertion. This patch fixes this issue by adding new patterns to the TD file. Reviewers: 1. guyblank 2. igorb 3. zvi 4. ayman 5. craig.topper Differential Revision: https://reviews.llvm.org/D39292 Change-Id: Ideb4d7e946c8d40cfce2920891f2d89fe64c58f8 llvm-svn: 316981	2017-10-31 10:00:19 +00:00
Serguei Katkov	dad01a379f	[CGP] Fix the detection of trivial case for addressing mode The address can be presented as a bitcast of baseReg. In this case it is still trivial but OriginalValue != baseReg. llvm-svn: 316980	2017-10-31 07:01:35 +00:00
Max Kazantsev	0e30db695e	[IRCE][NFC] Rename fields of InductiveRangeCheck Rename `Offset`, `Scale`, `Length` into `Begin`, `Step`, `End` respectively to make naming of similar entities for Ranges and Range Checks more consistent. Differential Revision: https://reviews.llvm.org/D39414 llvm-svn: 316979	2017-10-31 06:19:05 +00:00
Craig Topper	232eac1fb9	[X86] Make AVX512_512_SET0 XMM16-31 lower to 128-bit XOR when AVX512VL is enabled. Use 128-bit VLX instruction when VLX is enabled. Unfortunately, this weakens our ability to do domain fixing when AVX512DQ is not enabled, but it is consistent with our 256-bit behavior. Maybe we should add custom handling to domain fixing to allow EVEX integer XOR/AND/OR/ANDN to switch to VEX encoded fp instructions if the high registers aren't being used? llvm-svn: 316978	2017-10-31 06:01:04 +00:00
Max Kazantsev	ffcc4669b7	[NFC] Get rid of variables used in assert only llvm-svn: 316977	2017-10-31 05:33:58 +00:00
Philip Reames	eaa9b48ba6	[IndVarSimplify] Simplify code using preheader assumption As noted in the nice block comment, the previous code didn't actually handle multi-entry loops correctly, it just assumed SCEV didn't analyze such loops. Given SCEV has comments to the contrary, that seems a bit suspect. More importantly, the pass actually requires loopsimplify form which ensures a loop-preheader is available. Remove the excessive generaility and shorten the code greatly. Note that we do successfully analyze many multi-entry loops, but we do so by converting them to single entry loops. See the added test case. llvm-svn: 316976	2017-10-31 05:16:46 +00:00
Max Kazantsev	568017d4d5	Reapply "[GVN] Prevent LoadPRE from hoisting across instructions that don't pass control flow to successors" This patch fixes the miscompile that happens when PRE hoists loads across guards and other instructions that don't always pass control flow to their successors. PRE is now prohibited to hoist across such instructions because there is no guarantee that the load standing after such instruction is still valid before such instruction. For example, a load from under a guard may be invalid before the guard in the following case: int array[LEN]; ... guard(0 <= index && index < LEN); use(array[index]); Differential Revision: https://reviews.llvm.org/D37460 llvm-svn: 316975	2017-10-31 05:07:56 +00:00
Philip Reames	03929af30c	[SimplifyIndVar] Extract out invariant expression handling Previously, the code returned early from the function when it couldn't find a free expansion, it should be returning from the transform. I don't have a test case, noticed this via inspection. As a follow up, I'm going to revisit the logic in the extract function. I think that essentially the whole helper routine can be replaced with SCEVExpander, but I wanted to do that in a series of separate commits. llvm-svn: 316974	2017-10-31 04:19:06 +00:00
Craig Topper	2941e144f6	[X86] Clang-format some code. NFC llvm-svn: 316973	2017-10-31 02:34:29 +00:00
Shoaib Meenai	b65a04dcd7	[cmake] Make check_linker_flags operate via linker flags `check_linker_flags` currently sets the compiler flags (via `CMAKE_REQUIRED_FLAGS`), and thus implicitly relies on cmake's default behavior of passing the compiler flags to the linker. This breaks when cmake's build rules have been altered to not pollute the link line with compiler flags (which can be desirable for build cleanliness). Instead, set `CMAKE_EXE_LINKER_FLAGS` explicitly and use `CMP0056` to ensure the linker flags are passed along. Additionally, since we're inside a function, we can just alter the variable directly (as the alteration will be limited to the scope of the function) rather than saving and restoring the old value. Differential Revision: https://reviews.llvm.org/D39431 llvm-svn: 316972	2017-10-31 01:30:46 +00:00
Philip Reames	fa7b7b5937	Undo accidental commit These files shouldn't have been submitted in 316967 llvm-svn: 316968	2017-10-31 00:04:09 +00:00
Philip Reames	96a93d11ec	[CGP] Fix crash on i96 bit multiply Issue found by llvm-isel-fuzzer on OSS fuzz, https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=3725 If anyone actually cares about > 64 bit arithmetic, there's a lot more to do in this area. There's a bunch of obviously wrong code in the same function. I don't have the time to fix all of them and am just using this to understand what the workflow for fixing fuzzer cases might look like. llvm-svn: 316967	2017-10-30 23:59:51 +00:00
Simon Pilgrim	937b242735	Fix unused variable warnings. NFCI. llvm-svn: 316964	2017-10-30 22:38:07 +00:00
Simon Pilgrim	21b08adda6	[SelectionDAG] Tidyup computeKnownBits extension/truncation cases. NFCI. We don't need to extend/truncate the Known structure before calling computeKnownBits - it will reset at the start of the function. llvm-svn: 316962	2017-10-30 22:23:57 +00:00
Javed Absar	9171bb47a2	[AArch64]: range loopify frame-lowering llvm-svn: 316960	2017-10-30 22:00:06 +00:00
Rui Ueyama	8ada8b0c9d	Fix -fuse-ld feature detection error. check_cxx_compiler_flag doesn't seem to try to link a program, so the existing code doesn't correctly detect the availability of a given linker. This patch uses check_cxx_source_compiles instead. I confirmed that cmake now reports this error Host compiler does not support '-fuse-ld=foo' for -DLLVM_USE_LINKER=foo. Differential Revision: https://reviews.llvm.org/D39274 llvm-svn: 316958	2017-10-30 21:19:54 +00:00
Yaxun Liu	8af8ab7b8a	InferAddressSpaces: Fix bug about replacing addrspacecast InferAddressSpaces assumes the pointee type of addrspacecast is the same as the operand, which is not always true and causes invalid IR. This bug cause build failure in HCC. This patch fixes that. Differential Revision: https://reviews.llvm.org/D39432 llvm-svn: 316957	2017-10-30 21:19:41 +00:00
Tim Shen	c8c738dfbf	[CMake] Fix linker detection in AddLLVM.cmake Fix linker not being correctly detected when a custom one is specified through LLVM_USE_LINKER CMake variable. In particular, cmake -DCMAKE_BUILD_TYPE=Release -DLLVM_USE_LINKER=gold ../llvm resulted into Linker detection: GNU ld instead of Linker detection: GNU Gold due to the construction not accounting for such variable. It led to the general confusion and prevented setting linker-specific flags inside functions defined in AddLLVM.cmake. Thanks Oleksii Vilchanskyi for the patch! llvm-svn: 316956	2017-10-30 21:12:14 +00:00
Craig Topper	d005fa9f59	[X86] Add AVX512 support to fast isel's X86ChooseCmpOpcode. llvm-svn: 316955	2017-10-30 21:09:19 +00:00
Davide Italiano	8379c78957	[NewGVN] Stop assuming PHI args ordering when looking at phi-of-ops. It's not guaranteed. There's a bug open to sort them in predecessor order, but it won't happen anytime soon. In the meanwhile, passes will have to do an O(#preds) scan. Such is life. llvm-svn: 316953	2017-10-30 20:20:16 +00:00
Stefan Pintilie	f59ef1bafe	Revert "[PowerPC] Try to simplify a Swap if it feeds a Splat" Revert r316478. A test case has failed. Will recommit this change once we find and fix the failure. This reverts commit 7c330fabaedaba3d02c58bc3cc1198896c895f34. llvm-svn: 316952	2017-10-30 19:55:38 +00:00
Daniel Neilson	0ad57a67a0	Create instruction classes for identifying any atomicity of memory intrinsic. (NFC) Summary: For reference, see: http://lists.llvm.org/pipermail/llvm-dev/2017-August/116589.html This patch fleshes out the instruction class hierarchy with respect to atomic and non-atomic memory intrinsics. With this change, the relevant part of the class hierarchy becomes: IntrinsicInst -> MemIntrinsicBase (methods-only class) -> MemIntrinsic (non-atomic intrinsics) -> MemSetInst -> MemTransferInst -> MemCpyInst -> MemMoveInst -> AtomicMemIntrinsic (atomic intrinsics) -> AtomicMemSetInst -> AtomicMemTransferInst -> AtomicMemCpyInst -> AtomicMemMoveInst -> AnyMemIntrinsic (both atomicities) -> AnyMemSetInst -> AnyMemTransferInst -> AnyMemCpyInst -> AnyMemMoveInst This involves some class renaming: ElementUnorderedAtomicMemCpyInst -> AtomicMemCpyInst ElementUnorderedAtomicMemMoveInst -> AtomicMemMoveInst ElementUnorderedAtomicMemSetInst -> AtomicMemSetInst A script for doing this renaming in downstream trees is included below. An example of where the Any* classes should be used in LLVM is when reasoning about the effects of an instruction (ex: aliasing). --- Script for renaming AtomicMem* classes: PREFIXES="[<,([:space:]]" CLASSES="MemIntrinsic\|MemTransferInst\|MemSetInst\|MemMoveInst\|MemCpyInst" SUFFIXES="[;)>,[:space:]]" REGEX="(${PREFIXES})ElementUnorderedAtomic(${CLASSES})(${SUFFIXES})" REGEX2="visitElementUnorderedAtomic(${CLASSES})" FILES=$( grep -E "(${REGEX}\|${REGEX2})" -r . \| tr ':' ' ' \| awk '{print $1}' \| sort \| uniq ) SED_SCRIPT="s~${REGEX}~\1Atomic\2\3~g" SED_SCRIPT2="s~${REGEX2}~visitAtomic\1~g" for f in $FILES; do echo "Processing: $f" sed -i ".bak" -E "${SED_SCRIPT};${SED_SCRIPT2};${EA_SED_SCRIPT};${EA_SED_SCRIPT2}" $f done Reviewers: sanjoy, deadalnix, apilipenko, anna, skatkov, mkazantsev Reviewed By: sanjoy Subscribers: hfinkel, jholewinski, arsenm, sdardis, nhaehnle, JDevlieghere, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D38419 llvm-svn: 316950	2017-10-30 19:51:48 +00:00
Mandeep Singh Grang	67b94b1157	[GVNHoist] Fix non-deterministic sort order of PHIs for identical instructions Summary: This fixes failure in Transforms/GVNHoist/hoist.ll uncovered by D39245. Reviewers: hiraditya, spop, dberlin Reviewed By: dberlin Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39410 llvm-svn: 316949	2017-10-30 19:42:41 +00:00
Simon Pilgrim	c48c10f794	[SelectionDAG] Add VSELECT demanded elts support to computeKnownBits llvm-svn: 316947	2017-10-30 19:31:08 +00:00
Zvi Rackover	7b921274c2	X86 Tests: Update the variable-index permute tests with FP types. NFC. These cases will be addressed in a future update to D39126. llvm-svn: 316946	2017-10-30 19:29:15 +00:00
Simon Pilgrim	bf4b915360	[X86][SSE] Add another computeKnownBits test showing missing VSELECT demandedelts support llvm-svn: 316945	2017-10-30 19:19:58 +00:00
Simon Pilgrim	e585a22b5f	[SelectionDAG] Add VSELECT support to computeKnownBits llvm-svn: 316944	2017-10-30 19:08:21 +00:00
Simon Pilgrim	30187d57e4	[X86][SSE] computeKnownBits tests showing missing VSELECT demandedelts support llvm-svn: 316940	2017-10-30 18:48:31 +00:00
Simon Pilgrim	025f958bf2	[X86][AVX512] Cleanup scheduler tests - split GENERIC and SKX targets llvm-svn: 316938	2017-10-30 18:37:27 +00:00
Simon Pilgrim	21de22c31b	[SelectionDAG] Add SELECT demanded elts support to ComputeNumSignBits llvm-svn: 316933	2017-10-30 17:53:51 +00:00
Simon Pilgrim	8ec7f73005	[X86][SSE] ComputeNumSignBits tests showing missing VSELECT demandedelts support llvm-svn: 316932	2017-10-30 17:46:50 +00:00

1 2 3 4 5 ...

156175 Commits