llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 03:02:36 +01:00

Author	SHA1	Message	Date
Quentin Colombet	b031d01506	[IRTranslator] Add comments to explain the ordering of the switch. NFC. Group arithmetic operations, bitwise operations, and branch operations. llvm-svn: 276305	2016-07-21 17:26:41 +00:00
Sanjay Patel	4385ad44ac	[InstCombine] break up visitICmpInstWithInstAndIntCst(); NFCI Making smaller pieces out of some of these ~1000 line functions should make it easier to incrementally upgrade them to handle vector types. llvm-svn: 276304	2016-07-21 17:15:49 +00:00
Renato Golin	6248146d05	Adding RELEASE_TESTERS.TXT llvm-svn: 276302	2016-07-21 16:46:44 +00:00
Konstantin Zhuravlyov	b16afe7359	[AMDGPU] Emit read-only data to .rodata for hsa Differential Revision: https://reviews.llvm.org/D22538 llvm-svn: 276298	2016-07-21 15:59:23 +00:00
Quentin Colombet	3a4563a1e2	[IRTranslator] Add G_AND opcode. This commit adds a generic AND opcode to global-isel. llvm-svn: 276297	2016-07-21 15:50:42 +00:00
Konstantin Zhuravlyov	0afe58e18c	AMDGPU/SI: Add support for R_AMDGPU_ABS32 Differential Revision: https://reviews.llvm.org/D21646 llvm-svn: 276294	2016-07-21 15:29:19 +00:00
Geoff Berry	97900e2647	[AArch64] Load/store opt: Don't count transient instructions towards search limits. Summary: This change also changes findMatchingInsn and findMatchingUpdateInsnForward to take DBG_VALUE opcodes into account when tracking register defs and uses, which could potentially inhibit these optimizations in the presence of debug information. Reviewers: mcrosier Subscribers: aemerson, rengolin, mcrosier, llvm-commits Differential Revision: https://reviews.llvm.org/D22582 llvm-svn: 276293	2016-07-21 15:20:25 +00:00
Benjamin Kramer	6d65c1193c	Weaken ThreadSafeRefCountedBase atomics. Doesn't make a difference on x86, but avoids memory barriers on weakly-ordered archs like PowerPC and ARM. llvm-svn: 276291	2016-07-21 15:06:50 +00:00
Simon Pilgrim	4cbc84cfa7	[X86][SSE] Allow folding of store/zext with PEXTRW of 0'th element Under normal circumstances we prefer the higher performance MOVD to extract the 0'th element of a v8i16 vector instead of PEXTRW. But as detailed on PR27265, this prevents the SSE41 implementation of PEXTRW from folding the store of the 0'th element. Additionally it prevents us from making use of the fact that the (SSE2) reg-reg version of PEXTRW implicitly zero-extends the i16 element to the i32/i64 destination register. This patch only preferentially lowers to MOVD if we will not be zero-extending the extracted i16, nor prevent a store from being folded (on SSSE41). Fix for PR27265. Differential Revision: https://reviews.llvm.org/D22509 llvm-svn: 276289	2016-07-21 14:54:17 +00:00
Simon Pilgrim	50c1eac414	Fixed line endings llvm-svn: 276287	2016-07-21 14:36:41 +00:00
Simon Pilgrim	7d25039e14	[X86][SSE] Pull out duplicate EXTRW lowering code. NFCI. As requested on D22509, I've pulled out the v8i16 extraction lowering as the SSE41 and pre-SSE41 implementations are effectively the same. llvm-svn: 276285	2016-07-21 14:30:17 +00:00
Benjamin Kramer	a4f804055a	[profdata] Remove constructor that MSVC 2013 pretends to not understand. No functionality change intended. llvm-svn: 276284	2016-07-21 14:29:11 +00:00
Simon Pilgrim	9b2c75bbd5	[X86][AVX] Added support for lowering to VBROADCASTF128/VBROADCASTI128 As reported on PR26235, we don't currently make use of the VBROADCASTF128/VBROADCASTI128 instructions (or the AVX512 equivalents) to load+splat a 128-bit vector to both lanes of a 256-bit vector. This patch enables lowering from subvector insertion/concatenation patterns and auto-upgrades the llvm.x86.avx.vbroadcastf128.pd.256 / llvm.x86.avx.vbroadcastf128.ps.256 intrinsics to match. We could possibly investigate using VBROADCASTF128/VBROADCASTI128 to load repeated constants as well (similar to how we already do for scalar broadcasts). Differential Revision: https://reviews.llvm.org/D22460 llvm-svn: 276281	2016-07-21 14:10:54 +00:00
Benjamin Kramer	313cc4b45f	[DemandedBits] Reduce number of duplicated DenseMap lookups. No functionality change intended. llvm-svn: 276278	2016-07-21 13:37:55 +00:00
Benjamin Kramer	38c61d7923	[DenseMap] Add a C++17-style try_emplace method. This provides an elegant pattern to solve the "construct if not in map already" problem we have many times in LLVM. Without try_emplace we either have to rely on a sentinel value (nullptr) or do two lookups. llvm-svn: 276277	2016-07-21 13:37:53 +00:00
Benjamin Kramer	750272d02a	Rename StringMap::emplace_second to try_emplace. Coincidentally this function maps to the C++17 try_emplace. Rename it for consistentcy with C++17 std::map. NFC. llvm-svn: 276276	2016-07-21 13:37:48 +00:00
Sam Kolton	7fedc1cd73	[AMDGPU] Some code cleaning in SIRegisterInfo.td Reviewers: tstellarAMD, vpykhtin Subscribers: arsenm, kzhuravl Differential Revision: https://reviews.llvm.org/D22620 llvm-svn: 276274	2016-07-21 13:29:57 +00:00
Marina Yatsina	f70575e4b9	ExecutionDepsFix - Fix bug in clearance calculation The clearance calculation did not take into account registers defined as outputs or clobbers in inline assembly machine instructions because these register defs are implicit. Differential Revision: http://reviews.llvm.org/D22580 llvm-svn: 276266	2016-07-21 12:37:07 +00:00
Benjamin Kramer	a76c1a5347	[GCOV] Remove a layer of indirection. StringMap is designed to hold large values. No functionality change intended. llvm-svn: 276265	2016-07-21 12:06:31 +00:00
Renato Golin	d1ec7af874	[docs] Update release docs llvm-svn: 276264	2016-07-21 12:00:50 +00:00
Matt Arsenault	57e3208d74	AMDGPU: Fix phis from blocks split due to register indexing llvm-svn: 276257	2016-07-21 09:40:57 +00:00
David Majnemer	933021cc63	[GVNHoist] Preserve optimization hints which agree If we have optimization hints with agree with each other along different paths, preserve them. llvm-svn: 276248	2016-07-21 07:16:26 +00:00
David Majnemer	b0412efc3f	[GVNHoist] Don't wrongly preserve TBAA We hoisted loads/stores without taking into account which can cause miscompiles. llvm-svn: 276240	2016-07-21 05:59:53 +00:00
David Majnemer	96b4439f3b	[MergedLoadStoreMotion] Remove out of date comment llvm-svn: 276239	2016-07-21 05:59:51 +00:00
Amaury Sechet	e2c46ac53f	Add missing import to fix the build llvm-svn: 276237	2016-07-21 04:31:38 +00:00
Amaury Sechet	e639831d0c	Expose AttributeSetNode, use it to provide aggregate getter for attribute in the C API. Summary: See D19181 for context. Reviewers: whitequark, Wallbraker, jyknight, echristo, bkramer, void Subscribers: mehdi_amini Differential Revision: http://reviews.llvm.org/D21265 llvm-svn: 276236	2016-07-21 04:25:06 +00:00
Matthias Braun	65b09f0c51	IPRA: Fix RegMask calculation for alias registers This patch fixes a very subtle bug in regmask calculation. Thanks to zan jyu Wong <zyfwong@gmail.com> for bringing this to notice. For example if CL is only clobbered than CH should not be marked clobbered but CX, RCX and ECX should be mark clobbered. Previously for each modified register all of its aliases are marked clobbered by markRegClobbred() in RegUsageInfoCollector.cpp but that is wrong because when CL is clobbered then MRI::isPhysRegModified() will return true for CL, CX, ECX, RCX which is correct behavior but then for CX, EXC, RCX we mark CH also clobbered as CH is aliased to CX,ECX,RCX so markRegClobbred() is not required because isPhysRegModified already take cares of proper aliasing register. A very simple test case has been added to verify this change. Please find relevant bug report here : http://llvm.org/PR28567 Patch by Vivek Pandya <vivekvpandya@gmail.com> Differential Revision: https://reviews.llvm.org/D22400 llvm-svn: 276235	2016-07-21 03:50:39 +00:00
Adam Nemet	a992a8cd3f	[OptDiag] Missed these when making the IR Value a const pointer llvm-svn: 276224	2016-07-21 01:11:12 +00:00
Adam Nemet	377d292ea8	[OptDiag,LV] Add hotness attribute to applied-optimization remarks Test coverage is provided by modifying the function in the FP-math testcase that we are allowed to vectorize. llvm-svn: 276223	2016-07-21 01:07:13 +00:00
Matthias Braun	841fb2d97d	X86InstrInfo: No need for liveness analysis in classifyLEAReg() classifyLEAReg() deals with switching operands from 32bit to 64bit in order to use a LEA64_32 instruction (for three address code goodness). It currently performs a liveness analysis to determine the kill/undef flag for the newly added operand. This should not be necessary: - If the previous operand had a kill flag, then the 32bit part of the register gets killed, this will kill the super register as well. - If the previous operand had an undef flag then we didn't care what value we read, just use the same flag on the new operand. (No matter what an operand with an undef flag won't affect liveness) This makes the code independent of the presence of kill flags because it avoids a call to MachineBasicBlock::computeRegisterLiveness(). Differential Revision: http://reviews.llvm.org/D22283 llvm-svn: 276222	2016-07-21 00:33:38 +00:00
Sanjay Patel	8755396e8d	[InstCombine] LogicOpc (zext X), C --> zext (LogicOpc X, C) (PR28476) The benefits of this change include: 1. Remove DeMorgan-matching code that was added specifically to work-around the missing transform in http://reviews.llvm.org/rL248634. 2. Makes the DeMorgan transform work for vectors too. 3. Fix PR28476: https://llvm.org/bugs/show_bug.cgi?id=28476 Extending this transform to other casts and other associative operators may be useful too. See https://reviews.llvm.org/D22421 for a prerequisite for doing that though. Differential Revision: https://reviews.llvm.org/D22271 llvm-svn: 276221	2016-07-21 00:24:18 +00:00
Adam Nemet	2a94ac8820	[OptDiag,LV] Add hotness attribute to the derived analysis remarks This includes FPCompute and Aliasing. Testcase is based on no_fpmath.ll. llvm-svn: 276211	2016-07-20 23:50:32 +00:00
Sanjay Patel	e9a0321168	[InstSimplify][InstCombine] don't crash when folding vector selects of icmp Differential Revision: https://reviews.llvm.org/D22602 llvm-svn: 276209	2016-07-20 23:40:01 +00:00
George Burgess IV	6ad88d4c09	Make help text more consistent. NFC. llvm-svn: 276205	2016-07-20 23:14:29 +00:00
Tim Northover	e0ea323e71	GlobalISel: Remove explicit enumerator values from .def file. They were all auto-incremented from 0 anyway, and I'm getting really annoying conflicts and runtime failures when different people add more for GlobalISel (and even when I'm refactoring my own patches). NFC. llvm-svn: 276204	2016-07-20 22:58:01 +00:00
Xinliang David Li	4eb4c0a05c	Fix test failure on Win llvm-svn: 276202	2016-07-20 22:53:39 +00:00
George Burgess IV	70f242ff50	[CFLAA] Add offset tracking in CFLGraph. (Also, refactor our constexpr handling to be less insane). This patch lets us track field offsets in the CFL Graph, which is the first step to making CFLAA field/offset sensitive. Woohoo! Note that this patch shouldn't visibly change our behavior (since we make no use of the offsets we're now tracking), so we can't quite add tests for this yet. Patch by Jia Chen. Differential Revision: https://reviews.llvm.org/D22598 llvm-svn: 276201	2016-07-20 22:53:30 +00:00
Vedant Kumar	5b19d0d059	[utils] Add script to check for code coverage regressions Differential Revision: https://reviews.llvm.org/D22544 llvm-svn: 276199	2016-07-20 22:44:16 +00:00
Xinliang David Li	40622c593b	Reapply r276185 Fix the test case that should not depend on dir iteration order. llvm-svn: 276197	2016-07-20 22:24:52 +00:00
Justin Lebar	63ae2eb95c	[NVPTX] Enable the load-store vectorizer on nvptx. Reviewers: tra Subscribers: jholewinski, arsenm, asbirlea Differential Revision: https://reviews.llvm.org/D22592 llvm-svn: 276196	2016-07-20 22:11:36 +00:00
Xinliang David Li	31d2c7e14d	Revert r276185 -- build bot failure llvm-svn: 276194	2016-07-20 21:50:38 +00:00
Geoff Berry	1bd60ccae6	[AArch64] Register AArch64LoadStoreOptimizer so it can be run by llc -run-pass. NFCI. llvm-svn: 276193	2016-07-20 21:45:58 +00:00
Adam Nemet	46bb1fa09e	[OptDiag,LV] Add hotness attribute to analysis remarks The earlier change added hotness attribute to missed-optimization remarks. This follows up with the analysis remarks (the ones explaining the reason for the missed optimization). llvm-svn: 276192	2016-07-20 21:44:26 +00:00
Adam Nemet	c8216345a3	[OptDiag] Take the IR Value as a const pointer This helps because LoopAccessReport is passed around as a const reference and we derive the basic block passed as the Value parameter from the instruction in LoopAccessReport. llvm-svn: 276191	2016-07-20 21:44:22 +00:00
Adam Nemet	45603dc4d7	[OptDiag] Wrap a long line llvm-svn: 276190	2016-07-20 21:44:18 +00:00
Artem Belevich	5fd5640c49	[NVPTX] Renamed NVPTXLowerKernelArgs -> NVPTXLowerArgs. NFC. After r276153 the pass applies to both kernels and regular functions. Differential Revision: https://reviews.llvm.org/D22583 llvm-svn: 276189	2016-07-20 21:44:07 +00:00
Xinliang David Li	a600368d3e	[Profile] support directory reading in profile merging Differential Revision: http://reviews.llvm.org/D22560 llvm-svn: 276185	2016-07-20 21:31:29 +00:00
Tim Northover	8482c3127e	GlobalISel: implement Legalization querying framework. This adds an (incomplete, inefficient) framework for deciding what to do with some operation on a given type. llvm-svn: 276184	2016-07-20 21:13:29 +00:00
Ahmed Bougacha	a7477c993d	[AArch64][FastISel] Select -O0 legal cmpxchg. At -O0, cmpxchg survives AtomicExpand: it's mostly straightforward to select it in fast-isel, and let the pseudo be expanded later. extractvalues on the result are the tricky part: the generic logic only works for legal types (and it would be painful to make it support illegal types), so we can only support i32/i64 cmpxchg. llvm-svn: 276183	2016-07-20 21:12:32 +00:00
Ahmed Bougacha	458f98b251	[AArch64][FastISel] Select atomic stores into STLR. llvm-svn: 276182	2016-07-20 21:12:27 +00:00

1 2 3 4 5 ...

135308 Commits