llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
Sanjay Patel	b11050c641	add tests to show missing vector transforms llvm-svn: 271842	2016-06-05 17:32:58 +00:00
Sanjay Patel	b1c8f8ce55	regenerate checks llvm-svn: 271841	2016-06-05 17:29:45 +00:00
Sanjay Patel	605061c514	update test to use FileCheck llvm-svn: 271840	2016-06-05 17:13:09 +00:00
Sanjay Patel	24556cdcec	update test to use FileCheck llvm-svn: 271838	2016-06-05 16:41:20 +00:00
Sanjay Patel	7c1912623f	update test to FileCheck llvm-svn: 271837	2016-06-05 16:29:15 +00:00
Simon Pilgrim	585db6f6c3	[X86][XOP] Added VPERMIL2PD/VPERMIL2PS raw mask decoding for target shuffle combines llvm-svn: 271834	2016-06-05 15:21:30 +00:00
Simon Pilgrim	94a9893eda	[X86][XOP] Added VPERMIL2PD/VPERMIL2PS as a target shuffle type llvm-svn: 271831	2016-06-05 15:01:45 +00:00
Craig Topper	2249826460	[AVX512] Add support for lowering PALIGNR for v64i8. Could do this for other types to, but this is what's needed to replace the instrinsic with native IR in clang. llvm-svn: 271828	2016-06-05 06:29:12 +00:00
Craig Topper	47e5abb616	[AVX512] Split command lines and regenerate a test to prepare for a future commit. llvm-svn: 271827	2016-06-05 06:29:08 +00:00
Craig Topper	d8c697aad5	[AVX512] Fix PANDN combining for v4i32/v8i32 when VLX is enabled. v4i32/v8i32 ANDs aren't promoted to v2i64/v4i64 when VLX is enabled. llvm-svn: 271826	2016-06-05 05:35:11 +00:00
Xinliang David Li	a585afd293	[PM] Port GCOVProfiler pass to the new pass manager llvm-svn: 271823	2016-06-05 05:12:23 +00:00
David Majnemer	3c5fa38a11	[SimplifyCFG] Don't kill empty cleanuppads with multiple uses A basic block could contain: %cp = cleanuppad [] cleanupret from %cp unwind to caller This basic block is empty and is thus a candidate for removal. However, there can be other uses of %cp outside of this basic block. This is only possible in unreachable blocks. Make our transform more correct by checking that the pad has a single user before removing the BB. This fixes PR28005. llvm-svn: 271816	2016-06-04 23:50:03 +00:00
Sanjay Patel	dad6c47e6b	[InstCombine] allow vector constants for cast+icmp fold This is step 1 of unknown towards fixing PR28001: https://llvm.org/bugs/show_bug.cgi?id=28001 llvm-svn: 271810	2016-06-04 22:04:05 +00:00
Simon Pilgrim	2edc73fed4	[X86][XOP] Added VPERMIL2PD/VPERMIL2PS shuffle mask comment decoding llvm-svn: 271809	2016-06-04 21:44:28 +00:00
Sanjay Patel	a7b7945972	[InstCombine] add test for missing vector optimization llvm-svn: 271808	2016-06-04 21:41:25 +00:00
Sanjay Patel	b44cf32fc9	[InstCombine] add test for missing vector optimization llvm-svn: 271806	2016-06-04 21:20:03 +00:00
Sanjay Patel	9fa8579461	[InstCombine] minimize test case and use FileCheck llvm-svn: 271805	2016-06-04 21:04:59 +00:00
Simon Pilgrim	dde840bfa3	[Analysis] Enabled BITREVERSE as a vectorizable intrinsic Allows XOP to vectorize BITREVERSE - other targets will follow as their costmodels improve. llvm-svn: 271803	2016-06-04 20:21:07 +00:00
Saleem Abdulrasool	f999318a81	X86: enable TLS on Windows itanium Windows itanium is nearly identical to windows-msvc (MS ABI for C, itanium for C++). Enable the TLS support for the target similar to the MSVC model. llvm-svn: 271797	2016-06-04 18:27:22 +00:00
Simon Pilgrim	73aea916e6	[X86][AVX2] Fix v16i16 SHL lowering (PR27730) The AVX2 v16i16 shift lowering works by unpacking to 2 x v8i32, performing the shift and then truncating the result. The unpacking is used to place the values in the upper 16-bits so that we can correctly sign-extend for SRA shifts. Unfortunately we weren't ensuring that the lower 16-bits were zero to ensure that SHL correctly shifts in zero bits. llvm-svn: 271796	2016-06-04 16:45:33 +00:00
Simon Pilgrim	c995ee7b75	[InstCombine][MMX] Extend SimplifyDemandedUseBits MOVMSK support to MMX Add the MMX implementation to the SimplifyDemandedUseBits SSE/AVX MOVMSK support added in D19614 Requires a minor tweak as llvm.x86.mmx.pmovmskb takes a x86_mmx argument - so we have to be explicit about the implied v8i8 vector type. llvm-svn: 271789	2016-06-04 13:42:46 +00:00
Chandler Carruth	29e3ce3cdf	[sancov] Revert r271695 which broke all of the PPC bots. Original commit message: [sancov] Run sancov tests on more platforms The only tests that need to be run on Linux are the ones that use C++ demangling. I'm assuming they will fail on Mac, since __cxa_demangle there won't handle the non-double-underscore prefixed mangled names. llvm-svn: 271763	2016-06-04 03:28:27 +00:00
Chandler Carruth	85f0f82387	[llvm-profdata] Revert r271709 and the 3 subsequent commits - the code and/or tests aren't working on Windows currently. There seems to be some problem with quoting the file paths. I don't understand the test structure here or the code well enough to try to come up with a way to correctly handle paths with back slashes in them, and this has caused the Windows builds to be failing for 7 hours now, so I'm reverting the whole thing to bring them back to life. Sorry for the disruption, but a couple of these were bug fixes anyways that can be folded into a fresh commit. Reverts the following patches: r271756: Clean up the way we create the input filenames buffer (NFC) r271748: Fix use-after-free from discarded MemoryBuffer (NFC) r271710: Fix option description (NFC) r271709: Add option to ingest filepaths from a file llvm-svn: 271760	2016-06-04 03:08:01 +00:00
Adrian Prantl	718ab049bc	Testcase cleanup: Remove a redundant test input. llvm-svn: 271753	2016-06-04 00:10:17 +00:00
Matthias Braun	64003711e2	MIR: Support MachineMemOperands without associated value This is allowed (though used rarely) and useful to keep your tests short. llvm-svn: 271752	2016-06-04 00:06:31 +00:00
Easwaran Raman	60d682daa9	Reapply r271728 after adding move cobstructor for ProfileSummaryInfo llvm-svn: 271745	2016-06-03 22:54:26 +00:00
Derek Bruening	a636185af1	[esan\|wset] Optionally assume intra-cache-line accesses Summary: Adds an option -esan-assume-intra-cache-line which causes esan to assume that a single memory access touches just one cache line, even if it is not aligned, for better performance at a potential accuracy cost. Experiments show that the performance difference can be 2x or more, and accuracy loss is typically negligible, so we turn this on by default. This currently applies just to the working set tool. Reviewers: aizatsky Subscribers: vitalybuka, zhaoqin, kcc, eugenis, llvm-commits Differential Revision: http://reviews.llvm.org/D20978 llvm-svn: 271743	2016-06-03 22:29:52 +00:00
Easwaran Raman	0670f91a65	Revert r271728 as it breaks Windows build llvm-svn: 271738	2016-06-03 21:14:26 +00:00
Rui Ueyama	05c45592e0	pdbdump: print out TPI hashes. Differential Revision: http://reviews.llvm.org/D20945 llvm-svn: 271736	2016-06-03 20:48:51 +00:00
Easwaran Raman	553eb9ed8a	Analysis pass to access profile summary info Differential Revision: http://reviews.llvm.org/D20648 llvm-svn: 271728	2016-06-03 20:37:19 +00:00
Reid Kleckner	eb745c2e9c	[Symbolize] Check if the PE file has a PDB and emit an error if we can't load it Summary: Previously we would try to load PDBs for every PE executable we tried to symbolize. If that failed, we would fall back to DWARF. If there wasn't any DWARF, we'd print mostly useless symbol information using the export table. With this change, we only try to load PDBs for executables that claim to have them. If that fails, we can now print an error rather than falling back silently. This should make it a lot easier to diagnose and fix common symbolization issues, such as not having DIA or not having a PDB. Reviewers: zturner, eugenis Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D20982 llvm-svn: 271725	2016-06-03 20:25:09 +00:00
Chad Rosier	5059358611	[AArch64] Move tests from r271677 to a more appropriately named file. NFC. llvm-svn: 271718	2016-06-03 20:11:09 +00:00
Chad Rosier	a775bf75c7	[AArch64] Spot SBFX-compatible code expressed with sign_extend. This is very similar to r271677, but for extracts from i32 with the SIGN_EXTEND acting on a arithmetic shift. llvm-svn: 271717	2016-06-03 20:05:49 +00:00
Vedant Kumar	6bbf734940	[llvm-profdata] Add option to ingest filepaths from a file Differential Revision: http://reviews.llvm.org/D20980 llvm-svn: 271709	2016-06-03 19:05:20 +00:00
Derek Schuff	26aab3499f	[WebAssembly] Emit type signatures for declared functions Under emscripten, C code can take the address of a function implemented in Javascript (which is exposed via an import in wasm). Because imports do not have linear memory address in wasm, we need to generate a thunk to be the target of the indirect call; it call the import directly. To make this possible, LLVM needs to emit the type signatures for these functions, because they may not be called directly or referred to other than where the address is taken. This uses s new .s directive (.functype) which specifies the signature. Differential Revision: http://reviews.llvm.org/D20891 Re-apply r271599 but instead of bailing with an error when a declared function has multiple returns, replace it with a pointer argument. Also add the test case I forgot to 'git add' last time around. llvm-svn: 271703	2016-06-03 18:34:36 +00:00
Reid Kleckner	abf01ac457	[sancov] Disable these tests if there is no X86 backend Copied from test/CodeGen/X86 llvm-svn: 271698	2016-06-03 18:07:32 +00:00
Reid Kleckner	b9858d331b	[sancov] Run sancov tests on more platforms The only tests that need to be run on Linux are the ones that use C++ demangling. I'm assuming they will fail on Mac, since __cxa_demangle there won't handle the non-double-underscore prefixed mangled names. llvm-svn: 271695	2016-06-03 17:51:42 +00:00
Chris Bieneman	73cc17e63a	[yaml2obj] Sort MachO LinkEdit write operations based on offset This re-applies r271611, and hopefully the bots won't break this time. Although ld64 always outputs linkedit data in the same order, it isn't actually required to. This change makes yaml2obj resilient if the offsets are in arbitrary order. llvm-svn: 271687	2016-06-03 16:58:05 +00:00
Reid Kleckner	14799a2f9b	[codeview] Add basic record type translation This only translates data members for now. Translating overloaded methods is complicated, so I stopped short of doing that. Reviewers: aaboud Differential Revision: http://reviews.llvm.org/D20924 llvm-svn: 271680	2016-06-03 15:58:20 +00:00
Sjoerd Meijer	3d4cf26cf6	Code size optimisation: do not inline memcpy if this expansion results in more instructions than the libary call. Differential Revision: http://reviews.llvm.org/D20958 llvm-svn: 271678	2016-06-03 15:38:55 +00:00
Chad Rosier	1fa884d6ab	[AArch64] Spot SBFX-compatbile code expressed with sign_extend_inreg. We were assuming all SBFX-like operations would have the shl/asr form, but often when the field being extracted is an i8 or i16, we end up with a SIGN_EXTEND_INREG acting on a shift instead. This is a port of r213754 from ARM to AArch64. llvm-svn: 271677	2016-06-03 15:00:09 +00:00
Sanjay Patel	500fe712ac	[InstCombine] look through bitcasts to find selects There was concern that creating bitcasts for the simpler potential select pattern: define <2 x i64> @vecBitcastOp1(<4 x i1> %cmp, <2 x i64> %a) { %a2 = add <2 x i64> %a, %a %sext = sext <4 x i1> %cmp to <4 x i32> %bc = bitcast <4 x i32> %sext to <2 x i64> %and = and <2 x i64> %a2, %bc ret <2 x i64> %and } might lead to worse code for some targets, so this patch is matching the larger patterns seen in the test cases. The motivating example for this patch is this IR produced via SSE intrinsics in C: define <2 x i64> @gibson(<2 x i64> %a, <2 x i64> %b) { %t0 = bitcast <2 x i64> %a to <4 x i32> %t1 = bitcast <2 x i64> %b to <4 x i32> %cmp = icmp sgt <4 x i32> %t0, %t1 %sext = sext <4 x i1> %cmp to <4 x i32> %t2 = bitcast <4 x i32> %sext to <2 x i64> %and = and <2 x i64> %t2, %a %neg = xor <4 x i32> %sext, <i32 -1, i32 -1, i32 -1, i32 -1> %neg2 = bitcast <4 x i32> %neg to <2 x i64> %and2 = and <2 x i64> %neg2, %b %or = or <2 x i64> %and, %and2 ret <2 x i64> %or } For an AVX target, this is currently: vpcmpgtd %xmm1, %xmm0, %xmm2 vpand %xmm0, %xmm2, %xmm0 vpandn %xmm1, %xmm2, %xmm1 vpor %xmm1, %xmm0, %xmm0 retq With this patch, it becomes: vpmaxsd %xmm1, %xmm0, %xmm0 Differential Revision: http://reviews.llvm.org/D20774 llvm-svn: 271676	2016-06-03 14:42:07 +00:00
Artem Tamazov	baaf0740cf	[test/AMDGPU] Square-braced-syntax for registers: add macro test/example. Test added as per discussion in http://reviews.llvm.org/D20588. The macro is just a demonstration, useless in practice. Coding style fixes. Differential Revision: http://reviews.llvm.org/D20797 llvm-svn: 271675	2016-06-03 14:41:17 +00:00
Zachary Turner	e5a788e902	[pdb] Add string table offsets to check output. llvm-svn: 271674	2016-06-03 14:22:46 +00:00
Simon Pilgrim	d5ca72a493	[X86][AVX512] Fixed 512-bit vector nontemporal load alignment llvm-svn: 271673	2016-06-03 14:12:43 +00:00
Sjoerd Meijer	d7dd48669c	RAS extensions are part of ARMv8.2-A. This change enables them by introducing a new instruction to ARM and AArch64 targets and several system registers. Patch by: Roger Ferrer Ibanez and Oliver Stannard Differential Revision: http://reviews.llvm.org/D20282 llvm-svn: 271670	2016-06-03 14:03:27 +00:00
Simon Pilgrim	096c6479fc	[X86][AVX512] Added 512-bit vector nontemporal load tests llvm-svn: 271668	2016-06-03 13:42:49 +00:00
Sam Kolton	fee452853f	[AMDGPU] Assembler: More tests for SDWA instructions. Fix for SDWA float modifiers. Summary: Depends on D20625 Reviewers: tstellarAMD, vpykhtin, artem.tamazov Subscribers: arsenm, kzhuravl Differential Revision: http://reviews.llvm.org/D20674 llvm-svn: 271662	2016-06-03 11:43:09 +00:00
Simon Pilgrim	42c22dd5cc	[X86][SSE] Added nontemporal load tests These currently all lower to regular loads, generic nontemporal load support will be added in a future patch llvm-svn: 271659	2016-06-03 11:00:55 +00:00
Simon Pilgrim	9ac3c4e1c9	[X86] Added nontemporal scalar store tests llvm-svn: 271656	2016-06-03 10:30:54 +00:00

1 2 3 4 5 ...

37019 Commits