llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2025-02-01 05:01:59 +01:00

Author	SHA1	Message	Date
Ulrich Weigand	461d4b06db	[SystemZ] Fix truncstore + bswap codegen bug SystemZTargetLowering::combineSTORE contains code to transform a combination of STORE + BSWAP into a STRV type instruction. This transformation is correct for regular stores, but not for truncating stores. The routine neglected to check for that case. Fixes a miscompilation of llvm-objcopy with clang, which caused test suite failures in the SystemZ multistage build bot. llvm-svn: 313669	2017-09-19 20:50:05 +00:00
Saleem Abdulrasool	57f42e98ce	Revert "ExecutionEngine: add R_AARCH64_ABS{16,32}" This reverts commit SVN r313654. Seems that it is triggering an assertion on Windows specifically. Revert until I can build on Windows and look into what is happening there. llvm-svn: 313668	2017-09-19 20:35:25 +00:00
Jake Ehrlich	362ec74236	Revert "[llvm-objcopy] Add support for .dynamic, .dynsym, and .dynstr" This reverts commit r313663. Broken because overlapping-sections was reverted. llvm-svn: 313665	2017-09-19 20:00:04 +00:00
Jake Ehrlich	4b0a637bb0	Revert "[llvm-objcopy] Add support for nested and overlapping segments" This reverts commit r313656. Appears to be broken on Windows. llvm-svn: 313664	2017-09-19 19:52:09 +00:00
Jake Ehrlich	8e38021f91	[llvm-objcopy] Add support for .dynamic, .dynsym, and .dynstr This change adds support for sections involved in dynamic loading such as SHT_DYNAMIC, SHT_DYNSYM, and allocated string tables. The two added binaries used for tests can be downloaded [[ https://drive.google.com/file/d/0B3gtIAmiMwZXOXE3T0RobFg4ZTg/view?usp=sharing \| here ]] and [[ https://drive.google.com/file/d/0B3gtIAmiMwZXTFJSQUJZMGxNSXc/view?usp=sharing \| here ]] Differential Revision: https://reviews.llvm.org/D36560 llvm-svn: 313663	2017-09-19 19:21:09 +00:00
David Blaikie	5c7951c24c	Fix test to not depend on another subdirectories Input directory Inputs should be placed local to the test (or possibly in a common parent? I think we do that in some places - but the only common parent between these two directories is 'test' which seems a bit overly broad). llvm-svn: 313662	2017-09-19 19:20:08 +00:00
Jake Ehrlich	de99ea8a55	[llvm-objcopy] Add test to check that architecture specific values are not used on wrong architecture. This change adds a test that checks the an error is produced when a hexagon specific reserved section index is used but e_machine is not EM_HEXAGON. Differential Revision: https://reviews.llvm.org/D38017 llvm-svn: 313661	2017-09-19 19:05:15 +00:00
Dehao Chen	f1f34755de	Handle profile mismatch correctly for SamplePGO. Summary: Fix the bug when promoted call return type mismatches with the promoted function, we should not try to inline it. Otherwise it may lead to compiler crash. Reviewers: davidxl, tejohnson, eraman Reviewed By: tejohnson Subscribers: llvm-commits, sanjoy Differential Revision: https://reviews.llvm.org/D38018 llvm-svn: 313658	2017-09-19 18:26:54 +00:00
Reid Kleckner	dc40d6f36d	Re-land "Fix Bug 30978 by emitting cv file checksums." This reverts r313431 and brings back r313374 with a fix to write checksums as binary data and not ASCII hex strings. llvm-svn: 313657	2017-09-19 18:14:45 +00:00
Jake Ehrlich	3916a38a25	[llvm-objcopy] Add support for nested and overlapping segments This change adds support for nested and even overlapping segments. This means that PT_PHDR, PT_GNU_RELRO, PT_TLS, and PT_DYNAMIC can be supported properly. Differential Revision: https://reviews.llvm.org/D36558 llvm-svn: 313656	2017-09-19 18:14:03 +00:00
Saleem Abdulrasool	fac72de1b5	ExecutionEngine: add R_AARCH64_ABS{16,32} Add support for the R_AARCH64_ABS{16,32} relocations in the execution engine. This is primarily used for DWARF debug information relocations and needed by the LLVM JIT to support JITing for lldb. Patch by Alex Langford! llvm-svn: 313654	2017-09-19 18:00:50 +00:00
Reid Kleckner	bca8c94590	Re-land r313400 "[DebugInfo] Insert DW_OP_deref when spilling indirect DBG_VALUEs" I forgot to zero out the BitVector when reusing it between UserValues. Later uses of the same location number for a different UserValue would falsely indicate that they were spilled. Usually this would lead to incorrect debug info, but in some cases they would indicate something nonsensical like a memory location based on a vector register (Q8 on ARM). llvm-svn: 313640	2017-09-19 16:32:15 +00:00
Tony Jiang	74078abf1a	[PowerPC Peephole] Constants into a join add, use ADDI over LI/ADD. Two blocks prior to the join each perform an li and the the join block has an add using the initialized register. Optimize each predecessor block to instead use addi and delete the li's and add. Differential Revision: https://reviews.llvm.org/D36734 llvm-svn: 313639	2017-09-19 16:14:37 +00:00
Evandro Menezes	6c8d0d705e	[AArch64] Extend tests of loads and stores of register pairs Include instances of FP register pairs. llvm-svn: 313638	2017-09-19 15:46:35 +00:00
Tony Jiang	74b126e551	[Power9] Add missing Power9 instructions. The following 8 instructions are implemented in this patch. addpcis(subpcis, lnia), darn, maddhd, maddhdu, maddld, setb llvm-svn: 313636	2017-09-19 15:22:36 +00:00
Daniel Sanders	ca6a043f07	[globalisel] Add a G_BSWAP instruction and support bswap using it. llvm-svn: 313633	2017-09-19 14:25:15 +00:00
Simon Pilgrim	c0738c4adc	[X86][SSE] Add 'redundant pand' test case from PR34620 llvm-svn: 313632	2017-09-19 14:02:16 +00:00
Sanjay Patel	42fe16b8fd	[x86] regenerate checks; NFC llvm-svn: 313631	2017-09-19 13:43:09 +00:00
Alexey Bataev	84fb1c0314	[SLP] Reduce test, NFC. llvm-svn: 313630	2017-09-19 13:38:56 +00:00
Daniel Sanders	a591b205f2	[globalisel] Add support for intrinsic_void llvm-svn: 313629	2017-09-19 13:23:01 +00:00
Daniel Sanders	4b1144e7ae	[globalisel] Add support for intrinsic_w_chain. This maps directly to G_INTRINSIC_W_SIDE_EFFECTS. llvm-svn: 313627	2017-09-19 12:56:36 +00:00
Jina Nahias	b0f12aa95c	[x86] Lowering Mask Set1 intrinsics to LLVM IR This patch, together with a matching clang patch (https://reviews.llvm.org/D37668), implements the lowering of X86 mask set1 intrinsics to IR. Differential Revision: https://reviews.llvm.org/D37669 llvm-svn: 313625	2017-09-19 11:03:06 +00:00
Roger Ferrer Ibanez	43e3cefe97	[ARM] Use ADDCARRY / SUBCARRY This is a preparatory step for D34515. This change: - makes nodes ISD::ADDCARRY and ISD::SUBCARRY legal for i32 - lowering is done by first converting the boolean value into the carry flag using (_, C) ← (ARMISD::ADDC R, -1) and converted back to an integer value using (R, _) ← (ARMISD::ADDE 0, 0, C). An ARMISD::ADDE between the two operations does the actual addition. - for subtraction, given that ISD::SUBCARRY second result is actually a borrow, we need to invert the value of the second operand and result before and after using ARMISD::SUBE. We need to invert the carry result of ARMISD::SUBE to preserve the semantics. - given that the generic combiner may lower ISD::ADDCARRY and ISD::SUBCARRYinto ISD::UADDO and ISD::USUBO we need to update their lowering as well otherwise i64 operations now would require branches. This implies updating the corresponding test for unsigned. - add new combiner to remove the redundant conversions from/to carry flags to/from boolean values (ARMISD::ADDC (ARMISD::ADDE 0, 0, C), -1) → C - fixes PR34045 - fixes PR34564 Differential Revision: https://reviews.llvm.org/D35192 llvm-svn: 313618	2017-09-19 09:05:39 +00:00
Andrei Elovikov	1937c83616	Test commit. llvm-svn: 313617	2017-09-19 07:56:20 +00:00
Matt Arsenault	4d69a94a3c	AMDGPU: Run internalize symbols at -O0 The relocations used for externally visible functions aren't supported, so the direct call emitted ends up hitting a linker error. llvm-svn: 313616	2017-09-19 07:40:11 +00:00
Gadi Haber	c6fb224953	[X86][Skylake] Adding the scheduling information for the SkylakeClient target This patch adds the instruction scheduling information for the SkylakeClient (SKL) architecture target by adding the file X86SchedSkylakeClient.td located under the X86 Target. We used the scheduling information retrieved from the Skylake architects in order to create the file. The scheduling information includes latency, number of micro-Ops and used ports by each SKL instruction. The patch continues the scheduling replacement and insertion effort started with the SNB target in r307529 and r310792 and for HSW in r311879. Please expect some performance fluctuations due to code alignment effects. Reviewers: craig.topper, zvi, chandlerc, igorb, aymanmus, RKSimon, delena Differential Revision: https://reviews.llvm.org/D37294 llvm-svn: 313613	2017-09-19 06:19:27 +00:00
Craig Topper	21507d4d8e	[X86] Add VPERMPD/VPERMQ and VPERMPS/VPERMD to the execution domain fixing table. llvm-svn: 313610	2017-09-19 04:39:55 +00:00
Vedant Kumar	019979919f	[llvm-cov] Make report metrics agree with line exec counts, fixes PR34615 Use the same logic as the line-oriented coverage view to determine the number of covered lines in a function. Fixes llvm.org/PR34615. llvm-svn: 313604	2017-09-19 02:00:12 +00:00
Vedant Kumar	65c67f5133	[Coverage] Use gap regions to select better line exec counts After clang started emitting deferred regions (r312818), llvm-cov has had a hard time picking reasonable line execuction counts. There have been one or two generic improvements in this area (e.g r310012), but line counts can still report coverage for whitespace instead of code (llvm.org/PR34612). To fix the problem: * Introduce a new region kind so that frontends can explicitly label gap areas. This is done by changing the encoding of the columnEnd field of MappingRegion. This doesn't substantially increase binary size, and makes it easy to maintain backwards-compatibility. * Don't set the line count to a count from a gap area, unless the count comes from a wrapped segment. * Don't highlight gap areas as uncovered. Fixes llvm.org/PR34612. llvm-svn: 313597	2017-09-18 23:37:28 +00:00
Vedant Kumar	c4b4f71684	[llvm-cov] Repair a test. NFC. The checks with the MARKER prefix were not being run over the right input, because stderr was not redirected properly. llvm-svn: 313596	2017-09-18 23:37:27 +00:00
Yonghong Song	837145b1d0	bpf: add inline-asm support Signed-off-by: Yonghong Song <yhs@fb.com> Acked-by: Alexei Starovoitov <ast@kernel.org> llvm-svn: 313593	2017-09-18 23:29:36 +00:00
Yi Kong	0b6437a892	[ThinLTO/gold] Implement ThinLTO cache pruning support Differential Revision: https://reviews.llvm.org/D37993 llvm-svn: 313592	2017-09-18 23:24:55 +00:00
Hans Wennborg	fd12818ab8	Revert r313400 "[DebugInfo] Insert DW_OP_deref when spilling indirect DBG_VALUEs" This caused asserts in Chromium. See http://crbug.com/766261 > Summary: > This comes up in optimized debug info for C++ programs that pass and > return objects indirectly by address. In these programs, > llvm.dbg.declare survives optimization, which causes us to emit indirect > DBG_VALUE instructions. The fast register allocator knows to insert > DW_OP_deref when spilling indirect DBG_VALUE instructions, but the > LiveDebugVariables did not until this change. > > This fixes part of PR34513. I need to look into why this doesn't work at > -O0 and I'll send follow up patches to handle that. > > Reviewers: aprantl, dblaikie, probinson > > Subscribers: qcolombet, hiraditya, llvm-commits > > Differential Revision: https://reviews.llvm.org/D37911 llvm-svn: 313589	2017-09-18 23:08:42 +00:00
Zachary Turner	12d13d661c	[lit] Update clang and lld to use new config helpers. NFC intended here, this only updates clang and lld's lit configs to use some helper functionality in the lit.llvm submodule. llvm-svn: 313579	2017-09-18 22:26:48 +00:00
Sanjay Patel	458e63ca85	[DAGCombiner] fold assertzexts separated by trunc If we have an AssertZext of a truncated value that has already been AssertZext'ed, we can assert on the wider source op to improve the zext-y knowledge: assert (trunc (assert X, i8) to iN), i1 --> trunc (assert X, i1) to iN This moves a fold from being Mips-specific to general combining, and x86 shows improvements. Differential Revision: https://reviews.llvm.org/D37017 llvm-svn: 313577	2017-09-18 22:05:35 +00:00
Sanjay Patel	fb7cedf534	[InstCombine] auto-generate complete checks; NFC The code responsible for these transforms has the potential to add 2 instructions and break min/max patterns (PR33301). llvm-svn: 313575	2017-09-18 21:57:56 +00:00
Adrian Prantl	87821c19b9	llvm-dwarfdump: add a --show-parents options when selectively dumping DIEs. llvm-svn: 313567	2017-09-18 21:27:44 +00:00
Adrian Prantl	7399052eee	Fix typo in testcase. llvm-svn: 313566	2017-09-18 21:27:42 +00:00
Konstantin Zhuravlyov	407490a8b6	AMDGPU: Start selecting s_xnor_{b32, b64} Differential Revision: https://reviews.llvm.org/D37981 llvm-svn: 313565	2017-09-18 21:22:45 +00:00
Sanjay Patel	7ae3cb976f	[DAG, x86] allow store merging before and after legalization (PR34217) rL310710 allowed store merging to occur after legalization to catch stores that are created late, but this exposes a logic hole seen in PR34217: https://bugs.llvm.org/show_bug.cgi?id=34217 We will miss merging stores if the target lowers vector extracts into target-specific operations. This patch allows store merging to occur both before and after legalization if the target chooses to get maximum merging. I don't think the potential regressions in the other tests are relevant. The tests are for correctness of weird IR constructs rather than perf tests, and I think those are still correct. Differential Revision: https://reviews.llvm.org/D37987 llvm-svn: 313564	2017-09-18 20:54:26 +00:00
Craig Topper	80ff6ad748	[X86] Make sure we still emit zext for GR32 to GR64 when the source of the zext is AssertZext The AssertZext we might see in this case is only giving information about the lower 32 bits. It isn't providing information about the upper 32 bits. So we should emit a zext. This fixes PR28540. Differential Revision: https://reviews.llvm.org/D37729 llvm-svn: 313563	2017-09-18 20:49:13 +00:00
Alexey Bataev	2623a6b0e7	[SLP] Add a test for PR34635, NFC. llvm-svn: 313559	2017-09-18 19:33:30 +00:00
Sanjay Patel	15144d68cd	[x86] add tests for PR34217; NFC llvm-svn: 313548	2017-09-18 18:07:50 +00:00
Simon Pilgrim	f48c836d10	[X86][AVX] Improve (i8 bitcast (v8i1 x)) handling for 256-bit vector compare results. As commented on D37849, AVX1 targets were missing a chance to use vmovmskps for v8f32/v8i32 results for bool vector bitcasts llvm-svn: 313547	2017-09-18 17:58:31 +00:00
Sanjay Patel	f733cdf743	[x86] regenerate checks; NFC llvm-svn: 313545	2017-09-18 17:33:47 +00:00
Manoj Gupta	67f92622b1	[LoopVectorizer] Add more testcases for PR33804. Summary: Add test cases when float <-> pointer types conversion is triggered in presence of load instructions. Reviewers: Ayal, srhines, mkuper, rengolin Reviewed By: rengolin Subscribers: javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D37967 llvm-svn: 313544	2017-09-18 17:28:15 +00:00
Simon Pilgrim	1b4c429367	[SelectionDAG] Add BITCAST handling to ComputeNumSignBits for splatted sign bits. For cases where we are BITCASTing to vectors of smaller elements, then if the entire source was a splatted sign (src's NumSignBits == SrcBitWidth) we can say that the dst's NumSignBit == DstBitWidth, as we're just splitting those sign bits across multiple elements. We could generalize this but at the moment the only use case I have is to peek through bitcasts to vector comparison results. Differential Revision: https://reviews.llvm.org/D37849 llvm-svn: 313543	2017-09-18 16:45:05 +00:00
Craig Topper	0ec35b588c	[X86] Fix two more places to prefer VPERMQ/PD over VPERM2X128 when AVX2 is enabled The shuffle combining and lowerVectorShuffleAsLanePermuteAndBlend were both still trying to use VPERM2XF128 for unary shuffles when AVX2 is enabled. VPERM2X128 takes two inputs meaning when we use it for a unary shuffle one of those inputs is left undefined creating a false dependency on whatever register gets allocated there. If we have VPERMQ/PD we should prefer those since they only have a single input. Differential Revision: https://reviews.llvm.org/D37947 llvm-svn: 313542	2017-09-18 16:39:49 +00:00
Sam Parker	ddd8dfc9d7	[AArch64] Add V8_2aOps feature to Cortex-A55 and 75 Add the missing hardware features the ProcA55 and ProcA75 feature. These are already enabled via the target parser, but I had missed them in the backend. Differential Revision: https://reviews.llvm.org/D37974 llvm-svn: 313535	2017-09-18 14:46:14 +00:00
Sam Parker	d958cf8aa2	[ARM] Implement isTruncateFree Implement the isTruncateFree hooks, lifted from AArch64, that are used by TargetTransformInfo. This allows simplifycfg to reduce the test case into a single basic block. Differential Revision: https://reviews.llvm.org/D37516 llvm-svn: 313533	2017-09-18 14:28:51 +00:00

... 2 3 4 5 6 ...

47727 Commits