llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-23 04:52:54 +02:00

Author	SHA1	Message	Date
Renato Golin	5f6382ce39	[ARM] Try to re-enable MachineBranchProb.ll for ARM/AArch64 Commit r298799 changed code that made the XFAIL on MachineBranchProb.ll irrelevant, but some configurations still failed. I can't reproduce it locally, so I'm hoping that enabling this will tell me if some configurations will really fail or if they were just too slow. llvm-svn: 299558	2017-04-05 16:27:11 +00:00
Sanjay Patel	d474da0792	[InstCombine] add tests for missing icmp fold (PR32524) llvm-svn: 299557	2017-04-05 16:21:38 +00:00
Dmitry Preobrazhensky	37ee3681af	[AMDGPU][MC] Fix for Bug 28167 + LIT tests Corrected src0 for v_writelane_b32: - Enabled inline constants and literals for SI/CI (VOP2) - Enabled inline constants for VI (VOP3) Reviewers: vpykhtin, arsenm https://reviews.llvm.org/D31463 llvm-svn: 299555	2017-04-05 16:08:21 +00:00
Nirav Dave	7ec57298c0	[SystemZ] Prevent Merging Bitcast with non-normal loads Fixes PR32505. Reviewers: uweigand, jonpa Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D31609 llvm-svn: 299552	2017-04-05 15:42:48 +00:00
Davide Italiano	63946788d8	[yaml2obj] Factor out error handling code. llvm-svn: 299551	2017-04-05 15:18:16 +00:00
Davide Italiano	f17960676f	[llvm-ar] Remove unneeded std::, NFCI. This makes it more consistent with other exit() calls in llvm-ar (and the tools in general). llvm-svn: 299549	2017-04-05 15:05:05 +00:00
Davide Italiano	c830c23e31	[llvm-ar] errors go on stderr and not on stdout. llvm-svn: 299548	2017-04-05 14:52:17 +00:00
Jonathan Roelofs	fdb54ce7e7	Respect CMAKE_INSTALL_MANDIR for sphinx generated manpages This is a re-work of r297516, which was reverted in r297545. https://reviews.llvm.org/D30906 llvm-svn: 299547	2017-04-05 14:49:46 +00:00
Davide Italiano	c6eb082245	[yaml2obj] Improve error message when output file cannot be opened. Patch by Sam Clegg! Differential Revision: https://reviews.llvm.org/D31351 llvm-svn: 299546	2017-04-05 14:44:00 +00:00
Matthew Simpson	1512933518	[LV] Make test case more robust This test case depends on the loop being vectorized without forcing the vectorization factor. If the profitability ever changes in the future (due to cost model improvements), the test may no longer work as intended. Instead of checking the resulting IR, we should just check the instruction costs. The costs will be computed regardless if vectorization is profitable. llvm-svn: 299545	2017-04-05 14:34:13 +00:00
Sanjay Patel	bb37f0efa2	[DAGCombiner] add and use TLI hook to convert and-of-seteq / or-of-setne to bitwise logic+setcc (PR32401) This is a generic combine enabled via target hook to reduce icmp logic as discussed in: https://bugs.llvm.org/show_bug.cgi?id=32401 It's likely that other targets will want to enable this hook for scalar transforms, and there are probably other patterns that can use bitwise logic to reduce comparisons. Note that we are missing an IR canonicalization for these patterns, and we will probably prefer the pair-of-compares form in IR (shorter, more likely to fold). Differential Revision: https://reviews.llvm.org/D31483 llvm-svn: 299542	2017-04-05 14:09:39 +00:00
Jonas Paulsson	168f955a23	[DAGCombiner] Don't make a BUILD_VECTOR with operands of illegal type. When DAGCombiner visits a SIGN_EXTEND_INREG of a BUILD_VECTOR with constant operands, a new BUILD_VECTOR node will be created transformed constants. Llvm-stress found a case where the new BUILD_VECTOR had constant operands of an illegal type, because the (legal) element type is in fact not a legal scalar type. This patch changes this so that the new BUILD_VECTOR has the same operand type as the old one. Review: Eli Friedman, Nirav Dave https://bugs.llvm.org//show_bug.cgi?id=32422 llvm-svn: 299540	2017-04-05 13:45:37 +00:00
Sanjay Patel	70ba8f5253	[InstCombine] add tests for missing add canonicalization; NFC llvm-svn: 299539	2017-04-05 13:33:10 +00:00
Daniel Sanders	157f479b66	[globalisel][tablegen] Fix patterns involving multiple ComplexPatterns. Summary: Temporaries are now allocated to operands instead of predicates and this allocation is used to correctly pair up the rendered operands with the matched operands. Previously, ComplexPatterns were allocated temporaries independently in the Src Pattern and Dst Pattern, leading to mismatches. Additionally, the Dst Pattern failed to account for the allocated index and therefore always used temporary 0, 1, ... when it should have used base+0, base+1, ... Thanks to Aditya Nandakumar for noticing the bug. Depends on D30539 Reviewers: ab, t.p.northover, qcolombet, rovka, aditya_nandakumar Reviewed By: rovka Subscribers: igorb, dberris, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D31054 llvm-svn: 299538	2017-04-05 13:14:03 +00:00
Sam Kolton	5469f4b9e9	[AMDGPU] SDWA peephole: enable by default Reviewers: vpykhtin, rampitec, arsenm Subscribers: qcolombet, kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye Differential Revision: https://reviews.llvm.org/D31671 llvm-svn: 299536	2017-04-05 12:00:45 +00:00
Alexander Kornienko	5701b2ee79	Fix WebAssembly after r299529. llvm-svn: 299535	2017-04-05 11:50:43 +00:00
Simon Pilgrim	f6fed8721d	[X86][SSE] Renamed combine to make it clear that it only handles the vector shift by immediate opcodes. NFCI llvm-svn: 299532	2017-04-05 10:44:42 +00:00
James Molloy	2a97944bc0	[AArch64] Crypto requires FP. So if FP is disabled, crypto should also be disabled. llvm-svn: 299531	2017-04-05 10:44:38 +00:00
Alex Bradbury	4a2d4860e6	Add MCContext argument to MCAsmBackend::applyFixup for error reporting A number of backends (AArch64, MIPS, ARM) have been using MCContext::reportError to report issues such as out-of-range fixup values in their TgtAsmBackend. This is great, but because MCContext couldn't easily be threaded through to the adjustFixupValue helper function from its usual callsite (applyFixup), these backends ended up adding an MCContext* argument and adding another call to applyFixup to processFixupValue. Adding an MCContext parameter to applyFixup makes this unnecessary, and even better - applyFixup can take a reference to MCContext rather than a potentially null pointer. Differential Revision: https://reviews.llvm.org/D30264 llvm-svn: 299529	2017-04-05 10:16:14 +00:00
James Molloy	e0107c42cc	[LAA] Correctly return a half-open range in expandBounds This is a latent bug that's been hanging around for a while. For a loop-invariant pointer, expandBounds would return the range {Ptr, Ptr}, but this was interpreted as a half-open range, not a closed range. So we ended up planting incorrect bounds checks. Even worse, they were tautological, so we ended up incorrectly executing the optimized loop. llvm-svn: 299526	2017-04-05 09:24:26 +00:00
Gor Nishanov	0c6ab1f70b	[coroutines] Add syntax coloring to examples in Coroutines.rst Subscribers: EricWF Differential Revision: https://reviews.llvm.org/D31699 llvm-svn: 299517	2017-04-05 05:26:26 +00:00
Akira Hatanaka	60ec474c05	[ObjCArc] Do not dereference an invalidated iterator. Fix a bug in ARC contract pass where an iterator that pointed to a deleted instruction was dereferenced. It appears that tryToContractReleaseIntoStoreStrong was incorrectly assuming that a call to objc_retain would not immediately follow a call to objc_release. rdar://problem/25276306 llvm-svn: 299507	2017-04-05 03:44:09 +00:00
Lang Hames	32959bb1fe	[RuntimeDyld] Remove an unused static member left over from r299449. llvm-svn: 299497	2017-04-05 01:43:59 +00:00
Bob Haarman	c1611a3e18	ThinLTOBitcodeWriter: handle aliases first in filterModule Summary: This change fixes a "local linkage requires default visibility" assert when attempting to build LLVM with ThinLTO on Windows. Reviewers: pcc, tejohnson, mehdi_amini Reviewed By: pcc Subscribers: llvm-commits, Prazek Differential Revision: https://reviews.llvm.org/D31632 llvm-svn: 299491	2017-04-05 00:42:07 +00:00
Ahmed Bougacha	aed61d98c5	[X86] Relax assert in broadcast-of-subvector lowering. Before r294774, there was a problem when lowering broadcasts to use 128-bit subvectors. When we looked through a bitcast to find the broadcast input, we'd keep using the original type, so you'd end up with things like: (v8f32 (broadcast (v4f32 (extract_subvector (v8i32 V), ...)) )) r294774 fixed it to always emit subvectors with the scalar type of the original source. It also introduced some asserts, to check that we use scalars with the same size, and vectors with the same number of elements. The scalar size equality is checked earlier when looking through bitcasts, and is a useful assert. However, the number of elements don't have to be identical: we're always going to extract a 128-bit subvector, and we can have different size inputs if we looked through a concat_vector to find a 256-bit source. Relax the overzealous assert. Replace it with a check of the original source vector being 256 or 512 bits. If it's 128 bits, we can't extract_subvector from it. Fixes PR32371. llvm-svn: 299490	2017-04-05 00:14:39 +00:00
Matt Arsenault	b931aad9bc	Allow targets to opt-in to codegen in SCC order Decouple this setting from EnableIRPA. To support function calls on AMDGPU, it is necessary to report the global register usage throughout the kernel's call graph, so callees need to be handled first. llvm-svn: 299487	2017-04-04 23:44:46 +00:00
Daniel Berlin	8b807559be	Re-apply MemorySSA: Add support for caching clobbering access in stores with some fixes. Summary: This enables us to cache the clobbering access for stores, despite the fact that we can't rewrite the use-def chains themselves. Early testing shows that, after this change, for larger testcases, it will be a significant net positive (memory and time) to remove the walker caching. Reviewers: george.burgess.iv, davide Subscribers: Prazek, llvm-commits Differential Revision: https://reviews.llvm.org/D31567 llvm-svn: 299486	2017-04-04 23:43:10 +00:00
Daniel Berlin	21f9f2721a	Revert "MemorySSA: Add support for caching clobbering access in stores" This reverts revision r299322. llvm-svn: 299485	2017-04-04 23:43:04 +00:00
Petr Hosek	6a9c1cd63d	[MC] Set defaults based on section names and support name suffixes Set correct default flags and section type based on its name for .text, .data, .bss, .init_array, .fini_array, .preinit_array, .tdata, and .tbss and support section name suffixes for .data., .rodata., .text., .bss., .tdata.* and .tbss.* which matches the behavior of GAS. Fixes PR31888. Differential Revision: https://reviews.llvm.org/D30229 llvm-svn: 299484	2017-04-04 23:32:45 +00:00
Ahmed Bougacha	4b7c6d9d15	[AArch64] Avoid partial register deps on insertelt of load into lane 0. This improves upon r246462: that prevented FMOVs from being emitted for the cross-class INSERT_SUBREGs by disabling the formation of INSERT_SUBREGs of LOAD. But the ld1.s that we started selecting caused us to introduce partial dependencies on the vector register. Avoid that by using SCALAR_TO_VECTOR: it's a first-class citizen that is folded away by many patterns, including the scalar LDRS that we want in this case. Credit goes to Adam for finding the issue! llvm-svn: 299482	2017-04-04 22:55:53 +00:00
Evgeniy Stepanov	bbceead2ad	Change section flag character for SHF_LINK_ORDER to "o". GAS uses "m" as a compatibility alias for "M" (SHF_MERGE). "o" is free, except on ia64, where it already means SHF_LINK_ORDER. llvm-svn: 299479	2017-04-04 22:35:08 +00:00
Craig Topper	2f2b6aaa87	[InstCombine] Add test cases for various add/subtracts of constants(scalar, splat, and vector) with phis and selects. Improvements coming in a future commit. llvm-svn: 299476	2017-04-04 22:22:30 +00:00
Rafael Espindola	7287f2f247	[lit] Add a minimum export implementation. llvm-svn: 299475	2017-04-04 22:20:18 +00:00
Sanjay Patel	cdd1a7d5f1	[InstCombine] rename variable for easier reading; NFC We usually give constants a 'C' somewhere in the name... llvm-svn: 299474	2017-04-04 22:06:03 +00:00
Craig Topper	fdbd9d31f7	[InstCombine] Turn subtract of vectors of i1 into xor like we do for scalar i1. Matches what we already do for add. llvm-svn: 299472	2017-04-04 21:44:56 +00:00
Balaram Makam	30b98313bf	[AArch64] Add missing schedinfo, check completeness for Falkor. llvm-svn: 299468	2017-04-04 21:15:53 +00:00
Keno Fischer	1495c2d33b	[ExecutionDepsFix] Don't revisit true dependencies If an instruction has a true dependency, it makes sense for to use that register for any undef read operands in the same instruction (we'll have to wait for that register to become available anyway). This logic was already implemented. However, the code would then still try to revisit that instruction and break the dependency (and always fail, since by definition a true dependency has to be live before the instruction). Avoid revisiting such instructions as a performance optimization. No functional change. Differential Revision: https://reviews.llvm.org/D30173 llvm-svn: 299467	2017-04-04 20:30:47 +00:00
Craig Topper	5fbccd1f57	[InstCombine] Support folding and/or/xor with a constant vector RHS into selects and phis Currently we only fold with ConstantInt RHS. This generalizes to any Constant RHS. Differential Revision: https://reviews.llvm.org/D31610 llvm-svn: 299466	2017-04-04 20:26:25 +00:00
Petr Hosek	1462430d42	[AArch64][Fuchsia] Allow -mcmodel=kernel for --target=aarch64-fuchsia This mode is just like -mcmodel=small except that it moves the thread pointer from TPIDR_EL0 to TPIDR_EL1. Patch by Roland McGrath. Differential Revision: https://reviews.llvm.org/D31624 llvm-svn: 299462	2017-04-04 19:51:53 +00:00
Craig Topper	255d48a84e	[InstCombine] Add test cases for missing combines of phis with and/or/xor with constant argument. NFC llvm-svn: 299460	2017-04-04 19:31:21 +00:00
Yi Kong	5694f0b69c	Implement host CPU detection for AArch64 This shares detection logic with ARM(32), since AArch64 capable CPUs may also run in 32-bit system mode. We observe weird /proc/cpuinfo output for MSM8992 and MSM8994, where they report all CPU cores as one single model, depending on which CPU core the kernel is running on. As a workaround, we hardcode the known CPU part name for these SoCs. For big.LITTLE systems, this patch would only return the part name of the first core (usually the little core). Proper support will be added in a follow-up change. Differential Revision: D31675 llvm-svn: 299458	2017-04-04 19:06:04 +00:00
Matt Arsenault	19cebcd948	Verifier: Check some amdgpu calling convention restrictions llvm-svn: 299457	2017-04-04 18:43:11 +00:00
Balaram Makam	2adb3f319c	[AArch64] Refine Falkor Machine Model - Part 2 llvm-svn: 299456	2017-04-04 18:42:14 +00:00
Coby Tayree	b96fff48ea	[X86][inline-asm] Add support for MS 'EVEN' directive MS assembly syntax provide us with the 'EVEN' directive as a synonymous to at&t '.even'. This patch include the (small, simple) changes need to allow it. Test is provided at the following (clang-side) review: https://reviews.llvm.org/D27418 Differential Revision: https://reviews.llvm.org/D27417 llvm-svn: 299453	2017-04-04 17:57:23 +00:00
Craig Topper	e60bc8b17f	[InstCombine] Add more test cases for missing combines of selects with and/or/xor with constant argument. NFC llvm-svn: 299450	2017-04-04 17:48:08 +00:00
Lang Hames	8e95f041c8	[RuntimeDyld] Make RuntimeDyld honor the ProcessAllSections flag. When the ProcessAllSections flag (introduced in r204398) is set RuntimeDyld is supposed to make a call to the client's memory manager for every section in each object that is loaded. Due to some missing checks, this was not happening in all cases. This patch adds the missing cases, and fixes the Orc unit test that verifies correct behavior for ProcessAllSections (The unit test had been silently bailing out due to an ordering issue: a change in the test order meant that this unit-test was running before the native target was registered. This issue has also been fixed in this patch). This fixes <rdar://problem/22789965> llvm-svn: 299449	2017-04-04 17:03:49 +00:00
Sanjay Patel	6ea8420a3c	[x86] remove dead select-of-constants transform; NFCI https://reviews.llvm.org/D30537 / https://reviews.llvm.org/rL296977 added these transforms and other related transforms to the generic DAGCombiner (with a hook that x86 sets to true), so these patterns should not exist by the time we reach the target-specific combiner hook. llvm-svn: 299448	2017-04-04 16:54:58 +00:00
Rong Xu	2501ed842c	[PGO] Memory intrinsic calls optimization based on profiled size This patch optimizes two memory intrinsic operations: memset and memcpy based on the profiled size of the operation. The high level transformation is like: mem_op(..., size) ==> switch (size) { case s1: mem_op(..., s1); goto merge_bb; case s2: mem_op(..., s2); goto merge_bb; ... default: mem_op(..., size); goto merge_bb; } merge_bb: Differential Revision: http://reviews.llvm.org/D28966 llvm-svn: 299446	2017-04-04 16:42:20 +00:00
Matt Arsenault	494ba6a33a	AMDGPU: Remove legacy export intrinsic llvm-svn: 299444	2017-04-04 16:34:39 +00:00
Matt Arsenault	a646fad79f	AMDGPU: Remove legacy image intrinsics llvm-svn: 299443	2017-04-04 16:34:35 +00:00

... 2 3 4 5 6 ...

147241 Commits