llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 11:42:57 +01:00

Author	SHA1	Message	Date
Balaram Makam	02c1602f50	MachineInstr: Reason locally about some memory objects before going to AA. This addresses a FIXME in MachineInstr::mayAlias. llvm-svn: 310825	2017-08-14 09:41:40 +00:00
Sam Parker	471134db57	[LoopUnroll] Enable option to peel remainder loop On some targets, the penalty of executing runtime unrolling checks and then not the unrolled loop can be significantly detrimental to performance. This results in the need to be more conservative with the unroll count, keeping a trip count of 2 reduces the overhead as well as increasing the chance of the unrolled body being executed. But being conservative leaves performance gains on the table. This patch enables the unrolling of the remainder loop introduced by runtime unrolling. This can help reduce the overhead of misunrolled loops because the cost of non-taken branches is much less than the cost of the backedge that would normally be executed in the remainder loop. This allows larger unroll factors to be used without suffering performance loses with smaller iteration counts. Differential Revision: https://reviews.llvm.org/D36309 llvm-svn: 310824	2017-08-14 09:25:26 +00:00
Sam Parker	c30b2fef41	[AArch64] Remove unused MC function An unused function warning was raised in https://bugs.llvm.org/show_bug.cgi?id=34178. The offending function, in AArch64MCCodeEmitter.cpp, was committed by me last week. Differential Revision: https://reviews.llvm.org/D36665 llvm-svn: 310823	2017-08-14 09:16:13 +00:00
Elad Cohen	7e65878369	Revert "[DAGCombiner] Extending pattern detection for vector shuffle (REAPPLIED)" This reverts commit r310782. llvm-svn: 310822	2017-08-14 09:06:00 +00:00
Chandler Carruth	b246fe2de1	[ValueTracking] Revert r310583 which enabled functionality that still is causing compile time issues. Moreover, the patch deleted the flag in addition to changing the default, and links to a code review that doesn't even discuss the flag and just has an update to a Clang test case. I've followed up on the commit thread to ask for numbers on compile time at this point, leaving the flag in place until things stabilize, and pointing at specific code that seems to exhibit excessive compile time with this patch. Original commit message for r310583: """ [ValueTracking] Enabling ValueTracking patch by default (recommit). Part 2. The original patch was an improvement to IR ValueTracking on non-negative integers. It has been checked in to trunk (D18777, r284022). But was disabled by default due to performance regressions. Perf impact has improved. The patch would be enabled by default. """" llvm-svn: 310816	2017-08-14 07:03:24 +00:00
Craig Topper	fc84bf6223	[AVX-512] Add hasSideEffects = 0 to the 8-bit and 16-bit register broadcasts. llvm-svn: 310813	2017-08-14 05:09:34 +00:00
Craig Topper	aae9fca245	[X86] Remove unused argument from the vextract_for_size multiclass. NFC llvm-svn: 310812	2017-08-14 05:09:33 +00:00
Craig Topper	984bf55654	[AVX512] Remove comment I should have removed in r310808. NFC llvm-svn: 310811	2017-08-14 05:09:31 +00:00
Brian Gesiak	f4edf082e8	[opt-viewer] Listify `dict_items` for Py3 indexing Summary: In Python 2, calling `dict.items()` returns an indexable `list`, whereas on Python 3 it returns a set-like `dict_items` object, which cannot be indexed. Explicitly onvert the `dict_items` object so that it can be indexed when using Python 3. In combination with D36622, D36623, and D36624, this change allows `opt-viewer.py` to exit successfully when run with Python 3.4. Test Plan: Run `opt-viewer.py` using Python 3.4 and confirm it does not encounter a runtime error when when indexing into `dict.items()`. Reviewers: anemet Reviewed By: anemet Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36630 llvm-svn: 310810	2017-08-14 04:16:43 +00:00
Chandler Carruth	e390910302	[PowerPC] Revert r310346 (and followups r310356 & r310424) which introduce a miscompile bug. There appears to be a bug where the generated code to extract the sign bit doesn't work correctly for 32-bit inputs. I've replied to the original commit pointing out the problem. I think I see by inspection (and reading the manual for PPC) how to fix this, but I can't be 100% confident and I also don't know what the best way to test this is. Currently it seems nearly impossible to get the backend to hit this code path, but the patch autohr is likely in a better position to craft such test cases than I am, and based on where the bug is it should be easily done. Original commit message for r310346: """ [PowerPC] Eliminate compares - add i32 sext/zext handling for SETLE/SETGE Adds handling for SETLE/SETGE comparisons on i32 values. Furthermore, it adds the handling for the special case where RHS == 0. Differential Revision: https://reviews.llvm.org/D34048 """ llvm-svn: 310809	2017-08-14 03:41:00 +00:00
Craig Topper	a6234bd44a	[AVX512] Simplify the instruction defintion for VEXTRACT. NFCI The comment about why we couldn't use avx512_maskable appears to have been incorrect. llvm-svn: 310808	2017-08-14 01:53:10 +00:00
Javed Absar	e0e3cf49f9	[ARM] Tidy-up Cortex-A15 DPR-SPR optimizer implementation Modernise the code with range-loops etc Reviewed by: @fhahn, @rovka Differential Revision: https://reviews.llvm.org/D36502 llvm-svn: 310807	2017-08-14 01:38:01 +00:00
Craig Topper	ae34cf0b5a	[InstCombine] Simplify and inline FoldOrWithConstants/FoldXorWithConstants Summary: These functions were overly complicated. The body of this function was rechecking for an And operation to find the constant, but we already knew we were looking at two Ands ORed together and the pieces are in variables. We already had earlier nearby code that checked for ConstantInts. So just inline the remaining parts into the earlier code. Next step is to use m_APInt instead of ConstantInt. Reviewers: spatel, efriedma, davide, majnemer Reviewed By: spatel Subscribers: zzheng, llvm-commits Differential Revision: https://reviews.llvm.org/D36439 llvm-svn: 310806	2017-08-14 00:04:21 +00:00
Simon Pilgrim	5f65eed245	[X86][BMI] Add BEXTR demanded bits test cases (PR34042) llvm-svn: 310802	2017-08-13 20:35:38 +00:00
Craig Topper	929f9894d5	[X86] Fix typo from r310794. Index = 0 should have been Index == 0. llvm-svn: 310801	2017-08-13 20:21:12 +00:00
Craig Topper	377502c5b1	[X86] Remove unused pattern fragment that referenced MVT::i1. NFC llvm-svn: 310799	2017-08-13 20:04:05 +00:00
Martin Storsjo	0a00e5ecac	[COFF, ARM64] Use '//' as comment character in assembly files in GNU environments This allows using semicolons for bundling up more than one statement per line. This is used within the mingw-w64 project in some assembly files that contain code for multiple architectures. Differential Revision: https://reviews.llvm.org/D36366 llvm-svn: 310797	2017-08-13 19:42:05 +00:00
Alex Bradbury	861b2b3298	Remove RISCV from LLVM_ALL_TARGETS in CMakeLists.txt It was mistakenly added to that list in D23560 (committed in rL285712). RISCV is an experimental backend and should never have been in that list, I mistakenly interpreted LLVM_ALL_TARGETS as a list of all targets rather than targets to build by default. Unfortunately, because of this the RISCV backend has been building by default when it shouldn't be. This commet adds a description comment, which should help to avoid such mistakes in the future. See my message to llvm-dev for more information and analysis <http://lists.llvm.org/pipermail/llvm-dev/2017-August/116347.html>. Differential Revision: https://reviews.llvm.org/D36538 llvm-svn: 310796	2017-08-13 18:49:33 +00:00
Craig Topper	0a08ea2b81	[AVX512] Correct isExtractSubvectorCheap so that it will return the correct answers for extracting 128-bits from a 512-bit vector and for mask registers. Previously it would not return true for extracting either of the upper quarters of a 512-bit registers. For mask registers we support extracting anything from index 0. And otherwise we only support extracting the upper half of a register. Differential Revision: https://reviews.llvm.org/D36638 llvm-svn: 310794	2017-08-13 17:40:02 +00:00
Craig Topper	84c8374521	[X86][ARM][TargetLowering] Add SrcVT to isExtractSubvectorCheap Summary: Without the SrcVT its hard to know what is really being asked for. For example if your target has 128, 256, and 512 bit vectors. Maybe extracting 128 from 256 is cheap, but maybe extracting 128 from 512 is not. For x86 we do support extracting a quarter of a 512-bit register. But for i1 vectors we don't have isel patterns for extracting arbitrary pieces. So we need this to have a correct implementation of isExtractSubvectorCheap for mask vectors. Reviewers: RKSimon, zvi, efriedma Reviewed By: RKSimon Subscribers: aemerson, javed.absar, kristof.beyls, llvm-commits Differential Revision: https://reviews.llvm.org/D36649 llvm-svn: 310793	2017-08-13 17:29:07 +00:00
Gadi Haber	f100c89fd8	[X86][SandyBridge] Additional updates to the SNB instructions scheduling information This is a continuation patch for commit r307529 which completely replaces the scheduling information for the SandyBridge architecture target by modifying the file X86SchedSandyBridge.td located under the X86 Target (see also https://reviews.llvm.org/D35019). In this patch we added the scheduling information of additional SNB instructions that were missing from the patch commit r307529, fixed the scheduling of several resource groups that include only port0 instead of port05 (i.e., port0 OR port5) and fixed several incorrect instructions' scheduling in the r307529 commit. The patch also includes the X87 instructions which were missing in previous patch commit r307529 as reported in bugzilla bug 34080. Reviewers: zvi, RKSimon, chandlerc, igorb, m_zuckerman, craig.topper, aymanmus, dim Differential Revision: https://reviews.llvm.org/D36388 llvm-svn: 310792	2017-08-13 13:59:24 +00:00
Simon Pilgrim	1ade4f1663	[X86][AVX512] Added additional shuffle+trunc test case. An existing test should have covered this but a typo caused it to fail. I've kept both as the codegen for the typo case needs addressing as well. llvm-svn: 310791	2017-08-13 12:30:36 +00:00
Simon Pilgrim	5600ace2e7	[X86][TBM] Add tests showing failure to fold RFLAGS result into TBM instructions. And fails to select TBM instructions at all. llvm-svn: 310790	2017-08-13 12:16:00 +00:00
Coby Tayree	78eb041ca7	[X86][AsmParser][AVX512] Error appropriately when K0 is tried as a write-mask K0 isn't expected as a write-mask, so provide a detailed error here, instead of the more generic one (invalid op for insn) Conforms with gas Differential Revision: https://reviews.llvm.org/D36570 llvm-svn: 310789	2017-08-13 12:03:00 +00:00
Simon Pilgrim	60a0895ab5	[X86][TBM] Regenerate bextri intrinsics tests. NFCI. llvm-svn: 310788	2017-08-13 11:56:15 +00:00
Guy Blank	7c35012e05	[X86][AVX512] Add combine for TESTM Add an X86 combine for TESTM when one of the operands is a BUILD_VECTOR(0,0,...). TESTM op0, BUILD_VECTOR(0,0,...) -> BUILD_VECTOR(0,0,...) TESTM BUILD_VECTOR(0,0,...), op1 -> BUILD_VECTOR(0,0,...) Differential Revision: https://reviews.llvm.org/D36536 llvm-svn: 310787	2017-08-13 08:03:37 +00:00
Craig Topper	7a9745b858	[X86] Early out of combineInsertSubvector for mask vectors. The combines here shouldn't be done for mask vectors, but it wasn't clear anything was preventing that. llvm-svn: 310786	2017-08-12 22:33:58 +00:00
Craig Topper	0174550128	[X86] Fix bad comment. NFC llvm-svn: 310785	2017-08-12 22:33:57 +00:00
Craig Topper	4f3b1b1c16	[X86] When handling addcarry intrinsic, create the flag result with the correct type so we don't crash if we use a memory instruction Summary: Previously we were creating the flag result with MVT::Other which is interpretted as a Chain node. If we used a memory form of the instruction we would end up with a copyToReg that consumed the chain result of the adcx instruction instead of the flag result. Pretty sure we should be using MVT::i32 here, that's what we do other places we create these node types. We should probably consider this for 5.0 as well. Reviewers: RKSimon, zvi, spatel Reviewed By: RKSimon Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D36645 llvm-svn: 310784	2017-08-12 20:19:44 +00:00
Simon Pilgrim	21f47bba68	[DAGCombiner] Extending pattern detection for vector shuffle (REAPPLIED) If all the operands of a BUILD_VECTOR extract elements from same vector then split the vector efficiently based on the maximum vector access index. Reapplied with fix to only work with simple value types. Committed on behalf of @jbhateja (Jatin Bhateja) Differential Revision: https://reviews.llvm.org/D35788 llvm-svn: 310782	2017-08-12 17:43:25 +00:00
Florian Hahn	ac5f3d563c	[Triple] Add isThumb and isARM functions. Summary: isThumb returns true for Thumb triples (little and big endian), isARM returns true for ARM triples (little and big endian). There are a few more checks using arm/thumb that are not covered by those functions, e.g. that the architecture is either ARM or Thumb (little endian) or ARM/Thumb little endian only. Reviewers: javed.absar, rengolin, kristof.beyls, t.p.northover Reviewed By: rengolin Subscribers: llvm-commits, aemerson Differential Revision: https://reviews.llvm.org/D34682 llvm-svn: 310781	2017-08-12 17:40:18 +00:00
Simon Pilgrim	4285fd5a57	[X86] Regenerate merge store tests. NFCI. Gives us a much better idea of what is going on than just relying on a few checks. llvm-svn: 310780	2017-08-12 17:27:35 +00:00
Sanjay Patel	9a31ece394	[BDCE] clear poison generators after turning a value into zero (PR33695, PR34037) nsw, nuw, and exact carry implicit assumptions about their operands, so we need to clear those after trivializing a value. We decided there was no danger for llvm.assume or metadata, so there's just a comment about that. This fixes miscompiles as shown in: https://bugs.llvm.org/show_bug.cgi?id=33695 https://bugs.llvm.org/show_bug.cgi?id=34037 Differential Revision: https://reviews.llvm.org/D36592 llvm-svn: 310779	2017-08-12 16:41:08 +00:00
Sylvestre Ledru	cf9dd4dfe6	Fix some minor typos in the llvm XRay exemple llvm-svn: 310777	2017-08-12 15:08:11 +00:00
Richard Smith	03165d9522	D36604: PR34148: Do not assume we can use a copy relocation for an `external_weak` global An `external_weak` global may be intended to resolve as a null pointer if it's not defined, so it doesn't make sense to use a copy relocation for it. Differential Revision: https://reviews.llvm.org/D36604 llvm-svn: 310773	2017-08-11 23:52:28 +00:00
Kostya Serebryany	a2bcec7132	[libFuzzer] experimental support for Clang's coverage (fprofile-instr-generate), Linux-only llvm-svn: 310771	2017-08-11 23:03:22 +00:00
Sanjay Patel	03422abc5b	[x86] add tests for rotate left/right with masked shifter; NFC As noted in the test comment, instcombine now produces the masked shift value even when it's not included in the source, so we should handle this. Although the AMD/Intel docs don't say it explicitly, over-rotating the narrow ops produces the same results. An existence proof that this works as expected on all x86 comes from gcc 4.9 or later: https://godbolt.org/g/K6rc1A llvm-svn: 310770	2017-08-11 22:38:40 +00:00
John Baldwin	bac856de84	[MIPS] Use ABI to determine stack alignment. Summary: The stack alignment depends on the ABI (16 bytes for N32 and N64 and 8 bytes for O32), not the CPU type. Reviewers: sdardis Reviewed By: sdardis Subscribers: atanasyan, arichardson, llvm-commits Differential Revision: https://reviews.llvm.org/D36326 llvm-svn: 310768	2017-08-11 22:07:56 +00:00
Sanjay Patel	1641032341	[x86] regenerate test checks, add 64-bit run; NFC llvm-svn: 310767	2017-08-11 22:05:33 +00:00
Eugene Zelenko	4d66583321	[Analysis] Fix some Clang-tidy modernize-use-using and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 310766	2017-08-11 21:30:02 +00:00
Zachary Turner	d4322f0ace	Fix some broken tests. These were pending in a separate patch but I forgot to squash them before comitting, and this one didn't go through. llvm-svn: 310764	2017-08-11 21:14:01 +00:00
Eli Friedman	28e7964c1c	[OptDiag] Updating Remarks in SampleProfile Updating remark API to newer OptimizationDiagnosticInfo API. This allows remarks to show up in diagnostic yaml file, and enables use of opt-viewer tool. Hotness information for remarks (L505 and L751) do not display hotness information, most likely due to profile information not being propagated yet. Unsure if this is the desired outcome. Patch by Tarun Rajendran. Differential Revision: https://reviews.llvm.org/D36127 llvm-svn: 310763	2017-08-11 21:12:04 +00:00
Craig Topper	3ac638a1ce	[X86] Don't use fsin/fcos/fsincos instructions ever Summary: Previously we would use these instructions if sse was disabled and fastmath was enabled. As mentioned in D28335, this is a bad idea. Reviewers: efriedma, scanon, DavidKreitzer Reviewed By: DavidKreitzer Subscribers: zvi, llvm-commits Differential Revision: https://reviews.llvm.org/D36344 llvm-svn: 310762	2017-08-11 20:55:29 +00:00
Rafael Espindola	17a0e64f99	Fix access to undefined weak symbols in pic code When the access to a weak symbol is not a call, the access has to be able to produce the value 0 at runtime. We were sometimes producing code sequences where that was not possible if the code was leaded more than 4g away from 0. llvm-svn: 310756	2017-08-11 20:49:27 +00:00
Zachary Turner	2d5cfb025a	Output S_SECTION symbols to the Linker module. PDBs need to contain 1 module for each object file/compiland, and a special one synthesized by the linker. This one contains a symbol record for each output section in the executable with its address information. This patch adds such symbols to the linker module. Note that we also are supposed to add an S_COFFGROUP symbol for what appears to be each input section that contributes to each output section, but it's not entirely clear how to generate these yet, so I'm leaving that for a separate patch. llvm-svn: 310754	2017-08-11 20:46:28 +00:00
Matt Arsenault	09562e957d	AMDGPU: Start adding tail call support Handle the sibling call cases. llvm-svn: 310753	2017-08-11 20:42:08 +00:00
Kostya Serebryany	95f9246136	[libFuzzer] recommend Clang Coverage for coverage visualization llvm-svn: 310751	2017-08-11 20:32:47 +00:00
George Karpenkov	dd6730b275	[libFuzzer] Re-enable coverage.test on Darwin. llvm-svn: 310750	2017-08-11 20:30:52 +00:00
Daniel Sanders	aac1c7e1d0	Revert r310716 (and r310735): [globalisel][tablegen] Support zero-instruction emission. Two of the Windows bots are failing test\CodeGen\X86\GlobalISel\select-inc.mir which should not have been affected by the change. Reverting while I investigate. Also reverted r310735 because it builds on r310716. llvm-svn: 310745	2017-08-11 19:19:21 +00:00
Zachary Turner	b24836fd63	Add documentation for llvm-pdbutil. llvm-svn: 310744	2017-08-11 19:00:22 +00:00

... 6 7 8 9 10 ...

153304 Commits