llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 12:12:47 +01:00

Author	SHA1	Message	Date
Craig Topper	fac7ae20e3	[X86] Allow X86ISD::Wrapper to be folded into the base of gather/scatter address If the base of our gather corresponds to something contained in X86ISD::Wrapper we should be able to fold it into the address. This patch refactors some of the address matching to more fully use the X86ISelAddressMode struct and the getAddressOperands helper. A new helper function matchVectorAddress is added to call matchWrapper or fall back to matchAddressBase. We should also be able to support constant offsets from a wrapper, but I'll look into that in a future patch. We may even be able to completely reuse matchAddress here, but I wanted to start simple and work up to it. Differential Revision: https://reviews.llvm.org/D39927 llvm-svn: 318057	2017-11-13 17:53:59 +00:00
Sanjay Patel	1d05d01aea	[ValueTracking] simplify code in CannotBeNegativeZero() with match(); NFCI llvm-svn: 318055	2017-11-13 17:40:47 +00:00
Sanjay Patel	d6b8e8276a	[Reassociate] add tests with 'reassoc' FMF; NFC llvm-svn: 318053	2017-11-13 17:29:11 +00:00
Jan Vesely	ef07cc017a	AMDGPU: Drop duplicate setOperationAction These are set with other scalar int ops few lines up Differential Revision: https://reviews.llvm.org/D39928 llvm-svn: 318051	2017-11-13 16:46:07 +00:00
Jatin Bhateja	b020a5ea70	[SCEV] Handling for ICmp occuring in the evolution chain. Summary: If a compare instruction is same or inverse of the compare in the branch of the loop latch, then return a constant evolution node. This shall facilitate computations of loop exit counts in cases where compare appears in the evolution chain of induction variables. Will fix PR 34538 Reviewers: sanjoy, hfinkel, junryoungju Reviewed By: sanjoy, junryoungju Subscribers: javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D38494 llvm-svn: 318050	2017-11-13 16:43:24 +00:00
Simon Dardis	c113ee0cce	Revert "[CodeGenPrepare] Check that erased sunken address are not reused" This reverts commit r318032. The test broke some sanitizer bots. llvm-svn: 318049	2017-11-13 16:41:17 +00:00
Diana Picus	47b4fbad40	[ARM GlobalISel] Update legalizer test Make one of the legalizer tests a bit more robust by making sure all values we're interested in are used (either in a store or a return) and by using loads instead of constants for obtaining values on fewer than 32 bits. This should make the test less fragile to changes in the legalize combiner, since those loads are legal (as opposed to the constants, which were being widened and thus produced opportunities for the legalize combiner). llvm-svn: 318047	2017-11-13 16:02:42 +00:00
Bill Seurer	623071d2a0	[PowerPC][msan] Update msan to handle changed memory layouts in newer kernels In more recent Linux kernels (including those with 47 bit VMAs) the layout of virtual memory for powerpc64 changed causing the memory sanitizer to not work properly. This patch adjusts a bit mask in the memory sanitizer to work on the newer kernels while continuing to work on the older ones as well. This is the non-runtime part of the patch and finishes it. ref: r317802 Tested on several 4.x and 3.x kernel releases. llvm-svn: 318045	2017-11-13 15:43:19 +00:00
Omer Paparo Bivas	faa015e9c5	Inserting a base test for X86 performance nops Change-Id: I69da08b617d7fae8024c5aee04720eb465f39b81 llvm-svn: 318041	2017-11-13 15:02:39 +00:00
Uriel Korach	764492eea3	[X86] test/testn intrinsics lowering to IR. llvm part. Remove builtins from llvm and add AutoUpgrade support. Also add fast-isel tests for the TEST and TESTN instructions. Differential Revision: https://reviews.llvm.org/D38736 llvm-svn: 318036	2017-11-13 12:51:18 +00:00
Greg Bedwell	63d0385262	Move the setting of LLVM_BUILD_MODE to a macro so that we can re-use it in compiler-rt Differential Revision: https://reviews.llvm.org/D38470 llvm-svn: 318034	2017-11-13 12:40:05 +00:00
Momchil Velikov	90e4f16c37	[ARM] Place jump table as the first operand in additions When generating table jump code for switch statements, place the jump table label as the first operand in the various addition instructions in order to enable addressing mode selectors to better match index computation and possibly fold them into the addressing mode of the table entry load instruction. Differential revision: https://reviews.llvm.org/D39752 llvm-svn: 318033	2017-11-13 11:56:48 +00:00
Simon Dardis	f823989359	[CodeGenPrepare] Check that erased sunken address are not reused CodeGenPrepare sinks address computations from one basic block to another and attempts to reuse address computations that have already been sunk. If the same address computation appears twice with the first instance as an operand of a load whose result is an operand to a simplifable select, CodeGenPrepare simplifies the select and recursively erases the now dead instructions. CodeGenPrepare then attempts to use the erased address computation for the second load. Fix this by erasing the cached address value if it has zero uses before looking for the address value in the sunken address map. This partially resolves PR35209. Thanks to Alexander Richardson for reporting the issue! Reviewers: john.brawn Differential Revision: https://reviews.llvm.org/D39841 llvm-svn: 318032	2017-11-13 11:47:21 +00:00
Florian Hahn	8cde3a989b	[CodeExtractor] Add missing AllowVarArgs initialization. llvm-svn: 318029	2017-11-13 11:08:47 +00:00
Florian Hahn	f3a8f80975	[PartialInliner] Inline vararg functions that forward varargs. Summary: This patch extends the partial inliner to support inlining parts of vararg functions, if the vararg handling is done in the outlined part. It adds a `ForwardVarArgsTo` argument to InlineFunction. If it is non-null, all varargs passed to the inlined function will be added to all calls to `ForwardVarArgsTo`. The partial inliner takes care to only pass `ForwardVarArgsTo` if the varargs handing is done in the outlined function. It checks that vastart is not part of the function to be inlined. `test/Transforms/CodeExtractor/PartialInlineNoInline.ll` (already part of the repo) checks we do not do partial inlining if vastart is used in a basic block that will be inlined. Reviewers: davide, davidxl, grosser Reviewed By: davide, davidxl, grosser Subscribers: gyiu, grosser, eraman, llvm-commits Differential Revision: https://reviews.llvm.org/D39607 llvm-svn: 318028	2017-11-13 10:35:52 +00:00
Sander de Smalen	5c8677322a	Test commit llvm-svn: 318027	2017-11-13 09:57:20 +00:00
Jina Nahias	6fbbd8d3e0	[x86][AVX512] Lowering shuffle i/f intrinsics to LLVM IR This patch, together with a matching clang patch (https://reviews.llvm.org/D38672), implements the lowering of X86 shuffle i/f intrinsics to IR. Differential Revision: https://reviews.llvm.org/D38671 Change-Id: I1e7d359a74743e995ec356237a85214ce55d3661 llvm-svn: 318026	2017-11-13 09:16:39 +00:00
Gadi Haber	b18b35ba8c	[X86][SKX] Adding scheduling info of non-intrinsic + commutable SKX opcodes. Updated the scheduling information of the SKX subtarget in the file X86SchedSkylakeServer.td under lib/Target/X86 to: 1. add regular opcodes in addition to the suffixed "_Int" opcodes 2. add the (V)MAXCPD/MAXCPS/MAXCSD/MAXCSS/MINCPD/MINCPS/MINCSD/MINCSS instructions that are equivalent to their counterparts without the 'C' as they are part of a hack to make floating point min/max commutable under fast math. Reviewers: zvi, RKSimon, craig.topper Differential Revision: https://reviews.llvm.org/D39833 Change-Id: Ie13702a5ce1b1a08af91ca637a52b6962881e7d6 llvm-svn: 318024	2017-11-13 08:42:07 +00:00
Craig Topper	0cdb5107e2	[X86] Limit NOPs to 7 bytes when 'slm' is spelled 'silvermont'. We support 2 spelling for silvermont and we should accept both here. llvm-svn: 318023	2017-11-13 08:17:30 +00:00
Craig Topper	105eba4796	[X86] Use sse_load_f32/f64 to improve load folding of scalar vfscalefss/sd, vrcp14ss/sd, rsqrt14ss/sd instructions. llvm-svn: 318022	2017-11-13 08:07:33 +00:00
Craig Topper	30f219e692	[X86] Regenerate test. NFC llvm-svn: 318021	2017-11-13 08:07:31 +00:00
Matt Arsenault	d5ebb5f6b0	MI: Print ranges on MMO llvm-svn: 318020	2017-11-13 07:09:20 +00:00
Craig Topper	aeb8a91f35	[X86] Use sse_load_f32/f64 to improve load folding for scalar VFPCLASS intrinsics. llvm-svn: 318019	2017-11-13 06:46:48 +00:00
Craig Topper	df613e52cd	[X86] Add tests for missed opportunities to fold a 128-bit vector load into vfpclassss and vpfpclasssd. llvm-svn: 318018	2017-11-13 06:46:46 +00:00
Matt Arsenault	f1c4439f56	AMDGPU: Preserve nuw in shl add ptr combine llvm-svn: 318017	2017-11-13 05:33:35 +00:00
Craig Topper	64463b58e5	[X86] Fix SQRTSS/SQRTSD/RCPSS/RCPSD intrinsics to use sse_load_f32/sse_load_f64 to increase load folding opportunities. llvm-svn: 318016	2017-11-13 05:25:24 +00:00
Craig Topper	a400aaf166	[X86] Add tests for full vector loads to fold-load-unops.ll. We should be able to fold a full vector load into a scalar intrinsic. Since it's legal to narrow a load. llvm-svn: 318015	2017-11-13 05:25:23 +00:00
Craig Topper	1eb8ba73a1	[X86] Regenerate fold-load-unops.ll and add and avx512f command line. llvm-svn: 318014	2017-11-13 05:25:21 +00:00
Matt Arsenault	1744d88a1d	AMDGPU: Fix multi-use shl/add combine This was using a custom function that didn't handle the addressing modes properly for private. Use isLegalAddressingMode to avoid duplicating this. Additionally, skip the combine if there is only one use since the standard combine will handle it. llvm-svn: 318013	2017-11-13 05:11:54 +00:00
Craig Topper	1924bedd3d	[X86] Attempt to fix signed and unsigned comparison warning. llvm-svn: 318010	2017-11-13 02:19:13 +00:00
Craig Topper	b3f37ebae3	[X86] Use sse_load_f32/f64 in patterns for the memory forms of VRNDSCALESS/SD. llvm-svn: 318009	2017-11-13 02:03:01 +00:00
Craig Topper	a08d3b7667	[X86] Use EVEX encoded VRNDSCALE instructions to implement the legacy round intrinsics. The VRNDSCALE instructions implement a superset of the (V)ROUND instructions. They are equivalent if the upper 4-bits of the immediate are 0. This patch lowers the legacy intrinsics to the VRNDSCALE ISD node and masks the upper bits of the immediate to 0. This allows us to take advantage of the larger register encoding space. We should maybe consider converting VRNDSCALE back to VROUND in the EVEX to VEX pass if the extended registers are not being used. I notice some load folding opportunities being missed for the VRNDSCALESS/SD instructions that I'll try to fix in future patches. llvm-svn: 318008	2017-11-13 02:03:00 +00:00
Craig Topper	bff032a8e3	[X86] Split VRNDSCALE/VREDUCE/VGETMANT/VRANGE ISD nodes into versions with and without the rounding operand. NFCI I want to reuse the VRNDSCALE node for the legacy SSE rounding intrinsics so that those intrinsics can use EVEX instructions. All of these nodes share tablegen multiclasses so I split them all so that they all remain similar in their implementations. llvm-svn: 318007	2017-11-13 02:02:58 +00:00
Matt Arsenault	8743ca93c4	Fix some misc. -enable-var-scope violations llvm-svn: 318006	2017-11-13 01:47:52 +00:00
Matt Arsenault	eaea634604	AMDGPU: Select d16 loads into low component of register llvm-svn: 318005	2017-11-13 00:22:09 +00:00
Matt Arsenault	d9c8715837	AMDGPU: Fix -enable-var-scope violations llvm-svn: 318004	2017-11-12 23:53:44 +00:00
Matt Arsenault	d207bf89ab	AMDGPU: Fix missing gfx9 atomic inc/dec tests The global instructions weren't tested. Plus there were also some -enable-var-scope violations and broken check prefixes. llvm-svn: 318003	2017-11-12 23:40:12 +00:00
Craig Topper	45210b7de6	[X86] Add an X86ISD::RANGES opcode to use for the scalar intrinsics. This fixes a bug where we selected packed instructions for scalar intrinsics. llvm-svn: 317999	2017-11-12 18:51:09 +00:00
Craig Topper	cc2b32a675	[X86] Add test cases and command lines demonstrating how we accidentally select vrangeps/vrangepd from vrangess/vrangesd instrinsics when the rounding mode is CUR_DIRECTION llvm-svn: 317998	2017-11-12 18:51:08 +00:00
Craig Topper	bc3b5e9708	[X86] Remove some no longer needed intrinsic lowering code. llvm-svn: 317997	2017-11-12 18:51:06 +00:00
Mandeep Singh Grang	32947f7b72	[llvm] Remove redundant return [NFC] Reviewers: davidxl, olista01, Eugene.Zelenko Reviewed By: Eugene.Zelenko Subscribers: sdardis, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D39917 llvm-svn: 317995	2017-11-12 03:47:50 +00:00
Craig Topper	6dbd706f89	[InstCombine] Teach visitICmpInst to not break integer absolute value idioms Summary: This patch adds an early out to visitICmpInst if we are looking at a compare as part of an integer absolute value idiom. Similar is already done for min/max. In the particular case I observed in a benchmark we had an absolute value of a load from an indexed global. We simplified the compare using foldCmpLoadFromIndexedGlobal into a magic bit vector, a shift, and an and. But the load result was still used for the select and the negate part of the absolute valute idiom. So we overcomplicated the code and lost the ability to recognize it as an absolute value. I've chosen a simpler case for the test here. Reviewers: spatel, davide, majnemer Reviewed By: spatel Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D39766 llvm-svn: 317994	2017-11-12 02:28:21 +00:00
Craig Topper	d9e3d1df1d	[X86] Use vrndscaleps/pd for 128/256 ffloor/ftrunc/fceil/fnearbyint/frint when avx512vl is enabled. This matches what we do for scalar and 512-bit types. llvm-svn: 317991	2017-11-11 21:44:51 +00:00
Craig Topper	807c3846c6	[X86] Remove avx512-round.ll. The 512-bit rounding tests are now in vec_floor.ll with 128/256 sizes. llvm-svn: 317990	2017-11-11 21:44:50 +00:00
Craig Topper	d9337d8964	[X86] Add avx512vl command line to vec_floor.ll. Add 512-bit test cases. llvm-svn: 317989	2017-11-11 21:44:49 +00:00
Craig Topper	389a205040	[X86] Add avx512f command line to rounding-ops.ll llvm-svn: 317988	2017-11-11 21:44:48 +00:00
Craig Topper	3a3c977330	[X86] Regenerate rounding-ops.ll with update_llc_test_checks.py llvm-svn: 317987	2017-11-11 21:44:47 +00:00
Simon Pilgrim	ca9ad6eb90	[X86] Attempt to match multiple binary reduction ops at once. NFCI matchBinOpReduction currently matches against a single opcode, but we already have a case where we repeat calls to try to match against AND/OR and I'll be shortly adding another case for SMAX/SMIN/UMAX/UMIN (D39729). This NFCI patch alters matchBinOpReduction to try and pattern match against any of the provided list of candidate bin ops at once to save time. Differential Revision: https://reviews.llvm.org/D39726 llvm-svn: 317985	2017-11-11 18:16:55 +00:00
Craig Topper	1561471af3	[X86] Add scalar register class versions of VRNDSCALE instructions and rename the existing versions to _Int. This is consistent with out normal implementation of scalar instructions. While there disable load folding for the patterns with IMPLICIT_DEF unless optimizing for size which is also our standard practice. llvm-svn: 317977	2017-11-11 08:24:15 +00:00
Craig Topper	f495c94083	[X86] Regenerate avx512-round.ll test. llvm-svn: 317976	2017-11-11 08:24:13 +00:00

... 2 3 4 5 6 ...

156715 Commits