llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 02:52:53 +02:00

Author	SHA1	Message	Date
Chris Bieneman	d7ad967701	[CMake] Feed BUNDLE_PATH through llvm target wrappers This feeds the new llvm_codsign BUNDLE_PATH option through from the llvm target wrapper functions, so that you can specify the BUNDLE_PATH on the target's codesign. llvm-svn: 362248	2019-05-31 17:40:49 +00:00
Puyan Lotfi	884785073a	[MIR-Canon] Don't do vreg skip for independent instructions if there are none. We don't want to create vregs if there is nothing to use them for. That causes verifier errors. Differential Revision: https://reviews.llvm.org/D62740 llvm-svn: 362247	2019-05-31 17:34:25 +00:00
Andrea Di Biagio	d7e6a83c85	[MCA] Refactor class BottleneckAnalysis. NFCI The resource pressure distribution computation is now delegated by class BottleneckAnalysis to an instance of class PressureTracker. Class PressureTracker is also responsible for: - tracking users of processor resource units. - tracking the number of delay cycles caused by increases in backpressure. BottleneckAnalysis internally initializes a dependency graph. Each nodes represents an instruction in the input code sequence. Edges of the dependency graph are critical register/memory/resource dependencies. Dependencies are only added to the graph if they are seen as critical by backend pressure events. The DependencyGraph is currently unused. It is possible to print the dependency graph (see method DependencyGraph::dump()) for debugging purposes. The long term goal is to use the information stored by the dependency graph in order to do critical path computation. llvm-svn: 362246	2019-05-31 17:18:34 +00:00
Philip Reames	a9fe21addd	[Tests] Add tests for loop predication of loops w/ne latch conditions llvm-svn: 362244	2019-05-31 16:54:38 +00:00
Nikita Popov	2c9d3b2748	[CVP] Simplify non-overflowing saturating add/sub If we can determine that a saturating add/sub will not overflow based on range analysis, convert it into a simple binary operation. This is a sibling transform to the existing with.overflow handling. Differential Revision: https://reviews.llvm.org/D62703 llvm-svn: 362242	2019-05-31 16:46:05 +00:00
Kevin P. Neal	0286c53a28	Revert revert of r362112 with minor SystemZ test file corrections. [FPEnv] Added a special UnrollVectorOp method to deal with the chain on StrictFP opcodes This change creates UnrollVectorOp_StrictFP. The purpose of this is to address a failure that consistently occurs when calling StrictFP functions on vectors whose number of elements is 3 + 2n on most platforms, such as PowerPC or SystemZ. The old UnrollVectorOp method does not expect that the vector that it will unroll will have a chain, so it has an assert that prevents it from running if this is the case. This new StrictFP version of the method deals with the chain while unrolling the vector. With this new function in place during vector widending, llc can run vector-constrained-fp-intrinsics.ll for SystemZ successfully. Submitted by: Drew Wock <drew.wock@sas.com> Reviewed by: Cameron McInally, Kevin P. Neal Approved by: Cameron McInally Differential Revision: https://reviews.llvm.org/D62546 llvm-svn: 362241	2019-05-31 16:32:12 +00:00
Stanislav Mekhanoshin	cfbb42515c	[AMDGPU] Use InliningThresholdMultiplier for inline hint AMDGPU uses multiplier 9 for the inline cost. It is taken into account everywhere except for inline hint threshold. As a result we are penalizing functions with the inline hint making them less probable to be inlined than those without the hint. Defaults are 225 for a normal function and 325 for a function with an inline hint. Currently we have effective threshold 225 * 9 = 2025 for normal functions and just 325 for those with the hint. That is fixed by this patch. Differential Revision: https://reviews.llvm.org/D62707 llvm-svn: 362239	2019-05-31 16:19:26 +00:00
Cameron McInally	f9f799e893	[NFC][InstCombine] Add unary FNeg tests to fabs.ll llvm-svn: 362238	2019-05-31 16:17:04 +00:00
Guozhi Wei	69dec5670b	[PPC] Correctly adjust branch probability in PPCReduceCRLogicals In PPCReduceCRLogicals after splitting the original MBB into 2, the 2 impacted branches still use original branch probability. This is unreasonable. Suppose we have following code, and the probability of each successor is 50%. condc = conda \|\| condb br condc, label %target, label %fallthrough It can be transformed to following, br conda, label %target, label %newbb newbb: br condb, label %target, label %fallthrough Since each branch has a probability of 50% to each successor, the total probability to %fallthrough is 25% now, and the total probability to %target is 75%. This actually changed the original profiling data. A more reasonable probability can be set to 70% to the false side for each branch instruction, so the total probability to %fallthrough is close to 50%. This patch assumes the branch target with two incoming edges have same edge frequency and computes new probability fore each target, and keep the total probability to original targets unchanged. Differential Revision: https://reviews.llvm.org/D62430 llvm-svn: 362237	2019-05-31 16:11:17 +00:00
Cameron McInally	fa2b38d876	[NFC][InstCombine] Add unary FNeg tests to fcmp.ll llvm-svn: 362234	2019-05-31 15:40:03 +00:00
Jinsong Ji	96eb40626f	[MachinePipeliner][NFC] Add some debug log and statistics This is to add some log and statistics for debugging Differential Revision: https://reviews.llvm.org/D62165 llvm-svn: 362233	2019-05-31 15:35:19 +00:00
Cameron McInally	4d7b58d613	[NFC][InstCombine] Add unary FNeg tests to fdiv.ll llvm-svn: 362231	2019-05-31 15:10:34 +00:00
Simon Pilgrim	6c3794e196	[AMDGPU] Regenerate add/sub shrink constant tests for an upcoming patch llvm-svn: 362230	2019-05-31 15:06:51 +00:00
Simon Pilgrim	abc354a507	[AMDGPU] Regenerate CTLZ tests for an upcoming patch llvm-svn: 362229	2019-05-31 15:06:14 +00:00
Simon Pilgrim	828a13e476	[UpdateTestChecks] Add support for -march=r600 to match existing -march=amdgcn support llvm-svn: 362228	2019-05-31 15:05:06 +00:00
Cameron McInally	16ce3100db	[NFC][InstCombine] Add unary FNeg tests to fma.ll llvm-svn: 362227	2019-05-31 14:49:31 +00:00
George Rimar	769939a5dd	[llvm-readobj] - Remove excessive `dynamic.test` dynamic.test is a test that checks dumping of dynamic tags. It uses precompiled objects as inputs and it is completely excessive nowadays: Now we have elf-dynamic-tags-machine-specific.test and elf-dynamic-tags.test. (https://github.com/llvm-mirror/llvm/blob/master/test/tools/llvm-readobj/elf-dynamic-tags-machine-specific.test) (https://github.com/llvm-mirror/llvm/blob/master/test/tools/llvm-readobj/elf-dynamic-tags.test) First is used to check target specific tags and second tests the common flags. These tests use YAML, which is much better than using precompiled binaries. Note that new reviews tend to update the YAML based tests to add new tags, e.g. see D62596. With this patch it became possible to remove dynamic-table-so.aarch64 binary from the inputs folder. (other binaries are still used in other tests). Differential revision: https://reviews.llvm.org/D62728 llvm-svn: 362224	2019-05-31 13:16:21 +00:00
Nico Weber	9f2de13cb7	gn build: Merge r362160 llvm-svn: 362223	2019-05-31 12:07:05 +00:00
Nico Weber	aceb52b7fd	gn build: Merge r362196 llvm-svn: 362222	2019-05-31 11:52:59 +00:00
Nico Weber	cd278170e0	gn build: Merge r362190 llvm-svn: 362221	2019-05-31 11:51:42 +00:00
Russell Gallop	5febc54aae	ftime-trace: Trace loop passes These can take a significant amount of time in some builds. Suggested by Andrea Di Biagio. Differential Revision: https://reviews.llvm.org/D62666 llvm-svn: 362219	2019-05-31 10:14:04 +00:00
Roman Lebedev	1be644abc7	[InstCombine] 'C-(C2-X) --> X+(C-C2)' constant-fold It looks this fold was already partially happening, indirectly via some other folds, but with one-use limitation. No other fold here has that restriction. https://rise4fun.com/Alive/ftR llvm-svn: 362217	2019-05-31 09:47:16 +00:00
Roman Lebedev	4fa7f1c212	[InstCombine] 'add (sub C1, X), C2 --> sub (add C1, C2), X' constant-fold https://rise4fun.com/Alive/qJQ llvm-svn: 362216	2019-05-31 09:47:04 +00:00
Cullen Rhodes	6218d235e9	[AArch64][SVE2] Asm: support WHILE instructions Summary: Patch adds support for the following instructions: * WHILEGE, WHILEGT, WHILEHS, WHILEHI, WHILEWR, WHILERW The specification can be found here: https://developer.arm.com/docs/ddi0602/latest Reviewed By: chill Differential Revision: https://reviews.llvm.org/D62601 llvm-svn: 362215	2019-05-31 09:13:55 +00:00
Cullen Rhodes	832514ac74	[AArch64][SVE2] Asm: support TBL/TBX instructions Summary: A three sources variant of the TBL instruction is added to the existing SVE instruction in SVE2. This is implemented with minor changes to the existing TableGen class. TBX is a new instruction with its own definition. The specification can be found here: https://developer.arm.com/docs/ddi0602/latest Reviewed By: chill Differential Revision: https://reviews.llvm.org/D62600 llvm-svn: 362214	2019-05-31 09:06:53 +00:00
Cullen Rhodes	36661fd2e0	[AArch64][SVE2] Asm: support SVE2 store instructions Summary: Patch adds support for the following instructions: * STNT1B, STNT1H, STNT1S, STNT1D The specification can be found here: https://developer.arm.com/docs/ddi0602/latest Reviewed By: chill Differential Revision: https://reviews.llvm.org/D62599 llvm-svn: 362213	2019-05-31 08:59:40 +00:00
Petar Avramovic	a0540fcd76	[MIPS GlobalISel] Add detailed tests for lower call Test different operand types of callee and their behavior whether relocation model is pic or not. Possible operand types are: Register (function pointer), External symbol (used for libcalls e.g. __udivdi3 or memcpy), Global address. Global address has different handling depending on relocation model and linkage type. Register and external symbol do not. Differential Revision: https://reviews.llvm.org/D62590 llvm-svn: 362212	2019-05-31 08:40:08 +00:00
Sjoerd Meijer	ba322fceb7	Follow up and fix for rL362064 Fix the misleadingly indentation introduced in rL362064. This will get rid of the compiler warning, and it was actually a bug. This change will be used and tested in D62669. llvm-svn: 362211	2019-05-31 08:39:34 +00:00
Petar Avramovic	1f5827949f	[MIPS GlobalISel] Handle position independent code Handle position independent code for MIPS32. When callee is global address, lower call will emit callee as G_GLOBAL_VALUE and add target flag if needed. Support $gp in getRegBankFromRegClass(). Select G_GLOBAL_VALUE, specially handle case when there are target flags attached by lowerCall. Differential Revision: https://reviews.llvm.org/D62589 llvm-svn: 362210	2019-05-31 08:27:06 +00:00
Roman Lebedev	86b0868012	[NFC][InstCombine] Copy add/sub constant-folding tests from codegen Last three patterns are missed. llvm-svn: 362209	2019-05-31 08:24:07 +00:00
Roman Lebedev	4a12ec46dd	[NFC][Codegen] Add/sub constant-folding: add scalar tests too Just for completeness. llvm-svn: 362208	2019-05-31 08:23:48 +00:00
Petar Avramovic	7a7f031d27	[mips] Move initGlobalBaseReg to MipsFunctionInfo. NFC Move initGlobalBaseReg from MipsSEDAGToDAGISel to MipsFunctionInfo. This way functions used for handling position independent code during instruction selection, getGlobalBaseReg and initGlobalBaseReg, end up in same class. Differential Revision: https://reviews.llvm.org/D62586 llvm-svn: 362206	2019-05-31 08:15:28 +00:00
Craig Topper	00d48b9054	[InstructionSimplify] Add missing implementation of llvm::SimplifyUnOp. NFC There are no callers currently, but the function is declared so we should at least implement it. llvm-svn: 362205	2019-05-31 08:10:23 +00:00
Petar Avramovic	0bb9750cc1	[MIPS GlobalISel] Lower call for callee that is register Lower call for callee that is register for MIPS32. Register should contain callee function address. Differential Revision: https://reviews.llvm.org/D62585 llvm-svn: 362204	2019-05-31 08:06:17 +00:00
Craig Topper	f7b00565ae	[X86] Remove patterns for X86VSintToFP/X86VUintToFP+loadv4f32 to v2f64. These patterns can incorrectly narrow a volatile load from 128-bits to 64-bits. Similar to PR42079. Switch to using (v4i32 (bitcast (v2i64 (scalar_to_vector (loadi64))))) as the load pattern used in the instructions. This probably still has issues in 32-bit mode where loadi64 isn't legal. Maybe we should use VZMOVL for widened loads even when we don't need the upper bits as zeroes? llvm-svn: 362203	2019-05-31 07:38:26 +00:00
Craig Topper	b05e1785a6	[X86] Add test cases for failure to use 128-bit masked vcvtdq2pd when load starts as v2i32. llvm-svn: 362202	2019-05-31 07:38:22 +00:00
Craig Topper	7b1d0a7815	[X86] Add test cases for a volatile load shrinking bug involving cvtdq2pd. NFC Similar to PR42079 llvm-svn: 362201	2019-05-31 07:38:18 +00:00
Craig Topper	a8b84f0917	[X86] Copy a test case from avx512-cvt.ll to avx512-cvt-widen.ll. NFC llvm-svn: 362200	2019-05-31 07:38:14 +00:00
Craig Topper	7f8a753dec	[X86] Remove avx512 isel patterns for fpextend+load. Prefer to only match fp extloads instead. DAG combine will usually fold fpextend+load to an fp extload anyway. So the 256 and 512 patterns were probably unnecessary. The 128 bit pattern was special in that it looked for a v4f32 load, but then used it in an instruction that only loads 64-bits. This is bad if the load happens to be volatile. We could probably make the patterns volatile aware, but that's more work for something that's probably rare. The peephole pass might kick in and save us anyway. We might also be able to fix this with some additional DAG combines. This also adds patterns for vselect+extload to enabled masked vcvtps2pd to be used. Previously we looked for the unlikely vselect+fpextend+load. llvm-svn: 362199	2019-05-31 06:21:53 +00:00
Craig Topper	ea3e43b916	[X86] Add test to show missed opportunity to use masked vcvtps2pd for vselect+extload. llvm-svn: 362198	2019-05-31 06:21:49 +00:00
Craig Topper	6c808eecd2	[X86] Add test case for PR42079. NFC llvm-svn: 362197	2019-05-31 06:21:45 +00:00
Puyan Lotfi	5ae700de13	[MIR-Canon] Skip the first N vreg names lazily. This consolidates the vreg skip code into one function (SkipVRegs()). SkipVRegs() now knows if it should skip as if it is the first initialization or subsequent skips. The first skip is also done the first time createVirtualRegister is called by the cursor instead of by the cursor's constructor. This prevents verifier errors on machine functions that have no vregs (where the verifier will complain that there are vregs when the function uses none). Differential Revision: https://reviews.llvm.org/D62717 llvm-svn: 362195	2019-05-31 06:02:38 +00:00
Craig Topper	c0ec451e29	[X86] Correct the ins operand order for MASKPAIR16STORE to match other store instructions. This makes the 5 address operands come first. And the data operand comes last. This matches the operand order the instruction is created with. It's also the expected order in X86MCInstLower. So everything appeared to work, but the operands didn't match their declared type. Fixes a -verify-machineinstrs failure. Also remove the isel patterns from these instructions since they should only be used for stack spills and reloads. I'm not even sure what types the patterns were looking for to match. llvm-svn: 362193	2019-05-31 05:20:27 +00:00
Puyan Lotfi	eeeb479eef	[MIR-Canon] Hardening propagateLocalCopies. This is am almost NFC, it does the following: - If there is no register class for a COPY's src or dst, bail. - Fixes uses iterator invalidation bug. Differential Revision: https://reviews.llvm.org/D62713 llvm-svn: 362191	2019-05-31 04:49:58 +00:00
Richard Trieu	84feefcd69	Fix bad go bindings test. After r362128, the "byval" attribute has a stricter check and will cause an assertion. Remove the "byval" test case for now. llvm-svn: 362189	2019-05-31 03:45:11 +00:00
Pengfei Wang	d1fdadc458	[X86] Add VP2INTERSECT instructions Support Intel AVX512 VP2INTERSECT instructions in llvm Patch by Xiang Zhang (xiangzhangllvm) Differential Revision: https://reviews.llvm.org/D62366 llvm-svn: 362188	2019-05-31 02:50:41 +00:00
Petr Hosek	635759ce04	[CMake] Provide an option to use relative paths in debug info CMake always uses absolute file paths in the generated compiler invocation which results in absolute file paths being embedded in debug info. This is undesirable when building a toolchain e.g. on bots as the debug info may embed the bot source checkout path which is meaningless anywhere else. This change introduces the LLVM_USE_RELATIVE_PATHS_IN_DEBUG_INFO which uses -fdebug-prefix-map (where supported) options to rewrite paths embedded into debug info with relative ones. Additionally, LLVM_SOURCE_PREFIX can be used to override the path to source directory with a different one. Differential Revision: https://reviews.llvm.org/D62622 llvm-svn: 362185	2019-05-31 01:34:51 +00:00
Sam Clegg	1dad184da2	Fix -DBUILD_SHARED_LIBS=ON build after rL362160 Differential Revision: https://reviews.llvm.org/D62709 llvm-svn: 362180	2019-05-31 01:04:00 +00:00
Craig Topper	64b81fed53	[X86] Remove result type constraints from the extloadv2f32/extloadv4f32/extloadv8f32 PatFrags. NFC The result types aren't mentioned in the pattern name so really shouldn't be in the PatFrags. The users of these either have their own type constraint or rely on the type constranit system to realize the only legal extend would be to f64. llvm-svn: 362175	2019-05-30 23:35:24 +00:00
Matt Arsenault	ef4aaaee43	MISched: Fix -misched-regpressure=0 if subreg liveness enabled Test is waiting on fixing several more crashes in the AMDGPU scheduler implementation with this. llvm-svn: 362174	2019-05-30 23:31:36 +00:00

1 2 3 4 5 ...

179539 Commits