llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 12:02:58 +02:00

Author	SHA1	Message	Date
Simon Pilgrim	424dd446d9	[X86][SSE] Combine v16i8 SHL by constants to multiplies Pre-AVX512 (which can perform a quick extend/shift/truncate), extending to 2 v8i16 for the PMULLW and then truncating is more performant than relying on the generic PBLENDVB vXi8 shift path and uses a similar amount of mask constant pool data. Differential Revision: https://reviews.llvm.org/D48963 llvm-svn: 336513	2018-07-08 12:47:50 +00:00
Simon Pilgrim	d9072865e2	[X86] Set scheduler classes to unsupported. NFCI. While looking at PR36895 I noticed how much of the atom model was still setting schedules for unsupported SSE4+ instructions. llvm-svn: 336512	2018-07-08 10:32:07 +00:00
Roman Lebedev	ef342ad9c4	[X86][Basically NFC] Sched: split WriteBitScan into WriteBSF/WriteBSR. Summary: Motivation: {F6597954} This only does the mechanical splitting, does not actually change any numbers, as the tests added in previous revision show. Reviewers: craig.topper, RKSimon, courbet Reviewed By: craig.topper Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D48998 llvm-svn: 336511	2018-07-08 09:50:25 +00:00
Roman Lebedev	f199fb4bf4	[MCA][X86][NFC] Add BSF/BSR resource tests Reviewers: RKSimon, andreadb, courbet Reviewed By: RKSimon Subscribers: gbedwell, llvm-commits Differential Revision: https://reviews.llvm.org/D48997 llvm-svn: 336510	2018-07-08 09:50:14 +00:00
Craig Topper	1e1bb9959c	[LoopIdiomRecognize] Support for converting loops that use LSHR to CTLZ. In the 'detectCTLZIdiom' function support for loops that use LSHR instruction instead of ASHR has been added. This supports creating ctlz from the following code. int lzcnt(int x) { int count = 0; while (x > 0) { count++; x = x >> 1; } return count; } Patch by Olga Moldovanova Differential Revision: https://reviews.llvm.org/D48354 llvm-svn: 336509	2018-07-08 01:45:47 +00:00
Craig Topper	29c0f94899	[X86] Add back some intrinsic table entries lost in r336506. llvm-svn: 336508	2018-07-08 01:23:49 +00:00
Craig Topper	8a7baa3eef	[X86] Add new scalar fma intrinsics with rounding mode that use f32/f64 types. This allows us to handle masking in a very similar way to the default rounding version that uses llvm.fma. I had to add new rounding mode CodeGenOnly instructions to support isel when we can't find a movss to grab the upper bits from to use the b_Int instruction. Fast-isel tests have been updated to match new clang codegen. We are currently having trouble folding fneg into the new intrinsic. I'm going to correct that in a follow up patch to keep the size of this one down. A future patch will also remove the old intrinsics. llvm-svn: 336506	2018-07-08 01:10:43 +00:00
Craig Topper	bab39baa01	[X86] Use a rounding mode other than 4 in the scalar fma intrinsic fast-isel tests to match clang test cases. llvm-svn: 336505	2018-07-08 00:32:56 +00:00
Simon Pilgrim	d9a7cde08c	[X86] Regenerate PR14088 test. NFCI. llvm-svn: 336496	2018-07-07 20:08:27 +00:00
Simon Pilgrim	8689dcbc38	[SelectionDAG] Split float and integer isKnownNeverZero tests Splits off isKnownNeverZeroFloat to handle +/- 0 float cases. This will make it easier to be more aggressive with the integer isKnownNeverZero tests (similar to ValueTracking), use computeKnownBits etc. Differential Revision: https://reviews.llvm.org/D48969 llvm-svn: 336492	2018-07-07 18:17:14 +00:00
Simon Pilgrim	9224b711c4	Use const APInt& to avoid extra copy. NFCI. As discussed on D48825. llvm-svn: 336491	2018-07-07 17:33:48 +00:00
Simon Pilgrim	a8e606d72b	[DAGCombiner] Add EXTRACT_SUBVECTOR to SimplifyDemandedVectorElts As discussed on PR37989, this patch adds EXTRACT_SUBVECTOR handling to TargetLowering::SimplifyDemandedVectorElts and calls it from DAGCombiner::visitEXTRACT_SUBVECTOR. Differential Revision: https://reviews.llvm.org/D48825 llvm-svn: 336490	2018-07-07 17:30:06 +00:00
Simon Pilgrim	a4f67ce5f1	[CostModel][X86] Add SREM/UREM general and constant costs (PR38056) We penalize general SDIV/UDIV costs but don't do the same for SREM/UREM. This patch makes general vector SREM/UREM x20 as costly as scalar, the same approach as we do for SDIV/UDIV. The patch also extends the existing SDIV/UDIV constant costs for SREM/UREM - at the moment this means the additional cost of a MUL+SUB (see D48975). Differential Revision: https://reviews.llvm.org/D48980 llvm-svn: 336486	2018-07-07 16:53:30 +00:00
Chijun Sima	1f432345d3	Test commit llvm-svn: 336485	2018-07-07 16:22:22 +00:00
Gabor Buella	fe4fd06747	NFC - Typo fixes in X86 flags-copy-lowering.mir test Differential Revision: https://reviews.llvm.org/D48934 llvm-svn: 336484	2018-07-07 16:09:15 +00:00
Yvan Roux	5ab279cab6	[MachineOutliner] Add missing liveness tracking info in MIR test. This should bring the bots back to green state. llvm-svn: 336482	2018-07-07 08:42:31 +00:00
Yvan Roux	27968d5797	[MachineOutliner] Assert that Liveness tracking is accurate (NFC) The checking is done deeper inside MachineBasicBlock, but this will hopefully help to find issues when porting the machine outliner to a target where Liveness tracking is broken (like ARM). Differential Revision: https://reviews.llvm.org/D49023 llvm-svn: 336481	2018-07-07 08:02:19 +00:00
Chandler Carruth	f501d225b8	[Support] Clear errno before calling the function in RetryAfterSignal. For certain APIs, the return value of the function does not distinguish between failure (which populates errno) and other non-error conditions (which do not set errno). For example, `fgets` returns `NULL` both when an error has occurred, or upon EOF. If `errno` is already `EINTR` for whatever reason, then ``` RetryAfterSignal(nullptr, fgets, ...); ``` on a stream that has reached EOF would infinite loop. Fix this by setting `errno` to `0` before each attempt in `RetryAfterSignal`. Patch by Ricky Zhou! Differential Revision: https://reviews.llvm.org/D48755 llvm-svn: 336479	2018-07-07 02:46:12 +00:00
Chandler Carruth	6046248142	[PM/LoopUnswitch] Fix PR37889, producing the correct loop nest structure after trivial unswitching. This PR illustrates that a fundamental analysis update was not performed with the new loop unswitch. This update is also somewhat fundamental to the core idea of the new loop unswitch -- we actually update the CFG based on the unswitching. In order to do that, we need to update the loop nest in addition to the domtree. For some reason, when writing trivial unswitching, I thought that the loop nest structure cannot be changed by the transformation. But the PR helps illustrate that it clearly can. I've expanded this to a number of different test cases that try to cover the different cases of this. When we unswitch, we move an exit edge of a loop out of the loop. If this exit edge changes which loop reached by an exit is the innermost loop, it changes the parent of the loop. Essentially, this transformation may hoist the inner loop up the nest. I've added the simple logic to handle this reliably in the trivial unswitching case. This just requires updating LoopInfo and rebuilding LCSSA on the impacted loops. In the trivial case, we don't even need to handle dedicated exits because we're only hoisting the one loop and we just split its preheader. I've also ported all of these tests to non-trivial unswitching and verified that the logic already there correctly handles the loop nest updates necessary. Differential Revision: https://reviews.llvm.org/D48851 llvm-svn: 336477	2018-07-07 01:12:56 +00:00
Craig Topper	36043eef5b	[X86] Merge INTR_TYPE_3OP_RM with INTR_TYPE_3OP. Remove unused INTR_TYPE_1OP_RM. llvm-svn: 336476	2018-07-07 01:04:22 +00:00
Tim Shen	8dd0f7c995	Revert "[SCEV] Strengthen StrengthenNoWrapFlags (reapply r334428)." This reverts commit r336140. Our tests shows that LSR assert fails with it. llvm-svn: 336473	2018-07-06 23:20:35 +00:00
Benjamin Kramer	29dc1806ee	[PDB] memicmp only exists on Windows, use StringRef::compare_lower instead llvm-svn: 336469	2018-07-06 21:56:57 +00:00
Vedant Kumar	28a93dc229	Fix DIExpression::ExprOperand::appendToVector appendToVector used the wrong overload of SmallVector::append, resulting in it appending the same element to a vector `getSize()` times. This did not cause a problem when initially committed because appendToVector was only used to append 1-element operands. This changes appendToVector to use the correct overload of append(). Testing: ./unittests/IR/IRTests --gtest_filter='DIExpressionTest' llvm-svn: 336466	2018-07-06 21:06:21 +00:00
Vedant Kumar	0bce44c380	Remove a redundant null-check in DIExpression::prepend, NFC Code outside of an `if (Expr)` block dereferenced `Expr`, so the null check was redundant. llvm-svn: 336465	2018-07-06 21:06:20 +00:00
Zachary Turner	9243b281e0	[PDB] One more fix for hasing GSI records. The reference implementation uses a case-insensitive string comparison for strings of equal length. This will cause the string "tEo" to compare less than "VUo". However we were using a case sensitive comparison, which would generate the opposite outcome. Switch to a case insensitive comparison. Also, when one of the strings contains non-ascii characters, fallback to a straight memcmp. The only way to really test this is with a DIA test. Before this patch, the test will fail (but succeed if link.exe is used instead of lld-link). After the patch, it succeeds even with lld-link. llvm-svn: 336464	2018-07-06 21:01:42 +00:00
Vedant Kumar	587a26d422	Use Type::isIntOrPtrTy where possible, NFC It's a bit neater to write T.isIntOrPtrTy() over `T.isIntegerTy() \|\| T.isPointerTy()`. I used Python's re.sub with this regex to update users: r'([\w.\->()]+)isIntegerTy\(\)\s\\|\\|\s\1isPointerTy\(\)' llvm-svn: 336462	2018-07-06 20:17:42 +00:00
Fangrui Song	02142d5985	[IR] Fix inconsistent declaration parameter name llvm-svn: 336459	2018-07-06 19:26:00 +00:00
Craig Topper	8bb662173b	[X86] Remove patterns for MOVLPD/MOVLPS nodes with integer types. Lowering shouldn't generate these. If we need to use them for integer types, it should use a bitcast. llvm-svn: 336458	2018-07-06 18:47:57 +00:00
Craig Topper	240650cf84	[X86] Add more FMA3 memory folding patterns. Remove patterns that are no longer needed. We've removed the legacy FMA3 intrinsics and are now using llvm.fma and extractelement/insertelement. So we don't need patterns for the nodes that could only be created by the old intrinscis. Those ISD opcodes still exist because we haven't dropped the AVX512 intrinsics yet, but those should go to EVEX instructions. llvm-svn: 336457	2018-07-06 18:47:55 +00:00
Matt Davis	7464e6c913	[llvm-mca] Add HardwareUnit and Context classes. This patch moves the construction of the default backend from llvm-mca.cpp and into mca::Context. The Context class is responsible for holding ownership of the simulated hardware components. These components are subclasses of HardwareUnit. Right now the HardwareUnit is pretty bare-bones, but eventually we might want to add some common functionality across all hardware components, such as isReady() or something similar. I have a feeling this patch will probably need some updates, but it's a start. One thing I am not particularly fond of is the rather large interface for createDefaultPipeline. That convenience routine takes a rather large set of inputs from the llvm-mca driver, where many of those inputs are generated via command line options. One item I think we might want to change is the separating of ownership of hardware components (owned by the context) and the pipeline (which owns Stages). In short, a Pipeline owns Stages, a Context (currently) owns hardware. The Pipeline's Stages make use of the components, and thus there is a lifetime dependency generated. The components must outlive the pipeline. We could solve this by having the Context also own the Pipeline, and not return a unique_ptr<Pipeline>. Now that I think about it, I like that idea more. Differential Revision: https://reviews.llvm.org/D48691 llvm-svn: 336456	2018-07-06 18:03:14 +00:00
Alexander Shaposhnikov	9383a6b0f8	[llvm-objcopy] Add support for static libraries This diff adds support for handling static libraries to llvm-objcopy and llvm-strip. Test plan: make check-all Differential revision: https://reviews.llvm.org/D48413 llvm-svn: 336455	2018-07-06 17:51:03 +00:00
Sanjay Patel	ac69f6b93e	[InstCombine] add more tests for potentially poisonous shifts; NFC llvm-svn: 336454	2018-07-06 17:44:57 +00:00
Nico Weber	1000ec7dd1	Revert 336426 (and follow-ups 428, 440), it very likely caused PR38084. llvm-svn: 336453	2018-07-06 17:37:24 +00:00
Vedant Kumar	c0eb33d15f	[Debugify] Allow unsigned values narrower than their variables Suppress the diagnostic for mis-sized dbg.values when a value operand is narrower than the unsigned variable it describes. Assume that a debugger would implicitly zero-extend these values. llvm-svn: 336452	2018-07-06 17:32:40 +00:00
Vedant Kumar	7e4e253821	[Local] replaceAllDbgUsesWith: Update debug values before RAUW The replaceAllDbgUsesWith utility helps passes preserve debug info when replacing one value with another. This improves upon the existing insertReplacementDbgValues API by: - Updating debug intrinsics in-place, while preventing use-before-def of the replacement value. - Falling back to salvageDebugInfo when a replacement can't be made. - Moving the responsibiliy for rewriting llvm.dbg.* DIExpressions into common utility code. Along with the API change, this teaches replaceAllDbgUsesWith how to create DIExpressions for three basic integer and pointer conversions: - The no-op conversion. Applies when the values have the same width, or have bit-for-bit compatible pointer representations. - Truncation. Applies when the new value is wider than the old one. - Zero/sign extension. Applies when the new value is narrower than the old one. Testing: - check-llvm, check-clang, a stage2 `-g -O3` build of clang, regression/unit testing. - This resolves a number of mis-sized dbg.value diagnostics from Debugify. Differential Revision: https://reviews.llvm.org/D48676 llvm-svn: 336451	2018-07-06 17:32:39 +00:00
Sanjay Patel	af4db2793f	[InstCombine] add more tests with poison and undef; NFC As discussed in D48987 and D48893, there are many different ways to go wrong depending on the binop (and as shown here we already do go wrong in some cases). llvm-svn: 336450	2018-07-06 17:24:32 +00:00
Tom Stellard	86b1ba6c63	AMDGPU: Fix UBSan error caused by r335942 Summary: Fixes PR38071. Reviewers: arsenm, dstenb Reviewed By: arsenm Subscribers: kzhuravl, wdng, nhaehnle, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D48979 llvm-svn: 336448	2018-07-06 17:16:17 +00:00
Sanjay Patel	75f2c3060c	[Constants] extend getBinOpIdentity(); NFC The enhanced version will be used in D48893 and related patches and an almost identical (fadd is different) version is proposed in D28907, so adding this as a preliminary step. llvm-svn: 336444	2018-07-06 15:18:58 +00:00
Sanjay Patel	8e3ee5bc36	[Constant] add undef element query for vector constants; NFC This is likely to be used in D48987 and similar patches, so adding it as an NFC preliminary step. llvm-svn: 336442	2018-07-06 14:52:36 +00:00
Sjoerd Meijer	5aeee587fd	[ARM] ParallelDSP: added statistics, NFC. Added statistics for the number of SMLAD instructions created, and als renamed the pass name to -arm-parallel-dsp. Differential Revision: https://reviews.llvm.org/D48971 llvm-svn: 336441	2018-07-06 14:47:09 +00:00
Diogo N. Sampaio	56f28af5cd	Commit rL336426 cause buildbot failures http://green.lab.llvm.org/green/job/clang-stage1-cmake-RA-incremental/50537/testReport/junit/LLVM/CodeGen_AArch64/FoldRedundantShiftedMasking_ll/ This removes the comments of the function label causing this error. llvm-svn: 336440	2018-07-06 14:41:09 +00:00
Benjamin Kramer	6f5f12fea0	[LoopSink] Make the enforcement of determinism deterministic. LoopBlockNumber is a DenseMap<BasicBlock, int>, comparing the result of find() will compare a pair<BasicBlock, int>. That's of course depending on pointer ordering which varies from run to run. Reverse iteration doesn't find this because we're copying to a vector first. This bug has been there since 2016 but only recently showed up on clang selfhost with FDO and ThinLTO, which is also why I didn't manage to get a reasonable test case for this. Add an assert that would've caught this. llvm-svn: 336439	2018-07-06 14:20:58 +00:00
Andrea Di Biagio	f4b2508e93	[llvm-mca] A write latency cannot be a negative value. NFC llvm-svn: 336437	2018-07-06 13:46:10 +00:00
Sjoerd Meijer	fc7fc1b734	[AArch64] Armv8.4-A: TLB support This adds: - outer shareable TLB Maintenance instructions, and - TLB range maintenance instructions. llvm-svn: 336434	2018-07-06 13:00:16 +00:00
Jonas Devlieghere	a400ed90dc	[dsymutil] Emit label at the begin of a CU When emitting a CU, store the MCSymbol pointing to the beginning of the CU. We'll need this information later when emitting the .debug_names section (DWARF5 accelerator table). llvm-svn: 336433	2018-07-06 12:49:54 +00:00
Sjoerd Meijer	757ee882e7	Recommit: [AArch64] Armv8.4-A: Flag manipulation instructions Now with the asm operand definition included. llvm-svn: 336432	2018-07-06 12:32:33 +00:00
Diogo N. Sampaio	4c13935522	Added missing semicolon llvm-svn: 336428	2018-07-06 10:09:04 +00:00
Diogo N. Sampaio	b19d6519a8	[SelectionDAG] https://reviews.llvm.org/D48278 D48278 Allow to reduce redundant shift masks. For example: x1 = x & 0xAB00 x2 = (x >> 8) & 0xAB can be reduced to: x1 = x & 0xAB00 x2 = x1 >> 8 It only allows folding when the masks and shift values are constants. llvm-svn: 336426	2018-07-06 09:42:25 +00:00
Sjoerd Meijer	5c16b3f6e6	Revert [AArch64] Armv8.4-A: Flag manipulation instructions It's causing build errors. llvm-svn: 336422	2018-07-06 08:39:43 +00:00
Sjoerd Meijer	9988992bb1	[AArch64] Armv8.4-A: Flag manipulation instructions These instructions are added to AArch64 only. Differential Revision: https://reviews.llvm.org/D48926 llvm-svn: 336421	2018-07-06 08:12:20 +00:00

1 2 3 4 5 ...

166352 Commits