llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 02:52:53 +02:00

Author	SHA1	Message	Date
Andrea Di Biagio	d71feb8bcf	[MCA][Scheduler] Improved critical memory dependency computation. This fixes a problem where back-pressure increases caused by register dependencies were not correctly notified if execution was also delayed by memory dependencies. llvm-svn: 361740	2019-05-26 19:50:31 +00:00
Simon Pilgrim	595d9c5d8c	[SelectionDAG] GetDemandedBits - cleanup to more closely match SimplifyDemandedBits. NFCI. Prep work before adding demanded elts support. llvm-svn: 361739	2019-05-26 18:58:14 +00:00
Simon Pilgrim	09ad1f3612	[SelectionDAG] MaskedValueIsZero - add demanded elements implementation Will be used in an upcoming patch but I've updated the original implementation to call this to ensure test coverage. llvm-svn: 361738	2019-05-26 18:43:44 +00:00
Andrea Di Biagio	a0ca21f964	[MCA] Refactor the logic that computes the critical memory dependency info. NFCI CriticalRegDep has been renamed CriticalDependency, and it is now used by class Instruction to store information about the critical register dependency and the critical memory dependency. No functional change intendend. llvm-svn: 361737	2019-05-26 18:41:35 +00:00
Shawn Landden	fa2f4e26b8	[SimplifyCFG] back out all SwitchInst commits They caused the sanitizer builds to fail. My suspicion is the change the countLeadingZeros(). llvm-svn: 361736	2019-05-26 18:15:51 +00:00
Simon Pilgrim	f0ec124437	[X86][SSE] Add shuffle combining support for ISD::ANY_EXTEND_VECTOR_INREG Reuses what we already have in place for ISD::ZERO_EXTEND_VECTOR_INREG just with a different sentinel llvm-svn: 361734	2019-05-26 16:00:35 +00:00
Shawn Landden	4aa39591db	[SimplifyCFG] NFC, one more fixed test from previous push. The old test was checking for a stupid subtract one that is a transform that makes the code woorse. The constant-islands-jump-table.ll test wants the code a specific way, that makes sense, so I will submit code to fix that one. Sorry that I really didn't know how to run the test suite before this. llvm-svn: 361733	2019-05-26 15:29:10 +00:00
Simon Pilgrim	fa68bd7e06	Revert rL361731 : [LLParser] Fix uninitialized variable warnings. NFCI. These 3 variables cause quite a few warnings in the scan-build report on llvm. ........ Revert accidental commit. llvm-svn: 361732	2019-05-26 15:08:45 +00:00
Simon Pilgrim	e0bea1d37f	[LLParser] Fix uninitialized variable warnings. NFCI. These 3 variables cause quite a few warnings in the scan-build report on llvm. llvm-svn: 361731	2019-05-26 15:05:12 +00:00
Shawn Landden	2992304572	[SimplifyCFG] NFC, fix failing tests from last patches. No problems with the transforms. llvm-svn: 361730	2019-05-26 14:44:14 +00:00
Sanjay Patel	78254915c6	[InstCombine] prevent crashing with invalid extractelement index This was found/reduced from a fuzzer report: https://bugs.chromium.org/p/oss-fuzz/issues/detail?id=14956 llvm-svn: 361729	2019-05-26 14:03:50 +00:00
Shawn Landden	a0d036767d	[SimplifyCFG] ReduceSwitchRange: Improve on the case where the SubThreshold doesn't trigger llvm-svn: 361728	2019-05-26 13:55:52 +00:00
Shawn Landden	ec584726a2	[SimplifyCFG] Run ReduceSwitchRange unconditionally, generalize Rather than gating on "isSwitchDense" (resulting in necessesarily sparse lookup tables even when they were generated), always run this quite cheap transform. This transform is useful not just for generating tables. LowerSwitch also wants this: read LowerSwitch.cpp:257. Be careful to not generate worse code, by introducing a SubThreshold heuristic. Instead of just sorting by signed, generalize the finding of the best base. And now that it is run unconditionally, do not replicate its functionality in SwitchToLookupTable (which could use a Sub when having a hole is smaller, hence the SubThreshold heuristic located in a single place). This simplifies SwitchToLookupTable, and fixes some ugly corner cases due to the use of signed numbers, such as a table containing i16 32768 and 32769, of which 32769 would be interpreted as -32768, and now the code thinks the table is size 65536. (We still use unconditional subtraction when building a single-register mask, but I think this whole block should go when the more general sparse map is added, which doesn't leave empty holes in the table.) And the reason test4 and test5 did not trigger was documented wrong: it was because they were not considered sufficiently "dense". Also, fix generation of invalid LLVM-IR: shl by bit-width. llvm-svn: 361727	2019-05-26 13:55:14 +00:00
Shawn Landden	0538f8a427	[SimpligyCFG] NFC, remove GCD that was only used for powers of two and replace with an equilivent countTrailingZeros. GCD is much more expensive than this, with repeated division. This depends on D60823 llvm-svn: 361726	2019-05-26 13:54:04 +00:00
Shawn Landden	a25e856525	[SimplifyCFG] NFC, update Switch tests to HEAD so I can see if my changes change anything Also add baseline tests to show effect of later patches. llvm-svn: 361725	2019-05-26 13:52:41 +00:00
Shawn Landden	3c7672be25	[Support] make countLeadingZeros() and countTrailingZeros() return unsigned This matches countLeadingOnes() and countTrailingOnes(), and APInt's countLeadingZeros() and countTrailingZeros(). (as well as __builtin_clzll()) llvm-svn: 361724	2019-05-26 13:49:58 +00:00
Nikita Popov	8d7b143927	[ValueTracking] Base computeOverflowForUnsignedMul() on ConstantRange code; NFCI The implementation in ValueTracking and ConstantRange are equally powerful, reuse the one in ConstantRange, which will make this easier to extend. llvm-svn: 361723	2019-05-26 13:22:01 +00:00
Nico Weber	e2f61e4247	gn build: Merge r361664 llvm-svn: 361722	2019-05-26 13:06:48 +00:00
Nikita Popov	217c17cf26	[InstCombine] Refactor OptimizeOverflowCheck; NFCI Extract method to compute overflow based on binop and signedness, and then make the result handling code generic. This extends the always-overflow handling to signed muls, but has currently no effect, as we don't compute always overflow for them (thus NFC). llvm-svn: 361721	2019-05-26 11:43:37 +00:00
Nikita Popov	b1c5bb3da1	[InstCombine] Remove OverflowCheckFlavor; NFC Instead pass binary op and signedness. The extra enum only makes things more complicated in this case. llvm-svn: 361720	2019-05-26 11:43:31 +00:00
David Green	0532e48bcd	[ARM] Select fp16 fma This adds a pattern for fma, similar to the float and double patterns. Differential Revision: https://reviews.llvm.org/D62330 llvm-svn: 361719	2019-05-26 11:34:30 +00:00
David Green	cf943eac87	[ARM] Select a number of fp16 rounding functions This add patterns for fp16 round and ceil etc. Same as the float and double patterns. Differential Revision: https://reviews.llvm.org/D62326 llvm-svn: 361718	2019-05-26 11:13:00 +00:00
David Green	8dbf0fcde0	[ARM] Promote various fp16 math intrinsics Promote a number of fp16 math intrinsics to float, so that the relevant float math routines can be used. Copysign is expanded so as to be handled in-place. Differential Revision: https://reviews.llvm.org/D62325 llvm-svn: 361717	2019-05-26 10:59:21 +00:00
Simon Pilgrim	854db1c5c3	[X86][AVX] combineBitcastvxi1 - peek through bitops to determine size of original vector We were only testing for direct SETCC results - this allows us to peek through AND/OR/XOR combinations of the comparison results as well. There's a missing SEXT(PACKSS) fold that I need to investigate for v8i1 cases before I can enable it there as well. llvm-svn: 361716	2019-05-26 10:54:23 +00:00
David Green	799c41b22f	[ARM] Select fp16 fabs This adds a pattern for the fabs intrinsic, the same as float and double. Differential Revision: https://reviews.llvm.org/D62324 llvm-svn: 361715	2019-05-26 10:51:58 +00:00
David Green	ebcd7899ad	[ARM] Select fp16 fsqrt This adds a pattern for the sqrt intrinsic, the same as float and double. Differential Revision: https://reviews.llvm.org/D62322 llvm-svn: 361714	2019-05-26 10:42:24 +00:00
David Green	45adeac74d	[ARM] Promote fp16 frem Promote fp16 frem operations on ARM to floats so they call fmodf. Differential Revision: https://reviews.llvm.org/D62321 llvm-svn: 361713	2019-05-26 10:30:22 +00:00
David Green	dac287a665	[ARM] Add some base fullfp16 tests. NFC llvm-svn: 361712	2019-05-26 10:06:40 +00:00
Fangrui Song	72eb416dc0	[PowerPC] Add missing R_PPC_* relocation types While people mostly care about 64-bit, some systems need basic lib32 support. The plan is to make lld (see PR40888) capable of linking some applications (PR40888). llvm-svn: 361711	2019-05-26 08:31:00 +00:00
David Bolvansky	968bf43d7e	[SimplifyCFG] Added condition assumption for unreachable blocks Summary: PR41688 Reviewers: spatel, efriedma, craig.topper, hfinkel, reames Reviewed By: hfinkel Subscribers: javed.absar, dmgreen, fhahn, hfinkel, reames, nikic, lebedev.ri, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D61409 llvm-svn: 361707	2019-05-25 22:34:27 +00:00
Simon Pilgrim	6512a194aa	[X86] lowerBuildVectorToBitOp - support build_vector(shift()) -> shift(build_vector(),C) Commonly occurs in sign-extension cases llvm-svn: 361706	2019-05-25 18:02:17 +00:00
Robert Widmann	6c02607f00	[LLVM-C] Add Accessor for Mach-O Universal Binary Slices Summary: Allow for retrieving an object file corresponding to an architecture-specific slice in a Mach-O universal binary file. Reviewers: whitequark, deadalnix Reviewed By: whitequark Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D60378 llvm-svn: 361705	2019-05-25 16:47:27 +00:00
Nikita Popov	6a60222875	[X86] Combine fminnum/fmaxnum with non-nan operand to fmin/fmax If we have a known non-nan operand, place it in the second operand of fmin/fmax that is returned if either operand is nan. Differential Revision: https://reviews.llvm.org/D62448 llvm-svn: 361704	2019-05-25 16:44:29 +00:00
Nikita Popov	1c4dfb583f	[LVI][CVP] Add support for saturating add/sub Adds support for the uadd.sat family of intrinsics in LVI, based on ConstantRange methods from D60946. Differential Revision: https://reviews.llvm.org/D62447 llvm-svn: 361703	2019-05-25 16:44:14 +00:00
Simon Pilgrim	c4eff3e026	[X86][SSE] vector-sext - cleanup prefix lists Add X32-SSE common prefix to merge some checks llvm-svn: 361702	2019-05-25 16:33:17 +00:00
Sanjay Patel	b36b4dc57d	[SelectionDAG] define binops as a superset of commutative binops The test diffs show improved vector narrowing for integer min/max opcodes because those were all absent from the list. I'm not sure if we can expose functional diffs for all of the moved/added opcodes though. It seems like we are missing an AVX512 opportunity to use 256-bit ops in place of 512-bit ops on some tests/targets, but I think that can be a follow-up. Preliminary steps to make sure the callers are not misusing these queries: rL361268 rL361547 Differential Revision: https://reviews.llvm.org/D62191 llvm-svn: 361701	2019-05-25 15:28:55 +00:00
Nikita Popov	ab8b1ebeb6	[X86] Add tests for min/maxnum with const operand; NFC llvm-svn: 361700	2019-05-25 15:06:54 +00:00
Nikita Popov	e3d971d263	[LoopVectorize] Fix test by regenerating checks llvm-svn: 361699	2019-05-25 14:33:30 +00:00
Nikita Popov	2301567c31	[CVP] Remove unnecessary checks for empty GNWR; NFC The guaranteed no-wrap region is never empty, it always contains at least zero, so these optimizations don't ever apply. To make this more obviously true, replace the conversative return in makeGNWR with an assertion. llvm-svn: 361698	2019-05-25 14:11:55 +00:00
David Bolvansky	f5a3cdd398	[NFC] Make tests more robust for new optimizations llvm-svn: 361697	2019-05-25 14:10:20 +00:00
Sanjay Patel	67b43d983f	[SelectionDAG] soften assertion when legalizing narrow vector FP ops The test based on PR42010: https://bugs.llvm.org/show_bug.cgi?id=42010 ...may show an inaccuracy for PPC's target defs, but we should not be so aggressive with an assert here. There's no telling what out-of-tree targets look like. llvm-svn: 361696	2019-05-25 13:48:07 +00:00
David Bolvansky	33891f02e2	[NFC] Update test checks llvm-svn: 361695	2019-05-25 13:11:22 +00:00
Nikita Popov	6eb72baa95	[CVP] Add tests for saturating add/sub ranges; NFC llvm-svn: 361694	2019-05-25 09:53:51 +00:00
Nikita Popov	e18ebc4971	[LVI][CVP] Calculate with.overflow result range In LVI, calculate the range of extractvalue(op.with.overflow(%x, %y), 0) as the range of op(%x, %y). This is mainly useful in conjunction with D60650: If the result of the operation is extracted in a branch guarded against overflow, then the value of %x will be appropriately constrained and the result range of the operation will be calculated taking that into account. Differential Revision: https://reviews.llvm.org/D60656 llvm-svn: 361693	2019-05-25 09:53:45 +00:00
Nikita Popov	e405dadc9f	[LVI] Extract helper for binary range calculations; NFC llvm-svn: 361692	2019-05-25 09:53:37 +00:00
Craig Topper	9d88bde411	[X86FixupLEAs] Turn optIncDec into a generic two address LEA optimizer. Support LEA64_32r properly. INC/DEC is really a special case of a more generic issue. We should also turn leas into add reg/reg or add reg/imm regardless of the slow lea flags. This also supports LEA64_32 which has 64 bit input registers and 32 bit output registers. So we need to convert the 64 bit inputs to their 32 bit equivalents to check if they are equal to base reg. One thing to note, the original code preserved the kill flags by adding operands to the new instruction instead of using addReg. But I think tied operands aren't supposed to have the kill flag set. I dropped the kill flags, but I could probably try to preserve it in the add reg/reg case if we think its important. Not sure which operand its supposed to go on for the LEA64_32r instruction due to the super reg implicit uses. Though I'm also not sure those are needed since they were probably just created by an INSERT_SUBREG from a 32-bit input. Differential Revision: https://reviews.llvm.org/D61472 llvm-svn: 361691	2019-05-25 06:17:47 +00:00
Craig Topper	5185b1fd3e	[X86] Add zero idioms to the haswell, broadwell, and skylake schedule models. Add 256-bit fp xor to sandybridge zero idioms This copies the Sandy Bridge zero idiom support to later CPUs. Adding the AVX2 and AVX512F/VL instructions as appropriate. Differential Revision: https://reviews.llvm.org/D62360 llvm-svn: 361690	2019-05-25 04:47:49 +00:00
Craig Topper	1b939ae950	[X86][llvm-mca] Add zero idiom tests for Intel CPUs. NFC This pre-commits tests for D62360 llvm-svn: 361689	2019-05-25 04:47:42 +00:00
Peter Collingbourne	855b0e1cfe	Revert r361644, "[AMDGPU] Divergence driven ISel. Assign register class for cross block values according to the divergence." Broke sanitizer bots: http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux/builds/21694/steps/bootstrap%20clang/logs/stdio http://lab.llvm.org:8011/builders/sanitizer-x86_64-linux-fast/builds/32478/steps/check-llvm%20asan/logs/stdio llvm-svn: 361688	2019-05-25 01:52:38 +00:00
Akira Hatanaka	2e1cd6b131	Revert "[Analysis] Link library dependencies to Analysis plugins" This reverts commit r361340. The following builder has been broken for the past few days because of this commit: http://green.lab.llvm.org/green/job/clang-stage2-cmake-RgSan/ Also revert r361399, which was committed to fix r361340. llvm-svn: 361685	2019-05-25 00:50:03 +00:00

... 2 3 4 5 6 ...

179417 Commits