llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 19:12:56 +02:00

Mirror of https://github.com/RPCS3/llvm-mirror

Go to file

Sanjay Patel 050e5a4e3d [InstCombine] canonicalize shifty abs(): ashr+add+xor --> cmp+neg+sel We want to do this for 2 reasons: 1. Value tracking does not recognize the ashr variant, so it would fail to match for cases like D39766. 2. DAGCombiner does better at producing optimal codegen when we have the cmp+sel pattern. More detail about what happens in the backend: 1. DAGCombiner has a generic transform for all targets to convert the scalar cmp+sel variant of abs into the shift variant. That is the opposite of this IR canonicalization. 2. DAGCombiner has a generic transform for all targets to convert the vector cmp+sel variant of abs into either an ABS node or the shift variant. That is again the opposite of this IR canonicalization. 3. DAGCombiner has a generic transform for all targets to convert the exact shift variants produced by #1 or #2 into an ISD::ABS node. Note: It would be an efficiency improvement if we had #1 go directly to an ABS node when that's legal/custom. 4. The pattern matching above is incomplete, so it is possible to escape the intended/optimal codegen in a variety of ways. a. For #2, the vector path is missing the case for setlt with a '1' constant. b. For #3, we are missing a match for commuted versions of the shift variants. 5. Therefore, this IR canonicalization can only help get us to the optimal codegen. The version of cmp+sel produced by this patch will be recognized in the DAG and converted to an ABS node when possible or the shift sequence when not. 6. In the following examples with this patch applied, we may get conditional moves rather than the shift produced by the generic DAGCombiner transforms. The conditional move is created using a target-specific decision for any given target. Whether it is optimal or not for a particular subtarget may be up for debate. define i32 @abs_shifty(i32 %x) { %signbit = ashr i32 %x, 31 %add = add i32 %signbit, %x %abs = xor i32 %signbit, %add ret i32 %abs } define i32 @abs_cmpsubsel(i32 %x) { %cmp = icmp slt i32 %x, zeroinitializer %sub = sub i32 zeroinitializer, %x %abs = select i1 %cmp, i32 %sub, i32 %x ret i32 %abs } define <4 x i32> @abs_shifty_vec(<4 x i32> %x) { %signbit = ashr <4 x i32> %x, <i32 31, i32 31, i32 31, i32 31> %add = add <4 x i32> %signbit, %x %abs = xor <4 x i32> %signbit, %add ret <4 x i32> %abs } define <4 x i32> @abs_cmpsubsel_vec(<4 x i32> %x) { %cmp = icmp slt <4 x i32> %x, zeroinitializer %sub = sub <4 x i32> zeroinitializer, %x %abs = select <4 x i1> %cmp, <4 x i32> %sub, <4 x i32> %x ret <4 x i32> %abs } > $ ./opt -instcombine shiftyabs.ll -S \| ./llc -o - -mtriple=x86_64 -mattr=avx > abs_shifty: > movl %edi, %eax > negl %eax > cmovll %edi, %eax > retq > > abs_cmpsubsel: > movl %edi, %eax > negl %eax > cmovll %edi, %eax > retq > > abs_shifty_vec: > vpabsd %xmm0, %xmm0 > retq > > abs_cmpsubsel_vec: > vpabsd %xmm0, %xmm0 > retq > > $ ./opt -instcombine shiftyabs.ll -S \| ./llc -o - -mtriple=aarch64 > abs_shifty: > cmp w0, #0 // =0 > cneg w0, w0, mi > ret > > abs_cmpsubsel: > cmp w0, #0 // =0 > cneg w0, w0, mi > ret > > abs_shifty_vec: > abs v0.4s, v0.4s > ret > > abs_cmpsubsel_vec: > abs v0.4s, v0.4s > ret > > $ ./opt -instcombine shiftyabs.ll -S \| ./llc -o - -mtriple=powerpc64le > abs_shifty: > srawi 4, 3, 31 > add 3, 3, 4 > xor 3, 3, 4 > blr > > abs_cmpsubsel: > srawi 4, 3, 31 > add 3, 3, 4 > xor 3, 3, 4 > blr > > abs_shifty_vec: > vspltisw 3, -16 > vspltisw 4, 15 > vsubuwm 3, 4, 3 > vsraw 3, 2, 3 > vadduwm 2, 2, 3 > xxlxor 34, 34, 35 > blr > > abs_cmpsubsel_vec: > vspltisw 3, -16 > vspltisw 4, 15 > vsubuwm 3, 4, 3 > vsraw 3, 2, 3 > vadduwm 2, 2, 3 > xxlxor 34, 34, 35 > blr > Differential Revision: https://reviews.llvm.org/D40984 llvm-svn: 320921		2017-12-16 16:41:17 +00:00
bindings	Update go bindings to use new functions from rL317135.	2017-11-02 10:22:26 +00:00
cmake	[cmake] Fix clang-cl cross-compilation on macOS	2017-12-15 01:05:48 +00:00
docs	[CodeGen] Print MCSymbol operands as <mcsymbol sym> in both MIR and debug output	2017-12-14 10:03:23 +00:00
examples	[CMake] Use PRIVATE in target_link_libraries for executables	2017-12-05 21:49:56 +00:00
include	[X86] Remove GCCBuiltin from kand/kandn/kor/kxor/kxnor/knot intrinsics so clang can implement with native IR.	2017-12-16 08:25:30 +00:00
lib	[InstCombine] canonicalize shifty abs(): ashr+add+xor --> cmp+neg+sel	2017-12-16 16:41:17 +00:00
projects	[cmake] Support moving debuginfo-tests to llvm/projects	2017-12-12 17:06:08 +00:00
resources
runtimes	[runtimes] Add install-*-stripped targets	2017-12-08 19:42:46 +00:00
test	[InstCombine] canonicalize shifty abs(): ashr+add+xor --> cmp+neg+sel	2017-12-16 16:41:17 +00:00
tools	Fixed warning 'function declaration isn’t a prototype [-Werror=strict-prototypes]'	2017-12-16 02:54:17 +00:00
unittests	[SimplifyLibCalls] Inline calls to cabs when it's safe to do so	2017-12-16 01:26:25 +00:00
utils	[TableGen][GlobalISel] Make the different Matcher comparable	2017-12-15 23:24:39 +00:00
.arcconfig	project_id is from another era in phabricator land and does not provide any value.	2016-09-27 15:47:29 +00:00
.clang-format
.clang-tidy
.gitattributes	[MC] Fix regression tests on Windows when git “core.autocrlf” is set to true.	2017-11-17 21:59:43 +00:00
.gitignore	gitignore: Ignore .vs folder (VS2017 config files)	2017-04-08 00:16:58 +00:00
CMakeLists.txt	[cmake] Only attempt to install MSVC system libraries on Windows	2017-12-14 18:41:49 +00:00
CODE_OWNERS.TXT	Update my email addresses, NFC.	2017-10-26 10:16:54 +00:00
configure
CREDITS.TXT	Add myself to CREDITS.txt	2017-09-18 14:33:39 +00:00
LICENSE.TXT	Bump year to 2017 in LICENSE.txt	2017-01-12 18:02:42 +00:00
llvm.spec.in
LLVMBuild.txt
README.txt	Test commit access	2017-08-18 02:39:28 +00:00
RELEASE_TESTERS.TXT	Update my email addresses, NFC.	2017-10-26 10:16:54 +00:00

README.txt

Low Level Virtual Machine (LLVM)
================================

This directory and its subdirectories contain source code for LLVM,
a toolkit for the construction of highly optimized compilers,
optimizers, and runtime environments.

LLVM is open source software. You may freely distribute it under the terms of
the license agreement found in LICENSE.txt.

Please see the documentation provided in docs/ for further
assistance with LLVM, and in particular docs/GettingStarted.rst for getting
started with LLVM and docs/README.txt for an overview of LLVM's
documentation setup.

If you are writing a package for LLVM, see docs/Packaging.rst for our
suggestions.