llvm-mirror/test/CodeGen/PowerPC/shift-cmp.ll

; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py
; RUN: llc -mtriple powerpc64le-unknown-linux-gnu < %s | FileCheck %s
; RUN: llc -mtriple powerpc64-ibm-aix-xcoff < %s | FileCheck %s

define i1 @and_cmp_variable_power_of_two(i32 %x, i32 %y) {
; CHECK-LABEL: and_cmp_variable_power_of_two:
; CHECK:       # %bb.0:
; CHECK-NEXT:    srw 3, 3, 4
; CHECK-NEXT:    blr
  %shl = shl i32 1, %y
  %and = and i32 %x, %shl
  %cmp = icmp eq i32 %and, %shl
  ret i1 %cmp
}

define i1 @and_cmp_variable_power_of_two_64(i64 %x, i64 %y) {
; CHECK-LABEL: and_cmp_variable_power_of_two_64:
; CHECK:       # %bb.0:
; CHECK-NEXT:    srd 3, 3, 4
; CHECK-NEXT:    blr
  %shl = shl i64 1, %y
  %and = and i64 %x, %shl
  %cmp = icmp eq i64 %and, %shl
  ret i1 %cmp
}

define i1 @and_ncmp_variable_power_of_two(i32 %x, i32 %y) {
; CHECK-LABEL: and_ncmp_variable_power_of_two:
; CHECK:       # %bb.0:
; CHECK-NEXT:    srw 3, 3, 4
; CHECK-NEXT:    xori 3, 3, 1
; CHECK-NEXT:    blr
  %shl = shl i32 1, %y
  %and = and i32 %x, %shl
  %cmp = icmp ne i32 %and, %shl
  ret i1 %cmp
}

define i1 @and_ncmp_variable_power_of_two_64(i64 %x, i64 %y) {
; CHECK-LABEL: and_ncmp_variable_power_of_two_64:
; CHECK:       # %bb.0:
; CHECK-NEXT:    srd 3, 3, 4
; CHECK-NEXT:    xori 3, 3, 1
; CHECK-NEXT:    blr
  %shl = shl i64 1, %y
  %and = and i64 %x, %shl
  %cmp = icmp ne i64 %and, %shl
  ret i1 %cmp
}
[NFC][Codegen][PowerPC] Autogenerate shift-cmp.ll test Being affected by upcoming patch llvm-svn: 362529 2019-06-04 19:05:34 +02:00			`; NOTE: Assertions have been autogenerated by utils/update_llc_test_checks.py`
[PowerPC] [NFC] Add AIX triple to some regression tests As part of the effort to improve AIX support, regression test coverage misses quite a lot for AIX subtarget. This patch adds AIX triple to those don't need extra change, and we can cover more cases in following commits. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D94159 2021-01-18 04:44:00 +01:00			`; RUN: llc -mtriple powerpc64le-unknown-linux-gnu < %s \| FileCheck %s`
			`; RUN: llc -mtriple powerpc64-ibm-aix-xcoff < %s \| FileCheck %s`
[PowerPC] Add a pattern for a runtime bit check Following a suggestion by Sanjay, we should lower: %shl = shl i32 1, %y %and = and i32 %x, %shl %cmp = icmp eq i32 %and, %shl ret i1 %cmp into: subfic r4, r4, 32 rlwnm r3, r3, r4, 31, 31 Add this pattern and some associated patterns for the 64-bit case and the not-equal case. Fixes PR27356. llvm-svn: 280454 2016-09-02 04:34:44 +02:00
			`define i1 @and_cmp_variable_power_of_two(i32 %x, i32 %y) {`
[NFC][Codegen][PowerPC] Autogenerate shift-cmp.ll test Being affected by upcoming patch llvm-svn: 362529 2019-06-04 19:05:34 +02:00			`; CHECK-LABEL: and_cmp_variable_power_of_two:`
			`; CHECK: # %bb.0:`
[Codegen] (X & (C l>>/<< Y)) ==/!= 0 --> ((X <</l>> Y) & C) ==/!= 0 fold Summary: This was originally reported in D62818. https://rise4fun.com/Alive/oPH InstCombine does the opposite fold, in hope that `C l>>/<< Y` expression will be hoisted out of a loop if `Y` is invariant and `X` is not. But as it is seen from the diffs here, if it didn't get hoisted, the produced assembly is almost universally worse. Much like with my recent "hoist add/sub by/from const" patches, we should get almost universal win if we hoist constant, there is almost always an "and/test by imm" instruction, but "shift of imm" not so much, so we may avoid having to materialize the immediate, and thus need one less register. And since we now shift not by constant, but by something else, the live-range of that something else may reduce. Special care needs to be applied not to disturb x86 `BT` / hexagon `tstbit` instruction pattern. And to not get into endless combine loop. Reviewers: RKSimon, efriedma, t.p.northover, craig.topper, spatel, arsenm Reviewed By: spatel Subscribers: hiraditya, MaskRay, wuzish, xbolva00, nikic, nemanjai, jvesely, wdng, nhaehnle, javed.absar, tpr, kristof.beyls, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62871 llvm-svn: 366955 2019-07-25 00:57:22 +02:00			`; CHECK-NEXT: srw 3, 3, 4`
[NFC][Codegen][PowerPC] Autogenerate shift-cmp.ll test Being affected by upcoming patch llvm-svn: 362529 2019-06-04 19:05:34 +02:00			`; CHECK-NEXT: blr`
[PowerPC] Add a pattern for a runtime bit check Following a suggestion by Sanjay, we should lower: %shl = shl i32 1, %y %and = and i32 %x, %shl %cmp = icmp eq i32 %and, %shl ret i1 %cmp into: subfic r4, r4, 32 rlwnm r3, r3, r4, 31, 31 Add this pattern and some associated patterns for the 64-bit case and the not-equal case. Fixes PR27356. llvm-svn: 280454 2016-09-02 04:34:44 +02:00			`%shl = shl i32 1, %y`
			`%and = and i32 %x, %shl`
			`%cmp = icmp eq i32 %and, %shl`
			`ret i1 %cmp`
			`}`

			`define i1 @and_cmp_variable_power_of_two_64(i64 %x, i64 %y) {`
[NFC][Codegen][PowerPC] Autogenerate shift-cmp.ll test Being affected by upcoming patch llvm-svn: 362529 2019-06-04 19:05:34 +02:00			`; CHECK-LABEL: and_cmp_variable_power_of_two_64:`
			`; CHECK: # %bb.0:`
[Codegen] (X & (C l>>/<< Y)) ==/!= 0 --> ((X <</l>> Y) & C) ==/!= 0 fold Summary: This was originally reported in D62818. https://rise4fun.com/Alive/oPH InstCombine does the opposite fold, in hope that `C l>>/<< Y` expression will be hoisted out of a loop if `Y` is invariant and `X` is not. But as it is seen from the diffs here, if it didn't get hoisted, the produced assembly is almost universally worse. Much like with my recent "hoist add/sub by/from const" patches, we should get almost universal win if we hoist constant, there is almost always an "and/test by imm" instruction, but "shift of imm" not so much, so we may avoid having to materialize the immediate, and thus need one less register. And since we now shift not by constant, but by something else, the live-range of that something else may reduce. Special care needs to be applied not to disturb x86 `BT` / hexagon `tstbit` instruction pattern. And to not get into endless combine loop. Reviewers: RKSimon, efriedma, t.p.northover, craig.topper, spatel, arsenm Reviewed By: spatel Subscribers: hiraditya, MaskRay, wuzish, xbolva00, nikic, nemanjai, jvesely, wdng, nhaehnle, javed.absar, tpr, kristof.beyls, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62871 llvm-svn: 366955 2019-07-25 00:57:22 +02:00			`; CHECK-NEXT: srd 3, 3, 4`
[NFC][Codegen][PowerPC] Autogenerate shift-cmp.ll test Being affected by upcoming patch llvm-svn: 362529 2019-06-04 19:05:34 +02:00			`; CHECK-NEXT: blr`
[PowerPC] Add a pattern for a runtime bit check Following a suggestion by Sanjay, we should lower: %shl = shl i32 1, %y %and = and i32 %x, %shl %cmp = icmp eq i32 %and, %shl ret i1 %cmp into: subfic r4, r4, 32 rlwnm r3, r3, r4, 31, 31 Add this pattern and some associated patterns for the 64-bit case and the not-equal case. Fixes PR27356. llvm-svn: 280454 2016-09-02 04:34:44 +02:00			`%shl = shl i64 1, %y`
			`%and = and i64 %x, %shl`
			`%cmp = icmp eq i64 %and, %shl`
			`ret i1 %cmp`
			`}`

			`define i1 @and_ncmp_variable_power_of_two(i32 %x, i32 %y) {`
[NFC][Codegen][PowerPC] Autogenerate shift-cmp.ll test Being affected by upcoming patch llvm-svn: 362529 2019-06-04 19:05:34 +02:00			`; CHECK-LABEL: and_ncmp_variable_power_of_two:`
			`; CHECK: # %bb.0:`
[Codegen] (X & (C l>>/<< Y)) ==/!= 0 --> ((X <</l>> Y) & C) ==/!= 0 fold Summary: This was originally reported in D62818. https://rise4fun.com/Alive/oPH InstCombine does the opposite fold, in hope that `C l>>/<< Y` expression will be hoisted out of a loop if `Y` is invariant and `X` is not. But as it is seen from the diffs here, if it didn't get hoisted, the produced assembly is almost universally worse. Much like with my recent "hoist add/sub by/from const" patches, we should get almost universal win if we hoist constant, there is almost always an "and/test by imm" instruction, but "shift of imm" not so much, so we may avoid having to materialize the immediate, and thus need one less register. And since we now shift not by constant, but by something else, the live-range of that something else may reduce. Special care needs to be applied not to disturb x86 `BT` / hexagon `tstbit` instruction pattern. And to not get into endless combine loop. Reviewers: RKSimon, efriedma, t.p.northover, craig.topper, spatel, arsenm Reviewed By: spatel Subscribers: hiraditya, MaskRay, wuzish, xbolva00, nikic, nemanjai, jvesely, wdng, nhaehnle, javed.absar, tpr, kristof.beyls, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62871 llvm-svn: 366955 2019-07-25 00:57:22 +02:00			`; CHECK-NEXT: srw 3, 3, 4`
			`; CHECK-NEXT: xori 3, 3, 1`
[NFC][Codegen][PowerPC] Autogenerate shift-cmp.ll test Being affected by upcoming patch llvm-svn: 362529 2019-06-04 19:05:34 +02:00			`; CHECK-NEXT: blr`
[PowerPC] Add a pattern for a runtime bit check Following a suggestion by Sanjay, we should lower: %shl = shl i32 1, %y %and = and i32 %x, %shl %cmp = icmp eq i32 %and, %shl ret i1 %cmp into: subfic r4, r4, 32 rlwnm r3, r3, r4, 31, 31 Add this pattern and some associated patterns for the 64-bit case and the not-equal case. Fixes PR27356. llvm-svn: 280454 2016-09-02 04:34:44 +02:00			`%shl = shl i32 1, %y`
			`%and = and i32 %x, %shl`
			`%cmp = icmp ne i32 %and, %shl`
			`ret i1 %cmp`
			`}`

			`define i1 @and_ncmp_variable_power_of_two_64(i64 %x, i64 %y) {`
[NFC][Codegen][PowerPC] Autogenerate shift-cmp.ll test Being affected by upcoming patch llvm-svn: 362529 2019-06-04 19:05:34 +02:00			`; CHECK-LABEL: and_ncmp_variable_power_of_two_64:`
			`; CHECK: # %bb.0:`
[Codegen] (X & (C l>>/<< Y)) ==/!= 0 --> ((X <</l>> Y) & C) ==/!= 0 fold Summary: This was originally reported in D62818. https://rise4fun.com/Alive/oPH InstCombine does the opposite fold, in hope that `C l>>/<< Y` expression will be hoisted out of a loop if `Y` is invariant and `X` is not. But as it is seen from the diffs here, if it didn't get hoisted, the produced assembly is almost universally worse. Much like with my recent "hoist add/sub by/from const" patches, we should get almost universal win if we hoist constant, there is almost always an "and/test by imm" instruction, but "shift of imm" not so much, so we may avoid having to materialize the immediate, and thus need one less register. And since we now shift not by constant, but by something else, the live-range of that something else may reduce. Special care needs to be applied not to disturb x86 `BT` / hexagon `tstbit` instruction pattern. And to not get into endless combine loop. Reviewers: RKSimon, efriedma, t.p.northover, craig.topper, spatel, arsenm Reviewed By: spatel Subscribers: hiraditya, MaskRay, wuzish, xbolva00, nikic, nemanjai, jvesely, wdng, nhaehnle, javed.absar, tpr, kristof.beyls, jsji, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D62871 llvm-svn: 366955 2019-07-25 00:57:22 +02:00			`; CHECK-NEXT: srd 3, 3, 4`
			`; CHECK-NEXT: xori 3, 3, 1`
[NFC][Codegen][PowerPC] Autogenerate shift-cmp.ll test Being affected by upcoming patch llvm-svn: 362529 2019-06-04 19:05:34 +02:00			`; CHECK-NEXT: blr`
[PowerPC] Add a pattern for a runtime bit check Following a suggestion by Sanjay, we should lower: %shl = shl i32 1, %y %and = and i32 %x, %shl %cmp = icmp eq i32 %and, %shl ret i1 %cmp into: subfic r4, r4, 32 rlwnm r3, r3, r4, 31, 31 Add this pattern and some associated patterns for the 64-bit case and the not-equal case. Fixes PR27356. llvm-svn: 280454 2016-09-02 04:34:44 +02:00			`%shl = shl i64 1, %y`
			`%and = and i64 %x, %shl`
			`%cmp = icmp ne i64 %and, %shl`
			`ret i1 %cmp`
			`}`