mirror of
https://github.com/RPCS3/llvm-mirror.git
synced 2024-11-26 04:32:44 +01:00
994e1aad23
1. ReachingDefsAnalysis - Allows to identify for each instruction what is the “closest” reaching def of a certain register. Used by BreakFalseDeps (for clearance calculation) and ExecutionDomainFix (for arbitrating conflicting domains). 2. ExecutionDomainFix - Changes the variant of the instructions in order to minimize domain crossings. 3. BreakFalseDeps - Breaks false dependencies. 4. LoopTraversal - Creatws a traversal order of the basic blocks that is optimal for loops (introduced in revision L293571). Both ExecutionDomainFix and ReachingDefsAnalysis use this to determine the order they will traverse the basic blocks. This also included the following changes to ExcecutionDepsFix original logic: 1. BreakFalseDeps and ReachingDefsAnalysis logic no longer restricted by a register class. 2. ReachingDefsAnalysis tracks liveness of reg units instead of reg indices into a given reg class. Additional changes in affected files: 1. X86 and ARM targets now inherit from ExecutionDomainFix instead of ExecutionDepsFix. BreakFalseDeps also was added to the passes they activate. 2. Comments and references to ExecutionDepsFix replaced with ExecutionDomainFix and BreakFalseDeps, as appropriate. Additional refactoring changes will follow. This commit is (almost) NFC. The only functional change is that now BreakFalseDeps will break dependency for all register classes. Since no additional instructions were added to the list of instructions that have false dependencies, there is no actual change yet. In a future commit several instructions (and tests) will be added. This is the first of multiple patches that fix bugzilla https://bugs.llvm.org/show_bug.cgi?id=33869 Most of the patches are intended at refactoring the existent code. Additional relevant reviews: https://reviews.llvm.org/D40331 https://reviews.llvm.org/D40332 https://reviews.llvm.org/D40333 https://reviews.llvm.org/D40334 Differential Revision: https://reviews.llvm.org/D40330 Change-Id: Icaeb75e014eff96a8f721377783f9a3e6c679275 llvm-svn: 323087
23 lines
746 B
LLVM
23 lines
746 B
LLVM
; RUN: llc < %s -mcpu=cortex-a9 -mattr=+neon,+neonfp -float-abi=hard -mtriple armv7-linux-gnueabi | FileCheck %s
|
|
|
|
;; This test checks that the ExecutionDomainFix pass performs the domain changes
|
|
;; even when some dependencies are propagated through implicit definitions.
|
|
|
|
; CHECK: fun_a
|
|
define <4 x float> @fun_a(<4 x float> %in, <4 x float> %x, float %y) nounwind {
|
|
; CHECK: vext
|
|
; CHECK: vext
|
|
; CHECK: vadd.f32
|
|
%1 = insertelement <4 x float> %in, float %y, i32 0
|
|
%2 = fadd <4 x float> %1, %x
|
|
ret <4 x float> %2
|
|
}
|
|
; CHECK: fun_b
|
|
define <4 x i32> @fun_b(<4 x i32> %in, <4 x i32> %x, i32 %y) nounwind {
|
|
; CHECK: vmov.32
|
|
; CHECK: vadd.i32
|
|
%1 = insertelement <4 x i32> %in, i32 %y, i32 0
|
|
%2 = add <4 x i32> %1, %x
|
|
ret <4 x i32> %2
|
|
}
|