PWR022: Move invariant conditional out of the loop to facilitate vectorization
Issue
Conditional evaluates to the same value for all loop iterations and can be moved outside of the loop to favor vectorization.
Actions
Move the invariant conditional outside of the loop by duplicating the loop body.
Relevance
Classical vectorization requirements do not allow branching inside the loop
body, which would mean no if
and switch
statements inside the loop body are
allowed. However, loop invariant conditionals can be extracted outside of the
loop to facilitate vectorization. Therefore, it is often good to extract
invariant conditional statements out of vectorizable loops to increase
performance. A conditional whose expression evaluates to the same value for all
iterations (i.e., a loop-invariant) can be safely moved outside the loop since
it will always be either true or false.
This optimization is called loop unswitching and the compilers can do it automatically in simple cases. However, in more complex cases, the compiler will omit this optimization and therefore it is beneficial to do it manually.
Code example
C
The following loop contains a condition that is invariant for all its iterations. Not only may this introduce an unnecessary redundant comparison, it may also make the vectorization of the loop more difficult for some compilers:
int example(int *A, int n) {
int total = 0;
for (int i = 0; i < n; ++i) {
if (n < 10) {
total++;
}
A[i] = total;
}
return total;
}
The loop invariant can be extracted out of the loop in a simple way, by duplicating the loop body and removing the condition. The resulting code is as follows:
int example(int *A, int n) {
int total = 0;
if (n < 10) {
for (int i = 0; i < n; ++i) {
A[i] = ++total;
}
} else {
for (int i = 0; i < n; ++i) {
A[i] = total;
}
}
return total;
}
Fortran
The following loop contains a condition that is invariant for all its iterations. Not only may this introduce an unnecessary redundant comparison, it may also make the vectorization of the loop more difficult for some compilers:
pure subroutine example(array)
integer, intent(out) :: array(:)
integer :: i, total
total = 0
do i = 1, size(array, 1)
if (size(array, 1) < 10) then
total = total + 1
end if
array(i) = total
end do
end subroutine example
The loop invariant can be extracted out of the loop in a simple way, by duplicating the loop body and removing the condition. The resulting code is as follows:
pure subroutine example(array)
integer, intent(out) :: array(:)
integer :: i, total
total = 0
if (size(array, 1) < 10) then
do i = 1, size(array, 1)
total = total + 1
array(i) = total
end do
else
do i = 1, size(array, 1)
array(i) = total
end do
end if
end subroutine example