mirror of
https://github.com/RPCS3/llvm-mirror.git
synced 2024-11-23 11:13:28 +01:00
17b2066590
We can often end up with conditional stores that cannot be speculated. They can come from fairly simple, idiomatic code: if (c & flag1) *a = x; if (c & flag2) *a = y; ... There is no dominating or post-dominating store to a, so it is not legal to move the store unconditionally to the end of the sequence and cache the intermediate result in a register, as we would like to. It is, however, legal to merge the stores together and do the store once: tmp = undef; if (c & flag1) tmp = x; if (c & flag2) tmp = y; if (c & flag1 || c & flag2) *a = tmp; The real power in this optimization is that it allows arbitrary length ladders such as these to be completely and trivially if-converted. The typical code I'd expect this to trigger on often uses binary-AND with constants as the condition (as in the above example), which means the ending condition can simply be truncated into a single binary-AND too: 'if (c & (flag1|flag2))'. As in the general case there are bitwise operators here, the ladder can often be optimized further too. This optimization involves potentially increasing register pressure. Even in the simplest case, the lifetime of the first predicate is extended. This can be elided in some cases such as using binary-AND on constants, but not in the general case. Threading 'tmp' through all branches can also increase register pressure. The optimization as in this patch is enabled by default but kept in a very conservative mode. It will only optimize if it thinks the resultant code should be if-convertable, and additionally if it can thread 'tmp' through at least one existing PHI, so it will only ever in the worst case create one more PHI and extend the lifetime of a predicate. This doesn't trigger much in LNT, unfortunately, but it does trigger in a big way in a third party test suite. llvm-svn: 252051
72 lines
2.0 KiB
C++
72 lines
2.0 KiB
C++
//===-- llvm/ADT/SetOperations.h - Generic Set Operations -------*- C++ -*-===//
|
|
//
|
|
// The LLVM Compiler Infrastructure
|
|
//
|
|
// This file is distributed under the University of Illinois Open Source
|
|
// License. See LICENSE.TXT for details.
|
|
//
|
|
//===----------------------------------------------------------------------===//
|
|
//
|
|
// This file defines generic set operations that may be used on set's of
|
|
// different types, and different element types.
|
|
//
|
|
//===----------------------------------------------------------------------===//
|
|
|
|
#ifndef LLVM_ADT_SETOPERATIONS_H
|
|
#define LLVM_ADT_SETOPERATIONS_H
|
|
|
|
namespace llvm {
|
|
|
|
/// set_union(A, B) - Compute A := A u B, return whether A changed.
|
|
///
|
|
template <class S1Ty, class S2Ty>
|
|
bool set_union(S1Ty &S1, const S2Ty &S2) {
|
|
bool Changed = false;
|
|
|
|
for (typename S2Ty::const_iterator SI = S2.begin(), SE = S2.end();
|
|
SI != SE; ++SI)
|
|
if (S1.insert(*SI).second)
|
|
Changed = true;
|
|
|
|
return Changed;
|
|
}
|
|
|
|
/// set_intersect(A, B) - Compute A := A ^ B
|
|
/// Identical to set_intersection, except that it works on set<>'s and
|
|
/// is nicer to use. Functionally, this iterates through S1, removing
|
|
/// elements that are not contained in S2.
|
|
///
|
|
template <class S1Ty, class S2Ty>
|
|
void set_intersect(S1Ty &S1, const S2Ty &S2) {
|
|
for (typename S1Ty::iterator I = S1.begin(); I != S1.end();) {
|
|
const auto &E = *I;
|
|
++I;
|
|
if (!S2.count(E)) S1.erase(E); // Erase element if not in S2
|
|
}
|
|
}
|
|
|
|
/// set_difference(A, B) - Return A - B
|
|
///
|
|
template <class S1Ty, class S2Ty>
|
|
S1Ty set_difference(const S1Ty &S1, const S2Ty &S2) {
|
|
S1Ty Result;
|
|
for (typename S1Ty::const_iterator SI = S1.begin(), SE = S1.end();
|
|
SI != SE; ++SI)
|
|
if (!S2.count(*SI)) // if the element is not in set2
|
|
Result.insert(*SI);
|
|
return Result;
|
|
}
|
|
|
|
/// set_subtract(A, B) - Compute A := A - B
|
|
///
|
|
template <class S1Ty, class S2Ty>
|
|
void set_subtract(S1Ty &S1, const S2Ty &S2) {
|
|
for (typename S2Ty::const_iterator SI = S2.begin(), SE = S2.end();
|
|
SI != SE; ++SI)
|
|
S1.erase(*SI);
|
|
}
|
|
|
|
} // End llvm namespace
|
|
|
|
#endif
|