llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

History

Chandler Carruth 65613836e9 First major step toward addressing PR14059. This teaches SROA to handle cases where we have partial integer loads and stores to an otherwise promotable alloca to widen[1] those loads and stores to cover the entire alloca and bitcast them into the appropriate type such that promotion can proceed. These partial loads and stores stem from an annoying confluence of ARM's calling convention and ABI lowering and the FCA pre-splitting which takes place in SROA. Clang lowers a { double, double } in-register function argument as a [4 x i32] function argument to ensure it is placed into integer 32-bit registers (a really unnerving implicit contract between Clang and the ARM backend I would add). This results in a FCA load of [4 x i32]* from the { double, double } alloca, and SROA decomposes this into a sequence of i32 loads and stores. Inlining proceeds, code gets folded, but at the end of the day, we still have i32 stores to the low and high halves of a double alloca. Widening these to be i64 operations, and bitcasting them to double prior to loading or storing allows promotion to proceed for these allocas. I looked quite a bit changing the IR which Clang produces for this case to be more friendly, but small changes seem unlikely to help. I think the best representation we could use currently would be to pass 4 i32 arguments thereby avoiding any FCAs, but that would still require this fix. It seems like it might eventually be nice to somehow encode the ABI register selection choices outside of the parameter type system so that the parameter can be a { double, double }, but the CC register annotations indicate that this should be passed via 4 integer registers. This patch does not address the second problem in PR14059, which is the reverse: when a struct alloca is loaded as a larger single integer. This patch also does not address some of the code quality issues with the FCA-splitting. Those don't actually impede any optimizations really, but they're on my list to clean up. [1]: Pedantic footnote: for those concerned about memory model issues here, this is safe. For the alloca to be promotable, it cannot escape or have any use of its address that could allow these loads or stores to be racing. Thus, widening is always safe. llvm-svn: 165928		2012-10-15 08:40:30 +00:00
..
ADCE
ArgumentPromotion	Fix the remaining TCL-style quotes found in the testsuite. This is	2012-07-02 19:09:46 +00:00
BBVectorize	Allow BBVectorize to form non-2^n-length vectors.	2012-06-28 05:42:42 +00:00
BlockPlacement
BranchFolding
CodeExtractor
CodeGenPrepare	Move load_to_switch.ll to test/CodeGen/SPARC/	2012-09-19 09:25:03 +00:00
ConstantMerge
ConstProp	Fix the remaining TCL-style quotes found in the testsuite. This is	2012-07-02 19:09:46 +00:00
CorrelatedValuePropagation	CorrelatedPropagation: BasicBlock::removePredecessor can simplify PHI nodes. If the it's the condition of a SwitchInst, reload it.	2012-09-28 10:42:50 +00:00
DeadArgElim	Fix PR14016.	2012-10-09 08:13:15 +00:00
DeadStoreElimination	Make this test check the transforms it's actually doing. Also add a test that it	2012-09-25 18:17:38 +00:00
EarlyCSE	New EarlyCSE tests for CSE-ing across commutativity.	2012-10-09 16:58:13 +00:00
FunctionAttrs	Convert all tests using TCL-style quoting to use shell-style quoting.	2012-07-02 12:47:22 +00:00
GlobalDCE
GlobalOpt	GlobalOpt: non-constexpr bitcasts or GEPs can occur even if the global value is only stored once.	2012-09-28 10:01:27 +00:00
GVN	Make MemoryBuiltins aware of TargetLibraryInfo.	2012-08-29 15:32:21 +00:00
IndVarSimplify	Move TargetData to DataLayout.	2012-10-08 16:39:34 +00:00
Inline	rename test	2012-09-19 09:22:17 +00:00
InstCombine	instcombine: Migrate strcmp and strncmp optimizations	2012-10-15 03:47:37 +00:00
InstSimplify
Internalize
IPConstantProp	Convert all tests using TCL-style quoting to use shell-style quoting.	2012-07-02 12:47:22 +00:00
JumpThreading	JumpThreading: when default destination is the destination of some cases in a	2012-09-05 23:45:58 +00:00
LCSSA	Convert all tests using TCL-style quoting to use shell-style quoting.	2012-07-02 12:47:22 +00:00
LICM	LICM may hoist an instruction with undefined behavior above a trap.	2012-09-04 10:25:04 +00:00
LoopDeletion
LoopIdiom	LoopIdiom: Give up when the loop is not in canonical form.	2012-09-21 17:27:23 +00:00
LoopRotate	LoopRotation: Make the brute force DomTree update more brute force.	2012-09-02 11:57:22 +00:00
LoopSimplify	Convert all tests using TCL-style quoting to use shell-style quoting.	2012-07-02 12:47:22 +00:00
LoopStrengthReduce	LSR critical edge splitting fix for PR13756.	2012-09-18 17:51:33 +00:00
LoopUnroll	Fix tests that didn't test anything.	2012-09-26 09:51:39 +00:00
LoopUnswitch	Fix tests that didn't test anything.	2012-09-26 09:51:39 +00:00
LowerAtomic
LowerExpectIntrinsic
LowerInvoke
LowerSwitch
Mem2Reg
MemCpyOpt	In my recent change to avoid use of underaligned memory I didn't notice that	2012-10-04 13:53:21 +00:00
MergeFunc	Fix the remaining TCL-style quotes found in the testsuite. This is	2012-07-02 19:09:46 +00:00
MetaRenamer	Review feedback from Duncan Sands. Alphabetize includes and simplify	2012-09-14 19:19:57 +00:00
ObjCARC	Detect overflow in the path count computation. rdar://12277446.	2012-09-12 20:45:17 +00:00
PhaseOrdering	Add test case for r164850.	2012-09-29 00:12:08 +00:00
PruneEH	The normal edge of an invoke is not allowed to branch to a block with a	2012-08-10 20:55:20 +00:00
Reassociate	Stop reassociate from looking through expressions of arbitrary complexity. This	2012-07-26 09:26:40 +00:00
ScalarRepl	Port the global copy optimization from the SROA pass to InstCombine.	2012-08-21 08:39:44 +00:00
SCCP	Fix the remaining TCL-style quotes found in the testsuite. This is	2012-07-02 19:09:46 +00:00
SimplifyCFG	SimplifyCFG: Enhance the "remove CFG edge that leads to null pointer dereference" optimization to also handle instructions with multiple uses.	2012-10-04 16:11:49 +00:00
SimplifyLibCalls	instcombine: Migrate strcmp and strncmp optimizations	2012-10-15 03:47:37 +00:00
Sink
SROA	First major step toward addressing PR14059. This teaches SROA to handle	2012-10-15 08:40:30 +00:00
StripSymbols
TailCallElim	Convert the uses of '\|&' to use '2>&1 \|' instead, which works on old	2012-07-02 18:37:59 +00:00
TailDup	Convert the uses of '\|&' to use '2>&1 \|' instead, which works on old	2012-07-02 18:37:59 +00:00