llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 19:23:23 +01:00

History

Chandler Carruth 0fe0191603 [MBP] Fix a really horrible bug in MachineBlockPlacement, but behind a flag for now. First off, thanks to Daniel Jasper for really pointing out the issue here. It's been here forever (at least, I think it was there when I first wrote this code) without getting really noticed or fixed. The key problem is what happens when two reasonably common patterns happen at the same time: we outline multiple cold regions of code, and those regions in turn have diamonds or other CFGs for which we can't just topologically lay them out. Consider some C code that looks like: if (a1()) { if (b1()) c1(); else d1(); f1(); } if (a2()) { if (b2()) c2(); else d2(); f2(); } done(); Now consider the case where a1() and a2() are unlikely to be true. In that case, we might lay out the first part of the function like: a1, a2, done; And then we will be out of successors in which to build the chain. We go to find the best block to continue the chain with, which is perfectly reasonable here, and find "b1" let's say. Laying out successors gets us to: a1, a2, done; b1, c1; At this point, we will refuse to lay out the successor to c1 (f1) because there are still un-placed predecessors of f1 and we want to try to preserve the CFG structure. So we go get the next best block, d1. ... wait for it ... Except that the next best block isn't d1. It is b2! d1 is waaay down inside these conditionals. It is much less important than b2. Except that this is exactly what we didn't want. If we keep going we get the entire set of the rest of the CFG interleaved!!! a1, a2, done; b1, c1; b2, c2; d1, f1; d2, f2; So we clearly need a better strategy here. =] My current favorite strategy is to actually try to place the block whose predecessor is closest. This very simply ensures that we unwind these kinds of CFGs the way that is natural and fitting, and should minimize the number of cache lines instructions are spread across. It also happens to be dead simple. It's like the datastructure was specifically set up for this use case or something. We only push blocks onto the work list when the last predecessor for them is placed into the chain. So the back of the worklist is the nearest next block. Unfortunately, a change like this is going to cause soooo many benchmarks to swing wildly. So for now I'm adding this under a flag so that we and others can validate that this is fixing the problems described, that it seems possible to enable, and hopefully that it fixes more of our problems long term. llvm-svn: 231238		2015-03-04 12:18:08 +00:00
..
Analysis	Make llvm.eh.begincatch use an outparam	2015-03-03 17:41:09 +00:00
Assembler	DebugInfo: Move new hierarchy into place	2015-03-03 17:24:31 +00:00
Bindings	DebugInfo: Move new hierarchy into place	2015-03-03 17:24:31 +00:00
Bitcode	[opaque pointer type] Add textual IR support for explicit type parameter to load instruction	2015-02-27 21:17:42 +00:00
BugPoint	DebugInfo: Move new hierarchy into place	2015-03-03 17:24:31 +00:00
CodeGen	[MBP] Fix a really horrible bug in MachineBlockPlacement, but behind	2015-03-04 12:18:08 +00:00
DebugInfo	[llvm-pdbdump] Display full enum definitions.	2015-03-04 06:09:53 +00:00
ExecutionEngine	[opaque pointer type] Add textual IR support for explicit type parameter to load instruction	2015-02-27 21:17:42 +00:00
Feature	DebugInfo: Move new hierarchy into place	2015-03-03 17:24:31 +00:00
FileCheck	FileCheck: Add CHECK-SAME	2015-02-26 04:53:00 +00:00
Instrumentation	[sanitizer/coverage] Add AFL-style coverage counters (search heuristic for fuzzing).	2015-03-03 23:27:02 +00:00
Integer	[opaque pointer type] Add textual IR support for explicit type parameter to load instruction	2015-02-27 21:17:42 +00:00
JitListener	DebugInfo: Move new hierarchy into place	2015-03-03 17:24:31 +00:00
Linker	DebugInfo: Move new hierarchy into place	2015-03-03 17:24:31 +00:00
LTO	[opaque pointer type] Add textual IR support for explicit type parameter to load instruction	2015-02-27 21:17:42 +00:00
MC	[MC][Target] Implement support for R_X86_64_SIZE{32,64}.	2015-03-04 06:49:39 +00:00
Object	Make llvm/test/Object/archive-format.test CRLF-tolerant.	2015-03-03 15:54:48 +00:00
Other	[opaque pointer type] Add textual IR support for explicit type parameter to load instruction	2015-02-27 21:17:42 +00:00
SymbolRewriter
TableGen
tools	[llvm-pdbdump] Display full enum definitions.	2015-03-04 06:09:53 +00:00
Transforms	[RewriteStatepointsForGC] Fix a relocation bug w.r.t values defined by invoke instructions	2015-03-04 00:13:52 +00:00
Unit
Verifier	Teach the verifier to enforce that the alignment argument of memory intrinsics must be a power of 2.	2015-03-02 09:35:06 +00:00
YAMLParser
.clang-format
CMakeLists.txt	Back out two accidental changes that snuck in with r229245. Sorry these	2015-02-14 09:05:58 +00:00
lit.cfg	Change SystemZ large tests to use the existing long_tests property	2015-03-02 19:34:11 +00:00
lit.site.cfg.in	Remove log statements from config scripts.	2015-02-22 07:31:42 +00:00
Makefile	Attempt to fix the builders.	2015-02-22 07:01:41 +00:00
Makefile.tests
TestRunner.sh