llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 12:12:47 +01:00

Author	SHA1	Message	Date
Chris Lattner	a106725fc5	Land the long talked about "type system rewrite" patch. This patch brings numerous advantages to LLVM. One way to look at it is through diffstat: 109 files changed, 3005 insertions(+), 5906 deletions(-) Removing almost 3K lines of code is a good thing. Other advantages include: 1. Value::getType() is a simple load that can be CSE'd, not a mutating union-find operation. 2. Types a uniqued and never move once created, defining away PATypeHolder. 3. Structs can be "named" now, and their name is part of the identity that uniques them. This means that the compiler doesn't merge them structurally which makes the IR much less confusing. 4. Now that there is no way to get a cycle in a type graph without a named struct type, "upreferences" go away. 5. Type refinement is completely gone, which should make LTO much MUCH faster in some common cases with C++ code. 6. Types are now generally immutable, so we can use "Type " instead "const Type " everywhere. Downsides of this patch are that it removes some functions from the C API, so people using those will have to upgrade to (not yet added) new API. "LLVM 3.0" is the right time to do this. There are still some cleanups pending after this, this patch is large enough as-is. llvm-svn: 134829	2011-07-09 17:41:24 +00:00
Evan Cheng	c9e252df68	Change createAsmParser to take a MCSubtargetInfo instead of triple, CPU, and feature string. Parsing some asm directives can change subtarget state (e.g. .code 16) and it must be reflected in other modules (e.g. MCCodeEmitter). That is, the MCSubtargetInfo instance must be shared. llvm-svn: 134795	2011-07-09 05:47:46 +00:00
Jakob Stoklund Olesen	aef38c4f35	Oops, didn't mean to commit that. Spills should be hoisted out of loops, but we don't want to hoist them to dominating blocks at the same loop depth. That could cause the spills to be executed more often. llvm-svn: 134782	2011-07-09 01:02:44 +00:00
Jakob Stoklund Olesen	fe41eb3bda	Hoist spills within a basic block. Try to move spills as early as possible in their basic block. This can help eliminate interferences by shortening the live range being spilled. This fixes PR10221. llvm-svn: 134776	2011-07-09 00:25:03 +00:00
Cameron Zwarich	c23366d357	Add an intrinsic and codegen support for fused multiply-accumulate. The intent is to use this for architectures that have a native FMA instruction. llvm-svn: 134742	2011-07-08 21:39:21 +00:00
Jakob Stoklund Olesen	acaf9e9ce1	Be more aggressive about following hints. RAGreedy::tryAssign will now evict interference from the preferred register even when another register is free. To support this, add the EvictionCost struct that counts how many hints are broken by an eviction. We don't want to break one hint just to satisfy another. Rename canEvict to shouldEvict, and add the first bit of eviction policy that doesn't depend on spill weights: Always make room in the preferred register as long as the evictees can be split and aren't already assigned to their preferred register. Also make the CSR avoidance more accurate. When looking for a cheaper register it is OK to use a new volatile register. Only CSR aliases that have never been used before should be avoided. llvm-svn: 134735	2011-07-08 20:46:18 +00:00
Devang Patel	31505d2d78	Refactor. llvm-svn: 134703	2011-07-08 17:09:57 +00:00
Devang Patel	756482ca98	Make provision to have floating point constants in .debug_loc expressions. llvm-svn: 134702	2011-07-08 16:49:43 +00:00
Benjamin Kramer	560b1d3295	Apparently we can't expect a BinaryOperator here. Should fix llvm-gcc selfhost. llvm-svn: 134699	2011-07-08 12:08:24 +00:00
Benjamin Kramer	44c76d239a	Emit a more efficient magic number multiplication for exact sdivs. We have to do this in DAGBuilder instead of DAGCombiner, because the exact bit is lost after building. struct foo { char x[24]; }; long bar(struct foo a, struct foo b) { return a-b; } is now compiled into movl 4(%esp), %eax subl 8(%esp), %eax sarl $3, %eax imull $-1431655765, %eax, %eax instead of movl 4(%esp), %eax subl 8(%esp), %eax movl $715827883, %ecx imull %ecx movl %edx, %eax shrl $31, %eax sarl $2, %edx addl %eax, %edx movl %edx, %eax llvm-svn: 134695	2011-07-08 10:31:30 +00:00
Evan Cheng	50f2d8d304	Eliminate asm parser's dependency on TargetMachine: - Each target asm parser now creates its own MCSubtatgetInfo (if needed). - Changed AssemblerPredicate to take subtarget features which tablegen uses to generate asm matcher subtarget feature queries. e.g. "ModeThumb,FeatureThumb2" is translated to "(Bits & ModeThumb) != 0 && (Bits & FeatureThumb2) != 0". llvm-svn: 134678	2011-07-08 01:53:10 +00:00
Eric Christopher	f3059adb4b	Remove a FIXME. All of the standard ones are in the list. llvm-svn: 134647	2011-07-07 22:29:03 +00:00
Devang Patel	f97af90b4a	Add DEBUG message. llvm-svn: 134643	2011-07-07 21:44:42 +00:00
Devang Patel	ff0a35a206	If known DebugLocs do not match then two DBG_VALUE machine instructions are not identical. For example, DBG_VALUE 3.310000e+02, 0, !"ds"; dbg:sse.stepfft.c:138:18 @[ sse.stepfft.c:32:10 ] DBG_VALUE 3.310000e+02, 0, !"ds"; dbg:sse.stepfft.c:138:18 @[ sse.stepfft.c:31:10 ] These two MIs represent identical value, 3.31..., for one variable, ds, but they are not identical because the represent two separate instances of inlined variable "ds". llvm-svn: 134620	2011-07-07 17:45:33 +00:00
Lang Hames	9e52663aa4	Add functions 'hasPredecessor' and 'hasPredecessorHelper' to SDNode. The hasPredecessorHelper function allows predecessors to be cached to speed up repeated invocations. This fixes PR10186. X.isPredecessorOf(Y) now just calls Y.hasPredecessor(X) Y.hasPredecessor(X) calls Y.hasPredecessorHelper(X, Visited, Worklist) with empty Visited and Worklist sets (i.e. no caching over invocations). Y.hasPredecessorHelper(X, Visited, Worklist) caches search state in Visited and Worklist to speed up repeated calls. The Visited set is searched for X before going to the worklist to further search the DAG if necessary. llvm-svn: 134592	2011-07-07 04:31:51 +00:00
Devang Patel	a30ca05040	Add DEBUG messages. llvm-svn: 134572	2011-07-07 00:14:27 +00:00
Eli Friedman	293141407b	When tail-merging multiple blocks, make sure to correctly update the live-in list on the merged block to correctly account for the live-outs of all the predecessors. They might not be the same in all cases (the testcase I have involves a PHI node where one of the operands is an IMPLICIT_DEF). Unfortunately, the testcase I have is large and confidential, so I don't have a test to commit at the moment; I'll see if I can come up with something smaller where this issue reproduces. <rdar://problem/9716278> llvm-svn: 134565	2011-07-06 23:41:48 +00:00
Devang Patel	94f4eec7e4	Remove dead code. llvm-svn: 134561	2011-07-06 23:26:18 +00:00
Devang Patel	0abf331128	Typo. llvm-svn: 134559	2011-07-06 23:09:51 +00:00
Eric Christopher	91acbb256c	Grammar and 80-col. llvm-svn: 134555	2011-07-06 22:41:18 +00:00
Evan Cheng	dcd3ea7062	createMCInstPrinter doesn't need TargetMachine anymore. llvm-svn: 134525	2011-07-06 19:45:42 +00:00
Jakub Staszak	28bcc8673e	Introduce "expect" intrinsic instructions. llvm-svn: 134516	2011-07-06 18:22:43 +00:00
Dan Gohman	7927fe2250	Remove the ObjC ARC passes from the default optimization list, and add extension points to be used by clang. llvm-svn: 134444	2011-07-05 22:01:44 +00:00
Jakob Stoklund Olesen	57f59c98ed	Break infinite loop when the Hopfield network oscillates. This is impossible in theory, I can prove it. In practice, our near-zero threshold can cause the network to oscillate between equally good solutions. <rdar://problem/9720596> llvm-svn: 134428	2011-07-05 18:46:42 +00:00
Jakob Stoklund Olesen	f95a1068bd	Fix PR10277. Remat during spilling triggers dead code elimination. If a phi-def becomes unused, that may also cause live ranges to split into separate connected components. This type of splitting is different from normal live range splitting. In particular, there may not be a common original interval. When the split range is its own original, make sure that the new siblings are also their own originals. The range being split cannot be used as an original since it doesn't cover the new siblings. llvm-svn: 134413	2011-07-05 15:38:41 +00:00
Jakob Stoklund Olesen	c380e517fd	Tweak comment and debug output. llvm-svn: 134412	2011-07-05 15:38:37 +00:00
Rafael Espindola	29113212a6	Move early tail duplication earlier. This fixes the issue noted in PR10251 where early tail dup of bbs with indirectbr would cause a bb to be duplicated into a loop preheader and then into its predecessors, creating phi nodes with identical operands just before register allocation. This helps with jsinterp.o size (__TEXT goes from 163568 to 126656) and a bit with performance 1.005x faster on sunspider (jits still enabled). The result on webkit with the jit disabled is more significant: 1.021x faster. llvm-svn: 134372	2011-07-04 04:54:22 +00:00
Rafael Espindola	962773db64	Move most of the pre BB code to TailDuplicateAndUpdate. Change the HasIndirectbr variable to be just that. No functionality change. llvm-svn: 134371	2011-07-04 01:21:42 +00:00
Rafael Espindola	ce4f4ff705	Reduce indentation and fix the count of how many PHIs we have inserted. llvm-svn: 134370	2011-07-04 00:13:36 +00:00
Jakob Stoklund Olesen	9950d41b39	Fix PR10244. A split point inserted in a block with a landing pad successor may be hoisted above the call to ensure that it dominates all successors. The code that handles the rest of the basic block must take this into account. I am not including a test case, it would be very fragile. PR10244 comes from building clang with exceptions enabled. llvm-svn: 134369	2011-07-04 00:05:28 +00:00
Rafael Espindola	f04e6b50ca	Fix an easy fixme. llvm-svn: 134364	2011-07-03 05:26:42 +00:00
Rafael Espindola	cf67208057	Use getVNInfoAt. llvm-svn: 134312	2011-07-02 07:50:27 +00:00
Jakob Stoklund Olesen	b94d989634	Better diagnostics when inline asm fails to allocate. asm.c:2:7: error: ran out of registers during register allocation asm(""::"r"(0), "r"(1), "r"(2), "r"(3), "r"(4), "r"(5), "r"(6), "r"(7), "r"(8), "r"(9)); ^ llvm-svn: 134310	2011-07-02 07:17:37 +00:00
Rafael Espindola	a8c92aa8ef	Check the VN of the src register at the two copies, not just the register number. llvm-svn: 134309	2011-07-02 05:34:02 +00:00
Jakob Stoklund Olesen	c19c47697f	Include a source location when complaining about bad inline assembly. Add a MI->emitError() method that the backend can use to report errors related to inline assembly. Call it from X86FloatingPoint.cpp when the constraints are wrong. This enables proper clang diagnostics from the backend: $ clang -c pr30848.c pr30848.c:5:12: error: Inline asm output regs must be last on the x87 stack __asm__ ("" : "=u" (d)); /* { dg-error "output regs" } */ ^ 1 error generated. llvm-svn: 134307	2011-07-02 03:53:34 +00:00
Jakob Stoklund Olesen	60871c3ee0	Use a new strategy for preventing eviction loops in RAGreedy. Every live range is assigned a cascade number the first time it is involved in an eviction. As the evictor, it gets a new cascade number. Every evictee is assigned the same cascade number as the evictor. Eviction is prohibited if the evictor has a lower assigned cascade number than the evictee. This means that assigned cascade numbers are monotonically increasing with every eviction, yet they are bounded by NextCascade which can only be incremented by new live ranges. Thus, infinite loops cannot happen, but eviction cascades can still be triggered by new live ranges as we want. Thanks to Andy for explaining this to me. llvm-svn: 134303	2011-07-02 01:37:09 +00:00
Cameron Zwarich	6ea6623f23	Take a stab at fixing the llvm-x86_64-linux-checks failure. llvm-svn: 134287	2011-07-01 23:45:21 +00:00
Evan Cheng	e7e74a3250	Rename TargetSubtarget to TargetSubtargetInfo for consistency. llvm-svn: 134259	2011-07-01 21:01:15 +00:00
Duncan Sands	cfea0dd707	Disable commit 134216 ("Add 134199 back, but disable the optimization when the second copy is a kill") to see if it fixes the i386 dragonegg buildbot, which is timing out because gcc built with dragonegg is going into an infinite loop. llvm-svn: 134237	2011-07-01 12:01:00 +00:00
Rafael Espindola	ac24a57bdb	Avoid DenseMap lookup. llvm-svn: 134231	2011-07-01 04:15:02 +00:00
Rafael Espindola	0b7dda94fb	Fix off by one error. I misunderstood the comment about killedAt. llvm-svn: 134229	2011-07-01 03:31:29 +00:00
Rafael Espindola	0a0153608f	Check the liveinterval, not the kill flag. llvm-svn: 134228	2011-07-01 02:35:06 +00:00
Jakob Stoklund Olesen	20986ee7bb	Don't inflate register classes used by inline asm. The constraints are represented by the register class of the original virtual register created for the inline asm. If the register class were included in the operand descriptor, we might be able to do this. For now, just give up on regclass inflation when inline asm is involved. No test case, this bug hasn't happened yet. llvm-svn: 134226	2011-07-01 01:24:25 +00:00
Rafael Espindola	c09ce29b8b	Add 134199 back, but disable the optimization when the second copy is a kill. llvm-svn: 134216	2011-07-01 00:16:54 +00:00
Rafael Espindola	6201f80bc0	Revert my previous patch while I debug llvm-gcc bootstrap. llvm-svn: 134201	2011-06-30 22:58:17 +00:00
Rafael Espindola	63769912fc	Don't give up on coalescing A and B when we find A = X B = X Instead, proceed as if we had found A = X B = A llvm-svn: 134199	2011-06-30 22:24:13 +00:00
Rafael Espindola	83789b3b8d	Create a isFullCopy predicate. llvm-svn: 134189	2011-06-30 21:15:52 +00:00
Rafael Espindola	a324c7e6bb	Remove dead code. llvm-svn: 134148	2011-06-30 13:17:24 +00:00
Jakob Stoklund Olesen	58a24d9ecc	Reapply r134047 now that the world is ready for it. This patch will sometimes choose live range split points next to interference instead of always splitting next to a register point. That means spill code can now appear almost anywhere, and it was necessary to fix code that didn't expect that. The difficult places were: - Between a CALL returning a value on the x87 stack and the corresponding FpPOP_RETVAL (was FpGET_ST0). Probably also near x87 inline assembly, but that didn't actually show up in testing. - Between a CALL popping arguments off the stack and the corresponding ADJCALLSTACKUP. Both are fixed now. The only place spill code can't appear is after terminators, see SplitAnalysis::getLastSplitPoint. Original commit message: Rewrite RAGreedy::splitAroundRegion, now with cool ASCII art. This function has to deal with a lot of special cases, and the old version got it wrong sometimes. In particular, it would sometimes leave multiple uses in the stack interval in a single block. That causes bad code with multiple reloads in the same basic block. The new version handles block entry and exit in a single pass. It first eliminates all the easy cases, and then goes on to create a local interval for the blocks with difficult interference. Previously, we would only create the local interval for completely isolated blocks. It can happen that the stack interval becomes completely empty because we could allocate a register in all edge bundles, and the new local intervals deal with the interference. The empty stack interval is harmless, but we need to remove a SplitKit assertion that checks for empty intervals. llvm-svn: 134125	2011-06-30 01:30:39 +00:00
Eric Christopher	40578e7885	Remove getRegClassForInlineAsmConstraint and all dependencies. Fixes rdar://9643582 llvm-svn: 134123	2011-06-30 01:20:03 +00:00

1 2 3 4 5 ...

12080 Commits