llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 20:12:56 +02:00

Author	SHA1	Message	Date
Benjamin Kramer	ffcbcb72ef	Erase instructions _after_ checking their type. llvm-svn: 132256	2011-05-28 11:48:37 +00:00
John McCall	119a0222f5	Implement and document the llvm.eh.resume intrinsic, which is transformed by the inliner into a branch to the enclosing landing pad (when inlined through an invoke). If not so optimized, it is lowered DWARF EH preparation into a call to _Unwind_Resume (or _Unwind_SjLj_Resume as appropriate). Its chief advantage is that it takes both the exception value and the selector value as arguments, meaning that there is zero effort in recovering these; however, the frontend is required to pass these down, which is not actually particularly difficult. Also document the behavior of landing pads a bit better, and make it clearer that it's okay that personality functions don't always land at landing pads. This is just a fact of life. Don't write optimizations that rely on pushing things over an unwind edge. llvm-svn: 132253	2011-05-28 07:45:59 +00:00
Charles Davis	6702c786ed	When generating code for Win64 EH, emit StartProc and EndProc directives. llvm-svn: 132250	2011-05-28 04:21:04 +00:00
Jakob Stoklund Olesen	56bd697f79	Create two BlockInfo entries when a live range is discontinuous through a block. Delete the Kill and Def markers in BlockInfo. They are no longer necessary when BlockInfo describes a continuous live range. This only affects the relatively rare kind of basic block where a live range looks like this: \|---x o---\| Now live range splitting can pretend that it is looking at two blocks: \|---x o---\| This allows the code to be simplified a bit. llvm-svn: 132245	2011-05-28 02:33:00 +00:00
Jakob Stoklund Olesen	394d198d2e	Add SplitAnalysis::getNumLiveBlocks(). It is important that this function returns the same number of live blocks as countLiveBlocks(CurLI) because live range splitting uses the number of live blocks to ensure it is making progress. This is in preparation of supporting duplicate UseBlock entries for basic blocks that have a virtual register live-in and live-out, but not live-though. llvm-svn: 132244	2011-05-28 02:32:57 +00:00
Devang Patel	93e78e996f	Select DW_AT_const_value size based on global variable size. llvm-svn: 132239	2011-05-28 00:39:18 +00:00
Rafael Espindola	386c4259db	Fix the root cause of the bootstrap failure: There was no way to check if a given register/mode pair was valid. We now return an error code (-2) instead of asserting. If anyone thinks that an assert at this point is really needed, we can autogen a hasValidDwarfRegNum instead. llvm-svn: 132236	2011-05-28 00:13:01 +00:00
Charles Davis	cf8d922dbe	Stub out support for Win64-style exceptions. Note that this is merely using the Win64 EH mechanism to implement GCC-style exceptions. LLVM supports hardly anything else at this point! llvm-svn: 132234	2011-05-27 23:47:32 +00:00
Rafael Espindola	9ce5cebde6	Fix a regression I recently introduced by removing DwarfRegNum of subregisters: When a value is in a subregister, at least report the location as being the superregister. We should extend the .td files to encode the bit range so that we can produce a DW_OP_bit_piece. llvm-svn: 132224	2011-05-27 22:15:01 +00:00
Rafael Espindola	2230168a0f	Make size computation less brittle. llvm-svn: 132222	2011-05-27 22:05:41 +00:00
Charles Davis	cb20ea9935	Add the suffix to the Win64 EH data sections' names if given. Add a test for this. XFAIL'd, because the COFF AsmParser can't handle .section yet. llvm-svn: 132220	2011-05-27 21:38:47 +00:00
Nadav Rotem	531aa71d22	Refactor getActionType and getTypeToTransformTo ; place all of the 'decision' code in one place. Re-apply 131534 and fix the multi-step promotion of integers. llvm-svn: 132217	2011-05-27 21:03:13 +00:00
Devang Patel	2872ac051d	Keep this simple. Use DIType to get signness and size of a type. Based on size, select appropraite form. llvm-svn: 132206	2011-05-27 19:13:26 +00:00
Devang Patel	c0bffe6366	Handle signed types gracefully. This fixes regressions reported by buildbots as a fallout of r132193. llvm-svn: 132197	2011-05-27 18:15:52 +00:00
Devang Patel	62a7038a9f	Select DW_AT_const_value size based on variable size. llvm-svn: 132193	2011-05-27 16:45:18 +00:00
Cameron Zwarich	a9c418b1c3	Fix PR10029 - VerifyCoalescing failure on patterns_dfa.c of 445.gobmk. llvm-svn: 132181	2011-05-27 05:04:51 +00:00
Devang Patel	177dbe2de1	Add comment. llvm-svn: 132149	2011-05-26 21:49:28 +00:00
Devang Patel	e0b7ab9296	During branch folding avoid inserting redundant DBG_VALUE machine instructions. llvm-svn: 132148	2011-05-26 21:47:59 +00:00
Charles Davis	4becdc727e	Revert r132111. I built Release (without Asserts), so I didn't know about the assert that prevented setting alignment on section creation. llvm-svn: 132113	2011-05-26 05:35:55 +00:00
Charles Davis	44fb280873	Align Win64 EH Table sections to 4 bytes. llvm-svn: 132111	2011-05-26 05:19:54 +00:00
Stuart Hastings	837a958ff6	Reverting 132105: it broke some LLVM-GCC DejaGNU tests. llvm-svn: 132108	2011-05-26 04:09:49 +00:00
Stuart Hastings	e704bfb21e	Correctly handle a one-word struct passed byval on x86_64. rdar://problem/6920088 llvm-svn: 132105	2011-05-26 02:44:56 +00:00
Jakob Stoklund Olesen	6c654330dc	Add a RAGreedy::canEvict function. This doesn't change functionality (much), but it allows for a more fine-grained eviction policy. The current policy only compares spill weights, and that is not always the best thing to do. Spill weights are designed to serve linear scan, and they don't consider live range splitting. Add a mechanism so canEvict() can request that a live range be evicted and split/spilled. This is to avoid infinite eviction loops. llvm-svn: 132101	2011-05-25 23:58:36 +00:00
Eli Friedman	93ffb875ad	Rewrite fast-isel integer cast handling to handle more cases, and to be simpler and more consistent. The practical effects here are that x86-64 fast-isel can now handle trunc from i8 to i1, and ARM fast-isel can handle many more constructs involving integers narrower than 32 bits (including loads, stores, and many integer casts). rdar://9437928 . llvm-svn: 132099	2011-05-25 23:49:02 +00:00
Devang Patel	7e814e1e36	Remove unused statistical counter. llvm-svn: 132087	2011-05-25 21:55:40 +00:00
Rafael Espindola	70213c7c5f	Replace the -unwind-tables option with a per function flag. This is more LTO friendly as we can now correctly merge files compiled with or without -fasynchronous-unwind-tables. llvm-svn: 132033	2011-05-25 03:44:17 +00:00
Devang Patel	0b44360610	Remove dead code. llvm-svn: 131974	2011-05-24 18:27:52 +00:00
Rafael Espindola	8e44a53856	Explain FIXME. llvm-svn: 131952	2011-05-24 03:10:31 +00:00
Rafael Espindola	176fe6a0e0	Fix the defaults for .eh_frame. We were marking it as writable. llvm-svn: 131951	2011-05-24 02:50:20 +00:00
Evan Cheng	b5950697e8	- Teach SelectionDAG::isKnownNeverZero to return true (op x, c) when c is non-zero. - Teach X86 cmov optimization to eliminate the cmov from ctlz, cttz extension when the source of X86ISD::BSR / X86ISD::BSF is proven to be non-zero. rdar://9490949 llvm-svn: 131948	2011-05-24 01:48:22 +00:00
Devang Patel	5bce258c3d	Fix debug info for blocks' variable. llvm-svn: 131940	2011-05-24 00:22:25 +00:00
Devang Patel	8a90970a54	Remove unnecessary comment. llvm-svn: 131936	2011-05-23 23:16:14 +00:00
Devang Patel	e829168e05	Revert 121907 (it causes llc crash) and apply original patch from PR9817. llvm-svn: 131926	2011-05-23 22:04:42 +00:00
Devang Patel	37ab34a49f	Preserve debug info during iSel by keeping DanglingDebugInfoMap live until end of function. Patch by Micah Villmow llvm-svn: 131908	2011-05-23 17:44:13 +00:00
Devang Patel	5920de5c8c	While replacing all uses of a SDValue with another value, do not forget to transfer SDDbgValue. llvm-svn: 131907	2011-05-23 17:35:08 +00:00
Chris Lattner	ca9723d80b	Eliminate some temporary variables, and don't call getByValTypeAlignment when we're just going to throw the result away. No functionality change. llvm-svn: 131880	2011-05-22 23:23:02 +00:00
Chris Lattner	c383dd2c87	eliminate dependence on StandardPasses.h. The code generator's pass pipeline should eventually convert to PMBuilder, but I don't plan to do this. llvm-svn: 131819	2011-05-22 00:13:44 +00:00
Benjamin Kramer	df3070e83d	Implement mulo x, 2 -> addo x, x in DAGCombiner. llvm-svn: 131800	2011-05-21 18:31:55 +00:00
Cameron Zwarich	00ec3d1f9f	Fix PR9962 by properly constraining register classes in RemoveCopyByCommutingDef(). This actually fixes most of the VerifyCoalescing failures in test-suite. llvm-svn: 131768	2011-05-20 23:25:36 +00:00
Charles Davis	445c467ced	Fix typo. When will I learn? llvm-svn: 131765	2011-05-20 22:23:34 +00:00
Charles Davis	989cc73ef3	Add .pdata and .xdata sections to the COFF TLOF implementation. llvm-svn: 131763	2011-05-20 22:13:55 +00:00
Jim Grosbach	909aff492f	No reason not to allow defining the CFA as a reg w/ offset zero. llvm-svn: 131760	2011-05-20 21:50:09 +00:00
Jim Grosbach	c4a65b2613	Fix typo. llvm-svn: 131757	2011-05-20 21:35:39 +00:00
Jim Grosbach	b2b33616c0	Add support for frame info use of the .cfi_def_cfa directive. llvm-svn: 131756	2011-05-20 21:23:17 +00:00
Cameron Zwarich	a487989a73	Fix PR9960 by teaching SimpleRegisterCoalescing::AdjustCopiesBackFrom() to preserve the phikill flag. llvm-svn: 131717	2011-05-20 03:54:04 +00:00
Cameron Zwarich	4e8f708cbb	Fix PR9955 by only attaching load memory operands to load instructions and similarly for stores. Now "make check" passes with the MachineVerifier forced on with the VerifyCoalescing option! llvm-svn: 131705	2011-05-19 23:44:34 +00:00
Stuart Hastings	a68d5a99c0	Update some currently-disabled code, preparing for eventual use. llvm-svn: 131663	2011-05-19 18:48:20 +00:00
Cameron Zwarich	6fbc514611	Revert r128961 because it didn't include a test and causes the verifier to fail on CodeGen/X86/2007-05-07-InvokeSRet.ll. There is probably a bug here that was fixed by r128961, but since there is no test or reference to a source file I have to revert it. llvm-svn: 131618	2011-05-19 01:56:19 +00:00
Duncan Sands	d3292b9f1e	Revert commit 131534 since it seems to have broken several buildbots. Original log entry: Refactor getActionType and getTypeToTransformTo ; place all of the 'decision' code in one place. llvm-svn: 131536	2011-05-18 14:57:56 +00:00
Nadav Rotem	b7d689c706	Refactor getActionType and getTypeToTransformTo ; place all of the 'decision' code in one place. llvm-svn: 131534	2011-05-18 12:26:38 +00:00
Jakob Stoklund Olesen	2310f43683	Eliminate dead dead code elimination code. llvm-svn: 131524	2011-05-18 04:51:15 +00:00
Jakob Stoklund Olesen	cd6bbe0171	Also use shrinkToUses after AdjustCopiesBackFrom(). The 'last use' may not be in the same basic block, and we still want a correct live range. llvm-svn: 131523	2011-05-18 04:51:12 +00:00
Jakob Stoklund Olesen	504b1bd8ab	Properly shrink live ranges after deleting dead copies. Clean up after all joined copies. LiveInterval::shrinkToUses recomputes the live range from scratch instead of removing snippets. This should avoid the problem with dangling live ranges. Leave physreg identity copies alone. They can be created when joining a virtreg with a physreg. They don't affect register allocation, and they will be removed by the rewriter. llvm-svn: 131521	2011-05-18 04:18:19 +00:00
Eli Friedman	ade6f78867	Make fast-isel miss counting in -stats and -fast-isel-verbose take terminators into account; since there are many fewer isel misses with recent changes, misses caused by terminators are more significant. llvm-svn: 131502	2011-05-17 23:02:10 +00:00
Dan Gohman	834a63625d	Misc. code cleanups. llvm-svn: 131497	2011-05-17 22:22:52 +00:00
Dan Gohman	5d23a937f7	Misc. code cleanups. llvm-svn: 131495	2011-05-17 22:20:36 +00:00
Stuart Hastings	581113d8a0	Revert 131467 due to buildbot complaint. llvm-svn: 131469	2011-05-17 16:59:46 +00:00
Stuart Hastings	a2509a7ec3	Fix an obscure issue in X86_64 parameter passing: if a tiny byval is passed as the fifth parameter, insure it's passed correctly (in R9). rdar://problem/6920088 llvm-svn: 131467	2011-05-17 16:45:55 +00:00
Jakob Stoklund Olesen	6645e60d56	Tweak cross-class coalescing to be more aggressive when the target class is small. The greedy register allocator has live range splitting and register class inflation, so it can actually fully undo this join, including restoring the original register classes. We still don't want to do this for long live ranges, mostly because of the high register pressure of there are many constrained live ranges overlapping. llvm-svn: 131466	2011-05-17 16:38:37 +00:00
Jakob Stoklund Olesen	16f11212fc	Teach LiveInterval::isZeroLength about null SlotIndexes. When instructions are deleted, they leave tombstone SlotIndex entries. The isZeroLength method should ignore these null indexes. This causes RABasic to sometimes spill a callee-saved register in the abi-isel.ll test, so don't run that test with -regalloc=basic. Prioritizing register allocation according to spill weight can cause more registers to be used. llvm-svn: 131436	2011-05-16 23:50:05 +00:00
Dan Gohman	9a55240376	Delete unused variables. llvm-svn: 131430	2011-05-16 22:19:54 +00:00
Dan Gohman	9ea3bcc685	Trim #includes. llvm-svn: 131429	2011-05-16 22:14:50 +00:00
Dan Gohman	2d7dc7849f	Fix whitespace and 80-column violations. llvm-svn: 131428	2011-05-16 22:09:53 +00:00
Jim Grosbach	59a8c1aa5b	Track how many insns fast-isel successfully selects as well as how many it misses. llvm-svn: 131426	2011-05-16 21:51:07 +00:00
Devang Patel	9f871794b6	Preserve debug info for unused zero extended boolean argument. Radar 9422775. llvm-svn: 131422	2011-05-16 21:24:05 +00:00
Eli Friedman	cb60e2293f	Make fast-isel work correctly s/uadd.with.overflow intrinsics. llvm-svn: 131420	2011-05-16 21:06:17 +00:00
Eli Friedman	bd237ec411	Fix silly typo. llvm-svn: 131419	2011-05-16 20:34:53 +00:00
Eli Friedman	5f1b7e4153	Basic fast-isel of extractvalue. Not too helpful on its own, given the IR clang generates for cases like this, but it should become more useful soon. llvm-svn: 131417	2011-05-16 20:27:46 +00:00
Rafael Espindola	113d944ce4	Don't do tail calls in a function that call setjmp. The stack might be corrupted when setjmp returns again. llvm-svn: 131399	2011-05-16 03:05:33 +00:00
Eli Friedman	559f908ca7	Fix a FIXME by moving the fast-isel implementation of the objectsize intrinsic from the x86 code to the generic code. llvm-svn: 131332	2011-05-14 00:47:51 +00:00
Rafael Espindola	95d9ad78ea	Make codegen able to handle values of empty types. This is one way to fix PR9900. I will keep it open until sable is able to comment on it. llvm-svn: 131294	2011-05-13 15:18:06 +00:00
Stuart Hastings	5fb280fd39	Since I can't reproduce the failures from 131261, re-trying with a simplified version. <rdar://problem/9298790> llvm-svn: 131274	2011-05-13 00:51:54 +00:00
Stuart Hastings	b362a9bcc6	Revert 131266 and 131261 due to buildbot complaints. rdar://problem/9298790 llvm-svn: 131269	2011-05-13 00:15:17 +00:00
Stuart Hastings	d106d72681	Non-fast-isel followup to 129634; correctly handle branches controlled by non-CMP expressions. The executable test case (129821) would test this as well, if we had an "-O0 -disable-arm-fast-isel" LLVM-GCC tester. Alas, the ARM assembly would be very difficult to check with FileCheck. The thumb2-cbnz.ll test is affected; it generates larger code (tst.w vs. cmp #0), but I believe the new version is correct. rdar://problem/9298790 llvm-svn: 131261	2011-05-12 23:36:41 +00:00
Evan Cheng	43393670c9	Update comment. llvm-svn: 131258	2011-05-12 22:35:48 +00:00
Devang Patel	dd08ae41c6	Doug convinced me that DW_AT_APPLE_objc_complete_type is more appropriate name. s/DW_AT_APPLE_objc_class_extension/DW_AT_APPLE_objc_complete_type/g llvm-svn: 131244	2011-05-12 21:29:42 +00:00
Evan Cheng	f3eb9e3262	Re-enable branchfolding common code hoisting optimization. Fixed a liveness test bug and also taught it to update liveins. llvm-svn: 131241	2011-05-12 20:30:01 +00:00
Devang Patel	b865dd6a20	Let Objective-C front-end identify class extension, in dwarf output, using an attribute DW_AT_APPLE_objc_class_extension. llvm-svn: 131238	2011-05-12 19:06:16 +00:00
Evan Cheng	2c6e581865	Temporarily disable the transformation. It's breaking 186.crafty in some configuration. llvm-svn: 131235	2011-05-12 18:44:58 +00:00
Evan Cheng	5ff60c7364	Re-commit 131172 with fix. MachineInstr identity checks should check dead markers. In some cases a register def is dead on one path, but not on another. This is passing Clang self-hosting. llvm-svn: 131214	2011-05-12 00:56:58 +00:00
Devang Patel	344808fbe5	Identify end of prologue (and beginning of function body) using DW_LNS_set_prologue_end line table opcode. llvm-svn: 131194	2011-05-11 19:22:19 +00:00
Jakob Stoklund Olesen	0e9c54a422	Avoid hoisting spills when looking at a copy from another register that is also about to be spilled. This can only happen when two extra snippet registers are included in the spill, and there is a copy between them. Hoisting the spill creates problems because the hoist will mark the copy for later dead code elimination, and spilling the second register will turn the copy into a spill. <rdar://problem/9420853> llvm-svn: 131192	2011-05-11 18:25:10 +00:00
Nadav Rotem	57dd315a3b	Fixes a bug in the DAGCombiner. LoadSDNodes have two values (data, chain). If there is a store after the load node, then there is a chain, which means that there is another user. Thus, asking hasOneUser would fail. Instead we ask hasNUsesOfValue on the 'data' value. llvm-svn: 131183	2011-05-11 14:40:50 +00:00
Rafael Espindola	dfc30289f1	Revert 131172 as it is causing clang to miscompile itself. I will try to provide a reduced testcase. llvm-svn: 131176	2011-05-11 03:27:17 +00:00
Bill Wendling	453a924d29	Give the 'eh.sjlj.dispatchsetup' intrinsic call the value coming from the setjmp intrinsic call. This prevents it from being reordered so that it appears before the setjmp intrinsic (thus making it completely useless). <rdar://problem/9409683> llvm-svn: 131174	2011-05-11 01:11:55 +00:00
Evan Cheng	271e0ebf0a	Add a late optimization to BranchFolding that hoist common instruction sequences at the start of basic blocks to their common predecessor. It's actually quite common (e.g. about 50 times in JM/lencod) and has shown to be a nice code size benefit. e.g. pushq %rax testl %edi, %edi jne LBB0_2 ## BB#1: xorb %al, %al popq %rdx ret LBB0_2: xorb %al, %al callq _foo popq %rdx ret => pushq %rax xorb %al, %al testl %edi, %edi je LBB0_2 ## BB#1: callq _foo LBB0_2: popq %rdx ret rdar://9145558 llvm-svn: 131172	2011-05-11 01:03:01 +00:00
Rafael Espindola	26a7112c00	Initialize moveTypeModule. llvm-svn: 131157	2011-05-10 21:54:59 +00:00
Eli Friedman	aecbcc0c8f	Disable my little CopyToReg argument hack with fast-isel. rdar://problem/9413587 . llvm-svn: 131156	2011-05-10 21:50:58 +00:00
Stuart Hastings	e589a29764	Correctly walk through nested and adjacent CALLSEQ_START nodes. No test case; I've only seen this on a release branch, and I can't get it to reproduce on trunk. rdar://problem/7662569 llvm-svn: 131152	2011-05-10 21:20:03 +00:00
Rafael Espindola	46b0ce1b5f	Produce a __debug_frame section on darwin ARM when appropriate. llvm-svn: 131151	2011-05-10 21:04:45 +00:00
Rafael Espindola	9b3e6a9cb4	Rename DwarfRequiresRelocationForStmtList to DwarfRequiresRelocationForSectionOffset as this is not specific to StmtList. llvm-svn: 131148	2011-05-10 20:35:05 +00:00
Rafael Espindola	92dc58fea6	Use .cfi_sections to put the unwind info in .debug_frame when possible. With this clang will use .debug_frame in, for example, clang -g -c -m32 test.c This matches gcc's behaviour. It looks like .debug_frame is a bit bigger than .eh_frame, but has the big advantage of not being allocated. llvm-svn: 131140	2011-05-10 18:39:09 +00:00
Jakob Stoklund Olesen	1dfbef8150	Fix PR9883. Make sure all caches are invalidated when a live range is repaired. The previous invalidation missed the alias interference caches. Also add a stats counter for the number of repaired ranges. llvm-svn: 131133	2011-05-10 17:37:41 +00:00
Devang Patel	38193e7550	Do not ignore InlinedAt while walking up scope chain to find subprogram node. llvm-svn: 131106	2011-05-09 22:14:49 +00:00
Eric Christopher	ba60d3d108	Look through struct wrapped types for inline asm statments. Patch by Evan Cheng. llvm-svn: 131093	2011-05-09 20:04:43 +00:00
Duncan Sands	afe16d8de1	Indent properly, no functionality change. llvm-svn: 131082	2011-05-09 08:03:33 +00:00
Jakob Stoklund Olesen	56a5573e6f	Remove an assertion to fix PR9872. It can happen that a live debug variable is the last use of a sub-register, and the register allocator will pick a larger register class for the virtual register. If the allocated register doesn't support the sub-register index, just use %noreg for the debug variables instead of asserting. In PR9872, a debug variable ends up in the sub_8bit_hi part of a GR32_ABCD register. The register is split and one part is inflated to GR32 and assigned %ESI because there are no more normal uses of sub_8bit_hi. Since %ESI doesn't have that sub-register, substPhysReg asserted. Now it will simply insert a %noreg instead, and the debug variable will be marked unavailable in that range. We don't currently have a way of saying: !"value" is in bits 8-15 of %ESI, I don't know if DWARF even supports that. llvm-svn: 131073	2011-05-08 19:21:08 +00:00
Jakob Stoklund Olesen	bb09bbccb8	Emit a proper error message when register allocators run out of registers. This can't be just an assertion, users can always write impossible inline assembly. Such an assembly statement should be included in the error message. llvm-svn: 131024	2011-05-06 21:58:30 +00:00
Andrew Trick	cb5a80566c	Added an assertion, and updated a comment. llvm-svn: 131022	2011-05-06 21:52:52 +00:00
Evan Cheng	3cda8bc78e	80 col violations. llvm-svn: 131015	2011-05-06 20:52:23 +00:00
Eli Friedman	12e590e760	Make the logic for determining function alignment more explicit. No functionality change. llvm-svn: 131012	2011-05-06 20:34:06 +00:00
Eli Friedman	219b955495	Use array_lengthof. No functional change. llvm-svn: 131008	2011-05-06 19:50:10 +00:00
Jakob Stoklund Olesen	d7896b41a3	Iterate backwards over debug locations when splitting them so they can be safely erased. This should unbreak dragonegg-i386-linux and build-self-4-mingw32. llvm-svn: 131007	2011-05-06 19:31:19 +00:00
Andrew Trick	7e699489f1	Typo: Reviewed by Alistair. llvm-svn: 131001	2011-05-06 18:14:32 +00:00
Jakob Stoklund Olesen	6e6ed39f02	Update LiveDebugVariables after live range splitting. After a virtual register is split, update any debug user variables that resided in the old register. This ensures that the LiveDebugVariables are still correct after register allocation. This may create DBG_VALUE instructions that place a user variable in a register in parts of the function and in a stack slot in other parts. DwarfDebug currently doesn't support that. llvm-svn: 130998	2011-05-06 18:00:02 +00:00
Jakob Stoklund Olesen	27f4581ec6	Use TargetMachine hooks to properly print debug variable locations. llvm-svn: 130997	2011-05-06 17:59:59 +00:00
Jakob Stoklund Olesen	04122feb16	Also count identity copies. llvm-svn: 130996	2011-05-06 17:59:57 +00:00
Andrew Trick	6e2ccb7b00	Post-RA scheduler compile time fix. Quadratic computation of DAG node depth. The post-ra scheduler was explicitly updating the depth of a node's successors after scheduling it, regardless of whether the successor was ready. This is quadratic for DAGs with transitively redundant edges. I simply removed the useless update of depth, which is lazilly computed later. Fixes <rdar://problem/9044332> compiler takes way too long to build TextInput. llvm-svn: 130992	2011-05-06 17:09:08 +00:00
Devang Patel	188fd17fe4	Move CompileUnit::getOrCreateNameSpace() and CompileUnit::addPubType() from DwarfDebug.cpp to DwarfCompileUnit.cpp llvm-svn: 130991	2011-05-06 16:57:54 +00:00
Rafael Espindola	a4de174c00	Nothing else uses this label. llvm-svn: 130989	2011-05-06 15:44:29 +00:00
Rafael Espindola	ef239d2fdc	Yet more dead code. llvm-svn: 130988	2011-05-06 15:31:55 +00:00
Rafael Espindola	4228ece8fd	Update comments. llvm-svn: 130987	2011-05-06 15:28:56 +00:00
Rafael Espindola	9b57a8739e	More dead code elimination. llvm-svn: 130985	2011-05-06 15:22:26 +00:00
Rafael Espindola	59462d8ae3	Dead code elimination. llvm-svn: 130984	2011-05-06 14:56:22 +00:00
Eli Friedman	f7b4d848ae	Re-revert r130877; it's apparently causing a regression on 197.parser, possibly related to cbnz formation. llvm-svn: 130977	2011-05-06 05:23:07 +00:00
Rafael Espindola	37aa9169a8	Remove DwarfTableException. llvm-svn: 130964	2011-05-05 23:19:54 +00:00
Rafael Espindola	f109f01ec2	Remove the DwarfTable enum. llvm-svn: 130959	2011-05-05 22:14:31 +00:00
Devang Patel	99147805e4	Remove little used statistical counter. llvm-svn: 130955	2011-05-05 22:00:08 +00:00
Rafael Espindola	a84abb2226	Implement a really simple DwarfSjLjException. llvm-svn: 130947	2011-05-05 20:48:31 +00:00
Rafael Espindola	a99111bae9	List all exception types in a switch. llvm-svn: 130944	2011-05-05 19:48:34 +00:00
Andrew Trick	cd459f78ef	ARM post RA scheduler compile time fix. BuildSchedGraph was quadratic in the number of calls in the basic block. After this fix, it keeps only a single call at the top of the DefList so compile time doesn't blow up on large blocks. This reduces postRA sched time on an external test case from 81s to 0.3s. Although r130800 (reduced ARM register alias defs) also partially fixes the issue by reducing the constant overhead of checking call interference by an order of magnitude. Fixes <rdar://problem/7662664> very poor compile time with post RA scheduling. llvm-svn: 130943	2011-05-05 19:32:21 +00:00
Andrew Trick	5946376390	whitespace llvm-svn: 130942	2011-05-05 19:24:06 +00:00
Owen Anderson	35f6bae989	Allow FastISel of three-register-operand instructions. llvm-svn: 130934	2011-05-05 17:59:04 +00:00
Devang Patel	3d0c5dc9fd	If debug info for inlined function is missing then handle it gracefully. llvm-svn: 130933	2011-05-05 17:54:26 +00:00
Jakob Stoklund Olesen	fa1e44c83e	Add some statistics to the splitting and spilling frameworks. llvm-svn: 130931	2011-05-05 17:22:53 +00:00
Eli Friedman	09ec41fcde	Avoid extra vreg copies for arguments passed in registers. Specifically, this can make MachineCSE more effective in some cases (especially in small functions). PR8361 / part of rdar://problem/8259436 . llvm-svn: 130928	2011-05-05 16:53:34 +00:00
Eli Friedman	367747bccd	Small syntax cleanup; we don't need to #define constants in C++. No functionality change intended. llvm-svn: 130926	2011-05-05 16:25:23 +00:00
Eli Friedman	9d2cd1cac4	Minor correction to r130877; fixes PR9846 and hopefully the buildbot failures. llvm-svn: 130925	2011-05-05 16:18:11 +00:00
Bill Wendling	7a13f3bde3	Remove a flag that would set the ".eh" symbol as .globl. MachO was the only one who used this flag, and it now emits CFI and doesn't emit this anymore. All other targets left this flag "false". <rdar://problem/8486371> llvm-svn: 130918	2011-05-05 06:49:15 +00:00
Jakob Stoklund Olesen	357867ae35	Disable physical register coalescing by default. Joining physregs is inherently dangerous because it uses a heuristic to avoid creating invalid code. Linear scan had an emergency spilling mechanism to deal with those rare cases. The new greedy allocator does not. The greedy register allocator is much better at taking hints, so this has almost no impact on code size and quality. The few cases where it matters show up as unit tests that now have -join-physregs enabled explicitly. llvm-svn: 130896	2011-05-04 23:59:00 +00:00
Bill Wendling	279e17e523	SjLj EH could produce a machine basic block that legitimately has more than one landing pad as its successor. SjLj exception handling jumps to the correct landing pad via a switch statement that's generated right before code-gen. Loosen the constraint in the machine instruction verifier to allow for this. Note, this isn't the most rigorous check since we cannot determine where that switch statement came from. But it's marginally better than turning this check off when SjLj exceptions are used. <rdar://problem/9187612> llvm-svn: 130881	2011-05-04 22:54:05 +00:00
Eli Friedman	5b78092546	Re-commit r130862 with a minor change to avoid an iterator running off the edge in some cases. Original message: Teach MachineCSE how to do simple cross-block CSE involving physregs. This allows, for example, eliminating duplicate cmpl's on x86. Part of rdar://problem/8259436 . llvm-svn: 130877	2011-05-04 22:10:36 +00:00
Eli Friedman	cc74616be6	Back out r130862; it appears to be breaking bootstrap. llvm-svn: 130867	2011-05-04 20:48:42 +00:00
Eli Friedman	e086e00208	Teach MachineCSE how to do simple cross-block CSE involving physregs. This allows, for example, eliminating duplicate cmpl's on x86. Part of rdar://problem/8259436 . llvm-svn: 130862	2011-05-04 19:54:24 +00:00
Rafael Espindola	8346c59e6a	Producing a DW_FORM_addr for DW_AT_stmt_list is probably correct, but it is both inefficient and unexpected by dwarfdump. Change to a DW_FORM_data4. While in here, change the predicate name to reflect that the position is not really absolute (it is an offset), just that the linker needs a relocation. llvm-svn: 130846	2011-05-04 17:44:06 +00:00
Jakob Stoklund Olesen	372f33d560	Rename -disable-physical-join to -join-physregs and invert it. Physreg joining is still on by default, but I will turn it off shortly. llvm-svn: 130844	2011-05-04 16:45:05 +00:00
Devang Patel	a6e69a5541	Tighten up check for empty (i.e. no meaningful debug info) module. This fixes dwarf-die2.c test case from gcc test suite. llvm-svn: 130842	2011-05-04 16:34:02 +00:00
Devang Patel	3506ad5c7f	Even if the subprogram is going to use AT_specification, emit DW_AT_MIPS_linkage_name. This helps gdb and fixes var-path-expr.exp regression reported by gdb testsuite. llvm-svn: 130794	2011-05-03 21:50:34 +00:00
Jakob Stoklund Olesen	9884b02bef	Gracefully handle invalid live ranges. Fix PR9831. Register coalescing can sometimes create live ranges that end in the middle of a basic block without any killing instruction. When SplitKit detects this, it will repair the live range by shrinking it to its uses. Live range splitting also needs to know about this. When the range shrinks so much that it becomes allocatable, live range splitting fails because it can't find a good split point. It is paranoid about making progress, so an allocatable range is considered an error. The coalescer should really not be creating these bad live ranges. They appear when coalescing dead copies. llvm-svn: 130787	2011-05-03 20:42:13 +00:00
Devang Patel	2d8861a690	If the front end has emitted llvm.dbg.cu and other debug info anchors (clang does it now) then use them directly. This saves one scan of entire module, to collect debug info, which in turns saves few machine cycles at compile time. llvm-svn: 130759	2011-05-03 16:45:22 +00:00
Owen Anderson	285b2ec92d	Other parts of the SelectionDAG framework assume that targets use their pointer type for vector indices. Make the vector unrolling code respect that. llvm-svn: 130733	2011-05-02 22:25:45 +00:00
Jakob Stoklund Olesen	86f7ef57c2	Handle <def,undef> in the second loop as well. llvm-svn: 130718	2011-05-02 20:36:53 +00:00
Jakob Stoklund Olesen	4ad266b02c	Use the PrintReg adaptor to correctly print live-in registers in debug output. llvm-svn: 130715	2011-05-02 20:06:30 +00:00
Jakob Stoklund Olesen	3c707fcb60	Only ignore <undef> use operands, keep the <def,undef> ops. Def operands may also have an <undef> flag, but that just means that a sub-register redef doesn't actually read the super-register. For physical registers, it has no meaning. llvm-svn: 130714	2011-05-02 20:06:28 +00:00
Devang Patel	7d7ac5512a	Emit debug info for global variables first. This works around a limitation in gdb which is reported by following inherit.exp test failures from gdb testsuite. gdb.cp/inherit.exp: print g_vB.vB::vb gdb.cp/inherit.exp: print g_vB.vB::vx gdb.cp/inherit.exp: print g_vC.vC::vc gdb.cp/inherit.exp: print g_vC.vC::vx gdb.cp/inherit.exp: print g_vD.vB::vb ... llvm-svn: 130702	2011-05-02 18:19:17 +00:00
Rafael Espindola	8ef7012675	Only produce the eh_frame section if we have at least one personality function. llvm-svn: 130692	2011-05-02 15:49:52 +00:00
Jakob Stoklund Olesen	281067b1b8	Minimize the slot indexes spanned by register ranges created when splitting. When an interfering live range ends at a dead slot index between two instructions, make sure that the inserted copy instruction gets a slot index after the dead ones. This makes it possible to avoid the interference. Ideally, there shouldn't be interference ending at a deleted instruction, but physical register coalescing can sometimes do that to sub-registers. This fixes PR9823. llvm-svn: 130687	2011-05-02 05:29:58 +00:00
Rafael Espindola	eb5d0cb4f4	GCC uses a different encoding of pointers in the FDE when using -fno-dwarf2-cfi-asm. Implement the same behavior. llvm-svn: 130637	2011-05-01 04:49:54 +00:00
Jakob Stoklund Olesen	ffe1dbc840	When a physreg is live-in and live through a basic block, make sure its live range covers the entire block. The live range can't be terminated at a random instruction. llvm-svn: 130619	2011-04-30 19:12:33 +00:00
Jakob Stoklund Olesen	4ec9e1c33a	Avoid using stale entries form the sibling value map. This could happen when trying to use a value that had been eliminated after dead code elimination and folding loads. llvm-svn: 130597	2011-04-30 06:42:21 +00:00
Jakob Stoklund Olesen	8c578bf1cf	Use hysteresis for local live range splitting as well. llvm-svn: 130596	2011-04-30 05:07:46 +00:00
Rafael Espindola	7901d3790e	Add all the plumbing needed for MC to expand cfi to the old tables in the final assembly. It is the same technique used when targeting assemblers that don't support .loc. llvm-svn: 130587	2011-04-30 03:44:37 +00:00
Jakob Stoklund Olesen	720ec44d13	Update comment. llvm-svn: 130582	2011-04-30 03:13:08 +00:00
Jakob Stoklund Olesen	a397ddfd59	Use a greedy algorithm for allocating registers. llvm-svn: 130568	2011-04-30 01:37:54 +00:00
Bill Wendling	0a7857b4fa	Print out the 'nontemporal' info on a store. llvm-svn: 130562	2011-04-29 23:45:22 +00:00
Eli Friedman	919bf1ca71	Make FastEmit_ri_ try a bit harder to succeed for supported operations; FastEmit_i can fail for non-Thumb2 ARM. Makes ARMSimplifyAddress work correctly, and reduces the number of fast-isel bailouts on non-Thumb ARM. llvm-svn: 130560	2011-04-29 23:34:52 +00:00
Devang Patel	acce3eb6b2	Hoist MCLineEntry construction AsmPrinter so that anyone who derives from AsmPrinter can have line number entries. PR 9810 llvm-svn: 130518	2011-04-29 18:00:54 +00:00
Rafael Espindola	7c632e0001	The last hack for producing bit identical output with cfi on OS X. llvm-svn: 130504	2011-04-29 15:09:53 +00:00
Rafael Espindola	16455286cb	Change DwarfCFIException's member variables to track what it actually emmits: .cfi_personality, .cfi_lsda and the moves. llvm-svn: 130503	2011-04-29 14:48:51 +00:00
Rafael Espindola	16b23d9ff7	Factor some code to needsCFIMoves. Avoid printing moves when we don't have to. llvm-svn: 130501	2011-04-29 14:14:06 +00:00
Devang Patel	900ceb725b	Teach dwarf writer to handle complex address expression for .debug_loc entries. This fixes clang generated blocks' variables' debug info. Radar 9279956. llvm-svn: 130373	2011-04-28 02:22:40 +00:00
Eli Friedman	86181251f3	Fix a silly mistake in r130338. llvm-svn: 130360	2011-04-28 00:42:03 +00:00
Rafael Espindola	36e419b524	Remove unnecessary argument. llvm-svn: 130343	2011-04-27 23:17:57 +00:00
Rafael Espindola	0525497a16	Rename getPersonalityPICSymbol to getCFIPersonalitySymbol, document it, and give it a bit more responsibility. Also implement it for MachO. If hacked to use cfi, 32 bit MachO will produce .cfi_personality 155, L___gxx_personality_v0$non_lazy_ptr and 64 bit will produce .cfi_presonality ___gxx_personality_v0 The general idea is that .cfi_personality gets passed the final symbol. It is up to codegen to produce it if using indirect representation (like 32 bit MachO), but it is up to MC to decide which relocations to create. llvm-svn: 130341	2011-04-27 23:08:15 +00:00
Devang Patel	42736cb058	Simplify handling of variables with complex address (i.e. blocks variables) llvm-svn: 130339	2011-04-27 22:45:24 +00:00
Eli Friedman	c5406cdb50	Make the fast-isel code for literal 0.0 a bit shorter/faster, since 0.0 is common. rdar://problem/9303592 . llvm-svn: 130338	2011-04-27 22:41:55 +00:00
Eli Friedman	fc1152d772	Remove unused function. llvm-svn: 130337	2011-04-27 22:21:02 +00:00
Rafael Espindola	6bf5acb38d	Fix indentation. llvm-svn: 130331	2011-04-27 21:29:52 +00:00
Devang Patel	42f4a7ff92	Revert r130178. It turned out to be not the optimal path to emit complex location expressions. llvm-svn: 130326	2011-04-27 20:29:27 +00:00
Evan Cheng	fa34d31aa4	If converter was being too cute. It look for root BBs (which don't have successors) and use inverse depth first search to traverse the BBs. However that doesn't work when the CFG has infinite loops. Simply do a linear traversal of all BBs work just fine. rdar://9344645 llvm-svn: 130324	2011-04-27 19:32:43 +00:00
Jakob Stoklund Olesen	adb564f3cd	Also add <imp-def> operands for defined and dead super-registers when rewriting. We cannot rely on the <imp-def> operands added by LiveIntervals in all cases as demonstrated by the test case. llvm-svn: 130313	2011-04-27 17:42:31 +00:00
Jakob Stoklund Olesen	2fa051f068	Add a safe-guard against repeated splitting for some rare cases. The number of blocks covered by a live range must be strictly decreasing when splitting, otherwise we can't allow repeated splitting. llvm-svn: 130249	2011-04-26 22:33:12 +00:00
Evan Cheng	dea3347167	Be careful about scheduling nodes above previous calls. It increase usages of more callee-saved registers and introduce copies. Only allows it if scheduling a node above calls would end up lessen register pressure. Call operands also has added ABI restrictions for register allocation, so be extra careful with hoisting them above calls. rdar://9329627 llvm-svn: 130245	2011-04-26 21:31:35 +00:00
Rafael Espindola	e238ffe4ba	Print the label if we will use it in debug_frame. llvm-svn: 130232	2011-04-26 19:26:53 +00:00
Devang Patel	09b1585aac	Refactor code. Keep dwarf register operation selection logic at one place. llvm-svn: 130231	2011-04-26 19:06:18 +00:00
Jakob Stoklund Olesen	c9cf507d93	Use the new TRI->getLargestLegalSuperClass hook to constrain register class inflation. This has two effects: 1. We never inflate to a larger register class than what the sub-target can handle. 2. Completely unconstrained virtual registers get the largest possible register class. llvm-svn: 130229	2011-04-26 18:52:36 +00:00
Dan Gohman	fbb7ade7ae	Fast-isel support for simple inline asms. llvm-svn: 130205	2011-04-26 17:18:34 +00:00
Chris Lattner	37fec9f729	don't emit the symbol name twice for local bss and common symbols. For example, don't emit: .comm _i,4,2 ## @i ## @i instead emit: .comm _i,4,2 ## @i llvm-svn: 130192	2011-04-26 06:14:13 +00:00
Evan Cheng	73a9ae3388	Fix typo llvm-svn: 130190	2011-04-26 04:57:37 +00:00
Rafael Espindola	59c3a084c6	Print all the moves at a given label instead of just the first one. Remove previous DwarfCFI hack. llvm-svn: 130187	2011-04-26 03:58:56 +00:00
Devang Patel	4969322bc4	Let dwarf writer allocate extra space in the debug location expression. This space, if requested, will be used for complex addresses of the Blocks' variables. llvm-svn: 130178	2011-04-26 00:12:46 +00:00
Devang Patel	3da97b7d34	Rename a local variable. llvm-svn: 130171	2011-04-25 23:05:21 +00:00
Devang Patel	e28211b031	Rename a method to match what it really does. s/addVariableAddress/addFrameVariableAddress/g llvm-svn: 130170	2011-04-25 23:02:17 +00:00
Devang Patel	b1b33d6569	Do not drop a variable's complex address if it is not based on frame base. Observed this while reading code, so I do not have a test case handy here. llvm-svn: 130167	2011-04-25 22:52:55 +00:00
Devang Patel	83eac5e134	A dbg.declare may not be in entry block, even if it is referring to an incoming argument. However, It is appropriate to emit DBG_VALUE referring to this incoming argument in entry block in MachineFunction. llvm-svn: 130129	2011-04-25 16:33:52 +00:00
Rafael Espindola	a14f5303dd	Simplify the logic. Noticed by aKor. llvm-svn: 130116	2011-04-24 19:55:34 +00:00
Rafael Espindola	8c824c73b6	Synchronize the conditions for producing a .cfi_startproc and a .cfi_endproc. Fixes PR9787. llvm-svn: 130115	2011-04-24 19:00:34 +00:00
Sebastian Redl	5fea40be23	Give SplitKit.h a header guard. llvm-svn: 130095	2011-04-24 15:46:51 +00:00
Jay Foad	c146569beb	Remove unused STL header includes. llvm-svn: 130068	2011-04-23 19:53:52 +00:00
Owen Anderson	e1b33b92a3	Teach FastISel to deal with instructions that have two immediate operands. llvm-svn: 130033	2011-04-22 23:38:06 +00:00
Devang Patel	929bbb6bf9	Let front-end tie subprogram declaration with subprogram definition directly. llvm-svn: 130028	2011-04-22 23:10:17 +00:00
Jakob Stoklund Olesen	0dcf650f0a	Always compare the cost of region splitting with the cost of per-block splitting. Sometimes it is better to split per block, and we missed those cases. llvm-svn: 130025	2011-04-22 22:47:40 +00:00
Chris Lattner	d9c0db9bd7	Recommit the fix for rdar://9289512 with a couple tweaks to fix bugs exposed by the gcc dejagnu testsuite: 1. The load may actually be used by a dead instruction, which would cause an assert. 2. The load may not be used by the current chain of instructions, and we could move it past a side-effecting instruction. Change how we process uses to define the problem away. llvm-svn: 130018	2011-04-22 21:59:37 +00:00
Benjamin Kramer	f6eab5f86e	DAGCombine: fold "(zext x) == C" into "x == (trunc C)" if the trunc is lossless. On x86 this allows to fold a load into the cmp, greatly reducing register pressure. movzbl (%rdi), %eax cmpl $47, %eax -> cmpb $47, (%rdi) This shaves 8k off gcc.o on i386. I'll leave applying the patch in README.txt to Chris :) llvm-svn: 130005	2011-04-22 18:47:44 +00:00
Devang Patel	6aac901f77	Do not leak argument's DbgVariables. llvm-svn: 130004	2011-04-22 18:09:57 +00:00
Evan Cheng	1f353e2438	Typo llvm-svn: 129970	2011-04-22 01:40:20 +00:00
Bill Wendling	b0df282414	Branch folding is folding a landing pad into a regular BB. An exception is thrown via a call to _cxa_throw, which we don't expect to return. Therefore, the "true" part of the invoke goes to a BB that has 'unreachable' as its only instruction. This is lowered into an empty MachineBB. The landing pad for this invoke, however, is directly after the "true" MBB. When the empty MBB is removed, the landing pad is directly below the BB with the invoke call. The unconditional branch is removed and then the two blocks are merged together. The testcase is too big for a regression test. <rdar://problem/9305728> llvm-svn: 129965	2011-04-22 01:07:09 +00:00
Devang Patel	4f25432e4e	Refactor. llvm-svn: 129938	2011-04-21 21:07:35 +00:00
Matt Beaumont-Gay	eb07568f1b	Don't recycle loop variables. llvm-svn: 129928	2011-04-21 19:46:23 +00:00
Jakob Stoklund Olesen	5053b8795b	Allow allocatable ranges from global live range splitting to be split again. These intervals are allocatable immediately after splitting, but they may be evicted because of later splitting. This is rare, but when it happens they should be split again. The remainder intervals that cannot be allocated after splitting still move directly to spilling. SplitEditor::finish can optionally provide a mapping from new live intervals back to the original interval indexes returned by openIntv(). Each original interval index can map to multiple new intervals after connected components have been separated. Dead code elimination may also add existing intervals to the list. The reverse mapping allows the SplitEditor client to treat the new intervals differently depending on the split region they came from. llvm-svn: 129925	2011-04-21 18:38:15 +00:00
Devang Patel	a31b73427e	Add comment in output stream. llvm-svn: 129921	2011-04-21 17:50:24 +00:00
Daniel Dunbar	3a96439b36	Revert r1296656, "Fix rdar://9289512 - not folding load into compare at -O0...", which broke a couple GCC test suite tests at -O0. llvm-svn: 129914	2011-04-21 16:14:46 +00:00
Jakob Stoklund Olesen	22089a66fb	Add debug output for rematerializable instructions. llvm-svn: 129883	2011-04-20 22:14:20 +00:00
Jakob Stoklund Olesen	266de7e1e1	Permit remat when a virtual register has multiple defs. TII::isTriviallyReMaterializable() shouldn't depend on any properties of the register being defined by the instruction. Rematerialization is going to create a new virtual register anyway. llvm-svn: 129882	2011-04-20 22:14:17 +00:00
Jakob Stoklund Olesen	6501ea2557	Prefer cheap registers for busy live ranges. On the x86-64 and thumb2 targets, some registers are more expensive to encode than others in the same register class. Add a CostPerUse field to the TableGen register description, and make it available from TRI->getCostPerUse. This represents the cost of a REX prefix or a 32-bit instruction encoding required by choosing a high register. Teach the greedy register allocator to prefer cheap registers for busy live ranges (as indicated by spill weight). llvm-svn: 129864	2011-04-20 18:19:48 +00:00
Stuart Hastings	a552942e02	ARM byval support. Will be enabled by another patch to the FE. <rdar://problem/7662569> llvm-svn: 129858	2011-04-20 16:47:52 +00:00
Rafael Espindola	09e9797728	Remove unused arguments. llvm-svn: 129844	2011-04-20 03:08:09 +00:00
Eric Christopher	4c3c7c8211	Rewrite the expander for umulo/smulo to remember to sign extend the input manually and pass all (now) 4 arguments to the mul libcall. Add a new ExpandLibCall for just this (copied gratuitously from type legalization). Fixes rdar://9292577 llvm-svn: 129842	2011-04-20 01:19:45 +00:00
Daniel Dunbar	82a4062a4e	ADT/Triple: Renambe isOSX... methods to isMacOSX for consistency with the OS triple component. llvm-svn: 129838	2011-04-20 00:14:25 +00:00
Daniel Dunbar	140e365c49	CodeGen: Eliminate a use of getDarwinMajorNumber(). - There is a minor semantic change here (evidenced by the test change) for Darwin triples that have no version component. I debated changing the default behavior of isOSVersionLT, but decided it made more sense for triples to be explicit. llvm-svn: 129802	2011-04-19 20:32:39 +00:00
Stuart Hastings	89cb281cf8	Delete unnecessary variable. <rdar://problem/7662569> llvm-svn: 129796	2011-04-19 20:09:38 +00:00
Bob Wilson	886994b683	Avoid write-after-write issue hazards for Cortex-A9. Add a avoidWriteAfterWrite() target hook to identify register classes that suffer from write-after-write hazards. For those register classes, try to avoid writing the same register in two consecutive instructions. This is currently disabled by default. We should not spill to avoid hazards! The command line flag -avoid-waw-hazard can be used to enable waw avoidance. llvm-svn: 129772	2011-04-19 18:11:45 +00:00
Jakob Stoklund Olesen	dceb96c62d	Force the greedy register allocator to be linked alongside linear scan. This means that the new register allocator can be used with 'clang -mllvm -regalloc=greedy'. llvm-svn: 129764	2011-04-19 17:17:58 +00:00
Eli Friedman	bbf7d2ac38	SelectBasicBlock is rather slow even when it doesn't do anything; skip the unnecessary work where possible. llvm-svn: 129763	2011-04-19 17:01:08 +00:00
Stuart Hastings	f838ea4959	Support nested CALLSEQ_BEGIN/END; necessary for ARM byval support. <rdar://problem/7662569> llvm-svn: 129761	2011-04-19 16:16:58 +00:00
Chris Lattner	f15db6c86f	Implement support for x86 fastisel of small fixed-sized memcpys, which are generated en-mass for C++ PODs. On my c++ test file, this cuts the fast isel rejects by 10x and shrinks the generated .s file by 5% llvm-svn: 129755	2011-04-19 05:52:03 +00:00
Eli Friedman	b306371396	Simplify declarations slightly by using typedefs. llvm-svn: 129720	2011-04-18 21:21:37 +00:00
Devang Patel	7220c1a021	Reduce clutter in asm output. Do not emit source location as comment for each instruction. llvm-svn: 129715	2011-04-18 20:26:49 +00:00
Jakob Stoklund Olesen	c2f25578a4	Handle spilling around an instruction that has an early-clobber re-definition of the spilled register. This is quite common on ARM now that some stores have early-clobber defines. llvm-svn: 129714	2011-04-18 20:23:27 +00:00
Eric Christopher	e1103d0a86	Fix a bug where we were counting the alias sets as completely used registers for fast allocation a different way. This has us updating used registers only when we're using that exact register. Fixes rdar://9207598 llvm-svn: 129711	2011-04-18 19:26:25 +00:00
Chris Lattner	f8f4d3c30a	while we're at it, handle 'sdiv exact' of a power of 2 also, this fixes a few rejects on c++ iterator loops. llvm-svn: 129694	2011-04-18 07:00:40 +00:00
Chris Lattner	dd2f1ec77c	fix rdar://9297011 - udiv by power of two causing fast-isel rejects llvm-svn: 129693	2011-04-18 06:55:51 +00:00
Chris Lattner	28eaf6be7f	1. merge fast-isel-shift-imm.ll into fast-isel-x86-64.ll 2. implement rdar://9289501 - fast isel should fold trivial multiplies to shifts 3. teach tblgen to handle shift immediates that are different sizes than the shifted operands, eliminating some code from the X86 fast isel backend. 4. Have FastISel::SelectBinaryOp use (the poorly named) FastEmit_ri_ function instead of FastEmit_ri to simplify code. llvm-svn: 129666	2011-04-17 20:23:29 +00:00
Chris Lattner	f9d9976374	fix an oversight which caused us to compile the testcase (and other less trivial things) into a dummy lea. Before we generated: _test: ## @test movq _G@GOTPCREL(%rip), %rax leaq (%rax), %rax ret now we produce: _test: ## @test movq _G@GOTPCREL(%rip), %rax ret This is part of rdar://9289558 llvm-svn: 129662	2011-04-17 17:12:08 +00:00
Chris Lattner	5e00f501ff	Fix rdar://9289512 - not folding load into compare at -O0 The basic issue here is that bottom-up isel is matching the branch and compare, and was failing to fold the load into the branch/compare combo. Fixing this (by allowing folding into any instruction of a sequence that is selected) allows us to produce things like: cmpb $0, 52(%rax) je LBB4_2 instead of: movb 52(%rax), %cl cmpb $0, %cl je LBB4_2 This makes the generated -O0 code run a bit faster, but also speeds up compile time by putting less pressure on the register allocator and generating less code. This was one of the biggest classes of missing load folding. Implementing this shrinks 176.gcc's c-decl.s (as a random example) by about 4% in (verbose-asm) line count. llvm-svn: 129656	2011-04-17 06:35:44 +00:00
Chris Lattner	1fe5f78b7e	split a complex predicate out to a helper function. Simplify two for loops, which don't need to check for falling off the end of a block and end of phi nodes, since terminators are never phis. llvm-svn: 129655	2011-04-17 06:03:19 +00:00
Chris Lattner	cb194276e0	fix rdar://9289583 - fast isel should handle non-canonical commutative binops allowing us to fold the immediate into the 'and' in this case: int test1(int i) { return 8&i; } llvm-svn: 129653	2011-04-17 01:16:47 +00:00
Eli Friedman	2798137293	PR9055: extend the fix to PR4050 (r70179) to apply to zext and anyext. Returning a new node makes the code try to replace the old node, which in the included testcase is killed by CSE. llvm-svn: 129650	2011-04-16 23:25:34 +00:00
Francois Pichet	1cc1375d03	Unbreak the MSVC 2010 build. For further information on this particular issue see: http://connect.microsoft.com/VisualStudio/feedback/details/520043/error-converting-from-null-to-a-pointer-type-in-std-pair llvm-svn: 129642	2011-04-16 14:20:39 +00:00
Benjamin Kramer	0b3416e2f5	Remove unused variable. llvm-svn: 129639	2011-04-16 10:30:47 +00:00
Rafael Espindola	9e5aaa3b78	Put each personality function in a section. This fixes the gnu ld warning: error in foo.o; no .eh_frame_hdr table will be created. llvm-svn: 129635	2011-04-16 03:51:21 +00:00
Evan Cheng	b720f37282	Fix divmod libcall lowering. Convert to {S\|U}DIVREM first and then expand the node to a libcall. rdar://9280991 llvm-svn: 129633	2011-04-16 03:08:26 +00:00
Devang Patel	eddab1d186	Introduce support to encode Objective-C property information in debugging information generated for an interface. llvm-svn: 129624	2011-04-16 00:11:51 +00:00
Rafael Espindola	694ad2f25c	Some refactoring suggested by Anton Korobeynikov. llvm-svn: 129600	2011-04-15 20:32:03 +00:00
Jakob Stoklund Olesen	bdd6204582	Teach the SplitKit blitter to handle multiply defined values as well. The transferValues() function can now handle both singly and multiply defined values, as long as the resulting live range is known. Only rematerialized values have their live range recomputed by extendRange(). The updateSSA() function can now insert PHI values in bulk across multiple values in multiple target registers in one pass. The list of blocks received from transferValues() is in layout order which seems to work well for the iterative algorithm. Blocks from extendRange() are still in reverse BFS order, but this function is used so rarely now that it doesn't matter. llvm-svn: 129580	2011-04-15 17:24:49 +00:00
Jakob Stoklund Olesen	ea8581b792	Remember to set flag. llvm-svn: 129579	2011-04-15 17:24:46 +00:00
Rafael Espindola	99831068c8	Add 129518 back with a fix for when we are producing eh just because of debug info. Change ELF systems to use CFI for producing the EH tables. This reduces the size of the clang binary in Debug builds from 690MB to 679MB. llvm-svn: 129571	2011-04-15 15:11:06 +00:00
Chris Lattner	0304b82f80	Fix a ton of comment typos found by codespell. Patch by Luis Felipe Strano Moraes! llvm-svn: 129558	2011-04-15 05:18:47 +00:00
NAKAMURA Takumi	7aed456653	Revert r129518, "Change ELF systems to use CFI for producing the EH tables. This reduces the" It broke several builds. llvm-svn: 129557	2011-04-15 03:35:57 +00:00
Owen Anderson	0ce6c0f86e	Fix another instance of the DAG combiner not using the correct type for the RHS of a shift. llvm-svn: 129522	2011-04-14 17:30:49 +00:00
Rafael Espindola	d5eed657e2	Change ELF systems to use CFI for producing the EH tables. This reduces the size of the clang binary in Debug builds from 690MB to 679MB. llvm-svn: 129518	2011-04-14 15:18:53 +00:00
Andrew Trick	e89c19ab7b	In the pre-RA scheduler, maintain cmp+br proximity. This is done by pushing physical register definitions close to their use, which happens to handle flag definitions if they're not glued to the branch. This seems to be generally a good thing though, so I didn't need to add a target hook yet. The primary motivation is to generate code closer to what people expect and rule out missed opportunity from enabling macro-op fusion. As a side benefit, we get several 2-5% gains on x86 benchmarks. There is one regression: SingleSource/Benchmarks/Shootout/lists slows down be -10%. But this is an independent scheduler bug that will be tracked separately. See rdar://problem/9283108. Incidentally, pre-RA scheduling is only half the solution. Fixing the later passes is tracked by: <rdar://problem/8932804> [pre-RA-sched] on x86, attempt to schedule CMP/TEST adjacent with condition jump Fixes: <rdar://problem/9262453> Scheduler unnecessary break of cmp/jump fusion llvm-svn: 129508	2011-04-14 05:15:06 +00:00
Chris Lattner	d4ba43dc76	sink a call into its only use. llvm-svn: 129503	2011-04-14 04:12:47 +00:00
Owen Anderson	d98929ed6c	During post-legalization DAG combining, be careful to only create shifts where the RHS is of the legal type for the new operation. llvm-svn: 129484	2011-04-13 23:22:23 +00:00
Devang Patel	43cbfe2ba7	Remove extra bytes that were added for gdb. We do not have good poiner to understand actual reason behind this fixme. Spot checking suggest that newer gdb does not need this. llvm-svn: 129461	2011-04-13 19:41:17 +00:00
Jakob Stoklund Olesen	d7db076abc	Stop using dead function. llvm-svn: 129442	2011-04-13 15:00:11 +00:00
Andrew Trick	916e01c917	Recommit r129383. PreRA scheduler heuristic fixes: VRegCycle, TokenFactor latency. Additional fixes: Do something reasonable for subtargets with generic itineraries by handle node latency the same as for an empty itinerary. Now nodes default to unit latency unless an itinerary explicitly specifies a zero cycle stage or it is a TokenFactor chain. Original fixes: UnitsSharePred was a source of randomness in the scheduler: node priority depended on the queue data structure. I rewrote the recent VRegCycle heuristics to completely replace the old heuristic without any randomness. To make the ndoe latency adjustments work, I also needed to do something a little more reasonable with TokenFactor. I gave it zero latency to its consumers and always schedule it as low as possible. llvm-svn: 129421	2011-04-13 00:38:32 +00:00
Eric Christopher	147cad907a	Temporarily revert r129408 to see if it brings the bots back. llvm-svn: 129417	2011-04-13 00:20:59 +00:00
Eric Christopher	c72bd6024f	Fix a bug where we were counting the alias sets as completely used registers for fast allocation. Fixes rdar://9207598 llvm-svn: 129408	2011-04-12 23:23:14 +00:00
Devang Patel	9cceebfde4	I missed this new file in previous commit. llvm-svn: 129407	2011-04-12 23:21:44 +00:00
Devang Patel	5f8111e1ca	Simplify. There is no need to use static variable. llvm-svn: 129406	2011-04-12 23:10:47 +00:00
Devang Patel	f078958e43	Do not reuse parameter name. llvm-svn: 129405	2011-04-12 23:09:06 +00:00
Devang Patel	f288e23b3f	This mechanical patch moves type handling into CompileUnit from DwarfDebug. In case of multiple compile unit in one object file, each compile unit is responsible for its own set of type entries anyway. This refactoring makes this obvious. llvm-svn: 129402	2011-04-12 22:53:02 +00:00
Eric Christopher	553418ccd4	Add more comments... err debug statements to the fast allocator. llvm-svn: 129400	2011-04-12 22:17:44 +00:00
Jakob Stoklund Olesen	7f28263ab0	SparseBitVector is SLOW. Use a Bitvector instead, we didn't need the smaller memory footprint anyway. This makes the greedy register allocator 10% faster. llvm-svn: 129390	2011-04-12 21:30:53 +00:00
Andrew Trick	d83e7b6a5d	Revert 129383. It causes some targets to hit a scheduler assert. llvm-svn: 129385	2011-04-12 20:14:07 +00:00
Andrew Trick	1e0821075d	PreRA scheduler heuristic fixes: VRegCycle, TokenFactor latency. UnitsSharePred was a source of randomness in the scheduler: node priority depended on the queue data structure. I rewrote the recent VRegCycle heuristics to completely replace the old heuristic without any randomness. To make these heuristic adjustments to node latency work, I also needed to do something a little more reasonable with TokenFactor. I gave it zero latency to its consumers and always schedule it as low as possible. llvm-svn: 129383	2011-04-12 19:54:36 +00:00
Jakob Stoklund Olesen	1db776a52e	Create new intervals for isolated blocks during region splitting. This merges the behavior of splitSingleBlocks into splitAroundRegion, so the RS_Region and RS_Block register stages can be coalesced. That means the leftover intervals after region splitting go directly to spilling instead of a second pass of per-block splitting. llvm-svn: 129379	2011-04-12 19:32:53 +00:00
Jakob Stoklund Olesen	33a5706748	Add SplitKit API to query and select the current interval being worked on. This makes it possible to target multiple registers in one pass. llvm-svn: 129374	2011-04-12 18:11:31 +00:00
Jakob Stoklund Olesen	1b4a5fa3e4	Fix a bug in RegAllocBase::addMBBLiveIns() where a basic block could accidentally be skipped. llvm-svn: 129373	2011-04-12 18:11:28 +00:00
Devang Patel	c115961589	Remove dead typedef. llvm-svn: 129368	2011-04-12 17:43:12 +00:00
Devang Patel	6c1785d527	Refactor CompileUnit into a separate header. llvm-svn: 129367	2011-04-12 17:40:32 +00:00
Eric Christopher	72a09952de	Fix typo. llvm-svn: 129334	2011-04-12 00:48:08 +00:00
Jakob Stoklund Olesen	ea0a2b637b	Reuse live interval union between functions. This saves a bit of compile time when compiling many small functions. llvm-svn: 129321	2011-04-11 23:57:14 +00:00
Nick Lewycky	75e67d4dc2	Just because a GlobalVariable's initializer is [N x { i32, void ()* }] doesn't mean that it has to be ConstantArray of ConstantStruct. We might have ConstantAggregateZero, at either level, so don't crash on that. Also, semi-deprecate the sentinal value. The linker isn't aware of sentinals so we end up with the two lists appended, each with their "sentinals" on them. Different parts of LLVM treated sentinals differently, so make them all just ignore the single entry and continue on with the rest of the list. llvm-svn: 129307	2011-04-11 22:11:20 +00:00
Jakob Stoklund Olesen	7796876061	Speed up eviction by stopping collectInterferingVRegs as soon as the spill weight limit has been exceeded. llvm-svn: 129305	2011-04-11 21:47:01 +00:00
Bill Wendling	966775ce8a	The default of the dispatch switch statement was to branch to a BB that executed the 'unwind' instruction. However, later on that instruction was converted into a jump to the basic block it was located in, causing an infinite loop when we get there. It turns out, we get there if the _Unwind_Resume_or_Rethrow call returns (which it's not supposed to do). It returns if it cannot find a place to unwind to. Thus we would get what appears to be a "hang" when in reality it's just that the EH couldn't be propagated further along. Instead of infinitely looping (or calling `unwind', which none of our back-ends support (it's lowered into nothing...)), call the @llvm.trap() intrinsic instead. This may not conform to specific rules of a particular language, but it's rather better than infinitely looping. <rdar://problem/9175843&9233582> llvm-svn: 129302	2011-04-11 21:32:34 +00:00
Evan Cheng	ea0d287a8a	Look pass copies when determining whether hoisting would end up inserting more copies. rdar://9266679 llvm-svn: 129297	2011-04-11 21:09:18 +00:00
Jakob Stoklund Olesen	fceaaa54f5	Use a faster algorithm for computing MBB live-in registers after register allocation. LiveIntervals::findLiveInMBBs has to do a full binary search for each segment. llvm-svn: 129292	2011-04-11 20:01:41 +00:00
Evan Cheng	d575a99d75	Fix a couple of places where changes are made but not tracked. llvm-svn: 129287	2011-04-11 18:47:20 +00:00
Jakob Stoklund Olesen	d224a3530a	Don't add live ranges for sub-registers when clobbering a physical register. Both coalescing and register allocation already check aliases for interference, so these extra segments are only slowing us down. This speeds up both linear scan and the greedy register allocator. llvm-svn: 129283	2011-04-11 18:08:10 +00:00
Jakob Stoklund Olesen	57f2eda288	Speed up LiveIntervalUnion::unify by handling end insertion specially. This particularly helps with the initial transfer of fixed intervals. llvm-svn: 129277	2011-04-11 15:00:44 +00:00
Jakob Stoklund Olesen	97bb6d4c3a	Time the initial seeding of live registers llvm-svn: 129276	2011-04-11 15:00:42 +00:00
Jakob Stoklund Olesen	5d6e68454e	Don't shrink live ranges after dead code elimination unless it is going to help. In particular, don't repeatedly recompute the PIC base live range after rematerialization. llvm-svn: 129275	2011-04-11 15:00:39 +00:00
Jay Foad	0d5ca4cf44	Don't include Operator.h from InstrTypes.h. llvm-svn: 129271	2011-04-11 09:35:34 +00:00
Chris Lattner	e8dfbaef19	Avoid excess precision issues that lead to generating host-compiler-specific code. Switch lowering probably shouldn't be using FP for this. This resolves PR9581. llvm-svn: 129199	2011-04-09 06:57:13 +00:00
Jakob Stoklund Olesen	5add6d16b7	Build the Hopfield network incrementally when splitting global live ranges. It is common for large live ranges to have few basic blocks with register uses and many live-through blocks without any uses. This approach grows the Hopfield network incrementally around the use blocks, completely avoiding checking interference for some through blocks. llvm-svn: 129188	2011-04-09 02:59:09 +00:00
Jakob Stoklund Olesen	b530849e81	Precompute interference for neighbor blocks as long as there is no interference. This doesn't require seeking in the live interval union, so it is very cheap. llvm-svn: 129187	2011-04-09 02:59:05 +00:00
Chris Lattner	badb8ca63c	have dag combine zap "store undef", which can be formed during call lowering with undef arguments. llvm-svn: 129185	2011-04-09 02:32:02 +00:00
Devang Patel	21b6ef4320	Simplify array bound checks and clarify comments. One element array can have same non-zero number as lower bound as well as upper bound. llvm-svn: 129170	2011-04-08 23:39:38 +00:00
Devang Patel	39ac307002	Do not emit DW_AT_upper_bound and DW_AT_lower_bound for unbouded array. If lower bound is more then upper bound then consider it is an unbounded array. An array is unbounded if non-zero lower bound is same as upper bound. If lower bound and upper bound are zero than array has one element. llvm-svn: 129156	2011-04-08 21:55:10 +00:00
Evan Cheng	bc053100af	Change -arm-trap-func= into a non-arm specific option. Now Intrinsic::trap is lowered into a call to the specified trap function at sdisel time. llvm-svn: 129152	2011-04-08 21:37:21 +00:00
Nick Lewycky	ac1fe011df	llvm.global_[cd]tor is defined to be either external, or appending with an array of { i32, void ()* }. Teach the verifier to verify that, deleting copies of checks strewn about. llvm-svn: 129128	2011-04-08 07:30:21 +00:00
Andrew Trick	36a1759769	Added a check in the preRA scheduler for potential interference on a induction variable. The preRA scheduler is unaware of induction vars, so we look for potential "virtual register cycles" instead. Fixes <rdar://problem/8946719> Bad scheduling prevents coalescing llvm-svn: 129100	2011-04-07 19:54:57 +00:00
Jakob Stoklund Olesen	3e349f2950	Recompute hasPHIKill flags when shrinking live intervals. PHI values may be deleted, causing the flags to be wrong. This fixes PR9616. llvm-svn: 129092	2011-04-07 18:43:14 +00:00
Jakob Stoklund Olesen	aace1636b6	Avoid moving iterators when the previous block was just visited. llvm-svn: 129081	2011-04-07 17:27:50 +00:00
Jakob Stoklund Olesen	1791098020	Prefer multiplications to divisions. llvm-svn: 129080	2011-04-07 17:27:48 +00:00
Jakob Stoklund Olesen	402a4daae6	Extract SpillPlacement::addLinks for handling the special transparent blocks. llvm-svn: 129079	2011-04-07 17:27:46 +00:00
Evan Cheng	1d3691e071	Remove dead code. rdar://9221736. llvm-svn: 129044	2011-04-07 00:56:37 +00:00
Jakob Stoklund Olesen	b59d7e2dea	Also account for the spill code that would be inserted in live-through blocks with interference. llvm-svn: 129030	2011-04-06 21:32:41 +00:00
Jakob Stoklund Olesen	7bd327adbc	Abort the constraint calculation early when all positive bias is lost. Without any positive bias, there is nothing for the spill placer to to. It will spill everywhere. llvm-svn: 129029	2011-04-06 21:32:38 +00:00
Jakob Stoklund Olesen	7621fb6c1b	Keep track of the number of positively biased nodes when adding constraints. If there are no positive nodes, the algorithm can be aborted early. llvm-svn: 129021	2011-04-06 19:14:00 +00:00
Jakob Stoklund Olesen	00f622b9b1	Break the spill placement algorithm into three parts: prepare, addConstraints, and finish. This will allow us to abort the algorithm early if it is determined to be futile. llvm-svn: 129020	2011-04-06 19:13:57 +00:00
Jakob Stoklund Olesen	10a362acbd	Oops. Scary. llvm-svn: 128986	2011-04-06 04:07:14 +00:00
Jakob Stoklund Olesen	bb79ab5ba3	Analyze blocks with uses separately from live-through blocks without uses. About 90% of the relevant blocks are live-through without uses, and the only information required about them is their number. This saves memory and enables later optimizations that need to look at only the use-blocks. llvm-svn: 128985	2011-04-06 03:57:00 +00:00
Jakob Stoklund Olesen	50ab0391d7	Sign error llvm-svn: 128963	2011-04-05 23:43:16 +00:00
Jakob Stoklund Olesen	2bba415e6f	Don't crash when a value is defined after the last split point. llvm-svn: 128962	2011-04-05 23:43:14 +00:00
Jakob Stoklund Olesen	88a0367967	Permit blocks to branch directly to a landing pad. Treat the landing pad as a normal successor when that happens. llvm-svn: 128961	2011-04-05 23:43:11 +00:00
Devang Patel	03d0891c10	Add support to encode function's template parameters. llvm-svn: 128947	2011-04-05 22:52:06 +00:00
Jakob Stoklund Olesen	a819faa2f7	Run LiveDebugVariables in RegAllocBasic and RegAllocGreedy. llvm-svn: 128935	2011-04-05 21:40:37 +00:00
Devang Patel	af7f5f4ada	Refactor. llvm-svn: 128929	2011-04-05 21:08:24 +00:00
Bob Wilson	ef86806800	Add an assertion instead of crashing when the scavenger goes past the end of a basic block. llvm-svn: 128925	2011-04-05 20:44:15 +00:00
Jakob Stoklund Olesen	613bcf88be	When dead code elimination removes all but one use, try to fold the single def into the remaining use. Rematerialization can leave single-use loads behind that we might as well fold whenever possible. llvm-svn: 128918	2011-04-05 20:20:26 +00:00
Devang Patel	2be08abc94	Do not emit empty name. llvm-svn: 128914	2011-04-05 20:14:13 +00:00
Jakob Stoklund Olesen	2bef449b52	Ensure all defs referring to a virtual register are marked dead by addRegisterDead(). There can be multiple defs for a single virtual register when they are defining sub-registers. The missing <dead> flag was stopping the inline spiller from eliminating dead code after rematerialization. llvm-svn: 128888	2011-04-05 16:53:50 +00:00
Rafael Espindola	7618e7be93	Print visibility info for external variables. llvm-svn: 128887	2011-04-05 15:51:32 +00:00
Jakob Stoklund Olesen	731b0d77a2	Use std::unique instead of a SmallPtrSet to ensure unique instructions in UseSlots. This allows us to always keep the smaller slot for an instruction which is what we want when a register has early clobber defines. Drop the UsingInstrs set and the UsingBlocks map. They are no longer needed. llvm-svn: 128886	2011-04-05 15:18:18 +00:00
Jakob Stoklund Olesen	6bd6e03755	Stop precomputing last split points, query the SplitAnalysis cache on demand. llvm-svn: 128875	2011-04-05 04:20:29 +00:00
Jakob Stoklund Olesen	65c8f18b8d	Cache the fairly expensive last split point computation and provide a fast inlined path for the common case. Most basic blocks don't contain a call that may throw, so the last split point os simply the first terminator. llvm-svn: 128874	2011-04-05 04:20:27 +00:00
Bill Wendling	a8db395dc1	Revamp the SjLj "dispatch setup" intrinsic. It needed to be moved closer to the setjmp statement, because the code directly after the setjmp needs to know about values that are on the stack. Also, the 'bitcast' of the function context was causing a dead load. This wouldn't be too horrible, except that at -O0 it wasn't optimized out, and because it wasn't using the correct base pointer (if there is a VLA), it would try to access a value from a garbage address. <rdar://problem/9130540> llvm-svn: 128873	2011-04-05 01:37:43 +00:00
Stuart Hastings	1635b37415	Revert 123704; it broke threaded LLVM. llvm-svn: 128868	2011-04-05 00:37:28 +00:00
Jakob Stoklund Olesen	1454095d5e	Allow coalescing with reserved physregs in certain cases: When a virtual register has a single value that is defined as a copy of a reserved register, permit that copy to be joined. These virtual register are usually copies of the stack pointer: %vreg75<def> = COPY %ESP; GR32:%vreg75 MOV32mr %vreg75, 1, %noreg, 0, %noreg, %vreg74<kill> MOV32mi %vreg75, 1, %noreg, 8, %noreg, 0 MOV32mi %vreg75<kill>, 1, %noreg, 4, %noreg, 0 CALLpcrel32 ... Coalescing these virtual registers early decreases register pressure. Previously, they were coalesced by RALinScan::attemptTrivialCoalescing after register allocation was completed. The lower register pressure causes the mcinst-lowering-cmp0.ll test case to fail because it depends on linear scan spilling a particular register. I am deleting 2008-08-05-SpillerBug.ll because it is counting the number of instructions emitted, and its revision history shows the 'correct' count being edited many times. llvm-svn: 128845	2011-04-04 21:00:03 +00:00
Jakob Stoklund Olesen	d5ddbadc69	Extract physreg joining policy to a separate method. llvm-svn: 128844	2011-04-04 20:59:59 +00:00
Jakob Stoklund Olesen	78d65c6632	Stop caching basic block index ranges now that SlotIndexes can keep up. llvm-svn: 128821	2011-04-04 15:32:15 +00:00
Jakob Stoklund Olesen	6092c3d81f	Delete leftover data members. llvm-svn: 128820	2011-04-04 15:32:11 +00:00
Jakob Stoklund Olesen	e5f6956148	Use InterferenceCache in RegAllocGreedy. llvm-svn: 128765	2011-04-02 06:03:38 +00:00
Jakob Stoklund Olesen	f881310607	Add an InterferenceCache class for caching per-block interference ranges. When the greedy register allocator is splitting multiple global live ranges, it tends to look at the same interference data many times. The InterferenceCache class caches queries for unaltered LiveIntervalUnions. llvm-svn: 128764	2011-04-02 06:03:35 +00:00
Jakob Stoklund Olesen	024a1de4ae	Use basic block numbers as indexes when mapping slot index ranges. This is more compact and faster than using DenseMap. llvm-svn: 128763	2011-04-02 06:03:31 +00:00
Cameron Zwarich	2748634089	Add a RemoveFromWorklist method to DCI. This is needed to do some complicated transformations in target-specific DAG combines without causing DAGCombiner to delete the same node twice. If you know of a better way to avoid this (see my next patch for an example), please let me know. llvm-svn: 128758	2011-04-02 02:40:26 +00:00
Evan Cheng	28382f9178	Add comments. llvm-svn: 128730	2011-04-01 19:57:01 +00:00
Evan Cheng	13c73e4836	Assign node order numbers to results of call instruction lowering. This should improve src line debug info when sdisel is used. rdar://9199118 llvm-svn: 128728	2011-04-01 19:42:22 +00:00
Evan Cheng	39574b2766	Issue libcalls __udivmodi4 / __divmodi4 for div / rem pairs. rdar://8911343 llvm-svn: 128696	2011-04-01 00:42:02 +00:00
Jakob Stoklund Olesen	203727c92e	The basic register allocator must also use the inline spiller. It is using a trivial rewriter that doesn't know how to insert spill code requested by the standard spiller. llvm-svn: 128688	2011-03-31 23:02:17 +00:00
Jakob Stoklund Olesen	a935319339	Don't completely eliminate identity copies that also modify super register liveness. Turn them into noop KILL instructions instead. This lets the scavenger know when super-registers are killed and defined. llvm-svn: 128645	2011-03-31 17:55:25 +00:00
Jakob Stoklund Olesen	c0874a65a0	Allow kill flags on two-address instructions. They are harmless. llvm-svn: 128643	2011-03-31 17:52:41 +00:00
Jakob Stoklund Olesen	84bb8092b6	Mark all uses as <undef> when joining a copy. This way, shrinkToUses() will ignore the instruction that is about to be deleted, and we avoid leaving invalid live ranges that SplitKit doesn't like. Fix a misunderstanding in MachineVerifier about <def,undef> operands. The <undef> flag is valid on def operands where it has the same meaning as <undef> on a use operand. It only applies to sub-register defines which also read the full register. llvm-svn: 128642	2011-03-31 17:23:25 +00:00
Devang Patel	eb032aede2	Remove dead code. llvm-svn: 128639	2011-03-31 16:53:49 +00:00
Jakob Stoklund Olesen	03a6cd0433	Fix bug found by valgrind. llvm-svn: 128634	2011-03-31 15:14:11 +00:00
NAKAMURA Takumi	e0a71fb3e0	lib/CodeGen/LiveIntervalAnalysis.cpp: [PR9590] Don't use std::pow(float,float) here. We don't expect the real "powf()" on some hosts (and powf() would be available on other hosts). For consistency, std::pow(double,double) may be called instead. Or, precision issue might attack us, to see unstable regalloc and stack coloring. llvm-svn: 128629	2011-03-31 12:11:33 +00:00
Jakob Stoklund Olesen	e72dfb1c45	Pick a conservative register class when creating a small live range for remat. The rematerialized instruction may require a more constrained register class than the register being spilled. In the test case, the spilled register has been inflated to the DPR register class, but we are rematerializing a load of the ssub_0 sub-register which only exists for DPR_VFP2 registers. The register class is reinflated after spilling, so the conservative choice is only temporary. llvm-svn: 128610	2011-03-31 03:54:44 +00:00
Jakob Stoklund Olesen	30de09d279	Fix evil VirtRegRewriter bug. The rewriter can keep track of multiple stack slots in the same register if they happen to have the same value. When an instruction modifies a stack slot by defining a register that is mapped to a stack slot, other stack slots in that register are no longer valid. This is a very rare problem, and I don't have a simple test case. I get the impression that VirtRegRewriter knows it is about to be deleted, inventing a last opaque problem. <rdar://problem/9204040> llvm-svn: 128562	2011-03-30 18:14:07 +00:00
Jakob Stoklund Olesen	41a7b0951b	Teach VirtRegRewriter about the new virtual register numbers. No functional change. llvm-svn: 128561	2011-03-30 18:14:04 +00:00
Jay Foad	53632b7c03	Remove PHINode::reserveOperandSpace(). Instead, add a parameter to PHINode::Create() giving the (known or expected) number of operands. llvm-svn: 128537	2011-03-30 11:28:46 +00:00
Jay Foad	dc5a008237	(Almost) always call reserveOperandSpace() on newly created PHINodes. llvm-svn: 128535	2011-03-30 11:19:20 +00:00
Jakob Stoklund Olesen	8ce46ee438	Treat clones the same as their origin. When DCE clones a live range because it separates into connected components, make sure that the clones enter the same register allocator stage as the register they were cloned from. For instance, clones may be split even when they where created during spilling. Other registers created during spilling are not candidates for splitting or even (re-)spilling. llvm-svn: 128524	2011-03-30 02:52:39 +00:00
Jim Grosbach	47b87dbc29	Tidy up. 80 columns and trailing whitespace. llvm-svn: 128504	2011-03-29 23:20:22 +00:00
Jakob Stoklund Olesen	a292fa3d1e	Recompute register class and hint for registers created during spilling. The spill weight is not recomputed for an unspillable register - it stays infinite. llvm-svn: 128490	2011-03-29 21:20:19 +00:00
Jakob Stoklund Olesen	229e589bd1	Remember to use the correct register when rematerializing for snippets. llvm-svn: 128469	2011-03-29 17:47:02 +00:00
Jakob Stoklund Olesen	4676323ac8	Run dead code elimination immediately after rematerialization. This may eliminate some uses of the spilled registers, and we don't want to insert reloads for that. llvm-svn: 128468	2011-03-29 17:47:00 +00:00
Bill Wendling	7469ccb3bd	Inline check that's used only once. llvm-svn: 128465	2011-03-29 17:12:55 +00:00
Bill Wendling	47b8e67328	Rework the logic (and removing the bad check for an unreachable block) so that the FailBB dominator is correctly calculated. Believe it or not, there isn't a functionality change here. llvm-svn: 128455	2011-03-29 07:28:52 +00:00
Bill Wendling	ded022ad8b	Don't try to add stack protector logic to a dead basic block. It messes up dominator information. llvm-svn: 128452	2011-03-29 05:15:48 +00:00
Jakob Stoklund Olesen	a92d74e8cb	Handle the special case when all uses follow the last split point. llvm-svn: 128450	2011-03-29 03:12:04 +00:00
Jakob Stoklund Olesen	c209e050dd	Properly enable rematerialization when spilling after live range splitting. The instruction to be rematerialized may not be the one defining the register that is being spilled. The traceSiblingValue() function sees through sibling copies to find the remat candidate. llvm-svn: 128449	2011-03-29 03:12:02 +00:00
Bill Wendling	cb8447ad52	In some cases, the "fail BB dominator" may be null after the BB was split (and becomes reachable when before it wasn't). Check to make sure that it's not null before trying to use it. llvm-svn: 128434	2011-03-28 23:02:18 +00:00
Daniel Dunbar	cec6959c23	Integrated-As: Add support for setting the AllowTemporaryLabels flag via integrated-as. llvm-svn: 128431	2011-03-28 22:49:19 +00:00
Jakob Stoklund Olesen	18eaae730c	Amend debug output. llvm-svn: 128398	2011-03-27 22:49:23 +00:00
Jakob Stoklund Olesen	9b9cae35db	Drop interference reassignment in favor of eviction. The reassignment phase was able to move interference with a higher spill weight, but it didn't happen very often and it was fairly expensive. The existing interference eviction picks up the slack. llvm-svn: 128397	2011-03-27 22:49:21 +00:00
Jakob Stoklund Olesen	25ff895ebe	Use individual register classes when spilling snippets. The main register class may have been inflated by live range splitting, so that register class is not necessarily valid for the snippet instructions. Use the original register class for the stack slot interval. llvm-svn: 128351	2011-03-26 22:16:41 +00:00
Benjamin Kramer	f9e1ba7398	Turn SelectionDAGBuilder::GetRegistersForValue into a local function. It couldn't be used outside of the file because SDISelAsmOperandInfo is local to SelectionDAGBuilder.cpp. Making it a static function avoids a weird linkage dance. llvm-svn: 128342	2011-03-26 16:35:10 +00:00

... 5 6 7 8 9 ...

12148 Commits