llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
David Blaikie	d6a0e067d7	Add '*' to auto variable that is a pointer, as per the coding conventions. llvm-svn: 221033	2014-11-01 01:03:39 +00:00
David Blaikie	bc2a3611cd	Add DwarfCompileUnit::getSkeleton that returns DwarfCompileUnit* to avoid having to cast from DwarfUnit* on every call. llvm-svn: 221031	2014-11-01 00:50:34 +00:00
Duncan P. N. Exon Smith	7004fd9aac	IR: MDNode => Value: Instruction::getMetadata() Change `Instruction::getMetadata()` to return `Value` as part of PR21433. Update most callers to use `Instruction::getMDNode()`, which wraps the result in a `cast_or_null<MDNode>`. llvm-svn: 221024	2014-11-01 00:10:31 +00:00
David Blaikie	1e29b40b41	Sink some of DwarfDebug::collectDeadVariables down into DwarfCompileUnit. llvm-svn: 221010	2014-10-31 22:30:30 +00:00
David Blaikie	da34a7bac5	Sink most of DwarfDebug::constructAbstractSubprogramScopeDIE into DwarfCompileUnit llvm-svn: 221005	2014-10-31 21:57:02 +00:00
Quentin Colombet	06167df4ad	[CodeGenPrepare] Move extractelement close to store if they can be combined. This patch adds an optimization in CodeGenPrepare to move an extractelement right before a store when the target can combine them. The optimization may promote any scalar operations to vector operations in the way to make that possible. Context Some targets use different register files for both vector and scalar operations. This means that transitioning from one domain to another may incur copy from one register file to another. These copies are not coalescable and may be expensive. For example, according to the scheduling model, on cortex-A8 a vector to GPR move is 20 cycles. Motivating Example Let us consider an example: define void @foo(<2 x i32>* %addr1, i32* %dest) { %in1 = load <2 x i32>* %addr1, align 8 %extract = extractelement <2 x i32> %in1, i32 1 %out = or i32 %extract, 1 store i32 %out, i32* %dest, align 4 ret void } As it is, this IR generates the following assembly on armv7: vldr d16, [r0] @vector load vmov.32 r0, d16[1] @ cross-register-file copy: 20 cycles orr r0, r0, #1 @ scalar bitwise or str r0, [r1] @ scalar store bx lr Whereas we could generate much faster code: vldr d16, [r0] @ vector load vorr.i32 d16, #0x1 @ vector bitwise or vst1.32 {d16[1]}, [r1:32] @ vector extract + store bx lr Half of the computation made in the vector is useless, but this allows to get rid of the expensive cross-register-file copy. Proposed Solution To avoid this cross-register-copy penalty, we promote the scalar operations to vector operations. The penalty will be removed if we manage to promote the whole chain of computation in the vector domain. Currently, we do that only when the chain of computation ends by a store and the target is able to combine an extract with a store. Stores are the most likely candidates, because other instructions produce values that would need to be promoted and so, extracted as some point[1]. Moreover, this is customary that targets feature stores that perform a vector extract (see AArch64 and X86 for instance). The proposed implementation relies on the TargetTransformInfo to decide whether or not it is beneficial to promote a chain of computation in the vector domain. Unfortunately, this interface is rather inaccurate for this level of details and although this optimization may be beneficial for X86 and AArch64, the inaccuracy will lead to the optimization being too aggressive. Basically in TargetTransformInfo, everything that is legal has a cost of 1, whereas, even if a vector type is legal, usually a vector operation is slightly more expensive than its scalar counterpart. That will lead to too many promotions that may not be counter balanced by the saving of the cross-register-file copy. For instance, on AArch64 this penalty is just 4 cycles. For now, the optimization is just enabled for ARM prior than v8, since those processors have a larger penalty on cross-register-file copies, and the scope is limited to basic blocks. Because of these two factors, we limit the effects of the inaccuracy. Indeed, I did not want to build up a fancy cost model with block frequency and everything on top of that. [1] We can imagine targets that can combine an extractelement with other instructions than just stores. If we want to go into that direction, the current interfaces must be augmented and, moreover, I think this becomes a global isel problem. Differential Revision: http://reviews.llvm.org/D5921 <rdar://problem/14170854> llvm-svn: 220978	2014-10-31 17:52:53 +00:00
David Blaikie	24cb75d1b9	Correct assert text from r220923 Noticed in post-commit review by Adrian Prantl. llvm-svn: 220967	2014-10-31 16:45:36 +00:00
Hao Liu	6cc87eb119	PR20557: Fix the bug that bogus cpu parameter crashes llc on AArch64 backend. Initial patch by Oleg Ranevskyy. llvm-svn: 220945	2014-10-31 02:35:34 +00:00
Ahmed Bougacha	38c0bf429c	[SelectionDAG] When scalarizing trunc, don't assert for legal operands. r212242 introduced a legalizer hook, originally to let AArch64 widen v1i{32,16,8} rather than scalarize, because the legalizer expected, when scalarizing the result of a conversion operation, to already have scalarized the operands. On AArch64, v1i64 is legal, so that commit ensured operations such as v1i32 = trunc v1i64 wouldn't assert. It did that by choosing to widen v1 types whenever possible. However, v1i1 types, for which there's no legal widened type, would still trigger the assert. This commit fixes that, by only scalarizing a trunc's result when the operand has already been scalarized, and introducing an extract_elt otherwise. This is similar to r205625. Fixes PR20777. llvm-svn: 220937	2014-10-30 23:46:50 +00:00
Louis Gerbarg	6f92b8978d	Fix incorrect invariant check in DAG Combine Earlier this summer I fixed an issue where we were incorrectly combining multiple loads that had different constraints such alignment, invariance, temporality, etc. Apparently in one case I made copt paste error and swapped alignment and invariance. Tests included. rdar://18816719 llvm-svn: 220933	2014-10-30 22:21:03 +00:00
David Blaikie	dc51b0f8dd	PR21408: Workaround the appearance of duplicate variables due to problems when inlining two calls to the same function from the same call site. llvm-svn: 220923	2014-10-30 20:20:11 +00:00
NAKAMURA Takumi	ee3b3d3d09	Whitespace. llvm-svn: 220857	2014-10-29 15:23:11 +00:00
David Blaikie	35385231e3	Minimize the scope of some variables, NFC. llvm-svn: 220759	2014-10-28 02:57:26 +00:00
Lang Hames	77d387a954	[PBQP] Unique allowed-sets for nodes in the PBQP graph and use pairs of these sets as keys into a cache of interference matrice values in the Interference constraint adder. Creating interference matrices was one of the large remaining time-sinks in PBQP. Caching them reduces the total compile time (when using PBQP) on the nightly test suite by ~10%. llvm-svn: 220688	2014-10-27 17:44:25 +00:00
David Blaikie	5a24603108	Remove some unnecessary casts. llvm-svn: 220658	2014-10-26 23:37:04 +00:00
Frederic Riss	1a2ce34071	Sink DwarfUnit::constructImportedEntityDIE into DwarfCompileUnit. So that it has access to getOrCreateGlobalVariableDIE. If we ever support decsribing using directive in C++ classes (thus requiring support in type units), it will certainly use another mechanism anyway. Differential Revision: http://reviews.llvm.org/D5975 llvm-svn: 220594	2014-10-24 21:31:09 +00:00
Matt Arsenault	cef46eb164	Fix copy paste comment llvm-svn: 220581	2014-10-24 18:13:10 +00:00
David Blaikie	21ab861fa1	DebugInfo: Sink DwarfDebug::ScopeVariables down into DwarfFile (part of refactoring to allow subprogram emission in both the skeleton and main units to enable -gmlt-like data to be included in the skeleton for live inlined backtracing purposes) llvm-svn: 220578	2014-10-24 17:57:34 +00:00
David Blaikie	1054961712	Remove DwarfDebug::FirstCU as it has no use It was only being used as a flag to identify the lack of debug info from within endModule - use the section labels for that instead. llvm-svn: 220575	2014-10-24 17:53:38 +00:00
Sanjay Patel	d9b7837012	Use rsqrt (X86) to speed up reciprocal square root calcs This is a first step for generating SSE rsqrt instructions for reciprocal square root calcs when fast-math is allowed. For now, be conservative and only enable this for AMD btver2 where performance improves significantly - for example, 29% on llvm/projects/test-suite/SingleSource/Benchmarks/BenchmarkGame/n-body.c (if we convert the data type to single-precision float). This patch adds a two constant version of the Newton-Raphson refinement algorithm to DAGCombiner that can be selected by any target via a parameter returned by getRsqrtEstimate().. See PR20900 for more details: http://llvm.org/bugs/show_bug.cgi?id=20900 Differential Revision: http://reviews.llvm.org/D5658 llvm-svn: 220570	2014-10-24 17:02:16 +00:00
Marcello Maggioni	835ff8fe13	Added reset of LexicalScope in LiveDebugVariables reset function. llvm-svn: 220545	2014-10-24 02:46:50 +00:00
Timur Iskhodzhanov	04c11d578f	Fix PR21189 -- Emit symbol subsection required to debug LLVM-built binaries with VS2012+ Reviewed at http://reviews.llvm.org/D5772 llvm-svn: 220544	2014-10-24 01:27:45 +00:00
David Blaikie	11c3a3f6f2	DebugInfo: Remove DwarfDebug::addScopeVariable now that it's just a trivial wrapper llvm-svn: 220542	2014-10-24 00:43:47 +00:00
Ahmed Bougacha	a035d19b50	[SelectionDAG] Teach the vector scalarizer about FP conversions. This adds support for legalization of instructions of the form: [fp_conv] <1 x i1> %op to <1 x double> where fp_conv is one of fpto[us]i, [us]itofp. This used to assert because they were simply missing from the vector operand scalarizer. A similar problem arose in r190830, with trunc instead. Fixes PR20778. Differential Revision: http://reviews.llvm.org/D5810 llvm-svn: 220533	2014-10-23 22:49:25 +00:00
Ahmed Bougacha	4e0335a62d	Update comment and fix typos in assert message. (NFC) llvm-svn: 220531	2014-10-23 22:40:34 +00:00
Tim Northover	d882ba8bc5	ScheduleDAG: record PhysReg dependencies represented by CopyFromReg nodes x86's CMPXCHG -> EFLAGS consumer wasn't being recorded as a real EFLAGS dependency because it was represented by a pair of CopyFromReg(EFLAGS) -> CopyToReg(EFLAGS) nodes. ScheduleDAG was expecting the source to be an implicit-def on the instruction, where the result numbers in the DAG and the Uses list in TableGen matched up precisely. The Copy notation seems much more robust, so this patch extends ScheduleDAG rather than refactoring x86. Should fix PR20376. llvm-svn: 220529	2014-10-23 22:31:48 +00:00
David Blaikie	92e415962d	DebugInfo: Remove DwarfDebug::CurrentFnArguments since we have to handle argument ordering of other arguments (abstract arguments) in the same way and already have code for that too. While refactoring this code I was confused by both the name I had introduced (addNonArgumentVariable... but it has all this logic to handle argument numbering and keep things in order?) and by the redundancy. Seems when I fixed the misordered inlined argument handling, I didn't realize it was mostly redundant with the argument ordering code (which I may've also written, I'm not sure). So let's just rely on the more general case. The only oddity in output this produces is that it means when we emit all the variables for the current function, we don't track when we've finished the argument variables and are about to start the local variables and insert DW_AT_unspecified_parameters (for varargs functions) there. Instead it ends up after the local variables, scopes, etc. But this isn't invalid and doesn't cause DWARF consumers problems that I know of... so we'll just go with that because it makes the code nice & simple. (though, let's see what the buildbots have to say about this - crosses fingers) There will be some cleanup commits to follow to remove the now trivial wrappers, etc. llvm-svn: 220527	2014-10-23 22:27:50 +00:00
David Blaikie	c325cd7123	DebugInfo: Sink DwarfDebug::addNonArgumentScopeVariable into DwarfFile. llvm-svn: 220520	2014-10-23 22:04:30 +00:00
David Blaikie	dbed952309	DebugInfo: Remove DwarfDebug::addCurrentFnArgument declaration now that it's moved to DwarfFile. llvm-svn: 220515	2014-10-23 21:53:17 +00:00
David Blaikie	1e1e6fb31c	DebugInfo: Simplify/tidy/correct global variable decl/def emission handling. This fixes a bug (introduced by fixing the IR emitted from Clang where the definition of a static member would be scoped within the class, rather than within its lexical decl context) where the definition of a static variable would be placed inside a class. It also improves source fidelity by scoping static class member definitions inside the lexical decl context in which tehy are written (eg: namespace n { class foo { static int i; } int foo::i; } - the definition of 'i' will be within the namespace 'n' in the DWARF output now). Lastly, and the original goal, this reduces debug info size slightly (and makes debug info easier to read, etc) by placing the definitions of non-member global variables within their namespace, rather than using a separate namespace-scoped declaration along with a definition at global scope. Based on patches and discussion with Frédéric. llvm-svn: 220497	2014-10-23 19:12:43 +00:00
David Blaikie	8eacc975e4	Remove explicit (void) use of DwarfFile::DD that was accidentally left in r220452. Caught in post-commit review by Frédéric. llvm-svn: 220487	2014-10-23 16:12:58 +00:00
David Blaikie	e3b5e8b37a	[DebugInfo] Sink DwarfDebug::addCurrentFnArgument down into DwarfFile. Variable handling will be sunk into DwarfFile so that abstract variables and the like can be shared across multiple CUs (to handle cross-CU inlining, for example). llvm-svn: 220453	2014-10-23 00:16:05 +00:00
David Blaikie	f0eb7b0322	[DebugInfo] Add DwarfDebug& to DwarfFile. Use the DwarfDebug in one function that previously took it as a parameter, and lay the foundation for use this for other operations coming soon. llvm-svn: 220452	2014-10-23 00:16:03 +00:00
David Blaikie	e782ff6864	[DebugInfo] Remove LexicalScopes::isCurrentFunctionScope and CSE a use of LexicalScopes::getCurrentFunctionScope Now that we're sure the only root (non-abstract) scope is the current function scope, there's no need for isCurrentFunctionScope, the property can be tested directly instead. llvm-svn: 220451	2014-10-23 00:06:27 +00:00
Benjamin Kramer	a6c059251d	Strength reduce constant-sized vectors into arrays. No functionality change. llvm-svn: 220412	2014-10-22 19:55:26 +00:00
Matt Arsenault	c95fbccb3f	Fix typo llvm-svn: 220353	2014-10-22 00:28:59 +00:00
Matt Arsenault	2257f6b589	Add minnum / maxnum codegen llvm-svn: 220342	2014-10-21 23:01:01 +00:00
Arnaud A. de Grandmaison	3555931ec7	Pacify bots and simplify r220321 llvm-svn: 220335	2014-10-21 21:50:49 +00:00
Arnaud A. de Grandmaison	73624b6ac4	[PBQP] Teach PassConfig to tell if the default register allocator is used. This enables targets to adapt their pass pipeline to the register allocator in use. For example, with the AArch64 backend, using PBQP with the cortex-a57, the FPLoadBalancing pass is no longer necessary. llvm-svn: 220321	2014-10-21 20:47:22 +00:00
Arnaud A. de Grandmaison	f39e772ae9	[PBQP] Fix coalescing benefits As coalescing registers is a benefit, the cost should be improved (i.e. made smaller) when coalescing is possible. llvm-svn: 220302	2014-10-21 16:24:15 +00:00
Rafael Espindola	6ffbd5bf5d	Fix a bit of confusion about .set and produce more readable assembly. Every target we support has support for assembly that looks like a = b - c .long a What is special about MachO is that the above combination suppresses the production of a relocation. With this change we avoid producing the intermediary labels when they don't add any value. llvm-svn: 220256	2014-10-21 01:17:30 +00:00
Rafael Espindola	72d274d9e2	Make AsmPrinter::EmitLabelOffsetDifference a static helper and simplify. It had exactly one caller in a position where we know hasSetDirective is true. llvm-svn: 220250	2014-10-21 00:25:49 +00:00
Philip Reames	c3e4c79873	Introduce enum values for previously defined metadata types. (NFC) Our metadata scheme lazily assigns IDs to string metadata, but we have a mechanism to preassign them as well. Using a preassigned ID is helpful since we get compile time type checking, and avoid some (minimal) string construction and comparison. This change adds enum value for three existing metadata types: + MD_nontemporal = 9, // "nontemporal" + MD_mem_parallel_loop_access = 10, // "llvm.mem.parallel_loop_access" + MD_nonnull = 11 // "nonnull" I went through an updated various uses as well. I made no attempt to get all uses; I focused on the ones which were easily grepable and easily to translate. For example, there were several items in LoopInfo.cpp I chose not to update. llvm-svn: 220248	2014-10-21 00:13:20 +00:00
Lang Hames	86d69f67b6	[PBQP] Replace the interference-constraints algorithm with a faster version loosely based on linear scan. On x86-64 this is good for a ~2% drop in compile time on the nightly test suite. llvm-svn: 220143	2014-10-18 17:26:07 +00:00
Pete Cooper	bedb6f3c4b	Check for dynamic alloca's when selecting lifetime intrinsics. TL;DR: Indexing maps with [] creates missing entries. The long version: When selecting lifetime intrinsics, we index the static alloca map with the AllocaInst we find for that lifetime. Trouble is, we don't first check to see if this is a dynamic alloca. On the attached example, this causes a dynamic alloca to create an entry in the static map, and returns 0 (the default) as the frame index for that lifetime. 0 was used for the frame index of the stack protector, which given that it now has a lifetime, is coloured, and merged with other stack slots. PEI would later trigger an assert because it expects the stack protector to not be dead. This fix ensures that we only get frame indices for static allocas, ie, those in the map. Dynamic ones are effectively dropped, which is suboptimal, but at least isn't completely broken. rdar://problem/18672951 llvm-svn: 220099	2014-10-17 22:59:33 +00:00
Juergen Ributzka	99ffd17333	[Stackmaps] Enable invoking the patchpoint intrinsic. Patch by Kevin Modzelewski Reviewers: atrick, ributzka Reviewed By: ributzka Subscribers: llvm-commits, reames Differential Revision: http://reviews.llvm.org/D5634 llvm-svn: 220055	2014-10-17 17:39:00 +00:00
Jan Vesely	31f817808d	SelectionDAG: Add sext_inreg optimizations v2: use dyn_cast fixup comments v3: use cast Reviewed-by: Matt Arsenault <arsenm2@gmail.com> Signed-off-by: Jan Vesely <jan.vesely@rutgers.edu> llvm-svn: 220044	2014-10-17 14:45:25 +00:00
Juergen Ributzka	00a783c163	Reduce code duplication between patchpoint and non-patchpoint lowering. NFC. This is in preparation for another patch that makes patchpoints invokable. Reviewers: atrick, ributzka Reviewed By: ributzka Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5657 llvm-svn: 219967	2014-10-16 21:26:35 +00:00
Robin Morisset	8dc41d55aa	Erase fence insertion from SelectionDAGBuilder.cpp (NFC) Summary: Backends can use setInsertFencesForAtomic to signal to the middle-end that montonic is the only memory ordering they can accept for stores/loads/rmws/cmpxchg. The code lowering those accesses with a stronger ordering to fences + monotonic accesses is currently living in SelectionDAGBuilder.cpp. In this patch I propose moving this logic out of it for several reasons: - There is lots of redundancy to avoid: extremely similar logic already exists in AtomicExpand. - The current code in SelectionDAGBuilder does not use any target-hooks, it does the same transformation for every backend that requires it - As a result it is plain unsound, as it was apparently designed for ARM. It happens to mostly work for the other targets because they are extremely conservative, but Power for example had to switch to AtomicExpand to be able to use lwsync safely (see r218331). - Because it produces IR-level fences, it cannot be made sound ! This is noted in the C++11 standard (section 29.3, page 1140): ``` Fences cannot, in general, be used to restore sequential consistency for atomic operations with weaker ordering semantics. ``` It can also be seen by the following example (called IRIW in the litterature): ``` atomic<int> x = y = 0; int r1, r2, r3, r4; Thread 0: x.store(1); Thread 1: y.store(1); Thread 2: r1 = x.load(); r2 = y.load(); Thread 3: r3 = y.load(); r4 = x.load(); ``` r1 = r3 = 1 and r2 = r4 = 0 is impossible as long as the accesses are all seq_cst. But if they are lowered to monotonic accesses, no amount of fences can prevent it.. This patch does three things (I could cut it into parts, but then some of them would not be tested/testable, please tell me if you would prefer that): - it provides a default implementation for emitLeadingFence/emitTrailingFence in terms of IR-level fences, that mimic the original logic of SelectionDAGBuilder. As we saw above, this is unsound, but the best that can be done without knowing the targets well (and there is a comment warning about this risk). - it then switches Mips/Sparc/XCore to use AtomicExpand, relying on this default implementation (that exactly replicates the logic of SelectionDAGBuilder, so no functional change) - it finally erase this logic from SelectionDAGBuilder as it is dead-code. Ideally, each target would define its own override for emitLeading/TrailingFence using target-specific fences, but I do not know the Sparc/Mips/XCore memory model well enough to do this, and they appear to be dealing fine with the ARM-inspired default expansion for now (probably because they are overly conservative, as Power was). If anyone wants to compile fences more agressively on these platforms, the long comment should make it clear why he should first override emitLeading/TrailingFence. Test Plan: make check-all, no functional change Reviewers: jfb, t.p.northover Subscribers: aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D5474 llvm-svn: 219957	2014-10-16 20:34:57 +00:00
Eric Christopher	64e055d96c	Avoid caching the MachineFunction, we don't use it outside of runOnMachineFunction. llvm-svn: 219847	2014-10-15 21:06:25 +00:00
Rafael Espindola	1dba93c519	Simplify handling of --noexecstack by using getNonexecutableStackSection. llvm-svn: 219799	2014-10-15 16:12:52 +00:00
Jingyue Wu	3ee10fc280	[MachineSink] Use the real post dominator tree Summary: Fixes a FIXME in MachineSinking. Instead of using the simple heuristics in isPostDominatedBy, use the real MachinePostDominatorTree and MachineLoopInfo. The old heuristics caused instructions to sink unnecessarily, and might create register pressure. This is the second try of the fix. The first one (D4814) caused a performance regression due to failing to sink instructions out of loops (PR21115). This patch fixes PR21115 by sinking an instruction from a deeper loop to a shallower one regardless of whether the target block post-dominates the source. Thanks Alexey Volkov for reporting PR21115! Test Plan: Added a NVPTX codegen test to verify that our change prevents the backend from over-sinking. It also shows the unnecessary register pressure caused by over-sinking. Added an X86 test to verify we can sink instructions out of loops regardless of the dominance relationship. This test is reduced from Alexey's test in PR21115. Updated an affected test in X86. Also ran SPEC CINT2006 and llvm-test-suite for compilation time and runtime performance. Results are attached separately in the review thread. Reviewers: Jiangning, resistor, hfinkel Reviewed By: hfinkel Subscribers: hfinkel, bruno, volkalexey, llvm-commits, meheff, eliben, jholewinski Differential Revision: http://reviews.llvm.org/D5633 llvm-svn: 219773	2014-10-15 03:27:43 +00:00
Gerolf Hoflehner	fbd25ba142	[AAarch64] Optimize CSINC-branch sequence Peephole optimization that generates a single conditional branch for csinc-branch sequences like in the examples below. This is possible when the csinc sets or clears a register based on a condition code and the branch checks that register. Also the condition code may not be modified between the csinc and the original branch. Examples: 1. Convert csinc w9, wzr, wzr, <CC>;tbnz w9, #0, 0x44 to b.<invCC> 2. Convert csinc w9, wzr, wzr, <CC>; tbz w9, #0, 0x44 to b.<CC> rdar://problem/18506500 llvm-svn: 219742	2014-10-14 23:07:53 +00:00
Rafael Espindola	c98553226f	Remove unused member variable. Fixes pr20904. llvm-svn: 219706	2014-10-14 18:53:16 +00:00
David Blaikie	2044b61ced	DebugInfo: Ensure that all debug location scope chains from instructions within a function, lead to the function itself. Let me tell you a tale... Originally committed in r211723 after discovering a nasty case of weird scoping due to inlining, this was reverted in r211724 after it fired in ASan/compiler-rt. (minor diversion where I accidentally committed/reverted again in r211871/r211873) After further testing and fixing bugs in ArgumentPromotion (r211872) and Inlining (r212065) it was recommitted in r212085. Reverted in r212089 after the sanitizer buildbots still showed problems. Fixed another bug in ArgumentPromotion (r212128) found by this assertion. Recommitted in r212205, reverted in r212226 after it crashed some more on sanitizer buildbots. Fix clang some more in r212761. Recommitted in r212776, reverted in r212793. ASan failures. Recommitted in r213391, reverted in r213432, trying to reproduce flakey ASan build failure. Fixed bugs in r213805 (ArgPromo + DebugInfo), r213952 (LiveDebugVariables strips dbg_value intrinsics in functions not described by debug info). Recommitted in r214761, reverted in r214999, flakey failure on Windows buildbot. Fixed DeadArgElimination + DebugInfo bug in r219210. Recommitted in r219215, reverted in r219512, failure on ObjC++ atomic properties in the test-suite on Darwin. Fixed ObjC++ atomic properties issue in Clang in r219690. [This commit is provided 'as is' with no hope that this is the last time I commit this change either expressed or implied] llvm-svn: 219702	2014-10-14 18:22:52 +00:00
David Blaikie	c221665891	Revert "Fix stuff... again." Accidental commit. This reverts commit r219693. llvm-svn: 219695	2014-10-14 17:13:09 +00:00
David Blaikie	7ece1f98c4	Revert some parts of r196288 that were confusing and untested. If we figure out why they should be here, let's add some testing of some kind so we can better demonstrate why it's needed. llvm-svn: 219694	2014-10-14 17:12:02 +00:00
David Blaikie	47d8204a49	Fix stuff... again. llvm-svn: 219693	2014-10-14 17:11:59 +00:00
Eric Christopher	45581b7332	Remove unnecessary TargetMachine.h includes. llvm-svn: 219672	2014-10-14 07:22:08 +00:00
Eric Christopher	e4d58538b3	Grab the subtarget and subtarget dependent variables off of MachineFunction rather than TargetMachine. llvm-svn: 219671	2014-10-14 07:22:00 +00:00
Eric Christopher	db23ede8d2	Grab the subtarget and subtarget dependent variables off of MachineFunction rather than TargetMachine. llvm-svn: 219670	2014-10-14 07:17:23 +00:00
Eric Christopher	feee977cf4	Instead of the TargetMachine cache the MachineFunction and TargetRegisterInfo in the peephole optimizer. This makes it easier to grab subtarget dependent variables off of the MachineFunction rather than the TargetMachine. llvm-svn: 219669	2014-10-14 07:17:20 +00:00
Eric Christopher	f0b9de507b	Access subtarget specific variables off of the MachineFunction's cached subtarget and not the TargetMachine. llvm-svn: 219668	2014-10-14 07:00:33 +00:00
Eric Christopher	82bb988565	Access the subtarget off of the MachineFunction via the DAG scheduler or via the SelectionDAG if available. Otherwise grab the subtarget off of the MachineFunction by going up the parent chain. llvm-svn: 219666	2014-10-14 06:56:25 +00:00
Eric Christopher	d5f6f42397	Remove the use and member variable of the TargetMachine from MachineLICM as we can get the same data off of the MachineFunction. llvm-svn: 219663	2014-10-14 06:26:57 +00:00
Eric Christopher	6f2145e3fc	Have MachineInstrBundle use the MachineFunction for subtarget access rather than the TargetMachine. llvm-svn: 219662	2014-10-14 06:26:55 +00:00
Eric Christopher	6f44ab137d	Access the subtarget off of the MachineFunction rather than through the TargetMachine. llvm-svn: 219661	2014-10-14 06:26:53 +00:00
Eric Christopher	15c10d51e5	Remove the TargetMachine from DFAPacketizer since it was only being used to grab subtarget specific things that we can grab from the MachineFunction anyhow. llvm-svn: 219650	2014-10-14 01:03:16 +00:00
Eric Christopher	1c880c715a	Migrate another set of getSubtargetImpl away. llvm-svn: 219636	2014-10-13 21:57:44 +00:00
Adrian Prantl	f90178790d	Add an assertion about the integrity of the iterator. Broken parent scope pointers in inlined DIVariables can cause ensureAbstractVariableIsCreated to insert new abstract scopes, thus invalidating the iterator in this loop and leading to hard-to-debug crashes. Useful when manually reducing IR for testcases. llvm-svn: 219628	2014-10-13 20:44:58 +00:00
Adrian Prantl	cf6b88a8d4	constify the getters in SDNodeDbgValue. llvm-svn: 219627	2014-10-13 20:43:47 +00:00
Chad Rosier	b3aab2d6f3	Refactor debug statement and remove dead argument. NFC. llvm-svn: 219626	2014-10-13 19:46:39 +00:00
Benjamin Kramer	6b288538ef	Modernize old-style static asserts. NFC. llvm-svn: 219588	2014-10-12 17:56:40 +00:00
David Blaikie	b2150c9f0a	Revert "DebugInfo: Ensure that all debug location scope chains from instructions within a function, lead to the function itself." This invariant is violated (& the assertions fire) on some Objective C++ in the test-suite. Reverting while I investigate. This reverts commit r219215. llvm-svn: 219523	2014-10-10 18:46:21 +00:00
Hal Finkel	8eb3f6d6a0	[MiSched] Fix a logic error in tryPressure() Fixes a logic error in the MachineScheduler found by Steve Montgomery (and confirmed by Andy). This has gone unfixed for months because the fix has been found to introduce some small performance regressions. However, Andy has recommended that, at this point, we fix this to avoid further dependence on the incorrect behavior (and then follow-up separately on any regressions), and I agree. Fixes PR18883. llvm-svn: 219512	2014-10-10 17:06:20 +00:00
David Blaikie	9cbee3a665	Simplify a few uses of DwarfDebug::SPMap llvm-svn: 219510	2014-10-10 16:59:52 +00:00
Timur Iskhodzhanov	c1bf6e5e98	Reorder functions in WinCodeViewLineTables.cpp [NFC] This helps read the comments and understand the code in a natural order llvm-svn: 219508	2014-10-10 16:05:32 +00:00
Benjamin Kramer	f35a067b43	Reduce double set lookups. NFC. llvm-svn: 219505	2014-10-10 15:32:50 +00:00
Timur Iskhodzhanov	6a73d5beb3	Fix a small typo, NFC llvm-svn: 219492	2014-10-10 12:52:58 +00:00
David Blaikie	6a3e82d728	Sink the per-CU part of DwarfDebug::finishSubprogramDefinitions into DwarfCompileUnit. llvm-svn: 219477	2014-10-10 06:39:29 +00:00
David Blaikie	39b2185feb	Sink most of DwarfDebug::constructAbstractSubprogramScopeDIE down into DwarfCompileUnit. llvm-svn: 219476	2014-10-10 06:39:26 +00:00
David Blaikie	ece83cd7c3	Avoid unnecessary map lookup/insertion. llvm-svn: 219466	2014-10-10 03:09:38 +00:00
Sanjay Patel	78e4aafd3f	Improve sqrt estimate algorithm (fast-math) This patch changes the fast-math implementation for calculating sqrt(x) from: y = 1 / (1 / sqrt(x)) to: y = x * (1 / sqrt(x)) This has 2 benefits: less code / faster code and one less estimate instruction that may lose precision. The only target that will be affected (until http://reviews.llvm.org/D5658 is approved) is PPC. The difference in codegen for PPC is 2 less flops for a single-precision sqrtf or vector sqrtf and 4 less flops for a double-precision sqrt. We also eliminate a constant load and extra register usage. Differential Revision: http://reviews.llvm.org/D5682 llvm-svn: 219445	2014-10-09 21:26:35 +00:00
Sanjay Patel	dc8836d89c	delete function names from comments llvm-svn: 219444	2014-10-09 21:24:46 +00:00
David Blaikie	c848d1d43c	Remove unused parameter llvm-svn: 219440	2014-10-09 20:36:27 +00:00
David Blaikie	5cc331ae79	Sink DwarfDebug::createAndAddScopeChildren down into DwarfCompileUnit. llvm-svn: 219437	2014-10-09 20:26:15 +00:00
David Blaikie	6e8ade1095	Sink DwarfDebug::constructSubprogramScopeDIE down into DwarfCompileUnit llvm-svn: 219436	2014-10-09 20:21:36 +00:00
David Blaikie	477f8cb9ba	Sink DwarfDebug::createScopeChildrenDIE down into DwarfCompileUnit. llvm-svn: 219422	2014-10-09 18:24:28 +00:00
Lang Hames	1ac2927c37	[PBQP] Replace PBQPBuilder with composable constraints (PBQPRAConstraint). This patch removes the PBQPBuilder class and its subclasses and replaces them with a composable constraints class: PBQPRAConstraint. This allows constraints that are only required for optimisation (e.g. coalescing, soft pairing) to be mixed and matched. This patch also introduces support for target writers to supply custom constraints for their targets by overriding a TargetSubtargetInfo method: std::unique_ptr<PBQPRAConstraints> getCustomPBQPConstraints() const; This patch should have no effect on allocations. llvm-svn: 219421	2014-10-09 18:20:51 +00:00
David Blaikie	c7aba518d5	Sink DwarfDebug.cpp::constructVariableDIE into DwarfCompileUnit. llvm-svn: 219419	2014-10-09 17:56:39 +00:00
David Blaikie	78c64030db	Move DwarfUnit::constructVariableDIE down to DwarfCompileUnit, since it's only needed there. llvm-svn: 219418	2014-10-09 17:56:36 +00:00
David Blaikie	06c6662b4d	Sink DwarfDebug::constructLexicalScopeDIE into DwarfCompileUnit llvm-svn: 219414	2014-10-09 17:08:42 +00:00
David Blaikie	c49791b401	Missing reformatting llvm-svn: 219413	2014-10-09 17:08:38 +00:00
David Blaikie	1a81b6999c	Sink DwarfDebug::constructInlinedScopeDIE into DwarfCompileUnit This introduces access to the AbstractSPDies map from DwarfDebug so DwarfCompileUnit can access it. Eventually this'll sink down to DwarfFile, but it'll still be generically accessible - not much encapsulation to provide it. (constructInlinedScopeDIE could stay further up, in DwarfFile to avoid exposing this - but I don't think that's particularly better) llvm-svn: 219411	2014-10-09 16:50:53 +00:00
Eric Christopher	78db37f8b8	Remove more calls to getSubtargetImpl from the schedulers and remove cached or unnecessary TargetMachines. llvm-svn: 219387	2014-10-09 06:28:06 +00:00
Eric Christopher	f9e1101078	Remove unused argument to CreateTargetScheduleState and change the TargetMachine to a TargetSubtargetInfo since everything we wanted is off of that. llvm-svn: 219382	2014-10-09 01:59:35 +00:00
Eric Christopher	d0436462af	Remove uses of getSubtargetImpl from ResourcePriorityQueue and replace them with calls off of the MachineFuncton. llvm-svn: 219381	2014-10-09 01:59:31 +00:00
Eric Christopher	864095b456	Remove the uses of getSubtargetImpl from InstrEmitter and remove the now unused TargetMachine variable. llvm-svn: 219379	2014-10-09 01:35:29 +00:00
Eric Christopher	6aec5f9e16	Use the subtarget on the dag to get TargetFrameLowering rather than off the target machine. llvm-svn: 219378	2014-10-09 01:35:27 +00:00
Eric Christopher	065fdace2e	Remove uses of the TargetMachine from FunctionLoweringInfo via caching TargetLowering and using the MachineFunction. llvm-svn: 219375	2014-10-09 00:57:31 +00:00
David Blaikie	4e189e100b	Push DwarfDebug::attachRangesOrLowHighPC down into DwarfCompileUnit llvm-svn: 219372	2014-10-09 00:21:42 +00:00
David Blaikie	f2869de8da	Sink DwarfDebug::addScopeRangeList down into DwarfCompileUnit (& add a few accessors/make a couple of things public for this - it's a bit of a toss-up, but I think I prefer it this way, keeping some more of the meaty code down in DwarfCompileUnit - if only to make for smaller implementation files, etc) I think we could simplify range handling a bit if we removed the range lists from each unit and just put a single range list on DwarfDebug, similar to address pooling. llvm-svn: 219370	2014-10-09 00:11:39 +00:00
Eric Christopher	fb763ff585	Remove unnecessary include. llvm-svn: 219368	2014-10-08 23:38:40 +00:00
Eric Christopher	c47c81ea7a	Use both the cached TLI and the subtarget off of the DAG in the DAG combiner. llvm-svn: 219367	2014-10-08 23:38:39 +00:00
Eric Christopher	2dae75a164	Remove getSubtargetImpl calls from FastISel, we can get it from the MachineFunction where it's already cached. llvm-svn: 219366	2014-10-08 23:38:33 +00:00
David Blaikie	86080e417c	Sink DwarfUnit::addSectionDelta into DwarfCompileUnit, the only place it's needed. llvm-svn: 219364	2014-10-08 23:30:05 +00:00
David Blaikie	f68bfbaaa6	Reformat some stuff I missed in recent previous commits llvm-svn: 219356	2014-10-08 23:09:42 +00:00
David Blaikie	802d562b37	Sink and coalesce DwarfDebug.cpp::addSectionLabel and DwarfUnit::addSectionLabel down into DwarfCompileUnit::addSectionLabel llvm-svn: 219351	2014-10-08 22:46:27 +00:00
Eric Christopher	a1080be376	Remove dead call to getTypeToTransformTo. The result is unused. llvm-svn: 219347	2014-10-08 22:25:45 +00:00
David Blaikie	644527ade8	DebugInfo: The rest of pushing DwarfDebug::constructScopeDIE down into DwarfCompileUnit Funnily enough, I copied it, but didn't actually remove the original in r219345. Let's do that. llvm-svn: 219346	2014-10-08 22:23:10 +00:00
David Blaikie	c48de79313	Push DwarfDebug::constructScopeDIE down into DwarfCompileUnit One of many steps to generalize subprogram emission to both the DWO and non-DWO sections (to emit -gmlt-like data under fission). Once the functions are pushed down into DwarfCompileUnit some of the data structures will be pushed at least into DwarfFile so that they can be unique per-file, allowing emission to both files independently. llvm-svn: 219345	2014-10-08 22:20:02 +00:00
Eric Christopher	860e89dbbd	Remove a bunch of getSubtargetImpl calls since we already have a cached TLI instance. llvm-svn: 219342	2014-10-08 21:08:32 +00:00
Timur Iskhodzhanov	7661ce203f	Fix COFF section index relocation should be 16 bits, not 32 Original patch by Andrey Guskov! http://reviews.llvm.org/D5651 llvm-svn: 219327	2014-10-08 18:01:49 +00:00
Eric Christopher	fdc12cc58d	Use the TargetLowering information we already have on the SelectionDAG in SelectionDAGBuilder rather than going through the TargetMachine for lookup. llvm-svn: 219292	2014-10-08 09:50:54 +00:00
Eric Christopher	fd1a5ab45f	Grab the TargetRegisterInfo off of the subtarget from the MachineFunction rather than a lookup on the TargetMachine to avoid unnecessary lookups. llvm-svn: 219291	2014-10-08 09:50:52 +00:00
Eric Christopher	c561a308cd	Replace calls to get the subtarget and TargetFrameLowering with cached variables and a single call in the constructor. llvm-svn: 219287	2014-10-08 08:46:34 +00:00
Eric Christopher	3d7c1d381d	Use cached subtarget rather than looking it up on the TargetMachine again. llvm-svn: 219285	2014-10-08 07:51:41 +00:00
Eric Christopher	ce3f63df4d	Cache TargetLowering on SelectionDAGISel and update previous calls to getTargetLowering() with the cached variable. llvm-svn: 219284	2014-10-08 07:32:17 +00:00
Eric Christopher	cb92393555	Cache SelectionDAGISel TargetInstrInfo lookups on the class and propagate. Also use the TargetSubtargetInfo and the MachineFunction and move TargetRegisterInfo query closer to uses. llvm-svn: 219273	2014-10-08 01:58:03 +00:00
Eric Christopher	cafbda86bf	Reset the target options and optimization level as the first thing we do inside selection dag. This code needs to be migrated to queries on the function rather than global data, but this organizes things before we start grabbing the subtarget. llvm-svn: 219271	2014-10-08 01:58:01 +00:00
Eric Christopher	8acda9a737	Have the selection dag grab TargetLowering off of the subtarget inside init rather than have it passed in as an argument. llvm-svn: 219270	2014-10-08 01:57:58 +00:00
Eric Christopher	7fc943eeed	Have SelectionDAG's subtarget TargetSelectionDAGInfo be set during init rather than construction time. llvm-svn: 219262	2014-10-08 00:32:59 +00:00
Sanjay Patel	9fc82843b4	typos llvm-svn: 219221	2014-10-07 17:38:33 +00:00
Sanjay Patel	53fd7152c7	typos llvm-svn: 219220	2014-10-07 17:36:50 +00:00
David Blaikie	4c9057a6de	DebugInfo: Ensure that all debug location scope chains from instructions within a function, lead to the function itself. Let me tell you a tale... Originally committed in r211723 after discovering a nasty case of weird scoping due to inlining, this was reverted in r211724 after it fired in ASan/compiler-rt. (minor diversion where I accidentally committed/reverted again in r211871/r211873) After further testing and fixing bugs in ArgumentPromotion (r211872) and Inlining (r212065) it was recommitted in r212085. Reverted in r212089 after the sanitizer buildbots still showed problems. Fixed another bug in ArgumentPromotion (r212128) found by this assertion. Recommitted in r212205, reverted in r212226 after it crashed some more on sanitizer buildbots. Fix clang some more in r212761. Recommitted in r212776, reverted in r212793. ASan failures. Recommitted in r213391, reverted in r213432, trying to reproduce flakey ASan build failure. Fixed bugs in r213805 (ArgPromo + DebugInfo), r213952 (LiveDebugVariables strips dbg_value intrinsics in functions not described by debug info). Recommitted in r214761, reverted in r214999, flakey failure on Windows buildbot. Fixed DeadArgElimination + DebugInfo bug in r219210. Recommitting and hoping that's the last of it. [That one burned down, fell over, then sank into the swamp.] llvm-svn: 219215	2014-10-07 16:56:20 +00:00
Hal Finkel	0d252ab8da	[DAGCombine] Remove SIGN_EXTEND-related inf-loop The patch's author points out that, despite the function's documentation, getSetCCResultType is only used to get the SETCC result type (with one here-removed problematic exception). In one case, getSetCCResultType was being used to get the predicate type to use for a SELECT node, and then SIGN_EXTENDing (or truncating) to get the input predicate to match that type. Unfortunately, this was happening inside visitSIGN_EXTEND, and creating new SIGN_EXTEND nodes was causing an infinite loop. In addition, this behavior was wrong if a target was not using ZeroOrNegativeOneBooleanContent. Lastly, the extension/truncation seems unnecessary here: SELECT is defined as: Select(COND, TRUEVAL, FALSEVAL). If the type of the boolean COND is not i1 then the high bits must conform to getBooleanContents. So here we remove this use of getSetCCResultType and update getSetCCResultType's documentation to reflect its actual uses. Patch by deadal nix! llvm-svn: 219141	2014-10-06 20:19:47 +00:00
Sanjay Patel	7477a9a155	Fast-math fold: x / (y * sqrt(z)) -> x * (rsqrt(z) / y) The motivation is to recognize code such as this from /llvm/projects/test-suite/SingleSource/Benchmarks/BenchmarkGame/n-body.c: float distance = sqrt(dx * dx + dy * dy + dz * dz); float mag = dt / (distance * distance * distance); Without this patch, we don't match the sqrt as a reciprocal sqrt, so for PPC the new testcase in this patch produces: addis 3, 2, .LCPI4_2@toc@ha lfs 4, .LCPI4_2@toc@l(3) addis 3, 2, .LCPI4_1@toc@ha lfs 0, .LCPI4_1@toc@l(3) fcmpu 0, 1, 4 beq 0, .LBB4_2 # BB#1: frsqrtes 4, 1 addis 3, 2, .LCPI4_0@toc@ha lfs 5, .LCPI4_0@toc@l(3) fnmsubs 13, 1, 5, 1 fmuls 6, 4, 4 fmadds 1, 13, 6, 5 fmuls 1, 4, 1 fres 4, 1 <--- reciprocal of reciprocal square root fnmsubs 1, 1, 4, 0 fmadds 4, 4, 1, 4 .LBB4_2: fmuls 1, 4, 2 fres 2, 1 fnmsubs 0, 1, 2, 0 fmadds 0, 2, 0, 2 fmuls 1, 3, 0 blr After the patch, this simplifies to: frsqrtes 0, 1 addis 3, 2, .LCPI4_1@toc@ha fres 5, 2 lfs 4, .LCPI4_1@toc@l(3) addis 3, 2, .LCPI4_0@toc@ha lfs 7, .LCPI4_0@toc@l(3) fnmsubs 13, 1, 4, 1 fmuls 6, 0, 0 fnmsubs 2, 2, 5, 7 fmadds 1, 13, 6, 4 fmadds 2, 5, 2, 5 fmuls 0, 0, 1 fmuls 0, 0, 2 fmuls 1, 3, 0 blr Differential Revision: http://reviews.llvm.org/D5628 llvm-svn: 219139	2014-10-06 19:31:18 +00:00
Benjamin Kramer	581892875a	DbgValueHistoryCalculator: Store modified registers in a BitVector instead of std::set. And iterate over the smaller map instead of the larger set first. Reduces the time spent in calculateDbgValueHistory by 30-40%. llvm-svn: 219123	2014-10-06 15:31:04 +00:00
David Blaikie	16833bbcd2	DebugInfo: Sink constructImportedEntityDIE down into DwarfUnit from DwarfDebug. It was just calling a bunch of DwarfUnit functions anyway, as can be seen by the simplification of removing "TheCU" from all the function calls in the implementation. llvm-svn: 219103	2014-10-06 05:37:24 +00:00
Chandler Carruth	a54972693a	[x86, dag] Teach the DAG combiner to prune inputs toa vector_shuffle that are unused. This allows the combiner to delete math feeding shuffles where the math isn't actually necessary. This improves some of the vperm2x128 tests that regressed when the vector shuffle lowering started actually generating vperm instructions rather than forcibly decomposing them. Sadly, this isn't enough to get this really right because we still form a completely unnecessary permutation. To fix that, we also need to fold shuffles which just rearrange concatenated or inserted subvectors. llvm-svn: 219086	2014-10-05 19:14:34 +00:00
David Blaikie	d13e7a4d3b	Remove unused map This became unnecessary/unused in r208636 llvm-svn: 219085	2014-10-05 16:31:13 +00:00
Benjamin Kramer	860521c88b	Make AAMDNodes ctor and operator bool (!!!) explicit, mop up bugs and weirdness exposed by it. llvm-svn: 219068	2014-10-04 22:44:29 +00:00
Benjamin Kramer	7db3ef45b9	Remove unnecessary copying or replace it with moves in a bunch of places. NFC. llvm-svn: 219061	2014-10-04 16:55:56 +00:00
David Blaikie	0eef1c005d	Sink DwarfDebug::updateSubprogramScopeDIE into DwarfCompileUnit This requires exposing some of the current function state from DwarfDebug. I hope there's not too much of that to expose as I go through all the functions, but it still seems nicer to expose singular data down to multiple consumers, than have consumers expose raw mapping data structures up to DwarfDebug for building subprograms. Part of a series of refactoring to allow subprograms in both the skeleton and dwo CUs under Fission. llvm-svn: 219060	2014-10-04 16:24:00 +00:00
David Blaikie	1f6267c40e	Reformatting accidentally left out of r219057 llvm-svn: 219059	2014-10-04 16:00:26 +00:00
David Blaikie	c691131ef4	Sink DwarfDebug::attachLowHighPC into DwarfCompileUnit One of many things to sink down into DwarfCompileUnit to allow handling of subprograms in both the skeleton and dwo CU under Fission. llvm-svn: 219058	2014-10-04 15:58:47 +00:00
David Blaikie	7bf1cb9a67	Move DwarfCompileUnit from DwarfUnit.h to its own header (DwarfCompileUnit.h) In preparation for sinking all the subprogram emission code down from DwarfDebug into DwarfCompileUnit, this will avoid bloating DwarfUnit.h/cpp greatly and make concerns a bit more clear/isolated. (sinking this handling down is part of the work to handle emitting minimal subprograms for -gmlt-like data into the skeleton CU under fission) llvm-svn: 219057	2014-10-04 15:49:50 +00:00
Duncan P. N. Exon Smith	c1be4794ba	Revert "Revert "DI: Fold constant arguments into a single MDString"" This reverts commit r218918, effectively reapplying r218914 after fixing an Ocaml bindings test and an Asan crash. The root cause of the latter was a tightened-up check in `DILexicalBlock::Verify()`, so I'll file a PR to investigate who requires the loose check (and why). Original commit message follows. -- This patch addresses the first stage of PR17891 by folding constant arguments together into a single MDString. Integers are stringified and a `\0` character is used as a separator. Part of PR17891. Note: I've attached my testcases upgrade scripts to the PR. If I've just broken your out-of-tree testcases, they might help. llvm-svn: 219010	2014-10-03 20:01:09 +00:00
Adam Nemet	3fe531df7b	[ISel] Keep matching state consistent when folding during X86 address match In the X86 backend, matching an address is initiated by the 'addr' complex pattern and its friends. During this process we may reassociate and-of-shift into shift-of-and (FoldMaskedShiftToScaledMask) to allow folding of the shift into the scale of the address. However as demonstrated by the testcase, this can trigger CSE of not only the shift and the AND which the code is prepared for but also the underlying load node. In the testcase this node is sitting in the RecordedNode and MatchScope data structures of the matcher and becomes a deleted node upon CSE. Returning from the complex pattern function, we try to access it again hitting an assert because the node is no longer a load even though this was checked before. Now obviously changing the DAG this late is bending the rules but I think it makes sense somewhat. Outside of addresses we prefer and-of-shift because it may lead to smaller immediates (FoldMaskAndShiftToScale is an even better example because it create a non-canonical node). We currently don't recognize addresses during DAGCombiner where arguably this canonicalization should be performed. On the other hand, having this in the matcher allows us to cover all the cases where an address can be used in an instruction. I've also talked a little bit to Dan Gohman on llvm-dev who added the RAUW for the new shift node in FoldMaskedShiftToScaledMask. This RAUW is responsible for initiating the recursive CSE on users (http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-September/076903.html) but it is not strictly necessary since the shift is hooked into the visited user. Of course it's safer to keep the DAG consistent at all times (e.g. for accurate number of uses, etc.). So rather than changing the fundamentals, I've decided to continue along the previous patches and detect the CSE. This patch installs a very targeted DAGUpdateListener for the duration of a complex-pattern match and updates the matching state accordingly. (Previous patches used HandleSDNode to detect the CSE but that's not practical here). The listener is only installed on X86. I tested that there is no measurable overhead due to this while running through the spec2k BC files with llc. The only thing we pay for is the creation of the listener. The callback never ever triggers in spec2k since this is a corner case. Fixes rdar://problem/18206171 llvm-svn: 219009	2014-10-03 20:00:34 +00:00
Benjamin Kramer	4c9fb3d669	Eliminate some deep std::vector copies. NFC. llvm-svn: 218999	2014-10-03 18:33:16 +00:00
Renato Golin	aea9ec6761	Revert 202433 - Provide a target override for the latest regalloc heuristic That commit was introduced in order to help investigate a problem in ARM codegen breaking from commit 202304 (Add a limit to the heuristic that register allocates instructions in local order). Recent analisys indicated that the problem no longer exists, so I'm reverting this change. See PR18996. llvm-svn: 218981	2014-10-03 12:20:53 +00:00
Chandler Carruth	48b57382a8	Fix the threshold added in r186434 (a re-apply of r185393) and updaated to be a ManagedStatic in r218163 to not be a global variable written and read to from within the innards of SpillPlacement. This will fix a really scary race condition for anyone that has two copies of LLVM running spill placement concurrently. Yikes! This will also fix a really significant compile time hit that r218163 caused because the spill placement threshold read is actually in the very hot path of this code. The memory fence on each read was showing up as huge compile time regressions when spilling is responsible for most of the compile time. For example, optimizing sanitized code showed over 50% compile time regressions here. =/ llvm-svn: 218921	2014-10-02 22:23:14 +00:00
Duncan P. N. Exon Smith	fb6bcc4eb2	Revert "DI: Fold constant arguments into a single MDString" This reverts commit r218914 while I investigate some bots. llvm-svn: 218918	2014-10-02 22:15:31 +00:00
Duncan P. N. Exon Smith	58b6077a79	DI: Fold constant arguments into a single MDString This patch addresses the first stage of PR17891 by folding constant arguments together into a single MDString. Integers are stringified and a `\0` character is used as a separator. Part of PR17891. Note: I've attached my testcases upgrade scripts to the PR. If I've just broken your out-of-tree testcases, they might help. llvm-svn: 218914	2014-10-02 21:56:57 +00:00
Adrian Prantl	2b1df58ebe	Move the complex address expression out of DIVariable and into an extra argument of the llvm.dbg.declare/llvm.dbg.value intrinsics. Previously, DIVariable was a variable-length field that has an optional reference to a Metadata array consisting of a variable number of complex address expressions. In the case of OpPiece expressions this is wasting a lot of storage in IR, because when an aggregate type is, e.g., SROA'd into all of its n individual members, the IR will contain n copies of the DIVariable, all alike, only differing in the complex address reference at the end. By making the complex address into an extra argument of the dbg.value/dbg.declare intrinsics, all of the pieces can reference the same variable and the complex address expressions can be uniqued across the CU, too. Down the road, this will allow us to move other flags, such as "indirection" out of the DIVariable, too. The new intrinsics look like this: declare void @llvm.dbg.declare(metadata %storage, metadata %var, metadata %expr) declare void @llvm.dbg.value(metadata %storage, i64 %offset, metadata %var, metadata %expr) This patch adds a new LLVM-local tag to DIExpressions, so we can detect and pretty-print DIExpression metadata nodes. What this patch doesn't do: This patch does not touch the "Indirect" field in DIVariable; but moving that into the expression would be a natural next step. http://reviews.llvm.org/D4919 rdar://problem/17994491 Thanks to dblaikie and dexonsmith for reviewing this patch! Note: I accidentally committed a bogus older version of this patch previously. llvm-svn: 218787	2014-10-01 18:55:02 +00:00
Adrian Prantl	0959156fa3	Revert r218778 while investigating buldbot breakage. "Move the complex address expression out of DIVariable and into an extra" llvm-svn: 218782	2014-10-01 18:10:54 +00:00
Adrian Prantl	229943585f	Move the complex address expression out of DIVariable and into an extra argument of the llvm.dbg.declare/llvm.dbg.value intrinsics. Previously, DIVariable was a variable-length field that has an optional reference to a Metadata array consisting of a variable number of complex address expressions. In the case of OpPiece expressions this is wasting a lot of storage in IR, because when an aggregate type is, e.g., SROA'd into all of its n individual members, the IR will contain n copies of the DIVariable, all alike, only differing in the complex address reference at the end. By making the complex address into an extra argument of the dbg.value/dbg.declare intrinsics, all of the pieces can reference the same variable and the complex address expressions can be uniqued across the CU, too. Down the road, this will allow us to move other flags, such as "indirection" out of the DIVariable, too. The new intrinsics look like this: declare void @llvm.dbg.declare(metadata %storage, metadata %var, metadata %expr) declare void @llvm.dbg.value(metadata %storage, i64 %offset, metadata %var, metadata %expr) This patch adds a new LLVM-local tag to DIExpressions, so we can detect and pretty-print DIExpression metadata nodes. What this patch doesn't do: This patch does not touch the "Indirect" field in DIVariable; but moving that into the expression would be a natural next step. http://reviews.llvm.org/D4919 rdar://problem/17994491 Thanks to dblaikie and dexonsmith for reviewing this patch! llvm-svn: 218778	2014-10-01 17:55:39 +00:00
Jingyue Wu	784352be06	Revert r216862 due to a performance regression Reported by Alexey Volkov in PR21115 llvm-svn: 218771	2014-10-01 15:22:13 +00:00
David Blaikie	fe64c97aaa	Implement DW_TAG_subrange_type with DW_AT_count rather than DW_AT_upper_bound This allows proper disambiguation of unbounded arrays and arrays of zero bound ("struct foo { int x[]; };" and "struct foo { int x[0]; }"). GCC instead produces an upper bound of -1 in the latter situation, but count seems tidier. This way lower_bound is provided if it's not the language default and count is provided if the count is known, otherwise it's omitted. Simple. If someone wants to look at rdar://problem/12566646 and see if this change is acceptable to that bug/fix, that might be helpful (see the empty-and-one-elem-array.ll test case which cites that radar). llvm-svn: 218726	2014-10-01 00:56:55 +00:00
David Blaikie	5720297fff	Omit DW_AT_inline under -gmlt to save a little more space. llvm-svn: 218719	2014-09-30 23:29:16 +00:00
David Blaikie	0fda7951bb	DebugInfo: Sink the code emitting DW_AT_APPLE_omit_frame_ptr down to a more common spot. No functional change. Pre-emptive refactoring before I start pushing some of this subprogram creation down into DWARFCompileUnit so I can build different subprograms in the skeleton unit from the dwo unit for adding -gmlt-like data to the skeleton. llvm-svn: 218713	2014-09-30 22:32:49 +00:00
David Blaikie	c3f2904ef4	Disable the -gmlt optimization implemented in r218129 under Darwin due to issues with dsymutil. r218129 omits DW_TAG_subprograms which have no inlined subroutines when emitting -gmlt data. This makes -gmlt very low cost for -O0 builds. Darwin's dsymutil reasonably considers a CU empty if it has no subprograms (which occurs with the above optimization in -O0 programs without any force_inline function calls) and drops the line table, CU, and everything in this situation, making backtraces impossible. Until dsymutil is modified to account for this, disable this optimization on Darwin to preserve the desired functionality. (see r218545, which should be reverted after this patch, for other discussion/details) Footnote: In the long term, it doesn't look like this scheme (of simplified debug info to describe inlining to enable backtracing) is tenable, it is far too size inefficient for optimized code (the DW_TAG_inlined_subprograms, even once compressed, are nearly twice as large as the line table itself (also compressed)) and we'll be considering things like Cary's two level line table proposal to encode all this information directly in the line table. llvm-svn: 218702	2014-09-30 21:28:32 +00:00
Sanjay Patel	abcb3acee5	Use the target-specified iteration count to opt out of any further refinement of an estimate. NFC. llvm-svn: 218700	2014-09-30 20:44:23 +00:00
Sanjay Patel	c095454ca2	Split the estimate() interface into separate functions for each type. NFC. It was hacky to use an opcode as a switch because it won't always match (rsqrte != sqrte), and it looks like we'll need to add more special casing per arch than I had hoped for. Eg, x86 will prefer a different NR estimate implementation. ARM will want to use it's 'step' instructions. There also don't appear to be any new estimate instructions in any arch in a long, long time. Altivec vloge and vexpte may have been the first and last in that field... llvm-svn: 218698	2014-09-30 20:28:48 +00:00
Andrea Di Biagio	a5f619dff6	[DAG] Check in advance if a build_vector has a legal type before attempting to convert it into a shuffle. Currently, the DAG Combiner only tries to convert type-legal build_vector nodes into shuffles. This patch simply moves the logic that checks if a build_vector has a legal value type up before we even start analyzing the operands. This allows to early exit immediately from method 'visitBUILD_VECTOR' if the node type is known to be illegal. No functional change intended. llvm-svn: 218677	2014-09-30 15:30:22 +00:00
Matt Arsenault	23f3740054	Add MachineOperand::ChangeToFPImmediate and setFPImm llvm-svn: 218579	2014-09-28 19:24:59 +00:00
James Molloy	3c39c7c1b4	[AArch64] Redundant store instructions should be removed as dead code If there is a store followed by a store with the same value to the same location, then the store is dead/noop. It can be removed. This problem is found in spec2006-197.parser. For example, stur w10, [x11, #-4] stur w10, [x11, #-4] Then one of the two stur instructions can be removed. Patch by David Xu! llvm-svn: 218569	2014-09-27 17:02:54 +00:00
Sanjay Patel	98a98574c5	Refactor reciprocal and reciprocal square root estimate into target-independent functions (part 2). This is purely refactoring. No functional changes intended. PowerPC is the only target that is currently using this interface. The ultimate goal is to allow targets other than PowerPC (certainly X86 and Aarch64) to turn this: z = y / sqrt(x) into: z = y * rsqrte(x) And: z = y / x into: z = y * rcpe(x) using whatever HW magic they can use. See http://llvm.org/bugs/show_bug.cgi?id=20900 . There is one hook in TargetLowering to get the target-specific opcode for an estimate instruction along with the number of refinement steps needed to make the estimate usable. Differential Revision: http://reviews.llvm.org/D5484 llvm-svn: 218553	2014-09-26 23:01:47 +00:00
David Xu	c351ac640d	Revert patch ofr218493 llvm-svn: 218494	2014-09-26 02:28:03 +00:00
David Xu	43a5d5bdc1	Redundant store instructions should be removed as dead code llvm-svn: 218493	2014-09-26 02:02:09 +00:00
Eric Christopher	3b65e1ff31	Move resetTargetOptions from taking a MachineFunction to a Function since we are accessing the TargetMachine that we're a member function of. llvm-svn: 218489	2014-09-26 01:28:10 +00:00
Bruno Cardoso Lopes	d68585f076	[MachineSink+PGO] Teach MachineSink to use BlockFrequencyInfo Machine Sink uses loop depth information to select between successors BBs to sink machine instructions into, where BBs within smaller loop depths are preferable. This patch adds support for choosing between successors by using profile information from BlockFrequencyInfo instead, whenever the information is available. Tested it under SPEC2006 train (average of 30 runs for each program); ~1.5% execution speedup in average on x86-64 darwin. <rdar://problem/18021659> llvm-svn: 218472	2014-09-25 23:14:26 +00:00
Tom Stellard	fbc414f7ff	SelectionDAG: Remove #if NDEBUG from check for a post-isel hook The InstrEmitter will skip the check of MI.hasPostISelHook() before calling AdjustInstrPostInstrSelection() when NDEBUG is not defined. This was added in r140228, and I'm not sure if it is intentional or not, but it is a likely source for bugs, because it means with Release+Asserts builds you can forget to set the hasPostISelHook flag on TableGen definitions and AdjustInstrPostInstrSelection() will still be called. llvm-svn: 218458	2014-09-25 18:59:22 +00:00
Robin Morisset	98b0bed638	Lower idempotent RMWs to fence+load Summary: I originally tried doing this specifically for X86 in the backend in D5091, but it was rather brittle and generally running too late to be general. Furthermore, other targets may want to implement similar optimizations. So I reimplemented it at the IR-level, fitting it into AtomicExpandPass as it interacts with that pass (which could not be cleanly done before at the backend level). This optimization relies on a new target hook, which is only used by X86 for now, as the correctness of the optimization on other targets remains an open question. If it is found correct on other targets, it should be trivial to enable for them. Details of the optimization are discussed in D5091. Test Plan: make check-all + a new test Reviewers: jfb Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5422 llvm-svn: 218455	2014-09-25 17:27:43 +00:00
Jiangning Liu	92ab0f5880	Clear PreferredExtendType for in each function-specific state FunctionLoweringInfo. llvm-svn: 218364	2014-09-24 03:22:56 +00:00
Robin Morisset	1d079fe807	[X86] Make wide loads be managed by AtomicExpand Summary: AtomicExpand already had logic for expanding wide loads and stores on LL/SC architectures, and for expanding wide stores on CmpXchg architectures, but not for wide loads on CmpXchg architectures. This patch fills this hole, and makes use of this new feature in the X86 backend. Only one functionnal change: we now lose the SynchScope attribute. It is regrettable, but I have another patch that I will submit soon that will solve this for all of AtomicExpand (it seemed better to split it apart as it is a different concern). Test Plan: make check-all (lots of tests for this functionality already exist) Reviewers: jfb Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5404 llvm-svn: 218332	2014-09-23 20:59:25 +00:00
Robin Morisset	a7da34c778	Add AtomicExpandPass::bracketInstWithFences, and use it whenever getInsertFencesForAtomic would trigger in SelectionDAGBuilder Summary: The goal is to eventually remove all the code related to getInsertFencesForAtomic in SelectionDAGBuilder as it is wrong (designed for ARM, not really portable, works mostly by accident because the backends are overly conservative), and repeats the same logic that goes in emitLeading/TrailingFence. In this patch, I make AtomicExpandPass insert the fences as it knows better where to put them. Because this requires getting the fences and not just passing an IRBuilder around, I had to change the return type of emitLeading/TrailingFence. This code only triggers on ARM for now. Because it is earlier in the pipeline than SelectionDAGBuilder, it triggers and lowers atomic accesses to atomic so SelectionDAGBuilder does not add barriers anymore on ARM. If this patch is accepted I plan to implement emitLeading/TrailingFence for all backends that setInsertFencesForAtomic(true), which will allow both making them less conservative and simplifying SelectionDAGBuilder once they are all using this interface. This should not cause any functionnal change so the existing tests are used and not modified. Test Plan: make check-all, benefits from existing tests of atomics on ARM Reviewers: jfb, t.p.northover Subscribers: aemerson, llvm-commits Differential Revision: http://reviews.llvm.org/D5179 llvm-svn: 218329	2014-09-23 20:31:14 +00:00
Lang Hames	f3ae47a1c7	[MCJIT] Nuke MachineRelocation and MachineCodeEmitter. Now that the old JIT is gone they're no longer needed. llvm-svn: 218320	2014-09-23 18:08:47 +00:00
Sanjay Patel	14a8712c3d	Use SDValue bool operator to reduce code. No functional change. llvm-svn: 218314	2014-09-23 16:24:20 +00:00
David Majnemer	c7ddf568e6	MC: ReadOnlyWithRel section kinds should map to rdata in COFF Don't consider ReadOnlyWithRel as a writable section in COFF, they really belong in .rdata. llvm-svn: 218268	2014-09-22 20:39:23 +00:00
Sanjay Patel	1235f854b3	Refactor reciprocal square root estimate into target-independent function; NFC. This is purely a plumbing patch. No functional changes intended. The ultimate goal is to allow targets other than PowerPC (certainly X86 and Aarch64) to turn this: z = y / sqrt(x) into: z = y * rsqrte(x) using whatever HW magic they can use. See http://llvm.org/bugs/show_bug.cgi?id=20900 . The first step is to add a target hook for RSQRTE, take the already target-independent code selfishly hoarded by PPC, and put it into DAGCombiner. Next steps: The code in DAGCombiner::BuildRSQRTE() should be refactored further; tests that exercise that logic need to be added. Logic in PPCTargetLowering::BuildRSQRTE() should be hoisted into DAGCombiner. X86 and AArch64 overrides for TargetLowering.BuildRSQRTE() should be added. Differential Revision: http://reviews.llvm.org/D5425 llvm-svn: 218219	2014-09-21 15:19:15 +00:00
Sanjay Patel	6ce3689f46	mop up: "Don’t duplicate function or class name at the beginning of the comment." llvm-svn: 218218	2014-09-21 14:48:16 +00:00
Sanjay Patel	48e944fd89	mop up: "Don’t duplicate function or class name at the beginning of the comment." llvm-svn: 218194	2014-09-20 22:39:16 +00:00
David Majnemer	8ffc0f9fcb	MC: Treat ReadOnlyWithRel and ReadOnlyWithRelLocal as ReadOnly for COFF A problem with our old behavior becomes observable under x86-64 COFF when we need a read-only GV which has an initializer which is referenced using a relocation: we would mark the section as writable. Marking the section as writable interferes with section merging. This fixes PR21009. llvm-svn: 218179	2014-09-20 07:31:46 +00:00
Peter Collingbourne	203d8801c7	Fix crash with an insertvalue that produces an empty object. llvm-svn: 218171	2014-09-20 00:10:47 +00:00
Chris Bieneman	a553d90b36	Converting SpillPlacement's BlockFrequency threshold to a ManagedStatic to avoid static constructors and destructors. llvm-svn: 218163	2014-09-19 22:46:28 +00:00
David Blaikie	ab086ab341	Omit DW_TAG_subprograms for subprograms without inlined subroutines when producing -gmlt data To reduce the size of -gmlt data, skip the subprograms without any inlined subroutines. Since we've now got the ability to make these determinations in the backend (funnily enough - we added the flag so we wouldn't produce ranges under -gmlt, but with this change we use the flag, but go back to producing ranges under -gmlt). Instead, just produce CU ranges to inform the consumer which parts of the code are described by this CU's line table. Tools could inspect the line table directly to compute the range, but the CU ranges only seem to be about 0.5% of object/executable size, so I'm not too worried about teaching llvm-symbolizer that trick just yet - it's certainly a possible piece of future work. Update an llvm-symbolizer test just to demonstrate that this schema is acceptable there (if it wasn't, the compiler-rt tests would catch this, but good to have an in-llvm-tree test for llvm-symbolizer's behavior here) Building the clang binary with -gmlt with this patch reduces the total size of object files by 5.1% (5.56% without ranges) without compression and the executable by 4.37% (4.75% without ranges). llvm-svn: 218129	2014-09-19 17:03:16 +00:00
Frederic Riss	f436abc737	Change DwarfCompileUnit::createGlobalVariable to getOrCreateGlobalVariable. Summary: This will allow to request the creation of a forward delacred variable at is point of use (for imported declarations, this will be DwarfDebug::constructImportedEntityDIE) rather than having to put the forward decl in a retention list. Note that getOrCreateGlobalVariable returns the actual definition DIE when the routine creates a declaration and a definition DIE. If you agree this is the right behavior, then I'll have a followup patch that registers the definition in the DIE map instead of the declaration as it is today (this 'breaks' only one test, where we test that the imported entity is the declaration). I'm not sure what's best here, but it's easy enough for a consumer to follow the DW_AT_specification link to get to the declaration, whereas it takes more work to find the actual definition from a declaration DIE. Reviewers: echristo, dblaikie, aprantl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5381 llvm-svn: 218126	2014-09-19 15:12:03 +00:00
Hal Finkel	0c0c256ad7	Optionally enable more-aggressive FMA formation in DAGCombine The heuristic used by DAGCombine to form FMAs checks that the FMUL has only one use, but this is overly-conservative on some systems. Specifically, if the FMA and the FADD have the same latency (and the FMA does not compete for resources with the FMUL any more than the FADD does), there is no need for the restriction, and furthermore, forming the FMA leaving the FMUL can still allow for higher overall throughput and decreased critical-path length. Here we add a new TLI callback, enableAggressiveFMAFusion, false by default, to elide the hasOneUse check. This is enabled for PowerPC by default, as most PowerPC systems will benefit. Patch by Olivier Sallenave, thanks! llvm-svn: 218120	2014-09-19 11:42:56 +00:00
Jiangning Liu	9dd58d584c	Optimize sext/zext insertion algorithm in back-end. With this optimization, we will not always insert zext for values crossing basic blocks, but insert sext if the users of a value crossing basic block has preference of sign predicate. llvm-svn: 218101	2014-09-19 05:30:35 +00:00
David Blaikie	8a66037575	Omit DW_AT_frame_base under -gmlt for size llvm-svn: 218100	2014-09-19 04:55:05 +00:00
David Blaikie	1a16b76ce1	Describe the -gmlt optimization committed in the previous revision. llvm-svn: 218099	2014-09-19 04:47:46 +00:00
David Blaikie	5019d606b7	Omit all the extra static attributes on subprograms in -gmlt This omission will be done in a fancier manner once we're dealing with "put gmlt in the skeleton CUs under fission" - it'll have to be conditional on the kind of CU we're emitting into (skeleton or gmlt). llvm-svn: 218098	2014-09-19 04:30:36 +00:00
Hans Wennborg	0eae0ac98d	Fix an it's vs. its typo. llvm-svn: 218093	2014-09-19 01:14:56 +00:00
Frederic Riss	b1025475ae	Revert part of r218041. The patch moved some logic around in an attempt to generate potentially more DW_AT_declaration attributes. The patch was flawed though and it stopped generating the attribute in some cases. llvm-svn: 218060	2014-09-18 16:41:04 +00:00
Frederic Riss	79b1a240fa	Always emit DW_AT_declaration attribute when the variable isn't a definition. Summary: This doesn't show up today as we don't emit decalration only variables. This will be tested when the followup patches implementing import of forward declared entities lands in clang. Reviewers: echristo, dblaikie, aprantl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5382 llvm-svn: 218041	2014-09-18 09:38:23 +00:00
Eric Christopher	c75fbbac7c	Add a new pass FunctionTargetTransformInfo. This pass serves as a shim between the TargetTransformInfo immutable pass and the Subtarget via the TargetMachine and Function. Migrate a single call from BasicTargetTransformInfo as an example and provide shims where TargetMachine begins taking a Function to determine the subtarget. No functional change. llvm-svn: 218004	2014-09-18 00:34:14 +00:00
Robin Morisset	4c9d292205	[X86] Use the generic AtomicExpandPass instead of X86AtomicExpandPass This required a new hook called hasLoadLinkedStoreConditional to know whether to expand atomics to LL/SC (ARM, AArch64, in a future patch Power) or to CmpXchg (X86). Apart from that, the new code in AtomicExpandPass is mostly moved from X86AtomicExpandPass. The main result of this patch is to get rid of that pass, which had lots of code duplicated with AtomicExpandPass. llvm-svn: 217928	2014-09-17 00:06:58 +00:00
Quentin Colombet	324286d9e6	[CodeGenPrepare][AddressingModeMatcher] The promotion mechanism was expecting instructions when truncate, sext, or zext were created. Fix that. llvm-svn: 217926	2014-09-16 22:36:07 +00:00
Owen Anderson	dded4a7289	Add back a fallback case for targets that do not or cannot implement getNoopForMachoTarget(). llvm-svn: 217899	2014-09-16 20:28:00 +00:00
Hal Finkel	261b9637c7	Fix BasicTTI::getCmpSelInstrCost to deal with illegal vector types The default implementation of getCmpSelInstrCost, which provides the cost of icmp/fcmp/select instructions, did not deal sensibly with illegal vector types that were scalarized. We'd ask for the legalization cost of the vector type, which would return something like (4, f64) given an input of <4 x double>, and we'd then check the TLI status of the ISD opcode on that scalar type. This would result in querying (ISD::VSELECT, f64), for example. Amusingly enough, ISD::VSELECT on scalar types is marked as Legal by default (as with most other operations), and most backends never change this because VSELECT is never generated on scalars. However, seeing the resulting operation as Legal, we'd neglect to add the scalarization cost before returning. The result is that we'd grossly under-estimate the cost of cmps/selects on illegal vector types. Now, if type legalization clearly results in scalarization, we skip the early return and add the scalarization cost. llvm-svn: 217859	2014-09-16 04:35:50 +00:00
David Blaikie	028baef2fb	DebugInfo: Add comment describing the need to disable address pool usage in skeleton units. Post commit review from Eric Christopher. llvm-svn: 217842	2014-09-15 22:41:25 +00:00
Sanjay Patel	e7b8ce1029	Replace repeated null checks with an assert. NFC. Without a vector to hold the created ops, these functions don't have any use. llvm-svn: 217831	2014-09-15 21:52:51 +00:00
Juergen Ributzka	0a4f4becc3	[FastISel] Move optimizeCmpPredicate to FastISel base class. NFC. Make the optimizeCmpPredicate function available to all targets. llvm-svn: 217822	2014-09-15 20:47:13 +00:00
Sanjay Patel	7c6f056447	Replace dead links to "Hacker's Delight" with general references. NFC. llvm-svn: 217814	2014-09-15 19:47:44 +00:00
Rafael Espindola	6e5ce4f5db	Fix a lot of confusion around inserting nops on empty functions. On MachO, and MachO only, we cannot have a truly empty function since that breaks the linker logic for atomizing the section. When we are emitting a frame pointer, the presence of an unreachable will create a cfi instruction pointing past the last instruction. This is perfectly fine. The FDE information encodes the pc range it applies to. If some tool cannot handle this, we should explicitly say which bug we are working around and only work around it when it is actually relevant (not for ELF for example). Given the unreachable we could omit the .cfi_def_cfa_register, but then again, we could also omit the entire function prologue if we wanted to. llvm-svn: 217801	2014-09-15 18:32:58 +00:00
Quentin Colombet	798b42868c	[CodeGenPrepare][AddressingModeMatcher] Fix a think-o for the sext(zext) -> zext promotion introduced in r217629. We were returning the old sext instead of the new zext as the promoted instruction! Thanks Joerg Sonnenberger for the test case. llvm-svn: 217800	2014-09-15 18:26:58 +00:00
Yaron Keren	5f0401446e	In DwarfEHPrepare, after all passes are run, RewindFunction may be a dangling pointer to a dead function. To make sure it's valid, doFinalization nullptrs RewindFunction just like the constructor and so it will be found on next run. llvm-svn: 217737	2014-09-14 20:36:28 +00:00
Owen Anderson	6f86c0daa2	Allow targets to custom legalize vector insertion and extraction. llvm-svn: 217711	2014-09-12 22:16:11 +00:00
Owen Anderson	683e1e4686	Remove an unnecessary restriction. MIsNeedChainEdge() should be checked even when scheduler AliasAnalysis is not enabled. A good chunk of the MIsNeedChainEdge() is logic that is valid and should be applied even for targets that are not using for alias analysis. llvm-svn: 217706	2014-09-12 21:17:55 +00:00
Benjamin Kramer	ed321129a0	Legalizer: Use the scalar bit width when promoting bit counting instrs on vectors. e.g. when promoting ctlz from <2 x i32> to <2 x i64> we have to fixup the result by 32 bits, not 64. PR20917. llvm-svn: 217671	2014-09-12 12:50:27 +00:00
Quentin Colombet	74d3a27ed8	[CodeGenPrepare] Teach the addressing mode matcher how to promote zext. I.e., teach it about 'sext (zext a to ty) to ty2' => zext a to ty2. llvm-svn: 217629	2014-09-11 21:22:14 +00:00
David Blaikie	e260499363	Remove the unused string section symbol parameter from DwarfFile::emitStrings And since it /looked/ like the DwarfStrSectionSym was unused, I tried removing it - but then it turned out that DwarfStringPool was reconstructing the same label (and expecting it to have already been emitted) and uses that. So I kept it around, but wanted to pass it in to users - since it seemed a bit silly for DwarfStringPool to have it passed in and returned but itself have no use for it. The only two users don't handle strings in both .dwo and .o files so they only ever need the one symbol - no need to keep it (and have an unused symbol) in the DwarfStringPool used for fission/.dwo. Refactor a bunch of accelerator table usage to remove duplication so I didn't have to touch 4-5 callers. llvm-svn: 217628	2014-09-11 21:12:48 +00:00
Matt Arsenault	dbc82e3483	Add DAG combine for shl + add of constants. Do (shl (add x, c1), c2) -> (add (shl x, c2), c1 << c2) This is already done for multiplies, but since multiplies by powers of two are turned into shifts, we also need to handle it here. This might want checks for isLegalAddImmediate to avoid transforming an add of a legal immediate with one that isn't. llvm-svn: 217610	2014-09-11 17:34:19 +00:00
Sanjay Patel	099c1958cc	Combine fmul vector FP constants when unsafe math is allowed. This is an extension of the change made with r215820: http://llvm.org/viewvc/llvm-project?view=revision&revision=215820 That patch allowed combining of splatted vector FP constants that are multiplied. This patch allows combining non-uniform vector FP constants too by relaxing the check on the type of vector. Also, canonicalize a vector fmul in the same way that we already do for scalars - if only one operand of the fmul is a constant, make it operand 1. Otherwise, we miss potential folds. This fold is also done by -instcombine, but it's possible that extra fmuls may have been generated during lowering. Differential Revision: http://reviews.llvm.org/D5254 llvm-svn: 217599	2014-09-11 15:45:27 +00:00
David Xu	4d7f013423	Build correct vector filled with undef nodes llvm-svn: 217570	2014-09-11 05:10:28 +00:00
Adrian Prantl	47d8607b25	Cleanup: Use the appropriate API for accessing the DIVariable of a DBG_VALUE intrinsic. llvm-svn: 217533	2014-09-10 18:52:29 +00:00
Sanjay Patel	8030ed3639	Rename getMaximumUnrollFactor -> getMaxInterleaveFactor; also rename option names controlling this variable. "Unroll" is not the appropriate name for this variable. Clang already uses the term "interleave" in pragmas and metadata for this. Differential Revision: http://reviews.llvm.org/D5066 llvm-svn: 217528	2014-09-10 17:58:16 +00:00
Yuri Gorshenin	a4d343a3a2	[asan-assembly-instrumentation] Added CFI directives to the generated instrumentation code. Summary: [asan-assembly-instrumentation] Added CFI directives to the generated instrumentation code. Reviewers: eugenis Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5189 llvm-svn: 217482	2014-09-10 09:45:49 +00:00
David Blaikie	b8169f9e8f	Sink PrevCU updating into DwarfUnit::addRange to ensure consistency So that the two operations in DwarfDebug couldn't get separated (because I accidentally separated them in some work in progress), put them together. While we're here, move DwarfUnit::addRange to DwarfCompileUnit, since it's not relevant to type units. llvm-svn: 217468	2014-09-09 23:13:01 +00:00
David Blaikie	2d0c5d6e9a	Remove DwarfDebug::PrevSection, PrevCU is sufficient for handling address range holes. PrevSection/PrevCU are used to detect holes in the address range of a CU to ensure the DW_AT_ranges does not include those holes. When we see a function with no debug info, though it may be in the same range as the prior and subsequent functions, there should be a gap in the CU's ranges. By setting PrevCU to null in that case, the range would not be extended to cover the gap. llvm-svn: 217466	2014-09-09 22:56:36 +00:00
Patrik Hagglund	7b94ce2090	[MachineSinking] Conservatively clear kill flags after coalescing. This solves the problem of having a kill flag inside a loop with a definition of the register prior to the loop: %vreg368<def> ... Inside loop: %vreg520<def> = COPY %vreg368 %vreg568<def,tied1> = add %vreg341<tied0>, %vreg520<kill> => was coalesced into => %vreg568<def,tied1> = add %vreg341<tied0>, %vreg368<kill> MachineVerifier then complained: * Bad machine code: Virtual register killed in block, but needed live out. * The kill flag for %vreg368 is incorrect, and is cleared by this patch. This is similar to the clearing done at the end of MachineSinking::SinkInstruction(). Patch provided by Jonas Paulsson. Reviewed by Quentin Colombet and Juergen Ributzka. llvm-svn: 217427	2014-09-09 07:47:00 +00:00
Hans Wennborg	6ebf2a8b60	Fast-ISel: Remove dead code after falling back from selecting call instructions (PR20863) Previously, fast-isel would not clean up after failing to select a call instruction, because it would have called flushLocalValueMap() which moves the insertion point, making SavedInsertPt in selectInstruction() invalid. Fixing this by making SavedInsertPt a member variable, and having flushLocalValueMap() update it. This removes some redundant code at -O0, and more importantly fixes PR20863. Differential Revision: http://reviews.llvm.org/D5249 llvm-svn: 217401	2014-09-08 20:24:10 +00:00
Sanjay Patel	b6481eb090	Group unsafe fmul math folds together for easier reading. No functional change. llvm-svn: 217399	2014-09-08 20:16:42 +00:00
Sanjay Patel	3803a37086	Fix the FIXME that was just added in r217390 - remove a bunch of redundant fold permutations. The testcases for these folds already exist in test/CodeGen/X86/fp-fast.ll. llvm-svn: 217393	2014-09-08 18:22:51 +00:00
Sanjay Patel	510f5b3b43	group unsafe math folds together for easier reading Also added a FIXME regarding redundant folds for non-canonicalized constants. llvm-svn: 217390	2014-09-08 17:32:19 +00:00
Chad Rosier	7f50169e13	[AArch64] Improve AA to remove unneeded edges in the AA MI scheduling graph. Patch by Sanjin Sijaric <ssijaric@codeaurora.org>! Phabricator Review: http://reviews.llvm.org/D5103 llvm-svn: 217371	2014-09-08 14:43:48 +00:00
David Blaikie	9f14661b4c	DebugInfo: Do not use DW_FORM_GNU_addr_index in skeleton CUs, GDB 7.8 errors on this. It's probably not a huge deal to not do this - if we could, maybe the address could be reused by a subprogram low_pc and avoid an extra relocation, but it's just one per CU at best. llvm-svn: 217338	2014-09-07 17:31:42 +00:00
Sanjay Patel	5c895f97e3	Allow vector fsub ops with constants to get the same optimizations as scalars. This problem is bigger than just fsub, but this is the minimum fix to solve fneg for PR20556 ( http://llvm.org/bugs/show_bug.cgi?id=20556 ), and we solve zero subtraction with the same change. llvm-svn: 217286	2014-09-05 22:26:22 +00:00
Sanjay Patel	0fe727a669	clean up; NFC llvm-svn: 217278	2014-09-05 20:55:46 +00:00
Rafael Espindola	b6092ac3d7	Revert "Disable the fix for pr20793 because of a gnu ld bug." This reverts commit r217211. Both the bfd ld and gold outputs were valid. They were using a Rela relocation, so the value present in the relocated location was not used, which caused me to misread the output. llvm-svn: 217264	2014-09-05 18:03:38 +00:00
Adrian Prantl	6118051e4d	Set the parent pointer of cloned DBG_VALUE instructions correctly. Fixes PR20523. When spilling variables onto the stack, spillVirtReg() is setting the parent pointer of the cloned DBG_VALUE intrinsic for the stack location to the parent pointer of the original intrinsic. MachineInstr parent pointers should however always point to the parent basic block. MBB is shadowing the MBB member variable. The instruction still ends up being inserted into the right basic block, because it's inserted after MI which serves as the iterator. I failed at constructing a reliable testcase for this, see http://llvm.org/bugs/show_bug.cgi?id=20523 for a large testcases. llvm-svn: 217260	2014-09-05 17:10:10 +00:00
Rafael Espindola	052b6801ba	Disable the fix for pr20793 because of a gnu ld bug. llvm-svn: 217211	2014-09-05 00:14:12 +00:00
Rafael Espindola	a2044bee3f	Refactor to avoid code duplication. NFC. llvm-svn: 217207	2014-09-05 00:02:50 +00:00
Rafael Espindola	6ac7414624	Fix pr20793. With this patch the third field of llvm.global_ctors is also used on ELF. llvm-svn: 217202	2014-09-04 23:03:58 +00:00
Reid Kleckner	b2519b0989	MC Win64: Put unwind info for COMDAT code into the same COMDAT group Summary: This fixes a long standing issue where we would emit many little .text sections and only one .pdata and .xdata section. Now we generate one .pdata / .xdata pair per .text section and associate them correctly. Fixes PR19667. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D5181 llvm-svn: 217176	2014-09-04 17:42:03 +00:00
Juergen Ributzka	f33094f142	Revert r216803 "[MachineSinking] Clear kill flag of all operands at all their uses." This reverts commit r216803, because it might have broken the buildbot. The issue is tracked in PR20842. llvm-svn: 217120	2014-09-04 02:07:36 +00:00
Robin Morisset	c2d1634d13	Refactor AtomicExpandPass and add a generic isAtomic() method to Instruction Summary: Split shouldExpandAtomicInIR() into different versions for Stores/Loads/RMWs/CmpXchgs. Makes runOnFunction cleaner (no more redundant checking/casting), and will help moving the X86 backend to this pass. This requires a way of easily detecting which instructions are atomic. I followed the pattern of mayReadFromMemory, mayWriteOrReadMemory, etc.. in making isAtomic() a method of Instruction implemented by a switch on the opcodes. Test Plan: make check Reviewers: jfb Subscribers: mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D5035 llvm-svn: 217080	2014-09-03 21:29:59 +00:00
Robin Morisset	1932ecdd2a	Use target-dependent emitLeading/TrailingFence instead of the target-independent insertLeading/TrailingFence (in AtomicExpandPass) Fixes two latent bugs: - There was no fence inserted before expanded seq_cst load (unsound on Power) - There was only a fence release before seq_cst stores (again unsound, in particular on Power) It is not even clear if this is correct on ARM swift processors (where release fences are DMB ishst instead of DMB ish). This behaviour is currently preserved on ARM Swift as it is not clear whether it is incorrect. I would love to get documentation stating whether it is correct or not. These two bugs were not triggered because Power is not (yet) using this pass, and these behaviours happen to be (mostly?) working on ARM (although they completely butchered the semantics of the llvm IR). See: http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-August/075821.html for an example of the problems that can be caused by the second of these bugs. I couldn't see a way of fixing these in a completely target-independent way without adding lots of unnecessary fences on ARM, hence the target-dependent parts of this patch. This patch implements the new target-dependent parts only for ARM (the default of not doing anything is enough for AArch64), other architectures will use this infrastructure in later patches. llvm-svn: 217076	2014-09-03 21:01:03 +00:00
Juergen Ributzka	76dd2e3da7	[FastISel][tblgen] Rename tblgen generated FastISel functions. NFC. This is the final round of renaming. This changes tblgen to emit lower-case function names for FastEmitInst_* and FastEmit_*, and updates all its uses in the source code. Reviewed by Eric llvm-svn: 217075	2014-09-03 20:56:59 +00:00
Juergen Ributzka	fa7bc008ce	[FastISel] Rename public visible FastISel functions. NFC. This commit renames the following public FastISel functions: LowerArguments -> lowerArguments SelectInstruction -> selectInstruction TargetSelectInstruction -> fastSelectInstruction FastLowerArguments -> fastLowerArguments FastLowerCall -> fastLowerCall FastLowerIntrinsicCall -> fastLowerIntrinsicCall FastEmitZExtFromI1 -> fastEmitZExtFromI1 FastEmitBranch -> fastEmitBranch UpdateValueMap -> updateValueMap TargetMaterializeConstant -> fastMaterializeConstant TargetMaterializeAlloca -> fastMaterializeAlloca TargetMaterializeFloatZero -> fastMaterializeFloatZero LowerCallTo -> lowerCallTo Reviewed by Eric llvm-svn: 217074	2014-09-03 20:56:52 +00:00
Eric Christopher	e1f21228eb	Remove resetSubtargetFeatures as it is unused. llvm-svn: 217071	2014-09-03 20:36:31 +00:00
Juergen Ributzka	89b8a87a22	[FastISel] Some long overdue spring cleaning of FastISel. Things got a little bit messy over the years and it is time for a little bit spring cleaning. This first commit is focused on the FastISel base class itself. It doxyfies all comments, C++11fies the code where it makes sense, renames internal methods to adhere to the coding standard, and clang-formats the files. Reviewed by Eric llvm-svn: 217060	2014-09-03 18:46:45 +00:00
Eric Christopher	2f6f860aaa	Reinstate "Nuke the old JIT." Approved by Jim Grosbach, Lang Hames, Rafael Espindola. This reinstates commits r215111, 215115, 215116, 215117, 215136. llvm-svn: 216982	2014-09-02 22:28:02 +00:00
Hal Finkel	7cce13e023	Add pass-manager flags to use CFL AA Add -use-cfl-aa (and -use-cfl-aa-in-codegen) to add CFL AA in the default pass managers (for easy testing). llvm-svn: 216978	2014-09-02 22:12:54 +00:00
Juergen Ributzka	254fee990d	[FastISel] Provide the option to skip target-independent instruction selection. NFC. This allows the target to disable target-independent instruction selection and jump directly into the target-dependent instruction selection code. This can be beneficial for targets, such as AArch64, which could emit much better code, but never got a chance to do so, because the target-independent instruction selector was able to find an instruction sequence. llvm-svn: 216947	2014-09-02 21:07:44 +00:00
Matt Arsenault	6635d39450	Fix interference caused by fmul 2, x -> fadd x, x If an fmul was introduced by lowering, it wouldn't be folded into a multiply by a constant since the earlier combine would have replaced the fmul with the fadd. llvm-svn: 216932	2014-09-02 19:02:53 +00:00
Reid Kleckner	1af94055d7	CodeGen: Handle va_start in the entry block Also fix a small copy-paste bug in X86ISelLowering where Chain should have been used in place of DAG.getEntryToken(). Fixes PR20828. llvm-svn: 216929	2014-09-02 18:42:44 +00:00
Matt Arsenault	2e1586e561	Fix comment and unnecessary check for FP build_vectors. This was copy-paste from the integer version, but FP build_vectors don't truncate. llvm-svn: 216928	2014-09-02 18:33:51 +00:00
Pete Cooper	92fc86558d	Change MCSchedModel to be a struct of statically initialized data. This removes static initializers from the backends which generate this data, and also makes this struct match the other Tablegen generated structs in behaviour Reviewed by Andy Trick and Chandler C llvm-svn: 216919	2014-09-02 17:43:54 +00:00
David Blaikie	7c49145b3c	unique_ptrify PBQPBuilder::build llvm-svn: 216918	2014-09-02 17:42:01 +00:00
Hal Finkel	9b47ecfb28	Enable splitting indexing from loads with TargetConstants When I recommitted r208640 (in r216898) I added an exclusion for TargetConstant offsets, as there is no guarantee that a backend can handle them on generic ADDs (even if it generates them during address-mode matching) -- and, specifically, applying this transformation directly with TargetConstants caused a self-hosting failure on PPC64. Ignoring all TargetConstants, however, is less than ideal. Instead, for non-opaque constants, we can convert them into regular constants for use with the generated ADD (or SUB). llvm-svn: 216908	2014-09-02 16:05:23 +00:00
Hal Finkel	afd5700add	Revert "Revert '[DAGCombiner] Split up an indexed load if only the base pointer value is live'" I reverted r208640 in r209747 because r208640 broke self-hosting on PPC64. The underlying cause of the failure is that pre-inc loads with increments represented by ISD::TargetConstants were being transformed into ISD:::ADDs with ISD::TargetConstant operands. PPC doesn't have a pattern for those, and so they were selected as invalid r+r adds. This recommits r208640, rebased and with an exclusion for ISD::TargetConstant increments. This behavior seems correct, although in the future we might want to ask the target to split out the indexing that uses ISD::TargetConstants. Unfortunately, I don't yet have small test case where the relevant invalid 'add' instruction is not itself dead (and thus eliminated by DeadMachineInstructionElim -- sometimes bugpoint is too good at removing things) Original commit message (by Adam Nemet): Right now the load may not get DCE'd because of the side-effect of updating the base pointer. This can happen if we lower a read-modify-write of an illegal larger type (e.g. i48) such that the modification only affects one of the subparts (the lower i32 part but not the higher i16 part). See the testcase. In order to spot the dead load we need to revisit it when SimplifyDemandedBits decided that the value of the load is masked off. This is the CommitTargetLoweringOpt piece. I checked compile time with ARM64 by sending SPEC bitcode files through llc. No measurable change. Fixes <rdar://problem/16031651> llvm-svn: 216898	2014-09-02 06:24:04 +00:00
Saleem Abdulrasool	2ae3d80803	CodeGen: indicate Windows unwind data format The structures for Windows unwinding are shared across multiple platforms. Indicate the encoding to be used for the particular target. Use this to switch the unwind emitter instantiated by the AsmPrinter. llvm-svn: 216895	2014-09-01 23:48:39 +00:00
Saleem Abdulrasool	36d75c8330	CodeGen: split out the Win64Exception emitter Move the Windows unwind information emitter into a separate header. This is not related to DWARF based emission. NFC. llvm-svn: 216894	2014-09-01 23:48:34 +00:00
Patrik Hagglund	58aa72df97	Fix in InlineSpiller to make the rematerilization loop also consider implicit uses of the whole register when a sub register is defined. Now the same iterator is used in the rematerilization loop as in the spill loop later. Patch provided by Mikael Holmen. This fix was proposed and reviewed by Quentin Colombet, http://lists.cs.uiuc.edu/pipermail/llvmdev/2014-August/076135.html. Unfortunately, this error in the rematerilization code has only been seen in a large test case for an out-of-tree target, and is probably hard to reproduce on an in-tree target. Therefore, no testcase is provided. llvm-svn: 216873	2014-09-01 11:04:07 +00:00
Jingyue Wu	2b97db6061	[MachineSink] Use the real post dominator tree Summary: Fixes a FIXME in MachineSinking. Instead of using the simple heuristics in isPostDominatedBy, use the real MachinePostDominatorTree. The old heuristics caused instructions to sink unnecessarily, and might create register pressure. Test Plan: Added a NVPTX codegen test to verify that our change is in effect. It also shows the unnecessary register pressure caused by over-sinking. Updated affected tests in AArch64 and X86. Reviewers: eliben, meheff, Jiangning Reviewed By: Jiangning Subscribers: jholewinski, aemerson, mcrosier, llvm-commits Differential Revision: http://reviews.llvm.org/D4814 llvm-svn: 216862	2014-09-01 03:47:25 +00:00
David Blaikie	836d1e85c7	DebugInfo: Elide lexical scopes which only contain other (inline or lexical) scopes. DW_TAG_lexical_scopes inform debuggers about the instruction range for which a given variable (or imported declaration/module/etc) is valid. If the scope doesn't itself contain any such entities, it's a waste of space and should be omitted. We were correctly doing this for entirely empty leaves, but not for intermediate nodes. Reduces total (not just debug sections) .o file size for a bootstrap -gmlt LLVM by 22% and bootstrap -gmlt clang executable by 13%. The wins for a full -g build will be less as a % (and in absolute terms), but should still be substantial - with some of that win being fewer relocations, thus more substantiall reducing link times than fewer bytes alone would have. llvm-svn: 216861	2014-08-31 21:26:22 +00:00
David Blaikie	2851a73611	DebugInfo: Move argument creation up into the caller that's unambiguously handling the subprogram scope (replacing a conditional with an assertion in the process) llvm-svn: 216845	2014-08-31 18:04:28 +00:00
David Blaikie	2b1ec5518c	Delay adding imported entity DIEs to the lexical scope, streamlining the check for "this scope has nothing in it" This makes the emptiness of the scope with regards to variables and nested scopes is the same as with regards to imported entities. Just check if we had nothing at all before we build the node. llvm-svn: 216840	2014-08-31 05:46:17 +00:00

... 3 4 5 6 7 ...

17573 Commits