llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 03:23:01 +02:00

Author	SHA1	Message	Date
Rafael Espindola	e6465aeb2d	Split GlobalValue into GlobalValue and GlobalObject. This allows code to statically accept a Function or a GlobalVariable, but not an alias. This is already a cleanup by itself IMHO, but the main reason for it is that it gives a lot more confidence that the refactoring to fix the design of GlobalAlias is correct. That will be a followup patch. llvm-svn: 208716	2014-05-13 18:45:48 +00:00
Joey Gouly	c0f94cb136	[CGP] r205941 changed the logic, so that a cast happens before 'Result' is compared to 'AddrMode.BaseReg'. In the case that 'AddrMode.BaseReg' is nullptr, 'Result' will also be nullptr, so the cast causes an assertion. We should use dyn_cast_or_null here to check 'Result' is not null and it is an instruction. Bug found by Mats Petersson, and I reduced his IR to get a test case. llvm-svn: 208705	2014-05-13 15:42:45 +00:00
David Blaikie	1203463603	Revert "DebugInfo: Include lexical scopes in inlined subroutines." This reverts commit r208506. Some inlined subroutine scopes appear to be missing with this change. Reverting while I investigate. llvm-svn: 208642	2014-05-12 23:53:03 +00:00
Pete Cooper	801ae0ce03	Use a logical not when inverting SetCC. This unfortunately doesn't fire on any targets so I couldn't find a test case to trigger it. The problem occurs when a non-i1 setcc is inverted. For example 'i8 = setcc' will get 'xor 0xff' to invert this. This is clearly wrong when the boolean contents are ZeroOrOne. This patch introduces getLogicalNOT and updates SetCC legalisation to use it. Reviewed by Hal Finkel. llvm-svn: 208641	2014-05-12 23:26:58 +00:00
Adam Nemet	78e81b5109	[DAGCombiner] Split up an indexed load if only the base pointer value is live Right now the load may not get DCE'd because of the side-effect of updating the base pointer. This can happen if we lower a read-modify-write of an illegal larger type (e.g. i48) such that the modification only affects one of the subparts (the lower i32 part but not the higher i16 part). See the testcase. In order to spot the dead load we need to revisit it when SimplifyDemandedBits decided that the value of the load is masked off. This is the CommitTargetLoweringOpt piece. I checked compile time with ARM64 by sending SPEC bitcode files through llc. No measurable change. Fixes <rdar://problem/16031651> llvm-svn: 208640	2014-05-12 23:00:03 +00:00
David Blaikie	32cd8a7919	DebugInfo: Attach DW_AT_inline to inlined subprograms at DIE-construction time rather than as a post-processing step. llvm-svn: 208636	2014-05-12 21:50:44 +00:00
David Blaikie	5ca904146d	DwarfDebug: Avoid an extra map lookup while constructing abstract scope DIEs and reduce nesting/conditionals. One test case had to be updated as it still had the extra indirection for the variable list - removing the extra indirection got it back to passing. llvm-svn: 208608	2014-05-12 18:23:35 +00:00
Matt Arsenault	43171f4aad	Make SimplifyDemandedBits understand BUILD_PAIR llvm-svn: 208598	2014-05-12 17:14:48 +00:00
Saleem Abdulrasool	5f98d2b767	CodeGen: add parenthesis around complex expression Add missing parenthesis suggested by GCC. NFC. llvm-svn: 208519	2014-05-12 06:08:18 +00:00
Hal Finkel	5b038e4cbc	Pass the value type to TLI::getRegisterByName We must validate the value type in TLI::getRegisterByName, because if we don't and the wrong type was used with the IR intrinsic, then we'll assert (because we won't be able to find a valid register class with which to construct the requested copy operation). For PPC64, additionally, the type information is necessary to decide between the 64-bit register and the 32-bit subregister. No functionality change. llvm-svn: 208508	2014-05-11 19:29:07 +00:00
David Blaikie	a56eed1551	DebugInfo: Include lexical scopes in inlined subroutines. llvm-svn: 208506	2014-05-11 18:12:17 +00:00
David Blaikie	c11c7cade1	DwarfUnit: Make explicit a limitation/bug in enumeration constant emission. Filed as PR19712, LLVM fails to detect the right type of an enum constant when a frontend does not provide an underlying type for the enumeration type. llvm-svn: 208502	2014-05-11 17:04:05 +00:00
David Blaikie	31bd82e5d9	DwarfUnit: Pick a winner between isTypeSigned and isUnsignedDIType. And the winner by a nose is isUnsignedDIType, for no particular reason. These two functions were just complements of each other and used in very related code, so refactor callers to just use one of them. llvm-svn: 208500	2014-05-11 16:08:41 +00:00
David Blaikie	5c0fff0a62	DwarfUnit: Factor out calling isUnsignedDIType into a utility function so each caller of emitConstantValue doesn't have to call it separately. llvm-svn: 208496	2014-05-11 15:56:59 +00:00
David Blaikie	32bfd4a974	DwarfUnit: Share common constant value emission between APInts of small (<= 64 bit) and MCOperand immediates. Doesn't seem a good reason to duplicate this code (it was more literally duplicated prior to r208494, and while the dataN code /does/ actually fire in this case, it doesn't seem necessary (and the DWARF standard recommends using udata/sdata pervasively instead of dataN, so as to indicate signedness of the values)) llvm-svn: 208495	2014-05-11 15:47:39 +00:00
David Blaikie	1f94b7156a	DebugInfo: Simplify constant value emission. This code looks to have become dead at some time in the past. I tried to reproduce cases where LLVM would emit constants with dataN, but could not. Upon inspection it seems the code doesn't do that anymore - the only time a size is provided by isTypeSigned is when the type is signed, and in those cases we use sdata. dataN is only used for unsigned types and isTypeSigned doesn't provide a value for sizeInBits in that case. Remove the dead cases/size plumbing. llvm-svn: 208494	2014-05-11 15:06:20 +00:00
Oliver Stannard	2b1166b162	ARM: HFAs must be passed in consecutive registers When using the ARM AAPCS, HFAs (Homogeneous Floating-point Aggregates) must be passed in a block of consecutive floating-point registers, or on the stack. This means that unused floating-point registers cannot be back-filled with part of an HFA, however this can currently happen. This patch, along with the corresponding clang patch (http://reviews.llvm.org/D3083) prevents this. llvm-svn: 208413	2014-05-09 14:01:47 +00:00
Quentin Colombet	112f8aafc2	[TargetInstrInfo] Fix the implementation of commuteInstruction to match the comment of the API. Relaxes the behavior of TargetInstrInfo::commuteInstruction when TargetInstrInfo::findCommutedOpIndices returns false. Previously TargetInstrInfo triggered a fatal error in such situation whereas based on the comment in the API it should just return nullptr. Indeed the only precondition that should be ensured is that the instruction must be commutable. llvm-svn: 208371	2014-05-08 23:12:27 +00:00
David Blaikie	f61e94a80f	Reapply r207876 (Try simplifying LexicalScopes ownership again) including a workaround for an MSVC2012 bug regarding forward_as_tuple (r207876 was reverted in r208131 after seeing some consistent buildbot failure for MSVC 2012. The original commits were in r207724-r207726) Takumi was nice enough to dig into this and locate this Microsoft Connect issue: http://connect.microsoft.com/VisualStudio/feedback/details/814899/forward-as-tuple-debug-implementation-error describing a bug in MSVC2012's forward_as_tuple implementation. Since the parameters in this instance are trivial/small, pass them by value (using make_tuple) instead of perfectly-forwarded tuple of rvalue references (involving the broken forward_as_tuple). Hopefully this will satisfy MSVC2012. llvm-svn: 208364	2014-05-08 22:24:51 +00:00
Hal Finkel	faaba5686b	Fix a spelling error llvm-svn: 208314	2014-05-08 13:42:57 +00:00
Hal Finkel	c52e65b830	Move late partial-unrolling thresholds into the processor definitions The old method used by X86TTI to determine partial-unrolling thresholds was messy (because it worked by testing target features), and also would not correctly identify the target CPU if certain target features were disabled. After some discussions on IRC with Chandler et al., it was decided that the processor scheduling models were the right containers for this information (because it is often tied to special uop dispatch-buffer sizes). This does represent a small functionality change: - For generic x86-64 (which uses the SB model and, thus, will get some unrolling). - For AMD cores (because they still currently use the SB scheduling model) - For Haswell (based on benchmarking by Louis Gerbarg, it was decided to bump the default threshold to 50; we're working on a test case for this). Otherwise, nothing has changed for any other targets. The logic, however, has been moved into BasicTTI, so other targets may now also opt-in to this functionality simply by setting LoopMicroOpBufferSize in their processor model definitions. llvm-svn: 208289	2014-05-08 09:14:44 +00:00
Matt Arsenault	903ece3700	Fix using wrong result type for setcc. When reducing the bitwidth of a comparison against a constant, the original setcc's result type was used, which was incorrect. No test since I don't think any other in tree targets change the bitwidth of the setcc type depending on the bitwidth of the compared type. llvm-svn: 208236	2014-05-07 18:26:58 +00:00
Rafael Espindola	765e5e78cf	Remove the UseCFI option from createAsmStreamer. We were already always passing true, this just removes the option. llvm-svn: 208205	2014-05-07 13:00:43 +00:00
Zinovy Nis	ce225593e1	[BUG][REFACTOR] 1) Fix for printing debug locations for absolute paths. 2) Location printing is moved into public method DebugLoc::print() to avoid re-inventing the wheel. Differential Revision: http://reviews.llvm.org/D3513 llvm-svn: 208177	2014-05-07 09:51:22 +00:00
David Blaikie	7acf842266	Revert "Try simplifying LexicalScopes ownership again." Speculatively reverting due to a suspicious failure on a Windows buildbot. This reverts commit 10c37a012ea11596d44cd9059fe09c959caf30c8. llvm-svn: 208131	2014-05-06 21:07:17 +00:00
Benjamin Kramer	593859517f	TTI: Estimate @llvm.fmuladd cost as fmul + fadd when FMA's aren't legal on the target. llvm-svn: 208115	2014-05-06 18:36:23 +00:00
Renato Golin	8a9a382ab2	Implememting named register intrinsics This patch implements the infrastructure to use named register constructs in programs that need access to specific registers (bare metal, kernels, etc). So far, only the stack pointer is supported as a technology preview, but as it is, the intrinsic can already support all non-allocatable registers from any architecture. llvm-svn: 208104	2014-05-06 16:51:25 +00:00
David Blaikie	5f72874fd2	Try simplifying LexicalScopes ownership again. Committed initially in r207724-r207726 and reverted due to compiler-rt crashes in r207732. Instead, fix this harder with unordered_map and store the LexicalScopes by value in the map. This did necessitate moving the definition of LexicalScope above the definition of LexicalScopes. Let's see how the buildbots/compilers tolerate unordered_map::emplace + std::piecewise_construct + std::forward_as_tuple... llvm-svn: 207876	2014-05-02 22:21:05 +00:00
Benjamin Kramer	bc327c6bd3	Satisfy GCC's urgent need for parentheses around ‘&&’ within ‘\|\|’. llvm-svn: 207871	2014-05-02 21:28:49 +00:00
Tim Northover	4aa1f54c61	DAGCombine: prevent formation of illegal ConstantFP nodes. llvm-svn: 207850	2014-05-02 17:25:02 +00:00
Benjamin Kramer	96dab04f0f	Allow SelectionDAG::FoldConstantArithmetic to work when it's called with a vector VT but scalar values. llvm-svn: 207835	2014-05-02 12:35:22 +00:00
Juergen Ributzka	b036be4ecc	[Stackmaps] Pacify windows buildbot. llvm-svn: 207807	2014-05-01 22:39:26 +00:00
Juergen Ributzka	6bed4e0fc7	[Stackmaps] Add command line option to specify the stackmap version. llvm-svn: 207805	2014-05-01 22:21:30 +00:00
Juergen Ributzka	60208fb4c5	[Stackmaps] Refactor serialization code. No functional change intended. llvm-svn: 207804	2014-05-01 22:21:27 +00:00
Juergen Ributzka	08694158e1	[Stackmaps] Replace the custom ConstantPool class with a MapVector. llvm-svn: 207803	2014-05-01 22:21:24 +00:00
Richard Smith	3e92744bc0	Speculatively roll back r207724-r207726, which are code cleanup changes and appear to be breaking a bootstrapped build of compiler-rt. llvm-svn: 207732	2014-05-01 00:46:58 +00:00
David Blaikie	0128475a9a	LexicalScopes: Use unique_ptr to manage ownership of abstract LexicalScopes. llvm-svn: 207726	2014-04-30 23:46:27 +00:00
David Blaikie	be0292793b	Forgotten reformatting. llvm-svn: 207725	2014-04-30 23:42:04 +00:00
David Blaikie	465a8b04c4	LexicalScopes: use unique_ptr to own LexicalScope objects. Ownership of abstract scopes coming soon. llvm-svn: 207724	2014-04-30 23:40:59 +00:00
Alexey Samsonov	86bd2c034d	Use a single data structure to store all user variables in DwarfDebug Summary: Get rid of UserVariables set, and turn DbgValues into MapVector to get a fixed ordering, as suggested in review for http://reviews.llvm.org/D3573. Test Plan: llvm regression tests Reviewers: dblaikie Reviewed By: dblaikie Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D3579 llvm-svn: 207720	2014-04-30 23:02:40 +00:00
David Blaikie	df8ca29906	Revert "Emit DW_AT_object_pointer once, on the declaration, for each function." Breaks GDB buildbot (http://lab.llvm.org:8011/builders/clang-x86_64-ubuntu-gdb-75/builds/14517) GCC emits DW_AT_object_pointer /everywhere/ (declaration, abstract definition, inlined subroutine), but it looks like GCC relies on it being somewhere other than the declaration, at least. I'll experiment further & can hopefully still remove it from the inlined_subroutine. This reverts commit r207705. llvm-svn: 207719	2014-04-30 22:58:19 +00:00
Joerg Sonnenberger	113e756703	Prepare support of Itanium ABI on ARM as opposed to EHABI by conditionally emitting .fnstart and friends only for EHABI. llvm-svn: 207718	2014-04-30 22:43:13 +00:00
David Blaikie	f12b7ea2d9	DebugInfo: Omit DW_AT_artificial on DW_TAG_formal_parameters in DW_TAG_inlined_subroutines. They just don't need to be there - they're inherited from the abstract definition. In theory I would like them to be inherited from the declaration, but the DWARF standard doesn't quite say that... we can probably do it anyway but I'm less confident about that so I'll leave it for a separate commit. llvm-svn: 207717	2014-04-30 22:41:33 +00:00
Alexey Samsonov	8f5245ce6b	Convert more loops to range-based equivalents llvm-svn: 207714	2014-04-30 22:17:38 +00:00
Alexey Samsonov	db1cffce83	Slightly simplify code in DwarfDebug::beginFunction llvm-svn: 207710	2014-04-30 21:44:17 +00:00
Alexey Samsonov	90773577bb	Move logic for calculating DBG_VALUE history map into separate file/class. Summary: No functionality change. Test Plan: llvm regression test suite. Reviewers: dblaikie Reviewed By: dblaikie Subscribers: echristo, llvm-commits Differential Revision: http://reviews.llvm.org/D3573 llvm-svn: 207708	2014-04-30 21:34:11 +00:00
David Blaikie	5560047d31	Emit DW_AT_object_pointer once, on the declaration, for each function. This effectively reverts r164326, but adds some comments and justification and ensures we /don't/ emit the DW_AT_object_pointer on the (abstract and concrete) definitions. (while still preserving it on standalone definitions involving ObjC Blocks) This does increase the size of member function declarations from 7 to 11 bytes, unfortunately, but still seems like the Right Thing to do so that callers that see only the declaration still have the information about the object pointer. That said, I don't know what, if any, DWARF consumers don't have a heuristic to guess this in the case of normal C++ member functions - perhaps we can remove it entirely. llvm-svn: 207705	2014-04-30 21:29:41 +00:00
Weiming Zhao	3625856a33	[ARM64] Prevent bit extraction to be adjusted by following shift For pattern like ((x >> C1) & Mask) << C2, DAG combiner may convert it into (x >> (C1-C2)) & (Mask << C2), which makes pattern matching of ubfx more difficult. For example: Given %shr = lshr i64 %x, 4 %and = and i64 %shr, 15 %arrayidx = getelementptr inbounds [8 x [64 x i64]]* @arr, i64 0, %i64 2, i64 %and %0 = load i64* %arrayidx With current shift folding, it takes 3 instrs to compute base address: lsr x8, x0, #1 and x8, x8, #0x78 add x8, x9, x8 If using ubfx, it only needs 2 instrs: ubfx x8, x0, #4, #4 add x8, x9, x8, lsl #3 This fixes bug 19589 llvm-svn: 207702	2014-04-30 21:07:24 +00:00
Reid Kleckner	1d2b4b0f8f	Fix the clang-cl self-host build by defining ~DwarfDebug out of line DwarfDebug.h has a SmallVector member containing a unique_ptr of an incomplete type. MSVC doesn't have key functions, so the vtable and dtor are emitted in AsmPrinter.cpp, where DwarfDebug's ctor is called. AsmPrinter.cpp include DwarfUnit.h and doesn't get a complete definition of DwarfTypeUnit. We could fix the problem by including DwarfUnit.h in DwarfDebug.h, but that would increase header bloat. Instead, define ~DwarfDebug out of line. llvm-svn: 207701	2014-04-30 20:34:31 +00:00
Alexey Samsonov	307f1e3874	Convert several loops over MachineFunction basic blocks to range-based loops llvm-svn: 207683	2014-04-30 18:29:51 +00:00
Craig Topper	79b097d66a	Use makeArrayRef insted of calling ArrayRef<T> constructor directly. I introduced most of these recently. llvm-svn: 207616	2014-04-30 07:17:30 +00:00
David Blaikie	b630e434da	Fix some 80 cols violations committed in r207539 Caught by Eric Christopher in post-commit review. llvm-svn: 207595	2014-04-29 23:43:06 +00:00
Benjamin Kramer	4f8fb8ff6c	raw_ostream: Forward declare OpenFlags and include FileSystem.h only where necessary. llvm-svn: 207593	2014-04-29 23:26:49 +00:00
Jim Grosbach	f4913dabb0	Tidy up whitespace. llvm-svn: 207583	2014-04-29 22:41:50 +00:00
David Blaikie	1fbf8e4869	DwarfDebug: Split the initialization of abstract and non-abstract subprogram DIEs. These were called from distinct places and had significant distinct behavior. No need to make that a dynamic check inside the function rather than just having two functions (refactoring some common code into a helper function to be called from the two separate functions). llvm-svn: 207539	2014-04-29 15:58:35 +00:00
Craig Topper	244adfe60a	[C++11] Add 'override' keywords and remove 'virtual'. Additionally add 'final' and leave 'virtual' on some methods that are marked virtual without overriding anything and have no obvious overrides themselves. llvm-svn: 207511	2014-04-29 07:58:41 +00:00
David Blaikie	1f2c2892ef	Remove DwarfUnit::LabelRange since it's unused. Seems at some point the intent was to emit fission ranges_base as unique per CU but the code today emits ranges_base as the start of the ranges section for all CUs being compiled and all the ranges_base relative addresses are relative to that. So removing this dead code and leaving the status quo until there's a reason to change it (perhaps something's faster if it has distinct ranges for each CU). llvm-svn: 207464	2014-04-28 23:36:52 +00:00
David Blaikie	e6804866b6	AddressPool::HasBeenUsed: Add comment explaining the use-case for this flag. Based on code review by Eric Christopher on r207323 llvm-svn: 207460	2014-04-28 22:52:50 +00:00
David Blaikie	f9b06eb99e	DIE: Document some learnings about why the world isn't perfect. llvm-svn: 207458	2014-04-28 22:41:39 +00:00
David Blaikie	93b3fbef79	Satisfy sub-optimal GCC warning. (Clang doesn't warn here because it knows the string is benign - the assert still checks what it's intended to - though putting the correct parens does make clang-format format the code a little better) llvm-svn: 207456	2014-04-28 22:27:26 +00:00
Eric Christopher	0f5adbed56	We already calculate WideVT above, just reuse it. Patch by Jan Vesely <jan.vesely@rutgers.edu>. llvm-svn: 207455	2014-04-28 22:24:57 +00:00
Eli Bendersky	028d8b328f	Add (...) around && clause to appeace gcc 4.8's warning llvm-svn: 207452	2014-04-28 22:19:12 +00:00
David Blaikie	64838d0189	DebugInfo: Just store the DIE by value in the DwarfUnit Since all 4 ctor calls in DwarfDebug just pass in a trivially constructed DIE with the right tag type, sink the tag selection down into the Dwarf*Unit ctors (removing the argument entirely from callers in DwarfDebug) and initialize the DIE member in DwarfUnit. llvm-svn: 207448	2014-04-28 21:14:27 +00:00
David Blaikie	2977225bc4	Pass DIEs to DwarfUnit constructors by unique_ptr. llvm-svn: 207447	2014-04-28 21:04:29 +00:00
Eric Christopher	a5d10a654c	Reformat, 80-col, tab characters, etc. llvm-svn: 207444	2014-04-28 20:42:22 +00:00
David Blaikie	e15490943b	Improve explicit memory ownership of DIEs Now that the subtle constructScopeDIE has been refactored into two functions - one returning memory to take ownership of, one returning a pointer to already owning memory - push unique_ptr through more APIs. I think this completes most of the unique_ptr ownership of DIEs. llvm-svn: 207442	2014-04-28 20:36:45 +00:00
David Blaikie	7b4f7635db	DwarfDebug: Omit DW_AT_object_pointer on inlined_subroutines While refactoring out constructScopeDIE into two functions I realized we were emitting DW_AT_object_pointer in the inlined subroutine when we didn't need to (GCC doesn't, and the abstract subprogram definition has the information already). So here's the refactoring and the bug fix. This is one step of refactoring to remove some subtle memory ownership semantics. It turns out the original constructScopeDIE returned ownership in its return value in some cases and not in others. The split into two functions now separates those two semantics - further cleanup (unique_ptr, etc) will follow. llvm-svn: 207441	2014-04-28 20:27:02 +00:00
Craig Topper	9683cb114b	Convert more SelectionDAG functions to use ArrayRef. llvm-svn: 207397	2014-04-28 05:57:50 +00:00
Craig Topper	b663bffa27	[C++] Use 'nullptr'. llvm-svn: 207394	2014-04-28 04:05:08 +00:00
Craig Topper	aec1381207	Convert AddNodeIDNode and SelectionDAG::getNodeIfExiists to use ArrayRef<SDValue> llvm-svn: 207383	2014-04-27 23:22:43 +00:00
Craig Topper	0b9e8dcc15	Convert SelectionDAGISel::MorphNode to use ArrayRef. llvm-svn: 207379	2014-04-27 19:21:20 +00:00
Craig Topper	1e0e54db16	Convert SelectionDAG::MorphNodeTo to use ArrayRef. llvm-svn: 207378	2014-04-27 19:21:16 +00:00
Craig Topper	1efda44640	Convert SelectionDAG::SelectNodeTo to use ArrayRef. llvm-svn: 207377	2014-04-27 19:21:11 +00:00
Craig Topper	e5c6e7f4ea	Convert one last signature of getNode to take an ArrayRef of SDUse. llvm-svn: 207376	2014-04-27 19:21:06 +00:00
Craig Topper	fcd493c542	Convert SDNode constructor to use ArrayRef. llvm-svn: 207375	2014-04-27 19:21:02 +00:00
Craig Topper	536995c0a7	Convert SelectionDAG::getMergeValues to use ArrayRef. llvm-svn: 207374	2014-04-27 19:20:57 +00:00
Craig Topper	83c26f9284	Const-correct SelectionDAG::getAtomic. llvm-svn: 207373	2014-04-27 19:20:47 +00:00
Adrian Prantl	c6c1b378f8	Clarify the doxygen comment for AsmPrinter::EmitDwarfRegOpPiece and add default arguments to the function. No functional change. llvm-svn: 207372	2014-04-27 18:50:45 +00:00
Benjamin Kramer	764309a6cd	X86TTI: Adjust sdiv cost now that we can lower it on plain SSE2. Includes a fix for a horrible typo that caused all SDIV costs to be slightly off :) llvm-svn: 207371	2014-04-27 18:47:54 +00:00
Adrian Prantl	474f0cfd36	Debug info: Refactor EmitDwarfRegOpPiece to be a member function of AsmPrinter. No functional change. http://reviews.llvm.org/D3373 rdar://problem/15928306 llvm-svn: 207369	2014-04-27 18:25:45 +00:00
Adrian Prantl	33815a6326	Debug Info: Prepare DebugLocEntry to handle more than a single value per entry. This is in preparation for generic DW_OP_piece support. No functional change so far. http://reviews.llvm.org/D3373 rdar://problem/15928306 llvm-svn: 207368	2014-04-27 18:25:40 +00:00
Benjamin Kramer	171a3310a4	Make helper functions static. llvm-svn: 207359	2014-04-27 14:54:59 +00:00
David Blaikie	bcb7340715	Remove redundant explicit default initialization of non-trivially constructed member. llvm-svn: 207357	2014-04-27 14:47:23 +00:00
NAKAMURA Takumi	4b708bedcf	Add the default constructor DwarfAccelTable::DataArray() to initialize (MCSymbol*)StrSym explicitly. It will fix crash in codegen on msvc x64. llvm-svn: 207356	2014-04-27 11:59:44 +00:00
Benjamin Kramer	f9669b910d	SelectionDAG: Aggressively fold shuffles of constant splats. llvm-svn: 207352	2014-04-27 11:41:06 +00:00
Benjamin Kramer	e6a357c0fc	DAGCombiner: Simplify code a bit, make more transforms work with vectors. llvm-svn: 207338	2014-04-26 23:09:49 +00:00
David Blaikie	84f56770da	DwarfDebug: Roll argument into call. llvm-svn: 207334	2014-04-26 22:37:45 +00:00
David Blaikie	b12eecfe0b	DebugInfo: Fix and test a regression caused by r207263 causing the DW_AT_object_pointer to go missing on blocks Noticed by inspection. Test coverage added. llvm-svn: 207333	2014-04-26 22:12:18 +00:00
Craig Topper	e0741a0fcb	Convert getMemIntrinsicNode to take ArrayRef of SDValue instead of pointer and size. llvm-svn: 207329	2014-04-26 19:29:41 +00:00
Craig Topper	1b1f54bcca	Convert SelectionDAG::getNode methods to use ArrayRef<SDValue>. llvm-svn: 207327	2014-04-26 18:35:24 +00:00
Craig Topper	66f68bf6f5	Remove an unused version of getMemIntrinsicNode and getNode. Additionally, these were calling makeVTList with the pointers passed in which would were unlikely to belong to SelectionDAG and likely would have just been stack pointers. llvm-svn: 207326	2014-04-26 18:35:13 +00:00
David Blaikie	25a7c1380d	DWARF Type Units: Avoid emitting type units under fission if the type requires an address. Since there's no way to ensure the type unit in the .dwo and the type unit skeleton in the .o are correlated, this cannot work. This implementation is a bit inefficient for a few reasons, called out in comments. llvm-svn: 207323	2014-04-26 17:27:38 +00:00
David Blaikie	2cfea484cb	DwarfDebug: Minor refactoring around type unit construction Sinking addition of the declaration attribute down to where the signature is added. So that if the signature is not added neither is the declaration attribute (this will come in handy when aborting type unit construction to instead emit the type into the CU directly in some cases) Pull out type unit identifier hashing just to simplify the function a little, it'll be getting longer. llvm-svn: 207321	2014-04-26 16:26:41 +00:00
Benjamin Kramer	89fb3dd5a4	Rip out X86-specific vector SDIV lowering, make the corresponding DAGCombiner transform work on vectors. llvm-svn: 207316	2014-04-26 13:00:53 +00:00
Benjamin Kramer	163df6bc62	DAGCombiner: Turn divs of vector splats into vectorized multiplications. Otherwise the legalizer would just scalarize everything. Support for mulhi in the targets isn't that great yet so on most targets we get exactly the same scalarized output. Add a test for x86 vector udiv. I had to disable the mulhi nodes on ARM because there aren't any patterns for it. As far as I know ARM has instructions for getting the high part of a multiply so this should be fixed. llvm-svn: 207315	2014-04-26 12:06:28 +00:00
Michael Zolotukhin	1c3340dd78	Revert r206749 till a final decision about the intrinsics is made. llvm-svn: 207313	2014-04-26 09:56:41 +00:00
Juergen Ributzka	d783eb73d8	[DAG] During DAG legalization keep opaque constants even after expanding. The included test case would return the incorrect results, because the expansion of an shift with a constant shift amount of 0 would generate undefined behavior. This is because ExpandShiftByConstant assumes that all shifts by constants with a value of 0 have already been optimized away. This doesn't happen for opaque constants and usually this isn't a problem, because opaque constants won't take this code path - they are not supposed to. In the case that the opaque constant has to be expanded by the legalizer, the legalizer would drop the opaque flag. In this case we hit the limitations of ExpandShiftByConstant and create incorrect code. This commit fixes the legalizer by not dropping the opaque flag when expanding opaque constants and adding an assertion to ExpandShiftByConstant to catch this not supported case in the future. This fixes <rdar://problem/16718472> llvm-svn: 207304	2014-04-26 02:58:04 +00:00
Eric Christopher	0914022daa	Make sure that rangelists are also relative to the compile unit low_pc similar to location lists. Fixes PR19563 llvm-svn: 207283	2014-04-25 22:23:54 +00:00
David Blaikie	014910bdcb	DwarfAccelTable: Store the string symbol in the accelerator table to avoid duplicate lookup. This also avoids the need for subtly side-effecting calls to manifest strings in the string table at the point where items are added to the accelerator tables. llvm-svn: 207281	2014-04-25 22:21:35 +00:00
David Blaikie	795c9f381d	Encapsulate the DWARF string pool in a separate type. Pulls out some more code from some of the rather monolithic DWARF classes. Unlike the address table, the string table won't move up into DwarfDebug - each DWARF file has its own string table (but there can be only one address table). llvm-svn: 207277	2014-04-25 21:34:35 +00:00
Adrian Prantl	7566e72bb8	This reapplies r207235 with an additional bugfixes caught by the msan buildbot - do not insert debug intrinsics before phi nodes. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207269	2014-04-25 20:49:25 +00:00
David Blaikie	c5c1e1ec36	DwarfUnit: Remove unused function llvm-svn: 207264	2014-04-25 20:02:24 +00:00
David Blaikie	7ce6b4003f	DIE: Pass ownership of children via std::unique_ptr rather than raw pointer. This should reduce the chance of memory leaks like those fixed in r207240. There's still some unclear ownership of DIEs happening in DwarfDebug. Pushing unique_ptr and references through more APIs should help expose the cases where ownership is a bit fuzzy. llvm-svn: 207263	2014-04-25 20:00:34 +00:00
David Blaikie	e3e1144d71	DIEEntry: Refer to the specified DIE via reference rather than pointer. Makes some more cases (the unit tests, specifically), lexically compatible with a change to unique_ptr. llvm-svn: 207261	2014-04-25 19:33:43 +00:00
David Blaikie	e5d83756ee	DwarfUnit: return by reference from createAndAddDIE Since this doesn't return ownership (the DIE has been added to the specified parent already) nor return null, just return by reference. llvm-svn: 207259	2014-04-25 18:52:29 +00:00
David Blaikie	ad6109fd52	Return DIE by reference instead of pointer from DwarfUnit::getUnitDie llvm-svn: 207255	2014-04-25 18:35:57 +00:00
David Blaikie	ec60d29162	DwarfUnit: Suddently, DIE references, everywhere. This'll make changing to unique_ptr ownership of DIEs easier since the usages will now have '*' on them making them textually compatible between unique_ptr and raw pointer. llvm-svn: 207253	2014-04-25 18:26:14 +00:00
Adrian Prantl	319db7c542	Revert "This reapplies r207130 with an additional testcase+and a missing check for" This reverts commit 207235 to investigate msan buildbot breakage. llvm-svn: 207250	2014-04-25 18:18:09 +00:00
David Blaikie	31047da334	Refactor some common logic in DwarfUnit::constructVariableDIE and pass non-null DIE by reference to DbgVariable::setDIE llvm-svn: 207244	2014-04-25 17:32:19 +00:00
Adrian Prantl	7f9d1e9fd6	This reapplies r207130 with an additional testcase+and a missing check for AllocaInst that was missing in one location. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207235	2014-04-25 17:01:00 +00:00
David Blaikie	31058c6eb0	Add missing cpp file header Code review feedback from Paul Robinson on r207022 llvm-svn: 207198	2014-04-25 06:22:32 +00:00
Adrian Prantl	0338f80f17	Revert "This reapplies r207130 with an additional testcase+and a missing check for" Typo in testcase. llvm-svn: 207166	2014-04-25 00:42:50 +00:00
Adrian Prantl	bf019d19e9	This reapplies r207130 with an additional testcase+and a missing check for AllocaInst that was missing in one location. Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine.ll testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207165	2014-04-25 00:38:40 +00:00
Adrian Prantl	0b669e8f79	Revert "Debug info for optimized code: Support variables that are on the stack and" This reverts commit 207130 for buildbot breakage. llvm-svn: 207162	2014-04-25 00:04:49 +00:00
Richard Smith	ca779a185f	Remove C++11ism (specializing a template in a surrounding namespace) to appease the buildbots. llvm-svn: 207136	2014-04-24 18:49:15 +00:00
Richard Smith	a49b5ce5a2	[modules] "Specialize" a function by actually specializing a function template rather than by adding an overload and hoping that it's declared before the code that calls it. (In a modules build, it isn't.) llvm-svn: 207133	2014-04-24 18:27:29 +00:00
Adrian Prantl	807e5d8a9a	Debug info for optimized code: Support variables that are on the stack and described by DBG_VALUEs during their lifetime. Previously, when a variable was at a FrameIndex for any part of its lifetime, this would shadow all other DBG_VALUEs and only a single fbreg location would be emitted, which in fact is only valid for a small range and not the entire lexical scope of the variable. The included dbg-value-const-byref testcase demonstrates this. This patch fixes this by Local - emitting dbg.value intrinsics for allocas that are passed by reference - dropping all dbg.declares (they are now fully lowered to dbg.values) SelectionDAG - renamed constructors for SDDbgValue for better readability. - fix UserValue::match() to handle indirect values correctly - not inserting an MMI table entries for dbg.values that describe allocas. - lowering dbg.values that describe allocas into indirect DBG_VALUEs. CodeGenPrepare - leaving dbg.values for an alloca were they are (see comment) Other - regenerated/updated instcombine-intrinsics testcase and included source rdar://problem/16679879 http://reviews.llvm.org/D3374 llvm-svn: 207130	2014-04-24 17:41:45 +00:00
Craig Topper	c7c3a99ec2	[C++] Use 'nullptr'. llvm-svn: 207083	2014-04-24 06:44:33 +00:00
David Blaikie	052cdbf046	Remove unused parameter llvm-svn: 207061	2014-04-24 01:25:10 +00:00
David Blaikie	7e6a431bb1	Remove the intermediate AccelTypes maps in DWARF units. llvm-svn: 207060	2014-04-24 01:23:49 +00:00
David Blaikie	cba93c7350	Remove the intermediate AccelNamespace maps in DWARF units. llvm-svn: 207059	2014-04-24 01:02:42 +00:00
David Blaikie	00d0daca29	Remove the intermediate AccelObjC maps in DWARF units llvm-svn: 207057	2014-04-24 00:53:32 +00:00
David Blaikie	16ac64caec	And actually use the DwarfDebug::AccelNames to emit the names. Fix for r207049 which would've emitted no accelerated names at all... llvm-svn: 207051	2014-04-23 23:46:25 +00:00
David Blaikie	3a329f9b6e	More formatting... llvm-svn: 207050	2014-04-23 23:38:39 +00:00
David Blaikie	0ea0080644	Remove intermediate accelerator table for names. (similar changes coming for the other accelerator tables) llvm-svn: 207049	2014-04-23 23:37:35 +00:00
David Blaikie	b5a0b53e34	DwarfAccelTable: Remove trivial dtor and simplify construction with an array. llvm-svn: 207044	2014-04-23 23:03:45 +00:00
David Blaikie	f96fd788df	Move the AddressPool from DwarfFile to DwarfDebug. There's only ever one address pool, not one per DWARF output file, so let's just have one. (similar refactoring of the string pool to come soon) llvm-svn: 207026	2014-04-23 21:20:10 +00:00
David Blaikie	ae2c262f6c	clang-format for my previous commit (I keep forgetting... ) llvm-svn: 207025	2014-04-23 21:20:07 +00:00
David Blaikie	ef5afb8970	Separate out the DWARF address pool into its own type/files. llvm-svn: 207022	2014-04-23 21:04:59 +00:00
David Blaikie	aa205a84ff	clang-format r207010 llvm-svn: 207016	2014-04-23 19:44:08 +00:00
David Blaikie	24e0c7bf4c	Split out DwarfFile from DwarfDebug into its own .h/.cpp files. Some of these types (DwarfDebug in particular) are quite large to begin with (and I keep forgetting whether DwarfFile is in DwarfDebug or DwarfUnit... ) so having a few smaller files seems like goodness. llvm-svn: 207010	2014-04-23 18:54:00 +00:00
Evgeniy Stepanov	c242bd4b23	Create MCTargetOptions. For now it contains a single flag, SanitizeAddress, which enables AddressSanitizer instrumentation of inline assembly. Patch by Yuri Gorshenin. llvm-svn: 206971	2014-04-23 11:16:03 +00:00
David Blaikie	4c499bd490	Requisite reformatting for previous commit. llvm-svn: 206927	2014-04-22 23:09:36 +00:00
David Blaikie	4c1ce16ebc	Push memory ownership of DwarfUnits into clients of DwarfFile. This prompted me to push references through most of DwarfDebug. Sorry for the churn. Honestly it's a bit silly that we're passing around units all over the place like that anyway and I think it's mostly due to the DIE attribute adding utility functions being utilities in DwarfUnit. I should have another go at moving them out of DwarfUnit... llvm-svn: 206925	2014-04-22 22:39:41 +00:00
David Blaikie	ef0c701473	Use std::unique_ptr to handle ownership of DwarfUnits in DwarfFile. So Chandler - how about those range algorithms? (would really love a dereferencing range adapter for this sort of stuff) llvm-svn: 206921	2014-04-22 21:27:37 +00:00
David Blaikie	9010d1345e	Simplify address pool index assignment. llvm-svn: 206905	2014-04-22 17:21:40 +00:00
Hao Liu	6daf5ecff4	Fix an infinite loop bug in DAG Combine about keeping transfering between ANY_EXTEND and SIGN_EXTEND. llvm-svn: 206873	2014-04-22 09:57:06 +00:00
David Blaikie	eb8b511a12	Revert "Use value semantics to manage DbgVariables rather than dynamic allocation/pointers." This reverts commit r206780. This commit was regressing gdb.opt/inline-locals.exp in the GDB 7.5 test suite. Reverting until I can fix the issue. llvm-svn: 206867	2014-04-22 05:41:06 +00:00
Chandler Carruth	2361db41db	[Modules] Remove potential ODR violations by sinking the DEBUG_TYPE define below all header includes in the lib/CodeGen/... tree. While the current modules implementation doesn't check for this kind of ODR violation yet, it is likely to grow support for it in the future. It also removes one layer of macro pollution across all the included headers. Other sub-trees will follow. llvm-svn: 206837	2014-04-22 02:02:50 +00:00
Quentin Colombet	df419d429a	[CodeGenPrepare] Use APInt to check the value of the immediate in a and while checking candidate for bit field extract. Otherwise the value may not fit in uint64_t and this will trigger an assertion. This fixes PR19503. llvm-svn: 206834	2014-04-22 01:20:34 +00:00
Chandler Carruth	15c7b91ac2	[Modules] Make Support/Debug.h modular. This requires it to not change behavior based on other files defining DEBUG_TYPE, which means it cannot define DEBUG_TYPE at all. This is actually better IMO as it forces folks to define relevant DEBUG_TYPEs for their files. However, it requires all files that currently use DEBUG(...) to define a DEBUG_TYPE if they don't already. I've updated all such files in LLVM and will do the same for other upstream projects. This still leaves one important change in how LLVM uses the DEBUG_TYPE macro going forward: we need to only define the macro after header files have been #include-ed. Previously, this wasn't possible because Debug.h required the macro to be pre-defined. This commit removes that. By defining DEBUG_TYPE after the includes two things are fixed: - Header files that need to provide a DEBUG_TYPE for some inline code can do so by defining the macro before their inline code and undef-ing it afterward so the macro does not escape. - We no longer have rampant ODR violations due to including headers with different DEBUG_TYPE definitions. This may be mostly an academic violation today, but with modules these types of violations are easy to check for and potentially very relevant. Where necessary to suppor headers with DEBUG_TYPE, I have moved the definitions below the includes in this commit. I plan to move the rest of the DEBUG_TYPE macros in LLVM in subsequent commits; this one is big enough. The comments in Debug.h, which were hilariously out of date already, have been updated to reflect the recommended practice going forward. llvm-svn: 206822	2014-04-21 22:55:11 +00:00
Yi Jiang	284d0fc3fc	Set default value of HasExtractBitsInsn to false llvm-svn: 206803	2014-04-21 22:22:44 +00:00
Hal Finkel	9e0afff216	Remove seemingly-unneeded artificial dependency The rationale for this artificial dependency seems to have been lost to the ravages of time, it is covered by no regression tests, and has no impact on test-suite performance numbers on either x86 or PPC. For the test suite, on both x86 and PPC, I ran the test suite 10 times (both as a baseline and with this change), and found no statistically-significant changes. For PPC, I used a P7 box. For x86, I used an Intel Xeon E5430. Both with -O3 -mcpu=native. This was discussed on-list back in January, but I've not had a chance to run the performance tests until today. llvm-svn: 206795	2014-04-21 21:30:25 +00:00
David Blaikie	11a5acbfd8	Use unique_ptr to handle ownership of UserValues in LiveDebugVariablesImpl llvm-svn: 206785	2014-04-21 20:37:07 +00:00
David Blaikie	3fde01a358	Use unique_ptr to manage objects owned by the ScheduleDAGMI. llvm-svn: 206784	2014-04-21 20:32:32 +00:00
David Blaikie	ac6cce35ed	Use value semantics to manage DbgVariables rather than dynamic allocation/pointers. Requires switching some vectors to lists to maintain pointer validity. These could be changed to forward_lists (singly linked) with a bit more work - I've left comments to that effect. llvm-svn: 206780	2014-04-21 20:13:09 +00:00
Chandler Carruth	ac94fb1460	[Modules] Sink the DEBUG_TYPE macro out of LegalizeTypes.h and into the various .cpp files. This macro is inherently non-modular, and it wasn't even needed in this header file. llvm-svn: 206775	2014-04-21 19:43:07 +00:00
Yi Jiang	b1c450606d	ARM64: Combine shifts and uses from different basic block to bit-extract instruction llvm-svn: 206774	2014-04-21 19:34:27 +00:00
Matt Arsenault	797b7a884c	Fix unnecessary line break llvm-svn: 206772	2014-04-21 18:39:13 +00:00
Duncan P. N. Exon Smith	78dd4cd9af	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206707, reapplying r206704. The preceding commit to CalcSpillWeights should have sorted out the failing buildbots. <rdar://problem/14292693> llvm-svn: 206766	2014-04-21 17:57:07 +00:00
Duncan P. N. Exon Smith	ffa49df7cf	CalcSpillWeights: Hack to prevent x87 nonsense This gross hack forces `hweight` into memory, preventing hidden precision from making `1 > 1` occasionally equal `true`. <rdar://problem/14292693> llvm-svn: 206765	2014-04-21 17:57:01 +00:00
Michael Zolotukhin	c7f992f9a3	Reapply r206732. This time without optimization of branches. llvm-svn: 206749	2014-04-21 12:01:33 +00:00
Chandler Carruth	164ee32140	Revert r206732 which is causing llc to crash on most of the build bots. Original commit message: Implement builtins for safe division: safe.sdiv.iN, safe.udiv.iN, safe.srem.iN, safe.urem.iN (iN = i8, i61, i32, or i64). llvm-svn: 206735	2014-04-21 07:11:15 +00:00
Michael Zolotukhin	f5ebd83e24	Implement builtins for safe division: safe.sdiv.iN, safe.udiv.iN, safe.srem.iN, safe.urem.iN (iN = i8, i16, i32, or i64). llvm-svn: 206732	2014-04-21 05:33:09 +00:00
Duncan P. N. Exon Smith	f65036e329	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206704, as expected. llvm-svn: 206707	2014-04-19 22:46:00 +00:00
Duncan P. N. Exon Smith	707997192f	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206677, reapplying my BlockFrequencyInfo rewrite. I've done a careful audit, added some asserts, and fixed a couple of bugs (unfortunately, they were in unlikely code paths). There's a small chance that this will appease the failing bots [1][2]. (If so, great!) If not, I have a follow-up commit ready that will temporarily add -debug-only=block-freq to the two failing tests, allowing me to compare the code path between what the failing bots and what my machines (and the rest of the bots) are doing. Once I've triggered those builds, I'll revert both commits so the bots go green again. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 [2]: http://llvm-amd64.freebsd.your.org/b/builders/clang-i386-freebsd/builds/18445 <rdar://problem/14292693> llvm-svn: 206704	2014-04-19 22:34:26 +00:00
Yaron Keren	407a465a3d	Patch by Vadim Chugunov Win64 stack unwinder gets confused when execution flow "falls through" after a call to 'noreturn' function. This fixes the "missing epilogue" problem by emitting a trap instruction for IR 'unreachable' on x86_x64-pc-windows. A secondary use for it would be for anyone wanting to make double-sure that 'noreturn' functions, indeed, do not return. llvm-svn: 206684	2014-04-19 13:47:43 +00:00
Duncan P. N. Exon Smith	0ee9548e22	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206666, as planned. Still stumped on why the bots are failing. Sanitizer bots haven't turned anything up. If anyone can help me debug either of the failures (referenced in r206666) I'll owe them a beer. (In the meantime, I'll be auditing my patch for undefined behaviour.) llvm-svn: 206677	2014-04-19 00:42:46 +00:00
Duncan P. N. Exon Smith	66e247e69c	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206628, reapplying r206622 (and r206626). Two tests are failing only on buildbots [1][2]: i.e., I can't reproduce on Darwin, and Chandler can't reproduce on Linux. Asan and valgrind don't tell us anything, but we're hoping the msan bot will catch it. So, I'm applying this again to get more feedback from the bots. I'll leave it in long enough to trigger builds in at least the sanitizer buildbots (it was failing for reasons unrelated to my commit last time it was in), and hopefully a few others.... and then I expect to revert a third time. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 [2]: http://llvm-amd64.freebsd.your.org/b/builders/clang-i386-freebsd/builds/18445 llvm-svn: 206666	2014-04-18 22:30:03 +00:00
Duncan P. N. Exon Smith	80fdbd652d	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" (#2 ) This reverts commit r206622 and the MSVC fixup in r206626. Apparently the remotely failing tests are still failing, despite my attempt to fix the nondeterminism in r206621. llvm-svn: 206628	2014-04-18 17:56:08 +00:00
Andrew Trick	13e85b6249	Better comments to explain buffered/unbuffered processor resources. llvm-svn: 206625	2014-04-18 17:35:08 +00:00
Duncan P. N. Exon Smith	cf746f5ff0	Reapply "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commit r206556, effectively reapplying commit r206548 and its fixups in r206549 and r206550. In an intervening commit I've added target triples to the tests that were failing remotely [1] (but passing locally). I'm hoping the mystery is solved? I'll revert this again if the tests are still failing remotely. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 llvm-svn: 206622	2014-04-18 17:22:25 +00:00
Duncan P. N. Exon Smith	79011f6e40	Revert "blockfreq: Rewrite BlockFrequencyInfoImpl" This reverts commits r206548, r206549 and r206549. There are some unit tests failing that aren't failing locally [1], so reverting until I have time to investigate. [1]: http://bb.pgr.jp/builders/ninja-x64-msvc-RA-centos6/builds/1816 llvm-svn: 206556	2014-04-18 02:17:43 +00:00
Duncan P. N. Exon Smith	78f8766db3	blockfreq: Rewrite BlockFrequencyInfoImpl Rewrite the shared implementation of BlockFrequencyInfo and MachineBlockFrequencyInfo entirely. The old implementation had a fundamental flaw: precision losses from nested loops (or very wide branches) compounded past loop exits (and convergence points). The @nested_loops testcase at the end of test/Analysis/BlockFrequencyAnalysis/basic.ll is motivating. This function has three nested loops, with branch weights in the loop headers of 1:4000 (exit:continue). The old analysis gives non-sensical results: Printing analysis 'Block Frequency Analysis' for function 'nested_loops': ---- Block Freqs ---- entry = 1.0 for.cond1.preheader = 1.00103 for.cond4.preheader = 5.5222 for.body6 = 18095.19995 for.inc8 = 4.52264 for.inc11 = 0.00109 for.end13 = 0.0 The new analysis gives correct results: Printing analysis 'Block Frequency Analysis' for function 'nested_loops': block-frequency-info: nested_loops - entry: float = 1.0, int = 8 - for.cond1.preheader: float = 4001.0, int = 32007 - for.cond4.preheader: float = 16008001.0, int = 128064007 - for.body6: float = 64048012001.0, int = 512384096007 - for.inc8: float = 16008001.0, int = 128064007 - for.inc11: float = 4001.0, int = 32007 - for.end13: float = 1.0, int = 8 Most importantly, the frequency leaving each loop matches the frequency entering it. The new algorithm leverages BlockMass and PositiveFloat to maintain precision, separates "probability mass distribution" from "loop scaling", and uses dithering to eliminate probability mass loss. I have unit tests for these types out of tree, but it was decided in the review to make the classes private to BlockFrequencyInfoImpl, and try to shrink them (or remove them entirely) in follow-up commits. The new algorithm should generally have a complexity advantage over the old. The previous algorithm was quadratic in the worst case. The new algorithm is still worst-case quadratic in the presence of irreducible control flow, but it's linear without it. The key difference between the old algorithm and the new is that control flow within a loop is evaluated separately from control flow outside, limiting propagation of precision problems and allowing loop scale to be calculated independently of mass distribution. Loops are visited bottom-up, their loop scales are calculated, and they are replaced by pseudo-nodes. Mass is then distributed through the function, which is now a DAG. Finally, loops are revisited top-down to multiply through the loop scales and the masses distributed to pseudo nodes. There are some remaining flaws. - Irreducible control flow isn't modelled correctly. LoopInfo and MachineLoopInfo ignore irreducible edges, so this algorithm will fail to scale accordingly. There's a note in the class documentation about how to get closer. See also the comments in test/Analysis/BlockFrequencyInfo/irreducible.ll. - Loop scale is limited to 4096 per loop (2^12) to avoid exhausting the 64-bit integer precision used downstream. - The "bias" calculation proposed on llvmdev is not incorporated here. This will be added in a follow-up commit, once comments from this review have been handled. llvm-svn: 206548	2014-04-18 01:57:45 +00:00
Diego Novillo	45811c5ea3	Fix bug 19437 - Only add discriminators for DWARF 4 and above. Summary: This prevents the discriminator generation pass from triggering if the DWARF version being used in the module is prior to 4. Reviewers: echristo, dblaikie CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3413 llvm-svn: 206507	2014-04-17 22:33:50 +00:00
Josh Magee	5d6a41432d	[stack protector] Make the StackProtector pass respect ssp-buffer-size. Previously, SSPBufferSize was assigned the value of the "stack-protector-buffer-size" attribute after all uses of SSPBufferSize. The effect was that the default SSPBufferSize was always used during analysis. I moved the check for the attribute before the analysis; now --param ssp-buffer-size= works correctly again. Differential Revision: http://reviews.llvm.org/D3349 llvm-svn: 206486	2014-04-17 19:08:36 +00:00
Tim Northover	fa11ed01b6	Atomics: promote ARM's IR-based atomics pass to CodeGen. Still only 32-bit ARM using it at this stage, but the promotion allows direct testing via opt and is a reasonably self-contained patch on the way to switching ARM64. At this point, other targets should be able to make use of it without too much difficulty if they want. (See ARM64 commit coming soon for an example). llvm-svn: 206485	2014-04-17 18:22:47 +00:00
Jim Grosbach	63557754ee	[c++11] Tidy up AsmPrinter.cpp. Range'ify loops and tidy up some by-reference handling. No functional change. llvm-svn: 206422	2014-04-16 22:38:02 +00:00
Tim Northover	dcc9d1cb89	DAGCombiner: don't optimise non-existant litpool load This particular DAG combine is designed to kick in when both ConstantFPs will end up being loaded via a litpool, however those nodes have a semi-legal status, dictated by isFPImmLegal so in some cases there wouldn't have been a litpool in the first place. Don't try to be clever in those circumstances. Picked up while merging some AArch64 tests. llvm-svn: 206365	2014-04-16 09:03:09 +00:00
Craig Topper	69e0e91431	Convert SelectionDAG::getVTList to use ArrayRef llvm-svn: 206357	2014-04-16 06:10:51 +00:00
Craig Topper	f803e4fd66	[C++11] More 'nullptr' conversion. In some cases just using a boolean check instead of comparing to nullptr. llvm-svn: 206356	2014-04-16 04:21:27 +00:00
Akira Hatanaka	d5cce12997	Make FastISel::SelectInstruction return before target specific fast-isel code handles Intrinsic::trap if TargetOptions::TrapFuncName is set. This fixes a bug in which the trap function was not taken into consideration when a program was compiled without optimization (at -O0). <rdar://problem/16291933> llvm-svn: 206323	2014-04-15 21:30:06 +00:00
Robert Lougher	eaacc6b6ba	Revert r191049/r191059 as it can produce wrong code (see PR17975). It has already been reverted on the 3.4 branch in r196521. llvm-svn: 206311	2014-04-15 18:34:24 +00:00
Duncan P. N. Exon Smith	58154f2238	verify-di: Implement DebugInfoVerifier Implement DebugInfoVerifier, which steals verification relying on DebugInfoFinder from Verifier. - Adds LegacyDebugInfoVerifierPassPass, a ModulePass which wraps DebugInfoVerifier. Uses -verify-di command-line flag. - Change verifyModule() to invoke DebugInfoVerifier as well as Verifier. - Add a call to createDebugInfoVerifierPass() wherever there was a call to createVerifierPass(). This implementation as a module pass should sidestep efficiency issues, allowing us to turn debug info verification back on. <rdar://problem/15500563> llvm-svn: 206300	2014-04-15 16:27:38 +00:00
Tim Northover	537e0eb4e2	FastISel: constrain the RegClass of operands when emitting instructions. ARM64 suffered multiple -verify-machineinstr failures (principally over the xsp/xzr issue) because FastISel was completely ignoring which subset of the general-purpose registers each instruction required. More fixes are coming in ARM64 specific FastISel, but this should cover the generic problems. llvm-svn: 206283	2014-04-15 13:59:49 +00:00
Nick Lewycky	82ad9fc7c8	Break PseudoSourceValue out of the Value hierarchy. It is now the root of its own tree containing FixedStackPseudoSourceValue (which you can use isa/dyn_cast on) and MipsCallEntry (which you can't). Anything that needs to use either a PseudoSourceValue* and Value* is strongly encouraged to use a MachinePointerInfo instead. llvm-svn: 206255	2014-04-15 07:22:52 +00:00
David Blaikie	ad3e8f4101	Use unique_ptr to manage TypePromotionActions owned by TypePromotionTransaction. llvm-svn: 206250	2014-04-15 06:17:44 +00:00
David Blaikie	621e20bc78	Use unique_ptr to manage ownership of GCFunctionInfos in GCStrategy llvm-svn: 206249	2014-04-15 06:07:26 +00:00
David Blaikie	ff6e0d4bb1	Use unique_ptr for the result of Registry entries. llvm-svn: 206248	2014-04-15 05:53:26 +00:00
David Blaikie	3d383785b6	Use unique_ptr to manage ownership of GCStrategy objects in GCMetadata llvm-svn: 206246	2014-04-15 05:34:49 +00:00
David Blaikie	7495172545	Use std::unique_ptr for DIE children Got bored, removed some manual memory management. Pushed references (rather than pointers) through a few APIs rather than replacing *x with x.get(). llvm-svn: 206222	2014-04-14 22:45:02 +00:00
Adrian Prantl	ee1b12d3e5	Re-apply r206096 after investigating the gdb buildbot failure. Thanks to dblaikie for updating the testcase! Debug info: (bugfix) C++ C/Dtors can be compiled to multiple functions, therefore, their declaration cannot have one DW_AT_linkage_name. The specific instances however can and should have that attribute. This patch reorders the code in DwarfUnit::getOrCreateSubprogramDIE() to emit linkage names for C/Dtors. rdar://problem/16362674. llvm-svn: 206210	2014-04-14 21:16:04 +00:00
Hal Finkel	93c495a063	Don't assert in BasicTTI::getMemoryOpCost for non-simple types BasicTTI::getMemoryOpCost must explicitly check for non-simple types; setting AllowUnknown=true with TLI->getSimpleValueType is not sufficient because, for example, non-power-of-two vector types return non-simple EVTs (not MVT::Other). llvm-svn: 206150	2014-04-14 05:59:09 +00:00
Craig Topper	30281a67fb	[C++11] More 'nullptr' conversion. In some cases just using a boolean check instead of comparing to nullptr. llvm-svn: 206142	2014-04-14 00:51:57 +00:00
Benjamin Kramer	f6c0615b06	Retire llvm::array_endof in favor of non-member std::end. While there make array_lengthof constexpr if we have support for it. llvm-svn: 206112	2014-04-12 16:15:53 +00:00
David Blaikie	7c76f25336	PR13337: Omit DW_TAG_restrict_type when compiling for DWARF2 DWARF3 introduced DW_TAG_restrict_type, so avoid using it in prior versions. llvm-svn: 206105	2014-04-12 05:35:59 +00:00
Adrian Prantl	2d31ce322b	Revert "Debug info: (bugfix) C++ C/Dtors can be compiled to multiple functions," This reverts commit 206096 while I investigate why this broke the gdb buildbot. llvm-svn: 206103	2014-04-12 04:25:02 +00:00
David Blaikie	1031309286	Use dwarf::Tag rather than unsigned for DIE::Tag to make debugging easier. Nice to be able to just print out the Tag and have the debugger print dwarf::DW_TAG_subprogram or whatever, rather than an int. It's a bit finicky (for example DIDescriptor::getTag still returns unsigned) because some places still handle real dwarf tags + our fake tags (one day we'll remove the fake tags, hopefully). llvm-svn: 206098	2014-04-12 02:24:04 +00:00
Adrian Prantl	86edc5e96b	Debug info: (bugfix) C++ C/Dtors can be compiled to multiple functions, therefore, their declaration cannot have one DW_AT_linkage_name. The specific instances however can and should have that attribute. This patch reorders the code in DwarfUnit::getOrCreateSubprogramDIE() to emit linkage names for C/Dtors. rdar://problem/16362674. llvm-svn: 206096	2014-04-12 01:44:42 +00:00
Hal Finkel	70fab0cd6d	Reenable use of TBAA during CodeGen We had disabled use of TBAA during CodeGen (even when otherwise using AA) because the ptrtoint/inttoptr used by CGP for address sinking caused BasicAA to miss basic type punning that it should catch (and, thus, we'd fail to override TBAA when we should). However, when AA is in use during CodeGen, CGP now uses normal GEPs and bitcasts, instead of ptrtoint/inttoptr, when doing address sinking. As a result, BasicAA should be able to make us do the right thing in the face of type-punning, and it seems safe to enable use of TBAA again. self-hosting seems fine on PPC64/Linux on the P7, with TBAA enabled and -misched=shuffle. Note: We still don't update TBAA when merging stack slots, although because BasicAA should now catch all such cases, this is no longer a blocking issue. Nevertheless, I plan to commit code to deal with this properly in the near future. llvm-svn: 206093	2014-04-12 01:26:00 +00:00
Hal Finkel	f4336e3866	Add the ability to use GEPs for address sinking in CGP The current memory-instruction optimization logic in CGP, which sinks parts of the address computation that can be adsorbed by the addressing mode, does this by explicitly converting the relevant part of the address computation into IR-level integer operations (making use of ptrtoint and inttoptr). For most targets this is currently not a problem, but for targets wishing to make use of IR-level aliasing analysis during CodeGen, the use of ptrtoint/inttoptr is a problem for two reasons: 1. BasicAA becomes less powerful in the face of the ptrtoint/inttoptr 2. In cases where type-punning was used, and BasicAA was used to override TBAA, BasicAA may no longer do so. (this had forced us to disable all use of TBAA in CodeGen; something which we can now enable again) This (use of GEPs instead of ptrtoint/inttoptr) is not currently enabled by default (except for those targets that use AA during CodeGen), and so aside from some PowerPC subtargets and SystemZ, there should be no change in behavior. We may be able to switch completely away from the ptrtoint/inttoptr sinking on all targets, but further testing is required. I've doubled-up on a number of existing tests that are sensitive to the address sinking behavior (including some store-merging tests that are sensitive to the order of the resulting ADD operations at the SDAG level). llvm-svn: 206092	2014-04-12 00:59:48 +00:00
Duncan P. N. Exon Smith	532f710ed4	blockfreq: Rename BlockFrequencyImpl to BlockFrequencyInfoImpl This is a shared implementation class for BlockFrequencyInfo and MachineBlockFrequencyInfo, not for BlockFrequency, a related (but distinct) class. No functionality change. <rdar://problem/14292693> llvm-svn: 206083	2014-04-11 23:20:58 +00:00
Quentin Colombet	e1542121a5	[RegAllocGreedy][Last Chance Recoloring] Change the name of the exhaustive search option. fexhaustive-register-search => exhaustive-register-search 'f' is a Clang thing! This is related to PR18747. llvm-svn: 206075	2014-04-11 21:51:09 +00:00
Quentin Colombet	4f1ce6e4a2	[RegAllocGreedy][Last Chance Recoloring] Addition of -fexhaustive-register-search option to allow an exhaustive search during last chance recoloring. This is related to PR18747 Patch by MAYUR PANDEY <mayur.p@samsung.com>. llvm-svn: 206072	2014-04-11 21:39:44 +00:00
Quentin Colombet	149298454e	[Register Coalescer] Fix wrong live-range information with rematerialization. When rematerializing an instruction that defines a super register that would be used by a physical subregisters we use the related physical super register for the definition. To keep the live-range information accurate, all the defined subregisters must be marked as dead def, otherwise the register allocation may miss some interferences. Working on a reduced test-case! <rdar://problem/16582185> llvm-svn: 206060	2014-04-11 19:45:07 +00:00
Adrian Prantl	208fc516be	Debug info: Store the DIVariable in DebugLocEntry also for constants, so DwarfDebug::emitDebugLocEntry can emit them with the correct signedness. rdar://problem/15928306 llvm-svn: 206042	2014-04-11 17:49:47 +00:00
Matt Arsenault	65fde80ac6	Move ExtractVectorElements to SelectionDAG. This seems generally useful, and makes sense to go along with SplitVector. llvm-svn: 206041	2014-04-11 17:47:30 +00:00
Tom Stellard	9ea60803c4	SelectionDAG: Use helper function to improve legalization of ISD::MUL The TargetLowering::expandMUL() helper contains lowering code extracted from the DAGTypeLegalizer and allows the SelectionDAGLegalizer to expand more ISD::MUL patterns without having to use a library call. llvm-svn: 206037	2014-04-11 16:12:01 +00:00
Tom Stellard	be787d15d0	SelectionDAG: Factor ISD::MUL lowering code out of DAGTypeLegalizer This code has been moved to a new function in the TargetLowering class called expandMUL(). The purpose of this is to be able to share lowering code between the SelectionDAGLegalize and DAGTypeLegalizer classes. No functionality changed intended. llvm-svn: 206036	2014-04-11 16:11:58 +00:00
David Blaikie	1573e6e09f	Implement depth_first and inverse_depth_first range factory functions. Also updated as many loops as I could find using df_begin/idf_begin - strangely I found no uses of idf_begin. Is that just used out of tree? Also a few places couldn't use df_begin because either they used the member functions of the depth first iterators or had specific ordering constraints (I added a comment in the latter case). Based on a patch by Jim Grosbach. (Jim - you just had iterator_range<T> where you needed iterator_range<idf_iterator<T>>) llvm-svn: 206016	2014-04-11 01:50:01 +00:00
Jim Grosbach	0d0ea8cdb5	[c++11] Range'ify use list loops in InstrEmitter. llvm-svn: 206015	2014-04-11 01:13:16 +00:00
Jim Grosbach	6f9873ee9f	[c++11] Range'ify use list loops in DAGCombiner. llvm-svn: 206014	2014-04-11 01:13:13 +00:00
Reid Kleckner	f99741400f	Move the segmented stack switch to a function attribute This removes the -segmented-stacks command line flag in favor of a per-function "split-stack" attribute. Patch by Luqman Aden and Alex Crichton! llvm-svn: 205997	2014-04-10 22:58:43 +00:00
Adrian Prantl	52b43b7eb6	Debug info: Factor the retrieving of the DIVariable from a MachineInstr into a function. llvm-svn: 205973	2014-04-10 17:39:48 +00:00
Jim Grosbach	8314a2474c	Fix to support properly cleaning up failed address sinking against constants As it turns out the source of the sunkaddr can be a constant, in which case there is not an instruction to delete, causing the cleanup code introduced in r204833 to crash. This patch adds a dynamic check to ensure the deleted value is in fact an instruction and not a constant. Patch by Louis Gerbarg <lgg@apple.com> llvm-svn: 205941	2014-04-10 00:27:45 +00:00
Jim Grosbach	118d4cc5e8	SelectionDAG: Don't constant fold target-specific nodes. FoldConstantArithmetic() only knows how to deal with a few target independent ISD opcodes. Bail early if it sees a target-specific ISD node. These node do funny things with operand types which may break the assumptions of the code that follows, and there's no actual folding that can be done anyway. For example, non-constant 256 bit vector shifts on X86 have a shift-amount operand that's a 128-bit v4i32 vector regardless of what the first operand type is and that breaks the assumption that the operand types must match. rdar://16530923 llvm-svn: 205937	2014-04-09 23:28:11 +00:00
Quentin Colombet	95c5120626	[DAGCombiner] DAG combine does not know how to combine indexed loads with sign/zero/any extensions. However a few places were not checking properly the property of the load and were turning an indexed load into a regular extended load. Therefore the indexed value was lost during the process and this was triggering an assertion. <rdar://problem/16389332> llvm-svn: 205923	2014-04-09 20:03:05 +00:00
David Majnemer	075f09bac5	WinCOFF: Emit common symbols as specified in the COFF spec Summary: Local common symbols were properly inserted into the .bss section. However, putting external common symbols in the .bss section would give them a strong definition. Instead, encode them as undefined, external symbols who's symbol value is equivalent to their size. Reviewers: Bigcheese, rafael, rnk CC: llvm-commits Differential Revision: http://reviews.llvm.org/D3324 llvm-svn: 205811	2014-04-08 22:33:40 +00:00
Matt Arsenault	b8e729fdf2	Bug 19348: Check for legal ExtLoad operation before folding (aext (zextload x)) -> (aext (truncate (*extload x))) Patch by Stanislav Mekhanoshin! llvm-svn: 205805	2014-04-08 21:40:37 +00:00
Duncan P. N. Exon Smith	9e6fa5129b	RegAlloc: Account for a variable entry block frequency Until r197284, the entry frequency was constant -- i.e., set to 2^14. Although current ToT still has a constant entry frequency, since r197284 that has been an implementation detail (which is soon going to change). - r204690 made the wrong assumption for the CSRCost metric. Adjust callee-saved register cost based on entry frequency. - r185393 made the wrong assumption (although it was valid at the time). Update SpillPlacement.cpp::Threshold to be relative to the entry frequency. Since ToT still has 2^14 entry frequency, this should have no observable functionality change. <rdar://problem/14292693> llvm-svn: 205789	2014-04-08 19:18:56 +00:00
Andrew Trick	e122529ee6	Put a limit on ScheduleDAGSDNodes::ClusterNeighboringLoads to avoid blowing up compile time. Fixes PR16365 - Extremely slow compilation in -O1 and -O2. The SD scheduler has a quadratic implementation of load clustering which absolutely blows up compile time for large blocks with constant pool loads. The MI scheduler has a better implementation of load clustering. However, we have not done the work yet to completely eliminate the SD scheduler. Some benchmarks still seem to benefit from early load clustering, although maybe by chance. As an intermediate term fix, I just put a nice limit on the number of DAG users to search before finding a match. With this limit there are no binary differences in the LLVM test suite, and the PR16365 test case does not suffer any compile time impact from this routine. llvm-svn: 205738	2014-04-07 21:29:22 +00:00
Andrew Trick	06df84876b	Minor change to StackMapLiveness DEBUG output. llvm-svn: 205656	2014-04-04 23:49:35 +00:00
Matt Arsenault	7b6a70a9cf	Add DAG parameter to ComputeNumSignBitsForTargetNode This way, you can check the number of sign bits in the operands. The depth parameter it already has is pretty useless without this. llvm-svn: 205649	2014-04-04 20:13:13 +00:00
Tim Northover	9ea26aa436	DAGLegalize: add last-ditch type-legalization for VSELECT. When LLVM sees something like (v1iN (vselect v1i1, v1iN, v1iN)) it can decide that the result is OK (v1i64 is legal on AArch64, for example) but it still need scalarising because of that v1i1. There was no code to do this though. AArch64 and ARM64 have DAG combines to produce efficient code and prevent that occuring in most such situations, but there are edge cases that they miss. This adds a legalization to cope with that. llvm-svn: 205626	2014-04-04 14:49:30 +00:00
Tim Northover	421793ce9a	ARM64: handle v1i1 types arising from setcc properly. There were several overlapping problems here, and this solution is closely inspired by the one adopted in AArch64 in r201381. Firstly, scalarisation of v1i1 setcc operations simply fails if the input types are legal. This is fixed in LegalizeVectorTypes.cpp this time, and allows AArch64 code to be simplified slightly. Second, vselect with such a setcc feeding into it ends up in ScalarizeVectorOperand, where it's not handled. I experimented with an implementation, but found that whatever DAG came out was rather horrific. I think Hao's DAG combine approach is a good one for quality, though there are edge cases it won't catch (to be fixed separately). Should fix PR19335. llvm-svn: 205625	2014-04-04 14:49:21 +00:00
Craig Topper	694437e2ef	Make consistent use of MCPhysReg instead of uint16_t throughout the tree. llvm-svn: 205610	2014-04-04 05:16:06 +00:00
Quentin Colombet	5e74b12918	[RegAllocGreedy][Last Chance Recoloring] Emit diagnostics when last chance recoloring cut-offs are encountered and register allocation failed. This is related to PR18747 Patch by MAYUR PANDEY <mayur.p@samsung.com>. llvm-svn: 205601	2014-04-04 02:05:21 +00:00
Quentin Colombet	419aeb287d	Revert r205599, the commit was not intended to have so many changes llvm-svn: 205600	2014-04-04 02:02:49 +00:00
Quentin Colombet	b4d3858ea5	[RegAllocGreedy][Last Chance Recoloring] Emit diagnostics when last chance recoloring cut-offs are hit. This is related to PR18747. Patch by MAYUR PANDEY <mayur.p@samsung.com> llvm-svn: 205599	2014-04-04 01:58:57 +00:00
Eric Christopher	7bc582d828	Fix for PR 19261: llc doesn't generate nodes for unconditional fall-through branches for targets without FastISel implementation (X86 has it, but can be disabled by "-fast-isel=false") in SelectionDAGBuilder::visitBr(). So for line 4 in the following testcase 1: void foo(int i){ 2: switch(i){ 3: default: 4: break; 5: } 6: return; 7: } there is no corresponding line in .debug_line section, and a debugger cannot set a breakpoint at line 4. Fix this by always emitting a branch when we're not optimizing and add a testcase to ensure that there's code on every line we'd want to break. Patch by Daniil Fukalov. llvm-svn: 205529	2014-04-03 12:11:51 +00:00
David Blaikie	a8caf93b53	DebugInfo: Use a 64 bit type for the subrange While we were encoding 64 bit values (data8) in the subrange itself, using a 32 bit type for the subrange was still confusing the gdb. Oh, and make it unsigned too. As the comment points out, this could be pushed into the frontend so that it would be 32 or 64 bit as appropriate, etc. llvm-svn: 205512	2014-04-03 06:28:20 +00:00
Lang Hames	6348bde938	[CodeGen] Fix peephole optimizer bug introduced in r205481. Fixes PR19318. I should have read that comment a little more carefully. ;) Regression test in the works, committing in the mean time to un-break people. llvm-svn: 205511	2014-04-03 05:03:20 +00:00
Hal Finkel	9364c269c6	Account for scalarization costs in BasicTTI::getMemoryOpCost for extending vector loads When a vector type legalizes to a larger vector type, and the target does not support the associated extending load (or truncating store), then legalization will scalarize the load (or store) resulting in an associated scalarization cost. BasicTTI::getMemoryOpCost needs to account for this. Between this, and r205487, PowerPC on the P7 with VSX enabled shows: MultiSource/Benchmarks/PAQ8p/paq8p: 43% speedup SingleSource/Benchmarks/BenchmarkGame/puzzle: 51% speedup SingleSource/UnitTests/Vectorizer/gcc-loops 28% speedup (some of these are new; some of these, such as PAQ8p, just reverse regressions that VSX support would trigger) llvm-svn: 205495	2014-04-03 00:53:59 +00:00
Hal Finkel	c9d6860443	Fix multi-register costs in BasicTTI::getCastInstrCost For an cast (extension, etc.), the currently logic predicts a low cost if the associated operation (keyed on the destination type) is legal (or promoted). This is not true when the number of values required to legalize the type is changing. For example, <8 x i16> being sign extended by <8 x i32> is not generically cheap on PPC with VSX, even though sign extension to v4i32 is legal, because two output v4i32 values are required compared to the single v8i16 input value, and without custom logic in the target, this conversion will scalarize. llvm-svn: 205487	2014-04-02 23:18:54 +00:00
Lang Hames	7c2eba77fd	[CodeGen] Teach the peephole optimizer to remember (and exploit) all folding opportunities in the current basic block, rather than just the last one seen. <rdar://problem/16478629> llvm-svn: 205481	2014-04-02 22:59:58 +00:00
Juergen Ributzka	5b52581e6c	Add comments and test case for [DAG] Keep the opaque constant flag when performing unary constant folding operations (r204737). llvm-svn: 205474	2014-04-02 22:21:01 +00:00
Jim Grosbach	f349849199	Simplify resolveFrameIndex() signature. Just pass a MachineInstr reference rather than an MBB iterator. Creating a MachineInstr& is the first thing every implementation did anyway. llvm-svn: 205453	2014-04-02 19:28:18 +00:00
Oliver Stannard	e941b27161	ARM: Add support for segmented stacks Patch by Alex Crichton, ILyoan, Luqman Aden and Svetoslav. llvm-svn: 205430	2014-04-02 16:10:33 +00:00
Adrian Prantl	34fbf7eb58	clarify comment llvm-svn: 205429	2014-04-02 15:49:45 +00:00
David Blaikie	f8c0762846	Adjust comments regarding non-relocated abbrev offset in debug_info.dwo I'm not sure the comment in the implementation really adds a lot of value (it's clear that we emit zero when no symbol is provided, but it doesn't explain why we would do that). Happy to iterate. llvm-svn: 205386	2014-04-02 02:04:51 +00:00
David Blaikie	e62bc9ddaa	Split debug_loc and debug_loc.dwo emission into two separate functions Based on code review feedback from Eric Christopher on r204697 llvm-svn: 205385	2014-04-02 01:50:20 +00:00
David Blaikie	f410cc8df5	DebugInfo: Introduce DebugLocList to encapsulate a list of DebugLocEntries and an MC Label to refer to them This removes the magic-number-esque code creating/retrieving the same label for a debug_loc entry from two places and removes the last small piece of reusable logic from emitDebugLoc so that there will be less duplication when refactoring it into two functions (one for debug_loc, the other for debug_loc.dwo). llvm-svn: 205382	2014-04-02 01:43:18 +00:00
Adrian Prantl	cf2248c0ba	Add a doxygen comment to DebugLocEntry::Merge. llvm-svn: 205374	2014-04-01 23:34:45 +00:00
David Blaikie	034f61d1e5	DebugLocEntry: Actually merge the loc entry when returning true. Seems we didn't have any test coverage for merging... awesome. So I added some - but hit an llvm-objdump bug while I was there. I'm choosing not to shave that yak right now. Code review feedback/bug catch by Adrian Prantl in r205360. llvm-svn: 205373	2014-04-01 23:19:23 +00:00
David Blaikie	10e2013985	Fix accidental fallthrough in DebugLocEntry::hasSameValueOrLocation No test case (this would invoke UB by examining uninitialized members, etc, at best - and this code is apparently untested anyway - I'm about to fix that) Code review feedback from Adrian Prantl on r205360. llvm-svn: 205367	2014-04-01 22:25:09 +00:00
David Blaikie	0bcd815436	Remove unused function DebugLocEntry::isEmpty llvm-svn: 205365	2014-04-01 22:06:18 +00:00
David Blaikie	d3e34a8e49	Refactor out the comparison of the location/value in a DebugLocEntry llvm-svn: 205364	2014-04-01 22:04:07 +00:00
David Blaikie	8ffd5df5fe	DebugInfo: Split DebugLocEntry into its own file. It seems big enough that it deserves its own file - but it is header only, so there's no need for another cpp file, etc. llvm-svn: 205360	2014-04-01 21:49:04 +00:00
Adrian Prantl	65a4c6a66e	DwarfDebug: Prevent DebugLocEntry merging from coalescing two different constants into only the first one. rdar://14874886. llvm-svn: 205357	2014-04-01 21:04:18 +00:00
Matt Arsenault	0062eb7871	Make isSetCCEquivalent respect the TargetBooleanContents llvm-svn: 205336	2014-04-01 18:13:26 +00:00
Matt Arsenault	8f25a008a2	Add helpers for checking if a value is a target boolean constant. llvm-svn: 205335	2014-04-01 18:13:22 +00:00
David Blaikie	7ed071eec0	DebugInfo: Factor out common functionality for rendering debug_loc and debug_loc.dwo location list entries In preparation for refactoring this function into two, one for debug_loc, one for debug_loc.dwo. llvm-svn: 205324	2014-04-01 16:17:41 +00:00
David Blaikie	6b5255c1db	Cleanup remaining use of removed variable to fix the build llvm-svn: 205323	2014-04-01 16:13:29 +00:00
David Blaikie	260a196cb8	Simplify debug_loc.dwo handling slightly. llvm-svn: 205322	2014-04-01 16:09:49 +00:00
David Blaikie	99bdb2e6c3	DebugInfo: Avoid creating unnecessary/empty line tables and remove the special case of '0' in DwarfCompileUnit::initStmtList by just always using a label difference This moves one case of raw text checking down into the MCStreamer interfaces in the form of a virtual function, even if we ultimately end up consolidating on the one-or-many line tables issue one day, this is nicer in the interim. This just generally streamlines a bunch of use cases into a common code path. llvm-svn: 205287	2014-04-01 08:07:52 +00:00
Adrian Prantl	860533bd61	LTO type uniquing: store the Decl field of a DIImportedEntity as a DIRef. No other functionality changes, DIBuilder testcase is included in a paired CFE commit. This relaxes the assertion in isScopeRef to also accept subclasses of DIScope. llvm-svn: 205279	2014-04-01 03:41:04 +00:00
Juergen Ributzka	9c6cfb73c8	[Stackmaps] Update the stackmap format to use 64-bit relocations for the function address and properly align all entries. This commit updates the stackmap format to version 1 to indicate the reorganizaion of several fields. This was done in order to align stackmap entries to their natural alignment and to minimize padding. Fixes <rdar://problem/16005902> llvm-svn: 205254	2014-03-31 22:14:04 +00:00
Matt Arsenault	5c7af600db	Change shouldSplitVectorElementType to better match the description. Pass the entire vector type, and not just the element. llvm-svn: 205247	2014-03-31 20:54:58 +00:00
Hal Finkel	25be539bb8	Add an optional ability to expand larger BUILD_VECTORs with shuffles This adds the ability to expand large (meaning with more than two unique defined values) BUILD_VECTOR nodes in terms of SCALAR_TO_VECTOR and (legal) vector shuffles. There is now no limit of the size we are capable of expanding this way, although we don't currently do this for vectors with many unique values because of the default implementation of TLI's shouldExpandBuildVectorWithShuffles function. There is currently no functional change to any existing targets because the new capabilities are not used unless some target overrides the TLI shouldExpandBuildVectorWithShuffles function. As a result, I've not included a test case for the new functionality in this commit, but regression tests will (at least) be added soon when I commit support for the PPC QPX vector instruction set. The benefit of committing this now is that it makes the shouldExpandBuildVectorWithShuffles callback, which had to be added for other reasons regardless, fully functional. I suspect that other targets will also benefit from tuning the heuristic. llvm-svn: 205243	2014-03-31 19:42:55 +00:00
Hal Finkel	5ecd959a9e	Add a TLI hook to control when BUILD_VECTOR might be expanded using shuffles There are two general methods for expanding a BUILD_VECTOR node: 1. Use SCALAR_TO_VECTOR on the defined scalar values and then shuffle them together. 2. Build the vector on the stack and then load it. Currently, we use a fixed heuristic: If there are only one or two unique defined values, then we attempt an expansion in terms of SCALAR_TO_VECTOR and vector shuffles (provided that the required shuffle mask is legal). Otherwise, always expand via the stack. Even when SCALAR_TO_VECTOR is not legal, this can still be a good idea depending on what tricks the target can play when lowering the resulting shuffle. If the target can't do anything special, however, and if SCALAR_TO_VECTOR is expanded via the stack, this heuristic leads to sub-optimal code (two stack loads instead of one). Because only the target knows whether the SCALAR_TO_VECTORs and shuffles for a build vector of a particular type are likely to be optimial, this adds a new TLI function: shouldExpandBuildVectorWithShuffles which takes the vector type and the count of unique defined values. If this function returns true, then method (1) will be used, subject to the constraint that all of the necessary shuffles are legal (as determined by isShuffleMaskLegal). If this function returns false, then method (2) is always used. This commit does not enhance the current code to support expanding a build_vector with more than two unique values using shuffles, but I'll commit an implementation of the more-general case shortly. llvm-svn: 205230	2014-03-31 17:48:10 +00:00

... 3 4 5 6 7 ...

16841 Commits