llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 03:53:04 +02:00

Author	SHA1	Message	Date
Xinliang David Li	26f18dd6bb	[PGO] Add hash to name mapping in InstrProfSymtab Creator and lookup interfaces are added to this symtab class. The new interfaces will be used by InstrProf Readers and writer. A unit test is also added for the new APIs. llvm-svn: 256092	2015-12-19 07:44:57 +00:00
Vedant Kumar	6843b30188	Revert "[IR] Move optional data in llvm::Function into a hungoff uselist" This reverts commit r256090. This broke llvm-clang-lld-x86_64-debian-fast. llvm-svn: 256091	2015-12-19 07:30:44 +00:00
Vedant Kumar	46b3967fa2	[IR] Move optional data in llvm::Function into a hungoff uselist Make personality functions, prefix data, and prologue data hungoff operands of Function. This is based on the email thread "[RFC] Clean up the way we store optional Function data" on llvm-dev. Thanks to sanjoyd, majnemer, rnk, loladiro, and dexonsmith for feedback! Differential Revision: http://reviews.llvm.org/D13829 llvm-svn: 256090	2015-12-19 07:08:56 +00:00
James Y Knight	fe15b9a096	Possibly fix MSVC compilation after r256054. I don't have any way to test MSVC compilation, but maybe this will fix the error: llvm/Support/TrailingObjects.h(286) : error C3210: 'TrailingObjectsBase' : access declaration can only be applied to a base class member llvm/Support/TrailingObjects.h(337) : see reference to class template instantiation 'llvm::TrailingObjects<BaseTy,TrailingTys...>' being compiled llvm/Support/TrailingObjects.h(286) : error C2602: 'llvm::trailing_objects_internal::TrailingObjectsBase::OverloadToken' is not a member of a base class of 'llvm::TrailingObjects<BaseTy,TrailingTys...>' llvm/Support/TrailingObjects.h(91) : see declaration of 'llvm::trailing_objects_internal::TrailingObjectsBase::OverloadToken' llvm-svn: 256068	2015-12-19 00:53:22 +00:00
Xinliang David Li	cf9d24ec84	[PGO] Cleanup: Move large member functions out of line (NFC) llvm-svn: 256058	2015-12-18 23:06:37 +00:00
James Y Knight	6a87ac0efb	Rewrite the TrailingObjects template to provide two new features: - Automatic alignment of the base type for the alignment requirements of the trailing types. - Support for an arbitrary numbers of trailing types, instead of only 1 or 2, by using a variadic template implementation. Upcoming commits to clang will take advantage of both of these features. Differential Revision: http://reviews.llvm.org/D12439 llvm-svn: 256054	2015-12-18 22:54:37 +00:00
Rafael Espindola	3db381761e	git-clang-format a region I am about to change. llvm-svn: 256048	2015-12-18 22:23:16 +00:00
Xinliang David Li	c749671813	[PGO] Simplify computehash interface (NFC) llvm-svn: 256047	2015-12-18 22:22:12 +00:00
Alexey Samsonov	e2d62e9cd1	[Symbolize] Improve the ownership of parsed objects. This code changes the way Symbolize handles parsed binaries: now parsed OwningBinary<Binary> is not broken into (binary, memory buffer) pair, and is just stored as-is in a cache. ObjectFile components of Mach-O universal binaries are also stored explicitly in a separate cache. Additionally, this change: * simplifies the code that parses/caches binaries: it's now done in a single place, not three different functions. * makes flush() method behave as expected, and actually clear the cached parsed binaries and objects. * fixes a dangling pointer issue described in http://reviews.llvm.org/D15638 llvm-svn: 256041	2015-12-18 22:02:14 +00:00
Cong Hou	d24cdaa414	Use getEdgeProbability() instead of getEdgeWeight() in BFI and remove getEdgeWeight() interfaces from MBPI. This patch removes all getEdgeWeight() interfaces from CodeGen directory. As getEdgeProbability() is a little more expensive than getEdgeWeight(), I will compose a patch soon in which BPI only stores probabilities instead of edge weights so that getEdgeProbability() will have O(1) time. Differential revision: http://reviews.llvm.org/D15489 llvm-svn: 256039	2015-12-18 21:53:24 +00:00
Rafael Espindola	096a36a6a6	Remove redundant argument. NFC. llvm-svn: 256031	2015-12-18 21:18:57 +00:00
Rafael Espindola	a8daff187a	Drop materializeAllPermanently. This inlines materializeAll into the only caller (materializeAllPermanently) and renames materializeAllPermanently to just materializeAll. llvm-svn: 256024	2015-12-18 20:13:39 +00:00
Rafael Espindola	4fe057fd6e	Drop support for dematerializing. It was only used on lib/Linker and the use was "dead" since it was used on a function the IRMover had just moved. llvm-svn: 256019	2015-12-18 19:57:26 +00:00
Pete Cooper	cda523f460	Revert "Improve DWARFDebugFrame::parse to also handle __eh_frame." This reverts commit r256008. Its breaking multiple buildbots, although works for me locally. llvm-svn: 256013	2015-12-18 19:45:38 +00:00
Teresa Johnson	d80a73c588	Rename variables to reflect linker split (NFC) Renamed variables to be more reflective of whether they are an instance of Linker, IRLinker or ModuleLinker. Also fix a stale comment. llvm-svn: 256011	2015-12-18 19:28:59 +00:00
Eric Christopher	d315447cf7	Convert Arg, ArgList, and Option to dump() to dbgs() rather than errs(). Also add print() functions. Patch by Justin Lebar! llvm-svn: 256010	2015-12-18 18:55:26 +00:00
Eric Christopher	de13aaf4d1	Add a dump method for ArgList. Patch by Justin Lebar! llvm-svn: 256009	2015-12-18 18:55:22 +00:00
Pete Cooper	70c961d67d	Improve DWARFDebugFrame::parse to also handle __eh_frame. LLVM MC has single methods which can handle the output of EH frame and DWARF CIE's and FDE's. This code improves DWARFDebugFrame::parse to do the same for parsing. This also allows llvm-objdump to support the --dwarf=frames option which objdump supports. This option dumps the .eh_frame section using the new code in DWARFDebugFrame::parse. http://reviews.llvm.org/D15535 Reviewed by Rafael Espindola. llvm-svn: 256008	2015-12-18 18:51:08 +00:00
Teresa Johnson	a03b8bd4f9	[ThinLTO/LTO] Don't link in unneeded metadata Summary: Third patch split out from http://reviews.llvm.org/D14752. Only map in needed DISubroutine metadata (imported or otherwise linked in functions and other DISubroutine referenced by inlined instructions). This is supported for ThinLTO, LTO and llvm-link --only-needed, with associated tests for each one. Depends on D14838. Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14843 llvm-svn: 256003	2015-12-18 17:51:37 +00:00
Vedant Kumar	80e7021204	Revert "[Option] Introduce Arg::print(raw_ostream&) and use llvm::dbgs" This reverts commit r255977. This is part of http://reviews.llvm.org/D15634. llvm-svn: 255978	2015-12-18 02:30:45 +00:00
Vedant Kumar	5d48dc63b5	[Option] Introduce Arg::print(raw_ostream&) and use llvm::dbgs llvm-svn: 255977	2015-12-18 02:27:52 +00:00
Eric Christopher	359dea2a6b	Reorganize the C API headers to improve build times. Type specific declarations have been moved to Type.h and error handling routines have been moved to ErrorHandling.h. Both are included in Core.h so nothing should change for projects directly including the headers, but transitive dependencies may be affected. llvm-svn: 255965	2015-12-18 01:46:52 +00:00
Cong Hou	72c4ebb415	[BranchProbability] Remove the restriction that known and unknown probabilities cannot coexist when being normalized. The current BranchProbability::normalizeProbabilities() forbids known and unknown probabilities to coexist in the list. This was once used to help capture probability exceptions but has caused some reported build failures (https://llvm.org/bugs/show_bug.cgi?id=25838). This patch removes this restriction by evenly distributing the complement of the sum of all known probabilities to unknown ones. We could still treat this as an abnormal behavior, but it is better to emit warnings in our future profile validator. Differential revision: http://reviews.llvm.org/D15548 llvm-svn: 255934	2015-12-17 22:27:07 +00:00
Sanjoy Das	5292411857	[SCEV] Add and use SCEVConstant::getAPInt; NFCI llvm-svn: 255921	2015-12-17 20:28:46 +00:00
Teresa Johnson	0dce8d436c	[ThinLTO] Metadata linking for imported functions Summary: Second patch split out from http://reviews.llvm.org/D14752. Maps metadata as a post-pass from each module when importing complete, suturing up final metadata to the temporary metadata left on the imported instructions. This entails saving the mapping from bitcode value id to temporary metadata in the importing pass, and from bitcode value id to final metadata during the metadata linking postpass. Depends on D14825. Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14838 llvm-svn: 255909	2015-12-17 17:14:09 +00:00
Andy Gibbs	d9ba2c9d06	Revert r254592 (virtual dtor in SCEVPredicate). Clang has better diagnostics in this case. It is not necessary therefore to change the destructor to avoid what is effectively an invalid warning in gcc. Instead, better handle the warning flags given to the compiler. llvm-svn: 255905	2015-12-17 16:43:53 +00:00
Tobias Grosser	d78a561416	DOTGraphTraits: Allow the decision to show a graph to consider the analysis The method processFunction() is called to decide if a graph should be shown for a certain function. To allow DOTGraphTraitViewers to take this decision based on the analysis results for the given function, we forward a reference to the analysis result. This will be used by Polly to only visualize functions where interesting loop regions have been detected. llvm-svn: 255889	2015-12-17 12:48:25 +00:00
Xinliang David Li	7fbf182d27	[PGO] InstrPGO and coverage code refactoring (NFC) Introduce a new class InstrProfSymtab to abstract the PGO symbol table for prof and coverage reader. The symtab is is to lookup function's PGO name using function keys. The first user of the class is CoverageMapping Reader. More will follow. llvm-svn: 255862	2015-12-17 00:53:37 +00:00
Rafael Espindola	4c6d9d0ecf	Use std::unique_ptr. NFC. llvm-svn: 255852	2015-12-16 23:49:14 +00:00
Rafael Espindola	f7a0054c75	Change linkInModule to take a std::unique_ptr. Passing in a std::unique_ptr should help find errors when the module is used after being linked into another module. llvm-svn: 255842	2015-12-16 23:16:33 +00:00
Rafael Espindola	5428b16a54	Drop an unnecessary use of writev. It looks like the code this patch deletes is based on a misunderstanding of what guarantees writev provides. In particular, writev with 1 iovec is not "more atomic" than a write. Testing on OS X shows that both write and writev from multiple processes can be intermixed. llvm-svn: 255837	2015-12-16 22:59:06 +00:00
Rafael Espindola	666f0e4525	Simplify memory management with std::unique_ptr. llvm-svn: 255831	2015-12-16 22:28:34 +00:00
Ahmed Bougacha	34e91bd86a	[CodeGen] Make MachineInstrBuilder::copyImplicitOps const. NFC. This matches the other MIB methods, none of which modify the builder. Without this, we can't chain copyImplicitOps. Also reformat the few users, in PPCEarlyReturn. llvm-svn: 255828	2015-12-16 22:15:30 +00:00
Nathan Slingerland	648ea4f74f	[PGO] Handle and report overflow during profile merge for all types of data Summary: Surface counter overflow when merging profile data. Merging still occurs on overflow but counts saturate to the maximum representable value. Overflow is reported to the user. Reviewers: davidxl, dnovillo, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15547 llvm-svn: 255825	2015-12-16 21:45:43 +00:00
Krzysztof Parzyszek	8feeb85759	[Hexagon] Update e_flags in the ELF definitions llvm-svn: 255820	2015-12-16 20:57:07 +00:00
Manman Ren	9744c391a9	CXX_FAST_TLS calling convention: target independent portion. Update supportSplitCSR's interface to take machine function instead of the calling convention. Review comments for http://reviews.llvm.org/D15341 llvm-svn: 255818	2015-12-16 20:45:48 +00:00
Paul Robinson	9300845b3d	Set debugger tuning from TargetOptions (NFC) Differential Revision: http://reviews.llvm.org/D15427 llvm-svn: 255810	2015-12-16 19:58:30 +00:00
Justin Bogner	58647df890	LPM: Make callers of LPM.deleteLoopFromQueue update LoopInfo directly. NFC As of r255720, the loop pass manager will DTRT when passes update the loop info for removed loops, so they no longer need to reach into LPPassManager APIs to do this kind of transformation. This change very nearly removes the need for the LPPassManager to even be passed into loop passes - the only remaining pass that uses the LPM argument is LoopUnswitch. llvm-svn: 255797	2015-12-16 18:40:20 +00:00
Tom Stellard	88b1e5fba9	MachineScheduler: Add a target hook for deciding which RegPressure sets to increase Summary: This patch adds a function called getRegPressureSetScore() to TargetRegisterInfo. The MachineScheduler uses this when comparing instruction that increase the register pressure of different sets to determine which set is safer to increase. This hook is useful for GPU targets where the number of registers in the class is not the best metric for determing which presser set is safer to increase. Future work may include adding more parameters to this function, like for example, the current pressure level of the set or the amount that the pressure will be increased/decreased. Reviewers: qcolombet, escha, arsenm, atrick, MatzeB Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14806 llvm-svn: 255795	2015-12-16 18:31:01 +00:00
Krzysztof Parzyszek	063469eb6a	[Packetizer] Add a check whether an instruction should be packetized now Add a function VLIWPacketizerList::shouldAddToPacket, which will allow specific implementations to decide if it is profitable to add given instruction to the current packet. llvm-svn: 255780	2015-12-16 16:38:16 +00:00
Vaivaswatha Nagaraj	a478a7d3d6	Add InaccessibleMemOnly and inaccessibleMemOrArgMemOnly attributes Summary: This patch introduces two new function attributes InaccessibleMemOnly: This attribute indicates that the function may only access memory that is not accessible by the program/IR being compiled. This is a weaker form of ReadNone. inaccessibleMemOrArgMemOnly: This attribute indicates that the function may only access memory that is either not accessible by the program/IR being compiled, or is pointed to by its pointer arguments. This is a weaker form of ArgMemOnly Test cases have been updated. This revision uses this (`d001932f3a`) as reference. Reviewers: jmolloy, hfinkel Subscribers: reames, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D15499 llvm-svn: 255778	2015-12-16 16:16:19 +00:00
Keno Fischer	1df6a9d651	[SectionMemoryManager] Make better use of virtual memory Summary: On Windows, the allocation granularity can be significantly larger than a page (64K), so with many small objects, just clearing the FreeMem list rapidly leaks quite a bit of virtual memory space (if not rss). Fix that by only removing those parts of the FreeMem blocks that overlap pages for which we are applying memory permissions, rather than dropping the FreeMem blocks entirely. Reviewers: lhames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15202 llvm-svn: 255760	2015-12-16 11:13:23 +00:00
Vikram TV	5d75d984d9	Recommit LiveDebugValues pass after fixing a couple of minor issues. llvm-svn: 255759	2015-12-16 11:09:48 +00:00
George Burgess IV	0d80a00b30	Minor cleanup of Attribute code. NFC. llvm-svn: 255751	2015-12-16 05:21:02 +00:00
Justin Bogner	5c61081fe6	LPM: Simplify how passes mark loops for deletion. NFC When a pass removes a loop it currently has to reach up into the LPPassManager's internals to update the state of the iteration over loops. This reverse dependency results in a pretty awkward interplay of the LPPassManager and its Passes. Here, we change this to instead keep track of when a loop has become "unlooped" in the Loop objects themselves, then the LPPassManager can check this and manipulate its own state directly. This opens the door to allow most of the loop passes to work without a backreference to the LPPassManager. I've kept passes calling the LPPassManager::deleteLoopFromQueue API now so I could put an assert in to prove that this is NFC, but a later pass will update passes just to preserve the LoopInfo directly and stop referencing the LPPassManager completely. llvm-svn: 255720	2015-12-16 00:01:02 +00:00
Reid Kleckner	58fdd5df47	[WinEH] Make llvm.x86.seh.recoverfp work on x64 It adjusts from RSP-after-prologue to RBP, which is what SEH filters need to do before they can use llvm.localrecover. Fixes SEH filter captures, which were broken in r250088. Issue reported by Alex Crichton. llvm-svn: 255707	2015-12-15 23:40:58 +00:00
Evgeniy Stepanov	39e538e166	Cross-DSO control flow integrity (LLVM part). An LTO pass that generates a __cfi_check() function that validates a call based on a hash of the call-site-known type and the target pointer. llvm-svn: 255693	2015-12-15 23:00:08 +00:00
Reid Kleckner	ba7cf44fda	Wrap include of <future> in some warning suppression pragmas Eventually we may need to sink this include to the .cpp file or something to suport LLVM_ENABLE_THREADS=OFF, but this solves my immediate problem of fixing the build. llvm-svn: 255682	2015-12-15 22:10:30 +00:00
Eric Christopher	3b41a7181d	Add support for the .debug_macro section of the forthcoming DWARF 5 spec. Patch by B. Sivachandra Reddy! llvm-svn: 255679	2015-12-15 21:50:27 +00:00
Reid Kleckner	f034b3c900	Fix clang-cl self-host with MSVC 2013 STL std::bind implementation llvm-svn: 255678	2015-12-15 21:41:58 +00:00
Reid Kleckner	c8a81bcc44	[WinEH] Remove unused intrinsic llvm.x86.seh.restoreframe We can clean this up now that we have the X86 CATCHRET instruction to restore the FP, SP, and BP. llvm-svn: 255677	2015-12-15 21:41:34 +00:00
David Majnemer	608538dccc	[WinEH] Use operand bundles to describe call sites SimplifyCFG allows tail merging with code which terminates in unreachable which, in turn, makes it possible for an invoke to end up in a funclet which it was not originally part of. Using operand bundles on invokes allows us to determine whether or not an invoke was part of a funclet in the source program. Furthermore, it allows us to unambiguously answer questions about the legality of inlining into call sites which the personality may have trouble with. Differential Revision: http://reviews.llvm.org/D15517 llvm-svn: 255674	2015-12-15 21:27:27 +00:00
Xinliang David Li	4bfffacc8b	Coverage code refactoring /NFC llvm-svn: 255670	2015-12-15 19:44:45 +00:00
Justin Bogner	621a2ef540	LPM: Stop threading `Pass ` through all of the loop utility APIs. NFC A large number of loop utility functions take a `Pass ` and reach into it to find out which analyses to preserve. There are a number of problems with this: - The APIs have access to pretty well any Pass state they want, so it's hard to tell what they may or may not do. - Other APIs have copied these and pass around a `Pass *` even though they don't even use it. Some of these just hand a nullptr to the API since the callers don't even have a pass available. - Passes in the new pass manager don't work like the current ones, so the APIs can't be used as is there. Instead, we should explicitly thread the analysis results that we actually care about through these APIs. This is both simpler and more reusable. llvm-svn: 255669	2015-12-15 19:40:57 +00:00
Nathan Slingerland	b95c2eec31	[llvm-profdata] Add support for weighted merge of profile data (2nd try) Summary: This change adds support for specifying a weight when merging profile data with the llvm-profdata tool. Weights are specified by using the --weighted-input=<weight>,<filename> option. Input files not specified with this option (normal positional list after options) are given a default weight of 1. Adding support for arbitrary weighting of input profile data allows for relative importance to be placed on the input data from multiple training runs. Both sampled and instrumented profiles are supported. Reviewers: davidxl, dnovillo, bogner, silvas Subscribers: silvas, davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D15306 llvm-svn: 255659	2015-12-15 17:37:09 +00:00
Tom Stellard	165a990abf	AMDGPU/SI: Add llvm.amdgcn.mbcnt.* intrinsics Summary: These are meant to be used instead of the llvm.SI.tid intrinsic which will be deprecated at some point. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15475 llvm-svn: 255652	2015-12-15 17:02:52 +00:00
Tom Stellard	8d8cf53f5c	AMDGPU/SI: Add llvm.amdgcn.v.interp.p[12] intrinsics Summary: These are meant to be used instead of the llvm.SI.fs.interp intrinsic which will be deprecated at some point. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15474 llvm-svn: 255651	2015-12-15 17:02:49 +00:00
Michael Kuperstein	5b09c8a71e	Do not try to use i8 and i16 versions of FP_TO_U/SINT soft float library calls It appears that neither compiler-rt nor the gnu soft-float libraries actually implement these conversions. Instead of emitting calls to library functions that don't exist, handle it similarly to the way we handle i8 -> float and i16 -> float conversions: call the i32 library function, and adjust the type. Differential Revision: http://reviews.llvm.org/D15151 llvm-svn: 255643	2015-12-15 12:55:50 +00:00
Elena Demikhovsky	61fc55d5da	Type legalizer for masked gather and scatter intrinsics. Full type legalizer that works with all vectors length - from 2 to 16, (i32, i64, float, double). This intrinsic, for example void @llvm.masked.scatter.v2f32(<2 x float>%data , <2 x float*>%ptrs , i32 align , <2 x i1>%mask ) requires type widening for data and type promotion for mask. Differential Revision: http://reviews.llvm.org/D13633 llvm-svn: 255629	2015-12-15 08:40:41 +00:00
Craig Topper	466a91c686	Use CmpInst::Predicate instead of 'unsigned short' in some places. NFC llvm-svn: 255623	2015-12-15 06:11:33 +00:00
Justin Bogner	c058f3a850	LoopUtils: Remove defaults for arguments that are always specified. NFC llvm-svn: 255620	2015-12-15 05:52:13 +00:00
Teresa Johnson	ff38b7b132	Fix template parameter pack handling in ThreadPool Fixes passing of template parameter pack via std::forward and add unittest. llvm-svn: 255617	2015-12-15 04:44:02 +00:00
Weiming Zhao	348c97685c	Bump up MAX_SUBTARGET_FEATURES Summary: Currently, ARMGenSubtargetInfo (from ARM.td) is reaching the limit of 96: enum : uint64_t { ... XScale = 95 }; We need to bump the maximum value up to accommodate future changes and/or customized subtarget definitions. Reviewers: apazos, t.p.northover Subscribers: llvm-commits, aemerson Differential Revision: http://reviews.llvm.org/D15514 llvm-svn: 255616	2015-12-15 04:42:49 +00:00
Vaivaswatha Nagaraj	e6c5ddb0c2	NFC: Fix typo in comment llvm-svn: 255615	2015-12-15 04:41:10 +00:00
Nico Weber	630645d32a	Try to let r255604 have an effect. llvm-svn: 255611	2015-12-15 03:14:19 +00:00
Mehdi Amini	02dd190506	Add specific header for MSVC to be able to build with LLVM_ENABLE_THREADS=OFF Follow-up to the ThreadPool library From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255604	2015-12-15 02:32:03 +00:00
Mehdi Amini	b29b50a9dd	Instcombine: destructor loads of structs that do not contains padding For non padded structs, we can just proceed and deaggregate them. We don't want ot do this when there is padding in the struct as to not lose information about this padding (the subsequents passes would then try hard to preserve the padding, which is undesirable). Also update extractvalue.ll and cast.ll so that they use structs with padding. Remove the FIXME in the extractvalue of laod case as the non padded case is handled when processing the load, and we don't want to do it on the padded case. Patch by: Amaury SECHET <deadalnix@gmail.com> Differential Revision: http://reviews.llvm.org/D14483 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255600	2015-12-15 01:44:07 +00:00
Cong Hou	8389268d7a	Let operator/ with uint32_t rhs operand be a member of BranchProbability and add a new operator /=. NFC. llvm-svn: 255595	2015-12-15 01:21:14 +00:00
Mehdi Amini	54fc055738	Add a C++11 ThreadPool implementation in LLVM This is a very simple implementation of a thread pool using C++11 thread. It accepts any std::function<void()> for asynchronous execution. Individual task can be synchronize using the returned future, or the client can block on the full queue completion. In case LLVM is configured with Threading disabled, it falls back to sequential execution using std::async with launch:deferred. This is intended to support parallelism for ThinLTO processing in linker plugin, but is generic enough for any other uses. This is a recommit of r255444 ; trying to workaround a bug in the MSVC 2013 standard library. I think I was hit by: http://connect.microsoft.com/VisualStudio/feedbackdetail/view/791185/std-packaged-task-t-where-t-is-void-or-a-reference-class-are-not-movable Recommit of r255589, trying to please g++ as well. Differential Revision: http://reviews.llvm.org/D15464 From: mehdi_amini <mehdi_amini@91177308-0d34-0410-b5e6-96231b3b80d8> llvm-svn: 255593	2015-12-15 00:59:19 +00:00
Mehdi Amini	5acecbc1ec	Revert "Add a C++11 ThreadPool implementation in LLVM" This reverts commit r255589. Breaks g++ From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255591	2015-12-15 00:42:44 +00:00
Mehdi Amini	dbdd929681	Add a C++11 ThreadPool implementation in LLVM This is a very simple implementation of a thread pool using C++11 thread. It accepts any std::function<void()> for asynchronous execution. Individual task can be synchronize using the returned future, or the client can block on the full queue completion. In case LLVM is configured with Threading disabled, it falls back to sequential execution using std::async with launch:deferred. This is intended to support parallelism for ThinLTO processing in linker plugin, but is generic enough for any other uses. This is a recommit of r255444 ; trying to workaround a bug in the MSVC 2013 standard library. I think I was hit by: http://connect.microsoft.com/VisualStudio/feedbackdetail/view/791185/std-packaged-task-t-where-t-is-void-or-a-reference-class-are-not-movable Differential Revision: http://reviews.llvm.org/D15464 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255589	2015-12-15 00:38:05 +00:00
Xinliang David Li	7640a69ad0	[PGO] make profile prefix even shorter and more readable llvm-svn: 255586	2015-12-15 00:32:56 +00:00
Rafael Espindola	79cecc5f46	Yet another missing include. llvm-svn: 255579	2015-12-14 23:39:05 +00:00
Rafael Espindola	2d1739bf50	A better attempt to add a missing include llvm-svn: 255578	2015-12-14 23:34:35 +00:00
Xinliang David Li	10c9ed2b0f	[PGO] Shorten profile symbol prefixes Profile symbols have long prefixes which waste space and creating pressure for linker. This patch shortens the prefixes to minimal length without losing verbosity. Differential Revision: http://reviews.llvm.org/D15503 llvm-svn: 255575	2015-12-14 23:26:27 +00:00
Rafael Espindola	5b397256de	Use diagnostic handler in the LLVMContext This patch converts code that has access to a LLVMContext to not take a diagnostic handler. This has a few advantages * It is easier to use a consistent diagnostic handler in a single program. * Less clutter since we are not passing a handler around. It does make it a bit awkward to implement some C APIs that return a diagnostic string. I will propose new versions of these APIs and deprecate the current ones. llvm-svn: 255571	2015-12-14 23:17:03 +00:00
Sanjay Patel	14a74b66f7	add fast-math-flags to 'call' instructions (PR21290) This patch adds optional fast-math-flags (the same that apply to fmul/fadd/fsub/fdiv/frem/fcmp) to call instructions in IR. Follow-up patches would use these flags in LibCallSimplifier, add support to clang, and extend FMF to the DAG for calls. Motivating example: %y = fmul fast float %x, %x %z = tail call float @sqrtf(float %y) We'd like to be able to optimize sqrt(x*x) into fabs(x). We do this today using a function-wide attribute for unsafe-math, but we really want to trigger on the instructions themselves: %z = tail call fast float @sqrtf(float %y) because in an LTO build it's possible that calls with fast semantics have been inlined into a function with non-fast semantics. The code changes and tests are based on the recent commits that added "notail": http://reviews.llvm.org/rL252368 and added FMF to fcmp: http://reviews.llvm.org/rL241901 Differential Revision: http://reviews.llvm.org/D14707 llvm-svn: 255555	2015-12-14 21:59:03 +00:00
Krzysztof Parzyszek	0cb4e2fced	[Packetizer] Add AliasAnalysis as a parameter to the packetizer This will make the depedence graph more accurate if an alias analysis is provided. If nullptr is specified in its place, the behavior will remain as it is currently. llvm-svn: 255540	2015-12-14 20:35:13 +00:00
Pete Cooper	183f136627	Add missing vtable anchor's. The following description is from http://reviews.llvm.org/D15481: ICmpInst, GetElementPtrInst and PHINode have no anchor functions. This causes the vtable and the type info (if RTTI is enabled in user code) to be emitted in multiple translation units. Before 3.7, the destructors were the key functions for these nodes, but they have been removed. There have been discussions about this here: http://lists.llvm.org/pipermail/llvm-dev/2015-August/089010.html and here: http://lists.llvm.org/pipermail/llvm-dev/2015-December/092921.html. Patch by Visoiu Mistrih Francis llvm-svn: 255538	2015-12-14 20:29:16 +00:00
Krzysztof Parzyszek	1c65753742	[Packetizer] Make endPacket virtual This will allow custom handling of packet finalization. The current definition of endPacket will still perform the default finalization. llvm-svn: 255537	2015-12-14 20:12:24 +00:00
Sanjoy Das	46fedab671	Teach haveSameSpecialState about operand bundles llvm-svn: 255527	2015-12-14 19:11:35 +00:00
Krzysztof Parzyszek	73ad4fb9e3	Add "const" to function arguments in DFAPacketizer llvm-svn: 255526	2015-12-14 18:54:44 +00:00
Xinliang David Li	2ab348ba60	[PGO] Value profiling text format reader/writer support This patch adds the missing functionality in parsable text format support for value profiling. Differential Revision: http://reviews.llvm.org/D15212 llvm-svn: 255523	2015-12-14 18:44:01 +00:00
David Majnemer	49dcd13916	[IR] Remove terminatepad It turns out that terminatepad gives little benefit over a cleanuppad which calls the termination function. This is not sufficient to implement fully generic filters but MSVC doesn't support them which makes terminatepad a little over-designed. Depends on D15478. Differential Revision: http://reviews.llvm.org/D15479 llvm-svn: 255522	2015-12-14 18:34:23 +00:00
Paul Robinson	252af42f99	FastISel needs to remove dead code when it bails out. When FastISel fails to translate an instruction it hands off code generation to SelectionDAG. Before it does so, it may have generated local value instructions to feed phi nodes in successor blocks. These instructions will then be generated again by SelectionDAG, causing duplication and less efficient code, including extra spill instructions. Patch by Wolfgang Pieb! Differential Revision: http://reviews.llvm.org/D11768 llvm-svn: 255520	2015-12-14 18:33:18 +00:00
NAKAMURA Takumi	a3ac1e279e	Reformat to untabify. llvm-svn: 255483	2015-12-14 07:58:25 +00:00
Michael Zuckerman	8fdb077443	[X86][inline asm] support even directive The .even directive aligns content to an evan-numbered address. In at&t syntax .even In Microsoft syntax even (without the dot). Differential Revision: http://reviews.llvm.org/D15413 llvm-svn: 255462	2015-12-13 17:07:23 +00:00
Cong Hou	b76a7a8dcb	Normalize MBB's successors' probabilities in several locations. This patch adds some missing calls to MBB::normalizeSuccProbs() in several locations where it should be called. Those places are found by checking if the sum of successors' probabilities is approximate one in MachineBlockPlacement pass with some instrumented code (not in this patch). Differential revision: http://reviews.llvm.org/D15259 llvm-svn: 255455	2015-12-13 09:26:17 +00:00
Nico Weber	c244167f92	Revert r255444. It doesn't build on Windows and broke the Windows LLD and LLDB bots: http://lab.llvm.org:8011/builders/lld-x86_64-win7/builds/27693/steps/build_Lld/logs/stdio http://lab.llvm.org:8011/builders/lldb-x86-windows-msvc/builds/13468/steps/build/logs/stdio llvm-svn: 255446	2015-12-13 04:14:39 +00:00
Mehdi Amini	865fad06c6	Add a C++11 ThreadPool implementation in LLVM This is a very simple implementation of a thread pool using C++11 thread. It accepts any std::function<void()> for asynchronous execution. Individual task can be synchronize using the returned future, or the client can block on the full queue completion. In case LLVM is configured with Threading disabled, it falls back to sequential execution using std::async with launch:deferred. This is intended to support parallelism for ThinLTO processing in linker plugin, but is generic enough for any other uses. Differential Revision: http://reviews.llvm.org/D15464 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255444	2015-12-12 22:55:25 +00:00
David Majnemer	bf189bdcd7	[IR] Reformulate LLVM's EH funclet IR While we have successfully implemented a funclet-oriented EH scheme on top of LLVM IR, our scheme has some notable deficiencies: - catchendpad and cleanupendpad are necessary in the current design but they are difficult to explain to others, even to seasoned LLVM experts. - catchendpad and cleanupendpad are optimization barriers. They cannot be split and force all potentially throwing call-sites to be invokes. This has a noticable effect on the quality of our code generation. - catchpad, while similar in some aspects to invoke, is fairly awkward. It is unsplittable, starts a funclet, and has control flow to other funclets. - The nesting relationship between funclets is currently a property of control flow edges. Because of this, we are forced to carefully analyze the flow graph to see if there might potentially exist illegal nesting among funclets. While we have logic to clone funclets when they are illegally nested, it would be nicer if we had a representation which forbade them upfront. Let's clean this up a bit by doing the following: - Instead, make catchpad more like cleanuppad and landingpad: no control flow, just a bunch of simple operands; catchpad would be splittable. - Introduce catchswitch, a control flow instruction designed to model the constraints of funclet oriented EH. - Make funclet scoping explicit by having funclet instructions consume the token produced by the funclet which contains them. - Remove catchendpad and cleanupendpad. Their presence can be inferred implicitly using coloring information. N.B. The state numbering code for the CLR has been updated but the veracity of it's output cannot be spoken for. An expert should take a look to make sure the results are reasonable. Reviewers: rnk, JosephTremoulet, andrew.w.kaylor Differential Revision: http://reviews.llvm.org/D15139 llvm-svn: 255422	2015-12-12 05:38:55 +00:00
Diego Novillo	9bbd13f9a0	SamplePGO - Reduce memory utilization by 10x. DenseMap is the wrong data structure to use for sample records and call sites. The keys are too large, causing massive core memory growth when reading profiles. Before this patch, a 21Mb input profile was causing the compiler to grow to 3Gb in memory. By switching to std::map, the compiler now grows to 300Mb in memory. There still are some opportunities for memory footprint reduction. I'll be looking at those next. llvm-svn: 255389	2015-12-11 23:21:38 +00:00
Hal Finkel	e58db13c29	Revert r248483, r242546, r242545, and r242409 - absdiff intrinsics After much discussion, ending here: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151123/315620.html it has been decided that, instead of having the vectorizer directly generate special absdiff and horizontal-add intrinsics, we'll recognize the relevant reduction patterns during CodeGen. Accordingly, these intrinsics are not needed (the operations they represent can be pattern matched, as is already done in some backends). Thus, we're backing these out in favor of the current development work. r248483 - Codegen: Fix llvm.*absdiff semantic. r242546 - [ARM] Use [SU]ABSDIFF nodes instead of intrinsics for VABD/VABA r242545 - [AArch64] Use [SU]ABSDIFF nodes instead of intrinsics for ABD/ABA r242409 - [Codegen] Add intrinsics 'absdiff' and corresponding SDNodes for absolute difference operation llvm-svn: 255387	2015-12-11 23:11:52 +00:00
Xinliang David Li	2cad28318a	[PGO] Revert r255365: solution incomplete, not handling lambda yet llvm-svn: 255369	2015-12-11 20:23:22 +00:00
Xinliang David Li	de46641441	[PGO] Stop using invalid char in instr variable names. Before the patch, -fprofile-instr-generate compile will fail if no integrated-as is specified when the file contains any static functions (the -S output is also invalid). This patch fixed the issue. With the change, the index format version will be bumped up by 1. Backward compatibility is preserved with this change. Differential Revision: http://reviews.llvm.org/D15243 llvm-svn: 255365	2015-12-11 19:53:19 +00:00
Matthias Braun	b8675cada7	CodeGen: Redo analyzePhysRegs() and computeRegisterLiveness() computeRegisterLiveness() was broken in that it reported dead for a register even if a subregister was alive. I assume this was because the results of analayzePhysRegs() are hard to understand with respect to subregisters. This commit: Changes the results of analyzePhysRegs (=struct PhysRegInfo) to be clearly understandable, also renames the fields to avoid silent breakage of third-party code (and improve the grammar). Fix all (two) users of computeRegisterLiveness() in llvm: By reenabling it and removing workarounds for the bug. This fixes http://llvm.org/PR24535 and http://llvm.org/PR25033 Differential Revision: http://reviews.llvm.org/D15320 llvm-svn: 255362	2015-12-11 19:42:09 +00:00
Matt Arsenault	fad94dae85	Start replacing vector_extract/vector_insert with extractelt/insertelt These are redundant pairs of nodes defined for INSERT_VECTOR_ELEMENT/EXTRACT_VECTOR_ELEMENT. insertelement/extractelement are slightly closer to the corresponding C++ node name, and has stricter type checking so prefer it. Update targets to only use these nodes where it is trivial to do so. AArch64, ARM, and Mips all have various type errors on simple replacement, so they will need work to fix. Example from AArch64: def : Pat<(sext_inreg (vector_extract (v16i8 V128:$Rn), VectorIndexB:$idx), i8), (i32 (SMOVvi8to32 V128:$Rn, VectorIndexB:$idx))>; Which is trying to do sext_inreg i8, i8. llvm-svn: 255359	2015-12-11 19:20:16 +00:00
Manman Ren	6d08250fe8	CXX_FAST_TLS calling convention: target independent portion. The access function has a short entry and a short exit, the initialization block is only run the first time. To improve the performance, we want to have a short frame at the entry and exit. We explicitly handle most of the CSRs via copies. Only the CSRs that are not handled via copies will be in CSR_SaveList. Frame lowering and prologue/epilogue insertion will generate a short frame in the entry and exit according to CSR_SaveList. The majority of the CSRs will be handled by register allcoator. Register allocator will try to spill and reload them in the initialization block. We add CSRsViaCopy, it will be explicitly handled during lowering. 1> we first set FunctionLoweringInfo->SplitCSR if conditions are met (the target supports it for the given calling convention and the function has only return exits). We also call TLI->initializeSplitCSR to perform initialization. 2> we call TLI->insertCopiesSplitCSR to insert copies from CSRsViaCopy to virtual registers at beginning of the entry block and copies from virtual registers to CSRsViaCopy at beginning of the exit blocks. 3> we also need to make sure the explicit copies will not be eliminated. rdar://problem/23557469 Differential Revision: http://reviews.llvm.org/D15340 llvm-svn: 255353	2015-12-11 18:24:30 +00:00
Xinliang David Li	b2d316e534	[PGO] Read VP raw data without depending on the Value field Before this patch, each function's on-disk VP data is 'pointed' to by the Value field of per-function ProfileData structue, and read relies on this field (relocated with ValueDataDelta field) to read the value data. However this means the Value field needs to be updated during runtime before dumping, which creates undesirable data races. With this patch, the reading of VP data no longer depends on Value field. There is no format change. ValueDataDelta header field becomes obsolute but will be kept for compatibility reason (will be removed next time the raw format change is needed). llvm-svn: 255329	2015-12-11 06:53:53 +00:00
Xinliang David Li	7d948a5ebf	Add a forward declaration (NFC) llvm-svn: 255292	2015-12-10 20:13:41 +00:00
Nathan Slingerland	5f9f1eadf1	[ProfileData] Add unit test infrastructure for sample profile reader/writer Summary: Adds support for in-memory round-trip of sample profile data along with basic round trip unit tests. This will also make it easier to include unit tests for future changes to sample profiling. Reviewers: davidxl, dnovillo, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15211 llvm-svn: 255264	2015-12-10 17:21:42 +00:00
Rafael Espindola	9a2fc3b415	Avoid undefined behavior when vector is empty. Found by ubsan. llvm-svn: 255258	2015-12-10 16:35:06 +00:00
Rafael Espindola	c944447e56	Slit lib/Linker in two. A linker normally has two stages: symbol resolution and "moving stuff". In lib/Linker there is the complication of lazy linking some globals, but it was still far more mixed than it needed to. This splits the linker into a lower level IRMover and the linker proper. The IRMover just takes a list of globals to move and a callback that lets the user control what is lazy linked. The main motivation is that now tools/gold (and soon lld) can use their own symbol resolution to instruct IRMover what to do. llvm-svn: 255254	2015-12-10 14:19:35 +00:00
Amjad Aboud	85f2758759	Macro debug info support in LLVM IR Introduced DIMacro and DIMacroFile debug info metadata in the LLVM IR to support macros. Differential Revision: http://reviews.llvm.org/D14687 llvm-svn: 255245	2015-12-10 12:56:35 +00:00
Jonas Paulsson	34ea6437fb	[PostRA scheduling] Allow a target to do scheduling when it wants post RA. SystemZ needs to do its scheduling after branch relaxation, which can only happen after block placement, and therefore the standard PostRAScheduler point in the pass sequence is too early. TargetMachine::targetSchedulesPostRAScheduling() is a new method that signals on returning true that target will insert the final scheduling pass on its own. Reviewed by Hal Finkel llvm-svn: 255234	2015-12-10 09:10:07 +00:00
Sanjoy Das	d85ded90d0	Add arg_begin() and arg_end() to CallInst and InvokeInst; NFCI - This simplifies the CallSite class, arg_begin / arg_end are now simple wrapper getters. - In several places, we were creating CallSite instances solely to call arg_begin and arg_end. With this change, that's no longer required. llvm-svn: 255226	2015-12-10 06:39:02 +00:00
Matthias Braun	c65dd31462	RegisterPressure: Factor out liveness dead-def detection logic; NFCI Detecting additional dead-defs without a dead flag that are only visible through liveness information should be part of the register operand collection not intertwined with the register pressure update logic. llvm-svn: 255192	2015-12-10 01:04:15 +00:00
Sanjoy Das	f3ba629c4d	Use WeakVH to keep track of calls with operand bundles in CloneCodeInfo `CloneAndPruneIntoFromInst` can DCE instructions after cloning them into the new function, and so an AssertingVH is too strong. This change switches CloneCodeInfo to use a std::vector<WeakVH>. llvm-svn: 255148	2015-12-09 20:33:52 +00:00
Teresa Johnson	83a7df21b2	[ThinLTO] FunctionImport pass can take a const index pointer (NFC) llvm-svn: 255140	2015-12-09 19:39:47 +00:00
Rong Xu	2f995f2098	[PGO] Resubmit "MST based PGO instrumentation infrastructure" (r254021) This new patch fixes a few bugs that exposed in last submit. It also improves the test cases. --Original Commit Message-- This patch implements a minimum spanning tree (MST) based instrumentation for PGO. The use of MST guarantees minimum number of CFG edges getting instrumented. An addition optimization is to instrument the less executed edges to further reduce the instrumentation overhead. The patch contains both the instrumentation and the use of the profile to set the branch weights. Differential Revision: http://reviews.llvm.org/D12781 llvm-svn: 255132	2015-12-09 18:08:16 +00:00
Nathan Slingerland	3b35892e68	[Support] Change SaturatingAdd()/SaturatingMultiply() to use pointer for returning overflow state Summary: Improve SaturatingAdd()/SaturatingMultiply() to use bool * to optionally return overflow result. This should make it clearer that the value is returned at callsites and reduces the size of the implementation. Reviewers: davidxl, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15219 llvm-svn: 255128	2015-12-09 17:11:28 +00:00
Mehdi Amini	300ed48d90	Change hasUniqueInitializer() to call isStrongDefinitionForLinker() instead of !isWeakForLinker() Summary: Available_externally global variable with initializer were considered "hasInitializer()", while obviously it can't match the description: Whether the global variable has an initializer, and any changes made to the initializer will turn up in the final executable. since modifying the initializer of an externally available variable does not make sense. Reviewers: pcc, rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15351 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255123	2015-12-09 16:17:07 +00:00
Silviu Baranga	d19d7b747a	Re-commit r255115, with the PredicatedScalarEvolution class moved to ScalarEvolution.h, in order to avoid cyclic dependencies between the Transform and Analysis modules: [LV][LAA] Add a layer over SCEV to apply run-time checked knowledge on SCEV expressions Summary: This change creates a layer over ScalarEvolution for LAA and LV, and centralizes the usage of SCEV predicates. The SCEVPredicatedLayer takes the statically deduced knowledge by ScalarEvolution and applies the knowledge from the SCEV predicates. The end goal is that both LAA and LV should use this interface everywhere. This also solves a problem involving the result of SCEV expression rewritting when the predicate changes. Suppose we have the expression (sext {a,+,b}) and two predicates P1: {a,+,b} has nsw P2: b = 1. Applying P1 and then P2 gives us {a,+,1}, while applying P2 and the P1 gives us sext({a,+,1}) (the AddRec expression was changed by P2 so P1 no longer applies). The SCEVPredicatedLayer maintains the order of transformations by feeding back the results of previous transformations into new transformations, and therefore avoiding this issue. The SCEVPredicatedLayer maintains a cache to remember the results of previous SCEV rewritting results. This also has the benefit of reducing the overall number of expression rewrites. Reviewers: mzolotukhin, anemet Subscribers: jmolloy, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D14296 llvm-svn: 255122	2015-12-09 16:06:28 +00:00
Silviu Baranga	ba0669cbca	Revert r255115 until we figure out how to fix the bot failures. llvm-svn: 255117	2015-12-09 15:25:28 +00:00
Silviu Baranga	f6006f41f7	[LV][LAA] Add a layer over SCEV to apply run-time checked knowledge on SCEV expressions Summary: This change creates a layer over ScalarEvolution for LAA and LV, and centralizes the usage of SCEV predicates. The SCEVPredicatedLayer takes the statically deduced knowledge by ScalarEvolution and applies the knowledge from the SCEV predicates. The end goal is that both LAA and LV should use this interface everywhere. This also solves a problem involving the result of SCEV expression rewritting when the predicate changes. Suppose we have the expression (sext {a,+,b}) and two predicates P1: {a,+,b} has nsw P2: b = 1. Applying P1 and then P2 gives us {a,+,1}, while applying P2 and the P1 gives us sext({a,+,1}) (the AddRec expression was changed by P2 so P1 no longer applies). The SCEVPredicatedLayer maintains the order of transformations by feeding back the results of previous transformations into new transformations, and therefore avoiding this issue. The SCEVPredicatedLayer maintains a cache to remember the results of previous SCEV rewritting results. This also has the benefit of reducing the overall number of expression rewrites. Reviewers: mzolotukhin, anemet Subscribers: jmolloy, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D14296 llvm-svn: 255115	2015-12-09 15:03:52 +00:00
Mehdi Amini	65450e86c8	Revert "Implement a new pass - LiveDebugValues - to compute the set of live DEBUG_VALUEs at each basic block and insert them. Reviewed and accepted at: http://reviews.llvm.org/D11933 " This reverts commit r255096. Break the bots: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_check/16378/ From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255101	2015-12-09 08:17:42 +00:00
Vikram TV	eb98340d4a	Implement a new pass - LiveDebugValues - to compute the set of live DEBUG_VALUEs at each basic block and insert them. Reviewed and accepted at: http://reviews.llvm.org/D11933 llvm-svn: 255096	2015-12-09 05:49:14 +00:00
Steven Wu	1b8163c1b8	Fix the order of destructors in LibLTOCodeGenerator Summary: The order of destructors in LTOCodeGenerator gets changed in r254696. It is possible for LTOCodeGenerator to have a MergedModule created in the OwnedContext, in which case the module must be destructed before the context. Reviewers: rafael, dexonsmith Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D15346 llvm-svn: 255092	2015-12-09 03:37:51 +00:00
Rafael Espindola	f20bc23b7c	Return a std::unique_ptr from CloneModule. NFC. llvm-svn: 255078	2015-12-08 23:57:17 +00:00
Mehdi Amini	ba2c064383	Remove caching in FunctionImport: a Module can't be reused after being linked from The Linker destroys the source module (API change coming to make it explicit) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255064	2015-12-08 22:39:40 +00:00
Mehdi Amini	e4f5a60024	Revert "Add Available Externally linkage type to isWeakForLinker()" This reverts r255043, as per post-review concern were raised on the correctness. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255045	2015-12-08 19:13:31 +00:00
Mehdi Amini	adf4a628c7	Add Available Externally linkage type to isWeakForLinker() Per LangRef: "Globals with available_externally linkage are allowed to be discarded at will, and are otherwise the same as linkonce_odr", since linkonce_odr is in this list it makes sense to have available_externally there as well. Reviewers: rafael Differential Revision: http://reviews.llvm.org/D15323 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255043	2015-12-08 19:01:29 +00:00
Asaf Badouh	73424c7e6b	[x86][avx512] more changes in intrinsics to be align with gcc format Differential Revision: http://reviews.llvm.org/D15329 llvm-svn: 255011	2015-12-08 12:34:34 +00:00
Sanjoy Das	90bb44dfe3	[OperandBundles] Remove unncessary constructor The StringRef constructor is unnecessary (since we're converting to std::string anyway), and having it requires an explicit call to StringRef's or std::string's constructor. llvm-svn: 255000	2015-12-08 03:50:32 +00:00
Sanjoy Das	2f7aca1668	[IndVars] Have getInsertPointForUses preserve LCSSA Summary: Also add a stricter post-condition for IndVarSimplify. Fixes PR25578. Test case by Michael Zolotukhin. Reviewers: hfinkel, atrick, mzolotukhin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15059 llvm-svn: 254977	2015-12-08 00:13:21 +00:00
Sanjoy Das	ec1f59a19a	[SCEVExpander] Have hoistIVInc preserve LCSSA Summary: (Note: the problematic invocation of hoistIVInc that caused PR24804 came from IndVarSimplify, not from SCEVExpander itself) Fixes PR24804. Test case by David Majnemer. Reviewers: hfinkel, majnemer, atrick, mzolotukhin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15058 llvm-svn: 254976	2015-12-08 00:13:17 +00:00
Sanjoy Das	9d4e519ec7	Add Instruction::getFunction; NFC Will be used in a upcoming patch. llvm-svn: 254975	2015-12-08 00:13:12 +00:00
Philip Reames	b28303ef74	[PassManager] Tuning Memory Usage of AnalysisUsage We were using unneccessarily large initial sizes for these SmallVectors. This was wasting around 50kb of memory for the O3 pipeline, even after the uniquing changes. We're still using around 20kb which is a bit much, but it's definitely better. This is about a 6% improvement in total O3 memory usage. Note: The raw data on structure size which were used to pick these thresholds can be found in the review thread. Differential Revision: http://reviews.llvm.org/D15244 llvm-svn: 254974	2015-12-08 00:10:56 +00:00
Justin Bogner	8b4ea7a451	Support: Teach Asan about BumpPtrAllocator Based on patch by Pete Cooper. llvm-svn: 254964	2015-12-07 23:12:26 +00:00
Mehdi Amini	87b7d8abe9	Remove useless hack that avoids calling LLVMLinkInInterpreter() This is supposed to force-link the Interpreter, by inserting a dead call to LLVMLinkInInterpreter(). Since it is actually an empty function, there is no reason for the call to be dead. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254956	2015-12-07 22:27:19 +00:00
Teresa Johnson	335927a217	Fix function return type in declaration (bot errors from r254926). Try to fix bot build errors from r254926 by correcting the function return type. llvm-svn: 254934	2015-12-07 19:53:38 +00:00
Sanjay Patel	51a4d88a44	fix 'the the '; NFC llvm-svn: 254928	2015-12-07 19:21:39 +00:00
Teresa Johnson	1fb89d62fb	[ThinLTO] Support for specifying function index from pass manager Summary: Add a field on the PassManagerBuilder that clang or gold can use to pass down a pointer to the function index in memory to use for importing when the ThinLTO backend is triggered. Add support to supply this to the function import pass. Reviewers: joker.eph, dexonsmith Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D15024 llvm-svn: 254926	2015-12-07 19:21:11 +00:00
Lang Hames	096bda3427	[Orc] Removing traces of takeOwnershipOfBuffers left after r251560. Patch by Joshua Gerrard. Thanks Joshua! llvm-svn: 254919	2015-12-07 17:35:56 +00:00
Teresa Johnson	4789ea589d	[ThinLTO] Support cloning of temporary DILocation metadata This is needed to support linking of module-level metadata as a postpass after function importing, where we will be leaving temporary metadata on imported instructions until the postpass metadata import. Also added unittest. Split from D14838. llvm-svn: 254914	2015-12-07 15:05:44 +00:00
Igor Breger	2e5da39635	AVX-512: implement kunpck intrinsics. Differential Revision: http://reviews.llvm.org/D14821 llvm-svn: 254908	2015-12-07 13:25:18 +00:00
Asaf Badouh	201b9ca305	[avx512] rename gcc intrinsics to be align with gcc format rename the gcc intrinsics suffix : _mask ->_round Differential Revision: http://reviews.llvm.org/D15285 llvm-svn: 254905	2015-12-07 13:14:14 +00:00
Craig Topper	6d9821a6ce	Add uint8_t size to LegalizeTypeAction enum and use the enum type directly to remove some typecasts. NFC llvm-svn: 254895	2015-12-07 06:31:41 +00:00
Craig Topper	2d42dff673	Add uint8_t size to LegalizeAction enum so we can use the enum type directly and remove some casts. NFC llvm-svn: 254893	2015-12-07 06:01:51 +00:00
Lang Hames	dde8f900ec	[Orc] Rename IndirectStubsManagerBase to IndirectStubsManager. No functional change. llvm-svn: 254885	2015-12-06 19:44:45 +00:00
Asaf Badouh	903869d4c1	[X86][AVX512] add vmovss/sd missing encoding Differential Revision: http://reviews.llvm.org/D14701 llvm-svn: 254875	2015-12-06 13:26:56 +00:00
Michael Kuperstein	0fd1d57c01	[X86] Always generate precise CFA adjustments. This removes the code path that generate "synchronous" (only correct at call site) CFA. We will probably want to re-introduce it once we are capable of emitting different .eh_frame and .debug_frame sections. Differential Revision: http://reviews.llvm.org/D14948 llvm-svn: 254874	2015-12-06 13:06:20 +00:00
Craig Topper	55a007dfc9	Use make_range to reduce mentions of iterator type. NFC llvm-svn: 254872	2015-12-06 05:08:07 +00:00
Craig Topper	a9a7f8f04c	Minor formatting fix. NFC llvm-svn: 254871	2015-12-06 05:07:58 +00:00
Sanjoy Das	16ad4f2471	[InstCombine] Call getCmpPredicateForMinMax only with a valid SPF Summary: There are `SelectPatternFlavor`s that don't represent min or max idioms, and we should not be passing those to `getCmpPredicateForMinMax`. Fixes PR25745. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15249 llvm-svn: 254869	2015-12-05 23:44:22 +00:00
Craig Topper	bf1c629225	Replace uint16_t with the MCPhysReg typedef in many places. A lot of physical register arrays already use this typedef. llvm-svn: 254843	2015-12-05 07:13:35 +00:00
Craig Topper	250c981bf4	Use std::fill instead of memset to initialize an array to avoid hardcoded count and a multiply. The outputed code is identical. llvm-svn: 254842	2015-12-05 07:13:25 +00:00
Xinliang David Li	c84cd3b8ff	[PGO] Add version to getPGOFuncName method Different version of indexed format may use different name uniquing schemes for static functions. Pass the version info to the name interface so that different schmes can be picked (for profile lookup). llvm-svn: 254838	2015-12-05 05:16:36 +00:00
David Blaikie	09b741f7b6	[llvm-dwp] Support debug_tu_index llvm-svn: 254827	2015-12-05 03:05:45 +00:00
Lang Hames	20a7c0b0dd	Whitespace. llvm-svn: 254821	2015-12-05 01:44:20 +00:00
Justin Bogner	62356515cc	CodeGen: Let the BumpPtrAllocator free the elements of indexList The indexList's nodes are all allocated on a BumpPtrAllocator, so it's more efficient to let them be freed when it goes away, rather than deleting them directly. This is a follow up to r254794. llvm-svn: 254808	2015-12-05 00:39:14 +00:00
Philip Reames	730b7eb8e3	[EarlyCSE] IsSimple vs IsVolatile naming clarification (NFC) When the notion of target specific memory intrinsics was introduced to EarlyCSE, the commit confused the notions of volatile and simple memory access. Since I'm about to start working on this area, cleanup the naming so that patches aren't horribly confusing. Note that the actual implementation was always bailing if the load or store wasn't simple. Reminder: - "volatile" - C++ volatile, can't remove any memory operations, but in principal unordered - "ordered" - imposes ordering constraints on other nearby memory operations - "atomic" - can't be split or sheared. In LLVM terms, all "ordered" operations are also atomic so the predicate "isAtomic" is often used. - "simple" - a load which is none of the above. These are normal loads and what most of the optimizer works with. llvm-svn: 254805	2015-12-05 00:18:33 +00:00
Philip Reames	a39a875fea	[PassManager] Ensure destructors of cached AnalysisUsage objects are run In 254760, I introduced the usage of a BumpPtrAllocator for the AnalysisUsage instances held by the PassManger. This turns out to have been incorrect since a BumpPtrAllocator does not run the destructors of objects when deallocating memory. Since a few of our SmallVector's had grown beyond their small size, we end up with some leaked memory. We need to use a SpecificBumpPtrAllocator instead. llvm-svn: 254803	2015-12-04 23:48:19 +00:00
Teresa Johnson	f06c8232ba	[ThinLTO] Helper for performing renaming/promotion on a module Creates a module and performs necessary renaming/promotion of locals that may be exported to another module. Split out of D15024. llvm-svn: 254802	2015-12-04 23:40:22 +00:00
Philip Reames	62dd4420a0	Address a memory leak in 254760 The issue appears to have been that the copy constructor of the SmallVector was being invoked and this was somehow leading to leaked memory. This patch avoids the symptom, but likely doesn't address the underlying problem. I'm still investigating the root cause, but wanted to avoid the memory leak in the mean time. Even with the underlying fix, avoiding the redundant allocation is worthwhile. llvm-svn: 254795	2015-12-04 23:06:33 +00:00
Justin Bogner	6c86c50af0	CodeGen: Move the SlotIndexes BumpPtrAllocator before the list it allocates When a `SlotIndexes` is destroyed, `ileAllocator` will currently be destructed before `IndexList`, but all of `IndexList`'s storage has been allocated by `ileAllocator`. This means we'll call destructors on garbage data, which is very bad. This can be avoided by putting the BumpPtrAllocator earlier in the class than anything it allocates. Unfortunately, I don't know how to test this. It depends very much on memory layout, and the only evidence I have that this is actually happening in practice are backtraces that might be explained by this. By inspection though, the code is obviously dangerous/wrong, and this is the right thing to do. I'll follow up later with a patch that calls clearAndLeakNodesUnsafely on the list, since there isn't much point in destructing them when they're allocated in a BPA anyway, but I figured it makes sense to commit the correctness fix separately from that optimization. llvm-svn: 254794	2015-12-04 23:00:54 +00:00
Lang Hames	c8c41976a3	[Orc] Move some code up into the JITCompileCallbackManager base class. NFC. llvm-svn: 254778	2015-12-04 22:09:19 +00:00
Rafael Espindola	2c7e9b8d27	Always pass a diagnostic handler to the linker. Before this patch the diagnostic handler was optional. If it was not passed, the one in the LLVMContext was used. That is probably not a pattern we want to follow. If each area has an optional callback, there is a sea of callbacks and it is hard to follow which one is called. Doing this also found cases where the callback is a nice addition, like testing that no errors or warnings are reported. The other option is to always use the diagnostic handler in the LLVMContext. That has a few problems * To implement the C API we would have to set the diag handler and then set it back to the original value. * Code that creates the context might be far away from code that wants the diagnostics. I do have a patch that implements the second option and will send that as an RFC. llvm-svn: 254777	2015-12-04 22:08:53 +00:00
Sanjoy Das	6dd256008e	[OperandBundles] Allow operand-specific attributes in operand bundles Currently `OperandBundleUse::operandsHaveAttr` computes its result without being given a specific operand. This is problematic because it forces us to say that, e.g., even non-pointer operands in `"deopt"` operand bundles are `readonly`, which doesn't make sense. This commit changes `operandsHaveAttr` to work in the context of a specific operand, so that we can give the operand attributes that make sense for the operands's `llvm::Type`. llvm-svn: 254764	2015-12-04 20:34:37 +00:00
Philip Reames	cce89f4a30	[LegacyPassManager] Reduce memory usage for AnalysisUsage The LegacyPassManager was storing an instance of AnalysisUsage for each instance of each pass. In practice, most instances of a single pass class share the same dependencies. We can't rely on this because passes can (and some do) have dynamic dependencies based on instance options. We can exploit the likely commonality by uniqueing the usage information after querying the pass, but before storing it into the pass manager. This greatly reduces memory consumption by the AnalysisUsage objects. For a long pass pipeline, I measured a decrease in memory consumption for this storage of about 50%. I have not measured on the default O3 pipeline, but I suspect it will see some benefit as well since many passes are repeated (e.g. InstCombine). Differential Revision: http://reviews.llvm.org/D14677 llvm-svn: 254760	2015-12-04 20:05:04 +00:00
Matthias Braun	54aeb74db9	ScheduleDAGInstrs: Move LiveIntervals field to ScheduleDAGMI Now that ScheduleDAGInstrs doesn't need it anymore we can move the field down the class hierarcy to ScheduleDAGMI. llvm-svn: 254759	2015-12-04 19:54:24 +00:00
Sanjay Patel	2b56d370e1	don't repeat function names in comments; NFC llvm-svn: 254740	2015-12-04 17:54:31 +00:00
Manman Ren	107407fcfc	[CXX TLS calling convention] Add CXX TLS calling convention. This commit adds a new target-independent calling convention for C++ TLS access functions. It aims to minimize overhead in the caller by perserving as many registers as possible. The target-specific implementation for X86-64 is defined as following: Arguments are passed as for the default C calling convention The same applies for the return value(s) The callee preserves all GPRs - except RAX and RDI The access function makes C-style TLS function calls in the entry and exit block, C-style TLS functions save a lot more registers than normal calls. The added calling convention ties into the existing implementation of the C-style TLS functions, so we can't simply use existing calling conventions such as preserve_mostcc. rdar://9001553 llvm-svn: 254737	2015-12-04 17:40:13 +00:00
David Blaikie	3fd553d9d8	[llvm-dwp] Retrieve the DWOID from the CU for the cu_index entry llvm-svn: 254731	2015-12-04 17:20:04 +00:00
Rafael Espindola	b4f7892e3a	Modernize the C++ APIs for creating LTO modules. This is a continuation of r253367. These functions return is owned by the caller, so they return std::unique_ptr now. The call can fail, so the return is wrapped in ErrorOr. They have a context where to report diagnostics, so they don't need to take a string out parameter. With this there are no call to getGlobalContext in lib/LTO. llvm-svn: 254721	2015-12-04 16:14:31 +00:00
Rafael Espindola	b0476044d6	Move a call to getGlobalContext out of lib/LTO. llvm-svn: 254696	2015-12-04 02:42:28 +00:00
Lang Hames	84deb31665	[Orc] Rename JITCompileCallbackManagerBase to JITCompileCallbackManager. This class is turning into a useful interface, rather than an implementation detail, so I'm dropping the 'Base' suffix. No functional change. llvm-svn: 254693	2015-12-04 02:15:39 +00:00
Nathan Slingerland	5a941550ef	Revert "[llvm-profdata] Add support for weighted merge of profile data" This reverts commit b7250858d96b8ce567681214273ac0e62713c661. Reverting in order to investigate Windows test failure. llvm-svn: 254687	2015-12-04 02:13:58 +00:00
Matthias Braun	cdae83c48c	ScheduleDAGInstrs: Rework schedule graph builder. Re-comitting with a change that avoids undefined uses getting put into the VRegUses list. The new algorithm remembers the uses encountered while walking backwards until a matching def is found. Contrary to the previous version this: - Works without LiveIntervals being available - Allows to increase the precision to subregisters/lanemasks (not used for now) The changes in the AMDGPU tests are necessary because the R600 scheduler is not stable with respect to the order of nodes in the ready queues. Differential Revision: http://reviews.llvm.org/D9068 llvm-svn: 254683	2015-12-04 01:51:19 +00:00
Matthias Braun	60703e1f97	raw_ostream: << operator for callables with raw_ostream argument This is a revised version of r254655 which uses a Printable wrapper class to avoid ambiguous overload problems. Differential Revision: http://reviews.llvm.org/D14348 llvm-svn: 254681	2015-12-04 01:31:59 +00:00
Xinliang David Li	799fa97d82	[PGO] Unify VP data format between raw and indexed profile (Reader) With the latest refactoring and code sharing patches landed, it is possible to unify the value profile implementation between raw and indexed profile. This is the patch in raw profile reader that uses the common interface. Differential Revision: http://reviews.llvm.org/D15056 llvm-svn: 254677	2015-12-04 01:02:10 +00:00
Cong Hou	cc25d3b7d5	Don't punish vectorized arithmetic instruction whose type will be split to multiple registers Currently in LLVM's cost model, a vectorized arithmetic instruction will have high cost if its type is split into multiple registers. However, this punishment is too heavy and unnecessary. The overhead of the split should not be on arithmetic instructions but instructions that implement the split. Note that during vectorization we have calculated the register pressure, and we only choose proper interleaving factor (and also vectorization factor) so that we don't use more registers than the maximum number. Here is a very simple example: if a vadd has the cost 1, and if we double VF so that we need two registers to perform it, then its cost will become 4 with the current implementation, which will prevent us to use larger VF. Differential revision: http://reviews.llvm.org/D15159 llvm-svn: 254671	2015-12-04 00:36:58 +00:00
Nathan Slingerland	e6205d97d7	[llvm-profdata] Add support for weighted merge of profile data This change adds support for an optional weight when merging profile data with the llvm-profdata tool. Weights are specified by adding an option ':<weight>' suffix to the input file names. Adding support for arbitrary weighting of input profile data allows for relative importance to be placed on the input data from multiple training runs. Both sampled and instrumented profiles are supported. Reviewers: dnovillo, bogner, davidxl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14547 llvm-svn: 254669	2015-12-04 00:00:20 +00:00
Kevin B. Smith	078e5ad0c3	[CodeGen] Minor correction to comment on PhysRegInfo. Differential revision: http://reviews.llvm.org/D15216 llvm-svn: 254668	2015-12-04 00:00:10 +00:00
Rafael Espindola	116a2b8530	Simplify since this function never fails. llvm-svn: 254667	2015-12-03 23:56:42 +00:00
Matthias Braun	a075ebf2eb	Revert "raw_ostream: << operator for callables with raw_stream argument" This commit provoked "error C2593: 'operator <<' is ambiguous" on MSVC. This reverts commit r254655. llvm-svn: 254661	2015-12-03 23:00:28 +00:00
David Majnemer	dc587eeed6	[Analysis] Become aware of MSVC's new/delete functions The compiler can take advantage of the allocation/deallocation function's properties. We knew how to do this for Itanium but had no support for MSVC-style functions. llvm-svn: 254656	2015-12-03 22:45:19 +00:00
Matthias Braun	cf5631d3ff	raw_ostream: << operator for callables with raw_stream argument This allows easier construction of print helpers. Example: Printable PrintLaneMask(unsigned LaneMask) { return Printable([LaneMask](raw_ostream &OS) { OS << format("%08X", LaneMask); }); } // Usage: OS << PrintLaneMask(Mask); Differential Revision: http://reviews.llvm.org/D14348 llvm-svn: 254655	2015-12-03 22:17:26 +00:00
Chih-Hung Hsieh	0fd457f2bc	[X86] Part 1 to fix x86-64 fp128 calling convention. Almost all these changes are conditioned and only apply to the new x86-64 f128 type configuration, which will be enabled in a follow up patch. They are required together to make new f128 work. If there is any error, we should fix or revert them as a whole. These changes should have no impact to current configurations. * Relax type legalization checks to accept new f128 type configuration, whose TypeAction is TypeSoftenFloat, not TypeLegal, but also has TLI.isTypeLegal true. * Relax GetSoftenedFloat to return in some cases f128 type SDValue, which is TLI.isTypeLegal but not "softened" to i128 node. * Allow customized FABS, FNEG, FCOPYSIGN on new f128 type configuration, to generate optimized bitwise operators for libm functions. * Enhance related Lower* functions to handle f128 type. * Enhance DAGTypeLegalizer::run, SoftenFloatResult, and related functions to keep new f128 type in register, and convert f128 operators to library calls. * Fix Combiner, Emitter, Legalizer routines that did not handle f128 type. * Add ExpandConstant to handle i128 constants, ExpandNode to handle ISD::Constant node. * Add one more parameter to getCommonSubClass and firstCommonClass, to guarantee that returned common sub class will contain the specified simple value type. This extra parameter is used by EmitCopyFromReg in InstrEmitter.cpp. * Fix infinite loop in getTypeLegalizationCost when f128 is the value type. * Fix printOperand to handle null operand. * Enhance ISD::BITCAST node to handle f128 constant. * Expand new f128 type for BR_CC, SELECT_CC, SELECT, SETCC nodes. * Enhance X86AsmPrinter to emit f128 values in comments. Differential Revision: http://reviews.llvm.org/D15134 llvm-svn: 254653	2015-12-03 22:02:40 +00:00
Easwaran Raman	b4cc0435ff	Interface to attach maximum function count from PGO to module as module flags. This provides interface to get and set maximum function counts to Module. This would allow things like determination of function hotness. The actual setting of this max function count will have to be done in the frontend. Differential Revision: http://reviews.llvm.org/D15003 llvm-svn: 254647	2015-12-03 20:57:37 +00:00
Rafael Espindola	85d236a563	Delete dead code. llvm-svn: 254609	2015-12-03 14:35:15 +00:00
Andy Gibbs	eb7130cedf	Fix class SCEVPredicate has virtual functions and accessible non-virtual destructor. It is not enough to simply make the destructor virtual since there is a g++ 4.7 issue (see https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53613) that throws the error "looser throw specifier for ... overridding ~SCEVPredicate() noexcept". llvm-svn: 254592	2015-12-03 08:20:20 +00:00
Matthias Braun	809ea1300c	Revert "ScheduleDAGInstrs: Rework schedule graph builder." This works mostly fine but breaks some stage 1 builders when compiling compiler-rt on i386. Revert for further investigation as I can't see an obvious cause/fix. This reverts commit r254577. llvm-svn: 254586	2015-12-03 03:01:10 +00:00
Mehdi Amini	94623d2769	Adapt comment and rename variable in ModuleLinker to describe more accurately the actual use. Thanks Sean Silva for the suggestion. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254582	2015-12-03 02:37:30 +00:00
Mehdi Amini	07b85fee55	Remove "ExportingModule" from ThinLTO Index (NFC) There is no real reason the index has to have the concept of an exporting Module. We should be able to have one single unique instance of the Index, and it should be read-only after creation for the whole ThinLTO processing. The linker plugin should be able to process multiple modules (in parallel or in sequence) with the same index. The only reason the ExportingModule was present seems to be to implement hasExportedFunctions() that is used by the Module linker to decide what to do with the current Module. For now I replaced it with a query to the map of Modules path to see if this module was declared in the Index and consider that if it is the case then it is probably exporting function. On the long term the Linker interface needs to evolve and this call should not be needed anymore. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254581	2015-12-03 02:37:23 +00:00
Matthias Braun	c7931c150a	ScheduleDAGInstrs: Rework schedule graph builder. The new algorithm remembers the uses encountered while walking backwards until a matching def is found. Contrary to the previous version this: - Works without LiveIntervals being available - Allows to increase the precision to subregisters/lanemasks (not used for now) The changes in the AMDGPU tests are necessary because the R600 scheduler is not stable with respect to the order of nodes in the ready queues. Differential Revision: http://reviews.llvm.org/D9068 llvm-svn: 254577	2015-12-03 02:05:27 +00:00
David Majnemer	f6b0a5ec47	Rename a header guard to be more appropriate llvm-svn: 254566	2015-12-02 23:28:27 +00:00
David Majnemer	56dee65385	Move EH-specific helper functions to a more appropriate place No functionality change is intended. llvm-svn: 254562	2015-12-02 23:06:39 +00:00
Xinliang David Li	1377f309a0	[PGO] Allow input value node list to be null This is to handle the case when vp node linked list array is laziliy initialized at runtime llvm-svn: 254551	2015-12-02 21:47:43 +00:00
Alexey Samsonov	ddcffafd2c	[PowerPC] Remove wild call to RegScavenger::initRegState(). This call should in fact be made by RegScavenger::enterBasicBlock() called below. The first call does nothing except for triggering UB, indicated by UBSan (passing nullptr to memset()). llvm-svn: 254548	2015-12-02 21:25:28 +00:00
Fiona Glaser	9540086d5a	Scheduler / Regalloc: use unique_ptr[] instead of std::vector vector.resize() is significantly slower than memset in many STLs and the cost of initializing these vectors is significant on targets with many registers. Since we don't need the overhead of a vector, use a simple unique_ptr instead. llvm-svn: 254526	2015-12-02 18:32:59 +00:00
Nathan Slingerland	625b1f4b92	[llvm-profdata] Change instr prof counter overflow to saturate rather than discard Summary: This changes overflow handling during instrumentation profile merge. Rathar than throwing away records that would result in counter overflow, merged counts are instead clamped to the maximum representable value. A warning about counter overflow is still surfaced to the user as before. Reviewers: dnovillo, davidxl, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14893 llvm-svn: 254525	2015-12-02 18:19:24 +00:00
Tim Northover	a44eae5e57	AArch64: use ldxp/stxp pair to implement 128-bit atomic loads. The ARM ARM is clear that 128-bit loads are only guaranteed to have been atomic if there has been a corresponding successful stxp. It's less clear for AArch32, so I'm leaving that alone for now. llvm-svn: 254524	2015-12-02 18:12:57 +00:00
Andy Gibbs	cf3b9350a7	Rollback r254508 and r254511 to fix buildbots llvm-svn: 254513	2015-12-02 14:36:48 +00:00
Andy Gibbs	f2eef0b5af	Fix buildbots broken by r254508 g++ 4.7 does not allow an inline defaulted virtual destructor to be overridden, giving the error "looser throw specifier for ... overridding ~SCEVPredicate() noexcept (true)" (see https://gcc.gnu.org/bugzilla/show_bug.cgi?id=53613). The work-around given in the bug report above has been utilised here. llvm-svn: 254511	2015-12-02 14:22:18 +00:00
Andy Gibbs	302a46438a	Fix class SCEVPredicate has virtual functions and accessible non-virtual destructor llvm-svn: 254508	2015-12-02 13:41:24 +00:00
Christof Douma	b1205456e7	[AArch64]: Add support for Cortex-A35 Adds support for the new Cortex-A35 ARMv8-A core. llvm-svn: 254503	2015-12-02 11:53:44 +00:00
Asaf Badouh	d6d08d5567	[X86][AVX512] add comi with Sae add builtin_ia32_vcomisd and builtin_ia32_vcomisd Differential Revision: http://reviews.llvm.org/D14331 llvm-svn: 254493	2015-12-02 08:17:51 +00:00
Akira Hatanaka	a676a66323	[AttributeSet] Overload AttributeSet::addAttribute to reduce compile time. The new overloaded function is used when an attribute is added to a large number of slots of an AttributeSet (for example, to function parameters). This is much faster than calling AttributeSet::addAttribute once per slot, because AttributeSet::getImpl (which calls FoldingSet::FIndNodeOrInsertPos) is called only once per function instead of once per slot. With this commit, clang compiles a file which used to take over 22 minutes in just 13 seconds. rdar://problem/23581000 Differential Revision: http://reviews.llvm.org/D15085 llvm-svn: 254491	2015-12-02 06:58:49 +00:00
David Blaikie	3719678673	[llvm-dwp] Emit a rather fictional debug_cu_index This is very rudimentary support for debug_cu_index, but it is enough to allow llvm-dwarfdump to find the offsets for contributions and correctly dump debug_info. It will need to actually find the real signature of the unit and build the real hash table with the right number of buckets, as per the DWP specification. It will also need to be expanded to cover the tu_index as well. llvm-svn: 254489	2015-12-02 06:21:34 +00:00
David Blaikie	4fd6a24d41	DebugInfo\DWARF: Privatize some accidentally public members llvm-svn: 254488	2015-12-02 06:21:28 +00:00
Mehdi Amini	34766825ef	Change ModuleLinker to take a set of GlobalValues to import instead of a single one For efficiency reason, when importing multiple functions for the same Module, we can avoid reparsing it every time. Differential Revision: http://reviews.llvm.org/D15102 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254486	2015-12-02 04:34:28 +00:00
Mehdi Amini	1909ee53b2	Modify FunctionImport to take a callback to load modules When linking static archive, there is no individual module files to load. Instead they can be mmap'ed and could be initialized from a buffer directly. The callback provide flexibility to override the scheme for loading module from the summary. Differential Revision: http://reviews.llvm.org/D15101 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254479	2015-12-02 02:00:29 +00:00
Xinliang David Li	c258245ddf	Define member operator delete For the struct with trailing objects, define a member operator delete. Without this, the program will fail when -fsized-deallocation option is used where the wrong size will be passed to the global delete operator. llvm-svn: 254471	2015-12-01 23:05:27 +00:00
Xinliang David Li	5715dafdb5	[PGO] Add support for reading multiple versions of indexed profile format profile data Profile readers using incompatible on-disk hash table format can now share the same implementation and interfaces. Differential Revision: http://reviews.llvm.org/D15100 llvm-svn: 254458	2015-12-01 20:26:26 +00:00
Rafael Espindola	e3fda2ca99	Use references now that it is natural to do so. The linker never takes ownership of a module or changes which module it is refering to, making it natural to use references. llvm-svn: 254449	2015-12-01 19:50:54 +00:00
Rafael Espindola	8c37192246	Delete dead code. llvm-svn: 254436	2015-12-01 18:50:35 +00:00
Rafael Espindola	876cd0cb5e	Use a forwarding constructor instead of an init method. llvm-svn: 254435	2015-12-01 18:46:19 +00:00
Rafael Espindola	49b4ffd0b8	Delete the setModule method from the Linker. It was only used from LTO for a debug feature, and LTO can just create another linker. It is pretty odd to have a method to reset the module in the middle of a link. It would make IdentifiedStructTypes inconsistent with the Module for example. llvm-svn: 254434	2015-12-01 18:41:30 +00:00
Elena Demikhovsky	ce52715df6	Fixed a failure in cost calculation for vector GEP Cost calculation for vector GEP failed with due to invalid cast to GEP index operand. The bug is fixed, added a test. http://reviews.llvm.org/D14976 llvm-svn: 254408	2015-12-01 12:08:36 +00:00
Yury Gribov	8eadab0fdb	Introduce new @llvm.get.dynamic.area.offset.i{32, 64} intrinsics. The @llvm.get.dynamic.area.offset.* intrinsic family is used to get the offset from native stack pointer to the address of the most recent dynamic alloca on the caller's stack. These intrinsics are intendend for use in combination with @llvm.stacksave and @llvm.restore to get a pointer to the most recent dynamic alloca. This is useful, for example, for AddressSanitizer's stack unpoisoning routines. Patch by Max Ostapenko. Differential Revision: http://reviews.llvm.org/D14983 llvm-svn: 254404	2015-12-01 11:40:55 +00:00
Oliver Stannard	c41a38aefa	[ARM] Add ARMv8.2-A to TargetParser Add ARMv8.2-A to TargetParser, so that it can be used by the clang command-line options and the .arch directive. Most testing of this will be done in clang, checking that the command-line options that this enables work. Differential Revision: http://reviews.llvm.org/D15037 llvm-svn: 254400	2015-12-01 10:33:56 +00:00
Sanjoy Das	1a33637ac1	Introduce a range version of std::find, and use in SCEV Reviewers: dblaikie, pcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15064 llvm-svn: 254391	2015-12-01 07:49:27 +00:00
Sanjoy Das	e90f281076	Introduce a range version of std::any_of, and use it in SCEV Reviewers: dblaikie, pcc Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15063 llvm-svn: 254390	2015-12-01 07:49:23 +00:00
Cong Hou	5d51a489ae	Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces. (This is the second attempt to submit this patch. The first caused two assertion failures and was reverted. See https://llvm.org/bugs/show_bug.cgi?id=25687) The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes (http://reviews.llvm.org/D13908). 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights (http://reviews.llvm.org/D14361). 3. Use new interfaces in all other passes. 4. Remove old interfaces. This patch is 3+4 above. In this patch, MBB won't provide weight-based interfaces any more, which are totally replaced by probability-based ones. The interface addSuccessor() is redesigned so that the default probability is unknown. We allow unknown probabilities but don't allow using it together with known probabilities in successor list. That is to say, we either have a list of successors with all known probabilities, or all unknown probabilities. In the latter case, we assume each successor has 1/N probability where N is the number of successors. An assertion checks if the user is attempting to add a successor with the disallowed mixed use as stated above. This can help us catch many misuses. All uses of weight-based interfaces are now updated to use probability-based ones. Differential revision: http://reviews.llvm.org/D14973 llvm-svn: 254377	2015-12-01 05:29:22 +00:00
Matthias Braun	c3639e325e	RegisterPressure: Remove support for recede()/advance() at MBB boundaries Nobody was checking the returnvalue of recede()/advance() so we can simply replace this code with asserts. llvm-svn: 254371	2015-12-01 04:20:04 +00:00
Matthias Braun	57f3cb5cfc	RegisterPressure: There is no need to make getCurSlot() public llvm-svn: 254370	2015-12-01 04:20:01 +00:00
Matthias Braun	d05b1a39de	RegisterPressure: There is no need to make discoverLive{In\|Out} public llvm-svn: 254369	2015-12-01 04:19:58 +00:00
Hans Wennborg	6d0b969988	Revert r254348: "Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces." and the follow-up r254356: "Fix a bug in MachineBlockPlacement that may cause assertion failure during BranchProbability construction." Asserts were firing in Chromium builds. See PR25687. llvm-svn: 254366	2015-12-01 03:49:42 +00:00
David Blaikie	0483eddbd1	[llvm-dwp] Initial partial prototype This just concatenates the common DWP sections without doing any of the fancy DWP things like: 1) update str_offsets 2) deduplicating strings 3) merging/creating cu/tu_index Patches for these will follow shortly. (also not sure about target triple/object file type for this tool - do I really need a whole triple just to write an object file that contains purely static/hardcoded bytes in each section? & I guess I should just pick it based on the first input, maybe, rather than hardcoding for now - but we only produce .dwo on ELF platforms with objcopy for now anyway) llvm-svn: 254355	2015-12-01 00:48:39 +00:00
Evgeniy Stepanov	154021c8a2	[safestack] Protect byval function arguments. Detect unsafe byval function arguments and move them to the unsafe stack. llvm-svn: 254353	2015-12-01 00:40:05 +00:00
Cong Hou	b228d0caa6	Replace all weight-based interfaces in MBB with probability-based interfaces, and update all uses of old interfaces. The patch in http://reviews.llvm.org/D13745 is broken into four parts: 1. New interfaces without functional changes (http://reviews.llvm.org/D13908). 2. Use new interfaces in SelectionDAG, while in other passes treat probabilities as weights (http://reviews.llvm.org/D14361). 3. Use new interfaces in all other passes. 4. Remove old interfaces. This patch is 3+4 above. In this patch, MBB won't provide weight-based interfaces any more, which are totally replaced by probability-based ones. The interface addSuccessor() is redesigned so that the default probability is unknown. We allow unknown probabilities but don't allow using it together with known probabilities in successor list. That is to say, we either have a list of successors with all known probabilities, or all unknown probabilities. In the latter case, we assume each successor has 1/N probability where N is the number of successors. An assertion checks if the user is attempting to add a successor with the disallowed mixed use as stated above. This can help us catch many misuses. All uses of weight-based interfaces are now updated to use probability-based ones. Differential revision: http://reviews.llvm.org/D14973 llvm-svn: 254348	2015-12-01 00:02:51 +00:00
Paul Robinson	6a55b6dd41	Have 'optnone' respect the -fast-isel=false option. This is primarily useful for debugging optnone v. ISel issues. Differential Revision: http://reviews.llvm.org/D14792 llvm-svn: 254335	2015-11-30 21:56:16 +00:00
Sanjoy Das	3f2c3a2acc	[ADT] Fix typo in comment llvm-svn: 254278	2015-11-30 01:24:17 +00:00
Sanjoy Das	edaa1479cf	Fix out of bounds access in hasStructRetAttr llvm-svn: 254273	2015-11-29 23:15:43 +00:00
Craig Topper	233dd30406	[X86] int_x86_avx2_permps and X86ISD::VPERMV should take an integer vector for its shuffle indices. llvm-svn: 254269	2015-11-29 22:53:22 +00:00
Davide Italiano	85963c8ad6	[SimplifyLibCalls] Tranform log(pow(x, y)) -> ylog(x). This one is enabled only under -ffast-math. There are cases where the difference between the value computed and the correct value is huge even for ffast-math, e.g. as Steven pointed out: x = -1, y = -4 log(pow(-1), 4) = 0 4log(-1) = NaN I checked what GCC does and apparently they do the same optimization (which result in the dramatic difference). Future work might try to make this (slightly) less worse. Differential Revision: http://reviews.llvm.org/D14400 llvm-svn: 254263	2015-11-29 20:58:04 +00:00
Xinliang David Li	2f7575a9de	Minor code cleanups - Add const keyword - fix code comments - move forward decl to the common file llvm-svn: 254244	2015-11-29 04:52:34 +00:00
Xinliang David Li	1c888b8d73	[PGO] Move value profile format related structures and APIs to common file This is the last step to enable profile runtime to share the same value prof data format and reader/writer code with llvm host tools. The VP related data structures are moved to a section in InstrProfData.inc enabled with macro INSTR_PROF_VALUE_PROF_DATA, and common API implementations are enabled with INSTR_PROF_COMMON_API_IMPL. There should be no functional change. llvm-svn: 254235	2015-11-28 19:07:09 +00:00
Jonas Paulsson	4e06f54193	[Stack realignment] Handling of aligned allocas. This patch implements dynamic realignment of stack objects for targets with a non-realigned stack pointer. Behaviour in FunctionLoweringInfo is changed so that for a target that has StackRealignable set to false, over-aligned static allocas are considered to be variable-sized objects and are handled with DYNAMIC_STACKALLOC nodes. It would be good to group aligned allocas into a single big alloca as an optimization, but this is yet todo. SystemZ benefits from this, due to its stack frame layout. New tests SystemZ/alloca-03.ll for aligned allocas, and SystemZ/alloca-04.ll for "no-realign-stack" attribute on functions. Review and help from Ulrich Weigand and Hal Finkel. llvm-svn: 254227	2015-11-28 11:02:32 +00:00
Xinliang David Li	8167574a24	[PGO] Add return code for vp rt record init routine to indicate error condition llvm-svn: 254220	2015-11-28 05:47:34 +00:00
Xinliang David Li	16f0d8f3a9	[PGO] Allow value profile writer interface to allocated target buffer Raw profile writer needs to write all data of one kind in one continuous block, so the buffer needs to be pre-allocated and passed to the writer method in pieces for function profile data. The change adds the support for raw value data writing. llvm-svn: 254219	2015-11-28 05:37:01 +00:00
Xinliang David Li	c87d1d6b3e	Function name cleanup (NFC) llvm-svn: 254218	2015-11-28 05:06:00 +00:00
Xinliang David Li	7cdfa09575	[PGO] Extract VP data integrity check code into a helper function (NFC) llvm-svn: 254217	2015-11-28 04:56:07 +00:00
Keno Fischer	39e3d1456f	[autoconf] Fix MinGW build This is the autoconf analog of r251201. I realize autoconf is deprecated, but while it's in tree, it should at least be kept working. Also add the deprecation message to configure.ac such that AutoRegen actually picks ip up. llvm-svn: 254215	2015-11-28 00:54:12 +00:00
Rafael Espindola	4a063d8813	Simplify the linking of recursive data. Now the ValueMapper has two callbacks. The first one maps the declaration. The ValueMapper records the mapping and then materializes the body/initializer. llvm-svn: 254209	2015-11-27 20:28:19 +00:00
Charlie Turner	18cf3a8580	[LoopVectorize] Use MapVector rather than DenseMap for MinBWs. The order in which instructions are truncated in truncateToMinimalBitwidths effects code generation. Switch to a map with a determinisic order, since the iteration order over a DenseMap is not defined. This code is not hot, so the difference in container performance isn't interesting. Many thanks to David Blaikie for making me aware of MapVector! Fixes PR25490. Differential Revision: http://reviews.llvm.org/D14981 llvm-svn: 254179	2015-11-26 20:39:51 +00:00
Krzysztof Parzyszek	f494c1982c	[Hexagon] Hexagon V60 HVX intrinsic defintions Author: Ron Lieberman <ronl@codeaurora.org> llvm-svn: 254165	2015-11-26 16:54:33 +00:00
Craig Topper	e404cbe233	[SelectionDAG] Add a SDTCisSameSizeAs type constraint that can be used to ensure vector widths match even if the element size and count don't. llvm-svn: 254138	2015-11-26 07:02:18 +00:00
Craig Topper	4b764921f1	Add type constraints to masked_load/masked_store to ensure the mask vector has the same number of elements as the other vectors. llvm-svn: 254137	2015-11-26 06:30:42 +00:00
Tom Stellard	814127be3e	AMDGPU: Fix typo llvm-svn: 254120	2015-11-26 02:04:11 +00:00
Sanjoy Das	99ca015392	[OperandBundles] Treat "deopt" operand bundles specially Teach LLVM optimize to more precisely in the presence of "deopt" operand bundles. "deopt" operand bundles imply that the call they're attached to is at least `readonly` (i.e. they don't imply clobber semantics), and they don't capture their bundle operands. llvm-svn: 254118	2015-11-26 01:16:05 +00:00
Tom Stellard	eb7e999b29	AMDGPU: Add llvm.amdgcn.dispatch.ptr intrinsic Summary: This returns a pointer to the dispatch packet, which can be used to load information about the kernel dispach. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D14898 llvm-svn: 254116	2015-11-26 00:43:29 +00:00
Xinliang David Li	df875576b5	Fix a typo introduced in previous patches llvm-svn: 254112	2015-11-26 00:02:23 +00:00
Xinliang David Li	739b63a55d	[PGO] Implement ValueProfiling Closure interfaces for runtime value profile data This is one of the many steps to commonize value profiling support between profile runtime and compiler/llvm tools. After this change, profiler runtime now can share the same C APIs to do VP serialization/deseriazation with LLVM host tools (and produces value data in identical format between indexed and raw profile). It is not yet enabled in profiler runtime yet. Also added a unit test case to test runtime profile data serialization/deserialization interfaces implemented using common closure code. llvm-svn: 254110	2015-11-25 23:31:18 +00:00
Artyom Skrobov	3803dae0a6	Expose isXxxConstant() functions from SelectionDAGNodes.h (NFC) Summary: Many target lowerings copy-paste the code to test SDValues for known constants. This code can instead be shared in SelectionDAG.cpp, and reused in the targets. Reviewers: MatzeB, andreadb, tstellarAMD Subscribers: arsenm, jyknight, llvm-commits Differential Revision: http://reviews.llvm.org/D14945 llvm-svn: 254085	2015-11-25 19:41:11 +00:00
Eric Christopher	5f84aed4f6	Fix some places where we were assuming that memory type had been legalized to a simple type when lowering a truncating store of a vector type. In this case for an EVT we'll return Expand as we should in all of the cases anyhow. The testcase triggered at the one in VectorLegalizer::LegalizeOp, inspection found the rest. llvm-svn: 254061	2015-11-25 09:11:53 +00:00
Xinliang David Li	29597bc958	[PGO] Convert InstrProfRecord based serialization methods to use common C methods 1. Convert serialization methods using InstrProfRecord as source into C (impl) interfaces using Closure. 2. Reimplement InstrProfRecord serialization method to use new C interface as dummy wrapper. Now it is ready to implement wrapper for runtime value profile data. (The new code need better source location -- but not changed in this patch to minimize diffs. ) llvm-svn: 254057	2015-11-25 06:23:38 +00:00
Xinliang David Li	960920a165	[PGO] convert a subset of C++ interfaces into C (for sharing) (NFC) llvm-svn: 254056	2015-11-25 04:29:24 +00:00
Xinliang David Li	03ec0b37d7	Add missing documentation. (NFC) llvm-svn: 254051	2015-11-25 01:13:44 +00:00

... 3 4 5 6 7 ...

25322 Commits