llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

Author	SHA1	Message	Date
David Majnemer	22cb4eb850	[OperandBundles] Have DeadArgElim play nice with operand bundles A call site's use of a Value might not correspond to an argument operand but to a bundle operand. llvm-svn: 256326	2015-12-23 09:58:36 +00:00
Philip Reames	e80a72c967	[GC] Make GCStrategy::isGCManagedPointer a type predicate not a value predicate [NFC] Reasons: 1) The existing form was a form of false generality. None of the implemented GCStrategies use anything other than a type. Its becoming more and more clear we're going to need some type of strong GC pointer in the type system and we shouldn't pretend otherwise at this point. 2) The API was awkward when applied to vectors-of-pointers. The old one could have been made to work, but calling isGCManagedPointer(Ty->getScalarType()) is much cleaner than the Value alternatives. 3) The rewriting implementation effectively assumes the type based predicate as well. We should be consistent. llvm-svn: 256312	2015-12-23 01:42:15 +00:00
Akira Hatanaka	6a7dbf68a2	Provide a way to specify inliner's attribute compatibility and merging. This reapplies r256277 with two changes: - In emitFnAttrCompatCheck, change FuncName's type to std::string to fix a use-after-free bug. - Remove an unnecessary install-local target in lib/IR/Makefile. Original commit message for r252949: Provide a way to specify inliner's attribute compatibility and merging rules using table-gen. NFC. This commit adds new classes CompatRule and MergeRule to Attributes.td, which are used to generate code to check attribute compatibility and merge attributes of the caller and callee. rdar://problem/19836465 llvm-svn: 256304	2015-12-22 23:57:37 +00:00
Dan Gohman	922f754559	Add an OperandNamespace field to Target.td's Operand. For targets to add their own operand types as needed, as advertised in Operand's comment, they need to be able to specify an alternate namespace for OperandType names too. This matches the RegisterOperand class. llvm-svn: 256299	2015-12-22 23:37:37 +00:00
Akira Hatanaka	dfd76e927a	Revert r256277 and r256279. Some of the bots failed again. llvm-svn: 256280	2015-12-22 20:29:09 +00:00
Akira Hatanaka	fa235f0243	Provide a way to specify inliner's attribute compatibility and merging. This reapplies r252990 and r252949. I've added member function getKind to the Attr classes which returns the enum or string of the attribute. Original commit message for r252949: Provide a way to specify inliner's attribute compatibility and merging rules using table-gen. NFC. This commit adds new classes CompatRule and MergeRule to Attributes.td, which are used to generate code to check attribute compatibility and merge attributes of the caller and callee. rdar://problem/19836465 llvm-svn: 256277	2015-12-22 20:00:05 +00:00
Cong Hou	50c405416c	[BPI] Replace weights by probabilities in BPI. This patch removes all weight-related interfaces from BPI and replace them by probability versions. With this patch, we won't use edge weight anymore in either IR or MC passes. Edge probabilitiy is a better representation in terms of CFG update and validation. Differential revision: http://reviews.llvm.org/D15519 llvm-svn: 256263	2015-12-22 18:56:14 +00:00
Manuel Jacob	3a4569b878	Remove deprecated llvm.experimental.gc.result.{int,float,ptr} intrinsics. Summary: These were deprecated 11 months ago when a generic llvm.experimental.gc.result intrinsic, which works for all types, was added. Reviewers: sanjoy, reames Subscribers: sanjoy, chenli, llvm-commits Differential Revision: http://reviews.llvm.org/D15719 llvm-svn: 256262	2015-12-22 18:44:45 +00:00
Vedant Kumar	4a1d86d7e2	[Support] Allow multiple paired calls to {start,stop}Timer() Differential Revision: http://reviews.llvm.org/D15619 Reviewed-by: rafael llvm-svn: 256258	2015-12-22 17:36:17 +00:00
Asaf Badouh	d891bbfe44	[X86][AVX512] Add rcp14 and rsqrt14 intrinsics Differential Revision: http://reviews.llvm.org/D15414 llvm-svn: 256237	2015-12-22 11:40:04 +00:00
David Majnemer	47d3d1e5ef	[MC, COFF] Support link /incremental conditionally Today, we always take into account the possibility that object files produced by MC may be consumed by an incremental linker. This results in us initialing fields which vary with time (TimeDateStamp) which harms hermetic builds (e.g. verifying a self-host went well) and produces sub-optimal code because we cannot assume anything about the relative position of functions within a section (call sites can get redirected through incremental linker thunks). Let's provide an MCTargetOption which controls this behavior so that we can disable this functionality if we know a-priori that the build will not rely on /incremental. llvm-svn: 256203	2015-12-21 22:09:27 +00:00
Amjad Aboud	8197f10787	Implemented Support of IA interrupt and exception handlers: http://lists.llvm.org/pipermail/cfe-dev/2015-September/045171.html Differential Revision: http://reviews.llvm.org/D15567 llvm-svn: 256155	2015-12-21 14:07:14 +00:00
Xinliang David Li	859c330e8f	[PGO] Improve Indexed Profile Reader efficiency With the support of value profiling added, the Indexed prof reader gets less efficient. The prof reader initialization used to be just reading the file header, but with VP support added, initialization needs to walk through all profile keys of ondisk hash table resulting in very poor locality and large memory increase (keys are stored together with the profile data in the mapped profile buffer). Even worse, when the reader is used by the compiler (not llvm-profdata too), the penalty becomes very high as compilation of each single module requires touching profile data buffer for the whole program. In this patch, the icall target values (MD5hash) are no longer eargerly converted back to name strings when the data is read into memory. New interface is added to to profile reader so that InstrProfSymtab can be lazily created for Indexed profile reader on-demand. Creating of the symtab is intended to be used by llvm-profdata tool for symbolic dumping of VP data. It can be used with compiler (for legacy out of tree uses) too but not recommended due to compile time and memory reasons mentioned above. Some other cleanups are also included: Function Addr to md5 map is now consolated into InstrProfSymtab. InstrProfStringtab is no longer used and eliminated. llvm-svn: 256114	2015-12-20 06:22:13 +00:00
Xinliang David Li	6c1acd793c	Minor clean up -- move large single use method out of header(NFC) llvm-svn: 256113	2015-12-20 05:15:45 +00:00
Sanjoy Das	d080ee893d	Nonnull elements in OperandBundleCallSites are not all Instructions `CloneAndPruneIntoFromInst` sometimes RAUW's dead instructions with `undef` before erasing them (to avoid deleting instructions that still have uses). This changes the `WeakVH` in `OperandBundleCallSites` to hold an `undef`, and we need to guard for this situation in eventuality in `llvm::InlineFunction`. llvm-svn: 256110	2015-12-19 22:40:28 +00:00
Rafael Espindola	2959fcb59a	Delete APIs that have been deprecated since 2010. llvm-svn: 256107	2015-12-19 21:42:07 +00:00
Rafael Espindola	552e7f96b1	Assert that we have all use/users in the getters. An error that is pretty easy to make is to use the lazy bitcode reader and then do something like if (V.use_empty()) The problem is that uses in unmaterialized functions are not accounted for. This patch adds asserts that all uses are known. llvm-svn: 256105	2015-12-19 20:03:23 +00:00
Vedant Kumar	358f3ea995	Re-reapply "[IR] Move optional data in llvm::Function into a hungoff uselist" Make personality functions, prefix data, and prologue data hungoff operands of Function. This is based on the email thread "[RFC] Clean up the way we store optional Function data" on llvm-dev. Thanks to sanjoyd, majnemer, rnk, loladiro, and dexonsmith for feedback! Includes a fix to scrub value subclass data in dropAllReferences. Does not use binary literals. Differential Revision: http://reviews.llvm.org/D13829 llvm-svn: 256095	2015-12-19 08:52:49 +00:00
Vedant Kumar	2e1a683bae	Revert "Reapply "[IR] Move optional data in llvm::Function into a hungoff uselist"" This reverts commit r256093. This broke lld-x86_64-win7 because of -Werror,-Wc++1y-extensions. llvm-svn: 256094	2015-12-19 08:48:43 +00:00
Vedant Kumar	c33a34516e	Reapply "[IR] Move optional data in llvm::Function into a hungoff uselist" Make personality functions, prefix data, and prologue data hungoff operands of Function. This is based on the email thread "[RFC] Clean up the way we store optional Function data" on llvm-dev. Thanks to sanjoyd, majnemer, rnk, loladiro, and dexonsmith for feedback! Includes a fix to scrub value subclass data in dropAllReferences. Differential Revision: http://reviews.llvm.org/D13829 llvm-svn: 256093	2015-12-19 08:29:51 +00:00
Xinliang David Li	26f18dd6bb	[PGO] Add hash to name mapping in InstrProfSymtab Creator and lookup interfaces are added to this symtab class. The new interfaces will be used by InstrProf Readers and writer. A unit test is also added for the new APIs. llvm-svn: 256092	2015-12-19 07:44:57 +00:00
Vedant Kumar	6843b30188	Revert "[IR] Move optional data in llvm::Function into a hungoff uselist" This reverts commit r256090. This broke llvm-clang-lld-x86_64-debian-fast. llvm-svn: 256091	2015-12-19 07:30:44 +00:00
Vedant Kumar	46b3967fa2	[IR] Move optional data in llvm::Function into a hungoff uselist Make personality functions, prefix data, and prologue data hungoff operands of Function. This is based on the email thread "[RFC] Clean up the way we store optional Function data" on llvm-dev. Thanks to sanjoyd, majnemer, rnk, loladiro, and dexonsmith for feedback! Differential Revision: http://reviews.llvm.org/D13829 llvm-svn: 256090	2015-12-19 07:08:56 +00:00
James Y Knight	fe15b9a096	Possibly fix MSVC compilation after r256054. I don't have any way to test MSVC compilation, but maybe this will fix the error: llvm/Support/TrailingObjects.h(286) : error C3210: 'TrailingObjectsBase' : access declaration can only be applied to a base class member llvm/Support/TrailingObjects.h(337) : see reference to class template instantiation 'llvm::TrailingObjects<BaseTy,TrailingTys...>' being compiled llvm/Support/TrailingObjects.h(286) : error C2602: 'llvm::trailing_objects_internal::TrailingObjectsBase::OverloadToken' is not a member of a base class of 'llvm::TrailingObjects<BaseTy,TrailingTys...>' llvm/Support/TrailingObjects.h(91) : see declaration of 'llvm::trailing_objects_internal::TrailingObjectsBase::OverloadToken' llvm-svn: 256068	2015-12-19 00:53:22 +00:00
Rafael Espindola	e392243a54	Deprecate a few C APIs. This deprecates: * LLVMParseBitcode * LLVMParseBitcodeInContext * LLVMGetBitcodeModuleInContext * LLVMGetBitcodeModule They are replaced with the functions with a 2 suffix which do not record a diagnostic. llvm-svn: 256065	2015-12-18 23:46:42 +00:00
Xinliang David Li	cf9d24ec84	[PGO] Cleanup: Move large member functions out of line (NFC) llvm-svn: 256058	2015-12-18 23:06:37 +00:00
James Y Knight	6a87ac0efb	Rewrite the TrailingObjects template to provide two new features: - Automatic alignment of the base type for the alignment requirements of the trailing types. - Support for an arbitrary numbers of trailing types, instead of only 1 or 2, by using a variadic template implementation. Upcoming commits to clang will take advantage of both of these features. Differential Revision: http://reviews.llvm.org/D12439 llvm-svn: 256054	2015-12-18 22:54:37 +00:00
Rafael Espindola	3db381761e	git-clang-format a region I am about to change. llvm-svn: 256048	2015-12-18 22:23:16 +00:00
Xinliang David Li	c749671813	[PGO] Simplify computehash interface (NFC) llvm-svn: 256047	2015-12-18 22:22:12 +00:00
Alexey Samsonov	e2d62e9cd1	[Symbolize] Improve the ownership of parsed objects. This code changes the way Symbolize handles parsed binaries: now parsed OwningBinary<Binary> is not broken into (binary, memory buffer) pair, and is just stored as-is in a cache. ObjectFile components of Mach-O universal binaries are also stored explicitly in a separate cache. Additionally, this change: * simplifies the code that parses/caches binaries: it's now done in a single place, not three different functions. * makes flush() method behave as expected, and actually clear the cached parsed binaries and objects. * fixes a dangling pointer issue described in http://reviews.llvm.org/D15638 llvm-svn: 256041	2015-12-18 22:02:14 +00:00
Cong Hou	d24cdaa414	Use getEdgeProbability() instead of getEdgeWeight() in BFI and remove getEdgeWeight() interfaces from MBPI. This patch removes all getEdgeWeight() interfaces from CodeGen directory. As getEdgeProbability() is a little more expensive than getEdgeWeight(), I will compose a patch soon in which BPI only stores probabilities instead of edge weights so that getEdgeProbability() will have O(1) time. Differential revision: http://reviews.llvm.org/D15489 llvm-svn: 256039	2015-12-18 21:53:24 +00:00
Rafael Espindola	096a36a6a6	Remove redundant argument. NFC. llvm-svn: 256031	2015-12-18 21:18:57 +00:00
Rafael Espindola	a8daff187a	Drop materializeAllPermanently. This inlines materializeAll into the only caller (materializeAllPermanently) and renames materializeAllPermanently to just materializeAll. llvm-svn: 256024	2015-12-18 20:13:39 +00:00
Rafael Espindola	4fe057fd6e	Drop support for dematerializing. It was only used on lib/Linker and the use was "dead" since it was used on a function the IRMover had just moved. llvm-svn: 256019	2015-12-18 19:57:26 +00:00
Pete Cooper	cda523f460	Revert "Improve DWARFDebugFrame::parse to also handle __eh_frame." This reverts commit r256008. Its breaking multiple buildbots, although works for me locally. llvm-svn: 256013	2015-12-18 19:45:38 +00:00
Teresa Johnson	d80a73c588	Rename variables to reflect linker split (NFC) Renamed variables to be more reflective of whether they are an instance of Linker, IRLinker or ModuleLinker. Also fix a stale comment. llvm-svn: 256011	2015-12-18 19:28:59 +00:00
Eric Christopher	d315447cf7	Convert Arg, ArgList, and Option to dump() to dbgs() rather than errs(). Also add print() functions. Patch by Justin Lebar! llvm-svn: 256010	2015-12-18 18:55:26 +00:00
Eric Christopher	de13aaf4d1	Add a dump method for ArgList. Patch by Justin Lebar! llvm-svn: 256009	2015-12-18 18:55:22 +00:00
Pete Cooper	70c961d67d	Improve DWARFDebugFrame::parse to also handle __eh_frame. LLVM MC has single methods which can handle the output of EH frame and DWARF CIE's and FDE's. This code improves DWARFDebugFrame::parse to do the same for parsing. This also allows llvm-objdump to support the --dwarf=frames option which objdump supports. This option dumps the .eh_frame section using the new code in DWARFDebugFrame::parse. http://reviews.llvm.org/D15535 Reviewed by Rafael Espindola. llvm-svn: 256008	2015-12-18 18:51:08 +00:00
Teresa Johnson	a03b8bd4f9	[ThinLTO/LTO] Don't link in unneeded metadata Summary: Third patch split out from http://reviews.llvm.org/D14752. Only map in needed DISubroutine metadata (imported or otherwise linked in functions and other DISubroutine referenced by inlined instructions). This is supported for ThinLTO, LTO and llvm-link --only-needed, with associated tests for each one. Depends on D14838. Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14843 llvm-svn: 256003	2015-12-18 17:51:37 +00:00
Rafael Espindola	bd88dc53fc	clang-format to reduce diff in another patch. llvm-svn: 255981	2015-12-18 03:04:52 +00:00
Vedant Kumar	80e7021204	Revert "[Option] Introduce Arg::print(raw_ostream&) and use llvm::dbgs" This reverts commit r255977. This is part of http://reviews.llvm.org/D15634. llvm-svn: 255978	2015-12-18 02:30:45 +00:00
Vedant Kumar	5d48dc63b5	[Option] Introduce Arg::print(raw_ostream&) and use llvm::dbgs llvm-svn: 255977	2015-12-18 02:27:52 +00:00
Eric Christopher	359dea2a6b	Reorganize the C API headers to improve build times. Type specific declarations have been moved to Type.h and error handling routines have been moved to ErrorHandling.h. Both are included in Core.h so nothing should change for projects directly including the headers, but transitive dependencies may be affected. llvm-svn: 255965	2015-12-18 01:46:52 +00:00
Cong Hou	72c4ebb415	[BranchProbability] Remove the restriction that known and unknown probabilities cannot coexist when being normalized. The current BranchProbability::normalizeProbabilities() forbids known and unknown probabilities to coexist in the list. This was once used to help capture probability exceptions but has caused some reported build failures (https://llvm.org/bugs/show_bug.cgi?id=25838). This patch removes this restriction by evenly distributing the complement of the sum of all known probabilities to unknown ones. We could still treat this as an abnormal behavior, but it is better to emit warnings in our future profile validator. Differential revision: http://reviews.llvm.org/D15548 llvm-svn: 255934	2015-12-17 22:27:07 +00:00
Rafael Espindola	a58ac64571	Drop function that are deprecated since 2010. These functions were deprecated in r97608. llvm-svn: 255927	2015-12-17 21:16:12 +00:00
Sanjoy Das	5292411857	[SCEV] Add and use SCEVConstant::getAPInt; NFCI llvm-svn: 255921	2015-12-17 20:28:46 +00:00
Teresa Johnson	0dce8d436c	[ThinLTO] Metadata linking for imported functions Summary: Second patch split out from http://reviews.llvm.org/D14752. Maps metadata as a post-pass from each module when importing complete, suturing up final metadata to the temporary metadata left on the imported instructions. This entails saving the mapping from bitcode value id to temporary metadata in the importing pass, and from bitcode value id to final metadata during the metadata linking postpass. Depends on D14825. Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14838 llvm-svn: 255909	2015-12-17 17:14:09 +00:00
Andy Gibbs	d9ba2c9d06	Revert r254592 (virtual dtor in SCEVPredicate). Clang has better diagnostics in this case. It is not necessary therefore to change the destructor to avoid what is effectively an invalid warning in gcc. Instead, better handle the warning flags given to the compiler. llvm-svn: 255905	2015-12-17 16:43:53 +00:00
Tobias Grosser	d78a561416	DOTGraphTraits: Allow the decision to show a graph to consider the analysis The method processFunction() is called to decide if a graph should be shown for a certain function. To allow DOTGraphTraitViewers to take this decision based on the analysis results for the given function, we forward a reference to the analysis result. This will be used by Polly to only visualize functions where interesting loop regions have been detected. llvm-svn: 255889	2015-12-17 12:48:25 +00:00
Xinliang David Li	7fbf182d27	[PGO] InstrPGO and coverage code refactoring (NFC) Introduce a new class InstrProfSymtab to abstract the PGO symbol table for prof and coverage reader. The symtab is is to lookup function's PGO name using function keys. The first user of the class is CoverageMapping Reader. More will follow. llvm-svn: 255862	2015-12-17 00:53:37 +00:00
Rafael Espindola	4c6d9d0ecf	Use std::unique_ptr. NFC. llvm-svn: 255852	2015-12-16 23:49:14 +00:00
Rafael Espindola	f7a0054c75	Change linkInModule to take a std::unique_ptr. Passing in a std::unique_ptr should help find errors when the module is used after being linked into another module. llvm-svn: 255842	2015-12-16 23:16:33 +00:00
Rafael Espindola	5428b16a54	Drop an unnecessary use of writev. It looks like the code this patch deletes is based on a misunderstanding of what guarantees writev provides. In particular, writev with 1 iovec is not "more atomic" than a write. Testing on OS X shows that both write and writev from multiple processes can be intermixed. llvm-svn: 255837	2015-12-16 22:59:06 +00:00
Rafael Espindola	666f0e4525	Simplify memory management with std::unique_ptr. llvm-svn: 255831	2015-12-16 22:28:34 +00:00
Ahmed Bougacha	34e91bd86a	[CodeGen] Make MachineInstrBuilder::copyImplicitOps const. NFC. This matches the other MIB methods, none of which modify the builder. Without this, we can't chain copyImplicitOps. Also reformat the few users, in PPCEarlyReturn. llvm-svn: 255828	2015-12-16 22:15:30 +00:00
Nathan Slingerland	648ea4f74f	[PGO] Handle and report overflow during profile merge for all types of data Summary: Surface counter overflow when merging profile data. Merging still occurs on overflow but counts saturate to the maximum representable value. Overflow is reported to the user. Reviewers: davidxl, dnovillo, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15547 llvm-svn: 255825	2015-12-16 21:45:43 +00:00
Krzysztof Parzyszek	8feeb85759	[Hexagon] Update e_flags in the ELF definitions llvm-svn: 255820	2015-12-16 20:57:07 +00:00
Manman Ren	9744c391a9	CXX_FAST_TLS calling convention: target independent portion. Update supportSplitCSR's interface to take machine function instead of the calling convention. Review comments for http://reviews.llvm.org/D15341 llvm-svn: 255818	2015-12-16 20:45:48 +00:00
Paul Robinson	9300845b3d	Set debugger tuning from TargetOptions (NFC) Differential Revision: http://reviews.llvm.org/D15427 llvm-svn: 255810	2015-12-16 19:58:30 +00:00
Justin Bogner	58647df890	LPM: Make callers of LPM.deleteLoopFromQueue update LoopInfo directly. NFC As of r255720, the loop pass manager will DTRT when passes update the loop info for removed loops, so they no longer need to reach into LPPassManager APIs to do this kind of transformation. This change very nearly removes the need for the LPPassManager to even be passed into loop passes - the only remaining pass that uses the LPM argument is LoopUnswitch. llvm-svn: 255797	2015-12-16 18:40:20 +00:00
Tom Stellard	88b1e5fba9	MachineScheduler: Add a target hook for deciding which RegPressure sets to increase Summary: This patch adds a function called getRegPressureSetScore() to TargetRegisterInfo. The MachineScheduler uses this when comparing instruction that increase the register pressure of different sets to determine which set is safer to increase. This hook is useful for GPU targets where the number of registers in the class is not the best metric for determing which presser set is safer to increase. Future work may include adding more parameters to this function, like for example, the current pressure level of the set or the amount that the pressure will be increased/decreased. Reviewers: qcolombet, escha, arsenm, atrick, MatzeB Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14806 llvm-svn: 255795	2015-12-16 18:31:01 +00:00
Krzysztof Parzyszek	063469eb6a	[Packetizer] Add a check whether an instruction should be packetized now Add a function VLIWPacketizerList::shouldAddToPacket, which will allow specific implementations to decide if it is profitable to add given instruction to the current packet. llvm-svn: 255780	2015-12-16 16:38:16 +00:00
Vaivaswatha Nagaraj	a478a7d3d6	Add InaccessibleMemOnly and inaccessibleMemOrArgMemOnly attributes Summary: This patch introduces two new function attributes InaccessibleMemOnly: This attribute indicates that the function may only access memory that is not accessible by the program/IR being compiled. This is a weaker form of ReadNone. inaccessibleMemOrArgMemOnly: This attribute indicates that the function may only access memory that is either not accessible by the program/IR being compiled, or is pointed to by its pointer arguments. This is a weaker form of ArgMemOnly Test cases have been updated. This revision uses this (`d001932f3a`) as reference. Reviewers: jmolloy, hfinkel Subscribers: reames, joker.eph, llvm-commits Differential Revision: http://reviews.llvm.org/D15499 llvm-svn: 255778	2015-12-16 16:16:19 +00:00
Keno Fischer	1df6a9d651	[SectionMemoryManager] Make better use of virtual memory Summary: On Windows, the allocation granularity can be significantly larger than a page (64K), so with many small objects, just clearing the FreeMem list rapidly leaks quite a bit of virtual memory space (if not rss). Fix that by only removing those parts of the FreeMem blocks that overlap pages for which we are applying memory permissions, rather than dropping the FreeMem blocks entirely. Reviewers: lhames Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15202 llvm-svn: 255760	2015-12-16 11:13:23 +00:00
Vikram TV	5d75d984d9	Recommit LiveDebugValues pass after fixing a couple of minor issues. llvm-svn: 255759	2015-12-16 11:09:48 +00:00
George Burgess IV	0d80a00b30	Minor cleanup of Attribute code. NFC. llvm-svn: 255751	2015-12-16 05:21:02 +00:00
Justin Bogner	5c61081fe6	LPM: Simplify how passes mark loops for deletion. NFC When a pass removes a loop it currently has to reach up into the LPPassManager's internals to update the state of the iteration over loops. This reverse dependency results in a pretty awkward interplay of the LPPassManager and its Passes. Here, we change this to instead keep track of when a loop has become "unlooped" in the Loop objects themselves, then the LPPassManager can check this and manipulate its own state directly. This opens the door to allow most of the loop passes to work without a backreference to the LPPassManager. I've kept passes calling the LPPassManager::deleteLoopFromQueue API now so I could put an assert in to prove that this is NFC, but a later pass will update passes just to preserve the LoopInfo directly and stop referencing the LPPassManager completely. llvm-svn: 255720	2015-12-16 00:01:02 +00:00
Reid Kleckner	58fdd5df47	[WinEH] Make llvm.x86.seh.recoverfp work on x64 It adjusts from RSP-after-prologue to RBP, which is what SEH filters need to do before they can use llvm.localrecover. Fixes SEH filter captures, which were broken in r250088. Issue reported by Alex Crichton. llvm-svn: 255707	2015-12-15 23:40:58 +00:00
Evgeniy Stepanov	39e538e166	Cross-DSO control flow integrity (LLVM part). An LTO pass that generates a __cfi_check() function that validates a call based on a hash of the call-site-known type and the target pointer. llvm-svn: 255693	2015-12-15 23:00:08 +00:00
Reid Kleckner	ba7cf44fda	Wrap include of <future> in some warning suppression pragmas Eventually we may need to sink this include to the .cpp file or something to suport LLVM_ENABLE_THREADS=OFF, but this solves my immediate problem of fixing the build. llvm-svn: 255682	2015-12-15 22:10:30 +00:00
Eric Christopher	3b41a7181d	Add support for the .debug_macro section of the forthcoming DWARF 5 spec. Patch by B. Sivachandra Reddy! llvm-svn: 255679	2015-12-15 21:50:27 +00:00
Reid Kleckner	f034b3c900	Fix clang-cl self-host with MSVC 2013 STL std::bind implementation llvm-svn: 255678	2015-12-15 21:41:58 +00:00
Reid Kleckner	c8a81bcc44	[WinEH] Remove unused intrinsic llvm.x86.seh.restoreframe We can clean this up now that we have the X86 CATCHRET instruction to restore the FP, SP, and BP. llvm-svn: 255677	2015-12-15 21:41:34 +00:00
David Majnemer	608538dccc	[WinEH] Use operand bundles to describe call sites SimplifyCFG allows tail merging with code which terminates in unreachable which, in turn, makes it possible for an invoke to end up in a funclet which it was not originally part of. Using operand bundles on invokes allows us to determine whether or not an invoke was part of a funclet in the source program. Furthermore, it allows us to unambiguously answer questions about the legality of inlining into call sites which the personality may have trouble with. Differential Revision: http://reviews.llvm.org/D15517 llvm-svn: 255674	2015-12-15 21:27:27 +00:00
Xinliang David Li	4bfffacc8b	Coverage code refactoring /NFC llvm-svn: 255670	2015-12-15 19:44:45 +00:00
Justin Bogner	621a2ef540	LPM: Stop threading `Pass ` through all of the loop utility APIs. NFC A large number of loop utility functions take a `Pass ` and reach into it to find out which analyses to preserve. There are a number of problems with this: - The APIs have access to pretty well any Pass state they want, so it's hard to tell what they may or may not do. - Other APIs have copied these and pass around a `Pass *` even though they don't even use it. Some of these just hand a nullptr to the API since the callers don't even have a pass available. - Passes in the new pass manager don't work like the current ones, so the APIs can't be used as is there. Instead, we should explicitly thread the analysis results that we actually care about through these APIs. This is both simpler and more reusable. llvm-svn: 255669	2015-12-15 19:40:57 +00:00
Nathan Slingerland	b95c2eec31	[llvm-profdata] Add support for weighted merge of profile data (2nd try) Summary: This change adds support for specifying a weight when merging profile data with the llvm-profdata tool. Weights are specified by using the --weighted-input=<weight>,<filename> option. Input files not specified with this option (normal positional list after options) are given a default weight of 1. Adding support for arbitrary weighting of input profile data allows for relative importance to be placed on the input data from multiple training runs. Both sampled and instrumented profiles are supported. Reviewers: davidxl, dnovillo, bogner, silvas Subscribers: silvas, davidxl, llvm-commits Differential Revision: http://reviews.llvm.org/D15306 llvm-svn: 255659	2015-12-15 17:37:09 +00:00
Tom Stellard	165a990abf	AMDGPU/SI: Add llvm.amdgcn.mbcnt.* intrinsics Summary: These are meant to be used instead of the llvm.SI.tid intrinsic which will be deprecated at some point. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15475 llvm-svn: 255652	2015-12-15 17:02:52 +00:00
Tom Stellard	8d8cf53f5c	AMDGPU/SI: Add llvm.amdgcn.v.interp.p[12] intrinsics Summary: These are meant to be used instead of the llvm.SI.fs.interp intrinsic which will be deprecated at some point. Reviewers: arsenm Subscribers: arsenm, llvm-commits Differential Revision: http://reviews.llvm.org/D15474 llvm-svn: 255651	2015-12-15 17:02:49 +00:00
Michael Kuperstein	5b09c8a71e	Do not try to use i8 and i16 versions of FP_TO_U/SINT soft float library calls It appears that neither compiler-rt nor the gnu soft-float libraries actually implement these conversions. Instead of emitting calls to library functions that don't exist, handle it similarly to the way we handle i8 -> float and i16 -> float conversions: call the i32 library function, and adjust the type. Differential Revision: http://reviews.llvm.org/D15151 llvm-svn: 255643	2015-12-15 12:55:50 +00:00
Elena Demikhovsky	61fc55d5da	Type legalizer for masked gather and scatter intrinsics. Full type legalizer that works with all vectors length - from 2 to 16, (i32, i64, float, double). This intrinsic, for example void @llvm.masked.scatter.v2f32(<2 x float>%data , <2 x float*>%ptrs , i32 align , <2 x i1>%mask ) requires type widening for data and type promotion for mask. Differential Revision: http://reviews.llvm.org/D13633 llvm-svn: 255629	2015-12-15 08:40:41 +00:00
Craig Topper	466a91c686	Use CmpInst::Predicate instead of 'unsigned short' in some places. NFC llvm-svn: 255623	2015-12-15 06:11:33 +00:00
Justin Bogner	c058f3a850	LoopUtils: Remove defaults for arguments that are always specified. NFC llvm-svn: 255620	2015-12-15 05:52:13 +00:00
Teresa Johnson	ff38b7b132	Fix template parameter pack handling in ThreadPool Fixes passing of template parameter pack via std::forward and add unittest. llvm-svn: 255617	2015-12-15 04:44:02 +00:00
Weiming Zhao	348c97685c	Bump up MAX_SUBTARGET_FEATURES Summary: Currently, ARMGenSubtargetInfo (from ARM.td) is reaching the limit of 96: enum : uint64_t { ... XScale = 95 }; We need to bump the maximum value up to accommodate future changes and/or customized subtarget definitions. Reviewers: apazos, t.p.northover Subscribers: llvm-commits, aemerson Differential Revision: http://reviews.llvm.org/D15514 llvm-svn: 255616	2015-12-15 04:42:49 +00:00
Vaivaswatha Nagaraj	e6c5ddb0c2	NFC: Fix typo in comment llvm-svn: 255615	2015-12-15 04:41:10 +00:00
Nico Weber	630645d32a	Try to let r255604 have an effect. llvm-svn: 255611	2015-12-15 03:14:19 +00:00
Mehdi Amini	02dd190506	Add specific header for MSVC to be able to build with LLVM_ENABLE_THREADS=OFF Follow-up to the ThreadPool library From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255604	2015-12-15 02:32:03 +00:00
Mehdi Amini	b29b50a9dd	Instcombine: destructor loads of structs that do not contains padding For non padded structs, we can just proceed and deaggregate them. We don't want ot do this when there is padding in the struct as to not lose information about this padding (the subsequents passes would then try hard to preserve the padding, which is undesirable). Also update extractvalue.ll and cast.ll so that they use structs with padding. Remove the FIXME in the extractvalue of laod case as the non padded case is handled when processing the load, and we don't want to do it on the padded case. Patch by: Amaury SECHET <deadalnix@gmail.com> Differential Revision: http://reviews.llvm.org/D14483 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255600	2015-12-15 01:44:07 +00:00
Cong Hou	8389268d7a	Let operator/ with uint32_t rhs operand be a member of BranchProbability and add a new operator /=. NFC. llvm-svn: 255595	2015-12-15 01:21:14 +00:00
Mehdi Amini	54fc055738	Add a C++11 ThreadPool implementation in LLVM This is a very simple implementation of a thread pool using C++11 thread. It accepts any std::function<void()> for asynchronous execution. Individual task can be synchronize using the returned future, or the client can block on the full queue completion. In case LLVM is configured with Threading disabled, it falls back to sequential execution using std::async with launch:deferred. This is intended to support parallelism for ThinLTO processing in linker plugin, but is generic enough for any other uses. This is a recommit of r255444 ; trying to workaround a bug in the MSVC 2013 standard library. I think I was hit by: http://connect.microsoft.com/VisualStudio/feedbackdetail/view/791185/std-packaged-task-t-where-t-is-void-or-a-reference-class-are-not-movable Recommit of r255589, trying to please g++ as well. Differential Revision: http://reviews.llvm.org/D15464 From: mehdi_amini <mehdi_amini@91177308-0d34-0410-b5e6-96231b3b80d8> llvm-svn: 255593	2015-12-15 00:59:19 +00:00
Mehdi Amini	5acecbc1ec	Revert "Add a C++11 ThreadPool implementation in LLVM" This reverts commit r255589. Breaks g++ From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255591	2015-12-15 00:42:44 +00:00
Mehdi Amini	dbdd929681	Add a C++11 ThreadPool implementation in LLVM This is a very simple implementation of a thread pool using C++11 thread. It accepts any std::function<void()> for asynchronous execution. Individual task can be synchronize using the returned future, or the client can block on the full queue completion. In case LLVM is configured with Threading disabled, it falls back to sequential execution using std::async with launch:deferred. This is intended to support parallelism for ThinLTO processing in linker plugin, but is generic enough for any other uses. This is a recommit of r255444 ; trying to workaround a bug in the MSVC 2013 standard library. I think I was hit by: http://connect.microsoft.com/VisualStudio/feedbackdetail/view/791185/std-packaged-task-t-where-t-is-void-or-a-reference-class-are-not-movable Differential Revision: http://reviews.llvm.org/D15464 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255589	2015-12-15 00:38:05 +00:00
Xinliang David Li	7640a69ad0	[PGO] make profile prefix even shorter and more readable llvm-svn: 255586	2015-12-15 00:32:56 +00:00
Rafael Espindola	79cecc5f46	Yet another missing include. llvm-svn: 255579	2015-12-14 23:39:05 +00:00
Rafael Espindola	2d1739bf50	A better attempt to add a missing include llvm-svn: 255578	2015-12-14 23:34:35 +00:00
Xinliang David Li	10c9ed2b0f	[PGO] Shorten profile symbol prefixes Profile symbols have long prefixes which waste space and creating pressure for linker. This patch shortens the prefixes to minimal length without losing verbosity. Differential Revision: http://reviews.llvm.org/D15503 llvm-svn: 255575	2015-12-14 23:26:27 +00:00
Rafael Espindola	5b397256de	Use diagnostic handler in the LLVMContext This patch converts code that has access to a LLVMContext to not take a diagnostic handler. This has a few advantages * It is easier to use a consistent diagnostic handler in a single program. * Less clutter since we are not passing a handler around. It does make it a bit awkward to implement some C APIs that return a diagnostic string. I will propose new versions of these APIs and deprecate the current ones. llvm-svn: 255571	2015-12-14 23:17:03 +00:00
Sanjay Patel	14a74b66f7	add fast-math-flags to 'call' instructions (PR21290) This patch adds optional fast-math-flags (the same that apply to fmul/fadd/fsub/fdiv/frem/fcmp) to call instructions in IR. Follow-up patches would use these flags in LibCallSimplifier, add support to clang, and extend FMF to the DAG for calls. Motivating example: %y = fmul fast float %x, %x %z = tail call float @sqrtf(float %y) We'd like to be able to optimize sqrt(x*x) into fabs(x). We do this today using a function-wide attribute for unsafe-math, but we really want to trigger on the instructions themselves: %z = tail call fast float @sqrtf(float %y) because in an LTO build it's possible that calls with fast semantics have been inlined into a function with non-fast semantics. The code changes and tests are based on the recent commits that added "notail": http://reviews.llvm.org/rL252368 and added FMF to fcmp: http://reviews.llvm.org/rL241901 Differential Revision: http://reviews.llvm.org/D14707 llvm-svn: 255555	2015-12-14 21:59:03 +00:00
Krzysztof Parzyszek	0cb4e2fced	[Packetizer] Add AliasAnalysis as a parameter to the packetizer This will make the depedence graph more accurate if an alias analysis is provided. If nullptr is specified in its place, the behavior will remain as it is currently. llvm-svn: 255540	2015-12-14 20:35:13 +00:00
Pete Cooper	183f136627	Add missing vtable anchor's. The following description is from http://reviews.llvm.org/D15481: ICmpInst, GetElementPtrInst and PHINode have no anchor functions. This causes the vtable and the type info (if RTTI is enabled in user code) to be emitted in multiple translation units. Before 3.7, the destructors were the key functions for these nodes, but they have been removed. There have been discussions about this here: http://lists.llvm.org/pipermail/llvm-dev/2015-August/089010.html and here: http://lists.llvm.org/pipermail/llvm-dev/2015-December/092921.html. Patch by Visoiu Mistrih Francis llvm-svn: 255538	2015-12-14 20:29:16 +00:00
Krzysztof Parzyszek	1c65753742	[Packetizer] Make endPacket virtual This will allow custom handling of packet finalization. The current definition of endPacket will still perform the default finalization. llvm-svn: 255537	2015-12-14 20:12:24 +00:00
Sanjoy Das	46fedab671	Teach haveSameSpecialState about operand bundles llvm-svn: 255527	2015-12-14 19:11:35 +00:00
Krzysztof Parzyszek	73ad4fb9e3	Add "const" to function arguments in DFAPacketizer llvm-svn: 255526	2015-12-14 18:54:44 +00:00
Xinliang David Li	2ab348ba60	[PGO] Value profiling text format reader/writer support This patch adds the missing functionality in parsable text format support for value profiling. Differential Revision: http://reviews.llvm.org/D15212 llvm-svn: 255523	2015-12-14 18:44:01 +00:00
David Majnemer	49dcd13916	[IR] Remove terminatepad It turns out that terminatepad gives little benefit over a cleanuppad which calls the termination function. This is not sufficient to implement fully generic filters but MSVC doesn't support them which makes terminatepad a little over-designed. Depends on D15478. Differential Revision: http://reviews.llvm.org/D15479 llvm-svn: 255522	2015-12-14 18:34:23 +00:00
Paul Robinson	252af42f99	FastISel needs to remove dead code when it bails out. When FastISel fails to translate an instruction it hands off code generation to SelectionDAG. Before it does so, it may have generated local value instructions to feed phi nodes in successor blocks. These instructions will then be generated again by SelectionDAG, causing duplication and less efficient code, including extra spill instructions. Patch by Wolfgang Pieb! Differential Revision: http://reviews.llvm.org/D11768 llvm-svn: 255520	2015-12-14 18:33:18 +00:00
NAKAMURA Takumi	a3ac1e279e	Reformat to untabify. llvm-svn: 255483	2015-12-14 07:58:25 +00:00
Michael Zuckerman	8fdb077443	[X86][inline asm] support even directive The .even directive aligns content to an evan-numbered address. In at&t syntax .even In Microsoft syntax even (without the dot). Differential Revision: http://reviews.llvm.org/D15413 llvm-svn: 255462	2015-12-13 17:07:23 +00:00
Cong Hou	b76a7a8dcb	Normalize MBB's successors' probabilities in several locations. This patch adds some missing calls to MBB::normalizeSuccProbs() in several locations where it should be called. Those places are found by checking if the sum of successors' probabilities is approximate one in MachineBlockPlacement pass with some instrumented code (not in this patch). Differential revision: http://reviews.llvm.org/D15259 llvm-svn: 255455	2015-12-13 09:26:17 +00:00
Nico Weber	c244167f92	Revert r255444. It doesn't build on Windows and broke the Windows LLD and LLDB bots: http://lab.llvm.org:8011/builders/lld-x86_64-win7/builds/27693/steps/build_Lld/logs/stdio http://lab.llvm.org:8011/builders/lldb-x86-windows-msvc/builds/13468/steps/build/logs/stdio llvm-svn: 255446	2015-12-13 04:14:39 +00:00
Mehdi Amini	865fad06c6	Add a C++11 ThreadPool implementation in LLVM This is a very simple implementation of a thread pool using C++11 thread. It accepts any std::function<void()> for asynchronous execution. Individual task can be synchronize using the returned future, or the client can block on the full queue completion. In case LLVM is configured with Threading disabled, it falls back to sequential execution using std::async with launch:deferred. This is intended to support parallelism for ThinLTO processing in linker plugin, but is generic enough for any other uses. Differential Revision: http://reviews.llvm.org/D15464 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255444	2015-12-12 22:55:25 +00:00
David Majnemer	bf189bdcd7	[IR] Reformulate LLVM's EH funclet IR While we have successfully implemented a funclet-oriented EH scheme on top of LLVM IR, our scheme has some notable deficiencies: - catchendpad and cleanupendpad are necessary in the current design but they are difficult to explain to others, even to seasoned LLVM experts. - catchendpad and cleanupendpad are optimization barriers. They cannot be split and force all potentially throwing call-sites to be invokes. This has a noticable effect on the quality of our code generation. - catchpad, while similar in some aspects to invoke, is fairly awkward. It is unsplittable, starts a funclet, and has control flow to other funclets. - The nesting relationship between funclets is currently a property of control flow edges. Because of this, we are forced to carefully analyze the flow graph to see if there might potentially exist illegal nesting among funclets. While we have logic to clone funclets when they are illegally nested, it would be nicer if we had a representation which forbade them upfront. Let's clean this up a bit by doing the following: - Instead, make catchpad more like cleanuppad and landingpad: no control flow, just a bunch of simple operands; catchpad would be splittable. - Introduce catchswitch, a control flow instruction designed to model the constraints of funclet oriented EH. - Make funclet scoping explicit by having funclet instructions consume the token produced by the funclet which contains them. - Remove catchendpad and cleanupendpad. Their presence can be inferred implicitly using coloring information. N.B. The state numbering code for the CLR has been updated but the veracity of it's output cannot be spoken for. An expert should take a look to make sure the results are reasonable. Reviewers: rnk, JosephTremoulet, andrew.w.kaylor Differential Revision: http://reviews.llvm.org/D15139 llvm-svn: 255422	2015-12-12 05:38:55 +00:00
Diego Novillo	9bbd13f9a0	SamplePGO - Reduce memory utilization by 10x. DenseMap is the wrong data structure to use for sample records and call sites. The keys are too large, causing massive core memory growth when reading profiles. Before this patch, a 21Mb input profile was causing the compiler to grow to 3Gb in memory. By switching to std::map, the compiler now grows to 300Mb in memory. There still are some opportunities for memory footprint reduction. I'll be looking at those next. llvm-svn: 255389	2015-12-11 23:21:38 +00:00
Hal Finkel	e58db13c29	Revert r248483, r242546, r242545, and r242409 - absdiff intrinsics After much discussion, ending here: http://lists.llvm.org/pipermail/llvm-commits/Week-of-Mon-20151123/315620.html it has been decided that, instead of having the vectorizer directly generate special absdiff and horizontal-add intrinsics, we'll recognize the relevant reduction patterns during CodeGen. Accordingly, these intrinsics are not needed (the operations they represent can be pattern matched, as is already done in some backends). Thus, we're backing these out in favor of the current development work. r248483 - Codegen: Fix llvm.*absdiff semantic. r242546 - [ARM] Use [SU]ABSDIFF nodes instead of intrinsics for VABD/VABA r242545 - [AArch64] Use [SU]ABSDIFF nodes instead of intrinsics for ABD/ABA r242409 - [Codegen] Add intrinsics 'absdiff' and corresponding SDNodes for absolute difference operation llvm-svn: 255387	2015-12-11 23:11:52 +00:00
Xinliang David Li	2cad28318a	[PGO] Revert r255365: solution incomplete, not handling lambda yet llvm-svn: 255369	2015-12-11 20:23:22 +00:00
Xinliang David Li	de46641441	[PGO] Stop using invalid char in instr variable names. Before the patch, -fprofile-instr-generate compile will fail if no integrated-as is specified when the file contains any static functions (the -S output is also invalid). This patch fixed the issue. With the change, the index format version will be bumped up by 1. Backward compatibility is preserved with this change. Differential Revision: http://reviews.llvm.org/D15243 llvm-svn: 255365	2015-12-11 19:53:19 +00:00
Matthias Braun	b8675cada7	CodeGen: Redo analyzePhysRegs() and computeRegisterLiveness() computeRegisterLiveness() was broken in that it reported dead for a register even if a subregister was alive. I assume this was because the results of analayzePhysRegs() are hard to understand with respect to subregisters. This commit: Changes the results of analyzePhysRegs (=struct PhysRegInfo) to be clearly understandable, also renames the fields to avoid silent breakage of third-party code (and improve the grammar). Fix all (two) users of computeRegisterLiveness() in llvm: By reenabling it and removing workarounds for the bug. This fixes http://llvm.org/PR24535 and http://llvm.org/PR25033 Differential Revision: http://reviews.llvm.org/D15320 llvm-svn: 255362	2015-12-11 19:42:09 +00:00
Matt Arsenault	fad94dae85	Start replacing vector_extract/vector_insert with extractelt/insertelt These are redundant pairs of nodes defined for INSERT_VECTOR_ELEMENT/EXTRACT_VECTOR_ELEMENT. insertelement/extractelement are slightly closer to the corresponding C++ node name, and has stricter type checking so prefer it. Update targets to only use these nodes where it is trivial to do so. AArch64, ARM, and Mips all have various type errors on simple replacement, so they will need work to fix. Example from AArch64: def : Pat<(sext_inreg (vector_extract (v16i8 V128:$Rn), VectorIndexB:$idx), i8), (i32 (SMOVvi8to32 V128:$Rn, VectorIndexB:$idx))>; Which is trying to do sext_inreg i8, i8. llvm-svn: 255359	2015-12-11 19:20:16 +00:00
Manman Ren	6d08250fe8	CXX_FAST_TLS calling convention: target independent portion. The access function has a short entry and a short exit, the initialization block is only run the first time. To improve the performance, we want to have a short frame at the entry and exit. We explicitly handle most of the CSRs via copies. Only the CSRs that are not handled via copies will be in CSR_SaveList. Frame lowering and prologue/epilogue insertion will generate a short frame in the entry and exit according to CSR_SaveList. The majority of the CSRs will be handled by register allcoator. Register allocator will try to spill and reload them in the initialization block. We add CSRsViaCopy, it will be explicitly handled during lowering. 1> we first set FunctionLoweringInfo->SplitCSR if conditions are met (the target supports it for the given calling convention and the function has only return exits). We also call TLI->initializeSplitCSR to perform initialization. 2> we call TLI->insertCopiesSplitCSR to insert copies from CSRsViaCopy to virtual registers at beginning of the entry block and copies from virtual registers to CSRsViaCopy at beginning of the exit blocks. 3> we also need to make sure the explicit copies will not be eliminated. rdar://problem/23557469 Differential Revision: http://reviews.llvm.org/D15340 llvm-svn: 255353	2015-12-11 18:24:30 +00:00
Xinliang David Li	b2d316e534	[PGO] Read VP raw data without depending on the Value field Before this patch, each function's on-disk VP data is 'pointed' to by the Value field of per-function ProfileData structue, and read relies on this field (relocated with ValueDataDelta field) to read the value data. However this means the Value field needs to be updated during runtime before dumping, which creates undesirable data races. With this patch, the reading of VP data no longer depends on Value field. There is no format change. ValueDataDelta header field becomes obsolute but will be kept for compatibility reason (will be removed next time the raw format change is needed). llvm-svn: 255329	2015-12-11 06:53:53 +00:00
Xinliang David Li	7d948a5ebf	Add a forward declaration (NFC) llvm-svn: 255292	2015-12-10 20:13:41 +00:00
Nathan Slingerland	5f9f1eadf1	[ProfileData] Add unit test infrastructure for sample profile reader/writer Summary: Adds support for in-memory round-trip of sample profile data along with basic round trip unit tests. This will also make it easier to include unit tests for future changes to sample profiling. Reviewers: davidxl, dnovillo, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15211 llvm-svn: 255264	2015-12-10 17:21:42 +00:00
Rafael Espindola	9a2fc3b415	Avoid undefined behavior when vector is empty. Found by ubsan. llvm-svn: 255258	2015-12-10 16:35:06 +00:00
Rafael Espindola	c944447e56	Slit lib/Linker in two. A linker normally has two stages: symbol resolution and "moving stuff". In lib/Linker there is the complication of lazy linking some globals, but it was still far more mixed than it needed to. This splits the linker into a lower level IRMover and the linker proper. The IRMover just takes a list of globals to move and a callback that lets the user control what is lazy linked. The main motivation is that now tools/gold (and soon lld) can use their own symbol resolution to instruct IRMover what to do. llvm-svn: 255254	2015-12-10 14:19:35 +00:00
Amjad Aboud	85f2758759	Macro debug info support in LLVM IR Introduced DIMacro and DIMacroFile debug info metadata in the LLVM IR to support macros. Differential Revision: http://reviews.llvm.org/D14687 llvm-svn: 255245	2015-12-10 12:56:35 +00:00
Jonas Paulsson	34ea6437fb	[PostRA scheduling] Allow a target to do scheduling when it wants post RA. SystemZ needs to do its scheduling after branch relaxation, which can only happen after block placement, and therefore the standard PostRAScheduler point in the pass sequence is too early. TargetMachine::targetSchedulesPostRAScheduling() is a new method that signals on returning true that target will insert the final scheduling pass on its own. Reviewed by Hal Finkel llvm-svn: 255234	2015-12-10 09:10:07 +00:00
Sanjoy Das	d85ded90d0	Add arg_begin() and arg_end() to CallInst and InvokeInst; NFCI - This simplifies the CallSite class, arg_begin / arg_end are now simple wrapper getters. - In several places, we were creating CallSite instances solely to call arg_begin and arg_end. With this change, that's no longer required. llvm-svn: 255226	2015-12-10 06:39:02 +00:00
Matthias Braun	c65dd31462	RegisterPressure: Factor out liveness dead-def detection logic; NFCI Detecting additional dead-defs without a dead flag that are only visible through liveness information should be part of the register operand collection not intertwined with the register pressure update logic. llvm-svn: 255192	2015-12-10 01:04:15 +00:00
Sanjoy Das	f3ba629c4d	Use WeakVH to keep track of calls with operand bundles in CloneCodeInfo `CloneAndPruneIntoFromInst` can DCE instructions after cloning them into the new function, and so an AssertingVH is too strong. This change switches CloneCodeInfo to use a std::vector<WeakVH>. llvm-svn: 255148	2015-12-09 20:33:52 +00:00
Teresa Johnson	83a7df21b2	[ThinLTO] FunctionImport pass can take a const index pointer (NFC) llvm-svn: 255140	2015-12-09 19:39:47 +00:00
Rong Xu	2f995f2098	[PGO] Resubmit "MST based PGO instrumentation infrastructure" (r254021) This new patch fixes a few bugs that exposed in last submit. It also improves the test cases. --Original Commit Message-- This patch implements a minimum spanning tree (MST) based instrumentation for PGO. The use of MST guarantees minimum number of CFG edges getting instrumented. An addition optimization is to instrument the less executed edges to further reduce the instrumentation overhead. The patch contains both the instrumentation and the use of the profile to set the branch weights. Differential Revision: http://reviews.llvm.org/D12781 llvm-svn: 255132	2015-12-09 18:08:16 +00:00
Nathan Slingerland	3b35892e68	[Support] Change SaturatingAdd()/SaturatingMultiply() to use pointer for returning overflow state Summary: Improve SaturatingAdd()/SaturatingMultiply() to use bool * to optionally return overflow result. This should make it clearer that the value is returned at callsites and reduces the size of the implementation. Reviewers: davidxl, silvas Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15219 llvm-svn: 255128	2015-12-09 17:11:28 +00:00
Mehdi Amini	300ed48d90	Change hasUniqueInitializer() to call isStrongDefinitionForLinker() instead of !isWeakForLinker() Summary: Available_externally global variable with initializer were considered "hasInitializer()", while obviously it can't match the description: Whether the global variable has an initializer, and any changes made to the initializer will turn up in the final executable. since modifying the initializer of an externally available variable does not make sense. Reviewers: pcc, rafael Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15351 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255123	2015-12-09 16:17:07 +00:00
Silviu Baranga	d19d7b747a	Re-commit r255115, with the PredicatedScalarEvolution class moved to ScalarEvolution.h, in order to avoid cyclic dependencies between the Transform and Analysis modules: [LV][LAA] Add a layer over SCEV to apply run-time checked knowledge on SCEV expressions Summary: This change creates a layer over ScalarEvolution for LAA and LV, and centralizes the usage of SCEV predicates. The SCEVPredicatedLayer takes the statically deduced knowledge by ScalarEvolution and applies the knowledge from the SCEV predicates. The end goal is that both LAA and LV should use this interface everywhere. This also solves a problem involving the result of SCEV expression rewritting when the predicate changes. Suppose we have the expression (sext {a,+,b}) and two predicates P1: {a,+,b} has nsw P2: b = 1. Applying P1 and then P2 gives us {a,+,1}, while applying P2 and the P1 gives us sext({a,+,1}) (the AddRec expression was changed by P2 so P1 no longer applies). The SCEVPredicatedLayer maintains the order of transformations by feeding back the results of previous transformations into new transformations, and therefore avoiding this issue. The SCEVPredicatedLayer maintains a cache to remember the results of previous SCEV rewritting results. This also has the benefit of reducing the overall number of expression rewrites. Reviewers: mzolotukhin, anemet Subscribers: jmolloy, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D14296 llvm-svn: 255122	2015-12-09 16:06:28 +00:00
Silviu Baranga	ba0669cbca	Revert r255115 until we figure out how to fix the bot failures. llvm-svn: 255117	2015-12-09 15:25:28 +00:00
Silviu Baranga	f6006f41f7	[LV][LAA] Add a layer over SCEV to apply run-time checked knowledge on SCEV expressions Summary: This change creates a layer over ScalarEvolution for LAA and LV, and centralizes the usage of SCEV predicates. The SCEVPredicatedLayer takes the statically deduced knowledge by ScalarEvolution and applies the knowledge from the SCEV predicates. The end goal is that both LAA and LV should use this interface everywhere. This also solves a problem involving the result of SCEV expression rewritting when the predicate changes. Suppose we have the expression (sext {a,+,b}) and two predicates P1: {a,+,b} has nsw P2: b = 1. Applying P1 and then P2 gives us {a,+,1}, while applying P2 and the P1 gives us sext({a,+,1}) (the AddRec expression was changed by P2 so P1 no longer applies). The SCEVPredicatedLayer maintains the order of transformations by feeding back the results of previous transformations into new transformations, and therefore avoiding this issue. The SCEVPredicatedLayer maintains a cache to remember the results of previous SCEV rewritting results. This also has the benefit of reducing the overall number of expression rewrites. Reviewers: mzolotukhin, anemet Subscribers: jmolloy, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D14296 llvm-svn: 255115	2015-12-09 15:03:52 +00:00
Mehdi Amini	65450e86c8	Revert "Implement a new pass - LiveDebugValues - to compute the set of live DEBUG_VALUEs at each basic block and insert them. Reviewed and accepted at: http://reviews.llvm.org/D11933 " This reverts commit r255096. Break the bots: http://lab.llvm.org:8080/green/job/clang-stage1-cmake-RA-incremental_check/16378/ From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255101	2015-12-09 08:17:42 +00:00
Vikram TV	eb98340d4a	Implement a new pass - LiveDebugValues - to compute the set of live DEBUG_VALUEs at each basic block and insert them. Reviewed and accepted at: http://reviews.llvm.org/D11933 llvm-svn: 255096	2015-12-09 05:49:14 +00:00
Steven Wu	1b8163c1b8	Fix the order of destructors in LibLTOCodeGenerator Summary: The order of destructors in LTOCodeGenerator gets changed in r254696. It is possible for LTOCodeGenerator to have a MergedModule created in the OwnedContext, in which case the module must be destructed before the context. Reviewers: rafael, dexonsmith Subscribers: llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D15346 llvm-svn: 255092	2015-12-09 03:37:51 +00:00
Rafael Espindola	f20bc23b7c	Return a std::unique_ptr from CloneModule. NFC. llvm-svn: 255078	2015-12-08 23:57:17 +00:00
Mehdi Amini	ba2c064383	Remove caching in FunctionImport: a Module can't be reused after being linked from The Linker destroys the source module (API change coming to make it explicit) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255064	2015-12-08 22:39:40 +00:00
Mehdi Amini	e4f5a60024	Revert "Add Available Externally linkage type to isWeakForLinker()" This reverts r255043, as per post-review concern were raised on the correctness. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255045	2015-12-08 19:13:31 +00:00
Mehdi Amini	adf4a628c7	Add Available Externally linkage type to isWeakForLinker() Per LangRef: "Globals with available_externally linkage are allowed to be discarded at will, and are otherwise the same as linkonce_odr", since linkonce_odr is in this list it makes sense to have available_externally there as well. Reviewers: rafael Differential Revision: http://reviews.llvm.org/D15323 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255043	2015-12-08 19:01:29 +00:00
Asaf Badouh	73424c7e6b	[x86][avx512] more changes in intrinsics to be align with gcc format Differential Revision: http://reviews.llvm.org/D15329 llvm-svn: 255011	2015-12-08 12:34:34 +00:00
Sanjoy Das	90bb44dfe3	[OperandBundles] Remove unncessary constructor The StringRef constructor is unnecessary (since we're converting to std::string anyway), and having it requires an explicit call to StringRef's or std::string's constructor. llvm-svn: 255000	2015-12-08 03:50:32 +00:00
Sanjoy Das	2f7aca1668	[IndVars] Have getInsertPointForUses preserve LCSSA Summary: Also add a stricter post-condition for IndVarSimplify. Fixes PR25578. Test case by Michael Zolotukhin. Reviewers: hfinkel, atrick, mzolotukhin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15059 llvm-svn: 254977	2015-12-08 00:13:21 +00:00
Sanjoy Das	ec1f59a19a	[SCEVExpander] Have hoistIVInc preserve LCSSA Summary: (Note: the problematic invocation of hoistIVInc that caused PR24804 came from IndVarSimplify, not from SCEVExpander itself) Fixes PR24804. Test case by David Majnemer. Reviewers: hfinkel, majnemer, atrick, mzolotukhin Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15058 llvm-svn: 254976	2015-12-08 00:13:17 +00:00
Sanjoy Das	9d4e519ec7	Add Instruction::getFunction; NFC Will be used in a upcoming patch. llvm-svn: 254975	2015-12-08 00:13:12 +00:00
Philip Reames	b28303ef74	[PassManager] Tuning Memory Usage of AnalysisUsage We were using unneccessarily large initial sizes for these SmallVectors. This was wasting around 50kb of memory for the O3 pipeline, even after the uniquing changes. We're still using around 20kb which is a bit much, but it's definitely better. This is about a 6% improvement in total O3 memory usage. Note: The raw data on structure size which were used to pick these thresholds can be found in the review thread. Differential Revision: http://reviews.llvm.org/D15244 llvm-svn: 254974	2015-12-08 00:10:56 +00:00
Justin Bogner	8b4ea7a451	Support: Teach Asan about BumpPtrAllocator Based on patch by Pete Cooper. llvm-svn: 254964	2015-12-07 23:12:26 +00:00
Mehdi Amini	87b7d8abe9	Remove useless hack that avoids calling LLVMLinkInInterpreter() This is supposed to force-link the Interpreter, by inserting a dead call to LLVMLinkInInterpreter(). Since it is actually an empty function, there is no reason for the call to be dead. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254956	2015-12-07 22:27:19 +00:00
Teresa Johnson	335927a217	Fix function return type in declaration (bot errors from r254926). Try to fix bot build errors from r254926 by correcting the function return type. llvm-svn: 254934	2015-12-07 19:53:38 +00:00
Sanjay Patel	51a4d88a44	fix 'the the '; NFC llvm-svn: 254928	2015-12-07 19:21:39 +00:00
Teresa Johnson	1fb89d62fb	[ThinLTO] Support for specifying function index from pass manager Summary: Add a field on the PassManagerBuilder that clang or gold can use to pass down a pointer to the function index in memory to use for importing when the ThinLTO backend is triggered. Add support to supply this to the function import pass. Reviewers: joker.eph, dexonsmith Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D15024 llvm-svn: 254926	2015-12-07 19:21:11 +00:00
Lang Hames	096bda3427	[Orc] Removing traces of takeOwnershipOfBuffers left after r251560. Patch by Joshua Gerrard. Thanks Joshua! llvm-svn: 254919	2015-12-07 17:35:56 +00:00
Teresa Johnson	4789ea589d	[ThinLTO] Support cloning of temporary DILocation metadata This is needed to support linking of module-level metadata as a postpass after function importing, where we will be leaving temporary metadata on imported instructions until the postpass metadata import. Also added unittest. Split from D14838. llvm-svn: 254914	2015-12-07 15:05:44 +00:00
Igor Breger	2e5da39635	AVX-512: implement kunpck intrinsics. Differential Revision: http://reviews.llvm.org/D14821 llvm-svn: 254908	2015-12-07 13:25:18 +00:00
Asaf Badouh	201b9ca305	[avx512] rename gcc intrinsics to be align with gcc format rename the gcc intrinsics suffix : _mask ->_round Differential Revision: http://reviews.llvm.org/D15285 llvm-svn: 254905	2015-12-07 13:14:14 +00:00
Craig Topper	6d9821a6ce	Add uint8_t size to LegalizeTypeAction enum and use the enum type directly to remove some typecasts. NFC llvm-svn: 254895	2015-12-07 06:31:41 +00:00
Craig Topper	2d42dff673	Add uint8_t size to LegalizeAction enum so we can use the enum type directly and remove some casts. NFC llvm-svn: 254893	2015-12-07 06:01:51 +00:00
Lang Hames	dde8f900ec	[Orc] Rename IndirectStubsManagerBase to IndirectStubsManager. No functional change. llvm-svn: 254885	2015-12-06 19:44:45 +00:00
Asaf Badouh	903869d4c1	[X86][AVX512] add vmovss/sd missing encoding Differential Revision: http://reviews.llvm.org/D14701 llvm-svn: 254875	2015-12-06 13:26:56 +00:00
Michael Kuperstein	0fd1d57c01	[X86] Always generate precise CFA adjustments. This removes the code path that generate "synchronous" (only correct at call site) CFA. We will probably want to re-introduce it once we are capable of emitting different .eh_frame and .debug_frame sections. Differential Revision: http://reviews.llvm.org/D14948 llvm-svn: 254874	2015-12-06 13:06:20 +00:00
Craig Topper	55a007dfc9	Use make_range to reduce mentions of iterator type. NFC llvm-svn: 254872	2015-12-06 05:08:07 +00:00
Craig Topper	a9a7f8f04c	Minor formatting fix. NFC llvm-svn: 254871	2015-12-06 05:07:58 +00:00
Sanjoy Das	16ad4f2471	[InstCombine] Call getCmpPredicateForMinMax only with a valid SPF Summary: There are `SelectPatternFlavor`s that don't represent min or max idioms, and we should not be passing those to `getCmpPredicateForMinMax`. Fixes PR25745. Reviewers: majnemer Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D15249 llvm-svn: 254869	2015-12-05 23:44:22 +00:00
Craig Topper	bf1c629225	Replace uint16_t with the MCPhysReg typedef in many places. A lot of physical register arrays already use this typedef. llvm-svn: 254843	2015-12-05 07:13:35 +00:00
Craig Topper	250c981bf4	Use std::fill instead of memset to initialize an array to avoid hardcoded count and a multiply. The outputed code is identical. llvm-svn: 254842	2015-12-05 07:13:25 +00:00
Xinliang David Li	c84cd3b8ff	[PGO] Add version to getPGOFuncName method Different version of indexed format may use different name uniquing schemes for static functions. Pass the version info to the name interface so that different schmes can be picked (for profile lookup). llvm-svn: 254838	2015-12-05 05:16:36 +00:00
David Blaikie	09b741f7b6	[llvm-dwp] Support debug_tu_index llvm-svn: 254827	2015-12-05 03:05:45 +00:00
Lang Hames	20a7c0b0dd	Whitespace. llvm-svn: 254821	2015-12-05 01:44:20 +00:00
Justin Bogner	62356515cc	CodeGen: Let the BumpPtrAllocator free the elements of indexList The indexList's nodes are all allocated on a BumpPtrAllocator, so it's more efficient to let them be freed when it goes away, rather than deleting them directly. This is a follow up to r254794. llvm-svn: 254808	2015-12-05 00:39:14 +00:00
Philip Reames	730b7eb8e3	[EarlyCSE] IsSimple vs IsVolatile naming clarification (NFC) When the notion of target specific memory intrinsics was introduced to EarlyCSE, the commit confused the notions of volatile and simple memory access. Since I'm about to start working on this area, cleanup the naming so that patches aren't horribly confusing. Note that the actual implementation was always bailing if the load or store wasn't simple. Reminder: - "volatile" - C++ volatile, can't remove any memory operations, but in principal unordered - "ordered" - imposes ordering constraints on other nearby memory operations - "atomic" - can't be split or sheared. In LLVM terms, all "ordered" operations are also atomic so the predicate "isAtomic" is often used. - "simple" - a load which is none of the above. These are normal loads and what most of the optimizer works with. llvm-svn: 254805	2015-12-05 00:18:33 +00:00
Philip Reames	a39a875fea	[PassManager] Ensure destructors of cached AnalysisUsage objects are run In 254760, I introduced the usage of a BumpPtrAllocator for the AnalysisUsage instances held by the PassManger. This turns out to have been incorrect since a BumpPtrAllocator does not run the destructors of objects when deallocating memory. Since a few of our SmallVector's had grown beyond their small size, we end up with some leaked memory. We need to use a SpecificBumpPtrAllocator instead. llvm-svn: 254803	2015-12-04 23:48:19 +00:00
Teresa Johnson	f06c8232ba	[ThinLTO] Helper for performing renaming/promotion on a module Creates a module and performs necessary renaming/promotion of locals that may be exported to another module. Split out of D15024. llvm-svn: 254802	2015-12-04 23:40:22 +00:00
Philip Reames	62dd4420a0	Address a memory leak in 254760 The issue appears to have been that the copy constructor of the SmallVector was being invoked and this was somehow leading to leaked memory. This patch avoids the symptom, but likely doesn't address the underlying problem. I'm still investigating the root cause, but wanted to avoid the memory leak in the mean time. Even with the underlying fix, avoiding the redundant allocation is worthwhile. llvm-svn: 254795	2015-12-04 23:06:33 +00:00
Justin Bogner	6c86c50af0	CodeGen: Move the SlotIndexes BumpPtrAllocator before the list it allocates When a `SlotIndexes` is destroyed, `ileAllocator` will currently be destructed before `IndexList`, but all of `IndexList`'s storage has been allocated by `ileAllocator`. This means we'll call destructors on garbage data, which is very bad. This can be avoided by putting the BumpPtrAllocator earlier in the class than anything it allocates. Unfortunately, I don't know how to test this. It depends very much on memory layout, and the only evidence I have that this is actually happening in practice are backtraces that might be explained by this. By inspection though, the code is obviously dangerous/wrong, and this is the right thing to do. I'll follow up later with a patch that calls clearAndLeakNodesUnsafely on the list, since there isn't much point in destructing them when they're allocated in a BPA anyway, but I figured it makes sense to commit the correctness fix separately from that optimization. llvm-svn: 254794	2015-12-04 23:00:54 +00:00
Lang Hames	c8c41976a3	[Orc] Move some code up into the JITCompileCallbackManager base class. NFC. llvm-svn: 254778	2015-12-04 22:09:19 +00:00
Rafael Espindola	2c7e9b8d27	Always pass a diagnostic handler to the linker. Before this patch the diagnostic handler was optional. If it was not passed, the one in the LLVMContext was used. That is probably not a pattern we want to follow. If each area has an optional callback, there is a sea of callbacks and it is hard to follow which one is called. Doing this also found cases where the callback is a nice addition, like testing that no errors or warnings are reported. The other option is to always use the diagnostic handler in the LLVMContext. That has a few problems * To implement the C API we would have to set the diag handler and then set it back to the original value. * Code that creates the context might be far away from code that wants the diagnostics. I do have a patch that implements the second option and will send that as an RFC. llvm-svn: 254777	2015-12-04 22:08:53 +00:00
Sanjoy Das	6dd256008e	[OperandBundles] Allow operand-specific attributes in operand bundles Currently `OperandBundleUse::operandsHaveAttr` computes its result without being given a specific operand. This is problematic because it forces us to say that, e.g., even non-pointer operands in `"deopt"` operand bundles are `readonly`, which doesn't make sense. This commit changes `operandsHaveAttr` to work in the context of a specific operand, so that we can give the operand attributes that make sense for the operands's `llvm::Type`. llvm-svn: 254764	2015-12-04 20:34:37 +00:00
Philip Reames	cce89f4a30	[LegacyPassManager] Reduce memory usage for AnalysisUsage The LegacyPassManager was storing an instance of AnalysisUsage for each instance of each pass. In practice, most instances of a single pass class share the same dependencies. We can't rely on this because passes can (and some do) have dynamic dependencies based on instance options. We can exploit the likely commonality by uniqueing the usage information after querying the pass, but before storing it into the pass manager. This greatly reduces memory consumption by the AnalysisUsage objects. For a long pass pipeline, I measured a decrease in memory consumption for this storage of about 50%. I have not measured on the default O3 pipeline, but I suspect it will see some benefit as well since many passes are repeated (e.g. InstCombine). Differential Revision: http://reviews.llvm.org/D14677 llvm-svn: 254760	2015-12-04 20:05:04 +00:00
Matthias Braun	54aeb74db9	ScheduleDAGInstrs: Move LiveIntervals field to ScheduleDAGMI Now that ScheduleDAGInstrs doesn't need it anymore we can move the field down the class hierarcy to ScheduleDAGMI. llvm-svn: 254759	2015-12-04 19:54:24 +00:00
Sanjay Patel	2b56d370e1	don't repeat function names in comments; NFC llvm-svn: 254740	2015-12-04 17:54:31 +00:00
Manman Ren	107407fcfc	[CXX TLS calling convention] Add CXX TLS calling convention. This commit adds a new target-independent calling convention for C++ TLS access functions. It aims to minimize overhead in the caller by perserving as many registers as possible. The target-specific implementation for X86-64 is defined as following: Arguments are passed as for the default C calling convention The same applies for the return value(s) The callee preserves all GPRs - except RAX and RDI The access function makes C-style TLS function calls in the entry and exit block, C-style TLS functions save a lot more registers than normal calls. The added calling convention ties into the existing implementation of the C-style TLS functions, so we can't simply use existing calling conventions such as preserve_mostcc. rdar://9001553 llvm-svn: 254737	2015-12-04 17:40:13 +00:00
David Blaikie	3fd553d9d8	[llvm-dwp] Retrieve the DWOID from the CU for the cu_index entry llvm-svn: 254731	2015-12-04 17:20:04 +00:00
Rafael Espindola	b4f7892e3a	Modernize the C++ APIs for creating LTO modules. This is a continuation of r253367. These functions return is owned by the caller, so they return std::unique_ptr now. The call can fail, so the return is wrapped in ErrorOr. They have a context where to report diagnostics, so they don't need to take a string out parameter. With this there are no call to getGlobalContext in lib/LTO. llvm-svn: 254721	2015-12-04 16:14:31 +00:00
Rafael Espindola	b0476044d6	Move a call to getGlobalContext out of lib/LTO. llvm-svn: 254696	2015-12-04 02:42:28 +00:00
Lang Hames	84deb31665	[Orc] Rename JITCompileCallbackManagerBase to JITCompileCallbackManager. This class is turning into a useful interface, rather than an implementation detail, so I'm dropping the 'Base' suffix. No functional change. llvm-svn: 254693	2015-12-04 02:15:39 +00:00
Nathan Slingerland	5a941550ef	Revert "[llvm-profdata] Add support for weighted merge of profile data" This reverts commit b7250858d96b8ce567681214273ac0e62713c661. Reverting in order to investigate Windows test failure. llvm-svn: 254687	2015-12-04 02:13:58 +00:00
Matthias Braun	cdae83c48c	ScheduleDAGInstrs: Rework schedule graph builder. Re-comitting with a change that avoids undefined uses getting put into the VRegUses list. The new algorithm remembers the uses encountered while walking backwards until a matching def is found. Contrary to the previous version this: - Works without LiveIntervals being available - Allows to increase the precision to subregisters/lanemasks (not used for now) The changes in the AMDGPU tests are necessary because the R600 scheduler is not stable with respect to the order of nodes in the ready queues. Differential Revision: http://reviews.llvm.org/D9068 llvm-svn: 254683	2015-12-04 01:51:19 +00:00
Matthias Braun	60703e1f97	raw_ostream: << operator for callables with raw_ostream argument This is a revised version of r254655 which uses a Printable wrapper class to avoid ambiguous overload problems. Differential Revision: http://reviews.llvm.org/D14348 llvm-svn: 254681	2015-12-04 01:31:59 +00:00
Xinliang David Li	799fa97d82	[PGO] Unify VP data format between raw and indexed profile (Reader) With the latest refactoring and code sharing patches landed, it is possible to unify the value profile implementation between raw and indexed profile. This is the patch in raw profile reader that uses the common interface. Differential Revision: http://reviews.llvm.org/D15056 llvm-svn: 254677	2015-12-04 01:02:10 +00:00
Cong Hou	cc25d3b7d5	Don't punish vectorized arithmetic instruction whose type will be split to multiple registers Currently in LLVM's cost model, a vectorized arithmetic instruction will have high cost if its type is split into multiple registers. However, this punishment is too heavy and unnecessary. The overhead of the split should not be on arithmetic instructions but instructions that implement the split. Note that during vectorization we have calculated the register pressure, and we only choose proper interleaving factor (and also vectorization factor) so that we don't use more registers than the maximum number. Here is a very simple example: if a vadd has the cost 1, and if we double VF so that we need two registers to perform it, then its cost will become 4 with the current implementation, which will prevent us to use larger VF. Differential revision: http://reviews.llvm.org/D15159 llvm-svn: 254671	2015-12-04 00:36:58 +00:00
Nathan Slingerland	e6205d97d7	[llvm-profdata] Add support for weighted merge of profile data This change adds support for an optional weight when merging profile data with the llvm-profdata tool. Weights are specified by adding an option ':<weight>' suffix to the input file names. Adding support for arbitrary weighting of input profile data allows for relative importance to be placed on the input data from multiple training runs. Both sampled and instrumented profiles are supported. Reviewers: dnovillo, bogner, davidxl Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D14547 llvm-svn: 254669	2015-12-04 00:00:20 +00:00
Kevin B. Smith	078e5ad0c3	[CodeGen] Minor correction to comment on PhysRegInfo. Differential revision: http://reviews.llvm.org/D15216 llvm-svn: 254668	2015-12-04 00:00:10 +00:00
Rafael Espindola	116a2b8530	Simplify since this function never fails. llvm-svn: 254667	2015-12-03 23:56:42 +00:00
Matthias Braun	a075ebf2eb	Revert "raw_ostream: << operator for callables with raw_stream argument" This commit provoked "error C2593: 'operator <<' is ambiguous" on MSVC. This reverts commit r254655. llvm-svn: 254661	2015-12-03 23:00:28 +00:00
David Majnemer	dc587eeed6	[Analysis] Become aware of MSVC's new/delete functions The compiler can take advantage of the allocation/deallocation function's properties. We knew how to do this for Itanium but had no support for MSVC-style functions. llvm-svn: 254656	2015-12-03 22:45:19 +00:00

... 2 3 4 5 6 ...

25915 Commits