llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-22 12:33:33 +02:00

Author	SHA1	Message	Date
Vedant Kumar	2e1a683bae	Revert "Reapply "[IR] Move optional data in llvm::Function into a hungoff uselist"" This reverts commit r256093. This broke lld-x86_64-win7 because of -Werror,-Wc++1y-extensions. llvm-svn: 256094	2015-12-19 08:48:43 +00:00
Vedant Kumar	c33a34516e	Reapply "[IR] Move optional data in llvm::Function into a hungoff uselist" Make personality functions, prefix data, and prologue data hungoff operands of Function. This is based on the email thread "[RFC] Clean up the way we store optional Function data" on llvm-dev. Thanks to sanjoyd, majnemer, rnk, loladiro, and dexonsmith for feedback! Includes a fix to scrub value subclass data in dropAllReferences. Differential Revision: http://reviews.llvm.org/D13829 llvm-svn: 256093	2015-12-19 08:29:51 +00:00
Vedant Kumar	6843b30188	Revert "[IR] Move optional data in llvm::Function into a hungoff uselist" This reverts commit r256090. This broke llvm-clang-lld-x86_64-debian-fast. llvm-svn: 256091	2015-12-19 07:30:44 +00:00
Vedant Kumar	46b3967fa2	[IR] Move optional data in llvm::Function into a hungoff uselist Make personality functions, prefix data, and prologue data hungoff operands of Function. This is based on the email thread "[RFC] Clean up the way we store optional Function data" on llvm-dev. Thanks to sanjoyd, majnemer, rnk, loladiro, and dexonsmith for feedback! Differential Revision: http://reviews.llvm.org/D13829 llvm-svn: 256090	2015-12-19 07:08:56 +00:00
Teresa Johnson	0dce8d436c	[ThinLTO] Metadata linking for imported functions Summary: Second patch split out from http://reviews.llvm.org/D14752. Maps metadata as a post-pass from each module when importing complete, suturing up final metadata to the temporary metadata left on the imported instructions. This entails saving the mapping from bitcode value id to temporary metadata in the importing pass, and from bitcode value id to final metadata during the metadata linking postpass. Depends on D14825. Reviewers: dexonsmith, joker.eph Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D14838 llvm-svn: 255909	2015-12-17 17:14:09 +00:00
Rafael Espindola	f7a0054c75	Change linkInModule to take a std::unique_ptr. Passing in a std::unique_ptr should help find errors when the module is used after being linked into another module. llvm-svn: 255842	2015-12-16 23:16:33 +00:00
Justin Bogner	58647df890	LPM: Make callers of LPM.deleteLoopFromQueue update LoopInfo directly. NFC As of r255720, the loop pass manager will DTRT when passes update the loop info for removed loops, so they no longer need to reach into LPPassManager APIs to do this kind of transformation. This change very nearly removes the need for the LPPassManager to even be passed into loop passes - the only remaining pass that uses the LPM argument is LoopUnswitch. llvm-svn: 255797	2015-12-16 18:40:20 +00:00
Richard Trieu	9af860a927	Remove one of the void casts used to suppress unused variable warning. llvm-svn: 255709	2015-12-15 23:47:17 +00:00
Evgeniy Stepanov	493b24312f	Suppress unused variable warning in the no-asserts build. llvm-svn: 255706	2015-12-15 23:30:29 +00:00
Richard Trieu	bf34638e13	Cast variable to void to resolve unused variable warning in non-asserts builds. llvm-svn: 255704	2015-12-15 23:25:34 +00:00
Evgeniy Stepanov	39e538e166	Cross-DSO control flow integrity (LLVM part). An LTO pass that generates a __cfi_check() function that validates a call based on a hash of the call-site-known type and the target pointer. llvm-svn: 255693	2015-12-15 23:00:08 +00:00
James Molloy	fb86086405	[PassManagerBuilder] Add a few more scalar optimization passes This patch does two things: 1. mem2reg is now run immediately after globalopt. Now that globalopt can localize variables more aggressively, it makes sense to lower them to SSA form earlier rather than later so they can benefit from the full set of optimization passes. 2. More scalar optimizations are run after the loop optimizations in LTO mode. The loop optimizations (especially indvars) can clean up scalar code sufficiently to make it worthwhile running more scalar passes. I've particularly added SCCP here as it isn't run anywhere else in the LTO pass pipeline. Mem2reg is super cheap and shouldn't affect compilation time at all. The rest of the added passes are in the LTO pipeline only so doesn't affect the vast majority of compilations, just the link step. llvm-svn: 255634	2015-12-15 09:24:01 +00:00
Rafael Espindola	2d1739bf50	A better attempt to add a missing include llvm-svn: 255578	2015-12-14 23:34:35 +00:00
Rafael Espindola	5f225be291	Trying to fix the build in a bot. llvm-svn: 255577	2015-12-14 23:31:08 +00:00
Rafael Espindola	5b397256de	Use diagnostic handler in the LLVMContext This patch converts code that has access to a LLVMContext to not take a diagnostic handler. This has a few advantages * It is easier to use a consistent diagnostic handler in a single program. * Less clutter since we are not passing a handler around. It does make it a bit awkward to implement some C APIs that return a diagnostic string. I will propose new versions of these APIs and deprecate the current ones. llvm-svn: 255571	2015-12-14 23:17:03 +00:00
Sanjoy Das	987d70ed26	[MergeFunctions] Use II instead of CI for InvokeInst; NFC Using `CI` is slightly misleading. llvm-svn: 255529	2015-12-14 19:11:45 +00:00
Sanjoy Das	97417780af	Teach MergeFunctions about operand bundles llvm-svn: 255528	2015-12-14 19:11:40 +00:00
Diego Novillo	9bbd13f9a0	SamplePGO - Reduce memory utilization by 10x. DenseMap is the wrong data structure to use for sample records and call sites. The keys are too large, causing massive core memory growth when reading profiles. Before this patch, a 21Mb input profile was causing the compiler to grow to 3Gb in memory. By switching to std::map, the compiler now grows to 300Mb in memory. There still are some opportunities for memory footprint reduction. I'll be looking at those next. llvm-svn: 255389	2015-12-11 23:21:38 +00:00
Artur Pilipenko	8b6635b2f6	PruneEH pass incorrectly reports that a change was made Reviewed By: reames Differential Revision: http://reviews.llvm.org/D14097 llvm-svn: 255343	2015-12-11 16:30:26 +00:00
Teresa Johnson	087bc3b677	[ThinLTO] Debug message cleanup (NFC) Added some missing spaces between the module identifier and the start of the debug message. Also added a ":" after the module identifier to make this look a little nicer. llvm-svn: 255259	2015-12-10 16:39:07 +00:00
Sanjoy Das	d85ded90d0	Add arg_begin() and arg_end() to CallInst and InvokeInst; NFCI - This simplifies the CallSite class, arg_begin / arg_end are now simple wrapper getters. - In several places, we were creating CallSite instances solely to call arg_begin and arg_end. With this change, that's no longer required. llvm-svn: 255226	2015-12-10 06:39:02 +00:00
Rafael Espindola	2e47184a91	Don't assign a temporary string to a StringRef. Should fix the windows debug and asan bots. llvm-svn: 255149	2015-12-09 20:41:10 +00:00
Teresa Johnson	83a7df21b2	[ThinLTO] FunctionImport pass can take a const index pointer (NFC) llvm-svn: 255140	2015-12-09 19:39:47 +00:00
Mehdi Amini	b282e7bd00	The current importing scheme is processing one function at a time, loading the source Module, linking the function in the destination module, and destroying the source Module before repeating with the next function to import (potentially from the same Module). Ideally we would keep the source Module alive and import the next Function needed from this Module. Unfortunately this is not possible because the linker does not leave it in a usable state. However we can do better by first computing the list of all candidates per Module, and only then load the source Module and import all the function we need for it. The trick to process callees is to materialize function in the source module when building the list of function to import, and inspect them in their source module, collecting the list of callees for each callee. When we move the the actual import, we will import from each source module exactly once. Each source module is loaded exactly once. The only drawback it that it requires to have all the lazy-loaded source Module in memory at the same time. Currently this patch already improves considerably the link time, a multithreaded link of llvm-dis on my laptop was: real 1m12.175s user 6m32.430s sys 0m10.529s and is now: real 0m40.697s user 2m10.237s sys 0m4.375s Note: this is the full link time (linker+Import+Optimizer+CodeGen) Differential Revision: http://reviews.llvm.org/D15178 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255100	2015-12-09 08:17:35 +00:00
Sanjoy Das	cb770fbcb6	[OperandBundles] Have PruneEH work correct with operand bundles. For an invoke with operand bundles, the [op_begin(), op_end()-3] range can contain things other than invoke arguments. This change teaches PruneEH to use arg_begin() and arg_end() explicitly. llvm-svn: 255073	2015-12-08 23:16:52 +00:00
Mehdi Amini	5d4cc87b91	Fix/Improve Debug print in FunctionImport pass From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255071	2015-12-08 23:04:19 +00:00
Mehdi Amini	ba2c064383	Remove caching in FunctionImport: a Module can't be reused after being linked from The Linker destroys the source module (API change coming to make it explicit) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 255064	2015-12-08 22:39:40 +00:00
Teresa Johnson	1fb89d62fb	[ThinLTO] Support for specifying function index from pass manager Summary: Add a field on the PassManagerBuilder that clang or gold can use to pass down a pointer to the function index in memory to use for importing when the ThinLTO backend is triggered. Add support to supply this to the function import pass. Reviewers: joker.eph, dexonsmith Subscribers: davidxl, llvm-commits, joker.eph Differential Revision: http://reviews.llvm.org/D15024 llvm-svn: 254926	2015-12-07 19:21:11 +00:00
Mehdi Amini	e0e1d33bec	clang-format FunctionImport after refactoring (NFC) From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254585	2015-12-03 02:58:14 +00:00
Mehdi Amini	30d3bd787c	Refactor FunctionImporter::importFunctions with a helper function to process the Worklist (NFC) This precludes some more functional changes to perform bulk imports. From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254583	2015-12-03 02:37:33 +00:00
David Majnemer	56dee65385	Move EH-specific helper functions to a more appropriate place No functionality change is intended. llvm-svn: 254562	2015-12-02 23:06:39 +00:00
Mehdi Amini	34766825ef	Change ModuleLinker to take a set of GlobalValues to import instead of a single one For efficiency reason, when importing multiple functions for the same Module, we can avoid reparsing it every time. Differential Revision: http://reviews.llvm.org/D15102 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254486	2015-12-02 04:34:28 +00:00
Mehdi Amini	1909ee53b2	Modify FunctionImport to take a callback to load modules When linking static archive, there is no individual module files to load. Instead they can be mmap'ed and could be initialized from a buffer directly. The callback provide flexibility to override the scheme for loading module from the summary. Differential Revision: http://reviews.llvm.org/D15101 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 254479	2015-12-02 02:00:29 +00:00
Rafael Espindola	e3fda2ca99	Use references now that it is natural to do so. The linker never takes ownership of a module or changes which module it is refering to, making it natural to use references. llvm-svn: 254449	2015-12-01 19:50:54 +00:00
Teresa Johnson	0bc8d23948	[ThinLTO] Wrap dbgs() output in DEBUG macro Missed in a couple places. llvm-svn: 254422	2015-12-01 17:12:10 +00:00
Teresa Johnson	0e55603ffe	[ThinLTO] Remove stale comment (NFC) Stale as of r254036 which added basic profitability check. llvm-svn: 254421	2015-12-01 16:45:23 +00:00
Diego Novillo	d74fdde81f	SamplePGO - Do not use std::to_string in diagnostics. This fixes buildbots in systems that std::to_string is not present. It also tidies the output of the diagnostic to render doubles a bit better (thanks Ben Kramer for help with string streams and format). llvm-svn: 254261	2015-11-29 18:23:26 +00:00
Diego Novillo	d08de97276	SamplePGO - Add initial support for inliner annotations. This adds two thresholds to the sample profiler to affect inlining decisions: the concept of global hotness and coldness. Functions that have accumulated more than a certain fraction of samples at runtime, are annotated with the InlineHint attribute. Conversely, functions that accumulate less than a certain fraction of samples, are annotated with the Cold attribute. This is very similar to the hints emitted by Clang when using instrumentation profiles. Notice that this is a very blunt instrument. A function may have globally collected a significant fraction of samples, but that does not necessarily mean that every callsite for that function is hot. Ideally, we would annotate each callsite with the samples collected at that callsite. This way, the inliner can incorporate all these weights into its cost model. Once the inliner offers this functionality, we can change the hints emitted here to a more precise per-callsite annotation. For now, this is providing some measure of speedups with our internal benchmarks. I've observed speedups of up to 23% (though the geo mean is about 3%). I expect these numbers to improve as the inliner gets better annotations. llvm-svn: 254212	2015-11-27 23:14:51 +00:00
Diego Novillo	c52a667205	SamplePGO - Fix default threshold for hot callsites. Based on testing of internal benchmarks, I'm lowering this threshold to a value of 0.1%. This means that SamplePGO will respect 99.9% of the original inline decisions when following a profile. The performance difference is noticeable in some tests. With the previous threshold, the speedups over baseline -O2 was about 0.63%. With the new default, the speedups are around 3% on average. The point of this threshold is not to do more aggressive inlining. When an inlined callsite crosses this threshold, SamplePGO will redo the inline decision so that it can better apply the input profile. By respecting most original inline decisions, we can apply more of the input profile because the shape of the code follows the profile more closely. In the next series, I'll be looking at adding some inline hints for the cold callsites and for toplevel functions that are hot/cold as well. llvm-svn: 254211	2015-11-27 23:14:49 +00:00
Rafael Espindola	d215bba299	Disallow aliases to available_externally. They are as much trouble as aliases to declarations. They are requiring the code generator to define a symbol with the same value as another symbol, but the second symbol is undefined. If representing this is important for some optimization, we could add support for available_externally aliases. They would be required to point to a declaration (or available_externally definition). llvm-svn: 254170	2015-11-26 19:22:59 +00:00
Rong Xu	c4f897c441	[PGO] Revert revision r254021,r254028,r254035 Revert the above revision due to multiple issues. llvm-svn: 254040	2015-11-24 23:49:08 +00:00
Teresa Johnson	cbf6e0bf1b	[ThinLTO] Add option to limit importing based on instruction count Add a simple initial heuristic to control importing based on the number of instructions recorded in the function's summary. Add option to control the limit, and test using option. llvm-svn: 254036	2015-11-24 22:55:46 +00:00
Diego Novillo	2b7c3c54ab	SamplePGO - Add test for hot/cold inlined functions. When the original binary is executed and sampled, the resulting profile contains information on the original inline stack. We currently follow the original inline plan if we notice that the inlined callsite has more than 0 samples to it. A better way is to determine whether the callsite is actually worth inlining. If the callsite accumulates a small fraction of the samples spent in the parent function, then we don't want to bother inlining it (as it means that the callsite is actually cold). This patch introduces a threshold expressed in percentage of samples in relation to the parent function. If the callsite uses less than N% of the total samples used by its parent, the original inline decision is not re-applied. I've set the threshold to the very arbitrary value of 5%. I'm yet to do any actual experiments to see what's a good value. I wanted to separate the basic mechanism from the tuning. llvm-svn: 254034	2015-11-24 22:38:37 +00:00
Rong Xu	025bf7be0c	[PGO] MST based PGO instrumentation infrastructure This patch implements a minimum spanning tree (MST) based instrumentation for PGO. The use of MST guarantees minimum number of CFG edges getting instrumented. An addition optimization is to instrument the less executed edges to further reduce the instrumentation overhead. The patch contains both the instrumentation and the use of the profile to set the branch weights. Differential Revision: http://reviews.llvm.org/D12781 llvm-svn: 254021	2015-11-24 21:31:25 +00:00
Teresa Johnson	697f6bcd05	[ThinLTO] Refactor function body scan during importing into helper (NFC) llvm-svn: 254020	2015-11-24 21:15:19 +00:00
Teresa Johnson	a3214913e6	[ThinLTO] Enable iterative importing in FunctionImport pass Analyze imported function bodies and add any new external calls to the worklist for importing. Currently no controls on the importing so this will end up importing everything possible in the call tree below the importing module. Basic profitability checks coming next. Update test to check for iteratively inlined functions. llvm-svn: 254011	2015-11-24 19:55:04 +00:00
Teresa Johnson	9c0a1779ce	[ThinLTO] Fix FunctionImport alias checking and test Skip imports for weak_any aliases as well. Fix the test to check non-import of weak aliases and functions, and import of normal alias. llvm-svn: 253991	2015-11-24 16:10:43 +00:00
Ismail Donmez	266a7da4e3	Fix build after r253954 llvm-svn: 253969	2015-11-24 09:48:09 +00:00
Mehdi Amini	2fe02188ef	Add a FunctionImporter helper to perform summary-based cross-module function importing Summary: This is a helper to perform cross-module import for ThinLTO. Right now it is importing naively every possible called functions. Reviewers: tejohnson Subscribers: dexonsmith, llvm-commits Differential Revision: http://reviews.llvm.org/D14914 From: Mehdi Amini <mehdi.amini@apple.com> llvm-svn: 253954	2015-11-24 06:07:49 +00:00
Diego Novillo	d28d079aa7	SamplePGO - Add coverage tracking for samples. The existing coverage tracker counts the number of records that were used from the input profile. An alternative view of coverage is to check how many available samples were applied. This way, if the profile contains several records with few samples, it doesn't really matter much that they were not applied. The more interesting records to apply are the ones that contribute many samples. llvm-svn: 253912	2015-11-23 20:12:21 +00:00

1 2 3 4 5 ...

2324 Commits