llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-23 11:13:28 +01:00

Author	SHA1	Message	Date
Sanjoy Das	ce6d998be5	[OperandBundles] Refactor; NFCI Intended to make later changes simpler. Exposes `getBundleOperandsStartIndex` and `getBundleOperandsEndIndex`, and uses them for the computation in `getNumTotalBundleOperands`. llvm-svn: 252037	2015-11-04 04:31:06 +00:00
Alexey Samsonov	4cb73711bb	[LLVMSymbolize] Reduce indentation by using helper function. NFC. llvm-svn: 252022	2015-11-04 00:30:26 +00:00
Alexey Samsonov	27ddee3db7	[LLVMSymbolize] Properly propagate object parsing errors from the library. llvm-svn: 252021	2015-11-04 00:30:24 +00:00
Adam Nemet	933537bc6b	LLE 6/6: Add LoopLoadElimination pass Summary: The goal of this pass is to perform store-to-load forwarding across the backedge of a loop. E.g.: for (i) A[i + 1] = A[i] + B[i] => T = A[0] for (i) T = T + B[i] A[i + 1] = T The pass relies on loop dependence analysis via LoopAccessAnalisys to find opportunities of loop-carried dependences with a distance of one between a store and a load. Since it's using LoopAccessAnalysis, it was easy to also add support for versioning away may-aliasing intervening stores that would otherwise prevent this transformation. This optimization is also performed by Load-PRE in GVN without the option of multi-versioning. As was discussed with Daniel Berlin in http://reviews.llvm.org/D9548, this is inferior to a more loop-aware solution applied here. Hopefully, we will be able to remove some complexity from GVN/MemorySSA as a consequence. In the long run, we may want to extend this pass (or create a new one if there is little overlap) to also eliminate loop-indepedent redundant loads and store that require versioning due to may-aliasing intervening stores/loads. I have some motivating cases for store elimination. My plan right now is to wait for MemorySSA to come online first rather than using memdep for this. The main motiviation for this pass is the 456.hmmer loop in SPECint2006 where after distributing the original loop and vectorizing the top part, we are left with the critical path exposed in the bottom loop. Being able to promote the memory dependence into a register depedence (even though the HW does perform store-to-load fowarding as well) results in a major gain (~20%). This gain also transfers over to x86: it's around 8-10%. Right now the pass is off by default and can be enabled with -enable-loop-load-elim. On the LNT testsuite, there are two performance changes (negative number -> improvement): 1. -28% in Polybench/linear-algebra/solvers/dynprog: the length of the critical paths is reduced 2. +2% in Polybench/stencils/adi: Unfortunately, I couldn't reproduce this outside of LNT The pass is scheduled after the loop vectorizer (which is after loop distribution). The rational is to try to reuse LAA state, rather than recomputing it. The order between LV and LLE is not critical because normally LV does not touch scalar st->ld forwarding cases where vectorizing would inhibit the CPU's st->ld forwarding to kick in. LoopLoadElimination requires LAA to provide the full set of dependences (including forward dependences). LAA is known to omit loop-independent dependences in certain situations. The big comment before removeDependencesFromMultipleStores explains why this should not occur for the cases that we're interested in. Reviewers: dberlin, hfinkel Subscribers: junbuml, dberlin, mssimpso, rengolin, sanjoy, llvm-commits Differential Revision: http://reviews.llvm.org/D13259 llvm-svn: 252017	2015-11-03 23:50:08 +00:00
Adam Nemet	6d8e7bca0f	[LAA] LLE 5/6: Add predicate functions Dependence::isForward/isBackward, NFC Summary: Will be used by the LoopLoadElimination pass. Reviewers: hfinkel Subscribers: llvm-commits Differential Revision: http://reviews.llvm.org/D13258 llvm-svn: 252016	2015-11-03 23:50:03 +00:00
Adam Nemet	ea9a067ee3	[LAA] LLE 4/6: APIs to access the dependent instructions for a dependence, NFC Summary: The functions use LAI and MemoryDepChecker classes so they need to be defined after those definitions outside of the Dependence class. Will be used by the LoopLoadElimination pass. Reviewers: hfinkel Subscribers: rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D13257 llvm-svn: 252015	2015-11-03 23:49:58 +00:00
Peter Collingbourne	284b8079b3	CodeGen, Target: Move Mach-O-specific symbol name logic to Mach-O lowering. A profile of an LTO link of Chrome revealed that we were spending some ~30-50% of execution time in the function Constant::getRelocationInfo(), which is called from TargetLoweringObjectFile::getKindForGlobal() and in turn from TargetMachine::getNameWithPrefix(). It turns out that we only need the result of getKindForGlobal() when targeting Mach-O, so this change moves the relevant part of the logic to TargetLoweringObjectFileMachO. NFCI. Differential Revision: http://reviews.llvm.org/D14168 llvm-svn: 252014	2015-11-03 23:40:03 +00:00
Alexey Samsonov	37b571b778	[LLVMSymbolize] Factor out the logic for printing structs from DIContext. NFC. Introduce DIPrinter which takes care of rendering DILineInfo and friends. This allows LLVMSymbolizer class to return a structured data instead of plain std::strings. llvm-svn: 251989	2015-11-03 22:20:52 +00:00
Adam Nemet	8ce9fb467e	[LAA] LLE 3/6: Rename InterestingDependence to Dependences, NFC Summary: We now collect all types of dependences including lexically forward deps not just "interesting" ones. Reviewers: hfinkel Subscribers: rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D13256 llvm-svn: 251985	2015-11-03 21:39:52 +00:00
Alexey Samsonov	785313f5bc	[LLVMSymbolize] Move demangling away from printing routines. NFC. Make printDILineInfo and friends responsible for just rendering the contents of the structures, demangling should actually be performed earlier, when we have the information about the originating SymbolizableModule at hand. llvm-svn: 251981	2015-11-03 21:36:13 +00:00
Adam Nemet	b9c59b29d9	[LAA] LLE 2/6: Fix a NoDep case that should be a Forward dependence Summary: When the dependence distance in zero then we have a loop-independent dependence from the earlier to the later access. No current client of LAA uses forward dependences so other than potentially hitting the MaxDependences threshold earlier, this change shouldn't affect anything right now. This and the previous patch were tested together for compile-time regression. None found in LNT/SPEC. Reviewers: hfinkel Subscribers: rengolin, llvm-commits Differential Revision: http://reviews.llvm.org/D13255 llvm-svn: 251973	2015-11-03 20:13:43 +00:00
Rafael Espindola	5583e816fe	Delete dead code. llvm-svn: 251960	2015-11-03 18:55:58 +00:00
Rafael Espindola	7953bd5d91	Simplify local common output. We now create them as they are found and use higher level APIs. This is a step in avoiding creating unnecessary sections. llvm-svn: 251958	2015-11-03 18:50:51 +00:00
Rafael Espindola	7ec60e8686	Revert "Revert "[Orc] Directly emit machine code for the x86 resolver block and trampolines."" This reverts commit r251937. The test was updated to the new API, bring the API back. llvm-svn: 251944	2015-11-03 16:40:37 +00:00
Rafael Espindola	c04851c1d4	Revert "[Orc] Directly emit machine code for the x86 resolver block and trampolines." This reverts commit r251933. It broke the build of examples/Kaleidoscope/Orc/fully_lazy/toy.cpp. llvm-svn: 251937	2015-11-03 16:25:20 +00:00
Lang Hames	f8e78a5c05	[Orc] Directly emit machine code for the x86 resolver block and trampolines. Bypassing LLVM for this has a number of benefits: 1) Laziness support becomes asm-syntax agnostic (previously lazy jitting didn't work on Windows as the resolver block was in Darwin asm). 2) For cross-process JITs, it allows resolver blocks and trampolines to be emitted directly in the target process, reducing cross process traffic. 3) It should be marginally faster. llvm-svn: 251933	2015-11-03 16:10:18 +00:00
Michael Kuperstein	5991145dd5	[X86] Generate .cfi_adjust_cfa_offset correctly when pushing arguments When push instructions are being used to pass function arguments on the stack, and either EH or debugging are enabled, we need to generate .cfi_adjust_cfa_offset directives appropriately. For (synch) EH, it is enough for the CFA offset to be correct at every call site, while for debugging we want to be correct after every push. Darwin does not support this well, so don't use pushes whenever it would be required. Differential Revision: http://reviews.llvm.org/D13767 llvm-svn: 251904	2015-11-03 08:17:25 +00:00
Matthias Braun	a75cf73a70	ScheduleDAGInstrs: Remove IsPostRA flag; NFC ScheduleDAGInstrs doesn't behave differently before or after register allocation. It was only used in a method of MachineSchedulerBase which behaved differently in MachineScheduler/PostMachineScheduler. Change this to let MachineScheduler/PostMachineScheduler just pass in a parameter to that function. The order of the LiveIntervals* and bool RemoveKillFlags paramters have been switched to make out-of-tree code fail instead of unintentionally passing a value intended for the IsPostRA flag to the (previously following and default initialized) RemoveKillFlags. Differential Revision: http://reviews.llvm.org/D14245 llvm-svn: 251883	2015-11-03 01:53:29 +00:00
Rafael Espindola	703dd2fb10	This never returns end(), simplify to use Child instead of iterator. NFC. llvm-svn: 251876	2015-11-03 01:20:44 +00:00
Teresa Johnson	90b0eac682	Restore "Support for ThinLTO function importing and symbol linking." This restores commit r251837, with the new library dependence added to llvm-link/Makefile to address bot failures. llvm-svn: 251866	2015-11-03 00:14:15 +00:00
David Blaikie	0eb4964368	Fix the build I just broke llvm-svn: 251854	2015-11-02 23:10:52 +00:00
David Blaikie	aec8d8648f	Orc: Drop some else-after-return, reflow a few spots, and avoid use of pointee types llvm-svn: 251853	2015-11-02 23:09:38 +00:00
Teresa Johnson	077216c4b1	Revert "Support for ThinLTO function importing and symbol linking." This reverts commit r251837, due to a number of bot failures of the form: /home/grosser/buildslave/perf-x86_64-penryn-O3-polly-fast/llvm.obj/tools/llvm-link/Release+Asserts/llvm-link.o:llvm-link.cpp:function loadIndex(llvm::LLVMContext&, llvm::Module const): error: undefined reference to 'llvm::object::FunctionIndexObjectFile::create(llvm::MemoryBufferRef, llvm::LLVMContext&, llvm::Module const, bool)' /home/grosser/buildslave/perf-x86_64-penryn-O3-polly-fast/llvm.obj/tools/llvm-link/Release+Asserts/llvm-link.o:llvm-link.cpp:function loadIndex(llvm::LLVMContext&, llvm::Module const*): error: undefined reference to 'llvm::object::FunctionIndexObjectFile::takeIndex()' I'm not sure why these are happening - I added Object to the requred libraries in tools/llvm-link/LLVMBuild.txt and the LLVM_LINK_COMPONENTS in tools/llvm-link/CMakeLists.txt. Confirmed for my build that these symbols come out of libLLVMObject.a. What am I missing? llvm-svn: 251841	2015-11-02 22:17:32 +00:00
Teresa Johnson	30954fb923	Support for ThinLTO function importing and symbol linking. Summary: Support for necessary linkage changes and symbol renaming during ThinLTO function importing. Also includes llvm-link support for manually importing functions and associated llvm-link based tests. Note that this does not include support for intelligently importing metadata, which is currently imported duplicate times. That support will be in the follow-on patch, and currently is ignored by the tests. Reviewers: dexonsmith, joker.eph, davidxl Subscribers: tobiasvk, tejohnson, llvm-commits Differential Revision: http://reviews.llvm.org/D13515 llvm-svn: 251837	2015-11-02 21:39:10 +00:00
Reid Kleckner	84ff637db7	[Support] Assert that reported key+data lenghts match reality This found a bug in Clang's PTH implementation. llvm-svn: 251829	2015-11-02 20:49:29 +00:00
David Blaikie	a98b9cfa54	StringRef-ify DiagnosticInfoSampleProfile::Filename llvm-svn: 251823	2015-11-02 20:01:13 +00:00
Rafael Espindola	12412f318b	ELF can handle some relocations of the form -sym + constant. Remove code that was assuming that this would never work. Thanks to Colin LeMahie for finding and diagnosing the bug. llvm-svn: 251818	2015-11-02 19:13:59 +00:00
Teresa Johnson	1cf56803da	Clang format a few prior patches (NFC) I had clang formatted my earlier patches using the wrong style. Reformatted with the LLVM style. llvm-svn: 251812	2015-11-02 18:02:11 +00:00
Artur Pilipenko	0dd1f670a9	Preserve load alignment and dereferenceable metadata during some transformations Reviewed By: hfinkel Differential Revision: http://reviews.llvm.org/D13953 llvm-svn: 251809	2015-11-02 17:53:51 +00:00
Silviu Baranga	439f20d646	Add missing override statements in ScalarEvolution.h. NFC llvm-svn: 251805	2015-11-02 15:29:49 +00:00
Silviu Baranga	ab8c77a47d	[SCEV][LV] Add SCEV Predicates and use them to re-implement stride versioning Summary: SCEV Predicates represent conditions that typically cannot be derived from static analysis, but can be used to reduce SCEV expressions to forms which are usable for different optimizers. ScalarEvolution now has the rewriteUsingPredicate method which can simplify a SCEV expression using a SCEVPredicateSet. The normal workflow of a pass using SCEVPredicates would be to hold a SCEVPredicateSet and every time assumptions need to be made a new SCEV Predicate would be created and added to the set. Each time after calling getSCEV, the user will call the rewriteUsingPredicate method. We add two types of predicates SCEVPredicateSet - implements a set of predicates SCEVEqualPredicate - tests for equality between two SCEV expressions We use the SCEVEqualPredicate to re-implement stride versioning. Every time we version a stride, we will add a SCEVEqualPredicate to the context. Instead of adding specific stride checks, LoopVectorize now adds a more generic SCEV check. We only need to add support for this in the LoopVectorizer since this is the only pass that will do stride versioning. Reviewers: mzolotukhin, anemet, hfinkel, sanjoy Subscribers: sanjoy, hfinkel, rengolin, jmolloy, llvm-commits Differential Revision: http://reviews.llvm.org/D13595 llvm-svn: 251800	2015-11-02 14:41:02 +00:00
James Molloy	bcb542b66c	[PatternMatch] Switch to use ValueTracking::matchSelectPattern Instead of rolling our own min/max matching code (which is notoriously hard to get completely right), use ValueTracking's instead. llvm-svn: 251785	2015-11-02 09:54:00 +00:00
Pawel Bylica	669b84345b	[Support] Extend sys::path with user_cache_directory function. Summary: The new function sys::path::user_cache_directory tries to discover a directory suitable for cache storage for current system user. On Windows and Darwin it returns a path to system-specific user cache directory. On Linux it follows XDG Base Directory Specification, what is: - use non-empty $XDG_CACHE_HOME env var, - use $HOME/.cache. Reviewers: chapuni, aaron.ballman, rafael Subscribers: rafael, aaron.ballman, llvm-commits Differential Revision: http://reviews.llvm.org/D13801 llvm-svn: 251784	2015-11-02 09:49:17 +00:00
Igor Breger	dd070c17bb	AVX512: Implemented encoding and intrinsics for VBROADCASTI32x2 and VBROADCASTF32x2 instructions. Differential Revision: http://reviews.llvm.org/D14216 llvm-svn: 251781	2015-11-02 07:39:36 +00:00
Craig Topper	4f55fa704b	Fix a -Wpessimizing-move warning. llvm-svn: 251773	2015-11-02 05:24:28 +00:00
Xinliang David Li	1ef7153d55	[PGO] Value profiling (index format) code cleanup and testing 1. Added a set of public interfaces in InstrProfRecord class to access (read/write) value profile data. 2. Changed IndexedProfile reader and writer code to use the newly defined interfaces and hide implementation details. 3. Added a couple of unittests for value profiling: - Test new interfaces to get and set value profile data - Test value profile data merging with various scenarios. No functional change is expected. The new interfaces will also make it possible to change on-disk format of value prof data to be more compact (to be submitted). llvm-svn: 251771	2015-11-02 05:08:23 +00:00
Rafael Espindola	38d8e625e7	Use Child instead of child_iterator in the archive writer. We never need to pass end(). This will also remove some complication once we start adding error checking. llvm-svn: 251758	2015-11-01 00:10:37 +00:00
Rafael Espindola	52f24bb108	Don't store a Child to the first regular member. This is a bit ugly, but has a few advantages: * Archive is now easy to copy since there is no Archive -> Child -> Archive loop. * It makes it clear that we already checked for errors when finding the Child data. llvm-svn: 251750	2015-10-31 21:44:42 +00:00
Rafael Espindola	d268dd1895	Delete dead code. llvm-svn: 251749	2015-10-31 21:16:01 +00:00
Rafael Espindola	5f017c0eef	Simplify handling of archive Symbol tables. We only need to store a StringRef. llvm-svn: 251748	2015-10-31 21:03:29 +00:00
Rafael Espindola	f0f9bc5a37	Simplify the handling of the archive string table. We only need to store a StringRef llvm-svn: 251746	2015-10-31 20:06:13 +00:00
Lang Hames	885f548700	Add a sys::OwningMemoryBlock class, which is a sys::MemoryBlock that owns its underlying memory, and will automatically release it on destruction. Use this to tidy up the orc::IndirectStubsInfo class. llvm-svn: 251731	2015-10-31 00:55:32 +00:00
Justin Bogner	4a77d84be2	[PM] Port StripDeadPrototypes to the new pass manager This is a really straightforward port. Also adds a test for the pass, since it only seemed to be tested tangentially before. llvm-svn: 251726	2015-10-30 23:28:12 +00:00
Justin Bogner	cc016f8933	[PM] Port ADCE to the new pass manager llvm-svn: 251725	2015-10-30 23:13:18 +00:00
Justin Bogner	7d746e1b5b	Whitespace. NFC llvm-svn: 251724	2015-10-30 23:02:38 +00:00
Justin Bogner	5c07f819b9	PM: Print the IR unit's name in debug output. NFC llvm-svn: 251723	2015-10-30 22:58:15 +00:00
Lang Hames	d06d3863b2	[Orc] Expose the compile callback API through the C bindings. llvm-svn: 251683	2015-10-30 03:20:21 +00:00
Davide Italiano	48bdd477c4	[MC] Make another header NDEBUG-free. llvm-svn: 251679	2015-10-30 01:25:50 +00:00
Alexey Samsonov	0b64fb3077	Let the users of LLVMSymbolizer decide whether they want to symbolize inlined frames. Introduce LLVMSymbolizer::symbolizeInlinedCode() instead of switching on PrintInlining option passed to the constructor. This will be needed once we retrun structured data (instead of std::string) from LLVMSymbolizer and move printing logic out. llvm-svn: 251675	2015-10-30 00:40:20 +00:00
NAKAMURA Takumi	09526908d4	llvm/ExecutionEngine/Orc/LogicalDylib.h: Satisfy Modules. llvm-svn: 251674	2015-10-30 00:38:01 +00:00

1 2 3 4 5 ...

25284 Commits