llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

History

Hao Liu d39edfda46 [LoopVectorize] Teach Loop Vectorizor about interleaved memory accesses. Interleaved memory accesses are grouped and vectorized into vector load/store and shufflevector. E.g. for (i = 0; i < N; i+=2) { a = A[i]; // load of even element b = A[i+1]; // load of odd element ... // operations on a, b, c, d A[i] = c; // store of even element A[i+1] = d; // store of odd element } The loads of even and odd elements are identified as an interleave load group, which will be transfered into vectorized IRs like: %wide.vec = load <8 x i32>, <8 x i32>* %ptr %vec.even = shufflevector <8 x i32> %wide.vec, <8 x i32> undef, <4 x i32> <i32 0, i32 2, i32 4, i32 6> %vec.odd = shufflevector <8 x i32> %wide.vec, <8 x i32> undef, <4 x i32> <i32 1, i32 3, i32 5, i32 7> The stores of even and odd elements are identified as an interleave store group, which will be transfered into vectorized IRs like: %interleaved.vec = shufflevector <4 x i32> %vec.even, %vec.odd, <8 x i32> <i32 0, i32 4, i32 1, i32 5, i32 2, i32 6, i32 3, i32 7> store <8 x i32> %interleaved.vec, <8 x i32>* %ptr This optimization is currently disabled by defaut. To try it by adding '-enable-interleaved-mem-accesses=true'. llvm-svn: 239291		2015-06-08 06:39:56 +00:00
..
ADT	[bpf] rename triple names bpf_be -> bpfeb	2015-06-05 16:11:14 +00:00
Analysis	[LoopVectorize] Teach Loop Vectorizor about interleaved memory accesses.	2015-06-08 06:39:56 +00:00
AsmParser	Pass a MemoryBufferRef when we can avoid taking ownership.	2014-08-26 21:49:01 +00:00
Bitcode	Replace push_back(Constructor(foo)) with emplace_back(foo) for non-trivial types	2015-05-29 19:43:39 +00:00
CodeGen	[LoopVectorize] Teach Loop Vectorizor about interleaved memory accesses.	2015-06-08 06:39:56 +00:00
Config	[omp] Add a configuration variable for the default OpenMP runtime.	2015-05-28 01:47:22 +00:00
DebugInfo	Re-unique_ptrify LoadedObjectInfo::clone after it was reverted due to some other changes that broke on GCC around the same time	2015-06-04 20:54:32 +00:00
ExecutionEngine	Re-unique_ptrify LoadedObjectInfo::clone after it was reverted due to some other changes that broke on GCC around the same time	2015-06-04 20:54:32 +00:00
IR	[Statepoints] Mark statepoint intrinsic with Throws attribute	2015-06-03 16:18:58 +00:00
IRReader	Pass a MemoryBufferRef when we can avoid taking ownership.	2014-08-26 21:49:01 +00:00
LineEditor	Use 'override/final' instead of 'virtual' for overridden methods	2015-04-11 02:11:45 +00:00
Linker	Linker: Add flag to override linkage rules	2015-04-22 04:11:00 +00:00
LTO	Make the C++ LTO API easier to use from C++ clients.	2015-06-01 20:08:30 +00:00
MC	[MC] Function naming NFC.	2015-06-07 20:29:37 +00:00
Object	llvm-ar: Move archive writer to Object.	2015-06-08 02:32:01 +00:00
Option	Use 'override/final' instead of 'virtual' for overridden methods	2015-04-11 02:11:45 +00:00
Passes	[PM] Create a separate library for high-level pass management code.	2015-03-07 09:02:36 +00:00
ProfileData	Tidy comments in SampleProfile header. NFC.	2015-05-12 22:03:00 +00:00
Support	TargetParser: Fix comments in enum(s) introduced in r239150. [-Wdocumentation]	2015-06-06 01:41:35 +00:00
TableGen	[TableGen] Use the SMLoc header file instead of SourceMgr header file in a couple places. NFC	2015-06-08 01:35:40 +00:00
Target	Add isLegalAddressingMode address space argument to TTI	2015-06-07 20:12:03 +00:00
Transforms	[GlobalMerge] Take into account minsize on Global users' parents.	2015-06-04 20:39:23 +00:00
CMakeLists.txt	Remove llvm_headers_do_not_build for the benefit of XCode and Visual Studio users.	2014-08-14 00:51:47 +00:00
InitializePasses.h	Resubmit r237954 (MIR Serialization: print and parse LLVM IR using MIR format).	2015-05-27 18:02:19 +00:00
LinkAllIR.h
LinkAllPasses.h	Add a speculative execution pass	2015-05-15 17:54:48 +00:00
module.modulemap	Fix modules build post-r235612.	2015-04-23 23:22:26 +00:00
module.modulemap.build
Pass.h	Use 'override/final' instead of 'virtual' for overridden methods	2015-04-11 02:11:45 +00:00
PassAnalysisSupport.h	Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; LLVM edition.	2015-02-15 22:54:22 +00:00
PassInfo.h	Removing LLVM_DELETED_FUNCTION, as MSVC 2012 was the last reason for requiring the macro. NFC; LLVM edition.	2015-02-15 22:54:22 +00:00
PassRegistry.h	Revert r231276 (including r231277): Add a lock() function in PassRegistry to speed up multi-thread synchronization.	2015-03-05 17:53:00 +00:00
PassSupport.h	Defining a new API for debug options that doesn't rely on static global cl::opts.	2014-10-15 21:54:35 +00:00