llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-26 06:22:56 +02:00

Author	SHA1	Message	Date
Tim Northover	ca6435867c	GlobalISel: add generic conditional branch. Just the basic equivalent to DAG's condbr for now, we'll get to things like br_cc when we start doing more legalization. llvm-svn: 277184	2016-07-29 17:58:00 +00:00
Krzysztof Parzyszek	3adce2360c	[Hexagon] Testcase for not merging stores into a misaligned store The DAG combiner will try to merge consecutive stores into a bigger store, unless the resulting store is not fast. Misaligned vector stores are allowed on Hexagon, but are not fast. Add a testcase to make sure this type of merging does not occur. Patch by Pranav Bhandarkar. llvm-svn: 277182	2016-07-29 17:55:37 +00:00
Krzysztof Parzyszek	3a5bc2df22	Revert r277178, the actual change had already been applied Will submit another patch with the testcase only. llvm-svn: 277180	2016-07-29 17:50:47 +00:00
Krzysztof Parzyszek	f5f51e9c74	[Hexagon] Misaligned loads and stores are not fast The DAG combiner tries to merge stores to adjacent vector wide memory locations by creating stores which are integral multiples of the vector width. Discourage this by informing it that this is slow. This should not affect legalization passes, because all of them ignore the "Fast" argument. Patch by Pranav Bhandarkar. llvm-svn: 277178	2016-07-29 17:45:16 +00:00
Kevin Enderby	46ecdee89c	The next step along the way to getting good error messages for bad archives. As mentioned in commit log for r276686 this next step is adding a new method in the ArchiveMemberHeader class to get the full name that does proper error checking, and can be use for error messages. To do this the name of ArchiveMemberHeader::getName() is changed to ArchiveMemberHeader::getRawName() to be consistent with Archive::Child::getRawName(). Then the “new” method is the addition of a new implementation of ArchiveMemberHeader::getName() which gets the full name and provides proper error checking. Which is mostly a rewrite of what was Archive::Child::getName() and cleaning up incorrect uses of llvm_unreachable() in the code which were actually just cases of errors in the input Archives. Then Archive::Child::getName() is changed to return Expected<> and use the new implementation of ArchiveMemberHeader::getName() . Also needed to change Archive::getMemoryBufferRef() with these changes to return Expected<> as well to propagate Errors up. As well as changing Archive::isThinMember() to return Expected<> . llvm-svn: 277177	2016-07-29 17:44:13 +00:00
Tim Northover	2f025adfb6	CodeGen: improve MachineInstrBuilder & MachineIRBuilder interface For MachineInstrBuilder, having to manually use RegState::Define is ugly and makes register definitions clunkier than they need to be, so this adds two convenience functions: addDef and addUse. For MachineIRBuilder, we want to avoid BuildMI's first-reg-is-def rule because it's hidden away and causes bugs. So this patch switches buildInstr to returning a MachineInstrBuilder and adding all operands via addDef/addUse. NFC. llvm-svn: 277176	2016-07-29 17:43:52 +00:00
Ahmed Bougacha	999a09ced0	[AArch64][GlobalISel] Select G_XOR. llvm-svn: 277173	2016-07-29 16:56:25 +00:00
Ahmed Bougacha	78ac7a57a6	[GlobalISel] Add G_XOR. llvm-svn: 277172	2016-07-29 16:56:20 +00:00
Ahmed Bougacha	bedaf830ba	[AArch64][GlobalISel] Select G_LOAD/G_STORE. Mostly straightforward as we ignore addressing modes and just use the base + unsigned immediate offset (always 0) variants. This currently fails to select extloads because we have yet to agree on a representation. llvm-svn: 277171	2016-07-29 16:56:16 +00:00
Ahmed Bougacha	04e36d2a3b	[GlobalISel] Add LLT raw_ostream operator<< overload. Helpful when debugging; will be used in the following commit. llvm-svn: 277170	2016-07-29 16:56:12 +00:00
Brendon Cahoon	e37295579e	MachinePipeliner pass that implements Swing Modulo Scheduling Software pipelining is an optimization for improving ILP by overlapping loop iterations. Swing Modulo Scheduling (SMS) is an implementation of software pipelining that attempts to reduce register pressure and generate efficient pipelines with a low compile-time cost. This implementaion of SMS is a target-independent back-end pass. When enabled, the pass should run just prior to the register allocation pass, while the machine IR is in SSA form. If the pass is successful, then the original loop is replaced by the optimized loop. The optimized loop contains one or more prolog blocks, the pipelined kernel, and one or more epilog blocks. This pass is enabled for Hexagon only. To enable for other targets, a couple of target specific hooks must be implemented, and the pass needs to be called from the target's TargetMachine implementation. Differential Review: http://reviews.llvm.org/D16829 llvm-svn: 277169	2016-07-29 16:44:44 +00:00
Krzysztof Parzyszek	ce9680792b	[Hexagon] Custom lower VECTOR_SHUFFLE and EXTRACT_SUBVECTOR for HVX If the mask of a vector shuffle has alternating odd or even numbers starting with 1 or 0 respectively up to the largest possible index for the given type in the given HVX mode (single of double) we can generate vpacko or vpacke instruction respectively. E.g. %42 = shufflevector <32 x i16> %37, <32 x i16> %41, <32 x i32> <i32 1, i32 3, ..., i32 63> is %42.h = vpacko(%41.w, %37.w) Patch by Pranav Bhandarkar. llvm-svn: 277168	2016-07-29 16:44:27 +00:00
Matt Masten	6edcc04bd0	Initial support for vectorization using svml (short vector math library). Differential Revision: https://reviews.llvm.org/D19544 llvm-svn: 277166	2016-07-29 16:42:44 +00:00
Ahmed Bougacha	16e33791e3	[GlobalISel] Auto-brief LowLevelType. NFC. llvm-svn: 277163	2016-07-29 16:11:06 +00:00
Ahmed Bougacha	9e6a1992f0	[GlobalISel] Add LLT::operator!=(). llvm-svn: 277162	2016-07-29 16:11:04 +00:00
Ahmed Bougacha	478610d92e	[GlobalISel] Fix LLT::unsized to match LLT(LabelTy). When coming from an IR label type, we set a 0 NumElements, but not when constructing an LLT using unsized(), causing comparisons to fail. Pick one variant and fix the other. llvm-svn: 277161	2016-07-29 16:11:02 +00:00
Ahmed Bougacha	d3a23757cf	[GlobalISel] Add unittests for LowLevelType. llvm-svn: 277160	2016-07-29 16:10:57 +00:00
Paul Robinson	7b5e62b34b	Reinstate optnone test for GVN Hoisting, removed in r276479. llvm-svn: 277158	2016-07-29 16:05:50 +00:00
Nirav Dave	34b6eb75b8	Remove inline-comment-2.ll until I can debug why it fails on some builds llvm-svn: 277152	2016-07-29 15:24:06 +00:00
Krzysztof Parzyszek	4ca53a9c57	[Hexagon] Improve balancing of address calculation Rebalances address calculation trees and applies Hexagon-specific optimizations to the trees to improve instruction selection. Patch by Tobias Edler von Koch. llvm-svn: 277151	2016-07-29 15:15:35 +00:00
Nirav Dave	c5d7dc7ef7	Fix inline-comment-2.ll triple llvm-svn: 277149	2016-07-29 15:12:00 +00:00
David L Kreitzer	717b5d713d	Avoid unnecessary 32-bit to 64-bit zero extensions following 32-bit CMOV instructions on x86_64. The 32-bit CMOV implicitly zero extends. Differential Revision: https://reviews.llvm.org/D22941 llvm-svn: 277148	2016-07-29 15:09:54 +00:00
Nirav Dave	29e4234fc7	[MC] When emitting output hash comments always use standard line comment seperator llvm-svn: 277146	2016-07-29 14:42:00 +00:00
Krzysztof Parzyszek	f6c7e61c5f	Fix license information in the file header llvm-svn: 277145	2016-07-29 14:04:17 +00:00
Krzysztof Parzyszek	6ac252b6e2	Add missing files to r277143 llvm-svn: 277144	2016-07-29 13:59:55 +00:00
Krzysztof Parzyszek	ef4e9bde37	[Hexagon] Implement DFA based hazard recognizer The post register allocator scheduler can generate poor schedules because the scoreboard hazard recognizer is unable to identify hazards for Hexagon precisely. Instead, Hexagon should use a DFA based hazard recognizer. Patch by Brendon Cahoon. llvm-svn: 277143	2016-07-29 13:59:09 +00:00
Daniel Sanders	92f45597fa	Re-commit: [mips][fastisel] Handle 0-4 arguments without SelectionDAG. Summary: Implements fastLowerArguments() to avoid the need to fall back on SelectionDAG for 0-4 argument functions that don't do tricky things like passing double in a pair of i32's. This allows us to move all except one test to -fast-isel-abort=3. The remaining one has function prototypes of the form 'i32 (i32, double, double)' which requires floats to be passed in GPR's. The previous commit had an uninitialized variable that caused the incoming argument region to have undefined size. This has been fixed. Reviewers: sdardis Subscribers: dsanders, llvm-commits, sdardis Differential Revision: https://reviews.llvm.org/D22680 llvm-svn: 277136	2016-07-29 12:27:28 +00:00
Nirav Dave	2b8176fede	Cleanup TransferDbgValues [DAG] Check debug values for invalidation before transferring and mark old debug values invalid when transferring to another SDValue. This fixes PR28613. Reviewers: jyknight, hans, dblaikie, echristo Subscribers: yaron.keren, ismail, llvm-commits Differential Revision: https://reviews.llvm.org/D22858 llvm-svn: 277135	2016-07-29 11:49:32 +00:00
Simon Pilgrim	511d5dc0be	[X86][SSE] Optimize the truncation of vector comparison results with PACKSS We currently default to using either generic shuffles or MASK+PACKUS/PACKSS to truncate all integer vectors. For vector comparisons, we know that the result will be either all or zero bits in every element, which can be efficiently truncated by directly using PACKSS to repeatedly halve the size of each element. Due to the limited input values (-1 or 0) we don't need to account for vector element size, so for simplicity we just use the PACKSS(vXi16,vXi16) implementation in all cases. Additionally for AVX2 PACKSS of 256bit data we must perform a PERMQ shuffle to reorder the data into the correct order. I did investigate performing a single shuffle after all the PACKSS calls but the need to cross 128bit lanes makes this difficult to achieve efficiently. We avoid performing this on AVX512 as it should have better alternative truncation instructions. Differential Revision: https://reviews.llvm.org/D22814 llvm-svn: 277132	2016-07-29 10:23:10 +00:00
Simon Pilgrim	8fa33ce6ce	Fixed MSVC out of range shift warning llvm-svn: 277130	2016-07-29 10:03:39 +00:00
Sjoerd Meijer	fd7049c574	Fix for commit rL277126 that broke a build. llvm-svn: 277129	2016-07-29 09:57:37 +00:00
Prakhar Bahuguna	01b47c1da5	[Thumb] Emit Thumb move in both Thumb modes for struct_byval predicates Summary: The MOV/MOVT instructions being chosen for struct_byval predicates was conditional only on Thumb2, resulting in an ARM MOV/MOVT instruction being incorrectly emitted in Thumb1 mode. This is especially apparent with v8-m.base targets. This patch ensures that Thumb instructions are emitted in both Thumb modes. Reviewers: rengolin, t.p.northover Subscribers: llvm-commits, aemerson, rengolin Differential Revision: https://reviews.llvm.org/D22865 llvm-svn: 277128	2016-07-29 09:16:46 +00:00
Jacques Pienaar	08d08634c4	[lanai] Update for Target API (TargetRegistry::RegisterMCAsmBackend) change llvm-svn: 277127	2016-07-29 08:50:23 +00:00
Sjoerd Meijer	f6deb69730	TargetInstrInfo: add virtual function getInstSizeInBytes This adds a target hook getInstSizeInBytes to TargetInstrInfo that a lot of subclasses already implement. Differential Revision: https://reviews.llvm.org/D22885 llvm-svn: 277126	2016-07-29 08:16:16 +00:00
Craig Topper	72bf22eca3	[AVX512] Mark EVEX VMOVSSrm and VMOVSDrm as canFoldAsLoad and isReMaterializable. llvm-svn: 277120	2016-07-29 06:06:04 +00:00
Craig Topper	e3f3eaac43	[AVX512] Copy the patterns that recognize scalar arimetic operations inserting into the lower element of a packed vector from AVX/SSE so that we can use EVEX encoded instructions. llvm-svn: 277119	2016-07-29 06:06:00 +00:00
Craig Topper	3b064848ca	[AVX512] Add AVX512 run lines to some tests for scalar fma/add/sub/mul/div and regenerate. Follow up commits will bring AVX512 code up to the same quality as AVX/SSE. llvm-svn: 277118	2016-07-29 06:05:58 +00:00
David Majnemer	b674c77582	[EarlyCSE] Correctly handle simplified, but live, instructions Some instructions may have their uses replaced with a symbolic constant. However, the instruction may still have side effects which percludes it from being removed from the function. EarlyCSE treated such an instruction as if it were removed, resulting in PR28763. llvm-svn: 277114	2016-07-29 05:39:21 +00:00
David Majnemer	341f5528df	[ConstantFolding] Fold bitcasts of vectors w/ undef elements An undef vector element can be treated as if it had any value. Folding such a vector element to 0 in a bitcast can open up further folding opportunities. llvm-svn: 277104	2016-07-29 04:06:09 +00:00
David Majnemer	48ead01c49	[ConstantFolding] Remove an unused ConstantFoldInstOperands overload No functional change is intended. llvm-svn: 277101	2016-07-29 03:27:33 +00:00
David Majnemer	54cf54dfdf	[ConstantFolding] Use ConstantExpr::getWithOperands ConstantExpr::getWithOperands does much of the hard work that ConstantFoldInstOperandsImpl tries to do but more completely. This lets us fold ExtractValue/InsertValue expressions. llvm-svn: 277100	2016-07-29 03:27:31 +00:00
David Majnemer	93c48d55ce	[ConstnatFolding] Teach the folder how to fold ConstantVector A ConstantVector can have ConstantExpr operands and vice versa. However, the folder had no ability to fold ConstantVectors which, in some cases, was an optimization barrier. Instead, rephrase the folder in terms of Constants instead of ConstantExprs and teach callers how to deal with failure. llvm-svn: 277099	2016-07-29 03:27:26 +00:00
Craig Topper	c6646be4b6	[AVX512] Remove the intrinsic forms of VMOVSS/VMOVSD. We don't need two different forms of 'rr' and 'rm'. This matches SSE/AVX. I'm not convinced the patterns for the rm_Int was correct anyway. It had a tied source that should't exist for the unmasked version. The load form of MOVSS always zeros the most significant bits. I've left the patterns off the masked load instructions as I'm not sure what the correct pattern should be and we don't have any tests currently. Nor do we implement masked scalar load intrinsics in clang currently. llvm-svn: 277098	2016-07-29 02:49:08 +00:00
George Burgess IV	1421f18abe	[CFLAA] Check for pointer types in more places. This patch fixes an assertion that fires when we try to add non-pointer Values to the CFLGraph. Centralizing the check for whether something is/isn't a pointer type isn't completely trivial (and, in some cases, would end up being entirely redundant), but it may be beneficial to do so if this trips us up more in the future. Patch by Jia Chen. Differential Revision: https://reviews.llvm.org/D22947 llvm-svn: 277096	2016-07-29 01:23:45 +00:00
Eugene Zelenko	aef9df0c15	Add LLVM_ENABLE_LLD option to use LLD as C/C++ linker. Differential revision: https://reviews.llvm.org/D22896 llvm-svn: 277093	2016-07-29 00:46:13 +00:00
Teresa Johnson	e25d88d3d4	Capture stderr when checking for gold version On MacOS the ld version is emitted to stderr, resulting in lots of messages in the ninja check output. llvm-svn: 277092	2016-07-29 00:39:56 +00:00
Piotr Padlewski	3b158bab46	Fixed comment llvm-svn: 277091	2016-07-29 00:30:07 +00:00
Piotr Padlewski	bbeb94e057	Added ThinLTO inlining statistics Summary: copypasta doc of ImportedFunctionsInliningStatistics class \brief Calculate and dump ThinLTO specific inliner stats. The main statistics are: (1) Number of inlined imported functions, (2) Number of imported functions inlined into importing module (indirect), (3) Number of non imported functions inlined into importing module (indirect). The difference between first and the second is that first stat counts all performed inlines on imported functions, but the second one only the functions that have been eventually inlined to a function in the importing module (by a chain of inlines). Because llvm uses bottom-up inliner, it is possible to e.g. import function `A`, `B` and then inline `B` to `A`, and after this `A` might be too big to be inlined into some other function that calls it. It calculates this statistic by building graph, where the nodes are functions, and edges are performed inlines and then by marking the edges starting from not imported function. If `Verbose` is set to true, then it also dumps statistics per each inlined function, sorted by the greatest inlines count like - number of performed inlines - number of performed inlines to importing module Reviewers: eraman, tejohnson, mehdi_amini Subscribers: mehdi_amini, llvm-commits Differential Revision: https://reviews.llvm.org/D22491 llvm-svn: 277089	2016-07-29 00:27:16 +00:00
Justin Lebar	b1ec783712	Revert "Don't invoke getName() from Function::isIntrinsic().", rL276942. This broke some out-of-tree AMDGPU tests that relied on the old behavior wherein isIntrinsic() would return true for any function that starts with "llvm.". And in general that change will not play nicely with out-of-tree backends. llvm-svn: 277087	2016-07-28 23:58:15 +00:00
Evgeniy Stepanov	48430d1527	[sanitizer] Simplify and future-proof maybeMarkSanitizerLibraryCallNoBuiltin(). Sanitizers set nobuiltin attribute on certain library functions to avoid a situation where such function is neither instrumented nor intercepted. At the moment the list of interesting functions is hardcoded. This change replaces it with logic based on TargetLibraryInfo::hasOptimizedCodegen and the presense of readnone function attribute (sanitizers are generally interested in memory behavior of library functions). This is expected to be a no-op change: the new logic matches exactly the same set of functions. r276771 (currently reverted) added mempcpy() to the list, breaking MSan tests. With this change, r276771 can be safely re-landed. llvm-svn: 277086	2016-07-28 23:45:15 +00:00

1 2 3 4 5 ...

135738 Commits