llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 04:02:41 +01:00

Author	SHA1	Message	Date
Wolfgang Pieb	83867e3150	[DWARF] Fix formatting bug with r321295. This fixes a MIPS buildbot failure. llvm-svn: 321330	2017-12-22 01:12:24 +00:00
Craig Topper	a892001119	[X86] Use SIGN_EXTEND rather than ZERO_EXTEND for lowering extract_vector_elt from vXi1 with a non-const index. We have a better range of instructions we can use if we can fill with the value i1 value rather than zeroing. llvm-svn: 321315	2017-12-21 22:08:23 +00:00
Alina Sbirlea	e57faad0f4	[ModRefInfo] Add must alias info to ModRefInfo. Summary: Add an additional bit to ModRefInfo, ModRefInfo::Must, to be cleared for known must aliases. Shift existing Mod/Ref/ModRef values to include an additional most significant bit. Update wrappers that modify ModRefInfo values to reflect the change. Notes: * ModRefInfo::Must is almost entirely cleared in the AAResults methods, the remaining changes are trying to preserve it. * Only some small changes to make custom AA passes set ModRefInfo::Must (BasicAA). * GlobalsModRef already declares a bit, who's meaning overlaps with the most significant bit in ModRefInfo (MayReadAnyGlobal). No changes to shift the value of MayReadAnyGlobal (see AlignedMap). FunctionInfo.getModRef() ajusts most significant bit so correctness is preserved, but the Must info is lost. * There are cases where the ModRefInfo::Must is not set, e.g. 2 calls that only read will return ModRefInfo::NoModRef, though they may read from exactly the same location. Reviewers: dberlin, hfinkel, george.burgess.iv Subscribers: llvm-commits, sanjoy Differential Revision: https://reviews.llvm.org/D38862 llvm-svn: 321309	2017-12-21 21:41:53 +00:00
Craig Topper	c7955a43df	[X86] When lowering truncates to vXi1, don't sign extend i16/i8 types to 512-bit if we have VLX. This should only affect what we do for v8i16. Previously we went to v8i64, but if we have VLX we only need v8i32. This prevents an unnecessary zmm usage. llvm-svn: 321303	2017-12-21 20:45:13 +00:00
Wolfgang Pieb	1fcda452d6	[DWARF v5] Rework of string offsets table reader Reorganizes the DWARF consumer to derive the string offsets table contribution's format from the contribution header instead of (incorrectly) from the unit's format. Reviewers: JDevliegehere, aprantl Differential Revision: https://reviews.llvm.org/D41146 llvm-svn: 321295	2017-12-21 19:38:13 +00:00
Craig Topper	d4b7af1b65	[X86] Promote v8i1 shuffles to v8i32 instead of v8i64 if we have VLX. We should have equally good shuffle options for v8i32 with VLX. This was spotted during my attempts to remove 512-bit vectors from SKX. We still use 512-bits for v16i1, v32i1, and v64i1. I'm less sure we can handle those well with narrower vectors. i32 and i64 element sizes get the best shuffle support. llvm-svn: 321291	2017-12-21 18:44:06 +00:00
Simon Pilgrim	e823f6a711	[X86][SSE] Split large PAVGB/PAVGW vectors to legal widths Patch to allow detectAVGPattern handle vectors larger than the legal size (128 SSE2, 256 AVX2, 512 AVX512BW), splitting the vectors accordingly. Differential Revision: https://reviews.llvm.org/D41440 llvm-svn: 321288	2017-12-21 18:12:31 +00:00
Francis Visoiu Mistrih	8ce86acbc0	[YAML] Refactor escaping unittests llvm-svn: 321284	2017-12-21 17:14:13 +00:00
Francis Visoiu Mistrih	3aecf492c6	[YAML] Fix UTF-8 handling Previous YAML quoting patches broke UTF-8 printing in YAML: see https://reviews.llvm.org/D41290#961801. Differential Revision: https://reviews.llvm.org/D41490 llvm-svn: 321283	2017-12-21 17:14:09 +00:00
Krzysztof Parzyszek	743a82a560	[TableGen] Print more helpful information in case of type contradiction Dump the failing TreePattern. llvm-svn: 321282	2017-12-21 17:12:43 +00:00
Simon Pilgrim	8b909e05ff	[DAGCombiner] Remove (xor (xor x, c1), c2) -> (xor x, (xor c1, c2)) fold. NFCI. More general cases are already handled by constant canonicalization and then the ReassociateOps call at line 5327 llvm-svn: 321280	2017-12-21 16:54:03 +00:00
Simon Pilgrim	f5ea116056	[DAGCombiner] Generalize (or (and X, c1), c2) -> (and (or X, c2), c1\|c2) combine to work on non-splat vectors The knownbits_mask_or_shuffle_uitofp change is interesting - shuffle combines manage to kick in, removing the AND constant mask load. For targets with fast-variable-shuffle this should reduce further to VPOR+VPSHUFB+VCVTDQ2PS. llvm-svn: 321279	2017-12-21 16:34:46 +00:00
Simon Pilgrim	e439ef2067	[X86] Add (or (and X, c1), c2) -> (and (or X, c2), c1\|c2) non-splat vector test llvm-svn: 321278	2017-12-21 16:08:41 +00:00
Tony Jiang	d407a2a892	[PowerPC] Fix parest build failure in SPEC2017. The build failure was caused by an assertion in pre-legalization DAGCombine: Combining: t6: ppcf128 = uint_to_fp t5 ... into: t20: f32 = PPCISD::FCFIDUS t19 which is clearly wrong since ppcf128 are definitely different type with f32 and we cannot change the node value type when do DAGCombine. The fix is don't handle ppc_fp128 or i1 conversions in PPCTargetLowering::combineFPToIntToFP and leave it to downstream to legalize it and expand it to small legal types. Differential Revision: https://reviews.llvm.org/D41411 llvm-svn: 321276	2017-12-21 15:42:50 +00:00
Simon Pilgrim	4299abea6b	[DAGCombiner] Generalize (and (or x, C), D) -> D iff (C & D) == D combine to work on non-splat vectors llvm-svn: 321275	2017-12-21 15:17:29 +00:00
Simon Dardis	bd317d0d04	[mips] Fix the invalid EVA test During the review of D40362 I spotted that this test wasn't actually testing the eva instructions due to '-mattr==eva', rather than '-mattr=+eva', which resulted in test having no effect. llvm-svn: 321273	2017-12-21 15:14:07 +00:00
Simon Pilgrim	dc611378a2	[X86] Add (and (or x, C), D) -> D iff (C & D) == D non-splat vector test llvm-svn: 321268	2017-12-21 14:33:40 +00:00
Simon Pilgrim	2fb173e1e1	[X86] Add v48i8 AVG test case, based on discussion on D41440 llvm-svn: 321261	2017-12-21 13:18:19 +00:00
Sam Parker	27411c6167	[DAGCombine] Improve ReduceLoadWidth for SRL If the SRL node is only used by an AND, we may be able to set the ExtVT to the width of the mask, making the AND redundant. To support this, another check has been added in isLegalNarrowLoad which queries whether the load is valid. Differential Revision: https://reviews.llvm.org/D41350 llvm-svn: 321259	2017-12-21 12:55:04 +00:00
Pavel Labath	428cae6649	[Support] Remove MemoryBuffer::getNewUninitMemBuffer There is nothing useful that can be done with a read-only uninitialized buffer without const_casting its contents to initialize it. A better solution is to obtain a writable buffer (WritableMemoryBuffer::getNewUninitMemBuffer), and then convert it to a read-only buffer after initialization. All callers of this function have already been updated to do this, so this function is now unused. llvm-svn: 321257	2017-12-21 11:27:21 +00:00
Sam Parker	2018a87d0b	[ARM] Armv8-R DFB instruction Implement MC support for the Armv8-R 'Data Full Barrier' instruction. Differential Revision: https://reviews.llvm.org/D41430 llvm-svn: 321256	2017-12-21 11:17:49 +00:00
Simon Atanasyan	84005977e5	[llvm-readobj] Fix ambiguous call to the `printNumber` llvm-svn: 321254	2017-12-21 10:46:20 +00:00
Simon Atanasyan	17ed6d3801	[llvm-readobj] Support 'GNU' style for MIPS GOT/PLT dumping This change adds `printMipsGOT` and `printMipsPLT` methods to the `DumpStyle` class and overrides them in the `GNUStyle` and `LLVMStyle` descendants. To pass information about GOT/PLT layout into these methods, the `MipsGOTParser` class has been extended to hold all necessary data. llvm-svn: 321253	2017-12-21 10:26:02 +00:00
Craig Topper	e1bff090c9	[X86] Use PSHUFB for v32i16 shuffles before falling back to VPERMW/VPERMI2W. PSHUFB has the ability to implicitly 0 elements which VPERMI2W can't do. So give a chance to use it first. llvm-svn: 321251	2017-12-21 08:22:51 +00:00
Craig Topper	b98d778751	[X86] Use VPERMI2B for v16i8 shuffles if we have VBMI+VLX and would have otherwise used two PSHUFBs ORed together. llvm-svn: 321249	2017-12-21 07:31:30 +00:00
Craig Topper	15ddabcd33	[X86] Use VPERMB/VPERMI2B for v32i8 shuffle lowering if VBMI and VLX are supported. llvm-svn: 321248	2017-12-21 05:58:31 +00:00
Craig Topper	9523be48b2	[X86] Add avx512vbmi command lines to vector-shuffle-256-v32.ll llvm-svn: 321247	2017-12-21 03:58:31 +00:00
Sam Clegg	70c40e44a2	[WebAssembly] Remove unneeded sub-directory This is the only wasm def (and likely likely will be for the foreseeable) file so no need for a sub-directory Differential Revision: https://reviews.llvm.org/D41476 llvm-svn: 321246	2017-12-21 03:16:34 +00:00
Sanjoy Das	259bcf37bc	Revert "Expose a TargetMachine::getTargetTransformInfo function" This reverts commit r321234. It breaks the -DBUILD_SHARED_LIBS=ON build. llvm-svn: 321243	2017-12-21 02:34:39 +00:00
Sam Clegg	dc00fca5b5	[WebAssembly] Fix local references to weak aliases When weak aliases are used with in same translation unit we need to be able to directly reference to alias and not just the thing it is aliases. We do this by defining both a wasm import and a wasm export in this case that result in a single Symbol. This change is a partial revert of rL314245. A corresponding lld change address the previous issues we had with this. See: https://github.com/WebAssembly/tool-conventions/issues/34 Differential Revision: https://reviews.llvm.org/D41472 llvm-svn: 321242	2017-12-21 02:30:38 +00:00
Michael Zolotukhin	728dc93610	[SimplifyCFG] Avoid quadratic on a predecessors number behavior in instruction sinking. If a block has N predecessors, then the current algorithm will try to sink common code to this block N times (whenever we visit a predecessor). Every attempt to sink the common code includes going through all predecessors, so the complexity of the algorithm becomes O(N^2). With this patch we try to sink common code only when we visit the block itself. With this, the complexity goes down to O(N). As a side effect, the moment the code is sunk is slightly different than before (the order of simplifications has been changed), that's why I had to adjust two tests (note that neither of the tests is supposed to test SimplifyCFG): * test/CodeGen/AArch64/arm64-jumptable.ll - changes in this test mimic the changes that previous implementation of SimplifyCFG would do. * test/CodeGen/ARM/avoid-cpsr-rmw.ll - in this test I disabled common code sinking by a command line flag. llvm-svn: 321236	2017-12-21 01:22:13 +00:00
Sanjoy Das	90359bc4d6	Expose a TargetMachine::getTargetTransformInfo function Summary: This makes the TargetMachine interface a bit simpler. We still need the std::function in TargetIRAnalysis to avoid having to add a dependency from Analysis to Target. See discussion: http://lists.llvm.org/pipermail/llvm-dev/2017-December/119749.html I avoided adding all of the backend owners to this review since the change is simple, but let me know if you feel differently about this. Reviewers: echristo, MatzeB, hfinkel Reviewed By: hfinkel Subscribers: jholewinski, jfb, arsenm, dschuff, mcrosier, sdardis, nemanjai, nhaehnle, javed.absar, sbc100, jgravelle-google, aheejin, kbarton, llvm-commits Differential Revision: https://reviews.llvm.org/D41464 llvm-svn: 321234	2017-12-21 01:06:58 +00:00
Reid Kleckner	d234f6d0a8	Attempt to pacify 4.8.5 with makeArrayRef llvm-svn: 321233	2017-12-21 00:28:34 +00:00
Simon Dardis	b648972d95	[orc][cmake] Check if 8 byte atomics require libatomic for unittest rL319838 introduced SymbolStringPool which uses 8 byte atomics for reference counters. On systems which do not support such atomics natively such as MIPS32, explicitly add libatomic as one of the libraries for SymbolStringPool's unittest. Reviewers: lhames, beanz Differential Revision: https://reviews.llvm.org/D41010 llvm-svn: 321225	2017-12-20 22:26:41 +00:00
Joel Galenson	eebcb74d92	[ARM] Optimize {s,u}{add,sub}.with.overflow. The AArch64 backend contains code to optimize {s,u}{add,sub}.with.overflow during SelectionDAG. This commit ports that code to the ARM backend. Differential revision: https://reviews.llvm.org/D35635 llvm-svn: 321224	2017-12-20 22:25:39 +00:00
Krzysztof Parzyszek	abba1d8bb8	[Hexagon] Use ArrayRef member functions instead of custom ones llvm-svn: 321221	2017-12-20 20:54:13 +00:00
Krzysztof Parzyszek	b90ded8ab6	[Hexagon] Allow construction of HVX vector predicates Handle BUILD_VECTOR of boolean values. llvm-svn: 321220	2017-12-20 20:49:43 +00:00
Krzysztof Parzyszek	cda8213c1e	[Hexagon] Legalize vector elements to i32 in buildVector32/64 llvm-svn: 321218	2017-12-20 20:33:49 +00:00
Aaron Ballman	0d26e8d7ab	Do not generate an empty switch statement as it causes MSVC to issue diagnostics about switch statements without case or default labels. llvm-svn: 321217	2017-12-20 20:09:30 +00:00
Yonghong Song	615374a840	bpf: add support for objdump -print-imm-hex Add support for 'objdump -print-imm-hex' for imm64, operand imm and branch target. If user programs encode immediate values as hex numbers, such an option will make it easy to correlate asm insns with source code. This option also makes it easy to correlate imm values with insn encoding. There is one changed behavior in this patch. In old way, we print the 64bit imm as u64: O << (uint64_t)Op.getImm(); and the new way is: O << formatImm(Op.getImm()); The formatImm is defined in llvm/MC/MCInstPrinter.h as format_object<int64_t> formatImm(int64_t Value) So the new way to print 64bit imm is i64 type. If a 64bit value has the highest bit set, the old way will print the value as a positive value and the new way will print as a negative value. The new way is consistent with x86_64. For the code (see the test program): ... if (a == 0xABCDABCDabcdabcdULL) ... x86_64 objdump, with and without -print-imm-hex, looks like: 48 b8 cd ab cd ab cd ab cd ab movabsq $-6067004223159161907, %rax 48 b8 cd ab cd ab cd ab cd ab movabsq $-0x5432543254325433, %rax Signed-off-by: Yonghong Song <yhs@fb.com> llvm-svn: 321215	2017-12-20 19:39:58 +00:00
David Blaikie	a5f0240633	PR35705: Fix Chapter 9 example code for API changes to DIBuilder llvm-svn: 321214	2017-12-20 19:36:54 +00:00
Craig Topper	211591bb42	[X86] Refactor DomainReassignment pass to make the Closure class not stores references to the main data structures of the pass itself Multiple Closure objects can be created and stored for a single function. It's not a good idea to devote so many fields of it to storing pointers and references to global data structures of the pass. The closure class should only store the things needed to represent the closure itself. This patch refactors many of the methods of Closure to belong to the pass object and to pass around a reference to the current Closure. The Closure class gains a few simple methods to add instructions and edges, and to return iterators to edges and instructions Differential Revision: https://reviews.llvm.org/D41327 llvm-svn: 321213	2017-12-20 19:36:43 +00:00
Matt Arsenault	422c27e8aa	TableGen: Allow setting SDNodeProperties on intrinsics Allows preserving MachineMemOperands on intrinsics through selection. For reasons I don't understand, this is a static property of the pattern and the selector deliberately goes out of its way to drop if not present. Intrinsics already inherit from SDPatternOperator allowing them to be used directly in instruction patterns. SDPatternOperator has a list of SDNodeProperty, but you currently can't set them on the intrinsic. Without SDNPMemOperand, when the node is selected any memory operands are always dropped. Allowing setting this on the intrinsics avoids needing to introduce another equivalent target node just to have SDNPMemOperand set. llvm-svn: 321212	2017-12-20 19:36:28 +00:00
Matthew Simpson	8e9e490391	[ICP] Expose unconditional call promotion interface This patch modifies the indirect call promotion utilities by exposing and using an unconditional call promotion interface. The unconditional promotion interface (i.e., call promotion without creating an if-then-else) can be used if it's known that an indirect call has only one possible callee. The existing conditional promotion interface uses this unconditional interface to promote an indirect call after it has been versioned and placed within the "then" block. A consequence of unconditional promotion is that the fix-up operations for phi nodes in the normal destination of invoke instructions are changed. This is necessary because the existing implementation assumed that an invoke had been versioned, creating a "merge" block where a return value bitcast could be placed. In the new implementation, the edge between a promoted invoke's parent block and its normal destination is split if needed to add a bitcast for the return value. If the invoke is also versioned, the phi node merging the return value of the promoted and original invoke instructions is placed in the "merge" block. Differential Revision: https://reviews.llvm.org/D40751 llvm-svn: 321210	2017-12-20 19:26:37 +00:00
Craig Topper	d3d31e8b77	[X86] Remove zext from vXi32 to vXi64 on indices of gather/scatter instructions if we can prove the pre-extended value is positive. Gather/scatter can implicitly sign extend from i32->i64 on indices. So if we know the sign bit of the input to a zext is 0 we can use the implicit extension. llvm-svn: 321209	2017-12-20 19:25:33 +00:00
Matt Arsenault	6e134847bf	DAG: Tolerate non-MemSDNodes for OPC_RecordMemRef When intrinsics are allowed to have mem operands, there are two ways this can happen. First is an intrinsic that is marked has having a mem operand, but is not handled by getTgtMemIntrinsic. The second way can occur even for intrinsics which do not have a mem operand. It seems the selector table does some kind of sorting based on the opcode, and the mem ref recording can happen in the same scope for intrinsics that both do and do not have mem refs. I haven't been able to figure out exactly why this happens (although it happens even with the matcher optimizations disabled). I'm not sure if it's worth trying to avoid hitting this for these nodes since I think it's still reasonable to handle this in case getTgtMemIntrinic is not implemented. llvm-svn: 321208	2017-12-20 19:11:59 +00:00
Warren Ristow	8ec115e5d5	Improve the test for r320216. NFC. Patch by Matthew Voss! llvm-svn: 321207	2017-12-20 19:11:31 +00:00
Adam Nemet	eeb462931d	[opt-viewer] Also demangle indirect-call promotion targets llvm-svn: 321206	2017-12-20 19:08:12 +00:00
Stefan Pintilie	bc52dd01f2	[PowerPC] Added an assert to make sure that the MBBI iterator is valid. The function createTailCallBranchInstr assumes that the iterator MBBI is valid. However, only one use of MBBI is guarded in the function. Fix this by adding an assert. Differential Revision: https://reviews.llvm.org/D41358 llvm-svn: 321205	2017-12-20 19:07:44 +00:00
Nirav Dave	bb6eab85ac	[DAG] Fix condition on overlapping store check. Prevent overlapping store elision when overlapping store is pre-inc/dec as analysis is wrong in these cases. llvm-svn: 321204	2017-12-20 19:06:47 +00:00

1 2 3 4 5 ...

158296 Commits