llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-20 03:23:01 +02:00

Author	SHA1	Message	Date
Simon Pilgrim	ca9b9365c9	[SelectionDAG] computeKnownBits - use ashrInPlace on known bits of ISD::SRA input. NFCI. llvm-svn: 317087	2017-11-01 13:16:48 +00:00
Craig Topper	0253374c05	[DAGCombiner] Fix typos in comments. NFC llvm-svn: 317072	2017-11-01 03:30:52 +00:00
Simon Pilgrim	937b242735	Fix unused variable warnings. NFCI. llvm-svn: 316964	2017-10-30 22:38:07 +00:00
Simon Pilgrim	21b08adda6	[SelectionDAG] Tidyup computeKnownBits extension/truncation cases. NFCI. We don't need to extend/truncate the Known structure before calling computeKnownBits - it will reset at the start of the function. llvm-svn: 316962	2017-10-30 22:23:57 +00:00
Daniel Neilson	0ad57a67a0	Create instruction classes for identifying any atomicity of memory intrinsic. (NFC) Summary: For reference, see: http://lists.llvm.org/pipermail/llvm-dev/2017-August/116589.html This patch fleshes out the instruction class hierarchy with respect to atomic and non-atomic memory intrinsics. With this change, the relevant part of the class hierarchy becomes: IntrinsicInst -> MemIntrinsicBase (methods-only class) -> MemIntrinsic (non-atomic intrinsics) -> MemSetInst -> MemTransferInst -> MemCpyInst -> MemMoveInst -> AtomicMemIntrinsic (atomic intrinsics) -> AtomicMemSetInst -> AtomicMemTransferInst -> AtomicMemCpyInst -> AtomicMemMoveInst -> AnyMemIntrinsic (both atomicities) -> AnyMemSetInst -> AnyMemTransferInst -> AnyMemCpyInst -> AnyMemMoveInst This involves some class renaming: ElementUnorderedAtomicMemCpyInst -> AtomicMemCpyInst ElementUnorderedAtomicMemMoveInst -> AtomicMemMoveInst ElementUnorderedAtomicMemSetInst -> AtomicMemSetInst A script for doing this renaming in downstream trees is included below. An example of where the Any* classes should be used in LLVM is when reasoning about the effects of an instruction (ex: aliasing). --- Script for renaming AtomicMem* classes: PREFIXES="[<,([:space:]]" CLASSES="MemIntrinsic\|MemTransferInst\|MemSetInst\|MemMoveInst\|MemCpyInst" SUFFIXES="[;)>,[:space:]]" REGEX="(${PREFIXES})ElementUnorderedAtomic(${CLASSES})(${SUFFIXES})" REGEX2="visitElementUnorderedAtomic(${CLASSES})" FILES=$( grep -E "(${REGEX}\|${REGEX2})" -r . \| tr ':' ' ' \| awk '{print $1}' \| sort \| uniq ) SED_SCRIPT="s~${REGEX}~\1Atomic\2\3~g" SED_SCRIPT2="s~${REGEX2}~visitAtomic\1~g" for f in $FILES; do echo "Processing: $f" sed -i ".bak" -E "${SED_SCRIPT};${SED_SCRIPT2};${EA_SED_SCRIPT};${EA_SED_SCRIPT2}" $f done Reviewers: sanjoy, deadalnix, apilipenko, anna, skatkov, mkazantsev Reviewed By: sanjoy Subscribers: hfinkel, jholewinski, arsenm, sdardis, nhaehnle, JDevlieghere, javed.absar, llvm-commits Differential Revision: https://reviews.llvm.org/D38419 llvm-svn: 316950	2017-10-30 19:51:48 +00:00
Simon Pilgrim	c48c10f794	[SelectionDAG] Add VSELECT demanded elts support to computeKnownBits llvm-svn: 316947	2017-10-30 19:31:08 +00:00
Simon Pilgrim	e585a22b5f	[SelectionDAG] Add VSELECT support to computeKnownBits llvm-svn: 316944	2017-10-30 19:08:21 +00:00
Simon Pilgrim	21de22c31b	[SelectionDAG] Add SELECT demanded elts support to ComputeNumSignBits llvm-svn: 316933	2017-10-30 17:53:51 +00:00
Simon Pilgrim	9020d12789	[SelectionDAG] Add SEXT/AND/XOR/Or demanded elts support to ComputeNumSignBits llvm-svn: 316875	2017-10-29 22:03:37 +00:00
Simon Pilgrim	630e28612c	[SelectionDAG] Add SRA/SHL demanded elts support to ComputeNumSignBits Introduce a isConstOrDemandedConstSplat helper function that can recognise a constant splat build vector for at least the demanded elts we care about. llvm-svn: 316866	2017-10-29 18:19:37 +00:00
Simon Pilgrim	612223546a	[SelectionDAG] Add support for INSERT_SUBVECTOR to computeKnownBits llvm-svn: 316847	2017-10-28 22:10:40 +00:00
Simon Pilgrim	a120e99e55	[SelectionDAG] Support 'bit preserving' floating points bitcasts on computeKnownBits/ComputeNumSignBits For cases where we know the floating point representations match the bitcasted integer equivalent, allow bitcasting to these types. This is especially useful for the X86 floating point compare results which return all/zero bits but as a floating point type. Differential Revision: https://reviews.llvm.org/D39289 llvm-svn: 316831	2017-10-28 14:27:53 +00:00
Guozhi Wei	bb55a29e2c	[DAGCombine] Don't combine sext with extload if sextload is not supported and extload has multi users In function DAGCombiner::visitSIGN_EXTEND_INREG, sext can be combined with extload even if sextload is not supported by target, then if sext is the only user of extload, there is no big difference, no harm no benefit. if extload has more than one user, the combined sextload may block extload from combining with other zext, causes extra zext instructions generated. As demonstrated by the attached test case. This patch add the constraint that when sextload is not supported by target, sext can only be combined with extload if it is the only user of extload. Differential Revision: https://reviews.llvm.org/D39108 llvm-svn: 316802	2017-10-27 21:54:24 +00:00
Matt Arsenault	f9af63b514	DAG: Fold fma (fneg x), K, y -> fma x, -K, y llvm-svn: 316753	2017-10-27 09:06:07 +00:00
Sean Fertile	76045150cf	Add subclass data to the FoldingSetNode for MemIntrinsicSDNodes. Not having the subclass data on an MemIntrinsicSDNodes means it was possible to try to fold 2 nodes with the same operands but differing MMO flags. This would trip an assertion when trying to refine the alignment between the 2 MachineMemOperands. Differential Revision: https://reviews.llvm.org/D38898 llvm-svn: 316737	2017-10-27 04:02:51 +00:00
Matt Arsenault	d2d2e6f167	DAG: Fix creating select with wrong condition type This code added in r297930 assumed that it could create a select with a condition type that is just an integer bitcast of the selected type. For AMDGPU any vselect is going to be scalarized (although the vector types are legal), and all select conditions must be i1 (the same as getSetCCResultType). This logic doesn't really make sense to me, but there's never really been a consistent policy in what the select condition mask type is supposed to be. Try to extend the logic for skipping the transform for condition types that aren't setccs. It doesn't seem quite right to me though, but checking conditions that seem more sensible (like whether the vselect is going to be expanded) doesn't work since this seems to depend on that also. llvm-svn: 316554	2017-10-25 07:14:07 +00:00
Adrian Prantl	74a7e5f6f8	Implement salavageDebugInfo functionality for SelectionDAG. Similar to how llvm::salvagDebugInfo hooks into InstCombine, this adds a hook that can be invoked before an SDNode that is associated with an SDDbgValue is erased to capture the effect of the deleted node in a DIExpression. The motivating example is an SDDebugValue attached to an ADD operation that gets folded into a LOAD+OFFSET operation. rdar://problem/32121503 llvm-svn: 316525	2017-10-24 22:55:12 +00:00
Adrian Prantl	8a5ebfc618	Use range-based for loop. NFC llvm-svn: 316496	2017-10-24 20:38:00 +00:00
Adrian Prantl	1cde06b6be	Use range-based-for. NFC llvm-svn: 316485	2017-10-24 19:32:59 +00:00
Adrian Prantl	3c5379a357	Doxygenify comments. llvm-svn: 316466	2017-10-24 17:23:40 +00:00
Simon Pilgrim	12498e9376	[SelectionDAG] Add VSELECT support to ComputeNumSignBits llvm-svn: 316457	2017-10-24 16:38:38 +00:00
George Burgess IV	72a4675c99	Fix buildbot breakage SP is only used in an assert. Caused by r316374. llvm-svn: 316377	2017-10-23 21:08:02 +00:00
George Burgess IV	d60307ede1	Don't crash when we see unallocatable registers in clobbers This fixes a bug where we'd crash given code like the test-case from https://bugs.llvm.org/show_bug.cgi?id=30792 . Instead, we let the offending clobber silently slide through. This doesn't fully fix said bug, since the assembler will still complain the moment it sees a crypto/fp/vector op, and we still don't diagnose calls that require vector regs. Differential Revision: https://reviews.llvm.org/D39030 llvm-svn: 316374	2017-10-23 20:46:36 +00:00
Simon Pilgrim	93f91e166d	[DAGCombine] Permit combining of shuffles of equivalent splat BUILD_VECTORs combineShuffleOfScalars is very conservative about shuffled BUILD_VECTORs that can be combined together. This patch adds one additional case - if both BUILD_VECTORs represent splats of the same scalar value but with different UNDEF elements, then we should create a single splat BUILD_VECTOR, sharing only the UNDEF elements defined by the shuffle mask. Differential Revision: https://reviews.llvm.org/D38696 llvm-svn: 316331	2017-10-23 15:48:08 +00:00
Florian Hahn	541038ee51	[SelectionDAG] Use dyn_cast without cast. llvm-svn: 316258	2017-10-21 05:37:10 +00:00
Florian Hahn	d87d8e53bf	[SelectionDAG] Use isa to silence unused variable warning (NFC). llvm-svn: 316257	2017-10-21 04:57:03 +00:00
Craig Topper	722fe7b374	[SelectionDAG] Don't subject ConstantSDNodes to the depth limit in computeKnownBits and ComputeNumSignBits. We don't need to do any additional recursion, we just need to analyze the APInt stored in the node. This matches what the ValueTracking versions do for IR. llvm-svn: 316256	2017-10-21 03:22:13 +00:00
Craig Topper	bd7165ec1f	[SelectionDAG] Don't subject ISD:Constant to the depth limit in TargetLowering::SimplifyDemandedBits. Summary: We shouldn't recurse any further but it doesn't mean we shouldn't be able to give the known bits for a constant. The caller would probably like that we always return the right answer for a constant RHS. This matches what InstCombine does in this case. I don't have a test case because this showed up while trying to revive D31724. Reviewers: RKSimon, spatel Reviewed By: RKSimon Subscribers: arsenm, llvm-commits Differential Revision: https://reviews.llvm.org/D38967 llvm-svn: 316255	2017-10-21 02:27:19 +00:00
Craig Topper	3781c30a61	[SelectionDAG] Add a check to getVectorShuffle to ensure that the only negative index we allow is -1. llvm-svn: 316183	2017-10-19 20:59:41 +00:00
NAKAMURA Takumi	c59f1ab92f	Untabify. llvm-svn: 316079	2017-10-18 13:31:28 +00:00
Simon Pilgrim	cc5051307b	[DAGCombine] Add SCALAR_TO_VECTOR undef handling to simplifyShuffleMask. This allows us to simplify later visitVECTOR_SHUFFLE optimizations such as combineShuffleOfScalars. Noticed whilst working on D38696 llvm-svn: 316017	2017-10-17 18:14:48 +00:00
Mark Searles	67e40dbe69	Use the return value of UpdateNodeOperands(); in some cases, UpdateNodeOperands() modifies the node in-place and using the return value isn’t strictly necessary. However, it does not necessarily modify the node, but may return a resultant node if it already exists in the DAG. See comments in UpdateNodeOperands(). In that case, the return value must be used to avoid such scenarios as an infinite loop (node is assumed to have been updated, so added back to the worklist, and re-processed; however, node hasn’t changed so it is once again passed to UpdateNodeOperands(), assumed modified, added back to worklist; cycle infinitely repeats). Differential Revision: https://reviews.llvm.org/D38466 llvm-svn: 315957	2017-10-16 23:38:53 +00:00
Krzysztof Parzyszek	7791aebbb3	Add iterator range MachineRegisterInfo::liveins(), adopt users, NFC llvm-svn: 315927	2017-10-16 19:08:41 +00:00
Sjoerd Meijer	f56933ea80	ISel type legalizer: debug messages. NFC. Minor addition and follow up of r314773 and r311533: this adds more debug messages to the type legalizer. For each node, it dumps legalization info for results and operands nodes, rather than just the final legalized node. Differential Revision: https://reviews.llvm.org/D38726 llvm-svn: 315904	2017-10-16 14:07:30 +00:00
Aaron Ballman	1dbcb12601	Reverting r315590; it did not include changes for llvm-tblgen, which is causing link errors for several people. Error LNK2019 unresolved external symbol "public: void __cdecl `anonymous namespace'::MatchableInfo::dump(void)const " (?dump@MatchableInfo@?A0xf4f1c304@@QEBAXXZ) referenced in function "public: void __cdecl `anonymous namespace'::AsmMatcherEmitter::run(class llvm::raw_ostream &)" (?run@AsmMatcherEmitter@?A0xf4f1c304@@QEAAXAEAVraw_ostream@llvm@@@Z) llvm-tblgen D:\llvm\2017\utils\TableGen\AsmMatcherEmitter.obj 1 llvm-svn: 315854	2017-10-15 14:32:27 +00:00
Matt Arsenault	1086f79305	DAG: Add opcode and source type to isFPExtFree This is only currently used for mad/fma transforms. This is the only case where it should be used for AMDGPU, so add an opcode to be sure. llvm-svn: 315740	2017-10-13 19:55:45 +00:00
Matt Arsenault	5307ef32f4	DAG: Add flags to dumps llvm-svn: 315690	2017-10-13 15:41:40 +00:00
Craig Topper	b0f8cf7ff3	[SelectionDAG] Cleanup the SIGN_EXTEND_INREG handling in computeKnownBits. NFCI Use less temporary APInts. Use bit counting more. Don't call getScalarSizeInBits so many places, just capture it once. llvm-svn: 315671	2017-10-13 05:35:35 +00:00
Craig Topper	f75b98319a	[SelectionDAG] Fix typo in comment. NFC llvm-svn: 315670	2017-10-13 05:35:34 +00:00
Craig Topper	fbbd815f67	[SelectionDAG] Correct the early out in SelectionDAG::getZeroExtendInReg to work properly for vector types. I don't know if we ever hit this case or not. Turning it into an assert only fired on expanding some atomic operation in a SystemZ lit test. llvm-svn: 315648	2017-10-13 00:18:58 +00:00
Craig Topper	bbfe102642	[SelectionDAG] Const-correct the DemandedMask argument to one of the overloads of SimplifyDemandedBits. NFC llvm-svn: 315641	2017-10-12 23:46:05 +00:00
Craig Topper	a196f15138	[SelectionDAG] Simplify the ISD::SIGN_EXTEND/ZERO_EXTEND handling to use less temporary APInts by counting bits instead. NFCI llvm-svn: 315628	2017-10-12 21:58:25 +00:00
Wei Ding	725c541dbe	Implement custom lowering for ISD::CTTZ_ZERO_UNDEF and ISD::CTTZ. Differential Revision: http://reviews.llvm.org/D37348 llvm-svn: 315610	2017-10-12 19:37:14 +00:00
Don Hinton	16622c817e	[dump] Remove NDEBUG from test to enable dump methods [NFC] Summary: Add LLVM_FORCE_ENABLE_DUMP cmake option, and use it along with LLVM_ENABLE_ASSERTIONS to set LLVM_ENABLE_DUMP. Remove NDEBUG and only use LLVM_ENABLE_DUMP to enable dump methods. Move definition of LLVM_ENABLE_DUMP from config.h to llvm-config.h so it'll be picked up by public headers. Differential Revision: https://reviews.llvm.org/D38406 llvm-svn: 315590	2017-10-12 16:16:06 +00:00
Wei Mi	f44870d1d1	Revert r307036 because of PR34919. llvm-svn: 315540	2017-10-12 00:24:52 +00:00
Sanjay Patel	0d50eae372	[DAGCombiner] convert insertelement of bitcasted vector into shuffle Eg: insert v4i32 V, (v2i16 X), 2 --> shuffle v8i16 V', X', {0,1,2,3,8,9,6,7} This is a generalization of the IR fold in D38316 to handle insertion into a non-undef vector. We may want to abandon that one if we can't find value in squashing the more specific pattern sooner. We're using the existing legal shuffle target hook to avoid AVX512 horror with vXi1 shuffles. There may be room for improvement in the shuffle lowering here, but that would be follow-up work. Differential Revision: https://reviews.llvm.org/D38388 llvm-svn: 315460	2017-10-11 14:12:16 +00:00
Alex Bradbury	d4ed8e7017	[TargetLowering] Correctly track NumFixedArgs field of CallLoweringInfo The NumFixedArgs field of CallLoweringInfo is used by TargetLowering::LowerCallTo to determine whether a given argument is passed using the vararg calling convention or not (specifically, to set IsFixed for each ISD::OutputArg). Firstly, CallLoweringInfo::setLibCallee and CallLoweringInfo::setCallee both incorrectly set NumFixedArgs based on the _previous_ args list. Secondly, TargetLowering::LowerCallTo failed to increment NumFixedArgs when modifying the argument list so a pointer is passed for the return value. If your backend uses the IsFixed property or directly accesses NumFixedArgs, it is _possible_ this change could result in codegen changes (although the previous behaviour would have been incorrect). No such cases have been identified during code review for any in-tree architecture. Differential Revision: https://reviews.llvm.org/D37898 llvm-svn: 315457	2017-10-11 13:48:45 +00:00
Eugene Zelenko	a9f4ca477c	[CodeGen] Fix some Clang-tidy modernize and Include What You Use warnings; other minor fixes (NFC). llvm-svn: 315380	2017-10-10 22:33:29 +00:00
David Stuttard	ed11cfd166	[DAGCombine] Fix for shuffle to vector extend for non power 2 vectors Summary: See https://llvm.org/PR33743 for more details It seems that for non-power of 2 vector sizes, the algorithm can produce non-matching sizes for input and result causing an assert. This usually isn't a problem as the isAnyExtend check will weed these out, but in some cases (most often with lots of undefined values for the mask indices) it can pass this check for non power of 2 vectors. Adding in an extra check that ensures that bit size will match for the result and input (as required) Subscribers: nhaehnle Differential Revision: https://reviews.llvm.org/D35241 llvm-svn: 315307	2017-10-10 12:45:45 +00:00
Adam Nemet	ec7409d86a	Rename OptimizationDiagnosticInfo.* to OptimizationRemarkEmitter.* Sync it up with the name of the class actually defined here. This has been bothering me for a while... llvm-svn: 315249	2017-10-09 23:19:02 +00:00

1 2 3 4 5 ...

8564 Commits