llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-24 03:33:20 +01:00

Author	SHA1	Message	Date
Florian Hahn	424d2206a7	[LAA] Add test for PR47751, which currently uses wrong bounds.	2020-10-07 11:22:22 +01:00
Jay Foad	c63e985f6f	[SDag] SimplifyDemandedBits: simplify to FP constant if all bits known We were already doing this for integer constants. This patch implements the same thing for floating point constants. Differential Revision: https://reviews.llvm.org/D88570	2020-10-07 09:24:38 +01:00
Max Kazantsev	fe21571576	[Test] Add one more test where we can avoid creating trunc	2020-10-07 15:06:38 +07:00
Rainer Orth	5116ffc6e0	[Support][unittests] Enforce alignment in ConvertUTFTest `LLVM-Unit :: Support/./SupportTests/ConvertUTFTest.ConvertUTF16LittleEndianToUTF8String` `FAIL`s on Solaris/sparcv9: In `llvm/lib/Support/ConvertUTFWrapper.cpp` (`convertUTF16ToUTF8String`) the `SrcBytes` arg is reinterpreted/accessed as `UTF16` (`unsigned short`, which requires 2-byte alignment on strict-alignment targets like Sparc) without anything guaranteeing the alignment, so the access yields a `SIGBUS`. This patch avoids this by enforcing the required alignment in the callers. Tested on `sparcv9-sun-solaris2.11`. Differential Revision: https://reviews.llvm.org/D88824	2020-10-07 09:08:41 +02:00
Max Kazantsev	5a5039d1f8	[NFC] Use getZero instead of getConstant(0)	2020-10-07 13:53:36 +07:00
Roman Lebedev	034e32765e	[SROA] rewritePartition()/findCommonType(): if uses have conflicting type, try getTypePartition() before falling back to largest integral use type (PR47592) And another step towards transformss not introducing inttoptr and/or ptrtoint casts that weren't there already. In this case, when load/store uses have conflicting types, instead of falling back to the iN, we can try to use allocated sub-type. As disscussed, this isn't the best idea overall (we shouldn't rely on allocated type), but it works fine as a temporary measure. I've measured, and @ `-O3` as of vanilla llvm test-suite + RawSpeed, this results in +0.05% more bitcasts, -5.51% less inttoptr and -1.05% less ptrtoint (at the end of middle-end opt pipeline) See https://bugs.llvm.org/show_bug.cgi?id=47592 Reviewed By: efriedma Differential Revision: https://reviews.llvm.org/D88788	2020-10-07 09:20:19 +03:00
Yonghong Song	a6506984a7	BPF: avoid duplicated globals for CORE relocations This patch fixed two issues related with relocation globals. In LLVM, if a global, e.g. with name "g", is created and conflict with another global with the same name, LLVM will rename the global, e.g., with a new name "g.2". Since relocation global name has special meaning, we do not want llvm to change it, so internally we have logic to check whether duplication happens or not. If happens, just reuse the previous global. The first bug is related to non-btf-id relocation (BPFAbstractMemberAccess.cpp). Commit 54d9f743c8b0 ("BPF: move AbstractMemberAccess and PreserveDIType passes to EP_EarlyAsPossible") changed ModulePass to FunctionPass, i.e., handling each function at a time. But still just one BPFAbstractMemberAccess object is created so module level de-duplication still possible. Commit 40251fee0084 ("[BPF][NewPM] Make BPFTargetMachine properly adjust NPM optimizer pipeline") made a change to create a BPFAbstractMemberAccess object per function so module level de-duplication is not possible any more without going through all module globals. This patch simply changed the map which holds reloc globals as class static, so it will be available to all BPFAbstractMemberAccess objects for different functions. The second bug is related to btf-id relocation (BPFPreserveDIType.cpp). Before Commit 54d9f743c8b0, the pass is a ModulePass, so we have a local variable, incremented for each instance, and works fine. But after Commit 54d9f743c8b0, the pass becomes a FunctionPass. Local variable won't work properly since different functions will start with the same initial value. Fix the issue by change the local count variable as static, so it will be truely unique across the whole module compilation. Differential Revision: https://reviews.llvm.org/D88942	2020-10-06 22:37:49 -07:00
Max Kazantsev	725c655687	[Test] Add test showing that we can avoid inserting trunc/zext	2020-10-07 12:19:01 +07:00
Chen Zheng	937f27d268	[MachineInstr] exclude call instruction in mayAlias we now get noAlias result for a call instruction and other load/store/call instructions if we query mayAlias. This is not right as call instruction is not with mayloadorstore, but it may alter the memory. This patch fixes this wrong alias query. Differential Revision: https://reviews.llvm.org/D87490	2020-10-07 00:12:21 -04:00
Chen Zheng	a20254fe88	[PowerPC] implement target hook getTgtMemIntrinsic This patch can make pass recognize Powerpc related memory intrinsics. Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D88373	2020-10-07 00:02:44 -04:00
Chen Zheng	60fa7c7101	[PowerPC] add more builtins for PPCTargetLowering::getTgtMemIntrinsic Reviewed By: steven.zhang Differential Revision: https://reviews.llvm.org/D88374	2020-10-06 23:48:33 -04:00
Bill Wendling	0be97dba3f	[CodeGen][TailDuplicator] Don't duplicate blocks with INLINEASM_BR Tail duplication of a block with an INLINEASM_BR may result in a PHI node on the indirect branch. This is okay, but it also introduces a copy for that PHI node after the INLINEASM_BR, which is not okay. See: https://github.com/ClangBuiltLinux/linux/issues/1125 Differential Revision: https://reviews.llvm.org/D88823	2020-10-06 18:44:59 -07:00
Valentin Clement	865511fe51	[flang][openacc] Fix device_num and device_type clauses for init directive This patch fix the device_num and device_type clauses used in the init clause. device_num was not spelled correctly in the parser and was to restrictive with scalarIntConstantExpr instead of scalarIntExpr. device_type is now taking a list of ScalarIntExpr. Reviewed By: kiranchandramohan Differential Revision: https://reviews.llvm.org/D88571	2020-10-06 21:27:01 -04:00
Johannes Doerfert	b71edafce2	[Attributor] Use smarter way to determine alignment of GEPs Use same logic existing in other places to deal with base case GEPs. Add the original Attributor talk example.	2020-10-06 19:31:08 -05:00
Johannes Doerfert	66f0aafaf0	[Attributor] Ignore read accesses to constant memory The old function attribute deduction pass ignores reads of constant memory and we need to copy this behavior to replace the pass completely. First step are constant globals. TBAA can also describe constant accesses and there are other possibilities. We might want to consider asking the alias analyses that are available but for now this is simpler and cheaper.	2020-10-06 19:31:07 -05:00
Johannes Doerfert	cfcde5122b	[Attributor] Give up early on AANoReturn::initialize If the function is not assumed `noreturn` we should not wait for an update to mark the call site as "may-return". This has two kinds of consequences: - We have less iterations in many tests. - We have less deductions based on "known information" (since we ask earlier, point 1, and therefore assumed information is not "known" yet). The latter is an artifact that we might want to tackle properly at some point but which is not easily fixable right now.	2020-10-06 19:31:07 -05:00
Nico Weber	2e9cf00e3d	[gn build] manually port 5e4409f308177	2020-10-06 18:43:49 -04:00
Dave Airlie	fd4efdea0f	Fix out-of-tree clang build due to sysexits change The sysexists change broke clang building out of tree against llvm. https://reviews.llvm.org/D88467	2020-10-06 18:21:17 -04:00
Lang Hames	c9187b8c21	[RuntimeDyld][COFF] Report fatal error on error, rather than emiting diagnostic. Report a fatal error if an IMAGE_REL_AMD64_ADDR32NB cannot be applied due to an out-of-range target. Previously we emitted a diagnostic to llvm::errs and continued. Patch by Dale Martin. Thanks Dale!	2020-10-06 15:16:29 -07:00
Duncan P. N. Exon Smith	8fbfc2b8c5	docs: Emphasize ArrayRef over SmallVectorImpl The section on SmallVector has a note about preferring SmallVectorImpl for APIs but doesn't mention ArrayRef. Although ArrayRef is discussed elsewhere, let's re-emphasize here. Differential Revision: https://reviews.llvm.org/D49881	2020-10-06 18:13:52 -04:00
Alexandre Ganea	8eda5d6321	Revert [lit] Support running tests on Windows without GnuWin32 This reverts b3418cb4eb1456c41606f4621dcfa362fe54183c and d12ae042e17b27ebc8d2b5ae3d8dd5f88384d093 This breaks some external bots, see discussion in https://reviews.llvm.org/D84380 In the meanwhile, please use `cmake -DLLVM_LIT_TOOLS_DIR="C:/Program Files/Git/usr/bin"` or add it to %PATH%.	2020-10-06 15:38:18 -04:00
Mircea Trofin	c33f12cf26	[NFC][MC] Type uses of MCRegUnitIterator as MCRegister This is one of many subsequent similar changes. Note that we're ok with the parameter being typed as MCPhysReg, as MCPhysReg -> MCRegister is a correct conversion; Register -> MCRegister assumes the former is indeed physical, so we stop relying on the implicit conversion and use the explicit, value-asserting asMCReg(). Differential Revision: https://reviews.llvm.org/D88862	2020-10-06 12:09:56 -07:00
Scott Linder	709b3d8e67	[AMDGPU] Fix remaining kernel descriptor test Follow up on e4a9e4ef554a to fix a test I missed in the original patch. Committed as obvious.	2020-10-06 18:45:04 +00:00
Scott Linder	265f34d2aa	[AMDGPU] Emit correct kernel descriptor on big-endian hosts Previously we wrote multi-byte values out as-is from host memory. Use the `emitIntN` helpers in `MCStreamer` to produce a valid descriptor irrespective of the host endianness. Reviewed By: arsenm, rochauha Differential Revision: https://reviews.llvm.org/D88858	2020-10-06 17:29:38 +00:00
Stanislav Mekhanoshin	1aaa1eb0d8	[AMDGPU] Create isGFX9Plus utility function Introduce a utility function to make it more convenient to write code that is the same on the GFX9 and GFX10 subtargets. Use isGFX9Plus in the AsmParser for AMDGPU. Authored By: Joe_Nash Differential Revision: https://reviews.llvm.org/D88908	2020-10-06 10:18:43 -07:00
Simon Pilgrim	c076cb5935	[X86][SSE] combineX86ShuffleChain add 'CanonicalizeShuffleInput' helper. NFCI. As part of PR45974, we're getting closer to not creating 'padded' vectors on-the-fly in combineX86ShufflesRecursively, and only pad the source inputs if we have a definite match inside combineX86ShuffleChain. At the moment combineX86ShuffleChain just has to bitcast an input to the correct shuffle type, but eventually we'll need to pad them as well. So, move the bitcast into a 'CanonicalizeShuffleInput helper for now, making the diff for future padding support a lot smaller.	2020-10-06 17:47:24 +01:00
Sebastian Neubauer	2d9c389a77	[AMDGPU] Remove SIInstrInfo::calculateLDSSpillAddress This function does not seem to be used anymore. Differential Revision: https://reviews.llvm.org/D88904	2020-10-06 18:45:22 +02:00
Nikita Popov	a7e1d4a9e3	[MemCpyOpt] Use dereferenceable pointer helper The call slot optimization has some home-grown code for checking whether the destination is dereferenceable. Replace this with the generic isDereferenceableAndAlignedPointer() helper. I'm not checking alignment here, because that is currently handled separately and may be an enforced alignment for allocas. The clean way of integrating that part would probably be to accept a callback in isDereferenceableAndAlignedPointer() for the actual isAligned check, which would then have a chance to use an enforced alignment instead. This allows the destination to be a GEP (among other things), though the two open TODOs may prevent it from working in practice. Differential Revision: https://reviews.llvm.org/D88805	2020-10-06 18:41:19 +02:00
Nikita Popov	c845089f3a	[MemCpyOpt] Check for throwing calls during call slot optimization When performing call slot optimization for a non-local destination, we need to check whether there may be throwing calls between the call and the copy. Otherwise, the early write to the destination may be observable by the caller. This was already done for call slot optimization of load/store, but not for memcpys. For the sake of clarity, I'm moving this check into the common optimization function, even if that does need an additional instruction scan for the load/store case. As efriedma pointed out, this check is not sufficient due to potential accesses from another thread. This case is left as a TODO. Differential Revision: https://reviews.llvm.org/D88799	2020-10-06 18:24:40 +02:00
Nikita Popov	c882ea3914	[MemCpyOpt] Add separate statistic for call slot optimization (NFC)	2020-10-06 18:14:10 +02:00
Simon Pilgrim	03d833fcca	[APIntTest] Extend extractBits to check 'lshr+trunc' pattern for each case as well. Noticed while triaging PR47731 that we don't have great coverage for such patterns.	2020-10-06 16:32:40 +01:00
Fangrui Song	4b618b1f5f	[X86] .code16: temporarily set Mode32Bit when matching an instruction with the data32 prefix PR47632 This allows MC to match `data32 ...` as one instruction instead of two (data32 without insn + insn). The compatibility with GNU as improves: `data32 ljmp` will be matched as ljmpl. `data32 lgdt 4(%eax)` will be matched as `lgdtl` (prefixes: 0x67 0x66, instead of 0x66 0x67). GNU as supports many other `data32 w` as `l`. We currently just hard code `data32 callw` and `data32 ljmpw`. Generalizing the suffix replacement is tricky and requires a think about the "bwlq" appending suffix rules in MatchAndEmitATTInstruction. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D88772	2020-10-06 08:32:03 -07:00
Dávid Bolvanský	b959205f6d	[SimplifyLibCalls] Optimize mempcpy_chk to mempcpy	2020-10-06 17:08:46 +02:00
LLVM GN Syncbot	045458557c	[gn build] Port aa2b593f149	2020-10-06 14:49:44 +00:00
Arthur Eubanks	b093181fde	[BPF][NewPM] Make BPFTargetMachine properly adjust NPM optimizer pipeline This involves porting BPFAbstractMemberAccess and BPFPreserveDIType to NPM, then adding them BPFTargetMachine::registerPassBuilderCallbacks (the NPM equivalent of adjustPassManager()). Reviewed By: yonghong-song, asbirlea Differential Revision: https://reviews.llvm.org/D88855	2020-10-06 07:42:32 -07:00
Arthur Eubanks	edc441cd86	[test][InstCombine][NewPM] Fix InstCombine tests under NPM Some of these depended on analyses being present that aren't provided automatically in NPM. early_dce_clobbers_callgraph.ll was previously inlining a noinline function? cast-call-combine.ll relied on the legacy always-inline pass being a CGSCC pass and getting rerun. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D88187	2020-10-06 07:39:00 -07:00
Arthur Eubanks	846976cef1	[test][NewPM] Make dead-uses.ll work under NPM This one is weird... globals-aa needs to be already computed at licm, or else a function pass can't run a module analysis and won't have access to globals-aa. But the globals-aa result is impacted by instcombine in a way that affects what the test is expecting. If globals-aa is computed before instcombine, it is cached and globals-aa used in licm won't contain the necessary info provided by instcombine. Another catch is that if we don't invalidate AAManager, it will use the cached AAManager that instcombine requested, which may not contain globals-aa. So we have to invalidate<aa> so that licm can recompute an AAManager with the globals-aa created by the require<globals-aa>. This is essentially the problem described in https://reviews.llvm.org/D84259. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D88118	2020-10-06 07:33:02 -07:00
Johannes Doerfert	3e4d5e877b	[Attributor][FIX] Move assertion to make it not trivially fail The idea of this assertion was to check the simplified value before we assign it, not after, which caused this to trivially fail all the time.	2020-10-06 09:32:18 -05:00
Johannes Doerfert	c5111678c6	[Attributor][FIX] Dead return values are not `noundef` When we assume a return value is dead we might still visit return instructions via `Attributor::checkForAllReturnedValuesAndReturnInsts(..)`. When we do so the "returned value" is potentially simplified to `undef` as it is the assumed "returned value". This is a problem if there was a preexisting `noundef` attribute that will only be removed as we manifest the `undef` return value. We should not use this combination to derive `unreachable` though. Two test cases fixed.	2020-10-06 09:32:18 -05:00
Johannes Doerfert	3d88a1b2b2	[Attributor][NFC] Ignore benign uses in AAMemoryBehaviorFloating In AAMemoryBehaviorFloating we used to track benign uses in a SetVector. With this change we look through benign uses eagerly to reduce the number of elements (=Uses) we look at during an update. The test does actually not fail prior to this commit but I already wrote it so I kept it.	2020-10-06 09:32:18 -05:00
Dmitri Gribenko	6c75e7bd17	Silence -Wunused-variable in NDEBUG mode	2020-10-06 16:02:17 +02:00
Simon Pilgrim	9a98ab03d1	[InstCombine] canRewriteGEPAsOffset - don't dereference a dyn_cast<>. NFCI. We know V is a IntToPtrInst or PtrToIntInst type so we know its a CastInst - so use cast<> directly. Prevents clang static analyzer warning that we could deference a null pointer.	2020-10-06 14:48:34 +01:00
Simon Pilgrim	1305d04969	[InstCombine] FoldShiftByConstant - consistently use ConstantExpr in logicalshift(trunc(shift(x,c1)),c2) fold. NFCI. This still only gets used for scalar types but now always uses ConstantExpr in preparation for vector support - it was using APInt methods in some places.	2020-10-06 14:48:34 +01:00
Sam Tebbs	8c5ecf8e27	[ARM] Fold select_cc(vecreduce_[u\|s][min\|max], x) into VMINV or VMAXV This folds a select_cc or select(set_cc) of a max or min vector reduction with a scalar value into a VMAXV or VMINV. Differential Revision: https://reviews.llvm.org/D87836	2020-10-06 14:44:58 +01:00
Dmitry Preobrazhensky	7883b0e0f8	[AMDGPU][MC] Added detection of unsupported instructions Implemented identification of unsupported instructions; improved errors reporting. See bug 42590. Reviewers: rampitec Differential Revision: https://reviews.llvm.org/D88211	2020-10-06 16:44:27 +03:00
Jonas Paulsson	3490945bcb	[SystemZAsmParser] Treat VR128 separately in ParseDirectiveInsn(). This patch makes the parser - reject higher vector registers (>=16) in operands where they should not be accepted. - accept higher integers (>=16) in vector register operands. Review: Ulrich Weigand Differential Revision: https://reviews.llvm.org/D88888	2020-10-06 14:42:40 +02:00
Simon Pilgrim	9f91f08925	[InstCombine] FoldShiftByConstant - use PatternMatch for logicalshift(trunc(shift(x,c1)),c2) fold. NFCI.	2020-10-06 13:13:08 +01:00
Simon Pilgrim	0a9d5521b3	[InstCombine] FoldShiftByConstant - remove unnecessary cast<>. NFC. Op1 is already a Constant*	2020-10-06 13:13:08 +01:00
Alexey Lapshin	660205ea0e	[llvm-objcopy][NFC] fix style issues reported by clang-format.	2020-10-06 15:06:25 +03:00
LLVM GN Syncbot	f024187bee	[gn build] Port d6c9dc3c17e	2020-10-06 12:02:07 +00:00

... 2 3 4 5 6 ...

204892 Commits