llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 04:32:44 +01:00

Author	SHA1	Message	Date
Arthur Eubanks	266a9a84be	[OpaquePtr] Use ArgListEntry::IndirectType for lowering ABI attributes Consolidate PreallocatedType and ByValType into IndirectType, and use that for inalloca.	2021-07-07 14:58:38 -07:00
Jinsong Ji	8dfa5bc073	[PowerPC] Add P7 RUN line for load and splat test	2021-07-07 21:43:46 +00:00
Arthur Eubanks	b3ffc2a93b	[OpaquePtr] Remove checking pointee type for byval/preallocated type These currently always require a type parameter. The bitcode reader already upgrades old bitcode without the type parameter to use the pointee type. In cases where the caller does not have byval but the callee does, we need to follow CallBase::paramHasAttr() and also look at the callee for the byval type so that CallBase::isByValArgument() and CallBase::getParamByValType() are in sync. Do the same for preallocated. While we're here add a corresponding version for inalloca since we'll need it soon. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D104663	2021-07-07 14:28:55 -07:00
George Burgess IV	6032fd6068	utils: add a revert checker Chrome OS and Android have found it useful to have an automated revert checker. It was requested to upstream it, since other folks in the LLVM community may also find value in it. The tests depend on having a full (non-shallow) checkout of LLVM. This seems reasonable to me, since: - the tests should only be run if the user is developing on this script - it's kind of hard to develop on this script without local git history :) If people really want, the tests' dependency on LLVM's history can be removed. It's mostly just effort/complexity that doesn't seem necessary. Differential Revision: https://reviews.llvm.org/D105578	2021-07-07 14:20:01 -07:00
Patrick Holland	18254b5fab	[MCA] [AMDGPU] Adding an implementation to AMDGPUCustomBehaviour for handling s_waitcnt instructions. This commit also makes some slight changes to the scheduling model for AMDGPU to set the RetireOOO flag for all scheduling classes. This flag is only used by llvm-mca and allows instructions to retire out of order. See the differential link below for a deeper explanation of everything. Differential Revision: https://reviews.llvm.org/D104730	2021-07-07 14:17:54 -07:00
David Green	cfbbe8e54f	[ARM] Add some opaque pointer gather/scatter tests. NFC They seem to work OK. Some other test cleanups at the same time.	2021-07-07 22:03:53 +01:00
Nikita Popov	02090c5cdd	[AsmWriter] Simplify type attribute printing (NFC) Avoid enumerating all supported type attributes, instead fetch their name from the attribute kind.	2021-07-07 22:47:33 +02:00
Nikita Popov	5c3c9d947d	[IR] Simplify Attribute::getAsString() (NFC) Avoid enumerating all attributes here and instead use getNameFromAttrKind(), which is based on the tablegen data. This only leaves us with custom handling for int attributes, which don't have uniform printing.	2021-07-07 22:43:17 +02:00
Fangrui Song	3586ecfc1a	[llvm-nm] Switch command line parsing from llvm::cl to OptTable Part of https://lists.llvm.org/pipermail/llvm-dev/2021-July/151622.html "Binary utilities: switch command line parsing from llvm::cl to OptTable" Users should generally observe no difference as long as they only use intended option forms. Behavior changes: * `-t=d` is removed. Use `-t d` instead. * `--demangle=0` cannot be used. Omit the option or use `--no-demangle` instead. * `--help-list` is removed. This is a `cl::` specific option. Note: * `-t` diagnostic gets improved. * This patch avoids cl::opt collision if we decide to support multiplexing for binary utilities * One-dash long options are still supported. * The `-s` collision (`-s segment section` for Mach-O) is unfortunate. `-s` means `--print-armap` in GNU nm. * This patch removes the last `cl::multi_val` use case from the `llvm/lib/Support/CommandLine.cpp` library `-M` (`--print-armap`), `-U` (`--defined-only`), and `-W` (`--no-weak`) are now deprecated. They could conflict with future GNU nm options. (--print-armap has an existing alias -s, so GNU will unlikely add a new one. --no-weak (not in GNU nm) is rarely used anyway.) `--just-symbol-name` is now deprecated in favor of `--format=just-symbols` and `-j`. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D105330	2021-07-07 13:34:33 -07:00
Nikita Popov	aec8b8bed1	[IR] Make some pointer element type accesses explicit (NFC) Explicitly fetch the pointer element type in various deprecated methods, so we can hopefully remove support from this from the base GEP constructor.	2021-07-07 22:05:30 +02:00
Adrian Prantl	017a58dbae	GlobalISel/AArch64: don't optimize away redundant branches at -O0 This patch prevents GlobalISel from optimizing out redundant branch instructions when compiling without optimizations. The motivating example is code like the following common pattern in Swift, where users expect to be able to set a breakpoint on the early exit: public func f(b: Bool) { guard b else { return // I would like to set a breakpoint here. } ... } The patch modifies two places in GlobalISEL: The first one is in IRTranslator.cpp where the removal of redundant branches is made conditional on the optimization level. The second one is in AArch64InstructionSelector.cpp where an -O0 only optimization is being removed. Disabling these optimizations increases code size at -O0 by ~8%. However, doing so improves debuggability, and debug builds are the primary reason why developers compile without optimizations. We thus concluded that this is the right trade-off. rdar://79515454 Differential Revision: https://reviews.llvm.org/D105238	2021-07-07 12:51:55 -07:00
Eli Friedman	468a596bcd	[AArch64] Simplify sve-breakdown-scalable-vectortype.ll. Fix the calling convention so we don't spill every SVE register.	2021-07-07 12:32:17 -07:00
Martin Storsjö	cb69439440	[CodeView] Add missing cases for new enum values This fixes warnings while building llvm-pdbutil after d20b013b490e0603ba21b5ccff966d7e11025775.	2021-07-07 22:15:33 +03:00
Martin Storsjö	5f3a753cf4	[COFF] [CodeView] Add a few new enum values These are undocumented, but are visible in the SDK headers since some versions ago. Differential Revision: https://reviews.llvm.org/D105513	2021-07-07 22:00:18 +03:00
Nemanja Ivanovic	df463561de	[PowerPC] Disable permuted SCALAR_TO_VECTOR on LE without direct moves There are some patterns involving the permuted scalar to vector node for which we don't have patterns without direct moves on little endian subtargets. This causes selection errors. While we can of course add the missing patterns, any additional effort to make this work is not useful since there is no support for any CPU that can run in little endian mode and does not support direct moves.	2021-07-07 13:50:49 -05:00
Nikita Popov	79ce45f3e7	[SCEVExpander] Support opaque pointers This adds support for opaque pointers to expandAddToGEP() by always generating an i8 GEP for opaque pointers. After looking at some other cases (constexpr GEP folding, SROA GEP generation), I've come around to the idea that we should use i8 GEPs for opaque pointers, because the alternative would be to guess a GEP type from surrounding code, which will not be reliable. Ultimately, i8 GEPs is where we want to end up anyway, and opaque pointers just make that the natural choice. There are a couple of other places in SCEVExpander that check pointer element types, I plan to update those when I run across usable test coverage that doesn't assert elsewhere. Differential Revision: https://reviews.llvm.org/D105398	2021-07-07 20:47:59 +02:00
Sanjay Patel	b2ee89efb3	[SLP] rename variable to not be misleading; NFC The reduction matching was probably only dealing with binops when it was written, but we have now generalized it to handle select and intrinsics too, so assert on that too.	2021-07-07 14:40:21 -04:00
Eli Friedman	7117a662d6	[ScalarEvolution] Make sure getMinusSCEV doesn't negate pointers. Add a function removePointerBase that returns, essentially, S - getPointerBase(S). Use it in getMinusSCEV instead of actually subtracting pointers. Differential Revision: https://reviews.llvm.org/D105503	2021-07-07 10:27:10 -07:00
Simon Pilgrim	5668a526d1	[X86][Atom] Fix vector fp<->int resource/throughputs Match whats documented in the Intel AOM - almost all the conversion instructions requires BOTH ports (apart from the MMX cvtpi2ps/cvtpi2ps instructions which we already override) - this was being incorrectly modelled as EITHER port. Now that we can use in-order models in llvm-mca, the atom model is a good "worst case scenario" analysis for x86.	2021-07-07 16:52:34 +01:00
Arnold Schwaighofer	37a06f783f	[coro async] Cap the alignment of spilled values (vs. allocas) at the max frame alignment Before this patch we would normally use the ABI alignment which can be to high for the context alginment. For spilled values we don't need ABI alignment, since the frame entry's address is not escaped. rdar://79664965 Differential Revision: https://reviews.llvm.org/D105288	2021-07-07 08:06:25 -07:00
Philip Reames	5f5aa15bce	[LV] Unconditionally branch from middle to scalar preheader if the scalar loop must execute (try 4) Resubmit after the following changes: * Fix a latent bug related to unrolling with required epilogue (see e49d65f). I believe this is the cause of the prior PPC buildbot failure. * Disable non-latch exits for epilogue vectorization to be safe (9ffa90d) * Split out assert movement (600624a) to reduce churn if this gets reverted again. Previous commit message (try 3) Resubmit after fixing test/Transforms/LoopVectorize/ARM/mve-gather-scatter-tailpred.ll Previous commit message... This is a resubmit of 3e5ce4 (which was reverted by 7fe41ac). The original commit caused a PPC build bot failure we never really got to the bottom of. I can't reproduce the issue, and the bot owner was non-responsive. In the meantime, we stumbled across an issue which seems possibly related, and worked around a latent bug in 80e8025. My best guess is that the original patch exposed that latent issue at higher frequency, but it really is just a guess. Original commit message follows... If we know that the scalar epilogue is required to run, modify the CFG to end the middle block with an unconditional branch to scalar preheader. This is instead of a conditional branch to either the preheader or the exit block. The motivation to do this is to support multiple exit blocks. Specifically, the current structure forces us to identify immediate dominators and which exit block to branch from in the middle terminator. For the multiple exit case - where we know require scalar will hold - these questions are ill formed. This is the last change needed to support multiple exit loops, but since the diffs are already large enough, I'm going to land this, and then enable separately. You can think of this as being NFCIish prep work, but the changes are a bit too involved for me to feel comfortable tagging the review that way. Differential Revision: https://reviews.llvm.org/D94892	2021-07-07 07:44:35 -07:00
Irina Dobrescu	2f32eb71fa	[AArch64][GlobalISel] Lower vector types for min/max Differential Revision: https://reviews.llvm.org/D105433	2021-07-07 15:34:03 +01:00
Dylan Fleming	df10fbb5b5	[SVE] Fix ShuffleVector cast<FixedVectorType> in truncateToMinimalBitwidths Depends on D104239 Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D105341	2021-07-07 15:30:10 +01:00
Tom Stellard	e3b57c8a21	unittests: Fix library dependency name Corrects the library name of LLVMAsmParser added in 2e4ec3e5d6a3bf7f61bea6898286cba64be7b764.	2021-07-07 06:56:15 -07:00
Tom Stellard	d0248bfa5e	unittests: Fix build with LLVM_LINK_LLVM_DYLIB=ON The build system was linking the PluginsTests unittest against libLLVM.so and LLVMAsmParser which was causing the test to fail with this error: LLVM ERROR: inconsistency in registered CommandLine options We need to add llvm libraries to LLVM_LINK_COMPONENTS so that they are dropped from the linker arguments when linking with LLVM_LINK_LLVM_DYLIB=ON Reviewed By: aeubanks Differential Revision: https://reviews.llvm.org/D105523	2021-07-07 06:50:45 -07:00
Sander de Smalen	3bbfdfb241	[CostModel] Express cost(urem) as cost(div+mul+sub) when set to Expand. The Legalizer expands the operations of urem/srem into a div+mul+sub or divrem when those are legal/custom. This patch changes the cost-model to reflect that cost. Since there is no 'divrem' Instruction in LLVM IR, the cost of divrem is assumed to be the same as div+mul+sub since the three operations will need to be executed at runtime regardless. Patch co-authored by David Sherwood (@david-arm) Reviewed By: RKSimon, paulwalker-arm Differential Revision: https://reviews.llvm.org/D103799	2021-07-07 14:40:28 +01:00
Arnold Schwaighofer	337a4ae273	[coro async] Move code to proper switch While upstreaming patches this code somehow was applied to the wrong switch statement. Differential Revision: https://reviews.llvm.org/D105504	2021-07-07 06:19:08 -07:00
Zarko Todorovski	ecca945191	[AIX] Use VSSRC/VSFRC Register classes for f32/f64 callee arguments on P8 and above Adding usage of VSSRC and VSFRC when adding the live in registers on AIX. This matches the behaviour of the rest of PPC Subtargets. Reviewed By: nemanjai, #powerpc Differential Revision: https://reviews.llvm.org/D104396	2021-07-07 09:18:20 -04:00
Simon Pilgrim	e8227ef0a0	[IndVarSimplify][X86] Regenerate loop-invariant-conditions.ll test checks	2021-07-07 13:58:28 +01:00
Simon Pilgrim	8dde5bc762	[CostModel][X86] Adjust sext/zext SSE/AVX legalized costs based on llvm-mca reports. Update costs based on the worst case costs from the script in D103695. Move to using legalized types wherever possible, which allows us to prune the cost tables.	2021-07-07 13:58:27 +01:00
Simon Pilgrim	929bb9374e	[CostModel][X86] Adjust sitofp/uitofp SSE/AVX legalized costs based on llvm-mca reports. Update (mainly) vXi8/vXi16 -> vXf32/vXf64 sitofp/uitofp costs based on the worst case costs from the script in D103695. Move to using legalized types wherever possible, which allows us to prune the cost tables.	2021-07-07 12:03:45 +01:00
LLVM GN Syncbot	a67e94b8ca	[gn build] Port 6829db727e9e	2021-07-07 10:14:59 +00:00
Max Kazantsev	9407ff0a51	[NFC] Remove duplicate function calls Removed repeated call of L->getHeader(). Now using previously stored return value. Patch by Dmitry Makogon! Differential Revision: https://reviews.llvm.org/D105535 Reviewed By: mkazantsev	2021-07-07 17:02:36 +07:00
Dylan Fleming	6cb338b059	[SVE] Fixed cast<FixedVectorType> on scalable vector in SelectionDAGBuilder::getUniformBase Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D105350	2021-07-07 10:48:17 +01:00
David Green	13a750213e	[DAG] Reassociate Add with Or We already have reassociation code for Adds and Ors separately in DAG combiner, this adds it for the combination of the two where Ors act like Adds. It reassociates (add (or (x, c), y) -> (add (add (x, y), c)) where we know that the Ors operands have no common bits set, and the Or has one use. Differential Revision: https://reviews.llvm.org/D104765	2021-07-07 10:21:07 +01:00
Dylan Fleming	5e3e34426a	[SVE] Fix cast<FixedVectorType> in truncateToMinimalBitwidths Reviewed By: sdesmalen Differential Revision: https://reviews.llvm.org/D104239	2021-07-07 09:58:05 +01:00
Jay Foad	8bb25dfcf8	[AMDGPU] Simplify tablegen files. NFC. There is no need to cast records to strings before comparing them.	2021-07-07 09:19:23 +01:00
Fangrui Song	a394c72e7f	[llvm-readobj][test] Improve grouped option test	2021-07-06 23:02:33 -07:00
Stanislav Mekhanoshin	3efbb63d53	[AMDGPU] isPassEnabled() helper to check cl::opt and OptLevel We have several checks for both cl::opt and OptLevel over our pass config, although these checks do not properly work if default value of a cl::opt will be false. Create a helper to use instead and properly handle it. NFC for now. Differential Revision: https://reviews.llvm.org/D105517	2021-07-06 21:53:35 -07:00
Johannes Doerfert	2f34f28211	[Attributor][FIX] Replace uses first, then values Before we replaced value by registering all their uses. However, as we replace a value old uses become stale. We now replace values explicitly and keep track of "new values" when doing so to avoid replacing only uses in stale/old values but not their replacements.	2021-07-06 22:43:51 -05:00
Johannes Doerfert	f2839d8f48	[Attriibutor][NFC] Precommit heap-2-stack test case	2021-07-06 22:41:23 -05:00
Johannes Doerfert	4f0b565d46	[Attributor] Introduce a helper function to deal with undef + none We often need to deal with the value lattice that contains none and undef as special values. A simple helper makes this much nicer. Differential Revision: https://reviews.llvm.org/D103857	2021-07-06 22:41:21 -05:00
Johannes Doerfert	13dc82700d	[Attributor] Simplify operands inside of simplification AAs first When we do simplification via AAPotentialValues or AAValueConstantRange we need to simplify the operands of an instruction we deconstruct first. This does not only improve the result, see for example range.ll, but is required as we allow outside AAs to provide simplification rules via callbacks. If we do ignore the simplification rules and base other simplifications on the IR instead we can create an inconsistent state.	2021-07-06 22:41:18 -05:00
Tom Stellard	edc8227c9b	tests/CodeGen: Use %python lit substitution when invoking python This will use the python that LLVM was configured to use rather than python from PATH. Reviewed By: serge-sans-paille Differential Revision: https://reviews.llvm.org/D105224	2021-07-06 18:46:36 -07:00
Nemanja Ivanovic	11ea47a03f	[PowerPC] Re-enable combine for i64 BSWAP on targets without LDBRX The combine was disabled in 4e22c7265d86 as it caused failures in the ppc64be-multistage (bootstrap) bot. It turns out that the combine did not correctly update the MMO for the high load which caused aliased stores to be reported as unaliased. This patch fixes that problem and re-enables the combine.	2021-07-06 20:42:01 -05:00
Eli Friedman	0fd04c425c	[AArch64] Add more tests related to vselect with constant condition. Not a complete set of tests, but a starting point if anyone wants to look at improving this.	2021-07-06 17:06:22 -07:00
Eli Friedman	292c76b03f	[AArch64] Sync isDef32 to the current x86 version. We should probably come up with some better way to do this, but let's make sure to catch known issues for now.	2021-07-06 17:05:01 -07:00
Stanislav Mekhanoshin	9e3245ac4b	[AMDGPU] Move atomic expand past infer address spaces There are cases where infer address spaces pass cannot yet infer an address space in the opt pipeline and then in the llc pipeline it runs too late for atomic expand pass to benefit from a specific address space. Move atomic expand pass past the infer address spaces. Fixes: SWDEV-293410 Differential Revision: https://reviews.llvm.org/D105511	2021-07-06 15:53:32 -07:00
Stanislav Mekhanoshin	fdcf893dae	[AMDGPU] Do not run IR optimizations at -O0 Differential Revision: https://reviews.llvm.org/D105515	2021-07-06 15:29:52 -07:00
Stanislav Mekhanoshin	0364d8e371	[AMDGPU] Fix pass name of AMDGPULowerKernelAttributes. NFC. This was obviously copy-pasted.	2021-07-06 15:03:31 -07:00

1 2 3 4 5 ...

218142 Commits