llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-21 20:12:56 +02:00

Author	SHA1	Message	Date
Tim Northover	4f6226ff36	OpaquePtr: print byval types containing anonymous types correctly. Attribute::getAsString doesn't have enough information to print anonymous Module-level types correctly, so they come back as "%type 0xabcd". This results in broken IR when printing as text. Instead, print type-attributes (currently just byval) using the TypePrinting infrastructure available in AsmWriter. This only applies to function argument attributes.	2020-01-07 15:11:43 +00:00
Matt Arsenault	2346ede1ad	llc: Change behavior of -mcpu with existing attribute Don't overwrite existing target-cpu attributes. I've often found the replacement behavior annoying, and this is inconsistent with how the fast math command line flags interact with the function attributes. Does not yet change target-features, since I think that should behave as a concatenation.	2020-01-07 10:10:25 -05:00
Matt Arsenault	a26ffcf2f6	AMDGPU/GlobalISel: Partially fix llvm.amdgcn.kill pattern import Tests deferred since the existing DAG test depends on some other operations, but isn't far from working as-is.	2020-01-07 10:09:59 -05:00
Hans Wennborg	c7ebd85525	[docs] NFC: Fix typos in documents "the the" -> "the" "an" -> "a" Patch by Kazuaki Ishizaki <ishizaki@jp.ibm.com>! Differential revision: https://reviews.llvm.org/D72091	2020-01-07 16:06:14 +01:00
Sam Parker	e8c8bfd37b	[TypePromotion] Use SetVectors instead of PtrSets Remove the chance of non-deterministic insertion of zexts of the sources by using a SetVector instead of SmallPtrSet. Do the same for sinks for consistency and to negate the small issue from possibly happening. The SafeWrap instructions are now also stored in a SmallVector. The IRPromoter members of these structures have been changed to references. Differential Revision: https://reviews.llvm.org/D72322	2020-01-07 14:51:54 +00:00
Sanjay Patel	e06518d1cc	[DAGCombiner] reduce shuffle of concat of same vector This is possibly a small part towards solving PR42024: https://bugs.llvm.org/show_bug.cgi?id=42024 The vectorizer is creating shuffles of concat like this: %63 = shufflevector <4 x i64> %x, <4 x i64> undef, <8 x i32> <i32 0, i32 1, i32 2, i32 3, i32 0, i32 1, i32 2, i32 3> %64 = shufflevector <8 x i64> %63, <8 x i64> undef, <8 x i32> <i32 0, i32 4, i32 1, i32 5, i32 2, i32 6, i32 3, i32 7> That might be fixable in the vectorizers, but we're not allowed to fold that into a single shuffle in instcombine, so we should have a backend backstop to convert that into the likely simpler form: %64 = shufflevector <4 x i64> %x, <4 x i64> undef, <8 x i32> <i32 0, i32 0, i32 1, i32 1, i32 2, i32 2, i32 3, i32 3> Differential Revision: https://reviews.llvm.org/D72300	2020-01-07 09:48:59 -05:00
Sjoerd Meijer	038287e275	[ARM][MVE] VPT Blocks: findVCMPToFoldIntoVPS This is a recommit of D71330, but with a few things fixed and changed: 1) ReachingDefAnalysis: this was not running with optnone as it was checking skipFunction(), which other analysis passes don't do. I guess this is a copy-paste from a codegen pass. 2) VPTBlockPass: here I've added skipFunction(), because like most/all optimisations, we don't want to run this with optnone. This fixes the issues with the initial/previous commit: the VPTBlockPass was running with optnone, but ReachingDefAnalysis wasn't, and so VPTBlockPass was crashing querying ReachingDefAnalysis. I've added test case mve-vpt-block-optnone.mir to check that we don't run VPTBlock with optnone. Differential Revision: https://reviews.llvm.org/D71470	2020-01-07 13:54:47 +00:00
Simon Pilgrim	71c2db1510	[X86] Standardize shuffle match/lowering function names. NFC. We mainly use lowerShuffle/matchShuffle - replace the (few) lowerVectorShuffle/matchVectorShuffle cases to be consistent.	2020-01-07 13:41:52 +00:00
Victor Campos	3c58da7fc9	[ARM] Improve codegen of volatile load/store of i64 Summary: Instead of generating two i32 instructions for each load or store of a volatile i64 value (two LDRs or STRs), now emit LDRD/STRD. These improvements cover architectures implementing ARMv5TE or Thumb-2. Reviewers: dmgreen, efriedma, john.brawn, nickdesaulniers Reviewed By: efriedma, nickdesaulniers Subscribers: nickdesaulniers, vvereschaka, kristof.beyls, hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D70072	2020-01-07 13:16:18 +00:00
Simon Pilgrim	2583e8dbb5	Fix "use of uninitialized variable" static analyzer warning. NFCI.	2020-01-07 12:06:54 +00:00
Ulrich Weigand	784552367c	[SystemZ] Extend fp-strict-alias test case Explicitly add test for fpexcept.maytrap intrinsics.	2020-01-07 12:44:51 +01:00
LLVM GN Syncbot	6bd74663bc	[gn build] Port c69ae835d0e	2020-01-07 11:41:46 +00:00
Luís Marques	9c9c4dfafa	[RISCV][Docs] Add RISC-V asm template argument modifiers Adds the RISC-V asm template argument modifiers currently supported by LLVM. Additional ones supported by GCC will be added to the documentation when we start supporting them.	2020-01-07 11:06:46 +00:00
Simon Pilgrim	1e0278fea9	Fix Wdocumentation warnings. NFCI.	2020-01-07 10:55:38 +00:00
Simon Pilgrim	dd43b4ee29	Fix "use of uninitialized variable" static analyzer warnings. NFCI.	2020-01-07 10:55:38 +00:00
Simon Pilgrim	39a6803ba2	Fix "use of uninitialized variable" static analyzer warnings. NFCI.	2020-01-07 10:55:37 +00:00
James Henderson	9361ef9442	[DebugInfo] Fix infinite loop caused by reading past debug_line end If the claimed unit length of a debug line program is such that the line table would finish past the end of the .debug_line section, an infinite loop occurs because the data extractor will continue to "read" zeroes without changing the offset. This previously didn't hit an error because the line table program handles a series of zeroes as a bad extended opcode. This patch fixes the inifinite loop and adds a warning if the program doesn't fit in the available data. Reviewed by: JDevlieghere Differential Revision: https://reviews.llvm.org/D72279	2020-01-07 10:22:35 +00:00
Jim Lin	dbb448542d	[NFC] Use isX86() instead of getArch() Summary: This is a clean up for https://reviews.llvm.org/D72247. Reviewers: MaskRay, craig.topper, jhenderson Reviewed By: MaskRay Subscribers: hiraditya, rupprecht, cfe-commits, llvm-commits Tags: #clang, #llvm Differential Revision: https://reviews.llvm.org/D72320	2020-01-07 17:35:44 +08:00
Ulrich Weigand	cf9231a7a8	[SystemZ] Fix python failure in test case With recent Python the Large/spill-02.py test failed with an error: TypeError: can't multiply sequence by non-int of type 'float'	2020-01-07 10:26:37 +01:00
Ehud Katz	9e236592f9	[APFloat] Fix out of scope usage of a pointer to local variable	2020-01-07 11:24:18 +02:00
serge-sans-paille	756cc832bb	Fix compiler extension example cmake integration - Do not add it to the Export file - Update install target Differential Revision: https://reviews.llvm.org/D72255	2020-01-07 09:27:08 +01:00
Ehud Katz	094f071eba	[APFloat] Fix fusedMultiplyAdd when `this` equals to `Addend` Up until now, the arguments to `fusedMultiplyAdd` are passed by reference. We must save the `Addend` value on the beginning of the function, before we modify `this`, as they may be the same reference. To fix this, we now pass the `addend` parameter of `multiplySignificand` by value (instead of by-ref), and have a default value of zero. Fix PR44051. Differential Revision: https://reviews.llvm.org/D70422	2020-01-07 08:45:18 +02:00
Juneyoung Lee	95972f8c89	Let PassBuilder Expose PassInstrumentationCallbacks Summary: This is an effort to allowing external libraries register their own pass instrumentation during their llvmGetPassPluginInfo() calls. By exposing this through the added getPIC(), now a pass writer can do something like this: ``` extern "C" ::llvm::PassPluginLibraryInfo LLVM_ATTRIBUTE_WEAK llvmGetPassPluginInfo() { return { .., [](llvm::PassBuilder &PB) { PB.getPIC()->registerAfterPassCallback(move(f)); } }; } ``` Reviewers: chandlerc, philip.pfaffe, fedor.sergeev Reviewed By: fedor.sergeev Subscribers: llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71086	2020-01-07 14:10:37 +09:00
Fangrui Song	0523ceb321	[MC] Add parameter `Address` to MCInstrPrinter::printInstruction Follow-up of D72172. Reviewed By: jhenderson, rnk Differential Revision: https://reviews.llvm.org/D72180	2020-01-06 20:44:14 -08:00
Fangrui Song	00f5c66666	[MC] Add parameter `Address` to MCInstPrinter::printInst printInst prints a branch/call instruction as `b offset` (there are many variants on various targets) instead of `b address`. It is a convention to use address instead of offset in most external symbolizers/disassemblers. This difference makes `llvm-objdump -d` output unsatisfactory. Add `uint64_t Address` to printInst(), so that it can pass the argument to printInstruction(). `raw_ostream &OS` is moved to the last to be consistent with other print* methods. The next step is to pass `Address` to printInstruction() (generated by tablegen from the instruction set description). We can gradually migrate targets to print addresses instead of offsets. In any case, downstream projects which don't know `Address` can pass 0 as the argument. Reviewed By: jhenderson Differential Revision: https://reviews.llvm.org/D72172	2020-01-06 20:42:22 -08:00
Matt Arsenault	1d11ecf97b	AMDGPU/GlobalISel: Fix unused variable warning in release	2020-01-06 22:31:33 -05:00
QingShan Zhang	480b8252a0	[NFC][Test] Add a test to verify the DAGCombine of fma	2020-01-07 03:13:39 +00:00
Matt Arsenault	7e93ac83b8	AMDGPU: Add run line to int_to_fp tests This wasn't catching a regression on targets with legal i16 triggered in a future commit.	2020-01-06 21:38:50 -05:00
Matt Arsenault	af8a75766e	AMDGPU: Select llvm.amdgcn.interp.p2.f16 directly This will enable automatic GlobalISel support in a future commit.	2020-01-06 20:34:21 -05:00
Matt Arsenault	fa52fc3593	AMDGPU: Use default operands for clamp/omod We have a lot of complex pattern variants that just set the source modifiers that are really handled, and then set the output modifiers to 0. We're unlikely to ever match output modifiers from the use instruction side, and we already match clamp/omod in a separate pass.	2020-01-06 20:22:13 -05:00
Heejin Ahn	ec0f06a33e	[WebAssembly] Fix landingpad-only case in Emscripten EH Summary: Previously we didn't set `Changed` to true when there are only landing pads but not invokes. This fixes it and we set `Changed` to true whenever we have landing pads. (There can't be invokes without landing pads, so that case is covered too) The test case for this has to be a separate file because this pass is a `ModulePass` and `Changed` is computed based on the whole module. Reviewers: tlively Subscribers: dschuff, sbc100, jgravelle-google, hiraditya, sunfish, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72308	2020-01-06 17:02:32 -08:00
Matt Arsenault	85ec6f9561	AMDGPU/GlobalISel: Legalize G_READCYCLECOUNTER	2020-01-06 19:16:32 -05:00
Mark de Wever	0c578f69c5	[NFC] Fixes -Wrange-loop-analysis warnings This avoids new warnings due to D68912 adds -Wrange-loop-analysis to -Wall.	2020-01-07 00:51:41 +01:00
Fangrui Song	ce9eae4112	Add Triple::isX86() Reviewed By: craig.topper, skan Differential Revision: https://reviews.llvm.org/D72247	2020-01-06 15:51:02 -08:00
Matt Arsenault	7fdc97cfea	AMDGPU/GlobalISel: Select G_UADDE/G_USUBE	2020-01-06 18:27:52 -05:00
Matt Arsenault	1a53e6403d	AMDGPU/GlobalISel: Replace handling of boolean values This solves selection failures with generated selection patterns, which would fail due to inferring the SGPR reg bank for virtual registers with a set register class instead of VCC bank. Use instruction selection would constrain the virtual register to a specific class, so when the def was selected later the bank no longer was set to VCC. Remove the SCC reg bank. SCC isn't directly addressable, so it requires copying from SCC to an allocatable 32-bit register during selection, so these might as well be treated as 32-bit SGPR values. Now any scalar boolean value that will produce an outupt in SCC should be widened during RegBankSelect to s32. Any s1 value should be a vector boolean during selection. This makes the vcc register bank unambiguous with a normal SGPR during selection. Summary of how this should now work: - G_TRUNC is always a no-op, and never should use a vcc bank result. - SALU boolean operations should be promoted to s32 in RegBankSelect apply mapping - An s1 value means vcc bank at selection. The exception is for legalization artifacts that use s1, which are never VCC. All other contexts should infer the VCC register classes for s1 typed registers. The LLT for the register is now needed to infer the correct register class. Extensions with vcc sources should be legalized to a select of constants during RegBankSelect. - Copy from non-vcc to vcc ensures high bits of the input value are cleared during selection. - SALU boolean inputs should ensure the inputs are 0/1. This includes select, conditional branches, and carry-ins. There are a few somewhat dirty details. One is that G_TRUNC/G_*EXT selection ignores the usual register-bank from register class functions, and can't handle truncates with VCC result banks. I think this is OK, since the artifacts are specially treated anyway. This does require some care to avoid producing cases with vcc. There will also be no 100% reliable way to verify this rule is followed in selection in case of register classes, and violations manifests themselves as invalid copy instructions much later. Standard phi handling also only considers the bank of the result register, and doesn't insert copies to make the source banks match. This doesn't work for vcc, so we have to manually correct phi inputs in this case. We should add a verifier check to make sure there are no phis with mixed vcc and non-vcc register bank inputs. There's also some duplication with the LegalizerHelper, and some code which should live in the helper. I don't see a good way to share special knowledge about what types to use for intermediate operations depending on the bank for example. Using the helper to replace extensions with selects also seems somewhat awkward to me. Another issue is there are some contexts calling getRegBankFromRegClass that apparently don't have the LLT type for the register, but I haven't yet run into a real issue from this. This also introduces new unnecessary instructions in most cases, since we don't yet try to optimize out the zext when the source is known to come from a compare.	2020-01-06 18:26:42 -05:00
Matt Arsenault	8defb1c20e	TableGen/GlobalISel: Handle default operands that are used Copy the logic from the existing handling in the DAG matcher emittter. This will enable some AMDGPU pattern cleanups without breaking GlobalISel tests, and eventually handle importing more patterns. The test is a bit annoying since the sections seem to randomly sort themselves if anything else is added in the future.	2020-01-06 18:26:42 -05:00
Matt Arsenault	0b40741b65	GlobalISel: Implement lower for G_INTRINSIC_ROUND Mostly copied from AMDGPU lowering implementation, except used G_SITOFP instead of directly creating a select on -1.0, 0.0.	2020-01-06 18:26:42 -05:00
Philip Reames	78e60220db	[X86] Move an enum definition into a header to simplify future patches [NFC]	2020-01-06 15:14:42 -08:00
Petr Hosek	2a966f8371	[CMake] Pass symlink dependency to add_llvm_install_targets explicitly The install-${name}-stripped targets don't strip when ${name} is being symlinked, e.g. llvm-ar or llvm-objcopy. The problem is that llvm_install_symlink passes install-${dest} as a dependency of install-${name}, e.g. install-llvm-ar becomes a dependency of both install-llvm-ranlib and install-llvm-ranlib-stripped. What this means is that when installing a distribution that contains both llvm-ar and llvm-ranlib is that first the stripped version of llvm-ar is installed (by the install-llvm-ar-stripped target) and then it's overwritten by an unstripped version of llvm-ar bnecause install-llvm-ranlib-stripped has install-llvm-ranlib as a dependency as mentioned earlier. To avoid this issue, rather than passing the install-${dest} as dependency, we introduce a new argument to add_llvm_install_targets for symlink target which expands it into an appropriate dependency, i.e. install-${dest} for install-${name} target and install-${dest}-stripped for install-${name}-stripped. Differential Revision: https://reviews.llvm.org/D71951	2020-01-06 14:51:32 -08:00
Bill Wendling	42ec33434a	Don't rely on 'l'(ell) modifiers to indicate a label reference Summary: It's not necessary to use an 'l'(ell) modifier when referencing a label. Treat block addresses and MBB references as if the modifier is used anyway. This prevents us from generating references to ficticious labels. Reviewers: jyknight, nickdesaulniers, hfinkel Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D71849	2020-01-06 14:44:03 -08:00
Thomas Preud'homme	e03f28d2ef	[FileCheck] Remove FileCheck prefix in API Summary: When FileCheck was made a library, types in the public API were renamed to add a FileCheck prefix, such as Pattern to FileCheckPattern. Many types were moved into a private interface and thus don't need this prefix anymore. This commit removes those unneeded prefixes. Reviewers: jhenderson, jdenny, probinson, grimar, arichardson, rnk Reviewed By: jhenderson Subscribers: hiraditya, llvm-commits Tags: #llvm Differential Revision: https://reviews.llvm.org/D72186	2020-01-06 22:28:23 +00:00
Jinsong Ji	687a7c63f2	[PowerPC][NFC] Rename record instructions to use _rec suffix instead of o We use o suffix to indicate record form instuctions, (as it is similar to dot '.' in mne?) This was fine before, as we did not support XO-form. However, with https://reviews.llvm.org/D66902, we now have XO-form support. It becomes confusing now to still use 'o' for record form, and it is weird to have something like 'Oo' . This patch rename all 'o' instructions to use '_rec' instead. Also rename `isDot` to `isRecordForm`. Reviewed By: #powerpc, hfinkel, nemanjai, steven.zhang, lkail Differential Revision: https://reviews.llvm.org/D70758	2020-01-06 22:27:07 +00:00
Matt Arsenault	450f31a6e3	GlobalISel: Fix unsupported legalize action This would complain about invalid legalizer rules otherwise. Mark some operations as unsupported for AMDGPU. This currently seems to produce the same legalize error as when no rules are defined, but eventually this should produce a proper user facing error.	2020-01-06 17:21:51 -05:00
Matt Arsenault	08da10f080	GlobalISel: Correct result type for G_FCMP in lowerFPTOUI Using the final result type doesn't make any sense. Use the natural default boolean type for the select condition.	2020-01-06 17:21:51 -05:00
Matt Arsenault	546a3ee122	GlobalISel: Start adding computeNumSignBits to GISelKnownBits	2020-01-06 17:21:51 -05:00
Matt Arsenault	5365e52310	AMDGPU: Fix legalizing f16 fpow The existing test only covered one case for r600. The use of mul_legacy also looks suspicious to me, but leave it for now. The patterns are also not making use of source modifiers.	2020-01-06 17:21:51 -05:00
Matt Arsenault	d483a5285e	AMDGPU: Use ImmLeaf This solves one GlobalISel importer error, but the pattern still fails for another reason.	2020-01-06 17:21:51 -05:00
Matt Arsenault	e299148752	AMDGPU: Use ImmLeaf for inline immediate predicates	2020-01-06 17:21:51 -05:00
Matt Arsenault	51e4379d7b	llc/MIR: Fix setFunctionAttributes for MIR functions A random set of attributes are implemented by llc/opt forcing the string attributes on the IR functions before processing anything. This would not happen for MIR functions, which have not yet been created at this point. Use a callback in the MIR parser, purely to avoid dealing with the ugliness that the command line flags are in a .inc file, and would require allowing access to these flags from multiple places (either from the MIR parser directly, or a new utility pass to implement these flags). It would probably be better to cleanup the flag handling into a separate library. This is in preparation for treating more command line flags with a corresponding function attribute in a more uniform way. The fast math flags in particular have a messy system where the command line flag sets the behavior from a function attribute if present, and otherwise the command line flag. This means if any other pass tries to inspect the function attributes directly, it will be inconsistent with the intended behavior. This is also inconsistent with the current behavior of -mcpu and -mattr, which overwrites any pre-existing function attributes. I would like to move this to consistenly have the command line flags not overwrite any pre-existing attributes, and to always ensure the command line flags are consistent with the function attributes.	2020-01-06 17:21:51 -05:00

... 3 4 5 6 7 ...

189892 Commits