llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 12:12:47 +01:00

Author	SHA1	Message	Date
Jorge Gorbe Moya	ac434d7750	Fix crash after looking up dwo_id=0 in CU index. In the current state, if getFromHash(0) is called and there's no CU with dwo_id=0, the lookup will stop at an empty slot, then the check `Rows[H].getSignature() != S` won't cause the lookup to fail and return a nullptr (as it should), because the empty slot has a 0 in the signature field, and a pointer to the empty slot will be incorrectly returned. This patch fixes this by using the index field in the hash entry to check for empty slots: signature = 0 can match a valid hash but according to the spec the index for an occupied slot will always be non-zero. Differential Revision: https://reviews.llvm.org/D91670	2020-11-19 11:15:01 -08:00
Fraser Cormack	9b94da0a79	[RISCV] Add test cases for missed grevi/greviw opportunities. NFC	2020-11-19 18:43:39 +00:00
Leonard Chan	c24d9d2b01	[llvm][IR] Add dso_local_equivalent Constant The `dso_local_equivalent` constant is a wrapper for functions that represents a value which is functionally equivalent to the global passed to this. That is, if this accepts a function, calling this constant should have the same effects as calling the function directly. This could be a direct reference to the function, the `@plt` modifier on X86/AArch64, a thunk, or anything that's equivalent to the resolved function as a call target. When lowered, the returned address must have a constant offset at link time from some other symbol defined within the same binary. The address of this value is also insignificant. The name is leveraged from `dso_local` where use of a function or variable is resolved to a symbol in the same linkage unit. In this patch: - Addition of `dso_local_equivalent` and handling it - Update Constant::needsRelocation() to strip constant inbound GEPs and take advantage of `dso_local_equivalent` for relative references This is useful for the [Relative VTables C++ ABI](https://reviews.llvm.org/D72959) which makes vtables readonly. This works by replacing the dynamic relocations for function pointers in them with static relocations that represent the offset between the vtable and virtual functions. If a function is externally defined, `dso_local_equivalent` can be used as a generic wrapper for the function to still allow for this static offset calculation to be done. See [RFC](http://lists.llvm.org/pipermail/llvm-dev/2020-August/144469.html) for more details. Differential Revision: https://reviews.llvm.org/D77248	2020-11-19 10:26:17 -08:00
Fraser Cormack	82b4130087	[RISCV] Lower GREVI and GORCI as custom nodes This moves the recognition of GREVI and GORCI from TableGen patterns into a DAGCombine. This is done primarily to match "deeper" patterns in the future, like (grevi (grevi x, 1) 2) -> (grevi x, 3). TableGen is not best suited to matching patterns such as these as the compile time of the DAG matchers quickly gets out of hand due to the expansion of commutative permutations. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D91259	2020-11-19 18:11:42 +00:00
Adhemerval Zanella	3f1ee6c0a7	[AArch64] Lower fptrunc/fpext from/to FP128t to/from FP16 The compiler-rt part which adds the emitted symbols is handled in a subsequent patch. Differential Revision: https://reviews.llvm.org/D91731	2020-11-19 15:14:50 -03:00
Sander de Smalen	8270667b76	[LoopVectorize] NFC: Fix unused variable warning for MaxSafeDepDist rGf571fe6df585127d8b045f8e8f5b4e59da9bbb73 led to a warning of an unused variable for MaxSafeDepDist (written but not used). It seems this variable and assignment can be safely removed.	2020-11-19 17:41:35 +00:00
Sam Tebbs	e397da3290	[ARM][LowOverheadLoops] Convert intermediate vpr use assertion to condition This converts the intermediate VPR use assertion to a condition in the if-statement to protect against assertion failures in case behaviuour is changed. This is a follow-up to https://reviews.llvm.org/D90935 and implements the post-approval comments. Reviewed By: dmgreen Differential Revision: https://reviews.llvm.org/D91790	2020-11-19 17:15:45 +00:00
Joseph Huber	c0b5fdf2f0	[OpenMP] Add Location Fields to Libomptarget Runtime for Debugging Summary: Add support for passing source locations to libomptarget runtime functions using the ident_t struct present in the rest of the libomp API. This will allow the runtime system to give much more insightful error messages and debugging values. Reviewers: jdoerfert grokos Differential Revision: https://reviews.llvm.org/D87946	2020-11-19 12:01:53 -05:00
Roman Lebedev	1a9500be6d	[NFC][PhaseOrdering] Add a test showing the need to run IndVars after LoopIdiom	2020-11-19 19:43:33 +03:00
Mircea Trofin	e1027a640d	[FileCheck] Disallow unused prefixes in llvm/test/Analysis This is achieved through a substitution of FileCheck in lit.cfg.py, where we explicitly set -allow-unused-prefixes to false. We also introduce a %FileCheckWithUnusedPrefixes% substitution that can be used in those cases where we want to allow unused prefixes, even if the folder policy is to disallow them. Differential Revision: https://reviews.llvm.org/D91275	2020-11-19 07:56:35 -08:00
diggerlin	b63aeb246f	[AIX][XCOFF][Patch2] decode vector information and extent long table of the traceback table of the xcoff. SUMMARY: 1. decode the Vector extension if has_vec is set 2. decode long table fields, if longtbtable is set. There is conflict on the bit order of HasVectorInfoMask and HasExtensionTableMask between AIX os header and IBM aix compiler XLC. In the /usr/include/sys/debug.h defines static constexpr uint32_t HasVectorInfoMask = 0x0040'0000; static constexpr uint32_t HasExtensionTableMask = 0x0080'0000; but the XLC defines as static constexpr uint32_t HasVectorInfoMask = 0x0080'0000; static constexpr uint32_t HasExtensionTableMask = 0x0040'0000; we follows the definition of the IBM AIX compiler XLC here. Reviewer: Jason Liu Differential Revision: https://reviews.llvm.org/D86461	2020-11-19 10:23:43 -05:00
David Green	954eb80175	[ARM] Add a WLS tail predication test. NFC	2020-11-19 14:52:46 +00:00
Sven van Haastregt	a30a7b4581	Fix Wundef warnings for Support/Compiler.h Support/Compiler.h is included by c files (e.g. regcomp.c) where __cplusplus is not defined at all. Avoid evaluating the undefined macro for such files.	2020-11-19 14:10:55 +00:00
Florian Hahn	78dba29355	[ConstraintElimination] Add GEP test case with variable offset.	2020-11-19 14:06:50 +00:00
Simon Pilgrim	56198bfdfa	[RISCV] Extend 32-bit test coverage of neg-abs tests for D91120	2020-11-19 13:59:18 +00:00
Simon Pilgrim	877bcd2557	[ValueTracking] computeKnownBitsFromShiftOperator - move shift amount analysis to top of the function. NFCI. These are all lightweight to compute and helps avoid issues with Known being used to hold both the shift amount and then the shifted result. Minor cleanup for D90479.	2020-11-19 13:50:49 +00:00
David Green	a8dd5e49ed	[ARM] Deliberately prevent inline asm in low overhead loops. NFC This was already something that was handled by one of the "else" branches in maybeLoweredToCall, so this patch is an NFC but makes it explicit and adds a test. We may in the future want to support this under certain situations but for the moment just don't try and create low overhead loops with inline asm in them. Differential Revision: https://reviews.llvm.org/D91257	2020-11-19 13:28:21 +00:00
Simon Pilgrim	1bd2f183ce	[X86][AVX] Only share broadcasts of different widths from the same SDValue of the same SDNode (PR48215) D57663 allowed us to reuse broadcasts of the same scalar value by extracting low subvectors from the widest type. Unfortunately we weren't ensuring the broadcasts were from the same SDValue, just the same SDNode - which failed on multiple-value nodes like ISD::SDIVREM FYI: I intend to request this be merged into the 11.x release branch. Differential Revision: https://reviews.llvm.org/D91709	2020-11-19 12:15:18 +00:00
Simon Moll	fd4dd67127	[LV][NFC-ish] Allow vector widths over 256 elements The assertion that vector widths are <= 256 elements was hard wired in the LV code. Eg, VE allows for vectors up to 512 elements. Test again the TTI vector register bit width instead - this is an NFC for non-asserting builds. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D91518	2020-11-19 10:58:29 +01:00
Florian Hahn	9328b6a7fd	[SelDAGBuilder] Do not require simple VTs for constraints. In some cases, the values passed to `asm sideeffect` calls cannot be mapped directly to simple MVTs. Currently, we crash in the backend if that happens. An example can be found in the @test_vector_too_large_r_m test case, where we pass <9 x float> vectors. In practice, this can happen in cases like the simple C example below. using vec = float __attribute__((ext_vector_type(9))); void f1 (vec m) { asm volatile("" : "+r,m"(m) : : "memory"); } One case that use "+r,m" constraints for arbitrary data types in practice is google-benchmark's DoNotOptimize. This patch updates visitInlineAsm so that it use MVT::Other for constraints with complex VTs. It looks like the rest of the backend correctly deals with that and properly legalizes the type. And we still report an error if there are no registers to satisfy the constraint. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D91710	2020-11-19 09:31:54 +00:00
Max Kazantsev	b6c52319b7	[NFC] Remove comment (commited ahead of time by mistake)	2020-11-19 16:28:34 +07:00
Max Kazantsev	2e3d07d447	[NFC] Move code earlier as preparation for further changes	2020-11-19 16:27:23 +07:00
Simon Moll	8a479552ad	[VE] VEC_BROADCAST, lowering and isel This defines the vec_broadcast SDNode along with lowering and isel code. We also remove unused type mappings for the vector register classes (all vector MVTs that are not used in the ISA go). We will implement support for short vectors later by intercepting nodes with illegal vector EVTs before LLVM has had a chance to widen them. Reviewed By: kaz7 Differential Revision: https://reviews.llvm.org/D91646	2020-11-19 09:44:56 +01:00
Sam Clegg	1e4ea2938d	[WebAssembly] Add support for named globals in the object format. Differential Revision: https://reviews.llvm.org/D91769	2020-11-19 00:17:22 -08:00
Andrew Wei	d024603f9e	[IndVarSimplify] Notify top most loop to drop cached exit counts Some nested loops may share the same ExitingBB, so after we finishing FoldExit, we need to notify OuterLoop and SCEV to drop any stored trip count. Patched by: guopeilin Reviewed By: mkazantsev Differential Revision: https://reviews.llvm.org/D91325	2020-11-19 15:37:54 +08:00
Mircea Trofin	818fa7a683	[NFC][TFUtils] also include output specs lookup logic in loadOutputSpecs The lookup logic is also reusable. Also refactored the API to return the loaded vector - this makes it more clear what state it is in in the case of error (as it won't be returned). Differential Revision: https://reviews.llvm.org/D91759	2020-11-18 21:20:21 -08:00
Kazu Hirata	0fce5c3fd1	[Transforms] Use llvm::is_contained (NFC)	2020-11-18 20:42:22 -08:00
Mircea Trofin	504ced25f2	[NFC][TFUtils] Extract out the output spec loader It's generic for the 'development mode', not specific to the inliner case. Differential Revision: https://reviews.llvm.org/D91751	2020-11-18 20:03:20 -08:00
Craig Topper	4038ba0242	[RISCV] Add MemOperand to the instruction created by storeRegToStackSlot/loadRegFromStackSlot Differential Revision: https://reviews.llvm.org/D91730	2020-11-18 19:20:03 -08:00
Kai Luo	213dbf28b4	[X86][AArch64][RISCV] Pre-commit negated abs test case. NFC.	2020-11-19 02:31:45 +00:00
Duncan P. N. Exon Smith	c727b8693b	Support: Avoid SmallVector::assign with a range from to-be-replaced vector in Windows GetExecutableName This code wasn't valid, and 5abf76fbe37380874a88cc9aa02164800e4e10f3 started asserting. This is a speculative fix since I don't have a Windows machine handy.	2020-11-18 17:55:49 -08:00
Duncan P. N. Exon Smith	b6b630c8ab	ADT: Add assertions to SmallVector::insert, etc., for reference invalidation 2c196bbc6bd897b3dcc1d87a3baac28e1e88df41 asserted that `SmallVector::push_back` doesn't invalidate the parameter when it needs to grow. Do the same for `resize`, `append`, `assign`, `insert`, and `emplace_back`. Differential Revision: https://reviews.llvm.org/D91744	2020-11-18 17:36:28 -08:00
snek	a27ae8bd58	[WebAssembly] Support fp reg class in r constraint Patch by snek Reviewed By: aheejin Differential Revision: https://reviews.llvm.org/D90978	2020-11-18 17:05:58 -08:00
Moritz Sichert	0fadcfd431	Added GDB pretty printer for StringMap Reviewed By: csigg, dblaikie Differential Revision: https://reviews.llvm.org/D91183	2020-11-18 16:33:34 -08:00
Arthur Eubanks	dd58cefb43	[NPM] Remove -enable-npm-optnone flag It has been on by default for a couple months without complaint. Reviewed By: asbirlea Differential Revision: https://reviews.llvm.org/D91743	2020-11-18 15:49:16 -08:00
Scott Linder	a62e1e8765	[YAMLIO] Support non-null-terminated inputs In some places the parser guards against dereferencing `End`, while in others it relies on the presence of a trailing `'\0'` to elide checks. Add the remaining guards needed to ensure the parser never attempts to dereference `End`, making it safe to not require a null-terminated input buffer. Update the parser fuzzer harness so that it tests with buffers that are guaranteed to be non-null-terminated, null-terminated, and 1-terminated, additionally ensuring the result of the parse is the same in each case. Some of the regression tests were written by inspection, and some are cases caught by the fuzzer which required additional fixes in the parser. Differential Revision: https://reviews.llvm.org/D84050	2020-11-18 23:06:03 +00:00
Scott Linder	4173cf968c	[YAMLIO] Add a generic YAML fuzzer harness This is essentially a clone of the existing fuzzer added in D50839, but for the whole parser Streamer, and currently only testing for sanitizer violations. Differential Revision: https://reviews.llvm.org/D91573	2020-11-18 23:06:03 +00:00
Kazushi (Jam) Marukawa	c8d618c3ce	[VE] Add vmv intrinsic instructions Add vmv intrinsic instructions and regression tests. Reviewed By: simoll Differential Revision: https://reviews.llvm.org/D91700	2020-11-19 08:05:35 +09:00
Nico Weber	a63dbe29d6	[gn build] (manually) merge f0785c1f7ac	2020-11-18 17:17:01 -05:00
Hsiangkai Wang	cb69018868	[RISCV] Use register class VR for V instruction operands directly. @tangxingxin1008 found a bug that regard vadd.vv v1, v3, a0 as a valid V instruction. We should remove the VRegAsmOperand operand class and use VR register class directly. Patched by: tangxingxin1008, Hsiangkai Differential Revision: https://reviews.llvm.org/D91712	2020-11-19 05:59:46 +08:00
Fangrui Song	0b1b3b7f09	MCExpr::evaluateAsRelocatableImpl : allow evaluation of non-VK_None MCSymbolRefExpr when MCAsmLayout is available https://sourceware.org/git/gitweb.cgi?p=binutils-gdb.git;h=4acf8c78e659833be8be047ba2f8561386a11d4b (1994) introduced this behavior: if a fixup symbol is equated to an expression with an undefined symbol, convert the fixup to be against the target symbol. glibc relies on this behavior to perform assembly level indirection ``` asm("memcpy = __GI_memcpy"); // from sysdeps/generic/symbol-hacks.h ... // call memcpy@PLT // The relocation references __GI_memcpy in GNU as, but memcpy in MC (without the patch) memcpy (...); ``` (1) It complements `extern __typeof(memcpy) memcpy asm("__GI_memcpy");` The frontend asm label does not redirect synthesized memcpy in the middle-end. (See D88712 for details) (2) `asm("memcpy = __GI_memcpy");` is in every translation unit, but the memcpy declaration may not be visible in the translation unit where memcpy is synthesized. MC already redirects `memcpy = __GI_memcpy; call memcpy` but not `memcpy = __GI_memcpy; call memcpy@plt`. This patch fixes the latter by allowing MCExpr::evaluateAsRelocatableImpl to evaluate a non-VK_None MCSymbolRefExpr, which is only done after the layout is available. GNU as allows `memcpy = __GI_memcpy+1; call memcpy@PLT` which seems nonsensical, so we don't allow it. `MC/PowerPC/pr38945.s` `NUMBER = 0x6ffffff9; cmpwi 8,NUMBER@l` requires the `symbol@l` form in AsmMatcher, so evaluation needs to be deferred. This is the place whether future simplification may be possible. Note, if we suppress the VM_None evaluation when MCAsmLayout is nullptr, we may lose the `invalid reassignment of non-absolute variable` diagnostic (`ARM/thumb_set-diagnostics.s` and `MC/AsmParser/variables-invalid.s`). We know that this diagnostic is troublesome in some cases (https://github.com/ClangBuiltLinux/linux/issues/1008), so we can consider making simplification in the future. Reviewed By: jyknight Differential Revision: https://reviews.llvm.org/D88625	2020-11-18 13:52:33 -08:00
Arthur Eubanks	d33f9517fa	[NPM] Add implicit basic-aa before other AA This matches the legacy AA infra and fixes llvm/test/ANalysis/TypeBasedAliasAnalysis/precedence.ll under NPM.	2020-11-18 13:48:24 -08:00
Jamie Schmeiser	91459b7237	Revert "Revert "Revert "Expand existing loopsink testing to also test loopsinking using new pass manager and fix LICM bug.""" This reverts commit e29292969b92aa15afba734d4f6863fc405f087c. This apparently causes a regression in compile time (ie, it slows down).	2020-11-18 16:07:16 -05:00
Baptiste Saleil	fdfa32d644	[PowerPC] Add peephole to remove redundant accumulator prime/unprime instructions In some situations, the compiler may insert an accumulator prime instruction and an accumulator unprime instruction with no use of that accumulator between the two. That's for example the case when we store an accumulator after assembling it or restoring it. This patch adds a peephole to remove these prime and unprime instructions. Differential Revision: https://reviews.llvm.org/D91386	2020-11-18 15:01:07 -06:00
Roman Lebedev	78115bb465	[NFC][Reassociate] Delay checking isLoadCombineCandidate() until after ShouldConvertOrWithNoCommonBitsToAdd() but before haveNoCommonBitsSet() This appears to improve -O3 compile-time performance somewhat: https://llvm-compile-time-tracker.com/compare.php?from=87369c626114ae17f4c637635c119e6de0856a9a&to=c04b8271e1609b0dfb20609b40844b0c4324517e&stat=instructions It doesn't look like delaying it until after haveNoCommonBitsSet() is better: https://llvm-compile-time-tracker.com/compare.php?from=c04b8271e1609b0dfb20609b40844b0c4324517e&to=b2943d450eaf41b5f76d2dc7350f0a279f64cd99&stat=instructions	2020-11-18 23:57:12 +03:00
Scott Linder	23061187c4	[NFC][AMDGPU] Remove some generic pointers in memory-legalizer tests These tests implicitly depend on the target supporting generic pointers, so to prepare for testing them on GFX6 (which lacks FLAT) remove the dependency where possible. Reviewed By: rampitec Differential Revision: https://reviews.llvm.org/D91666	2020-11-18 20:52:18 +00:00
Nikita Popov	86fecd5fbf	[BasicAA] Generalize base offset modulus handling The GEP aliasing implementation currently has two pieces of code that solve two different subsets of the same basic problem: If you have GEPs with offsets 4x + 0 and 4y + 1 (assuming access size 1), then they do not alias regardless of whether x and y are the same. One implementation is in aliasSameBasePointerGEPs(), which looks at this in a limited structural way. It requires both GEP base pointers to be exactly the same, then (optionally) a number of equal indexes, then an unknown index, then a non-equal index into a struct. This set of limitations works, but it's overly restrictive and hides the core property we're trying to exploit. The second implementation is part of aliasGEP() itself and tries to find a common modulus in the scales, so it can then check that the constant offset doesn't overlap under modular arithmetic. The second implementation has the right idea of what the general problem is, but effectively only considers power of two factors in the scales (while aliasSameBasePointerGEPs also works with non-pow2 struct sizes.) What this patch does is to adjust the aliasGEP() implementation to instead find the largest common factor in all the scales (i.e. the GCD) and use that as the modulus. Differential Revision: https://reviews.llvm.org/D91027	2020-11-18 21:48:49 +01:00
Arthur Eubanks	cae85631c4	[test] Fix eliminate-callsite-inline.ll under NPM The NPM inliner does not remove arbitrary unused internal functions, and that is not the point of this test.	2020-11-18 12:39:39 -08:00
Arthur Eubanks	90c9229761	[test] Fix remaining GVN tests under NPM Some used analyses are optional but were relied upon in tests, the legacy PM runs them in a different order than the NPM.	2020-11-18 12:35:32 -08:00
Jamie Schmeiser	c75ab9daaf	Revert "Revert "Expand existing loopsink testing to also test loopsinking using new pass manager and fix LICM bug."" This reverts commit 562addba652e8bdabe49f9123fd92c21b7a0d640. Reverted change too quickly, the failing test cases passed on the next build. So reverting revert (to include the changes).	2020-11-18 15:33:02 -05:00

1 2 3 4 5 ...

207063 Commits