llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-18 18:42:46 +02:00

Author	SHA1	Message	Date
Arthur Eubanks	1debc51397	[GVN] Properly invalidate ICF cache when we simplify a value This fixes a "Cached first special instruction is wrong!" assert. The assert fires because replacing a value with another can cause an instruction to no longer be "special" to ICF. In this case, devirtualization happened, turning an indirect call to a call to a willreturn function which is no longer special. Reviewed By: nikic, rnk Differential Revision: https://reviews.llvm.org/D99977	2021-04-08 14:01:57 -07:00
Thomas Preud'homme	7a43ae0368	[FileCheck, test] Rename checkWildcardRegexCharMatchFailure Proposed edit https://reviews.llvm.org/D97845#inline-922769 in D97845 suggests the checkWildcardRegexCharMatchFailure function name is misleading because it is not clear it checks for a match failure on each character in the string parameter. This commit renames it to an hopefully clearer name. Reviewed By: jdenny Differential Revision: https://reviews.llvm.org/D98343	2021-04-08 21:57:55 +01:00
Konstantin Zhuravlyov	ec8823f099	AMDGPU: Add gfx90c support to code object v2 for backwards compatibility Differential Revision: https://reviews.llvm.org/D100126	2021-04-08 16:42:43 -04:00
Stanislav Mekhanoshin	68533b388b	[AMDGPU] Check for all meta instrs in GCNRegBankReassign It used to work correctly even with a KILL, but there is no reason to consider meta instructions since they do not create real HW uses. Differential Revision: https://reviews.llvm.org/D100135	2021-04-08 13:41:10 -07:00
Nikita Popov	27fef01ae2	[LoopRotate] Don't split loop pass manager After D99249 we use three different loop pass managers for LICM, LoopRotate and LICM+LoopUnswitch. This happens because LazyBFI and LazyBPI are not preserved by LoopRotate (note that D74640 is no longer needed). Avoid this by marking them as preserved. My understanding of D86156 is that it is okay to simply preserve them (which LoopUnswitch already does for the same reason) and rely on callbacks to deal with deleted blocks. Differential Revision: https://reviews.llvm.org/D99843	2021-04-08 22:05:18 +02:00
Stanislav Mekhanoshin	f3dbd48063	[AMDGPU] Allow -amdgpu-unsafe-fp-atomics to ignore denorm mode Fixes: SWDEV-274276 Differential Revision: https://reviews.llvm.org/D100072	2021-04-08 12:46:36 -07:00
Wouter van Oortmerssen	cb0db55dcc	[WebAssembly] Fix for PIC external symbol ISEL wasm64 was missing DAG ISEL patterns for external symbol based global.get, but simply adding these analogous to the existing 32-bit versions doesn't work. This is because we are conflating the 32-bit global index with the pointer represented by the external symbol, which for wasm32 happened to work. The simplest fix is to pretend we have a 64-bit global index. This sounds incorrect, but is immaterial since once this index is stored as a MachineOperand it becomes 64-bit anyway (and has been all along). As such, the EmitInstrWithCustomInserter based implementation I experimented with become a no-op and no further changes in the C++ code are required. Differential Revision: https://reviews.llvm.org/D99904	2021-04-08 12:07:38 -07:00
Congzhe Cao	41b67178f7	[LoopInterchange] Fix transformation bugs in loop interchange After loop interchange, the (old) outer loop header should not jump to the `LoopExit`. Note that the old outer loop becomes the new inner loop after interchange. If we branched to `LoopExit` then after interchange we would jump directly from the (new) inner loop header to `LoopExit` without executing the rest of outer loop. This patch modifies adjustLoopBranches() such that the old outer loop header (which becomes the new inner loop header) jumps to the old inner loop latch which becomes the new outer loop latch after interchange. Reviewed By: bmahjour Differential Revision: https://reviews.llvm.org/D98475	2021-04-08 14:58:13 -04:00
Levy Hsu	c78dea0987	[RISCV] Add InstAlias for Zbb Zbp and Zbs extension Add InstAlias that allows the last operand to be an imm for following instructions: 1. Zbb or Zbp: - ror - rorw (RV64 Only) 2. Zbs - best - bclr - binv - bext Reviewed By: craig.topper, jrtc27 Differential Revision: https://reviews.llvm.org/D100083	2021-04-08 11:51:31 -07:00
Sanjay Patel	b9646d4353	[InstCombine] fold min/max intrinsic with negated operand to abs The smax case shows up in https://llvm.org/PR49885 . The others seem unlikely, but we might as well try for uniformity (although that could mean an extra instruction to create "nabs"). smax -- https://alive2.llvm.org/ce/z/8yYaGy smin -- https://alive2.llvm.org/ce/z/0_7zc_ umax -- https://alive2.llvm.org/ce/z/EcsZWs umin -- https://alive2.llvm.org/ce/z/Xw6WvB	2021-04-08 14:37:39 -04:00
Sanjay Patel	0ddf8ead81	[InstCombine] add tests for min/max with negated operand; NFC	2021-04-08 14:37:39 -04:00
Stanislav Mekhanoshin	3071fb3566	Set IgnoreLLVMUsed to false in CallGraph::addToCallGraph() clang++ uses llvm.compiler.used in certain cases to preserve symbol which is fully inlined. D96087 has resulted in undefined symbols in such cases. Set it to false by default to preserve old behavior but keep the option for specific uses where we want to ignore these (e.g. to detect a potential indirect call to a function). Differential Revision: https://reviews.llvm.org/D99897	2021-04-08 11:14:09 -07:00
Paul C. Anagnostopoulos	16f0e00ff4	Revert "[TableGen] Add support for the 'assert' statement in multiclasses" This reverts commit 3b9a15d910a8c748b1444333a4a3905a996528bc.	2021-04-08 13:58:58 -04:00
Stephen Tozer	577c530811	Revert "[DebugInfo] Correctly track SDNode dependencies for list debug values" Reverted due to failure on the sanitizer-x86_64-linux-fast bot. This reverts commit e10493eb5012a2c313471489646bde9595ea06c0.	2021-04-08 17:55:45 +01:00
Yuanfang Chen	2ff261f721	abtest.py: support bisection based on a response file Also makes LINK_TEST customizable from commandline with `--test` option.	2021-04-08 09:46:01 -07:00
Andrew Savonichev	9c918327f6	[MCA] Add tests for IPC on Cortex-A55 The tests compare IPC statistics that MCA provides with IPC values measured on Cortex-A55 hardware. For hardware tests, each snippet is run in a loop unrolled by 1000, and IPC is measured by linux-perf. Several tests do not match the hardware: the skewed ALU is not supported, LDR seem to be missing a forwarding path. Differential Revision: https://reviews.llvm.org/D98174	2021-04-08 19:37:07 +03:00
Stephen Tozer	24e848bf91	[DebugInfo] Correctly track SDNode dependencies for list debug values During SelectionDAG, we must track the SDNodes that each SDDbgValue depends on to compute its value. These are ultimately derived from the location operands to the SDDbgValue, but were stored in a separate vector prior to this patch. This resulted in cases where one of the lists was updated incorrectly, resulting in crashes during compilation. This patch fixes the issue by directly recomputing the dependency list from the SDDbgOperands in getDependencies(). Differential Revision: https://reviews.llvm.org/D99423	2021-04-08 17:01:45 +01:00
Jay Foad	1bd3ba1ff8	[AMDGPU] Add some implicit uses to tests. NFC. This is just to stop a future patch from optimizing away the things that we actually want to check for.	2021-04-08 16:37:48 +01:00
Jay Foad	7ed2ce6b42	[AMDGPU] SIFoldOperands: remove an unneeded isReg check. NFC.	2021-04-08 16:37:43 +01:00
Jay Foad	f43cbd4ee9	[AMDGPU] SIFoldOperands: make use of emplace_back. NFC.	2021-04-08 14:34:10 +01:00
Jay Foad	50867895fb	[AMDGPU] SIFoldOperands: remove an unneeded make_early_inc_range. NFC.	2021-04-08 14:32:36 +01:00
Jay Foad	337240fd51	[AMDGPU] SIFoldOperands: try harder to fold cndmask instructions Look through copies to find more cases where the two values being selected are identical. The motivation for this is just to be able to remove the weird special case where tryFoldCndMask was called from foldInstOperand, part way through folding a move-immediate into its users, without regressing any lit tests.	2021-04-08 14:26:12 +01:00
Florian Hahn	3be13814be	[LV] Pass VPWidenPHIRecipe to widenPHIInstruction (NFC). Instead of passing the start value and the defined value to widenPHIInstruction, pass the VPWidenPHIRecipe directly, which can be used to get both (and more in future patches).	2021-04-08 14:25:10 +01:00
Joseph Tremoulet	c47e6e9e06	Support: mapped_file_region: Pass MAP_NORESERVE to mmap This allows mapping larger files, delaying OOM failures until too many pages of them are accessed. This is makes the behavior of the mapped_file_region in this regard consistent between its "Unix" and "Windows" implementations. Guard the code witih #if defined(MAP_NORESERVE), consistent with other uses of MAP_NORESERVE in llvm-project, because some FreeBSD versions do not provide this flag. Reviewed By: clayborg Differential Revision: https://reviews.llvm.org/D96626	2021-04-08 09:07:25 -04:00
Jay Foad	0533f9d3ae	[AMDGPU] SIFoldOperands: make tryFoldCndMask a member function. NFC.	2021-04-08 14:05:29 +01:00
Sebastian Neubauer	e0e28835d1	[AMDGPU] Fix computing live registers in prolog ScratchExecCopy needs to be marked as live, we cannot use that register while EXEC is stored in there. Marking SGPRForFPSaveRestoreCopy and SGPRForBPSaveRestoreCopy as available is unnecessary, they should not be live at that point anway. Differential Revision: https://reviews.llvm.org/D100098	2021-04-08 14:52:50 +02:00
Sanjay Patel	4786c07012	[InstCombine] add icmp with no-wrap add tests; NFC Goes with D100095	2021-04-08 08:40:04 -04:00
Paul C. Anagnostopoulos	d8201f454e	[TableGen] Make behavior of list slice suffix consistent across all values Differential Revision: https://reviews.llvm.org/D99883	2021-04-08 08:38:44 -04:00
Paul C. Anagnostopoulos	282eb5170a	[TableGen] Add support for the 'assert' statement in multiclasses	2021-04-08 08:36:03 -04:00
David Sherwood	5def792b7d	[CodeGen][AArch64] Fix isel crash for truncating FP stores When attempting to truncate a FP vector and store the result out to memory we crashed because we had no pattern for truncating FP stores. In fact, we don't support these types of stores and the correct fix is to stop marking these truncating stores as legal. Tests have been added here: CodeGen/AArch64/sve-fptrunc-store.ll Differential Revision: https://reviews.llvm.org/D100025	2021-04-08 13:21:29 +01:00
Jay Foad	01d1da52ff	[AMDGPU] SIFoldOperands: refactor tryFoldCndMask with early-outs. NFC.	2021-04-08 13:16:07 +01:00
Stephen Tozer	0261173904	[DebugInfo] Prevent invalid debug info being produced during LoopStrengthReduce During LoopStrengthReduce, some of the SSA values that are used by debug values may be lost and/or salvaged. After LSR we attempt to recover any undef debug values, including any that were salvaged but then lost their values afterwards, by replacing the lost values with any live equal values (plus a possible constant offset) that have been gathered prior to running LSR. When we do this we restore the debug value's original DIExpression, to undo any salvaging (as we have gone back to using the original debug value). This process can currently produce invalid debug info if the number of operands has changed by salvaging during LSR. Replacing old values during the applyEqualValues step does not change the number of location operands, which means that when we restore the old DIExpression we may have a mismatch between the number of operands used by the debug value and the number of operands referenced by the DIExpression. This patch fixes this by restoring the full original location metadata at the start of the applyEqualValues step, so that there is no mismatch in operand count between the debug value and its DIExpression. Differential Revision: https://reviews.llvm.org/D98644	2021-04-08 13:04:48 +01:00
Roman Lebedev	b825875812	[NFC][X86][CostModel] Add some load/store tests w/ non-power-of-two elt cnt vectors For example the cost to load <48 x i16> should likely be 3, because that's just 3x load i256.	2021-04-08 15:00:28 +03:00
Mikael Holmen	dda9735107	[NVPTX] Fix compiler warning in NDEBUG build [NFC] Without the fix we get ../lib/Target/NVPTX/NVPTXLowerArgs.cpp:236:24: error: lambda capture 'Arg' is not used [-Werror,-Wunused-lambda-capture] auto IsALoadChain = [Arg](Value *Start) { ^~~ 1 error generated.	2021-04-08 13:21:21 +02:00
Serguei Katkov	efbc7a658c	[GreedyRA ORE] Add function level spill/reloads stats Reviewers: reames, MatzeB, anemet, thegameg Reviewed By: reames, thegameg Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D100014	2021-04-08 16:55:52 +07:00
David Green	154e9ac3d0	[LV] Logical and/or select costs D99674 stopped the folding of certain select operations into and/or, due to incorrect folding in the presence of poison. D97360 added some costs to attempt to account for the change, but only worked at the getUserCost level, not the getCmpSelInstrCost that the vectorizer will use directly. This adds similar logic into the vectorizer to handle these logical and/or selects, treating them like and/or directly. This fixes 60% performance regressions from code like the attached test case. Differential Revision: https://reviews.llvm.org/D99884	2021-04-08 10:39:47 +01:00
David Green	1c65cc8e30	[LV] Add a logical and/or select cost test. NFC	2021-04-08 10:27:06 +01:00
Fraser Cormack	cefb1ed047	[RISCV] Support OR/XOR/AND reductions on vector masks This patch adds RVV codegen support for OR/XOR/AND reductions for both scalable- and fixed-length vector types. There are a few possible codegen strategies for each -- vmfirst.m, vmsbf.m, and vmsif.m could be used to some extent -- but the vpopc.m instruction was chosen since it produces the scalar result in one instruction, after which scalar instructions can finish off the computation. The reductions are lowered identically for both scalable- and fixed-length vectors, although some alternate strategies may be more optimal on fixed-length vectors since it's cheaper to get the length of those types. Other reduction types were not deemed to be relevant for mask vectors. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D100030	2021-04-08 09:46:38 +01:00
Thomas Preud'homme	a168a0cb31	[AMDGPU, test] Fix use of undef FileCheck var Test CodeGen/AMDGPU/amdgpu.private-memory.ll and CodeGen/AMDGPU/private-memory-r600.ll have a block of CHECK directives whose prefix is inconsistent: R600-CHECK Vs R600. This leads to a R600-NOT directive using an undefined CHAN variable due to R600-CHECK directives never being considered by FileCheck. Fixing the prefix leads to the testcase failing. As per https://reviews.llvm.org/D99865#2675235 this commit removes the directives instead since it is not possible to write a reliable check. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D99865	2021-04-08 09:42:59 +01:00
LemonBoy	29ba9d08ef	[AsmParser] Recognize more escaped characters between single quotes The GNU AS manual states the following about single-character constants enclosed within single quotes: > Some backslash escapes apply to characters, \b, \f, \n, \r, \t, and \" with the same meaning as for strings, plus \' for a single quote. Add two more characters to the switch handling this case to match GAS behaviour, plus a test to make sure nothing regresses. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D99609	2021-04-08 09:59:37 +02:00
Serguei Katkov	c36162f4d9	[GreedyRA ORE] Extract computeNumberOfSplillsReloads to use in different places. NFC. Extract one basic block handling to introduce stat computation for method scope. Reviewers: reames, MatzeB, anemet, thegameg Reviewed By: reames, thegameg Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D100013	2021-04-08 14:40:45 +07:00
Serguei Katkov	075fe72e46	[GreedyRA ORE] Extract stats in RAGreedyStats struct. NFC. Combine all collected stats into separate struct RAGreedyStats with add and report methods. The motivation is to extend the number of statistics to capture and instead of adding new parameters, just combine all of them into one structure. Additionally I plan to use report from different places in future to report data for function as well. Reviewers: reames, MatzeB, anemet, thegameg Reviewed By: thegameg Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D100012	2021-04-08 14:27:37 +07:00
Serguei Katkov	e21829da6d	[GreedyRA ORE] Compute ORE stats if extra analysis is enabled To save compile time, avoid computation of stats if ORE will not emit it. The motivation is to add more stats and compute it only if it will dumped. Reviewers: reames, MatzeB, anemet, thegameg Reviewed By: reames Subscribers: llvm-commits Differential Revision: https://reviews.llvm.org/D100010	2021-04-08 14:24:18 +07:00
Esme-Yi	353dabc855	[Debug-Info] Use inlined strings in .dwinfo section by default for DBX. Summary: Set the default DwarfInlinedStrings as inlined strings for DBX, due to DBX does not support .dwstr section for now. Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D99933	2021-04-08 07:20:22 +00:00
Hsiangkai Wang	f26eeade90	[RISCV] Add scalable offset under very large stack size. If the stack size is larger than 12 bits, we have to use a scratch register to store the stack size. Before we introduce the scalable stack offset, we could simplify %0 = ADDI %stack.0, 0 => %scratch = ... # sequence of instructions to move the offset into %%scratch %0 = ADD %fp, %scratch However, if the offset contains scalable part, we need to consider it. %0 = ADDI %stack.0, 0 => %scratch = ... # sequence of instructions to move the offset into %%scratch %scratch = ADD %fp, %scratch %scalable_offset = ... # sequence of instructions for vscaled-offset. %0 = ADD/SUB %scratch, %scalable_offset Differential Revision: https://reviews.llvm.org/D100035	2021-04-08 14:46:05 +08:00
Hsiangkai Wang	a750e1eab2	[NFC][RISCV] Add test for scalable offset under large stack size. This test case shows that we access wrong stack slots when the frame object has scalable offset under large stack size. Differential Revision: https://reviews.llvm.org/D100084	2021-04-08 14:46:05 +08:00
Juneyoung Lee	dda330f547	[Constant] Remove unused variable	2021-04-08 15:44:42 +09:00
Juneyoung Lee	01aeaffdfd	[Constant] ConstantStruct/Array should not lower poison to undef This is a (late) follow-up patch of 8871a4b4cab8a56fd6ff12fd024002c3c79128b3 and c95f39891a282ebf36199c73b705d4a2c78a46ce to make ConstantStruct::get/ConstantArray::getImpl correctly return PoisonValue if all elements are poison. This was found while discussing about the elements of a vector-typed UndefValue (D99853)	2021-04-08 15:23:12 +09:00
Hongtao Yu	12f5193bf4	[CSSPGO] Move pseudo probes to the beginning of a block to unblock SelectionDAG combine. Pseudo probes, when scattered in a block, can be chained dependencies of other regular DAG nodes and block DAG combine optimizations. To fix this, scattered probes in a block are grouped and placed at the beginning of the block. This shouldn't affect the profile quality. Test Plan: Reviewed By: wenlei, wmi Differential Revision: https://reviews.llvm.org/D100002	2021-04-07 22:45:35 -07:00
Philip Reames	b81ddb9786	[docs] Document our norms around reverts This has come up a few times recently, and I was surprised to notice that we don't have anything in the docs. This patch deliberately sticks to stuff that is uncontroversial in the community. Everything herein is thought to be widely agreed to by a large majority of the community. A few things were noted and removed in review which failed this standard, if you spot anything else, please point it out. Differential Revision: https://reviews.llvm.org/D99305	2021-04-07 21:02:19 -07:00

1 2 3 4 5 ...

213875 Commits