llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-26 12:43:36 +01:00

Author	SHA1	Message	Date
Fangrui Song	b331e4f4c1	Revert D103717 "[InstrProfiling] Make __profd_ unconditionally private for ELF" This reverts commit 76d0747e0807307780ba84cbd7e5c80b20c26bd7. If a group has `__llvm_prf_vals` due to static value profiler counters (`NS!=0`), we cannot make `__llvm_prf_data` private, because a prevailing text section may reference `__llvm_prf_data` and will cause a `relocation refers to a discarded section` linker error. Note: while a `__profc_` group is non-prevailing, it may be referenced by a prevailing text section due to inlining. ``` group section [ 66] `.group' [__profc__ZN5clang20EmitClangDeclContextERN4llvm12RecordKeeperERNS0_11raw_ostreamE] contains 4 sections: [Index] Name [ 67] __llvm_prf_cnts [ 68] __llvm_prf_vals [ 69] __llvm_prf_data [ 70] .rela__llvm_prf_data ```	2021-06-17 23:38:17 -07:00
Johannes Doerfert	363cd23ad2	[Attributor][FIX] Arguments of unknown functions can be undef This should fix PR50683. The wrong assumption was that we could always know what the callee is when we replace a call site argument with undef. We wanted to know that to remove the `noundef` that might be attached to the argument. Since no callee means we did the propagation on the caller site, there is no need to remove an attribute. It is only needed if we replace all uses and therefore pass `undef` instead of the value that was passed in otherwise.	2021-06-18 01:07:53 -05:00
Johannes Doerfert	4b0523df3a	[Attributor] Allow to skip the initial update for a new AA Users might want to run initialize for a set of AAs without an intermediate update step. Running update eagerly is not a requirement anyway so we make it optional.	2021-06-18 01:07:53 -05:00
Johannes Doerfert	c50c80812f	[Attributor] Use a centralized value simplification interface To allow outside AAs that simplify values we need to ensure all value simplification goes through the Attributor, not AAValueSimplify (or any of the other AAs we have already like AAPotentialValues). This patch also introduces an interface for the outside AAs to register simplification callbacks for an IRPosition. To make this work as expected we have to pass IRPositions instead of Values in AAValueSimplify, which makes sense by itself.	2021-06-18 01:07:53 -05:00
Johannes Doerfert	73f1949d65	[Attributor] Introduce a helper do deal with constant type mismatches If we simplify values we sometimes end up with type mismatches. If the value is a constant we can often cast it though to still allow propagation. The logic is now put into a helper and it replaces some ad hoc things we did before. This also introduces the AA namespace for abstract attribute related functions and types. Differential Revision: https://reviews.llvm.org/D103856	2021-06-18 01:07:52 -05:00
Johannes Doerfert	99cca18714	[Attributor] Make sure Heap2Stack works properly on a GPU target If the target stack is not accessible between different running "threads" we have to make sure not to create allocas for mallocs that might be used by multiple "threads". The "use check" is sufficient to prevent this but if we apply the "free check" we have to make sure the pointer is not communicated to others before the free is reached. Differential Revision: https://reviews.llvm.org/D98608	2021-06-18 01:07:52 -05:00
Johannes Doerfert	1d462a0452	[OpenMP][NFC] Expose AAExecutionDomain and rename its getter The initial use for AAExecutionDomain was to determine if a single thread executes a block. While this is sometimes informative most of the time, and for other reasons, we actually want to know if it is the "initial thread". Thus, the thread that started execution on the current device. The deduction needs to be adjusted in a follow up as the methods we use right not are looking for the OpenMP thread id which is resets whenever a thread enters a parallel region. What we basically want is to look for `llvm.nvvm.read.ptx.sreg.ntid.x` and equivalent functions.	2021-06-18 01:07:52 -05:00
Johannes Doerfert	4c4da5cf4e	[Attributor][NFC] Add test from PR49606 It is not clear to me how we fixed this, I reverted a few candidates but I couldn't make the test fail. Still worth having it in our regression suite.	2021-06-18 01:07:52 -05:00
Johannes Doerfert	b3f7c318e7	[Attributor][NFC] Precommit a set of test cases for load simplification	2021-06-18 01:07:51 -05:00
Johannes Doerfert	ffbe2b6b3c	[Attributor][NFC] AAReachability is currently stateless, don't invalidate it We invalidated AAReachabilityImpl directly which is not helpful and confusing as we still used it regardless. We now avoid invalidating it (not needed anyway) and add checks for the state. This has by itself no actual effect but prepares for later extensions.	2021-06-18 01:07:51 -05:00
George Balatsouras	d2b759a121	[dfsan] Replace dfs$ prefix with .dfsan suffix The current naming scheme adds the `dfs$` prefix to all DFSan-instrumented functions. This breaks mangling and prevents stack trace printers and other tools from automatically demangling function names. This new naming scheme is mangling-compatible, with the `.dfsan` suffix being a vendor-specific suffix: https://itanium-cxx-abi.github.io/cxx-abi/abi.html#mangling-structure With this fix, demangling utils would work out-of-the-box. Reviewed By: stephan.yichao.zhao Differential Revision: https://reviews.llvm.org/D104494	2021-06-17 22:42:47 -07:00
Luke	1271774638	[RISCV] Don't enable Interleaved Access Vectorization The patch https://reviews.llvm.org/D101469 is intended to enable loop unrolling, not interleaved access vectorization. The method bool enableInterleavedAccessVectorization() should not be implemented.	2021-06-18 12:32:30 +08:00
Daniil Seredkin	981cf746b0	[InstCombine][NFC] Added tests for mul with zext/sext operands Baseline tests for D104193	2021-06-18 11:14:50 +07:00
Igor Kudrin	ca414c6429	[objdump][ARM] Fix evaluating the target address of a Thumb BLX(i) The instruction can be 16-bit aligned while targeting 32-bit aligned code. To calculate the target address correctly, the address of the instruction has to be adjusted. Differential Revision: https://reviews.llvm.org/D104446	2021-06-18 10:40:55 +07:00
Carl Ritson	b219ce9cea	[AMDGPU] Remove duplicate setOperationAction for v4i16/v4f16 (NFC)	2021-06-18 12:38:54 +09:00
Heejin Ahn	f7b0205560	[WebAssembly] Rename event to tag We recently decided to change 'event' to 'tag', and 'event section' to 'tag section', out of the rationale that the section contains a generalized tag that references a type, which may be used for something other than exceptions, and the name 'event' can be confusing in the web context. See - https://github.com/WebAssembly/exception-handling/issues/159#issuecomment-857910130 - https://github.com/WebAssembly/exception-handling/pull/161 Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D104423	2021-06-17 20:34:19 -07:00
Xun Li	2576f49320	[Coroutine] Properly deal with byval and noalias parameters This patch is to address https://bugs.llvm.org/show_bug.cgi?id=48857. Previous attempts can be found in D104007 and D101980. A lot of discussions can be found in those two patches. To summarize the bug: When Clang emits IR for coroutines, the first thing it does is to make a copy of every argument to the local stack, so that uses of the arguments in the function will all refer to the local copies instead of the arguments directly. However, in some cases we find that arguments are still directly used: When Clang emits IR for a function that has pass-by-value arguments, sometimes it emits an argument with byval attribute. A byval attribute is considered to be local to the function (just like alloca) and hence it can be easily determined that it does not alias other values. If in the IR there exists a memcpy from a byval argument to a local alloca, and then from that local alloca to another alloca, MemCpyOpt will optimize out the first memcpy because byval argument's content will not change. This causes issues because after a coroutine suspension, the byval argument may die outside of the function, and latter uses will lead to memory use-after-free. This is only a problem for arguments with either byval attribute or noalias attribute, because only these two kinds are considered local. Arguments without these two attributes will be considered to alias coro_suspend and hence we won't have this problem. So we need to be able to deal with these two attributes in coroutines properly. For noalias arguments, since coro_suspend may potentially change the value of any argument outside of the function, we simply shouldn't mark any argument in a coroutiune as noalias. This can be taken care of in CoroEarly pass. For byval arguments, if such an argument needs to live across suspensions, we will have to copy their value content to the frame, not just the pointer. Differential Revision: https://reviews.llvm.org/D104184	2021-06-17 19:06:10 -07:00
Jim Lin	c8807d8093	[M68k][NFC] Fix indentation in M68kInstrArithmetic.td Merely fix indentation Reviewed By: myhsu Differential Revision: https://reviews.llvm.org/D104434	2021-06-18 09:49:04 +08:00
Kuter Dinel	319a05fb4c	[FIX][Attributor] Fix broken build due to missing virtual deconstructors. The lack some virtual deconstructors where causing some builds bots to fail. This patch fixes that. Problematic commit: https://reviews.llvm.org/rGeaf1b6810ce0f40008b2b1d902750eafa3e198d3 Build bot: https://lab.llvm.org/buildbot/#/builders/18/builds/1741	2021-06-18 07:32:51 +03:00
Roman Lebedev	1b972a014c	[NFC][SimpleLoopUnswitch] unswitchTrivialBranch(): add debug output explaining unswitching failure It's not prohibitively verbose, and allows easier understanding why certain unswitching ultimately wasn't performed.	2021-06-18 00:46:04 +03:00
Kuter Dinel	5e7d306b6b	[Attributor] Derive AACallEdges attribute This attribute computes the optimistic live call edges using the attributor liveness information. This attribute will be used for deriving a inter-procedural function reachability attribute. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D104059	2021-06-18 03:29:22 +03:00
Andrew Browne	b1d7a90c73	Revert "[DFSan] Cleanup code for platforms other than Linux x86_64." This reverts commit 8441b993bdba29437b296bad6a37464669eef35e. Buildbot failures.	2021-06-17 14:19:18 -07:00
Fangrui Song	9eab4a359b	[InstrProfiling] Make __profd_ unconditionally private for ELF For ELF, since all counters/data are in a section group (either `comdat any` or `comdat noduplicates`), and the signature for `comdat any` is `__profc_`, the D1003372 optimization prerequisite (linker GC cannot discard data variables while the text section is retained) is always satisified, we can make __profd_ unconditionally private. Reviewed By: davidxl, rnk Differential Revision: https://reviews.llvm.org/D103717	2021-06-17 14:16:54 -07:00
Craig Topper	f70ebc20e2	[PartiallyInlineLibCalls] Disable sqrt expansion for strictfp. This pass emits a floating point compare and a conditional branch, but if strictfp is enabled we don't emit a constrained compare intrinsic. The backend also won't expand the readonly sqrt call this pass inserts to a sqrt instruction under strictfp. So we end up with 2 libcalls as seen here. https://godbolt.org/z/oax5zMEWd Fix these things by disabling the pass. Differential Revision: https://reviews.llvm.org/D104479	2021-06-17 14:15:12 -07:00
Andrew Browne	a68fbc2c8a	[DFSan] Cleanup code for platforms other than Linux x86_64. These other platforms are unsupported and untested. They could be re-added later based on MSan code. Reviewed By: gbalats, stephan.yichao.zhao Differential Revision: https://reviews.llvm.org/D104481	2021-06-17 14:08:40 -07:00
Eli Friedman	e4884552df	[ScalarEvolution] Fix pointer/int type handling converting select/phi to min/max. The old version of this code would blindly perform arithmetic without paying attention to whether the types involved were pointers or integers. This could lead to weird expressions like negating a pointer. Explicitly handle simple cases involving pointers, like "x < y ? x : y". In all other cases, coerce the operands of the comparison to integer types. This avoids the weird cases, while handling most of the interesting cases. Differential Revision: https://reviews.llvm.org/D103660	2021-06-17 14:05:12 -07:00
Saleem Abdulrasool	0c385efcae	RISCV: clean up target expression handling The target specific expression handling was slightly regressed by bbea64250f65480d787e1c5ff45c4de3ec2dcda8. This restores the proper sub-expression evaluation to allow for constant folding within the expression. We explicitly discard the layout and assembler when evaluating the expression to avoid any symbolic computation and instead using the `evaluateAsRelocatable` to canonicalise and constant fold only. We can also simplify the expression handling - none of the target variants support symbolic difference. This simplifies the logic for that and adds additional tests to ensure that we do not accidentally regress here in the future. Reviewed By: maskray Differential Revision: https://reviews.llvm.org/D104473	2021-06-17 13:35:32 -07:00
Jon Roelofs	921bd72ae7	[GISel] Eliminate redundant bitmasking This was a GISel vs SDAG regression that showed up at -Os on arm64 in: SingleSource/Benchmarks/Adobe-C++/simple_types_constant_folding.test https://llvm.godbolt.org/z/aecjodsjG Differential revision: https://reviews.llvm.org/D103334	2021-06-17 12:53:00 -07:00
Jon Roelofs	10c5d815eb	[AArch64][GISel] and+or+shl => bfi This fixes a GISEL vs SDAG regression that showed up at -Os in 256.bzip2 In `_getAndMoveToFrontDecode`: gisel: ``` and w9, w0, #0xff orr w9, w9, w8, lsl #8 ``` sdag: ``` bfi w0, w8, #8, #24 ``` Differential revision: https://reviews.llvm.org/D103291	2021-06-17 12:52:59 -07:00
Jorge Gorbe Moya	70dce4754b	Revert "[NFC] Remove checking pointee type for byval/preallocated type" This reverts commit 738abfdbea21acd2597d83ad3390daf5696b6d07.	2021-06-17 12:29:23 -07:00
Nikita Popov	81cd78135c	[LoopUnroll] Fold all exits based on known trip count/multiple Fold all exits based on known trip count/multiple information from SCEV. Previously only the latch exit or the single exit were folded. This doesn't yet eliminate ULO.TripCount and ULO.TripMultiple entirely: They're still used to a) decide whether runtime unrolling should be performed and b) for ORE remarks. However, the core unrolling logic is independent of them now. Differential Revision: https://reviews.llvm.org/D104203	2021-06-17 20:58:34 +02:00
Patrick Holland	5d008e4862	[MCA] [RegisterFile] Allow for skipping Defs with RegID of 0 (rather than assert(RegID) like we do before this patch). This patch will allow developers to remove unwanted instruction Defs (most likely from within a target specific InstrPostProcess) by setting that Def's RegisterID to 0. Differential Revision: https://reviews.llvm.org/D104433	2021-06-17 11:52:43 -07:00
Andrew Ng	6e97d78450	[llvm-symbolizer][docs] Update example for --verbose in the guide Differential Revision: https://reviews.llvm.org/D104128	2021-06-17 19:12:44 +01:00
Roman Lebedev	2b8fbae5a3	[X86] AMD Zen 3: don't confuse shift and shuffle, NFC These proc res groups occupy the exact same pipes, so this doesn't affect the modelling, but it's confusing nontheless.	2021-06-17 21:07:35 +03:00
Roman Lebedev	5f87b00dc8	[NFC] LoopVectorizationCostModel::getMaximizedVFForTarget(): clarify debug msg This really isn't talking about vectors in general, but only about either fixed or scalable vectors, and it's pretty confusing to see it state that there aren't any vectors :)	2021-06-17 21:07:34 +03:00
LLVM GN Syncbot	55cf55cf08	[gn build] Port f27e4548fc42	2021-06-17 17:09:43 +00:00
Saleem Abdulrasool	bb616e9e7a	test: clean up some of the RISCV tests (NFC) This addresses some post-commit comments from jrtc27 to make the tests easier to process.	2021-06-17 09:51:09 -07:00
Haojian Wu	e76e730b8f	fix an -Wunused-variable warning in release built, NFC	2021-06-17 18:48:47 +02:00
Sanjay Patel	25656e8196	[InstSimplify] add tests for computeKnownBits of shift-with-bitcast op; NFC	2021-06-17 12:39:16 -04:00
Sanjay Patel	48440d6126	[InstCombine][x86] add tests for complex vector shift value tracking; NFC https://llvm.org/PR50123	2021-06-17 12:39:16 -04:00
Saleem Abdulrasool	f56e4f6d3d	RISCV: adjust handling of relocation emission for RISCV This re-architects the RISCV relocation handling to bring the implementation closer in line with the implementation in binutils. We would previously aggressively resolve the relocation. With this restructuring, we always will emit a paired relocation for any symbolic difference of the type of S±T[±C] where S and T are labels and C is a constant. GAS has a special target hook controlled by `RELOC_EXPANSION_POSSIBLE` which indicates that a fixup may be expanded into multiple relocations. This is used by the RISCV backend to always emit a paired relocation - either ADD[WIDTH] + SUB[WIDTH] for text relocations or SET[WIDTH] + SUB[WIDTH] for a debug info relocation. Irrespective of whether linker relaxation support is enabled, symbolic difference is always emitted as a paired relocation. This change also sinks the target specific behaviour down into the target specific area rather than exposing it to the shared relocation handling. In the process, we also sink the "special" handling for debug information down into the RISCV target. Although this improves the path for the other targets, this is not necessarily entirely ideal either. The changes in the debug info emission could be done through another type of hook as this functionality would be required by any other target which wishes to do linker relaxation. However, as there are no other targets in LLVM which currently do this, this is a reasonable thing to do until such time as the code needs to be shared. Improve the handling of the relocation (and add a reduced test case from the Linux kernel) to ensure that we handle complex expressions for symbolic difference. This ensures that we correct relocate symbols with the adddends normalized and associated with the addition portion of the paired relocation. This change also addresses some review comments from Alex Bradbury about the relocations meant for use in the DWARF CFA being named incorrectly (using ADD6 instead of SET6) in the original change which introduced the relocation type. This resolves the issues with the symbolic difference emission sufficiently to enable building the Linux kernel with clang+IAS+lld (without linker relaxation). Resolves PR50153, PR50156! Fixes: ClangBuiltLinux/linux#1023, ClangBuiltLinux/linux#1143 Reviewed By: nickdesaulniers, maskray Differential Revision: https://reviews.llvm.org/D103539	2021-06-17 08:20:02 -07:00
Stephen Tozer	84970078e4	Reapply "[DebugInfo] Prevent non-determinism when updating DIArgList users of a value" Reapply the commit which previously caused build failures due to the mismatched template arguments between the return type and the returned SmallVector. This reverts commit e8991caea8690ec2d17b0b7e1c29bf0da6609076.	2021-06-17 16:16:55 +01:00
Kevin P. Neal	abe1b0a2fe	[FPEnv][InstSimplify] Precommit tests for D103169. In D103169 I'm adding to InstSimplify support for NaN to constrained intrinsics that have a regular FP IR instruction counterpart. Precommit the tests for clarity when that ticket lands.	2021-06-17 10:34:39 -04:00
Guillaume Chatelet	913b337ddc	[llvm] fix typo in comment	2021-06-17 14:30:52 +00:00
Stephen Tozer	2242a1aa55	Revert "[DebugInfo] Prevent non-determinism when updating DIArgList users of a value" Commit caused build errors on buildbots with [-Werror,-Wreturn-std-move] enabled. This reverts commit fa1de88f81e9c6db5255ca7c4d0fd25606c5a054.	2021-06-17 15:20:59 +01:00
Stephen Tozer	3173d57109	[DebugInfo] Prevent non-determinism when updating DIArgList users of a value This patch fixes an issue where builds of programs with multiple dbg.values with DIArgList locations could have non-deterministic output. This issue was caused by ReplaceableMetadataImpl::getAllArgListUsers, which returned DIArgList pointers in a random order; the output of this function would later be used to insert dbg.values, causing the order of insertion to be non-deterministic. This patch changes getAllArgListUsers to return pointers in a fixed order. Differential Revision: https://reviews.llvm.org/D104105	2021-06-17 15:09:27 +01:00
Sjoerd Meijer	523d263530	[FuncSpec] Precommit test: don't specialise funcs with NoDuplicate instrs. NFC.	2021-06-17 14:13:25 +01:00
Simon Pilgrim	1bc07d1287	[X86] combineSelect - refactor MIN/MAX detection code to make it easier to add additional select(setcc,x,y) folds. NFCI. I need to add some additional handling to address some of the regressions from D101074	2021-06-17 13:50:59 +01:00
Florian Hahn	19b14a93b0	[X86] Check using default in test added in 0bd5bbb31e0345ae. Make sure llvm-mc is invariant with respect to debug locations in the test (checks update to use the -x86-pad-for-align default value)	2021-06-17 13:19:43 +01:00
Florian Hahn	3e8d4a4389	[X86] Add test showing binary differences with -x86-pad-for-align. This patch adds a test case showing how a single extra .loc can cause binary differences when using -x86-pad-for-align=true. The issue has been discussed in D94542, PR42138, PR48742.	2021-06-17 12:27:17 +01:00

... 2 3 4 5 6 ...

217481 Commits