llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-22 18:54:02 +01:00

Author	SHA1	Message	Date
Lang Hames	d2647ecc04	[ORC][C-bindings] Re-order object transform function arguments. ObjInOut is an in-out parameter not a return value argument, so by convention it should come after the context value (Ctx).	2021-06-18 22:12:39 +10:00
Lang Hames	eb023bd323	[ORC] Use uint8_t rather than char for RPC wrapper-function calls. This partially reverts 838490de7ed, which broke some Solaris bots. Apparently Solaris defines int8_t as char rather than signed char, which made the SerializationTypeName<char> specialization a redefinition. This partial revert isolates use of uint8_t buffers to ORC-RPC handling of wrapper functions only. The TargetProcessControl::runWrapper method will continue to use char buffers.	2021-06-18 21:56:09 +10:00
Lang Hames	04c9dcdc5d	[ORC] Add support for dumping objects to the C API. Provides ObjectTransformLayer APIs, a getter to access the ObjectTransformLayer member of LLJIT, and the DumpObjects utility to make construction of a dump-to-disk transform easy. An example showing how the new APIs can be used has been added in llvm/examples/OrcV2Examples/OrcV2CBindingsDumpObjects.	2021-06-18 20:56:45 +10:00
Liqiang Tao	1584f0233e	Revert D104028 "[llvm][Inliner] Add an optional PriorityInlineOrder"	2021-06-18 18:52:00 +08:00
Max Kazantsev	6b59cab001	[LoopDeletion] Break backedge if we can prove that the loop is exited on 1st iteration (try 3) This patch handles one particular case of one-iteration loops for which SCEV cannot straightforwardly prove BECount = 1. The idea of the optimization is to symbolically execute conditional branches on the 1st iteration, moving in topoligical order, and only visiting blocks that may be reached on the first iteration. If we find out that we never reach header via the latch, then the backedge can be broken. This implementation uses InstSimplify. SCEV version was rejected due to high compile time impact. Differential Revision: https://reviews.llvm.org/D102615 Reviewed By: nikic	2021-06-18 17:31:57 +07:00
Haojian Wu	2192be141f	[Attributor] Fix UB behavior on uninitalized bool variables. Found by ASAN.	2021-06-18 11:49:42 +02:00
Jay Foad	58f1e523e1	[AMDGPU] Update generated checks. NFC.	2021-06-18 10:49:02 +01:00
Daniil Seredkin	ce9c5da2dc	[InstCombine] Fold (sext bool X) * (sext bool X) to zext (and X, X) InstCombine didn't perform (sext bool X) * (sext bool X) --> zext (and X, X) which can result in just (zext X). The patch adds regression tests to check this transformation and adds a check for equality of mul's operands for that case. Differential Revision: https://reviews.llvm.org/D104193	2021-06-18 16:28:06 +07:00
Max Kazantsev	bc696a57a2	[Test] Add XFAIL unit test for PR50765	2021-06-18 16:25:42 +07:00
Liqiang Tao	921bbbb45e	[llvm][Inliner] Add an optional PriorityInlineOrder This patch adds an optional PriorityInlineOrder, which uses the heap to order inlining. The callsite which size is smaller would have a higher priority. Reviewed By: mtrofin Differential Revision: https://reviews.llvm.org/D104028	2021-06-18 16:55:38 +08:00
Max Kazantsev	7e6d9eb18f	[NFC] Assert non-zero factor before division This is to ensure that zero denominator leads to controlled assertion failure rather than UB.	2021-06-18 15:50:50 +07:00
Haojian Wu	a8d4c2a73b	[Attributor] Don't print the call-graph in a hard-coded file. This looks like not a practical pattern in our codebase (it could fail in some sandbox environement). Instead we print it via standard output, and it is controled by the -attributor-print-call-graph, this follows a similiar pattern of attributor-print-dep.	2021-06-18 09:38:07 +02:00
Tomasz Miąsko	da9f3ec14e	[Demangle][Rust] Parse dot suffix Allow mangled names to include an arbitrary dot suffix, akin to vendor specific suffix in Itanium mangling. Primary motivation is a support for symbols renamed during ThinLTO import / promotion (ThinLTO is the default configuration for optimized builds in rustc). Reviewed By: dblaikie Differential Revision: https://reviews.llvm.org/D104358	2021-06-18 09:29:45 +02:00
Daniil Seredkin	2f87452581	Revert "[InstCombine] Fold (sext bool X) * (sext bool X) to zext (and X, X)" This reverts commit 31053338c97b36ffb582f9c04a74cec69cce3e70.	2021-06-18 14:21:02 +07:00
Daniil Seredkin	acf8b9a690	[InstCombine] Fold (sext bool X) * (sext bool X) to zext (and X, X) InstCombine didn't perform (sext bool X) * (sext bool X) --> zext (and X, X) which can result in just (zext X). The patch adds regression tests to check this transformation and adds a check for equality of mul's operands for that case. Differential Revision: https://reviews.llvm.org/D104193	2021-06-18 14:12:00 +07:00
Fangrui Song	b331e4f4c1	Revert D103717 "[InstrProfiling] Make __profd_ unconditionally private for ELF" This reverts commit 76d0747e0807307780ba84cbd7e5c80b20c26bd7. If a group has `__llvm_prf_vals` due to static value profiler counters (`NS!=0`), we cannot make `__llvm_prf_data` private, because a prevailing text section may reference `__llvm_prf_data` and will cause a `relocation refers to a discarded section` linker error. Note: while a `__profc_` group is non-prevailing, it may be referenced by a prevailing text section due to inlining. ``` group section [ 66] `.group' [__profc__ZN5clang20EmitClangDeclContextERN4llvm12RecordKeeperERNS0_11raw_ostreamE] contains 4 sections: [Index] Name [ 67] __llvm_prf_cnts [ 68] __llvm_prf_vals [ 69] __llvm_prf_data [ 70] .rela__llvm_prf_data ```	2021-06-17 23:38:17 -07:00
Johannes Doerfert	363cd23ad2	[Attributor][FIX] Arguments of unknown functions can be undef This should fix PR50683. The wrong assumption was that we could always know what the callee is when we replace a call site argument with undef. We wanted to know that to remove the `noundef` that might be attached to the argument. Since no callee means we did the propagation on the caller site, there is no need to remove an attribute. It is only needed if we replace all uses and therefore pass `undef` instead of the value that was passed in otherwise.	2021-06-18 01:07:53 -05:00
Johannes Doerfert	4b0523df3a	[Attributor] Allow to skip the initial update for a new AA Users might want to run initialize for a set of AAs without an intermediate update step. Running update eagerly is not a requirement anyway so we make it optional.	2021-06-18 01:07:53 -05:00
Johannes Doerfert	c50c80812f	[Attributor] Use a centralized value simplification interface To allow outside AAs that simplify values we need to ensure all value simplification goes through the Attributor, not AAValueSimplify (or any of the other AAs we have already like AAPotentialValues). This patch also introduces an interface for the outside AAs to register simplification callbacks for an IRPosition. To make this work as expected we have to pass IRPositions instead of Values in AAValueSimplify, which makes sense by itself.	2021-06-18 01:07:53 -05:00
Johannes Doerfert	73f1949d65	[Attributor] Introduce a helper do deal with constant type mismatches If we simplify values we sometimes end up with type mismatches. If the value is a constant we can often cast it though to still allow propagation. The logic is now put into a helper and it replaces some ad hoc things we did before. This also introduces the AA namespace for abstract attribute related functions and types. Differential Revision: https://reviews.llvm.org/D103856	2021-06-18 01:07:52 -05:00
Johannes Doerfert	99cca18714	[Attributor] Make sure Heap2Stack works properly on a GPU target If the target stack is not accessible between different running "threads" we have to make sure not to create allocas for mallocs that might be used by multiple "threads". The "use check" is sufficient to prevent this but if we apply the "free check" we have to make sure the pointer is not communicated to others before the free is reached. Differential Revision: https://reviews.llvm.org/D98608	2021-06-18 01:07:52 -05:00
Johannes Doerfert	1d462a0452	[OpenMP][NFC] Expose AAExecutionDomain and rename its getter The initial use for AAExecutionDomain was to determine if a single thread executes a block. While this is sometimes informative most of the time, and for other reasons, we actually want to know if it is the "initial thread". Thus, the thread that started execution on the current device. The deduction needs to be adjusted in a follow up as the methods we use right not are looking for the OpenMP thread id which is resets whenever a thread enters a parallel region. What we basically want is to look for `llvm.nvvm.read.ptx.sreg.ntid.x` and equivalent functions.	2021-06-18 01:07:52 -05:00
Johannes Doerfert	4c4da5cf4e	[Attributor][NFC] Add test from PR49606 It is not clear to me how we fixed this, I reverted a few candidates but I couldn't make the test fail. Still worth having it in our regression suite.	2021-06-18 01:07:52 -05:00
Johannes Doerfert	b3f7c318e7	[Attributor][NFC] Precommit a set of test cases for load simplification	2021-06-18 01:07:51 -05:00
Johannes Doerfert	ffbe2b6b3c	[Attributor][NFC] AAReachability is currently stateless, don't invalidate it We invalidated AAReachabilityImpl directly which is not helpful and confusing as we still used it regardless. We now avoid invalidating it (not needed anyway) and add checks for the state. This has by itself no actual effect but prepares for later extensions.	2021-06-18 01:07:51 -05:00
George Balatsouras	d2b759a121	[dfsan] Replace dfs$ prefix with .dfsan suffix The current naming scheme adds the `dfs$` prefix to all DFSan-instrumented functions. This breaks mangling and prevents stack trace printers and other tools from automatically demangling function names. This new naming scheme is mangling-compatible, with the `.dfsan` suffix being a vendor-specific suffix: https://itanium-cxx-abi.github.io/cxx-abi/abi.html#mangling-structure With this fix, demangling utils would work out-of-the-box. Reviewed By: stephan.yichao.zhao Differential Revision: https://reviews.llvm.org/D104494	2021-06-17 22:42:47 -07:00
Luke	1271774638	[RISCV] Don't enable Interleaved Access Vectorization The patch https://reviews.llvm.org/D101469 is intended to enable loop unrolling, not interleaved access vectorization. The method bool enableInterleavedAccessVectorization() should not be implemented.	2021-06-18 12:32:30 +08:00
Daniil Seredkin	981cf746b0	[InstCombine][NFC] Added tests for mul with zext/sext operands Baseline tests for D104193	2021-06-18 11:14:50 +07:00
Igor Kudrin	ca414c6429	[objdump][ARM] Fix evaluating the target address of a Thumb BLX(i) The instruction can be 16-bit aligned while targeting 32-bit aligned code. To calculate the target address correctly, the address of the instruction has to be adjusted. Differential Revision: https://reviews.llvm.org/D104446	2021-06-18 10:40:55 +07:00
Carl Ritson	b219ce9cea	[AMDGPU] Remove duplicate setOperationAction for v4i16/v4f16 (NFC)	2021-06-18 12:38:54 +09:00
Heejin Ahn	f7b0205560	[WebAssembly] Rename event to tag We recently decided to change 'event' to 'tag', and 'event section' to 'tag section', out of the rationale that the section contains a generalized tag that references a type, which may be used for something other than exceptions, and the name 'event' can be confusing in the web context. See - https://github.com/WebAssembly/exception-handling/issues/159#issuecomment-857910130 - https://github.com/WebAssembly/exception-handling/pull/161 Reviewed By: tlively Differential Revision: https://reviews.llvm.org/D104423	2021-06-17 20:34:19 -07:00
Xun Li	2576f49320	[Coroutine] Properly deal with byval and noalias parameters This patch is to address https://bugs.llvm.org/show_bug.cgi?id=48857. Previous attempts can be found in D104007 and D101980. A lot of discussions can be found in those two patches. To summarize the bug: When Clang emits IR for coroutines, the first thing it does is to make a copy of every argument to the local stack, so that uses of the arguments in the function will all refer to the local copies instead of the arguments directly. However, in some cases we find that arguments are still directly used: When Clang emits IR for a function that has pass-by-value arguments, sometimes it emits an argument with byval attribute. A byval attribute is considered to be local to the function (just like alloca) and hence it can be easily determined that it does not alias other values. If in the IR there exists a memcpy from a byval argument to a local alloca, and then from that local alloca to another alloca, MemCpyOpt will optimize out the first memcpy because byval argument's content will not change. This causes issues because after a coroutine suspension, the byval argument may die outside of the function, and latter uses will lead to memory use-after-free. This is only a problem for arguments with either byval attribute or noalias attribute, because only these two kinds are considered local. Arguments without these two attributes will be considered to alias coro_suspend and hence we won't have this problem. So we need to be able to deal with these two attributes in coroutines properly. For noalias arguments, since coro_suspend may potentially change the value of any argument outside of the function, we simply shouldn't mark any argument in a coroutiune as noalias. This can be taken care of in CoroEarly pass. For byval arguments, if such an argument needs to live across suspensions, we will have to copy their value content to the frame, not just the pointer. Differential Revision: https://reviews.llvm.org/D104184	2021-06-17 19:06:10 -07:00
Jim Lin	c8807d8093	[M68k][NFC] Fix indentation in M68kInstrArithmetic.td Merely fix indentation Reviewed By: myhsu Differential Revision: https://reviews.llvm.org/D104434	2021-06-18 09:49:04 +08:00
Kuter Dinel	319a05fb4c	[FIX][Attributor] Fix broken build due to missing virtual deconstructors. The lack some virtual deconstructors where causing some builds bots to fail. This patch fixes that. Problematic commit: https://reviews.llvm.org/rGeaf1b6810ce0f40008b2b1d902750eafa3e198d3 Build bot: https://lab.llvm.org/buildbot/#/builders/18/builds/1741	2021-06-18 07:32:51 +03:00
Roman Lebedev	1b972a014c	[NFC][SimpleLoopUnswitch] unswitchTrivialBranch(): add debug output explaining unswitching failure It's not prohibitively verbose, and allows easier understanding why certain unswitching ultimately wasn't performed.	2021-06-18 00:46:04 +03:00
Kuter Dinel	5e7d306b6b	[Attributor] Derive AACallEdges attribute This attribute computes the optimistic live call edges using the attributor liveness information. This attribute will be used for deriving a inter-procedural function reachability attribute. Reviewed By: jdoerfert Differential Revision: https://reviews.llvm.org/D104059	2021-06-18 03:29:22 +03:00
Andrew Browne	b1d7a90c73	Revert "[DFSan] Cleanup code for platforms other than Linux x86_64." This reverts commit 8441b993bdba29437b296bad6a37464669eef35e. Buildbot failures.	2021-06-17 14:19:18 -07:00
Fangrui Song	9eab4a359b	[InstrProfiling] Make __profd_ unconditionally private for ELF For ELF, since all counters/data are in a section group (either `comdat any` or `comdat noduplicates`), and the signature for `comdat any` is `__profc_`, the D1003372 optimization prerequisite (linker GC cannot discard data variables while the text section is retained) is always satisified, we can make __profd_ unconditionally private. Reviewed By: davidxl, rnk Differential Revision: https://reviews.llvm.org/D103717	2021-06-17 14:16:54 -07:00
Craig Topper	f70ebc20e2	[PartiallyInlineLibCalls] Disable sqrt expansion for strictfp. This pass emits a floating point compare and a conditional branch, but if strictfp is enabled we don't emit a constrained compare intrinsic. The backend also won't expand the readonly sqrt call this pass inserts to a sqrt instruction under strictfp. So we end up with 2 libcalls as seen here. https://godbolt.org/z/oax5zMEWd Fix these things by disabling the pass. Differential Revision: https://reviews.llvm.org/D104479	2021-06-17 14:15:12 -07:00
Andrew Browne	a68fbc2c8a	[DFSan] Cleanup code for platforms other than Linux x86_64. These other platforms are unsupported and untested. They could be re-added later based on MSan code. Reviewed By: gbalats, stephan.yichao.zhao Differential Revision: https://reviews.llvm.org/D104481	2021-06-17 14:08:40 -07:00
Eli Friedman	e4884552df	[ScalarEvolution] Fix pointer/int type handling converting select/phi to min/max. The old version of this code would blindly perform arithmetic without paying attention to whether the types involved were pointers or integers. This could lead to weird expressions like negating a pointer. Explicitly handle simple cases involving pointers, like "x < y ? x : y". In all other cases, coerce the operands of the comparison to integer types. This avoids the weird cases, while handling most of the interesting cases. Differential Revision: https://reviews.llvm.org/D103660	2021-06-17 14:05:12 -07:00
Saleem Abdulrasool	0c385efcae	RISCV: clean up target expression handling The target specific expression handling was slightly regressed by bbea64250f65480d787e1c5ff45c4de3ec2dcda8. This restores the proper sub-expression evaluation to allow for constant folding within the expression. We explicitly discard the layout and assembler when evaluating the expression to avoid any symbolic computation and instead using the `evaluateAsRelocatable` to canonicalise and constant fold only. We can also simplify the expression handling - none of the target variants support symbolic difference. This simplifies the logic for that and adds additional tests to ensure that we do not accidentally regress here in the future. Reviewed By: maskray Differential Revision: https://reviews.llvm.org/D104473	2021-06-17 13:35:32 -07:00
Jon Roelofs	921bd72ae7	[GISel] Eliminate redundant bitmasking This was a GISel vs SDAG regression that showed up at -Os on arm64 in: SingleSource/Benchmarks/Adobe-C++/simple_types_constant_folding.test https://llvm.godbolt.org/z/aecjodsjG Differential revision: https://reviews.llvm.org/D103334	2021-06-17 12:53:00 -07:00
Jon Roelofs	10c5d815eb	[AArch64][GISel] and+or+shl => bfi This fixes a GISEL vs SDAG regression that showed up at -Os in 256.bzip2 In `_getAndMoveToFrontDecode`: gisel: ``` and w9, w0, #0xff orr w9, w9, w8, lsl #8 ``` sdag: ``` bfi w0, w8, #8, #24 ``` Differential revision: https://reviews.llvm.org/D103291	2021-06-17 12:52:59 -07:00
Jorge Gorbe Moya	70dce4754b	Revert "[NFC] Remove checking pointee type for byval/preallocated type" This reverts commit 738abfdbea21acd2597d83ad3390daf5696b6d07.	2021-06-17 12:29:23 -07:00
Nikita Popov	81cd78135c	[LoopUnroll] Fold all exits based on known trip count/multiple Fold all exits based on known trip count/multiple information from SCEV. Previously only the latch exit or the single exit were folded. This doesn't yet eliminate ULO.TripCount and ULO.TripMultiple entirely: They're still used to a) decide whether runtime unrolling should be performed and b) for ORE remarks. However, the core unrolling logic is independent of them now. Differential Revision: https://reviews.llvm.org/D104203	2021-06-17 20:58:34 +02:00
Patrick Holland	5d008e4862	[MCA] [RegisterFile] Allow for skipping Defs with RegID of 0 (rather than assert(RegID) like we do before this patch). This patch will allow developers to remove unwanted instruction Defs (most likely from within a target specific InstrPostProcess) by setting that Def's RegisterID to 0. Differential Revision: https://reviews.llvm.org/D104433	2021-06-17 11:52:43 -07:00
Andrew Ng	6e97d78450	[llvm-symbolizer][docs] Update example for --verbose in the guide Differential Revision: https://reviews.llvm.org/D104128	2021-06-17 19:12:44 +01:00
Roman Lebedev	2b8fbae5a3	[X86] AMD Zen 3: don't confuse shift and shuffle, NFC These proc res groups occupy the exact same pipes, so this doesn't affect the modelling, but it's confusing nontheless.	2021-06-17 21:07:35 +03:00
Roman Lebedev	5f87b00dc8	[NFC] LoopVectorizationCostModel::getMaximizedVFForTarget(): clarify debug msg This really isn't talking about vectors in general, but only about either fixed or scalable vectors, and it's pretty confusing to see it state that there aren't any vectors :)	2021-06-17 21:07:34 +03:00

1 2 3 4 5 ...

217346 Commits