llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-11-25 20:23:11 +01:00

Author	SHA1	Message	Date
Serguei Katkov	a0cc2dd0ab	Revert "Mark gc.relocate and gc.result as readnone" As readnone function they become movable and LICM can hoist them out of a loop. As a result in LCSSA form phi node of type token is created. No one is ready that GCRelocate first operand is phi node but expects to be token. GVN test were also updated, it seems it does not do what is expected. Test for LICM is also added. This reverts commit f352463ade6e49c3b0275f296d9190d828b7630b.	2021-03-12 16:59:17 +07:00
Fraser Cormack	fa5bc9bba6	[RISCV] Optimize INSERT_VECTOR_ELT sequences This patch optimizes the codegen for INSERT_VECTOR_ELT in various ways. Primarily, it removes the use of vslidedown during lowering, and the vector element is inserted entirely using vslideup with a custom VL and slide index. Additionally, lowering of i64-element vectors on RV32 has been optimized in several ways. When the 64-bit value to insert is the same as the sign-extension of the lower 32-bits, the codegen can follow the regular path. When this is not possible, a new sequence of two i32 vslide1up instructions is used to get the vector element into a vector. This sequence was suggested by @craig.topper. From there, the value is slid into the final position for more consistent lowering across RV32 and RV64. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D98250	2021-03-12 09:13:38 +00:00
Fraser Cormack	8e6f056201	[RISCV] Fix up stale VECREDUCE comments. NFC. Reviewed By: craig.topper Differential Revision: https://reviews.llvm.org/D98399	2021-03-12 08:49:46 +00:00
Bjorn Pettersson	bcace0dff8	[InstSimplify] Simplify smul.fix and smul.fix.sat Add simplification of smul.fix and smul.fix.sat according to X * 0 -> 0 X * undef -> 0 X * (1 << scale) -> X This includes the commuted patterns and splatted vectors. Reviewed By: nikic Differential Revision: https://reviews.llvm.org/D98299	2021-03-12 09:09:58 +01:00
Bjorn Pettersson	5d0c366826	[ConstantFold] Handle undef/poison when constant folding smul_fix/smul_fix_sat Do constant folding according to posion * C -> poison C * poison -> poison undef * C -> 0 C * undef -> 0 for smul_fix and smul_fix_sat intrinsics (for any scale). Reviewed By: nikic, aqjune, nagisa Differential Revision: https://reviews.llvm.org/D98410	2021-03-12 09:09:58 +01:00
Johannes Doerfert	ce96365607	Revert "[OpenMP] Do not propagate match extensions to nested contexts" Two tests failed for some reason, need to investigate: https://lab.llvm.org/buildbot/#/builders/109/builds/10399 This reverts commit ad9e98b8efa0138559eb640023695dab54967a8d.	2021-03-11 23:48:36 -06:00
Johannes Doerfert	0058a98cec	Revert "[OpenMP] Introduce the `disable_selector_propagation` variant selector trait" Need to revert ad9e98b8efa0138559eb640023695dab54967a8d which this commit depends on. This reverts commit f771ef7b5f0ed260d00931cd50e6fe462edbacaf.	2021-03-11 23:48:35 -06:00
Johannes Doerfert	24773fbc70	[Attributor] Derive `willreturn` based on `mustprogress` Since D86233 we have `mustprogress` which, in combination with `readonly`, implies `willreturn`. The idea is that every side-effect has to be modeled as a "write". Consequently, `readonly` means there is no side-effect, and `mustprogress` guarantees that we cannot "loop" forever without side-effect. Reviewed By: fhahn Differential Revision: https://reviews.llvm.org/D94125	2021-03-11 23:31:44 -06:00
Johannes Doerfert	0f695356f4	[Attributor][NFC] Update tests after D94741 The update_test_checks script can now check for global symbols and is able to handle them properly when they differ across prefixes, e.g., attribute #0 might be different in different runs. This patch simply updates all the Attributor tests with the new script. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D97906	2021-03-11 23:31:39 -06:00
Johannes Doerfert	e3c1fc67b1	[OpenMP] Introduce the `disable_selector_propagation` variant selector trait Nested `omp [begin\|end] declare variant` inherit the selectors from surrounding `omp (begin\|end) declare variant` constructs. To stop such propagation the user can add the `disable_selector_propagation` to the `extension` set in the `implementation` selector. Reviewed By: tianshilei1992 Differential Revision: https://reviews.llvm.org/D95765	2021-03-11 23:31:25 -06:00
Johannes Doerfert	274a713a36	[OpenMP] Do not propagate match extensions to nested contexts If we have nested declare variant context, it doesn't make sense to inherit the match extension from the parent. Instead, just skip it. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D95764	2021-03-11 23:31:21 -06:00
Johannes Doerfert	6e9991eb47	[Utils] Check for more global information in update_test_checks This allows to check for various globals (metadata/attributes/...) and also resolves problems with globals (metadata/attributes/...) being reused across different prefixes. Reviewed By: sstefan1 Differential Revision: https://reviews.llvm.org/D94741	2021-03-11 23:31:16 -06:00
Johannes Doerfert	6c444365cb	[FIX] Allow non-constant assume operand bundle operands. Fixes PR49545 Reviewed By: zequanwu, fhahn, lebedev.ri Differential Revision: https://reviews.llvm.org/D98444	2021-03-11 23:31:09 -06:00
Esme-Yi	018bbe154e	[Debug-Info] Add names for the debug line prologue. Summary: This is a minor patch to add names for the debug line prologue, as a follow-up of D95998. Reviewed By: dblaikie, ikudrin, shchenz Differential Revision: https://reviews.llvm.org/D98383	2021-03-12 04:45:08 +00:00
Craig Topper	07d94ffb28	[RISCV] Return false from isShuffleMaskLegal except for splats. We don't support any other shuffles currently. This changes the bswap/bitreverse tests that check for this in their expansion code. Previously we expanded a byte swapping shuffle through memory. Now we're scalarizing and doing bit operations on scalars to swap bytes. In the future we can probably use vrgather.vx to do a byte swap shuffle.	2021-03-11 20:02:49 -08:00
Yonghong Song	dae54112ca	BPF: provide better error message for unsupported atomic operations Currently, BPF backend does not support all variants of atomic_load_{add,and,or,xor}, atomic_swap and atomic_cmp_swap For example, it only supports 32bit (with alu32 mode) and 64bit operations for atomic_load_{and,or,xor}, atomic_swap and atomic_cmp_swap. Due to historical reason, atomic_load_add is always supported with 32bit and 64bit. If user used an unsupported atomic operation, currently, codegen selectiondag cannot find bpf support and will issue a fatal error. This is not user friendly as user may mistakenly think this is a compiler bug. This patch added Custom rule for unsupported atomic operations and will emit better error message during ReplaceNodeResults() callback. The following is an example output. $ cat t.c short sync(short *p) { return __sync_val_compare_and_swap (p, 2, 3); } $ clang -target bpf -O2 -g -c t.c t.c:2:11: error: Unsupported atomic operations, please use 64 bit version return __sync_val_compare_and_swap (p, 2, 3); ^ fatal error: error in backend: Cannot select: t19: i64,ch = AtomicCmpSwap<(load store seq_cst seq_cst 2 on %ir.p)> t0, t2, Constant:i64<2>, Constant:i64<3>, t.c:2:11 t2: i64,ch = CopyFromReg t0, Register:i64 %0 t1: i64 = Register %0 t11: i64 = Constant<2> t10: i64 = Constant<3> In function: sync PLEASE submit a bug report ... Fatal error will still happen since we did not really do proper lowering for these unsupported atomic operations. But we do get a much better error message. Differential Revision: https://reviews.llvm.org/D98471	2021-03-11 19:19:00 -08:00
Carl Ritson	b3653dfbd2	[AMDGPU] Do not annotate an else branch if there is a kill As llvm.amdgcn.kill is lowered to a terminator it can cause else branch annotations to end up in the wrong block. Do not annotate conditionals as else branches where there is a kill to avoid this. Reviewed By: arsenm Differential Revision: https://reviews.llvm.org/D97427	2021-03-12 11:52:08 +09:00
Mircea Trofin	87d29d8dee	Revert "[NPM][CGSCC] FunctionAnalysisManagerCGSCCProxy: do not clear immutable function passes" This reverts commit 5eaeb0fa67e57391f5584a3f67fdb131e93afda6. It appears there are analyses that assume clearing - example: https://lab.llvm.org/buildbot#builders/36/builds/5964	2021-03-11 18:31:19 -08:00
Mircea Trofin	c85bdf8369	[NPM][CGSCC] FunctionAnalysisManagerCGSCCProxy: do not clear immutable function passes Check with the analysis result by calling invalidate instead of clear on the analysis manager. Differential Revision: https://reviews.llvm.org/D98440	2021-03-11 18:15:28 -08:00
David Blaikie	d97ffcdf36	Move (llvm-original-di-preservation) test example output into the Inputs directory (since it's an input to the test execution) The "Inputs" subdirectory is used for all files read by the test, not only those used as input to the execution - so even though this file is used as a golden reference for the output of the test, it's still an input to the test execution (it is read in the process of executing the test).	2021-03-11 17:36:33 -08:00
Maksim Panchenko	3442ee8292	[RuntimeDyld] Speedup resolution of relocations to external symbols From what I can tell, the loop inside applyExternalSymbolRelocations() used to call getSymbolAddress(). After the JITSymbolResolver interface redesign, the functionality has changed, and the loop should no longer trigger repopulation of ExternalSymbolRelocations. If that's the case, there is no need to update the loop iterator manually, and ExternalSymbolRelocations can be cleared at once. This way, when there are many external symbols in the program, the function runs much faster. Reviewed By: lhames Differential Revision: https://reviews.llvm.org/D97531	2021-03-11 16:58:49 -08:00
Carl Ritson	df6c503796	[AMDGPU] Restrict image_msaa_load to MSAA dimension types This instruction is only valid on 2D MSAA and 2D MSAA Array surfaces. Remove intrinsic support for other dimension types, and block assembly for unsupported dimensions. Reviewed By: foad Differential Revision: https://reviews.llvm.org/D98397	2021-03-12 09:47:24 +09:00
Ruiling Song	27bf846d0e	[AMDGPU] Don't check hasStackObjects() when reserving VGPR We have amdgpu_gfx functions that have high register pressure. If we do not reserve VGPR for SGPR spill, we will fall into the path to spill the SGPR to memory, which does not only have correctness issue, but also have really bad performance. I don't know why there is the check for hasStackObjects(), in our case, we don't have stack objects at the time of finalizeLowering(). So just remove the check that we always reserve a VGPR for possible SGPR spill in non-entry functions. Reviewed by: arsenm Differential Revision: https://reviews.llvm.org/D98345	2021-03-12 08:11:14 +08:00
Ruiling Song	782055450f	[AMDGPU] Free reserved VGPR if no SGPR spill I met some code generation behavior change when I tried to remove the hasStackObject() check when reserving VGPR for SGPR spill. For example, the function `callee_no_stack_no_fp_elim_all` in the lit test file `callee-frame-setup.ll`. The generated code changed from: ``` s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0) s_mov_b32 s4, s33 s_mov_b32 s33, s32 s_mov_b32 s33, s4 s_setpc_b64 s[30:31] ``` into something like: ``` s_waitcnt vmcnt(0) expcnt(0) lgkmcnt(0) v_writelane_b32 v63, s33, 0 s_mov_b32 s33, s32 v_readlane_b32 s33, v63, 0 s_setpc_b64 s[30:31] ``` I think we still prefer the old version where only scalar instructions are needed. The idea here is free the reserved VGPR if no SGPR spills. So we will very likely to use a free SGPR for fp/sp spill. Reviewed by: arsenm Differential Revision: https://reviews.llvm.org/D98344	2021-03-12 08:11:14 +08:00
Craig Topper	31b93a5021	[RISCV] Add test cases for fixed vector bitreverse, bswap, ctlz, cttz, and ctpop. Codegen needs to be improved, but I wanted to check for crashes.	2021-03-11 15:56:32 -08:00
David Blaikie	964aca873b	Fix use of deprecated IRBuilder::CreateLoad in Kaleidoscope	2021-03-11 15:44:02 -08:00
Florian Hahn	648fd413ce	[Matrix] Add support for matrix-by-scalar division. This patch extends the matrix spec to allow matrix-by-scalar division. Originally support for `/` was left out to avoid ambiguity for the matrix-matrix version of `/`, which could either be elementwise or specified as matrix multiplication M1 * (1/M2). For the matrix-scalar version, no ambiguity exists; `*` is also an elementwise operation in that case. Matrix-by-scalar division is commonly supported by systems including Matlab, Mathematica or NumPy. Reviewed By: rjmccall Differential Revision: https://reviews.llvm.org/D97857	2021-03-11 22:21:23 +00:00
Stanislav Mekhanoshin	68c47b952f	[AMDGPU] Remove dead MTBUF patterns These patterns are obviously dead, they are using format operand which is not selected and we have no corresponding SelectMUBUF() function. Differential Revision: https://reviews.llvm.org/D98451	2021-03-11 14:13:00 -08:00
Nikita Popov	29852402a8	[Attributor] Don't access pointer elem type in constructPointer (NFC) Splitting this out as the change is non-trivial: The way this code handled pointer types doesn't really make sense, as GEPs can only apply an offset to the outermost pointer, but can't drill down into interior pointer types (which would require dereferencing memory). Instead give special treatment to the first (pointer) index. I've hardcoded it to zero as that's the only way the function is used right now, but handling non-zero indexes would be straightforward. The original goal here was to have an element type for CreateGEP.	2021-03-11 21:36:40 +01:00
Petr Hosek	83705adcd9	[InstrProfiling] Generate runtime hook for ELF platforms When using -fprofile-list to selectively apply instrumentation only to certain files or functions, we may end up with a binary that doesn't have any counters in the case where no files were selected. However, because on Linux and Fuchsia, we pass -u__llvm_profile_runtime, the runtime would still be pulled in and incur some non-trivial overhead, especially in the case when the continuous or runtime counter relocation mode is being used. A better way would be to pull in the profile runtime only when needed by declaring the __llvm_profile_runtime symbol in the translation unit only when needed. This approach was already used prior to 9a041a75221ca, but we changed it to always generate the __llvm_profile_runtime due to a TAPI limitation. Since TAPI is only used on Mach-O platforms, we could use the early emission of __llvm_profile_runtime there, and on other platforms we could change back to the earlier approach where the symbol is generated later only when needed. We can stop passing -u__llvm_profile_runtime to the linker on Linux and Fuchsia since the generated undefined symbol in each translation unit that needed it serves the same purpose. Differential Revision: https://reviews.llvm.org/D98061	2021-03-11 12:29:01 -08:00
Valery N Dmitriev	1cb62c35f6	[SLP] Fix crash when matching associative reduction for integer min/max. Associative reduction matcher in SLP begins with select instruction but when it reached call to llvm.umax (or alike) via def-use chain the latter also matched as UMax kind. The routine's later code assumes matched instruction to be a select and thus it merely died on the first encountered cast that did not fit. Differential Revision: https://reviews.llvm.org/D98432	2021-03-11 11:52:57 -08:00
Wei Mi	702028822a	[IndirectCallPromotion] Don't strip ".__uniq." suffix when it strips ".llvm." suffix. Currently IndirectCallPromotion simply strip everything after the first "." in LTO mode, in order to match the symbol name and the name with ".llvm." suffix in the value profile. However, if -funique-internal-linkage-names and thinlto are both enabled, the name may have both ".__uniq." suffix and ".llvm." suffix, and the current mechanism will strip them both, which is unexpected. The patch fixes the problem. Differential Revision: https://reviews.llvm.org/D98389	2021-03-11 11:08:47 -08:00
LemonBoy	6291746c16	[SelectionDAG] Improve scalarization of irregular vector types Use a more general strategy when splitting a vector into scalar parts (and vice-versa) to correctly handle vector types whose element size is not a power of 2 (and a multiple of 8). Reviewed By: atanasyan Differential Revision: https://reviews.llvm.org/D98273	2021-03-11 19:57:13 +01:00
LemonBoy	31e713306f	[MIPS] Fix lowering of irregular vector arguments The code deciding how to split the vector in register-sized integers used the integer division operator, thus rounding down the result. Correct the computation for irregularly-sized types (non-power-of-two, non multiple of 8) by rounding the division result upwards. Reviewed By: atanasyan Differential Revision: https://reviews.llvm.org/D98189	2021-03-11 19:56:04 +01:00
LLVM GN Syncbot	ff7a1a2d1e	[gn build] Port 5433a79176a3	2021-03-11 18:35:32 +00:00
Wenlei He	fbf8284cc0	[SamplePGO] Skip inlinee profile scaling for sample loader inlining For CGSCC inline, we need to scale down a function's branch weights and entry counts when thee it's inlined at a callsite. This is done through updateCallProfile. Additionally, we also scale the weigths for the inlined clone based on call site count in updateCallerBFI. Neither is needed for inlining during sample profile loader as it's using context profile that is separated from inlinee's own profile. This change skip the inlinee profile scaling for sample loader inlining. Differential Revision: https://reviews.llvm.org/D98187	2021-03-11 10:18:26 -08:00
Nikita Popov	30e03f0628	[UnitTests] Remove uses of deprecated CreateLoad() API Missed this usage inside OpenMPIRBuilderTest.	2021-03-11 19:05:53 +01:00
Craig Topper	dd58d36de0	[RISCV] Support fixed vector copysign. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D98394	2021-03-11 09:57:24 -08:00
David Green	9b84945f7a	[ARM] Move t2DoLoopStart reg alloc hint This adjusts the place that the t2DoLoopStart reg allocation hint is inserted, adding it in the ARMTPAndVPTOptimizaionPass in a similar place as other tail predicated loop optimizations. This removes the need for doing so in a custom inserter, and should make the hint more accurate, only adding it where we expect to create a DLS (not DLSTP or WLS).	2021-03-11 17:56:19 +00:00
David Green	18fc27f084	[ARM] Improve WLS lowering Recently we improved the lowering of low overhead loops and tail predicated loops, but concentrated first on the DLS do style loops. This extends those improvements over to the WLS while loops, improving the chance of lowering them successfully. To do this the lowering has to change a little as the instructions are terminators that produce a value - something that needs to be treated carefully. Lowering starts at the Hardware Loop pass, inserting a new llvm.test.start.loop.iterations that produces both an i1 to control the loop entry and an i32 similar to the llvm.start.loop.iterations intrinsic added for do loops. This feeds into the loop phi, properly gluing the values together: %wls = call { i32, i1 } @llvm.test.start.loop.iterations.i32(i32 %div) %wls0 = extractvalue { i32, i1 } %wls, 0 %wls1 = extractvalue { i32, i1 } %wls, 1 br i1 %wls1, label %loop.ph, label %loop.exit ... loop: %lsr.iv = phi i32 [ %wls0, %loop.ph ], [ %iv.next, %loop ] .. %iv.next = call i32 @llvm.loop.decrement.reg.i32(i32 %lsr.iv, i32 1) %cmp = icmp ne i32 %iv.next, 0 br i1 %cmp, label %loop, label %loop.exit The llvm.test.start.loop.iterations need to be lowered through ISel lowering as a pair of WLS and WLSSETUP nodes, which each get converted to t2WhileLoopSetup and t2WhileLoopStart Pseudos. This helps prevent t2WhileLoopStart from being a terminator that produces a value, something difficult to control at that stage in the pipeline. Instead the t2WhileLoopSetup produces the value of LR (essentially acting as a lr = subs rn, 0), t2WhileLoopStart consumes that lr value (the Bcc). These are then converted into a single t2WhileLoopStartLR at the same point as t2DoLoopStartTP and t2LoopEndDec. Otherwise we revert the loop to prevent them from progressing further in the pipeline. The t2WhileLoopStartLR is a single instruction that takes a GPR and produces LR, similar to the WLS instruction. %1:gprlr = t2WhileLoopStartLR %0:rgpr, %bb.3 t2B %bb.1 ... bb.2.loop: %2:gprlr = PHI %1:gprlr, %bb.1, %3:gprlr, %bb.2 ... %3:gprlr = t2LoopEndDec %2:gprlr, %bb.2 t2B %bb.3 The t2WhileLoopStartLR can then be treated similar to the other low overhead loop pseudos, eventually being lowered to a WLS providing the branches are within range. Differential Revision: https://reviews.llvm.org/D97729	2021-03-11 17:56:19 +00:00
Simon Wallis	fb37b7e70a	[AArch64] Fix -Wunused-but-set-variable in GCC non-debug build [AArch64] Fix -Wunused-but-set-variable in GCC -DLLVM_ENABLE_ASSERTIONS=off non-debug build. Reviewed By: MaskRay Differential Revision: https://reviews.llvm.org/D98431	2021-03-11 17:54:05 +00:00
Hiroshi Yamauchi	1295ae3ffe	[PGO] Fix two issues in PGOMemOPSizeOpt. 1. PGOMemOPSizeOpt grabs only the first, up to five (by default) entries from the value profile metadata and preserves the remaining entries for the fallback memop call site. If there are more than five entries, the rest of the entries would get dropped. This is fine for PGOMemOPSizeOpt itself as it only promotes up to 3 (by default) values, but potentially not for other downstream passes that may use the value profile metadata. 2. PGOMemOPSizeOpt originally assumed that only values 0 through 8 are kept track of. When the range buckets were introduced, it was changed to skip the range buckets, but since it does not grab all entries (only five), if some range buckets exist in the first five entries, it could potentially cause fewer promotion opportunities (eg. if 4 out of 5 were range buckets, it may be able to promote up to one non-range bucket, as opposed to 3.) Also, combined with 1, it means that wrong entries may be preserved, as it didn't correctly keep track of which were entries were skipped. To fix this, PGOMemOPSizeOpt now grabs all the entries (up to the maximum number of value profile buckets), keeps track of which entries were skipped, and preserves all the remaining entries. Differential Revision: https://reviews.llvm.org/D97592	2021-03-11 09:53:05 -08:00
Nikita Popov	332a4fa84c	[IRBuilder] Deprecate CreateLoad APIs with implicit type These APIs are not compatible with opaque pointers. Deprecate them to avoid the introduction of further uses.	2021-03-11 18:46:45 +01:00
Craig Topper	c201bc69e6	[RISCV] Handle vmv.x.s intrinsic for i64 vectors on RV32. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D98372	2021-03-11 09:39:50 -08:00
Craig Topper	2632043921	[RISCV] Support extract_vector_elt for fixed and scalable masked registers. This uses a really simple approach of converting to an i8 vector and extracting. This is probably not the best approach especially if you know the index is constant. Other ideas: -Store to stack temporary using vse1, load as scalar and shift. -Sort of bitcast the vector to a vector of i8, slide down the appropriate 8 bit element, copy to scalar, shift down the correct bit within the 8 bits we extracted. Not exactly sure how to describe such a bitcast from i1 vector to i8 vector within the type system for elements less than 8. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D98310	2021-03-11 09:26:44 -08:00
Craig Topper	0a46c560b5	[ValueTypes][RISCV] Add MVT for v1f16. RISCV makes all fixed vector MVTs with size less than or equal to a command line option legal. This didn't include v1f16 because it was missing but did include v1f32 and v1f64. One test is affected where we did test this type, but it is a horizontal reduction so it is non-sensical. Perhaps we should canonicalize that away somewhere. I'm not sure if we should be making v1 types legal, but this will at least make RISCV consistent across all types. Reviewed By: frasercrmck Differential Revision: https://reviews.llvm.org/D98365	2021-03-11 09:23:18 -08:00
Matt Arsenault	8d21dc7b08	AMDGPU/GlobalISel: Improve private addressing mode matching This enables the look-through-copy to hack around not correctly regbankselecting constants to match the use bank.	2021-03-11 10:23:35 -05:00
Matt Arsenault	d9bf8e78b9	GlobalISel: Fix off by one in finding explicit byval alignment For attribute sets, the return index is at 0, and arguments start at 1. getParamAlignment adds the offset of 1, so we need to convert from attribute index back to IR index.	2021-03-11 10:23:08 -05:00
Matt Arsenault	0d5c3bafbb	AMDGPU/GlobalISel: Add more tests for byval arguments	2021-03-11 10:23:08 -05:00
Joseph Huber	ff5095970f	[OpenMP] Restore backwards compatibility for libomptarget Summary: The changes introduced in D87946 changed the API for libomptarget functions. `__kmpc_push_target_tripcount` was a function in Clang 11.x but was not given a backward-compatible interface. This change will require people using Clang 13.x or 12.x to recompile their offloading programs. Reviewed By: jdoerfert cchen Differential Revision: https://reviews.llvm.org/D98358	2021-03-11 09:52:11 -05:00

1 2 3 4 5 ...

212548 Commits