mirror of
https://github.com/RPCS3/llvm-mirror.git
synced 2024-11-23 11:13:28 +01:00
ffc35aabd8
In non-SPMD mode we create a state machine like code to identify the parallel region the GPU worker threads should execute next. The identification uses the parallel region function pointer as that allows it to work even if the kernel (=target region) and the parallel region are in separate TUs. However, taking the address of a function comes with various downsides. With this patch we will identify the most common situation and replace the function pointer use with a dummy global symbol (for identification purposes only). That means, if the parallel region is only called from a single target region (or kernel), we do not use the function pointer of the parallel region to identify it but a new global symbol. Fixes PR46450. Reviewed By: JonChesterfield Differential Revision: https://reviews.llvm.org/D83271 |
||
---|---|---|
.. | ||
add_attributes.ll | ||
dead_use.ll | ||
deduplication_remarks.ll | ||
deduplication.ll | ||
gpu_kernel_detection_remarks.ll | ||
gpu_state_machine_function_ptr_replacement.ll | ||
icv_remarks.ll | ||
icv_tracking.ll | ||
parallel_deletion_remarks.ll | ||
parallel_deletion.ll | ||
rtf_type_checking.ll |