1
0
mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00
llvm-mirror/lib/Target/AMDGPU
Stanislav Mekhanoshin 919b6f1d38 [AMDGPU] Fix high occupancy calculation and print it
We had couple places which still return 10 as a maximum
occupancy. Fixed.

Also print comment about occupancy as compiler see it.

Differential Revision: https://reviews.llvm.org/D65423

llvm-svn: 367381
2019-07-31 01:07:10 +00:00
..
AsmParser [AMDGPU][MC][GFX10] Enabled GFX10 assembly with arbitrary wavesize assumed by the code 2019-07-24 16:50:17 +00:00
Disassembler [AMDGPU] gfx908 mAI instructions, MC part 2019-07-09 21:43:09 +00:00
MCTargetDesc [AMDGPU] Increase kernel padding 2019-07-24 19:40:13 +00:00
TargetInfo Revert CMake: Make most target symbols hidden by default 2019-06-11 03:21:13 +00:00
Utils [AMDGPU] Fixed occupancy calculation for gfx10 2019-07-19 21:29:51 +00:00
AMDGPU.h AMDGPU: Add pass to lower SGPR spills 2019-07-03 23:32:29 +00:00
AMDGPU.td [AMDGPU] gfx908 v_pk_fmac_f16 support 2019-07-09 22:42:24 +00:00
AMDGPUAliasAnalysis.cpp AMDGPU: Improve alias analysis for GDS 2019-07-17 11:22:19 +00:00
AMDGPUAliasAnalysis.h [AliasAnalysis] Second prototype to cache BasicAA / anyAA state. 2019-03-22 17:22:19 +00:00
AMDGPUAlwaysInlinePass.cpp
AMDGPUAnnotateKernelFeatures.cpp AMDGPU: Handle "uniform-work-group-size" attribute (fix for RADV) 2019-03-07 00:54:04 +00:00
AMDGPUAnnotateUniformValues.cpp
AMDGPUArgumentUsageInfo.cpp [AMDGPU] Packed thread ids in function call ABI 2019-06-28 01:52:13 +00:00
AMDGPUArgumentUsageInfo.h AMDGPU: Convert some places to Register 2019-07-01 13:44:46 +00:00
AMDGPUAsmPrinter.cpp [AMDGPU] Fix high occupancy calculation and print it 2019-07-31 01:07:10 +00:00
AMDGPUAsmPrinter.h [AMDGPU] Fixed +DumpCode 2019-05-14 16:17:14 +00:00
AMDGPUAtomicOptimizer.cpp [DivergenceAnalysis] Add methods for querying divergence at use 2019-07-29 10:22:09 +00:00
AMDGPUCallingConv.td AMDGPU: Decompose all values to 32-bit pieces for calling conventions 2019-07-19 13:57:44 +00:00
AMDGPUCallLowering.cpp [AMDGPU] Fix typo. 2019-07-26 17:13:59 +00:00
AMDGPUCallLowering.h AMDGPU/GlobalISel: Handle most function return types 2019-07-26 02:36:05 +00:00
AMDGPUCodeGenPrepare.cpp AMDGPU: Add 24-bit mul intrinsics 2019-07-15 17:50:31 +00:00
AMDGPUFeatures.td AMDGPU: Fix names for generation features 2019-04-03 00:01:03 +00:00
AMDGPUFixFunctionBitcasts.cpp
AMDGPUFrameLowering.cpp
AMDGPUFrameLowering.h
AMDGPUGenRegisterBankInfo.def AMDGPU/GlobalISel: Add support for wide loads >= 256-bits 2019-07-10 00:22:41 +00:00
AMDGPUGISel.td AMDGPU/GlobalISel: Select G_ASHR 2019-07-16 20:31:25 +00:00
AMDGPUHSAMetadataStreamer.cpp [AMDGPU] Added a new metadata for multi grid sync implicit argument 2019-07-05 16:05:17 +00:00
AMDGPUHSAMetadataStreamer.h [AMDGPU] Switched HSA metadata to use MsgPackDocument 2019-03-13 18:55:50 +00:00
AMDGPUInline.cpp [AMDGPU] Tune inlining parameters for AMDGPU target 2019-07-17 16:51:29 +00:00
AMDGPUInstrInfo.cpp
AMDGPUInstrInfo.h
AMDGPUInstrInfo.td [AMDGPU] gfx1010 core wave32 changes 2019-06-20 15:08:34 +00:00
AMDGPUInstructions.td TableGen: Add MinAlignment predicate 2019-07-31 00:14:43 +00:00
AMDGPUInstructionSelector.cpp AMDGPU/GlobalISel: Don't assume instruction can be erased when selecting exts 2019-07-24 16:05:53 +00:00
AMDGPUInstructionSelector.h AMDGPU/GlobalISel: Select private loads 2019-07-16 19:22:21 +00:00
AMDGPUISelDAGToDAG.cpp [AMDGPU] Move WQM/WWM intrinsic instruction selection to AMDGPUISelDAGToDAG 2019-07-26 13:11:44 +00:00
AMDGPUISelLowering.cpp Fix MSVC warning about extending a uint32_t shift result to uint64_t. NFCI. 2019-07-23 14:04:54 +00:00
AMDGPUISelLowering.h [AMDGPU] gfx908 atomic fadd and atomic pk_fadd 2019-07-11 00:10:17 +00:00
AMDGPULegalizerInfo.cpp [AMDGPU/GlobalISel] Add llvm.amdgcn.fdiv.fast legalization. 2019-07-30 18:49:16 +00:00
AMDGPULegalizerInfo.h [AMDGPU/GlobalISel] Add llvm.amdgcn.fdiv.fast legalization. 2019-07-30 18:49:16 +00:00
AMDGPULibCalls.cpp Fix missing use of defined() in include guard 2019-07-12 20:12:15 +00:00
AMDGPULibFunc.cpp Delay initialization of three static global maps, NFC 2019-03-28 17:33:41 +00:00
AMDGPULibFunc.h [opaque pointer types] Add a FunctionCallee wrapper type, and use it. 2019-02-01 02:28:03 +00:00
AMDGPULowerIntrinsics.cpp
AMDGPULowerKernelArguments.cpp AMDGPU: Consolidate some getGeneration checks 2019-06-19 23:54:58 +00:00
AMDGPULowerKernelAttributes.cpp
AMDGPUMachineCFGStructurizer.cpp
AMDGPUMachineFunction.cpp AMDGPU: Make AMDGPUPerfHintAnalysis an SCC pass 2019-07-05 20:26:13 +00:00
AMDGPUMachineFunction.h
AMDGPUMachineModuleInfo.cpp AMDGPU: Add support for cross address space synchronization scopes 2019-03-25 20:50:21 +00:00
AMDGPUMachineModuleInfo.h AMDGPU: Add support for cross address space synchronization scopes 2019-03-25 20:50:21 +00:00
AMDGPUMacroFusion.cpp
AMDGPUMacroFusion.h
AMDGPUMCInstLower.cpp AMDGPU: Prepare for explicit absolute relocations in code generation 2019-06-16 17:43:37 +00:00
AMDGPUOpenCLEnqueuedBlockLowering.cpp Fix parameter name comments using clang-tidy. NFC. 2019-07-16 04:46:31 +00:00
AMDGPUPerfHintAnalysis.cpp AMDGPU: Fix assert in clang test 2019-07-05 21:09:53 +00:00
AMDGPUPerfHintAnalysis.h AMDGPU: Make AMDGPUPerfHintAnalysis an SCC pass 2019-07-05 20:26:13 +00:00
AMDGPUPromoteAlloca.cpp AMDGPU: Fix iterator crash in AMDGPUPromoteAlloca 2019-06-18 12:23:44 +00:00
AMDGPUPropagateAttributes.cpp AMDGPU: Move DEBUG_TYPE definition below includes 2019-07-08 18:48:39 +00:00
AMDGPUPTNote.h
AMDGPURegisterBankInfo.cpp [AMDGPU/GlobalISel] Add llvm.amdgcn.fdiv.fast legalization. 2019-07-30 18:49:16 +00:00
AMDGPURegisterBankInfo.h AMDGPU/GlobalISel: Add support for wide loads >= 256-bits 2019-07-10 00:22:41 +00:00
AMDGPURegisterBanks.td AMDGPU: Select G_SEXT/G_ZEXT/G_ANYEXT 2019-06-25 13:18:11 +00:00
AMDGPURegisterInfo.cpp [AMDGPU] gfx908 mAI instructions, MC part 2019-07-09 21:43:09 +00:00
AMDGPURegisterInfo.h
AMDGPURegisterInfo.td [AMDGPU] gfx908 register file changes 2019-07-09 19:41:51 +00:00
AMDGPURewriteOutArguments.cpp
AMDGPUSearchableTables.td [AMDGPU] gfx908 mfma support 2019-07-11 21:19:33 +00:00
AMDGPUSubtarget.cpp [AMDGPU] Fix high occupancy calculation and print it 2019-07-31 01:07:10 +00:00
AMDGPUSubtarget.h [AMDGPU] Fix high occupancy calculation and print it 2019-07-31 01:07:10 +00:00
AMDGPUTargetMachine.cpp [AMDGPU] Run unreachable-mbb-elimination after isel to clean up PHIs. 2019-07-25 14:50:18 +00:00
AMDGPUTargetMachine.h MIR: Allow targets to serialize MachineFunctionInfo 2019-03-14 22:54:43 +00:00
AMDGPUTargetObjectFile.cpp
AMDGPUTargetObjectFile.h
AMDGPUTargetTransformInfo.cpp AMDGPU: Support GDS atomics 2019-07-01 17:17:45 +00:00
AMDGPUTargetTransformInfo.h [AMDGPU] Tune inlining parameters for AMDGPU target 2019-07-17 16:51:29 +00:00
AMDGPUUnifyDivergentExitNodes.cpp Update phis in AMDGPUUnifyDivergentExitNodes 2019-06-25 18:55:16 +00:00
AMDGPUUnifyMetadata.cpp
AMDILCFGStructurizer.cpp
AMDKernelCodeT.h [AMDGPU] gfx1010 wave32 metadata 2019-06-17 16:48:56 +00:00
BUFInstructions.td AMDGPU/GlobalISel: Fix selection of private stores 2019-07-16 19:27:44 +00:00
CaymanInstructions.td
CMakeLists.txt [AMDGPU] Autogenerate register asm names 2019-07-16 23:44:21 +00:00
DSInstructions.td AMDGPU: Don't use SDNodeXForm for DS offset output 2019-07-22 21:38:11 +00:00
EvergreenInstructions.td AMDGPU: Avoid code predicates for extload PatFrags 2019-07-16 02:46:05 +00:00
FLATInstructions.td AMDGPU: Add register classes to flat store patterns 2019-07-16 18:26:42 +00:00
GCNDPPCombine.cpp [AMDGPU] Fix DPP combiner check for exec modification 2019-07-12 15:59:40 +00:00
GCNHazardRecognizer.cpp AMDGPU/GFX10: Apply the VMEM-to-scalar-write hazard also to writes to EXEC 2019-07-17 11:22:57 +00:00
GCNHazardRecognizer.h [AMDGPU] gfx908 hazard recognizer 2019-07-11 21:30:34 +00:00
GCNILPSched.cpp
GCNIterativeScheduler.cpp
GCNIterativeScheduler.h
GCNMinRegStrategy.cpp
GCNNSAReassign.cpp AMDGPU: Check MRI for callee saved regs instead of TRI 2019-06-26 13:39:29 +00:00
GCNProcessors.td [AMDGPU] gfx908 target 2019-07-09 18:10:06 +00:00
GCNRegBankReassign.cpp [AMDGPU] gfx908 mfma support 2019-07-11 21:19:33 +00:00
GCNRegPressure.cpp [AMDGPU] Print register pressure for agpr and vgpr separately 2019-07-30 20:45:15 +00:00
GCNRegPressure.h [AMDGPU] gfx908 mfma support 2019-07-11 21:19:33 +00:00
GCNSchedStrategy.cpp [AMDGPU] Speed up live-in virtual register set computaion in GCNScheduleDAGMILive. 2019-06-18 11:43:17 +00:00
GCNSchedStrategy.h [AMDGPU] Speed up live-in virtual register set computaion in GCNScheduleDAGMILive. 2019-06-18 11:43:17 +00:00
LLVMBuild.txt [AMDGPU] Move InstPrinter files to MCTargetDesc. NFC 2019-05-11 00:03:35 +00:00
MIMGInstructions.td [AMDGPU] Extend MIMG opcode to 8 bits 2019-07-12 18:38:06 +00:00
R600.td
R600AsmPrinter.cpp
R600AsmPrinter.h
R600ClauseMergePass.cpp
R600ControlFlowFinalizer.cpp
R600Defines.h
R600EmitClauseMarkers.cpp
R600ExpandSpecialInstrs.cpp
R600FrameLowering.cpp
R600FrameLowering.h
R600InstrFormats.td
R600InstrInfo.cpp R600InstrInfo.cpp - Add getTransSwizzle assert for the swizzle op index. NFCI. 2019-05-08 10:39:56 +00:00
R600InstrInfo.h
R600Instructions.td AMDGPU: Redefine load PatFrags 2019-07-16 17:38:50 +00:00
R600ISelLowering.cpp [TargetLowering] Add MachineMemOperand::Flags to allowsMemoryAccess tests (PR42123) 2019-06-12 17:14:03 +00:00
R600ISelLowering.h [TargetLowering] Add MachineMemOperand::Flags to allowsMemoryAccess tests (PR42123) 2019-06-12 17:14:03 +00:00
R600MachineFunctionInfo.cpp
R600MachineFunctionInfo.h
R600MachineScheduler.cpp
R600MachineScheduler.h
R600OpenCLImageTypeLoweringPass.cpp
R600OptimizeVectorRegisters.cpp R600: Fix unconditional return in loop 2019-05-20 16:22:11 +00:00
R600Packetizer.cpp CodeGen: Introduce a class for registers 2019-06-24 15:50:29 +00:00
R600Processors.td AMDGPU: Fix names for generation features 2019-04-03 00:01:03 +00:00
R600RegisterInfo.cpp CodeGen: Introduce a class for registers 2019-06-24 15:50:29 +00:00
R600RegisterInfo.h CodeGen: Introduce a class for registers 2019-06-24 15:50:29 +00:00
R600RegisterInfo.td
R600Schedule.td
R700Instructions.td
SIAddIMGInit.cpp
SIAnnotateControlFlow.cpp [AMDGPU] gfx1010 wave32 icmp/fcmp intrinsic changes for wave32 2019-06-13 23:47:36 +00:00
SIDefines.h [AMDGPU][MC][GFX9][GFX10] Added support of GET_DOORBELL message 2019-07-15 15:12:16 +00:00
SIFixSGPRCopies.cpp [AMDGPU] Add llvm.amdgcn.softwqm intrinsic 2019-07-26 09:54:12 +00:00
SIFixupVectorISel.cpp [AMDGPU] gfx1010 VMEM and SMEM implementation 2019-04-30 22:08:23 +00:00
SIFixVGPRCopies.cpp
SIFoldOperands.cpp [AMDGPU] Fix DPP combiner check for exec modification 2019-07-12 15:59:40 +00:00
SIFormMemoryClauses.cpp [AMDGPU] Added target-specific attribute amdgpu-max-memory-clause 2019-05-30 18:46:34 +00:00
SIFrameLowering.cpp [AMDGPU] Add the adjusted FP as a livein register. 2019-07-16 15:57:12 +00:00
SIFrameLowering.h [AMDGPU] Add the adjusted FP as a livein register. 2019-07-16 15:57:12 +00:00
SIInsertSkips.cpp [AMDGPU] gfx10 conditional registers handling 2019-06-16 17:13:09 +00:00
SIInsertWaitcnts.cpp [AMDGPU] gfx908 mfma support 2019-07-11 21:19:33 +00:00
SIInstrFormats.td [AMDGPU] Extend MIMG opcode to 8 bits 2019-07-12 18:38:06 +00:00
SIInstrInfo.cpp [AMDGPU] Fix typo in error message 2019-07-29 16:17:13 +00:00
SIInstrInfo.h AMDGPU/GlobalISel: Select flat loads 2019-07-16 18:05:29 +00:00
SIInstrInfo.td TableGen: Add MinAlignment predicate 2019-07-31 00:14:43 +00:00
SIInstructions.td [AMDGPU] Add llvm.amdgcn.softwqm intrinsic 2019-07-26 09:54:12 +00:00
SIISelLowering.cpp [AMDGPU] Reserve all AGPRs on targets which do not have them 2019-07-30 19:29:33 +00:00
SIISelLowering.h [AMDGPU] Enable v4f16 and above for v_pk_fma instructions 2019-07-29 08:15:10 +00:00
SILoadStoreOptimizer.cpp AMDGPU/LoadStoreOptimizer: combine MMOs when merging instructions 2019-07-29 16:40:58 +00:00
SILowerControlFlow.cpp CodeGen: Introduce a class for registers 2019-06-24 15:50:29 +00:00
SILowerI1Copies.cpp AMDGPU: Make fixing i1 copies robust against re-ordering 2019-06-27 16:56:44 +00:00
SILowerSGPRSpills.cpp Remove set but unused variable. 2019-07-15 06:35:28 +00:00
SIMachineFunctionInfo.cpp [AMDGPU] Fix high occupancy calculation and print it 2019-07-31 01:07:10 +00:00
SIMachineFunctionInfo.h [AMDGPU] gfx908 agpr spilling 2019-07-11 21:54:13 +00:00
SIMachineScheduler.cpp [CodeGen] Add "const" to MachineInstr::mayAlias 2019-04-19 09:08:38 +00:00
SIMachineScheduler.h
SIMemoryLegalizer.cpp Delete dead stores 2019-07-12 14:58:15 +00:00
SIModeRegister.cpp [SIMode] Fix typo in Status constructor 2019-05-08 10:24:22 +00:00
SIOptimizeExecMasking.cpp [AMDGPU] gfx908 mfma support 2019-07-11 21:19:33 +00:00
SIOptimizeExecMaskingPreRA.cpp [AMDGPU] gfx10 conditional registers handling 2019-06-16 17:13:09 +00:00
SIPeepholeSDWA.cpp [AMDGPU] gfx10 conditional registers handling 2019-06-16 17:13:09 +00:00
SIPreAllocateWWMRegs.cpp [AMDGPU] Pre-allocate WWM registers to reduce VGPR pressure. 2019-04-01 15:19:52 +00:00
SIProgramInfo.h [AMDGPU] Fix high occupancy calculation and print it 2019-07-31 01:07:10 +00:00
SIRegisterInfo.cpp [AMDGPU] Reserve all AGPRs on targets which do not have them 2019-07-30 19:29:33 +00:00
SIRegisterInfo.h [AMDGPU] gfx908 mfma support 2019-07-11 21:19:33 +00:00
SIRegisterInfo.td [AMDGPU] Autogenerate register sequences in tuples 2019-07-19 21:43:42 +00:00
SISchedule.td [AMDGPU] gfx908 scheduling 2019-07-11 21:25:00 +00:00
SIShrinkInstructions.cpp AMDGPU: Write LDS objects out as global symbols in code generation 2019-06-25 11:52:30 +00:00
SIWholeQuadMode.cpp [AMDGPU] Add llvm.amdgcn.softwqm intrinsic 2019-07-26 09:54:12 +00:00
SMInstructions.td [AMDGPU] Always use s_memtime for readcyclecounter 2019-07-09 03:10:18 +00:00
SOPInstructions.td AMDGPU: Redefine setcc condition PatLeafs 2019-07-19 20:24:40 +00:00
VIInstrFormats.td
VIInstructions.td
VOP1Instructions.td [AMDGPU] Allow any value in unused src0 field in v_nop 2019-06-24 17:35:20 +00:00
VOP2Instructions.td AMDGPU/GlobalISel: Select G_ASHR 2019-07-16 20:31:25 +00:00
VOP3Instructions.td AMDGPU/GlobalISel: Select G_ASHR 2019-07-16 20:31:25 +00:00
VOP3PInstructions.td [AMDGPU] gfx908 mAI instructions, MC part 2019-07-09 21:43:09 +00:00
VOPCInstructions.td AMDGPU: Redefine setcc condition PatLeafs 2019-07-19 20:24:40 +00:00
VOPInstructions.td [AMDGPU] gfx908 mAI instructions, MC part 2019-07-09 21:43:09 +00:00