llvm-mirror

mirror of https://github.com/RPCS3/llvm-mirror.git synced 2024-10-19 11:02:59 +02:00

History

Nicolai Haehnle bfa5e9829a AMDGPU: Fix various issues around the VirtReg2Value mapping Summary: The VirtReg2Value mapping is crucial for getting consistently reliable divergence information into the SelectionDAG. This patch fixes a bunch of issues that lead to incorrect divergence info and introduces tight assertions to ensure we don't regress: 1. VirtReg2Value is generated lazily; there were some cases where a lookup was performed before all relevant virtual registers were created, leading to an out-of-sync mapping. Those cases were: - Complex code to lower formal arguments that generated CopyFromReg nodes from live-in registers (fixed by never querying the mapping for live-in registers). - Code that generates CopyToReg for formal arguments that are used outside the entry basic block (fixed by never querying the mapping for Register nodes, which don't need the divergence info anyway). 2. For complex values that are lowered to a sequence of registers, all registers must be reflected in the VirtReg2Value mapping. I am not adding any new tests, since I'm not actually aware of any bugs that these problems are causing with trunk as-is. However, I recently added a test case (in r346423) which fails when D53283 is applied without this change. Also, the new assertions should provide most of the effective test coverage. There is one test change in sdwa-peephole.ll. The underlying issue is that since the divergence info is now correct, the DAGISel will select V_OR_B32 directly instead of S_OR_B32. This leads to an extra COPY which affects the behavior of MachineLICM in a way that ends up with the S_MOV_B32 with the constant in a different basic block than the V_OR_B32, which is presumably what defeats the peephole. Reviewers: alex-t, arsenm, rampitec Subscribers: kzhuravl, jvesely, wdng, yaxunl, dstuttard, tpr, t-tye, llvm-commits Differential Revision: https://reviews.llvm.org/D54340 llvm-svn: 348049		2018-11-30 22:55:29 +00:00
..
AsmParser	[AMDGPU] Combine DPP mov with use instructions (VOP1/2/3)	2018-11-30 14:21:56 +00:00
Disassembler	AMDGPU: Separate R600 and GCN TableGen files	2018-06-28 23:47:12 +00:00
InstPrinter	AMDGPU: Re-apply r341982 after fixing the layering issue	2018-09-12 18:50:47 +00:00
MCTargetDesc	AMDGPU: Add sram-ecc feature	2018-11-05 22:44:19 +00:00
TargetInfo
Utils	Revert r347871 "Fix: Add support for TFE/LWE in image intrinsic"	2018-11-29 20:14:17 +00:00
AMDGPU.h	[AMDGPU] Combine DPP mov with use instructions (VOP1/2/3)	2018-11-30 14:21:56 +00:00
AMDGPU.td	[AMDGPU] Combine DPP mov with use instructions (VOP1/2/3)	2018-11-30 14:21:56 +00:00
AMDGPUAliasAnalysis.cpp	Allow subclassing ExternalAA	2018-11-07 20:26:42 +00:00
AMDGPUAliasAnalysis.h	Allow subclassing ExternalAA	2018-11-07 20:26:42 +00:00
AMDGPUAlwaysInlinePass.cpp	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
AMDGPUAnnotateKernelFeatures.cpp	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
AMDGPUAnnotateUniformValues.cpp	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
AMDGPUArgumentUsageInfo.cpp	AMDGPU/AMDHSA: Remove GridWorkGroupCountX/Y/Z	2018-06-21 18:36:04 +00:00
AMDGPUArgumentUsageInfo.h	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
AMDGPUAsmPrinter.cpp	[AMDGPU] Derive GCNSubtarget from MF to get overridden target features	2018-11-19 15:44:20 +00:00
AMDGPUAsmPrinter.h	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
AMDGPUAtomicOptimizer.cpp	[AMDGPU] Fix the new atomic optimizer in pixel shaders.	2018-11-05 12:04:48 +00:00
AMDGPUCallingConv.td	AMDGPU: Partially fix handling of packed amdgpu_ps arguments	2018-08-01 19:57:34 +00:00
AMDGPUCallLowering.cpp	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
AMDGPUCallLowering.h	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
AMDGPUCodeGenPrepare.cpp	[IRBuilder] Fixup CreateIntrinsic to allow specifying Types to Mangle.	2018-10-08 10:32:33 +00:00
AMDGPUFeatures.td	AMDGPU: Allow fp32-denormals feature for r600 targets	2018-08-01 15:04:36 +00:00
AMDGPUFixFunctionBitcasts.cpp	[AMDGPU] Add a pass to promote bitcast calls	2018-10-26 13:18:36 +00:00
AMDGPUFrameLowering.cpp
AMDGPUFrameLowering.h
AMDGPUGenRegisterBankInfo.def	AMDGPU/GlobalISel: Fix crash in regbankselect on non-power-of-2 types	2018-07-27 06:04:40 +00:00
AMDGPUGISel.td	AMDGPU/GlobalISel: Select amdgcn.cvt.pkrtz to 64-bit instructions	2018-10-08 17:49:29 +00:00
AMDGPUHSAMetadataStreamer.cpp	AMDGPU: Don't abort on unknown addrspace argument	2018-09-10 02:23:30 +00:00
AMDGPUHSAMetadataStreamer.h	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
AMDGPUInline.cpp	fix typos aggressively; NFC	2018-11-07 14:35:36 +00:00
AMDGPUInstrInfo.cpp	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
AMDGPUInstrInfo.h	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
AMDGPUInstrInfo.td	AMDGPU: Remove PHI loop condition optimization	2018-10-31 13:26:48 +00:00
AMDGPUInstructions.td	[AMDGPU] Add and update scalar instructions	2018-11-29 16:05:38 +00:00
AMDGPUInstructionSelector.cpp	Revert "AMDGPU/GlobalISel: Implement select for G_INSERT"	2018-10-11 23:36:46 +00:00
AMDGPUInstructionSelector.h	Revert "AMDGPU/GlobalISel: Implement select for G_INSERT"	2018-10-11 23:36:46 +00:00
AMDGPUIntrinsicInfo.cpp	[AMDGPU] Update includes for intrinsic changes :(	2018-06-23 03:05:39 +00:00
AMDGPUIntrinsicInfo.h	[AMDGPU] Update includes for intrinsic changes :(	2018-06-23 03:05:39 +00:00
AMDGPUIntrinsics.td	AMDGPU: Separate R600 and GCN TableGen files	2018-06-28 23:47:12 +00:00
AMDGPUISelDAGToDAG.cpp	AMDGPU: Avoid selecting ds_{read,write}2_b32 on SI	2018-10-17 15:37:48 +00:00
AMDGPUISelLowering.cpp	[x86] allow vector load narrowing with multi-use values	2018-11-10 20:05:31 +00:00
AMDGPUISelLowering.h	DAG: Change behavior of fminnum/fmaxnum nodes	2018-10-22 16:27:27 +00:00
AMDGPULegalizerInfo.cpp	AMDGPU/GlobalISel: Add support for G_INTTOPTR	2018-10-05 04:34:09 +00:00
AMDGPULegalizerInfo.h	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
AMDGPULibCalls.cpp	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
AMDGPULibFunc.cpp	[cxx2a] Fix warning triggered by r343285	2018-09-29 02:17:12 +00:00
AMDGPULibFunc.h	AMDGPU: Fix missing C++ mode comment	2018-06-20 19:45:40 +00:00
AMDGPULowerIntrinsics.cpp	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
AMDGPULowerKernelArguments.cpp	[IRBuilder] Fixup CreateIntrinsic to allow specifying Types to Mangle.	2018-10-08 10:32:33 +00:00
AMDGPULowerKernelAttributes.cpp	AMDGPU: Add pass to optimize reqd_work_group_size	2018-05-18 21:35:00 +00:00
AMDGPUMachineCFGStructurizer.cpp	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
AMDGPUMachineFunction.cpp	Reapply "AMDGPU: Fix handling of alignment padding in DAG argument lowering"	2018-07-20 09:05:08 +00:00
AMDGPUMachineFunction.h	Reapply "AMDGPU: Fix handling of alignment padding in DAG argument lowering"	2018-07-20 09:05:08 +00:00
AMDGPUMachineModuleInfo.cpp
AMDGPUMachineModuleInfo.h
AMDGPUMacroFusion.cpp	[AMDGPU] Always pass TRI into findRegister[Use/Def]OperandIdx	2018-11-09 17:58:59 +00:00
AMDGPUMacroFusion.h
AMDGPUMCInstLower.cpp	AMDGPU: Fix getInstSizeInBytes	2018-08-29 07:46:09 +00:00
AMDGPUOpenCLEnqueuedBlockLowering.cpp	[AMDGPU] Change enqueue kernel handle type	2018-06-13 17:31:51 +00:00
AMDGPUPerfHintAnalysis.cpp	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
AMDGPUPerfHintAnalysis.h	Fix -Winconsistent-missing-overrides in AMDGPU code	2018-05-25 17:46:24 +00:00
AMDGPUPromoteAlloca.cpp	[AMDGPU] Extend promote alloca vectorization	2018-11-08 00:16:23 +00:00
AMDGPUPTNote.h
AMDGPURegAsmNames.inc.cpp
AMDGPURegisterBankInfo.cpp	[AMDGPU] NFC Test commit	2018-11-16 00:46:51 +00:00
AMDGPURegisterBankInfo.h
AMDGPURegisterBanks.td
AMDGPURegisterInfo.cpp	AMDGPU: Remove #include "MCTargetDesc/AMDGPUMCTargetDesc.h" from common headers	2018-05-22 02:03:23 +00:00
AMDGPURegisterInfo.h	[TargetRegisterInfo] Remove temporary hook enableMultipleCopyHints()	2018-10-05 14:23:11 +00:00
AMDGPURegisterInfo.td	AMDGPU: Separate R600 and GCN TableGen files	2018-06-28 23:47:12 +00:00
AMDGPURewriteOutArguments.cpp	Rename DEBUG macro to LLVM_DEBUG.	2018-05-14 12:53:11 +00:00
AMDGPUSearchableTables.td	AMDGPU: Remove old-style image intrinsics	2018-06-21 13:37:45 +00:00
AMDGPUSubtarget.cpp	Revert r347871 "Fix: Add support for TFE/LWE in image intrinsic"	2018-11-29 20:14:17 +00:00
AMDGPUSubtarget.h	Revert r347871 "Fix: Add support for TFE/LWE in image intrinsic"	2018-11-29 20:14:17 +00:00
AMDGPUTargetMachine.cpp	[AMDGPU] Combine DPP mov with use instructions (VOP1/2/3)	2018-11-30 14:21:56 +00:00
AMDGPUTargetMachine.h	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
AMDGPUTargetObjectFile.cpp
AMDGPUTargetObjectFile.h
AMDGPUTargetTransformInfo.cpp	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
AMDGPUTargetTransformInfo.h	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
AMDGPUUnifyDivergentExitNodes.cpp	[NFC] Rename the DivergenceAnalysis to LegacyDivergenceAnalysis	2018-08-30 14:21:36 +00:00
AMDGPUUnifyMetadata.cpp
AMDILCFGStructurizer.cpp	AMDGPU: Separate R600 and GCN TableGen files	2018-06-28 23:47:12 +00:00
AMDKernelCodeT.h
BUFInstructions.td	AMDGPU/NFC: Split MUBUF_Pseudo_Atomics into RTN/NO_RTN multiclasses	2018-11-07 21:21:32 +00:00
CaymanInstructions.td
CMakeLists.txt	[AMDGPU] Combine DPP mov with use instructions (VOP1/2/3)	2018-11-30 14:21:56 +00:00
DSInstructions.td	AMDGPU: Avoid selecting ds_{read,write}2_b32 on SI	2018-10-17 15:37:48 +00:00
EvergreenInstructions.td	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
FLATInstructions.td	[AMDGPU] Add FixupVectorISel pass, currently Supports SREGs in GLOBAL LD/ST	2018-11-16 01:13:34 +00:00
GCNDPPCombine.cpp	[AMDGPU] Combine DPP mov with use instructions (VOP1/2/3)	2018-11-30 14:21:56 +00:00
GCNHazardRecognizer.cpp	[AMDGPU] Prevent sequences of non-instructions disrupting GCNHazardRecognizer wait state counting	2018-09-10 10:14:48 +00:00
GCNHazardRecognizer.h	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
GCNILPSched.cpp	ScheduleDAG: Cleanup dumping code; NFC	2018-09-19 00:23:35 +00:00
GCNIterativeScheduler.cpp	llvm::sort(C.begin(), C.end(), ...) -> llvm::sort(C, ...)	2018-09-27 02:13:45 +00:00
GCNIterativeScheduler.h
GCNMinRegStrategy.cpp	ScheduleDAG: Cleanup dumping code; NFC	2018-09-19 00:23:35 +00:00
GCNProcessors.td	[AMDGPU] Defined gfx909 Raven Ridge 2	2018-10-24 08:14:07 +00:00
GCNRegPressure.cpp	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
GCNRegPressure.h	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
GCNSchedStrategy.cpp	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
GCNSchedStrategy.h	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
LLVMBuild.txt
MIMGInstructions.td	Revert r347871 "Fix: Add support for TFE/LWE in image intrinsic"	2018-11-29 20:14:17 +00:00
R600.td	Reapply "AMDGPU: Fix handling of alignment padding in DAG argument lowering"	2018-07-20 09:05:08 +00:00
R600AsmPrinter.cpp	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
R600AsmPrinter.h	AMDGPU: Split R600 AsmPrinter code into its own class	2018-05-24 20:02:01 +00:00
R600ClauseMergePass.cpp	AMDGPU: Separate R600 and GCN TableGen files	2018-06-28 23:47:12 +00:00
R600ControlFlowFinalizer.cpp	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
R600Defines.h
R600EmitClauseMarkers.cpp	[AMDGPU] Always pass TRI into findRegister[Use/Def]OperandIdx	2018-11-09 17:58:59 +00:00
R600ExpandSpecialInstrs.cpp	AMDGPU: Separate R600 and GCN TableGen files	2018-06-28 23:47:12 +00:00
R600FrameLowering.cpp
R600FrameLowering.h
R600InstrFormats.td	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
R600InstrInfo.cpp	[AMDGPU] Always pass TRI into findRegister[Use/Def]OperandIdx	2018-11-09 17:58:59 +00:00
R600InstrInfo.h	[PSV] Update API to be able to use TargetCustom without UB.	2018-08-20 19:23:45 +00:00
R600Instructions.td	AMDGPU: Remove remnants of old address space mapping	2018-08-31 05:49:54 +00:00
R600ISelLowering.cpp	[SelectionDAG] Add FoldBUILD_VECTOR to simplify new BUILD_VECTOR nodes	2018-10-30 10:32:11 +00:00
R600ISelLowering.h	AMDGPU/R600: Convert kernel param loads to use PARAM_I_ADDRESS	2018-08-01 18:36:07 +00:00
R600MachineFunctionInfo.cpp
R600MachineFunctionInfo.h
R600MachineScheduler.cpp	Fix clang -Wimplicit-fallthrough warnings across llvm, NFC	2018-11-01 19:54:45 +00:00
R600MachineScheduler.h
R600OpenCLImageTypeLoweringPass.cpp
R600OptimizeVectorRegisters.cpp	AMDGPU: Separate R600 and GCN TableGen files	2018-06-28 23:47:12 +00:00
R600Packetizer.cpp	AMDGPU: Separate R600 and GCN TableGen files	2018-06-28 23:47:12 +00:00
R600Processors.td	AMDGPU: Separate R600 and GCN TableGen files	2018-06-28 23:47:12 +00:00
R600RegisterInfo.cpp	AMDGPU: Separate R600 and GCN TableGen files	2018-06-28 23:47:12 +00:00
R600RegisterInfo.h	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
R600RegisterInfo.td	AMDGPU: Separate R600 and GCN TableGen files	2018-06-28 23:47:12 +00:00
R600Schedule.td
R700Instructions.td	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
SIAnnotateControlFlow.cpp	AMDGPU: Remove PHI loop condition optimization	2018-10-31 13:26:48 +00:00
SIDebuggerInsertNops.cpp	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
SIDefines.h	AMDGPU: Turn D16 for MIMG instructions into a regular operand	2018-06-21 13:36:01 +00:00
SIFixSGPRCopies.cpp	AMDGPU: Rewrite SILowerI1Copies to always stay on SALU	2018-10-31 13:27:08 +00:00
SIFixupVectorISel.cpp	[AMDGPU] Disable SReg Global LD/ST, perf regression	2018-11-30 18:29:17 +00:00
SIFixVGPRCopies.cpp	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
SIFixWWMLiveness.cpp	[AMDGPU] Reworked SIFixWWMLiveness	2018-08-02 23:31:32 +00:00
SIFoldOperands.cpp	[AMDGPU] Fold copy (copy vgpr)	2018-09-27 18:55:20 +00:00
SIFormMemoryClauses.cpp	llvm::sort(C.begin(), C.end(), ...) -> llvm::sort(C, ...)	2018-09-27 02:13:45 +00:00
SIFrameLowering.cpp	AMDGPU: Rename isAmdCodeObjectV2 -> isAmdHsaOrMesa	2018-10-04 21:02:16 +00:00
SIFrameLowering.h	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
SIInsertSkips.cpp	AMDGPU: Fix analyzeBranch failing with pseudoterminators	2018-11-16 05:03:02 +00:00
SIInsertWaitcnts.cpp	AMDGPU/InsertWaitcnts: Remove the dependence on MachineLoopInfo	2018-11-29 11:06:26 +00:00
SIInstrFormats.td	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
SIInstrInfo.cpp	[AMDGPU] Combine DPP mov with use instructions (VOP1/2/3)	2018-11-30 14:21:56 +00:00
SIInstrInfo.h	[AMDGPU] Combine DPP mov with use instructions (VOP1/2/3)	2018-11-30 14:21:56 +00:00
SIInstrInfo.td	[AMDGPU] Combine DPP mov with use instructions (VOP1/2/3)	2018-11-30 14:21:56 +00:00
SIInstructions.td	[AMDGPU] Restored selection of scalar_to_vector (v2x16)	2018-11-19 19:58:13 +00:00
SIIntrinsics.td
SIISelLowering.cpp	AMDGPU: Fix various issues around the VirtReg2Value mapping	2018-11-30 22:55:29 +00:00
SIISelLowering.h	[AMDGPU] Convert insert_vector_elt into set of selects	2018-11-19 17:39:20 +00:00
SILoadStoreOptimizer.cpp	[AMDGPU] Fix ds combine with subregs	2018-09-25 23:33:18 +00:00
SILowerControlFlow.cpp	AMDGPU: Remove PHI loop condition optimization	2018-10-31 13:26:48 +00:00
SILowerI1Copies.cpp	AMDGPU: Rewrite SILowerI1Copies to always stay on SALU	2018-10-31 13:27:08 +00:00
SIMachineFunctionInfo.cpp	[AMDGPU] Remove FeatureVGPRSpilling	2018-10-31 18:54:06 +00:00
SIMachineFunctionInfo.h	AMDGPU/AMDHSA: Remove GridWorkGroupCountX/Y/Z	2018-06-21 18:36:04 +00:00
SIMachineScheduler.cpp	[CodeGen][NFC] Make `TII::getMemOpBaseImmOfs` return a base operand	2018-11-28 12:00:20 +00:00
SIMachineScheduler.h
SIMemoryLegalizer.cpp	AMDGPU: Re-apply r341982 after fixing the layering issue	2018-09-12 18:50:47 +00:00
SIOptimizeExecMasking.cpp	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
SIOptimizeExecMaskingPreRA.cpp	AMDGPU: Don't delete instructions if S_ENDPGM has implicit uses	2018-08-28 18:55:55 +00:00
SIPeepholeSDWA.cpp	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
SIProgramInfo.h	[AMDGPU] Refactor HSAMetadataStream::emitKernel (NFC)	2018-07-10 17:31:32 +00:00
SIRegisterInfo.cpp	AMDGPU: Only add implicit super-reg def for first subreg	2018-11-26 17:02:01 +00:00
SIRegisterInfo.h	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
SIRegisterInfo.td	AMDGPU: Make v4i16/v4f16 legal	2018-06-15 15:15:46 +00:00
SISchedule.td
SIShrinkInstructions.cpp	[AMDGPU] Fixed return value causing warning and regression	2018-10-29 17:53:23 +00:00
SIWholeQuadMode.cpp	AMDGPU: Refactor Subtarget classes	2018-07-11 20:59:01 +00:00
SMInstructions.td	AMDGPU: Consolidate SMRD TableGen patterns	2018-10-06 03:32:43 +00:00
SOPInstructions.td	[AMDGPU] Add and update scalar instructions	2018-11-29 16:05:38 +00:00
VIInstrFormats.td
VIInstructions.td
VOP1Instructions.td	[AMDGPU] Combine DPP mov with use instructions (VOP1/2/3)	2018-11-30 14:21:56 +00:00
VOP2Instructions.td	[AMDGPU] Combine DPP mov with use instructions (VOP1/2/3)	2018-11-30 14:21:56 +00:00
VOP3Instructions.td	AMDGPU: Fix V_FMA_F16 selection on GFX9	2018-11-19 21:10:16 +00:00
VOP3PInstructions.td	[AMDGPU] Handle the idot8 pattern generated by FE.	2018-11-01 22:48:19 +00:00
VOPCInstructions.td	AMDGPU: Implement llvm.amdgcn.icmp/fcmp for i16/f16	2018-08-15 21:25:20 +00:00
VOPInstructions.td	[AMDGPU] Combine DPP mov with use instructions (VOP1/2/3)	2018-11-30 14:21:56 +00:00