Mila 0.13.48
Deep Neural Network Library
Loading...
Searching...
No Matches
Compute.CudaLinearOp Module Reference

Classes

class  Mila::Dnn::Compute::Cuda::Linear::CudaLinearOp< TComputePrecision, TWeightQuant >
 CUDA Linear operation with compile-time weight quantization policy dispatch. More...
class  Mila::Dnn::Compute::Cuda::Linear::CudaLinearOpRegistrar

Files

file  /__w/Mila/Mila/Mila/Src/Dnn/Compute/Devices/Cuda/Operations/Linear/CudaLinearOp.ixx
 CUDA implementation of Linear operation with two-phase cuBLASLt optimization.
file  /__w/Mila/Mila/Mila/Src/Dnn/Compute/Devices/Cuda/Operations/Linear/CudaLinearOp.Dispatch.ixx
file  /__w/Mila/Mila/Mila/Src/Dnn/Compute/Devices/Cuda/Operations/Linear/CudaLinearOp.Plans.ixx
 cuBLASLt plan builders for CudaLinearOp forward and backward passes.
file  /__w/Mila/Mila/Mila/Src/Dnn/Compute/Devices/Cuda/Operations/Linear/CudaLinearOp.Quantize.ixx
 Quantize partition of CudaLinearOp.