|
Mila
Deep Neural Network Library
|
Implementation of the CUDA-based Fully Connected operation for neural networks. More...
#include <cublasLt.h>#include <cuda_fp16.h>#include <vector>#include <memory>#include <string>#include <stdexcept>#include <exception>#include "Kernels/CudaOps.h"#include <type_traits>import Utils.Logger;import Compute.CublasLtMatMulBias;import Compute.CudaDevice;import Compute.CudaMemoryResource;import Dnn.Modules.Linear;import Dnn.Tensor;import Compute.UnaryOperation;import Dnn.TensorTraits;import Dnn.ComponentConfig;import Compute.DeviceContext;import Compute.MemoryResource;import Compute.Precision;import Compute.OperationType;import Compute.OperationRegistry;import Compute.OperationBase;import Compute.DeviceType;import Compute.OperationAttributes;Classes | |
| struct | Mila::Dnn::Compute::Detail::cuda_matmul_impl< float > |
| struct | Mila::Dnn::Compute::Detail::cuda_matmul_impl< half > |
| class | Mila::Dnn::Compute::CudaLinearOp< TInput, TOutput > |
| CUDA implementation of the Fully Connected operation for neural networks. More... | |
| class | Mila::Dnn::Compute::CudaLinearOpRegistrar |
| Class responsible for registering the CudaLinearOp operation. More... | |
Namespaces | |
| namespace | Mila |
| namespace | Mila::Dnn |
| namespace | Mila::Dnn::Compute |
| namespace | Mila::Dnn::Compute::Detail |
| Namespace for CUDA layer normalization implementation details. | |
Implementation of the CUDA-based Fully Connected operation for neural networks.