Mila
Deep Neural Network Library
Loading...
Searching...
No Matches
CudaLinearOp.ixx File Reference

Implementation of the CUDA-based Fully Connected operation for neural networks. More...

#include <cublasLt.h>
#include <cuda_fp16.h>
#include <vector>
#include <memory>
#include <string>
#include <stdexcept>
#include <exception>
#include "Kernels/CudaOps.h"
#include <type_traits>
import Utils.Logger;
import Compute.CublasLtMatMulBias;
import Compute.CudaDevice;
import Compute.CudaMemoryResource;
import Dnn.Modules.Linear;
import Dnn.Tensor;
import Compute.UnaryOperation;
import Dnn.TensorTraits;
import Dnn.ComponentConfig;
import Compute.DeviceContext;
import Compute.MemoryResource;
import Compute.Precision;
import Compute.OperationType;
import Compute.OperationRegistry;
import Compute.OperationBase;
import Compute.DeviceType;
import Compute.OperationAttributes;

Classes

struct  Mila::Dnn::Compute::Detail::cuda_matmul_impl< float >
 
struct  Mila::Dnn::Compute::Detail::cuda_matmul_impl< half >
 
class  Mila::Dnn::Compute::CudaLinearOp< TInput, TOutput >
 CUDA implementation of the Fully Connected operation for neural networks. More...
 
class  Mila::Dnn::Compute::CudaLinearOpRegistrar
 Class responsible for registering the CudaLinearOp operation. More...
 

Namespaces

namespace  Mila
 
namespace  Mila::Dnn
 
namespace  Mila::Dnn::Compute
 
namespace  Mila::Dnn::Compute::Detail
 Namespace for CUDA layer normalization implementation details.
 

Detailed Description

Implementation of the CUDA-based Fully Connected operation for neural networks.