|
Mila 0.13.48
Deep Neural Network Library
|
CUDA implementation of the residual (y = x + F(x)) binary operation. More...
#include "Kernels/Residual.cuh"#include <memory>#include <cuda_fp16.h>#include <stdexcept>#include <cstdint>#include <cassert>#include <string>import Compute.OperationRegistrarHelpers;import Compute.CudaTensorDataType;import Compute.CudaResidualOp:Dispatch;import Compute.BinaryOperation;import Dnn.Components.ResidualConfig;import Compute.OperationType;import Compute.ExecutionContext;import Dnn.Component;import Compute.CudaDeviceMemoryResource;import Dnn.ComponentConfig;import Compute.OperationBase;import Dnn.ITensor;import Compute.OperationRegistry;import Dnn.TensorDataType;import Compute.DeviceType;Classes | |
| class | Mila::Dnn::Compute::Cuda::Residual::CudaResidualOp< TInputA, TInputB, TPrecision > |
| CUDA Residual operation implementing the BinaryOperation interface. More... | |
| class | Mila::Dnn::Compute::Cuda::Residual::CudaResidualOpRegistrar |
Namespaces | |
| namespace | Mila |
| Mila main API namespace. | |
| namespace | Mila::Dnn |
| namespace | Mila::Dnn::Compute |
| namespace | Mila::Dnn::Compute::Cuda |
| namespace | Mila::Dnn::Compute::Cuda::Residual |
CUDA implementation of the residual (y = x + F(x)) binary operation.
Device- and precision-specific implementation of the Residual operation for CUDA devices. Implements the device-agnostic BinaryOperation interface and dispatches to device kernels in the Detail namespace based on the native compute type (float / half).