Device-dispatched TensorOps interface template. More...

Detailed Description

template<Compute::DeviceType TDevice>
struct Mila::Dnn::TensorOps< TDevice >

Device-dispatched TensorOps interface template.

Specialize TensorOps<TDevice> for each supported Compute::DeviceType to provide backend implementations of tensor operations (elementwise, reductions, copy, fill, etc.).

Requirements for specializations:

Provide the operations used by the framework (static or instance methods), matching the signatures expected by TensorOps callers.
Use the device's memory resource and execution context types to access device-specific APIs and streams.
Respect host/device accessibility guarantees: CPU specializations must operate on host-accessible memory, CUDA specializations on device memory.

Usage example:

template<>
struct TensorOps<Compute::DeviceType::Cpu>
{
    static void copy(const ITensor& src, ITensor& dst);
    // ...
};

Template Parameters

TDevice Compute device type to specialize for (DeviceType::Cpu, DeviceType::Cuda, ...)

The documentation for this struct was generated from the following file:

/__w/Mila/Mila/Mila/Src/Dnn/Tensors/Operations/TensorOps-Base.ixx