|
Mila 0.13.48
Deep Neural Network Library
|

Files | |
| BinaryOperation.ixx | |
| Abstract device-agnostic binary operation interface. | |
| GqaOpTypeMap.ixx | |
| GqaOpTypeMap.Template.ixx | |
| GqaState.ixx | |
| Non-owning transient scratch state for CudaGqaOp inference paths. | |
| IKVCacheLifecycle.ixx | |
| Interface for operations that own and manage a KV cache. | |
| IKvInference.ixx | |
| KV-cache compute interface for modern attention backends (GQA and beyond). | |
| IPackedKvInference.ixx | |
| IPositionalDecode.ixx | |
| IPositionalPairedOp.ixx | |
| Interface for paired operations whose output depends on absolute token position. | |
| LinearOpTypeMap.ixx | |
| LinearOpTypeMap.Template.ixx | |
| Primary compile-time dispatch template mapping (DeviceType, TPrecision, TWeightQuant) to a concrete LinearOp type. | |
| OperationBase.ixx | |
| Core abstraction for neural network operations in the Mila framework. | |
| OperationRegistrarHelpers.ixx | |
| Helpers to standardize registration of unary/binary/paired ops. | |
| OperationRegistry.ixx | |
| Central registry for creating and discovering compute operations. | |
| OperationRegistryHelpers.ixx | |
| Compile-time templated helpers for querying the OperationRegistry. | |
| OperationsRegistrar.ixx | |
| OperationTraits.ixx | |
| Aggregator for the unified operation traits dispatch table. | |
| OperationTraits.Template.ixx | |
| Unified compile-time dispatch template mapping (OperationType, DeviceType, TPrecision, TPolicy) to a concrete operation type. | |
| OperationType.ixx | |
| Defines the operation types supported by the compute framework. | |
| PairedOperation.ixx | |
| Abstract device-agnostic paired operation interface. | |
| UnaryOperation.ixx | |
| Device-agnostic unary operation interface using abstract tensor data types. | |