|
Mila 0.13.48
Deep Neural Network Library
|
OperationTraits specializations for all CPU operation backends. More...
import Dnn.Quantization.Weight.Policies;import Compute.CpuSoftmaxOp;import Compute.CpuResidualOp;import Compute.CpuEncoderOp;import Compute.CpuGeluOp;import Compute.CpuLinearOp;import Compute.CpuAttention;import Compute.OperationTraits.Template;Namespaces | |
| namespace | Mila |
| Mila main API namespace. | |
| namespace | Mila::Dnn |
| namespace | Mila::Dnn::Compute |
OperationTraits specializations for all CPU operation backends.
This partition module is the single registration point for every (OperationType, Cpu, TPrecision, TPolicy) -> concrete op mapping.
CPU ops are currently concrete (non-templated) FP32-only implementations. BF16 CPU paths are not a current Mila target.
Migration status: LinearOp complete (NoWeightQuant; quantized policies are CUDA-only) GeluOp complete ResidualOp complete SoftmaxOp complete MultiHeadAttentionOp complete LpeOp complete CrossEntropyOp pending (CpuSoftmaxCrossEntropyOp not yet wired into CMake) SamplingOp pending