Mila 0.13.48
Deep Neural Network Library
Loading...
Searching...
No Matches
IKvInference.ixx File Reference

KV-cache compute interface for modern attention backends (GQA and beyond). More...

Classes

struct  Mila::Dnn::Compute::IKvInference
 Compute interface for attention operations that maintain a KV cache. More...

Namespaces

namespace  Mila
 Mila main API namespace.
namespace  Mila::Dnn
namespace  Mila::Dnn::Compute

Detailed Description

KV-cache compute interface for modern attention backends (GQA and beyond).