|
Mila 0.13.48
Deep Neural Network Library
|
Multi-Head Attention module (concatenated QKV input). More...
#include <memory>#include <vector>#include <string>#include <sstream>#include <type_traits>#include <stdexcept>#include <cstdint>#include <optional>import Serialization.Mode;import Serialization.ModelArchive;import Compute.IKvCacheLifecycle;import Compute.MemoryResource;import Compute.OperationTraits;import Compute.ExecutionContextFactory;import Dnn.Components.MultiHeadAttentionConfig;import Dnn.TensorDataType;import Dnn.TensorDataTypeTraits;import Compute.IPackedKvInference;import Dnn.Component;import Dnn.ComponentType;import Dnn.Tensor;import Compute.UnaryOperation;import Compute.DeviceType;import Compute.CpuMemoryResource;import Dnn.ITensor;import Dnn.TensorTypes;import Compute.Device;import Compute.DeviceTypeTraits;import Compute.DeviceId;import Compute.ExecutionContext;Classes | |
| class | Mila::Dnn::MultiHeadAttention< TDeviceType, TPrecision > |
| Multi-Head Attention module that accepts concatenated QKV input. More... | |
Namespaces | |
| namespace | Mila |
| Mila main API namespace. | |
| namespace | Mila::Dnn |
Multi-Head Attention module (concatenated QKV input).