|
Mila 0.13.48
Deep Neural Network Library
|
Configuration for the SwiGLU activation component. More...
#include <stdexcept>#include <string>#include <string_view>#include <sstream>import Serialization.Metadata;import Dnn.ApproximationMethod;import Dnn.ComponentConfig;import Dnn.Component;Classes | |
| class | Mila::Dnn::SwigluConfig |
Namespaces | |
| namespace | Mila |
| Mila main API namespace. | |
| namespace | Mila::Dnn |
Configuration for the SwiGLU activation component.
Design principle (Mila-wide):
SwigluConfig has no structurally required parameters — hidden_dim is determined from the input tensor shape at build time. The default constructor is correct.
Optional (fluent): inner_gelu_method (default: Tanh).
Typical usage: