|
Mila 0.13.48
Deep Neural Network Library
|
Character-level tokenizer implementing the Tokenizer API. More...
#include <string>#include <string_view>#include <vector>#include <span>#include <memory>#include <optional>#include <filesystem>import Data.TokenizerVocabulary;import Data.Tokenizer;import Data.CharVocabulary;Classes | |
| class | Mila::Data::CharTokenizer |
| Character-level tokenizer. More... | |
Namespaces | |
| namespace | Mila |
| Mila main API namespace. | |
| namespace | Mila::Data |
Typedefs | |
| using | Mila::Data::TokenId |
Character-level tokenizer implementing the Tokenizer API.
Provides a simple byte/char tokenizer that maps single-byte characters to token ids via a TokenizerVocabulary.