BERTInitialTokenizer.ConfigΒΆ

Component: BERTInitialTokenizer

class BERTInitialTokenizer.Config[source]

Bases: Tokenizer.Config

Config for this class.

All Attributes (including base classes)

split_regex: str = '\\s+'
lowercase: bool = True

Default JSON

{
    "split_regex": "\\s+",
    "lowercase": true
}