Data.ConfigΒΆ

Component: Data

class Data.Config[source]

Bases: Component.Config

All Attributes (including base classes)

source: DataSource.Config = TSVDataSource.Config()
Specify where training/test/eval data come from. The default value will not provide any data.
batcher: Batcher.Config = PoolingBatcher.Config()
How training examples are split into batches for the optimizer.
sort_key: Optional[str] = None
in_memory: Optional[bool] = True
cache numberized result in memory, turn off when CPU memory bound.
Subclasses
  • PackedLMData.Config

Default JSON

{
    "source": {
        "TSVDataSource": {
            "column_mapping": {},
            "train_filename": null,
            "test_filename": null,
            "eval_filename": null,
            "field_names": null,
            "delimiter": "\t",
            "quoted": false,
            "drop_incomplete_rows": false
        }
    },
    "batcher": {
        "PoolingBatcher": {
            "train_batch_size": 16,
            "eval_batch_size": 16,
            "test_batch_size": 16,
            "pool_num_batches": 1000,
            "num_shuffled_pools": 1
        }
    },
    "sort_key": null,
    "in_memory": true
}