DisjointMultitaskDataHandler.Config

Component: DisjointMultitaskDataHandler

class DisjointMultitaskDataHandler.Config[source]

Bases: DataHandler.Config

Configuration class for DisjointMultitaskDataHandler.

upsample

If upsample, keep cycling over each iterator in round-robin. Iterators with less batches will get more passes. If False, we do single pass over each iterator, the ones which run out will sit idle. This is used for evaluation. Default True.

Type:bool

All Attributes (including base classes)

columns_to_read: list[str] = []
shuffle: bool = True
sort_within_batch: bool = True
train_path: str = 'train.tsv'
eval_path: str = 'eval.tsv'
test_path: str = 'test.tsv'
train_batch_size: int = 128
eval_batch_size: int = 128
test_batch_size: int = 128
column_mapping: dict[str, str] = {}
upsample: bool = True

Default JSON

{
    "columns_to_read": [],
    "shuffle": true,
    "sort_within_batch": true,
    "train_path": "train.tsv",
    "eval_path": "eval.tsv",
    "test_path": "test.tsv",
    "train_batch_size": 128,
    "eval_batch_size": 128,
    "test_batch_size": 128,
    "column_mapping": {},
    "upsample": true
}