Skip to content

dataloader

sorix.utils.data.dataloader

DataLoader

DataLoader(
    dataset, batch_size=16, shuffle=False, collate_fn=None
)

Data iterator that provides batches of data from a Dataset. Inspired by PyTorch's DataLoader.

Parameters:

  • dataset (Any) –

    The dataset to load data from.

  • batch_size (int, default: 16 ) –

    How many samples per batch to load.

  • shuffle (bool, default: False ) –

    Set to True to have the data reshuffled at every epoch.

  • collate_fn (Optional[Callable], default: None ) –

    Merges a list of samples to form a mini-batch of Tensors. Default converts nested lists/arrays to sorix.tensors.

Source code in sorix/utils/data/dataloader.py
def __init__(
    self, 
    dataset: Any, 
    batch_size: int = 16, 
    shuffle: bool = False,
    collate_fn: Optional[Callable] = None
):
    self.dataset = dataset
    self.batch_size = batch_size
    self.shuffle = shuffle
    self.collate_fn = collate_fn or self._default_collate