Data FlowΒΆ

This document describes the overall data flow of the data module.

  • Raw Data

    Original open source dataset. For each supported original data set, we provide scripts to convert it into atomic files.

  • Atomic Files

    Basic input elements for different traffic prediction tasks.

  • Dataset

    DifferentDataset classes are developed for each type of traffic prediction task, which are responsible for reading atomic files and performing some data preprocessing operations. See here for detail.

  • DataLoader

    The Dataloader class responsible for loading data, using the native torch.utils.data.DataLoader of Pytorch, it is responsible for returning the data to the model in the form of the internal data representation structure Batch class .