DistributedReadingService¶
- class torchdata.dataloader2.DistributedReadingService(timeout: int = 1800)¶
DistributedReadingSerivcehandles distributed sharding on the graph ofDataPipeand guarantee the randomness by sharing the same seed across the distributed processes.- Parameters:
timeout – Timeout for operations executed against the process group in seconds. Default value equals 30 minutes.
- finalize() None¶
Clean up the distributed process group.
- initialize(datapipe: IterDataPipe | MapDataPipe) IterDataPipe | MapDataPipe¶
Launches the
gloo-backend distributed process group. Carries out distributed sharding on the graph ofDataPipeand returnes the graph attached with aFullSyncIterDataPipeat the end.
- initialize_iteration(seed_generator: SeedGenerator) None¶
Shares the same seed from rank 0 to other ranks across the distributed processes and apply the random seed to the
DataPipegraph.