WebDataset
These include text, machine-generated data like web logs or sensor data, images, and so forth.