Fri Nov 18, 2022 9:49 pm
Data used by the Bittensor network ideally comes from two sources

• Individual nodes and entities
• Dataset provided by the Network

Since it is in the early days of practice dataset is provided that is parsed and ready for use. This dataset is sourced from the net called THE PILE, which is provided for public use by an open-source community called EleutherAI. This dataset is used for creating the incentive landscape that the miners (Servers)

Note: The Pile as mentioned on their website is an 825 GiB diverse, open-source language modelling data set that consists of 22 smaller, high-quality datasets combined together.
