-
Notifications
You must be signed in to change notification settings - Fork 28
Closed
Description
It would be very useful to have the training data returned from generate_data_parallel.py script available to download, for both the pile and packed cases.
I appreciate this may be a large amount of memory, and therefore difficult to host, so there is no expectation of course!
But it would avoid people needing to run the costly data generation process locally in order to experiment with the training.
Metadata
Metadata
Assignees
Labels
No labels