the-pile
the-pile copied to clipboard
Public website to explore dataset
Hi there, I'm testing GPT-J-6B and want to inspect the training data.Is there any public service/website where that I can explore the data without downloading all the dataset? Example: Search documents by keyword (Maybe indexing data using ElasticSearch...). I think it would be very valuable for many folks here. Thanks.
Agree!