Find the right datasets for you
Suggested filters
Dataset title | Domain | Type | Locale | Amount | Latin books 15K Public Domain books, scanned and digitized, both fiction and non-fiction. | la, la-latn | 15K | European Spanish books 40K+ Public Domain books, scanned and digitized, both fiction and non-fiction. | es-ES | 40.5K | European French books 177K+ Public Domain books, scanned and digitized, both fiction and non-fiction. | fr-FR, FR | 177.7K | German books 183K+ Public Domain books, scanned and digitized, both fiction and non-fiction. | de-DE, DE | 183.4K | Greek books 15K+ Public Domain books, scanned and digitized, both fiction and non-fiction. | EL | 15K | English books 211K+ Public Domain books, scanned and digitized, both fiction and non-fiction. | EN | 211.8K | Italian books 42K+ Public Domain books, scanned and digitized, both fiction and non-fiction. | it-IT | 42.6K | Korean Question-Answer pairs 1,250,480 Question-Answer pairs in Korean. | KO, ko-KR | 1.3M | English books 92K+ Public Domain books, scanned and digitized, both fiction and non-fiction. | General General | EN | 92.7K | Filipino accented English Podcasts 10000 hours of Filipino accented English live, non-simulated podcasts, recorded by real podcasters in our partner network. | General General | EN | 10K hours |
|---|
Showing 10 of 152 datasets
...
Datasets per page
Latin books
Amount:
15K
Locale:
European Spanish books
Amount:
40.5K
Locale:
European French books
Amount:
177.7K
Locale:
German books
Amount:
183.4K
Locale:
Greek books
Amount:
15K
Locale:
English books
Amount:
211.8K
Locale:
Italian books
Amount:
42.6K
Locale:
Korean Question-Answer pairs
Amount:
1.3M
Locale:
English books
Domain:
Amount:
92.7K
Locale:
Filipino accented English Podcasts
Domain:
Amount:
10K hours
Locale:
Showing 10 of 152 datasets
1/16