Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Basque
Spanish
French
English
Portuguese
Catalan
German
Italian
Russian
Arabic
Indonesian
Dutch
Danish
Hungarian
Romanian
Vietnamese
Chinese
Bengali
Swedish
Ukrainian
Japanese
Greek
Turkish
Finnish
Polish
Czech
Hindi
Korean
Bulgarian
Tamil
Slovak
Galician
Persian
Hebrew
Thai
Estonian
Slovenian
Esperanto
Lithuanian
Urdu
Malayalam
Serbian
Marathi
Armenian
Telugu
Afrikaans
Icelandic
Welsh
Georgian
Belarusian
Croatian
Latvian
Macedonian
Kazakh
Irish
Kannada
Maltese
Gujarati
Panjabi
Azerbaijani
Nepali
Burmese
Swahili
Kyrgyz
Albanian
Amharic
Uzbek
Yoruba
Sinhala
Tatar
Assamese
Mongolian
Malay
Tagalog
Breton
Khmer
Uyghur
Norwegian
Igbo
Latin
Lao
Norwegian Nynorsk
Western Frisian
Scottish Gaelic
Hausa
Pashto
Turkmen
Occitan
Bosnian
Javanese
+ 1592 languages
Languages with no match
Tagalog
rna
Baikeno
Akha
Manambu
Arosi
Awa (Papua New Guinea)
Akoose
Kenyang
Kwaio
Lahu
Eastern Apurímac Quechua
Mitla Zapotec
Agusan Manobo
Haryanvi
Alamblak
Western Bru
Kimaragang
Masaaba
Northern Thai
Ambai
Guanano
Kisar
Paranan
Tboli
Ghomálá'
btk
Brao
Batak
Upper Kinabatangan
Western Dani
Lü
Krung
Eastern Lawa
Mandar
Makasae
Nauete
Ruching Palaung
Panao Huánuco Quechua
Semai
Surigaonon
Sangil
Tukudede
Tampuan
Wambon
Yetfa
Denya
Arapaho
Blagar
Siksika
Southern Carrier
Chipaya
Cavineña
Dogrib
Eastern Bontok
Wipi
Eastern Bolivian Guaraní
Gunwinggu
Golin
Ignaciano
Kaingang
Lacandon
Macushi
Maca
Coatlán Mixe
Martu Wangka
Mwera (Chimwera)
Nggem
Chumburung
Nyangumarta
Southeastern Puebla Nahuatl
Pogolo
Safwa
Epena
Takia
Tiruray
Tacana
Tuyuca
Iduna
Wapishana
Vwanji
Wik-Mungkan
Walmajarri
Xavánte
Cajonos Zapotec
Zaramo
Yatzachi Zapotec
Zigula
Zoogocho Zapotec
Bunak
+ 6401 languages
Apply filters
Datasets
209
Full-text search
Edit filters
Sort: Trending
Active filters:
eu
Clear all
allenai/c4
Viewer
•
Updated
Jan 9
•
10.4B
•
314k
•
287
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
51.1k
•
546
recursal/SuperWikiImage-7M
Updated
3 days ago
•
1
•
3
unimelb-nlp/wikiann
Viewer
•
Updated
Feb 22
•
2M
•
807
•
96
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
12k
•
469
mozilla-foundation/common_voice_17_0
Viewer
•
Updated
Jun 16
•
13M
•
52.1k
•
150
statmt/cc100
Updated
Mar 5
•
763
•
71
legacy-datasets/common_voice
Updated
Aug 22
•
143
•
132
cis-lmu/m_lama
Updated
Jan 18
•
20
•
6
Helsinki-NLP/open_subtitles
Updated
Jan 18
•
294
•
60
Helsinki-NLP/opus-100
Viewer
•
Updated
Feb 28
•
55.1M
•
3.14k
•
140
oscar-corpus/oscar
Updated
Mar 21
•
399
•
173
Helsinki-NLP/tatoeba
Updated
Jan 18
•
35
•
37
legacy-datasets/wikipedia
Updated
Mar 11
•
976
•
548
google/xtreme
Viewer
•
Updated
Feb 22
•
2.77M
•
2.72k
•
88
oscar-corpus/OSCAR-2201
Updated
May 30, 2023
•
173
•
113
google/wit
Viewer
•
Updated
Jul 4, 2022
•
2.66M
•
265
•
36
wikimedia/wit_base
Viewer
•
Updated
Nov 4, 2022
•
108k
•
433
•
51
facebook/flores
Updated
Jan 18
•
97.6k
•
62
OpenAssistant/oasst1
Viewer
•
Updated
May 2, 2023
•
88.8k
•
3.2k
•
1.26k
CohereForAI/xP3x
Updated
Apr 10
•
1.08k
•
67
MichaelR207/MultiSim
Updated
Nov 14, 2023
•
196
•
5
facebook/belebele
Viewer
•
Updated
Aug 12
•
110k
•
67.3k
•
94
saillab/taco-datasets
Viewer
•
Updated
Dec 1, 2023
•
3.2M
•
1.97k
•
15
ayymen/Pontoon-Translations
Viewer
•
Updated
Jan 19
•
3.56M
•
507
•
9
mozilla-foundation/common_voice_16_0
Viewer
•
Updated
Dec 21, 2023
•
8.2M
•
2.38k
•
65
OpenAssistant/oasst2
Viewer
•
Updated
Jan 11
•
135k
•
4.15k
•
206
alexandrainst/m_arc
Viewer
•
Updated
Jan 15
•
87.4k
•
126k
•
4
alexandrainst/m_mmlu
Viewer
•
Updated
Mar 11
•
488k
•
326k
•
13
gttsehu/basque_parliament_1
Updated
Jul 12
•
7
•
1
Previous
1
2
3
...
7
Next