Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Welsh
English
French
Portuguese
Turkish
Russian
Arabic
Indonesian
Spanish
Japanese
German
Italian
Dutch
Korean
Tamil
Chinese
Catalan
Slovenian
Vietnamese
Hindi
Polish
Romanian
Estonian
Ukrainian
Greek
Czech
Hungarian
Swedish
Bengali
Danish
Finnish
Thai
Urdu
Persian
Bulgarian
Lithuanian
Afrikaans
Basque
Latvian
Belarusian
Slovak
Icelandic
Georgian
Macedonian
Serbian
Hebrew
Telugu
Armenian
Malayalam
Marathi
Galician
Azerbaijani
Kazakh
Amharic
Esperanto
Kyrgyz
Panjabi
Irish
Maltese
Uzbek
Swahili
Albanian
Croatian
Burmese
Mongolian
Kannada
Gujarati
Tatar
Yoruba
Nepali
Sinhala
Assamese
Breton
Khmer
Pashto
Tagalog
Scottish Gaelic
Hausa
Lao
Malay
Somali
Uyghur
Western Frisian
Igbo
Latin
Norwegian Nynorsk
Norwegian
Tajik
Turkmen
Sindhi
+ 1585 languages
Languages with no match
Tagalog
rna
Baikeno
Akha
Manambu
Arosi
Awa (Papua New Guinea)
Akoose
Kenyang
Kwaio
Lahu
Eastern Apurímac Quechua
Mitla Zapotec
Agusan Manobo
Haryanvi
Akawaio
Alamblak
Western Bru
Kimaragang
Masaaba
Northern Thai
Ambai
Guanano
Kisar
Paranan
Tboli
Ghomálá'
btk
Barasana-Eduria
Quiotepec Chinantec
Brao
Batak
Upper Kinabatangan
Western Dani
Lü
Krung
Eastern Lawa
Mandar
Makasae
Nauete
Ruching Palaung
Panao Huánuco Quechua
Semai
Surigaonon
Sangil
Tukudede
Tampuan
Wambon
Yetfa
Denya
Arapaho
Blagar
Siksika
Southern Carrier
Chipaya
Cavineña
Dogrib
Eastern Bontok
Wipi
Eastern Bolivian Guaraní
Gunwinggu
Golin
Ignaciano
Kaingang
Lacandon
Macushi
Maca
Coatlán Mixe
Martu Wangka
Mwera (Chimwera)
Nggem
Chumburung
Nyangumarta
Southeastern Puebla Nahuatl
Pogolo
Safwa
Epena
Takia
Tiruray
Tacana
Tuyuca
Iduna
Wapishana
Vwanji
Wik-Mungkan
Walmajarri
Xavánte
Cajonos Zapotec
Zaramo
Yatzachi Zapotec
+ 6408 languages
Apply filters
Datasets
173
Full-text search
Edit filters
Sort: Trending
Active filters:
cy
Clear all
allenai/c4
Viewer
•
Updated
Jan 9
•
10.4B
•
314k
•
287
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
51.1k
•
546
recursal/SuperWikiImage-7M
Updated
3 days ago
•
1
•
3
unimelb-nlp/wikiann
Viewer
•
Updated
Feb 22
•
2M
•
807
•
96
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
12k
•
469
mozilla-foundation/common_voice_17_0
Viewer
•
Updated
Jun 16
•
13M
•
52.1k
•
150
statmt/cc100
Updated
Mar 5
•
763
•
71
legacy-datasets/common_voice
Updated
Aug 22
•
143
•
132
cis-lmu/m_lama
Updated
Jan 18
•
20
•
6
Helsinki-NLP/opus-100
Viewer
•
Updated
Feb 28
•
55.1M
•
3.14k
•
140
oscar-corpus/oscar
Updated
Mar 21
•
399
•
173
Helsinki-NLP/tatoeba
Updated
Jan 18
•
35
•
37
legacy-datasets/wikipedia
Updated
Mar 11
•
976
•
548
csebuetnlp/xlsum
Viewer
•
Updated
Apr 18, 2023
•
1.35M
•
7.37k
•
106
oscar-corpus/OSCAR-2201
Updated
May 30, 2023
•
173
•
113
google/wit
Viewer
•
Updated
Jul 4, 2022
•
2.66M
•
265
•
36
wikimedia/wit_base
Viewer
•
Updated
Nov 4, 2022
•
108k
•
433
•
51
mteb/amazon_massive_intent
Viewer
•
Updated
May 7
•
1.69M
•
24.1k
•
20
facebook/flores
Updated
Jan 18
•
97.6k
•
62
CohereForAI/xP3x
Updated
Apr 10
•
1.08k
•
67
saillab/taco-datasets
Viewer
•
Updated
Dec 1, 2023
•
3.2M
•
1.97k
•
15
ayymen/Pontoon-Translations
Viewer
•
Updated
Jan 19
•
3.56M
•
507
•
9
mozilla-foundation/common_voice_16_0
Viewer
•
Updated
Dec 21, 2023
•
8.2M
•
2.38k
•
65
TaiMingLu/Multilingual-Benchmark
Viewer
•
Updated
15 days ago
•
1.05M
•
1
stanford-oval/ccnews
Viewer
•
Updated
Aug 31
•
893M
•
416
•
3
fsicoli/common_voice_18_0
Updated
Aug 15
•
27k
•
4
Nikity/Pornhub
Updated
Aug 26
•
10
•
17
ahelk/ccaligned_multilingual
Updated
Jan 18
•
11
•
5
speechbrain/common_language
Updated
Jun 12, 2023
•
301
•
27
facebook/covost2
Updated
Jan 18
•
263
•
22
Previous
1
2
3
...
6
Next