Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Tatar
Russian
German
French
Italian
Dutch
Portuguese
Romanian
Turkish
Arabic
Czech
Indonesian
Catalan
Greek
Spanish
Japanese
Polish
Slovenian
Tamil
Hindi
Ukrainian
Bulgarian
Estonian
Basque
Lithuanian
Urdu
Chinese
Bengali
Welsh
English
Finnish
Hungarian
Georgian
Korean
Swedish
Vietnamese
Belarusian
Kazakh
Macedonian
Danish
Esperanto
Icelandic
Maltese
Hebrew
Malayalam
Afrikaans
Kyrgyz
Thai
Galician
Slovak
Serbian
Persian
Armenian
Latvian
Irish
Amharic
Azerbaijani
Uyghur
Uzbek
Telugu
Marathi
Panjabi
Turkmen
Albanian
Mongolian
Assamese
Swahili
Yoruba
Croatian
Burmese
Norwegian Nynorsk
Kannada
Occitan
Sinhala
Breton
Lao
Khmer
Tajik
Bashkir
Guaraní
Gujarati
Kinyarwanda
Bosnian
Hausa
Igbo
Luxembourgish
Tagalog
Western Frisian
Scottish Gaelic
Malay
+ 1573 languages
Languages with no match
Tagalog
multilingual
rna
Baikeno
Akha
Manambu
Arosi
Awa (Papua New Guinea)
Akoose
Kenyang
Kwaio
Lahu
Eastern Apurímac Quechua
Mitla Zapotec
Agusan Manobo
Haryanvi
Akawaio
Apurinã
Mbyá Guaraní
Alamblak
Western Bru
Kimaragang
Masaaba
Northern Thai
Ambai
Guanano
Kisar
Paranan
Tboli
Ghomálá'
btk
Barasana-Eduria
Quiotepec Chinantec
Brao
Batak
Upper Kinabatangan
Western Dani
Lü
Krung
Eastern Lawa
Mandar
Makasae
Nauete
Ruching Palaung
Panao Huánuco Quechua
Semai
Surigaonon
Sangil
Tukudede
Tampuan
Wambon
Yetfa
Denya
Arapaho
Blagar
Siksika
Southern Carrier
Chipaya
Cavineña
Dogrib
Eastern Bontok
Wipi
Eastern Bolivian Guaraní
Gunwinggu
Golin
Ignaciano
Kaingang
Lacandon
Macushi
Maca
Coatlán Mixe
Martu Wangka
Mwera (Chimwera)
Nggem
Chumburung
Nyangumarta
Southeastern Puebla Nahuatl
Pogolo
Safwa
Epena
Takia
Tiruray
Tacana
Tuyuca
Iduna
Wapishana
Vwanji
Wik-Mungkan
Walmajarri
Xavánte
+ 6420 languages
Apply filters
Datasets
93
Full-text search
Edit filters
Sort: Trending
Active filters:
tt
Clear all
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
51.1k
•
546
unimelb-nlp/wikiann
Viewer
•
Updated
Feb 22
•
2M
•
807
•
96
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
12k
•
469
mozilla-foundation/common_voice_17_0
Viewer
•
Updated
Jun 16
•
13M
•
52.1k
•
150
legacy-datasets/common_voice
Updated
Aug 22
•
143
•
132
Helsinki-NLP/opus-100
Viewer
•
Updated
Feb 28
•
55.1M
•
3.14k
•
140
oscar-corpus/oscar
Updated
Mar 21
•
399
•
173
Helsinki-NLP/tatoeba
Updated
Jan 18
•
35
•
37
legacy-datasets/wikipedia
Updated
Mar 11
•
976
•
548
oscar-corpus/OSCAR-2201
Updated
May 30, 2023
•
173
•
113
wikimedia/wit_base
Viewer
•
Updated
Nov 4, 2022
•
108k
•
433
•
51
facebook/flores
Updated
Jan 18
•
97.6k
•
62
CohereForAI/xP3x
Updated
Apr 10
•
1.08k
•
67
saillab/taco-datasets
Viewer
•
Updated
Dec 1, 2023
•
3.2M
•
1.97k
•
15
ayymen/Pontoon-Translations
Viewer
•
Updated
Jan 19
•
3.56M
•
507
•
9
mozilla-foundation/common_voice_16_0
Viewer
•
Updated
Dec 21, 2023
•
8.2M
•
2.38k
•
65
fsicoli/common_voice_18_0
Updated
Aug 15
•
27k
•
4
ahelk/ccaligned_multilingual
Updated
Jan 18
•
11
•
5
speechbrain/common_language
Updated
Jun 12, 2023
•
301
•
27
microsoft/ms_terms
Updated
Jan 18
•
12
•
5
Helsinki-NLP/opus_gnome
Viewer
•
Updated
Feb 22
•
59.5k
•
35
•
1
Helsinki-NLP/opus_ubuntu
Viewer
•
Updated
Feb 22
•
37.4k
•
36
•
3
Helsinki-NLP/qed_amara
Updated
Jan 18
•
23
•
5
Helsinki-NLP/tanzil
Updated
Jan 18
•
12
•
4
community-datasets/tapaco
Viewer
•
Updated
Jun 26
•
3.85M
•
468
•
41
IWSLT/ted_talks_iwslt
Updated
Jan 18
•
16
•
18
community-datasets/udhr
Updated
Jan 18
•
16
•
2
unimorph/universal_morphologies
Updated
Jun 8, 2023
•
5
•
18
MartinThoma/wili_2018
Viewer
•
Updated
Aug 8
•
235k
•
44
•
4
Helsinki-NLP/tatoeba_mt
Updated
1 day ago
•
4.51k
•
52
Previous
1
2
3
4
Next