Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Maltese
German
French
Italian
Czech
Dutch
Portuguese
Greek
Finnish
Romanian
Slovenian
Danish
Spanish
Polish
Estonian
Lithuanian
English
Hungarian
Bulgarian
Swedish
Slovak
Latvian
Irish
Catalan
Croatian
Russian
Turkish
Ukrainian
Arabic
Basque
Icelandic
Japanese
Hindi
Indonesian
Korean
Welsh
Tamil
Chinese
Vietnamese
Belarusian
Galician
Urdu
Afrikaans
Georgian
Macedonian
Bengali
Kazakh
Persian
Hebrew
Kyrgyz
Serbian
Thai
Esperanto
Armenian
Malayalam
Telugu
Amharic
Panjabi
Tatar
Marathi
Uzbek
Mongolian
Azerbaijani
Albanian
Yoruba
Kannada
Burmese
Assamese
Khmer
Lao
Swahili
Uyghur
Gujarati
Norwegian Nynorsk
Sinhala
Turkmen
Tajik
Igbo
Occitan
Pashto
Hausa
Breton
Tagalog
Scottish Gaelic
Luxembourgish
Nepali
Kinyarwanda
Western Frisian
Guaraní
Malay
+ 1585 languages
Languages with no match
Tagalog
rna
Baikeno
Akha
Manambu
Arosi
Awa (Papua New Guinea)
Akoose
Kenyang
Kwaio
Lahu
Eastern Apurímac Quechua
Mitla Zapotec
Agusan Manobo
Haryanvi
Akawaio
Alamblak
Western Bru
Kimaragang
Masaaba
Northern Thai
Ambai
Guanano
Kisar
Paranan
Tboli
Ghomálá'
btk
Barasana-Eduria
Quiotepec Chinantec
Brao
Batak
Upper Kinabatangan
Western Dani
Lü
Krung
Eastern Lawa
Mandar
Makasae
Nauete
Ruching Palaung
Panao Huánuco Quechua
Semai
Surigaonon
Sangil
Tukudede
Tampuan
Wambon
Yetfa
Denya
Arapaho
Blagar
Siksika
Southern Carrier
Chipaya
Cavineña
Dogrib
Eastern Bontok
Wipi
Eastern Bolivian Guaraní
Gunwinggu
Golin
Ignaciano
Kaingang
Lacandon
Macushi
Maca
Coatlán Mixe
Martu Wangka
Mwera (Chimwera)
Nggem
Chumburung
Nyangumarta
Southeastern Puebla Nahuatl
Pogolo
Safwa
Epena
Takia
Tiruray
Tacana
Tuyuca
Iduna
Wapishana
Vwanji
Wik-Mungkan
Walmajarri
Xavánte
Cajonos Zapotec
Zaramo
Yatzachi Zapotec
+ 6408 languages
Apply filters
Datasets
149
Full-text search
Edit filters
Sort: Trending
Active filters:
mt
Clear all
FBK-MT/mosel
Viewer
•
Updated
8 days ago
•
51.1M
•
235
•
50
allenai/c4
Viewer
•
Updated
Jan 9
•
10.4B
•
314k
•
287
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
51.1k
•
546
unimelb-nlp/wikiann
Viewer
•
Updated
Feb 22
•
2M
•
807
•
96
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
12k
•
469
mozilla-foundation/common_voice_17_0
Viewer
•
Updated
Jun 16
•
13M
•
52.1k
•
150
legacy-datasets/common_voice
Updated
Aug 22
•
143
•
132
Helsinki-NLP/opus-100
Viewer
•
Updated
Feb 28
•
55.1M
•
3.14k
•
140
oscar-corpus/oscar
Updated
Mar 21
•
399
•
173
Helsinki-NLP/tatoeba
Updated
Jan 18
•
35
•
37
legacy-datasets/wikipedia
Updated
Mar 11
•
976
•
548
oscar-corpus/OSCAR-2201
Updated
May 30, 2023
•
173
•
113
facebook/flores
Updated
Jan 18
•
97.6k
•
62
joelniklaus/mapa
Viewer
•
Updated
Oct 25, 2022
•
41.8k
•
7
•
5
joelniklaus/lextreme
Viewer
•
Updated
Apr 29, 2023
•
1.65M
•
29
•
18
joelniklaus/Multi_Legal_Pile
Updated
Jan 12
•
392
•
42
joelniklaus/eurlex_resources
Updated
May 10, 2023
•
2
•
7
joelniklaus/legal-mc4
Viewer
•
Updated
Aug 6, 2023
•
4.74M
•
6
•
11
CohereForAI/xP3x
Updated
Apr 10
•
1.08k
•
67
facebook/belebele
Viewer
•
Updated
Aug 12
•
110k
•
67.3k
•
94
saillab/taco-datasets
Viewer
•
Updated
Dec 1, 2023
•
3.2M
•
1.97k
•
15
ayymen/Pontoon-Translations
Viewer
•
Updated
Jan 19
•
3.56M
•
507
•
9
mozilla-foundation/common_voice_16_0
Viewer
•
Updated
Dec 21, 2023
•
8.2M
•
2.38k
•
65
darmanin-matt/smnli_mt
Viewer
•
Updated
Jan 17
•
972k
•
6
•
1
fsicoli/common_voice_18_0
Updated
Aug 15
•
27k
•
4
ahelk/ccaligned_multilingual
Updated
Jan 18
•
11
•
5
speechbrain/common_language
Updated
Jun 12, 2023
•
301
•
27
Helsinki-NLP/ecb
Updated
Jan 18
•
24
Helsinki-NLP/emea
Updated
Jan 18
•
36
•
1
community-datasets/europa_eac_tm
Viewer
•
Updated
Jun 24
•
12.8k
•
25
•
3
Previous
1
2
3
...
5
Next