Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Bosnian
Serbian
Turkish
French
Russian
Croatian
Italian
Portuguese
Bulgarian
Czech
German
Greek
Spanish
Romanian
Swedish
Arabic
Hungarian
Indonesian
Japanese
Korean
Polish
Thai
English
Hindi
Dutch
Slovenian
Ukrainian
Danish
Macedonian
Tamil
Urdu
Bengali
Finnish
Hebrew
Lithuanian
Chinese
Catalan
Vietnamese
Estonian
Armenian
Icelandic
Belarusian
Slovak
Kazakh
Albanian
Afrikaans
Basque
Persian
Georgian
Telugu
Galician
Malayalam
Burmese
Welsh
Khmer
Amharic
Esperanto
Kannada
Panjabi
Azerbaijani
Irish
Gujarati
Kyrgyz
Latvian
Maltese
Sinhala
Somali
Tagalog
Uyghur
Uzbek
Scottish Gaelic
Marathi
Malay
Tajik
Tatar
Assamese
Kurdish
Lao
Swahili
Sindhi
Yoruba
Javanese
Luxembourgish
Mongolian
Turkmen
Hausa
Norwegian
Norwegian Nynorsk
Occitan
Guaraní
+ 1571 languages
Languages with no match
Tagalog
rna
Baikeno
Akha
Manambu
Arosi
Awa (Papua New Guinea)
Akoose
Kenyang
Kwaio
Lahu
Eastern Apurímac Quechua
Mitla Zapotec
Agusan Manobo
Haryanvi
Akawaio
Apurinã
Mbyá Guaraní
Alamblak
Western Bru
Kimaragang
Masaaba
Northern Thai
Ambai
Guanano
Kisar
Paranan
Tboli
btk
Barasana-Eduria
Quiotepec Chinantec
Brao
Batak
Upper Kinabatangan
Western Dani
Lü
Krung
Eastern Lawa
Mandar
Makasae
Nauete
Ruching Palaung
Panao Huánuco Quechua
Semai
Surigaonon
Sangil
Tukudede
Tampuan
Wambon
Yetfa
Denya
Arapaho
Blagar
Siksika
Southern Carrier
Chipaya
Cavineña
Dogrib
Eastern Bontok
Wipi
Eastern Bolivian Guaraní
Gunwinggu
Golin
Ignaciano
Kaingang
Lacandon
Macushi
Maca
Coatlán Mixe
Martu Wangka
Mwera (Chimwera)
Nggem
Chumburung
Nyangumarta
Southeastern Puebla Nahuatl
Pogolo
Safwa
Epena
Takia
Tiruray
Tacana
Tuyuca
Iduna
Wapishana
Vwanji
Wik-Mungkan
Walmajarri
Xavánte
Cajonos Zapotec
Zaramo
+ 6422 languages
Apply filters
Datasets
91
Full-text search
Edit filters
Sort: Trending
Active filters:
bs
Clear all
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
51.1k
•
546
unimelb-nlp/wikiann
Viewer
•
Updated
Feb 22
•
2M
•
807
•
96
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
12k
•
469
statmt/cc100
Updated
Mar 5
•
763
•
71
Helsinki-NLP/open_subtitles
Updated
Jan 18
•
294
•
60
Helsinki-NLP/opus-100
Viewer
•
Updated
Feb 28
•
55.1M
•
3.14k
•
140
oscar-corpus/oscar
Updated
Mar 21
•
399
•
173
Helsinki-NLP/tatoeba
Updated
Jan 18
•
35
•
37
legacy-datasets/wikipedia
Updated
Mar 11
•
976
•
548
oscar-corpus/OSCAR-2201
Updated
May 30, 2023
•
173
•
113
wikimedia/wit_base
Viewer
•
Updated
Nov 4, 2022
•
108k
•
433
•
51
facebook/flores
Updated
Jan 18
•
97.6k
•
62
CohereForAI/xP3x
Updated
Apr 10
•
1.08k
•
67
classla/ParlaSent
Viewer
•
Updated
Sep 28, 2023
•
18.2k
•
110
•
5
saillab/taco-datasets
Viewer
•
Updated
Dec 1, 2023
•
3.2M
•
1.97k
•
15
ayymen/Pontoon-Translations
Viewer
•
Updated
Jan 19
•
3.56M
•
507
•
9
stanford-oval/ccnews
Viewer
•
Updated
Aug 31
•
893M
•
416
•
3
borderlines/bordirlines
Updated
about 2 hours ago
•
721
•
4
community-datasets/bswac
Updated
Jan 11
•
25
ahelk/ccaligned_multilingual
Updated
Jan 18
•
11
•
5
microsoft/ms_terms
Updated
Jan 18
•
12
•
5
Helsinki-NLP/opus_gnome
Viewer
•
Updated
Feb 22
•
59.5k
•
35
•
1
Helsinki-NLP/opus_ubuntu
Viewer
•
Updated
Feb 22
•
37.4k
•
36
•
3
Helsinki-NLP/qed_amara
Updated
Jan 18
•
23
•
5
senti-lex/senti_lex
Updated
Jun 8, 2023
•
14
•
7
community-datasets/setimes
Viewer
•
Updated
Jun 26
•
8.8M
•
111
•
2
Helsinki-NLP/tanzil
Updated
Jan 18
•
12
•
4
IWSLT/ted_talks_iwslt
Updated
Jan 18
•
16
•
18
community-datasets/udhr
Updated
Jan 18
•
16
•
2
MartinThoma/wili_2018
Viewer
•
Updated
Aug 8
•
235k
•
44
•
4
Previous
1
2
3
4
Next