Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Breton
French
Russian
Spanish
German
Indonesian
Italian
Czech
Japanese
Dutch
Ukrainian
Arabic
Portuguese
Romanian
Catalan
Greek
Esperanto
Polish
Chinese
English
Basque
Slovenian
Turkish
Vietnamese
Belarusian
Bulgarian
Welsh
Estonian
Persian
Lithuanian
Finnish
Hungarian
Serbian
Swedish
Tamil
Hebrew
Hindi
Korean
Danish
Galician
Latvian
Georgian
Macedonian
Afrikaans
Bengali
Icelandic
Marathi
Urdu
Azerbaijani
Irish
Kazakh
Slovak
Malayalam
Thai
Armenian
Western Frisian
Albanian
Croatian
Mongolian
Maltese
Uzbek
Amharic
Kyrgyz
Tatar
Assamese
Uyghur
Swahili
Telugu
Latin
Oriya
Occitan
Yiddish
Panjabi
Interlingua
Turkmen
Chuvash
Malay
Pashto
Tagalog
Yoruba
Norwegian Nynorsk
Norwegian
Sinhala
Guaraní
Khmer
Lao
Nepali
Asturian
Central Kurdish
Scottish Gaelic
+ 1841 languages
Languages with no match
rna
Akha
Lahu
Haryanvi
Western Bru
Kimaragang
Masaaba
Northern Thai
Ghomálá'
btk
Brao
Batak
Upper Kinabatangan
Western Dani
Lü
Krung
Eastern Lawa
Mandar
Makasae
Nauete
Ruching Palaung
Semai
Surigaonon
Sangil
Tukudede
Tampuan
Wambon
Yetfa
Bunak
En
Fali
Hre
Haroi
Idaté
Mandaya
South Nuaulu
Nyoro
Kayan
Tontemboan
Tombulu
Ambala Ayta
Tuki
Kom (Cameroon)
Tiéyaxo Bozo
Mokpwe
Jenaama Bozo
Chipewyan
Lowland Oaxaca Chontal
Lotud
Huba
Kamwe
Western Juxtlahuaca Mixtec
Limbum
Laarim
Naskapi
Koonzime
Peranakan Indonesian
Samburu
Arammba
Tiang
Tharaka
Turkana
Turka
Mundari
Vili
Wanukaka
Yemba
Yakha
Riang Lai
Bhadrawahi
Mahasu Pahari
Egyptian (Ancient)
Braj
Ghotuo
Adangbe
Esimbi
Awing
Bambili-Bambui
Babanki
Iceve-Maci
Bafut
Mmen
Bangandu
Baka (Cameroon)
Bakoko
Aweer
Ntcham
Terei
Bafaw-Balong
Bu-Nao Bunu
+ 6152 languages
Apply filters
Datasets
88
Full-text search
Edit filters
Sort: Trending
Active filters:
br
Clear all
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
51.1k
•
546
unimelb-nlp/wikiann
Viewer
•
Updated
Feb 22
•
2M
•
807
•
96
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
12k
•
469
mozilla-foundation/common_voice_17_0
Viewer
•
Updated
Jun 16
•
13M
•
52.1k
•
150
statmt/cc100
Updated
Mar 5
•
763
•
71
legacy-datasets/common_voice
Updated
Aug 22
•
143
•
132
Helsinki-NLP/open_subtitles
Updated
Jan 18
•
294
•
60
Helsinki-NLP/opus-100
Viewer
•
Updated
Feb 28
•
55.1M
•
3.14k
•
140
oscar-corpus/oscar
Updated
Mar 21
•
399
•
173
Helsinki-NLP/tatoeba
Updated
Jan 18
•
35
•
37
legacy-datasets/wikipedia
Updated
Mar 11
•
976
•
548
oscar-corpus/OSCAR-2201
Updated
May 30, 2023
•
173
•
113
google/wit
Viewer
•
Updated
Jul 4, 2022
•
2.66M
•
265
•
36
wikimedia/wit_base
Viewer
•
Updated
Nov 4, 2022
•
108k
•
433
•
51
CohereForAI/xP3x
Updated
Apr 10
•
1.08k
•
67
ayymen/Pontoon-Translations
Viewer
•
Updated
Jan 19
•
3.56M
•
507
•
9
mozilla-foundation/common_voice_16_0
Viewer
•
Updated
Dec 21, 2023
•
8.2M
•
2.38k
•
65
afaji/cvqa
Viewer
•
Updated
14 days ago
•
10.4k
•
235
•
17
stanford-oval/ccnews
Viewer
•
Updated
Aug 31
•
893M
•
416
•
3
fsicoli/common_voice_18_0
Updated
Aug 15
•
27k
•
4
ahelk/ccaligned_multilingual
Updated
Jan 18
•
11
•
5
speechbrain/common_language
Updated
Jun 12, 2023
•
301
•
27
Helsinki-NLP/kde4
Updated
Jan 18
•
21
•
18
Helsinki-NLP/ofis_publik
Updated
Jan 18
•
14
•
1
Helsinki-NLP/opus_gnome
Viewer
•
Updated
Feb 22
•
59.5k
•
35
•
1
Helsinki-NLP/opus_ubuntu
Viewer
•
Updated
Feb 22
•
37.4k
•
36
•
3
Helsinki-NLP/qed_amara
Updated
Jan 18
•
23
•
5
senti-lex/senti_lex
Updated
Jun 8, 2023
•
14
•
7
community-datasets/tapaco
Viewer
•
Updated
Jun 26
•
3.85M
•
468
•
41
community-datasets/udhr
Updated
Jan 18
•
16
•
2
Previous
1
2
3
Next