Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Levantine Arabic
Cebuano
Moroccan Arabic
Egyptian Arabic
Iranian Persian
ajp
Central Kurdish
Crimean Tatar
Kabyle
Northern Kurdish
Maithili
Marathi
Pedi
Papiamento
Yue Chinese
Achinese
Mesopotamian Arabic
Najdi Arabic
Asturian
South Azerbaijani
Bemba (Zambia)
Bhojpuri
Banjar
Friulian
Chhattisgarhi
Iloko
Kamba (Kenya)
Minangkabau
Manipuri
Nepali (individual language)
Pangasinan
Santali
Sicilian
Swahili (individual language)
Tok Pisin
Tumbuka
Umbundu
Venetian
Waray (Philippines)
Standard Malay
Ta'izzi-Adeni Arabic
Tosk Albanian
Awadhi
Central Aymara
North Azerbaijani
Balinese
Chokwe
Dyula
Fon
Nigerian Fulfulde
Kachin
Kabiyè
Kabuverdianu
Kimbundu
Ligurian
Lombard
Latgalian
Luba-Lulua
Luo (Kenya and Tanzania)
Lushai
Standard Latvian
Mossi
Nuer
Odia
Plateau Malagasy
Dari
Ayacucho Quechua
Shan
Silesian
Central Atlas Tamazight
Northern Uzbek
Tunisian Arabic
Buginese
Southwestern Dinka
West Central Oromo
Halh Mongolian
Central Kanuri
Magahi
Southern Pashto
Tamasheq
Eastern Yiddish
Afrikaans
Amharic
Assamese
Bashkir
Belarusian
Bulgarian
Bambara
Bengali
Tibetan
+ 8030 languages
Languages with no match
multilingual
Bihari
Fula
Komi
Inuktitut
Zhuang
Avestan
rna
Inupiaq
btk
zht
ns
ma
map
py
qaa
son
bnt
kar
sgn
art
fre
dog
oy
du
chi
day
fst
oto
crp
iro
roa
cif
tib
at
ger
xx
myn
aus
cau
cdc
dmn
fiu
ngf
ngo
nic
omq
paa
poz
pqe
pqw
tai
cze
Apply filters
Datasets
24
Full-text search
Edit filters
Sort: Trending
Active filters:
apc
Clear all
CohereForAI/aya_dataset
Viewer
•
Updated
Jun 28
•
206k
•
1.34k
•
266
Helsinki-NLP/tatoeba
Updated
Jan 18
•
13
•
37
facebook/flores
Updated
Jan 18
•
68.7k
•
62
CohereForAI/xP3x
Updated
Apr 10
•
357
•
67
Davlan/sib200
Viewer
•
Updated
Feb 19
•
206k
•
13.6k
•
7
ramybaly/arsentd_lev
Updated
Jan 18
•
10
•
3
Muennighoff/flores200
Updated
Jan 7
•
20.8k
•
11
vpermilp/nllb-200-distilled-600M-rust
Updated
Mar 4, 2023
vpermilp/nllb-200-1.3B-rust
Updated
Mar 4, 2023
•
2
visheratin/laion-coco-nllb
Viewer
•
Updated
Apr 11
•
894k
•
16
•
38
Muennighoff/xP3x-sample
Viewer
•
Updated
Sep 18, 2023
•
28.4k
•
3
cis-lmu/Glot500
Viewer
•
Updated
Jun 17
•
1.23B
•
18
•
31
lbourdois/language_tags
Viewer
•
Updated
Jan 21
•
27.3k
•
5
ayymen/Weblate-Translations
Viewer
•
Updated
Apr 2
•
11.7M
•
49
•
7
lbourdois/panlex
Viewer
•
Updated
Feb 3
•
24.6M
•
4
•
7
Felladrin/ChatML-aya_dataset
Viewer
•
Updated
Feb 17
•
202k
jaygala24/xsimplusplus
Viewer
•
Updated
May 1
•
816k
•
24
mteb/sib200
Viewer
•
Updated
May 7
•
397k
•
5.23k
•
1
cis-lmu/GlotCC-V1
Viewer
•
Updated
Jul 11
•
1.28B
•
15
•
43
gentaiscool/bitext_sib200_miners
Viewer
•
Updated
Jun 18
•
287k
•
60
•
2
Svngoku/xP3x-Kongo
Viewer
•
Updated
Jun 24
•
1.22M
•
1
•
2
espnet/mms_ulab_v2
Viewer
•
Updated
Jul 2
•
201k
•
9
•
11
robinhad/long_flores
Updated
Aug 29
•
8
openlanguagedata/flores_plus
Viewer
•
Updated
5 days ago
•
419k