Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Cebuano
Iloko
Waray (Philippines)
German
Russian
Spanish
French
Indonesian
Vietnamese
Italian
Dutch
Portuguese
English
Japanese
Central Kurdish
Hindi
Serbian
Swedish
Tamil
Chinese
Asturian
Czech
Indonesian
Korean
Minangkabau
Belarusian
Malayalam
Burmese
Romanian
Ukrainian
Afrikaans
Bengali
Danish
Hebrew
Urdu
Bulgarian
Greek
Finnish
Croatian
Hungarian
Slovak
Thai
Thai
Turkish
Vietnamese
Amharic
Arabic
Esperanto
Khmer
Kannada
Lithuanian
Burmese
Polish
Slovenian
Catalan
Irish
Galician
Gujarati
Icelandic
Somali
Estonian
Armenian
Georgian
Macedonian
Panjabi
Tajik
Achinese
Kazakh
Kyrgyz
Luxembourgish
Pedi
Tagalog
Welsh
Basque
Lao
Sinhala
Telugu
Marathi
Sindhi
Banjar
Javanese
Maithili
Maltese
Egyptian Arabic
Persian
Pangasinan
Haitian
Javanese
Latin
Swahili
+ 8064 languages
Languages with no match
rna
btk
zht
ma
py
fre
oy
du
chi
day
fst
oto
cif
tib
at
ger
xx
myn
cze
Apply filters
Datasets
135
Full-text search
Edit filters
Sort: Trending
Active filters:
ceb
Clear all
allenai/c4
Viewer
•
Updated
Jan 9
•
10.4B
•
314k
•
287
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
51.1k
•
546
unimelb-nlp/wikiann
Viewer
•
Updated
Feb 22
•
2M
•
807
•
96
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
12k
•
469
CohereForAI/aya_collection
Viewer
•
Updated
Jun 28
•
514M
•
350
•
196
cis-lmu/m_lama
Updated
Jan 18
•
20
•
6
oscar-corpus/oscar
Updated
Mar 21
•
399
•
173
Helsinki-NLP/tatoeba
Updated
Jan 18
•
35
•
37
legacy-datasets/wikipedia
Updated
Mar 11
•
976
•
548
google/xtreme_s
Updated
29 days ago
•
498
•
54
oscar-corpus/OSCAR-2201
Updated
May 30, 2023
•
173
•
113
google/fleurs
Updated
Aug 25
•
31.3k
•
243
wikimedia/wit_base
Viewer
•
Updated
Nov 4, 2022
•
108k
•
433
•
51
sil-ai/bloom-speech
Updated
Feb 15, 2023
•
86
•
21
facebook/flores
Updated
Jan 18
•
97.6k
•
62
CohereForAI/xP3x
Updated
Apr 10
•
1.08k
•
67
cis-lmu/udhr-lid
Viewer
•
Updated
Jul 20
•
27.8k
•
41
•
6
saillab/taco-datasets
Viewer
•
Updated
Dec 1, 2023
•
3.2M
•
1.97k
•
15
ayymen/Pontoon-Translations
Viewer
•
Updated
Jan 19
•
3.56M
•
507
•
9
CohereForAI/aya_dataset
Viewer
•
Updated
Jun 28
•
206k
•
1.31k
•
264
CohereForAI/aya_collection_language_split
Viewer
•
Updated
Jun 28
•
514M
•
1.95k
•
74
jhu-clsp/kreyol-mt
Viewer
•
Updated
Jun 4
•
1.88M
•
95
•
7
cis-lmu/Taxi1500-RawData
Viewer
•
Updated
Jun 5
•
15.6M
•
1.36k
•
2
SEACrowd/flores200
Updated
Jun 24
•
10
•
1
akoksal/muri-it
Viewer
•
Updated
20 days ago
•
2.23M
•
2
Helsinki-NLP/bible_para
Updated
Jan 18
•
36
•
14
ahelk/ccaligned_multilingual
Updated
Jan 18
•
11
•
5
legacy-datasets/mc4
Updated
Mar 5
•
16
•
145
Helsinki-NLP/opus_ubuntu
Viewer
•
Updated
Feb 22
•
37.4k
•
36
•
3
Helsinki-NLP/qed_amara
Updated
Jan 18
•
23
•
5
Previous
1
2
3
...
5
Next