Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Romanian
German
French
Italian
Spanish
Portuguese
English
Dutch
Polish
Czech
Hungarian
Greek
Swedish
Russian
Danish
Finnish
Bulgarian
Turkish
Arabic
Slovak
Chinese
Indonesian
Slovenian
Ukrainian
Japanese
Lithuanian
Korean
Estonian
Vietnamese
Hindi
Catalan
Croatian
Latvian
Persian
Hebrew
Thai
Bengali
Serbian
Tamil
Basque
Urdu
Galician
Maltese
Marathi
Irish
Afrikaans
Macedonian
Malayalam
Georgian
Icelandic
Armenian
Esperanto
Telugu
Welsh
Kazakh
Albanian
Belarusian
Gujarati
Norwegian
Swahili
Azerbaijani
Nepali
Burmese
Amharic
Kannada
Panjabi
Kyrgyz
Mongolian
Malay
Uzbek
Khmer
Sinhala
Tagalog
Tatar
Somali
Hausa
Yoruba
Lao
Pashto
Breton
Uyghur
Assamese
Bosnian
Igbo
Tajik
Latin
Western Frisian
Scottish Gaelic
Norwegian Nynorsk
Kurdish
+ 1781 languages
Languages with no match
Tagalog
rna
Agusan Manobo
Haryanvi
Ambai
Guanano
Kisar
Paranan
Tboli
btk
Denya
Arapaho
Blagar
Siksika
Southern Carrier
Chipaya
Cavineña
Dogrib
Eastern Bontok
Wipi
Eastern Bolivian Guaraní
Gunwinggu
Golin
Ignaciano
Kaingang
Lacandon
Macushi
Maca
Coatlán Mixe
Martu Wangka
Mwera (Chimwera)
Nggem
Chumburung
Nyangumarta
Southeastern Puebla Nahuatl
Pogolo
Safwa
Epena
Takia
Tiruray
Tacana
Tuyuca
Iduna
Wapishana
Vwanji
Wik-Mungkan
Walmajarri
Xavánte
Cajonos Zapotec
Zaramo
Yatzachi Zapotec
Zigula
Zoogocho Zapotec
Arifama-Miniafia
Angor
Amanab
Amo
Bumbita Arapesh
Apinayé
Western Apache
Western Arrarnta
Waimaha
Kaluli
Beaver
Beami
Bughotu
Barok
Bedjond
Baruga
Bakairí
Ghayavi
Muinane
Buamu
Baeggu
Qaqet
Mapos Buang
Carapana
Cacua
Comaltepec Chinantec
Eastern Khumi Chin
Tabasco Chontal
Ashéninka Pajonal
Lealao Chinantec
Cerma
Lalana Chinantec
Tepetotutla Chinantec
Palantla Chinantec
Sochiapam Chinantec
Western Highland Chatino
Usila Chinantec
+ 6212 languages
Apply filters
Datasets
341
Full-text search
Edit filters
Sort: Trending
Active filters:
ro
Clear all
FBK-MT/mosel
Viewer
•
Updated
8 days ago
•
51.1M
•
235
•
50
allenai/c4
Viewer
•
Updated
Jan 9
•
10.4B
•
314k
•
287
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
51.1k
•
546
recursal/SuperWikiImage-7M
Updated
3 days ago
•
1
•
3
unimelb-nlp/wikiann
Viewer
•
Updated
Feb 22
•
2M
•
807
•
96
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
12k
•
469
mozilla-foundation/common_voice_17_0
Viewer
•
Updated
Jun 16
•
13M
•
52.1k
•
150
haoranxu/X-ALMA-Preference
Viewer
•
Updated
3 days ago
•
772k
•
4
•
2
statmt/cc100
Updated
Mar 5
•
763
•
71
legacy-datasets/common_voice
Updated
Aug 22
•
143
•
132
Helsinki-NLP/europarl
Viewer
•
Updated
Feb 27
•
186M
•
2.32k
•
17
cis-lmu/m_lama
Updated
Jan 18
•
20
•
6
Helsinki-NLP/open_subtitles
Updated
Jan 18
•
294
•
60
Helsinki-NLP/opus-100
Viewer
•
Updated
Feb 28
•
55.1M
•
3.14k
•
140
oscar-corpus/oscar
Updated
Mar 21
•
399
•
173
Helsinki-NLP/tatoeba
Updated
Jan 18
•
35
•
37
legacy-datasets/wikipedia
Updated
Mar 11
•
976
•
548
oscar-corpus/OSCAR-2201
Updated
May 30, 2023
•
173
•
113
google/wit
Viewer
•
Updated
Jul 4, 2022
•
2.66M
•
265
•
36
wikimedia/wit_base
Viewer
•
Updated
Nov 4, 2022
•
108k
•
433
•
51
facebook/voxpopuli
Viewer
•
Updated
Oct 14, 2022
•
169k
•
6.17k
•
83
mteb/amazon_massive_intent
Viewer
•
Updated
May 7
•
1.69M
•
24.1k
•
20
facebook/flores
Updated
Jan 18
•
97.6k
•
62
joelniklaus/mapa
Viewer
•
Updated
Oct 25, 2022
•
41.8k
•
7
•
5
joelniklaus/lextreme
Viewer
•
Updated
Apr 29, 2023
•
1.65M
•
29
•
18
joelniklaus/Multi_Legal_Pile
Updated
Jan 12
•
392
•
42
joelniklaus/eurlex_resources
Updated
May 10, 2023
•
2
•
7
joelniklaus/legal-mc4
Viewer
•
Updated
Aug 6, 2023
•
4.74M
•
6
•
11
OpenAssistant/oasst1
Viewer
•
Updated
May 2, 2023
•
88.8k
•
3.2k
•
1.26k
CohereForAI/xP3x
Updated
Apr 10
•
1.08k
•
67
Previous
1
2
3
...
12
Next