Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Bulgarian
German
French
Spanish
Italian
Portuguese
English
Dutch
Greek
Polish
Romanian
Hungarian
Czech
Danish
Finnish
Swedish
Slovak
Russian
Lithuanian
Slovenian
Estonian
Turkish
Arabic
Korean
Japanese
Chinese
Ukrainian
Vietnamese
Indonesian
Latvian
Catalan
Thai
Hebrew
Croatian
Hindi
Persian
Serbian
Bengali
Urdu
Basque
Galician
Maltese
Macedonian
Tamil
Afrikaans
Irish
Icelandic
Esperanto
Marathi
Belarusian
Georgian
Albanian
Kazakh
Malayalam
Welsh
Armenian
Swahili
Telugu
Azerbaijani
Amharic
Norwegian
Malay
Burmese
Panjabi
Tagalog
Uzbek
Gujarati
Kyrgyz
Nepali
Kannada
Sinhala
Mongolian
Yoruba
Khmer
Tatar
Uyghur
Bosnian
Hausa
Somali
Lao
Assamese
Breton
Igbo
Latin
Pashto
Scottish Gaelic
Norwegian Nynorsk
Sindhi
Tajik
Javanese
+ 1593 languages
Languages with no match
Tagalog
rna
Baikeno
Akha
Manambu
Arosi
Awa (Papua New Guinea)
Akoose
Kenyang
Kwaio
Lahu
Eastern Apurímac Quechua
Mitla Zapotec
Agusan Manobo
Haryanvi
Alamblak
Western Bru
Kimaragang
Masaaba
Northern Thai
Ambai
Guanano
Kisar
Paranan
Tboli
btk
Brao
Batak
Upper Kinabatangan
Western Dani
Lü
Krung
Eastern Lawa
Mandar
Makasae
Nauete
Ruching Palaung
Panao Huánuco Quechua
Semai
Surigaonon
Sangil
Tukudede
Tampuan
Wambon
Yetfa
Denya
Arapaho
Blagar
Siksika
Southern Carrier
Chipaya
Cavineña
Dogrib
Eastern Bontok
Wipi
Eastern Bolivian Guaraní
Gunwinggu
Golin
Ignaciano
Kaingang
Lacandon
Macushi
Maca
Coatlán Mixe
Martu Wangka
Mwera (Chimwera)
Nggem
Chumburung
Nyangumarta
Southeastern Puebla Nahuatl
Pogolo
Safwa
Epena
Takia
Tiruray
Tacana
Tuyuca
Iduna
Wapishana
Vwanji
Wik-Mungkan
Walmajarri
Xavánte
Cajonos Zapotec
Zaramo
Yatzachi Zapotec
Zigula
Zoogocho Zapotec
Bunak
En
+ 6400 languages
Apply filters
Datasets
238
Full-text search
Edit filters
Sort: Trending
Active filters:
bg
Clear all
FBK-MT/mosel
Viewer
•
Updated
8 days ago
•
51.1M
•
235
•
50
allenai/c4
Viewer
•
Updated
Jan 9
•
10.4B
•
314k
•
287
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
51.1k
•
546
recursal/SuperWikiImage-7M
Updated
3 days ago
•
1
•
3
unimelb-nlp/wikiann
Viewer
•
Updated
Feb 22
•
2M
•
807
•
96
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
12k
•
469
mozilla-foundation/common_voice_17_0
Viewer
•
Updated
Jun 16
•
13M
•
52.1k
•
150
haoranxu/X-ALMA-Preference
Viewer
•
Updated
3 days ago
•
772k
•
4
•
2
statmt/cc100
Updated
Mar 5
•
763
•
71
Helsinki-NLP/europarl
Viewer
•
Updated
Feb 27
•
186M
•
2.32k
•
17
mhardalov/exams
Viewer
•
Updated
Feb 6
•
136k
•
6.59k
•
29
cis-lmu/m_lama
Updated
Jan 18
•
20
•
6
Helsinki-NLP/open_subtitles
Updated
Jan 18
•
294
•
60
Helsinki-NLP/opus-100
Viewer
•
Updated
Feb 28
•
55.1M
•
3.14k
•
140
oscar-corpus/oscar
Updated
Mar 21
•
399
•
173
Helsinki-NLP/tatoeba
Updated
Jan 18
•
35
•
37
legacy-datasets/wikipedia
Updated
Mar 11
•
976
•
548
facebook/xnli
Viewer
•
Updated
Jan 5
•
6.4M
•
3.8k
•
49
google/xtreme
Viewer
•
Updated
Feb 22
•
2.77M
•
2.72k
•
88
oscar-corpus/OSCAR-2201
Updated
May 30, 2023
•
173
•
113
google/wit
Viewer
•
Updated
Jul 4, 2022
•
2.66M
•
265
•
36
wikimedia/wit_base
Viewer
•
Updated
Nov 4, 2022
•
108k
•
433
•
51
facebook/flores
Updated
Jan 18
•
97.6k
•
62
joelniklaus/mapa
Viewer
•
Updated
Oct 25, 2022
•
41.8k
•
7
•
5
joelniklaus/lextreme
Viewer
•
Updated
Apr 29, 2023
•
1.65M
•
29
•
18
joelniklaus/Multi_Legal_Pile
Updated
Jan 12
•
392
•
42
joelniklaus/eurlex_resources
Updated
May 10, 2023
•
2
•
7
joelniklaus/legal-mc4
Viewer
•
Updated
Aug 6, 2023
•
4.74M
•
6
•
11
OpenAssistant/oasst1
Viewer
•
Updated
May 2, 2023
•
88.8k
•
3.2k
•
1.26k
CohereForAI/xP3x
Updated
Apr 10
•
1.08k
•
67
Previous
1
2
3
...
8
Next