Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Norwegian
English
Danish
German
Swedish
French
Spanish
Portuguese
Italian
Dutch
Finnish
Russian
Polish
Turkish
Romanian
Hungarian
Arabic
Czech
Korean
Greek
Japanese
Indonesian
Chinese
Ukrainian
Hindi
Bulgarian
Vietnamese
Catalan
Estonian
Lithuanian
Serbian
Persian
Slovenian
Thai
Hebrew
Croatian
Slovak
Tamil
Bengali
Latvian
Icelandic
Marathi
Urdu
Afrikaans
Telugu
Malayalam
Basque
Albanian
Macedonian
Malay
Swahili
Armenian
Azerbaijani
Welsh
Galician
Belarusian
Kazakh
Georgian
Nepali
Esperanto
Gujarati
Tagalog
Uzbek
Amharic
Irish
Kannada
Latin
Panjabi
Burmese
Norwegian Nynorsk
Sinhala
Khmer
Maltese
Bosnian
Western Frisian
Kurdish
Somali
Breton
Scottish Gaelic
Javanese
Kyrgyz
Mongolian
Sindhi
Yiddish
Malagasy
Tajik
Yoruba
Norwegian Bokmål
Pashto
Uyghur
+ 503 languages
Languages with no match
Indonesian
code
Vietnamese
Thai
Javanese
Burmese
Sundanese
Tagalog
Khmer
Malay (individual language)
Hindi
Lao
Portuguese
Russian
Spanish
Hausa
German
Malayalam
Urdu
Yoruba
Gujarati
Kannada
Dutch
Korean
Panjabi
Amharic
Romanian
Telugu
Ganda
Afrikaans
Kinyarwanda
Xhosa
Igbo
Turkish
Ukrainian
Nyanja
Somali
Lithuanian
Serbian
Zulu
Czech
Hebrew
Finnish
Shona
Tajik
Modern Greek (1453-)
Wolof
Southern Sotho
Belarusian
Croatian
Kirghiz
Arabic
Assamese
Welsh
Irish
Slovak
Haitian
Lingala
Slovenian
Galician
Macedonian
Sanskrit
Armenian
Georgian
Maltese
Persian
Kazakh
Luxembourgish
Maori
Sindhi
Esperanto
Malay (macrolanguage)
Twi
Tswana
Occitan (post 1500)
Tsonga
Standard Arabic
Scottish Gaelic
Basque
Kikuyu
Uzbek
Ewe
Makasar
Akan
Swahili (macrolanguage)
Latin
Uighur
Albanian
Sinhala
Bambara
+ 7490 languages
Apply filters
Datasets
201
Full-text search
Edit filters
Sort: Trending
Active filters:
no
Clear all
allenai/c4
Viewer
•
Updated
Jan 9
•
10.4B
•
314k
•
287
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
51.1k
•
546
recursal/SuperWikiImage-7M
Updated
3 days ago
•
1
•
3
apple/mkqa
Updated
Jan 18
•
2.08k
•
36
unimelb-nlp/wikiann
Viewer
•
Updated
Feb 22
•
2M
•
807
•
96
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
12k
•
469
haoranxu/X-ALMA-Preference
Viewer
•
Updated
3 days ago
•
772k
•
4
•
2
statmt/cc100
Updated
Mar 5
•
763
•
71
Helsinki-NLP/open_subtitles
Updated
Jan 18
•
294
•
60
Helsinki-NLP/opus-100
Viewer
•
Updated
Feb 28
•
55.1M
•
3.14k
•
140
Helsinki-NLP/opus_books
Viewer
•
Updated
Mar 29
•
1.25M
•
789
•
47
oscar-corpus/oscar
Updated
Mar 21
•
399
•
173
legacy-datasets/wikipedia
Updated
Mar 11
•
976
•
548
NbAiLab/norwegian_parliament
Viewer
•
Updated
Apr 16
•
6k
•
92
•
3
oscar-corpus/OSCAR-2201
Updated
May 30, 2023
•
173
•
113
google/wit
Viewer
•
Updated
Jul 4, 2022
•
2.66M
•
265
•
36
wikimedia/wit_base
Viewer
•
Updated
Nov 4, 2022
•
108k
•
433
•
51
CohereForAI/xP3x
Updated
Apr 10
•
1.08k
•
67
tollefj/massive-en-no-shorter-transfer
Viewer
•
Updated
Aug 23, 2023
•
758k
•
2
•
1
facebook/belebele
Viewer
•
Updated
Aug 12
•
110k
•
67.3k
•
94
saillab/taco-datasets
Viewer
•
Updated
Dec 1, 2023
•
3.2M
•
1.97k
•
15
alexandrainst/m_arc
Viewer
•
Updated
Jan 15
•
87.4k
•
126k
•
4
alexandrainst/m_mmlu
Viewer
•
Updated
Mar 11
•
488k
•
326k
•
13
afaji/cvqa
Viewer
•
Updated
14 days ago
•
10.4k
•
235
•
17
stanford-oval/ccnews
Viewer
•
Updated
Aug 31
•
893M
•
416
•
3
Nikity/Pornhub
Updated
Aug 26
•
10
•
17
Helsinki-NLP/bible_para
Updated
Jan 18
•
36
•
14
ahelk/ccaligned_multilingual
Updated
Jan 18
•
11
•
5
community-datasets/europa_eac_tm
Viewer
•
Updated
Jun 24
•
12.8k
•
25
•
3
community-datasets/europa_ecdc_tm
Viewer
•
Updated
Jun 24
•
7.67k
•
26
•
2
Previous
1
2
3
...
7
Next