Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Goan Konkani
Maithili
Assamese
Gujarati
Hindi
Kannada
Malayalam
Marathi
Panjabi
Tamil
Urdu
Bengali
Oriya
Sanskrit
Telugu
Santali
Cebuano
Central Kurdish
Iloko
Nepali
Sindhi
Asturian
Karachay-Balkar
Yakut
Sicilian
Waray (Philippines)
Egyptian Arabic
South Azerbaijani
Russia Buriat
English
Upper Sorbian
Lojban
Lombard
Eastern Mari
Minangkabau
Low German
Venetian
Wu Chinese
Tosk Albanian
Tibetan
Bishnupriya
Lower Sorbian
Mazanderani
Newari
Piemontese
Western Panjabi
Pashto
Tuvinian
Kalmyk
Bavarian
Lezghian
Erzya
Pampanga
Mingrelian
Western Mari
Neapolitan
Afrikaans
Amharic
Aragonese
Bashkir
Central Bikol
Belarusian
Bulgarian
Breton
Bosnian
Catalan
Crimean Tatar
Czech
Chuvash
Welsh
Danish
German
Dimli (individual language)
Divehi
Greek
Esperanto
Spanish
Basque
Finnish
French
Northern Frisian
Western Frisian
Irish
Scottish Gaelic
Galician
Hebrew
Croatian
Hungarian
Armenian
Interlingua
+ 8031 languages
Languages with no match
Avestan
rna
iw
btk
Ojibwe
zht
cn
jp
ns
gr
ma
py
qaa
me
in
kar
sgn
cz
ji
fre
us
ua
bp
dog
oy
du
chi
day
fst
oto
crp
iro
roa
cif
tib
at
ger
xx
myn
bd
by
eg
fu
gb
lk
np
pk
pr
pu
qt
sai
cze
Apply filters
Datasets
37
Full-text search
Edit filters
Sort: Trending
Active filters:
gom
Clear all
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
53.4k
•
550
legacy-datasets/wikipedia
Updated
Mar 11
•
565
•
550
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
6.45k
•
472
oscar-corpus/oscar
Updated
Mar 21
•
1.23k
•
174
Helsinki-NLP/tatoeba
Updated
Jan 18
•
13
•
37
oscar-corpus/OSCAR-2201
Updated
May 30, 2023
•
1.51k
•
113
ayymen/Pontoon-Translations
Viewer
•
Updated
Jan 19
•
3.56M
•
16
•
9
google/IndicGenBench_flores_in
Updated
May 4
•
809
•
5
google/IndicGenBench_crosssum_in
Updated
May 4
•
138
•
4
oscar-corpus/OSCAR-2109
Updated
Nov 8, 2022
•
34
•
38
ai4bharat/IndicCOPA
Updated
Dec 15, 2022
•
221
•
3
olm/wikipedia
Updated
Jan 23
•
290
•
32
reyoung/wikipedia
Updated
Jan 13, 2023
•
1
livinNector/wikipedia
Updated
Mar 28, 2023
•
1
graelo/wikipedia
Viewer
•
Updated
Sep 10, 2023
•
105M
•
1.67k
•
64
satpalsr/indicCorpv2
Updated
Jul 31, 2023
•
2
•
2
baoanhtran/guanaco-llama2-200
Updated
Sep 24, 2023
•
5
cyanic-selkie/wikianc
Viewer
•
Updated
Sep 5, 2023
•
269M
•
5
ai4bharat/IN22-Gen
Updated
Dec 20, 2023
•
472
•
3
ai4bharat/IN22-Conv
Updated
Dec 20, 2023
•
4
•
6
openskyml/wikipedia
Updated
Oct 8, 2023
•
3
cis-lmu/Glot500
Viewer
•
Updated
Jun 17
•
1.23B
•
18
•
31
lbourdois/language_tags
Viewer
•
Updated
Jan 21
•
27.3k
•
5
NeuML/wikipedia
Updated
Jan 11
•
2
lbourdois/panlex
Viewer
•
Updated
Feb 3
•
24.6M
•
4
•
7
cointegrated/panlex-meanings
Viewer
•
Updated
Mar 24
•
78.2M
•
9
mwalol/wikipapa
Updated
Apr 2
nthakur/indic-swim-ir-cross-lingual
Viewer
•
Updated
Apr 28
•
93k
•
2
google/IndicGenBench_xorqa_in
Updated
May 4
•
273
•
2
mteb/IN22-Conv
Viewer
•
Updated
May 14
•
1.5k
•
15
Previous
1
2
Next