Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
1
Licenses
Other
Reset Languages
Egyptian Arabic
Cebuano
Central Kurdish
Low German
Waray (Philippines)
Mingrelian
Asturian
Upper Sorbian
Iloko
Lombard
Minangkabau
Mazanderani
Western Panjabi
Yakut
Wu Chinese
Tosk Albanian
Bavarian
Lojban
Eastern Mari
Sicilian
Venetian
South Azerbaijani
Bishnupriya
Karachay-Balkar
Maithili
Neapolitan
Newari
Piemontese
Afrikaans
Aragonese
Bashkir
Belarusian
Bulgarian
Bengali
Breton
Bosnian
Russia Buriat
Catalan
Chechen
Czech
Chuvash
Welsh
Danish
German
Greek
Esperanto
Spanish
Basque
Persian
Finnish
French
Western Frisian
Irish
Galician
Goan Konkani
Hindi
Croatian
Hungarian
Armenian
Interlingua
Indonesian
Ido
Icelandic
Italian
Japanese
Javanese
Georgian
Kazakh
Kannada
Korean
Latin
Luxembourgish
Lezghian
Lithuanian
Macedonian
Malayalam
Marathi
Mirandese
Burmese
Dutch
Norwegian Nynorsk
Occitan
Panjabi
Polish
Portuguese
Romanian
Russian
Sinhala
Slovak
Slovenian
+ 8014 languages
Languages with no match
code
multilingual
Avestan
rna
btk
Ojibwe
zht
cn
jp
ns
gr
ma
py
qaa
me
in
son
bnt
kar
sgn
art
cz
ji
fre
us
ua
bp
dog
oy
du
chi
day
fst
oto
crp
iro
cif
tib
at
ger
xx
myn
bd
by
eg
fu
gb
lk
np
pk
pr
pu
qt
sai
aus
cau
cdc
dmn
fiu
ngf
ngo
nic
omq
paa
poz
pqe
pqw
tai
cze
Apply filters
Datasets
27
Full-text search
Edit filters
Sort: Trending
Active filters:
xmf
Clear all
wikimedia/wikipedia
Viewer
•
Updated
Jan 9
•
61.6M
•
53.4k
•
550
legacy-datasets/wikipedia
Updated
Mar 11
•
565
•
550
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
6.45k
•
472
unimelb-nlp/wikiann
Viewer
•
Updated
Feb 22
•
2M
•
866
•
98
oscar-corpus/oscar
Updated
Mar 21
•
1.23k
•
174
oscar-corpus/OSCAR-2201
Updated
May 30, 2023
•
1.51k
•
113
wikimedia/wit_base
Viewer
•
Updated
Nov 4, 2022
•
108k
•
518
•
52
MartinThoma/wili_2018
Viewer
•
Updated
Aug 8
•
235k
•
37
•
4
oscar-corpus/OSCAR-2109
Updated
Nov 8, 2022
•
34
•
38
tner/wikiann
Updated
Sep 27, 2022
•
136
•
5
olm/wikipedia
Updated
Jan 23
•
290
•
32
reyoung/wikipedia
Updated
Jan 13, 2023
•
1
livinNector/wikipedia
Updated
Mar 28, 2023
•
1
graelo/wikipedia
Viewer
•
Updated
Sep 10, 2023
•
105M
•
1.67k
•
64
baoanhtran/guanaco-llama2-200
Updated
Sep 24, 2023
•
5
cyanic-selkie/wikianc
Viewer
•
Updated
Sep 5, 2023
•
269M
•
5
openskyml/wikipedia
Updated
Oct 8, 2023
•
3
cis-lmu/Glot500
Viewer
•
Updated
Jun 17
•
1.23B
•
18
•
31
lbourdois/language_tags
Viewer
•
Updated
Jan 21
•
27.3k
•
5
NeuML/wikipedia
Updated
Jan 11
•
2
lbourdois/panlex
Viewer
•
Updated
Feb 3
•
24.6M
•
4
•
7
mwalol/wikipapa
Updated
Apr 2
bltlab/ParaNames
Viewer
•
Updated
May 16
•
140M
•
3
cis-lmu/GlotCC-V1
Viewer
•
Updated
Jul 11
•
1.28B
•
15
•
43
espnet/mms_ulab_v2
Viewer
•
Updated
Jul 2
•
201k
•
9
•
11
khulnasoft/banglawiki
Updated
Jul 10
devngho/culturax-mini-nonshuffled
Viewer
•
Updated
24 days ago
•
71.8M
•
36