Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
Edit Datasets filters
Main
Tasks
Libraries
Languages
Licenses
Other
Modalities
Reset Modalities
3D
Audio
Geospatial
Image
Tabular
Text
Time-series
Video
Size (rows)
Reset Size
1B
10B
Format
Reset Format
json
csv
parquet
imagefolder
soundfolder
webdataset
text
arrow
Apply filters
Datasets
103
Full-text search
Edit filters
Sort: Trending
Active filters:
1B<n<10B
Clear all
HuggingFaceFW/fineweb-edu
Viewer
•
Updated
Aug 25
•
3B
•
64.1k
•
502
bigcode/the-stack-v2
Viewer
•
Updated
Apr 23
•
5.45B
•
303
•
270
laion/relaion2B-en-research-safe
Viewer
•
Updated
Jul 2
•
2.1B
•
310
•
180
laion/aesthetics_v2_4.5
Viewer
•
Updated
Sep 1
•
1.33B
•
31
•
17
fddemarco/pushshift-reddit-comments
Viewer
•
Updated
May 14, 2023
•
1.85B
•
7
•
8
uonlp/CulturaX
Viewer
•
Updated
Jul 23
•
7.18B
•
12k
•
469
rasoul-nikbakht/TSpec-LLM
Updated
Jun 14
•
2
•
27
MaLA-LM/mala-monolingual-integration
Viewer
•
Updated
12 days ago
•
1.14B
•
25
•
2
Salesforce/fineweb_deduplicated
Viewer
•
Updated
25 days ago
•
6.43B
•
208
•
24
GAIR/MathPile
Preview
•
Updated
Jun 13
•
19
•
178
GAIR/MathPile_Commercial
Preview
•
Updated
Mar 25
•
131
•
26
bigcode/the-stack-v2-dedup
Viewer
•
Updated
Apr 23
•
2.3B
•
1.01k
•
56
mercari-us/merrec
Viewer
•
Updated
Mar 9
•
1.27B
•
5
•
1
Zyphra/Zyda
Viewer
•
Updated
Jun 19
•
4.35B
•
86
•
64
FraunhoferIOSB/Synset-Boulevard
Updated
May 27
•
1
•
1
laion/relaion2B-en-research
Viewer
•
Updated
Jun 5
•
2.16B
•
65
•
22
MaLA-LM/mala-monolingual-filter
Viewer
•
Updated
12 days ago
•
1.42B
•
2
mlfoundations/dclm-baseline-1.0-parquet
Viewer
•
Updated
Jul 19
•
2.73B
•
499
•
23
bigdata-pw/Flickr
Viewer
•
Updated
10 days ago
•
5.15B
•
520
•
28
community-datasets/hrwac
Updated
Jan 18
•
18
laion/relaion2B-multi-research-safe
Viewer
•
Updated
Jul 3
•
2.06B
•
6
•
39
laion/relaion1b-nolang-research-safe
Viewer
•
Updated
Jul 3
•
1.19B
•
1
•
5
openclimatefix/uk_pv
Updated
1 day ago
•
1
•
10
carolina-c4ai/corpus-carolina
Updated
about 8 hours ago
•
748
•
19
LHF/escorpius-mr
Viewer
•
Updated
May 11, 2023
•
1.42B
•
1
•
3
maykcaldas/dbSelfiesWL
Viewer
•
Updated
Aug 24, 2022
•
1.01B
•
2
TheGreatRambler/mm2_level_played
Viewer
•
Updated
Nov 11, 2022
•
1.07B
•
2
•
1
laion/laion2B-multi-joined-translated-to-en
Viewer
•
Updated
Jul 14
•
2.06B
•
12
•
5
laion/laion1B-nolang-joined-translated-to-en
Viewer
•
Updated
Jul 14
•
1.18B
•
1
•
1
jhu-clsp/bernice-pretrain-data
Updated
Jan 3, 2023
•
1
•
5
Previous
1
2
3
4
Next