The fastest way to
high-quality training data

A free, continuously updated index of open-source AI datasets. Search by domain, modality, size, license and more, all transparently linked to the original source.

Trending NowTrending Now

Most viewed datasets this week

View All

Browse by Category

Explore datasets organized by domain and use case

Recently AddedRecently Added

Newly discovered datasets from the past week

View All

Featured Datasets

Handpicked high-quality datasets from across the web

View All

Benchmark DatasetsAcademic Datasets

Recently released datasets from research papers with EDA and analytics

View All Academic

Infinity-Chat

26k+ open-ended prompts designed to expose 'mode collapse' in LLM creativity.

nlpinstruct...creative...

Analytics SummaryAnalytics Summary

26k+ open-ended prompts designed to expose 'mode collapse' in LLM creativity.
15,400
3,200

WorldModelBench

350+ physics-constrained scenarios to test video generation logic.

computer...video-ge...world-mo...

Analytics SummaryAnalytics Summary

350+ physics-constrained scenarios to test video generation logic.
8,900
1,100

LiveBench

Monthly updated questions from arXiv, math competitions, and news.

llm-benc...reasonin...coding

Analytics SummaryAnalytics Summary

Monthly updated questions from arXiv, math competitions, and news.
42,000
5,600

STSBench (STSnu)

43 diverse driving scenarios with 971 verified spatial reasoning QA pairs.

computer...autonomo...vlm-reas...

Analytics SummaryAnalytics Summary

43 diverse driving scenarios with 971 verified spatial reasoning QA pairs.
6,500
890

WildBench

Real-world user queries evaluated using automated pairwise comparison.

llm-eval...rlhfuser-ali...

Analytics SummaryAnalytics Summary

Real-world user queries evaluated using automated pairwise comparison.
21,000
2,800

OS-Marathon

Benchmarks agents on long-horizon, repetitive desktop tasks to test stability.

agentscomputer...robustne...

Analytics SummaryAnalytics Summary

Benchmarks agents on long-horizon, repetitive desktop tasks to test stability.
3,200
450

XLRS-Bench

Ultra-high resolution (8.5k x 8.5k) remote sensing for MLLMs.

remote-s...multimod...satellit...

Analytics SummaryAnalytics Summary

Ultra-high resolution (8.5k x 8.5k) remote sensing for MLLMs.
5,600
1,200

MuVR

Retrieving specific moments from long, untrimmed video streams.

video-re...computer...temporal...

Analytics SummaryAnalytics Summary

Retrieving specific moments from long, untrimmed video streams.
4,100
670

AODRaw

7,700+ RAW sensor images for detection in adverse weather.

autonomo...object-d...low-ligh...

Analytics SummaryAnalytics Summary

7,700+ RAW sensor images for detection in adverse weather.
2,800
530

Request Datasets

Didn't find what you were looking for? Submit a request and we'll help you find or create the dataset you need.

Request dataset
Give feedback