SceneSplat-49K Dataset
We present SceneSplat-49K, a large-scale 3D Gaussian Splatting dataset comprising approximately
49K raw scenes and 46K curated 3DGS scenes aggregated from SceneSplat-7K, DL3DV-10K, HoliCity,
Aria Synthetic Environments, and newly collected crowdsourced data. The corpus spans diverse indoor
and outdoor environments, from rooms and apartments to streets. To support 3DGS scene understanding,
12K scenes are further enriched with per-primitive vision-language embeddings extracted using
state-of-the-art vision-language models.
Appearance, Geometry, and Scale Statistics of the SceneSplat-49K Dataset. Distributions of photometric (PSNR, SSIM, LPIPS) and geometric (depth ℓ1) reconstruction errors show consistently high-quality renders across scenes, while the wide spread in total Gaussian number and indoor/outdoor scene floor area demonstrates the dataset’s diversity. The curves are convolved from the bucket values and vertical dotted lines mark the mean of each metric.