DL3DV 10K : A large-scale real-world dataset for deep learning 3D vision research

DL3DV 10K

AI image generation AI data mining #Dataset #3D Vision #NeRF #Visual Language #Virtual Reality #Augmented Reality Standard Picks Open Source

Overview :

DL3DV-10K is a large-scale real-world dataset containing over 10,000 high-quality videos. Each video is manually annotated with key scene points and complexity, and also provides camera pose, NeRF depth estimation, point clouds, and 3D meshes. The dataset can be used for general NeRF research, scene consistency tracking, visual language models, and other computer vision studies.

Target Users :

["General NeRF Model Research","Scene-level Consistency Tracking","Visual Language Model Research","3D Reconstruction","Virtual Reality","Augmented Reality","Autonomous Driving Visual Perception"]

Total Visits： 359

Top Region： US(100.00%)

Website Views ： 55.2K

Use Cases

Optimizing NeRF model performance using the DL3DV-10K dataset

Training a visual language model based on the DL3DV-10K dataset

Developing SLAM systems using viewing angles and scene information from DL3DV-10K

Features

Provide over 10,000 high-quality videos

Manually annotated scene key points and environmental complexity

Supplied with camera pose, NeRF depth, and other data

Supports research on advanced algorithms such as NeRF and visual language models