DL3DV-10K
D
DL3DV 10K
Overview :
DL3DV-10K is a large-scale real-world dataset containing over 10,000 high-quality videos. Each video is manually annotated with key scene points and complexity, and also provides camera pose, NeRF depth estimation, point clouds, and 3D meshes. The dataset can be used for general NeRF research, scene consistency tracking, visual language models, and other computer vision studies.
Target Users :
["General NeRF Model Research","Scene-level Consistency Tracking","Visual Language Model Research","3D Reconstruction","Virtual Reality","Augmented Reality","Autonomous Driving Visual Perception"]
Total Visits: 359
Top Region: US(100.00%)
Website Views : 55.2K
Use Cases
Optimizing NeRF model performance using the DL3DV-10K dataset
Training a visual language model based on the DL3DV-10K dataset
Developing SLAM systems using viewing angles and scene information from DL3DV-10K
Features
Provide over 10,000 high-quality videos
Manually annotated scene key points and environmental complexity
Supplied with camera pose, NeRF depth, and other data
Supports research on advanced algorithms such as NeRF and visual language models
AIbase
Empowering the Future, Your AI Solution Knowledge Base
© 2025AIbase