

DL3DV 10K
Overview :
DL3DV-10K is a large-scale real-world dataset containing over 10,000 high-quality videos. Each video is manually annotated with key scene points and complexity, and also provides camera pose, NeRF depth estimation, point clouds, and 3D meshes. The dataset can be used for general NeRF research, scene consistency tracking, visual language models, and other computer vision studies.
Target Users :
["General NeRF Model Research","Scene-level Consistency Tracking","Visual Language Model Research","3D Reconstruction","Virtual Reality","Augmented Reality","Autonomous Driving Visual Perception"]
Use Cases
Optimizing NeRF model performance using the DL3DV-10K dataset
Training a visual language model based on the DL3DV-10K dataset
Developing SLAM systems using viewing angles and scene information from DL3DV-10K
Features
Provide over 10,000 high-quality videos
Manually annotated scene key points and environmental complexity
Supplied with camera pose, NeRF depth, and other data
Supports research on advanced algorithms such as NeRF and visual language models
Featured AI Tools
Chinese Picks

Capcut Dreamina
CapCut Dreamina is an AIGC tool under Douyin. Users can generate creative images based on text content, supporting image resizing, aspect ratio adjustment, and template type selection. It will be used for content creation in Douyin's text or short videos in the future to enrich Douyin's AI creation content library.
AI image generation
9.0M

Outfit Anyone
Outfit Anyone is an ultra-high quality virtual try-on product that allows users to try different fashion styles without physically trying on clothes. Using a two-stream conditional diffusion model, Outfit Anyone can flexibly handle clothing deformation, generating more realistic results. It boasts extensibility, allowing adjustments for poses and body shapes, making it suitable for images ranging from anime characters to real people. Outfit Anyone's performance across various scenarios highlights its practicality and readiness for real-world applications.
AI image generation
5.3M