Miradata : Large-scale long-form video dataset with structured subtitles

Miradata

AI video generation AI datasets #Video dataset #Long video #Structured subtitles #GPT-4V Fresh Picks Open Source

Overview :

MiraData is a large-scale video dataset that focuses on long-form video clips, with an average duration of 72 seconds, providing structured subtitles with an average length of 318 words, enhancing the description of video content. Through the use of technologies such as GPT-4V, MiraData demonstrates high accuracy and semantic coherence in the fields of video understanding and subtitle generation.

Target Users :

MiraData is designed for researchers and developers who require large-scale long-form video datasets and high-quality subtitles, particularly in the fields of video understanding and generation, and machine learning model training.

Total Visits： 474.6M

Top Region： US(19.34%)

Website Views ： 48.3K

How to Use

1. Download MiraData's metadata files from Google Drive or HuggingFace Dataset.

2. Use the provided script to download video samples.

3. Split and process video samples as needed.

4. Generate video subtitles using tools like GPT-4V.

5. Utilize MiraBench to assess the quality of generated videos.

6. Follow the license agreement to reasonably use the dataset in research or development.