

Flowvid
Overview :
FlowVid is an optical flow guided video synthesis model. By utilizing the spatial and temporal information of optical flow, it achieves temporal consistency between video frames. It seamlessly integrates with existing image synthesis models to enable various modification operations, including stylization, object swapping, and local editing. FlowVid boasts fast generation speed; a 4-second, 30FPS, 512×512 resolution video can be generated in just 1.5 minutes, outperforming CoDeF (3.1x), Rerender (7.2x), and TokenFlow (10.5x) respectively. In user evaluations, FlowVid achieved a quality score of 45.7%, significantly surpassing CoDeF (3.5%), Rerender (10.2%), and TokenFlow (40.4%).
Target Users :
FlowVid can be applied to the field of video synthesis to generate videos with temporal consistency. It allows for stylistic changes, object swapping, and local editing by modifying the first frame image.
Use Cases
Transform the style of a video into a cartoon style.
Replace the characters in a video with other people.
Perform local edits on specific areas within a video.
Features
Optical Flow Guided Video Synthesis
Supports stylization, object swapping, and local editing
Fast generation speed
Featured AI Tools

Open Sora Plan
Open-Sora-Plan is an open-source project dedicated to replicating OpenAI's Sora (T2V model) and constructing knowledge about Video-VQVAE (VideoGPT) + DiT. Initiated by the Peking University-Tuizhan AIGC Joint Laboratory, the project currently has limited resources and seeks contributions from the open-source community. The project provides training code and welcomes Pull Requests.
AI Video Generation
439.9K

Funclip
FunClip is a fully open-source, locally deployed automated video editing tool. It utilizes the FunASR Paraformer series of open-source models from Alibaba's TGETHER Lab for video voice recognition. Users can then freely select text segments or speakers from the recognized results, and clicking the crop button retrieves the corresponding video clip. FunClip integrates Alibaba's open-source industrial-grade Paraformer-Large model, one of the best-performing open-source Chinese ASR models currently available, and accurately predicts timestamps in an integrated manner.
AI Video Editing
233.5K