

PRISMA
Overview :
PRISMA is a computational photography pipeline that can perform a variety of inferences from any image or video. Similar to how light is refracted into different wavelengths through a prism, this pipeline expands images into data usable for 3D reconstruction or real-time post-processing operations. It integrates various algorithms and open-source pretrained models, such as monocular depth (MiDAS v3.1, ZoeDepth, Marigold, PatchFusion), optical flow (RAFT), segmentation masks (mmdet), and camera pose estimation (colmap), among others. The results are stored in a folder with the same name as the input file, with each band saved as a separate .png or .mp4 file. For videos, in the final step, it attempts to perform sparse reconstruction, which can be used for NeRFs (such as NVidia's Instant-ngp) or Gaussian diffusion training. The inferred depth information is exported by default as heatmap GLSL/HLSL samples that can be decoded in real-time using LYGIA, and the optical flow is encoded as HUE (angle) and saturation, which can also be decoded in real-time using LYGIA's optical flow GLSL/HLSL sampler.
Target Users :
["3D reconstruction","Image/video post-processing","Generating NeRF training data"]
Use Cases
Extracting multiple band information from images for analysis
Capturing depth/optical flow information from videos to create 3D effects
Serving as a data source for training NeRF networks
Features
Monocular depth inference
Optical flow estimation
Image segmentation
Camera pose estimation
Sparse 3D reconstruction
Featured AI Tools

Funclip
FunClip is a fully open-source, locally deployed automated video editing tool. It utilizes the FunASR Paraformer series of open-source models from Alibaba's TGETHER Lab for video voice recognition. Users can then freely select text segments or speakers from the recognized results, and clicking the crop button retrieves the corresponding video clip. FunClip integrates Alibaba's open-source industrial-grade Paraformer-Large model, one of the best-performing open-source Chinese ASR models currently available, and accurately predicts timestamps in an integrated manner.
AI Video Editing
229.1K
Chinese Picks

Kuaiying
Developed by Kuaishou, KuaiYing is a video editing application that offers a comprehensive suite of video editing features, including cutting, audio, subtitles, special effects, and more. It aims to help users easily create engaging and professional video content. It features an AI-powered anime video function that can transform videos into anime styles, offering various options like anime style, national style, and Japanese anime style. Additionally, KuaiYing boasts AI creation tools such as AI drawing, AI text-to-image, and an AI copywriting library to assist users in their creative endeavors. KuaiYing also provides a creative center to help users view data, find inspiration, and offers a powerful resource library including stickers and trending content to enhance user engagement.
AI Video Editing
213.6K