JASCO : Music generation model that combines text and audio conditioning.

JASCO

Music Generation AI Model #Music Generation #Text-to-Music #Symbolic Conditioning #Audio Conditioning Standard Picks Paid

Overview :

JASCO is a text-to-music generation model that combines symbolic and audio-based conditioning. It can generate high-quality music samples based on global text descriptions and fine-grained local controls. Built upon the stream matching modeling paradigm and a novel conditioning method, JASCO allows music generation to be controlled simultaneously by both local (e.g., chord) and global (text description) cues. By utilizing information bottleneck layers and temporal blurring, it extracts information relevant to specific controls, enabling the combination of symbolic and audio-based conditioning within the same text-to-music model.

Target Users :

JASCO is suitable for music creators, music theorists, and anyone interested in music generation technology. It can help users generate music that conforms to specific styles and emotions through text descriptions, providing new tools and inspiration sources for music composition.

Total Visits： 0

Website Views ： 58.0K

Use Cases

Music creators use JASCO to generate music in specific styles based on text descriptions.

Music theorists utilize JASCO to explore the impact of different text descriptions on music generation.

Educators use JASCO as a teaching tool to help students understand the relationship between music and text.

Features

Supports global text descriptions and fine-grained local controls.

Based on the stream matching modeling paradigm and novel conditioning methods.

Applies information bottleneck layers and temporal blurring techniques.

Can combine symbolic and audio-based conditioning.

Evaluates generation quality and conditioning adherence through objective metrics and human studies.