AI

Redefining Music AI: The Power of Sony’s SoniDo as a Versatile Foundation Model

A Foundation It refers to a pre -trained model developed on comprehensive data collections, designed to be multi -use and adaptable to a set of clinic tasks. These models have received widespread attention and are increasingly combined into daily applications. However, the field of music production lacks a strong foundation model capable of treating various music tasks.

In a new paper Music Foundation Model as a general supporter of music tasksSony research team SondoThe leading music institution model (MFM). Sonido is designed to extract hierarchical features from targeted music samples, providing a strong framework for improving the effectiveness of music treatment and access to it.

Sonido employs a gym based on a multi -level converter associated with a hierarchical cod Through precision pre -processing, its intermediate representations are used as features of the mission models across various music tasks, which are enhanced by data enlargement techniques.

The design of the encoding is derived from the inspiration from the JukeBox, but it distinguishes itself by combining the hierarchical structure. Using a framework called Vae (HQ-Vae) quantitySondo imposes a fine adaptation mechanism to another within its representations. Then the multi-level automatic model based on transformers is used for the modeling of HQ-VAE. To extract the features, the input sound is coded in the symbols, processed by the adapter, and intermediate outputs are used from specific layers.

By taking advantage of the hierarchical intermediate features, Sonido effectively controls the details, allowing the superior performance in a wide range of estuary tasks. These tasks include understanding, such as signs of music and copying, and obstetric tasks, such as the source of the source and mixing.

Experimental assessments show that the extracted Sonido features greatly enhance the training of models, and achieve modern performance through multiple tasks. These results emphasize the capabilities of the Music Foundation models such as Sonido to serve as strong reinforcements for estuary applications.

Besides improving the current mission models, Sonido also addresses the challenges in scenarios with limited data, providing a transformational solution to music processing. This innovation paves the way for the most efficient and accessible tools in the field of music production.

Paper Music Foundation Model as a general supporter of music tasks It is on Arxiv.


author: HECate is editor: Chang series


2024-12-05 20:17:00

Related Articles

Back to top button