AI

Phonetic Memorization Attacks in Music and Video Generation

Watch the PDF file entitled Bob’s Confetti: Happy Heats Inteing music and Video, by Jachul Roh and 5 other authors

PDF HTML (experimental) view

a summary:The celebration in the obstetric models extends beyond the proliferation of the text literally-it is manifested through non-local patterns, semantic bonds, and from amazing, through methods in air-conditioned generation tasks such as the lyrics of songs to the song (L2S) and the text of the text to Fidoyo (T2V). We reveal a new category of memorization via media, as models were trained in these tasks, leakage of copyright, through the indirect, invisible vocal tracks of the traditional midterm analysis. In this work, we offer a hostile vocal claim (APT), an attack that replaces iconic phrases with homogeneous alternatives-EG, “mom pasta” becomes “Bob Sweets”-maintaining the audio shape with a large change of semantic content. We explain that the models can be pushed to renew the preserved songs using similar words, but not related. Despite the semantic erosion, black box models such as Suno and Models Open Source, such as Yue, generate outputs that are remarkably similar to original songs-largely, rhythmic and high-ranking high degrees on Audiogudge, CLAP and Coverid. These effects continue through species and languages. The most surprising thing, we find that audio claims alone can lead to visual reservation in models from the text to the video: When words changed from Lose Yourself are given, VEO 3 creates scenes that reflect the original music video-through a convincing rapper and low urban settings-there are no explicit visual signals in the claim. This intersecting leak is an unprecedented threat: models keep deep structural patterns that go beyond their training method, which makes traditional safety measures such as copyright filters ineffective. The results we have reached a basic vulnerability in the air -conditioned gym models and raises urgent concerns about copyright, enacted, and spreading a safe publishing of multi -media generation systems.

The application date

From: Jaecul Roh [view email]
[v1]

Wed, July 23, 2025 21:11:47 UTC (5,220 KB)
[v2]

Wed, Aug 6 2025 16:06:47 UTC (13,994 KB)

Don’t miss more hot News like this! Click here to discover the latest in AI news!

2025-08-07 04:00:00

Related Articles

Back to top button