A comprehensive research trajectory exploring the security landscape of music generation models—from understanding vulnerabilities through adversarial attacks, to developing defense mechanisms, to analyzing privacy implications through membership inference.
Securing Music Generation Models: From Understanding Vulnerabilities to Defense and Privacy
My PhD research establishes a comprehensive framework for understanding and addressing security challenges in music generation models. The work progresses through three interconnected phases, each building upon insights from the previous stage to form a complete security lifecycle for music AI systems.
This research represents the first systematic investigation of adversarial robustness and privacy in the music domain, contributing novel attack methodologies, defense mechanisms, and inference techniques that bridge computer security and music information retrieval.
Adversarial attacks targeting specific segments of music generation models through selective inpainting, revealing critical vulnerabilities in regional model behaviors.
Perceptual evaluation framework for assessing similarity in adversarial music generation, bridging the gap between automated attack detection and human auditory perception.
Membership inference attack on audio diffusion models using latent space perturbation stability, revealing privacy vulnerabilities in waveform-based music generation.
Structural pattern analysis for membership inference in symbolic music generation models, exploiting time-series characteristics unique to ABC notation.
MAIA's discovery of regional vulnerabilities directly motivated the development of perceptual evaluation models. The attack insights informed defense strategy design.
Understanding attack-defense dynamics revealed the need to analyze training data privacy. Defense mechanisms inspired membership inference methodologies.
Privacy investigation extended across both waveform and symbolic domains, demonstrating that security principles apply universally across music representations.
This research establishes the first comprehensive security framework for music generation models, covering:
Together, these contributions form a cohesive PhD thesis that advances both the security and music AI communities, providing theoretical foundations and practical tools for securing next-generation music generation systems.