Why existing MIA methods fail for audio diffusion models
Audio waveforms are high-dimensional (16kHz × 10s = 160k samples). Traditional loss-based MIA methods struggle with such complex data.
Diffusion models have complex denoising trajectories. Simple likelihood comparisons fail to capture membership signals across timesteps.
Members exhibit higher stability in latent space under adversarial perturbations. This is our key insight for improved detection.
Understanding how we probe membership through latent perturbations
Add Gaussian noise to clean audio x₀ to get noisy latent xₜ at timestep t
Denoise xₜ back to clean reconstruction x̂₀ using the reverse operator Rₜ
Inject time-normalized perturbation δₜ = σₜδ̃ at xₜ, then denoise to get degraded x̂₀^δ
Interactive visualization of how membership inference works through forward diffusion and reverse denoising. Compare reconstruction quality to detect if a sample was in the training set.
Interpretation: When Δ (reconstruction error) is below the threshold τ, the sample is more likely a member (model learned it well). When Δ exceeds τ, it's more likely a non-member (model struggles to reconstruct unseen data).
Measuring latent stability through adaptive perturbation
We measure adversarial cost \(C_{\text{adv}}\): the minimum perturbation budget needed to degrade reconstruction quality below a threshold. Members require higher budgets (more stable) than non-members.
Members reside in more stable regions of the generative manifold, requiring larger perturbation budgets to reach the same degradation threshold
We use perceptual audio quality metrics instead of simple MSE to measure degradation because they better align with human perception of audio quality changes.
Cognitive Model of Perceptual Audio Quality - captures psychoacoustic phenomena like masking and frequency sensitivity
Multi-Resolution STFT distance - measures spectral differences across multiple time-frequency resolutions
Mean squared error on log-mel spectrograms - captures timbral and spectral characteristics
Direct waveform comparison - serves as a simple baseline but less perceptually aligned
Different diffusion timesteps \(t\) reveal different membership signals. Mid-trajectory (t ≈ 0.6T) shows best separability between members and non-members.
Explore how adversarial cost distributions change across diffusion timesteps
Comprehensive evaluation across models and datasets
TPR@1%FPR vs. maximum budget \(\eta_{\text{max}}\)
Performance across different degradation metrics
Paper, code, and citation
@inproceedings{liu2026lsaprobe,
title={LSA-Probe: Membership Inference via Latent Stability Analysis for Music Diffusion Models},
author={Liu, Yuxuan and Zhang, Peizhuo and Sang, Ruiqi and Li, Zhiyong and Tan, Yan and Cai, Yi and Li, Sheng},
booktitle={ICASSP 2026-2026 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
year={2026},
organization={IEEE}
}