Major alignment safety

VOID: Defeating Unauthorized Mimicry in Latent Diffusion Models

Chunlin Qiu, Ang Li, Tianxiao Huang, Ruilin Gan, Yunjie Ge, Shenyi Zhang

Published
Jun 10, 2026 — 16:08 UTC

Problem
The paper addresses the vulnerability of Latent Diffusion Models (LDMs) to unauthorized mimicry, a growing concern in visual synthesis applications. Existing defenses rely on injecting deceptive perturbations to mislead the model, but these methods are based on the flawed assumption that such perturbations remain effective throughout the LDM’s generation process. The authors highlight that the model’s inherent restoration capabilities often negate these perturbations, allowing individual identities to re-emerge in generated images. This work is a preprint and has not undergone peer review.

Method
The authors propose VOID, a defense framework that manipulates the stochastic nature of LDMs to prevent unauthorized mimicry. VOID employs two innovative strategies: (1) it amplifies latent encoding errors to disrupt the semantic structure of images, effectively corrupting the content, and (2) it counteracts target guidance signals to diminish the model’s ability to restore original identities. This dual approach ensures that the perturbations remain confined to human-imperceptible regions, preserving the visual utility of the images while enhancing security.

Results
The effectiveness of VOID is demonstrated through a comprehensive evaluation against 24 state-of-the-art defenses across 10 mimicry attack scenarios on 5 datasets. The results show a significant increase in the average Frechet Inception Distance (FID) from 113 to 365, representing a 223% improvement over the best existing defense. This substantial enhancement in FID indicates a marked increase in the semantic corruption of generated images, thereby thwarting unauthorized mimicry attempts.

Limitations
The authors acknowledge that while VOID significantly improves defense against mimicry, it may introduce challenges in specific contexts where visual fidelity is paramount. Additionally, the reliance on stochastic manipulation may not generalize across all LDM architectures or datasets. The paper does not address potential computational overhead introduced by the perturbation strategies, which could impact real-time applications.

Why it matters
The implications of this work are significant for the field of generative models, particularly in applications where identity protection is critical, such as in media and entertainment. By providing a robust defense mechanism against unauthorized mimicry, VOID enhances the security of LDMs, paving the way for safer deployment in sensitive contexts. This research contributes to the ongoing discourse on the ethical use of AI in visual synthesis, as discussed in related works on adversarial robustness and model security, as published in arXiv cs.CV.

Turing Wire

By Turing Wire editorial staff · Jun 10, 2026 · Editorial standards →

Source: arXiv cs.CV