Low Resource Reconstruction Attacks Through Benign Prompts

Journal: arXiv

Published Date: Jul 10, 2025

Abstract

The recent advances in generative models such as diffusion models have raised several risks and concerns related to privacy, copyright infringements and data stewardship. To better understand and control the risks, various researchers have created techniques, experiments and attacks that reconstruct images, or part of images, from the training set. While these techniques already establish that data from the training set can be reconstructed, they often rely on high-resources, excess to the training set as well as well-engineered and designed prompts. In this work, we devise a new attack that requires low resources, assumes little to no access to the actual training set, and identifies, seemingly, benign prompts that lead to potentially-risky image reconstruction. This highlights the risk that images might even be reconstructed by an uninformed user and unintentionally. For example, we identified that, with regard to one existing model, the prompt ``blue Unisex T-Shirt'' can generate the face of a real-life human model. Our method builds on an intuition from previous works which leverages domain knowledge and identifies a fundamental vulnerability that stems from the use of scraped data from e-commerce platforms, where templated layouts and images are tied to pattern-like prompts.

Authors

Sol Yarkoni
Roi Livni

External Resources

View on arXiv arXiv (http://arxiv.org/abs/2507.07947v1)

Low Resource Reconstruction Attacks Through Benign Prompts

Abstract

Authors

Categories

External Resources

Popular Topics

Recent Journals

Low Resource Reconstruction Attacks Through Benign Prompts

Abstract

Authors

Categories

External Resources

Don't Miss the Future of Medicine

Popular Topics

Recent Journals