Can Foundation Models Generalise the Presentation Attack Detection Capabilities on ID Cards?
Journal:
arXiv
Published Date:
Jun 5, 2025
Abstract
Nowadays, one of the main challenges in presentation attack detection (PAD)
on ID cards is obtaining generalisation capabilities for a diversity of
countries that are issuing ID cards. Most PAD systems are trained on one, two,
or three ID documents because of privacy protection concerns. As a result, they
do not obtain competitive results for commercial purposes when tested in an
unknown new ID card country. In this scenario, Foundation Models (FM) trained
on huge datasets can help to improve generalisation capabilities. This work
intends to improve and benchmark the capabilities of FM and how to use them to
adapt the generalisation on PAD of ID Documents. Different test protocols were
used, considering zero-shot and fine-tuning and two different ID card datasets.
One private dataset based on Chilean IDs and one open-set based on three ID
countries: Finland, Spain, and Slovakia. Our findings indicate that bona fide
images are the key to generalisation.