CelebCaption: A Benchmark Dataset for Identity-Sensitive Unlearning in Image Captioning
Citations

SCOPUS

0

초록

Machine unlearning seeks to remove the influence of selected training examples without retraining the model from scratch. Recent work has extended this goal to vision-language models, yet existing datasets are not suited for judging whether a sample's influence has truly been erased from learned image-text pairs. Current algorithms often introduce false information into sentences generated after unlearning, which compromises utility. We first establish three criteria that an image caption unlearning method should meet: Specificity Reduction, Identity Removal, and Performance Preservation. Guided by these criteria, we present CelebCaption, an image-text dataset of 15,000 photographs covering 150 well-known individuals, each linked to four captions that vary in detail (detailed versus summary) and in the presence of the subject's name. This design enables controlled, quantitative assessment of the proposed unlearning objectives. We benchmark several representative unlearning algorithms on CelebCaption using both caption quality scores and membership inference attack accuracy as quantitative unlearning metrics, and observe that current methods fail to achieve their privacy objectives. Our unlearning criteria and dataset provide a focused, reproducible testbed for advancing privacy-aware image captioning. The CelebCaption dataset is publicly available at https://github.com/DASH-Lab/CelebCaption.

키워드

benchmarkdatasetimage captioningmachine unlearningprivacy-preserving aivision-language models
제목
CelebCaption: A Benchmark Dataset for Identity-Sensitive Unlearning in Image Captioning
저자
Moon, HakjunWoo, Simon S.
DOI
10.1145/3773966.3779359
발행일
2026
유형
Conference Paper
저널명
WSDM 2026 - Proceedings of the 19th ACM International Conference on Web Search and Data Mining
페이지
1205 ~ 1210