HiDF: A Human-Indistinguishable Deepfake Dataset
Citations

WEB OF SCIENCE

1
Citations

SCOPUS

1

초록

The rapid development and prevalence of generative AI have made it easy for people to create high-quality deepfake images and videos, but their abuses have also increased exponentially. To mitigate potential social disruption, it is crucial to quickly detect the authenticity of each deepfake content hidden in a sea of information. While researchers have worked on developing deep learning-based methods, the deepfake datasets utilized in these studies are far from the real world in terms of their qualities; most popular deepfake datasets are human-distinguishable. To address this problem, we present a novel deepfake dataset, HiDF, a high-quality and human-indistinguishable deepfake dataset consisting of 62K images and 8K videos. HiDF is a meticulously curated dataset that includes diverse subjects that have undergone rigorous quality checks. A comparison of the quality between HiDF and existing deepfake datasets demonstrates that HiDF is human-indistinguishable. Hence, it can be a valuable benchmark dataset for deepfake detection tasks. Data and code (https://github.com/DSAIL-SKKU/HiDF) are publicly available for future deepfake detection research.

키워드

aideep-learningdeepfakehuman-indistinguishablemultimodal
제목
HiDF: A Human-Indistinguishable Deepfake Dataset
저자
Kang, ChaewonJeong, SeoyoonLee, JonghyunChoi, DaejinWoo, Simon S.Han, Jinyoung
DOI
10.1145/3711896.3737399
발행일
2025
유형
Proceedings Paper
저널명
Proceedings of the ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
2
페이지
5527 ~ 5538