A Multi-Stage Deep Learning Approach Incorporating Text-Image and Image-Image Comparisons for Cheapfake Detection

  • Jangwon Seo
  • , Hyo Seok Hwang
  • , Jiyoung Lee
  • , Minhyeok Lee*
  • , Wonsuk Kim*
  • , Junhee Seok*
  • *Corresponding author for this work

Research output: Chapter in Book/Report/Conference proceedingConference contribution

Abstract

The advancement of multimedia and artificial intelligence (AI) technologies has dismantled the barriers of information sharing, yet it has also ushered in a double-edged sword: a surge in the spread of fake information. In this context, there is a growing need for research on the detection of ‘cheapfakes,’ which are low-cost fake media, known for their ease of creation. This paper proposes a multi-stage deep learning process designed to effectively detect the diverse and rapidly evolving nature of cheapfakes. A single-step deep learning model faces limitations in distinguishing various types of cheapfakes, necessitating the application of a complex deep learning model approach to detect subtle Out-of-Context (OOC) phenomena. This study employs models based on Bidirectional Encoder Representations from Transformers (BERT) and stable diffusion technologies to approach cheapfake detection. Through the ACM ICMR 2024 challenge, the performance of this model was evaluated on a real dataset, achieving an accuracy of 71.9% in Task 1, an improvement of 7% over previous methods, and an accuracy of 55.7% in Task 2. These results are expected to make a significant contribution to the development of strategies for creating and countering cheapfakes. Additionally, this research aims to contribute to the detection of OOC media misuse through this challenge.

Original languageEnglish
Title of host publicationICMR 2024-Proceedings of the 14th Annual ACM International Conference on Multimedia Retrieval
PublisherAssociation for Computing Machinery, Inc
Pages1312-1316
Number of pages5
ISBN (Electronic)9798400706028
DOIs
Publication statusPublished - 2024 May 30
Event2024 International Conference on Multimedia Retrieval, ICMR 2024 - Phuket, Thailand
Duration: 2024 Jun 102024 Jun 14

Publication series

NameICMR 2024 - Proceedings of the 2024 International Conference on Multimedia Retrieval

Conference

Conference2024 International Conference on Multimedia Retrieval, ICMR 2024
Country/TerritoryThailand
CityPhuket
Period24/6/1024/6/14

Bibliographical note

Publisher Copyright:
© 2024 Copyright is held by the owner/author(s).

Keywords

  • BERT
  • Cheapfakes
  • Ground Image Captioning
  • Misinformation
  • Out-of-context
  • Semantic Textual Similarity
  • Stable Diffusion

ASJC Scopus subject areas

  • Computer Graphics and Computer-Aided Design
  • Human-Computer Interaction
  • Software

Fingerprint

Dive into the research topics of 'A Multi-Stage Deep Learning Approach Incorporating Text-Image and Image-Image Comparisons for Cheapfake Detection'. Together they form a unique fingerprint.

Cite this