Where and What: Contextual Dynamics-Aware Anomaly Detection in Surveillance Videos

  • Deok Hyun Ahn
  • , Yong Jin Jo
  • , Dong Bum Kim
  • , Gi Pyo Nam
  • , Jae Ho Han
  • , Haksub Kim*
  • *Corresponding author for this work

Research output: Contribution to journalArticlepeer-review

Abstract

In surveillance environments, detecting anomalies requires understanding the contextual dynamics of the environment, human behaviors, and movements within a scene. Effective anomaly detection must address both the where and what of events, but existing approaches such as unimodal action-based methods or LLM-integrated multimodal frameworks have limitations. These methods either rely on implicit scene information, making it difficult to localize where anomalies occur, or fail to adapt to surveillance specific challenges such as view changes, subtle actions, low light conditions, and crowded scenes. As a result, these challenges hinder accurate detection of what occurs. To overcome these limitations, our system takes advantage of features from a lightweight scene classification model to discern where an event occurs, acquiring explicit location-based context. To identify what events occur, it focuses on atomic actions, which remain underexplored in this field and are better suited to interpreting intricate abnormal behaviors than conventional abstract action features. To achieve robust anomaly detection, the proposed Temporal-Semantic Relationship Network (TSRN) models spatio-temporal relationships among multimodal features and employs a Segment-selective Focal Margin loss (SFML) to effectively address class imbalance, outperforming conventional MIL-based methods. Experimental results on public datasets demonstrate that the proposed system effectively reduces false alarms while maintaining robustness and practicality for real-world surveillance applications.

Original languageEnglish
Pages (from-to)6993-7007
Number of pages15
JournalIEEE Transactions on Image Processing
Volume34
DOIs
Publication statusPublished - 2025

Bibliographical note

Publisher Copyright:
© 1992-2012 IEEE.

Keywords

  • Surveillance videos
  • contextual dynamics
  • weakly-supervised video anomaly detection

ASJC Scopus subject areas

  • Software
  • Computer Graphics and Computer-Aided Design

Fingerprint

Dive into the research topics of 'Where and What: Contextual Dynamics-Aware Anomaly Detection in Surveillance Videos'. Together they form a unique fingerprint.

Cite this