Abstract
In this paper, a multi-modal classification is proposed for recognizing vandalism against Automatic Teller Machines (ATMs). The visual and textual information base model is developed here to identify external threats on ATMs. The model discriminates threatening behaviors from those that are benign in the image. It provides a level of confidence in the threat recognition by visual object classification coupled with word vector distance measure. To achieve our goal, real-time object detection based on a Region Convolutional Neural Network (R-CNN) first detects objects in the scene and word embedding technique allows to measure distance between the detected object label with predefined tools assumed to be used for vandalizing ATMs. Similarity measure from word embedding not only determines whether the scene may lead to any nefarious activities, but also would provide the level of confidence in occurrence of such incidents. From the experimental evaluation, it is shown that the method is effective and delivers a quantitative measure on decisions it makes.
Original language | English |
---|---|
Title of host publication | Proceedings of AVSS 2018 - 2018 15th IEEE International Conference on Advanced Video and Signal-Based Surveillance |
Publisher | Institute of Electrical and Electronics Engineers Inc. |
ISBN (Electronic) | 9781538692943 |
DOIs | |
Publication status | Published - 2018 Jul 2 |
Event | 15th IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS 2018 - Auckland, New Zealand Duration: 2018 Nov 27 → 2018 Nov 30 |
Publication series
Name | Proceedings of AVSS 2018 - 2018 15th IEEE International Conference on Advanced Video and Signal-Based Surveillance |
---|
Conference
Conference | 15th IEEE International Conference on Advanced Video and Signal-Based Surveillance, AVSS 2018 |
---|---|
Country/Territory | New Zealand |
City | Auckland |
Period | 18/11/27 → 18/11/30 |
Bibliographical note
Funding Information:Authors of Korea University were supported by the National Research Foundation (NRF) grant funded by the MSIP of Korea (No. 2017R1A2B4012720). David Han’s contribution was supported by the US Army Research Laboratory.
Publisher Copyright:
© 2018 IEEE.
ASJC Scopus subject areas
- Signal Processing
- Computer Vision and Pattern Recognition
- Hardware and Architecture
- Media Technology