Robust visual voice activity detection using local variance histogram in vehicular environments

  • Kyungsun Lee
  • , Taeyup Song
  • , Sungsoo Kim
  • , David K. Han
  • , Hanseok Ko

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    Abstract

    In this paper, a Vision based Voice Activity Detection (VVAD) algorithm is proposed using Local Variance Histogram (LVH). In conventional VVAD algorithm, the motion measure such as optical flow and intensity histogram are widely used. However, this approach is unstable under varying illumination and global motion changes which frequently occur in moving vehicular environment. To mitigate this problem, an appropriate framework based on LVH feature is developed. Comparison with two other conventional visual voice activity detectors shows the proposed method to be consistently more accurate and yields a substantial improvement in terms of detection probability and false alarm rate.

    Original languageEnglish
    Title of host publication2015 IEEE International Conference on Consumer Electronics, ICCE 2015
    PublisherInstitute of Electrical and Electronics Engineers Inc.
    Pages449-450
    Number of pages2
    ISBN (Electronic)9781479975426
    DOIs
    Publication statusPublished - 2015 Mar 23
    Event2015 IEEE International Conference on Consumer Electronics, ICCE 2015 - Las Vegas, United States
    Duration: 2015 Jan 92015 Jan 12

    Publication series

    Name2015 IEEE International Conference on Consumer Electronics, ICCE 2015

    Other

    Other2015 IEEE International Conference on Consumer Electronics, ICCE 2015
    Country/TerritoryUnited States
    CityLas Vegas
    Period15/1/915/1/12

    ASJC Scopus subject areas

    • Computer Networks and Communications
    • Electrical and Electronic Engineering
    • Industrial and Manufacturing Engineering

    Fingerprint

    Dive into the research topics of 'Robust visual voice activity detection using local variance histogram in vehicular environments'. Together they form a unique fingerprint.

    Cite this