Fast text caption localization on video using visual rhythm

Seong Soo Chun, Hyeokman Kim, Jung Rim Kim, Sangwook Oh, Sanghoon Sull

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    11 Citations (Scopus)

    Abstract

    In this paper, a fast DCT-based algorithm is proposed to efficiently locate text captions embedded on specific areas in a video sequence through visual rhythm, which can be fast constructed by sampling certain portions of a DC image sequence and temporally accumulating the samples along time. Our proposed approach is based on the observations that the text captions carrying important information suitable for indexing often appear on specific areas on video frames, from where sampling strategies are derived for a visual rhythm. Our method then uses a combination of contrast and temporal coherence information on the visual rhythm to detect text frames such that each detected text frame represents consecutive frames containing identical text strings, thus significantly reducing the amount of text frames needed to be examined for text localization from a video sequence. It then utilizes several important properties of text caption to locate the text caption from the detected frames.

    Original languageEnglish
    Title of host publicationRecent Advances in Visual Information Systems - 5th International Conference, VISUAL 2002, Proceedings
    EditorsShi-Kuo Chang, Zen Chen, Suh-Yin Lee
    PublisherSpringer Verlag
    Pages259-268
    Number of pages10
    ISBN (Print)3540433589
    DOIs
    Publication statusPublished - 2002
    Event5th International Conference on Visual Information Systems, VISUAL 2002 - Hsin Chu, Taiwan, Province of China
    Duration: 2002 Mar 112002 Mar 13

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume2314
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Other

    Other5th International Conference on Visual Information Systems, VISUAL 2002
    Country/TerritoryTaiwan, Province of China
    CityHsin Chu
    Period02/3/1102/3/13

    Bibliographical note

    Publisher Copyright:
    © Springer-Verlag Berlin Heidelberg 2002.

    ASJC Scopus subject areas

    • Theoretical Computer Science
    • General Computer Science

    Fingerprint

    Dive into the research topics of 'Fast text caption localization on video using visual rhythm'. Together they form a unique fingerprint.

    Cite this