Viseme recognition experiment using context dependent hidden Markov models

    Research output: Chapter in Book/Report/Conference proceedingConference contribution

    5 Citations (Scopus)

    Abstract

    Visual images synchronized with audio signals can provide user-friendly interface for man machine interactions. The visual speech can be represented as a sequence of visemes, which are the generic face images corresponding to particular sounds. We use HMMs (hidden Markov models) to convert audio signals to a sequence of visemes. In this paper, we compare two approaches in using HMMs. In the first approach, an HMM is trained for each triviseme which is a viseme with its left and right context, and the audio signals are directly recognized as a sequence of trivisemes. In the second approach, each triphone is modeled with an HMM, and a general triphone recognizer is used to produce a triphone sequence from the audio signals. The triviseme or triphone sequence is then converted to a viseme sequence. The performances of the two viseme recognition systems are evaluated on the TIMIT speech corpus.

    Original languageEnglish
    Title of host publicationIntelligent Data Engineering and Automated Learning - IDEAL 2002 - 3rd International Conference, Proceedings
    EditorsHujun Yin, Nigel Allinson, Richard Freeman, John Keane, Simon Hubbard
    PublisherSpringer Verlag
    Pages557-561
    Number of pages5
    ISBN (Print)9783540440253
    DOIs
    Publication statusPublished - 2002
    Event3rd International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2002 - Manchester, United Kingdom
    Duration: 2002 Aug 122002 Aug 14

    Publication series

    NameLecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
    Volume2412
    ISSN (Print)0302-9743
    ISSN (Electronic)1611-3349

    Other

    Other3rd International Conference on Intelligent Data Engineering and Automated Learning, IDEAL 2002
    Country/TerritoryUnited Kingdom
    CityManchester
    Period02/8/1202/8/14

    Bibliographical note

    Publisher Copyright:
    © Springer-Verlag Berlin Heidelberg 2002.

    ASJC Scopus subject areas

    • Theoretical Computer Science
    • General Computer Science

    Fingerprint

    Dive into the research topics of 'Viseme recognition experiment using context dependent hidden Markov models'. Together they form a unique fingerprint.

    Cite this