TY - JOUR
T1 - Sign language spotting with a threshold model based on conditional random fields
AU - Yang, Hee Deok
AU - Sclaroff, Stan
AU - Lee, Seong Whan
N1 - Funding Information:
This research was supported by World Class University Project funded by the Ministry of Education, Science and Technology, Republic of Korea (R31-2008-000-10008-0). This work was also supported by the IT R&D program of MKE/IITA (2008-F-038-01, Development of Context Adaptive Cognition Technology). Stan Sclaroff was supported in part by the US National Science Foundation under Grant 0329009 and Grant 0705749. The authors would like to thank the National Center for Sign Language and Gesture Resources, Boston University for providing the SignStream database.
PY - 2009
Y1 - 2009
N2 - Sign language spotting is the task of detecting and recognizing signs in a signed utterance, in a set vocabulary. The difficulty of sign language spotting is that instances of signs vary in both motion and appearance. Moreover, signs appear within a continuous gesture stream, interspersed with transitional movements between signs in a vocabulary and nonsign patterns (which include out-of-vocabulary signs, epentheses, and other movements that do not correspond to signs). In this paper, a novel method for designing threshold models in a conditional random field (CRF) model is proposed which performs an adaptive threshold for distinguishing between signs in a vocabulary and nonsign patterns. A short-sign detector, a hand appearance-based sign verification method, and a subsign reasoning method are included to further improve sign language spotting accuracy. Experiments demonstrate that our system can spot signs from continuous data with an 87.0 percent spotting rate and can recognize signs from isolated data with a 93.5 percent recognition rate versus 73.5 percent and 85.4 percent, respectively, for CRFs without a threshold model, short-sign detection, subsign reasoning, and hand appearance-based sign verification. Our system can also achieve a 15.0 percent sign error rate (SER) from continuous data and a 6.4 percent SER from isolated data versus 76.2 percent and 14.5 percent, respectively, for conventional CRFs.
AB - Sign language spotting is the task of detecting and recognizing signs in a signed utterance, in a set vocabulary. The difficulty of sign language spotting is that instances of signs vary in both motion and appearance. Moreover, signs appear within a continuous gesture stream, interspersed with transitional movements between signs in a vocabulary and nonsign patterns (which include out-of-vocabulary signs, epentheses, and other movements that do not correspond to signs). In this paper, a novel method for designing threshold models in a conditional random field (CRF) model is proposed which performs an adaptive threshold for distinguishing between signs in a vocabulary and nonsign patterns. A short-sign detector, a hand appearance-based sign verification method, and a subsign reasoning method are included to further improve sign language spotting accuracy. Experiments demonstrate that our system can spot signs from continuous data with an 87.0 percent spotting rate and can recognize signs from isolated data with a 93.5 percent recognition rate versus 73.5 percent and 85.4 percent, respectively, for CRFs without a threshold model, short-sign detection, subsign reasoning, and hand appearance-based sign verification. Our system can also achieve a 15.0 percent sign error rate (SER) from continuous data and a 6.4 percent SER from isolated data versus 76.2 percent and 14.5 percent, respectively, for conventional CRFs.
KW - Conditional random field
KW - Sign language recognition
KW - Sign language spotting
KW - Threshold model
UR - http://www.scopus.com/inward/record.url?scp=67349252145&partnerID=8YFLogxK
U2 - 10.1109/TPAMI.2008.172
DO - 10.1109/TPAMI.2008.172
M3 - Article
C2 - 19443924
AN - SCOPUS:67349252145
SN - 0162-8828
VL - 31
SP - 1264
EP - 1277
JO - IEEE Transactions on Pattern Analysis and Machine Intelligence
JF - IEEE Transactions on Pattern Analysis and Machine Intelligence
IS - 7
ER -