Human action recognition using shape and CLG-motion flow from multi-view image sequences

Mohiuddin Ahmad, Seong Whan Lee

Research output: Contribution to journalArticlepeer-review

122 Citations (Scopus)


In this paper, we present a method for human action recognition from multi-view image sequences that uses the combined motion and shape flow information with variability consideration. A combined local-global (CLG) optic flow is used to extract motion flow feature and invariant moments with flow deviations are used to extract the global shape flow feature from the image sequences. In our approach, human action is represented as a set of multidimensional CLG optic flow and shape flow feature vectors in the spatial-temporal action boundary. Actions are modeled by using a set of multidimensional HMMs for multiple views using the combined features, which enforce robust view-invariant operation. We recognize different human actions in daily life successfully in the indoor and outdoor environment using the maximum likelihood estimation approach. The results suggest robustness of the proposed method with respect to multiple views action recognition, scale and phase variations, and invariant analysis of silhouettes.

Original languageEnglish
Pages (from-to)2237-2252
Number of pages16
JournalPattern Recognition
Issue number7
Publication statusPublished - 2008 Jul

Bibliographical note

Funding Information:
This research was supported by the Intelligent Robotics Development Program, one of the 21st Century Frontier R&D Programs funded by the Ministry of Commerce, Industry and Energy of Korea.


  • Action matrix
  • Action recognition
  • Combined local-global (CLG) optic flow
  • Invariant Zernike moments
  • Multi-view image sequence
  • Multidimensional hidden Markov model (MDHMM)

ASJC Scopus subject areas

  • Software
  • Signal Processing
  • Computer Vision and Pattern Recognition
  • Artificial Intelligence


Dive into the research topics of 'Human action recognition using shape and CLG-motion flow from multi-view image sequences'. Together they form a unique fingerprint.

Cite this