Abstract
We introduce a multi-stage framework that uses local geometry changes on a hand surface and focuses on learning interaction between a primary and assistive hand/object for hand action recognition in videos from a egocentric view RGB camera. Our method does not require 3D information of objects such as the 6D object pose which is difficult to annotate or the depth of the image requires additional a depth sensor for learning an objects’ behavior while it interacts with hands. Instead, the proposed method learns the changes within the surface of the hand, the hand type which is positively correlated with the hand action and the location of objects and hands in the 2D image space. The framework synthesizes the mean curvature of the primary hand mesh model to encode the hand surface geometry. Also, we introduce a feature pooling layer to handle diverse scenarios: having one hand, two hands, one hand with one object, and two hands with two objects. Our method outperforms the state-of-the-art hand action recognition methods that use 6D object poses of objects or a depth sensor.
| Original language | English |
|---|---|
| Title of host publication | Pattern Recognition and Artificial Intelligence - 4th International Conference, ICPRAI 2024, Proceedings |
| Editors | Christian Wallraven, Cheng-Lin Liu, Arun Ross |
| Publisher | Springer Science and Business Media Deutschland GmbH |
| Pages | 201-215 |
| Number of pages | 15 |
| ISBN (Print) | 9789819787012 |
| DOIs | |
| Publication status | Published - 2025 |
| Event | 4th International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2024 - Jeju Island, Korea, Republic of Duration: 2024 Jul 3 → 2024 Jul 6 |
Publication series
| Name | Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics) |
|---|---|
| Volume | 14892 LNCS |
| ISSN (Print) | 0302-9743 |
| ISSN (Electronic) | 1611-3349 |
Conference
| Conference | 4th International Conference on Pattern Recognition and Artificial Intelligence, ICPRAI 2024 |
|---|---|
| Country/Territory | Korea, Republic of |
| City | Jeju Island |
| Period | 24/7/3 → 24/7/6 |
Bibliographical note
Publisher Copyright:© The Editor(s) (if applicable) and The Author(s), under exclusive license to Springer Nature Singapore Pte Ltd. 2025.
Keywords
- Machine perception
- deep learning
- hand action recognition
- surface modality
ASJC Scopus subject areas
- Theoretical Computer Science
- General Computer Science