An Action-tuned Neural Network Architecture for Hand Pose Estimation

Tessitore, Giovanni; Donnarumma, Francesco; Prevete, Roberto

There is a growing interest in developing computational models of grasping action recognition. This interest is increasingly motivated by a wide range of applications in robotics, neuroscience, HCI, motion capture and other research areas. In many cases, a vision-based approach to grasping action recognition appears to be more promising. For example, in HCI and robotic applications, such an approach often allows for simpler and more natural interaction. However, a vision-based approach to grasping action recognition is a challenging problem due to the large number of hand self-occlusions which make the mapping from hand visual appearance to the hand pose an inverse ill-posed problem. The approach proposed here builds on the work of Santello and co-workers which demonstrate a reduction in hand variability within a given class of grasping actions. The proposed neural network architecture introduces specialized modules for each class of grasping actions and viewpoints, allowing for a more robust hand pose estimation. A quantitative analysis of the proposed architecture obtained by working on a synthetic data set is presented and discussed as a basis for further work.

An Action-tuned Neural Network Architecture for Hand Pose Estimation / Tessitore, G., Donnarumma, F., Prevete, R.. - STAMPA. - (2010), pp. 358-363. (International Joint Conference on Computational Intelligence IJCCI 2010 Valencia, Spain October 24-26, 2010).