Journal article 550 views 141 downloads
A bag of words approach to subject specific 3D human pose interaction classification with random decision forests
Graphical Models, Volume: 76, Issue: 3, Pages: 162 - 171
Swansea University Authors: Jingjing Deng, Xianghua Xie
PDF | Accepted ManuscriptDownload (2.89MB)
DOI (Published version): 10.1016/j.gmod.2013.10.006
In this work, we investigate whether it is possible to distinguish conversational interactions from observing human motion alone, in particular subject specific gestures in 3D. We adopt Kinect sensors to obtain 3D displacement and velocity measurements, followed by wavelet decomposition to extract l...
|Published in:||Graphical Models|
Check full text
No Tags, Be the first to tag this record!
In this work, we investigate whether it is possible to distinguish conversational interactions from observing human motion alone, in particular subject specific gestures in 3D. We adopt Kinect sensors to obtain 3D displacement and velocity measurements, followed by wavelet decomposition to extract low level temporal features. These features are thengeneralized to form a visual vocabulary that can be further generalized to a set of topics from temporal distributions of visual vocabulary. A subject specific supervised learning approach based on Random Forests is used to classify the testing sequences to seven different conversational scenarios. These conversational scenarios concerned in this workhave rather subtle differences among them. Unlike typical action or event recognition, each interaction in our case contain many instances of primitive motions and actions, many of which are shared among different conversation scenarios. That is the interactions we are concerned with are not micro or instant events, such as hugging and high-five, but rather interactions over a period of time that consists rather similar individual motions, micro actions and interactions. We believe this is among one of the first work that is devoted to subject specific conversational interaction classification using 3D pose features and to show this task is indeed possible.
Human interaction, Action recognition, Human pose, Random forests, Bag of words
Faculty of Science and Engineering