Deep CNN, Body Pose, and Body-Object Interaction Features for Drivers' Activity Monitoring

IEEE transactions on intelligent transportation systems(2022)

Cited 22|Views44
No score
Abstract
Automatic recognition and prediction of in-vehicle human activities has a significant impact on the next generation of driver assistance and intelligent autonomous vehicles. In this article, we present a novel single image driver action recognition algorithm inspired by human perception that often focuses selectively on parts of the images to acquire information at specific places which are distinct to a given task. Unlike existing approaches, we argue that human activity is a combination of pose and semantic contextual cues. In detail, we model this by considering the configuration of body joints, their interaction with objects being represented as a pairwise relation to capture the structural information. Our body-pose and body-object interaction representation is built to be semantically rich and meaningful, which is highly discriminative even though it is coupled with a basic linear SVM classifier. We also propose a Multi-stream Deep Fusion Network (MDFN) for combining high-level semantics with CNN features. Our experimental results demonstrate that the proposed approach significantly improves the drivers’ action recognition accuracy on two exacting datasets.
More
Translated text
Key words
Vehicles,Feature extraction,Semantics,Computational modeling,Activity recognition,Monitoring,Image recognition,Transfer learning,intelligent vehicles,in-vehicle activity monitoring,deep learning,body pose and contextual descriptor,neural network-based fusion
AI Read Science
Must-Reading Tree
Example
Generate MRT to find the research sequence of this paper
Chat Paper
Summary is being generated by the instructions you defined