首页 /研究 /HOIsim: Synthesizing Realistic 3D Human-Object Interaction Data for Human Activity Recognition
PERCEPTION

HOIsim: Synthesizing Realistic 3D Human-Object Interaction Data for Human Activity Recognition

Marsil Zakour, Alaeddine Mellouli, Rahul Chaudhari

发表年份
2021
引用次数
8

摘要

Correct understanding of human activities is critical for meaningful assistance by robots in daily life. The development of perception algorithms and Deep Learning models of human activity requires large-scale sensor datasets. Good real-world activity data is, however, difficult and time- consuming to acquire. Several precisely calibrated and time- synchronized sensors are required, and the annotation and labeling of the collected sensor data is extremely labor intensive.To address these challenges, we present a 3D activity simulator, "HOIsim", focusing on Human-Object Interactions (HOIs). Using HOIsim, we provide a procedurally generated synthetic dataset of two sample daily life activities "lunch" and "breakfast". The dataset contains out-of-the-box ground truth annotations in the form of human and object poses, as well as ground truth activity labels. Furthermore, we introduce methods to meaningfully randomize activity flows and the environment topology. This allows us to generate a large number of random variants of these activities in very less time.Based on an abstraction of the low-level pose data in the form of spatiotemporal graphs of HOIs, we evaluate the generated Lunch dataset only with two Deep Learning models for activity recognition. The first model, based on recurrent neural networks achieves an accuracy of 87%, whereas the other, based on transformers, achieves an accuracy of 94.7%.

关键词

Computer scienceActivity recognitionGround truthArtificial intelligenceAbstractionObject (grammar)Deep learningMachine learningPerceptionRobot

相关论文

查看 PERCEPTION 分类全部论文