Enhancing Data-Driven Algorithms for Human Pose Estimation and Action Recognition Through Simulation
Autor: | Cristóbal Curio, Thomas Gulde, Dennis Ludl |
---|---|
Rok vydání: | 2020 |
Předmět: |
050210 logistics & transportation
Artificial neural network business.industry Computer science Mechanical Engineering 05 social sciences Machine learning computer.software_genre Motion capture Computer Science Applications Data modeling Recurrent neural network Component (UML) 0502 economics and business Automotive Engineering Benchmark (computing) Artificial intelligence business Intelligent transportation system Pose computer |
Zdroj: | IEEE Transactions on Intelligent Transportation Systems. 21:3990-3999 |
ISSN: | 1558-0016 1524-9050 |
DOI: | 10.1109/tits.2020.2988504 |
Popis: | Recognizing human actions, reliably inferring their meaning and being able to potentially exchange mutual social information are core challenges for autonomous systems when they directly share the same space with humans. Intelligent transport systems in particular face this challenge, as interactions with people are often required. The development and testing of technical perception solutions is done mostly on standard vision benchmark datasets for which manual labelling of sensory ground truth has been a tedious but necessary task. Furthermore, rarely occurring human activities are underrepresented in these datasets, leading to algorithms not recognizing such activities. For this purpose, we introduce a modular simulation framework, which offers to train and validate algorithms on various human-centred scenarios. We describe the usage of simulation data to train a state-of-the-art human pose estimation algorithm to recognize unusual human activities in urban areas. Since the recognition of human actions can be an important component of intelligent transport systems, we investigated how simulations can be applied for his purpose. Laboratory experiments show that we can train a recurrent neural network with only simulated data based on motion capture data and 3D avatars, which achieves an almost perfect performance in the classification of those human actions on real data. |
Databáze: | OpenAIRE |
Externí odkaz: |