Every Moment Counts: Dense Detailed Labeling of Actions in Complex Videos
Autor: | Ning Jin, Olga Russakovsky, Li Fei-Fei, Mykhaylo Andriluka, Greg Mori, Serena Yeung |
---|---|
Rok vydání: | 2017 |
Předmět: |
business.industry
Computer science Frame (networking) 020206 networking & telecommunications Ranging 02 engineering and technology Machine learning computer.software_genre Multiple input Moment (mathematics) Action (philosophy) Artificial Intelligence Pattern recognition (psychology) 0202 electrical engineering electronic engineering information engineering Action recognition 020201 artificial intelligence & image processing The Internet Computer Vision and Pattern Recognition Artificial intelligence business computer Software |
Zdroj: | International Journal of Computer Vision. 126:375-389 |
ISSN: | 1573-1405 0920-5691 |
DOI: | 10.1007/s11263-017-1013-y |
Popis: | Every moment counts in action recognition. A comprehensive understanding of human activity in video requires labeling every frame according to the actions occurring, placing multiple labels densely over a video sequence. To study this problem we extend the existing THUMOS dataset and introduce MultiTHUMOS, a new dataset of dense labels over unconstrained internet videos. Modeling multiple, dense labels benefits from temporal relations within and across classes. We define a novel variant of long short-term memory deep networks for modeling these temporal relations via multiple input and output connections. We show that this model improves action labeling accuracy and further enables deeper understanding tasks ranging from structured retrieval to action prediction. |
Databáze: | OpenAIRE |
Externí odkaz: |