Force from Motion: Decoding Physical Sensation in a First Person Video

Autor:	Hyun Soo Park, Jyh-Jing Hwang, Jianbo Shi
Rok vydání:	2016
Předmět:	Inertial frame of reference business.industry 0206 medical engineering Optical flow 02 engineering and technology Rigid body dynamics 020601 biomedical engineering Centripetal force Banked turn Inverse dynamics Motion estimation 0202 electrical engineering electronic engineering information engineering Structure from motion 020201 artificial intelligence & image processing Computer vision Artificial intelligence business
Zdroj:	CVPR
Popis:	A first-person video can generate powerful physical sensations of action in an observer. In this paper, we focus on a problem of Force from Motion—decoding the sensation of 1) passive forces such as the gravity, 2) the physical scale of the motion (speed) and space, and 3) active forces exerted by the observer such as pedaling a bike or banking on a ski turn. The sensation of gravity can be observed in a natural image. We learn this image cue for predicting a gravity direction in a 2D image and integrate the prediction across images to estimate the 3D gravity direction using structure from motion. The sense of physical scale is revealed to us when the body is in a dynamically balanced state. We compute the unknown physical scale of 3D reconstructed camera motion by leveraging the torque equilibrium at a banked turn that relates the centripetal force, gravity, and the body leaning angle. The active force and torque governs 3D egomotion through the physics of rigid body dynamics. Using an inverse dynamics optimization, we directly minimize 2D reprojection error (in video) with respect to 3D world structure, active forces, and additional passive forces such as air drag and friction force. We use structure from motion with the physical scale and gravity direction as an initialization of our bundle adjustment for force estimation. Our method shows quantitatively equivalent reconstruction comparing to IMU measurements in terms of gravity and scale recovery and outperforms method based on 2D optical flow for an active action recognition task. We apply our method to first person videos of mountain biking, urban bike racing, skiing, speedflying with parachute, and wingsuit flying where inertial measurements are not accessible.
Databáze:	OpenAIRE
Externí odkaz:	https://explore.openaire.eu/search/publication?articleId=doi_________::a1b665f5d2cdc1c06754a8b8db731e5c https://doi.org/10.1109/cvpr.2016.416 Zobrazit plný text záznamu