Locally Aware Piecewise Transformation Fields for 3D Human Mesh Registration
Autor: | Shaofei Wang, Siyu Tang, Andreas Geiger |
---|---|
Rok vydání: | 2021 |
Předmět: |
FOS: Computer and information sciences
Artificial neural network Computer science business.industry Computer Vision and Pattern Recognition (cs.CV) Computer Science - Computer Vision and Pattern Recognition Initialization Translation (geometry) Transformation (function) Parametric model Piecewise Point (geometry) Computer vision Artificial intelligence business Feature learning |
Zdroj: | CVPR 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) |
DOI: | 10.1109/cvpr46437.2021.00755 |
Popis: | Registering point clouds of dressed humans to parametric human models is a challenging task in computer vision. Traditional approaches often rely on heavily engineered pipelines that require accurate manual initialization of human poses and tedious post-processing. More recently, learning-based methods are proposed in hope to automate this process. We observe that pose initialization is key to accurate registration but existing methods often fail to provide accurate pose initialization. One major obstacle is that, despite recent effort on rotation representation learning in neural networks, regressing joint rotations from point clouds or images of humans is still very challenging. To this end, we propose novel piecewise transformation fields (PTF), a set of functions that learn 3D translation vectors to map any query point in posed space to its correspond position in rest-pose space. We combine PTF with multi-class occupancy networks, obtaining a novel learning-based framework that learns to simultaneously predict shape and per-point correspondences between the posed space and the canonical space for clothed human. Our key insight is that the translation vector for each query point can be effectively estimated using the point-aligned local features; consequently, rigid per bone transformations and joint rotations can be obtained efficiently via a least-square fitting given the estimated point correspondences, circumventing the challenging task of directly regressing joint rotations from neural networks. Furthermore, the proposed PTF facilitate canonicalized occupancy estimation, which greatly improves generalization capability and results in more accurate surface reconstruction with only half of the parameters compared with the state-of-the-art. Both qualitative and quantitative studies show that fitting parametric models with poses initialized by our network results in much better registration quality, especially for extreme poses. 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) ISBN:978-1-6654-4509-2 ISBN:978-1-6654-4510-8 |
Databáze: | OpenAIRE |
Externí odkaz: |