Rational 3D object placement based on deep learning based plane detection.

Autor: Yang, Wei-Jong, Lu, Li-Yu, Chan, Din-Yuen
Předmět:
Zdroj: Multimedia Tools & Applications; Nov2023, Vol. 82 Issue 28, p44555-44576, 22p
Abstrakt: Effective acquisition of 3D planar features from a 2D image for immersive AR applications is challenging without any 3D depth information. In this paper, we present a novel planar object placement (POP) system for rational insertion of 3D objects on plane surfaces and fitting their plane orientations in a given 2D image. The POP system is composed of an adaptive edge extractor (AEE), a vanishing point detector (VPD), a rational fitting transformer (RFT) and a plane normal detection network (PNDnet), which performs the estimation of planar features. With the inputs of the color image and the guided edge map retrieved by the AEE, the PNDnet based on a gating weighted U-Net, whose gating weights are controlled by global cross-level connection (GCC) blocks, can properly extract 3D information including planar pixel-wise normal map, semantic plane-segmented map and plane-wise normal vectors. Simultaneously, with plane-wise normal vectors and the VPD-estimated vanishing points, the RFT can easily induce the best placement position for projecting the AR object on a destination plane with better coherent 3D perspectives. Besides, the POP system allows the users to make the subtle tunes of 3D orientation to the inserted object for personal preferences. Experimental results show that the proposed POP system, which successfully guides the 3D-inserted objects to match up the surfaces and orientations of the planes in the target image, will become an effective AR tool for the insertion of 3D objects in target images and videos in the future. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index