Sim2Real: Generative AI to Enhance Photorealism through Domain Transfer with GAN and Seven-Chanel-360°-Paired-Images Dataset.

Autor: Bresson M; School of Aerospace, Transport and Manufacturing, Cranfield University, Bedfordshire MK43 0AL, UK., Xing Y; School of Aerospace, Transport and Manufacturing, Cranfield University, Bedfordshire MK43 0AL, UK., Guo W; School of Aerospace, Transport and Manufacturing, Cranfield University, Bedfordshire MK43 0AL, UK.
Jazyk: angličtina
Zdroj: Sensors (Basel, Switzerland) [Sensors (Basel)] 2023 Dec 23; Vol. 24 (1). Date of Electronic Publication: 2023 Dec 23.
DOI: 10.3390/s24010094
Abstrakt: This work aims at providing a solution to data scarcity by allowing end users to generate new images while carefully controlling building shapes and environments. While Generative Adversarial Networks (GANs) are the most common network type for image generation tasks, recent studies have only focused on RGB-to-RGB domain transfer tasks. This study utilises a state-of-the-art GAN network for domain transfer that effectively transforms a multi-channel image from a 3D scene into a photorealistic image. It relies on a custom dataset that pairs 360° images from a simulated domain with corresponding 360° street views. The simulated domain includes depth, segmentation map, and surface normal (stored in seven-channel images), while the target domain is composed of photos from Paris. Samples come in pairs thanks to careful virtual camera positioning. To enhance the simulated images into photorealistic views, the generator is designed to preserve semantic information throughout the layers. The study concludes with photorealistic-generated samples from the city of Paris, along with strategies to further refine model performance. The output samples are realistic enough to be used to train and improve future AI models.
Databáze: MEDLINE