What Do I Hear? Generating Sounds for Visuals with ChatGPT
Autor: | Lin, David Chuan-En, Martelaro, Nikolas |
---|---|
Rok vydání: | 2023 |
Předmět: | |
Druh dokumentu: | Working Paper |
Popis: | This short paper introduces a workflow for generating realistic soundscapes for visual media. In contrast to prior work, which primarily focus on matching sounds for on-screen visuals, our approach extends to suggesting sounds that may not be immediately visible but are essential to crafting a convincing and immersive auditory environment. Our key insight is leveraging the reasoning capabilities of language models, such as ChatGPT. In this paper, we describe our workflow, which includes creating a scene context, brainstorming sounds, and generating the sounds. Comment: Demo: http://soundify.cc |
Databáze: | arXiv |
Externí odkaz: |