Výsledky vyhledávání - "MARKOVIC, Dejan"

Report

Modeling and Driving Human Body Soundfields through Acoustic Primitives

Autor: Huang, Chao, Markovic, Dejan, Xu, Chenliang, Richard, Alexander

While rendering and animation of photorealistic 3D human body models have matured and reached an impressive quality over the past years, modeling the spatial audio associated with such full body models has been largely ignored so far. In this work, w

Externí odkaz: http://arxiv.org/abs/2407.13083

Zobrazit plný text záznamu

Report

ScoreDec: A Phase-preserving High-Fidelity Audio Codec with A Generalized Score-based Diffusion Post-filter

Autor: Wu, Yi-Chiao, Marković, Dejan, Krenn, Steven, Gebru, Israel D., Richard, Alexander

Although recent mainstream waveform-domain end-to-end (E2E) neural audio codecs achieve impressive coded audio quality with a very low bitrate, the quality gap between the coded and natural audio is still significant. A generative adversarial network

Externí odkaz: http://arxiv.org/abs/2401.12160

Zobrazit plný text záznamu

Report

Sounding Bodies: Modeling 3D Spatial Sound of Humans Using Body Pose and Audio

Autor: Xu, Xudong, Markovic, Dejan, Sandakly, Jacob, Keebler, Todd, Krenn, Steven, Richard, Alexander

While 3D human body modeling has received much attention in computer vision, modeling the acoustic equivalent, i.e. modeling 3D spatial audio produced by body motion and speech, has fallen short in the community. To close this gap, we present a model

Externí odkaz: http://arxiv.org/abs/2311.06285

Zobrazit plný text záznamu

Report

AudioDec: An Open-source Streaming High-fidelity Neural Audio Codec

Autor: Wu, Yi-Chiao, Gebru, Israel D., Marković, Dejan, Richard, Alexander

A good audio codec for live applications such as telecommunication is characterized by three key properties: (1) compression, i.e.\ the bitrate that is required to transmit the signal should be as low as possible; (2) latency, i.e.\ encoding and deco

Externí odkaz: http://arxiv.org/abs/2305.16608

Zobrazit plný text záznamu

Report

Reconstructing the Dynamic Directivity of Unconstrained Speech

Autor: Noufi, Camille, Markovic, Dejan, Dodds, Peter

This article presents a method for estimating and reconstructing the spatial energy distribution pattern of natural speech, which is crucial for achieving realistic vocal presence in virtual communication settings. The method comprises two stages. Fi

Externí odkaz: http://arxiv.org/abs/2209.04473

Zobrazit plný text záznamu

Report

End-to-End Binaural Speech Synthesis

Autor: Huang, Wen Chin, Markovic, Dejan, Richard, Alexander, Gebru, Israel Dejene, Menon, Anjali

In this work, we present an end-to-end binaural speech synthesis system that combines a low-bitrate audio codec with a powerful binaural decoder that is capable of accurate speech binauralization while faithfully reconstructing environmental factors

Externí odkaz: http://arxiv.org/abs/2207.03697

Zobrazit plný text záznamu

Report

Implicit Neural Spatial Filtering for Multichannel Source Separation in the Waveform Domain

Autor: Markovic, Dejan, Defossez, Alexandre, Richard, Alexander

We present a single-stage casual waveform-to-waveform multichannel model that can separate moving sound sources based on their broad spatial locations in a dynamic acoustic scene. We divide the scene into two spatial regions containing, respectively,

Externí odkaz: http://arxiv.org/abs/2206.15423

Zobrazit plný text záznamu

Akademický článek

Geraniol in vitro and geraniol-based emulsion ex vivo potential against four-species Streptococcus spp. biofilm relevant for dentistry

Autor: Nemoda, Milica, Veljković, Filip, Nikolić, Biljana, Brkić, Snežana, Marković, Dejan, Momčilović, Miloš, Lal, Mohan, Živković, Lada, Marinković, Jelena

Publikováno v: In Industrial Crops & Products 15 December 2024 222 Part 3

Zobrazit plný text záznamu

Report

Audio-Visual Speech Codecs: Rethinking Audio-Visual Speech Enhancement by Re-Synthesis

Autor: Yang, Karren, Markovic, Dejan, Krenn, Steven, Agrawal, Vasu, Richard, Alexander

Since facial actions such as lip movements contain significant information about speech content, it is not surprising that audio-visual speech enhancement methods are more accurate than their audio-only counterparts. Yet, state-of-the-art approaches

Externí odkaz: http://arxiv.org/abs/2203.17263

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Vyhledávací nástroje:

Upřesnit hledání