Výsledky vyhledávání - "Mysore, Gautham J."

Report

Emotion Embedding Spaces for Matching Music to Stories

Autor: Won, Minz, Salamon, Justin, Bryan, Nicholas J., Mysore, Gautham J., Serra, Xavier

Content creators often use music to enhance their stories, as it can be a powerful tool to convey emotion. In this paper, our goal is to help creators find music to match the emotion of their story. We focus on text-based stories that can be auralize

Externí odkaz: http://arxiv.org/abs/2111.13468

Zobrazit plný text záznamu

Report

Controllable Neural Prosody Synthesis

Autor: Morrison, Max, Jin, Zeyu, Salamon, Justin, Bryan, Nicholas J., Mysore, Gautham J.

Speech synthesis has recently seen significant improvements in fidelity, driven by the advent of neural vocoders and neural prosody generators. However, these systems lack intuitive user controls over prosody, making them unable to rectify prosody er

Externí odkaz: http://arxiv.org/abs/2008.03388

Zobrazit plný text záznamu

Report

F0-consistent many-to-many non-parallel voice conversion via conditional autoencoder

Autor: Qian, Kaizhi, Jin, Zeyu, Hasegawa-Johnson, Mark, Mysore, Gautham J.

Non-parallel many-to-many voice conversion remains an interesting but challenging speech processing task. Many style-transfer-inspired methods such as generative adversarial networks (GANs) and variational autoencoders (VAEs) have been proposed. Rece

Externí odkaz: http://arxiv.org/abs/2004.07370

Zobrazit plný text záznamu

Report

A Differentiable Perceptual Audio Metric Learned from Just Noticeable Differences

Autor: Manocha, Pranay, Finkelstein, Adam, Zhang, Richard, Bryan, Nicholas J., Mysore, Gautham J., Jin, Zeyu

Many audio processing tasks require perceptual assessment. The ``gold standard`` of obtaining human judgments is time-consuming, expensive, and cannot be used as an optimization criterion. On the other hand, automated metrics are efficient to compute

Externí odkaz: http://arxiv.org/abs/2001.04460

Zobrazit plný text záznamu

Report

B-Script: Transcript-based B-roll Video Editing with Recommendations

Autor: Huber, Bernd, Shin, Hijung Valentina, Russell, Bryan, Wang, Oliver, Mysore, Gautham J.

In video production, inserting B-roll is a widely used technique to enrich the story and make a video more engaging. However, determining the right content and positions of B-roll and actually inserting it within the main footage can be challenging,

Externí odkaz: http://arxiv.org/abs/1902.11216

Zobrazit plný text záznamu

Report

A Generative Product-of-Filters Model of Audio

Autor: Liang, Dawen, Hoffman, Matthew D., Mysore, Gautham J.

We propose the product-of-filters (PoF) model, a generative model that decomposes audio spectra as sparse linear combinations of "filters" in the log-spectral domain. PoF makes similar assumptions to those used in the classic homomorphic filtering ap

Externí odkaz: http://arxiv.org/abs/1312.5857

Zobrazit plný text záznamu

Periodical

Eulerian Video Magnification and Analysis.

Autor: Wadhwa, Neal, Hao-Yu Wu, Davis, Abe, Rubinstein, Michael, Shih, Eugene, Mysore, Gautham J., Chen, Justin G., Buyukozturk, Oral, Guttag, John V., Freeman, William T., Durand, Frédo

Publikováno v: Communications of the ACM; Jan2017, Vol. 60 Issue 1, p87-95, 9p, 6 Diagrams, 4 Graphs

Zobrazit plný text záznamu

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Conference

RE-VISITING THE MUSIC SEGMENTATION PROBLEM WITH CROWDSOURCING.

Autor: Cheng-i Wang, Mysore, Gautham J., Dubnov, Shlomo

Publikováno v: International Society for Music Information Retrieval Conference Proceedings; 2017, p738-744, 7p

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání