Zobrazeno 1 - 10
of 52
pro vyhledávání: '"Mysore, Gautham J."'
Content creators often use music to enhance their stories, as it can be a powerful tool to convey emotion. In this paper, our goal is to help creators find music to match the emotion of their story. We focus on text-based stories that can be auralize
Externí odkaz:
http://arxiv.org/abs/2111.13468
Speech synthesis has recently seen significant improvements in fidelity, driven by the advent of neural vocoders and neural prosody generators. However, these systems lack intuitive user controls over prosody, making them unable to rectify prosody er
Externí odkaz:
http://arxiv.org/abs/2008.03388
Non-parallel many-to-many voice conversion remains an interesting but challenging speech processing task. Many style-transfer-inspired methods such as generative adversarial networks (GANs) and variational autoencoders (VAEs) have been proposed. Rece
Externí odkaz:
http://arxiv.org/abs/2004.07370
Autor:
Manocha, Pranay, Finkelstein, Adam, Zhang, Richard, Bryan, Nicholas J., Mysore, Gautham J., Jin, Zeyu
Many audio processing tasks require perceptual assessment. The ``gold standard`` of obtaining human judgments is time-consuming, expensive, and cannot be used as an optimization criterion. On the other hand, automated metrics are efficient to compute
Externí odkaz:
http://arxiv.org/abs/2001.04460
In video production, inserting B-roll is a widely used technique to enrich the story and make a video more engaging. However, determining the right content and positions of B-roll and actually inserting it within the main footage can be challenging,
Externí odkaz:
http://arxiv.org/abs/1902.11216
We propose the product-of-filters (PoF) model, a generative model that decomposes audio spectra as sparse linear combinations of "filters" in the log-spectral domain. PoF makes similar assumptions to those used in the classic homomorphic filtering ap
Externí odkaz:
http://arxiv.org/abs/1312.5857
Autor:
Wadhwa, Neal, Hao-Yu Wu, Davis, Abe, Rubinstein, Michael, Shih, Eugene, Mysore, Gautham J., Chen, Justin G., Buyukozturk, Oral, Guttag, John V., Freeman, William T., Durand, Frédo
Publikováno v:
Communications of the ACM; Jan2017, Vol. 60 Issue 1, p87-95, 9p, 6 Diagrams, 4 Graphs
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Publikováno v:
International Society for Music Information Retrieval Conference Proceedings; 2017, p738-744, 7p