Zobrazeno 1 - 10
of 218
pro vyhledávání: '"Goto Masataka"'
Lyric translation plays a pivotal role in amplifying the global resonance of music, bridging cultural divides, and fostering universal connections. Translating lyrics, unlike conventional translation tasks, requires a delicate balance between singabi
Externí odkaz:
http://arxiv.org/abs/2308.13715
Autor:
Yakura, Hiromu, Goto, Masataka
Recent text-to-audio generation techniques have the potential to allow novice users to freely generate music audio. Even if they do not have musical knowledge, such as about chord progressions and instruments, users can try various text prompts to ge
Externí odkaz:
http://arxiv.org/abs/2307.13005
CatAlyst uses generative models to help workers' progress by influencing their task engagement instead of directly contributing to their task outputs. It prompts distracted workers to resume their tasks by generating a continuation of their work and
Externí odkaz:
http://arxiv.org/abs/2302.05678
Current deep learning techniques for style transfer would not be optimal for design support since their "one-shot" transfer does not fit exploratory design processes. To overcome this gap, we propose parametric transcription, which transcribes an end
Externí odkaz:
http://arxiv.org/abs/2105.09207
Deep generative models allow even novice composers to generate various melodies by sampling latent vectors. However, finding the desired melody is challenging since the latent space is unintuitive and high-dimensional. In this work, we present an int
Externí odkaz:
http://arxiv.org/abs/2010.03190
We attempt to recognize and track lyric words in lyric videos. Lyric video is a music video showing the lyric words of a song. The main characteristic of lyric videos is that the lyric words are shown at frames synchronously with the music. The diffi
Externí odkaz:
http://arxiv.org/abs/2006.11933
Publikováno v:
ACM Trans. Graph. 39, 4 (July 2020), pp.88:1-88:12
Visual design tasks often involve tuning many design parameters. For example, color grading of a photograph involves many parameters, some of which non-expert users might be unfamiliar with. We propose a novel user-in-the-loop optimization method tha
Externí odkaz:
http://arxiv.org/abs/2005.04107
Autor:
Nakatsuka, Takayuki, Yoshii, Kazuyoshi, Koyama, Yuki, Fukayama, Satoru, Goto, Masataka, Morishima, Shigeo
This paper proposes a statistical approach to 2D pose estimation from human images. The main problems with the standard supervised approach, which is based on a deep recognition (image-to-pose) model, are that it often yields anatomically implausible
Externí odkaz:
http://arxiv.org/abs/2004.03811
Publikováno v:
EURASIP Journal on Advances in Signal Processing, Vol 2010, Iss 1, p 172961 (2010)
We describe a novel query-by-example (QBE) approach in music information retrieval that allows a user to customize query examples by directly modifying the volume of different instrument parts. The underlying hypothesis of this approach is that the m
Externí odkaz:
https://doaj.org/article/bae2f696cb0944c190d69baaefc095f9
Publikováno v:
EURASIP Journal on Advances in Signal Processing, Vol 2007, Iss 1, p 086874 (2007)
Externí odkaz:
https://doaj.org/article/0410472d156c45759f5875a339cdee55