Zobrazeno 1 - 10
of 37
pro vyhledávání: '"Iashin, Vladimir"'
Our objective is audio-visual synchronization with a focus on 'in-the-wild' videos, such as those on YouTube, where synchronization cues can be sparse. Our contributions include a novel audio-visual synchronization model, and training that decouples
Externí odkaz:
http://arxiv.org/abs/2401.16423
The objective of this paper is audio-visual synchronisation of general videos 'in the wild'. For such videos, the events that may be harnessed for synchronisation cues may be spatially small and may occur only infrequently during a many seconds-long
Externí odkaz:
http://arxiv.org/abs/2210.07055
Autor:
Iashin, Vladimir, Rahtu, Esa
Recent advances in visually-induced audio generation are based on sampling short, low-fidelity, and one-class sounds. Moreover, sampling 1 second of audio from the state-of-the-art model takes minutes on a high-end GPU. In this work, we propose a sin
Externí odkaz:
http://arxiv.org/abs/2110.08791
Autor:
Xompero, Alessio, Donaher, Santiago, Iashin, Vladimir, Palermo, Francesca, Solak, Gökhan, Coppola, Claudio, Ishikawa, Reina, Nagao, Yuichi, Hachiuma, Ryo, Liu, Qi, Feng, Fan, Lan, Chuanlin, Chan, Rosa H. M., Christmann, Guilherme, Song, Jyun-Ting, Neeharika, Gonuguntla, Reddy, Chinnakotla Krishna Teja, Jain, Dinesh, Rehman, Bakhtawar Ur, Cavallaro, Andrea
Publikováno v:
IEEE Access, vol. 10, 2022, 1-15
The contactless estimation of the weight of a container and the amount of its content manipulated by a person are key pre-requisites for safe human-to-robot handovers. However, opaqueness and transparencies of the container and the content, and varia
Externí odkaz:
http://arxiv.org/abs/2107.12719
Human-robot object handover is a key skill for the future of human-robot collaboration. CORSMAL 2020 Challenge focuses on the perception part of this problem: the robot needs to estimate the filling mass of a container held by a human. Although there
Externí odkaz:
http://arxiv.org/abs/2012.01311
Autor:
Iashin, Vladimir, Rahtu, Esa
Dense video captioning aims to localize and describe important events in untrimmed videos. Existing methods mainly tackle this task by exploiting only visual features, while completely neglecting the audio track. Only a few prior works have utilized
Externí odkaz:
http://arxiv.org/abs/2005.08271
Autor:
Iashin, Vladimir, Rahtu, Esa
Dense video captioning is a task of localizing interesting events from an untrimmed video and producing textual description (captions) for each localized event. Most of the previous works in dense video captioning are solely based on visual informati
Externí odkaz:
http://arxiv.org/abs/2003.07758
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Autor:
Cruz, Cristina D., Wrigstedt, Pauli, Moslova, Karina, Iashin, Vladimir, Mäkkylä, Heidi, Ghemtio, Léo, Heikkinen, Sami, Tammela, Päivi, Perea-Buceta, Jesus E.
Publikováno v:
In European Journal of Medicinal Chemistry 5 February 2021 211
Autor:
Iashin, Vladimir
Video is an important format of information. Humans use videos for a variety of purposes such as entertainment, education, communication, information sharing, and capturing memories. To this date, humankind accumulated a colossal amount of video mate
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=od______4853::0da8c52b2e53cf0a48dc7be1edfb0e9e
https://trepo.tuni.fi/handle/10024/147432
https://trepo.tuni.fi/handle/10024/147432