Zobrazeno 1 - 10
of 118
pro vyhledávání: '"Chiori Hori"'
Publikováno v:
EURASIP Journal on Advances in Signal Processing, Vol 2003, Iss 2, Pp 128-139 (2003)
This paper proposes a statistical approach to automatic speech summarization. In our method, a set of words maximizing a summarization score indicating the appropriateness of summarization is extracted from automatically transcribed speech and then c
Externí odkaz:
https://doaj.org/article/198a5a6f96044b85859fcb0c35b5aff5
Publikováno v:
Interspeech 2022.
Autor:
Srinivas Sunkara, Luis A. Lastras, Jonathan K. Kummerfeld, Hannes Schulz, Walter S. Lasecki, Anoop Cherian, Adam Atkinson, Seokhwan Kim, Chiori Hori, Xiaoxue Zang, Jinchao Li, Sungjin Lee, Minlie Huang, R. Chulaka Gunasekara, Michel Galley, Tim K. Marks, Raghav Gupta, Mahmoud Adada, Baolin Peng, Abhinav Rastogi, Jianfeng Gao
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 29:2529-2540
This paper introduces the Eighth Dialog System Technology Challenge. In line with recent challenges, the eighth edition focuses on applying end-to-end dialog technologies in a pragmatic way for multi-domain task-completion, noetic response selection,
Autor:
Ankit Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori
In previous work, we have proposed the Audio-Visual Scene-Aware Dialog (AVSD) task, collected an AVSD dataset, developed AVSD technologies, and hosted an AVSD challenge track at both the 7th and 8th Dialog System Technology Challenges (DSTC7, DSTC8).
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::94c3921cb48a2437233496ab59372571
http://arxiv.org/abs/2110.06894
http://arxiv.org/abs/2110.06894
Autor:
Hannes Schulz, Luis Fernando D'Haro, Seokhwan Kim, Abhinav Rastogi, Chiori Hori, R. Chulaka Gunasekara
Publikováno v:
IEEE/ACM Transactions on Audio, Speech, and Language Processing. 29:2434-2436
The 11 papers in this special section that were part of the Dialog System Technology Challenge. Research competitions have been a long-standing and valuable tradition in the speech and language community. They accelerate the development of new techno
Publikováno v:
Interspeech 2021.
This paper addresses end-to-end automatic speech recognition (ASR) for long audio recordings such as lecture and conversational speeches. Most end-to-end ASR models are designed to recognize independent utterances, but contextual information (e.g., s
Publikováno v:
Interspeech 2021.
Video captioning is an essential technology to understand scenes and describe events in natural language. To apply it to real-time monitoring, a system needs not only to describe events accurately but also to produce the captions as soon as possible.
Autor:
Chiori Hori
slides for CVPR 2021 tutorial
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::417ef995b51c9b7b984be6fe16f5b994
Autor:
Guy Barash, Mauricio Castillo-Effen, Niyati Chhaya, Peter Clark, Huáscar Espinoza, Eitan Farchi, Christopher Geib, Odd Erik Gundersen, Seán HÉigeartaigh, José Hernández-Orallo, Chiori Hori, Xiaowei Huang, Kokil Jaidka, Pavan Kapanipathi, Sarah Keren, Seokhwan Kim, Marc Lanctot, Danny Lange, Julian McAuley, David Martinez, Marwan Mattar, null Mausam, Martin Michalowski, Reuth Mirsky, Roozbeh Mottaghi, Joseph Osborn, Julien Perolat, Martin Schmid, Arash Shaban-Nejad, Onn Shehory, Biplav Srivastava, William Streilein, Kartik Talamadupula, Julian Togelius, Koichiro Yoshino, Quanshi Zhang, Imed Zitouni
Publikováno v:
AI Magazine. 40:67-78
The workshop program of the Association for the Advancement of Artificial Intelligence’s 33rd Conference on Artificial Intelligence (AAAI-19) was held in Honolulu, Hawaii, on Sunday and Monday, January 27–28, 2019. There were fifteen workshops in
Publikováno v:
Computer Speech And Language, ISSN 0885-2308, 2019-05, Vol. 55
Archivo Digital UPM
Universidad Politécnica de Madrid
Archivo Digital UPM
Universidad Politécnica de Madrid
End-to-end dialog systems are gaining interest due to the recent advances of deep neural networks and the availability of large human–human dialog corpora. However, in spite of being of fundamental importance to systematically improve the performan