Výsledky vyhledávání

Akademický článek

A Statistical Approach to Automatic Speech Summarization

Autor: Chiori Hori, Sadaoki Furui, Rob Malkin, Alex Waibel, Hua Yu

Publikováno v: EURASIP Journal on Advances in Signal Processing, Vol 2003, Iss 2, Pp 128-139 (2003)

This paper proposes a statistical approach to automatic speech summarization. In our method, a set of words maximizing a summarization score indicating the appropriateness of summarization is extracted from automatically transcribed speech and then c

Externí odkaz: https://doaj.org/article/198a5a6f96044b85859fcb0c35b5aff5

Zobrazit plný text záznamu

Low-Latency Online Streaming VideoQA Using Audio-Visual Transformers

Autor: Chiori Hori, Takaaki Hori, Jonathan Le Roux

Publikováno v: Interspeech 2022.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::1421ad37a1dfa37fc219c400e5eac23d
https://doi.org/10.21437/interspeech.2022-10891

Zobrazit plný text záznamu

Overview of the Eighth Dialog System Technology Challenge: DSTC8

Publikováno v: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 29:2529-2540

This paper introduces the Eighth Dialog System Technology Challenge. In line with recent challenges, the eighth edition focuses on applying end-to-end dialog technologies in a pragmatic way for multi-domain task-completion, noetic response selection,

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::8d8fa80d824fc445e0d86d2a58bd61c5
https://doi.org/10.1109/taslp.2021.3078368

Zobrazit plný text záznamu

Audio-Visual Scene-Aware Dialog and Reasoning using Audio-Visual Transformers with Joint Student-Teacher Learning

Autor: Ankit Shah, Shijie Geng, Peng Gao, Anoop Cherian, Takaaki Hori, Tim K. Marks, Jonathan Le Roux, Chiori Hori

In previous work, we have proposed the Audio-Visual Scene-Aware Dialog (AVSD) task, collected an AVSD dataset, developed AVSD technologies, and hosted an AVSD challenge track at both the 7th and 8th Dialog System Technology Challenges (DSTC7, DSTC8).

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::94c3921cb48a2437233496ab59372571
http://arxiv.org/abs/2110.06894

Zobrazit plný text záznamu

Editorial: Special Issue on the Eighth Dialog System Technology Challenge

Autor: Hannes Schulz, Luis Fernando D'Haro, Seokhwan Kim, Abhinav Rastogi, Chiori Hori, R. Chulaka Gunasekara

Publikováno v: IEEE/ACM Transactions on Audio, Speech, and Language Processing. 29:2434-2436

The 11 papers in this special section that were part of the Dialog System Technology Challenge. Research competitions have been a long-standing and valuable tradition in the speech and language community. They accelerate the development of new techno

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::f7ff55bfdc0d2144840b266779c8a3ad
https://doi.org/10.1109/taslp.2021.3097842

Zobrazit plný text záznamu

Advanced Long-Context End-to-End Speech Recognition Using Context-Expanded Transformers

Autor: Takaaki Hori, Niko Moritz, Chiori Hori, Jonathan Le Roux

Publikováno v: Interspeech 2021.

This paper addresses end-to-end automatic speech recognition (ASR) for long audio recordings such as lecture and conversational speeches. Most end-to-end ASR models are designed to recognize independent utterances, but contextual information (e.g., s

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::1ff5ce8d0304be8749cbed6ffd5ad0a4
https://doi.org/10.21437/interspeech.2021-1643

Zobrazit plný text záznamu

Optimizing Latency for Online Video Captioning Using Audio-Visual Transformers

Autor: Chiori Hori, Jonathan Le Roux, Takaaki Hori

Publikováno v: Interspeech 2021.

Video captioning is an essential technology to understand scenes and describe events in natural language. To apply it to real-time monitoring, a system needs not only to describe events accurately but also to produce the captions as soon as possible.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::67536e9dde7dd66ea7ddf985847a0e2e
https://doi.org/10.21437/interspeech.2021-1975

Zobrazit plný text záznamu

human perspective scene understanding

Autor: Chiori Hori

slides for CVPR 2021 tutorial

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::417ef995b51c9b7b984be6fe16f5b994

Zobrazit plný text záznamu

Reports of the Workshops Held at the 2019 AAAI Conference on Artificial Intelligence

Publikováno v: AI Magazine. 40:67-78

The workshop program of the Association for the Advancement of Artificial Intelligence’s 33rd Conference on Artificial Intelligence (AAAI-19) was held in Honolulu, Hawaii, on Sunday and Monday, January 27–28, 2019. There were fifteen workshops in

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::b067cb4267e2efecb2f3f7d031938f87
https://doi.org/10.1609/aimag.v40i3.4981

Zobrazit plný text záznamu

Automatic evaluation of end-to-end dialog systems with adequacy-fluency metrics

Autor: Rafael E. Banchs, Luis Fernando D'Haro, Chiori Hori, Haizhou Li

Publikováno v: Computer Speech And Language, ISSN 0885-2308, 2019-05, Vol. 55
Archivo Digital UPM
Universidad Politécnica de Madrid

End-to-end dialog systems are gaining interest due to the recent advances of deep neural networks and the availability of large human–human dialog corpora. However, in spite of being of fundamental importance to systematically improve the performan

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_dedup___::224072189bbef9bf799aeab69dd2ceca
https://doi.org/10.1016/j.csl.2018.12.004

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání