Zobrazeno 1 - 10
of 12
pro vyhledávání: '"Dmitriy Genzel"'
Autor:
Yao-Yuan Yang, Moto Hira, Zhaoheng Ni, Artyom Astafurov, Caroline Chen, Christian Puhrsch, David Pollack, Dmitriy Genzel, Donny Greenberg, Edward Z. Yang, Jason Lian, Jeff Hwang, Ji Chen, Peter Goldsborough, Sean Narenthiran, Shinji Watanabe, Soumith Chintala, Vincent Quenneville-Belair
Publikováno v:
ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).
Publikováno v:
ACL/IJCNLP (1)
Pretraining and multitask learning are widely used to improve the speech to text translation performance. In this study, we are interested in training a speech to text translation model along with an auxiliary text to text translation task. We conduc
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::2e36e98898d076880b00ba209a7218ac
http://arxiv.org/abs/2107.05782
http://arxiv.org/abs/2107.05782
Publikováno v:
ICASSP
Attention-based sequence-to-sequence modeling provides a powerful and elegant solution for applications that need to map one sequence to a different sequence. Its success heavily relies on the availability of large amounts of training data. This pres
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::7617c00fd6219a83bc7d148cac9185ea
Autor:
Macduff Hughes, Alessandro Cattelan, Graham Neubig, Mengmeng Niu, Antonios Anastasopoulos, Philipp Koehn, Grace Tang, Eric Paquin, Rosie Lazar, Junjie Hu, Francisco Guzmán, William Lewis, Dmitriy Genzel, Christian Federmann, Marcello Federico, Zi-Yi Dou, Alp Öktem, Sylwia Tur
Publikováno v:
NLP4COVID@EMNLP
The COVID-19 pandemic is the worst pandemic to strike the world in over a century. Crucial to stemming the tide of the SARS-CoV-2 virus is communicating to vulnerable populations the means by which they can protect themselves. To this end, the collab
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::bd00aa49d30fc8e625fe3e28538a4a94
Publikováno v:
HIP@ICDAR
This paper describes an approach to estimating the unknown publication date for printed historical documents from their scanned page images, using Convolutional Neural Networks (CNN). The method primarily harnesses visual features from small image pa
Publikováno v:
ICDAR
Hidden Markov Model (HMM)-based classifiers have been successfully used for sequential labeling problems such as speech recognition and optical character recognition for decades. They have been especially successful in the domains where the segmentat
Publikováno v:
MOCR@ICDAR
While current OCR systems are able to recognize text in an increasing number of scripts and languages, typically they still need to be told in advance what those scripts and languages are. We propose an approach that repurposes the same HMM-based sys
Autor:
Andrew W. Senior, Frank Yung-Fong Tang, Ashok C. Popat, Eugene Ie, Nemanja Spasojevic, Dmitriy Genzel, Michael Edward Jahr
Publikováno v:
ICDAR
Optical character recognition is carried out using techniques borrowed from statistical machine translation. In particular, the use of multiple simple feature functions in linear combination, along with minimum-error-rate training, integrated decodin
Autor:
Dmitriy Genzel
Publikováno v:
Computational Linguistics and Intelligent Text Processing ISBN: 9783540245230
CICLing
CICLing
We propose and motivate a novel task: paragraph segmentation. We discuss and compare this task with text segmentation and discourse parsing. We present a system that performs the task with high accuracy. A variety of features is proposed and examined
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::0f9301da16ce1bf0ef191e22c292578b
https://doi.org/10.1007/978-3-540-30586-6_92
https://doi.org/10.1007/978-3-540-30586-6_92
Autor:
Dmitriy Genzel
Publikováno v:
HLT/EMNLP
Dictionaries and word translation models are used by a variety of systems, especially in machine translation. We build a multilingual dictionary induction system for a family of related resource-poor languages. We assume only the presence of a single