Zobrazeno 1 - 10
of 4 863
pro vyhledávání: '"A. Büthe"'
Autor:
Büthe, Jan
Reducing the bandwidth of speech is common practice in resource constrained environments like low-bandwidth speech transmission or low-complexity vocoding. We propose a lightweight and robust method for extending the bandwidth of wideband speech sign
Externí odkaz:
http://arxiv.org/abs/2412.11392
Neural vocoders are now being used in a wide range of speech processing applications. In many of those applications, the vocoder can be the most complex component, so finding lower complexity algorithms can lead to significant practical benefits. In
Externí odkaz:
http://arxiv.org/abs/2405.21069
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Autor:
Büthe, Lea1 (AUTHOR), Westhofen, Gina1 (AUTHOR), Hille, Andrea2 (AUTHOR), Büntzel, Judith1 (AUTHOR) judith.buentzel@med.uni-goettingen.de
Publikováno v:
Current Oncology. Dec2024, Vol. 31 Issue 12, p7663-7685. 23p.
Speech codec enhancement methods are designed to remove distortions added by speech codecs. While classical methods are very low in complexity and add zero delay, their effectiveness is rather limited. Compared to that, DNN-based methods deliver high
Externí odkaz:
http://arxiv.org/abs/2309.14521
Pitch estimation is an essential step of many speech processing algorithms, including speech coding, synthesis, and enhancement. Recently, pitch estimators based on deep neural networks (DNNs) have have been outperforming well-established DSP-based t
Externí odkaz:
http://arxiv.org/abs/2309.14507
Akademický článek
Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.
K zobrazení výsledku je třeba se přihlásit.
Classical speech coding uses low-complexity postfilters with zero lookahead to enhance the quality of coded speech, but their effectiveness is limited by their simplicity. Deep Neural Networks (DNNs) can be much more effective, but require high compl
Externí odkaz:
http://arxiv.org/abs/2307.06610
GAN vocoders are currently one of the state-of-the-art methods for building high-quality neural waveform generative models. However, most of their architectures require dozens of billion floating-point operations per second (GFLOPS) to generate speec
Externí odkaz:
http://arxiv.org/abs/2212.04532
Despite recent advancements in packet loss concealment (PLC) using deep learning techniques, packet loss remains a significant challenge in real-time speech communication. Redundancy has been used in the past to recover the missing information during
Externí odkaz:
http://arxiv.org/abs/2212.04453