Zobrazeno 1 - 10
of 50
pro vyhledávání: '"Om D. Deshmukh"'
Publikováno v:
The Journal of the Acoustical Society of America. 143:2289-2300
The principles of the existing pitch estimation techniques are often different and complementary in nature. In this work, a frame selective dynamic programming (FSDP) method is proposed which exploits the complementary characteristics of two existing
Publikováno v:
The Journal of the Acoustical Society of America. 146(3)
Speech (syllable) rate estimation typically involves computing a feature contour based on sub-band energies having strong local maxima/peaks at syllable nuclei, which are detected with the help of voicing decisions (VDs). While such a two-stage schem
Publikováno v:
IUI
One of the challenges that is holding back wide spread consumption of educational videos on mobile devices is the lack of mobile interfaces which can provide efficient video navigation capabilities. In this paper, we utilize multi-modal data analysis
Publikováno v:
ICASSP
Automatic syllable stress detection is useful in assessing and diagnosing the quality of the pronunciation of second language (L2) learners in an automated way. Typically, the syllable stress depends on three prominence measures - intensity level, du
Publikováno v:
ICPR
Real-world datasets consist of data representations (views) from different sources which often provide information complementary to each other. Multi-view learning algorithms aim at exploiting the complementary information present in different views
Publikováno v:
W4A
Recently, Mobile Instant Messaging Services (MIMs) such as WhatsApp have shown tremendous potential in enabling communication among diverse set of people. Such services have an even more critical role to play in developing regions. Due to the digital
Publikováno v:
IUI Companion
The amount of instructional videos available online, already in tens of thousands of hours, is growing steadily. A major bottleneck in their wide spread usage is the lack of tools for easy consumption of these videos. In this demonstration, we presen
Autor:
Ankit Gandhi, Arijit Biswas, Om D. Deshmukh, Saurabh Srivastava, Kuldeep Yadav, Kundan Shrivastava
Publikováno v:
IUI
Instructional videos are one of the most popular ways of teaching and learning in an online setting. However, navigation in videos is linear as compared to other instructional resources such as textbooks, where a table of topics and a multi-faceted i
Autor:
Om D. Deshmukh
Publikováno v:
Speech in Mobile and Pervasive Environments
This chapter contains sections titled: Automatic speech recognition Mathematical formulation Acoustic parameterization Acoustic modeling Language modeling Modifications for embedded speech recognition Applications Text‐to‐speech synthesis Text to
Autor:
Om D. Deshmukh, Ashish Verma
Publikováno v:
Speech Communication. 51:1224-1233
This paper presents a word-independent technique for classifying the syllable stress of spoken English words. The proposed technique improves upon the existing word-independent techniques by utilizing the acoustic differences of various syllable nucl