Image Captioning Generator Text-to-Speech.

Autor: Sharma, Tripti, Anand, Neetu, Gaurav, Kumar, Kapur, Rohit
Předmět:
Zdroj: International Journal of Next-Generation Computing; Oct2022, Vol. 13 Issue 3, p449-458, 10p
Abstrakt: With the rapid growth of artificial intelligence in recent years, image caption has increasingly grabbed the attention of many artificial intelligence researchers and has become a fascinating and challenging task. In this research work a model is created for blind people that can guide and support them while traveling on the highways just with the help of a smartphone application. This can be accomplished by first converting the scene in front of the user into text and then converting text into voice output. The method for the generation of image legends based on deep neural networks. With an image as an entry, the method can display an English sentence describing the contents of the image. The user first provides a voice command, then a quick snapshot is captured by the camera or webcam. This image is then fed as input to the image caption generator template that generates a caption for the image. Next, this caption text is converted to speech, which gives rise to a voice message on the description of the image. The objective of the research work is to develop a model that can help the blind people while travelling with the help of smartphone application. [ABSTRACT FROM AUTHOR]
Databáze: Complementary Index