Zobrazeno 1 - 10
of 2 666
pro vyhledávání: '"Image Captioning"'
Publikováno v:
Journal of Enabling Technologies, 2024, Vol. 18, Issue 4, pp. 248-264.
Externí odkaz:
http://www.emeraldinsight.com/doi/10.1108/JET-03-2024-0024
Autor:
P. Steffy Sherly, P. Velvizhy
Publikováno v:
Heritage Science, Vol 12, Iss 1, Pp 1-21 (2024)
Abstract This work aims to provide an innovative solution to enhance the accessibility of images by an innovative image to text to speech system. It is applied to Hindu and Christian divine images. The method is applicable, among others, to enhance c
Externí odkaz:
https://doaj.org/article/71b42a97f6d24cbe9dbf178ba71ed7d7
Publikováno v:
Scientific Reports, Vol 14, Iss 1, Pp 1-15 (2024)
Abstract Captioning an image involves using a combination of vision and language models to describe the image in an expressive and concise sentence. Successful captioning task requires extracting as much information as possible from the corresponding
Externí odkaz:
https://doaj.org/article/ac6465a1d1404988a66830515c6f9ae8
Publikováno v:
IET Image Processing, Vol 18, Iss 11, Pp 3055-3069 (2024)
Abstract Automatic generation of image captions is essentially a cross‐modal conversion from image to text. Owing to the differences in linguistic characteristics between Chinese and English, quite a few Chinese image captioning methods have recent
Externí odkaz:
https://doaj.org/article/2a633e09323540198cc20722ae1e9e61
Publikováno v:
Computational and Structural Biotechnology Journal, Vol 24, Iss , Pp 434-450 (2024)
A medical data integration center integrates a large volume of medical images from clinical departments, including X-rays, CT scans, and MRI scans. Ideally, all images should be indexed appropriately with standard clinical terms. However, some images
Externí odkaz:
https://doaj.org/article/6400153cf19049d7ae9e5786313089b0
Publikováno v:
Visual Computing for Industry, Biomedicine, and Art, Vol 7, Iss 1, Pp 1-17 (2024)
Abstract Large language models (LLMs), such as ChatGPT, have demonstrated impressive capabilities in various tasks and attracted increasing interest as a natural language interface across many domains. Recently, large vision-language models (VLMs) th
Externí odkaz:
https://doaj.org/article/4aa0d6e659b242fa83c5441409b25181
Publikováno v:
Frontiers in Physics, Vol 12 (2024)
Incorporating medical text annotations compensates for the quality deficiencies of image data, effectively overcoming the limitations of medical image segmentation. Many existing approaches achieve high-quality segmentation results by integrating tex
Externí odkaz:
https://doaj.org/article/40f985854597431396fb4b5eeb9dba77
Publikováno v:
Heliyon, Vol 10, Iss 17, Pp e36272- (2024)
Image captioning, the process of generating natural language descriptions based on image content, has garnered attention in AI research for its implications in scene understanding and human-computer interaction. While much prior research has focused
Externí odkaz:
https://doaj.org/article/00396d46f6044518bb1b43e908280e93
Autor:
Alaa Thobhani, Beiji Zou, Xiaoyan Kui, Asma A. Al-Shargabi, Zaid Derea, Amr Abdussalam, Mohammed A. Asham
Publikováno v:
Journal of King Saud University: Computer and Information Sciences, Vol 36, Iss 7, Pp 102127- (2024)
Image captioning, the task of generating descriptive sentences for images, has seen significant advancements by incorporating semantic information. However, previous studies employed semantic attribute detectors to extract predetermined attributes co
Externí odkaz:
https://doaj.org/article/38a232b2025749ed99f010a6eb2be787
Publikováno v:
International Journal of Digital Earth, Vol 17, Iss 1 (2024)
Remote sensing image acquisition is an essential way to obtain information. However, research on remote sensing images mainly focuses on object detection or image classification. The emergence of remote sensing image captioning (RSIC) has enabled und
Externí odkaz:
https://doaj.org/article/a8e25ddac9f84cfb8cefaeed3a542395