Výsledky vyhledávání - "Image Captioning"

Akademický článek

Instance-level medical image classification for text-based retrieval in a medical data integration center

Autor: Ka Yung Cheng, Markus Lange-Hegermann, Jan-Bernd Hövener, Björn Schreiweis

Publikováno v: Computational and Structural Biotechnology Journal, Vol 24, Iss , Pp 434-450 (2024)

A medical data integration center integrates a large volume of medical images from clinical departments, including X-rays, CT scans, and MRI scans. Ideally, all images should be indexed appropriately with standard clinical terms. However, some images

Externí odkaz: https://doaj.org/article/6400153cf19049d7ae9e5786313089b0

Zobrazit plný text záznamu

Akademický článek

Thangka image captioning model with Salient Attention and Local Interaction Aggregator

Autor: Wenjin Hu, Fujun Zhang, Yinqiu Zhao

Publikováno v: Heritage Science, Vol 12, Iss 1, Pp 1-21 (2024)

Abstract Thangka image captioning aims to automatically generate accurate and complete sentences that describe the main content of Thangka images. However, existing methods fall short in capturing the features of the core deity regions and the surrou

Externí odkaz: https://doaj.org/article/45485e3d909f4899b4b0638ffcab7d85

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

'Idol talks!' AI-driven image to text to speech: illustrated by an application to images of deities

Autor: P. Steffy Sherly, P. Velvizhy

Publikováno v: Heritage Science, Vol 12, Iss 1, Pp 1-21 (2024)

Abstract This work aims to provide an innovative solution to enhance the accessibility of images by an innovative image to text to speech system. It is applied to Hindu and Christian divine images. The method is applicable, among others, to enhance c

Externí odkaz: https://doaj.org/article/71b42a97f6d24cbe9dbf178ba71ed7d7

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Novel concept-based image captioning models using LSTM and multi-encoder transformer architecture

Autor: Asmaa A. E. Osman, Mohamed A. Wahby Shalaby, Mona M. Soliman, Khaled M. Elsayed

Publikováno v: Scientific Reports, Vol 14, Iss 1, Pp 1-15 (2024)

Abstract Captioning an image involves using a combination of vision and language models to describe the image in an expressive and concise sentence. Successful captioning task requires extracting as much information as possible from the corresponding

Externí odkaz: https://doaj.org/article/ac6465a1d1404988a66830515c6f9ae8

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Chinese image captioning with fusion encoder and visual keyword search

Autor: Yang Zou, Shiyu Liao, Qifei Wang

Publikováno v: IET Image Processing, Vol 18, Iss 11, Pp 3055-3069 (2024)

Abstract Automatic generation of image captions is essentially a cross‐modal conversion from image to text. Owing to the differences in linguistic characteristics between Chinese and English, quite a few Chinese image captioning methods have recent

Externí odkaz: https://doaj.org/article/2a633e09323540198cc20722ae1e9e61

Zobrazit plný text záznamu

Akademický článek

IQAGPT: computed tomography image quality assessment with vision-language and ChatGPT models

Autor: Zhihao Chen, Bin Hu, Chuang Niu, Tao Chen, Yuxin Li, Hongming Shan, Ge Wang

Publikováno v: Visual Computing for Industry, Biomedicine, and Art, Vol 7, Iss 1, Pp 1-17 (2024)

Abstract Large language models (LLMs), such as ChatGPT, have demonstrated impressive capabilities in various tasks and attracted increasing interest as a natural language interface across many domains. Recently, large vision-language models (VLMs) th

Externí odkaz: https://doaj.org/article/4aa0d6e659b242fa83c5441409b25181

Zobrazit plný text záznamu

Akademický článek

Region-guided transformer for remote sensing image captioning

Autor: Kai Zhao, Wei Xiong

Publikováno v: International Journal of Digital Earth, Vol 17, Iss 1 (2024)

Remote sensing image acquisition is an essential way to obtain information. However, research on remote sensing images mainly focuses on object detection or image classification. The emergence of remote sensing image captioning (RSIC) has enabled und

Externí odkaz: https://doaj.org/article/a8e25ddac9f84cfb8cefaeed3a542395

Zobrazit plný text záznamu

Akademický článek

FRIC: a framework for few-shot remote sensing image captioning

Autor: Haonan Zhou, Lurui Xia, Xiaoping Du, Sen Li

Publikováno v: International Journal of Digital Earth, Vol 17, Iss 1 (2024)

ABSTRACTThe training of image captioning (IC) models requires a large number of caption-labeled samples, which is usually difficult to satisfy in the actual remote sensing scenarios. The performance of the models will be damaged due to the few-shot p

Externí odkaz: https://doaj.org/article/b22f0d2726014ff098a5807b6dcb0be1

Zobrazit plný text záznamu

Akademický článek

Cap2Seg: leveraging caption generation for enhanced segmentation of COVID-19 medical images

Autor: Wanlong Zhao, Fan Li, Yueqin Diao, Puyin Fan, Zhu Chen

Publikováno v: Frontiers in Physics, Vol 12 (2024)

Incorporating medical text annotations compensates for the quality deficiencies of image data, effectively overcoming the limitations of medical image segmentation. Many existing approaches achieve high-quality segmentation results by integrating tex

Externí odkaz: https://doaj.org/article/40f985854597431396fb4b5eeb9dba77

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání