Výsledky vyhledávání - "Laila Bashmal"

Akademický článek

RS-LLaVA: A Large Vision-Language Model for Joint Captioning and Question Answering in Remote Sensing Imagery

Autor: Yakoub Bazi, Laila Bashmal, Mohamad Mahmoud Al Rahhal, Riccardo Ricci, Farid Melgani

Publikováno v: Remote Sensing, Vol 16, Iss 9, p 1477 (2024)

In this paper, we delve into the innovative application of large language models (LLMs) and their extension, large vision-language models (LVLMs), in the field of remote sensing (RS) image analysis. We particularly emphasize their multi-tasking poten

Externí odkaz: https://doaj.org/article/bbac3265f861430b919250d8f8f2e523

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Visual Question Generation From Remote Sensing Images

Autor: Laila Bashmal, Yakoub Bazi, Farid Melgani, Riccardo Ricci, Mohamad M. Al Rahhal, Mansour Zuair

Publikováno v: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, Vol 16, Pp 3279-3293 (2023)

Visual question generation (VQG) is a fundamental task in vision-language understanding that aims to generate relevant questions about the given input image. In this article, we propose a paragraph-based VQG approach for generating intelligent questi

Externí odkaz: https://doaj.org/article/b49cde89939c4b998ccfc39786ac1561

Zobrazit plný text záznamu

Akademický článek

Multilanguage Transformer for Improved Text to Remote Sensing Image Retrieval

Autor: Mohamad M. Al Rahhal, Yakoub Bazi, Norah A. Alsharif, Laila Bashmal, Naif Alajlan, Farid Melgani

Publikováno v: IEEE Journal of Selected Topics in Applied Earth Observations and Remote Sensing, Vol 15, Pp 9115-9126 (2022)

Cross-modal text-image retrieval in remote sensing (RS) provides a flexible retrieval experience for mining useful information from RS repositories. However, existing methods are designed to accept queries formulated in the English language only, whi

Externí odkaz: https://doaj.org/article/6f63226a433d4cf98d007509067e18aa

Zobrazit plný text záznamu

Akademický článek

CapERA: Captioning Events in Aerial Videos

Autor: Laila Bashmal, Yakoub Bazi, Mohamad Mahmoud Al Rahhal, Mansour Zuair, Farid Melgani

Publikováno v: Remote Sensing, Vol 15, Iss 8, p 2139 (2023)

In this paper, we introduce the CapERA dataset, which upgrades the Event Recognition in Aerial Videos (ERA) dataset to aerial video captioning. The newly proposed dataset aims to advance visual–language-understanding tasks for UAV videos by providi

Externí odkaz: https://doaj.org/article/e9af47099bf741cebe2c5b9f99ee938b

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

Vision–Language Model for Visual Question Answering in Medical Imagery

Autor: Yakoub Bazi, Mohamad Mahmoud Al Rahhal, Laila Bashmal, Mansour Zuair

Publikováno v: Bioengineering, Vol 10, Iss 3, p 380 (2023)

In the clinical and healthcare domains, medical images play a critical role. A mature medical visual question answering system (VQA) can improve diagnosis by answering clinical questions presented with a medical image. Despite its enormous potential

Externí odkaz: https://doaj.org/article/043e97016bc343d78b593a5b451e4eb3

Zobrazit plný text záznamu

Plný text ve formátu HTML

Akademický článek

UAV Image Multi-Labeling with Data-Efficient Transformers

Autor: Laila Bashmal, Yakoub Bazi, Mohamad Mahmoud Al Rahhal, Haikel Alhichri, Naif Al Ajlan

Publikováno v: Applied Sciences, Vol 11, Iss 9, p 3974 (2021)

In this paper, we present an approach for the multi-label classification of remote sensing images based on data-efficient transformers. During the training phase, we generated a second view for each image from the training set using data augmentation

Externí odkaz: https://doaj.org/article/d8c45fb6d39f40d38e6ad4400a21d159

Zobrazit plný text záznamu

Akademický článek

Vision Transformers for Remote Sensing Image Classification

Autor: Yakoub Bazi, Laila Bashmal, Mohamad M. Al Rahhal, Reham Al Dayil, Naif Al Ajlan

Publikováno v: Remote Sensing, Vol 13, Iss 3, p 516 (2021)

In this paper, we propose a remote-sensing scene-classification method based on vision transformers. These types of networks, which are now recognized as state-of-the-art models in natural language processing, do not rely on convolution layers as in

Externí odkaz: https://doaj.org/article/aa8eacb888bb4e66a99f84c0a67ceba6

Zobrazit plný text záznamu

Akademický článek

Siamese-GAN: Learning Invariant Representations for Aerial Vehicle Image Categorization

Autor: Laila Bashmal, Yakoub Bazi, Haikel AlHichri, Mohamad M. AlRahhal, Nassim Ammour, Naif Alajlan

Publikováno v: Remote Sensing, Vol 10, Iss 2, p 351 (2018)

In this paper, we present a new algorithm for cross-domain classification in aerial vehicle images based on generative adversarial networks (GANs). The proposed method, called Siamese-GAN, learns invariant feature representations for both labeled and

Externí odkaz: https://doaj.org/article/1e9b8e2c41214b118f2dbbf593775637

Zobrazit plný text záznamu

Plný text ve formátu HTML

Space Time Attention Transformer for Non-Event Detection in UAV Videos

Autor: Laila Bashmal, Yakoub Bazi, Naif Alajlan

Publikováno v: IGARSS 2022 - 2022 IEEE International Geoscience and Remote Sensing Symposium.

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::f01bb16abccc3fc2bd78eb815f3f0f83
https://doi.org/10.1109/igarss46834.2022.9884661

Zobrazit plný text záznamu

Deep Vision Transformers for Remote Sensing Scene Classification

Autor: Yakoub Bazi, Laila Bashmal, Mohamad Mahmoud Al Rahhal

Publikováno v: IGARSS

In this paper, we present a scene classification method based on vision transformers. These types of networks, which are now the standard models in natural language processing (NLP) do not rely on convolution block as in convolutional neural networks

Externí odkaz: https://explore.openaire.eu/search/publication?articleId=doi_________::e33fb207cf4d6c80f16a21d6ca3c0a93
https://doi.org/10.1109/igarss47720.2021.9553684

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání