Výsledky vyhledávání - "Khan, Zaid A."

Report

DataEnvGym: Data Generation Agents in Teacher Environments with Student Feedback

Autor: Khan, Zaid, Stengel-Eskin, Elias, Cho, Jaemin, Bansal, Mohit

The process of creating training data to teach models is currently driven by humans, who manually analyze model weaknesses and plan how to create data that improves a student model. Recent approaches using LLMs as annotators reduce human effort, but

Externí odkaz: http://arxiv.org/abs/2410.06215

Zobrazit plný text záznamu

Report

Consistency and Uncertainty: Identifying Unreliable Responses From Black-Box Vision-Language Models for Selective Visual Question Answering

Autor: Khan, Zaid, Fu, Yun

The goal of selective prediction is to allow an a model to abstain when it may not be able to deliver a reliable prediction, which is important in safety-critical contexts. Existing approaches to selective prediction typically require access to the i

Externí odkaz: http://arxiv.org/abs/2404.10193

Zobrazit plný text záznamu

Report

Self-Training Large Language Models for Improved Visual Program Synthesis With Visual Reinforcement

Autor: Khan, Zaid, BG, Vijay Kumar, Schulter, Samuel, Fu, Yun, Chandraker, Manmohan

Visual program synthesis is a promising approach to exploit the reasoning abilities of large language models for compositional computer vision tasks. Previous work has used few-shot prompting with frozen LLMs to synthesize visual programs. Training a

Externí odkaz: http://arxiv.org/abs/2404.04627

Zobrazit plný text záznamu

Report

Exploring Question Decomposition for Zero-Shot VQA

Autor: Khan, Zaid, BG, Vijay Kumar, Schulter, Samuel, Chandraker, Manmohan, Fu, Yun

Visual question answering (VQA) has traditionally been treated as a single-step task where each question receives the same amount of effort, unlike natural human question-answering strategies. We explore a question decomposition strategy for VQA to o

Externí odkaz: http://arxiv.org/abs/2310.17050

Zobrazit plný text záznamu

Report

Q: How to Specialize Large Vision-Language Models to Data-Scarce VQA Tasks? A: Self-Train on Unlabeled Images!

Autor: Khan, Zaid, BG, Vijay Kumar, Schulter, Samuel, Yu, Xiang, Fu, Yun, Chandraker, Manmohan

Finetuning a large vision language model (VLM) on a target dataset after large scale pretraining is a dominant paradigm in visual question answering (VQA). Datasets for specialized tasks such as knowledge-based VQA or VQA in non natural-image domains

Externí odkaz: http://arxiv.org/abs/2306.03932

Zobrazit plný text záznamu

Report

Contrastive Alignment of Vision to Language Through Parameter-Efficient Transfer Learning

Autor: Khan, Zaid, Fu, Yun

Contrastive vision-language models (e.g. CLIP) are typically created by updating all the parameters of a vision model and language model through contrastive training. Can such models be created by a small number of parameter updates to an already-tra

Externí odkaz: http://arxiv.org/abs/2303.11866

Zobrazit plný text záznamu

Dissertation/ Thesis

Data augmentation for attack detection on IoT Telehealth Systems

Autor: Khan, Zaid A.

Telehealth is an online health care system that is extensively used in the current pandemic situation. Our proposed technique is considered a fog computing-based attack detection architecture to protect IoT Telehealth Networks. As for IoT Telehealth

Externí odkaz: http://hdl.handle.net/1828/13798

Zobrazit plný text záznamu

Report

Single-Stream Multi-Level Alignment for Vision-Language Pretraining

Autor: Khan, Zaid, BG, Vijay Kumar, Yu, Xiang, Schulter, Samuel, Chandraker, Manmohan, Fu, Yun

Self-supervised vision-language pretraining from pure images and text with a contrastive loss is effective, but ignores fine-grained alignment due to a dual-stream architecture that aligns image and text representations only on a global level. Earlie

Externí odkaz: http://arxiv.org/abs/2203.14395

Zobrazit plný text záznamu

Report

Application of Modular Vehicle Technology to Mitigate Bus Bunching

Autor: Khan, Zaid Saeed, He, Weili, Menendez, Monica

The stochastic nature of public transport systems leads to headway variability and bus bunching, causing both operator and passenger cost to increase significantly. Traditional strategies to counter bus bunching, including bus-holding, stop-skipping,

Externí odkaz: http://arxiv.org/abs/2202.06039

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání