Výsledky vyhledávání

Report

Agri-LLaVA: Knowledge-Infused Large Multimodal Assistant on Agricultural Pests and Diseases

Autor: Wang, Liqiong, Jin, Teng, Yang, Jinyu, Leonardis, Ales, Wang, Fangyi, Zheng, Feng

In the general domain, large multimodal models (LMMs) have achieved significant advancements, yet challenges persist in applying them to specific fields, especially agriculture. As the backbone of the global economy, agriculture confronts numerous ch

Externí odkaz: http://arxiv.org/abs/2412.02158

Zobrazit plný text záznamu

Report

PlantCamo: Plant Camouflage Detection

Autor: Yang, Jinyu, Wang, Qingwei, Zheng, Feng, Chen, Peng, Leonardis, Aleš, Fan, Deng-Ping

Camouflaged Object Detection (COD) aims to detect objects with camouflaged properties. Although previous studies have focused on natural (animals and insects) and unnatural (artistic and synthetic) camouflage detection, plant camouflage has been negl

Externí odkaz: http://arxiv.org/abs/2410.17598

Zobrazit plný text záznamu

Report

Towards Unconstrained Collision Injury Protection Data Sets: Initial Surrogate Experiments for the Human Hand

Autor: Kirschner, Robin Jeanne, Yang, Jinyu, Elshani, Edonis, Micheler, Carina M., Leibbrand, Tobias, Müller, Dirk, Glowalla, Claudio, Rajaei, Nader, Burgkart, Rainer, Haddadin, Sami

Safety for physical human-robot interaction (pHRI) is a major concern for all application domains. While current standardization for industrial robot applications provide safety constraints that address the onset of pain in blunt impacts, these impac

Externí odkaz: http://arxiv.org/abs/2408.06175

Zobrazit plný text záznamu

Report

X-Former: Unifying Contrastive and Reconstruction Learning for MLLMs

Autor: Swetha, Sirnam, Yang, Jinyu, Neiman, Tal, Rizve, Mamshad Nayeem, Tran, Son, Yao, Benjamin, Chilimbi, Trishul, Shah, Mubarak

Recent advancements in Multimodal Large Language Models (MLLMs) have revolutionized the field of vision-language understanding by integrating visual perception capabilities into Large Language Models (LLMs). The prevailing trend in this field involve

Externí odkaz: http://arxiv.org/abs/2407.13851

Zobrazit plný text záznamu

Report

PVUW 2024 Challenge on Complex Video Understanding: Methods and Results

Pixel-level Video Understanding in the Wild Challenge (PVUW) focus on complex video understanding. In this CVPR 2024 workshop, we add two new tracks, Complex Video Object Segmentation Track based on MOSE dataset and Motion Expression guided Video Seg

Externí odkaz: http://arxiv.org/abs/2406.17005

Zobrazit plný text záznamu

Report

1st Place Solution for MeViS Track in CVPR 2024 PVUW Workshop: Motion Expression guided Video Segmentation

Autor: Gao, Mingqi, Luo, Jingnan, Yang, Jinyu, Han, Jungong, Zheng, Feng

Motion Expression guided Video Segmentation (MeViS), as an emerging task, poses many new challenges to the field of referring video object segmentation (RVOS). In this technical report, we investigated and validated the effectiveness of static-domina

Externí odkaz: http://arxiv.org/abs/2406.07043

Zobrazit plný text záznamu

Report

Place Anything into Any Video

Autor: Liu, Ziling, Yang, Jinyu, Gao, Mingqi, Zheng, Feng

Controllable video editing has demonstrated remarkable potential across diverse applications, particularly in scenarios where capturing or re-capturing real-world videos is either impractical or costly. This paper introduces a novel and efficient sys

Externí odkaz: http://arxiv.org/abs/2402.14316

Zobrazit plný text záznamu

Report

On the impact of robot personalization on human-robot interaction: A review

Autor: Yang, Jinyu, Vindolet, Camille, Olvera, Julio Rogelio Guadarrama, Cheng, Gordon

This study reviews the impact of personalization on human-robot interaction. Firstly, the various strategies used to achieve personalization are briefly described. Secondly, the effects of personalization known to date are discussed. They are present

Externí odkaz: http://arxiv.org/abs/2401.11776

Zobrazit plný text záznamu

Report

Track Anything: Segment Anything Meets Videos

Autor: Yang, Jinyu, Gao, Mingqi, Li, Zhe, Gao, Shang, Wang, Fangjing, Zheng, Feng

Recently, the Segment Anything Model (SAM) gains lots of attention rapidly due to its impressive segmentation performance on images. Regarding its strong ability on image segmentation and high interactivity with different prompts, we found that it pe

Externí odkaz: http://arxiv.org/abs/2304.11968

Zobrazit plný text záznamu

Akademický článek

Radiation preparation of nano-oxide@microcrystalline cellulose and its adsorption and removal of trichloroacetic acid

Autor: FU Lili, WANG Zhijun, LIU Kun, TANG Dongxu, YANG Jinyu, CHEN Huangqin, LI Yuesheng

Publikováno v: Fushe yanjiu yu fushe gongyi xuebao, Vol 42, Iss 4, Pp 43-51 (2024)

Trichloroacetic acid is a common nonvolatile byproduct of drinking water disinfection and poses carcinogenic risks to the human body. In this study, four types of nano-oxide@microcrystalline-cellulose-based adsorbents (P25@microcrystalline cellulose,

Externí odkaz: https://doaj.org/article/d401d46742f24bea96c66a058bb001fb

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání