Výsledky vyhledávání

Akademický článek

Ultra-compact thin-film-lithium-niobate photonic chip for dispersion compensation

Autor: Liu Shujun, Ma Ruitao, Wang Weihan, Yu Zejie, Dai Daoxin

Publikováno v: Nanophotonics, Vol 13, Iss 26, Pp 4723-4731 (2024)

Thin-film-lithium-niobate (TFLN) photonics has attracted intensive attention and become very popular in recent years. Here, an ultra-compact TFLN on-chip dispersion compensator is proposed and realized to provide a promising solution for dispersion c

Externí odkaz: https://doaj.org/article/e76c231589fa40ac9d3d52ce2cdd75f2

Zobrazit plný text záznamu

Akademický článek

Research progress in fiber reinforced polymer damping composites

Autor: WANG Tingting, GAO Junpeng, ZHANG Baoyan, MA Zhaodan, WANG Weihan, XU Hu

Publikováno v: Cailiao gongcheng, Vol 52, Iss 11, Pp 1-11 (2024)

Fiber reinforced polymer composites are widely used in aerospace， automotive， shipbuilding， rail transportation， and other fields. With the development of lightweight high-speed aerospace and precision instrument automation， vibration probl

Externí odkaz: https://doaj.org/article/ea391861d24842d7bb656ee3f8185047

Zobrazit plný text záznamu

Akademický článek

Applications and Challenges of AI Technology in Financial Portfolio Management

Autor: Wang Weihan

Publikováno v: SHS Web of Conferences, Vol 200, p 01002 (2024)

With the rapid development of artificial intelligence technology, financial portfolio management has entered a new era characterized by data-driven and intelligent decision-making. Based on the customer- centered service philosophy, managers propose

Externí odkaz: https://doaj.org/article/8c645fddb1fe4f64b39f561b83860f25

Zobrazit plný text záznamu

Report

VisScience: An Extensive Benchmark for Evaluating K12 Educational Multi-modal Scientific Reasoning

Autor: Jiang, Zhihuan, Yang, Zhen, Chen, Jinhao, Du, Zhengxiao, Wang, Weihan, Xu, Bin, Tang, Jie

Multi-modal large language models (MLLMs) have demonstrated promising capabilities across various tasks by integrating textual and visual information to achieve visual understanding in complex scenarios. Despite the availability of several benchmarks

Externí odkaz: http://arxiv.org/abs/2409.13730

Zobrazit plný text záznamu

Report

MathGLM-Vision: Solving Mathematical Problems with Multi-Modal Large Language Model

Autor: Yang, Zhen, Chen, Jinhao, Du, Zhengxiao, Yu, Wenmeng, Wang, Weihan, Hong, Wenyi, Jiang, Zhihuan, Xu, Bin, Tang, Jie

Large language models (LLMs) have demonstrated significant capabilities in mathematical reasoning, particularly with text-based mathematical problems. However, current multi-modal large language models (MLLMs), especially those specialized in mathema

Externí odkaz: http://arxiv.org/abs/2409.13729

Zobrazit plný text záznamu

Report

CogVLM2: Visual Language Models for Image and Video Understanding

Beginning with VisualGLM and CogVLM, we are continuously exploring VLMs in pursuit of enhanced vision-language fusion, efficient higher-resolution architecture, and broader modalities and applications. Here we propose the CogVLM2 family, a new genera

Externí odkaz: http://arxiv.org/abs/2408.16500

Zobrazit plný text záznamu

Report

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Autor: Yang, Zhuoyi, Teng, Jiayan, Zheng, Wendi, Ding, Ming, Huang, Shiyu, Xu, Jiazheng, Yang, Yuanming, Hong, Wenyi, Zhang, Xiaohan, Feng, Guanyu, Yin, Da, Gu, Xiaotao, Zhang, Yuxuan, Wang, Weihan, Cheng, Yean, Liu, Ting, Xu, Bin, Dong, Yuxiao, Tang, Jie

We present CogVideoX, a large-scale text-to-video generation model based on diffusion transformer, which can generate 10-second continuous videos aligned with text prompt, with a frame rate of 16 fps and resolution of 768 * 1360 pixels. Previous vide

Externí odkaz: http://arxiv.org/abs/2408.06072

Zobrazit plný text záznamu

Report

VIPeR: Visual Incremental Place Recognition with Adaptive Mining and Lifelong Learning

Autor: Ming, Yuhang, Xu, Minyang, Yang, Xingrui, Ye, Weicai, Wang, Weihan, Peng, Yong, Dai, Weichen, Kong, Wanzeng

Visual place recognition (VPR) is an essential component of many autonomous and augmented/virtual reality systems. It enables the systems to robustly localize themselves in large-scale environments. Existing VPR methods demonstrate attractive perform

Externí odkaz: http://arxiv.org/abs/2407.21416

Zobrazit plný text záznamu

Report

LVBench: An Extreme Long Video Understanding Benchmark

Autor: Wang, Weihan, He, Zehai, Hong, Wenyi, Cheng, Yean, Zhang, Xiaohan, Qi, Ji, Gu, Xiaotao, Huang, Shiyu, Xu, Bin, Dong, Yuxiao, Ding, Ming, Tang, Jie

Recent progress in multimodal large language models has markedly enhanced the understanding of short videos (typically under one minute), and several evaluation datasets have emerged accordingly. However, these advancements fall short of meeting the

Externí odkaz: http://arxiv.org/abs/2406.08035

Zobrazit plný text záznamu

Report

Benchmarking Neural Radiance Fields for Autonomous Robots: An Overview

Autor: Ming, Yuhang, Yang, Xingrui, Wang, Weihan, Chen, Zheng, Feng, Jinglun, Xing, Yifan, Zhang, Guofeng

Publikováno v: Engineering Applications of Artificial Intelligence, Volume 140, 15 January 2025, 109685

Neural Radiance Fields (NeRF) have emerged as a powerful paradigm for 3D scene representation, offering high-fidelity renderings and reconstructions from a set of sparse and unstructured sensor data. In the context of autonomous robotics, where perce

Externí odkaz: http://arxiv.org/abs/2405.05526

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání