Výsledky vyhledávání

Report

From Principles to Practice: A Deep Dive into AI Ethics and Regulations

Autor: Sun, Nan, Miao, Yuantian, Jiang, Hao, Ding, Ming, Zhang, Jun

In the rapidly evolving domain of Artificial Intelligence (AI), the complex interaction between innovation and regulation has become an emerging focus of our society. Despite tremendous advancements in AI's capabilities to excel in specific tasks and

Externí odkaz: http://arxiv.org/abs/2412.04683

Zobrazit plný text záznamu

Report

The temporal and spatial variations of lithium abundance in the Galactic disc

Autor: Sun, Tiancheng, Bi, Shaolan, Chen, Xunzhou, Yuxi, Lu, Chen, Yuqin, Ding, Ming-Yi, Shi, Jianrong, Yan, Hongliang, Ge, Zhishuai

This study investigates the temporal and spatial variations in lithium abundance within the Milky Way using a sample of 22,034 main-sequence turn-off (MSTO) stars and subgiants, characterised by precise stellar ages, 3D NLTE (non-local thermodynamic

Externí odkaz: http://arxiv.org/abs/2411.13011

Zobrazit plný text záznamu

Report

Face De-identification: State-of-the-art Methods and Comparative Studies

Autor: Cao, Jingyi, Chen, Xiangyi, Liu, Bo, Ding, Ming, Xie, Rong, Song, Li, Li, Zhu, Zhang, Wenjun

The widespread use of image acquisition technologies, along with advances in facial recognition, has raised serious privacy concerns. Face de-identification usually refers to the process of concealing or replacing personal identifiers, which is regar

Externí odkaz: http://arxiv.org/abs/2411.09863

Zobrazit plný text záznamu

Report

DreamPolish: Domain Score Distillation With Progressive Geometry Generation

Autor: Cheng, Yean, Cai, Ziqi, Ding, Ming, Zheng, Wendi, Huang, Shiyu, Dong, Yuxiao, Tang, Jie, Shi, Boxin

We introduce DreamPolish, a text-to-3D generation model that excels in producing refined geometry and high-quality textures. In the geometry construction phase, our approach leverages multiple neural representations to enhance the stability of the sy

Externí odkaz: http://arxiv.org/abs/2411.01602

Zobrazit plný text záznamu

Report

From 5G to 6G: A Survey on Security, Privacy, and Standardization Pathways

Autor: Yang, Mengmeng, Qu, Youyang, Ranbaduge, Thilina, Thapa, Chandra, Sultan, Nazatul, Ding, Ming, Suzuki, Hajime, Ni, Wei, Abuadbba, Sharif, Smith, David, Tyler, Paul, Pieprzyk, Josef, Rakotoarivelo, Thierry, Guan, Xinlong, M'rabet, Sirine

The vision for 6G aims to enhance network capabilities with faster data rates, near-zero latency, and higher capacity, supporting more connected devices and seamless experiences within an intelligent digital ecosystem where artificial intelligence (A

Externí odkaz: http://arxiv.org/abs/2410.21986

Zobrazit plný text záznamu

Report

MulCPred: Learning Multi-modal Concepts for Explainable Pedestrian Action Prediction

Autor: Feng, Yan, Carballo, Alexander, Fujii, Keisuke, Karlsson, Robin, Ding, Ming, Takeda, Kazuya

Pedestrian action prediction is of great significance for many applications such as autonomous driving. However, state-of-the-art methods lack explainability to make trustworthy predictions. In this paper, a novel framework called MulCPred is propose

Externí odkaz: http://arxiv.org/abs/2409.09446

Zobrazit plný text záznamu

Report

CogVLM2: Visual Language Models for Image and Video Understanding

Beginning with VisualGLM and CogVLM, we are continuously exploring VLMs in pursuit of enhanced vision-language fusion, efficient higher-resolution architecture, and broader modalities and applications. Here we propose the CogVLM2 family, a new genera

Externí odkaz: http://arxiv.org/abs/2408.16500

Zobrazit plný text záznamu

Report

Determining internal topological structures and running cost of mean field games with partial boundary measurement

Autor: Ding, Ming-Hui, Liu, Hongyu, Zheng, Guang-Hui

This paper investigates the simultaneous reconstruction of the running cost function and the internal topological structure within the mean-field games (MFG) system utilizing partial boundary data. The inverse problem is notably challenging due to fa

Externí odkaz: http://arxiv.org/abs/2408.08911

Zobrazit plný text záznamu

Report

VisualAgentBench: Towards Large Multimodal Models as Visual Foundation Agents

Large Multimodal Models (LMMs) have ushered in a new era in artificial intelligence, merging capabilities in both language and vision to form highly capable Visual Foundation Agents. These agents are postulated to excel across a myriad of tasks, pote

Externí odkaz: http://arxiv.org/abs/2408.06327

Zobrazit plný text záznamu

Report

CogVideoX: Text-to-Video Diffusion Models with An Expert Transformer

Autor: Yang, Zhuoyi, Teng, Jiayan, Zheng, Wendi, Ding, Ming, Huang, Shiyu, Xu, Jiazheng, Yang, Yuanming, Hong, Wenyi, Zhang, Xiaohan, Feng, Guanyu, Yin, Da, Gu, Xiaotao, Zhang, Yuxuan, Wang, Weihan, Cheng, Yean, Liu, Ting, Xu, Bin, Dong, Yuxiao, Tang, Jie

We present CogVideoX, a large-scale text-to-video generation model based on diffusion transformer, which can generate 10-second continuous videos aligned with text prompt, with a frame rate of 16 fps and resolution of 768 * 1360 pixels. Previous vide

Externí odkaz: http://arxiv.org/abs/2408.06072

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání