Výsledky vyhledávání

Report

M^3:Manipulation Mask Manufacturer for Arbitrary-Scale Super-Resolution Mask

Autor: Yang, Xinyu, Ma, Xiaochen, Zhu, Xuekang, Du, Bo, Su, Lei, Tong, Bingkui, Lei, Zeyu, Zhou, Jizhe

In the field of image manipulation localization (IML), the small quantity and poor quality of existing datasets have always been major issues. A dataset containing various types of manipulations will greatly help improve the accuracy of IML models. I

Externí odkaz: http://arxiv.org/abs/2407.03695

Zobrazit plný text záznamu

Report

VcLLM: Video Codecs are Secretly Tensor Codecs

Autor: Xu, Ceyu, Wu, Yongji, Yang, Xinyu, Chen, Beidi, Lentz, Matthew, Zhuo, Danyang, Wills, Lisa Wu

As the parameter size of large language models (LLMs) continues to expand, the need for a large memory footprint and high communication bandwidth have become significant bottlenecks for the training and inference of LLMs. To mitigate these bottleneck

Externí odkaz: http://arxiv.org/abs/2407.00467

Zobrazit plný text záznamu

Report

IMDL-BenCo: A Comprehensive Benchmark and Codebase for Image Manipulation Detection & Localization

Autor: Ma, Xiaochen, Zhu, Xuekang, Su, Lei, Du, Bo, Jiang, Zhuohang, Tong, Bingkui, Lei, Zeyu, Yang, Xinyu, Pun, Chi-Man, Lv, Jiancheng, Zhou, Jizhe

A comprehensive benchmark is yet to be established in the Image Manipulation Detection \& Localization (IMDL) field. The absence of such a benchmark leads to insufficient and misleading model evaluations, severely undermining the development of this

Externí odkaz: http://arxiv.org/abs/2406.10580

Zobrazit plný text záznamu

Report

It Takes Two: On the Seamlessness between Reward and Policy Model in RLHF

Autor: Lu, Taiming, Shen, Lingfeng, Yang, Xinyu, Tan, Weiting, Chen, Beidi, Yao, Huaxiu

Reinforcement Learning from Human Feedback (RLHF) involves training policy models (PMs) and reward models (RMs) to align language models with human preferences. Instead of focusing solely on PMs and RMs independently, we propose to examine their inte

Externí odkaz: http://arxiv.org/abs/2406.07971

Zobrazit plný text záznamu

Report

Zeroth-Order Fine-Tuning of LLMs with Extreme Sparsity

Autor: Guo, Wentao, Long, Jikai, Zeng, Yimeng, Liu, Zirui, Yang, Xinyu, Ran, Yide, Gardner, Jacob R., Bastani, Osbert, De Sa, Christopher, Yu, Xiaodong, Chen, Beidi, Xu, Zhaozhuo

Zeroth-order optimization (ZO) is a memory-efficient strategy for fine-tuning Large Language Models using only forward passes. However, the application of ZO fine-tuning in memory-constrained settings such as mobile phones and laptops is still challe

Externí odkaz: http://arxiv.org/abs/2406.02913

Zobrazit plný text záznamu

Report

FlashRAG: A Modular Toolkit for Efficient Retrieval-Augmented Generation Research

Autor: Jin, Jiajie, Zhu, Yutao, Yang, Xinyu, Zhang, Chenghao, Dou, Zhicheng

With the advent of Large Language Models (LLMs), the potential of Retrieval Augmented Generation (RAG) techniques have garnered considerable research attention. Numerous novel algorithms and models have been introduced to enhance various aspects of R

Externí odkaz: http://arxiv.org/abs/2405.13576

Zobrazit plný text záznamu

Report

Quantifying Emergence in Large Language Models

Autor: Chen, Hang, Yang, Xinyu, Zhu, Jiaying, Wang, Wenya

Emergence, broadly conceptualized as the ``intelligent'' behaviors of LLMs, has recently been studied and proved challenging to quantify due to the lack of a measurable definition. Most commonly, it has been estimated statistically through model perf

Externí odkaz: http://arxiv.org/abs/2405.12617

Zobrazit plný text záznamu

Report

Bridging the Gap: Protocol Towards Fair and Consistent Affect Analysis

Autor: Hu, Guanyu, Papadopoulou, Eleni, Kollias, Dimitrios, Tzouveli, Paraskevi, Wei, Jie, Yang, Xinyu

The increasing integration of machine learning algorithms in daily life underscores the critical need for fairness and equity in their deployment. As these technologies play a pivotal role in decision-making, addressing biases across diverse subpopul

Externí odkaz: http://arxiv.org/abs/2405.06841

Zobrazit plný text záznamu

Report

DeepSeek-V2: A Strong, Economical, and Efficient Mixture-of-Experts Language Model

Autor: DeepSeek-AI, Liu, Aixin, Feng, Bei, Wang, Bin, Wang, Bingxuan, Liu, Bo, Zhao, Chenggang, Dengr, Chengqi, Ruan, Chong, Dai, Damai, Guo, Daya, Yang, Dejian, Chen, Deli, Ji, Dongjie, Li, Erhang, Lin, Fangyun, Luo, Fuli, Hao, Guangbo, Chen, Guanting, Li, Guowei, Zhang, H., Xu, Hanwei, Yang, Hao, Zhang, Haowei, Ding, Honghui, Xin, Huajian, Gao, Huazuo, Li, Hui, Qu, Hui, Cai, J. L., Liang, Jian, Guo, Jianzhong, Ni, Jiaqi, Li, Jiashi, Chen, Jin, Yuan, Jingyang, Qiu, Junjie, Song, Junxiao, Dong, Kai, Gao, Kaige, Guan, Kang, Wang, Lean, Zhang, Lecong, Xu, Lei, Xia, Leyi, Zhao, Liang, Zhang, Liyue, Li, Meng, Wang, Miaojun, Zhang, Mingchuan, Zhang, Minghua, Tang, Minghui, Li, Mingming, Tian, Ning, Huang, Panpan, Wang, Peiyi, Zhang, Peng, Zhu, Qihao, Chen, Qinyu, Du, Qiushi, Chen, R. J., Jin, R. L., Ge, Ruiqi, Pan, Ruizhe, Xu, Runxin, Chen, Ruyi, Li, S. S., Lu, Shanghao, Zhou, Shangyan, Chen, Shanhuang, Wu, Shaoqing, Ye, Shengfeng, Ma, Shirong, Wang, Shiyu, Zhou, Shuang, Yu, Shuiping, Zhou, Shunfeng, Zheng, Size, Wang, T., Pei, Tian, Yuan, Tian, Sun, Tianyu, Xiao, W. L., Zeng, Wangding, An, Wei, Liu, Wen, Liang, Wenfeng, Gao, Wenjun, Zhang, Wentao, Li, X. Q., Jin, Xiangyue, Wang, Xianzu, Bi, Xiao, Liu, Xiaodong, Wang, Xiaohan, Shen, Xiaojin, Chen, Xiaokang, Chen, Xiaosha, Nie, Xiaotao, Sun, Xiaowen, Wang, Xiaoxiang, Liu, Xin, Xie, Xin, Yu, Xingkai, Song, Xinnan, Zhou, Xinyi, Yang, Xinyu, Lu, Xuan, Su, Xuecheng, Wu, Y., Li, Y. K., Wei, Y. X., Zhu, Y. X., Xu, Yanhong, Huang, Yanping, Li, Yao, Zhao, Yao, Sun, Yaofeng, Li, Yaohui, Wang, Yaohui, Zheng, Yi, Zhang, Yichao, Xiong, Yiliang, Zhao, Yilong, He, Ying, Tang, Ying, Piao, Yishi, Dong, Yixin, Tan, Yixuan, Liu, Yiyuan, Wang, Yongji, Guo, Yongqiang, Zhu, Yuchen, Wang, Yuduan, Zou, Yuheng, Zha, Yukun, Ma, Yunxian, Yan, Yuting, You, Yuxiang, Liu, Yuxuan, Ren, Z. Z., Ren, Zehui, Sha, Zhangli, Fu, Zhe, Huang, Zhen, Zhang, Zhen, Xie, Zhenda, Hao, Zhewen, Shao, Zhihong, Wen, Zhiniu, Xu, Zhipeng, Zhang, Zhongyu, Li, Zhuoshu, Wang, Zihan, Gu, Zihui, Li, Zilin, Xie, Ziwei

We present DeepSeek-V2, a strong Mixture-of-Experts (MoE) language model characterized by economical training and efficient inference. It comprises 236B total parameters, of which 21B are activated for each token, and supports a context length of 128

Externí odkaz: http://arxiv.org/abs/2405.04434

Zobrazit plný text záznamu

Report

Separate in the Speech Chain: Cross-Modal Conditional Audio-Visual Target Speech Extraction

Autor: Mu, Zhaoxi, Yang, Xinyu

The integration of visual cues has revitalized the performance of the target speech extraction task, elevating it to the forefront of the field. Nevertheless, this multi-modal learning paradigm often encounters the challenge of modality imbalance. In

Externí odkaz: http://arxiv.org/abs/2404.12725

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání