Výsledky vyhledávání

Report

LlamaFusion: Adapting Pretrained Language Models for Multimodal Generation

Autor: Shi, Weijia, Han, Xiaochuang, Zhou, Chunting, Liang, Weixin, Lin, Xi Victoria, Zettlemoyer, Luke, Yu, Lili

We present LlamaFusion, a framework for empowering pretrained text-only large language models (LLMs) with multimodal generative capabilities, enabling them to understand and generate both text and images in arbitrary sequences. LlamaFusion leverages

Externí odkaz: http://arxiv.org/abs/2412.15188

Zobrazit plný text záznamu

Report

Byte Latent Transformer: Patches Scale Better Than Tokens

Autor: Pagnoni, Artidoro, Pasunuru, Ram, Rodriguez, Pedro, Nguyen, John, Muller, Benjamin, Li, Margaret, Zhou, Chunting, Yu, Lili, Weston, Jason, Zettlemoyer, Luke, Ghosh, Gargi, Lewis, Mike, Holtzman, Ari, Iyer, Srinivasan

We introduce the Byte Latent Transformer (BLT), a new byte-level LLM architecture that, for the first time, matches tokenization-based LLM performance at scale with significant improvements in inference efficiency and robustness. BLT encodes bytes in

Externí odkaz: http://arxiv.org/abs/2412.09871

Zobrazit plný text záznamu

Report

Mixture-of-Transformers: A Sparse and Scalable Architecture for Multi-Modal Foundation Models

Autor: Liang, Weixin, Yu, Lili, Luo, Liang, Iyer, Srinivasan, Dong, Ning, Zhou, Chunting, Ghosh, Gargi, Lewis, Mike, Yih, Wen-tau, Zettlemoyer, Luke, Lin, Xi Victoria

The development of large language models (LLMs) has expanded to multi-modal systems capable of processing text, images, and speech within a unified framework. Training these models demands significantly larger datasets and computational resources com

Externí odkaz: http://arxiv.org/abs/2411.04996

Zobrazit plný text záznamu

Report

Bayesian estimation of transmission networks for infectious diseases

Autor: Xu, Jianing, Hu, Huimin, Ellison, Gregory, Yu, Lili, Whalen, Christopher, Liu, Liang

Reconstructing transmission networks is essential for identifying key factors like superspreaders and high-risk locations, which are critical for developing effective pandemic prevention strategies. In this study, we developed a Bayesian framework th

Externí odkaz: http://arxiv.org/abs/2409.05245

Zobrazit plný text záznamu

Report

Transfusion: Predict the Next Token and Diffuse Images with One Multi-Modal Model

Autor: Zhou, Chunting, Yu, Lili, Babu, Arun, Tirumala, Kushal, Yasunaga, Michihiro, Shamis, Leonid, Kahn, Jacob, Ma, Xuezhe, Zettlemoyer, Luke, Levy, Omer

We introduce Transfusion, a recipe for training a multi-modal model over discrete and continuous data. Transfusion combines the language modeling loss function (next token prediction) with diffusion to train a single transformer over mixed-modality s

Externí odkaz: http://arxiv.org/abs/2408.11039

Zobrazit plný text záznamu

Report

Megalodon: Efficient LLM Pretraining and Inference with Unlimited Context Length

Autor: Ma, Xuezhe, Yang, Xiaomeng, Xiong, Wenhan, Chen, Beidi, Yu, Lili, Zhang, Hao, May, Jonathan, Zettlemoyer, Luke, Levy, Omer, Zhou, Chunting

The quadratic complexity and weak length extrapolation of Transformers limits their ability to scale to long sequences, and while sub-quadratic solutions like linear attention and state space models exist, they empirically underperform Transformers i

Externí odkaz: http://arxiv.org/abs/2404.08801

Zobrazit plný text záznamu

Akademický článek

Benefits of Mobile Apps for Cancer Pain Management: Systematic Review

Autor: Zheng, Caiyun, Chen, Xu, Weng, Lizhu, Guo, Ling, Xu, Haiting, Lin, Meimei, Xue, Yan, Lin, Xiuqin, Yang, Aiqin, Yu, Lili, Xue, Zenggui, Yang, Jing

Publikováno v: JMIR mHealth and uHealth, Vol 8, Iss 1, p e17055 (2020)

BackgroundPain ratings reported by patients with cancer continue to increase, and numerous computer and phone apps for managing cancer-related pain have been developed recently; however, whether these apps effectively alleviate patients’ pain remai

Externí odkaz: https://doaj.org/article/16b51f78c2bd448296b9d37e6a5a61ce

Zobrazit plný text záznamu

Report

Jointly Training Large Autoregressive Multimodal Models

Autor: Aiello, Emanuele, Yu, Lili, Nie, Yixin, Aghajanyan, Armen, Oguz, Barlas

In recent years, advances in the large-scale pretraining of language and text-to-image models have revolutionized the field of machine learning. Yet, integrating these two modalities into a single, robust model capable of generating seamless multimod

Externí odkaz: http://arxiv.org/abs/2309.15564

Zobrazit plný text záznamu

Report

Scaling Autoregressive Multi-Modal Models: Pretraining and Instruction Tuning

We present CM3Leon (pronounced "Chameleon"), a retrieval-augmented, token-based, decoder-only multi-modal language model capable of generating and infilling both text and images. CM3Leon uses the CM3 multi-modal architecture but additionally shows th

Externí odkaz: http://arxiv.org/abs/2309.02591

Zobrazit plný text záznamu

Report

LIMA: Less Is More for Alignment

Autor: Zhou, Chunting, Liu, Pengfei, Xu, Puxin, Iyer, Srini, Sun, Jiao, Mao, Yuning, Ma, Xuezhe, Efrat, Avia, Yu, Ping, Yu, Lili, Zhang, Susan, Ghosh, Gargi, Lewis, Mike, Zettlemoyer, Luke, Levy, Omer

Large language models are trained in two stages: (1) unsupervised pretraining from raw text, to learn general-purpose representations, and (2) large scale instruction tuning and reinforcement learning, to better align to end tasks and user preference

Externí odkaz: http://arxiv.org/abs/2305.11206

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání