Výsledky vyhledávání - "large language models pre-training"

Report

A Review of the Challenges with Massive Web-mined Corpora Used in Large Language Models Pre-Training

Autor: Perełkiewicz, Michał, Poświata, Rafał

This article presents a comprehensive review of the challenges associated with using massive web-mined corpora for the pre-training of large language models (LLMs). This review identifies key challenges in this domain, including challenges such as no

Externí odkaz: http://arxiv.org/abs/2407.07630

Zobrazit plný text záznamu

Report

Cross-Lingual Supervision improves Large Language Models Pre-training

Autor: Schioppa, Andrea, Garcia, Xavier, Firat, Orhan

The recent rapid progress in pre-training Large Language Models has relied on using self-supervised language modeling objectives like next token prediction or span corruption. On the other hand, Machine Translation Systems are mostly trained using cr

Externí odkaz: http://arxiv.org/abs/2305.11778

Zobrazit plný text záznamu

Kniha

Tento výsledek nelze pro nepřihlášené uživatele zobrazit.
K zobrazení výsledku je třeba se přihlásit.

Akademický článek

Application of Reconfigurable OCS Technology for Pre-training Large Language Models

Autor: ZHU Chen, ZHOU Xu, WANG Peilong

Publikováno v: Guangtongxin yanjiu, Pp 24004901-24004910 (2024)

【Objective】Compared to Electronic Packet Switching (EPS), Optical Circuit Switching (OCS) demonstrates advantages in latency, power consumption, cost, and stability. This study aims to explore feasible applications of OCS in the networking of tra

Externí odkaz: https://doaj.org/article/d414340228524bdbb24e122a09a287cb

Zobrazit plný text záznamu

Elektronická kniha

Knowledge Management and Acquisition for Intelligent Systems : 20th Principle and Practice of Data and Knowledge Acquisition Workshop, PKAW 2024, Kyoto, Japan, November 18–19, 2024, Proceedings

Autor: Shiqing Wu, Xing Su, Xiaolong Xu, Byeong Ho Kang

This book constitutes the refereed proceedings of the 20th International Conference on Knowledge Management and Acquisition for Intelligent Systems, PKAW 2024, held in Kyoto, Japan, during November 18–19, 2024. The 15 full papers and 9 short papers

Zobrazit plný text záznamu

Elektronická kniha

Quick Start Guide to Large Language Models : Strategies and Best Practices for ChatGPT, Embeddings, Fine-Tuning, and Multimodal AI

Autor: Sinan Ozdemir

The Practical, Step-by-Step Guide to Using LLMs at Scale in Projects and Products Large Language Models (LLMs) like Llama 3, Claude 3, and the GPT family are demonstrating breathtaking capabilities, but their size and complexity have deterred many pr

Zobrazit plný text záznamu

Vyhledávací nástroje:

Upřesnit hledání