Large language models and their applications in bioinformatics

Autor: Oluwafemi A. Sarumi, Dominik Heider
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: Computational and Structural Biotechnology Journal, Vol 23, Iss , Pp 3498-3505 (2024)
Druh dokumentu: article
ISSN: 2001-0370
DOI: 10.1016/j.csbj.2024.09.031
Popis: Recent advancements in Natural Language Processing (NLP) have been significantly driven by the development of Large Language Models (LLMs), representing a substantial leap in language-based technology capabilities. These models, built on sophisticated deep learning architectures, typically transformers, are characterized by billions of parameters and extensive training data, enabling them to achieve high accuracy across various tasks. The transformer architecture of LLMs allows them to effectively handle context and sequential information, which is crucial for understanding and generating human language. Beyond traditional NLP applications, LLMs have shown significant promise in bioinformatics, transforming the field by addressing challenges associated with large and complex biological datasets. In genomics, proteomics, and personalized medicine, LLMs facilitate identifying patterns, predicting protein structures, or understanding genetic variations. This capability is crucial, e.g., for advancing drug discovery, where accurate prediction of molecular interactions is essential. This review discusses the current trends in LLMs research and their potential to revolutionize the field of bioinformatics and accelerate novel discoveries in the life sciences.
Databáze: Directory of Open Access Journals