Transipedia.org: k-mer-based exploration of large RNA sequencing datasets and application to cancer data

Autor: Chloé Bessière, Haoliang Xue, Benoit Guibert, Anthony Boureux, Florence Rufflé, Julien Viot, Rayan Chikhi, Mikaël Salson, Camille Marchet, Thérèse Commes, Daniel Gautheret
Jazyk: angličtina
Rok vydání: 2024
Předmět:
Zdroj: Genome Biology, Vol 25, Iss 1, Pp 1-16 (2024)
Druh dokumentu: article
ISSN: 1474-760X
DOI: 10.1186/s13059-024-03413-5
Popis: Abstract Indexing techniques relying on k-mers have proven effective in searching for RNA sequences across thousands of RNA-seq libraries, but without enabling direct RNA quantification. We show here that arbitrary RNA sequences can be quantified in seconds through their decomposition into k-mers, with a precision akin to that of conventional RNA quantification methods. Using an index of the Cancer Cell Line Encyclopedia (CCLE) collection consisting of 1019 RNA-seq samples, we show that k-mer indexing offers a powerful means to reveal non-reference sequences, and variant RNAs induced by specific gene alterations, for instance in splicing factors.
Databáze: Directory of Open Access Journals