Zobrazeno 1 - 8
of 8
pro vyhledávání: '"Tavor Z. Baharav"'
Publikováno v:
Patterns, Vol 1, Iss 6, Pp 100081- (2020)
Summary: Pairwise sequence alignment is often a computational bottleneck in genomic analysis pipelines, particularly in the context of third-generation sequencing technologies. To speed up this process, the pairwise k-mer Jaccard similarity is someti
Externí odkaz:
https://doaj.org/article/963367a73c234b308aecaac8afd7f6b3
Contingency tables, data represented as counts matrices, are ubiquitous across quantitative research and data-science applications. Existing statistical tests are insufficient however, as none are simultaneously computationally efficient and statisti
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::9bdf5f47fe6c8689f0508e3a7825ca10
https://doi.org/10.1101/2023.03.16.533008
https://doi.org/10.1101/2023.03.16.533008
A statistical reference-free genomic algorithm subsumes common workflows and enables novel discovery
Publikováno v:
bioRxiv : the preprint server for biology.
We introduce a probabilistic model that enables study of myriad, disparate and fundamental problems in genome science and expands the scope of inference currently possible. Our model formulates an unrecognized unifying goal of many biological studies
Autor:
Kaitlin Chaung, Tavor Z. Baharav, George Henderson, Peter Wang, Ivan N. Zheludev, Julia Salzman
SummaryWe show that myriad, disparate mechanisms that diversify genomes and transcriptomes can be captured by a unifying principle: sample-dependent sequence variation. This variation occurs in both RNA and DNA and functions to regulate transcript ex
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_________::765c774b2549d63abe67b1dcfa553060
https://doi.org/10.1101/2022.06.24.497555
https://doi.org/10.1101/2022.06.24.497555
Publikováno v:
CIKM
Extreme multi-label classification (XMC) aims to learn a model that can tag data points with a subset of relevant labels from an extremely large label set. Real world e-commerce applications like personalized recommendations and product advertising c
Publikováno v:
Patterns
Patterns, Vol 1, Iss 6, Pp 100081-(2020)
Patterns, Vol 1, Iss 6, Pp 100081-(2020)
Summary Pairwise sequence alignment is often a computational bottleneck in genomic analysis pipelines, particularly in the context of third-generation sequencing technologies. To speed up this process, the pairwise k-mer Jaccard similarity is sometim
Publikováno v:
ISIT
Distributed computing allows for large-scale computation and machine learning tasks by enabling parallel computing at massive scale. A critical challenge to speeding up distributed computing comes from stragglers, a crippling bottleneck to system per
The celebrated Monte Carlo method estimates an expensive-to-compute quantity by random sampling. Bandit-based Monte Carlo optimization is a general technique for computing the minimum of many such expensive-to-compute quantities by adaptive random sa
Externí odkaz:
https://explore.openaire.eu/search/publication?articleId=doi_dedup___::96d3f3f594569b83e3ea848a2677ccfe
http://arxiv.org/abs/1805.08321
http://arxiv.org/abs/1805.08321