Zobrazeno 1 - 10
of 151
pro vyhledávání: '"Dianne, Cook"'
Autor:
Nicholas Tierney, Dianne Cook
Publikováno v:
Journal of Statistical Software, Vol 105, Pp 1-31 (2023)
Despite the large body of research on missing value distributions and imputation, there is comparatively little literature with a focus on how to make it easy to handle, explore, and impute missing values in data. This paper addresses this gap. The n
Externí odkaz:
https://doaj.org/article/b51e13ad39a8410c90413e746452a1f7
Publikováno v:
Journal of Statistics and Data Science Education, Vol 30, Iss 3, Pp 289-303 (2022)
AbstractTextbook data is essential for teaching statistics and data science methods because it is clean, allowing the instructor to focus on methodology. Ideally textbook datasets are refreshed regularly, especially when they are subsets taken from a
Externí odkaz:
https://doaj.org/article/a5624756704343409826e98575302242
Autor:
Julia Polak, Dianne Cook
Publikováno v:
Journal of Statistics and Data Science Education, Vol 29, Iss 1, Pp 63-70 (2021)
Kaggle is a data modeling competition service, where participants compete to build a model with lower predictive error than other participants. Several years ago they released a simplified service that is ideal for instructors to run competitions in
Externí odkaz:
https://doaj.org/article/e7576c71173b4bd5886b4eb7b8571213
Publikováno v:
PLoS Computational Biology, Vol 17, Iss 10 (2021)
A key benefit of long-read nanopore sequencing technology is the ability to detect modified DNA bases, such as 5-methylcytosine. The lack of R/Bioconductor tools for the effective visualization of nanopore methylation profiles between samples from di
Externí odkaz:
https://doaj.org/article/5f17621a4ffd4711b2156b5a8f90d7c0
Publikováno v:
BMC Bioinformatics, Vol 20, Iss 1, Pp 1-31 (2019)
Abstract Background Despite the availability of many ready-made testing software, reliable detection of differentially expressed genes in RNA-seq data is not a trivial task. Even though the data collection is considered high-throughput, data analysis
Externí odkaz:
https://doaj.org/article/4aa808d575a44f399dc7deb18f28aba1
Publikováno v:
Journal of Statistical Software, Vol 89, Iss 1, Pp 1-31 (2019)
This paper introduces ggenealogy (Rutter, Vanderplas, and Cook 2019), a developing R software package that provides tools for searching through genealogical data, generating basic statistics on their graphical structures using parent and child connec
Externí odkaz:
https://doaj.org/article/3e240f72af3d4cbcb1afd8d6d1f05437
Autor:
Lindsay Rutter, Jimena Carrillo-Tripp, Bryony C. Bonning, Dianne Cook, Amy L. Toth, Adam G. Dolezal
Publikováno v:
BMC Genomics, Vol 20, Iss 1, Pp 1-20 (2019)
Abstract Background Parts of Europe and the United States have witnessed dramatic losses in commercially managed honey bees over the past decade to what is considered an unsustainable extent. The large-scale loss of bees has considerable implications
Externí odkaz:
https://doaj.org/article/0a398c520edc4e419871de3aed55622e
Publikováno v:
Harvard Data Science Review (2021)
Externí odkaz:
https://doaj.org/article/1c3adf581ad848648063164dff5e87d5
Autor:
Lindsay Rutter, Dianne Cook
Publikováno v:
PLoS Computational Biology, Vol 16, Iss 6, p e1007912 (2020)
Interactive data visualization is imperative in the biological sciences. The development of independent layers of interactivity has been in pursuit in the visualization community. We developed bigPint, a data visualization package available on Biocon
Externí odkaz:
https://doaj.org/article/7a80d36f6468420f9d05b2a469af3499
Publikováno v:
Genome Biology, Vol 20, Iss 1, Pp 1-10 (2019)
Abstract Bioconductor is a widely used R-based platform for genomics, but its host of complex genomic data structures places a cognitive burden on the user. For most tasks, the GRanges object would suffice, but there are gaps in the API that prevent
Externí odkaz:
https://doaj.org/article/87652990b80e4c1ca2afb2db49020943