Exploring the loblolly pine (Pinus taeda L.) genome by BAC sequencing and Cot analysis.

Autor: Perera D; Institute for Genomics, Biocomputing & Biotechnology, Mississippi State University, Mississippi State, MS 39762, USA., Magbanua ZV; National Institute of Molecular Biology & Biotechnology, National Science Complex, College of Science, University of the Philippines, Diliman, Quezon City, Philippines., Thummasuwan S; Department of Agricultural Sciences, Naresuan University, Phitsanulok, Thailand. Electronic address: supaphant@nu.ac.th., Mukherjee D; Department of Food Science, Nutrition, & Health Promotion, Mississippi State University, Mississippi State, MS 39762, USA. Electronic address: dm259@msstate.edu., Arick M 2nd; Institute for Genomics, Biocomputing & Biotechnology, Mississippi State University, Mississippi State, MS 39762, USA. Electronic address: maa146@igbb.msstate.edu., Chouvarine P; Texas Children's Cancer Center, Baylor College of Medicine, Houston, TX 77030, USA., Nairn CJ; Warnell School of Forest Resources, University of Georgia, Athens, GA 30602, USA. Electronic address: nairn@uga.edu., Schmutz J; US Department of Energy Joint Genome Institute, Walnut Creek, CA 94598, USA; HudsonAlpha Institute for Biotechnology, 601 Genome Way, Huntsville, AL 35801, USA. Electronic address: jschmutz@hudsonalpha.org., Grimwood J; US Department of Energy Joint Genome Institute, Walnut Creek, CA 94598, USA; HudsonAlpha Institute for Biotechnology, 601 Genome Way, Huntsville, AL 35801, USA. Electronic address: jgrimwood@hudsonalpha.org., Dean JFD; Department of Biochemistry, Molecular Biology, Entomology & Plant Pathology, Mississippi State University, Mississippi State, MS 39762, USA. Electronic address: jeffdean@bch.msstate.edu., Peterson DG; Institute for Genomics, Biocomputing & Biotechnology, Mississippi State University, Mississippi State, MS 39762, USA; Department of Plant & Soil Sciences, Mississippi State University, Mississippi State, MS 39762, USA. Electronic address: dp127@msstate.edu.
Jazyk: angličtina
Zdroj: Gene [Gene] 2018 Jul 15; Vol. 663, pp. 165-177. Date of Electronic Publication: 2018 Apr 12.
DOI: 10.1016/j.gene.2018.04.024
Abstrakt: Loblolly pine (LP; Pinus taeda L.) is an economically and ecologically important tree in the southeastern U.S. To advance understanding of the loblolly pine (LP; Pinus taeda L.) genome, we sequenced and analyzed 100 BAC clones and performed a Cot analysis. The Cot analysis indicates that the genome is composed of 57, 24, and 10% highly-repetitive, moderately-repetitive, and single/low-copy sequences, respectively (the remaining 9% of the genome is a combination of fold back and damaged DNA). Although single/low-copy DNA only accounts for 10% of the LP genome, the amount of single/low-copy DNA in LP is still 14 times the size of the Arabidopsis genome. Since gene numbers in LP are similar to those in Arabidopsis, much of the single/low-copy DNA of LP would appear to be composed of DNA that is both gene- and repeat-poor. Macroarrays prepared from a LP bacterial artificial chromosome (BAC) library were hybridized with probes designed from cell wall synthesis/wood development cDNAs, and 50 of the "targeted" clones were selected for further analysis. An additional 25 clones were selected because they contained few repeats, while 25 more clones were selected at random. The 100 BAC clones were Sanger sequenced and assembled. Of the targeted BACs, 80% contained all or part of the cDNA used to target them. One targeted BAC was found to contain fungal DNA and was eliminated from further analysis. Combinations of similarity-based and ab initio gene prediction approaches were utilized to identify and characterize potential coding regions in the 99 BACs containing LP DNA. From this analysis, we identified 154 gene models (GMs) representing both putative protein-coding genes and likely pseudogenes. Ten of the GMs (all of which were specifically targeted) had enough support to be classified as intact genes. Interestingly, the 154 GMs had statistically indistinguishable (α = 0.05) distributions in the targeted and random BAC clones (15.18 and 12.61 GM/Mb, respectively), whereas the low-repeat BACs contained significantly fewer GMs (7.08 GM/Mb). However, when GM length was considered, the targeted BACs had a significantly greater percentage of their length in GMs (3.26%) when compared to random (1.63%) and low-repeat (0.62%) BACs. The results of our study provide insight into LP evolution and inform ongoing efforts to produce a reference genome sequence for LP, while characterization of genes involved in cell wall production highlights carbon metabolism pathways that can be leveraged for increasing wood production.
(Copyright © 2018 The Authors. Published by Elsevier B.V. All rights reserved.)
Databáze: MEDLINE