Popis: |
BackgroundLarvae of Trichoptera produce silk to build various underwater structures allowing them to exploit a wide range of aquatic environments. Theheavy chain fibroin(h-fibroin) gene encodes the primary protein component of their silk. Studies on this long (>20 kbp) and highly repetitive gene have been limited by difficulties in its sequence assembly. Recently, high-quality long-read sequencing techniques have been successfully applied to obtain the full-lengthh-fibroinsequence.ResultsWe used three new and five previously published genomes to identify eight full-length h-fibroin gene and protein sequences of h-fibroin across the order Trichoptera covering various endpoints of the diversity of ecological silk use in caddisflies. We analyze these together with four existing high-quality h-fibroin sequences. Across the order, we observed conserved patterns in h-fibroin (high similarity of amino(n)-/carboxyl(c)-termini, presence of characteristic repeating structural modules). However, the sequence, number, and arrangement of these repeating modules varied across clades with increasing structural complexity of h-fibroin in fixed retreat and tube-case builders compared to cocoon-builders. We also found a higher percentage of proline in fixed-retreat makers.ConclusionThis study provides characterizations of the primary structure of h-fibroin from a diverse set of caddisflies. The interplay of conserved termini and basic motif structure with high variation in repeating modules as well as the variation in the percentage of proline might be linked to differences in mechanical properties (i.e., tensile strength, toughness) related to the different silk usage. This sets a starting point for future studies to screen and correlate amino acid motifs and other sequence features with quantifiable silk properties. |