Popis: |
A fundamental challenge of protein engineering is the ability to consistently design proteins that will spontaneously fold into `native-like’ structures. Combinatorial de novo design offers a solution, by limiting the sequence space and helping in understanding sequence-stability relationships. One model system for combinatorial design is Rop, a 63-residue homodimer, whose regulatory function in E. coli is to control plasmid copy number. Rop consists of layers of heptad repeats a-g, with hydrophobic residues occupying a and d positions, forming the hydrophobic core. Interestingly, a break in the pattern results in the surface exposure of an Arg residue at position d and burial of a Phe residue at position e. Understanding the heptad discontinuities and the amino acid preferences at each of the heptad positions in four-helix bundles can aid in the understanding of sequence-structure relationships. We aim to study the conformational preference by means of combinatorial and statistical approaches, and further biophysical characterization.To understand the role of heptad discontinuity in Rop, we designed combinatorial libraries by randomizing Arg55 and Arg55/Phe56 with degenerate codons. A functional cell-based fluorescent screen and conformational stability analysis from thermal and chemical denaturation revealed the role of an electrostatic interaction between the Arg and an Asp present across the dimer interface. Surprisingly, a non-conservative substitution of Arg to Leu is found to increase the stability of the protein by having indistinguishable enthalpy of unfolding as wild type, revealed by Gibbs-Helmholtz analysis. We also found that the R55F56 variants, which have bulky hydrophobes at both positions, are more stable compared to under-packed ones or charged variants, which again have huge enthalpic and entropic penalties. Variants with hydrophobic residues at both 55 and 56 positions are able to compensate for the loss of stability by presumably replacing the electrostatic interaction by increased hydrophobic contacts. Another surprising result is the enhanced stability of a variant (Asp32Arg) that is expected to have repulsive charges at the interaction interface. HSQC data suggest the possibility of altered conformations, and non-canonical heptads for some of these variants revealing the structural plasticity associated with the sequence. To analyze the roles of surface and edge residues in the heptad layers of Rop, we made libraries randomizing each of these heptad positions with degenerate NNK codons. Analysis from the cell based activity screen of these libraries indicate that the surface positional libraries b, c, and f have an overall enrichment of charged residues, with a preference of lysines over arginines on the RNA binding helix 1. The helix 2 could accommodate large aromatic or polar residues for library f, hydrophobic residues for library c, while still charged or polar residues were preferred for library b. The edge positional libraries e and g had an overall enrichment of hydrophobic residues. High throughput thermal scanning and stability studies on these libraries were used to monitor the relative Tm and sequencing those variants resulted in an understanding of conformational preferences at these positions. Further, a statistical analysis of four-helix bundles available in databases was used to construct a sequence fitness landscape of four-helix bundles. Finally, we introduced a method to monitor the sequence-stability landscape by combining combinatorial experiments with a cell free expression method. These results collectively aid in our understanding of sequence-stability relationship in Rop and subsequently improve our knowledge of combinatorial protein design. |