Monday, November 5, 2007 - 1:50 PM
104d

Engineering Transcription Factors with Novel DNA-Bind Specificity

Tasha Desai, Chemical and Biomolecular Engineering, University of Illinois, 600 South Matthews Ave., 211 Roger Adams Laboratory, Box C-3, Urbana, IL 61801 and Christopher V. Rao, Department of Chemical Engineering, University of Illinois, 600 South Matthews Ave., 211 Roger Adams Laboratory, Box C-3, Urbana, IL 61801.

Transcription factors are proteins that bind to specific sequences of DNA and regulate the expression of neighboring genes. By changing the DNA-binding specificity of these proteins, we can potentially change how genes are regulated in cells. The challenge is that we do not yet know how different combinations of amino acids in the DNA-binding domain of the protein map to specific nucleic acid sequences. Identifying these rules will be useful for engineering novel transcription factors with applications in metabolic engineering, gene therapy, and synthetic biology. To solve this problem, we propose the use of comparative genomics for transcription factor engineering.

High-throughput sequencing has revealed numerous patterns and variations in the genomic architecture of diverse species of bacteria. By observing the natural variation of specific families of transcription factors in divergent species, we can identify specificity-determining amino acids and their cognate binding sequences. We have tested this approach using the cyclic AMP receptor (CRP) and fumarate and nitrate reduction regulatory (FNR) family of winged helix-turn-helix proteins. For this family of proteins, we were able to computationally identify eight sets of amino acid combinations that are predicted to yield novel DNA-binding specificity. These computational predictions were tested experimentally using the CRP protein from Escherichia coli.

Of the eight computational predictions, four were shown experimentally to have novel DNA-binding specificity in vivo. We tested these designs using both the native CRP promoter and a tetracycline-inducible one. In the process, we were able to identify how expression levels determine the activity of these engineered proteins. Collectively, these results demonstrate the utility of comparative genomics for computationally-directed protein engineering.