Toward the structural genomics of complexes: Crystal structure of a PE/PPE protein complex from Mycobacterium tuberculosis.
Strong, M., Sawaya, M.R., Wang, S., Phillips, M., Cascio, D., Eisenberg, D.(2006) Proc Natl Acad Sci U S A 103: 8060-8065
- PubMed: 16690741 
- DOI: https://doi.org/10.1073/pnas.0602606103
- Primary Citation of Related Structures:  
2G38 - PubMed Abstract: 
The developing science called structural genomics has focused to date mainly on high-throughput expression of individual proteins, followed by their purification and structure determination. In contrast, the term structural biology is used to denote the determination of structures, often complexes of several macromolecules, that illuminate aspects of biological function. Here we bridge structural genomics to structural biology with a procedure for determining protein complexes of previously unknown function from any organism with a sequenced genome. From computational genomic analysis, we identify functionally linked proteins and verify their interaction in vitro by coexpression/copurification. We illustrate this procedure by the structural determination of a previously unknown complex between a PE and PPE protein from the Mycobacterium tuberculosis genome, members of protein families that constitute approximately 10% of the coding capacity of this genome. The predicted complex was readily expressed, purified, and crystallized, although we had previously failed in expressing individual PE and PPE proteins on their own. The reason for the failure is clear from the structure, which shows that the PE and PPE proteins mate along an extended apolar interface to form a four-alpha-helical bundle, where two of the alpha-helices are contributed by the PE protein and two by the PPE protein. Our entire procedure for the identification, characterization, and structural determination of protein complexes can be scaled to a genome-wide level.
Organizational Affiliation: 
Howard Hughes Medical Institute, UCLA-Department of Energy Institute of Genomics and Proteomics, University of California, Los Angeles, CA 90095, USA.