8VHH

Engineered holo tryptophan synthase (Tm9D8*) derived from T. maritima TrpB


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 2.15 Å
  • R-Value Free: 0.237 
  • R-Value Work: 0.214 
  • R-Value Observed: 0.215 

Starting Model: experimental
View more details

wwPDB Validation   3D Report Full Report


Ligand Structure Quality Assessment 


This is version 1.0 of the entry. See complete history


Literature

A combinatorially complete epistatic fitness landscape in an enzyme active site.

Johnston, K.E.Almhjell, P.J.Watkins-Dulaney, E.J.Liu, G.Porter, N.J.Yang, J.Arnold, F.H.

(2024) Proc Natl Acad Sci U S A 121: e2400439121-e2400439121

  • DOI: https://doi.org/10.1073/pnas.2400439121
  • Primary Citation of Related Structures:  
    8VHH

  • PubMed Abstract: 

    Protein engineering often targets binding pockets or active sites which are enriched in epistasis-nonadditive interactions between amino acid substitutions-and where the combined effects of multiple single substitutions are difficult to predict. Few existing sequence-fitness datasets capture epistasis at large scale, especially for enzyme catalysis, limiting the development and assessment of model-guided enzyme engineering approaches. We present here a combinatorially complete, 160,000-variant fitness landscape across four residues in the active site of an enzyme. Assaying the native reaction of a thermostable β-subunit of tryptophan synthase (TrpB) in a nonnative environment yielded a landscape characterized by significant epistasis and many local optima. These effects prevent simulated directed evolution approaches from efficiently reaching the global optimum. There is nonetheless wide variability in the effectiveness of different directed evolution approaches, which together provide experimental benchmarks for computational and machine learning workflows. The most-fit TrpB variants contain a substitution that is nearly absent in natural TrpB sequences-a result that conservation-based predictions would not capture. Thus, although fitness prediction using evolutionary data can enrich in more-active variants, these approaches struggle to identify and differentiate among the most-active variants, even for this near-native function. Overall, this work presents a large-scale testing ground for model-guided enzyme engineering and suggests that efficient navigation of epistatic fitness landscapes can be improved by advances in both machine learning and physical modeling.


  • Organizational Affiliation

    Division of Biology and Bioengineering, California Institute of Technology, Pasadena, CA 91125.


Macromolecules
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 1
MoleculeChains Sequence LengthOrganismDetailsImage
Tryptophan synthase beta chain 1
A, B
397Thermotoga maritimaMutation(s): 10 
Gene Names: trpB1trpBTM_0138
EC: 4.2.1.20
UniProt
Find proteins for P50909 (Thermotoga maritima (strain ATCC 43589 / DSM 3109 / JCM 10099 / NBRC 100826 / MSB8))
Explore P50909 
Go to UniProtKB:  P50909
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupP50909
Sequence Annotations
Expand
  • Reference Sequence
Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 2.15 Å
  • R-Value Free: 0.237 
  • R-Value Work: 0.214 
  • R-Value Observed: 0.215 
  • Space Group: I 4
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 165.698α = 90
b = 165.698β = 90
c = 83.061γ = 90
Software Package:
Software NamePurpose
PHENIXrefinement
XDSdata reduction
Aimlessdata scaling
PHASERphasing

Structure Validation

View Full Validation Report



Ligand Structure Quality Assessment 


Entry History & Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
Department of Energy (DOE, United States)United StatesDE-SC0022218

Revision History  (Full details and data files)

  • Version 1.0: 2024-08-14
    Type: Initial release