8XYW

De novo designed protein Trx-3


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.41 Å
  • R-Value Free: 0.250 
  • R-Value Work: 0.222 
  • R-Value Observed: 0.223 

Starting Model: in silico
View more details

wwPDB Validation   3D Report Full Report


This is version 1.0 of the entry. See complete history


Literature

All-Atom Protein Sequence Design Based on Geometric Deep Learning.

Liu, J.Guo, Z.You, H.Zhang, C.Lai, L.

(2024) Angew Chem Int Ed Engl : e202411461-e202411461

  • DOI: https://doi.org/10.1002/anie.202411461
  • Primary Citation of Related Structures:  
    8XYR, 8XYS, 8XYT, 8XYU, 8XYV, 8XYW

  • PubMed Abstract: 

    Designing sequences for specific protein backbones is a key step in creating new functional proteins. Here, we introduce GeoSeqBuilder, a deep learning framework that integrates protein sequence generation with side chain conformation prediction to produce the complete all-atom structures for designed sequences. GeoSeqBuilder uses spatial geometric features from protein backbones and explicitly includes three-body interactions of neighboring residues. GeoSeqBuilder achieves native residue type recovery rate of 51.6%, comparable to ProteinMPNN and  other leading methods, while accurately predicting side chain conformations. We first used GeoSeqBuilder to design sequences for thioredoxin and a hallucinated three-helical bundle protein. All the 15 tested sequences expressed as soluble monomeric proteins with high thermal stability, and the 2 high-resolution crystal structures solved closely match the designed models. The generated protein sequences exhibit low similarity (minimum 23%) to the original sequences, with significantly altered hydrophobic cores. We further redesigned the hydrophobic core of glutathione peroxidase 4, and 3 of the 5 designs showed improved enzyme activity. Although further testing is needed, the high experimental success rate in our testing demonstrates that GeoSeqBuilder is a powerful tool for designing novel sequences for predefined protein structures with atomic details. GeoSeqBuilder is available at https://github.com/PKUliujl/GeoSeqBuilder.


  • Organizational Affiliation

    Peking University, Center for Life Sciences, 5 Yiheyuan Road, 100871, Beijing, CHINA.


Macromolecules
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 1
MoleculeChains Sequence LengthOrganismDetailsImage
De novo designed Trx-3116synthetic constructMutation(s): 0 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
Sequence Annotations
Expand
  • Reference Sequence
Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.41 Å
  • R-Value Free: 0.250 
  • R-Value Work: 0.222 
  • R-Value Observed: 0.223 
  • Space Group: C 1 2 1
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 64.487α = 90
b = 64.511β = 109.498
c = 26.794γ = 90
Software Package:
Software NamePurpose
PHENIXrefinement
autoPROCdata reduction
Aimlessdata scaling
PHASERphasing

Structure Validation

View Full Validation Report



Entry History & Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
National Natural Science Foundation of China (NSFC)China--

Revision History  (Full details and data files)

  • Version 1.0: 2024-10-02
    Type: Initial release