7DGW

De novo designed protein H4A2S


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.35 Å
  • R-Value Free: 0.227 
  • R-Value Work: 0.191 
  • R-Value Observed: 0.194 

wwPDB Validation   3D Report Full Report


This is version 1.3 of the entry. See complete history


Literature

A backbone-centred energy function of neural networks for protein design.

Huang, B.Xu, Y.Hu, X.Liu, Y.Liao, S.Zhang, J.Huang, C.Hong, J.Chen, Q.Liu, H.

(2022) Nature 602: 523-528

  • DOI: https://doi.org/10.1038/s41586-021-04383-5
  • Primary Citation of Related Structures:  
    7DGU, 7DGW, 7DGY, 7DKK, 7DKO, 7DMF, 7FBB, 7FBC, 7FBD

  • PubMed Abstract: 

    A protein backbone structure is designable if a substantial number of amino acid sequences exist that autonomously fold into it 1,2 . It has been suggested that the designability of backbones is governed mainly by side chain-independent or side chain type-insensitive molecular interactions 3-5 , indicating an approach for designing new backbones (ready for amino acid selection) based on continuous sampling and optimization of the backbone-centred energy surface. However, a sufficiently comprehensive and precise energy function has yet to be established for this purpose. Here we show that this goal is met by a statistical model named SCUBA (for Side Chain-Unknown Backbone Arrangement) that uses neural network-form energy terms. These terms are learned with a two-step approach that comprises kernel density estimation followed by neural network training and can analytically represent multidimensional, high-order correlations in known protein structures. We report the crystal structures of nine de novo proteins whose backbones were designed to high precision using SCUBA, four of which have novel, non-natural overall architectures. By eschewing use of fragments from existing protein structures, SCUBA-driven structure design facilitates far-reaching exploration of the designable backbone space, thus extending the novelty and diversity of the proteins amenable to de novo design.


  • Organizational Affiliation

    MOE Key Laboratory for Membraneless Organelles and Cellular Dynamics, Hefei National Laboratory for Physical Sciences at the Microscale, School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, China.


Macromolecules
Find similar proteins by:  (by identity cutoff)  |  3D Structure
Entity ID: 1
MoleculeChains Sequence LengthOrganismDetailsImage
de novo designed protein H4A2S101Escherichia coli 'BL21-Gold(DE3)pLysS AGMutation(s): 0 
Entity Groups  
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
Sequence Annotations
Expand
  • Reference Sequence
Small Molecules
Ligands 1 Unique
IDChains Name / Formula / InChI Key2D Diagram3D Interactions
EPE
Query on EPE

Download Ideal Coordinates CCD File 
B [auth A]4-(2-HYDROXYETHYL)-1-PIPERAZINE ETHANESULFONIC ACID
C8 H18 N2 O4 S
JKMHFZQWWAIEOD-UHFFFAOYSA-N
Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.35 Å
  • R-Value Free: 0.227 
  • R-Value Work: 0.191 
  • R-Value Observed: 0.194 
  • Space Group: P 21 21 2
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 46.526α = 90
b = 53.801β = 90
c = 30.549γ = 90
Software Package:
Software NamePurpose
PHENIXrefinement
PDB_EXTRACTdata extraction
XDSdata reduction
XDSdata scaling
PHENIXphasing

Structure Validation

View Full Validation Report



Entry History 

Deposition Data

Revision History  (Full details and data files)

  • Version 1.0: 2021-11-24
    Type: Initial release
  • Version 1.1: 2022-02-16
    Changes: Database references
  • Version 1.2: 2022-02-23
    Changes: Database references, Structure summary
  • Version 1.3: 2022-03-02
    Changes: Database references