7DGW | pdb_00007dgw

De novo designed protein H4A2S


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.35 Å
  • R-Value Free: 
    0.227 (Depositor), 0.222 (DCC) 
  • R-Value Work: 
    0.191 (Depositor), 0.197 (DCC) 
  • R-Value Observed: 
    0.194 (Depositor) 

wwPDB Validation 3D Report Full Report

Validation slider image for 7DGW

This is version 1.4 of the entry. See complete history

Literature

A backbone-centred energy function of neural networks for protein design.

Huang, B.Xu, Y.Hu, X.Liu, Y.Liao, S.Zhang, J.Huang, C.Hong, J.Chen, Q.Liu, H.

(2022) Nature 602: 523-528

  • DOI: https://doi.org/10.1038/s41586-021-04383-5
  • Primary Citation Related Structures: 
    7DGU, 7DGW, 7DGY, 7DKK, 7DKO, 7DMF, 7FBB, 7FBC, 7FBD

  • PubMed Abstract: 

    A protein backbone structure is designable if a substantial number of amino acid sequences exist that autonomously fold into it 1,2 . It has been suggested that the designability of backbones is governed mainly by side chain-independent or side chain type-insensitive molecular interactions 3-5 , indicating an approach for designing new backbones (ready for amino acid selection) based on continuous sampling and optimization of the backbone-centred energy surface. However, a sufficiently comprehensive and precise energy function has yet to be established for this purpose. Here we show that this goal is met by a statistical model named SCUBA (for Side Chain-Unknown Backbone Arrangement) that uses neural network-form energy terms. These terms are learned with a two-step approach that comprises kernel density estimation followed by neural network training and can analytically represent multidimensional, high-order correlations in known protein structures. We report the crystal structures of nine de novo proteins whose backbones were designed to high precision using SCUBA, four of which have novel, non-natural overall architectures. By eschewing use of fragments from existing protein structures, SCUBA-driven structure design facilitates far-reaching exploration of the designable backbone space, thus extending the novelty and diversity of the proteins amenable to de novo design.


  • Organizational Affiliation
    • MOE Key Laboratory for Membraneless Organelles and Cellular Dynamics, Hefei National Laboratory for Physical Sciences at the Microscale, School of Life Sciences, Division of Life Sciences and Medicine, University of Science and Technology of China, Hefei, China.

Macromolecule Content 

  • Total Structure Weight: 11.59 kDa 
  • Atom Count: 807 
  • Modeled Residue Count: 94 
  • Deposited Residue Count: 101 
  • Unique protein chains: 1

Macromolecules

Find similar proteins by:|  3D Structure
Entity ID: 1
MoleculeChains  Sequence LengthOrganismDetailsImage
de novo designed protein H4A2S101Escherichia coli 'BL21-Gold(DE3)pLysS AGMutation(s): 0 

Small Molecules

Ligands 1 Unique
IDChains Name / Formula / InChI Key2D Diagram3D Interactions
EPE

Query on EPE



Download:Ideal Coordinates CCD File
B [auth A]4-(2-HYDROXYETHYL)-1-PIPERAZINE ETHANESULFONIC ACID
C8 H18 N2 O4 S
JKMHFZQWWAIEOD-UHFFFAOYSA-N

Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 1.35 Å
  • R-Value Free:  0.227 (Depositor), 0.222 (DCC) 
  • R-Value Work:  0.191 (Depositor), 0.197 (DCC) 
  • R-Value Observed: 0.194 (Depositor) 
Space Group: P 21 21 2
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 46.526α = 90
b = 53.801β = 90
c = 30.549γ = 90
Software Package:
Software NamePurpose
PHENIXrefinement
PDB_EXTRACTdata extraction
XDSdata reduction
XDSdata scaling
PHENIXphasing

Structure Validation

View Full Validation Report



Entry History 

Deposition Data

Revision History  (Full details and data files)

  • Version 1.0: 2021-11-24
    Type: Initial release
  • Version 1.1: 2022-02-16
    Changes: Database references
  • Version 1.2: 2022-02-23
    Changes: Database references, Structure summary
  • Version 1.3: 2022-03-02
    Changes: Database references
  • Version 1.4: 2024-05-29
    Changes: Data collection