5UP5 | pdb_00005up5

Solution structure of the de novo mini protein EHEE_rd1_0284


Experimental Data Snapshot

  • Method: SOLUTION NMR
  • Conformers Calculated: 100 
  • Conformers Submitted: 20 
  • Selection Criteria: structures with the lowest energy 

wwPDB Validation 3D Report Full Report

Validation slider image for 5UP5

This is version 1.3 of the entry. See complete history

Literature

Global analysis of protein folding using massively parallel design, synthesis, and testing.

Rocklin, G.J.Chidyausiku, T.M.Goreshnik, I.Ford, A.Houliston, S.Lemak, A.Carter, L.Ravichandran, R.Mulligan, V.K.Chevalier, A.Arrowsmith, C.H.Baker, D.

(2017) Science 357: 168-175

  • DOI: https://doi.org/10.1126/science.aan0693
  • Primary Citation Related Structures: 
    5UOI, 5UP1, 5UP5, 5UYO

  • PubMed Abstract: 

    Proteins fold into unique native structures stabilized by thousands of weak interactions that collectively overcome the entropic cost of folding. Although these forces are "encoded" in the thousands of known protein structures, "decoding" them is challenging because of the complexity of natural proteins that have evolved for function, not stability. We combined computational protein design, next-generation gene synthesis, and a high-throughput protease susceptibility assay to measure folding and stability for more than 15,000 de novo designed miniproteins, 1000 natural proteins, 10,000 point mutants, and 30,000 negative control sequences. This analysis identified more than 2500 stable designed proteins in four basic folds-a number sufficient to enable us to systematically examine how sequence determines folding and stability in uncharted protein space. Iteration between design and experiment increased the design success rate from 6% to 47%, produced stable proteins unlike those found in nature for topologies where design was initially unsuccessful, and revealed subtle contributions to stability as designs became increasingly optimized. Our approach achieves the long-standing goal of a tight feedback cycle between computation and experiment and has the potential to transform computational protein design into a data-driven science.


  • Organizational Affiliation
    • Department of Biochemistry and Institute for Protein Design, University of Washington, Seattle, WA 98195, USA.

Macromolecule Content 

  • Total Structure Weight: 4.97 kDa 
  • Atom Count: 348 
  • Modeled Residue Count: 40 
  • Deposited Residue Count: 40 
  • Unique protein chains: 1

Macromolecules

Find similar proteins by:|  3D Structure
Entity ID: 1
MoleculeChains  Sequence LengthOrganismDetailsImage
EHEE_rd1_028440Escherichia coliMutation(s): 0 

Experimental Data & Validation

Experimental Data

  • Method: SOLUTION NMR
  • Conformers Calculated: 100 
  • Conformers Submitted: 20 
  • Selection Criteria: structures with the lowest energy 

Structure Validation

View Full Validation Report



Entry History 

& Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
Natural Sciences and Engineering Research Council (NSERC, Canada)Canada--

Revision History  (Full details and data files)

  • Version 1.0: 2017-07-26
    Type: Initial release
  • Version 1.1: 2020-01-08
    Changes: Author supporting evidence, Data collection
  • Version 1.2: 2023-06-14
    Changes: Database references, Other
  • Version 1.3: 2024-05-15
    Changes: Data collection, Database references