7EH0 | pdb_00007eh0

Thermus thermophilus RNA polymerase transcription initiation complex containing a template-strand purine at position TSS-2, UpA RNA primer and CMPcPP


Experimental Data Snapshot

  • Method: X-RAY DIFFRACTION
  • Resolution: 2.81 Å
  • R-Value Free: 
    0.243 (Depositor), 0.242 (DCC) 
  • R-Value Work: 
    0.203 (Depositor), 0.203 (DCC) 
  • R-Value Observed: 
    0.204 (Depositor) 

Starting Model: experimental
View more details

wwPDB Validation 3D Report Full Report

Validation slider image for 7EH0

Ligand Structure Quality Assessment 


This is version 1.1 of the entry. See complete history

Literature

Promoter-sequence determinants and structural basis of primer-dependent transcription initiation in Escherichia coli .

Skalenko, K.S.Li, L.Zhang, Y.Vvedenskaya, I.O.Winkelman, J.T.Cope, A.L.Taylor, D.M.Shah, P.Ebright, R.H.Kinney, J.B.Zhang, Y.Nickels, B.E.

(2021) Proc Natl Acad Sci U S A 118

  • DOI: https://doi.org/10.1073/pnas.2106388118
  • Primary Citation Related Structures: 
    7EH0, 7EH1, 7EH2

  • PubMed Abstract: 

    Chemical modifications of RNA 5'-ends enable "epitranscriptomic" regulation, influencing multiple aspects of RNA fate. In transcription initiation, a large inventory of substrates compete with nucleoside triphosphates for use as initiating entities, providing an ab initio mechanism for altering the RNA 5'-end. In Escherichia coli cells, RNAs with a 5'-end hydroxyl are generated by use of dinucleotide RNAs as primers for transcription initiation, "primer-dependent initiation." Here, we use massively systematic transcript end readout (MASTER) to detect and quantify RNA 5'-ends generated by primer-dependent initiation for ∼4 10 (∼1,000,000) promoter sequences in E. coli The results show primer-dependent initiation in E. coli involves any of the 16 possible dinucleotide primers and depends on promoter sequences in, upstream, and downstream of the primer binding site. The results yield a consensus sequence for primer-dependent initiation, Y TSS-2 N TSS-1 N TSS W TSS+1 , where TSS is the transcription start site, N TSS-1 N TSS is the primer binding site, Y is pyrimidine, and W is A or T. Biochemical and structure-determination studies show that the base pair (nontemplate-strand base:template-strand base) immediately upstream of the primer binding site (Y:R TSS-2 , where R is purine) exerts its effect through the base on the DNA template strand (R TSS-2 ) through interchain base stacking with the RNA primer. Results from analysis of a large set of natural, chromosomally encoded E coli promoters support the conclusions from MASTER. Our findings provide a mechanistic and structural description of how TSS-region sequence hard-codes not only the TSS position but also the potential for epitranscriptomic regulation through primer-dependent transcription initiation.


  • Organizational Affiliation
    • Department of Genetics, Rutgers University, Piscataway, NJ 08854.

Macromolecule Content 

  • Total Structure Weight: 444.42 kDa 
  • Atom Count: 28,596 
  • Modeled Residue Count: 3,537 
  • Deposited Residue Count: 3,863 
  • Unique protein chains: 5
  • Unique nucleic acid chains: 3

Macromolecules


Find similar proteins by:|  3D Structure
Entity ID: 1
MoleculeChains  Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase subunit alpha
A, B
315Thermus thermophilus HB8Mutation(s): 0 
EC: 2.7.7.6
UniProt
Find proteins for Q5SHR6 (Thermus thermophilus (strain ATCC 27634 / DSM 579 / HB8))
Explore Q5SHR6 
Go to UniProtKB:  Q5SHR6
Entity Groups
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupQ5SHR6
Sequence Annotations
Expand
Reference Sequence
Find similar proteins by:|  3D Structure
Entity ID: 2
MoleculeChains  Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase subunit beta1,119Thermus thermophilus HB8Mutation(s): 0 
EC: 2.7.7.6
UniProt
Find proteins for Q8RQE9 (Thermus thermophilus (strain ATCC 27634 / DSM 579 / HB8))
Explore Q8RQE9 
Go to UniProtKB:  Q8RQE9
Entity Groups
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupQ8RQE9
Sequence Annotations
Expand
Reference Sequence
Find similar proteins by:|  3D Structure
Entity ID: 3
MoleculeChains  Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase subunit beta'1,524Thermus thermophilus HB8Mutation(s): 0 
EC: 2.7.7.6
UniProt
Find proteins for Q8RQE8 (Thermus thermophilus (strain ATCC 27634 / DSM 579 / HB8))
Explore Q8RQE8 
Go to UniProtKB:  Q8RQE8
Entity Groups
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupQ8RQE8
Sequence Annotations
Expand
Reference Sequence
Find similar proteins by:|  3D Structure
Entity ID: 4
MoleculeChains  Sequence LengthOrganismDetailsImage
DNA-directed RNA polymerase subunit omega99Thermus thermophilus HB8Mutation(s): 0 
EC: 2.7.7.6
UniProt
Find proteins for Q8RQE7 (Thermus thermophilus (strain ATCC 27634 / DSM 579 / HB8))
Explore Q8RQE7 
Go to UniProtKB:  Q8RQE7
Entity Groups
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupQ8RQE7
Sequence Annotations
Expand
Reference Sequence
Find similar proteins by:|  3D Structure
Entity ID: 5
MoleculeChains  Sequence LengthOrganismDetailsImage
RNA polymerase sigma factor SigA443Thermus thermophilus HB8Mutation(s): 0 
Gene Names: sigATTHA0532
UniProt
Find proteins for Q5SKW1 (Thermus thermophilus (strain ATCC 27634 / DSM 579 / HB8))
Explore Q5SKW1 
Go to UniProtKB:  Q5SKW1
Entity Groups
Sequence Clusters30% Identity50% Identity70% Identity90% Identity95% Identity100% Identity
UniProt GroupQ5SKW1
Sequence Annotations
Expand
Reference Sequence
Find similar nucleic acids by:  (by identity cutoff) 
Entity ID: 6
MoleculeChains LengthOrganismImage
DNA (27-MER)27synthetic construct
Sequence Annotations
Expand
Reference Sequence
Find similar nucleic acids by:  Sequence
Entity ID: 7
MoleculeChains LengthOrganismImage
DNA (5'-D(*CP*C*TP*GP*CP*AP*TP*CP*CP*GP*TP*GP*AP*GP*TP*AP*AP*AP*G)-3')19synthetic construct
Sequence Annotations
Expand
Reference Sequence
Find similar nucleic acids by:  Sequence
Entity ID: 8
MoleculeChains LengthOrganismImage
RNA (5'-R(*UP*A)-3')2synthetic construct
Sequence Annotations
Expand
Reference Sequence

Small Molecules

Ligands 3 Unique
IDChains Name / Formula / InChI Key2D Diagram3D Interactions
2TM

Query on 2TM



Download:Ideal Coordinates CCD File
P [auth D]5'-O-[(S)-hydroxy{[(S)-hydroxy(phosphonooxy)phosphoryl]methyl}phosphoryl]cytidine
C10 H18 N3 O13 P3
STGUOVSTMBLHFT-ZOQUXTDFSA-N
ZN

Query on ZN



Download:Ideal Coordinates CCD File
K [auth D],
L [auth D]
ZINC ION
Zn
PTFCDOFLOPIGGS-UHFFFAOYSA-N
MG

Query on MG



Download:Ideal Coordinates CCD File
J [auth B],
M [auth D],
N [auth D],
O [auth D],
Q [auth F]
MAGNESIUM ION
Mg
JLVVSXFLKOJNIY-UHFFFAOYSA-N

Experimental Data & Validation

Experimental Data

  • Method: X-RAY DIFFRACTION
  • Resolution: 2.81 Å
  • R-Value Free:  0.243 (Depositor), 0.242 (DCC) 
  • R-Value Work:  0.203 (Depositor), 0.203 (DCC) 
  • R-Value Observed: 0.204 (Depositor) 
Space Group: C 1 2 1
Unit Cell:
Length ( Å )Angle ( ˚ )
a = 183.784α = 90
b = 103.433β = 98.95
c = 295.626γ = 90
Software Package:
Software NamePurpose
PHENIXrefinement
HKL-2000data scaling
PDB_EXTRACTdata extraction
HKL-2000data reduction
PHENIXphasing

Structure Validation

View Full Validation Report



Ligand Structure Quality Assessment 


Entry History 

& Funding Information

Deposition Data


Funding OrganizationLocationGrant Number
National Natural Science Foundation of China (NSFC)China31822001

Revision History  (Full details and data files)

  • Version 1.0: 2021-07-14
    Type: Initial release
  • Version 1.1: 2023-11-29
    Changes: Data collection, Database references, Refinement description