8Y7E

Cryo-EM Structure of the human minor pre-B complex (pre-precatalytic spliceosome) U12 snRNP part


Domain Annotation: SCOP2 Classification SCOP2 Database Homepage

ChainsTypeFamily Name Domain Identifier Family IdentifierProvenance Source (Version)
D [auth 3]SCOP2B SuperfamilyWD40 repeat-like 8052534 3001694 SCOP2B (2021-05-27)
D [auth 3]SCOP2B SuperfamilyWD40 repeat-like 8052534 3001694 SCOP2B (2021-05-27)
D [auth 3]SCOP2B SuperfamilyWD40 repeat-like 8052535 3001694 SCOP2B (2021-05-27)
D [auth 3]SCOP2B SuperfamilyWD40 repeat-like 8051197 3001694 SCOP2B (2021-05-27)
E [auth 4]SCOP2B SuperfamilyRNA-binding domain RBD 8090834 3000110 SCOP2B (2021-05-27)
F [auth 5]SCOP2B SuperfamilyRNA-binding domain RBD 8040695 3000110 SCOP2B (2021-05-27)
G [auth 6]SCOP2B SuperfamilyTriquetra zinc finger motif 8051848 3002040 SCOP2B (2021-05-27)
J [auth h]SCOP2B SuperfamilySm-like ribonucleoproteins 8041751 3000419 SCOP2B (2021-05-27)
K [auth i]SCOP2B SuperfamilySm-like ribonucleoproteins 8041747 3000419 SCOP2B (2021-05-27)
L [auth j]SCOP2B SuperfamilySm-like ribonucleoproteins 8041748 3000419 SCOP2B (2021-05-27)
M [auth k]SCOP2B SuperfamilySm-like ribonucleoproteins 8041749 3000419 SCOP2B (2021-05-27)
N [auth l]SCOP2B SuperfamilySm-like ribonucleoproteins 8063476 3000419 SCOP2B (2021-05-27)
O [auth m]SCOP2B SuperfamilySm-like ribonucleoproteins 8063452 3000419 SCOP2B (2021-05-27)
P [auth n]SCOP2B SuperfamilySm-like ribonucleoproteins 8063468 3000419 SCOP2B (2021-05-27)

Protein Family Annotation Pfam Database Homepage

ChainsAccessionNameDescriptionCommentsSource
B [auth 1]PF22646PPP2R1A-like HEAT repeat (PPP2R1A-like_HEAT)PPP2R1A-like HEAT repeat- Repeat
C [auth 2]PF04037Domain of unknown function (DUF382) (DUF382)Domain of unknown function (DUF382)- Family
C [auth 2]PF04046PSP (PSP)PSP- Family
D [auth 3]PF10433Mono-functional DNA-alkylating methyl methanesulfonate N-term (MMS1_N)Mono-functional DNA-alkylating methyl methanesulfonate N-term- Repeat
D [auth 3]PF03178CPSF A subunit region (CPSF_A)CPSF A subunit region- Repeat
E [auth 4]PF00076RNA recognition motif (RRM_1)RNA recognition motifThe RRM motif (a.k.a. RRM, RBD, or RNP domain) is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and pro ...The RRM motif (a.k.a. RRM, RBD, or RNP domain) is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins (Swiss:P05455) have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteristic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins (Swiss:P05455) are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
Domain
F [auth 5]PF00076RNA recognition motif (RRM_1)RNA recognition motifThe RRM motif (a.k.a. RRM, RBD, or RNP domain) is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and pro ...The RRM motif (a.k.a. RRM, RBD, or RNP domain) is probably diagnostic of an RNA binding protein. RRMs are found in a variety of RNA binding proteins, including various hnRNP proteins, proteins implicated in regulation of alternative splicing, and protein components of snRNPs. The motif also appears in a few single stranded DNA binding proteins. The RRM structure consists of four strands and two helices arranged in an alpha/beta sandwich, with a third helix present during RNA binding in some cases The C-terminal beta strand (4th strand) and final helix are hard to align and have been omitted in the SEED alignment The LA proteins (Swiss:P05455) have an N terminal rrm which is included in the seed. There is a second region towards the C terminus that has some features characteristic of a rrm but does not appear to have the important structural core of a rrm. The LA proteins (Swiss:P05455) are one of the main autoantigens in Systemic lupus erythematosus (SLE), an autoimmune disease.
Domain
H [auth 7]PF07189Splicing factor 3B subunit 10 (SF3b10) (SF3b10)Splicing factor 3B subunit 10 (SF3b10)- Family
I [auth H]PF01423LSM domain (LSM)LSM domainThe LSM domain contains Sm proteins as well as other related LSM (Like Sm) proteins. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) i ...The LSM domain contains Sm proteins as well as other related LSM (Like Sm) proteins. The U1, U2, U4/U6, and U5 small nuclear ribonucleoprotein particles (snRNPs) involved in pre-mRNA splicing contain seven Sm proteins (B/B', D1, D2, D3, E, F and G) in common, which assemble around the Sm site present in four of the major spliceosomal small nuclear RNAs. The U6 snRNP binds to the LSM (Like Sm) proteins [3]. Sm proteins are also found in archaebacteria, which do not have any splicing apparatus suggesting a more general role for Sm proteins. All Sm proteins contain a common sequence motif in two segments, Sm1 and Sm2, separated by a short variable linker. This family also includes the bacterial Hfq (host factor Q) proteins. Hfq are also RNA-binding proteins, that form hexameric rings.
Domain

Gene Ontology: Gene Product Annotation Gene Ontology Database Homepage

ChainsPolymerMolecular FunctionBiological ProcessCellular Component
pre-mRNA---
B [auth 1]Splicing factor 3B subunit 1
C [auth 2]Splicing factor 3B subunit 2
D [auth 3]Splicing factor 3B subunit 3
E [auth 4]Splicing factor 3B subunit 4
F [auth 5]Splicing factor 3B subunit 6
G [auth 6]PHD finger-like domain-containing protein 5A
H [auth 7]Splicing factor 3B subunit 5
I [auth H]U12 snRNA---
J [auth h]Small nuclear ribonucleoprotein Sm D3
K [auth i]Small nuclear ribonucleoprotein-associated proteins B and B'
L [auth j]Small nuclear ribonucleoprotein Sm D1
M [auth k]Small nuclear ribonucleoprotein Sm D2
N [auth l]Small nuclear ribonucleoprotein E
O [auth m]Small nuclear ribonucleoprotein F
P [auth n]Small nuclear ribonucleoprotein G
Q [auth v]Sodium channel modifier 1

InterPro: Protein Family Classification InterPro Database Homepage

ChainsAccessionNameType
B [auth 1]IPR016024Armadillo-type foldHomologous Superfamily
B [auth 1]IPR011989Armadillo-like helicalHomologous Superfamily
B [auth 1]IPR015016Splicing factor 3B subunit 1Domain
B [auth 1]IPR038737Splicing factor 3B subunit 1-likeFamily
C [auth 2]IPR006568PSP, proline-richDomain
C [auth 2]IPR007180Domain of unknown function DUF382Domain
C [auth 2]IPR003034SAP domainDomain
D [auth 3]IPR018846Cleavage/polyadenylation specificity factor, A subunit, N-terminalDomain
D [auth 3]IPR036322WD40-repeat-containing domain superfamilyHomologous Superfamily
D [auth 3]IPR004871Cleavage/polyadenylation specificity factor, A subunit, C-terminalDomain
D [auth 3]IPR015943WD40/YVTN repeat-like-containing domain superfamilyHomologous Superfamily
E [auth 4]IPR034159SF3B4, RNA recognition motif 2Domain
E [auth 4]IPR000504RNA recognition motif domainDomain
E [auth 4]IPR012677Nucleotide-binding alpha-beta plait domain superfamilyHomologous Superfamily
E [auth 4]IPR035979RNA-binding domain superfamilyHomologous Superfamily
E [auth 4]IPR034158SF3B4, RNA recognition motif 1Domain
F [auth 5]IPR000504RNA recognition motif domainDomain
F [auth 5]IPR012677Nucleotide-binding alpha-beta plait domain superfamilyHomologous Superfamily
F [auth 5]IPR034150SF3B6, RNA recognition motifDomain
F [auth 5]IPR035979RNA-binding domain superfamilyHomologous Superfamily
G [auth 6]IPR005345PHF5-likeFamily
H [auth 7]IPR017089Splicing factor 3B, subunit 5Family
H [auth 7]IPR009846Splicing factor 3B subunit 5/RDS3 complex subunit 10Family
J [auth h]IPR027141Like-Sm (LSM) domain containing protein, LSm4/SmD1/SmD3Family
J [auth h]IPR034099Small nuclear ribonucleoprotein Sm D3Family
J [auth h]IPR001163Sm domain, eukaryotic/archaea-typeDomain
J [auth h]IPR047575Sm domainDomain
J [auth h]IPR010920LSM domain superfamilyHomologous Superfamily
K [auth i]IPR001163Sm domain, eukaryotic/archaea-typeDomain
K [auth i]IPR017131Small ribonucleoprotein associated, SmB/SmNFamily
K [auth i]IPR047575Sm domainDomain
K [auth i]IPR010920LSM domain superfamilyHomologous Superfamily
L [auth j]IPR027141Like-Sm (LSM) domain containing protein, LSm4/SmD1/SmD3Family
L [auth j]IPR034102Small nuclear ribonucleoprotein D1Domain
L [auth j]IPR001163Sm domain, eukaryotic/archaea-typeDomain
L [auth j]IPR047575Sm domainDomain
L [auth j]IPR010920LSM domain superfamilyHomologous Superfamily
M [auth k]IPR001163Sm domain, eukaryotic/archaea-typeDomain
M [auth k]IPR047575Sm domainDomain
M [auth k]IPR027248Small nuclear ribonucleoprotein Sm D2Family
M [auth k]IPR010920LSM domain superfamilyHomologous Superfamily
N [auth l]IPR027078Small nuclear ribonucleoprotein EFamily
N [auth l]IPR001163Sm domain, eukaryotic/archaea-typeDomain
N [auth l]IPR047575Sm domainDomain
N [auth l]IPR010920LSM domain superfamilyHomologous Superfamily
O [auth m]IPR016487Sm-like protein Lsm6/SmFFamily
O [auth m]IPR034100Small nuclear ribonucleoprotein FFamily
O [auth m]IPR001163Sm domain, eukaryotic/archaea-typeDomain
O [auth m]IPR047575Sm domainDomain
O [auth m]IPR010920LSM domain superfamilyHomologous Superfamily
P [auth n]IPR044641Sm-like protein Lsm7/SmGFamily
P [auth n]IPR034098Small nuclear ribonucleoprotein GFamily
P [auth n]IPR001163Sm domain, eukaryotic/archaea-typeDomain
P [auth n]IPR047575Sm domainDomain
P [auth n]IPR010920LSM domain superfamilyHomologous Superfamily
Q [auth v]IPR031625Sodium channel modifier 1, acidic C-terminal domainDomain
Q [auth v]IPR031622Sodium channel modifier 1, zinc-fingerDomain
Q [auth v]IPR033570Sodium channel modifier 1Family

Pharos: Disease Associations Pharos Homepage Annotation

ChainsDrug Target  Associated Disease
C [auth 2]PharosQ13435
D [auth 3]PharosQ15393
E [auth 4]PharosQ15427
F [auth 5]PharosQ9Y3B4
G [auth 6]PharosQ7RTV0
H [auth 7]PharosQ9BWJ5
J [auth h]PharosP62318
M [auth k]PharosP62316
N [auth l]PharosP62304
O [auth m]PharosP62306
P [auth n]PharosP62308
Q [auth v]PharosQ9BWG6

Protein Modification Annotation

Modified Residue(s)
ChainResidue(s)Description
B [auth 1]SEP Parent Component: SER

RESIDAA0037

PSI-MOD :  O-phospho-L-serine MOD:00046
D [auth 3]TPO Parent Component: THR

RESIDAA0037 , AA0038

PSI-MOD :  O-phospho-L-serine MOD:00046 , O-phospho-L-threonine MOD:00047