New!
Explore similar structures
New!
Find similar folds in PDB and AFDB with Foldseek

SET and MYND domain-containing protein 4

Reviewed
Reference proteome

AF-Q8IYR2-F1-v4

DownloadPDB file mmCIF file Predicted aligned error

Share your feedback on structure with Google DeepMind Looks great Could be improved

Information

Structure viewer
 Very high (pLDDT > 90)
 High (90 > pLDDT > 70)
 Low (70 > pLDDT > 50)
 Very low (pLDDT < 50)
AlphaFold produces a per-residue model confidence score (pLDDT) between 0 and 100. Some regions below 50 pLDDT may be unstructured in isolation.

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

SequenceNo structure available
Structure Tools
  • 19:58:19
    Mol* Plugin 4.5.0 [8/22/2024, 2:35:37 PM]
 Very high (pLDDT > 90)
 High (90 > pLDDT > 70)
 Low (70 > pLDDT > 50)
 Very low (pLDDT < 50)
AlphaFold produces a per-residue model confidence score (pLDDT) between 0 and 100. Some regions below 50 pLDDT may be unstructured in isolation.

Hide colour legend

Scored residueAligned residue
01002003004005006007008000100200300400500600700800
  • 0
  • 5
  • 10
  • 15
  • 20
  • 25
  • 30
Expected position error (Ångströms)

Predicted aligned error (PAE)

Click and drag a box on the PAE viewer to select regions of the structure and highlight them on the 3D viewer.

PAE data is useful for assessing inter-domain accuracy – go to Help section below for more information.

The heatmap is unavailable for mobile devices. Please use a larger screen to see the heatmap.

1804100200300400500600700800200400600800
100200300400500600700800GAVLISTCMDNEQRKHFYWPGAVLISTCMDNEQRKHFYWPAlternative amino acidsResidue sequence number
BenignUncertainPathogenic Reference
Amino acid present in the reference UniProt protein sequence, based on HG38
  • 0.0
  • 0.2
  • 0.3
  • 0.4
  • 0.5
  • 0.6
  • 0.7
  • 0.8
  • 1.0
  • -
Filter by category
0
0.34
0.00.34
0.34
0.56
0.340.564
0.56
1
0.5641
Similar structures Discover similar structures from the Protein Data Bank (PDB) and the AlphaFold Database clustered to 50% sequence identity (AFDB50) using Foldseek.
PDB ID
and chain
DescriptionSpecies Residue
range
E-value
Likelihood of a match between
the query and target sequence
in a structural alignment.
The lower the E-value, the more
significant the alignment.
Seq.
identity
Percentage of identical
residues between aligned
sequences over the
aligned length.
Res.
(Å)
Resolution. Indicates the level
of detail present in the 3D
structure. Smaller value means
finer details of the structure
and higher quality.
Align
in 3D
Align in 3D
PDB ID and
chain
5hq8_A_1
Description

Histone-lysine N-methyltransferase SMYD3

Histone-lysine N-methyltran...

Histone-lysine N-methyltransferase SMYD3
Species

Homo sapiens

Homo sapiens

Residue range

1 - 428Q9H7B4

E-value
Likelihood of a match between
the query and target sequence
in a structural alignment.
The lower the E-value, the more
significant the alignment.
3.37e-18
Seq. identity
Percentage of identical
residues between aligned
sequences over the
aligned length.
18.1%
Resolution (Å)
Resolution. Indicates the level
of detail present in the 3D
structure. Smaller value means
finer details of the structure
and higher quality.
1.72
Align in 3D
PDB ID and
chain
3oxf_A_1
Description

Histone-lysine N-methyltransferase SMYD3

Histone-lysine N-methyltran...

Histone-lysine N-methyltransferase SMYD3
Species

Homo sapiens

Homo sapiens

Residue range

1 - 428Q9H7B4

E-value
Likelihood of a match between
the query and target sequence
in a structural alignment.
The lower the E-value, the more
significant the alignment.
4.56e-18
Seq. identity
Percentage of identical
residues between aligned
sequences over the
aligned length.
17.6%
Resolution (Å)
Resolution. Indicates the level
of detail present in the 3D
structure. Smaller value means
finer details of the structure
and higher quality.
2.82
Align in 3D
PDB ID and
chain
6mon_A_1
Description

N-lysine methyltransferase SMYD2

N-lysine methyltransferase ...

N-lysine methyltransferase SMYD2
Species

Homo sapiens

Homo sapiens

Residue range

5 - 433Q9NRG4

E-value
Likelihood of a match between
the query and target sequence
in a structural alignment.
The lower the E-value, the more
significant the alignment.
5.41e-18
Seq. identity
Percentage of identical
residues between aligned
sequences over the
aligned length.
17.1%
Resolution (Å)
Resolution. Indicates the level
of detail present in the 3D
structure. Smaller value means
finer details of the structure
and higher quality.
2.71
Align in 3D
PDB ID and
chain
4o6f_A_1
Description

N-lysine methyltransferase SMYD2

N-lysine methyltransferase ...

N-lysine methyltransferase SMYD2
Species

Homo sapiens

Homo sapiens

Residue range

1 - 433Q9NRG4

E-value
Likelihood of a match between
the query and target sequence
in a structural alignment.
The lower the E-value, the more
significant the alignment.
5.64e-18
Seq. identity
Percentage of identical
residues between aligned
sequences over the
aligned length.
16.2%
Resolution (Å)
Resolution. Indicates the level
of detail present in the 3D
structure. Smaller value means
finer details of the structure
and higher quality.
2.82
Align in 3D
PDB ID and
chain
6mon_B_1
Description

N-lysine methyltransferase SMYD2

N-lysine methyltransferase ...

N-lysine methyltransferase SMYD2
Species

Homo sapiens

Homo sapiens

Residue range

5 - 433Q9NRG4

E-value
Likelihood of a match between
the query and target sequence
in a structural alignment.
The lower the E-value, the more
significant the alignment.
7.62e-18
Seq. identity
Percentage of identical
residues between aligned
sequences over the
aligned length.
17.2%
Resolution (Å)
Resolution. Indicates the level
of detail present in the 3D
structure. Smaller value means
finer details of the structure
and higher quality.
2.71
Align in 3D
PDB ID and
chain
6cby_A_1
Description

N-lysine methyltransferase SMYD2

N-lysine methyltransferase ...

N-lysine methyltransferase SMYD2
Species

Homo sapiens

Homo sapiens

Residue range

1 - 433Q9NRG4

E-value
Likelihood of a match between
the query and target sequence
in a structural alignment.
The lower the E-value, the more
significant the alignment.
1.33e-17
Seq. identity
Percentage of identical
residues between aligned
sequences over the
aligned length.
16.4%
Resolution (Å)
Resolution. Indicates the level
of detail present in the 3D
structure. Smaller value means
finer details of the structure
and higher quality.
2.55
Align in 3D
PDB ID and
chain
3ru0_A_1
Description

Histone-lysine N-methyltransferase SMYD3

Histone-lysine N-methyltran...

Histone-lysine N-methyltransferase SMYD3
Species

Homo sapiens

Homo sapiens

Residue range

2 - 428Q9H7B4

E-value
Likelihood of a match between
the query and target sequence
in a structural alignment.
The lower the E-value, the more
significant the alignment.
1.45e-17
Seq. identity
Percentage of identical
residues between aligned
sequences over the
aligned length.
18.4%
Resolution (Å)
Resolution. Indicates the level
of detail present in the 3D
structure. Smaller value means
finer details of the structure
and higher quality.
1.85
Align in 3D
PDB ID and
chain
3ru0_B_1
Description

Histone-lysine N-methyltransferase SMYD3

Histone-lysine N-methyltran...

Histone-lysine N-methyltransferase SMYD3
Species

Homo sapiens

Homo sapiens

Residue range

2 - 428Q9H7B4

E-value
Likelihood of a match between
the query and target sequence
in a structural alignment.
The lower the E-value, the more
significant the alignment.
1.52e-17
Seq. identity
Percentage of identical
residues between aligned
sequences over the
aligned length.
16.5%
Resolution (Å)
Resolution. Indicates the level
of detail present in the 3D
structure. Smaller value means
finer details of the structure
and higher quality.
1.85
Align in 3D
PDB ID and
chain
3s7f_A_1
Description

N-lysine methyltransferase SMYD2

N-lysine methyltransferase ...

N-lysine methyltransferase SMYD2
Species

Homo sapiens

Homo sapiens

Residue range

1 - 433Q9NRG4

E-value
Likelihood of a match between
the query and target sequence
in a structural alignment.
The lower the E-value, the more
significant the alignment.
1.58e-17
Seq. identity
Percentage of identical
residues between aligned
sequences over the
aligned length.
17.7%
Resolution (Å)
Resolution. Indicates the level
of detail present in the 3D
structure. Smaller value means
finer details of the structure
and higher quality.
2.85
Align in 3D
PDB ID and
chain
6cby_B_1
Description

N-lysine methyltransferase SMYD2

N-lysine methyltransferase ...

N-lysine methyltransferase SMYD2
Species

Homo sapiens

Homo sapiens

Residue range

1 - 433Q9NRG4

E-value
Likelihood of a match between
the query and target sequence
in a structural alignment.
The lower the E-value, the more
significant the alignment.
1.88e-17
Seq. identity
Percentage of identical
residues between aligned
sequences over the
aligned length.
16.6%
Resolution (Å)
Resolution. Indicates the level
of detail present in the 3D
structure. Smaller value means
finer details of the structure
and higher quality.
2.55
Items per page:
1 – 10 of 112
AF-Q8IYR2-F1
Current entry

WebGL does not seem to be available.

This can be caused by an outdated browser, graphics card driver issue, or bad weather. Sometimes, just restarting the browser helps. Also, make sure hardware acceleration is enabled in your browser.

For a list of supported browsers, refer to http://caniuse.com/#feat=webgl.

Aligned in 3D
Rotate and zoom structures in the 3D
viewer. Structures are aligned using alpha
carbon atoms as reference points. Once
aligned, RMSD values will appear in the
list below; lower values indicate greater
similarity between the two structures.
Toggle structures in both PDB and AFDB tables to view and align them with the entry structure in the 3D viewer. You can also download the aligned coordinates of the toggled structures and the entry structure.
Structure similarity clusterPredicted structures in the AlphaFold Protein Structure Database clustered using MMseqs2 and Foldseek. This data is provided by the AFDB Clusters.

AlphaFold database protein sequences clustered by the MMseqs2 algorithm (Steinegger M. and Soeding J., Nat. Commun. 9, 2018). Each cluster is comprised of sequences that fulfil two criteria: maintaining a maximum sequence identity of 50% and achieving a 90% bi-directional sequence overlap with the longest sequence of the cluster representative.

AFDB accession DescriptionSpecies
Sequence length
Average pLDDT
AFDB accessionAF-A0A6P4EBJ9-F1
Unreviewed
Reference proteome
Description SET and MYND domain-containing protein 4

SET and MYND domain-containing protein 4 ...

SET and MYND domain-containing protein 4
SpeciesDrosophila rhopaloa
Drosophila rhopaloa
Sequence length 738 Average pLDDT 92.62
AFDB accessionAF-B3ME54-F1
Unreviewed
Reference proteome
Description SET domain-containing protein
SET domain-containing protein
SpeciesDrosophila ananassae
Drosophila ananassae
Sequence length 751 Average pLDDT 92.56
AFDB accessionAF-A0A6P4L7S8-F1
Unreviewed
Reference proteome
Description SET and MYND domain-containing protein 4

SET and MYND domain-containing protein 4 ...

SET and MYND domain-containing protein 4
SpeciesDrosophila bipectinata
Drosophila bipectinata
Sequence length 752 Average pLDDT 92.38
AFDB accessionAF-B4QHA6-F1
Unreviewed
Reference proteome
Description GD10068
GD10068
SpeciesDrosophila simulans
Drosophila simulans
Sequence length 751 Average pLDDT 92.31
AFDB accessionAF-A0A6P4IWB4-F1
Unreviewed
Reference proteome
Description SET and MYND domain-containing protein 4

SET and MYND domain-containing protein 4 ...

SET and MYND domain-containing protein 4
SpeciesDrosophila kikkawai
Drosophila kikkawai
Sequence length 752 Average pLDDT 92
AFDB accessionAF-A0A6P8JCN6-F1
Unreviewed
Reference proteome
Description SET and MYND domain-containing protein 4

SET and MYND domain-containing protein 4 ...

SET and MYND domain-containing protein 4
SpeciesDrosophila mauritiana
Drosophila mauritiana
Sequence length 751 Average pLDDT 92
AFDB accessionAF-A0A6P4GYI8-F1
Unreviewed
Reference proteome
Description uncharacterized protein LOC108061527

uncharacterized protein LOC108061527 ...

uncharacterized protein LOC108061527
SpeciesDrosophila takahashii
Drosophila takahashii
Sequence length 756 Average pLDDT 91.81
AFDB accessionAF-A0A6P8Y238-F1
Unreviewed
Reference proteome
Description SET and MYND domain-containing protein 4

SET and MYND domain-containing protein 4 ...

SET and MYND domain-containing protein 4
SpeciesDrosophila albomicans
Drosophila albomicans
Sequence length 741 Average pLDDT 91.81
AFDB accessionAF-B3N790-F1
Unreviewed
Description SET domain-containing protein
SET domain-containing protein
SpeciesDrosophila erecta
Drosophila erecta
Sequence length 750 Average pLDDT 91.81
AFDB accessionAF-B4HT10-F1
Unreviewed
Reference proteome
Description GM20595
GM20595
SpeciesDrosophila sechellia
Drosophila sechellia
Sequence length 751 Average pLDDT 91.81
Items per page:
1 – 10 of 811

Help

Visit our online training course

How to interpret the Predicted Aligned Error

 

The Predicted Aligned Error (PAE) measures the confidence in the relative position of two residues within the predicted structure, providing insight into the reliability of relative position and orientations of different domains. Consider the human protein encoded by the gene GNE (Q9Y223).

GNE has two distinct domains according to experimentally determined structures in the Protein Data Bank (PDBe-KB). Does AlphaFold confidently predict their relative positions? We can use the interactive Predicted Aligned Error (PAE) plot to answer this question.

The PAE plot is not an inter-residue distance map or a contact map. Instead, the shade of green indicates the expected distance error in Ångströms (Å), ranging from 0 Å to an arbitrary cut-off of 31 Å. The colour at (x, y) corresponds to the expected distance error in the residue x’s position when the predicted and the true structures are aligned on residue y.

A dark green tile corresponds to a good prediction (low error), whereas a light green tile indicates poor prediction (high error). For example, when aligning on residue 300:

  • We’re confident in the relative position of residue 200
  • We’re not confident in the relative position of residue 600

The two low-error, dark green squares correspond to the two domains. By clicking and dragging, you can highlight these squares on the structure. If you want to remove the highlighting, click the cross icon.

When selecting an off-diagonal region, the plot visually represents the relationship between the selected ranges on the sequence and structure. The x range corresponds to the selection for scored residues, highlighted in orange, while the y range of aligned residues is highlighted in emerald green.

The high PAE values across the whole inter-domain region indicate that for this particular protein, AlphaFold does not reliably predict the relative position of the domains.

Let’s consider another inter-domain example, the human protein encoded by DIP2B (Q9P265).

In this case, we have confidence in the relative position of scored residues around 1450 when aligned with residues around 850, suggesting a packing between the small central domains.

Note that the PAE scores are asymmetrical, meaning there might be variations in PAE values between (x,y) and (y,x) positions. This is particularly relevant for loop regions with highly uncertain orientations, as seen on the DNA topoisomerase 3 (Q8T2T7).

 


Last updated

Last updated in AlphaFold DB version 2022-11-01, created with the AlphaFold Monomer v2.0 pipeline.

Licence and attribution

Data is available for academic and commercial use, under a CC-BY-4.0 licence.

EMBL-EBI expects attribution (e.g. in publications, services or products) for any of its online services, databases or software in accordance with good scientific practice.

If you make use of an AlphaFold prediction, please cite the following papers:
Jumper, J et al. Highly accurate protein structure prediction with AlphaFold. Nature (2021).
Varadi, M et al. AlphaFold Protein Structure Database in 2024: providing structure coverage for over 214 million protein sequences. Nucleic Acids Research (2024).
If you use data from AlphaMissense in your work, please cite the following paper:
Cheng, J et al. Accurate proteome-wide missense variant effect prediction with AlphaMissense. Science (2023).

AlphaFold Data Copyright (2022) DeepMind Technologies Limited.
AlphaMissense Copyright (2023) DeepMind Technologies Limited.

Feedback and questions

If you want to share feedback on an AlphaFold structure prediction, consider using the feedback buttons at the top of each structure page. If you have any questions that are not covered in the FAQs, please contact alphafold@deepmind.com. If you have feedback on the website or experience any bugs please contact afdbhelp@ebi.ac.uk.

Let us know how the AlphaFold Protein Structure Database has been useful in your research at alphafold@deepmind.com.

Disclaimer

The AlphaFold and AlphaMissense Data and other information provided on this site contain predictions with varying levels of confidence, is for theoretical modelling only and caution should be exercised in its use. It is provided 'as-is' without any warranty of any kind, whether expressed or implied. For clarity, no warranty is given that use of the information shall not infringe the rights of any third party. The information is not intended to be a substitute for professional medical advice, diagnosis, or treatment, and does not constitute medical or other professional advice. The AlphaFold and AlphaMissense Data have not been validated for, and are not approved for, any clinical use.

Use of the AlphaFold Protein Structure Database is subject to EMBL-EBI Terms of Use.