[Home] [Back to server]

D-I-TASSER results for example

[Click result.zip to download all results on this page]



Input Sequence in FASTA format
>test (69 residues)
AGCNAGQLTVCTGAIAGGARPTAACCSSLRAQQGCFCQFAKDPRYGRYVNSPNARKAVSSCGIALPTCH

Predicted Secondary Structure
Sequence 20 40 60 | | | AGCNAGQLTVCTGAIAGGARPTAACCSSLRAQQGCFCQFAKDPRYGRYVNSPNARKAVSSCGIALPTCH
PredictionCCCCHHHCCCCHHHHHCCCCCCHHHHHHHHHCCCCHHHHHCCCCHHHHCCCHHHHHHHHHHCCCCCCCC
Confidence977733312136776379999599999999615501111119537753688889999997199999889
H:Helix; S:Strand; C:Coil

Predicted Solvent Accessibility
Sequence 20 40 60 | | | AGCNAGQLTVCTGAIAGGARPTAACCSSLRAQQGCFCQFAKDPRYGRYVNSPNARKAVSSCGIALPTCH
Prediction761545414423433346652355015404744412041253441451253640431052163334628
Values range from 0 (buried residue) to 8 (highly exposed residue)

Predicted Contact and Distance Map Used in D-I-TASSER simulation
Contact Map
Distance Map
Hydrogen bond networks
D-I-TASSER simulation is guided by the consensus contact map (left figure), distance map (middle figure) and Hydrogen bond network (right figure) derived based on confidence scores of AttentionPotential. In the contact, distance map and hydrogen bond networks, the axes mark the residue index along the sequence. For the contact map, each dot represents a residue pair with predicted contact, while for the distance map and hydrogen bond network, a color scale represents a distance of 1-20+ angstroms or a angle of 0-180 degree.

Top 10 threading templates used by D-I-TASSER
Rank PDB
hit
ID1ID2CovNorm.
Zscore
Download
alignment
  20 40 60 | | |
SS
Seq
CCCCHHHCCCCHHHHHCCCCCCHHHHHHHHHCCCCHHHHHCCCCHHHHCCCHHHHHHHHHHCCCCCCCC
AGCNAGQLTVCTGAIAGGARPTAACCSSLRAQQGCFCQFAKDPRYGRYVNSPNARKAVSSCGIALPTCH
11tukA 0.56 0.54 15.43 1.33DEthreader ACQASQ-LAVCASAILSGAKPSGECCGNLRAQQGCFCQYAKDPTYGQ-YRSPHARDTLTSCGLAVPHC-
21tukA 0.60 0.58 16.62 3.23HHpred -ACQASQLAVCASAILSGAKPSGECCGNLRAQQGCFCQYAKDPTYGQYIRSPHARDTLTSCGLAVPHC-
31siyA 0.21 0.20 6.43 1.48DisCoVER MTCVQGNLAQCIGFLQKGGVVPPSCCTGVKNILAVCSCLKAA--AVRGINPNNAEALPGKCGVYKIST-
41tukA 0.60 0.58 16.62 3.00SPARKS-K -ACQASQLAVCASAILSGAKPSGECCGNLRAQQGCFCQYAKDPTYGQYIRSPHARDTLTSCGLAVPHC-
51tukA 0.61 0.58 16.61 1.32MapAlign --CQASQLAVCASAILSGAKPSGECCGNLRAQQGCFCQYAKDPTYGQYIRSPHARDTLTSCGLAVPHC-
61tukA 0.60 0.58 16.62 1.11FFAS-3D -ACQASQLAVCASAILSGAKPSGECCGNLRAQQGCFCQYAKDPTYGQYIRSPHARDTLTSCGLAVPHC-
71tukA 0.59 0.57 16.22 1.17Hybrid-CEthreader -ACQASQLAVCASAILSGAKPSGECCGNLRAQQGCFCQYAKDPTYGQ-YRSPHARDTLTSCGLAVPHC-
82ljoA 0.23 0.22 6.83 1.17DEthreader ICAVTSDLSPCLTYLTGGPGPSPQCCGGVKKLLAA-CNL-KS-AAGSILNTNNAAALPGKCGVNIYKTN
91l6hA 1.00 1.00 28.00 3.02HHpred AGCNAGQLTVCTGAIAGGARPTAACCSSLRAQQGCFCQFAKDPRYGRYVNSPNARKAVSSCGIALPTCH
102rknA 0.25 0.25 7.65 2.08SPARKS-K DLCSQDELNECKPAVSKETSPSQPCCTALQHDFACLCGYKNSPWLGFGVDPELASALPKQCGLANAPTC
(a)ID1 is the number of template residues identical to query divided by number of aligned residues.
(b)ID2 is the number of template residues identical to query divided by query sequence length.
(c)Cov is equal the number of aligned template residues divided by query sequence length.
(d)Norm. Zscore is the normalized Z-score of the threading alignments. A Normalized Z-score >1 means a good alignment and is highlighted in bold.
(e)Download alignment lists the threading program used to identify the template, and provide the 3D structure of aligned regions of threading templates (threading[1-10].pdb.gz).
(f)Template residues identical to query sequence are highlighted in color.

Top 1 final models from D-I-TASSER

Click
to view
RankaDownload PDBDownload ModelCIFC-scoreb Estimated TM-scorecEstimated RMSDc
1 model1.pdb.gz model1.cif -0.19 0.77±0.12 3.6±2.5Å
(a) D-I-TASSER simulations generate a large ensemble of structural conformations, i.e. decoys. These decoys are clustered by SPICKER based on pairwise structure similarity to report up to five final models from the five largest clusters. Models are ranked in descending order of cluster size. If the simulations converge well, it is possible to have less than 5 models generated, which is usually an indication of good model quality.
(b) The model confidence is quantitatified by C-score, calculated based on significance of threading template alignments, contact map satisfaction rate, mean absolute error between distance of model and distance of AttentionPotential, and convergence of D-I-TASSER simulations. C-score is typically in the range of [-5, 2], with higher C-score signifies higher model confidence.
(c) Model TM-score and RMSD are estimated based on C-score and protein length for the first model.

Proteins with similar structure
Top 10 structural analogs in PDB (as identified by TM-align)

Click
to view
RankPDB
Hit
TM-scoreRMSDaIDENaCov.Download
Alignment
11tukA0.901.100.5820.971model1_1tukA.pdb.gz
22ljoA0.712.210.2270.957model1_2ljoA.pdb.gz
37ksbA0.702.360.2690.971model1_7ksbA.pdb.gz
45tviV0.692.340.2880.957model1_5tviV.pdb.gz
54xuwA0.692.290.2580.957model1_4xuwA.pdb.gz
66frrA0.692.220.2000.942model1_6frrA.pdb.gz
72rknA0.682.370.2420.957model1_2rknA.pdb.gz
81siyA0.672.590.2240.971model1_1siyA.pdb.gz
97kscA0.652.770.2031.000model1_7kscA.pdb.gz
101bv2A0.642.640.2790.986model1_1bv2A.pdb.gz

(a)Query structure is shown in cartoon, while the structural analog is displayed using backbone trace.
(b)Ranking of proteins is based on TM-score of the structural alignment between the query structure and known structures in the PDB library.
(c)RMSDa is the RMSD between residues that are structurally aligned by TM-align.
(d)IDENa is the percentage sequence identity in the structurally aligned region.
(e)Cov. represents the coverage of the alignment by TM-align and is equal to the number of structurally aligned residues divided by length of the query protein.

Predicted Gene Ontology (GO) Terms
GOsearchresult_final_MF.svg
Molecular Function (MF)
GO termCscoreGOName
GO:00082890.89lipid binding
GO:00431670.77ion binding
GO:00468720.75metal ion binding
GO:00431680.74anion binding
GO:00436210.72protein self-association
GO:00082700.72zinc ion binding
GO:00055040.72fatty acid binding

Download full result of the above consensus prediction.

Click the graph to show a high resolution version.
(a)CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template.
(b)The graph shows the predicted terms within the Gene Ontology hierachy for Molecular Function. Confidently predicted terms are color coded by CscoreGO:
[0.4,0.5)[0.5,0.6)[0.6,0.7)[0.7,0.8)[0.8,0.9)[0.9,1.0]
GOsearchresult_final_BP.svg
Biological Process (BP)
GO termCscoreGOName
GO:00098620.72systemic acquired resistance, salicylic acid mediated signaling pathway
GO:00717020.60organic substance transport

Download full result of the above consensus prediction.

Click the graph to show a high resolution version.
(a)CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template.
(b)The graph shows the predicted terms within the Gene Ontology hierachy for Biological Process. Confidently predicted terms are color coded by CscoreGO:
[0.4,0.5)[0.5,0.6)[0.6,0.7)[0.7,0.8)[0.8,0.9)[0.9,1.0]
GOsearchresult_final_CC.svg
Cellular Component (CC)
GO termCscoreGOName
GO:00444641.00cell part
GO:00432290.75intracellular organelle
GO:00432310.63intracellular membrane-bounded organelle
GO:00312250.60anchored component of membrane

Download full result of the above consensus prediction.

Click the graph to show a high resolution version.
(a)CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template.
(b)The graph shows the predicted terms within the Gene Ontology hierachy for Cellular Component. Confidently predicted terms are color coded by CscoreGO:
[0.4,0.5)[0.5,0.6)[0.6,0.7)[0.7,0.8)[0.8,0.9)[0.9,1.0]

Predicted Enzyme Commission (EC) Numbers
Top 5 enzyme homologs in PDB

Click
to view
RankCscoreECPDB
Hit
TM-scoreRMSDaIDENaCov.EC NumberPredicted Active Site Residues
10.0601ac5A0.5093.660.0470.9133.4.16.6 NA
20.0603kaoA0.5073.540.0940.8414.1.2.40 NA
30.0601bx9A0.5083.390.0310.9272.5.1.18 NA
40.0603a1kA0.4932.910.0660.8553.5.1.4 NA
50.0601clcA0.5083.430.0160.8843.2.1.4 NA

 Click on the radio buttons to visualize predicted active site residues.
(a)CscoreEC is the confidence score for the Enzyme Commission (EC) number prediction. CscoreEC values range in between [0-1]; where a higher score indicates a more reliable EC number prediction.
(b)TM-score is a measure of global structural similarity between query and template protein.
(c)RMSDa is the RMSD between residues that are structurally aligned by TM-align.
(d)IDENa is the percentage sequence identity in the structurally aligned region.
(e)Cov. represents the coverage of global structural alignment and is equal to the number of structurally aligned residues divided by length of the query protein.

Predicted Ligand Binding Sites
Template proteins with similar binding site:

Click
to view
RankCscoreLBPDB
Hit
TM-scoreRMSDaIDENaCov.BS-scoreLig. NameDownload
Complex
Predicted binding site residues
10.031jtbA0.5913.150.2170.9860.50UUUcomplex1.pdb.gz4,5,8,29,43,54,57,58,63,65

  Click on the radio buttons to visualize predicted binding site and residues.
(a)CscoreLB is the confidence score of predicted binding site. CscoreLB values range in between [0-1]; where a higher score indicates a more reliable ligand-binding site prediction.
(b)BS-score is a measure of local similarity (sequence & structure) between template binding site and predicted binding site in the query structure. Based on large scale benchmarking analysis, we have observed that a BS-score >1 reflects a significant local match between the predicted and template binding site.
(c)TM-score is a measure of global structural similarity between query and template protein.
(d)RMSDa the RMSD between residues that are structurally aligned by TM-align.
(e)IDENa is the percentage sequence identity in the structurally aligned region.
(f)Cov. represents the coverage of global structural alignment and is equal to the number of structurally aligned residues divided by length of the query protein.

[Back to server]
Reference:
  • Wei Zheng, Qiqige Wuyun, Yang Li, Quancheng Liu, Xiaogen Zhou, Chunxiang Peng, Yiheng Zhu, Lydia Freddolino, Yang Zhang. Deep-learning-based single-domain and multidomain protein structure prediction with D-I-TASSER. Nature Biotechnology, 2025. doi:10.1038/s41587-025-02654-4.
  • Wei Zheng, Qiqige Wuyun, Lydia Freddolino, Yang Zhang. Integrating deep learning, threading alignments, and a multi-MSA strategy for high-quality protein monomer and complex structure prediction in CASP15. Proteins, 2023. doi:10.1002/prot.26585.
  • Contact WeiZheng