[Home] [Back to DMFold server] [Help]

DMFold results for example

[Click result.zip to download all results on this page]



Input Sequence In FASTA Format

example ( 1221 residues )
>example-A
EIMPLTLKVNGKTEQLEVDTRTTLLDTLRENLHLIGTKKGCDHGQCGACTVLVNGRRLNACLTLAVMHQGAEITTIEGLGSPDNLHPMQAAFIKHDGFQCGYCTSGQICSSVAVLKEIQDGIPSHVTVDLVSAPETTADEIRERMSGNICRCGAYANILAAIEDAAGEIKS

>example-B
MKAFTYERVNTPAEAALSAQRVPGAKFIAGGTNLLDLMKLEIETPTHLIDVNGLGLDKIEVTDAGGLRIGALVRNTDLAAHERVRRDYAVLSRALLAGASGQLRNQATTAGNLLQRTRCPYFYDTNQPCNKRLPGSGCAALEGFSRQHAVVGVSEACIATHPSDMAVAMRLLDAVVETITPEGKTRSITLADFYHPPGKTPHIETALLPGELIVAVTLPPPLGGKHIYRKVRDRASYAFALVSVAAIIQPDGSGRVALGGVAHKPWRIEAADAQLSQGAQAVYDTLFASAHPTAENTFKLLLAKRTLASVLAEARAQA

>example-C
MKFDKPAGENPIDQLKVVGRPHDRIDGPLKTTGTARYAYEWHEEAPNAAYGYIVGSAIAKGRLTALDTDAAQKAPGVLAVITASNAGALGKGDKNTARLLGGPTIEHYHQAIALVVAETFEQARAAASLVQAHYRRNKGAYSLADEKQAVNQPPEDTPDKNVGDFDGAFTSAAVKIDATYTTPDQSHMAMEPHASMAVWDGNKLTLWTSNQMIDWCRTDLAKTLKVPVENVRIISPYIGGGFGGKLFLRSDALLAALAARAVKRPVKVMLPRPSIPNNTTHRPATLQHLRIGADQSGKITAISHESWSGNLPGGTPETAVQQSELLYAGANRHTGLRLATLDLPEGNAMRAPGEAPGLMALEIAIDELAEKAGIDPVEFRILNDTQVDPADPTRCFSRRQLIECLRTGADKFGWKQRNATPGQVRDGEWLVGHGVAAGFRNNLLEKSGARVHLEQNGTVTVETDMTDIGTGSYTILAQTAAEMLGVPLEQVAVHLGDSSFPVSAGSGGQWGANTSTSGVYAACMKLREMIASAVGFDPEQSQFADGKITNGTRSATLHEATAGGRLTAEESIEFGTLSKEYQQSTFAGHFVEVGVHSATGEVRVRRMLAVCAAGRILNPKTARSQVIGAMTMGMGAALMEELAVDDRLGYFVNHDMAGYEVPVHADIPKQEVIFLDDTDPISSPMKAKGVGELGLCGVSAAIANAVYNATGIRVRDYPITLDKLLDKLPDVV


  Multiple Sequence Alignments

Alignment depth: Nf=123.769, number of sequences=4849


  • Download multiple sequence alignment in aln format.
  • Download multiple sequence alignment in a3m format.

  •  Predicted Contact And Distance Map

    Contact Map
    Distance Map

     Individual Chain Modeling Results

    ID. of chain chain Length of chain pLDDT Models of Chain Sequence
    A example-model1-A 171 0.97 click to view EIMPLTLKVNGKTEQLEVDTRTTLLDTLRENLHLIGTKKGCDHGQCGACTVLVNGRRLNACLTLAVMHQGAEITTIEGLGSPDNLHPMQAAFIKHDGFQCGYCTSGQICSSVAVLKEIQDGIPSHVTVDLVSAPETTADEIRERMSGNICRCGAYANILAAIEDAAGEIKS
    B example-model1-B 318 0.97 click to view MKAFTYERVNTPAEAALSAQRVPGAKFIAGGTNLLDLMKLEIETPTHLIDVNGLGLDKIEVTDAGGLRIGALVRNTDLAAHERVRRDYAVLSRALLAGASGQLRNQATTAGNLLQRTRCPYFYDTNQPCNKRLPGSGCAALEGFSRQHAVVGVSEACIATHPSDMAVAMRLLDAVVETITPEGKTRSITLADFYHPPGKTPHIETALLPGELIVAVTLPPPLGGKHIYRKVRDRASYAFALVSVAAIIQPDGSGRVALGGVAHKPWRIEAADAQLSQGAQAVYDTLFASAHPTAENTFKLLLAKRTLASVLAEARAQA
    C example-model1-C 732 0.99 click to view MKFDKPAGENPIDQLKVVGRPHDRIDGPLKTTGTARYAYEWHEEAPNAAYGYIVGSAIAKGRLTALDTDAAQKAPGVLAVITASNAGALGKGDKNTARLLGGPTIEHYHQAIALVVAETFEQARAAASLVQAHYRRNKGAYSLADEKQAVNQPPEDTPDKNVGDFDGAFTSAAVKIDATYTTPDQSHMAMEPHASMAVWDGNKLTLWTSNQMIDWCRTDLAKTLKVPVENVRIISPYIGGGFGGKLFLRSDALLAALAARAVKRPVKVMLPRPSIPNNTTHRPATLQHLRIGADQSGKITAISHESWSGNLPGGTPETAVQQSELLYAGANRHTGLRLATLDLPEGNAMRAPGEAPGLMALEIAIDELAEKAGIDPVEFRILNDTQVDPADPTRCFSRRQLIECLRTGADKFGWKQRNATPGQVRDGEWLVGHGVAAGFRNNLLEKSGARVHLEQNGTVTVETDMTDIGTGSYTILAQTAAEMLGVPLEQVAVHLGDSSFPVSAGSGGQWGANTSTSGVYAACMKLREMIASAVGFDPEQSQFADGKITNGTRSATLHEATAGGRLTAEESIEFGTLSKEYQQSTFAGHFVEVGVHSATGEVRVRRMLAVCAAGRILNPKTARSQVIGAMTMGMGAALMEELAVDDRLGYFVNHDMAGYEVPVHADIPKQEVIFLDDTDPISSPMKAKGVGELGLCGVSAAIANAVYNATGIRVRDYPITLDKLLDKLPDVV
    Click the Chain column or image, you can go into the individual page to check the individual Chain modeling results.

     Top 5 Final Models From DMFold

    Click
    to view
    RankDownload PDBpTM-score
    1 model1.pdb.gz 0.97
    2 model2.pdb.gz 0.97
    3 model3.pdb.gz 0.97
    4 model4.pdb.gz 0.97
    5 model5.pdb.gz 0.97

    Note: If the JSmol model is not visible, please refresh the page or click the radio buttons

    (a) DMFold generates a large set of structural models by different MSAs as inputs. These models are ranked by predicted TM-score (pTM-score for multimer) or predicted LDDT (pLDDT for monomer) and top 5 models are selected with the highest predicted scores.

     Residue-level Modeling Quality


    pLDDT of chain A

    pLDDT of chain B

    pLDDT of chain C


     Proteins With Similar Structure

    Top 10 structural analogs in PDB (as identified by US-align)

    Click
    to view
    RankPDB
    Hit
    TM-scoreRMSDaIDENaCov.Download
    Alignment
    15g5g0.990.331.000.99Download
    25y6q0.942.250.420.97Download
    31rm60.912.720.280.95Download
    41sb30.912.760.290.95Download
    51t3q0.902.580.260.93Download
    61n620.892.780.250.93Download
    71n600.892.780.250.93Download
    81n610.892.830.250.93Download
    91n630.892.830.250.93Download
    101zxi0.892.850.250.93Download

    Note: If the JSmol model is not visible, please refresh the page or click the radio buttons

    (a)Query structure is shown in cartoon, while the structural analog is displayed using backbone trace.
    (b)Ranking of proteins is based on TM-score of the structural alignment between the query structure and known structures in the PDB library.
    (c)RMSDa is the RMSD between residues that are structurally aligned by US-align.
    (d)IDENa is the percentage sequence identity in the structurally aligned region.
    (e)Cov. represents the coverage of the alignment by TM-align and is equal to the number of structurally aligned residues divided by length of the query protein.

    Predicted Gene Ontology (GO) Terms


    GOsearchresult_final_MF.svg
    Molecular Function (MF)
    GO termFreqGOChain IDName
    GO:00164911.00A,B,Coxidoreductase activity
    GO:19013631.00A,B,Cheterocyclic compound binding
    GO:00971591.00A,B,Corganic cyclic compound binding
    GO:00169031.00A,B,Coxidoreductase activity, acting on the aldehyde or oxo group of donors
    GO:00506601.00A,B,Cflavin adenine dinucleotide binding
    GO:00167260.67A,Coxidoreductase activity, acting on CH or CH2 groups, NAD or NADP as acceptor
    GO:00048540.67A,Cxanthine dehydrogenase activity
    GO:00001660.33Bnucleotide binding
    GO:00506620.33Ccoenzyme binding

    Download full result of the above freq.

    Click the graph to show a high resolution version.
    (a)CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template.
    (b)FreqGO is the frequency of predicted GO terms with a CscoreGO greater than 0.5 among all terms within each chain of a complex protein. The FreqGO values range between [0-1], where a higher value indicates a greater proportion of high-confidence predictions (CscoreGO > 0.5) in the function annotation of the protein chain.
    (c)Chain ID indicates the specific protein chain in the complex that contains the GO term with a CscoreGO greater than 0.5. This identifier helps in associating high-confidence GO terms with their respective chains.
    (d)The graph shows the predicted terms within the Gene Ontology hierachy for Molecular Function.
    [0.4,0.5)[0.5,0.6)[0.6,0.7)[0.7,0.8)[0.8,0.9)[0.9,1.0]

    GOsearchresult_final_BP.svg
    Biological Process (BP)
    GO termFreqGOChain IDName
    GO:00447101.00A,B,Csingle-organism metabolic process
    GO:00442371.00A,B,Ccellular metabolic process
    GO:00447631.00A,B,Csingle-organism cellular process
    GO:19015641.00A,B,Corganonitrogen compound metabolic process
    GO:00442811.00A,B,Csmall molecule metabolic process
    GO:00061391.00A,B,Cnucleobase-containing compound metabolic process
    GO:00725211.00A,B,Cpurine-containing compound metabolic process
    GO:19015651.00A,B,Corganonitrogen compound catabolic process
    GO:19013611.00A,B,Corganic cyclic compound catabolic process
    GO:00447121.00A,B,Csingle-organism catabolic process
    GO:00091151.00A,B,Cxanthine catabolic process
    GO:00550860.67A,Bnucleobase-containing small molecule metabolic process
    GO:00467000.67A,Bheterocycle catabolic process
    GO:00442700.67A,Bcellular nitrogen compound catabolic process
    GO:00194390.67A,Caromatic compound catabolic process
    GO:00091120.67A,Cnucleobase metabolic process
    GO:00061450.67A,Bpurine nucleobase catabolic process
    GO:00551140.67A,Boxidation-reduction process
    GO:00099870.33Bcellular process
    GO:00081520.33Bmetabolic process
    GO:00446990.33Bsingle-organism process
    GO:00717040.33Borganic substance metabolic process
    GO:19013600.33Borganic cyclic compound metabolic process
    GO:00464830.33Bheterocycle metabolic process
    GO:00442380.33Bprimary metabolic process
    GO:00068070.33Bnitrogen compound metabolic process
    GO:00442480.33Bcellular catabolic process
    GO:00061440.33Bpurine nucleobase metabolic process
    GO:00725230.33Bpurine-containing compound catabolic process

    Download full result of the above freq.

    Click the graph to show a high resolution version.
    (a)CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template.
    (b)FreqGO is the frequency of predicted GO terms with a CscoreGO greater than 0.5 among all terms within each chain of a complex protein. The FreqGO values range between [0-1], where a higher value indicates a greater proportion of high-confidence predictions (CscoreGO > 0.5) in the function annotation of the protein chain.
    (c)Chain ID indicates the specific protein chain in the complex that contains the GO term with a CscoreGO greater than 0.5. This identifier helps in associating high-confidence GO terms with their respective chains.
    (d)The graph shows the predicted terms within the Gene Ontology hierachy for Biological Process.
    [0.4,0.5)[0.5,0.6)[0.6,0.7)[0.7,0.8)[0.8,0.9)[0.9,1.0]

    GOsearchresult_final_CC.svg
    Cellular Component (CC)
    GO termFreqGOChain IDName
    GO:00444640.67A,Bcell part
    GO:00444240.67A,Cintracellular part
    GO:00058290.67A,Ccytosol
    GO:00425970.33Bperiplasmic space
    GO:00160200.33Bmembrane

    Download full result of the above freq.

    Click the graph to show a high resolution version.
    (a)CscoreGO is the confidence score of predicted GO terms. CscoreGO values range in between [0-1]; where a higher value indicates a better confidence in predicting the function using the template.
    (b)FreqGO is the frequency of predicted GO terms with a CscoreGO greater than 0.5 among all terms within each chain of a complex protein. The FreqGO values range between [0-1], where a higher value indicates a greater proportion of high-confidence predictions (CscoreGO > 0.5) in the function annotation of the protein chain.
    (c)Chain ID indicates the specific protein chain in the complex that contains the GO term with a CscoreGO greater than 0.5. This identifier helps in associating high-confidence GO terms with their respective chains.
    (d)The graph shows the predicted terms within the Gene Ontology hierachy for Cellular Component.
    [0.4,0.5)[0.5,0.6)[0.6,0.7)[0.7,0.8)[0.8,0.9)[0.9,1.0]


     Predicted Enzyme Commission (EC) Numbers

    Top 5 EC Number

    RankFreqECChain IDEC NumberEC Name
    11.00A,B,C1.2.99.2 Carbon-monoxide dehydrogenase (acceptor)
    21.00A,B,C1.17.3.2 Xanthine oxidase
    31.00A,B,C1.17.1.4 Xanthine dehydrogenase
    40.67B,C1.1.1.204 Transferred entry: 1.17.1.4
    50.33A1.3.99.1 Succinate dehydrogenase

    (a)FreqEC is the frequency of predicted Enzyme Commission (EC) number among all terms within each chain of a complex protein, considering only those terms with a TM-score greater than 0.4 for PDB-hits. FreqEC values range in between [0-1];
    (b)Chain ID indicates the specific protein chain in the complex that contains the Enzyme Commission (EC) number.

     Predicted Ligand Binding Sites

    Template proteins with similar binding site:

    Click
    to view
    RankPDB
    Hit
    TM-scoreRMSDaIDENaCov.Download
    Complex
    Lig. Name
    1 5g5g 0.993 0.33 1.000 0.994 Download ACT C L CSD FAD FES GOL IOD MCN MOS SF4
    2 5y6q 0.937 2.25 0.420 0.967 Download FAD FES GOL IPA MCN MOS SF4 SO4
    3 1rm6 0.908 2.72 0.280 0.952 Download C L EPE FAD FES N A PCD SF4
    4 1sb3 0.907 2.76 0.290 0.952 Download EPE FAD FES PCD SF4
    5 1t3q 0.896 2.58 0.260 0.934 Download FAD FES GOL MCN SMO SO4
    6 1n62 0.888 2.78 0.250 0.931 Download CUB FAD FES MCN PO4
    7 1n60 0.888 2.78 0.250 0.931 Download FAD FES MCN OMO PO4
    8 1n61 0.887 2.83 0.250 0.931 Download CUN FAD FES MCN PO4
    9 1n63 0.886 2.83 0.250 0.931 Download CUN FAD FES MCN PO4
    10 1zxi 0.886 2.85 0.250 0.931 Download C U CUM FAD FES MCN PO4

    Note: If the JSmol model is not visible, please refresh the page or click the radio buttons

      Click on the PDB hit radio buttons to visualize all predicted binding ligands.
      Click on the Lig Name radio buttons to visualize selected binding ligand and residues.
    (a)TM-score is a measure of global structural similarity between query and template protein.
    (b)RMSDa the RMSD between residues that are structurally aligned by US-align.
    (c)IDENa is the percentage sequence identity in the structurally aligned region.
    (d)Cov. represents the coverage of global structural alignment and is equal to the number of structurally aligned residues divided by length of the query protein.
    Download full result of the above ligand binding prediction.


    [Back to server]
    Reference:
  • Wei Zheng, Qiqige Wuyun, Yang Li, Chengxin Zhang, P Lydia Freddolino, Yang Zhang. Improving deep learning protein monomer and complex structure prediction using DeepMSA2 with huge metagenomics data. Nature Methods, (2024). https://doi.org/10.1038/s41592-023-02130-4.
  • Wei Zheng, Quancheng Liu, Qiqige Wuyun, P. Lydia Freddolino, Yang Zhang. DMFold: A deep learning platform for protein complex structure and function predictions based on DeepMSA2. In preparation.
  • Wei Zheng, Qiqige Wuyun, Peter L Freddolino, Yang Zhang. Integrating deep learning, threading alignments, and a multi-MSA strategy for high-quality protein monomer and complex structure prediction in CASP15. 1-20. Proteins. (2023). doi:10.1002/prot.26585.
  • yangzhanglabumich.edu | (734) 647-1549 | 100 Washtenaw Avenue, Ann Arbor, MI 48109-2218