Dear zhangyang laboratory:
Hello! I am a student who is new to bioinformatics, and I am studying the thesis of ResPRE recently. ResPRE uses the coupling precision matrix to achieve highaccuracy protein contact prediction. It has achieved excellent results. Is an outstanding work.I sincerely hope that some questions about this paper can be answered.
Question 1:
The model in ResPRE is trained through the Pytorch framework. So in the process of using Pytorch to train, how are the two functions __getitem__() and __len__() implemented in the DataLoader class when loading the data set? Can you provide the source code to learn?
Question 2:
In the training process, when loading data, is a sequence as input data for training or residue pairs as input data?
I have some questions about ResPRE
Moderator: robpearc
I have some questions about ResPRE
 Attachments

 2224.png (231.56 KiB) Viewed 1363 times

 115.png (17.9 KiB) Viewed 1363 times
Re: I have some questions about ResPRE
Hello,
Thank you for your interest in ResPRE!
1) The len() function will internally call __len__ to return the length of the object.
Our implementation is:
def __len__(self):
return self.length
The __getitem__ function will vary depending on your data loader implementation. We have not released our ResPRE training script; however there should be many examples online that you can find to build your own implementation of a data loader. Essentially, the function will load the true contact map and any precomputed features such as the precision matrices and Potts Model for a training protein, which are stored in a python dictionary. The function returns the dictionary of features and ground truth contact map.
2) The sequence is used along with other features, including the pairwise features such as the Potts model and precision matrices.
Thank you for your interest in ResPRE!
1) The len() function will internally call __len__ to return the length of the object.
Our implementation is:
def __len__(self):
return self.length
The __getitem__ function will vary depending on your data loader implementation. We have not released our ResPRE training script; however there should be many examples online that you can find to build your own implementation of a data loader. Essentially, the function will load the true contact map and any precomputed features such as the precision matrices and Potts Model for a training protein, which are stored in a python dictionary. The function returns the dictionary of features and ground truth contact map.
2) The sequence is used along with other features, including the pairwise features such as the Potts model and precision matrices.
Re: I have some questions about ResPRE
Sincerely thank you for your detailed answer. Thank you very much!
In addition, I have another question.
Q：Does the process of training ResPRE use supervised learning or unsupervised learning? If it is supervised learning, the sequence needs to be labeled before training. So what is this label?
In addition, I have another question.
Q：Does the process of training ResPRE use supervised learning or unsupervised learning? If it is supervised learning, the sequence needs to be labeled before training. So what is this label?
Re: I have some questions about ResPRE
We used supervised machine learning to train ResPRE models for contact map predictioon. The supervised information should be the contact maps (binary matrix, 1 for being in contact) from pdb structures.
Re: I have some questions about ResPRE
Thank you very much! I figured out the problem.
Re: I have some questions about ResPRE
The answer seems very much helpful for me as well. Thanks a million.robpearc wrote: ↑Tue May 25, 2021 11:48 am Hello,
Thank you for your interest in ResPRE!
1) The len() function will internally call __len__ to return the length of the object.
Our implementation is:
def __len__(self):
return self.length
The __getitem__ function will vary depending on your data loader implementation. We have not released our ResPRE training script; however there should be many examples online that you can find to build your own implementation of a data loader. Essentially, the function will load the true contact map and any precomputed features such as the precision matrices and Potts Model for a training protein, which are stored in a python dictionary. The function returns the dictionary of features and ground truth contact map.
2) The sequence is used along with other features, including the pairwise features such as the Potts model and precision matrices.