Git Product home page Git Product logo

Comments (4)

Roche78 avatar Roche78 commented on May 27, 2024

We ensured that all competing methods are run with default parameter setting according to their published papers. For DMPfold, we used the target sequences to generate final models as done in the original paper. Not sure of your benchmark, but we have made all input and output data openly available to fully reproduce DConStruct results. And one can always download the competing methods from their published papers to run in their original form with default parameter setting for reproducibility.

from dconstruct.

kad-ecoli avatar kad-ecoli commented on May 27, 2024

Based on what you describe, different MSAs were used for DMPfold and for DConStruct. DMPfold benchmark uses its default MSA (hhblits+uniclust30). DConStruct benchmark uses DeepMSA. This is not clearly explained in the manuscript, which explicitly states that DMPfold was run with DeepMSA ("We install and run DMPfold locally to predict distance histogram maps directly from the multiple sequence alignments (MSA) [34]").

If that is the case, the performance advantage of DConStruct compared to DMPFold shown in the manuscript is likely caused by better MSA used by DConStruct. This does not indicate that compare to CNS (used by DMPFold), DConStruct has a better ability to fold the protein given the same distance prediction.

from dconstruct.

Roche78 avatar Roche78 commented on May 27, 2024

We never "explicitly" state that "DMPfold was run with DeepMSA" in our manuscript. It is interesting to note your incorrect presumptions and the resulting unfounded conclusions, indicating an apparent bias for DeepMSA [1].

Reference:

[1] Chengxin Zhang, Wei Zheng, S. M. Mortuza, Yang Li, Yang Zhang. β€œDeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins.” Bioinformatics, 36 (7): 2105-2112 (2020).

from dconstruct.

kad-ecoli avatar kad-ecoli commented on May 27, 2024

The sentence I quoted is from section "DConStruct: hybridized distance- and contact-based hierarchical structure modeling" of the DConStruct manuscript.

We install and run DMPfold locally to predict distance histogram maps directly from the multiple sequence alignments (MSA) [34]

[34] Zhang C, Zheng W, Mortuza SM, Li Y, Zhang Y. DeepMSA: constructing deep multiple sequence alignment to improve contact prediction and fold-recognition for distant-homology proteins. Bioinformatics. 2020;36: 2105–2112. doi:10.1093/bioinformatics/btz863

A similar sentence is from section "Test datasets and programs to compare" of the DConsStruct manuscript.

We feed the multiple sequence alignments (MSAs) [34] to DMPfold and obtain the predicted initial distance histograms (rawdispred.current files) containing 20 distance bins with their associated likelihoods

In any case, even if my understanding is incorrect, could you explain the following questions:

  1. What is the MSA used by "DConsStruct" method in Table 3? Please kindly provide the command line, which is not included in the github repo of DConsStruct.
  2. What is the MSA used by "DMPFold" method in Table 3? Again, please kindly provide the command line, as DMPfold can accept arbitrary user input MSA despite having its built-in MSA generation pipeline.
  3. Could you provide the link to the MSA files used herein? If the files are too big for all dataset, you can just provide the MSAs for the 40 CASP FM targets.

from dconstruct.

Related Issues (2)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.