Git Product home page Git Product logo

Comments (7)

valdeanda avatar valdeanda commented on August 16, 2024

Hi Greg,
Yes, thanks for asking that.
We are working on the version 1.2 which has the option -custom.
By using this option, MEBS is going to download the Pfam database so you can add all the pfams that you want in the mapping file

For example, If you want to analyze only AmoA from archaea I recommend you to modify the pfam2kegg.tab file in the custom directory as following

PFAM KO PATHWAY PATHWAY NAME
PF12942 1 Ammonia monoxygenase Archaea AmoABC
PF04744 1 Ammonia monoxygenase Archaea AmoABC
PF04896 1 Ammonia monoxygenase Archaea AmoABC

However, the nitrogen cycle already have the Archaea AmoABC as pathway 26. https://github.com/eead-csic-compbio/metagenome_Pfam_score/blob/master/cycles/nitrogen/pfam2kegg.tab

Be aware that using the custom option will be useful to compute the completeness of those pathways but not the score, that has to be done using the advanced mode.

As soon as the -custom option is implemented I will let you know. Meanwhile, you can try to focus only on N pathway 26 and see if that works for you.
Thanks
Val

from metagenome_pfam_score.

michoug avatar michoug commented on August 16, 2024

Hi Val,
Thank you for the answer that will be indeed very useful.
My issue as of now is that the gene for amoA that you choose for archaea (I found only one) doesn't appear to be a blast match to one of the main taxonomic group possessing this gene in the archaeal domain, aka Nitrosopumilus.
Best
Greg

from metagenome_pfam_score.

valdeanda avatar valdeanda commented on August 16, 2024

from metagenome_pfam_score.

michoug avatar michoug commented on August 16, 2024

Hi Val,
So it's probably a confusion on my part the my_Pfam.nitrogen.hmm file contains the ones that I'm looking for. However in the nitrogen.fasta the amoA gene for archaea (tr|A0A023Q3R5|A0A023Q3R5_9ARCH Ammonia monooxygenase (Fragment) OS=uncultured archaeon GN=amoA PE=4 SV=1) doesn't blast to the main nitrosopumilus that I'm looking for, thus my confusion.
Would it be possible to clarify the role of the nitrogen.fasta in the analysis, if any, as it's not so clear for me.
Thanks for your help
Best
Greg

from metagenome_pfam_score.

valdeanda avatar valdeanda commented on August 16, 2024

Hi Creg.,
Which protein family exactly are you looking for?. The fasta file of each cycle contains representative sequences, that at the end are used to obtain the protein families (Pfams), and then to compute the relative entropy and the score. If you are not interested in the score, use the custom option with the protein family that you want to analyze, it doesn't matter if is not in the fasta file because MEBS is going to look all the protein families in Pfam database and only display those in your mapping file.
Let me know if that was helpful.
P.D In the MEBS paper is described in Stage 1 the annotation of the sulfur genes, the paper for the rest of the cycles is not ready yet. :S https://academic.oup.com/gigascience/article/6/11/gix096/4561660
I can give you more information if need it.

Best
Val

from metagenome_pfam_score.

vrou1995 avatar vrou1995 commented on August 16, 2024

Hi Val,

I was looking for the v1.2 version to install so that I could use the custom option but I've had no luck so far. Could you point me in the right direction?

Many thanks,

Vincent

from metagenome_pfam_score.

valdeanda avatar valdeanda commented on August 16, 2024

from metagenome_pfam_score.

Related Issues (6)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.