Git Product home page Git Product logo

Comments (4)

hyphaltip avatar hyphaltip commented on September 27, 2024

Dan it would be great to see some advancement here on the Fungi - we can collect the sets of examples and use the HMMs from the alignments we have but I am not sure if need to focus on clade-specific sets and iterate or if can be a single HMM sufficient. Would be happy to talk more about how we are approaching this - there are also some efforts from JGI To extract these regions through their pipelines so would be good to connect the dots.

from barrnap.

danwiththeplan avatar danwiththeplan commented on September 27, 2024

Hi, much appreciated for the response. Yes, I am still progressing this in collaboration with a colleague. I'd be keen to talk sometime about the specifics of what exactly you'd need to progress this. We have plenty of fungal genomes we can mine :)

from barrnap.

broganmcgreal avatar broganmcgreal commented on September 27, 2024

Hi, I'm Brogan McGreal a researcher at Plant and Food Research. Our team focuses on molecular plant pathogen interactions and I work with @danwiththeplan. We are currently looking to produce a fungal gene annotation pipeline using open source tools. We'd be happy to discuss what is required to get things working for fungi. I'm not sure what type of inputs you need and in what format i.e. do you need fungal genomes & aligned rna seq reads etc. As @danwiththeplan mentioned, we have plenty of fungal genomic and transcriptomic data.

from barrnap.

hyphaltip avatar hyphaltip commented on September 27, 2024

Hi if you are interested in fungal genome annotation I would first point to our project which has been successfully used for hundreds of genomes I've annotated and deposited and many others - https://github.com/nextgenusfs/funannotate/ - Jon has done an incredible job putting together the pipelines and several others are working on embedding it in snakemake or nextflow pipelines to enable more efficient annotation pipelines.

We also have generated and work on several thousand fungal genomes in our projects so it would be useful to explore what is helpful, my experience is the ribosomal loci are often missing from non-long read assemblies that do not account for the high coverage of the regions, so a combination of strategies are needed to actually get the ribosomal loci in an assembly depending on the ratio of their coverage to the nuclear genome and the parameters used.

I think mainly if the issue in this project is to better provide the HMMs to support the ribosomal locus and associated gene extractions it would make sense to match things up with what ITSx does to use the HMMs to find the boundaries and orient the ITS1 and ITS2 https://microbiology.se/software/itsx/.

from barrnap.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.