Git Product home page Git Product logo

Comments (2)

DerrickWood avatar DerrickWood commented on July 29, 2024

Multi-fasta files weren't originally supported because Kraken used to use a file that mapped files to taxa. (This made sense when working only with completed microbial genomes, but has since been shown to be problematic.) One of the component programs, set_lcas, would then open each file one at a time, and use the map information to assign correct taxonomic information to the file's k-mers. The set_lcas program actually supported multi-fasta files for quite a while, but the scripts and changes to other programs to support those files weren't present until now.

As of the most recent commit, adding multi-fasta files to a DB should no longer require many, many files in the filesystem, and also will not utilize the clumsy .ffn hack that just confused matters and ate up disk space unreasonably.

from kraken.

cjfields avatar cjfields commented on July 29, 2024

Thanks @DerrickWood . That helps tremendously.

from kraken.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.