Git Product home page Git Product logo

Comments (5)

drpatelh avatar drpatelh commented on July 28, 2024 1

Good question @sruthipsuresh ! Yes, Preseq hasn't been updated in quite a while now and I doubt it will be anytime soon...

Does it fail for single-end and paired-end data? Given the issues you mentioned I think it would be good enough to get it passing with single-end data only. May need to change the test data for this if required.

from modules.

edmundmiller avatar edmundmiller commented on July 28, 2024 1

I think that's a good option, I couldn't find any test stuff last night in their repo good job!

Ah this was just what I was looking for too! https://www.nextflow.io/docs/edge/script.html#http-ftp-files

You should be able to use something like https://github.com/smithlabcode/preseq/raw/master/data/SRR1003759_5M_subset.mr for each of the files for input.

So

mr = file('https://github.com/smithlabcode/preseq/raw/master/data/SRR1003759_5M_subset.mr')

from modules.

sruthipsuresh avatar sruthipsuresh commented on July 28, 2024 1

I'll do that instead of adding the file to the input folder directly! Thank you!

from modules.

sruthipsuresh avatar sruthipsuresh commented on July 28, 2024

For the preseq module, would we need to have another input file to use?
When testing it, preseq fails with the test files we have in both the bed and bam format as they have low read counts.

From the user manual:
preseq lc_extrap can fail to estimate the curve if there are not enough reads in the
aligned bam file for the calculation. The error reported is:
ERROR: max count before zero is less than min required count (4), sample not sufficiently deep or duplicates removed

Additionally, there's an error in the module where the $bam file is not preceded by a -bam prefix (since the default is a bed file).

from modules.

sruthipsuresh avatar sruthipsuresh commented on July 28, 2024

It fails for both types of data unfortunately. I also tried creating a sorted bed file from the test bam files that we have already (as described here), but that fails as well.

The single-end test does work with this sample file from the preseq repo- should this be used as the test data instead then?

from modules.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.