Git Product home page Git Product logo

Comments (8)

moonwatcher avatar moonwatcher commented on May 28, 2024

Ill need more details. but is it possible one read fails decoding and the filter outgoing qc fail or filter incoming qc fail or is on?

from pheniqs.

hukai916 avatar hukai916 commented on May 28, 2024

What info do you need? I am attaching my test data and code here:
https://www.dropbox.com/sh/6idqunobck72q5j/AAClfpkwhusMVDPNx6HU7wSYa?dl=0

Note that bam2 is missing the last record where bam1 and bam3 are correct.
Let me know if you need other info. Thanks!

from pheniqs.

moonwatcher avatar moonwatcher commented on May 28, 2024

@hukai916 so all bam files in the dropbox folder are outputs from the cram file? Can you please post the json config file you use and which version of pheniqs you are using?

log_decode2.txt seems to be the one showing a missing record. 450000 are reported in incoming and only 449999 are reported in sample. being the last record does point to some thread synchronization issue but obviously an extremely rare one since I have never encountered it. How many cores is this running on?

from pheniqs.

hukai916 avatar hukai916 commented on May 28, 2024

Hi, all bams are created from the same cram using the same pheniqs command. The missing-last-record cases occur occasionally, roughly 10%-20%, no matter how many cores I use. I will give more info after the holidays.

from pheniqs.

moonwatcher avatar moonwatcher commented on May 28, 2024

Can you please post pbmc_500_5p_3_aa.json and I will try and replicate it.

L.

from pheniqs.

hukai916 avatar hukai916 commented on May 28, 2024

Hi L.,
I have uploaded the json file to the same folder.
The Pheniqs I used is: pheniqs version 2.1.0 @moonwatcher

--Kai

from pheniqs.

moonwatcher avatar moonwatcher commented on May 28, 2024

Hello Kai

Just a quick update., I have been trying to reproduce this but have so far been unsuccessful. I tried both with a build of pheniqs 2.1.0 and with head.

A side comment: using an extension of ".bam1" pheniqs does not detect that you actually wanted bam encoding as so it will revert to the default SAM output. This is why the files you provided are bigger than I expected. They are actually uncompressed, simple text in SAM format. You can override the default output format with --format bam.

Are you observing this on MacOS or Linux? What exact platform? might also be useful to know what dependencies. Did you build the binary yourself or installed it from Conda?

It is possible this is related to one of the dependencies. Since this is so tricky to replicate, will you be willing to test on custom build? I can provide instructions if you are interested.

from pheniqs.

hukai916 avatar hukai916 commented on May 28, 2024

Thanks L.,

I installed Pheniqs with conda on a Ubuntu Xenial container (Docker). The container contains solely miniconda, Pheniqs, pysam, and SAMTools. The container can be pulled by:

docker pull hukai916/pheniqs_xenial:0.2

I can perform more tests, pls share instructions. Thanks!

--Kai

from pheniqs.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.