Comments (5)
Hi Jesse,
The logical flow of all those checks is confusing but I think this is working as intended: without a bed file (whether or not you have a sites file), you'll need to specify a window size. Otherwise, there will be no way for pixy to know the intervals over which to calculate your summary stats. So, if you want a window size of 1, you should specify --window_size 1 (as you did!).
So then on to the next problem, your pandas error. I can't reproduce that on my end, can you post your chr4_gene_locations.txt? That error might also be from your populations.txt file. Have a look at those two files, make sure they are valid tab-separated files etc. (or post them here). You could also try rerunning with the --debug flag to get a traceback of the pandas error.
Let me know how it goes!
from pixy.
Hi,
I checked the populations file and sites_file and verified that they were tab delimited. I'm including them here at this link: https://drive.google.com/drive/folders/1ex8zMsylNyIfuIHuX3Uuq0ORUnxI8Oj7?usp=sharing . Thanks for your help!
from pixy.
Hi Jesse,
Thanks for sending me your data! Interestingly, I wasn't able to reproduce your error. The calculations were slow (single-site mode is still very slow), but they did complete (let me know if you'd like the output file). While I was at it, I added some new optimizations that will speed this type of analysis up in the future.
Re: your error, a few questions:
- What OS are you running pixy on?
- There is an extra tab at the end of the second line of your populations.txt file. It didn't seem to affect anything for me, but I wonder if this might be connected to the problem.
- Can you re-run your analysis with the --debug flag? and paste the output here?
from pixy.
Just following up here, once your input file issue is resolved, it would probably be worth updating to the new version 1.2.5.beta1 on conda. The single sites + sites file combination you are doing is much faster in the new version.
from pixy.
Updating my pixy to the latest on conda seemed to fix everything! I can now use the sites_file argument.
from pixy.
Related Issues (20)
- Hello, While running the pixy software after converting vcf file with invariant sites using bcftools, I am getting the following error:UnicodeDecodeError: 'utf-8' codec cant decode byte 0x8b in position 1: invalid start byte; Could you please help me to resolve this error? HOT 4
- attribute error HOT 4
- Support for New Missing Data Formatting from GATK
- ability to handle sex chromosomes HOT 2
- Attribute Error During Help Menu Call HOT 5
- No variable sites HOT 3
- Version of numpy and htslib doesn't work HOT 2
- Why are the results so different when using pixy and vcftools HOT 5
- pixy (apparently) not using all requested cores
- Provide convenient sample data in a package, such as sample data CHrX in sample scripts
- Error during install from conda HOT 1
- ValueError: invalid literal for int() with base 10: '' HOT 2
- Overestimation of number differences? HOT 1
- "the provided VCF appears to contain no invariant sites" HOT 2
- installation error HOT 4
- The value of Fst is negative HOT 2
- Dxy values interpretation? HOT 1
- tabix: error while loading shared libraries: libcrypto.so.1.0.0 HOT 5
- conda install issues with numpy and samtools HOT 1
- Installation issue with pixy HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pixy.