Comments (3)
I was facing the same problem, it's hardcoded that there's a "|" separation in reference names and that the 4th field when you do line.split("|")[3]
returns the accession number.
You have two choices, change your database sequence names to fit that criterion, or change line 106 so it returns the accession number
from vapid.
Hi! Sorry for missing the first post and thanks for the solution posted above!
You can also use the —f flag to specify a specific reference file that you would like to use. So in this case you could download NC_045512.2.gbf once and then use the —f flag to always annotate off that file. This completely skips the blast search and is a good option once you know the reference you’re using works well for your files.
How exactly did you create your reference database? Could you upload it? I’d like to take a look and ensure that people can easily make their own reference databases and use them without modifying the code. I’m glad this tool seems like it would be useful and I want to make sure it works for everyone!
R
from vapid.
Hey, thanks for the response.
I think the problem is easy to fix, it's just that in the database you have, the references' names have a |
separation and the 3rd field (0 index) is the accession number, however, if someone builds their own database (Like I did where I replaced all the NCBI viral reference already in all_viruses with a newer version and added covid, I downloaded the viral.1.1.genomic from NCBI, and the names didn't have |
separation anymore and the code breaks), so the problem is just in the naming.
However, I did encounter some minor problems in the code and fixed them. I will try to fix a couple of things and send you a pull request, then if you think the fixes make sense, feel free to merge then :)
I'll try to do it in the near future :)
from vapid.
Related Issues (14)
- VAPiD not processing final sequence (of 5) and no sequences yield .sqn files HOT 5
- Is VAPiD suitble for metatranscriptome virus? HOT 1
- Spliced genes + ncRNAs HOT 1
- Please help, I have problems HOT 4
- running VAPiD throws IndexError: list index out of range HOT 3
- nboundLocalError: local variable 'name_of_virus' referenced before assignment for Insect virus HOT 3
- VAPID run error HOT 2
- Issues with final genbank-format file HOT 4
- running VAPiD throws IndexError: list index out of range - v2
- Create config.yaml for easy conda install
- adjust() does not work properly HOT 1
- UnboundLocalError: local variable 'ref_seq_gb' referenced before assignment HOT 1
- How similar do inputs need to be? HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vapid.