Comments (7)
We've already checked whether blastn is working properly. Long runtimes seems to be more related to strategy itself, so the issue is likely heading towards a software/strategy alternative, treating reads concatenation as a metagenome.
from genome-assembly-of-the-copepod-leptodiaptomus.
You can try using aligners for big sequence data such DIAMOND or LAST.
For more information on working with long-reads:
Huson, D. H., Albrecht, B., Bağcı, C., Bessarab, I., Gorska, A., Jolic, D., & Williams, R. B. (2018). MEGAN-LR: new algorithms allow accurate binning and easy interactive exploration of metagenomic long reads and contigs. Biology direct, 13(1), 6.
Bağcı, C., Beier, S., Górska, A., & Huson, D. H. (2019). Introduction to the analysis of environmental sequences: metagenomics with MEGAN. In Evolutionary Genomics (pp. 591-604). Humana, New York, NY.
from genome-assembly-of-the-copepod-leptodiaptomus.
Try Alignment-free methods:
https://genomebiology.biomedcentral.com/track/pdf/10.1186/s13059-017-1319-7 <- Tables 1 & 2
https://sourceforge.net/projects/rafts3/ <- Fast Blast alternative
from genome-assembly-of-the-copepod-leptodiaptomus.
For a review on how using metagenomic tools and construct metagenome-assembled genomes (MAGs):
Chen, L. X., Anantharaman, K., Shaiber, A., Eren, A. M., & Banfield, J. F. (2020). Accurate and complete genomes from metagenomes. Genome research, 30(3), 315-333.
from genome-assembly-of-the-copepod-leptodiaptomus.
Also you can give a look to this article:
Benchmarking the Minion: evaluating long reads for microbial profiling
DOI: 10.1038/s41598-020-61989-x
from genome-assembly-of-the-copepod-leptodiaptomus.
In this blog the author documents the steps he follows to Remove Microbial Contamination in Reads (bacteria, viral, fungi, protozoa, and archaea) in either short and long reads. He explains that for PacBio long reads, he didn’t find any tools specialized for that, so he tries several tools and concludes that minimap2 best met his needs.
https://yiweiniu.github.io/blog/2018/07/Remove-Contamination-of-Pokaryotic-Organisms-in-Reads/#more
Hope this could be useful.
from genome-assembly-of-the-copepod-leptodiaptomus.
It was concluded that the blast is very slow due to the large number of reads of my data. Unfortunately from my local computer I can't do it any faster. But my colleagues suggested different alternatives: (1) use alignments with reference genomes and extract the reads of interest, (2) treat the samples as metagenomes to identify the sequences of microorganisms and (3) extract the sequences of mitochondria to assemble and compare them with other species.
Due to the computational resources that I have now, I decided to focus on alternatives 1 and 3, in the first one I will probably lose coverage for assembly de novo but I hope to complete later with more sequences. And in the third I had not thought, but it will help me answer some of my biological questions and due to the amount of data (sequences), I can work better for now.
from genome-assembly-of-the-copepod-leptodiaptomus.
Related Issues (6)
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from genome-assembly-of-the-copepod-leptodiaptomus.