Comments (5)
Unfortunately, it is more complex, and the results depends on the rest of the sequences.
A counter-example: If the target/database sequence in the example had some extra non-matching sequence at the 3' end, the first match would be chosen. This is because these are global alignments and when a gap has to be opened at the 3' end anyway, it would rather extend that gap than opening an additional gap in 5' end, because the score would be better.
from vsearch.
tests added (frederic-mahe/vsearch-tests@b7beb67)
from vsearch.
@torognes it seems to me that the rule is to return the first perfect match found during backtracking? (so the rightmost one on the target sequence). If it is a permanent feature, I would like to mention it in the documentation. Is that ok for you?
Side question: I've been testing the same idea, but with matches on the minus strand of the target sequence. I've observed the same pattern (see frederic-mahe/vsearch-tests@70a6808), which suggests that when searching on the opposite strand, the query sequence is reverse-complemented and the target sequence is left unchanged. Is that correct?
from vsearch.
When aligning and backtracking, the code will always prefer to match the sequences than opening a gap, given that the scores are equal. It starts backtracking at the 3' ends of the sequences, and in the examples it will always start by matching the sequences at the 3' end.
from vsearch.
Thanks @torognes I've updated our test-suite (frederic-mahe/vsearch-tests@bae529a) with counter-examples. This is a complex behavior, and I don't see an easy way to describe it in the manpage, at least for now. That issue is complete and can be closed.
from vsearch.
Related Issues (20)
- compilation warning with ar: 'u' modifier ignored since 'D' is the default HOT 2
- sintax output is sometimes 4 columns and other times 5 columns HOT 3
- fastq_stripleft when the resulting length is null?
- forward read trimming and filtering (Minardi et al. 2021) HOT 1
- control of 2 separate randseed events in sintax HOT 4
- from fasta files to an OTU table HOT 1
- --uchime_denovo takes abundance information into account HOT 1
- how to detect matches containing many ambiguous symbols? HOT 1
- more compile-time checks HOT 2
- Issue encountered when using vsearch --usearch_global to generate OTU frequency table HOT 3
- clean-up stale branches HOT 1
- --makeudb_usearch truncates fasta headers HOT 3
- maxseqlength is not supported by uchime_denovo command HOT 6
- vsearch --usearch_global not showing "full alignment" instead only the segment pair HOT 3
- vsearch --top_hits_only --maxaccepts 1 returns sometimes 2 values HOT 6
- Issue related to usearch_global match HOT 4
- missing userfields options
- Consequences of using vsearch on NovaSeq data HOT 4
- Fix warnings reported by Lintian HOT 2
- Obtaining the expected error for each read HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from vsearch.