Comments (4)
Hi Antonio,
You can do something like this:
mmseqs cluster db clu tmp
mmseqs align db db clu aln -a
mmseqs createtsv db db aln aln.tsv
The -a
parameter computes the whole backtrace, you can also use --alignment-mode 3
, but that computes the backtrace anyway since it needs it for the seq. id.
--alignment-mode
1 and 2 only approximate the sequence identity.
Thank you for the feedback! We will add this to the FAQ and also discuss if it make sense to provide the same output also for Uniclust in future releases.
from mmseqs2.
Thank you very much! I will give it a try tonight and be back to you.
We will add this to the FAQ and also discuss if it make sense to provide the same output also for Uniclust in future releases.
In our case, this information will help us to evaluate the quality of the clusters.
Cheers
Antonio
from mmseqs2.
One more possible problem: the default --max-seqs
is too low for this task. You should set it to some very large number (--max-seqs 2147483647
). So you will get every sequence identity of every cluster member.
from mmseqs2.
Hi Milot,
I had a quick look to the results and seems that it work perfectly. I will have a more thorough look to the results in case we are missing any pair.
Many thanks
Cheers
Antonio
from mmseqs2.
Related Issues (20)
- .
- Error: Prefilter died HOT 1
- Difference in evalue when doijng search against swissprot_human and swissprot HOT 2
- Unable to find exact match 20 bp sequence HOT 1
- convertails getting killed due to the memory HOT 1
- MMseqs cluster : Invalid database read ... Error: Offset step died HOT 1
- Is it reasonable to use LINCLUST with --Kmer-Per-Seq 2000? HOT 4
- Null bite '^@' in convertalis results
- ARM64 installation issue - M3 chip MacBook Pro HOT 5
- Trivial case of clustering one sequence is too slow
- mmseqs as a replacement for cd-hit-est HOT 2
- Sequence Similarity Network Generation
- Prefilter (second round) dies during taxonomic classification with UniRef90 DB HOT 6
- Can mmseqs easy-search ouput all query-target pairs? HOT 2
- Does input FASTA file have to be aligned? HOT 2
- MMseqs search not finding exact and close-exact hits HOT 1
- Use MMSeqs to Output full MSA HOT 1
- .dbtype already exists error when clustering using profiles HOT 1
- SAM file formatting HOT 7
- result2msa output change HOT 4
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mmseqs2.