Comments (5)
@felixpetschko, since francesca mentioned you have been playing around with metrics as well, maybe this is something you would find interesting. The TCRdist metric is more relevant than levenshtein distance in practice. It's somewhat comparable to the alignment distance that Tobias has been working on, but might be even faster because it simplifies the alignment problem.
First step would be to get the code from nb_metrics.py
to work inside scirpy. As a next step it would be interesting if this works on GPU using numba's GPU features (or any other framework - as you like).
I'll provide feedback on the clonotype clustering later -- which is also still the priority right now. If we can't speed that up, speeding up the distance metrics is in vain.
from scirpy.
Hi @grst Is anyone working on implementing this at the moment. If not, I would be interested in having a crack at it.
from scirpy.
I think @felixpetschko has this on his list -- Felix, is that still the plan?
from scirpy.
@grst @ShihanL Yes, I am working on this :)
from scirpy.
Closed in #502.
This is in the main
branch and will be part of the next release.
@ShihanL, if you want to give it a try already, you can install the development version using
pip install git+https://github.com/scverse/scirpy.git@main
from scirpy.
Related Issues (20)
- tl.define_clonotypes within_group parameter returns ValueError HOT 1
- Retrieving specific portions of the Immune Receptor beyond the junction (or CDR3). HOT 2
- ir_dist alignment stuck HOT 4
- IEDB database cdr3_aa stored as junction_aa HOT 10
- Unclear default value for the Hamming Distance cut-off HOT 1
- Dandelion interoperability
- Where has UMI count for AIR chains gone? HOT 1
- Large dataset tutorial
- Make sure axes of nextwork plots don't have any ticks
- Add the Morisita-Horn index for repertoire overlap similarity scores HOT 1
- Sorting logic in `index_chains()` HOT 3
- Community tutorial page
- ir.tl.ir_query fails with error 'ValueError: max_workers must be greater than 0' HOT 1
- ir.tl.clonotype_modularity - ValueError: Length of values does not match length of index HOT 2
- "read_10x_vdj" not loading data properly HOT 2
- clone definition purely using CDR3 sequence HOT 1
- Optimize TCRdist metric HOT 1
- When running 'ir.tl.define_clonotypes' on MacOS14.4.1, I've got an Error:module 'os' has no attribute 'sched_getaffinity' HOT 2
- TypeError: join() got an unexpected keyword argument 'validate' HOT 9
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from scirpy.