Comments (4)
Hi Yuta,
Thanks, very nice of you to say!
I had indeed seen tidytcells, which looks great. I'm open to seeing how this could integrate - maybe I'll reach out to discuss how best this could work. Either way I'm very happy pointing that out in the stitchr docs, as I've definitely had people hit stitchr errors due to formatting errors in their input allele names!
from stitchr.
Amazing!
In terms of integration I had just thought that when the input species was either human or mouse, that tidytcells could check for IMGT compliance automatically, and either attempt auto-correction, or notify the user if the input symbol is not recoverable. But I also appreciate that having a dependency on a small project like tidytcells is a point of failure which you might feel weary of.
In either case, maybe a mention in the docs could potentially help out some future users with messy data that they want to put through stitchr. 💪🏼 If you're open to it, I can make some time in the near future to add a tiny mention in the docs and submit a pull request, if that makes life easier for you?
from stitchr.
I'm actually working on some minor tweaks at the moment, including the docs, so I can add in the suggestion no problem.
The only thing that makes me a little nervous about instituting an automatic check is that it might not cope so well with odd/uncommon/non-natural sequences (which are things that I often put through myself, and which did upset things in earlier test versions). For example I've recently been playing with some funny ORF alleles, like the human TRBV7-3*03, which uses Arg instead of the usual conserved second Cys, which I think tidytcells
might break the sequences of. Hence generally I've mostly made stitchr
to work under the assumption that users are best placed to get their own data sorted upstream, and just instead raise warnings if it thinks something might have gone awry.
However I can certainly imagine that many users won't care about those edge cases, especially when trying to adapt large datasets. I think maybe it could be included as an optional check, triggered via a flag when running. Perhaps even separate flags for tidytcells.tr.standardize
and tidytcells.junction.standardize
, to allow users a bit more flexibility.
from stitchr.
Thanks for adding a note in the docs! I don't know if my email reply to you came through but I see what you mean about edge cases and that perhaps best practice is to have standardisation and stitching be two separate steps. Anyways I hope our tools combined can have a synergistic effect and help some TCR people out there :).
from stitchr.
Related Issues (20)
- Simplifying output / silent mode HOT 1
- J/C region broken in certain genes in skip/extra gene mode HOT 1
- Lower case CDR3 amino acid characters cause an error HOT 1
- TRDV2 error HOT 2
- Possible to use custom species (non-human/non-mouse)? HOT 1
- Alternative non-templated codon usage?
- Improve % of perfectly replicated CDR3s when using a NT CDR3
- Add -cite option to stitchr
- thimble function gives an error for LEADER sequence when stitching TCR HOT 3
- Compatibility of stitchr with Windows HOT 5
- CDR1/2 HOT 2
- Wild card usage in Thimble ignores extra genes (-xg/additional-genes.fasta)
- First example at https://jamieheather.github.io/stitchr/installation.html not working as expected HOT 2
- Importing stitchr for use in other scripts - obtain stitched aa sequence without C region information HOT 5
- Specify databases location HOT 5
- Add option to read FASTA automatically into additional-genes.fasta
- Add more details to docs about different error/warning messages HOT 1
- TRBD gene HOT 2
- A issue about stitching immunoglobulins HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from stitchr.