Comments (10)
Hello,
No sorry. I gave up trying to fix this. When needed, I do the refitting on 2013 signatures by creating a mutational catalog myself instead of using a vcf.
Please, let me know if you find a solution.
from sigprofilersinglesample.
The check_rules
option controls the application of the biological-based rules in the signature assignment process (as described in Extended Data Fig. 8b from Alexandrov et al. 2020 Nature). These rules are based on the COSMIC v3 reference mutational signatures, described in the same study and used as default by SigProfilerSingleSample. If you use a custom set of reference signatures, these rules cannot be used. As I mentioned, we are working on a major upgrade of the tool that will take this into account.
Happy to help and please reopen the issue if you have any other problems. Thanks!
from sigprofilersinglesample.
I seem to be having the same issue did you manage to fix this ?
from sigprofilersinglesample.
Hi Marozi2, I just started working with SigprofilerExtractor, I was wondering how to create a mutational catalogue of my samples? Could you help me out please.
Thanks in advance.
from sigprofilersinglesample.
Hi Sonia-Bedi, I'm not sure there is a public tool to convert a vcf in mutational catalog. I did it myself.
I start with the output vcf from my SNPs caller. I first isolate the SNPs and get the trinucleotide context for each SNPs. Then I make a file with 3 columns: Trinucleotide REF ALT. From that file, I remove indels then convert purines bases in pyrimidines because the tool take care only of the pyrimidines (well explained here). After that, I compute the number of each SNPs in each trinucleotide context and then it's a matter of formatting the file to have a proper mutational catalog for the tool.
from sigprofilersinglesample.
Thank you @Marozi2 for your reply.
This sounds like a big task. I am still in search of a way to transform my vcf file either into the acceptable format which SigProfilerExtractor accepts or transform it into mutational catalogue. If I dont find any solution, will just follow what you did.
from sigprofilersinglesample.
Hi @Marozi2 and @Sonia-Bedi,
You can generate a mutational catalog using your preferred reference genome by using SigProfilerMatrixGenerator. You can find the details in the corresponding README, as well as in this useful Wiki page.
On the other hand, regarding the refitting analysis with custom signature files that you mentioned @Marozi2, I would like to have a closer look into it. Could you please share the signatures file that you are using? Also, is this bug happening using both vcfs and a pandas dataframe for the input mutational catalog matrix?
Thanks a lot for your interest in our tool and sorry for the late reply.
Best,
Marcos
from sigprofilersinglesample.
Hello,
Sorry for the late response, I was away last two weeks.
Here is the file (signatures_forSigProfiler.txt) I use for refitting on 2013 signatures. I made it from this file (signatures.txt) I took from here ftp://ftp.sanger.ac.uk/pub/cancer/AlexandrovEtAl.
The bug happens when I give a vcf as input but it does not with a mutational catalog.
signatures_forSigProfiler.txt
signatures.txt
from sigprofilersinglesample.
Hello @Marozi2,
Please use the option check_rules=False
. That should solve your issue for now. Please let me know if that's not the case or if you have any other questions.
We are currently working on a major upgrade of the tool, where we will address this issue for sure. Thanks for letting us know.
By the way, you should download all the different versions of COSMIC signatures always from https://cancer.sanger.ac.uk/signatures/downloads/. Your signatures.txt
matrix can lead to some issues since the rows have been reordered in the newest SigProfiler versions. Please check carefully your results in this regard.
Hope that helps and thanks again for your interest!
from sigprofilersinglesample.
Hello @marcos-diazg
Indeed it works with the option check_rules=False
. Could you explain what does this option implies?
Yes I noticed that signatures.txt
was not correctly formatted, that's why I created signatures_ForSigProfiler.txt
.
Thank you for your help!
from sigprofilersinglesample.
Related Issues (20)
- Input differs than output of SigProfilerExtractor HOT 11
- minimum threshold for reported signatures? HOT 1
- New signatures on COSMIC website HOT 7
- Missing Subfolders HOT 1
- S should be uppercase for filename "signatures.txt" in spss.py script HOT 8
- ID decomposition support? HOT 2
- implementation for mutational catalogue as input HOT 4
- gzip VCF files HOT 1
- DBS and Indels signature missing
- Can this tool be used with a different genome? HOT 1
- Assignment of mutational signatures with SigProfilerSingleSample
- Error when running SigProfilerSingleSample. HOT 4
- SigProfilerSingleSample not used by CNV signature HOT 1
- IndexError when run with GRCh38 HOT 2
- Sigprofiler single sample decomposition profile doesn't match match COSMIC signature database? HOT 1
- Can ID signatures be run on vcfs from build 38? HOT 1
- KeyError: '7_pvalue' HOT 1
- index out of range HOT 3
- copy number signature HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from sigprofilersinglesample.