ylab-hi / scanneo2 Goto Github PK

Snakemake-based computational workflow for neoantigen prediction from diverse sources

License: MIT License

Python 95.88% Dockerfile 3.88% Shell 0.24%

epitope exitron gene-fusion indels neoantigens neoepitope peptide snakemake snakemake-workflow splicing

scanneo2's Issues

Request TESLA*fastq.gz

From the .tests/integration/config_basic/config.yaml
Do you know how I can get these files to test it?
TESLA_9_2.fastq.gz
TESLA_10_2.fastq.gz
TESLA_11_2.fastq.gz

nthreads cannot be larger than environment variable

The workflow/envs/optitype.yml environment is limited to NUMEXPR_MAX_THREADS" (64). This applies only when setting the threads number higher than 64.

Error.  nthreads cannot be larger than environment variable "NUMEXPR_MAX_THREADS" (64)

Needs to be either increased or limited to 64 (hla typing)

Where I can download the required argument peptides

I want to use the compile.py module.
However, in the repo, I can not found the description that illustrate where I can download the peptide.fasta
https://github.com/ylab-hi/ScanNeo2/blob/52a3818ec3189af502f18eba6b6a1a69b9b3a8c3/workflow/rules/prioritization.smk#L58C4-L58C43

Add routine to catch missing input data on filesystem

Missing input data causes:

MissingInputException in rule fastqc_forward in file /projects/b1171/sej9799/GBM_analysis/ScanNeo2/workflow/rules/preproc.smk, line 35:
Missing input files for rule fastqc_forward:
output: results/SRR8281248/rnaseq/qualitycontrol/rna_tumor_R1_fastqc_raw.html, results/SRR8281248/rnaseq/qualitycontrol/rna_tumor_R1_fastqc_raw.zip
wildcards: sample=SRR8281248, seqtype=rnaseq, group=rna_tumor
affected files:
../GBM/SRR8281248_1.fastq

Should be caught differently

Aligned option using star

In ran on the star 2.7.1a star aligner. The option --chimOutType is not set correctly when I ran it showed me the conflict options.
https://github.com/ylab-hi/ScanNeo2/blob/main/workflow/rules/align.smk#L33C1-L33C34

EXITING because of fatal PARAMETERS error: --chimMultimapNmax > 0 (new chimeric detection) presently only works with --chimOutType Junctions
SOLUTION: re-run with --chimOutType Junctions

However, the pipeline already sets with --chimOutType WithinBAM HardClip.
Currently, I replace the option as recommended automatically by the software.
Can you check on the current smk align rule. If it is not set correctly, please update the pipeline.

cleave peptide for fusion gene

I tested on the test dataset of nextneopi (https://github.com/icbi-lab/nextNEOpi). I has a similar session for using the arriba to get the fusion genes. From those fusion genes, it can get the peptides that are possible to be the neoantigens.

There peptides with 8 amino acids:
PTEN - AC063965.1(21548),MED6P1(31892) MFSGGTCm FSGGTCmg SGGTCmgr GGTCmgrc GTCmgrcm TCmgrcmq Cmgrcmqt mgrcmqty grcmqtyp rcmqtypk cmqtypkv mqtypkvq qtypkvqg typkvqgs#Fusion-out-of-frame#high#yes#chr10:87952259#chr10:88016243#11#1#0#.#.

Your peptides with 8 amino acids:
MFSGGTCm

Is there anything wrong related to my test. Or your pipeline is focused on getting only this peptide rather than getting too much peptides sequence to achieve 37/38 active neoantigens on TELSA dataset?

Best,

MHC-I genotyping on Split BAMs

BAMfiles probably need to be QNAME sorted (rather than coordinate sorted) when splitting them

Calling the long non indel from both DNAseq and RNAseq bam file

I reviewed the publication:
https://academic.oup.com/bioinformatics/article/39/11/btad659/7330407
I think that the pipeline used both bam files from RNAseq and DNAseq to call for the long indel. The bam file from the figure shows me it is only from the RNAseq bam file. If there is anything wrong, please let's me know.

Extracting RG for DNAseq data when input is BAM required

A similar routine as for RNA-seq data required for BWA alignment for the DNA-seq path (when input is BAM file)

ylab-hi / scanneo2 Goto Github PK

scanneo2's Issues

Request TESLA*fastq.gz

nthreads cannot be larger than environment variable

Where I can download the required argument peptides

Add routine to catch missing input data on filesystem

Aligned option using star

cleave peptide for fusion gene

Paired-end Bam files causes error exception in featureCounts

Can I run the pipeline with slurm on HPC?

add cpus to spladder

Slow FTP download of VEP Cache

GATK Speedup Gather&Scatter

Prevent trigger that re-runs certain rules (/tmp)

can i use targeted sequencing? rather than WES..

MHC-I genotyping on Split BAMs

Calling the long non indel from both DNAseq and RNAseq bam file

Extracting RG for DNAseq data when input is BAM required

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent