fricktobias / dbs-pro Goto Github PK

View Code? Open in Web Editor NEW

4.0 4.0 1.0 1.13 MB

DBSpro Analysis

License: MIT License

Python 3.81% Jupyter Notebook 96.19%

dbs-pro's People

Contributors

Stargazers

Watchers

Forkers

afshinlab

dbs-pro's Issues

Check for & filter chimeric reads

The UMI sequences can be used to identify chimeric sequences by looking for UMI:s linked to several different ABC or DBS sequences.

Change pipeline order

Just and idea I had about how we might want to change the order in our pipeline.

I have found the following issue. For UMIs we cluster them for each ABC target but do not separate on DBS. This could mean that we are merging UMIs that should in fact be separate. My proposal would be to separate all UMIs by ABC and DBS before clustering. This would better represent the actual conditions in the experiment.

I am however unsure about the benefits in the end, possibly this would only be a lot of work for nothing, but I wanted to raise the idea anyway to set what you think.

Current pipeline

START. Input = Fastq file

Separate for DBS
1.1 Extract DBS
1.2. Cluster DBS
1.3 Correct DBS fastq
Separate for ABCs
2.1. Extract ABC-UMI
2.2 Split ABC-UMI by ABC
2.3 Cluster ABCs independently
2.4 Correct ABC fastqs.
Analysis of corrected DBS and ABC files.

END.

Purposed outline pipeline

START. Fastq file

Extract DBS
Extract ABC-UMI
Cluster DBS
Correct DBS fastq
Split/Tag ABC-UMI by DBS //This represent separated dropletts
Split/Tag ABC-UMI by ABC // This represents spliting within dropletts for different targets.
Cluster for each DBS-ABC pair indepentently
Correct DBS-ABC pairs
Analysis

END.