mpusp / snakemake-crispr-guides Goto Github PK
View Code? Open in Web Editor NEWA Snakemake workflow for the design of small guide RNAs (sgRNAs) for CRISPR applications.
License: MIT License
A Snakemake workflow for the design of small guide RNAs (sgRNAs) for CRISPR applications.
License: MIT License
utilities.R
can fail with changed wd --> more robust path from snakemake
object to source other scriptslegend.position = 0
fails creation of report.htmlREADME.md
in config/
with description how to run the wf, will be section 3 on renderd usage pageNC.XXXX.[12]
, typical RefSeq numbersID
is missing in gff
fileslocus_tag
BSgenome
R packageget_genome
complaining about not finding output filesWhen running the pipeline for Synechococcus PCC 11901, the number of CDS considered by the pipeline was less than the number of CDS annotated in the genome.
This was due to some CDS annotations using GeneMarkS-2+ rather than RefSeq or protein homology. By including GeneMarkS-2+ in the gff_source_type dictionary in the get_genome script, I was able to include the previously missing CDS.
Are there any other methods of annotating CDS when assembling genomes that should be included here?
When using "GCF_005577135.1", species name is parsed as "Picosynechococcus sp. PCC 11901 chromosome". The correct and valid name for the organism's name would be “Synechococcus sp. PCC 11901” or "Picosynechococcus sp. PCC 11901" (https://www.ncbi.nlm.nih.gov/Taxonomy/Browser/wwwtax.cgi?id=2579791).
This causes (in create_bsgenome.R)
txdb <- quiet_txdb( file = genome_gff, organism = genome_name, chrominfo = seqinfo_genome )$result
to break, which can be fixed by hardcoding the taxonomyId:
txdb <- quiet_txdb( file = genome_gff, organism = genome_name, chrominfo = seqinfo_genome, taxonomyId = 2579791 )$result
The same happens in design_guides.R in:
txdb <- makeTxDbFromGFF( file = genome_gff, organism = unname(genome(seqinfo_genome)[1]), chrominfo = seqinfo_genome )
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.