Comments (2)
Hi @RvV1979,
thanks for the nice feedback, good to hear that the pipeline is useful!
As of now, that type of functionality is not a built-in capability of grenepipe. I've had several users ask similar questions, but each with a slightly different requirement. Trying to implement some mechanism that would allow that within grenepipe would be rather cumbersome, both for me and for users, as it would need to be rather flexible and powerful to allow all these types of starting the pipeline somewhere in the middle, and hence require some kind of complicated way of specifying exactly which files at which step to take from where, etc...
That being said, I guess that "take the called
files and continue from there" might be a case that multiple people might find useful. Not sure if I'll get to implement that as a special case... not planned for now, but I'll put it on my list.
Anyway, so, with currently no way of grenepipe offering that intrinsicaly, there is still another way, albeit also rather complicated. That is, you could try to trick Snakemake into doing what you want. You'd need to put all your samples into a big samples table, and provide the called
files. Then, start snakemake in a dry run with --reason
, and start investigating which files it wants in each step. You might be able to get snakemake to think that all files for the subsequent steps (genotyping, filtering, etc) are there, and that it can hence start from there. Be aware that snakemake uses time stamps of the files to determine which ones need to be re-computed (as its inputs have changed) - so, you'd definitely need to learn some snakemake internals to pull this off.
Honestly, I'm not sure that I would even want to bother with that - unless computational resources and compute time on your cluster are the limiting factor on which you really need to save. Then it might be worth tinkering around with that. Otherwise, the waaaaay simpler approach is to just create a big samples table with all your fastq files, and run the whole thing in one large run. Up to you!
Let me know what you think!
Cheers and so long
Lucas
from grenepipe.
Hi Lucas,
Based on your descriptions I would indeed not want to bother with tricking snakemake into doing something it was not designed to do. In any case, your advice has saved me a lot of time trying to figure out the near impossible so many thanks for your quick reply.
from grenepipe.
Related Issues (20)
- bwa-mem2 "{tmp}.0000.bam": File exists HOT 5
- threads for bwa-mem2 via slurm HOT 2
- Error running toy example HOT 6
- MissingRuleException HOT 13
- PID error HOT 9
- java.lang.OutOfMemoryError: Java heap space HOT 2
- GRENEPIPE v12.1 HOT 5
- Make "trimming-tool" optional HOT 4
- restrict-regions and short contigs HOT 2
- ModuleNotFoundError: No module named 'chardet' HOT 2
- Write full executed command for each step to log files for reproducibility HOT 3
- mamba is difficult to install in grenepipe environment HOT 6
- Feature Request: Download reference genome and known variation HOT 2
- config file HOT 5
- greenepipe run error HOT 5
- problem with dedup HOT 4
- a new type of error HOT 2
- a new type of error HOT 1
- another type of error HOT 11
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from grenepipe.