Git Product home page Git Product logo

mgf-s3-prototype's Introduction

mgf-s3-prototype

The repositories that will hold the MFG outputs are still under investigation, so while the data in here are all valid, they may be moved to another location and another organisation later

Prerequisites

conda install -c conda-forge dvc dvc-s3
conda install -c conda-forge s5cmd  # optional install in case you want direct access to the s3 bucket

Use case 1: initializing a new analysis-results-cluster

Before running the following commands, it is assumed that you already ran git init or git cloned an existing repository.

dvc init
git commit -m "initialize dvc"
dvc remote add -d myremote s3://emobon-lfs-demo
dvc remote modify myremote endpointurl https://s3.mesocentre.uca.fr
dvc remote modify myremote profile "eosc-fairease1" # run this if you make use of a profile in your s3 config
git add .dvc/config
git commit -m "update dvc config"

Use case 2: pushing large files

Copy the new files you want to add into the repository on your local machine and start by adding the large files to DVC.

For example:

dvc add mgf-crate-0/results/DBH_AAAGOSDA_1_1_HMNJKDSX3.UDI200_clean.fastq.trimmed.fasta
dvc add mgf-crate-0/results/DBH_AAAGOSDA_1_2_HMNJKDSX3.UDI200_clean.fastq.trimmed.fasta
dvc add mgf-crate-0/results/DBH.merged_CDS.faa
...

Once all large files have been added to DVC, the .dvc placeholder files and .gitignore can be committed and the large files can be pushed to the S3 bucket.

git add mgf-crate-0/results/*.dvc mgf-crate-0/results/.gitignore
git commit -m "add large files"
dvc push

The remaining small files can then be committed directly.

git add .
git commit -m "add small files"
git push origin

Use case 3: pulling large files

A single command is sufficient to retrieve all large files in one go.

dvc pull

mgf-s3-prototype's People

Contributors

fmestre1 avatar laurianvm avatar bulricht avatar cymon avatar kmexter avatar

Watchers

 avatar Marc Portier avatar Ioulia Santi avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.