Git Product home page Git Product logo

lcms2struct_exp's Introduction

Experiments and result analysis scripts for the LC-MS²Struct

This repository contains the scripts to reproduce the experiments and analyse the results for the manuscript:

"Joint structural annotation of small molecules using liquid chromatography retention order and tandem mass spectrometry data",

Eric Bach, Emma L. Schymanski and Juho Rousu, 2022

Reproducibility

If you wish to re-produce our results please follow the instructions given below. All our experiments where performed using Linux as operating system and Python (version 3.8 and 3.9 are supported). Other operating systems are not officially supported. Detailed software requirements are given alongside the reproducibility instructions.

The first step for any of the following reproducibility tasks is to clone this repository:

clone https://github.com/aalto-ics-kepaco/lcms2struct_exp/
cd lcms2struct_exp

Manuscript figures

The raw outputs of the LC-MS²Struct for the experiments presented in the manuscript are available on Zenodo. Download both tar-files and unpack them in the repository's root directory.

Detailed instructions how to re-produce the figures of the manuscript can be found here.

Re-run the experiments

If you wish to re-run our experiments, you will have to install the LC-MS²Struct library, which provides an implementation of the presented Structured Support Vector Machine (SSVM). Furthermore, you will have to download our MassBank database containing all features, candidate sets, etc. from Zenodo. Detailed instructions are given here.

Re-building the MassBank DB

If you wish to re-build the MassBank DB used in our experiments, please follow the instructions given here

Repository structure

The repository is organized in different subdirectories those content is described in the following.

  • misc_scripts: Contains a script to extract different MassBank database statistics
  • run_scripts: Contains all scripts needed to re-run our experiments
  • results_raw: Contains the "raw" result files as produces by our experimental scripts (download from Zenodo)
  • results_processed: Contains the result files as used to produce the figures in our manuscript, e.g. aggregated max-margins, comparison methods, ... (download from Zenodo)
  • ssvm_evaluation: Script generating the figures from the data (library)

Citation information

Use the citation information of you want to reference our work:

@article {Bach2022,
  author = {Bach, Eric and Schymanski, Emma L. and Rousu, Juho},
  title = {Joint structural annotation of small molecules using liquid chromatography retention order and tandem mass spectrometry data},
  elocation-id = {2022.02.11.480137},
  year = {2022},
  doi = {10.1101/2022.02.11.480137}, 
  publisher = {Cold Spring Harbor Laboratory},
  URL = {https://www.biorxiv.org/content/early/2022/04/27/2022.02.11.480137},
  eprint = {https://www.biorxiv.org/content/early/2022/04/27/2022.02.11.480137.full.pdf},
  journal = {bioRxiv}
}

Software citation:

DOI

lcms2struct_exp's People

Contributors

bachi55 avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar

Forkers

mattoslmp

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.