Git Product home page Git Product logo

biologicaldatads2020's Introduction

BiologicalDataDS2020

Repository for Biological Data course project, Master Degree in Data Science at University of Padua.

Image

Requirements

All the required Python packages can be install executing the code

pip install -r requirements.txt

while inside the folder of the project.

All the remaining operations were executed using a Linux x64 machine, launching the bash files inside the data folder.

All the databases needed to execute the code were not included in the repository due to their size, and are hosted in this OneDrive folder. After downloading them, place them inside the data/part_2/original_datasets folder.

Since the computation of all the metrics for all the models is quite time consuming, we computed them just the first time, saved all the results on .csv files, and just read from them in the Notebook. To recompute from scratch all the metrics to test all the computations, just delete all the data in the parsed subfolders in data/part_1/HMMs and data/part_1/PSSMs.

Structure of the Project

The main file of the project is Project.ipynb (available here), in here all the steps we have done can be followed and executed again.

report.pdf contains an in-depth explanation of what was done during the project and in there will be the interpretations of our results.

In code can be found all the Python script used in the Jupyter notebook.

In data can be found all the files and bash script used and saved from the Jupyter notebook.

biologicaldatads2020's People

Contributors

alessandromanente avatar rmazzier avatar gianmarcocr avatar

Stargazers

Benedetta Mariani avatar Michele Lambertucci avatar  avatar Alberto Rossetto avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.