Git Product home page Git Product logo

seqrep's Introduction

SeqRep

PyPI version Open Source? Yes! GitHub license Check Markdown links CodeFactor Code style: black Imports: isort Open in Colab Towards Data Science

Scientific framework for representation in sequential data

Table of Content

Click to expand!

Description

This package aims to simplify the workflow of evaluation of machine learning models. It is primarily focused on sequential data. It helps with:

  • labeling data,
  • splitting data,
  • feature extraction,
  • feature reduction (i.e. selection or transformation),
  • running pipeline,
  • evaluation of results.

It also allows you to visualize each step.

The framework is designed for easy customization and extension of its functionality.

Installation

python -m pip install git+https://github.com/MIR-MU/seqrep

Features

See the README in the seqrep folder.

Usage

It is simple to use this package. After the import, you need to do three steps:

  1. Create your pipeline (which you want to evaluate);
  2. Create PipelineEvaluator (according to how you want to evaluate);
  3. Run the evaluation.
from sklearn.pipeline import Pipeline
from sklearn.preprocessing import MinMaxScaler
from sklearn.svm import SVC

from seqrep.feature_engineering import PreviousValuesExtractor, TimeFeaturesExtractor
from seqrep.labeling import NextColorLabeler
from seqrep.splitting import TrainTestSplitter
from seqrep.scaling import UniversalScaler
from seqrep.evaluation import ClassificationEvaluator
from seqrep.pipeline_evaluation import PipelineEvaluator

# 1. step
pipe = Pipeline([('fext_prev', PreviousValuesExtractor()),
                 ('fext_time', TimeFeaturesExtractor()),
                 ('scale_u', UniversalScaler(scaler=MinMaxScaler())),
                 ])

# 2. step
pipe_eval = PipelineEvaluator(labeler = NextColorLabeler(),
                              splitter = TrainTestSplitter(),
                              pipeline = pipe,
                              model = SVC(),
                              evaluator = ClassificationEvaluator(),
                              )
# 3. step
result = pipe_eval.run(data=data)

See the examples folder for more details.

License

GitHub license

This package is licensed under the MIT license, so it is open source. Feel free to use it!

Acknowledgement

Thanks for the huge support to my supervisor Michal Stefanik! Gratitude also belongs to all members of the MIR-MU group. Finally, thanks go to the Faculty of Informatics of Masaryk University for supporting this project as a dean's project.

seqrep's People

Contributors

jakrys avatar stefanik12 avatar witiko avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

seqrep's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.