Git Product home page Git Product logo

best-of-ml-python's Introduction

Best-of Machine Learning with Python

🏆  A ranked list of awesome machine learning Python libraries. Updated weekly.

This curated list contains 830 awesome open-source projects with a total of 2.6M stars grouped into 32 categories. All projects are ranked by a project-quality score, which is calculated based on various metrics automatically collected from GitHub and different package managers. If you like to add or update projects, feel free to open an issue, submit a pull request, or directly edit the projects.yaml. Contributions are very welcome!


🧙‍♂️  Discover other best-of lists or create your own.
📫  Subscribe to our newsletter for updates and trending projects.


Contents

Explanation

  • 🥇🥈🥉  Combined project-quality score
  • ⭐️  Star count from GitHub
  • 🐣  New project (less than 6 months old)
  • 💤  Inactive project (6 months no activity)
  • 💀  Dead project (12 months no activity)
  • 📈📉  Project is trending up or down
  • ➕  Project was recently added
  • ❗️  Warning (e.g. missing/risky license)
  • 👨‍💻  Contributors count from GitHub
  • 🔀  Fork count from GitHub
  • 📋  Issue count from GitHub
  • ⏱️  Last update timestamp on package manager
  • 📥  Download count from package manager
  • 📦  Number of dependent projects
  •   Tensorflow related project
  •   Sklearn related project
  •   PyTorch related project
  •   MxNet related project
  •   Apache Spark related project
  •   Jupyter related project
  •   PaddlePaddle related project
  •   Pandas related project

Machine Learning Frameworks

Back to top

General-purpose machine learning and deep learning frameworks.

Tensorflow (🥇44 · ⭐ 150K) - An Open Source Machine Learning Framework for Everyone. Apache-2
  • GitHub (👨‍💻 3.5K · 🔀 84K · 📦 120K · 📋 30K - 14% open · ⏱️ 17.01.2021):

     git clone https://github.com/tensorflow/tensorflow
    
  • PyPi (📥 4.7M / month · 📦 23K · ⏱️ 14.12.2020):

     pip install tensorflow
    
  • Conda (📥 2.3M · ⏱️ 15.07.2020):

     conda install -c conda-forge tensorflow
    
  • Docker Hub (📥 47M · ⭐ 1.8K · ⏱️ 17.01.2021):

     docker pull tensorflow/tensorflow
    
scikit-learn (🥇41 · ⭐ 44K) - scikit-learn: machine learning in Python. BSD-3
  • GitHub (👨‍💻 2.1K · 🔀 21K · 📥 630 · 📦 180K · 📋 8.9K - 26% open · ⏱️ 16.01.2021):

     git clone https://github.com/scikit-learn/scikit-learn
    
  • PyPi (📥 9.6M / month · 📦 38K · ⏱️ 22.12.2020):

     pip install scikit-learn
    
  • Conda (📥 6.3M · ⏱️ 22.12.2020):

     conda install -c conda-forge scikit-learn
    
PyTorch (🥇39 · ⭐ 46K) - Tensors and Dynamic neural networks in Python with strong GPU.. BSD-3
  • GitHub (👨‍💻 2.5K · 🔀 12K · 📦 59K · 📋 20K - 36% open · ⏱️ 17.01.2021):

     git clone https://github.com/pytorch/pytorch
    
  • PyPi (📥 1.6M / month · 📦 6.7K · ⏱️ 10.12.2020):

     pip install torch
    
  • Conda (📥 9.8M · ⏱️ 10.12.2020):

     conda install -c pytorch pytorch
    
PySpark (🥇38 · ⭐ 29K) - Apache Spark Python API. Apache-2
  • GitHub (👨‍💻 2.4K · 🔀 23K · 📦 540 · ⏱️ 16.01.2021):

     git clone https://github.com/apache/spark
    
  • PyPi (📥 6.4M / month · 📦 760 · ⏱️ 07.09.2020):

     pip install pyspark
    
  • Conda (📥 870K · ⏱️ 07.09.2020):

     conda install -c conda-forge pyspark
    
StatsModels (🥇36 · ⭐ 5.9K) - Statsmodels: statistical modeling and econometrics in Python. BSD-3
  • GitHub (👨‍💻 300 · 🔀 2.1K · 📥 25 · 📦 36K · 📋 4.3K - 47% open · ⏱️ 12.01.2021):

     git clone https://github.com/statsmodels/statsmodels
    
  • PyPi (📥 1.8M / month · 📦 6.7K · ⏱️ 29.10.2020):

     pip install statsmodels
    
  • Conda (📥 3.1M · ⏱️ 12.01.2021):

     conda install -c conda-forge statsmodels
    
Keras (🥇35 · ⭐ 51K) - Deep Learning for humans. MIT
  • GitHub (👨‍💻 900 · 🔀 19K · 📋 10K - 30% open · ⏱️ 15.01.2021):

     git clone https://github.com/keras-team/keras
    
  • PyPi (📥 1.7M / month · 📦 15K · ⏱️ 24.06.2020):

     pip install keras
    
  • Conda (📥 1.4M · ⏱️ 25.06.2020):

     conda install -c conda-forge keras
    
XGBoost (🥇35 · ⭐ 20K) - Scalable, Portable and Distributed Gradient Boosting (GBDT, GBRT or.. Apache-2
  • GitHub (👨‍💻 500 · 🔀 7.9K · 📥 1.9K · 📦 13K · 📋 3.9K - 6% open · ⏱️ 15.01.2021):

     git clone https://github.com/dmlc/xgboost
    
  • PyPi (📥 2.4M / month · 📦 1.6K · ⏱️ 09.12.2020):

     pip install xgboost
    
  • Conda (📥 1.3M · ⏱️ 10.12.2020):

     conda install -c conda-forge xgboost
    
LightGBM (🥇35 · ⭐ 12K) - A fast, distributed, high performance gradient boosting (GBT, GBDT, GBRT,.. MIT
  • GitHub (👨‍💻 200 · 🔀 3.2K · 📥 87K · 📦 5.3K · 📋 2K - 4% open · ⏱️ 16.01.2021):

     git clone https://github.com/microsoft/LightGBM
    
  • PyPi (📥 1.3M / month · 📦 560 · ⏱️ 08.12.2020):

     pip install lightgbm
    
  • Conda (📥 470K · ⏱️ 15.01.2021):

     conda install -c conda-forge lightgbm
    
MXNet (🥈34 · ⭐ 19K) - Lightweight, Portable, Flexible Distributed/Mobile Deep Learning.. Apache-2
  • GitHub (👨‍💻 950 · 🔀 6.8K · 📥 23K · 📦 1.7K · 📋 9.4K - 19% open · ⏱️ 16.01.2021):

     git clone https://github.com/apache/incubator-mxnet
    
  • PyPi (📥 140K / month · 📦 440 · ⏱️ 28.08.2020):

     pip install mxnet
    
  • Conda (📥 5.7K · ⏱️ 29.02.2020):

     conda install -c anaconda mxnet
    
Theano (🥈34 · ⭐ 9.3K) - Theano is a Python library that allows you to define, optimize, and.. BSD-3
  • GitHub (👨‍💻 380 · 🔀 2.5K · 📦 10K · 📋 2.8K - 24% open · ⏱️ 05.09.2020):

     git clone https://github.com/Theano/Theano
    
  • PyPi (📥 190K / month · 📦 5.5K · ⏱️ 27.07.2020):

     pip install theano
    
  • Conda (📥 1.3M · ⏱️ 01.11.2020):

     conda install -c conda-forge theano
    
pytorch-lightning (🥈33 · ⭐ 11K) - The lightweight PyTorch wrapper for high-performance.. Apache-2
  • GitHub (👨‍💻 360 · 🔀 1.3K · 📥 34 · 📦 1.7K · 📋 2.8K - 12% open · ⏱️ 16.01.2021):

     git clone https://github.com/PyTorchLightning/pytorch-lightning
    
  • PyPi (📥 90K / month · 📦 14 · ⏱️ 12.01.2021):

     pip install pytorch-lightning
    
  • Conda (📥 14K · ⏱️ 12.01.2021):

     conda install -c conda-forge pytorch-lightning
    
Fastai (🥈32 · ⭐ 20K) - The fastai deep learning library. Apache-2
  • GitHub (👨‍💻 470 · 🔀 6.9K · 📋 1.4K - 4% open · ⏱️ 13.01.2021):

     git clone https://github.com/fastai/fastai
    
  • PyPi (📥 130K / month · 📦 370 · ⏱️ 13.01.2021):

     pip install fastai
    
jax (🥈32 · ⭐ 11K) - Composable transformations of Python+NumPy programs: differentiate,.. Apache-2
  • GitHub (👨‍💻 240 · 🔀 940 · 📦 1K · 📋 2.1K - 34% open · ⏱️ 16.01.2021):

     git clone https://github.com/google/jax
    
  • PyPi (📥 70K / month · 📦 46 · ⏱️ 12.01.2021):

     pip install jax
    
  • Conda (📥 68K · ⏱️ 14.10.2020):

     conda install -c conda-forge jaxlib
    
Thinc (🥈32 · ⭐ 2.2K) - A refreshing functional take on deep learning, compatible with your favorite.. MIT
  • GitHub (👨‍💻 36 · 🔀 200 · 📦 11K · 📋 98 - 17% open · ⏱️ 05.01.2021):

     git clone https://github.com/explosion/thinc
    
  • PyPi (📥 790K / month · 📦 1.1K · ⏱️ 16.12.2020):

     pip install thinc
    
  • Conda (📥 850K · ⏱️ 18.12.2020):

     conda install -c conda-forge thinc
    
Catboost (🥈31 · ⭐ 5.6K) - A fast, scalable, high performance Gradient Boosting on Decision.. Apache-2
  • GitHub (👨‍💻 720 · 🔀 860 · 📥 47K · 📋 1.3K - 22% open · ⏱️ 17.01.2021):

     git clone https://github.com/catboost/catboost
    
  • PyPi (📥 710K / month · 📦 160 · ⏱️ 27.12.2020):

     pip install catboost
    
  • Conda (📥 550K · ⏱️ 29.12.2020):

     conda install -c conda-forge catboost
    
Chainer (🥈31 · ⭐ 5.5K) - A flexible framework of neural networks for deep learning. MIT
  • GitHub (👨‍💻 320 · 🔀 1.4K · 📦 1.8K · 📋 2K - 0% open · ⏱️ 23.12.2020):

     git clone https://github.com/chainer/chainer
    
  • PyPi (📥 18K / month · 📦 740 · ⏱️ 13.01.2021):

     pip install chainer
    
TFlearn (🥈30 · ⭐ 9.5K) - Deep learning library featuring a higher-level API for TensorFlow. MIT
  • GitHub (👨‍💻 130 · 🔀 2.4K · 📦 2.9K · 📋 930 - 61% open · ⏱️ 30.11.2020):

     git clone https://github.com/tflearn/tflearn
    
  • PyPi (📥 12K / month · 📦 480 · ⏱️ 11.11.2020):

     pip install tflearn
    
Vowpal Wabbit (🥈30 · ⭐ 7.4K) - Vowpal Wabbit is a machine learning system which pushes the.. BSD-3
  • GitHub (👨‍💻 300 · 🔀 1.7K · 📦 93 · 📋 1K - 15% open · ⏱️ 15.01.2021):

     git clone https://github.com/VowpalWabbit/vowpal_wabbit
    
  • PyPi (📥 18K / month · 📦 16 · ⏱️ 12.11.2020):

     pip install vowpalwabbit
    
PaddlePaddle (🥈29 · ⭐ 14K) - PArallel Distributed Deep LEarning: Machine Learning.. Apache-2
  • GitHub (👨‍💻 490 · 🔀 3.4K · 📥 15K · 📦 26 · 📋 13K - 18% open · ⏱️ 17.01.2021):

     git clone https://github.com/PaddlePaddle/Paddle
    
  • PyPi (📥 6.2K / month · 📦 16 · ⏱️ 14.12.2020):

     pip install paddlepaddle
    
Turi Create (🥈29 · ⭐ 10K) - Turi Create simplifies the development of custom machine learning.. BSD-3
  • GitHub (👨‍💻 81 · 🔀 1.1K · 📥 3.6K · 📦 220 · 📋 1.7K - 27% open · ⏱️ 13.10.2020):

     git clone https://github.com/apple/turicreate
    
  • PyPi (📥 12K / month · 📦 34 · ⏱️ 30.09.2020):

     pip install turicreate
    
tensorpack (🥈29 · ⭐ 5.9K) - A Neural Net Training Interface on TensorFlow, with focus.. Apache-2
  • GitHub (👨‍💻 55 · 🔀 1.7K · 📥 97 · 📦 670 · 📋 1.3K - 0% open · ⏱️ 16.01.2021):

     git clone https://github.com/tensorpack/tensorpack
    
  • PyPi (📥 92K / month · 📦 76 · ⏱️ 24.04.2020):

     pip install tensorpack
    
Sonnet (🥉28 · ⭐ 8.7K) - TensorFlow-based neural network library. Apache-2
  • GitHub (👨‍💻 48 · 🔀 1.2K · 📦 480 · 📋 150 - 10% open · ⏱️ 08.10.2020):

     git clone https://github.com/deepmind/sonnet
    
  • PyPi (📥 53K / month · 📦 82 · ⏱️ 27.03.2020):

     pip install dm-sonnet
    
  • Conda (📥 6.9K · ⏱️ 14.11.2020):

     conda install -c conda-forge sonnet
    
CNTK (🥉27 · ⭐ 17K · 💤) - Microsoft Cognitive Toolkit (CNTK), an open source deep-learning toolkit. MIT
  • GitHub (👨‍💻 260 · 🔀 4.4K · 📥 14K · 📋 3.3K - 24% open · ⏱️ 31.03.2020):

     git clone https://github.com/microsoft/CNTK
    
  • PyPi (📥 1.1K / month · 📦 17 · ⏱️ 09.12.2020):

     pip install cntk
    
skorch (🥉27 · ⭐ 3.7K) - A scikit-learn compatible neural network library that wraps.. BSD-3
  • GitHub (👨‍💻 39 · 🔀 270 · 📦 250 · 📋 360 - 13% open · ⏱️ 25.12.2020):

     git clone https://github.com/skorch-dev/skorch
    
  • PyPi (📥 5.8K / month · 📦 13 · ⏱️ 30.08.2020):

     pip install skorch
    
  • Conda (📥 160K · ⏱️ 19.09.2020):

     conda install -c conda-forge skorch
    
Ignite (🥉27 · ⭐ 3.2K) - High-level library to help with training and evaluating neural.. BSD-3
  • GitHub (👨‍💻 120 · 🔀 400 · 📦 700 · 📋 690 - 14% open · ⏱️ 13.01.2021):

     git clone https://github.com/pytorch/ignite
    
  • PyPi (📥 35K / month · 📦 62 · ⏱️ 17.01.2021):

     pip install pytorch-ignite
    
  • Conda (📥 52K · ⏱️ 20.09.2020):

     conda install -c pytorch ignite
    
dyNET (🥉27 · ⭐ 3.2K) - DyNet: The Dynamic Neural Network Toolkit. Apache-2
  • GitHub (👨‍💻 160 · 🔀 690 · 📥 1.4K · 📦 160 · 📋 920 - 27% open · ⏱️ 22.10.2020):

     git clone https://github.com/clab/dynet
    
  • PyPi (📥 4.6K / month · 📦 38 · ⏱️ 21.10.2020):

     pip install dyNET
    
Flax (🥉27 · ⭐ 1.4K · 📉) - Flax is a neural network ecosystem for JAX that is designed for.. Apache-2 jax
  • GitHub (👨‍💻 69 · 🔀 150 · 📥 20 · 📦 87 · 📋 230 - 50% open · ⏱️ 15.01.2021):

     git clone https://github.com/google/flax
    
  • PyPi (📥 54K / month · 📦 3 · ⏱️ 08.12.2020):

     pip install flax
    
mlpack (🥉26 · ⭐ 3.5K) - mlpack: a scalable C++ machine learning library --. BSD-3
  • GitHub (👨‍💻 260 · 🔀 1.3K · 📋 1.3K - 11% open · ⏱️ 16.01.2021):

     git clone https://github.com/mlpack/mlpack
    
  • PyPi (📥 180 / month · ⏱️ 28.10.2020):

     pip install mlpack
    
  • Conda (📥 62K · ⏱️ 29.10.2020):

     conda install -c conda-forge mlpack
    
Ludwig (🥉25 · ⭐ 7.4K · 📉) - Ludwig is a toolbox that allows to train and evaluate deep.. Apache-2
  • GitHub (👨‍💻 86 · 🔀 880 · 📦 74 · 📋 520 - 21% open · ⏱️ 30.12.2020):

     git clone https://github.com/ludwig-ai/ludwig
    
  • PyPi (📥 1.1K / month · 📦 12 · ⏱️ 16.11.2020):

     pip install ludwig
    
Neural Network Libraries (🥉25 · ⭐ 2.4K) - Neural Network Libraries. Apache-2
  • GitHub (👨‍💻 55 · 🔀 300 · 📥 520 · 📋 53 - 66% open · ⏱️ 15.01.2021):

     git clone https://github.com/sony/nnabla
    
  • PyPi (📥 2.5K / month · 📦 34 · ⏱️ 24.12.2020):

     pip install nnabla
    
tensorflow-upstream (🥉25 · ⭐ 530) - TensorFlow ROCm port. Apache-2
  • GitHub (👨‍💻 3.5K · 🔀 64 · 📥 16 · 📋 280 - 19% open · ⏱️ 14.01.2021):

     git clone https://github.com/ROCmSoftwarePlatform/tensorflow-upstream
    
  • PyPi (📥 2.1K / month · ⏱️ 22.12.2020):

     pip install tensorflow-rocm
    
einops (🥉24 · ⭐ 2.2K) - Deep learning operations reinvented (for pytorch, tensorflow, jax and.. MIT
  • GitHub (👨‍💻 10 · 🔀 72 · 📦 190 · 📋 61 - 36% open · ⏱️ 09.01.2021):

     git clone https://github.com/arogozhnikov/einops
    
  • PyPi (📥 14K / month · 📦 10 · ⏱️ 08.09.2020):

     pip install einops
    
  • Conda (📥 3.5K · ⏱️ 15.10.2020):

     conda install -c conda-forge einops
    
ktrain (🥉24 · ⭐ 720) - ktrain is a Python library that makes deep learning and AI more.. Apache-2
  • GitHub (👨‍💻 12 · 🔀 170 · 📦 110 · 📋 290 - 3% open · ⏱️ 26.12.2020):

     git clone https://github.com/amaiya/ktrain
    
  • PyPi (📥 5.4K / month · 📦 1 · ⏱️ 05.12.2020):

     pip install ktrain
    
xLearn (🥉23 · ⭐ 2.8K · 💤) - High performance, easy-to-use, and scalable machine learning (ML).. Apache-2
  • GitHub (👨‍💻 30 · 🔀 480 · 📥 2.2K · 📦 39 · 📋 280 - 62% open · ⏱️ 03.03.2020):

     git clone https://github.com/aksnzhy/xlearn
    
  • PyPi (📥 6.5K / month · 📦 1 · ⏱️ 04.12.2018):

     pip install xlearn
    
SHOGUN (🥉23 · ⭐ 2.8K) - Unified and efficient Machine Learning. BSD-3
  • GitHub (👨‍💻 250 · 🔀 1K · 📋 1.6K - 33% open · ⏱️ 08.12.2020):

     git clone https://github.com/shogun-toolbox/shogun
    
  • Conda (📥 90K · ⏱️ 25.06.2018):

     conda install -c conda-forge shogun
    
  • Docker Hub (📥 1.4K · ⭐ 1 · ⏱️ 31.01.2019):

     docker pull shogun/shogun
    
fklearn (🥉22 · ⭐ 1.3K) - fklearn: Functional Machine Learning. Apache-2
  • GitHub (👨‍💻 35 · 🔀 150 · 📦 9 · 📋 38 - 50% open · ⏱️ 20.11.2020):

     git clone https://github.com/nubank/fklearn
    
  • PyPi (📥 5.2K / month · ⏱️ 02.10.2020):

     pip install fklearn
    
mace (🥉21 · ⭐ 4.3K) - MACE is a deep learning inference framework optimized for mobile.. Apache-2
  • GitHub (👨‍💻 54 · 🔀 760 · 📥 1.3K · 📋 620 - 6% open · ⏱️ 10.01.2021):

     git clone https://github.com/XiaoMi/mace
    
Neural Tangents (🥉21 · ⭐ 1.3K) - Fast and Easy Infinite Neural Networks in Python. Apache-2
  • GitHub (👨‍💻 16 · 🔀 140 · 📥 79 · 📦 12 · 📋 74 - 33% open · ⏱️ 06.01.2021):

     git clone https://github.com/google/neural-tangents
    
  • PyPi (📥 470 / month · ⏱️ 18.11.2020):

     pip install neural-tangents
    
ThunderSVM (🥉20 · ⭐ 1.3K) - ThunderSVM: A Fast SVM Library on GPUs and CPUs. Apache-2
  • GitHub (👨‍💻 33 · 🔀 160 · 📥 2.1K · 📋 190 - 22% open · ⏱️ 14.12.2020):

     git clone https://github.com/Xtra-Computing/thundersvm
    
  • PyPi (📥 350 / month · ⏱️ 13.03.2020):

     pip install thundersvm
    
Haiku (🥉20 · ⭐ 900) - JAX-based neural network library. Apache-2
  • GitHub (👨‍💻 34 · 🔀 63 · 📦 56 · 📋 66 - 27% open · ⏱️ 16.01.2021):

     git clone https://github.com/deepmind/dm-haiku
    
Torchbearer (🥉18 · ⭐ 580 · 💤) - torchbearer: A model fitting library for PyTorch. MIT
  • GitHub (👨‍💻 13 · 🔀 62 · 📦 37 · 📋 240 - 3% open · ⏱️ 17.02.2020):

     git clone https://github.com/pytorchbearer/torchbearer
    
  • PyPi (📥 190 / month · 📦 3 · ⏱️ 31.01.2020):

     pip install torchbearer
    
ThunderGBM (🥉16 · ⭐ 580) - ThunderGBM: Fast GBDTs and Random Forests on GPUs. Apache-2
  • GitHub (👨‍💻 10 · 🔀 71 · 📋 46 - 43% open · ⏱️ 05.01.2021):

     git clone https://github.com/Xtra-Computing/thundergbm
    
  • PyPi (📥 100 / month · ⏱️ 01.05.2020):

     pip install thundergbm
    
NeoML (🥉13 · ⭐ 560) - Machine learning framework for both deep learning and traditional.. Apache-2
  • GitHub (👨‍💻 16 · 🔀 78 · 📋 29 - 55% open · ⏱️ 15.01.2021):

     git clone https://github.com/neoml-lib/neoml
    
Show 8 hidden projects...
  • dlib (🥈33 · ⭐ 9.8K) - A toolkit for making real world machine learning and data analysis.. ❗️BSL-1.0
  • NuPIC (🥉25 · ⭐ 6.2K · 💀) - Numenta Platform for Intelligent Computing is an implementation.. ❗️AGPL-3.0
  • Lasagne (🥉24 · ⭐ 3.8K · 💀) - Lightweight library to build and train neural networks in Theano. MIT
  • neon (🥉22 · ⭐ 3.9K · 💀) - Intel Nervana reference deep learning framework committed to best.. Apache-2
  • MindsDB (🥉20 · ⭐ 3.1K) - Predictive AI layer for existing databases. ❗️GPL-3.0
  • NeuPy (🥉20 · ⭐ 660 · 💀) - NeuPy is a Tensorflow based python library for prototyping and building.. MIT
  • elegy (🥉16 · ⭐ 150) - Elegy is a Neural Networks framework based on Jax and inspired.. Apache-2 jax
  • StarSpace (🥉13 · ⭐ 3.5K · 💀) - Learning embeddings for classification, retrieval and ranking. MIT

Data Visualization

Back to top

General-purpose and task-specific data visualization libraries.

Matplotlib (🥇41 · ⭐ 13K) - matplotlib: plotting with Python. Python-2.0
  • GitHub (👨‍💻 1.2K · 🔀 5.6K · 📦 310K · 📋 7.6K - 21% open · ⏱️ 17.01.2021):

     git clone https://github.com/matplotlib/matplotlib
    
  • PyPi (📥 8.7M / month · 📦 79K · ⏱️ 12.11.2020):

     pip install matplotlib
    
  • Conda (📥 7.7M · ⏱️ 14.01.2021):

     conda install -c conda-forge matplotlib
    
Plotly (🥇35 · ⭐ 8.7K) - The interactive graphing library for Python (includes Plotly Express). MIT
  • GitHub (👨‍💻 160 · 🔀 1.7K · 📦 5 · 📋 1.8K - 42% open · ⏱️ 14.01.2021):

     git clone https://github.com/plotly/plotly.py
    
  • PyPi (📥 2.5M / month · 📦 5K · ⏱️ 12.01.2021):

     pip install plotly
    
  • Conda (📥 1.2M · ⏱️ 12.01.2021):

     conda install -c conda-forge plotly
    
  • NPM (📥 63K / month · 📦 4 · ⏱️ 12.01.2021):

     npm install plotlywidget
    
Seaborn (🥇35 · ⭐ 8K) - Statistical data visualization using matplotlib. BSD-3
  • GitHub (👨‍💻 150 · 🔀 1.4K · 📥 110 · 📦 78K · 📋 1.8K - 4% open · ⏱️ 16.01.2021):

     git clone https://github.com/mwaskom/seaborn
    
  • PyPi (📥 1.7M / month · 📦 13K · ⏱️ 20.12.2020):

     pip install seaborn
    
  • Conda (📥 1.9M · ⏱️ 21.12.2020):

     conda install -c conda-forge seaborn
    
dash (🥇34 · ⭐ 14K) - Analytical Web Apps for Python, R, Julia, and Jupyter. No JavaScript Required. MIT
  • GitHub (👨‍💻 72 · 🔀 1.4K · 📦 17K · 📋 960 - 41% open · ⏱️ 17.01.2021):

     git clone https://github.com/plotly/dash
    
  • PyPi (📥 210K / month · 📦 1.6K · ⏱️ 09.12.2020):

     pip install dash
    
  • Conda (📥 210K · ⏱️ 11.12.2020):

     conda install -c conda-forge dash
    
Bokeh (🥇33 · ⭐ 15K) - Interactive Data Visualization in the browser, from Python. BSD-3
  • GitHub (👨‍💻 540 · 🔀 3.6K · 📦 30K · 📋 6.3K - 9% open · ⏱️ 17.01.2021):

     git clone https://github.com/bokeh/bokeh
    
  • PyPi (📥 990K / month · 📦 5.9K · ⏱️ 11.01.2021):

     pip install bokeh
    
  • Conda (📥 3.6M · ⏱️ 13.01.2021):

     conda install -c conda-forge bokeh
    
pyecharts (🥈31 · ⭐ 10K) - Python Echarts Plotting Library. MIT
  • GitHub (👨‍💻 29 · 🔀 2.3K · 📦 1.4K · 📋 1.4K - 1% open · ⏱️ 23.12.2020):

     git clone https://github.com/pyecharts/pyecharts
    
  • PyPi (📥 32K / month · 📦 280 · ⏱️ 29.10.2020):

     pip install pyecharts
    
wordcloud (🥈31 · ⭐ 7.8K) - A little word cloud generator in Python. MIT
  • GitHub (👨‍💻 58 · 🔀 2K · 📦 8.4K · 📋 440 - 21% open · ⏱️ 11.11.2020):

     git clone https://github.com/amueller/word_cloud
    
  • PyPi (📥 220K / month · 📦 1.1K · ⏱️ 11.11.2020):

     pip install wordcloud
    
  • Conda (📥 180K · ⏱️ 14.01.2021):

     conda install -c conda-forge wordcloud
    
Altair (🥈31 · ⭐ 6.3K) - Declarative statistical visualization library for Python. BSD-3
  • GitHub (👨‍💻 120 · 🔀 570 · 📦 7.3K · 📋 1.5K - 19% open · ⏱️ 12.01.2021):

     git clone https://github.com/altair-viz/altair
    
  • PyPi (📥 630K / month · 📦 370 · ⏱️ 01.04.2020):

     pip install altair
    
  • Conda (📥 580K · ⏱️ 01.04.2020):

     conda install -c conda-forge altair
    
bqplot (🥈30 · ⭐ 3K) - Plotting library for IPython/Jupyter notebooks. Apache-2
  • GitHub (👨‍💻 51 · 🔀 400 · 📦 1.2K · 📋 500 - 37% open · ⏱️ 14.01.2021):

     git clone https://github.com/bqplot/bqplot
    
  • PyPi (📥 11K / month · 📦 110 · ⏱️ 14.01.2021):

     pip install bqplot
    
  • Conda (📥 450K · ⏱️ 14.01.2021):

     conda install -c conda-forge bqplot
    
  • NPM (📥 110K / month · 📦 10 · ⏱️ 14.01.2021):

     npm install bqplot
    
pandas-profiling (🥈29 · ⭐ 6.6K) - Create HTML profiling reports from pandas DataFrame.. MIT
  • GitHub (👨‍💻 65 · 🔀 980 · 📦 2.9K · 📋 410 - 13% open · ⏱️ 12.01.2021):

     git clone https://github.com/pandas-profiling/pandas-profiling
    
  • PyPi (📥 190K / month · 📦 160 · ⏱️ 03.09.2020):

     pip install pandas-profiling
    
  • Conda (📥 97K · ⏱️ 09.01.2021):

     conda install -c conda-forge pandas-profiling
    
UMAP (🥈29 · ⭐ 4.4K) - Uniform Manifold Approximation and Projection. BSD-3
  • GitHub (👨‍💻 69 · 🔀 480 · 📋 420 - 48% open · ⏱️ 11.01.2021):

     git clone https://github.com/lmcinnes/umap
    
  • PyPi (📥 130K / month · 📦 230 · ⏱️ 11.01.2021):

     pip install umap-learn
    
PyQtGraph (🥈29 · ⭐ 2.3K) - Fast data visualization and GUI tools for scientific / engineering.. MIT
  • GitHub (👨‍💻 180 · 🔀 810 · 📋 720 - 39% open · ⏱️ 13.01.2021):

     git clone https://github.com/pyqtgraph/pyqtgraph
    
  • PyPi (📥 25K / month · 📦 890 · ⏱️ 20.12.2020):

     pip install pyqtgraph
    
  • Conda (📥 160K · ⏱️ 20.12.2020):

     conda install -c conda-forge pyqtgraph
    
HoloViews (🥈29 · ⭐ 1.8K) - With Holoviews, your data visualizes itself. BSD-3
  • GitHub (👨‍💻 100 · 🔀 300 · 📋 2.5K - 27% open · ⏱️ 14.01.2021):

     git clone https://github.com/holoviz/holoviews
    
  • PyPi (📥 74K / month · 📦 170 · ⏱️ 27.12.2020):

     pip install holoviews
    
  • Conda (📥 400K · ⏱️ 28.12.2020):

     conda install -c conda-forge holoviews
    
  • NPM (📥 6.2K / month · ⏱️ 24.05.2020):

     npm install @pyviz/jupyterlab_pyviz
    
Graphviz (🥈29 · ⭐ 900) - Simple Python interface for Graphviz. MIT
  • GitHub (👨‍💻 15 · 🔀 140 · 📦 17K · 📋 100 - 4% open · ⏱️ 05.01.2021):

     git clone https://github.com/xflr6/graphviz
    
  • PyPi (📥 2.4M / month · 📦 4.4K · ⏱️ 24.12.2020):

     pip install graphviz
    
VisPy (🥈28 · ⭐ 2.6K) - High-performance interactive 2D/3D data visualization library. BSD-3
  • GitHub (👨‍💻 140 · 🔀 540 · 📦 440 · 📋 1.1K - 31% open · ⏱️ 14.01.2021):

     git clone https://github.com/vispy/vispy
    
  • PyPi (📥 13K / month · 📦 120 · ⏱️ 28.11.2020):

     pip install vispy
    
  • Conda (📥 120K · ⏱️ 13.01.2021):

     conda install -c conda-forge vispy
    
  • NPM (📥 130 / month · ⏱️ 15.03.2020):

     npm install vispy
    
datashader (🥈28 · ⭐ 2.4K) - Quickly and accurately render even the largest data. BSD-3
  • GitHub (👨‍💻 43 · 🔀 310 · 📦 560 · 📋 460 - 31% open · ⏱️ 07.01.2021):

     git clone https://github.com/holoviz/datashader
    
  • PyPi (📥 10K / month · 📦 70 · ⏱️ 07.01.2021):

     pip install datashader
    
  • Conda (📥 130K · ⏱️ 08.01.2021):

     conda install -c conda-forge datashader
    
missingno (🥈27 · ⭐ 2.6K) - Missing data visualization module for Python. MIT
  • GitHub (👨‍💻 15 · 🔀 330 · 📦 2.7K · 📋 100 - 14% open · ⏱️ 28.12.2020):

     git clone https://github.com/ResidentMario/missingno
    
  • PyPi (📥 160K / month · 📦 76 · ⏱️ 29.06.2018):

     pip install missingno
    
  • Conda (📥 68K · ⏱️ 15.02.2020):

     conda install -c conda-forge missingno
    
data-validation (🥈27 · ⭐ 500) - Library for exploring and validating machine learning.. Apache-2
  • GitHub (👨‍💻 20 · 🔀 94 · 📥 200 · 📦 220 · 📋 110 - 34% open · ⏱️ 15.01.2021):

     git clone https://github.com/tensorflow/data-validation
    
  • PyPi (📥 6.8M / month · 📦 18 · ⏱️ 17.12.2020):

     pip install tensorflow-data-validation
    
Perspective (🥉26 · ⭐ 3.1K) - Streaming pivot visualization via WebAssembly. Apache-2
  • GitHub (👨‍💻 61 · 🔀 340 · 📦 160 · 📋 370 - 19% open · ⏱️ 17.01.2021):

     git clone https://github.com/finos/perspective
    
  • PyPi (📥 510 / month · 📦 4 · ⏱️ 14.01.2021):

     pip install perspective-python
    
  • NPM (📥 920 / month · ⏱️ 08.01.2021):

     npm install @finos/perspective-jupyterlab
    
Cufflinks (🥉26 · ⭐ 2.1K) - Productivity Tools for Plotly + Pandas. MIT
  • GitHub (👨‍💻 37 · 🔀 530 · 📦 2.5K · 📋 200 - 40% open · ⏱️ 07.10.2020):

     git clone https://github.com/santosjorge/cufflinks
    
  • PyPi (📥 47K / month · 📦 230 · ⏱️ 01.03.2020):

     pip install cufflinks
    
PyVista (🥉26 · ⭐ 660) - 3D plotting and mesh analysis through a streamlined interface for the.. MIT
  • GitHub (👨‍💻 49 · 🔀 130 · 📥 38 · 📦 240 · 📋 380 - 30% open · ⏱️ 17.01.2021):

     git clone https://github.com/pyvista/pyvista
    
  • PyPi (📥 6.8K / month · 📦 26 · ⏱️ 10.12.2020):

     pip install pyvista
    
  • Conda (📥 54K · ⏱️ 10.12.2020):

     conda install -c conda-forge pyvista
    
HyperTools (🥉25 · ⭐ 1.6K) - A Python toolbox for gaining geometric insights into high-dimensional.. MIT
  • GitHub (👨‍💻 21 · 🔀 150 · 📦 96 · 📋 190 - 36% open · ⏱️ 06.01.2021):

     git clone https://github.com/ContextLab/hypertools
    
  • PyPi (📥 180 / month · 📦 9 · ⏱️ 02.10.2020):

     pip install hypertools
    
hvPlot (🥉25 · ⭐ 330) - A high-level plotting API for pandas, dask, xarray, and networkx built on.. BSD-3
  • GitHub (👨‍💻 22 · 🔀 49 · 📦 430 · 📋 320 - 30% open · ⏱️ 14.01.2021):

     git clone https://github.com/holoviz/hvplot
    
  • PyPi (📥 32K / month · 📦 15 · ⏱️ 02.06.2020):

     pip install hvplot
    
  • Conda (📥 58K · ⏱️ 06.01.2021):

     conda install -c conda-forge hvplot
    
Chartify (🥉24 · ⭐ 2.8K) - Python library that makes it easy for data scientists to create.. Apache-2
  • GitHub (👨‍💻 19 · 🔀 250 · 📦 52 · 📋 70 - 57% open · ⏱️ 02.11.2020):

     git clone https://github.com/spotify/chartify
    
  • PyPi (📥 4.6K / month · 📦 5 · ⏱️ 02.11.2020):

     pip install chartify
    
  • Conda (📥 12K · ⏱️ 07.11.2020):

     conda install -c conda-forge chartify
    
pythreejs (🥉24 · ⭐ 690) - A Jupyter - Three.js bridge. BSD-3
  • GitHub (👨‍💻 24 · 🔀 160 · 📦 15 · 📋 200 - 30% open · ⏱️ 09.10.2020):

     git clone https://github.com/jupyter-widgets/pythreejs
    
  • PyPi (📥 5.4K / month · 📦 13 · ⏱️ 09.10.2020):

     pip install pythreejs
    
  • Conda (📥 260K · ⏱️ 12.10.2020):

     conda install -c conda-forge pythreejs
    
  • NPM (📥 4.2K / month · 📦 8 · ⏱️ 19.03.2020):

     npm install jupyter-threejs
    
Facets Overview (🥉23 · ⭐ 6.5K) - Visualizations for machine learning datasets. Apache-2
  • GitHub (👨‍💻 27 · 🔀 790 · 📦 37 · 📋 150 - 48% open · ⏱️ 27.10.2020):

     git clone https://github.com/pair-code/facets
    
  • PyPi (📥 29K / month · ⏱️ 24.07.2019):

     pip install facets-overview
    
Multicore-TSNE (🥉23 · ⭐ 1.5K) - Parallel t-SNE implementation with Python and Torch.. BSD-3
  • GitHub (👨‍💻 15 · 🔀 190 · 📦 200 · 📋 53 - 62% open · ⏱️ 19.08.2020):

     git clone https://github.com/DmitryUlyanov/Multicore-TSNE
    
  • PyPi (📥 2.6K / month · 📦 14 · ⏱️ 08.11.2017):

     pip install MulticoreTSNE
    
  • Conda (📥 5.8K · ⏱️ 12.11.2018):

     conda install -c conda-forge multicore-tsne
    
openTSNE (🥉23 · ⭐ 740) - Extensible, parallel implementations of t-SNE. BSD-3
  • GitHub (👨‍💻 10 · 🔀 82 · 📦 170 · 📋 71 - 4% open · ⏱️ 08.01.2021):

     git clone https://github.com/pavlin-policar/openTSNE
    
  • PyPi (📥 8K / month · 📦 4 · ⏱️ 08.01.2021):

     pip install opentsne
    
  • Conda (📥 73K · ⏱️ 08.01.2021):

     conda install -c conda-forge opentsne
    
PandasGUI (🥉22 · ⭐ 2K) - A GUI for Pandas DataFrames. MIT
  • GitHub (👨‍💻 7 · 🔀 110 · 📦 29 · 📋 65 - 20% open · ⏱️ 11.01.2021):

     git clone https://github.com/adamerose/pandasgui
    
  • PyPi (📥 1.6K / month · 📦 1 · ⏱️ 10.11.2020):

     pip install pandasgui
    
Pandas-Bokeh (🥉22 · ⭐ 600) - Bokeh Plotting Backend for Pandas and GeoPandas. MIT
  • GitHub (👨‍💻 11 · 🔀 75 · 📦 120 · 📋 72 - 29% open · ⏱️ 27.11.2020):

     git clone https://github.com/PatrikHlobil/Pandas-Bokeh
    
  • PyPi (📥 5.6K / month · 📦 10 · ⏱️ 27.11.2020):

     pip install pandas-bokeh
    
python-ternary (🥉22 · ⭐ 380) - Ternary plotting library for python with matplotlib. MIT
  • GitHub (👨‍💻 25 · 🔀 110 · 📥 14 · 📦 50 · 📋 100 - 23% open · ⏱️ 05.01.2021):

     git clone https://github.com/marcharper/python-ternary
    
  • PyPi (📥 970 / month · 📦 10 · ⏱️ 10.05.2020):

     pip install python-ternary
    
  • Conda (📥 48K · ⏱️ 10.05.2020):

     conda install -c conda-forge python-ternary
    
vega (🥉22 · ⭐ 290) - IPython/Jupyter notebook module for Vega and Vega-Lite. BSD-3
  • GitHub (👨‍💻 9 · 🔀 46 · 📋 87 - 9% open · ⏱️ 11.01.2021):

     git clone https://github.com/vega/ipyvega
    
  • PyPi (📥 8.7K / month · 📦 150 · ⏱️ 15.05.2020):

     pip install vega
    
  • Conda (📥 370K · ⏱️ 10.12.2020):

     conda install -c conda-forge vega
    
joypy (🥉21 · ⭐ 310) - Joyplots in Python with matplotlib & pandas. MIT
  • GitHub (👨‍💻 5 · 🔀 32 · 📦 62 · 📋 38 - 10% open · ⏱️ 28.12.2020):

     git clone https://github.com/sbebo/joypy
    
  • PyPi (📥 2.7K / month · 📦 6 · ⏱️ 28.12.2020):

     pip install joypy
    
  • Conda (📥 7.8K · ⏱️ 28.12.2020):

     conda install -c conda-forge joypy
    
HiPlot (🥉19 · ⭐ 1.9K) - HiPlot makes understanding high dimensional data easy. MIT
  • GitHub (👨‍💻 6 · 🔀 89 · 📦 2 · 📋 47 - 12% open · ⏱️ 11.01.2021):

     git clone https://github.com/facebookresearch/hiplot
    
  • PyPi (📥 1.9K / month · ⏱️ 23.12.2020):

     pip install hiplot
    
  • Conda (📥 41K · ⏱️ 23.12.2020):

     conda install -c conda-forge hiplot
    
Sweetviz (🥉19 · ⭐ 1.2K) - Visualize and compare datasets, target values and associations, with one.. MIT
  • GitHub (👨‍💻 6 · 🔀 130 · 📋 64 - 32% open · ⏱️ 10.12.2020):

     git clone https://github.com/fbdesignpro/sweetviz
    
  • PyPi (📥 7.2K / month · ⏱️ 10.12.2020):

     pip install sweetviz
    
lets-plot (🥉19 · ⭐ 480) - An open-source plotting library for statistical data. MIT
  • GitHub (👨‍💻 14 · 🔀 16 · 📋 110 - 36% open · ⏱️ 15.01.2021):

     git clone https://github.com/JetBrains/lets-plot
    
  • PyPi (📥 5.1K / month · ⏱️ 23.12.2020):

     pip install lets-plot
    
animatplot (🥉19 · ⭐ 350) - A python package for animating plots build on matplotlib. MIT
  • GitHub (👨‍💻 7 · 🔀 33 · 📦 14 · 📋 31 - 51% open · ⏱️ 05.10.2020):

     git clone https://github.com/t-makaro/animatplot
    
  • PyPi (📥 120 / month · 📦 1 · ⏱️ 05.10.2020):

     pip install animatplot
    
  • Conda (📥 5.1K · ⏱️ 06.10.2020):

     conda install -c conda-forge animatplot
    
AutoViz (🥉19 · ⭐ 290) - Automatically Visualize any dataset, any size with a single line of.. Apache-2
  • GitHub (👨‍💻 10 · 🔀 54 · 📦 26 · 📋 21 - 23% open · ⏱️ 21.12.2020):

     git clone https://github.com/AutoViML/AutoViz
    
  • PyPi (📥 910 / month · ⏱️ 10.12.2020):

     pip install autoviz
    
PyWaffle (🥉18 · ⭐ 380) - Make Waffle Charts in Python. MIT
  • GitHub (👨‍💻 6 · 🔀 61 · 📦 39 · 📋 12 - 16% open · ⏱️ 26.07.2020):

     git clone https://github.com/gyli/PyWaffle
    
  • PyPi (📥 740 / month · ⏱️ 26.07.2020):

     pip install pywaffle
    
FiftyOne (🥉18 · ⭐ 180 · ➕) - Visualize, create, and debug image and video datasets.. Apache-2
  • GitHub (👨‍💻 10 · 🔀 15 · 📋 290 - 38% open · ⏱️ 16.01.2021):

     git clone https://github.com/voxel51/fiftyone
    
  • PyPi (📥 570 / month · ⏱️ 16.01.2021):

     pip install fiftyone
    
nx-altair (🥉13 · ⭐ 150 · 💤) - Draw interactive NetworkX graphs with Altair. MIT
  • GitHub (👨‍💻 3 · 🔀 19 · 📋 8 - 50% open · ⏱️ 02.06.2020):

     git clone https://github.com/Zsailer/nx_altair
    
  • PyPi (📥 130 / month · 📦 2 · ⏱️ 02.06.2020):

     pip install nx-altair
    
Show 6 hidden projects...
  • plotnine (🥈28 · ⭐ 2.5K) - A grammar of graphics for Python. ❗️GPL-2.0
  • PDPbox (🥉23 · ⭐ 520 · 💀) - python partial dependence plot toolbox. MIT
  • pivottablejs (🥉19 · ⭐ 410 · 💀) - Dragndrop Pivot Tables and Charts for Jupyter/IPython.. MIT
  • ivis (🥉18 · ⭐ 220) - Dimensionality reduction in very large datasets using Siamese.. ❗️GPL-2.0
  • pdvega (🥉16 · ⭐ 340 · 💀) - Interactive plotting for Pandas using Vega-Lite. MIT
  • nptsne (🥉14 · ⭐ 24) - nptsne is a numpy compatible python binary package that offers a number.. Apache-2

Text Data & NLP

Back to top

Libraries for processing, cleaning, manipulating, and analyzing text data as well as libraries for NLP tasks such as language detection, fuzzy matching, classification, seq2seq learning, conversational AI, keyword extraction, and translation.

spaCy (🥇37 · ⭐ 18K) - Industrial-strength Natural Language Processing (NLP) with Python and Cython. MIT
  • GitHub (👨‍💻 540 · 🔀 3.2K · 📥 2.9K · 📦 20K · 📋 4.2K - 2% open · ⏱️ 17.01.2021):

     git clone https://github.com/explosion/spaCy
    
  • PyPi (📥 840K / month · 📦 3.1K · ⏱️ 11.12.2020):

     pip install spacy
    
  • Conda (📥 1.4M · ⏱️ 18.12.2020):

     conda install -c conda-forge spacy
    
transformers (🥇36 · ⭐ 40K) - Transformers: State-of-the-art Natural Language.. Apache-2
  • GitHub (👨‍💻 750 · 🔀 9.7K · 📥 1.2K · 📦 7K · 📋 5.7K - 9% open · ⏱️ 15.01.2021):

     git clone https://github.com/huggingface/transformers
    
  • PyPi (📥 640K / month · 📦 130 · ⏱️ 14.01.2021):

     pip install transformers
    
  • Conda (📥 15K · ⏱️ 14.01.2021):

     conda install -c conda-forge transformers
    
nltk (🥇34 · ⭐ 9.6K) - Suite of libraries and programs for symbolic and statistical natural.. Apache-2
  • GitHub (👨‍💻 390 · 🔀 2.4K · 📦 86K · 📋 1.5K - 16% open · ⏱️ 02.01.2021):

     git clone https://github.com/nltk/nltk
    
  • PyPi (📥 4.6M / month · 📦 21K · ⏱️ 12.04.2020):

     pip install nltk
    
  • Conda (📥 610K · ⏱️ 08.08.2019):

     conda install -c conda-forge nltk
    
Rasa (🥇32 · ⭐ 11K) - Open source machine learning framework to automate text- and voice-.. Apache-2
  • GitHub (👨‍💻 450 · 🔀 3.2K · 📋 4.9K - 11% open · ⏱️ 14.01.2021):

     git clone https://github.com/RasaHQ/rasa
    
  • PyPi (📥 69K / month · 📦 64 · ⏱️ 12.01.2021):

     pip install rasa
    
fairseq (🥇31 · ⭐ 11K) - Facebook AI Research Sequence-to-Sequence Toolkit written in Python. MIT
  • GitHub (👨‍💻 280 · 🔀 2.7K · 📥 57 · 📦 270 · 📋 2.4K - 25% open · ⏱️ 17.01.2021):

     git clone https://github.com/pytorch/fairseq
    
  • PyPi (📥 8.4K / month · 📦 20 · ⏱️ 05.01.2021):

     pip install fairseq
    
ChatterBot (🥇31 · ⭐ 11K) - ChatterBot is a machine learning, conversational dialog engine for.. BSD-3
  • GitHub (👨‍💻 100 · 🔀 3.7K · 📦 3.1K · 📋 1.4K - 15% open · ⏱️ 20.08.2020):

     git clone https://github.com/gunthercox/ChatterBot
    
  • PyPi (📥 24K / month · 📦 680 · ⏱️ 22.08.2020):

     pip install chatterbot
    
sentencepiece (🥇31 · ⭐ 4.7K) - Unsupervised text tokenizer for Neural Network-based text.. Apache-2
  • GitHub (👨‍💻 48 · 🔀 630 · 📥 9.6K · 📦 5.4K · 📋 410 - 5% open · ⏱️ 12.01.2021):

     git clone https://github.com/google/sentencepiece
    
  • PyPi (📥 980K / month · 📦 240 · ⏱️ 10.01.2021):

     pip install sentencepiece
    
  • Conda (📥 23K · ⏱️ 08.01.2021):

     conda install -c conda-forge sentencepiece
    
fastText (🥇30 · ⭐ 22K · 📈) - Library for fast text representation and classification. MIT
  • GitHub (👨‍💻 58 · 🔀 4.2K · 📦 1.4K · 📋 990 - 40% open · ⏱️ 18.07.2020):

     git clone https://github.com/facebookresearch/fastText
    
  • PyPi (📥 110K / month · 📦 190 · ⏱️ 28.04.2020):

     pip install fasttext
    
  • Conda (📥 17K · ⏱️ 12.10.2020):

     conda install -c conda-forge fasttext
    
flair (🥇30 · ⭐ 9.8K) - A very simple framework for state-of-the-art Natural Language.. MIT
  • GitHub (👨‍💻 160 · 🔀 1.4K · 📦 580 · 📋 1.4K - 7% open · ⏱️ 13.01.2021):

     git clone https://github.com/flairNLP/flair
    
  • PyPi (📥 37K / month · 📦 48 · ⏱️ 01.12.2020):

     pip install flair
    
torchtext (🥇30 · ⭐ 2.6K) - Data loaders and abstractions for text and NLP. BSD-3
  • GitHub (👨‍💻 100 · 🔀 600 · 📦 3.6K · 📋 570 - 45% open · ⏱️ 15.01.2021):

     git clone https://github.com/pytorch/text
    
  • PyPi (📥 78K / month · 📦 680 · ⏱️ 10.12.2020):

     pip install torchtext
    
AllenNLP (🥈29 · ⭐ 9.6K) - An open-source NLP research library, built on PyTorch. Apache-2
  • GitHub (👨‍💻 230 · 🔀 2K · 📥 36 · 📦 1.4K · 📋 2.2K - 4% open · ⏱️ 14.01.2021):

     git clone https://github.com/allenai/allennlp
    
  • PyPi (📥 16K / month · 📦 230 · ⏱️ 17.11.2020):

     pip install allennlp
    
TextBlob (🥈29 · ⭐ 7.5K) - Simple, Pythonic, text processing--Sentiment analysis, part-of-speech.. MIT
  • GitHub (👨‍💻 33 · 🔀 950 · 📥 88 · 📦 9.8K · 📋 220 - 31% open · ⏱️ 11.01.2021):

     git clone https://github.com/sloria/TextBlob
    
  • PyPi (📥 240K / month · 📦 2.5K · ⏱️ 24.02.2019):

     pip install textblob
    
  • Conda (📥 110K · ⏱️ 24.02.2019):

     conda install -c conda-forge textblob
    
snowballstemmer (🥈29 · ⭐ 470) - Snowball compiler and stemming algorithms. BSD-3
  • GitHub (👨‍💻 24 · 🔀 130 · 📦 42K · 📋 61 - 31% open · ⏱️ 23.11.2020):

     git clone https://github.com/snowballstem/snowball
    
  • PyPi (📥 2.1M / month · 📦 13K · ⏱️ 03.10.2019):

     pip install snowballstemmer
    
  • Conda (📥 2M · ⏱️ 03.10.2019):

     conda install -c conda-forge snowballstemmer
    
Tokenizers (🥈28 · ⭐ 4.2K · 📈) - Fast State-of-the-Art Tokenizers optimized for Research and.. Apache-2
  • GitHub (👨‍💻 40 · 🔀 310 · 📦 24 · 📋 370 - 21% open · ⏱️ 12.01.2021):

     git clone https://github.com/huggingface/tokenizers
    
  • PyPi (📥 840K / month · ⏱️ 12.01.2021):

     pip install tokenizers
    
  • Conda (📥 18K · ⏱️ 19.11.2020):

     conda install -c conda-forge tokenizers
    
sentence-transformers (🥈28 · ⭐ 3.9K · ➕) - Multilingual Sentence Embeddings using BERT / RoBERTa.. Apache-2 bert xlnet
  • GitHub (👨‍💻 38 · 🔀 740 · 📦 560 · 📋 640 - 46% open · ⏱️ 15.01.2021):

     git clone https://github.com/UKPLab/sentence-transformers
    
  • PyPi (📥 76K / month · ⏱️ 04.01.2021):

     pip install sentence-transformers
    
Dedupe (🥈28 · ⭐ 2.9K) - A python library for accurate and scalable fuzzy matching, record.. MIT
  • GitHub (👨‍💻 55 · 🔀 400 · 📦 180 · 📋 620 - 8% open · ⏱️ 15.01.2021):

     git clone https://github.com/dedupeio/dedupe
    
  • PyPi (📥 47K / month · 📦 78 · ⏱️ 06.09.2020):

     pip install dedupe
    
phonenumbers (🥈28 · ⭐ 2.6K) - Python port of Google's libphonenumber. Apache-2
  • GitHub (👨‍💻 22 · 🔀 330 · 📋 110 - 2% open · ⏱️ 12.01.2021):

     git clone https://github.com/daviddrysdale/python-phonenumbers
    
  • PyPi (📥 670K / month · 📦 2.3K · ⏱️ 12.01.2021):

     pip install phonenumbers
    
  • Conda (📥 370K · ⏱️ 04.08.2019):

     conda install -c conda-forge phonenumbers
    
inflect (🥈28 · ⭐ 470) - Correctly generate plurals, ordinals, indefinite articles; convert numbers.. MIT
  • GitHub (👨‍💻 25 · 🔀 57 · 📋 71 - 18% open · ⏱️ 15.11.2020):

     git clone https://github.com/jaraco/inflect
    
  • PyPi (📥 860K / month · 📦 1.4K · ⏱️ 15.11.2020):

     pip install inflect
    
  • Conda (📥 120K · ⏱️ 08.01.2021):

     conda install -c conda-forge inflect
    
OpenNMT (🥈27 · ⭐ 4.8K) - Open Source Neural Machine Translation in PyTorch. MIT
  • GitHub (👨‍💻 160 · 🔀 1.8K · 📦 65 · 📋 1.2K - 7% open · ⏱️ 13.01.2021):

     git clone https://github.com/OpenNMT/OpenNMT-py
    
  • PyPi (📥 4K / month · 📦 4 · ⏱️ 10.11.2020):

     pip install OpenNMT-py
    
GluonNLP (🥈27 · ⭐ 2.2K) - Toolkit that enables easy text preprocessing, datasets loading.. Apache-2
  • GitHub (👨‍💻 78 · 🔀 490 · 📦 370 · 📋 510 - 44% open · ⏱️ 16.01.2021):

     git clone https://github.com/dmlc/gluon-nlp
    
  • PyPi (📥 31K / month · 📦 11 · ⏱️ 13.08.2020):

     pip install gluonnlp
    
Jina (🥈27 · ⭐ 1.9K · 📈) - An easier way to build neural search in the cloud. Apache-2
  • GitHub (👨‍💻 81 · 🔀 320 · 📦 45 · 📋 590 - 7% open · ⏱️ 17.01.2021):

     git clone https://github.com/jina-ai/jina
    
  • PyPi (📥 1.9K / month · ⏱️ 17.01.2021):

     pip install jina
    
  • Docker Hub (📥 74K · ⏱️ 17.01.2021):

     docker pull jinaai/jina
    
textacy (🥈27 · ⭐ 1.6K) - NLP, before and after spaCy. Apache-2
  • GitHub (👨‍💻 29 · 🔀 210 · 📋 220 - 13% open · ⏱️ 09.01.2021):

     git clone https://github.com/chartbeat-labs/textacy
    
  • PyPi (📥 19K / month · 📦 77 · ⏱️ 29.08.2020):

     pip install textacy
    
  • Conda (📥 74K · ⏱️ 19.11.2020):

     conda install -c conda-forge textacy
    
DeepPavlov (🥈26 · ⭐ 4.9K) - An open source library for deep learning end-to-end dialog.. Apache-2
  • GitHub (👨‍💻 61 · 🔀 890 · 📦 170 · 📋 520 - 15% open · ⏱️ 24.12.2020):

     git clone https://github.com/deepmipt/DeepPavlov
    
  • PyPi (📥 4.3K / month · 📦 4 · ⏱️ 24.12.2020):

     pip install deeppavlov
    
TensorFlow Text (🥈26 · ⭐ 680) - Making text a first-class citizen in TensorFlow. Apache-2
  • GitHub (👨‍💻 31 · 🔀 85 · 📦 420 · 📋 120 - 31% open · ⏱️ 16.01.2021):

     git clone https://github.com/tensorflow/text
    
  • PyPi (📥 150K / month · 📦 4 · ⏱️ 13.01.2021):

     pip install tensorflow-text
    
ftfy (🥈25 · ⭐ 2.9K) - Fixes mojibake and other glitches in Unicode text, after the fact. MIT
  • GitHub (👨‍💻 17 · 🔀 98 · 📦 2.6K · 📋 110 - 14% open · ⏱️ 17.07.2020):

     git clone https://github.com/LuminosoInsight/python-ftfy
    
  • PyPi (📥 250K / month · 📦 760 · ⏱️ 20.07.2020):

     pip install ftfy
    
  • Conda (📥 94K · ⏱️ 03.02.2019):

     conda install -c conda-forge ftfy
    
vaderSentiment (🥈25 · ⭐ 2.7K · 💤) - VADER Sentiment Analysis. VADER (Valence Aware Dictionary.. MIT
  • GitHub (👨‍💻 10 · 🔀 720 · 📦 2.2K · 📋 98 - 24% open · ⏱️ 22.05.2020):

     git clone https://github.com/cjhutto/vaderSentiment
    
  • PyPi (📥 100K / month · 📦 290 · ⏱️ 22.05.2020):

     pip install vadersentiment
    
TextDistance (🥈25 · ⭐ 1.9K · 💤) - Compute distance between sequences. 30+ algorithms, pure.. MIT
  • GitHub (👨‍💻 7 · 🔀 160 · 📥 96 · 📦 300 · ⏱️ 13.04.2020):

     git clone https://github.com/life4/textdistance
    
  • PyPi (📥 92K / month · 📦 28 · ⏱️ 13.04.2020):

     pip install textdistance
    
  • Conda (📥 17K · ⏱️ 10.11.2020):

     conda install -c conda-forge textdistance
    
spark-nlp (🥈25 · ⭐ 1.8K) - State of the Art Natural Language Processing. Apache-2
  • GitHub (👨‍💻 73 · 🔀 390 · 📋 400 - 17% open · ⏱️ 15.01.2021):

     git clone https://github.com/JohnSnowLabs/spark-nlp
    
  • PyPi (📥 370K / month · ⏱️ 04.01.2021):

     pip install spark-nlp
    
jellyfish (🥈25 · ⭐ 1.4K) - a python library for doing approximate and phonetic matching of.. BSD-2
  • GitHub (👨‍💻 20 · 🔀 120 · 📦 2K · 📋 95 - 9% open · ⏱️ 30.12.2020):

     git clone https://github.com/jamesturk/jellyfish
    
  • PyPi (📥 740K / month · 📦 650 · ⏱️ 21.05.2020):

     pip install jellyfish
    
  • Conda (📥 110K · ⏱️ 08.01.2021):

     conda install -c conda-forge jellyfish
    
haystack (🥈25 · ⭐ 1.3K) - Transformers at scale for question answering & neural search. Using.. Apache-2
  • GitHub (👨‍💻 29 · 🔀 220 · 📦 25 · 📋 420 - 13% open · ⏱️ 14.01.2021):

     git clone https://github.com/deepset-ai/haystack
    
  • PyPi (📥 2.3K / month · 📦 74 · ⏱️ 03.07.2017):

     pip install haystack
    
ParlAI (🥉24 · ⭐ 6.9K) - A framework for training and evaluating AI models on a variety of.. MIT
  • GitHub (👨‍💻 140 · 🔀 1.4K · 📦 5 · 📋 940 - 11% open · ⏱️ 17.01.2021):

     git clone https://github.com/facebookresearch/ParlAI
    
  • PyPi (📥 670 / month · ⏱️ 16.12.2020):

     pip install parlai
    
PyText (🥉24 · ⭐ 6.1K) - A natural language modeling framework based on PyTorch. BSD-3
  • GitHub (👨‍💻 150 · 🔀 760 · 📥 210 · 📦 58 · 📋 190 - 60% open · ⏱️ 14.01.2021):

     git clone https://github.com/facebookresearch/pytext
    
  • PyPi (📥 220 / month · 📦 1 · ⏱️ 27.04.2020):

     pip install pytext-nlp
    
stanza (🥉24 · ⭐ 5.1K) - Official Stanford NLP Python Library for Many Human Languages. Apache-2
  • GitHub (👨‍💻 28 · 🔀 630 · 📋 430 - 12% open · ⏱️ 12.01.2021):

     git clone https://github.com/stanfordnlp/stanza
    
  • PyPi (📥 12K / month · 📦 2 · ⏱️ 13.08.2020):

     pip install stanza
    
  • Conda (📥 3K · ⏱️ 13.08.2020):

     conda install -c stanfordnlp stanza
    
textgenrnn (🥉24 · ⭐ 4.2K) - Easily train your own text-generating neural network of any size.. MIT
  • GitHub (👨‍💻 19 · 🔀 650 · 📥 390 · 📦 580 · 📋 190 - 57% open · ⏱️ 14.07.2020):

     git clone https://github.com/minimaxir/textgenrnn
    
  • PyPi (📥 1.6K / month · 📦 14 · ⏱️ 02.02.2020):

     pip install textgenrnn
    
T5 (🥉24 · ⭐ 3.1K) - Code for the paper Exploring the Limits of Transfer Learning with a.. Apache-2
  • GitHub (👨‍💻 28 · 🔀 420 · 📦 20 · 📋 320 - 11% open · ⏱️ 15.01.2021):

     git clone https://github.com/google-research/text-to-text-transfer-transformer
    
  • PyPi (📥 5.7K / month · ⏱️ 18.12.2020):

     pip install t5
    
Sumy (🥉24 · ⭐ 2.5K) - Module for automatic summarization of text documents and HTML pages. Apache-2
  • GitHub (👨‍💻 20 · 🔀 430 · 📦 630 · 📋 88 - 11% open · ⏱️ 21.11.2020):

     git clone https://github.com/miso-belica/sumy
    
  • PyPi (📥 5.2K / month · 📦 190 · ⏱️ 19.05.2019):

     pip install sumy
    
fastNLP (🥉24 · ⭐ 2K) - fastNLP: A Modularized and Extensible NLP Framework. Currently still.. Apache-2
  • GitHub (👨‍💻 48 · 🔀 350 · 📥 54 · 📦 21 · 📋 140 - 11% open · ⏱️ 23.11.2020):

     git clone https://github.com/fastnlp/fastNLP
    
  • PyPi (📥 930 / month · 📦 1 · ⏱️ 04.02.2019):

     pip install fastnlp
    
PyTextRank (🥉24 · ⭐ 1.4K) - Python implementation of TextRank for phrase extraction and.. MIT
  • GitHub (👨‍💻 10 · 🔀 290 · 📦 120 · 📋 54 - 31% open · ⏱️ 21.09.2020):

     git clone https://github.com/DerwenAI/pytextrank
    
  • PyPi (📥 11K / month · 📦 6 · ⏱️ 15.09.2020):

     pip install pytextrank
    
CLTK (🥉24 · ⭐ 630) - The Classical Language Toolkit. MIT
  • GitHub (👨‍💻 110 · 🔀 300 · 📥 22 · 📦 150 · 📋 480 - 25% open · ⏱️ 09.01.2021):

     git clone https://github.com/cltk/cltk
    
  • PyPi (📥 1.1K / month · 📦 68 · ⏱️ 01.01.2021):

     pip install cltk
    
pyahocorasick (🥉24 · ⭐ 570) - Python module (C extension and plain python) implementing Aho-.. BSD-3
  • GitHub (👨‍💻 20 · 🔀 86 · 📦 470 · 📋 96 - 32% open · ⏱️ 12.01.2021):

     git clone https://github.com/WojciechMula/pyahocorasick
    
  • PyPi (📥 89K / month · 📦 64 · ⏱️ 14.01.2019):

     pip install pyahocorasick
    
  • Conda (📥 110K · ⏱️ 13.10.2020):

     conda install -c conda-forge pyahocorasick
    
Ciphey (🥉23 · ⭐ 6.1K) - Automatically decrypt encryptions without knowing the key or cipher,.. MIT
  • GitHub (👨‍💻 38 · 🔀 340 · 📋 220 - 21% open · ⏱️ 02.01.2021):

     git clone https://github.com/Ciphey/Ciphey
    
  • PyPi (📥 4.1K / month · ⏱️ 02.12.2020):

     pip install ciphey
    
  • Docker Hub (📥 7.3K · ⭐ 1 · ⏱️ 17.12.2020):

     docker pull remnux/ciphey
    
flashtext (🥉23 · ⭐ 4.6K · 💤) - Extract Keywords from sentence or Replace keywords in sentences. MIT
  • GitHub (👨‍💻 8 · 🔀 520 · 📦 340 · 📋 97 - 49% open · ⏱️ 03.05.2020):

     git clone https://github.com/vi3k6i5/flashtext
    
  • PyPi (📥 220K / month · 📦 32 · ⏱️ 16.02.2018):

     pip install flashtext
    
neuralcoref (🥉23 · ⭐ 2.2K) - Fast Coreference Resolution in spaCy with Neural Networks. MIT
  • GitHub (👨‍💻 20 · 🔀 380 · 📥 160 · 📦 280 · 📋 260 - 16% open · ⏱️ 29.12.2020):

     git clone https://github.com/huggingface/neuralcoref
    
  • PyPi (📥 2.8K / month · 📦 18 · ⏱️ 08.04.2019):

     pip install neuralcoref
    
  • Conda (📥 5.7K · ⏱️ 21.02.2020):

     conda install -c conda-forge neuralcoref
    
sense2vec (🥉23 · ⭐ 1.1K · 💤) - Contextually-keyed word vectors. MIT
  • GitHub (👨‍💻 14 · 🔀 200 · 📥 12K · 📦 48 · 📋 93 - 16% open · ⏱️ 29.05.2020):

     git clone https://github.com/explosion/sense2vec
    
  • PyPi (📥 2.4K / month · 📦 6 · ⏱️ 22.11.2019):

     pip install sense2vec
    
  • Conda (📥 14K · ⏱️ 16.03.2020):

     conda install -c conda-forge sense2vec
    
spacy-transformers (🥉23 · ⭐ 870) - Use pretrained transformers like BERT, XLNet and GPT-2.. MIT spacy
  • GitHub (👨‍💻 14 · 🔀 99 · 📦 72 · 📋 180 - 26% open · ⏱️ 14.01.2021):

     git clone https://github.com/explosion/spacy-transformers
    
  • PyPi (📥 8.8K / month · 📦 3 · ⏱️ 14.10.2020):

     pip install spacy-transformers
    
SciSpacy (🥉23 · ⭐ 780) - A full spaCy pipeline and models for scientific/biomedical documents. Apache-2
  • GitHub (👨‍💻 15 · 🔀 120 · 📦 200 · 📋 150 - 11% open · ⏱️ 12.01.2021):

     git clone https://github.com/allenai/scispacy
    
  • PyPi (📥 7.2K / month · 📦 12 · ⏱️ 16.10.2020):

     pip install scispacy
    
pySBD (🥉23 · ⭐ 260) - pySBD (Python Sentence Boundary Disambiguation) is a rule-based sentence.. MIT
  • GitHub (👨‍💻 6 · 🔀 25 · 📦 97 · 📋 47 - 8% open · ⏱️ 01.12.2020):

     git clone https://github.com/nipunsadvilkar/pySBD
    
  • PyPi (📥 9K / month · 📦 2 · ⏱️ 08.10.2020):

     pip install pysbd
    
Snips NLU (🥉22 · ⭐ 3.4K · 💤) - Snips Python library to extract meaning from text. Apache-2
  • GitHub (👨‍💻 21 · 🔀 460 · 📋 250 - 20% open · ⏱️ 08.02.2020):

     git clone https://github.com/snipsco/snips-nlu
    
  • PyPi (📥 990 / month · 📦 8 · ⏱️ 15.01.2020):

     pip install snips-nlu
    
pytorch-nlp (🥉22 · ⭐ 1.8K) - Basic Utilities for PyTorch Natural Language Processing (NLP). BSD-3
  • GitHub (👨‍💻 17 · 🔀 220 · 📦 170 · 📋 63 - 22% open · ⏱️ 11.10.2020):

     git clone https://github.com/PetrochukM/PyTorch-NLP
    
  • PyPi (📥 2.7K / month · 📦 18 · ⏱️ 04.11.2019):

     pip install pytorch-nlp
    
scattertext (🥉22 · ⭐ 1.5K) - Beautiful visualizations of how language differs among document.. Apache-2
  • GitHub (👨‍💻 10 · 🔀 200 · 📦 140 · 📋 70 - 24% open · ⏱️ 18.12.2020):

     git clone https://github.com/JasonKessler/scattertext
    
  • PyPi (📥 2.3K / month · 📦 8 · ⏱️ 14.12.2020):

     pip install scattertext
    
  • Conda (📥 43K · ⏱️ 18.12.2020):

     conda install -c conda-forge scattertext
    
fast-bert (🥉22 · ⭐ 1.5K) - Super easy library for BERT based NLP models. Apache-2
  • GitHub (👨‍💻 34 · 🔀 300 · 📋 230 - 60% open · ⏱️ 17.12.2020):

     git clone https://github.com/kaushaltrivedi/fast-bert
    
  • PyPi (📥 1.8K / month · 📦 2 · ⏱️ 09.12.2020):

     pip install fast-bert
    
NLP Architect (🥉21 · ⭐ 2.6K) - A model library for exploring state-of-the-art deep learning.. Apache-2
  • GitHub (👨‍💻 36 · 🔀 390 · 📦 7 · 📋 120 - 13% open · ⏱️ 14.01.2021):

     git clone https://github.com/IntelLabs/nlp-architect
    
  • PyPi (📥 180 / month · ⏱️ 12.04.2020):

     pip install nlp-architect
    
Texar (🥉21 · ⭐ 2.1K) - Toolkit for Machine Learning, Natural Language Processing, and.. Apache-2
  • GitHub (👨‍💻 41 · 🔀 330 · 📦 12 · 📋 160 - 20% open · ⏱️ 29.07.2020):

     git clone https://github.com/asyml/texar
    
  • PyPi (📥 81 / month · 📦 2 · ⏱️ 19.11.2019):

     pip install texar
    
DeepMatcher (🥉20 · ⭐ 3.4K · 💤) - Python package for performing Entity and Text Matching using.. BSD-3
  • GitHub (👨‍💻 7 · 🔀 1.3K · 📦 9 · 📋 62 - 75% open · ⏱️ 20.04.2020):

     git clone https://github.com/anhaidgroup/deepmatcher
    
  • PyPi (📥 250 / month · ⏱️ 30.12.2019):

     pip install deepmatcher
    
gpt-2-simple (🥉20 · ⭐ 2.4K · 💤) - Python package to easily retrain OpenAI's GPT-2 text-.. MIT
  • GitHub (👨‍💻 16 · 🔀 490 · 📥 190 · 📋 220 - 58% open · ⏱️ 24.04.2020):

     git clone https://github.com/minimaxir/gpt-2-simple
    
  • PyPi (📥 5.3K / month · 📦 3 · ⏱️ 28.12.2019):

     pip install gpt-2-simple
    
NeMo (🥉20 · ⭐ 2.3K) - NeMo: a toolkit for conversational AI. Apache-2
  • GitHub (👨‍💻 68 · 🔀 360 · 📥 400 · 📋 410 - 39% open · ⏱️ 15.01.2021):

     git clone https://github.com/NVIDIA/NeMo
    
  • PyPi (📥 1.7K / month · 📦 4 · ⏱️ 11.12.2020):

     pip install nemo-toolkit
    
Texthero (🥉20 · ⭐ 2.1K) - Text preprocessing, representation and visualization from zero to hero. MIT
  • GitHub (👨‍💻 18 · 🔀 180 · 📥 61 · 📋 120 - 60% open · ⏱️ 03.11.2020):

     git clone https://github.com/jbesomi/texthero
    
  • PyPi (📥 2.2K / month · ⏱️ 06.07.2020):

     pip install texthero
    
DELTA (🥉20 · ⭐ 1.4K) - DELTA is a deep learning based natural language and speech.. Apache-2
  • GitHub (👨‍💻 41 · 🔀 270 · 📋 75 - 12% open · ⏱️ 17.12.2020):

     git clone https://github.com/Delta-ML/delta
    
  • PyPi (📥 8 / month · ⏱️ 27.03.2020):

     pip install delta-nlp
    
  • Docker Hub (📥 12K · ⏱️ 16.01.2021):

     docker pull zh794390558/delta
    
FARM (🥉20 · ⭐ 1.1K) - Fast & easy transfer learning for NLP. Harvesting language models.. Apache-2
  • GitHub (👨‍💻 29 · 🔀 150 · 📋 320 - 10% open · ⏱️ 12.01.2021):

     git clone https://github.com/deepset-ai/FARM
    
  • PyPi (📥 3.6K / month · 📦 2 · ⏱️ 12.01.2021):

     pip install farm
    
Sockeye (🥉20 · ⭐ 980) - Sequence-to-sequence framework with a focus on Neural Machine.. Apache-2
  • GitHub (👨‍💻 52 · 🔀 290 · 📋 240 - 8% open · ⏱️ 08.01.2021):

     git clone https://github.com/awslabs/sockeye
    
  • PyPi (📥 560 / month · 📦 2 · ⏱️ 08.01.2021):

     pip install sockeye
    
finetune (🥉20 · ⭐ 630) - Scikit-learn style model finetuning for NLP. MPL-2.0
  • GitHub (👨‍💻 18 · 🔀 67 · 📦 9 · 📋 130 - 13% open · ⏱️ 06.01.2021):

     git clone https://github.com/IndicoDataSolutions/finetune
    
  • PyPi (📥 150 / month · 📦 2 · ⏱️ 10.01.2020):

     pip install finetune
    
Kashgari (🥉19 · ⭐ 2K) - Kashgari is a production-level NLP Transfer learning framework.. Apache-2
  • GitHub (👨‍💻 19 · 🔀 380 · 📦 19 · 📋 330 - 9% open · ⏱️ 29.10.2020):

     git clone https://github.com/BrikerMan/Kashgari
    
  • PyPi (📥 120 / month · 📦 1 · ⏱️ 26.06.2019):

     pip install kashgari-tf
    
YouTokenToMe (🥉19 · ⭐ 710 · 💤) - Unsupervised text tokenizer focused on computational efficiency. MIT
  • GitHub (👨‍💻 6 · 🔀 42 · 📦 94 · 📋 41 - 51% open · ⏱️ 13.02.2020):

     git clone https://github.com/vkcom/youtokentome
    
  • PyPi (📥 5.4K / month · 📦 6 · ⏱️ 12.02.2020):

     pip install youtokentome
    
textpipe (🥉19 · ⭐ 260) - Textpipe: clean and extract metadata from text. MIT
  • GitHub (👨‍💻 27 · 🔀 17 · 📦 7 · 📋 44 - 45% open · ⏱️ 12.10.2020):

     git clone https://github.com/textpipe/textpipe
    
  • PyPi (📥 500 / month · 📦 1 · ⏱️ 12.10.2020):

     pip install textpipe
    
skift (🥉18 · ⭐ 210) - scikit-learn wrappers for Python fastText. MIT
  • GitHub (👨‍💻 6 · 🔀 22 · 📦 8 · 📋 10 - 20% open · ⏱️ 07.09.2020):

     git clone https://github.com/shaypal5/skift
    
  • PyPi (📥 1.2K / month · ⏱️ 27.08.2020):

     pip install skift
    
Camphr (🥉17 · ⭐ 320) - spaCy plugin for Transformers , Udify, ELmo, etc. Apache-2 spacy
  • GitHub (👨‍💻 7 · 🔀 15 · 📋 27 - 48% open · ⏱️ 15.01.2021):

     git clone https://github.com/PKSHATechnology-Research/camphr
    
  • PyPi (📥 370 / month · ⏱️ 25.12.2020):

     pip install camphr
    
VizSeq (🥉16 · ⭐ 300) - An Analysis Toolkit for Natural Language Generation (Translation,.. MIT
  • GitHub (👨‍💻 3 · 🔀 39 · 📦 1 · 📋 15 - 46% open · ⏱️ 13.12.2020):

     git clone https://github.com/facebookresearch/vizseq
    
  • PyPi (📥 44 / month · ⏱️ 07.08.2020):

     pip install vizseq
    
Translate (🥉15 · ⭐ 680) - Translate - a PyTorch Language Library. BSD-3
  • GitHub (👨‍💻 84 · 🔀 160 · 📋 93 - 70% open · ⏱️ 29.12.2020):

     git clone https://github.com/pytorch/translate
    
  • PyPi (📥 3 / month · ⏱️ 01.05.2018):

     pip install pytorch-translate
    
Headliner (🥉15 · ⭐ 220 · 💤) - Easy training and deployment of seq2seq models. MIT
  • GitHub (👨‍💻 2 · 🔀 37 · 📦 3 · 📋 14 - 7% open · ⏱️ 14.02.2020):

     git clone https://github.com/as-ideas/headliner
    
  • PyPi (📥 110 / month · 📦 1 · ⏱️ 24.01.2020):

     pip install headliner
    
NeuralQA (🥉15 · ⭐ 180) - NeuralQA: A Usable Library for Question Answering on Large Datasets with.. MIT
  • GitHub (👨‍💻 3 · 🔀 23 · 📦 2 · 📋 26 - 69% open · ⏱️ 16.12.2020):

     git clone https://github.com/victordibia/neuralqa
    
  • PyPi (📥 34 / month · ⏱️ 29.07.2020):

     pip install neuralqa
    
OpenNRE (🥉14 · ⭐ 2.9K) - An Open-Source Package for Neural Relation Extraction (NRE). MIT
  • GitHub (👨‍💻 9 · 🔀 830 · 📋 300 - 5% open · ⏱️ 24.11.2020):

     git clone https://github.com/thunlp/OpenNRE
    
TransferNLP (🥉14 · ⭐ 280 · 💤) - NLP library designed for reproducible experimentation.. MIT
  • GitHub (👨‍💻 7 · 🔀 18 · 📋 23 - 17% open · ⏱️ 28.05.2020):

     git clone https://github.com/feedly/transfer-nlp
    
  • PyPi (📥 39 / month · ⏱️ 28.05.2020):

     pip install transfer-nlp
    
textvec (🥉14 · ⭐ 150) - Text vectorization tool to outperform TFIDF for classification tasks. MIT
  • GitHub (👨‍💻 5 · 🔀 21 · 📦 2 · 📋 8 - 25% open · ⏱️ 03.12.2020):

     git clone https://github.com/textvec/textvec
    
  • PyPi (📥 84 / month · ⏱️ 03.12.2020):

     pip install textvec
    
Show 10 hidden projects...
  • gensim (🥇35 · ⭐ 12K) - Topic Modelling for Humans. ❗️LGPL-2.1
  • fuzzywuzzy (🥈29 · ⭐ 7.8K · 💤) - Fuzzy String Matching in Python. ❗️GPL-2.0
  • langid (🥈26 · ⭐ 1.7K · 💀) - Stand-alone language identification system. BSD-3
  • polyglot (🥉24 · ⭐ 1.7K) - Multilingual text (NLP) processing toolkit. ❗️GPL-3.0
  • anaGo (🥉22 · ⭐ 1.4K · 💀) - Bidirectional LSTM-CRF and ELMo for Named-Entity Recognition,.. MIT
  • MatchZoo (🥉21 · ⭐ 3.3K · 💀) - Facilitating the design, comparison and sharing of deep.. Apache-2
  • stop-words (🥉20 · ⭐ 120 · 💀) - Get list of common stop words in various languages in Python. BSD-3
  • pyfasttext (🥉19 · ⭐ 230 · 💀) - Yet another Python binding for fastText. ❗️GPL-3.0
  • NeuroNER (🥉18 · ⭐ 1.5K · 💀) - Named-entity recognition using neural networks. Easy-to-use and.. MIT
  • ONNX-T5 (🥉11 · ⭐ 130 · 🐣) - Summarization, translation, sentiment-analysis, text-generation.. Apache-2

Image Data

Back to top

Libraries for image & video processing, manipulation, and augmentation as well as libraries for computer vision tasks such as facial recognition, object detection, and classification.

Pillow (🥇39 · ⭐ 8.1K · 📈) - The friendly PIL fork (Python Imaging Library). ❗️PIL
  • GitHub (👨‍💻 340 · 🔀 1.6K · 📦 380K · 📋 2.1K - 11% open · ⏱️ 16.01.2021):

     git clone https://github.com/python-pillow/Pillow
    
  • PyPi (📥 13M / month · 📦 110K · ⏱️ 02.01.2021):

     pip install Pillow
    
  • Conda (📥 6.7M · ⏱️ 11.01.2021):

     conda install -c conda-forge pillow
    
torchvision (🥇36 · ⭐ 8.2K) - Datasets, Transforms and Models specific to Computer Vision. BSD-3
  • GitHub (👨‍💻 350 · 🔀 4.2K · 📦 39K · 📋 1.5K - 29% open · ⏱️ 15.01.2021):

     git clone https://github.com/pytorch/vision
    
  • PyPi (📥 700K / month · 📦 4.6K · ⏱️ 10.12.2020):

     pip install torchvision
    
  • Conda (📥 34K · ⏱️ 14.10.2018):

     conda install -c conda-forge torchvision
    
scikit-image (🥇36 · ⭐ 4.1K) - Image processing in Python. BSD-2
  • GitHub (👨‍💻 480 · 🔀 1.7K · 📦 59K · 📋 2.1K - 30% open · ⏱️ 17.01.2021):

     git clone https://github.com/scikit-image/scikit-image
    
  • PyPi (📥 1.7M / month · 📦 15K · ⏱️ 23.12.2020):

     pip install scikit-image
    
  • Conda (📥 2.1M · ⏱️ 23.12.2020):

     conda install -c conda-forge scikit-image
    
imgaug (🥇31 · ⭐ 11K · 💤) - Image augmentation for machine learning experiments. MIT
  • GitHub (👨‍💻 36 · 🔀 2K · 📦 5K · 📋 420 - 51% open · ⏱️ 01.06.2020):

     git clone https://github.com/aleju/imgaug
    
  • PyPi (📥 120K / month · 📦 280 · ⏱️ 05.02.2020):

     pip install imgaug
    
  • Conda (📥 31K · ⏱️ 14.02.2020):

     conda install -c conda-forge imgaug
    
imageio (🥇31 · ⭐ 820) - Python library for reading and writing image data. BSD-2
  • GitHub (👨‍💻 71 · 🔀 160 · 📦 35K · 📋 330 - 18% open · ⏱️ 08.01.2021):

     git clone https://github.com/imageio/imageio
    
  • PyPi (📥 1.8M / month · 📦 3.8K · ⏱️ 06.07.2020):

     pip install imageio
    
  • Conda (📥 1.6M · ⏱️ 06.07.2020):

     conda install -c conda-forge imageio
    
opencv-python (🥈30 · ⭐ 1.7K) - Automated CI toolchain to produce precompiled opencv-python,.. MIT
  • GitHub (👨‍💻 27 · 🔀 310 · 📋 390 - 4% open · ⏱️ 28.12.2020):

     git clone https://github.com/skvark/opencv-python
    
  • PyPi (📥 2.1M / month · 📦 13K · ⏱️ 02.01.2021):

     pip install opencv-python
    
Wand (🥈30 · ⭐ 1K) - The ctypes-based simple ImageMagick binding for Python. MIT
  • GitHub (👨‍💻 94 · 🔀 180 · 📥 5.2K · 📦 3.1K · 📋 330 - 3% open · ⏱️ 08.12.2020):

     git clone https://github.com/emcconville/wand
    
  • PyPi (📥 110K / month · 📦 1.2K · ⏱️ 29.11.2020):

     pip install wand
    
Face Recognition (🥈29 · ⭐ 38K) - The world's simplest facial recognition api for Python.. MIT
  • GitHub (👨‍💻 45 · 🔀 11K · 📥 440 · 📋 1.1K - 51% open · ⏱️ 26.09.2020):

     git clone https://github.com/ageitgey/face_recognition
    
  • PyPi (📥 28K / month · 📦 410 · ⏱️ 21.08.2018):

     pip install face_recognition
    
MoviePy (🥈29 · ⭐ 7.2K) - Video editing with Python. MIT
  • GitHub (👨‍💻 140 · 🔀 1K · 📦 5.7K · 📋 1K - 33% open · ⏱️ 17.01.2021):

     git clone https://github.com/Zulko/moviepy
    
  • PyPi (📥 110K / month · 📦 1.1K · ⏱️ 05.10.2020):

     pip install moviepy
    
  • Conda (📥 72K · ⏱️ 23.02.2020):

     conda install -c conda-forge moviepy
    
Albumentations (🥈28 · ⭐ 7.1K) - Fast image augmentation library and easy to use wrapper.. MIT
  • GitHub (👨‍💻 72 · 🔀 900 · 📦 2.5K · 📋 400 - 42% open · ⏱️ 15.01.2021):

     git clone https://github.com/albumentations-team/albumentations
    
  • PyPi (📥 58K / month · 📦 130 · ⏱️ 29.11.2020):

     pip install albumentations
    
  • Conda (📥 14K · ⏱️ 29.11.2020):

     conda install -c conda-forge albumentations
    
GluonCV (🥈28 · ⭐ 4.5K) - Gluon CV Toolkit. Apache-2
  • GitHub (👨‍💻 110 · 🔀 1K · 📦 400 · 📋 740 - 49% open · ⏱️ 15.01.2021):

     git clone https://github.com/dmlc/gluon-cv
    
  • PyPi (📥 66K / month · 📦 30 · ⏱️ 15.01.2021):

     pip install gluoncv
    
Kornia (🥈28 · ⭐ 3.5K) - Open Source Differentiable Computer Vision Library for PyTorch. Apache-2
  • GitHub (👨‍💻 82 · 🔀 340 · 📦 230 · 📋 340 - 29% open · ⏱️ 17.01.2021):

     git clone https://github.com/kornia/kornia
    
  • PyPi (📥 14K / month · ⏱️ 20.10.2020):

     pip install kornia
    
imutils (🥈28 · ⭐ 3.5K · 📈) - A series of convenience functions to make basic image processing.. MIT
  • GitHub (👨‍💻 20 · 🔀 870 · 📦 12K · 📋 200 - 63% open · ⏱️ 15.01.2021):

     git clone https://github.com/jrosebr1/imutils
    
  • PyPi (📥 150K / month · 📦 1.2K · ⏱️ 15.01.2021):

     pip install imutils
    
  • Conda (📥 34K · ⏱️ 15.01.2021):

     conda install -c conda-forge imutils
    
ImageHash (🥈28 · ⭐ 1.8K) - A Python Perceptual Image Hashing Module. BSD-2
  • GitHub (👨‍💻 17 · 🔀 250 · 📦 1.9K · 📋 87 - 19% open · ⏱️ 03.01.2021):

     git clone https://github.com/JohannesBuchner/imagehash
    
  • PyPi (📥 330K / month · 📦 530 · ⏱️ 19.11.2020):

     pip install ImageHash
    
  • Conda (📥 97K · ⏱️ 19.11.2020):

     conda install -c conda-forge imagehash
    
PyTorch Image Models (🥈27 · ⭐ 6.6K) - PyTorch image models, scripts, pretrained weights --.. Apache-2
  • GitHub (👨‍💻 26 · 🔀 920 · 📥 180K · 📦 250 · 📋 230 - 9% open · ⏱️ 06.01.2021):

     git clone https://github.com/rwightman/pytorch-image-models
    
imageai (🥈27 · ⭐ 5.8K) - A python library built to empower developers to build applications and.. MIT
  • GitHub (👨‍💻 13 · 🔀 1.6K · 📥 520K · 📦 650 · 📋 590 - 31% open · ⏱️ 05.01.2021):

     git clone https://github.com/OlafenwaMoses/ImageAI
    
  • PyPi (📥 5.3K / month · 📦 12 · ⏱️ 05.01.2021):

     pip install imageai
    
detectron2 (🥈26 · ⭐ 15K) - Detectron2 is FAIR's next-generation platform for object.. Apache-2
  • GitHub (👨‍💻 140 · 🔀 3.7K · 📦 150 · 📋 2.2K - 4% open · ⏱️ 16.01.2021):

     git clone https://github.com/facebookresearch/detectron2
    
  • Conda (📥 12K · ⏱️ 05.08.2020):

     conda install -c conda-forge detectron2
    
InsightFace (🥈26 · ⭐ 8.4K) - Face Analysis Project on MXNet. MIT
  • GitHub (👨‍💻 14 · 🔀 2.9K · 📦 50 · 📋 1.3K - 52% open · ⏱️ 13.01.2021):

     git clone https://github.com/deepinsight/insightface
    
  • PyPi (📥 4.4K / month · 📦 1 · ⏱️ 07.09.2019):

     pip install insightface
    
MMDetection (🥈25 · ⭐ 13K) - OpenMMLab Detection Toolbox and Benchmark. Apache-2
  • GitHub (👨‍💻 180 · 🔀 4.5K · 📦 18 · 📋 3.3K - 10% open · ⏱️ 14.01.2021):

     git clone https://github.com/open-mmlab/mmdetection
    
Augmentor (🥈25 · ⭐ 4.3K · 💤) - Image augmentation library in Python for machine learning. MIT
  • GitHub (👨‍💻 22 · 🔀 780 · 📦 270 · 📋 180 - 63% open · ⏱️ 09.03.2020):

     git clone https://github.com/mdbloice/Augmentor
    
  • PyPi (📥 2.7K / month · 📦 42 · ⏱️ 17.01.2020):

     pip install Augmentor
    
PyTorch3D (🥈25 · ⭐ 4.2K) - PyTorch3D is FAIR's library of reusable components for deep.. MIT
  • GitHub (👨‍💻 50 · 🔀 460 · 📦 44 · 📋 460 - 12% open · ⏱️ 11.01.2021):

     git clone https://github.com/facebookresearch/pytorch3d
    
  • PyPi (📥 4.2K / month · ⏱️ 12.11.2020):

     pip install pytorch3d
    
  • Conda (📥 5.7K · ⏱️ 12.11.2020):

     conda install -c pytorch3d pytorch3d
    
facenet-pytorch (🥈25 · ⭐ 1.8K) - Pretrained Pytorch face detection (MTCNN) and recognition.. MIT
  • GitHub (👨‍💻 11 · 🔀 370 · 📥 53K · 📦 250 · 📋 110 - 23% open · ⏱️ 19.11.2020):

     git clone https://github.com/timesler/facenet-pytorch
    
  • PyPi (📥 3.9K / month · 📦 3 · ⏱️ 19.11.2020):

     pip install facenet-pytorch
    
chainercv (🥈25 · ⭐ 1.4K · 💤) - ChainerCV: a Library for Deep Learning in Computer Vision. MIT
  • GitHub (👨‍💻 39 · 🔀 320 · 📦 220 · 📋 220 - 24% open · ⏱️ 07.01.2020):

     git clone https://github.com/chainer/chainercv
    
  • PyPi (📥 3.3K / month · 📦 24 · ⏱️ 31.05.2019):

     pip install chainercv
    
mtcnn (🥈25 · ⭐ 1.4K) - MTCNN face detection implementation for TensorFlow, as a PIP package. MIT
  • GitHub (👨‍💻 14 · 🔀 370 · 📦 820 · 📋 89 - 59% open · ⏱️ 13.01.2021):

     git clone https://github.com/ipazc/mtcnn
    
  • PyPi (📥 12K / month · 📦 42 · ⏱️ 14.11.2019):

     pip install mtcnn
    
mahotas (🥈25 · ⭐ 660) - Computer Vision in Python. MIT
  • GitHub (👨‍💻 30 · 🔀 120 · 📦 550 · 📋 69 - 18% open · ⏱️ 13.01.2021):

     git clone https://github.com/luispedro/mahotas
    
  • PyPi (📥 8.3K / month · 📦 190 · ⏱️ 16.08.2020):

     pip install mahotas
    
  • Conda (📥 250K · ⏱️ 01.11.2020):

     conda install -c conda-forge mahotas
    
Face Alignment (🥉23 · ⭐ 4.6K) - 2D and 3D Face alignment library build using pytorch. BSD-3
  • GitHub (👨‍💻 21 · 🔀 1K · 📋 220 - 13% open · ⏱️ 30.12.2020):

     git clone https://github.com/1adrianb/face-alignment
    
  • PyPi (📥 4.7K / month · 📦 6 · ⏱️ 22.12.2020):

     pip install face-alignment
    
segmentation_models (🥉23 · ⭐ 2.9K · 💤) - Segmentation models with pretrained backbones. Keras.. MIT
  • GitHub (👨‍💻 14 · 🔀 660 · 📋 380 - 37% open · ⏱️ 17.04.2020):

     git clone https://github.com/qubvel/segmentation_models
    
  • PyPi (📥 13K / month · 📦 12 · ⏱️ 10.01.2020):

     pip install segmentation_models
    
vidgear (🥉23 · ⭐ 1.6K) - High-performance cross-platform Video Processing Python framework.. Apache-2
  • GitHub (👨‍💻 5 · 🔀 110 · 📥 300 · 📦 77 · 📋 130 - 3% open · ⏱️ 01.01.2021):

     git clone https://github.com/abhiTronix/vidgear
    
  • PyPi (📥 1.4K / month · 📦 2 · ⏱️ 01.01.2021):

     pip install vidgear
    
Image Deduplicator (🥉22 · ⭐ 3.3K) - Finding duplicate images made easy!. Apache-2
  • GitHub (👨‍💻 9 · 🔀 280 · 📦 8 · 📋 69 - 24% open · ⏱️ 23.11.2020):

     git clone https://github.com/idealo/imagededup
    
  • PyPi (📥 1.5K / month · 📦 1 · ⏱️ 22.11.2020):

     pip install imagededup
    
CellProfiler (🥉22 · ⭐ 530) - An open-source application for biological image analysis. BSD-3
  • GitHub (👨‍💻 120 · 🔀 270 · 📥 520 · 📦 3 · 📋 2.9K - 5% open · ⏱️ 11.01.2021):

     git clone https://github.com/CellProfiler/CellProfiler
    
  • PyPi (📥 110 / month · ⏱️ 04.09.2017):

     pip install cellprofiler
    
pyvips (🥉22 · ⭐ 280) - python binding for libvips using cffi. MIT
  • GitHub (👨‍💻 10 · 🔀 25 · 📦 130 · 📋 200 - 33% open · ⏱️ 27.12.2020):

     git clone https://github.com/libvips/pyvips
    
  • PyPi (📥 3.1K / month · 📦 22 · ⏱️ 18.12.2020):

     pip install pyvips
    
  • Conda (📥 6K · ⏱️ 14.10.2020):

     conda install -c conda-forge pyvips
    
Image Super-Resolution (🥉21 · ⭐ 2.4K) - Super-scale your images and run experiments with.. Apache-2
  • GitHub (👨‍💻 9 · 🔀 460 · 📦 38 · 📋 150 - 33% open · ⏱️ 11.11.2020):

     git clone https://github.com/idealo/image-super-resolution
    
  • PyPi (📥 2.7K / month · 📦 4 · ⏱️ 08.01.2020):

     pip install ISR
    
  • Docker Hub (📥 120 · ⏱️ 01.04.2019):

     docker pull idealo/image-super-resolution-gpu
    
tensorflow-graphics (🥉21 · ⭐ 2.4K) - TensorFlow Graphics: Differentiable Graphics Layers.. Apache-2
  • GitHub (👨‍💻 30 · 🔀 270 · 📋 170 - 51% open · ⏱️ 30.10.2020):

     git clone https://github.com/tensorflow/graphics
    
  • PyPi (📥 910 / month · 📦 3 · ⏱️ 20.05.2020):

     pip install tensorflow-graphics
    
Luminoth (🥉21 · ⭐ 2.3K · 💤) - Deep Learning toolkit for Computer Vision. BSD-3
  • GitHub (👨‍💻 15 · 🔀 400 · 📥 11K · 📦 27 · 📋 180 - 28% open · ⏱️ 07.01.2020):

     git clone https://github.com/tryolabs/luminoth
    
  • PyPi (📥 500 / month · ⏱️ 09.11.2018):

     pip install luminoth
    
Caer (🥉21 · ⭐ 310 · 🐣) - A lightweight Computer Vision library. Scale your models, not boilerplate. MIT
  • GitHub (👨‍💻 2 · 🔀 39 · 📥 14 · 📋 9 - 33% open · ⏱️ 15.01.2021):

     git clone https://github.com/jasmcaus/caer
    
  • PyPi (📥 680K / month · ⏱️ 15.01.2021):

     pip install caer
    
MMF (🥉20 · ⭐ 4K) - A modular framework for vision & language multimodal research from.. BSD-3
  • GitHub (👨‍💻 50 · 🔀 590 · 📦 6 · 📋 410 - 21% open · ⏱️ 12.01.2021):

     git clone https://github.com/facebookresearch/mmf
    
  • PyPi (📥 140 / month · ⏱️ 11.06.2020):

     pip install mmf
    
image-match (🥉20 · ⭐ 2.5K) - Quickly search over billions of images. Apache-2
  • GitHub (👨‍💻 17 · 🔀 350 · 📋 94 - 50% open · ⏱️ 04.01.2021):

     git clone https://github.com/EdjoLabs/image-match
    
  • PyPi (📥 290 / month · 📦 3 · ⏱️ 13.02.2017):

     pip install image_match
    
Classy Vision (🥉20 · ⭐ 1K) - An end-to-end PyTorch framework for image and video.. MIT
  • GitHub (👨‍💻 53 · 🔀 190 · 📋 89 - 51% open · ⏱️ 06.01.2021):

     git clone https://github.com/facebookresearch/ClassyVision
    
  • PyPi (📥 140 / month · ⏱️ 20.11.2020):

     pip install classy_vision
    
  • Conda (📥 5.5K · ⏱️ 11.12.2020):

     conda install -c conda-forge classy_vision
    
nude.py (🥉20 · ⭐ 790) - Nudity detection with Python. MIT
  • GitHub (👨‍💻 12 · 🔀 130 · 📦 54 · 📋 8 - 75% open · ⏱️ 23.11.2020):

     git clone https://github.com/hhatto/nude.py
    
  • PyPi (📥 580 / month · 📦 9 · ⏱️ 23.11.2020):

     pip install nudepy
    
vit-pytorch (🥉19 · ⭐ 2.2K · 🐣) - Implementation of Vision Transformer, a simple way to.. MIT
  • GitHub (👨‍💻 4 · 🔀 190 · 📦 4 · 📋 59 - 37% open · ⏱️ 12.01.2021):

     git clone https://github.com/lucidrains/vit-pytorch
    
  • PyPi (📥 1.2K / month · ⏱️ 11.01.2021):

     pip install vit-pytorch
    
Torch Points 3D (🥉19 · ⭐ 1K) - Pytorch framework for doing deep learning on point clouds. BSD-3
  • GitHub (👨‍💻 19 · 🔀 150 · 📦 2 · 📋 190 - 21% open · ⏱️ 15.01.2021):

     git clone https://github.com/nicolas-chaulet/torch-points3d
    
  • PyPi (📥 270 / month · ⏱️ 18.12.2020):

     pip install torch-points3d
    
Norfair (🥉18 · ⭐ 770 · 🐣) - Lightweight Python library for adding real-time 2D object tracking.. BSD-3
  • GitHub (👨‍💻 7 · 🔀 43 · 📋 17 - 29% open · ⏱️ 22.12.2020):

     git clone https://github.com/tryolabs/norfair
    
  • PyPi (📥 240 / month · ⏱️ 07.12.2020):

     pip install norfair
    
PaddleDetection (🥉17 · ⭐ 2.2K) - Object detection and instance segmentation toolkit.. Apache-2
  • GitHub (👨‍💻 43 · 🔀 610 · 📋 1.1K - 25% open · ⏱️ 17.01.2021):

     git clone https://github.com/PaddlePaddle/PaddleDetection
    
lightly (🥉17 · ⭐ 390 · 🐣) - A python library for self-supervised learning. MIT
  • GitHub (👨‍💻 4 · 🔀 21 · 📋 38 - 52% open · ⏱️ 13.01.2021):

     git clone https://github.com/lightly-ai/lightly
    
  • PyPi (📥 180 / month · ⏱️ 11.01.2021):

     pip install lightly
    
DE⫶TR (🥉14 · ⭐ 5.9K) - End-to-End Object Detection with Transformers. Apache-2
  • GitHub (👨‍💻 19 · 🔀 850 · 📋 260 - 21% open · ⏱️ 15.11.2020):

     git clone https://github.com/facebookresearch/detr
    
PySlowFast (🥉14 · ⭐ 3.3K) - PySlowFast: video understanding codebase from FAIR for.. Apache-2
  • GitHub (👨‍💻 19 · 🔀 620 · 📦 2 · 📋 340 - 46% open · ⏱️ 07.01.2021):

     git clone https://github.com/facebookresearch/SlowFast
    
pycls (🥉13 · ⭐ 1.4K) - Codebase for Image Classification Research, written in PyTorch. MIT
  • GitHub (👨‍💻 9 · 🔀 150 · 📦 2 · 📋 54 - 18% open · ⏱️ 14.01.2021):

     git clone https://github.com/facebookresearch/pycls
    
Show 2 hidden projects...
  • glfw (🥈29 · ⭐ 7.1K) - A multi-platform library for OpenGL, OpenGL ES, Vulkan, window and input. ❗️Zlib
  • Pillow-SIMD (🥉23 · ⭐ 1.5K · 💤) - The friendly PIL fork. ❗️PIL

Graph Data

Back to top

Libraries for graph processing, clustering, embedding, and machine learning tasks.

networkx (🥇37 · ⭐ 8.5K) - Network Analysis in Python. BSD-3
  • GitHub (👨‍💻 490 · 🔀 2.2K · 📥 51 · 📦 65K · 📋 2.6K - 10% open · ⏱️ 16.01.2021):

     git clone https://github.com/networkx/networkx
    
  • PyPi (📥 5M / month · 📦 21K · ⏱️ 22.08.2020):

     pip install networkx
    
  • Conda (📥 2.9M · ⏱️ 23.08.2020):

     conda install -c conda-forge networkx
    
PyTorch Geometric (🥇28 · ⭐ 10K) - Geometric Deep Learning Extension Library for PyTorch. MIT
  • GitHub (👨‍💻 140 · 🔀 1.7K · 📋 1.7K - 37% open · ⏱️ 15.01.2021):

     git clone https://github.com/rusty1s/pytorch_geometric
    
  • PyPi (📥 12K / month · 📦 24 · ⏱️ 27.11.2020):

     pip install torch-geometric
    
dgl (🥇27 · ⭐ 6.5K) - Python package built to ease deep learning on graph, on top of existing.. Apache-2
  • GitHub (👨‍💻 100 · 🔀 1.3K · 📋 900 - 24% open · ⏱️ 17.01.2021):

     git clone https://github.com/dmlc/dgl
    
  • PyPi (📥 12K / month · 📦 30 · ⏱️ 17.01.2021):

     pip install dgl
    
StellarGraph (🥈25 · ⭐ 1.7K) - StellarGraph - Machine Learning on Graphs. Apache-2
  • GitHub (👨‍💻 28 · 🔀 250 · 📦 43 · 📋 910 - 23% open · ⏱️ 15.01.2021):

     git clone https://github.com/stellargraph/stellargraph
    
  • PyPi (📥 6.2K / month · 📦 2 · ⏱️ 25.06.2020):

     pip install stellargraph
    
Spektral (🥈23 · ⭐ 1.6K) - Graph Neural Networks with Keras and Tensorflow 2. MIT
  • GitHub (👨‍💻 16 · 🔀 190 · 📦 26 · 📋 110 - 19% open · ⏱️ 13.01.2021):

     git clone https://github.com/danielegrattarola/spektral
    
  • PyPi (📥 1.7K / month · ⏱️ 07.01.2021):

     pip install spektral
    
Node2Vec (🥈22 · ⭐ 620) - Implementation of the node2vec algorithm. MIT
  • GitHub (👨‍💻 7 · 🔀 160 · 📦 110 · ⏱️ 09.01.2021):

     git clone https://github.com/eliorc/node2vec
    
  • PyPi (📥 4.4K / month · 📦 10 · ⏱️ 09.01.2021):

     pip install node2vec
    
  • Conda (📥 15K · ⏱️ 25.04.2020):

     conda install -c conda-forge node2vec
    
ogb (🥈21 · ⭐ 710) - Benchmark datasets, data loaders, and evaluators for graph machine learning. MIT
  • GitHub (👨‍💻 12 · 🔀 97 · 📦 41 · ⏱️ 11.01.2021):

     git clone https://github.com/snap-stanford/ogb
    
  • PyPi (📥 1.7K / month · ⏱️ 29.12.2020):

     pip install ogb
    
torch-cluster (🥈21 · ⭐ 320) - PyTorch Extension Library of Optimized Graph Cluster.. MIT
  • GitHub (👨‍💻 17 · 🔀 67 · 📋 72 - 48% open · ⏱️ 03.01.2021):

     git clone https://github.com/rusty1s/pytorch_cluster
    
  • PyPi (📥 2.8K / month · 📦 32 · ⏱️ 31.10.2020):

     pip install torch-cluster
    
AmpliGraph (🥉20 · ⭐ 1.4K) - Python library for Representation Learning on Knowledge.. Apache-2
  • GitHub (👨‍💻 14 · 🔀 150 · 📦 9 · 📋 170 - 10% open · ⏱️ 25.08.2020):

     git clone https://github.com/Accenture/AmpliGraph
    
  • PyPi (📥 510 / month · ⏱️ 25.08.2020):

     pip install ampligraph
    
graph-nets (🥉19 · ⭐ 4.7K) - Build Graph Nets in Tensorflow. Apache-2
  • GitHub (👨‍💻 10 · 🔀 720 · 📋 110 - 8% open · ⏱️ 04.12.2020):

     git clone https://github.com/deepmind/graph_nets
    
  • PyPi (📥 720 / month · 📦 7 · ⏱️ 29.01.2020):

     pip install graph-nets
    
PyTorch-BigGraph (🥉19 · ⭐ 2.6K) - Generate embeddings from large-scale graph-structured.. BSD-3
  • GitHub (👨‍💻 19 · 🔀 360 · 📥 100 · 📋 130 - 23% open · ⏱️ 18.12.2020):

     git clone https://github.com/facebookresearch/PyTorch-BigGraph
    
  • PyPi (📥 230 / month · 📦 4 · ⏱️ 01.05.2019):

     pip install torchbiggraph
    
Paddle Graph Learning (🥉19 · ⭐ 870) - Paddle Graph Learning (PGL) is an efficient and.. Apache-2
  • GitHub (👨‍💻 17 · 🔀 120 · 📦 9 · 📋 42 - 33% open · ⏱️ 14.12.2020):

     git clone https://github.com/PaddlePaddle/PGL
    
  • PyPi (📥 530 / month · 📦 2 · ⏱️ 20.11.2020):

     pip install pgl
    
PyKEEN (🥉19 · ⭐ 280) - A Python library for learning and evaluating knowledge graph embeddings. MIT
  • GitHub (👨‍💻 17 · 🔀 43 · 📥 73 · 📋 96 - 37% open · ⏱️ 15.01.2021):

     git clone https://github.com/pykeen/pykeen
    
  • PyPi (📥 230 / month · 📦 1 · ⏱️ 25.08.2020):

     pip install pykeen
    
DeepGraph (🥉18 · ⭐ 230) - Analyze Data with Pandas-based Networks. Documentation:. BSD-3
  • GitHub (👨‍💻 2 · 🔀 33 · 📦 1 · 📋 12 - 58% open · ⏱️ 01.10.2020):

     git clone https://github.com/deepgraph/deepgraph
    
  • PyPi (📥 240 / month · ⏱️ 01.10.2020):

     pip install deepgraph
    
  • Conda (📥 76K · ⏱️ 15.01.2021):

     conda install -c conda-forge deepgraph
    
kglib (🥉17 · ⭐ 380) - Grakn Knowledge Graph Library (ML R&D). Apache-2
  • GitHub (👨‍💻 7 · 🔀 70 · 📥 210 · 📋 59 - 16% open · ⏱️ 15.01.2021):

     git clone https://github.com/graknlabs/kglib
    
  • PyPi (📥 55 / month · ⏱️ 19.08.2020):

     pip install grakn-kglib
    
pytorch_geometric_temporal (🥉16 · ⭐ 310) - A Temporal Extension Library for PyTorch Geometric. MIT
  • GitHub (👨‍💻 3 · 🔀 31 · 📋 16 - 6% open · ⏱️ 14.01.2021):

     git clone https://github.com/benedekrozemberczki/pytorch_geometric_temporal
    
  • PyPi (📥 89 / month · ⏱️ 14.01.2021):

     pip install torch-geometric-temporal
    
AutoGL (🥉15 · ⭐ 560 · 🐣) - An autoML framework & toolkit for machine learning on graphs. MIT
  • GitHub (👨‍💻 4 · 🔀 49 · 📋 5 - 60% open · ⏱️ 29.12.2020):

     git clone https://github.com/THUMNLab/AutoGL
    
  • PyPi (📥 170 / month · ⏱️ 23.12.2020):

     pip install auto-graph-learning
    
Euler (🥉14 · ⭐ 2.5K) - A distributed graph deep learning framework. Apache-2
  • GitHub (👨‍💻 5 · 🔀 500 · 📋 300 - 67% open · ⏱️ 29.07.2020):

     git clone https://github.com/alibaba/euler
    
  • PyPi (📥 7 / month · ⏱️ 10.04.2019):

     pip install euler-gl
    
GraphEmbedding (🥉14 · ⭐ 1.7K) - Implementation and experiments of graph embedding algorithms. MIT
  • GitHub (👨‍💻 6 · 🔀 540 · 📦 7 · 📋 40 - 67% open · ⏱️ 18.10.2020):

     git clone https://github.com/shenweichen/GraphEmbedding
    
OpenKE (🥉13 · ⭐ 2.3K · 💤) - An Open-Source Package for Knowledge Embedding (KE). MIT
  • GitHub (👨‍💻 10 · 🔀 720 · 📋 270 - 17% open · ⏱️ 08.04.2020):

     git clone https://github.com/thunlp/OpenKE
    
GraphVite (🥉13 · ⭐ 830) - GraphVite: A General and High-performance Graph Embedding System. Apache-2
  • GitHub (👨‍💻 1 · 🔀 120 · 📋 77 - 31% open · ⏱️ 14.01.2021):

     git clone https://github.com/DeepGraphLearning/graphvite
    
  • Conda (📥 3.4K · ⏱️ 19.03.2020):

     conda install -c milagraph graphvite
    
Show 7 hidden projects...
  • igraph (🥇27 · ⭐ 760) - Python interface for igraph. ❗️GPL-2.0
  • pygal (🥈26 · ⭐ 2.3K) - PYthon svg GrAph plotting Library. ❗️LGPL-3.0
  • Karate Club (🥈21 · ⭐ 1.1K) - Karate Club: An API Oriented Open-source Python Framework for.. ❗️GPL-3.0
  • DeepWalk (🥉19 · ⭐ 2.2K · 💤) - DeepWalk - Deep Learning for Graphs. ❗️GPL-3.0
  • Sematch (🥉17 · ⭐ 340 · 💀) - semantic similarity framework for knowledge graph. Apache-2
  • GraphSAGE (🥉14 · ⭐ 2.1K · 💀) - Representation learning on large graphs using stochastic.. MIT
  • OpenNE (🥉14 · ⭐ 1.4K · 💀) - An Open-Source Package for Network Embedding (NE). MIT

Audio Data

Back to top

Libraries for audio analysis, manipulation, transformation, and extraction, as well as speech recognition and music generation tasks.

DeepSpeech (🥇31 · ⭐ 16K) - DeepSpeech is an open source embedded (offline, on-device).. MPL-2.0
  • GitHub (👨‍💻 150 · 🔀 3K · 📥 400K · 📦 380 · 📋 1.9K - 4% open · ⏱️ 03.01.2021):

     git clone https://github.com/mozilla/DeepSpeech
    
  • PyPi (📥 12K / month · 📦 29 · ⏱️ 19.12.2020):

     pip install deepspeech
    
Magenta (🥇29 · ⭐ 16K) - Magenta: Music and Art Generation with Machine Intelligence. Apache-2
  • GitHub (👨‍💻 140 · 🔀 3.3K · 📦 260 · 📋 830 - 35% open · ⏱️ 13.01.2021):

     git clone https://github.com/magenta/magenta
    
  • PyPi (📥 4.7K / month · 📦 35 · ⏱️ 12.11.2020):

     pip install magenta
    
Pydub (🥇29 · ⭐ 5K) - Manipulate audio with a simple and easy high level interface. MIT
  • GitHub (👨‍💻 79 · 🔀 670 · 📦 5.6K · 📋 400 - 41% open · ⏱️ 14.12.2020):

     git clone https://github.com/jiaaro/pydub
    
  • PyPi (📥 110K / month · 📦 1.4K · ⏱️ 03.06.2020):

     pip install pydub
    
  • Conda (📥 12K · ⏱️ 02.02.2019):

     conda install -c conda-forge pydub
    
torchaudio (🥇29 · ⭐ 1.2K) - Data manipulation and transformation for audio signal.. BSD-2
  • GitHub (👨‍💻 85 · 🔀 260 · 📦 1.2K · 📋 400 - 38% open · ⏱️ 15.01.2021):

     git clone https://github.com/pytorch/audio
    
  • PyPi (📥 68K / month · 📦 9 · ⏱️ 27.10.2020):

     pip install torchaudio
    
audioread (🥈27 · ⭐ 360) - cross-library (GStreamer + Core Audio + MAD + FFmpeg) audio decoding.. MIT
  • GitHub (👨‍💻 20 · 🔀 83 · 📦 3.9K · 📋 75 - 41% open · ⏱️ 20.10.2020):

     git clone https://github.com/beetbox/audioread
    
  • PyPi (📥 270K / month · 📦 590 · ⏱️ 20.10.2020):

     pip install audioread
    
  • Conda (📥 190K · ⏱️ 08.12.2020):

     conda install -c conda-forge audioread
    
librosa (🥈26 · ⭐ 4.2K) - Python library for audio and music analysis. ISC
  • GitHub (👨‍💻 80 · 🔀 680 · 📦 8.9K · 📋 830 - 6% open · ⏱️ 11.09.2020):

     git clone https://github.com/librosa/librosa
    
  • PyPi (📥 290K / month · 📦 1.8K · ⏱️ 22.07.2020):

     pip install librosa
    
  • Conda (📥 240K · ⏱️ 22.07.2020):

     conda install -c conda-forge librosa
    
spleeter (🥈25 · ⭐ 15K) - Deezer source separation library including pretrained models. MIT
  • GitHub (👨‍💻 17 · 🔀 1.5K · 📥 890K · 📋 500 - 18% open · ⏱️ 15.01.2021):

     git clone https://github.com/deezer/spleeter
    
  • PyPi (📥 5.7K / month · ⏱️ 08.01.2021):

     pip install spleeter
    
  • Conda (📥 41K · ⏱️ 30.06.2020):

     conda install -c conda-forge spleeter
    
pyAudioAnalysis (🥈25 · ⭐ 3.7K) - Python Audio Analysis Library: Feature Extraction,.. Apache-2
  • GitHub (👨‍💻 23 · 🔀 960 · 📦 180 · 📋 270 - 61% open · ⏱️ 04.01.2021):

     git clone https://github.com/tyiannak/pyAudioAnalysis
    
  • PyPi (📥 5.4K / month · 📦 11 · ⏱️ 29.09.2020):

     pip install pyAudioAnalysis
    
espnet (🥈25 · ⭐ 3.3K) - End-to-End Speech Processing Toolkit. Apache-2
  • GitHub (👨‍💻 140 · 🔀 1K · 📥 59 · 📦 4 · 📋 1.2K - 10% open · ⏱️ 15.01.2021):

     git clone https://github.com/espnet/espnet
    
  • PyPi (📥 1.1K / month · ⏱️ 15.01.2021):

     pip install espnet
    
python-soundfile (🥈25 · ⭐ 350) - SoundFile is an audio library based on libsndfile, CFFI, and.. BSD-3
  • GitHub (👨‍💻 19 · 🔀 41 · 📥 2K · 📦 5.3K · 📋 140 - 40% open · ⏱️ 01.12.2020):

     git clone https://github.com/bastibe/python-soundfile
    
  • PyPi (📥 350K / month · 📦 740 · ⏱️ 19.04.2018):

     pip install soundfile
    
python_speech_features (🥉24 · ⭐ 1.8K) - This library provides common speech features for ASR.. MIT
  • GitHub (👨‍💻 18 · 🔀 540 · 📋 72 - 31% open · ⏱️ 31.12.2020):

     git clone https://github.com/jameslyons/python_speech_features
    
  • PyPi (📥 52K / month · 📦 300 · ⏱️ 16.08.2017):

     pip install python_speech_features
    
tinytag (🥉23 · ⭐ 430) - Read music meta data and length of MP3, OGG, OPUS, MP4, M4A, FLAC, WMA and.. MIT
  • GitHub (👨‍💻 18 · 🔀 78 · 📦 300 · 📋 74 - 18% open · ⏱️ 11.11.2020):

     git clone https://github.com/devsnd/tinytag
    
  • PyPi (📥 2.8K / month · 📦 96 · ⏱️ 05.11.2020):

     pip install tinytag
    
DDSP (🥉22 · ⭐ 1.7K) - DDSP: Differentiable Digital Signal Processing. Apache-2
  • GitHub (👨‍💻 24 · 🔀 160 · 📦 5 · 📋 99 - 13% open · ⏱️ 15.01.2021):

     git clone https://github.com/magenta/ddsp
    
  • PyPi (📥 970 / month · ⏱️ 14.01.2021):

     pip install ddsp
    
kapre (🥉22 · ⭐ 700) - kapre: Keras Audio Preprocessors. MIT
  • GitHub (👨‍💻 13 · 🔀 130 · 📥 7 · 📦 440 · 📋 85 - 5% open · ⏱️ 16.11.2020):

     git clone https://github.com/keunwoochoi/kapre
    
  • PyPi (📥 860 / month · 📦 11 · ⏱️ 29.09.2020):

     pip install kapre
    
Porcupine (🥉21 · ⭐ 2.3K) - On-device wake word detection powered by deep learning. Apache-2
  • GitHub (👨‍💻 25 · 🔀 330 · 📦 4 · 📋 280 - 3% open · ⏱️ 14.01.2021):

     git clone https://github.com/Picovoice/Porcupine
    
  • PyPi (📥 830 / month · ⏱️ 02.10.2020):

     pip install pvporcupine
    
Dejavu (🥉20 · ⭐ 5.3K · 💤) - Audio fingerprinting and recognition in Python. MIT
  • GitHub (👨‍💻 23 · 🔀 1.2K · 📦 16 · 📋 210 - 39% open · ⏱️ 03.06.2020):

     git clone https://github.com/worldveil/dejavu
    
  • PyPi (📥 90 / month · 📦 1 · ⏱️ 19.04.2015):

     pip install PyDejavu
    
TTS (🥉17 · ⭐ 3.1K) - Deep learning for Text to Speech (Discussion forum:.. MPL-2.0
  • GitHub (👨‍💻 48 · 🔀 630 · 📋 440 - 5% open · ⏱️ 13.01.2021):

     git clone https://github.com/mozilla/TTS
    
Muda (🥉16 · ⭐ 170) - A library for augmenting annotated audio data. ISC
  • GitHub (👨‍💻 6 · 🔀 30 · 📦 7 · 📋 49 - 10% open · ⏱️ 20.07.2020):

     git clone https://github.com/bmcfee/muda
    
  • PyPi (📥 71 / month · 📦 1 · ⏱️ 15.11.2019):

     pip install muda
    
Julius (🥉13 · ⭐ 160 · 🐣) - Fast PyTorch based DSP for audio and 1D signals. MIT
  • GitHub (👨‍💻 2 · 🔀 6 · 📦 2 · ⏱️ 10.12.2020):

     git clone https://github.com/adefossez/julius
    
  • PyPi (📥 160 / month · ⏱️ 25.11.2020):

     pip install julius
    
Show 4 hidden projects...
  • SpeechRecognition (🥇30 · ⭐ 5.4K · 💀) - Speech recognition module for Python, supporting.. BSD-3
  • aubio (🥈26 · ⭐ 2K) - a library for audio and music analysis. ❗️GPL-3.0
  • Essentia (🥉23 · ⭐ 1.7K) - C++ library for audio and music analysis, description and.. ❗️AGPL-3.0
  • Madmom (🥉20 · ⭐ 700 · 💀) - Python audio and music signal processing library. BSD-3

Geospatial Data

Back to top

Libraries to load, process, analyze, and write geographic data as well as libraries for spatial analysis, map visualization, and geocoding.

geopy (🥇33 · ⭐ 3.2K) - Geocoding library for Python. MIT
  • GitHub (👨‍💻 120 · 🔀 520 · 📦 16K · 📋 240 - 10% open · ⏱️ 27.12.2020):

     git clone https://github.com/geopy/geopy
    
  • PyPi (📥 3.8M / month · 📦 7.2K · ⏱️ 27.12.2020):

     pip install geopy
    
  • Conda (📥 500K · ⏱️ 27.12.2020):

     conda install -c conda-forge geopy
    
Shapely (🥇33 · ⭐ 2.1K) - Manipulation and analysis of geometric objects. BSD-3
  • GitHub (👨‍💻 100 · 🔀 370 · 📦 16K · 📋 700 - 19% open · ⏱️ 15.12.2020):

     git clone https://github.com/Toblerity/Shapely
    
  • PyPi (📥 1.9M / month · 📦 5.5K · ⏱️ 20.08.2020):

     pip install shapely
    
  • Conda (📥 2M · ⏱️ 13.10.2020):

     conda install -c conda-forge shapely
    
folium (🥇32 · ⭐ 5.1K) - Python Data. Leaflet.js Maps. MIT
  • GitHub (👨‍💻 120 · 🔀 1.9K · 📦 8.1K · 📋 830 - 17% open · ⏱️ 15.01.2021):

     git clone https://github.com/python-visualization/folium
    
  • PyPi (📥 200K / month · 📦 970 · ⏱️ 06.01.2021):

     pip install folium
    
  • Conda (📥 310K · ⏱️ 06.01.2021):

     conda install -c conda-forge folium
    
GeoPandas (🥈30 · ⭐ 2.4K) - Python tools for geographic data. BSD-3
  • GitHub (👨‍💻 130 · 🔀 530 · 📥 840 · 📦 6.9K · 📋 950 - 30% open · ⏱️ 14.01.2021):

     git clone https://github.com/geopandas/geopandas
    
  • PyPi (📥 380K / month · 📦 1.2K · ⏱️ 24.06.2020):

     pip install geopandas
    
  • Conda (📥 810K · ⏱️ 16.07.2020):

     conda install -c conda-forge geopandas
    
Rasterio (🥈30 · ⭐ 1.4K) - Rasterio reads and writes geospatial raster datasets. BSD-3
  • GitHub (👨‍💻 110 · 🔀 390 · 📥 700 · 📦 2.6K · 📋 1.3K - 10% open · ⏱️ 12.01.2021):

     git clone https://github.com/mapbox/rasterio
    
  • PyPi (📥 120K / month · 📦 850 · ⏱️ 12.01.2021):

     pip install rasterio
    
  • Conda (📥 840K · ⏱️ 30.10.2020):

     conda install -c conda-forge rasterio
    
Fiona (🥈30 · ⭐ 770) - Fiona reads and writes geographic data files. BSD-3
  • GitHub (👨‍💻 65 · 🔀 160 · 📦 4.8K · 📋 620 - 11% open · ⏱️ 30.11.2020):

     git clone https://github.com/Toblerity/Fiona
    
  • PyPi (📥 440K / month · 📦 1.2K · ⏱️ 17.11.2020):

     pip install fiona
    
  • Conda (📥 1.6M · ⏱️ 17.11.2020):

     conda install -c conda-forge fiona
    
pyproj (🥈29 · ⭐ 560) - Python interface to PROJ (cartographic projections and coordinate.. MIT
  • GitHub (👨‍💻 39 · 🔀 150 · 📦 8.2K · 📋 380 - 1% open · ⏱️ 08.01.2021):

     git clone https://github.com/pyproj4/pyproj
    
  • PyPi (📥 1.1M / month · 📦 2.5K · ⏱️ 05.11.2020):

     pip install pyproj
    
  • Conda (📥 1.9M · ⏱️ 06.11.2020):

     conda install -c conda-forge pyproj
    
ipyleaflet (🥈27 · ⭐ 1.1K) - A Jupyter - Leaflet.js bridge. MIT
  • GitHub (👨‍💻 63 · 🔀 280 · 📦 670 · 📋 380 - 34% open · ⏱️ 05.01.2021):

     git clone https://github.com/jupyter-widgets/ipyleaflet
    
  • PyPi (📥 12K / month · 📦 98 · ⏱️ 05.01.2021):

     pip install ipyleaflet
    
  • Conda (📥 580K · ⏱️ 16.01.2021):

     conda install -c conda-forge ipyleaflet
    
  • NPM (📥 120K / month · 📦 2 · ⏱️ 05.01.2021):

     npm install jupyter-leaflet
    
geojson (🥈27 · ⭐ 590) - Python bindings and utilities for GeoJSON. BSD-3
  • GitHub (👨‍💻 44 · 🔀 78 · 📦 5.6K · 📋 68 - 26% open · ⏱️ 25.11.2020):

     git clone https://github.com/jazzband/geojson
    
  • PyPi (📥 390K / month · 📦 1.6K · ⏱️ 09.08.2019):

     pip install geojson
    
  • Conda (📥 360K · ⏱️ 11.08.2019):

     conda install -c conda-forge geojson
    
ArcGIS API (🥉25 · ⭐ 940) - Documentation and samples for ArcGIS API for Python. Apache-2
  • GitHub (👨‍💻 60 · 🔀 680 · 📋 320 - 39% open · ⏱️ 16.01.2021):

     git clone https://github.com/Esri/arcgis-python-api
    
  • PyPi (📥 19K / month · 📦 10 · ⏱️ 30.11.2020):

     pip install arcgis
    
  • Docker Hub (📥 3.7K · ⭐ 29 · ⏱️ 06.03.2020):

     docker pull esridocker/arcgis-api-python-notebook
    
PySAL (🥉24 · ⭐ 810) - PySAL: Python Spatial Analysis Library Meta-Package. BSD-3
  • GitHub (👨‍💻 70 · 🔀 240 · 📋 630 - 9% open · ⏱️ 12.01.2021):

     git clone https://github.com/pysal/pysal
    
  • PyPi (📥 9.9K / month · 📦 18 · ⏱️ 30.07.2020):

     pip install pysal
    
  • Conda (📥 400K · ⏱️ 30.07.2020):

     conda install -c conda-forge pysal
    
GeoViews (🥉24 · ⭐ 330) - Simple, concise geographical visualization in Python. BSD-3
  • GitHub (👨‍💻 21 · 🔀 60 · 📦 190 · 📋 250 - 31% open · ⏱️ 21.09.2020):

     git clone https://github.com/holoviz/geoviews
    
  • PyPi (📥 1.2K / month · 📦 10 · ⏱️ 30.03.2020):

     pip install geoviews
    
  • Conda (📥 54K · ⏱️ 23.09.2020):

     conda install -c conda-forge geoviews
    
EarthPy (🥉22 · ⭐ 220) - A package built to support working with spatial data using open source.. BSD-3
  • GitHub (👨‍💻 38 · 🔀 96 · 📦 72 · 📋 220 - 11% open · ⏱️ 03.12.2020):

     git clone https://github.com/earthlab/earthpy
    
  • PyPi (📥 1.6K / month · 📦 6 · ⏱️ 18.06.2020):

     pip install earthpy
    
  • Conda (📥 26K · ⏱️ 19.06.2020):

     conda install -c conda-forge earthpy
    
pymap3d (🥉21 · ⭐ 170) - pure-Python (Numpy optional) 3D coordinate conversions for geospace ecef.. BSD-2
  • GitHub (👨‍💻 8 · 🔀 55 · 📋 24 - 8% open · ⏱️ 23.12.2020):

     git clone https://github.com/geospace-code/pymap3d
    
  • PyPi (📥 8.7K / month · 📦 3 · ⏱️ 21.09.2020):

     pip install pymap3d
    
  • Conda (📥 5.9K · ⏱️ 24.09.2020):

     conda install -c conda-forge pymap3d
    
Show 7 hidden projects...
  • Geocoder (🥈29 · ⭐ 1.3K · 💀) - Python Geocoder. MIT
  • Cartopy (🥈27 · ⭐ 1.4K) - Rasterio reads and writes geospatial raster datasets. ❗️LGPL-3.0
  • Satpy (🥉25 · ⭐ 660) - Python package for earth-observing satellite data processing. ❗️GPL-3.0
  • gmaps (🥉21 · ⭐ 700 · 💀) - Google maps for Jupyter notebooks. BSD-3
  • Sentinelsat (🥉21 · ⭐ 540) - Search and download Copernicus Sentinel satellite images. ❗️GPL-3.0
  • Mapbox GL (🥉20 · ⭐ 550 · 💀) - Use Mapbox GL JS to visualize data in a Python Jupyter notebook. MIT
  • geoplotlib (🥉19 · ⭐ 890 · 💀) - python toolbox for visualizing geographical data and making maps. MIT

Financial Data

Back to top

Libraries for algorithmic stock/crypto trading, risk analytics, backtesting, technical analysis, and other tasks on financial data.

zipline (🥇30 · ⭐ 13K) - Zipline, a Pythonic Algorithmic Trading Library. Apache-2
  • GitHub (👨‍💻 150 · 🔀 3.7K · 📦 670 · 📋 970 - 33% open · ⏱️ 14.10.2020):

     git clone https://github.com/quantopian/zipline
    
  • PyPi (📥 4.6K / month · 📦 160 · ⏱️ 05.10.2020):

     pip install zipline
    
yfinance (🥇29 · ⭐ 3.9K) - Yahoo! Finance market data downloader (+faster Pandas Datareader). Apache-2
  • GitHub (👨‍💻 27 · 🔀 990 · 📦 2.7K · 📋 470 - 63% open · ⏱️ 11.01.2021):

     git clone https://github.com/ranaroussi/yfinance
    
  • PyPi (📥 100K / month · 📦 26 · ⏱️ 05.10.2020):

     pip install yfinance
    
  • Conda (📥 31K · ⏱️ 27.12.2019):

     conda install -c ranaroussi yfinance
    
Alpha Vantage (🥇27 · ⭐ 3K) - A python wrapper for Alpha Vantage API for financial data. MIT
  • GitHub (👨‍💻 36 · 🔀 540 · 📋 220 - 8% open · ⏱️ 21.12.2020):

     git clone https://github.com/RomelTorres/alpha_vantage
    
  • PyPi (📥 21K / month · 📦 200 · ⏱️ 26.08.2018):

     pip install alpha_vantage
    
ta (🥇27 · ⭐ 1.7K) - Technical Analysis Library using Pandas and Numpy. MIT
  • GitHub (👨‍💻 23 · 🔀 450 · 📦 400 · 📋 140 - 42% open · ⏱️ 29.11.2020):

     git clone https://github.com/bukosabino/ta
    
  • PyPi (📥 30K / month · 📦 6 · ⏱️ 29.11.2020):

     pip install ta
    
pyfolio (🥈25 · ⭐ 3.4K) - Portfolio and risk analytics in Python. Apache-2
  • GitHub (👨‍💻 55 · 🔀 1.1K · 📦 190 · 📋 390 - 31% open · ⏱️ 15.07.2020):

     git clone https://github.com/quantopian/pyfolio
    
  • PyPi (📥 6.4K / month · 📦 54 · ⏱️ 15.04.2019):

     pip install pyfolio
    
  • Conda (📥 5.4K · ⏱️ 16.05.2020):

     conda install -c conda-forge pyfolio
    
empyrical (🥈25 · ⭐ 690) - Common financial risk and performance metrics. Used by zipline and.. Apache-2
  • GitHub (👨‍💻 22 · 🔀 220 · 📦 500 · 📋 53 - 50% open · ⏱️ 14.10.2020):

     git clone https://github.com/quantopian/empyrical
    
  • PyPi (📥 27K / month · 📦 220 · ⏱️ 13.10.2020):

     pip install empyrical
    
  • Conda (📥 9.1K · ⏱️ 14.10.2020):

     conda install -c conda-forge empyrical
    
Alphalens (🥈24 · ⭐ 1.7K · 💤) - Performance analysis of predictive (alpha) stock factors. Apache-2
  • GitHub (👨‍💻 25 · 🔀 630 · 📦 350 · 📋 180 - 20% open · ⏱️ 27.04.2020):

     git clone https://github.com/quantopian/alphalens
    
  • PyPi (📥 2K / month · 📦 14 · ⏱️ 27.04.2020):

     pip install alphalens
    
  • Conda (📥 10K · ⏱️ 16.05.2020):

     conda install -c conda-forge alphalens
    
IB-insync (🥈24 · ⭐ 1.2K) - Python sync/async framework for Interactive Brokers API. BSD-2
  • GitHub (👨‍💻 25 · 🔀 380 · 📋 280 - 1% open · ⏱️ 12.01.2021):

     git clone https://github.com/erdewit/ib_insync
    
  • PyPi (📥 2.9K / month · 📦 12 · ⏱️ 03.11.2020):

     pip install ib_insync
    
  • Conda (📥 7.1K · ⏱️ 07.11.2020):

     conda install -c conda-forge ib-insync
    
ffn (🥈24 · ⭐ 740) - ffn - a financial function library for Python. MIT
  • GitHub (👨‍💻 23 · 🔀 170 · 📦 96 · 📋 77 - 19% open · ⏱️ 16.01.2021):

     git clone https://github.com/pmorissette/ffn
    
  • PyPi (📥 11K / month · 📦 16 · ⏱️ 28.05.2018):

     pip install ffn
    
Enigma Catalyst (🥉23 · ⭐ 1.9K) - An Algorithmic Trading Library for Crypto-Assets in Python. Apache-2
  • GitHub (👨‍💻 140 · 🔀 590 · 📦 20 · 📋 500 - 26% open · ⏱️ 28.12.2020):

     git clone https://github.com/enigmampc/catalyst
    
  • PyPi (📥 340 / month · ⏱️ 11.11.2018):

     pip install enigma-catalyst
    
stockstats (🥉23 · ⭐ 690) - Supply a wrapper ``StockDataFrame`` based on the.. BSD-3
  • GitHub (👨‍💻 7 · 🔀 190 · 📦 160 · 📋 58 - 50% open · ⏱️ 17.10.2020):

     git clone https://github.com/jealous/stockstats
    
  • PyPi (📥 26K / month · 📦 24 · ⏱️ 17.10.2020):

     pip install stockstats
    
bt (🥉22 · ⭐ 930) - bt - flexible backtesting for Python. MIT
  • GitHub (👨‍💻 20 · 🔀 250 · 📦 42 · 📋 220 - 24% open · ⏱️ 16.01.2021):

     git clone https://github.com/pmorissette/bt
    
  • PyPi (📥 3.5K / month · 📦 20 · ⏱️ 28.05.2018):

     pip install bt
    
TensorTrade (🥉21 · ⭐ 2.2K) - An open source reinforcement learning framework for training,.. Apache-2
  • GitHub (👨‍💻 46 · 🔀 580 · 📦 10 · 📋 150 - 21% open · ⏱️ 06.01.2021):

     git clone https://github.com/tensortrade-org/tensortrade
    
  • PyPi (📥 960 / month · 📦 1 · ⏱️ 24.08.2020):

     pip install tensortrade
    
finmarketpy (🥉20 · ⭐ 2.4K) - Python library for backtesting trading strategies & analyzing.. Apache-2
  • GitHub (👨‍💻 14 · 🔀 410 · 📥 39 · 📦 3 · 📋 24 - 87% open · ⏱️ 16.01.2021):

     git clone https://github.com/cuemacro/finmarketpy
    
  • PyPi (📥 71 / month · ⏱️ 20.10.2020):

     pip install finmarketpy
    
Qlib (🥉19 · ⭐ 3.7K · 🐣) - Qlib is an AI-oriented quantitative investment platform, which aims.. MIT
  • GitHub (👨‍💻 27 · 🔀 560 · 📥 190 · 📋 100 - 53% open · ⏱️ 14.01.2021):

     git clone https://github.com/microsoft/qlib
    
  • PyPi (📥 1.9K / month · ⏱️ 12.12.2020):

     pip install pyqlib
    
tf-quant-finance (🥉19 · ⭐ 1.4K) - High-performance TensorFlow library for quantitative.. Apache-2
  • GitHub (👨‍💻 29 · 🔀 250 · 📦 4 · 📋 19 - 47% open · ⏱️ 15.01.2021):

     git clone https://github.com/google/tf-quant-finance
    
  • PyPi (📥 100 / month · ⏱️ 18.09.2020):

     pip install tf-quant-finance
    
Crypto Signals (🥉18 · ⭐ 2.4K) - Github.com/CryptoSignal - #1 Quant Trading & Technical Analysis.. MIT
  • GitHub (👨‍💻 25 · 🔀 640 · 📋 230 - 18% open · ⏱️ 03.09.2020):

     git clone https://github.com/CryptoSignal/crypto-signal
    
  • Docker Hub (📥 41K · ⭐ 8 · ⏱️ 03.09.2020):

     docker pull shadowreaver/crypto-signal
    
Show 6 hidden projects...

Time Series Data

Back to top

Libraries for forecasting, anomaly detection, feature extraction, and machine learning on time-series and sequential data.

Prophet (🥇30 · ⭐ 12K) - Tool for producing high quality forecasts for time series data that has.. MIT
  • GitHub (👨‍💻 120 · 🔀 3.4K · 📥 530 · 📋 1.5K - 4% open · ⏱️ 14.01.2021):

     git clone https://github.com/facebook/prophet
    
  • PyPi (📥 590K / month · 📦 150 · ⏱️ 05.09.2020):

     pip install fbprophet
    
tsfresh (🥇27 · ⭐ 5.3K) - Automatic extraction of relevant features from time series:. MIT
  • GitHub (👨‍💻 71 · 🔀 840 · 📋 430 - 7% open · ⏱️ 12.01.2021):

     git clone https://github.com/blue-yonder/tsfresh
    
  • PyPi (📥 170K / month · 📦 66 · ⏱️ 09.09.2020):

     pip install tsfresh
    
  • Conda (📥 26K · ⏱️ 10.09.2020):

     conda install -c conda-forge tsfresh
    
tslearn (🥇27 · ⭐ 1.4K) - A machine learning toolkit dedicated to time-series data. BSD-2
  • GitHub (👨‍💻 28 · 🔀 220 · 📦 180 · 📋 220 - 26% open · ⏱️ 08.01.2021):

     git clone https://github.com/tslearn-team/tslearn
    
  • PyPi (📥 67K / month · 📦 11 · ⏱️ 18.06.2020):

     pip install tslearn
    
  • Conda (📥 180K · ⏱️ 19.06.2020):

     conda install -c conda-forge tslearn
    
sktime (🥈26 · ⭐ 3.4K) - A unified framework for machine learning with time series. BSD-3
  • GitHub (👨‍💻 71 · 🔀 420 · 📥 41 · 📦 91 · 📋 320 - 31% open · ⏱️ 15.01.2021):

     git clone https://github.com/alan-turing-institute/sktime
    
  • PyPi (📥 7.2K / month · ⏱️ 29.12.2020):

     pip install sktime
    
pmdarima (🥈26 · ⭐ 800) - A statistical library designed to fill the void in Python's time series.. MIT
  • GitHub (👨‍💻 16 · 🔀 150 · 📦 680 · 📋 220 - 7% open · ⏱️ 02.12.2020):

     git clone https://github.com/alkaline-ml/pmdarima
    
  • PyPi (📥 190K / month · 📦 12 · ⏱️ 02.12.2020):

     pip install pmdarima
    
Streamz (🥈24 · ⭐ 900) - Real-time stream processing for python. BSD-3
  • GitHub (👨‍💻 38 · 🔀 110 · 📦 180 · 📋 220 - 41% open · ⏱️ 14.01.2021):

     git clone https://github.com/python-streamz/streamz
    
  • PyPi (📥 1.9K / month · 📦 16 · ⏱️ 02.11.2020):

     pip install streamz
    
  • Conda (📥 110K · ⏱️ 15.01.2021):

     conda install -c conda-forge streamz
    
GluonTS (🥈23 · ⭐ 1.7K) - Probabilistic time series modeling in Python. Apache-2
  • GitHub (👨‍💻 67 · 🔀 360 · 📋 520 - 34% open · ⏱️ 16.01.2021):

     git clone https://github.com/awslabs/gluon-ts
    
  • PyPi (📥 21K / month · ⏱️ 08.12.2020):

     pip install gluonts
    
STUMPY (🥉22 · ⭐ 1.6K) - STUMPY is a powerful and scalable Python library for computing a Matrix.. BSD-3
  • GitHub (👨‍💻 20 · 🔀 160 · 📋 200 - 13% open · ⏱️ 17.01.2021):

     git clone https://github.com/TDAmeritrade/stumpy
    
  • PyPi (📥 16K / month · ⏱️ 31.12.2020):

     pip install stumpy
    
  • Conda (📥 15K · ⏱️ 31.12.2020):

     conda install -c conda-forge stumpy
    
Darts (🥉21 · ⭐ 700) - A python library for easy manipulation and forecasting of time series. Apache-2
  • GitHub (👨‍💻 23 · 🔀 85 · 📦 11 · 📋 57 - 28% open · ⏱️ 09.11.2020):

     git clone https://github.com/unit8co/darts
    
  • PyPi (📥 1K / month · ⏱️ 09.11.2020):

     pip install u8darts
    
  • Docker Hub (📥 75 · ⏱️ 06.10.2020):

     docker pull unit8/darts
    
pyts (🥉20 · ⭐ 850 · 💤) - A Python package for time series classification. BSD-3
  • GitHub (👨‍💻 7 · 🔀 93 · 📦 79 · 📋 41 - 56% open · ⏱️ 30.04.2020):

     git clone https://github.com/johannfaouzi/pyts
    
  • PyPi (📥 2K / month · 📦 2 · ⏱️ 21.03.2020):

     pip install pyts
    
  • Conda (📥 5.9K · ⏱️ 21.03.2020):

     conda install -c conda-forge pyts
    
pytorch-forecasting (🥉19 · ⭐ 630 · 🐣) - Time series forecasting with PyTorch. MIT
  • GitHub (👨‍💻 11 · 🔀 65 · 📋 92 - 29% open · ⏱️ 13.01.2021):

     git clone https://github.com/jdb78/pytorch-forecasting
    
  • PyPi (📥 2.9K / month · ⏱️ 12.01.2021):

     pip install pytorch-forecasting
    
matrixprofile-ts (🥉19 · ⭐ 600 · 💤) - A Python library for detecting patterns and anomalies.. Apache-2
  • GitHub (👨‍💻 15 · 🔀 90 · 📦 11 · 📋 52 - 36% open · ⏱️ 25.04.2020):

     git clone https://github.com/target/matrixprofile-ts
    
  • PyPi (📥 2.5K / month · ⏱️ 08.08.2019):

     pip install matrixprofile-ts
    
seglearn (🥉19 · ⭐ 410) - Python module for machine learning time series:. BSD-3
  • GitHub (👨‍💻 13 · 🔀 47 · 📦 6 · 📋 25 - 24% open · ⏱️ 16.01.2021):

     git clone https://github.com/dmbee/seglearn
    
  • PyPi (📥 1.8K / month · 📦 1 · ⏱️ 02.04.2020):

     pip install seglearn
    
ADTK (🥉17 · ⭐ 580 · 💤) - A Python toolkit for rule-based/unsupervised anomaly detection in time.. MPL-2.0
  • GitHub (👨‍💻 11 · 🔀 73 · 📋 50 - 40% open · ⏱️ 17.04.2020):

     git clone https://github.com/arundo/adtk
    
  • PyPi (📥 29K / month · ⏱️ 17.04.2020):

     pip install adtk
    
atspy (🥉17 · ⭐ 330) - AtsPy: Automated Time Series Models in Python (by @firmai). MIT
  • GitHub (👨‍💻 5 · 🔀 63 · 📦 3 · 📋 18 - 88% open · ⏱️ 12.11.2020):

     git clone https://github.com/firmai/atspy
    
  • PyPi (📥 220 / month · ⏱️ 24.04.2020):

     pip install atspy
    
tick (🥉17 · ⭐ 320 · 💤) - Module for statistical learning, with a particular emphasis on time-.. BSD-3
  • GitHub (👨‍💻 16 · 🔀 74 · 📥 160 · 📦 29 · 📋 210 - 25% open · ⏱️ 15.06.2020):

     git clone https://github.com/X-DataInitiative/tick
    
  • PyPi (📥 420 / month · 📦 1 · ⏱️ 24.05.2020):

     pip install tick
    
Show 4 hidden projects...
  • luminol (🥈23 · ⭐ 860 · 💀) - Anomaly Detection and Correlation library. Apache-2
  • PyFlux (🥉22 · ⭐ 1.8K · 💀) - Open source time series library for Python. BSD-3
  • pydlm (🥉20 · ⭐ 350 · 💀) - A python library for Bayesian time series modeling. BSD-3
  • Auto TS (🥉17 · ⭐ 150) - Automatically build ARIMA, SARIMAX, VAR, FB Prophet and ML Models on.. Apache-2

Medical Data

Back to top

Libraries for processing and analyzing medical data such as MRIs, EEGs, genomic data, and other medical imaging formats.

MNE (🥇31 · ⭐ 1.5K) - MNE: Magnetoencephalography (MEG) and Electroencephalography (EEG) in Python. BSD-3
  • GitHub (👨‍💻 230 · 🔀 840 · 📦 790 · 📋 3.4K - 8% open · ⏱️ 15.01.2021):

     git clone https://github.com/mne-tools/mne-python
    
  • PyPi (📥 18K / month · 📦 200 · ⏱️ 17.12.2020):

     pip install mne
    
  • Conda (📥 97K · ⏱️ 20.12.2020):

     conda install -c conda-forge mne
    
Nilearn (🥇30 · ⭐ 690) - Machine learning for NeuroImaging in Python. BSD-3
  • GitHub (👨‍💻 160 · 🔀 370 · 📦 860 · 📋 1.5K - 25% open · ⏱️ 15.01.2021):

     git clone https://github.com/nilearn/nilearn
    
  • PyPi (📥 8.4K / month · 📦 300 · ⏱️ 12.11.2020):

     pip install nilearn
    
  • Conda (📥 76K · ⏱️ 12.11.2020):

     conda install -c conda-forge nilearn
    
Lifelines (🥈29 · ⭐ 1.5K) - Survival analysis in Python. MIT
  • GitHub (👨‍💻 90 · 🔀 410 · 📦 480 · 📋 780 - 24% open · ⏱️ 05.01.2021):

     git clone https://github.com/CamDavidsonPilon/lifelines
    
  • PyPi (📥 97K / month · 📦 130 · ⏱️ 09.12.2020):

     pip install lifelines
    
  • Conda (📥 120K · ⏱️ 10.12.2020):

     conda install -c conda-forge lifelines
    
NIPYPE (🥈29 · ⭐ 540) - Workflows and interfaces for neuroimaging packages. Apache-2
  • GitHub (👨‍💻 210 · 🔀 440 · 📦 490 · 📋 1.2K - 26% open · ⏱️ 08.12.2020):

     git clone https://github.com/nipy/nipype
    
  • PyPi (📥 10K / month · 📦 190 · ⏱️ 16.08.2020):

     pip install nipype
    
  • Conda (📥 350K · ⏱️ 28.11.2020):

     conda install -c conda-forge nipype
    
NiBabel (🥈29 · ⭐ 380) - Python package to access a cacophony of neuro-imaging file formats. MIT
  • GitHub (👨‍💻 89 · 🔀 210 · 📦 3.5K · 📋 390 - 27% open · ⏱️ 13.01.2021):

     git clone https://github.com/nipy/nibabel
    
  • PyPi (📥 44K / month · 📦 1.3K · ⏱️ 28.11.2020):

     pip install nibabel
    
  • Conda (📥 290K · ⏱️ 29.11.2020):

     conda install -c conda-forge nibabel
    
DIPY (🥈28 · ⭐ 380) - DIPY is the paragon 3D/4D+ imaging library in Python. Contains generic.. BSD-3
  • GitHub (👨‍💻 120 · 🔀 310 · 📦 320 · 📋 730 - 21% open · ⏱️ 10.12.2020):

     git clone https://github.com/dipy/dipy
    
  • PyPi (📥 6.2K / month · 📦 94 · ⏱️ 05.11.2020):

     pip install dipy
    
  • Conda (📥 170K · ⏱️ 14.11.2020):

     conda install -c conda-forge dipy
    
Hail (🥈23 · ⭐ 690) - Scalable genomic data analysis. MIT
  • GitHub (👨‍💻 69 · 🔀 180 · 📦 28 · 📋 2.3K - 14% open · ⏱️ 15.01.2021):

     git clone https://github.com/hail-is/hail
    
  • PyPi (📥 910 / month · ⏱️ 03.12.2020):

     pip install hail
    
MONAI (🥈22 · ⭐ 1.7K) - AI Toolkit for Healthcare Imaging. Apache-2
  • GitHub (👨‍💻 52 · 🔀 300 · 📋 660 - 11% open · ⏱️ 15.01.2021):

     git clone https://github.com/Project-MONAI/MONAI
    
  • PyPi (📥 3.5K / month · ⏱️ 15.12.2020):

     pip install monai
    
NiftyNet (🥈22 · ⭐ 1.3K · 💤) - [unmaintained] An open-source convolutional neural.. Apache-2
  • GitHub (👨‍💻 58 · 🔀 370 · 📦 33 · 📋 320 - 30% open · ⏱️ 21.04.2020):

     git clone https://github.com/NifTK/NiftyNet
    
  • PyPi (📥 160 / month · ⏱️ 10.10.2019):

     pip install niftynet
    
DeepVariant (🥉21 · ⭐ 2.2K) - DeepVariant is an analysis pipeline that uses a deep neural.. BSD-3
  • GitHub (👨‍💻 19 · 🔀 520 · 📥 3K · 📋 370 - 0% open · ⏱️ 14.12.2020):

     git clone https://github.com/google/deepvariant
    
  • Conda (📥 26K · ⏱️ 05.09.2020):

     conda install -c bioconda deepvariant
    
Brainiak (🥉18 · ⭐ 230) - Brain Imaging Analysis Kit. Apache-2
  • GitHub (👨‍💻 32 · 🔀 110 · 📦 12 · 📋 180 - 35% open · ⏱️ 15.10.2020):

     git clone https://github.com/brainiak/brainiak
    
  • PyPi (📥 98 / month · 📦 1 · ⏱️ 15.10.2020):

     pip install brainiak
    
  • Docker Hub (📥 460 · ⭐ 1 · ⏱️ 15.10.2020):

     docker pull brainiak/brainiak
    
Glow (🥉18 · ⭐ 150) - An open-source toolkit for large-scale genomic analysis. Apache-2
  • GitHub (👨‍💻 11 · 🔀 40 · 📋 63 - 34% open · ⏱️ 13.01.2021):

     git clone https://github.com/projectglow/glow
    
  • PyPi (📥 1K / month · ⏱️ 10.09.2020):

     pip install glow.py
    
Medical Detection Toolkit (🥉12 · ⭐ 890 · 💤) - The Medical Detection Toolkit contains 2D + 3D.. Apache-2
  • GitHub (👨‍💻 3 · 🔀 230 · 📋 110 - 24% open · ⏱️ 18.04.2020):

     git clone https://github.com/MIC-DKFZ/medicaldetectiontoolkit
    
MedicalNet (🥉11 · ⭐ 1K) - Many studies have shown that the performance on deep learning is.. MIT
  • GitHub (👨‍💻 1 · 🔀 280 · 📋 56 - 75% open · ⏱️ 27.08.2020):

     git clone https://github.com/Tencent/MedicalNet
    
Show 5 hidden projects...
  • NIPY (🥉21 · ⭐ 290) - Neuroimaging in Python FMRI analysis package. ❗️DSDP
  • MedPy (🥉20 · ⭐ 310 · 💤) - Medical image processing in Python. ❗️GPL-3.0
  • DLTK (🥉19 · ⭐ 1.2K · 💀) - Deep Learning Toolkit for Medical Image Analysis. Apache-2
  • MedicalTorch (🥉15 · ⭐ 700 · 💀) - A medical imaging framework for Pytorch. Apache-2
  • DeepNeuro (🥉14 · ⭐ 98 · 💤) - A deep learning python package for neuroimaging data. Made by:. MIT

Optical Character Recognition

Back to top

Libraries for optical character recognition (OCR) and text extraction from images or videos.

Tesseract (🥇29 · ⭐ 3.4K · 📉) - Python-tesseract is an optical character recognition (OCR).. Apache-2
  • GitHub (👨‍💻 36 · 🔀 500 · 📋 240 - 2% open · ⏱️ 04.01.2021):

     git clone https://github.com/madmaze/pytesseract
    
  • PyPi (📥 430K / month · 📦 1.4K · ⏱️ 15.12.2020):

     pip install pytesseract
    
  • Conda (📥 38K · ⏱️ 20.11.2020):

     conda install -c conda-forge pytesseract
    
EasyOCR (🥈27 · ⭐ 10K) - Ready-to-use OCR with 80+ supported languages and all popular writing.. Apache-2
  • GitHub (👨‍💻 73 · 🔀 970 · 📥 99K · 📦 100 · 📋 240 - 33% open · ⏱️ 14.01.2021):

     git clone https://github.com/JaidedAI/EasyOCR
    
  • PyPi (📥 9.9K / month · ⏱️ 05.01.2021):

     pip install easyocr
    
OCRmyPDF (🥈27 · ⭐ 3.7K) - OCRmyPDF adds an OCR text layer to scanned PDF files, allowing them.. MPL-2.0
  • GitHub (👨‍💻 49 · 🔀 420 · 📦 110 · 📋 640 - 13% open · ⏱️ 10.01.2021):

     git clone https://github.com/jbarlow83/OCRmyPDF
    
  • PyPi (📥 13K / month · 📦 12 · ⏱️ 10.01.2021):

     pip install ocrmypdf
    
tesserocr (🥈26 · ⭐ 1.4K) - A Python wrapper for the tesseract-ocr API. MIT
  • GitHub (👨‍💻 23 · 🔀 180 · 📦 450 · 📋 200 - 29% open · ⏱️ 17.11.2020):

     git clone https://github.com/sirfz/tesserocr
    
  • PyPi (📥 24K / month · 📦 50 · ⏱️ 17.03.2020):

     pip install tesserocr
    
  • Conda (📥 32K · ⏱️ 13.01.2021):

     conda install -c conda-forge tesserocr
    
PaddleOCR (🥉24 · ⭐ 8.4K) - Awesome multilingual OCR toolkits based on PaddlePaddle.. Apache-2
  • GitHub (👨‍💻 41 · 🔀 1.6K · 📦 19 · 📋 1.2K - 30% open · ⏱️ 12.01.2021):

     git clone https://github.com/PaddlePaddle/PaddleOCR
    
  • PyPi (📥 1.6K / month · ⏱️ 18.12.2020):

     pip install paddleocr
    
attention-ocr (🥉20 · ⭐ 820) - A Tensorflow model for text recognition (CNN + seq2seq with.. MIT
  • GitHub (👨‍💻 27 · 🔀 230 · 📦 15 · 📋 140 - 11% open · ⏱️ 31.10.2020):

     git clone https://github.com/emedvedev/attention-ocr
    
  • PyPi (📥 150 / month · ⏱️ 19.04.2019):

     pip install aocr
    
keras-ocr (🥉20 · ⭐ 740) - A packaged and flexible version of the CRAFT text detector and.. MIT
  • GitHub (👨‍💻 9 · 🔀 170 · 📥 14K · 📋 120 - 22% open · ⏱️ 26.11.2020):

     git clone https://github.com/faustomorales/keras-ocr
    
  • PyPi (📥 1.5K / month · 📦 1 · ⏱️ 26.11.2020):

     pip install keras-ocr
    
doc2text (🥉19 · ⭐ 1.2K) - Detect text blocks and OCR poorly scanned PDFs in bulk. Python module.. MIT
  • GitHub (👨‍💻 5 · 🔀 92 · 📦 32 · 📋 21 - 57% open · ⏱️ 01.12.2020):

     git clone https://github.com/jlsutherland/doc2text
    
  • PyPi (📥 230 / month · ⏱️ 06.09.2016):

     pip install doc2text
    
calamari (🥉18 · ⭐ 760) - Line based ATR Engine based on OCRopy. Apache-2
  • GitHub (👨‍💻 17 · 🔀 160 · 📋 150 - 13% open · ⏱️ 14.01.2021):

     git clone https://github.com/Calamari-OCR/calamari
    
  • PyPi (📥 350 / month · 📦 2 · ⏱️ 13.11.2018):

     pip install calamari_ocr
    
Show 1 hidden projects...
  • pdftabextract (🥉20 · ⭐ 1.9K · 💀) - A set of tools for extracting tables from PDF files.. Apache-2

Data Containers & Structures

Back to top

General-purpose data containers & structures as well as utilities & extensions for pandas.

pandas (🥇42 · ⭐ 28K · 📉) - Flexible and powerful data analysis / manipulation library for.. BSD-3
  • GitHub (👨‍💻 2.6K · 🔀 12K · 📥 93K · 📦 370K · 📋 20K - 17% open · ⏱️ 17.01.2021):

     git clone https://github.com/pandas-dev/pandas
    
  • PyPi (📥 23M / month · 📦 77K · ⏱️ 26.12.2020):

     pip install pandas
    
  • Conda (📥 13M · ⏱️ 14.01.2021):

     conda install -c conda-forge pandas
    
numpy (🥇42 · ⭐ 16K) - The fundamental package for scientific computing with Python. BSD-3
  • GitHub (👨‍💻 1.2K · 🔀 5.2K · 📥 300K · 📦 600K · 📋 9.5K - 23% open · ⏱️ 16.01.2021):

     git clone https://github.com/numpy/numpy
    
  • PyPi (📥 37M / month · 📦 170K · ⏱️ 05.01.2021):

     pip install numpy
    
  • Conda (📥 15M · ⏱️ 11.01.2021):

     conda install -c conda-forge numpy
    
h5py (🥇36 · ⭐ 1.5K) - HDF5 for Python -- The h5py package is a Pythonic interface to the HDF5.. BSD-3
  • GitHub (👨‍💻 160 · 🔀 390 · 📥 780 · 📦 99K · 📋 1.1K - 16% open · ⏱️ 15.01.2021):

     git clone https://github.com/h5py/h5py
    
  • PyPi (📥 4.9M / month · 📦 23K · ⏱️ 06.11.2020):

     pip install h5py
    
  • Conda (📥 4.4M · ⏱️ 01.01.2021):

     conda install -c conda-forge h5py
    
Arrow (🥈35 · ⭐ 6.9K) - Apache Arrow is a cross-language development platform for in-memory.. Apache-2
  • GitHub (👨‍💻 610 · 🔀 1.7K · 📦 29 · 📋 650 - 16% open · ⏱️ 17.01.2021):

     git clone https://github.com/apache/arrow
    
  • PyPi (📥 14M / month · 📦 990 · ⏱️ 19.10.2020):

     pip install pyarrow
    
  • Conda (📥 450K · ⏱️ 12.01.2021):

     conda install -c conda-forge arrow
    
xarray (🥈32 · ⭐ 1.9K) - N-D labeled arrays and datasets in Python. Apache-2
  • GitHub (👨‍💻 290 · 🔀 620 · 📦 4.9K · 📋 2.7K - 29% open · ⏱️ 15.01.2021):

     git clone https://github.com/pydata/xarray
    
  • PyPi (📥 260K / month · 📦 1.1K · ⏱️ 30.11.2020):

     pip install xarray
    
  • Conda (📥 2.3M · ⏱️ 01.12.2020):

     conda install -c conda-forge xarray
    
numexpr (🥈31 · ⭐ 1.5K) - Fast numerical array expression evaluator for Python, NumPy, PyTables,.. MIT
  • GitHub (👨‍💻 55 · 🔀 160 · 📋 300 - 17% open · ⏱️ 04.01.2021):

     git clone https://github.com/pydata/numexpr
    
  • PyPi (📥 660K / month · 📦 5.5K · ⏱️ 05.01.2020):

     pip install numexpr
    
  • Conda (📥 2.1M · ⏱️ 13.01.2021):

     conda install -c conda-forge numexpr
    
Modin (🥈29 · ⭐ 5.6K) - Modin: Speed up your Pandas workflows by changing a single line of.. Apache-2
  • GitHub (👨‍💻 66 · 🔀 380 · 📥 200K · 📦 280 · 📋 1.5K - 28% open · ⏱️ 15.01.2021):

     git clone https://github.com/modin-project/modin
    
  • PyPi (📥 74K / month · 📦 18 · ⏱️ 12.01.2021):

     pip install modin
    
TinyDB (🥈29 · ⭐ 4K) - TinyDB is a lightweight document oriented database optimized for your.. MIT
  • GitHub (👨‍💻 62 · 🔀 350 · 📋 250 - 5% open · ⏱️ 04.01.2021):

     git clone https://github.com/msiemens/tinydb
    
  • PyPi (📥 130K / month · 📦 1.1K · ⏱️ 14.11.2020):

     pip install tinydb
    
  • Conda (📥 99K · ⏱️ 14.11.2020):

     conda install -c conda-forge tinydb
    
Koalas (🥈29 · ⭐ 2.6K) - Koalas: pandas API on Apache Spark. Apache-2
  • GitHub (👨‍💻 47 · 🔀 290 · 📥 1K · 📦 70 · 📋 500 - 16% open · ⏱️ 17.01.2021):

     git clone https://github.com/databricks/koalas
    
  • PyPi (📥 820K / month · 📦 1 · ⏱️ 11.12.2020):

     pip install koalas
    
  • Conda (📥 75K · ⏱️ 11.12.2020):

     conda install -c conda-forge koalas
    
PyTables (🥈28 · ⭐ 1K) - A Python package to manage extremely large amounts of data. BSD-3
  • GitHub (👨‍💻 96 · 🔀 180 · 📥 120 · 📋 590 - 25% open · ⏱️ 10.01.2021):

     git clone https://github.com/PyTables/PyTables
    
  • PyPi (📥 350K / month · 📦 3.8K · ⏱️ 30.10.2019):

     pip install tables
    
  • Conda (📥 2.1M · ⏱️ 14.01.2021):

     conda install -c conda-forge pytables
    
Bottleneck (🥈28 · ⭐ 570) - Fast NumPy array functions written in C. BSD-2
  • GitHub (👨‍💻 20 · 🔀 62 · 📦 18K · 📋 200 - 11% open · ⏱️ 25.11.2020):

     git clone https://github.com/pydata/bottleneck
    
  • PyPi (📥 200K / month · 📦 2.9K · ⏱️ 21.02.2020):

     pip install Bottleneck
    
  • Conda (📥 1.4M · ⏱️ 13.01.2021):

     conda install -c conda-forge bottleneck
    
datasketch (🥉27 · ⭐ 1.4K) - MinHash, LSH, LSH Forest, Weighted MinHash, HyperLogLog,.. MIT
  • GitHub (👨‍💻 16 · 🔀 200 · 📥 16 · 📦 240 · 📋 110 - 17% open · ⏱️ 23.12.2020):

     git clone https://github.com/ekzhu/datasketch
    
  • PyPi (📥 180K / month · 📦 86 · ⏱️ 15.12.2020):

     pip install datasketch
    
zarr (🥉27 · ⭐ 620 · 📈) - An implementation of chunked, compressed, N-dimensional arrays for Python. MIT
  • GitHub (👨‍💻 35 · 🔀 110 · 📦 520 · 📋 410 - 43% open · ⏱️ 04.01.2021):

     git clone https://github.com/zarr-developers/zarr-python
    
  • PyPi (📥 61K / month · 📦 72 · ⏱️ 02.12.2020):

     pip install zarr
    
  • Conda (📥 520K · ⏱️ 03.12.2020):

     conda install -c conda-forge zarr
    
swifter (🥉25 · ⭐ 1.5K) - A package which efficiently applies any function to a pandas.. MIT
  • GitHub (👨‍💻 14 · 🔀 69 · 📦 270 · 📋 90 - 15% open · ⏱️ 19.12.2020):

     git clone https://github.com/jmcarpenter2/swifter
    
  • PyPi (📥 50K / month · 📦 16 · ⏱️ 11.10.2020):

     pip install swifter
    
  • Conda (📥 74K · ⏱️ 20.09.2020):

     conda install -c conda-forge swifter
    
bcolz (🥉25 · ⭐ 910) - A columnar data container that can be compressed. BSD-3
  • GitHub (👨‍💻 33 · 🔀 120 · 📦 1.5K · 📋 250 - 51% open · ⏱️ 10.09.2020):

     git clone https://github.com/Blosc/bcolz
    
  • PyPi (📥 19K / month · 📦 970 · ⏱️ 13.04.2018):

     pip install bcolz
    
  • Conda (📥 210K · ⏱️ 05.11.2019):

     conda install -c conda-forge bcolz
    
Pandaral·lel (🥉23 · ⭐ 1.3K) - A simple and efficient tool to parallelize Pandas.. BSD-3
  • GitHub (👨‍💻 13 · 🔀 92 · 📦 140 · 📋 110 - 53% open · ⏱️ 16.12.2020):

     git clone https://github.com/nalepae/pandarallel
    
  • PyPi (📥 39K / month · ⏱️ 25.08.2020):

     pip install pandarallel
    
Vaex (🥉22 · ⭐ 5.6K) - Out-of-Core DataFrames for Python, ML, visualize and explore big tabular data.. MIT
  • GitHub (👨‍💻 37 · 🔀 430 · 📥 200 · 📋 660 - 41% open · ⏱️ 15.01.2021):

     git clone https://github.com/vaexio/vaex
    
  • PyPi (📥 3.2K / month · 📦 2 · ⏱️ 13.01.2021):

     pip install vaex
    
  • Conda (📥 94K · ⏱️ 01.06.2020):

     conda install -c conda-forge vaex
    
datatable (🥉20 · ⭐ 1.1K) - A Python package for manipulating 2-dimensional tabular data.. MPL-2.0
  • GitHub (👨‍💻 28 · 🔀 95 · 📥 740 · 📋 1.2K - 9% open · ⏱️ 13.01.2021):

     git clone https://github.com/h2oai/datatable
    
  • PyPi (📥 15K / month · 📦 6 · ⏱️ 10.12.2020):

     pip install datatable
    
fletcher (🥉20 · ⭐ 210) - Pandas ExtensionDType/Array backed by Apache Arrow. MIT
  • GitHub (👨‍💻 23 · 🔀 32 · 📥 12 · 📦 3 · 📋 72 - 45% open · ⏱️ 29.12.2020):

     git clone https://github.com/xhochy/fletcher
    
  • PyPi (📥 270 / month · ⏱️ 07.12.2020):

     pip install fletcher
    
  • Conda (📥 18K · ⏱️ 29.12.2020):

     conda install -c conda-forge fletcher
    
StaticFrame (🥉20 · ⭐ 210) - The StaticFrame library defines the Series and Frame, immutable data.. MIT
  • GitHub (👨‍💻 14 · 🔀 20 · 📦 5 · 📋 270 - 12% open · ⏱️ 16.01.2021):

     git clone https://github.com/InvestmentSystems/static-frame
    
  • PyPi (📥 1.1K / month · ⏱️ 12.01.2021):

     pip install static-frame
    
  • Conda (📥 60K · ⏱️ 12.01.2021):

     conda install -c conda-forge static-frame
    
Bounter (🥉18 · ⭐ 890) - Efficient Counter that uses a limited (bounded) amount of memory.. MIT
  • GitHub (👨‍💻 8 · 🔀 48 · 📦 20 · 📋 20 - 55% open · ⏱️ 16.08.2020):

     git clone https://github.com/RaRe-Technologies/bounter
    
  • PyPi (📥 270 / month · 📦 14 · ⏱️ 17.08.2020):

     pip install bounter
    
PandaPy (🥉15 · ⭐ 470) - PandaPy has the speed of NumPy and the usability of Pandas 10x to 50x.. MIT
  • GitHub (👨‍💻 3 · 🔀 44 · 📦 1 · 📋 3 - 66% open · ⏱️ 12.11.2020):

     git clone https://github.com/firmai/pandapy
    
  • PyPi (📥 56 / month · ⏱️ 25.01.2020):

     pip install pandapy
    
Show 6 hidden projects...
  • Blaze (🥈28 · ⭐ 2.9K · 💀) - NumPy and Pandas interface to Big Data. BSD-3
  • sklearn-pandas (🥈28 · ⭐ 2.3K) - Pandas integration with sklearn. ❗️Zlib
  • Arctic (🥉24 · ⭐ 2.1K) - Arctic is a high performance datastore for numeric data. ❗️LGPL-2.1
  • pandasql (🥉22 · ⭐ 930 · 💀) - sqldf for pandas. MIT
  • pickleDB (🥉21 · ⭐ 530 · 💀) - pickleDB is an open source key-value store using Python's json.. BSD-3
  • Pandas Summary (🥉21 · ⭐ 360 · 💀) - An extension to pandas dataframes describe function. MIT

Data Loading & Extraction

Back to top

Libraries for loading, collecting, and extracting data from a variety of data sources and formats.

Faker (🥇36 · ⭐ 12K) - Faker is a Python package that generates fake data for you. MIT
  • GitHub (👨‍💻 380 · 🔀 1.4K · 📦 22K · 📋 490 - 28% open · ⏱️ 15.01.2021):

     git clone https://github.com/joke2k/faker
    
  • PyPi (📥 1.8M / month · 📦 4.7K · ⏱️ 15.01.2021):

     pip install Faker
    
  • Conda (📥 360K · ⏱️ 15.01.2021):

     conda install -c conda-forge faker
    
xlrd (🥇33 · ⭐ 1.8K) - Please use openpyxl where you can... BSD-3
  • GitHub (👨‍💻 50 · 🔀 410 · 📦 62K · ⏱️ 12.12.2020):

     git clone https://github.com/python-excel/xlrd
    
  • PyPi (📥 5M / month · 📦 14K · ⏱️ 11.12.2020):

     pip install xlrd
    
  • Conda (📥 1.1M · ⏱️ 09.01.2021):

     conda install -c conda-forge xlrd
    
xmltodict (🥇32 · ⭐ 4.2K · 💤) - Python module that makes working with XML feel like you are.. MIT
  • GitHub (👨‍💻 41 · 🔀 400 · 📦 20K · 📋 200 - 31% open · ⏱️ 26.04.2020):

     git clone https://github.com/martinblech/xmltodict
    
  • PyPi (📥 3.8M / month · 📦 8.2K · ⏱️ 11.02.2019):

     pip install xmltodict
    
  • Conda (📥 600K · ⏱️ 11.02.2019):

     conda install -c conda-forge xmltodict
    
Tablib (🥇32 · ⭐ 3.8K) - Python Module for Tabular Datasets in XLS, CSV, JSON, YAML, &c. MIT
  • GitHub (👨‍💻 110 · 🔀 560 · 📦 9.1K · 📋 230 - 14% open · ⏱️ 11.12.2020):

     git clone https://github.com/jazzband/tablib
    
  • PyPi (📥 520K / month · 📦 2.4K · ⏱️ 05.12.2020):

     pip install tablib
    
  • Conda (📥 56K · ⏱️ 05.12.2020):

     conda install -c conda-forge tablib
    
TensorFlow Datasets (🥇32 · ⭐ 2.6K) - TFDS is a collection of datasets ready to use with.. Apache-2
  • GitHub (👨‍💻 180 · 🔀 940 · 📦 3.4K · 📋 920 - 43% open · ⏱️ 14.01.2021):

     git clone https://github.com/tensorflow/datasets
    
  • PyPi (📥 520K / month · 📦 100 · ⏱️ 07.01.2021):

     pip install tensorflow-datasets
    
python-magic (🥈31 · ⭐ 1.8K · 📈) - A python wrapper for libmagic. MIT
  • GitHub (👨‍💻 49 · 🔀 220 · 📦 11K · 📋 150 - 17% open · ⏱️ 16.01.2021):

     git clone https://github.com/ahupp/python-magic
    
  • PyPi (📥 1.3M / month · 📦 5.1K · ⏱️ 16.01.2021):

     pip install python-magic
    
  • Conda (📥 74K · ⏱️ 24.12.2020):

     conda install -c conda-forge python-magic
    
smart-open (🥈30 · ⭐ 1.9K) - Utils for streaming large files (S3, HDFS, gzip, bz2...). MIT
  • GitHub (👨‍💻 71 · 🔀 270 · 📋 300 - 22% open · ⏱️ 17.01.2021):

     git clone https://github.com/RaRe-Technologies/smart_open
    
  • PyPi (📥 8.3M / month · 📦 640 · ⏱️ 30.12.2020):

     pip install smart-open
    
Datasets (🥈29 · ⭐ 6.5K) - The largest hub of ready-to-use NLP datasets for ML models with.. Apache-2
  • GitHub (👨‍💻 200 · 🔀 670 · 📦 260 · 📋 500 - 37% open · ⏱️ 15.01.2021):

     git clone https://github.com/huggingface/datasets
    
  • PyPi (📥 22K / month · 📦 18 · ⏱️ 04.01.2021):

     pip install datasets
    
pandas-datareader (🥈29 · ⭐ 1.8K) - Extract data from a wide range of Internet sources.. BSD-3
  • GitHub (👨‍💻 77 · 🔀 490 · 📦 7.5K · 📋 440 - 15% open · ⏱️ 31.12.2020):

     git clone https://github.com/pydata/pandas-datareader
    
  • PyPi (📥 120K / month · 📦 1.4K · ⏱️ 10.07.2020):

     pip install pandas-datareader
    
  • Conda (📥 86K · ⏱️ 20.11.2019):

     conda install -c conda-forge pandas-datareader
    
csvkit (🥉27 · ⭐ 4.4K) - A suite of utilities for converting to and working with CSV, the king of.. MIT
  • GitHub (👨‍💻 91 · 🔀 540 · 📦 800 · 📋 800 - 7% open · ⏱️ 30.10.2020):

     git clone https://github.com/wireservice/csvkit
    
  • PyPi (📥 25K / month · 📦 700 · ⏱️ 03.03.2020):

     pip install csvkit
    
  • Conda (📥 45K · ⏱️ 28.05.2019):

     conda install -c conda-forge csvkit
    
snorkel (🥉27 · ⭐ 4.4K) - A system for quickly generating training data with weak supervision. Apache-2
  • GitHub (👨‍💻 62 · 🔀 710 · 📥 490 · 📦 64 · 📋 940 - 3% open · ⏱️ 05.09.2020):

     git clone https://github.com/snorkel-team/snorkel
    
  • PyPi (📥 64K / month · 📦 4 · ⏱️ 07.04.2020):

     pip install snorkel
    
  • Conda (📥 15K · ⏱️ 10.04.2020):

     conda install -c conda-forge snorkel
    
PDFMiner (🥉26 · ⭐ 4.5K · 💤) - Python PDF Parser (Not actively maintained). Check out pdfminer.six. MIT
  • GitHub (👨‍💻 28 · 🔀 1K · 📦 2K · 📋 260 - 85% open · ⏱️ 18.01.2020):

     git clone https://github.com/euske/pdfminer
    
  • PyPi (📥 170K / month · 📦 1.5K · ⏱️ 25.11.2019):

     pip install pdfminer
    
  • Conda (📥 13K · ⏱️ 03.11.2019):

     conda install -c conda-forge pdfminer
    
tabulator-py (🥉26 · ⭐ 200) - Python library for reading and writing tabular data via streams. MIT
  • GitHub (👨‍💻 24 · 🔀 40 · 📦 460 · ⏱️ 30.11.2020):

     git clone https://github.com/frictionlessdata/tabulator-py
    
  • PyPi (📥 88K / month · 📦 100 · ⏱️ 30.11.2020):

     pip install tabulator
    
  • Conda (📥 37K · ⏱️ 24.07.2018):

     conda install -c conda-forge tabulator-py
    
Intake (🥉24 · ⭐ 520) - Intake is a lightweight package for finding, investigating, loading and.. BSD-2
  • GitHub (👨‍💻 51 · 🔀 91 · 📦 220 · 📋 260 - 28% open · ⏱️ 15.01.2021):

     git clone https://github.com/intake/intake
    
  • PyPi (📥 2.4K / month · 📦 74 · ⏱️ 03.06.2020):

     pip install intake
    
  • Conda (📥 55K · ⏱️ 03.06.2020):

     conda install -c conda-forge intake
    
SDV (🥉21 · ⭐ 310) - Synthetic Data Generation for tabular, relational and time series data. MIT
  • GitHub (👨‍💻 33 · 🔀 54 · 📦 13 · 📋 170 - 24% open · ⏱️ 14.01.2021):

     git clone https://github.com/sdv-dev/SDV
    
  • PyPi (📥 1K / month · ⏱️ 31.12.2020):

     pip install sdv
    
datatest (🥉21 · ⭐ 240) - Tools for test driven data-wrangling and data validation. Apache-2
  • GitHub (👨‍💻 6 · 🔀 13 · 📦 28 · 📋 51 - 17% open · ⏱️ 17.01.2021):

     git clone https://github.com/shawnbrown/datatest
    
  • PyPi (📥 2.2K / month · 📦 2 · ⏱️ 04.01.2021):

     pip install datatest
    
Show 6 hidden projects...
  • textract (🥉26 · ⭐ 2.9K · 💀) - extract text from any document. no muss. no fuss. MIT
  • Camelot (🥉23 · ⭐ 3K · 💀) - Camelot: PDF Table Extraction for Humans. MIT
  • Singer (🥉23 · ⭐ 670 · 📉) - Standard for moving data between databases, web APIs, files,.. ❗️AGPL-3.0
  • messytables (🥉23 · ⭐ 360 · 💀) - Tools for parsing messy tabular data. This is now superseded by.. MIT
  • pyexcel-xlsx (🥉23 · ⭐ 83) - A wrapper library to read, manipulate and write data in xlsx and.. BSD-3
  • rows (🥉20 · ⭐ 730) - A common, beautiful interface to tabular data, no matter the format. ❗️LGPL-3.0

Web Scraping & Crawling

Back to top

Libraries for web scraping, crawling, downloading, and mining as well as libraries.

🔗 best-of-web-python - Web Scraping ( ⭐ 400) - Collection of web-scraping and crawling libraries.


Data Pipelines & Streaming

Back to top

Libraries for data batch- and stream-processing, workflow automation, job scheduling, and other data pipeline tasks.

Celery (🥇37 · ⭐ 17K) - Asynchronous task queue/job queue based on distributed message passing. BSD-3
  • GitHub (👨‍💻 1.1K · 🔀 3.9K · 📦 48K · 📋 4.4K - 11% open · ⏱️ 15.01.2021):

     git clone https://github.com/celery/celery
    
  • PyPi (📥 1.9M / month · 📦 28K · ⏱️ 16.12.2020):

     pip install celery
    
  • Conda (📥 360K · ⏱️ 11.11.2020):

     conda install -c conda-forge celery
    
Airflow (🥇35 · ⭐ 20K) - Platform to programmatically author, schedule, and monitor workflows. Apache-2
  • GitHub (👨‍💻 1.7K · 🔀 7.8K · 📥 77K · 📋 2.8K - 34% open · ⏱️ 17.01.2021):

     git clone https://github.com/apache/airflow
    
  • PyPi (📥 520K / month · 📦 290 · ⏱️ 14.12.2020):

     pip install apache-airflow
    
  • Conda (📥 260K · ⏱️ 26.11.2020):

     conda install -c conda-forge airflow
    
  • Docker Hub (📥 4.4M · ⭐ 200 · ⏱️ 16.01.2021):

     docker pull apache/airflow
    
joblib (🥇35 · ⭐ 2.3K) - Computing with Python functions. BSD-3
  • GitHub (👨‍💻 98 · 🔀 290 · 📦 77K · 📋 640 - 45% open · ⏱️ 28.12.2020):

     git clone https://github.com/joblib/joblib
    
  • PyPi (📥 16M / month · 📦 6.6K · ⏱️ 14.12.2020):

     pip install joblib
    
  • Conda (📥 2.8M · ⏱️ 14.12.2020):

     conda install -c conda-forge joblib
    
luigi (🥇33 · ⭐ 14K) - Luigi is a Python module that helps you build complex pipelines of batch.. Apache-2
  • GitHub (👨‍💻 550 · 🔀 2.2K · 📦 1.3K · 📋 900 - 7% open · ⏱️ 14.01.2021):

     git clone https://github.com/spotify/luigi
    
  • PyPi (📥 350K / month · 📦 680 · ⏱️ 23.09.2020):

     pip install luigi
    
  • Conda (📥 6.3K · ⏱️ 21.07.2020):

     conda install -c anaconda luigi
    
rq (🥇33 · ⭐ 7.5K) - Simple job queues for Python. BSD-3
  • GitHub (👨‍💻 220 · 🔀 1.2K · 📦 7.1K · 📋 810 - 17% open · ⏱️ 14.01.2021):

     git clone https://github.com/rq/rq
    
  • PyPi (📥 220K / month · 📦 3.3K · ⏱️ 29.11.2020):

     pip install rq
    
  • Conda (📥 43K · ⏱️ 29.11.2020):

     conda install -c conda-forge rq
    
Beam (🥈32 · ⭐ 4.5K) - Unified programming model to define and execute data processing.. Apache-2
  • GitHub (👨‍💻 1K · 🔀 2.8K · ⏱️ 16.01.2021):

     git clone https://github.com/apache/beam
    
  • PyPi (📥 1.4M / month · 📦 190 · ⏱️ 08.01.2021):

     pip install apache-beam
    
Prefect (🥈30 · ⭐ 5.7K) - The easiest way to automate your data. Apache-2
  • GitHub (👨‍💻 170 · 🔀 500 · 📦 240 · 📋 1.5K - 14% open · ⏱️ 16.01.2021):

     git clone https://github.com/PrefectHQ/prefect
    
  • PyPi (📥 44K / month · 📦 2 · ⏱️ 13.01.2021):

     pip install prefect
    
  • Conda (📥 65K · ⏱️ 13.01.2021):

     conda install -c conda-forge prefect
    
mrjob (🥈30 · ⭐ 2.5K) - Run MapReduce jobs on Hadoop or Amazon Web Services. Apache-2
  • GitHub (👨‍💻 140 · 🔀 590 · 📦 650 · 📋 1.3K - 15% open · ⏱️ 16.11.2020):

     git clone https://github.com/Yelp/mrjob
    
  • PyPi (📥 100K / month · 📦 110 · ⏱️ 17.09.2020):

     pip install mrjob
    
  • Conda (📥 310K · ⏱️ 24.12.2020):

     conda install -c conda-forge mrjob
    
Kedro (🥈28 · ⭐ 3.3K) - A Python framework for creating reproducible, maintainable and modular.. Apache-2
  • GitHub (👨‍💻 93 · 🔀 380 · 📦 360 · 📋 430 - 20% open · ⏱️ 14.01.2021):

     git clone https://github.com/quantumblacklabs/kedro
    
  • PyPi (📥 48K / month · 📦 16 · ⏱️ 17.12.2020):

     pip install kedro
    
dbt (🥈28 · ⭐ 2.4K) - dbt (data build tool) enables data analysts and engineers to transform.. Apache-2
  • GitHub (👨‍💻 130 · 🔀 470 · 📦 170 · 📋 1.7K - 15% open · ⏱️ 14.01.2021):

     git clone https://github.com/fishtown-analytics/dbt
    
  • PyPi (📥 94K / month · 📦 18 · ⏱️ 14.01.2021):

     pip install dbt
    
  • Conda (📥 150K · ⏱️ 14.05.2020):

     conda install -c conda-forge dbt
    
faust (🥈27 · ⭐ 5.2K) - Python Stream Processing. BSD-3
  • GitHub (👨‍💻 91 · 🔀 420 · 📦 590 · 📋 440 - 50% open · ⏱️ 09.10.2020):

     git clone https://github.com/robinhood/faust
    
  • PyPi (📥 51K / month · 📦 28 · ⏱️ 25.02.2020):

     pip install faust
    
Dagster (🥈27 · ⭐ 2.8K · 📈) - A data orchestrator for machine learning, analytics, and ETL. Apache-2
  • GitHub (👨‍💻 110 · 🔀 270 · 📦 130 · 📋 2.4K - 24% open · ⏱️ 15.01.2021):

     git clone https://github.com/dagster-io/dagster
    
  • PyPi (📥 24K / month · 📦 4 · ⏱️ 15.01.2021):

     pip install dagster
    
  • Conda (📥 120K · ⏱️ 04.12.2020):

     conda install -c conda-forge dagster
    
petl (🥈27 · ⭐ 810) - Python Extract Transform and Load Tables of Data. MIT
  • GitHub (👨‍💻 44 · 🔀 150 · 📦 310 · 📋 410 - 16% open · ⏱️ 14.01.2021):

     git clone https://github.com/petl-developers/petl
    
  • PyPi (📥 12K / month · 📦 110 · ⏱️ 06.10.2020):

     pip install petl
    
  • Conda (📥 19K · ⏱️ 29.12.2020):

     conda install -c conda-forge petl
    
PyFunctional (🥈26 · ⭐ 1.8K) - Python library for creating data pipelines with chain functional.. MIT
  • GitHub (👨‍💻 23 · 🔀 99 · 📦 250 · 📋 120 - 1% open · ⏱️ 12.01.2021):

     git clone https://github.com/EntilZha/PyFunctional
    
  • PyPi (📥 29K / month · ⏱️ 12.01.2021):

     pip install pyfunctional
    
TFX (🥈26 · ⭐ 1.3K) - TFX is an end-to-end platform for deploying production ML pipelines. Apache-2
  • GitHub (👨‍💻 97 · 🔀 370 · 📋 500 - 39% open · ⏱️ 15.01.2021):

     git clone https://github.com/tensorflow/tfx
    
  • PyPi (📥 110K / month · 📦 4 · ⏱️ 06.01.2021):

     pip install tfx
    
streamparse (🥉25 · ⭐ 1.4K) - Run Python in Apache Storm topologies. Pythonic API, CLI.. Apache-2
  • GitHub (👨‍💻 40 · 🔀 220 · 📦 47 · 📋 330 - 20% open · ⏱️ 18.12.2020):

     git clone https://github.com/Parsely/streamparse
    
  • PyPi (📥 1.5K / month · 📦 26 · ⏱️ 07.10.2020):

     pip install streamparse
    
Great Expectations (🥉23 · ⭐ 3.4K) - Always know what to expect from your data. Apache-2
  • GitHub (👨‍💻 140 · 🔀 380 · 📋 750 - 21% open · ⏱️ 16.01.2021):

     git clone https://github.com/great-expectations/great_expectations
    
  • PyPi (📥 340K / month · 📦 6 · ⏱️ 23.12.2020):

     pip install great_expectations
    
bonobo (🥉23 · ⭐ 1.4K) - Extract Transform Load for Python 3.5+. Apache-2
  • GitHub (👨‍💻 36 · 🔀 110 · 📦 100 · 📋 170 - 36% open · ⏱️ 01.01.2021):

     git clone https://github.com/python-bonobo/bonobo
    
  • PyPi (📥 2.1K / month · 📦 24 · ⏱️ 20.07.2019):

     pip install bonobo
    
Optimus (🥉23 · ⭐ 970) - Agile Data Preparation Workflows madeeasy with dask, cudf,.. Apache-2
  • GitHub (👨‍💻 21 · 🔀 190 · 📦 11 · 📋 300 - 45% open · ⏱️ 15.01.2021):

     git clone https://github.com/ironmussa/Optimus
    
  • PyPi (📥 5.5K / month · ⏱️ 30.05.2019):

     pip install optimuspyspark
    
pysparkling (🥉23 · ⭐ 230) - A pure Python implementation of Apache Spark's RDD and DStream.. MIT
  • GitHub (👨‍💻 9 · 🔀 43 · 📦 56 · 📋 21 - 4% open · ⏱️ 10.01.2021):

     git clone https://github.com/svenkreiss/pysparkling
    
  • PyPi (📥 4.4K / month · 📦 14 · ⏱️ 10.01.2021):

     pip install pysparkling
    
Pypeline (🥉21 · ⭐ 1.2K) - Concurrent data pipelines in Python . MIT
  • GitHub (👨‍💻 9 · 🔀 66 · 📋 47 - 25% open · ⏱️ 05.01.2021):

     git clone https://github.com/cgarciae/pypeln
    
  • PyPi (📥 2K / month · 📦 5 · ⏱️ 05.01.2021):

     pip install pypeln
    
dpark (🥉20 · ⭐ 2.6K) - Python clone of Spark, a MapReduce alike framework in Python. BSD-3
  • GitHub (👨‍💻 35 · 🔀 550 · 📦 3 · ⏱️ 25.12.2020):

     git clone https://github.com/douban/dpark
    
  • PyPi (📥 11 / month · 📦 1 · ⏱️ 27.07.2018):

     pip install dpark
    
mrq (🥉20 · ⭐ 830) - Mr. Queue - A distributed worker task queue in Python using Redis & gevent. MIT
  • GitHub (👨‍💻 37 · 🔀 110 · 📦 23 · 📋 180 - 32% open · ⏱️ 13.12.2020):

     git clone https://github.com/pricingassistant/mrq
    
  • PyPi (📥 240 / month · 📦 12 · ⏱️ 31.08.2018):

     pip install mrq
    
pdpipe (🥉20 · ⭐ 580) - Easy pipelines for pandas DataFrames. MIT
  • GitHub (👨‍💻 8 · 🔀 29 · 📦 23 · 📋 20 - 30% open · ⏱️ 17.11.2020):

     git clone https://github.com/pdpipe/pdpipe
    
  • PyPi (📥 800 / month · ⏱️ 09.11.2020):

     pip install pdpipe
    
TaskTiger (🥉19 · ⭐ 1K) - Python task queue using Redis. MIT
  • GitHub (👨‍💻 19 · 🔀 58 · 📦 19 · 📋 61 - 49% open · ⏱️ 17.12.2020):

     git clone https://github.com/closeio/tasktiger
    
  • PyPi (📥 330 / month · 📦 9 · ⏱️ 18.06.2020):

     pip install tasktiger
    
Databolt Flow (🥉19 · ⭐ 890) - Python library for building highly effective data science workflows. MIT
  • GitHub (👨‍💻 10 · 🔀 64 · 📦 12 · 📋 16 - 31% open · ⏱️ 19.12.2020):

     git clone https://github.com/d6t/d6tflow
    
  • PyPi (📥 160 / month · ⏱️ 19.12.2020):

     pip install d6tflow
    
spark-deep-learning (🥉18 · ⭐ 1.8K) - Deep Learning Pipelines for Apache Spark. Apache-2
  • GitHub (👨‍💻 15 · 🔀 440 · 📦 15 · 📋 99 - 73% open · ⏱️ 16.01.2021):

     git clone https://github.com/databricks/spark-deep-learning
    
riko (🥉18 · ⭐ 1.5K) - A Python stream processing engine modeled after Yahoo! Pipes. MIT
  • GitHub (👨‍💻 18 · 🔀 70 · 📋 30 - 73% open · ⏱️ 14.08.2020):

     git clone https://github.com/nerevu/riko
    
  • PyPi (📥 130 / month · 📦 1 · ⏱️ 14.08.2020):

     pip install riko
    
flupy (🥉18 · ⭐ 150) - Fluent data pipelines for python and your shell. MIT
  • GitHub (👨‍💻 4 · 🔀 9 · ⏱️ 06.01.2021):

     git clone https://github.com/olirice/flupy
    
  • PyPi (📥 1.8K / month · ⏱️ 06.01.2021):

     pip install flupy
    
Mara Pipelines (🥉17 · ⭐ 1.6K) - A lightweight opinionated ETL framework, halfway between plain.. MIT
  • GitHub (👨‍💻 15 · 🔀 76 · 📦 7 · 📋 17 - 35% open · ⏱️ 31.07.2020):

     git clone https://github.com/mara/mara-pipelines
    
  • PyPi (📥 22 / month · ⏱️ 31.07.2020):

     pip install mara-pipelines
    
BatchFlow (🥉17 · ⭐ 150) - BatchFlow helps you conveniently work with random or sequential.. Apache-2
  • GitHub (👨‍💻 28 · 🔀 33 · 📋 100 - 49% open · ⏱️ 08.12.2020):

     git clone https://github.com/analysiscenter/batchflow
    
  • PyPi (📥 12 / month · ⏱️ 23.08.2019):

     pip install batchflow
    
zenml (🥉15 · ⭐ 380 · 🐣) - ZenML: Bring Zen to your ML with reproducible pipelines. Apache-2
  • GitHub (👨‍💻 7 · 🔀 23 · 📦 1 · 📋 4 - 25% open · ⏱️ 16.01.2021):

     git clone https://github.com/maiot-io/zenml
    
  • PyPi (📥 230 / month · ⏱️ 08.01.2021):

     pip install zenml
    
Show 2 hidden projects...
  • ploomber (🥉18 · ⭐ 110) - A convention over configuration workflow orchestrator. Develop.. Apache-2
  • Botflow (🥉15 · ⭐ 1.2K · 💀) - Python Fast Dataflow programming framework for Data pipeline work(.. BSD-3

Distributed Machine Learning

Back to top

Libraries that provide capabilities to distribute and parallelize machine learning tasks across large-scale compute infrastructure.

dask (🥇35 · ⭐ 7.8K) - Parallel computing with task scheduling. BSD-3
  • GitHub (👨‍💻 420 · 🔀 1.2K · 📦 25K · 📋 3.5K - 20% open · ⏱️ 15.01.2021):

     git clone https://github.com/dask/dask
    
  • PyPi (📥 1.2M / month · 📦 3.9K · ⏱️ 15.01.2021):

     pip install dask
    
  • Conda (📥 2.8M · ⏱️ 16.01.2021):

     conda install -c conda-forge dask
    
dask.distributed (🥇34 · ⭐ 1.1K) - A distributed task scheduler for Dask. BSD-3
  • GitHub (👨‍💻 230 · 🔀 500 · 📦 16K · 📋 2K - 36% open · ⏱️ 15.01.2021):

     git clone https://github.com/dask/distributed
    
  • PyPi (📥 740K / month · 📦 1.8K · ⏱️ 15.01.2021):

     pip install distributed
    
  • Conda (📥 3.5M · ⏱️ 16.01.2021):

     conda install -c conda-forge distributed
    
Ray (🥇33 · ⭐ 14K) - An open source framework that provides a simple, universal API for.. Apache-2
  • GitHub (👨‍💻 420 · 🔀 2.3K · 📦 1.5K · 📋 5.7K - 21% open · ⏱️ 17.01.2021):

     git clone https://github.com/ray-project/ray
    
  • PyPi (📥 150K / month · 📦 160 · ⏱️ 24.12.2020):

     pip install ray
    
horovod (🥈30 · ⭐ 11K) - Distributed training framework for TensorFlow, Keras, PyTorch, and.. Apache-2
  • GitHub (👨‍💻 110 · 🔀 1.7K · 📦 310 · 📋 1.7K - 16% open · ⏱️ 06.01.2021):

     git clone https://github.com/horovod/horovod
    
  • PyPi (📥 110K / month · 📦 40 · ⏱️ 07.01.2021):

     pip install horovod
    
ipyparallel (🥈29 · ⭐ 1.9K) - Interactive Parallel Computing in Python. BSD-3
  • GitHub (👨‍💻 94 · 🔀 730 · 📦 1.4K · 📋 250 - 56% open · ⏱️ 24.08.2020):

     git clone https://github.com/ipython/ipyparallel
    
  • PyPi (📥 68K / month · 📦 490 · ⏱️ 05.05.2020):

     pip install ipyparallel
    
  • Conda (📥 360K · ⏱️ 14.12.2020):

     conda install -c conda-forge ipyparallel
    
dask-ml (🥈26 · ⭐ 680) - Scalable Machine Learning with Dask. BSD-3
  • GitHub (👨‍💻 61 · 🔀 180 · 📦 330 · 📋 370 - 46% open · ⏱️ 05.01.2021):

     git clone https://github.com/dask/dask-ml
    
  • PyPi (📥 30K / month · 📦 42 · ⏱️ 24.09.2020):

     pip install dask-ml
    
  • Conda (📥 190K · ⏱️ 24.09.2020):

     conda install -c conda-forge dask-ml
    
Mesh (🥈26 · ⭐ 670) - Mesh TensorFlow: Model Parallelism Made Easier. Apache-2
  • GitHub (👨‍💻 38 · 🔀 120 · 📦 480 · 📋 84 - 85% open · ⏱️ 15.01.2021):

     git clone https://github.com/tensorflow/mesh
    
  • PyPi (📥 91K / month · 📦 28 · ⏱️ 17.12.2020):

     pip install mesh-tensorflow
    
BigDL (🥈25 · ⭐ 3.7K) - BigDL: Distributed Deep Learning Framework for Apache Spark. Apache-2
  • GitHub (👨‍💻 71 · 🔀 900 · 📦 21 · 📋 900 - 19% open · ⏱️ 23.12.2020):

     git clone https://github.com/intel-analytics/BigDL
    
  • PyPi (📥 1.2K / month · 📦 6 · ⏱️ 29.12.2020):

     pip install bigdl
    
  • Maven (⏱️ 05.12.2020):

     <dependency>
     	<groupId>com.intel.analytics.bigdl</groupId>
     	<artifactId>bigdl-SPARK_2.4</artifactId>
     	<version>[VERSION]</version>
     </dependency>
    
TensorFlowOnSpark (🥈25 · ⭐ 3.6K) - TensorFlowOnSpark brings TensorFlow programs to.. Apache-2
  • GitHub (👨‍💻 33 · 🔀 930 · 📋 340 - 1% open · ⏱️ 21.12.2020):

     git clone https://github.com/yahoo/TensorFlowOnSpark
    
  • PyPi (📥 62K / month · 📦 5 · ⏱️ 18.12.2020):

     pip install tensorflowonspark
    
mpi4py (🥈25 · ⭐ 370) - Python bindings for MPI. BSD-3
  • GitHub (👨‍💻 13 · 🔀 61 · 📥 660 · 📋 4 - 25% open · ⏱️ 17.01.2021):

     git clone https://github.com/mpi4py/mpi4py
    
  • PyPi (📥 140K / month · 📦 700 · ⏱️ 04.11.2019):

     pip install mpi4py
    
  • Conda (📥 500K · ⏱️ 08.01.2021):

     conda install -c conda-forge mpi4py
    
Elephas (🥉24 · ⭐ 1.4K) - Distributed Deep learning with Keras & Spark. MIT keras
  • GitHub (👨‍💻 26 · 🔀 290 · 📦 36 · 📋 140 - 28% open · ⏱️ 15.01.2021):

     git clone https://github.com/maxpumperla/elephas
    
  • PyPi (📥 7.5K / month · 📦 3 · ⏱️ 14.01.2021):

     pip install elephas
    
petastorm (🥉24 · ⭐ 1K) - Petastorm library enables single machine or distributed training.. Apache-2
  • GitHub (👨‍💻 37 · 🔀 180 · 📥 290 · 📦 24 · 📋 230 - 52% open · ⏱️ 15.12.2020):

     git clone https://github.com/uber/petastorm
    
  • PyPi (📥 25K / month · ⏱️ 15.12.2020):

     pip install petastorm
    
DeepSpeed (🥉23 · ⭐ 3.9K) - DeepSpeed is a deep learning optimization library that makes.. MIT
  • GitHub (👨‍💻 38 · 🔀 360 · 📦 8 · 📋 270 - 43% open · ⏱️ 15.01.2021):

     git clone https://github.com/microsoft/DeepSpeed
    
  • PyPi (📥 1.4K / month · ⏱️ 08.01.2021):

     pip install deepspeed
    
  • Docker Hub (📥 6.7K · ⭐ 2 · ⏱️ 20.11.2020):

     docker pull deepspeed/deepspeed
    
MMLSpark (🥉23 · ⭐ 2.2K) - Microsoft Machine Learning for Apache Spark. MIT
  • GitHub (👨‍💻 61 · 🔀 470 · 📋 380 - 44% open · ⏱️ 09.01.2021):

     git clone https://github.com/Azure/mmlspark
    
  • PyPi (📥 26K / month · ⏱️ 18.03.2020):

     pip install mmlspark
    
analytics-zoo (🥉22 · ⭐ 2.2K) - Distributed Tensorflow, Keras and PyTorch on Apache.. Apache-2
  • GitHub (👨‍💻 81 · 🔀 630 · 📦 2 · 📋 920 - 47% open · ⏱️ 17.01.2021):

     git clone https://github.com/intel-analytics/analytics-zoo
    
  • PyPi (📥 630 / month · ⏱️ 10.01.2021):

     pip install analytics-zoo
    
Submit it (🥉20 · ⭐ 290) - Python 3.6+ toolbox for submitting jobs to Slurm. MIT
  • GitHub (👨‍💻 9 · 🔀 17 · 📦 75 · 📋 23 - 39% open · ⏱️ 15.01.2021):

     git clone https://github.com/facebookincubator/submitit
    
  • PyPi (📥 2.1K / month · ⏱️ 01.12.2020):

     pip install submitit
    
  • Conda (📥 790 · ⏱️ 19.11.2020):

     conda install -c conda-forge submitit
    
BytePS (🥉19 · ⭐ 2.7K) - A high performance and generic framework for distributed DNN training. Apache-2
  • GitHub (👨‍💻 16 · 🔀 360 · 📋 210 - 33% open · ⏱️ 10.01.2021):

     git clone https://github.com/bytedance/byteps
    
  • PyPi (📥 52 / month · ⏱️ 04.11.2020):

     pip install byteps
    
  • Docker Hub (📥 960 · ⏱️ 03.03.2020):

     docker pull bytepsimage/tensorflow
    
Apache Singa (🥉19 · ⭐ 2.2K) - a distributed deep learning platform. Apache-2
  • GitHub (👨‍💻 70 · 🔀 570 · 📋 81 - 60% open · ⏱️ 15.01.2021):

     git clone https://github.com/apache/singa
    
  • Conda (📥 250 · ⏱️ 15.01.2021):

     conda install -c nusdbsystem singa
    
  • Docker Hub (📥 160 · ⭐ 2 · ⏱️ 04.06.2019):

     docker pull apache/singa
    
FairScale (🥉19 · ⭐ 640 · 🐣) - PyTorch extensions for high performance and large scale.. BSD-3
  • GitHub (👨‍💻 18 · 🔀 43 · 📦 2 · 📋 86 - 34% open · ⏱️ 16.01.2021):

     git clone https://github.com/facebookresearch/fairscale
    
  • PyPi (📥 1.2K / month · ⏱️ 07.01.2021):

     pip install fairscale
    
sk-dist (🥉18 · ⭐ 250) - Distributed scikit-learn meta-estimators in PySpark. Apache-2
  • GitHub (👨‍💻 6 · 🔀 42 · 📦 6 · 📋 16 - 43% open · ⏱️ 07.01.2021):

     git clone https://github.com/Ibotta/sk-dist
    
  • PyPi (📥 3.4K / month · 📦 2 · ⏱️ 14.05.2020):

     pip install sk-dist
    
Fiber (🥉17 · ⭐ 840) - Distributed Computing for AI Made Simple. Apache-2
  • GitHub (👨‍💻 5 · 🔀 89 · 📦 17 · 📋 18 - 72% open · ⏱️ 06.10.2020):

     git clone https://github.com/uber/fiber
    
  • PyPi (📥 960 / month · 📦 1 · ⏱️ 09.07.2020):

     pip install fiber
    
somoclu (🥉17 · ⭐ 220) - Massively parallel self-organizing maps: accelerate training on multicore.. MIT
  • GitHub (👨‍💻 17 · 🔀 55 · 📥 1.4K · 📋 130 - 19% open · ⏱️ 24.07.2020):

     git clone https://github.com/peterwittek/somoclu
    
  • PyPi (📥 1.2K / month · 📦 2 · ⏱️ 25.04.2020):

     pip install somoclu
    
  • Conda (📥 39K · ⏱️ 13.10.2020):

     conda install -c conda-forge somoclu
    
Hivemind (🥉16 · ⭐ 640) - Decentralized deep learning in PyTorch. Built to train models on.. MIT
  • GitHub (👨‍💻 8 · 🔀 29 · 📦 1 · 📋 32 - 46% open · ⏱️ 16.01.2021):

     git clone https://github.com/learning-at-home/hivemind
    
  • PyPi (📥 12 / month · 📦 1 · ⏱️ 28.08.2020):

     pip install hivemind
    
Show 3 hidden projects...

Hyperparameter Optimization & AutoML

Back to top

Libraries for hyperparameter optimization, automl and neural architecture search.

Optuna (🥇31 · ⭐ 3.9K) - A hyperparameter optimization framework. MIT
  • GitHub (👨‍💻 120 · 🔀 440 · 📦 860 · 📋 660 - 29% open · ⏱️ 15.01.2021):

     git clone https://github.com/optuna/optuna
    
  • PyPi (📥 150K / month · 📦 52 · ⏱️ 04.11.2020):

     pip install optuna
    
  • Conda (📥 17K · ⏱️ 13.01.2021):

     conda install -c conda-forge optuna
    
scikit-optimize (🥇31 · ⭐ 2K) - Sequential model-based optimization with a `scipy.optimize`.. BSD-3
  • GitHub (👨‍💻 68 · 🔀 370 · 📦 1.3K · 📋 520 - 31% open · ⏱️ 31.12.2020):

     git clone https://github.com/scikit-optimize/scikit-optimize
    
  • PyPi (📥 460K / month · 📦 160 · ⏱️ 04.09.2020):

     pip install scikit-optimize
    
  • Conda (📥 180K · ⏱️ 04.09.2020):

     conda install -c conda-forge scikit-optimize
    
Hyperopt (🥇30 · ⭐ 5.3K) - Distributed Asynchronous Hyperparameter Optimization in Python. BSD-3
  • GitHub (👨‍💻 84 · 🔀 840 · 📦 2.5K · 📋 550 - 60% open · ⏱️ 24.12.2020):

     git clone https://github.com/hyperopt/hyperopt
    
  • PyPi (📥 380K / month · 📦 500 · ⏱️ 07.10.2020):

     pip install hyperopt
    
  • Conda (📥 160K · ⏱️ 14.10.2020):

     conda install -c conda-forge hyperopt
    
featuretools (🥇28 · ⭐ 5.3K) - An open source python library for automated feature engineering. BSD-3
  • GitHub (👨‍💻 49 · 🔀 690 · 📦 650 · 📋 500 - 22% open · ⏱️ 13.01.2021):

     git clone https://github.com/alteryx/featuretools
    
  • PyPi (📥 59K / month · 📦 70 · ⏱️ 31.12.2020):

     pip install featuretools
    
  • Conda (📥 43K · ⏱️ 05.01.2021):

     conda install -c conda-forge featuretools
    
Keras Tuner (🥇28 · ⭐ 2.2K) - Hyperparameter tuning for humans. Apache-2
  • GitHub (👨‍💻 28 · 🔀 260 · 📦 360 · 📋 280 - 51% open · ⏱️ 09.12.2020):

     git clone https://github.com/keras-team/keras-tuner
    
  • PyPi (📥 200K / month · 📦 2 · ⏱️ 20.11.2020):

     pip install keras-tuner
    
NNI (🥈27 · ⭐ 8.7K) - An open source AutoML toolkit for automate machine learning lifecycle,.. MIT
  • GitHub (👨‍💻 110 · 🔀 1.1K · 📦 78 · 📋 1.1K - 21% open · ⏱️ 14.01.2021):

     git clone https://github.com/microsoft/nni
    
  • PyPi (📥 1.6K / month · 📦 16 · ⏱️ 14.01.2021):

     pip install nni
    
AutoKeras (🥈27 · ⭐ 7.7K) - AutoML library for deep learning. Apache-2
  • GitHub (👨‍💻 130 · 🔀 1.2K · 📦 140 · 📋 720 - 8% open · ⏱️ 04.01.2021):

     git clone https://github.com/keras-team/autokeras
    
  • PyPi (📥 4.8K / month · 📦 7 · ⏱️ 30.11.2020):

     pip install autokeras
    
auto-sklearn (🥈26 · ⭐ 5.2K) - Automated Machine Learning with scikit-learn. BSD-3
  • GitHub (👨‍💻 63 · 🔀 970 · 📦 130 · 📋 670 - 17% open · ⏱️ 04.01.2021):

     git clone https://github.com/automl/auto-sklearn
    
  • PyPi (📥 6.3K / month · 📦 30 · ⏱️ 04.01.2021):

     pip install auto-sklearn
    
Bayesian Optimization (🥈26 · ⭐ 4.8K) - A Python implementation of global optimization with.. MIT
  • GitHub (👨‍💻 27 · 🔀 1K · 📥 46 · 📦 570 · 📋 210 - 20% open · ⏱️ 19.12.2020):

     git clone https://github.com/fmfn/BayesianOptimization
    
  • PyPi (📥 47K / month · 📦 92 · ⏱️ 16.05.2020):

     pip install bayesian-optimization
    
AutoGluon (🥈26 · ⭐ 2.9K) - AutoGluon: AutoML for Text, Image, and Tabular Data. Apache-2
  • GitHub (👨‍💻 50 · 🔀 380 · 📦 44 · 📋 370 - 40% open · ⏱️ 16.01.2021):

     git clone https://github.com/awslabs/autogluon
    
  • PyPi (📥 19K / month · ⏱️ 17.01.2021):

     pip install autogluon
    
BoTorch (🥈26 · ⭐ 1.8K) - Bayesian optimization in PyTorch. MIT
  • GitHub (👨‍💻 50 · 🔀 170 · 📦 74 · 📋 150 - 26% open · ⏱️ 13.01.2021):

     git clone https://github.com/pytorch/botorch
    
  • PyPi (📥 25K / month · ⏱️ 08.12.2020):

     pip install botorch
    
SMAC3 (🥈26 · ⭐ 550) - Sequential Model-based Algorithm Configuration. BSD-3
  • GitHub (👨‍💻 30 · 🔀 150 · 📋 330 - 23% open · ⏱️ 29.10.2020):

     git clone https://github.com/automl/SMAC3
    
  • PyPi (📥 8.5K / month · 📦 58 · ⏱️ 29.10.2020):

     pip install smac
    
nevergrad (🥈25 · ⭐ 2.8K) - A Python toolbox for performing gradient-free optimization. MIT
  • GitHub (👨‍💻 42 · 🔀 270 · 📦 110 · 📋 190 - 39% open · ⏱️ 15.01.2021):

     git clone https://github.com/facebookresearch/nevergrad
    
  • PyPi (📥 7.2K / month · 📦 14 · ⏱️ 10.12.2020):

     pip install nevergrad
    
  • Conda (📥 5.6K · ⏱️ 14.12.2020):

     conda install -c conda-forge nevergrad
    
Ax (🥈25 · ⭐ 1.4K) - Adaptive Experimentation Platform. MIT
  • GitHub (👨‍💻 91 · 🔀 140 · 📦 120 · 📋 210 - 16% open · ⏱️ 15.01.2021):

     git clone https://github.com/facebook/Ax
    
  • PyPi (📥 15K / month · 📦 4 · ⏱️ 08.12.2020):

     pip install ax-platform
    
Hyperas (🥈24 · ⭐ 2.1K) - Keras + Hyperopt: A very simple wrapper for convenient.. MIT
  • GitHub (👨‍💻 21 · 🔀 300 · 📋 250 - 35% open · ⏱️ 22.12.2020):

     git clone https://github.com/maxpumperla/hyperas
    
  • PyPi (📥 12K / month · 📦 21 · ⏱️ 28.02.2019):

     pip install hyperas
    
GPyOpt (🥈24 · ⭐ 700) - Gaussian Process Optimization using GPy. BSD-3
  • GitHub (👨‍💻 49 · 🔀 220 · 📦 160 · 📋 290 - 34% open · ⏱️ 05.11.2020):

     git clone https://github.com/SheffieldML/GPyOpt
    
  • PyPi (📥 3.9K / month · 📦 36 · ⏱️ 19.03.2020):

     pip install gpyopt
    
Talos (🥉22 · ⭐ 1.4K) - Hyperparameter Optimization for TensorFlow, Keras and PyTorch. MIT
  • GitHub (👨‍💻 19 · 🔀 220 · 📦 110 · 📋 380 - 10% open · ⏱️ 21.11.2020):

     git clone https://github.com/autonomio/talos
    
  • PyPi (📥 900 / month · 📦 6 · ⏱️ 09.11.2020):

     pip install talos
    
Orion (🥉22 · ⭐ 180) - Asynchronous Distributed Hyperparameter Optimization. BSD-3
  • GitHub (👨‍💻 18 · 🔀 35 · 📦 39 · 📋 120 - 16% open · ⏱️ 15.01.2021):

     git clone https://github.com/Epistimio/orion
    
  • PyPi (📥 950 / month · 📦 8 · ⏱️ 04.12.2020):

     pip install orion
    
AdaNet (🥉21 · ⭐ 3.2K) - Fast and flexible AutoML with learning guarantees. Apache-2
  • GitHub (👨‍💻 26 · 🔀 500 · 📦 33 · 📋 110 - 57% open · ⏱️ 09.07.2020):

     git clone https://github.com/tensorflow/adanet
    
  • PyPi (📥 680 / month · 📦 2 · ⏱️ 09.07.2020):

     pip install adanet
    
optunity (🥉21 · ⭐ 360 · 💤) - optimization routines for hyperparameter tuning. BSD-3
  • GitHub (👨‍💻 9 · 🔀 72 · 📥 62 · 📦 58 · 📋 93 - 48% open · ⏱️ 11.05.2020):

     git clone https://github.com/claesenm/optunity
    
  • PyPi (📥 11K / month · 📦 7 · ⏱️ 30.09.2015):

     pip install optunity
    
Neuraxle (🥉21 · ⭐ 350) - A Sklearn-like Framework for Hyperparameter Tuning and AutoML in.. Apache-2
  • GitHub (👨‍💻 7 · 🔀 39 · 📦 17 · 📋 250 - 47% open · ⏱️ 12.01.2021):

     git clone https://github.com/Neuraxio/Neuraxle
    
  • PyPi (📥 280 / month · ⏱️ 20.09.2020):

     pip install neuraxle
    
mljar-supervised (🥉20 · ⭐ 740) - Automates Machine Learning Pipeline with Feature Engineering.. MIT
  • GitHub (👨‍💻 7 · 🔀 97 · 📦 9 · 📋 270 - 18% open · ⏱️ 15.01.2021):

     git clone https://github.com/mljar/mljar-supervised
    
  • PyPi (📥 2.9K / month · ⏱️ 14.01.2021):

     pip install mljar-supervised
    
Auto ViML (🥉20 · ⭐ 200) - Automatically Build Multiple ML Models with a Single Line of Code... Apache-2
  • GitHub (👨‍💻 6 · 🔀 52 · 📦 13 · 📋 16 - 18% open · ⏱️ 15.01.2021):

     git clone https://github.com/AutoViML/Auto_ViML
    
  • PyPi (📥 520 / month · ⏱️ 15.01.2021):

     pip install autoviml
    
Test Tube (🥉19 · ⭐ 650 · 💤) - Python library to easily log experiments and parallelize.. MIT
  • GitHub (👨‍💻 16 · 🔀 59 · 📥 7 · 📋 44 - 52% open · ⏱️ 17.03.2020):

     git clone https://github.com/williamFalcon/test-tube
    
  • PyPi (📥 9K / month · 📦 5 · ⏱️ 12.12.2018):

     pip install test_tube
    
lazypredict (🥉18 · ⭐ 330) - Lazy Predict help build a lot of basic models without much code.. MIT
  • GitHub (👨‍💻 8 · 🔀 74 · 📦 15 · 📋 64 - 65% open · ⏱️ 13.01.2021):

     git clone https://github.com/shankarpandala/lazypredict
    
  • PyPi (📥 1K / month · ⏱️ 09.07.2020):

     pip install lazypredict
    
Dragonfly (🥉17 · ⭐ 560) - An open source python library for scalable Bayesian optimisation. MIT
  • GitHub (👨‍💻 12 · 🔀 71 · 📋 38 - 55% open · ⏱️ 03.07.2020):

     git clone https://github.com/dragonfly/dragonfly
    
  • PyPi (📥 2.9K / month · 📦 2 · ⏱️ 03.07.2020):

     pip install dragonfly-opt
    
HyperparameterHunter (🥉16 · ⭐ 630) - Easy hyperparameter optimization and automatic result.. MIT
  • GitHub (👨‍💻 3 · 🔀 79 · 📥 270 · 📋 120 - 30% open · ⏱️ 21.10.2020):

     git clone https://github.com/HunterMcGushion/hyperparameter_hunter
    
  • PyPi (📥 59 / month · 📦 1 · ⏱️ 06.08.2019):

     pip install hyperparameter-hunter
    
AlphaPy (🥉16 · ⭐ 540) - Automated Machine Learning [AutoML] with Python, scikit-learn, Keras,.. Apache-2
  • GitHub (👨‍💻 3 · 🔀 110 · 📦 2 · 📋 39 - 25% open · ⏱️ 06.09.2020):

     git clone https://github.com/ScottfreeLLC/AlphaPy
    
  • PyPi (📥 73 / month · ⏱️ 25.08.2020):

     pip install alphapy
    
Auto Tune Models (🥉16 · ⭐ 500 · 💤) - Auto Tune Models - A multi-tenant, multi-data system for.. MIT
  • GitHub (👨‍💻 16 · 🔀 120 · 📦 6 · 📋 88 - 19% open · ⏱️ 21.02.2020):

     git clone https://github.com/HDI-Project/ATM
    
  • PyPi (📥 38 / month · ⏱️ 24.06.2019):

     pip install atm
    
Parfit (🥉15 · ⭐ 200 · 💤) - A package for parallelizing the fit and flexibly scoring of.. MIT
  • GitHub (👨‍💻 2 · 🔀 25 · 📦 7 · 📋 10 - 50% open · ⏱️ 04.04.2020):

     git clone https://github.com/jmcarpenter2/parfit
    
  • PyPi (📥 3.9K / month · ⏱️ 11.10.2018):

     pip install parfit
    
ENAS (🥉14 · ⭐ 2.4K · 💤) - PyTorch implementation of Efficient Neural Architecture Search via.. Apache-2
  • GitHub (👨‍💻 6 · 🔀 440 · 📋 44 - 84% open · ⏱️ 16.06.2020):

     git clone https://github.com/carpedm20/ENAS-pytorch
    
Devol (🥉11 · ⭐ 920) - Genetic neural architecture search with Keras. MIT
  • GitHub (👨‍💻 18 · 🔀 110 · 📋 27 - 25% open · ⏱️ 05.07.2020):

     git clone https://github.com/joeddav/devol
    
Show 13 hidden projects...
  • TPOT (🥇30 · ⭐ 7.7K) - A Python Automated Machine Learning tool that optimizes machine.. ❗️LGPL-3.0
  • MLBox (🥈23 · ⭐ 1.2K) - MLBox is a powerful Automated Machine Learning python library. ❗️BSD-1-Clause
  • auto_ml (🥉21 · ⭐ 1.5K · 💀) - [UNMAINTAINED] Automated machine learning for analytics & production. MIT
  • HpBandSter (🥉19 · ⭐ 440 · 💀) - a distributed Hyperband implementation on Steroids. BSD-3
  • Advisor (🥉17 · ⭐ 1.3K · 💀) - Open-source implementation of Google Vizier for hyper parameters.. Apache-2
  • sklearn-deap (🥉17 · ⭐ 620 · 💀) - Use evolutionary algorithms instead of gridsearch in.. MIT
  • Sherpa (🥉17 · ⭐ 280) - Hyperparameter optimization that enables researchers to experiment,.. ❗️GPL-3.0
  • automl-gs (🥉16 · ⭐ 1.7K · 💀) - Provide an input CSV and a target field to predict, generate a.. MIT
  • Xcessiv (🥉16 · ⭐ 1.3K · 💀) - A web-based application for quick, scalable, and automated.. Apache-2
  • Auptimizer (🥉13 · ⭐ 160) - An automatic ML model optimization tool. ❗️GPL-3.0
  • Hypermax (🥉13 · ⭐ 95) - Better, faster hyper-parameter optimization. BSD-3
  • featurewiz (🥉12 · ⭐ 17 · 🐣) - Use advanced feature engineering strategies and select the.. Apache-2
  • Hypertunity (🥉10 · ⭐ 120 · 💤) - A toolset for black-box hyperparameter optimisation. Apache-2

Reinforcement Learning

Back to top

Libraries for building and evaluating reinforcement learning & agent-based systems.

OpenAI Gym (🥇35 · ⭐ 23K) - A toolkit for developing and comparing reinforcement learning.. MIT
  • GitHub (👨‍💻 270 · 🔀 6.6K · 📦 17K · 📋 1.3K - 18% open · ⏱️ 11.01.2021):

     git clone https://github.com/openai/gym
    
  • PyPi (📥 290K / month · 📦 3.6K · ⏱️ 19.12.2020):

     pip install gym
    
baselines (🥇28 · ⭐ 11K · 💤) - OpenAI Baselines: high-quality implementations of reinforcement.. MIT
  • GitHub (👨‍💻 110 · 🔀 3.8K · 📦 280 · 📋 870 - 51% open · ⏱️ 31.01.2020):

     git clone https://github.com/openai/baselines
    
  • PyPi (📥 28K / month · 📦 37 · ⏱️ 26.02.2018):

     pip install baselines
    
TensorLayer (🥈27 · ⭐ 6.5K) - Deep Learning and Reinforcement Learning Library for.. Apache-2
  • GitHub (👨‍💻 130 · 🔀 1.4K · 📥 1.2K · 📋 480 - 9% open · ⏱️ 06.01.2021):

     git clone https://github.com/tensorlayer/tensorlayer
    
  • PyPi (📥 2.2K / month · 📦 68 · ⏱️ 19.06.2020):

     pip install tensorlayer
    
TF-Agents (🥈27 · ⭐ 1.8K) - TF-Agents is a library for Reinforcement Learning in.. Apache-2
  • GitHub (👨‍💻 84 · 🔀 450 · 📦 350 · 📋 400 - 24% open · ⏱️ 14.01.2021):

     git clone https://github.com/tensorflow/agents
    
  • PyPi (📥 7.3K / month · 📦 12 · ⏱️ 05.01.2021):

     pip install tf-agents
    
Dopamine (🥈26 · ⭐ 9.3K · 📉) - Dopamine is a research framework for fast prototyping of.. Apache-2
  • GitHub (👨‍💻 13 · 🔀 1.2K · 📋 140 - 51% open · ⏱️ 24.11.2020):

     git clone https://github.com/google/dopamine
    
  • PyPi (📥 59K / month · 📦 31 · ⏱️ 16.10.2020):

     pip install dopamine-rl
    
ViZDoom (🥈25 · ⭐ 1.2K) - Doom-based AI Research Platform for Reinforcement Learning from Raw.. MIT
  • GitHub (👨‍💻 43 · 🔀 280 · 📥 10K · 📦 100 · 📋 420 - 21% open · ⏱️ 16.01.2021):

     git clone https://github.com/mwydmuch/ViZDoom
    
  • PyPi (📥 390 / month · 📦 12 · ⏱️ 29.10.2020):

     pip install vizdoom
    
TensorForce (🥈24 · ⭐ 2.8K) - Tensorforce: a TensorFlow library for applied.. Apache-2
  • GitHub (👨‍💻 76 · 🔀 470 · 📋 550 - 1% open · ⏱️ 30.12.2020):

     git clone https://github.com/tensorforce/tensorforce
    
  • PyPi (📥 760 / month · 📦 23 · ⏱️ 07.09.2019):

     pip install tensorforce
    
Stable Baselines (🥈24 · ⭐ 2.8K · 📉) - A fork of OpenAI Baselines, implementations of.. MIT
  • GitHub (👨‍💻 110 · 🔀 560 · 📋 830 - 14% open · ⏱️ 23.12.2020):

     git clone https://github.com/hill-a/stable-baselines
    
  • PyPi (📥 7.9K / month · 📦 25 · ⏱️ 05.08.2020):

     pip install stable-baselines
    
Acme (🥉23 · ⭐ 1.9K) - A library of reinforcement learning components and agents. Apache-2
  • GitHub (👨‍💻 29 · 🔀 180 · 📦 17 · 📋 81 - 45% open · ⏱️ 14.01.2021):

     git clone https://github.com/deepmind/acme
    
  • PyPi (📥 500 / month · ⏱️ 24.07.2020):

     pip install dm-acme
    
garage (🥉23 · ⭐ 1K) - A toolkit for reproducible reinforcement learning research. MIT
  • GitHub (👨‍💻 73 · 🔀 180 · 📦 13 · 📋 950 - 19% open · ⏱️ 14.01.2021):

     git clone https://github.com/rlworkgroup/garage
    
  • PyPi (📥 200 / month · ⏱️ 02.10.2020):

     pip install garage
    
ChainerRL (🥉23 · ⭐ 920) - ChainerRL is a deep reinforcement learning library built on top of.. MIT
  • GitHub (👨‍💻 29 · 🔀 200 · 📦 88 · 📋 220 - 33% open · ⏱️ 09.12.2020):

     git clone https://github.com/chainer/chainerrl
    
  • PyPi (📥 530 / month · 📦 14 · ⏱️ 14.02.2020):

     pip install chainerrl
    
PARL (🥉21 · ⭐ 1.8K) - A high-performance distributed training framework for Reinforcement.. Apache-2
  • GitHub (👨‍💻 21 · 🔀 410 · 📦 65 · 📋 200 - 26% open · ⏱️ 14.01.2021):

     git clone https://github.com/PaddlePaddle/PARL
    
  • PyPi (📥 330 / month · ⏱️ 08.01.2021):

     pip install parl
    
TRFL (🥉20 · ⭐ 3.1K · 💤) - TensorFlow Reinforcement Learning. Apache-2
  • GitHub (👨‍💻 12 · 🔀 370 · 📦 36 · 📋 22 - 27% open · ⏱️ 20.04.2020):

     git clone https://github.com/deepmind/trfl
    
  • PyPi (📥 610 / month · 📦 2 · ⏱️ 20.04.2020):

     pip install trfl
    
Coach (🥉19 · ⭐ 1.9K) - Reinforcement Learning Coach by Intel AI Lab enables easy.. Apache-2
  • GitHub (👨‍💻 34 · 🔀 370 · 📋 260 - 30% open · ⏱️ 15.12.2020):

     git clone https://github.com/IntelLabs/coach
    
  • PyPi (📥 120 / month · 📦 1 · ⏱️ 10.10.2019):

     pip install rl_coach
    
PFRL (🥉19 · ⭐ 490) - PFRL: a PyTorch-based deep reinforcement learning library. MIT
  • GitHub (👨‍💻 10 · 🔀 60 · 📦 4 · 📋 43 - 51% open · ⏱️ 06.01.2021):

     git clone https://github.com/pfnet/pfrl
    
  • PyPi (📥 440 / month · ⏱️ 17.12.2020):

     pip install pfrl
    
ReAgent (🥉16 · ⭐ 2.8K) - A platform for Reasoning systems (Reinforcement Learning,.. BSD-3
  • GitHub (👨‍💻 86 · 🔀 370 · 📋 94 - 21% open · ⏱️ 17.01.2021):

     git clone https://github.com/facebookresearch/ReAgent
    
RLax (🥉16 · ⭐ 540) - A library of reinforcement learning building blocks in JAX. Apache-2 jax
  • GitHub (👨‍💻 10 · 🔀 40 · 📦 9 · 📋 6 - 50% open · ⏱️ 13.01.2021):

     git clone https://github.com/deepmind/rlax
    
  • PyPi (📥 120 / month · ⏱️ 19.02.2020):

     pip install rlax
    
Show 2 hidden projects...
  • keras-rl (🥈26 · ⭐ 4.9K · 💀) - Deep Reinforcement Learning for Keras. MIT
  • DeepMind Lab (🥉17 · ⭐ 6.4K) - A customisable 3D platform for agent-based AI research. ❗️GPL-2.0

Recommender Systems

Back to top

Libraries for building and evaluating recommendation systems.

implicit (🥇28 · ⭐ 2.2K) - Fast Python Collaborative Filtering for Implicit Feedback Datasets. MIT
  • GitHub (👨‍💻 28 · 🔀 450 · 📦 350 · 📋 330 - 23% open · ⏱️ 15.11.2020):

     git clone https://github.com/benfred/implicit
    
  • PyPi (📥 93K / month · 📦 22 · ⏱️ 15.09.2020):

     pip install implicit
    
  • Conda (📥 180K · ⏱️ 24.11.2020):

     conda install -c conda-forge implicit
    
lightfm (🥇27 · ⭐ 3.5K) - A Python implementation of LightFM, a hybrid recommendation algorithm. Apache-2
  • GitHub (👨‍💻 42 · 🔀 560 · 📦 380 · 📋 400 - 31% open · ⏱️ 27.11.2020):

     git clone https://github.com/lyst/lightfm
    
  • PyPi (📥 89K / month · 📦 28 · ⏱️ 27.11.2020):

     pip install lightfm
    
  • Conda (📥 72K · ⏱️ 07.12.2020):

     conda install -c conda-forge lightfm
    
scikit-surprise (🥈26 · ⭐ 4.6K) - A Python scikit for building and analyzing recommender.. BSD-3
  • GitHub (👨‍💻 38 · 🔀 820 · 📦 860 · 📋 320 - 10% open · ⏱️ 05.08.2020):

     git clone https://github.com/NicolasHug/Surprise
    
  • PyPi (📥 37K / month · 📦 24 · ⏱️ 19.07.2020):

     pip install scikit-surprise
    
  • Conda (📥 150K · ⏱️ 13.10.2020):

     conda install -c conda-forge scikit-surprise
    
Cornac (🥈22 · ⭐ 300 · ➕) - A Comparative Framework for Multimodal Recommender Systems. Apache-2
  • GitHub (👨‍💻 11 · 🔀 55 · 📦 26 · 📋 29 - 3% open · ⏱️ 04.01.2021):

     git clone https://github.com/PreferredAI/cornac
    
  • PyPi (📥 700 / month · 📦 1 · ⏱️ 28.12.2020):

     pip install cornac
    
  • Conda (📥 98K · ⏱️ 04.01.2021):

     conda install -c conda-forge cornac
    
Recommenders (🥈21 · ⭐ 9K) - Best Practices on Recommendation Systems. MIT
  • GitHub (👨‍💻 88 · 🔀 1.6K · 📦 1 · 📋 540 - 20% open · ⏱️ 04.01.2021):

     git clone https://github.com/microsoft/recommenders
    
TF Ranking (🥈21 · ⭐ 2K) - Learning to Rank in TensorFlow. Apache-2
  • GitHub (👨‍💻 17 · 🔀 350 · 📋 220 - 10% open · ⏱️ 19.08.2020):

     git clone https://github.com/tensorflow/ranking
    
  • PyPi (📥 19K / month · 📦 1 · ⏱️ 19.08.2020):

     pip install tensorflow_ranking
    
tensorrec (🥉20 · ⭐ 1.1K · 💤) - A TensorFlow recommendation algorithm and framework in.. Apache-2
  • GitHub (👨‍💻 8 · 🔀 200 · 📦 23 · 📋 120 - 26% open · ⏱️ 04.02.2020):

     git clone https://github.com/jfkirk/tensorrec
    
  • PyPi (📥 920 / month · 📦 1 · ⏱️ 02.04.2019):

     pip install tensorrec
    
fastFM (🥉20 · ⭐ 890 · 💤) - fastFM: A Library for Factorization Machines. BSD-3
  • GitHub (👨‍💻 19 · 🔀 190 · 📥 370 · 📦 72 · 📋 100 - 43% open · ⏱️ 05.03.2020):

     git clone https://github.com/ibayer/fastFM
    
  • PyPi (📥 460 / month · 📦 4 · ⏱️ 23.11.2017):

     pip install fastfm
    
RecBole (🥉20 · ⭐ 700) - A unified, comprehensive and efficient recommendation library. MIT
  • GitHub (👨‍💻 27 · 🔀 98 · 📋 71 - 33% open · ⏱️ 17.01.2021):

     git clone https://github.com/RUCAIBox/RecBole
    
  • PyPi (📥 100 / month · ⏱️ 14.01.2021):

     pip install recbole
    
  • Conda (📥 180 · ⏱️ 14.01.2021):

     conda install -c aibox recbole
    
TF Recommenders (🥉19 · ⭐ 690) - TensorFlow Recommenders is a library for building.. Apache-2
  • GitHub (👨‍💻 17 · 🔀 71 · 📦 10 · 📋 79 - 45% open · ⏱️ 15.01.2021):

     git clone https://github.com/tensorflow/recommenders
    
  • PyPi (📥 3.3K / month · ⏱️ 22.12.2020):

     pip install tensorflow-recommenders
    
recmetrics (🥉19 · ⭐ 230) - A library of metrics for evaluating recommender systems. MIT
  • GitHub (👨‍💻 12 · 🔀 53 · 📦 16 · 📋 14 - 50% open · ⏱️ 03.12.2020):

     git clone https://github.com/statisticianinstilettos/recmetrics
    
  • PyPi (📥 660 / month · ⏱️ 10.02.2019):

     pip install recmetrics
    
Spotlight (🥉18 · ⭐ 2.4K · 💤) - Deep recommender models using PyTorch. MIT
  • GitHub (👨‍💻 11 · 🔀 370 · 📋 110 - 57% open · ⏱️ 09.02.2020):

     git clone https://github.com/maciejkula/spotlight
    
  • Conda (📥 5.9K · ⏱️ 27.05.2018):

     conda install -c maciejkula spotlight
    
OpenRec (🥉16 · ⭐ 350 · 💤) - OpenRec is an open-source and modular library for neural network-.. Apache-2
  • GitHub (👨‍💻 10 · 🔀 79 · 📦 1 · 📋 16 - 25% open · ⏱️ 19.02.2020):

     git clone https://github.com/ylongqi/openrec
    
  • PyPi (📥 28 / month · 📦 1 · ⏱️ 11.09.2019):

     pip install openrec
    
Case Recommender (🥉16 · ⭐ 310 · 💤) - Case Recommender: A Flexible and Extensible Python.. MIT
  • GitHub (👨‍💻 10 · 🔀 66 · 📦 5 · 📋 22 - 27% open · ⏱️ 26.05.2020):

     git clone https://github.com/caserec/CaseRecommender
    
  • PyPi (📥 37 / month · ⏱️ 20.08.2019):

     pip install caserecommender
    

Privacy Machine Learning

Back to top

Libraries for encrypted and privacy-preserving machine learning using methods like federated learning & differential privacy.

PySyft (🥇26 · ⭐ 6.8K) - A library for answering questions using data you cannot see. MIT
  • GitHub (👨‍💻 330 · 🔀 1.5K · 📋 2.6K - 6% open · ⏱️ 15.01.2021):

     git clone https://github.com/OpenMined/PySyft
    
  • PyPi (📥 1.6K / month · 📦 3 · ⏱️ 14.09.2020):

     pip install syft
    
TensorFlow Privacy (🥈21 · ⭐ 1.3K) - Library for training machine learning models with.. Apache-2
  • GitHub (👨‍💻 35 · 🔀 260 · 📥 41 · 📋 120 - 33% open · ⏱️ 13.01.2021):

     git clone https://github.com/tensorflow/privacy
    
  • PyPi (📥 4K / month · 📦 1 · ⏱️ 03.09.2020):

     pip install tensorflow-privacy
    
TFEncrypted (🥈21 · ⭐ 810) - A Framework for Encrypted Machine Learning in TensorFlow. Apache-2
  • GitHub (👨‍💻 28 · 🔀 130 · 📦 46 · 📋 390 - 41% open · ⏱️ 19.08.2020):

     git clone https://github.com/tf-encrypted/tf-encrypted
    
  • PyPi (📥 320 / month · 📦 8 · ⏱️ 21.10.2019):

     pip install tf-encrypted
    
Opacus (🥈21 · ⭐ 710) - Training PyTorch models with differential privacy. Apache-2
  • GitHub (👨‍💻 23 · 🔀 90 · 📥 30 · 📦 10 · 📋 54 - 9% open · ⏱️ 13.01.2021):

     git clone https://github.com/pytorch/opacus
    
  • PyPi (📥 810 / month · ⏱️ 17.12.2020):

     pip install opacus
    
FATE (🥉20 · ⭐ 2.7K) - An Industrial Grade Federated Learning Framework. Apache-2
  • GitHub (👨‍💻 50 · 🔀 800 · 📋 800 - 34% open · ⏱️ 17.01.2021):

     git clone https://github.com/FederatedAI/FATE
    
CrypTen (🥉16 · ⭐ 690) - A framework for Privacy Preserving Machine Learning. MIT
  • GitHub (👨‍💻 23 · 🔀 89 · 📦 5 · 📋 46 - 32% open · ⏱️ 23.12.2020):

     git clone https://github.com/facebookresearch/CrypTen
    
  • PyPi (📥 70 / month · ⏱️ 05.02.2020):

     pip install crypten
    

Workflow & Experiment Tracking

Back to top

Libraries to organize, track, and visualize machine learning experiments.

Tensorboard (🥇36 · ⭐ 5.1K) - TensorFlow's Visualization Toolkit. Apache-2
  • GitHub (👨‍💻 250 · 🔀 1.3K · 📦 51K · 📋 1.5K - 37% open · ⏱️ 16.01.2021):

     git clone https://github.com/tensorflow/tensorboard
    
  • PyPi (📥 4.9M / month · 📦 3.6K · ⏱️ 12.11.2020):

     pip install tensorboard
    
  • Conda (📥 1.5M · ⏱️ 15.01.2021):

     conda install -c conda-forge tensorboard
    
mlflow (🥇34 · ⭐ 8.2K) - Open source platform for the machine learning lifecycle. Apache-2
  • GitHub (👨‍💻 270 · 🔀 1.9K · 📦 1.8K · 📋 1.7K - 39% open · ⏱️ 15.01.2021):

     git clone https://github.com/mlflow/mlflow
    
  • PyPi (📥 2.4M / month · 📦 150 · ⏱️ 31.12.2020):

     pip install mlflow
    
  • Conda (📥 200K · ⏱️ 11.01.2021):

     conda install -c conda-forge mlflow
    
DVC (🥇30 · ⭐ 7.1K) - Data Version Control | Git for Data & Models. Apache-2
  • GitHub (👨‍💻 200 · 🔀 660 · 📥 17K · 📦 540 · 📋 2.6K - 18% open · ⏱️ 15.01.2021):

     git clone https://github.com/iterative/dvc
    
  • PyPi (📥 47K / month · 📦 46 · ⏱️ 05.01.2021):

     pip install dvc
    
  • Conda (📥 440K · ⏱️ 05.01.2021):

     conda install -c conda-forge dvc
    
wandb client (🥇30 · ⭐ 2.6K) - A tool for visualizing and tracking your machine learning.. MIT
  • GitHub (👨‍💻 73 · 🔀 180 · 📦 1.8K · 📋 990 - 34% open · ⏱️ 17.01.2021):

     git clone https://github.com/wandb/client
    
  • PyPi (📥 180K / month · 📦 78 · ⏱️ 16.01.2021):

     pip install wandb
    
kaggle (🥈29 · ⭐ 3.8K) - Official Kaggle API. Apache-2
  • GitHub (👨‍💻 35 · 🔀 770 · 📦 4.3K · 📋 270 - 60% open · ⏱️ 30.11.2020):

     git clone https://github.com/Kaggle/kaggle-api
    
  • PyPi (📥 240K / month · 📦 560 · ⏱️ 30.11.2020):

     pip install kaggle
    
  • Conda (📥 40K · ⏱️ 30.11.2020):

     conda install -c conda-forge kaggle
    
sacred (🥈29 · ⭐ 3.3K) - Sacred is a tool to help you configure, organize, log and reproduce.. MIT
  • GitHub (👨‍💻 91 · 🔀 310 · 📦 770 · 📋 490 - 18% open · ⏱️ 12.01.2021):

     git clone https://github.com/IDSIA/sacred
    
  • PyPi (📥 10K / month · 📦 150 · ⏱️ 14.12.2020):

     pip install sacred
    
SageMaker SDK (🥈29 · ⭐ 1.3K · 📉) - A library for training and deploying machine learning.. Apache-2
  • GitHub (👨‍💻 160 · 🔀 560 · 📦 560 · 📋 720 - 28% open · ⏱️ 15.01.2021):

     git clone https://github.com/aws/sagemaker-python-sdk
    
  • PyPi (📥 410K / month · 📦 28 · ⏱️ 14.01.2021):

     pip install sagemaker
    
snakemake (🥈29 · ⭐ 810) - This is the development home of the workflow management system.. MIT
  • GitHub (👨‍💻 190 · 🔀 180 · 📦 720 · 📋 520 - 60% open · ⏱️ 15.01.2021):

     git clone https://github.com/snakemake/snakemake
    
  • PyPi (📥 8.1K / month · 📦 290 · ⏱️ 15.01.2021):

     pip install snakemake
    
  • Conda (📥 270K · ⏱️ 22.12.2020):

     conda install -c bioconda snakemake
    
PyCaret (🥈28 · ⭐ 2.8K) - An open-source, low-code machine learning library in Python. MIT
  • GitHub (👨‍💻 43 · 🔀 600 · 📥 350 · 📦 600 · 📋 700 - 23% open · ⏱️ 17.01.2021):

     git clone https://github.com/pycaret/pycaret
    
  • PyPi (📥 23K / month · ⏱️ 22.12.2020):

     pip install pycaret
    
AzureML SDK (🥈28 · ⭐ 2.1K) - Python notebooks with ML and deep learning examples with Azure.. MIT
  • GitHub (👨‍💻 52 · 🔀 1.5K · 📥 390 · 📋 920 - 7% open · ⏱️ 11.01.2021):

     git clone https://github.com/Azure/MachineLearningNotebooks
    
  • PyPi (📥 270K / month · 📦 60 · ⏱️ 11.01.2021):

     pip install azureml-sdk
    
tensorboardX (🥈27 · ⭐ 6.8K) - tensorboard for pytorch (and chainer, mxnet, numpy, ...). MIT
  • GitHub (👨‍💻 64 · 🔀 770 · 📥 290 · 📦 9.8K · 📋 410 - 17% open · ⏱️ 05.07.2020):

     git clone https://github.com/lanpa/tensorboardX
    
  • PyPi (📥 290K / month · 📦 1.3K · ⏱️ 31.12.2019):

     pip install tensorboardX
    
  • Conda (📥 260K · ⏱️ 06.07.2020):

     conda install -c conda-forge tensorboardx
    
Metaflow (🥈26 · ⭐ 4K) - Build and manage real-life data science projects with ease. Apache-2
  • GitHub (👨‍💻 28 · 🔀 310 · 📦 100 · 📋 250 - 43% open · ⏱️ 11.01.2021):

     git clone https://github.com/Netflix/metaflow
    
  • PyPi (📥 37K / month · 📦 1 · ⏱️ 29.10.2020):

     pip install metaflow
    
  • Conda (📥 10K · ⏱️ 12.11.2020):

     conda install -c conda-forge metaflow
    
Catalyst (🥈26 · ⭐ 2.4K) - Accelerated deep learning R&D. Apache-2
  • GitHub (👨‍💻 80 · 🔀 270 · 📦 270 · 📋 280 - 6% open · ⏱️ 14.01.2021):

     git clone https://github.com/catalyst-team/catalyst
    
  • PyPi (📥 6.6K / month · 📦 38 · ⏱️ 20.12.2020):

     pip install catalyst
    
TNT (🥈25 · ⭐ 1.3K) - Simple tools for logging and visualizing, loading and training. BSD-3
  • GitHub (👨‍💻 35 · 🔀 180 · 📦 510 · 📋 64 - 46% open · ⏱️ 05.01.2021):

     git clone https://github.com/pytorch/tnt
    
  • PyPi (📥 10K / month · 📦 50 · ⏱️ 29.07.2018):

     pip install torchnet
    
Hub (🥈24 · ⭐ 610) - The fastest way to access and manage datasets for PyTorch and TensorFlow... MPL-2.0
  • GitHub (👨‍💻 49 · 🔀 110 · 📦 110 · 📋 180 - 47% open · ⏱️ 17.01.2021):

     git clone https://github.com/activeloopai/Hub
    
  • PyPi (📥 1.3K / month · 📦 52 · ⏱️ 15.01.2021):

     pip install hub
    
  • Conda (📥 93K · ⏱️ 22.04.2020):

     conda install -c conda-forge hub
    
ml-metadata (🥈24 · ⭐ 260) - For recording and retrieving metadata associated with ML.. Apache-2
  • GitHub (👨‍💻 11 · 🔀 56 · 📥 1.2K · 📦 63 · 📋 54 - 27% open · ⏱️ 14.01.2021):

     git clone https://github.com/google/ml-metadata
    
  • PyPi (📥 230K / month · ⏱️ 15.12.2020):

     pip install ml-metadata
    
VisualDL (🥉23 · ⭐ 3.2K) - Deep Learning Visualization Toolkit. Apache-2
  • GitHub (👨‍💻 28 · 🔀 440 · 📥 65 · 📦 140 · 📋 340 - 6% open · ⏱️ 15.01.2021):

     git clone https://github.com/PaddlePaddle/VisualDL
    
  • PyPi (📥 4.8K / month · 📦 10 · ⏱️ 29.12.2020):

     pip install visualdl
    
TensorWatch (🥉23 · ⭐ 3K) - Debugging, monitoring and visualization for Python Machine Learning.. MIT
  • GitHub (👨‍💻 13 · 🔀 310 · 📦 45 · 📋 63 - 77% open · ⏱️ 15.01.2021):

     git clone https://github.com/microsoft/tensorwatch
    
  • PyPi (📥 9.8K / month · 📦 6 · ⏱️ 04.03.2020):

     pip install tensorwatch
    
ClearML (🥉23 · ⭐ 2.1K) - ClearML - Auto-Magical Suite of tools to streamline your ML.. Apache-2
  • GitHub (👨‍💻 24 · 🔀 300 · 📥 260 · 📦 9 · 📋 280 - 33% open · ⏱️ 13.01.2021):

     git clone https://github.com/allegroai/clearml
    
  • PyPi (📥 1.2K / month · ⏱️ 13.01.2021):

     pip install clearml
    
  • Docker Hub (📥 30K · ⏱️ 05.10.2020):

     docker pull allegroai/trains
    
livelossplot (🥉23 · ⭐ 1K) - Live training loss plot in Jupyter Notebook for Keras, PyTorch.. MIT
  • GitHub (👨‍💻 15 · 🔀 130 · 📦 420 · 📋 72 - 6% open · ⏱️ 16.11.2020):

     git clone https://github.com/stared/livelossplot
    
  • PyPi (📥 11K / month · 📦 5 · ⏱️ 24.07.2020):

     pip install livelossplot
    
knockknock (🥉22 · ⭐ 1.9K · 💤) - Knock Knock: Get notified when your training ends with only two.. MIT
  • GitHub (👨‍💻 18 · 🔀 160 · 📦 130 · 📋 33 - 36% open · ⏱️ 16.03.2020):

     git clone https://github.com/huggingface/knockknock
    
  • PyPi (📥 1.2K / month · 📦 3 · ⏱️ 16.03.2020):

     pip install knockknock
    
  • Conda (📥 5.3K · ⏱️ 17.03.2020):

     conda install -c conda-forge knockknock
    
Guild AI (🥉22 · ⭐ 520) - Experiment tracking, ML developer tools. Apache-2
  • GitHub (👨‍💻 16 · 🔀 42 · 📦 25 · 📋 240 - 39% open · ⏱️ 14.01.2021):

     git clone https://github.com/guildai/guildai
    
  • PyPi (📥 1.9K / month · ⏱️ 14.01.2021):

     pip install guildai
    
lore (🥉21 · ⭐ 1.5K · 💤) - Lore makes machine learning approachable for Software Engineers and.. MIT
  • GitHub (👨‍💻 22 · 🔀 120 · 📦 16 · 📋 34 - 47% open · ⏱️ 11.05.2020):

     git clone https://github.com/instacart/lore
    
  • PyPi (📥 650 / month · 📦 1 · ⏱️ 11.05.2020):

     pip install lore
    
Studio.ml (🥉21 · ⭐ 370) - Studio: Simplify and expedite model building process. Apache-2
  • GitHub (👨‍💻 21 · 🔀 48 · 📦 3 · 📋 240 - 23% open · ⏱️ 12.01.2021):

     git clone https://github.com/studioml/studio
    
  • PyPi (📥 950 / month · ⏱️ 16.01.2021):

     pip install studioml
    
gokart (🥉21 · ⭐ 160) - A wrapper of the data pipeline library luigi. MIT
  • GitHub (👨‍💻 24 · 🔀 30 · 📦 28 · 📋 29 - 31% open · ⏱️ 12.01.2021):

     git clone https://github.com/m3dev/gokart
    
  • PyPi (📥 470 / month · 📦 2 · ⏱️ 25.02.2020):

     pip install gokart
    
hiddenlayer (🥉20 · ⭐ 1.4K · 💤) - Neural network graphs and training metrics for.. MIT
  • GitHub (👨‍💻 6 · 🔀 190 · 📦 52 · 📋 75 - 57% open · ⏱️ 24.04.2020):

     git clone https://github.com/waleedka/hiddenlayer
    
  • PyPi (📥 1.1K / month · 📦 4 · ⏱️ 24.04.2020):

     pip install hiddenlayer
    
Labml (🥉20 · ⭐ 360) - Monitor PyTorch & TensorFlow model training from your mobile phone. MIT
  • GitHub (👨‍💻 5 · 🔀 33 · 📦 20 · ⏱️ 15.01.2021):

     git clone https://github.com/lab-ml/labml
    
  • PyPi (📥 980 / month · ⏱️ 15.01.2021):

     pip install labml
    
MXBoard (🥉19 · ⭐ 330 · 💤) - Logging MXNet data for visualization in TensorBoard. Apache-2
  • GitHub (👨‍💻 9 · 🔀 46 · 📦 93 · 📋 31 - 51% open · ⏱️ 24.01.2020):

     git clone https://github.com/awslabs/mxboard
    
  • PyPi (📥 2.4K / month · 📦 1 · ⏱️ 22.05.2018):

     pip install mxboard
    
quinn (🥉19 · ⭐ 200) - pyspark methods to enhance developer productivity. Apache-2
  • GitHub (👨‍💻 6 · 🔀 26 · 📋 23 - 73% open · ⏱️ 29.07.2020):

     git clone https://github.com/MrPowers/quinn
    
  • PyPi (📥 110K / month · 📦 3 · ⏱️ 26.07.2020):

     pip install quinn
    
aim (🥉15 · ⭐ 780) - Aim a super-easy way to record, search and compare 1000s of ML training.. Apache-2
  • GitHub (👨‍💻 8 · 🔀 45 · 📋 39 - 35% open · ⏱️ 13.01.2021):

     git clone https://github.com/aimhubio/aim
    
  • PyPi (📥 870 / month · 📦 2 · ⏱️ 07.01.2021):

     pip install aim
    
Show 6 hidden projects...
  • TensorBoard Logger (🥉20 · ⭐ 610 · 💀) - Log TensorBoard events without touching TensorFlow. MIT
  • SKLL (🥉17 · ⭐ 520) - SciKit-Learn Laboratory (SKLL) makes it easy to run machine.. ❗️BSD-1-Clause
  • datmo (🥉16 · ⭐ 330 · 💀) - Open source production model management tool for data scientists. MIT
  • steppy (🥉15 · ⭐ 130 · 💀) - Lightweight, Python library for fast and reproducible experimentation. MIT
  • ModelChimp (🥉14 · ⭐ 120) - Experiment tracking for machine and deep learning projects. BSD-2
  • traintool (🥉10 · ⭐ 9 · 🐣) - Train off-the-shelf machine learning models in one.. Apache-2

Model Serialization & Conversion

Back to top

Libraries to serialize models to files, convert between a variety of model formats, and optimize models for deployment.

onnx (🥇33 · ⭐ 9.6K) - Open standard for machine learning interoperability. Apache-2
  • GitHub (👨‍💻 190 · 🔀 1.8K · 📥 9.9K · 📦 2.4K · 📋 1.4K - 35% open · ⏱️ 15.01.2021):

     git clone https://github.com/onnx/onnx
    
  • PyPi (📥 310K / month · 📦 300 · ⏱️ 06.11.2020):

     pip install onnx
    
  • Conda (📥 180K · ⏱️ 13.01.2021):

     conda install -c conda-forge onnx
    
Core ML Tools (🥇26 · ⭐ 2.1K) - Core ML tools contain supporting tools for Core ML model.. BSD-3
  • GitHub (👨‍💻 110 · 🔀 320 · 📥 2.3K · 📦 450 · 📋 610 - 50% open · ⏱️ 08.01.2021):

     git clone https://github.com/apple/coremltools
    
  • PyPi (📥 39K / month · 📦 130 · ⏱️ 10.10.2020):

     pip install coremltools
    
TorchServe (🥈24 · ⭐ 1.5K) - Model Serving on PyTorch. Apache-2
  • GitHub (👨‍💻 61 · 🔀 220 · 📥 170 · 📦 26 · 📋 530 - 24% open · ⏱️ 13.01.2021):

     git clone https://github.com/pytorch/serve
    
  • PyPi (📥 1.7K / month · ⏱️ 17.12.2020):

     pip install torchserve
    
  • Conda (📥 6K · ⏱️ 17.12.2020):

     conda install -c pytorch torchserve
    
  • Docker Hub (📥 50K · ⭐ 3 · ⏱️ 18.12.2020):

     docker pull pytorch/torchserve
    
mmdnn (🥈22 · ⭐ 5.2K) - MMdnn is a set of tools to help users inter-operate among different deep.. MIT
  • GitHub (👨‍💻 85 · 🔀 920 · 📥 3.3K · 📦 47 · 📋 590 - 51% open · ⏱️ 14.08.2020):

     git clone https://github.com/Microsoft/MMdnn
    
  • PyPi (📥 550 / month · 📦 3 · ⏱️ 24.07.2020):

     pip install mmdnn
    
m2cgen (🥈22 · ⭐ 1.7K) - Transform ML models into a native code (Java, C, Python, Go, JavaScript,.. MIT
  • GitHub (👨‍💻 12 · 🔀 150 · 📦 4 · 📋 61 - 40% open · ⏱️ 07.10.2020):

     git clone https://github.com/BayesWitnesses/m2cgen
    
  • PyPi (📥 8.6K / month · ⏱️ 18.09.2020):

     pip install m2cgen
    
cortex (🥉21 · ⭐ 7.2K) - Run inference at scale. Apache-2
  • GitHub (👨‍💻 18 · 🔀 540 · 📋 870 - 21% open · ⏱️ 15.01.2021):

     git clone https://github.com/cortexlabs/cortex
    
  • PyPi (📥 690 / month · ⏱️ 06.01.2021):

     pip install cortex
    
Hummingbird (🥉20 · ⭐ 2.2K) - Hummingbird compiles trained ML models into tensor computation for.. MIT
  • GitHub (👨‍💻 18 · 🔀 150 · 📥 88 · 📦 12 · 📋 180 - 27% open · ⏱️ 13.01.2021):

     git clone https://github.com/microsoft/hummingbird
    
  • PyPi (📥 800 / month · ⏱️ 04.01.2021):

     pip install hummingbird-ml
    
pytorch2keras (🥉18 · ⭐ 660 · 💤) - PyTorch to Keras model convertor. MIT
  • GitHub (👨‍💻 11 · 🔀 110 · 📦 17 · 📋 100 - 37% open · ⏱️ 14.05.2020):

     git clone https://github.com/nerox8664/pytorch2keras
    
  • PyPi (📥 370 / month · 📦 1 · ⏱️ 14.05.2020):

     pip install pytorch2keras
    
tfdeploy (🥉16 · ⭐ 350) - Deploy tensorflow graphs for fast evaluation and export to.. BSD-3
  • GitHub (👨‍💻 4 · 🔀 38 · 📋 33 - 30% open · ⏱️ 08.01.2021):

     git clone https://github.com/riga/tfdeploy
    
  • PyPi (📥 48 / month · 📦 2 · ⏱️ 30.03.2017):

     pip install tfdeploy
    
Show 2 hidden projects...

Model Interpretability

Back to top

Libraries to visualize, explain, debug, evaluate, and interpret machine learning models.

shap (🥇33 · ⭐ 11K) - A game theoretic approach to explain the output of any machine learning model. MIT
  • GitHub (👨‍💻 140 · 🔀 1.6K · 📦 1.9K · 📋 1.4K - 64% open · ⏱️ 15.01.2021):

     git clone https://github.com/slundberg/shap
    
  • PyPi (📥 760K / month · 📦 140 · ⏱️ 04.11.2020):

     pip install shap
    
  • Conda (📥 360K · ⏱️ 16.12.2020):

     conda install -c conda-forge shap
    
Lime (🥇29 · ⭐ 8.3K) - Lime: Explaining the predictions of any machine learning classifier. BSD-2
  • GitHub (👨‍💻 57 · 🔀 1.3K · 📦 1K · 📋 480 - 7% open · ⏱️ 12.01.2021):

     git clone https://github.com/marcotcr/lime
    
  • PyPi (📥 170K / month · 📦 130 · ⏱️ 03.04.2020):

     pip install lime
    
  • Conda (📥 62K · ⏱️ 28.06.2020):

     conda install -c conda-forge lime
    
eli5 (🥇28 · ⭐ 2.3K · 💤) - A library for debugging/inspecting machine learning classifiers and.. MIT
  • GitHub (👨‍💻 14 · 🔀 280 · 📦 840 · 📋 240 - 53% open · ⏱️ 22.01.2020):

     git clone https://github.com/TeamHG-Memex/eli5
    
  • PyPi (📥 240K / month · 📦 96 · ⏱️ 29.08.2019):

     pip install eli5
    
  • Conda (📥 86K · ⏱️ 15.06.2020):

     conda install -c conda-forge eli5
    
pyLDAvis (🥇28 · ⭐ 1.4K) - Python library for interactive topic model visualization. Port of.. BSD-3
  • GitHub (👨‍💻 31 · 🔀 290 · 📦 1.7K · 📋 150 - 61% open · ⏱️ 02.12.2020):

     git clone https://github.com/bmabey/pyLDAvis
    
  • PyPi (📥 60K / month · 📦 99 · ⏱️ 05.06.2018):

     pip install pyldavis
    
  • Conda (📥 21K · ⏱️ 22.06.2018):

     conda install -c conda-forge pyldavis
    
arviz (🥇28 · ⭐ 930 · 📈) - Exploratory analysis of Bayesian models with Python. Apache-2
  • GitHub (👨‍💻 62 · 🔀 170 · 📥 98 · 📦 670 · 📋 520 - 21% open · ⏱️ 16.01.2021):

     git clone https://github.com/arviz-devs/arviz
    
  • PyPi (📥 110K / month · 📦 18 · ⏱️ 23.09.2020):

     pip install arviz
    
  • Conda (📥 170K · ⏱️ 24.09.2020):

     conda install -c conda-forge arviz
    
InterpretML (🥈27 · ⭐ 3.4K) - Fit interpretable models. Explain blackbox machine learning. MIT
  • GitHub (👨‍💻 23 · 🔀 430 · 📦 60 · 📋 160 - 27% open · ⏱️ 15.01.2021):

     git clone https://github.com/interpretml/interpret
    
  • PyPi (📥 9.8K / month · 📦 2 · ⏱️ 14.01.2021):

     pip install interpret
    
Model Analysis (🥈27 · ⭐ 1K) - Model analysis tools for TensorFlow. Apache-2
  • GitHub (👨‍💻 33 · 🔀 200 · 📋 54 - 40% open · ⏱️ 14.01.2021):

     git clone https://github.com/tensorflow/model-analysis
    
  • PyPi (📥 6.6M / month · 📦 10 · ⏱️ 16.12.2020):

     pip install tensorflow-model-analysis
    
yellowbrick (🥈26 · ⭐ 3.1K) - Visual analysis and diagnostic tools to facilitate machine.. Apache-2
  • GitHub (👨‍💻 95 · 🔀 460 · 📋 590 - 13% open · ⏱️ 15.01.2021):

     git clone https://github.com/DistrictDataLabs/yellowbrick
    
  • PyPi (📥 56K / month · 📦 62 · ⏱️ 09.10.2020):

     pip install yellowbrick
    
Captum (🥈25 · ⭐ 2.1K · 📉) - Model interpretability and understanding for PyTorch. BSD-3
  • GitHub (👨‍💻 66 · 🔀 190 · 📦 100 · 📋 180 - 24% open · ⏱️ 06.01.2021):

     git clone https://github.com/pytorch/captum
    
  • PyPi (📥 7.7K / month · ⏱️ 13.11.2020):

     pip install captum
    
dtreeviz (🥈25 · ⭐ 1.3K) - A python library for decision tree visualization and model interpretation. MIT
  • GitHub (👨‍💻 13 · 🔀 180 · 📦 99 · 📋 80 - 17% open · ⏱️ 25.11.2020):

     git clone https://github.com/parrt/dtreeviz
    
  • PyPi (📥 13K / month · 📦 4 · ⏱️ 25.11.2020):

     pip install dtreeviz
    
Lucid (🥈24 · ⭐ 4K) - A collection of infrastructure and tools for research in neural.. Apache-2
  • GitHub (👨‍💻 34 · 🔀 550 · 📦 400 · 📋 160 - 41% open · ⏱️ 24.11.2020):

     git clone https://github.com/tensorflow/lucid
    
  • PyPi (📥 510 / month · 📦 6 · ⏱️ 20.12.2018):

     pip install lucid
    
DoWhy (🥈24 · ⭐ 2.6K) - DoWhy is a Python library for causal inference that supports explicit.. MIT
  • GitHub (👨‍💻 37 · 🔀 360 · 📥 19 · 📦 24 · 📋 93 - 16% open · ⏱️ 24.12.2020):

     git clone https://github.com/Microsoft/dowhy
    
  • PyPi (📥 7.9K / month · ⏱️ 12.12.2020):

     pip install dowhy
    
  • Conda (📥 900 · ⏱️ 13.12.2020):

     conda install -c conda-forge dowhy
    
Fairness 360 (🥈24 · ⭐ 1.2K) - A comprehensive set of fairness metrics for datasets and.. Apache-2
  • GitHub (👨‍💻 40 · 🔀 380 · 📦 62 · 📋 81 - 43% open · ⏱️ 08.01.2021):

     git clone https://github.com/Trusted-AI/AIF360
    
  • PyPi (📥 2.4K / month · 📦 2 · ⏱️ 02.06.2020):

     pip install aif360
    
keras-vis (🥈23 · ⭐ 2.8K · 💤) - Neural network visualization toolkit for keras. MIT
  • GitHub (👨‍💻 10 · 🔀 580 · 📦 650 · 📋 210 - 53% open · ⏱️ 20.04.2020):

     git clone https://github.com/raghakot/keras-vis
    
  • PyPi (📥 2K / month · 📦 28 · ⏱️ 06.07.2017):

     pip install keras-vis
    
keract (🥈23 · ⭐ 840) - Activation Maps (Layers Outputs) and Gradients in Keras. MIT
  • GitHub (👨‍💻 15 · 🔀 180 · 📦 66 · 📋 76 - 1% open · ⏱️ 09.01.2021):

     git clone https://github.com/philipperemy/keract
    
  • PyPi (📥 1.4K / month · 📦 5 · ⏱️ 22.12.2020):

     pip install keract
    
TreeInterpreter (🥈23 · ⭐ 640) - Package for interpreting scikit-learn's decision tree.. BSD-3
  • GitHub (👨‍💻 10 · 🔀 130 · 📦 110 · 📋 28 - 89% open · ⏱️ 10.01.2021):

     git clone https://github.com/andosa/treeinterpreter
    
  • PyPi (📥 66K / month · 📦 18 · ⏱️ 10.01.2021):

     pip install treeinterpreter
    
random-forest-importances (🥈23 · ⭐ 400) - Code to compute permutation and drop-column.. MIT
  • GitHub (👨‍💻 14 · 🔀 88 · 📦 57 · 📋 29 - 13% open · ⏱️ 08.01.2021):

     git clone https://github.com/parrt/random-forest-importances
    
  • PyPi (📥 14K / month · 📦 2 · ⏱️ 25.11.2020):

     pip install rfpimp
    
Alibi (🥉22 · ⭐ 860) - Algorithms for monitoring and explaining machine learning models. Apache-2
  • GitHub (👨‍💻 11 · 🔀 100 · 📦 54 · 📋 140 - 46% open · ⏱️ 02.12.2020):

     git clone https://github.com/SeldonIO/alibi
    
  • PyPi (📥 2.1K / month · 📦 9 · ⏱️ 20.10.2020):

     pip install alibi
    
Explainability 360 (🥉22 · ⭐ 750) - Interpretability and explainability of data and machine.. Apache-2
  • GitHub (👨‍💻 27 · 🔀 160 · 📦 12 · 📋 44 - 59% open · ⏱️ 04.12.2020):

     git clone https://github.com/Trusted-AI/AIX360
    
  • PyPi (📥 440 / month · 📦 1 · ⏱️ 28.10.2020):

     pip install aix360
    
iNNvestigate (🥉21 · ⭐ 750) - A toolbox to iNNvestigate neural networks' predictions!. BSD-2
  • GitHub (👨‍💻 18 · 🔀 170 · 📦 43 · 📋 200 - 35% open · ⏱️ 16.10.2020):

     git clone https://github.com/albermax/innvestigate
    
  • PyPi (📥 320 / month · 📦 1 · ⏱️ 14.11.2020):

     pip install innvestigate
    
fairlearn (🥉21 · ⭐ 650) - A Python package to assess and improve fairness of machine.. MIT
  • GitHub (👨‍💻 29 · 🔀 160 · 📋 190 - 41% open · ⏱️ 12.01.2021):

     git clone https://github.com/fairlearn/fairlearn
    
  • PyPi (📥 3.3K / month · 📦 1 · ⏱️ 10.11.2020):

     pip install fairlearn
    
  • Conda (📥 8.3K · ⏱️ 11.11.2020):

     conda install -c conda-forge fairlearn
    
aequitas (🥉21 · ⭐ 340) - Bias and Fairness Audit Toolkit. MIT
  • GitHub (👨‍💻 15 · 🔀 62 · 📦 63 · 📋 53 - 62% open · ⏱️ 20.12.2020):

     git clone https://github.com/dssg/aequitas
    
  • PyPi (📥 210 / month · 📦 3 · ⏱️ 16.12.2020):

     pip install aequitas
    
checklist (🥉20 · ⭐ 1.2K) - Beyond Accuracy: Behavioral Testing of NLP models with CheckList. MIT
  • GitHub (👨‍💻 7 · 🔀 97 · 📦 5 · 📋 56 - 5% open · ⏱️ 16.12.2020):

     git clone https://github.com/marcotcr/checklist
    
  • PyPi (📥 240 / month · 📦 2 · ⏱️ 21.09.2020):

     pip install checklist
    
tf-explain (🥉20 · ⭐ 750) - Interpretability Methods for tf.keras models with Tensorflow 2.x. MIT
  • GitHub (👨‍💻 12 · 🔀 69 · 📦 59 · 📋 75 - 45% open · ⏱️ 05.01.2021):

     git clone https://github.com/sicara/tf-explain
    
  • PyPi (📥 950 / month · 📦 1 · ⏱️ 14.02.2020):

     pip install tf-explain
    
deeplift (🥉20 · ⭐ 500) - Public facing deeplift repo. MIT
  • GitHub (👨‍💻 11 · 🔀 120 · 📦 43 · 📋 75 - 38% open · ⏱️ 11.11.2020):

     git clone https://github.com/kundajelab/deeplift
    
  • PyPi (📥 450 / month · 📦 2 · ⏱️ 11.11.2020):

     pip install deeplift
    
sklearn-evaluation (🥉20 · ⭐ 290) - Machine learning model evaluation made easy: plots,.. MIT
  • GitHub (👨‍💻 5 · 🔀 25 · 📦 26 · 📋 35 - 22% open · ⏱️ 28.12.2020):

     git clone https://github.com/edublancas/sklearn-evaluation
    
  • PyPi (📥 210 / month · 📦 2 · ⏱️ 28.12.2020):

     pip install sklearn-evaluation
    
explainerdashboard (🥉19 · ⭐ 250) - Quickly build Explainable AI dashboards that show the inner.. MIT
  • GitHub (👨‍💻 8 · 🔀 37 · 📦 3 · 📋 55 - 16% open · ⏱️ 14.01.2021):

     git clone https://github.com/oegedijk/explainerdashboard
    
  • PyPi (📥 1.3K / month · ⏱️ 14.01.2021):

     pip install explainerdashboard
    
What-If Tool (🥉18 · ⭐ 420) - Source code/webpage/demos for the What-If Tool. Apache-2
  • GitHub (👨‍💻 18 · 🔀 89 · 📋 69 - 50% open · ⏱️ 14.01.2021):

     git clone https://github.com/PAIR-code/what-if-tool
    
  • PyPi (📥 2.3K / month · ⏱️ 28.06.2020):

     pip install witwidget
    
  • NPM (📥 1.6K / month · ⏱️ 13.01.2021):

     npm install wit-widget
    
fairness-indicators (🥉18 · ⭐ 170) - Tensorflow's Fairness Evaluation and Visualization.. Apache-2
  • GitHub (👨‍💻 19 · 🔀 49 · 📋 21 - 76% open · ⏱️ 15.01.2021):

     git clone https://github.com/tensorflow/fairness-indicators
    
  • PyPi (📥 480 / month · ⏱️ 17.12.2020):

     pip install fairness-indicators
    
DiCE (🥉17 · ⭐ 430) - Generate Diverse Counterfactual Explanations for any machine.. MIT
  • GitHub (👨‍💻 8 · 🔀 63 · 📋 30 - 40% open · ⏱️ 15.12.2020):

     git clone https://github.com/interpretml/DiCE
    
  • PyPi (📥 160 / month · ⏱️ 22.09.2020):

     pip install dice-ml
    
ExplainX.ai (🥉17 · ⭐ 160) - Explainable AI framework for data scientists. Explain & debug any.. MIT
  • GitHub (👨‍💻 4 · 🔀 28 · 📋 20 - 35% open · ⏱️ 07.01.2021):

     git clone https://github.com/explainX/explainx
    
  • PyPi (📥 460 / month · ⏱️ 07.12.2020):

     pip install explainx
    
LIT (🥉16 · ⭐ 2.3K · 🐣) - The Language Interpretability Tool: Interactively analyze NLP models.. Apache-2
  • GitHub (👨‍💻 10 · 🔀 230 · 📋 54 - 50% open · ⏱️ 17.11.2020):

     git clone https://github.com/PAIR-code/lit
    
  • PyPi (📥 140 / month · ⏱️ 17.11.2020):

     pip install lit-nlp
    
FlashTorch (🥉16 · ⭐ 530 · 💤) - Visualization toolkit for neural networks in PyTorch! Demo --. MIT
  • GitHub (👨‍💻 2 · 🔀 66 · 📦 6 · 📋 27 - 22% open · ⏱️ 29.05.2020):

     git clone https://github.com/MisaOgura/flashtorch
    
  • PyPi (📥 170 / month · ⏱️ 29.05.2020):

     pip install flashtorch
    
tcav (🥉16 · ⭐ 420) - Code for the TCAV ML interpretability project. Apache-2
  • GitHub (👨‍💻 16 · 🔀 96 · 📦 6 · 📋 42 - 4% open · ⏱️ 14.07.2020):

     git clone https://github.com/tensorflow/tcav
    
  • PyPi (📥 36 / month · 📦 1 · ⏱️ 10.06.2019):

     pip install tcav
    
LOFO (🥉16 · ⭐ 290) - Leave One Feature Out Importance. MIT
  • GitHub (👨‍💻 3 · 🔀 40 · 📦 3 · 📋 11 - 9% open · ⏱️ 02.07.2020):

     git clone https://github.com/aerdem4/lofo-importance
    
  • PyPi (📥 180 / month · ⏱️ 02.07.2020):

     pip install lofo-importance
    
model-card-toolkit (🥉16 · ⭐ 150 · 🐣) - a tool that leverages rich metadata and lineage.. Apache-2
  • GitHub (👨‍💻 7 · 🔀 21 · 📦 1 · ⏱️ 14.01.2021):

     git clone https://github.com/tensorflow/model-card-toolkit
    
  • PyPi (📥 69 / month · ⏱️ 16.10.2020):

     pip install model-card-toolkit
    
Anchor (🥉14 · ⭐ 610) - Code for High-Precision Model-Agnostic Explanations paper. BSD-2
  • GitHub (👨‍💻 8 · 🔀 87 · 📋 54 - 16% open · ⏱️ 10.09.2020):

     git clone https://github.com/marcotcr/anchor
    
  • PyPi (📥 440 / month · ⏱️ 26.06.2020):

     pip install anchor_exp
    
Show 7 hidden projects...
  • scikit-plot (🥈23 · ⭐ 2K · 💀) - An intuitive library to add plotting functionality to scikit-.. MIT
  • Skater (🥉20 · ⭐ 960 · 💤) - Python Library for Model Interpretation/Explanations. ❗️UPL-1.0
  • DALEX (🥉18 · ⭐ 750) - moDel Agnostic Language for Exploration and eXplanation. ❗️GPL-3.0
  • XAI (🥉16 · ⭐ 560 · 💀) - XAI - An eXplainability toolbox for machine learning. MIT
  • imodels (🥉16 · ⭐ 140 · ➕) - Interpretable ML package for concise, transparent, and accurate.. MIT
  • contextual-ai (🥉13 · ⭐ 65) - Contextual AI adds explainability to different stages of.. Apache-2
  • Attribution Priors (🥉11 · ⭐ 71) - Tools for training explainable models using.. MIT

Vector Similarity Search (ANN)

Back to top

Libraries for Approximate Nearest Neighbor Search and Vector Indexing/Similarity Search.

🔗 ANN Benchmarks ( ⭐ 2K) - Benchmarks of approximate nearest neighbor libraries in Python.

Faiss (🥇29 · ⭐ 12K) - A library for efficient similarity search and clustering of dense vectors. MIT
  • GitHub (👨‍💻 72 · 🔀 2.1K · 📦 300 · 📋 1.3K - 7% open · ⏱️ 13.01.2021):

     git clone https://github.com/facebookresearch/faiss
    
  • PyPi (📥 6.7K / month · 📦 6 · ⏱️ 16.10.2020):

     pip install pymilvus
    
  • Conda (📥 21K · ⏱️ 12.12.2020):

     conda install -c conda-forge faiss
    
Annoy (🥇29 · ⭐ 8K) - Approximate Nearest Neighbors in C++/Python optimized for memory usage.. Apache-2
  • GitHub (👨‍💻 68 · 🔀 860 · 📦 1.4K · 📋 290 - 8% open · ⏱️ 03.12.2020):

     git clone https://github.com/spotify/annoy
    
  • PyPi (📥 220K / month · 📦 360 · ⏱️ 18.09.2020):

     pip install annoy
    
NMSLIB (🥈28 · ⭐ 2.3K) - Non-Metric Space Library (NMSLIB): An efficient similarity search.. Apache-2
  • GitHub (👨‍💻 44 · 🔀 330 · 📦 310 · 📋 360 - 12% open · ⏱️ 08.01.2021):

     git clone https://github.com/nmslib/nmslib
    
  • PyPi (📥 53K / month · 📦 52 · ⏱️ 08.01.2021):

     pip install nmslib
    
  • Conda (📥 10K · ⏱️ 08.01.2021):

     conda install -c conda-forge nmslib
    
Milvus (🥈25 · ⭐ 4.9K) - An open source embedding vector similarity search engine powered by.. Apache-2
  • GitHub (👨‍💻 140 · 🔀 760 · 📋 2.1K - 10% open · ⏱️ 15.01.2021):

     git clone https://github.com/milvus-io/milvus
    
  • PyPi (📥 6.7K / month · 📦 6 · ⏱️ 16.10.2020):

     pip install pymilvus
    
  • Docker Hub (📥 220K · ⭐ 9 · ⏱️ 06.01.2021):

     docker pull milvusdb/milvus
    
hnswlib (🥈22 · ⭐ 1.3K) - Header-only C++/python library for fast approximate nearest neighbors. Apache-2
  • GitHub (👨‍💻 32 · 🔀 230 · 📋 180 - 43% open · ⏱️ 18.08.2020):

     git clone https://github.com/nmslib/hnswlib
    
  • PyPi (📥 15K / month · 📦 2 · ⏱️ 13.07.2020):

     pip install hnswlib
    
Magnitude (🥉21 · ⭐ 1.4K) - A fast, efficient universal vector embedding utility package. MIT
  • GitHub (👨‍💻 4 · 🔀 98 · 📦 170 · 📋 74 - 32% open · ⏱️ 17.07.2020):

     git clone https://github.com/plasticityai/magnitude
    
  • PyPi (📥 5.2K / month · 📦 14 · ⏱️ 25.05.2020):

     pip install pymagnitude
    
PyNNDescent (🥉21 · ⭐ 360) - A Python nearest neighbor descent for approximate nearest neighbors. BSD-2
  • GitHub (👨‍💻 10 · 🔀 43 · 📋 56 - 48% open · ⏱️ 16.01.2021):

     git clone https://github.com/lmcinnes/pynndescent
    
  • PyPi (📥 16K / month · 📦 3 · ⏱️ 19.11.2020):

     pip install pynndescent
    
  • Conda (📥 21K · ⏱️ 19.11.2020):

     conda install -c conda-forge pynndescent
    
NGT (🥉19 · ⭐ 610) - Nearest Neighbor Search with Neighborhood Graph and Tree for High-.. Apache-2
  • GitHub (👨‍💻 10 · 🔀 67 · 📋 74 - 10% open · ⏱️ 16.01.2021):

     git clone https://github.com/yahoojapan/NGT
    
  • PyPi (📥 2.9K / month · 📦 3 · ⏱️ 23.09.2020):

     pip install ngt
    
N2 (🥉19 · ⭐ 450) - TOROS N2 - lightweight approximate Nearest Neighbor library which runs fast.. Apache-2
  • GitHub (👨‍💻 17 · 🔀 51 · 📦 12 · 📋 39 - 43% open · ⏱️ 21.10.2020):

     git clone https://github.com/kakao/n2
    
  • PyPi (📥 320 / month · 📦 2 · ⏱️ 16.10.2020):

     pip install n2
    
Show 2 hidden projects...
  • NearPy (🥉20 · ⭐ 660 · 💀) - Python framework for fast (approximated) nearest neighbour search in.. MIT
  • PySparNN (🥉11 · ⭐ 840 · 💀) - Approximate Nearest Neighbor Search for Sparse Data in Python!. BSD-3

Probabilistics & Statistics

Back to top

Libraries providing capabilities for probabilistic programming/reasoning, bayesian inference, gaussian processes, or statistics.

PyMC3 (🥇32 · ⭐ 5.5K) - Probabilistic Programming in Python: Bayesian Modeling and.. Apache-2
  • GitHub (👨‍💻 300 · 🔀 1.3K · 📥 140 · 📦 2K · 📋 2.1K - 7% open · ⏱️ 16.01.2021):

     git clone https://github.com/pymc-devs/pymc3
    
  • PyPi (📥 110K / month · 📦 290 · ⏱️ 07.12.2020):

     pip install pymc3
    
  • Conda (📥 240K · ⏱️ 07.12.2020):

     conda install -c conda-forge pymc3
    
tensorflow-probability (🥇31 · ⭐ 3.2K) - Probabilistic reasoning and statistical analysis in.. Apache-2
  • GitHub (👨‍💻 400 · 🔀 840 · 📦 1 · 📋 940 - 44% open · ⏱️ 16.01.2021):

     git clone https://github.com/tensorflow/probability
    
  • PyPi (📥 310K / month · 📦 250 · ⏱️ 29.12.2020):

     pip install tensorflow-probability
    
  • Conda (📥 29K · ⏱️ 13.03.2020):

     conda install -c conda-forge tensorflow-probability
    
hmmlearn (🥇29 · ⭐ 2.2K) - Hidden Markov Models in Python, with scikit-learn like API. BSD-3
  • GitHub (👨‍💻 31 · 🔀 600 · 📦 850 · 📋 340 - 18% open · ⏱️ 28.12.2020):

     git clone https://github.com/hmmlearn/hmmlearn
    
  • PyPi (📥 120K / month · 📦 210 · ⏱️ 12.09.2020):

     pip install hmmlearn
    
  • Conda (📥 58K · ⏱️ 01.11.2020):

     conda install -c conda-forge hmmlearn
    
GPyTorch (🥈28 · ⭐ 2.2K) - A highly efficient and modular implementation of Gaussian Processes.. MIT
  • GitHub (👨‍💻 63 · 🔀 310 · 📦 190 · 📋 820 - 24% open · ⏱️ 14.01.2021):

     git clone https://github.com/cornellius-gp/gpytorch
    
  • PyPi (📥 31K / month · 📦 8 · ⏱️ 30.11.2020):

     pip install gpytorch
    
Pyro (🥈27 · ⭐ 6.7K) - Deep universal probabilistic programming with Python and PyTorch. Apache-2
  • GitHub (👨‍💻 100 · 🔀 780 · 📦 360 · 📋 810 - 16% open · ⏱️ 16.01.2021):

     git clone https://github.com/pyro-ppl/pyro
    
  • PyPi (📥 6.3K / month · 📦 36 · ⏱️ 17.11.2020):

     pip install pyro-ppl
    
filterpy (🥈27 · ⭐ 1.6K) - Python Kalman filtering and optimal estimation library. Implements.. MIT
  • GitHub (👨‍💻 33 · 🔀 400 · 📦 680 · 📋 170 - 12% open · ⏱️ 04.01.2021):

     git clone https://github.com/rlabbe/filterpy
    
  • PyPi (📥 14K / month · 📦 210 · ⏱️ 10.10.2018):

     pip install filterpy
    
  • Conda (📥 56K · ⏱️ 05.05.2020):

     conda install -c conda-forge filterpy
    
GPflow (🥈27 · ⭐ 1.4K) - Gaussian processes in TensorFlow. Apache-2
  • GitHub (👨‍💻 68 · 🔀 390 · 📦 210 · 📋 690 - 12% open · ⏱️ 13.01.2021):

     git clone https://github.com/GPflow/GPflow
    
  • PyPi (📥 1.9K / month · 📦 17 · ⏱️ 01.12.2020):

     pip install gpflow
    
  • Conda (📥 8K · ⏱️ 06.11.2018):

     conda install -c conda-forge gpflow
    
pomegranate (🥉26 · ⭐ 2.6K) - Fast, flexible and easy to use probabilistic modelling in Python. MIT
  • GitHub (👨‍💻 61 · 🔀 460 · 📦 380 · 📋 570 - 5% open · ⏱️ 09.01.2021):

     git clone https://github.com/jmschrei/pomegranate
    
  • PyPi (📥 21K / month · 📦 56 · ⏱️ 09.01.2021):

     pip install pomegranate
    
  • Conda (📥 42K · ⏱️ 01.11.2020):

     conda install -c conda-forge pomegranate
    
pgmpy (🥉24 · ⭐ 1.7K) - Python Library for learning (Structure and Parameter) and inference.. MIT
  • GitHub (👨‍💻 88 · 🔀 560 · 📥 48 · 📦 180 · 📋 720 - 31% open · ⏱️ 10.01.2021):

     git clone https://github.com/pgmpy/pgmpy
    
  • PyPi (📥 23K / month · 📦 12 · ⏱️ 30.12.2020):

     pip install pgmpy
    
SALib (🥉24 · ⭐ 420) - Sensitivity Analysis Library in Python (Numpy). Contains Sobol, Morris,.. MIT
  • GitHub (👨‍💻 23 · 🔀 140 · 📋 220 - 18% open · ⏱️ 12.11.2020):

     git clone https://github.com/SALib/SALib
    
  • PyPi (📥 12K / month · 📦 44 · ⏱️ 19.11.2020):

     pip install salib
    
  • Conda (📥 57K · ⏱️ 24.10.2020):

     conda install -c conda-forge salib
    
bambi (🥉20 · ⭐ 540) - BAyesian Model-Building Interface (Bambi) in Python. MIT
  • GitHub (👨‍💻 15 · 🔀 50 · 📦 8 · 📋 150 - 16% open · ⏱️ 15.01.2021):

     git clone https://github.com/bambinos/bambi
    
  • PyPi (📥 220 / month · 📦 3 · ⏱️ 17.12.2020):

     pip install bambi
    
scikit-posthocs (🥉20 · ⭐ 170) - Pairwise Multiple Comparisons (Post Hoc) Tests in Python. MIT
  • GitHub (👨‍💻 7 · 🔀 18 · 📥 16 · 📦 100 · 📋 35 - 11% open · ⏱️ 19.10.2020):

     git clone https://github.com/maximtrp/scikit-posthocs
    
  • PyPi (📥 7.8K / month · 📦 3 · ⏱️ 12.10.2020):

     pip install scikit-posthocs
    
pyhsmm (🥉18 · ⭐ 480) - Bayesian inference in HSMMs and HMMs. MIT
  • GitHub (👨‍💻 13 · 🔀 150 · 📦 17 · 📋 94 - 36% open · ⏱️ 24.08.2020):

     git clone https://github.com/mattjj/pyhsmm
    
  • PyPi (📥 53 / month · 📦 4 · ⏱️ 10.05.2017):

     pip install pyhsmm
    
Funsor (🥉18 · ⭐ 160) - Functional tensors for probabilistic programming. Apache-2
  • GitHub (👨‍💻 9 · 🔀 15 · 📦 7 · 📋 110 - 49% open · ⏱️ 14.01.2021):

     git clone https://github.com/pyro-ppl/funsor
    
  • PyPi (📥 200 / month · ⏱️ 15.10.2020):

     pip install funsor
    
Baal (🥉17 · ⭐ 310) - Using approximate bayesian posteriors in deep nets for active learning. Apache-2
  • GitHub (👨‍💻 8 · 🔀 24 · 📦 6 · 📋 34 - 35% open · ⏱️ 11.12.2020):

     git clone https://github.com/ElementAI/baal
    
  • PyPi (📥 110 / month · ⏱️ 03.11.2020):

     pip install baal
    
Orbit (🥉17 · ⭐ 300) - Bayesian forecasting with object-oriented design and probabilistic.. Apache-2
  • GitHub (👨‍💻 7 · 🔀 13 · 📦 3 · 📋 160 - 13% open · ⏱️ 08.01.2021):

     git clone https://github.com/uber/orbit
    
  • PyPi (📥 62 / month · ⏱️ 15.11.2020):

     pip install orbit-ml
    
Show 5 hidden projects...
  • PyStan (🥈27 · ⭐ 910) - PyStan, the Python interface to Stan. ❗️GPL-3.0
  • patsy (🥈27 · ⭐ 730 · 💀) - Describing statistical models in Python using symbolic formulas. BSD-2
  • Edward (🥉24 · ⭐ 4.6K · 💀) - A probabilistic programming language in TensorFlow. Deep.. Apache-2
  • pingouin (🥉22 · ⭐ 620) - Statistical package in Python based on Pandas. ❗️GPL-3.0
  • ZhuSuan (🥉14 · ⭐ 2K · 💀) - A probabilistic programming library for Bayesian deep learning,.. MIT

Adversarial Robustness

Back to top

Libraries for testing the robustness of machine learning models against attacks with adversarial/malicious examples.

CleverHans (🥇25 · ⭐ 4.9K) - An adversarial example library for constructing attacks,.. MIT
  • GitHub (👨‍💻 110 · 🔀 1.2K · 📦 180 · 📋 450 - 16% open · ⏱️ 13.01.2021):

     git clone https://github.com/cleverhans-lab/cleverhans
    
  • PyPi (📥 520 / month · 📦 8 · ⏱️ 27.11.2018):

     pip install cleverhans
    
Foolbox (🥇25 · ⭐ 1.8K) - A Python toolbox to create adversarial examples that fool neural networks.. MIT
  • GitHub (👨‍💻 30 · 🔀 310 · 📦 160 · 📋 300 - 13% open · ⏱️ 20.12.2020):

     git clone https://github.com/bethgelab/foolbox
    
  • PyPi (📥 970 / month · 📦 18 · ⏱️ 26.09.2020):

     pip install foolbox
    
ART (🥈23 · ⭐ 2K) - Adversarial Robustness Toolbox (ART) - Python Library for Machine Learning.. MIT
  • GitHub (👨‍💻 59 · 🔀 560 · 📦 98 · 📋 400 - 18% open · ⏱️ 11.01.2021):

     git clone https://github.com/Trusted-AI/adversarial-robustness-toolbox
    
  • PyPi (📥 1.7K / month · ⏱️ 09.01.2021):

     pip install adversarial-robustness-toolbox
    
TextAttack (🥈23 · ⭐ 1.2K) - TextAttack is a Python framework for adversarial attacks, data.. MIT
  • GitHub (👨‍💻 31 · 🔀 130 · 📦 15 · 📋 110 - 23% open · ⏱️ 15.01.2021):

     git clone https://github.com/QData/TextAttack
    
  • PyPi (📥 840 / month · ⏱️ 27.12.2020):

     pip install textattack
    
robustness (🥉18 · ⭐ 450) - A library for experimenting with, training and evaluating neural.. MIT
  • GitHub (👨‍💻 11 · 🔀 80 · 📦 31 · 📋 54 - 9% open · ⏱️ 01.12.2020):

     git clone https://github.com/MadryLab/robustness
    
  • PyPi (📥 250 / month · 📦 1 · ⏱️ 05.08.2020):

     pip install robustness
    
AdvBox (🥉16 · ⭐ 1K) - Advbox is a toolbox to generate adversarial examples that fool neural.. Apache-2
  • GitHub (👨‍💻 19 · 🔀 220 · 📋 32 - 18% open · ⏱️ 26.08.2020):

     git clone https://github.com/advboxes/AdvBox
    
  • PyPi (📥 19 / month · ⏱️ 05.12.2018):

     pip install advbox
    
Show 2 hidden projects...
  • advertorch (🥉17 · ⭐ 790 · 💤) - A Toolbox for Adversarial Robustness Research. ❗️GPL-3.0
  • Adversary (🥉13 · ⭐ 340 · 💀) - Tool to generate adversarial text examples and test machine.. MIT

GPU Utilities

Back to top

Libraries that require and make use of CUDA/GPU system capabilities to optimize data handling and machine learning tasks.

CuPy (🥇30 · ⭐ 4.8K) - A NumPy-compatible array library accelerated by CUDA. MIT
  • GitHub (👨‍💻 250 · 🔀 430 · 📥 5.6K · 📦 630 · 📋 1.2K - 29% open · ⏱️ 15.01.2021):

     git clone https://github.com/cupy/cupy
    
  • PyPi (📥 7.1K / month · 📦 190 · ⏱️ 25.12.2020):

     pip install cupy
    
  • Conda (📥 340K · ⏱️ 14.01.2021):

     conda install -c conda-forge cupy
    
  • Docker Hub (📥 48K · ⭐ 6 · ⏱️ 15.01.2021):

     docker pull cupy/cupy
    
PyCUDA (🥇27 · ⭐ 1.1K) - CUDA integration for Python, plus shiny features. MIT
  • GitHub (👨‍💻 68 · 🔀 210 · 📦 710 · 📋 160 - 24% open · ⏱️ 04.01.2021):

     git clone https://github.com/inducer/pycuda
    
  • PyPi (📥 13K / month · 📦 290 · ⏱️ 07.10.2020):

     pip install pycuda
    
gpustat (🥈26 · ⭐ 2.2K) - A simple command-line utility for querying and monitoring GPU status. MIT
  • GitHub (👨‍💻 10 · 🔀 180 · 📦 570 · 📋 67 - 29% open · ⏱️ 13.01.2021):

     git clone https://github.com/wookayin/gpustat
    
  • PyPi (📥 130K / month · 📦 58 · ⏱️ 02.01.2021):

     pip install gpustat
    
  • Conda (📥 9.2K · ⏱️ 24.11.2020):

     conda install -c conda-forge gpustat
    
Apex (🥈23 · ⭐ 5K) - A PyTorch Extension: Tools for easy mixed precision and distributed.. BSD-3
  • GitHub (👨‍💻 77 · 🔀 660 · 📦 420 · 📋 780 - 54% open · ⏱️ 17.12.2020):

     git clone https://github.com/NVIDIA/apex
    
  • Conda (📥 29K · ⏱️ 23.11.2020):

     conda install -c conda-forge nvidia-apex
    
ArrayFire (🥈22 · ⭐ 3.3K) - ArrayFire: a general purpose GPU library. BSD-3
  • GitHub (👨‍💻 80 · 🔀 450 · 📥 590 · 📋 1.5K - 17% open · ⏱️ 11.01.2021):

     git clone https://github.com/arrayfire/arrayfire
    
  • PyPi (📥 500 / month · 📦 2 · ⏱️ 18.10.2018):

     pip install arrayfire
    
py3nvml (🥈22 · ⭐ 160 · 💤) - Python 3 Bindings for NVML library. Get NVIDIA GPU status inside.. BSD-3
  • GitHub (👨‍💻 6 · 🔀 24 · 📦 220 · 📋 11 - 18% open · ⏱️ 23.04.2020):

     git clone https://github.com/fbcotter/py3nvml
    
  • PyPi (📥 93K / month · 📦 20 · ⏱️ 06.04.2020):

     pip install py3nvml
    
  • Conda (📥 11K · ⏱️ 10.10.2020):

     conda install -c conda-forge py3nvml
    
cuDF (🥉21 · ⭐ 3.6K) - cuDF - GPU DataFrame Library. Apache-2
  • GitHub (👨‍💻 200 · 🔀 470 · 📋 3.3K - 17% open · ⏱️ 15.01.2021):

     git clone https://github.com/rapidsai/cudf
    
  • PyPi (📥 600 / month · 📦 4 · ⏱️ 01.06.2020):

     pip install cudf
    
scikit-cuda (🥉21 · ⭐ 790) - Python interface to GPU-powered libraries. BSD-3
  • GitHub (👨‍💻 44 · 🔀 150 · 📦 110 · 📋 200 - 18% open · ⏱️ 13.07.2020):

     git clone https://github.com/lebedov/scikit-cuda
    
  • PyPi (📥 490 / month · 📦 31 · ⏱️ 27.05.2019):

     pip install scikit-cuda
    
DALI (🥉20 · ⭐ 3K) - A library containing both highly optimized building blocks and an.. Apache-2
  • GitHub (👨‍💻 53 · 🔀 360 · 📋 780 - 23% open · ⏱️ 15.01.2021):

     git clone https://github.com/NVIDIA/DALI
    
cuML (🥉19 · ⭐ 1.9K) - cuML - RAPIDS Machine Learning Library. Apache-2
  • GitHub (👨‍💻 120 · 🔀 300 · 📋 1.5K - 41% open · ⏱️ 16.01.2021):

     git clone https://github.com/rapidsai/cuml
    
  • PyPi (📥 280 / month · ⏱️ 11.04.2019):

     pip install cuml
    
BlazingSQL (🥉17 · ⭐ 1.4K) - BlazingSQL is a lightweight, GPU accelerated, SQL engine for.. Apache-2
  • GitHub (👨‍💻 39 · 🔀 130 · 📋 560 - 43% open · ⏱️ 13.01.2021):

     git clone https://github.com/BlazingDB/blazingsql
    
  • Conda (📥 910 · ⏱️ 11.11.2019):

     conda install -c blazingsql blazingsql-protocol
    
cuGraph (🥉16 · ⭐ 620) - cuGraph - RAPIDS Graph Analytics Library. Apache-2
  • GitHub (👨‍💻 54 · 🔀 130 · 📋 550 - 15% open · ⏱️ 15.01.2021):

     git clone https://github.com/rapidsai/cugraph
    
  • PyPi (📥 30 / month · ⏱️ 04.04.2019):

     pip install cugraph
    
SpeedTorch (🥉16 · ⭐ 590 · 💤) - Library for faster pinned CPU - GPU transfer in Pytorch. MIT
  • GitHub (👨‍💻 3 · 🔀 36 · 📦 3 · 📋 5 - 60% open · ⏱️ 21.02.2020):

     git clone https://github.com/Santosh-Gupta/SpeedTorch
    
  • PyPi (📥 66 / month · 📦 2 · ⏱️ 06.01.2020):

     pip install SpeedTorch
    
cuSignal (🥉15 · ⭐ 430) - GPU accelerated signal processing. Apache-2
  • GitHub (👨‍💻 28 · 🔀 57 · 📋 98 - 17% open · ⏱️ 15.01.2021):

     git clone https://github.com/rapidsai/cusignal
    
Show 3 hidden projects...
  • GPUtil (🥈23 · ⭐ 660 · 💀) - A Python module for getting the GPU status from NVIDA GPUs using.. MIT
  • nvidia-ml-py3 (🥉17 · ⭐ 60 · 💀) - Python 3 Bindings for the NVIDIA Management Library. BSD-3
  • ipyexperiments (🥉16 · ⭐ 120) - jupyter/ipython experiment containers for GPU and.. Apache-2

Tensorflow Utilities

Back to top

Libraries that extend TensorFlow with additional capabilities.

tensor2tensor (🥇32 · ⭐ 11K) - Library of deep learning models and datasets designed to.. Apache-2
  • GitHub (👨‍💻 230 · 🔀 2.7K · 📦 880 · 📋 1.2K - 45% open · ⏱️ 16.11.2020):

     git clone https://github.com/tensorflow/tensor2tensor
    
  • PyPi (📥 81K / month · 📦 82 · ⏱️ 17.06.2020):

     pip install tensor2tensor
    
tensorflow-hub (🥇32 · ⭐ 2.7K) - A library for transfer learning by reusing parts of.. Apache-2
  • GitHub (👨‍💻 66 · 🔀 1.4K · 📦 5K · 📋 540 - 8% open · ⏱️ 15.01.2021):

     git clone https://github.com/tensorflow/hub
    
  • PyPi (📥 720K / month · 📦 310 · ⏱️ 06.01.2021):

     pip install tensorflow-hub
    
  • Conda (📥 48K · ⏱️ 24.08.2020):

     conda install -c conda-forge tensorflow-hub
    
TF Addons (🥈30 · ⭐ 1.2K) - Useful extra functionality for TensorFlow 2.x maintained by.. Apache-2
  • GitHub (👨‍💻 140 · 🔀 420 · 📦 2K · 📋 780 - 21% open · ⏱️ 15.01.2021):

     git clone https://github.com/tensorflow/addons
    
  • PyPi (📥 210K / month · 📦 18 · ⏱️ 23.12.2020):

     pip install tensorflow-addons
    
TensorFlow Transform (🥈29 · ⭐ 850) - Input pipeline framework. Apache-2
  • GitHub (👨‍💻 27 · 🔀 160 · 📦 390 · 📋 150 - 21% open · ⏱️ 15.01.2021):

     git clone https://github.com/tensorflow/transform
    
  • PyPi (📥 7.4M / month · 📦 84 · ⏱️ 16.12.2020):

     pip install tensorflow-transform
    
efficientnet (🥈26 · ⭐ 1.7K) - Implementation of EfficientNet model. Keras and.. Apache-2
  • GitHub (👨‍💻 8 · 🔀 370 · 📥 140K · 📦 490 · 📋 100 - 47% open · ⏱️ 15.09.2020):

     git clone https://github.com/qubvel/efficientnet
    
  • PyPi (📥 82K / month · 📦 2 · ⏱️ 15.09.2020):

     pip install efficientnet
    
TF Model Optimization (🥈26 · ⭐ 950) - A toolkit to optimize ML models for deployment for.. Apache-2
  • GitHub (👨‍💻 43 · 🔀 210 · 📦 530 · 📋 160 - 44% open · ⏱️ 13.01.2021):

     git clone https://github.com/tensorflow/model-optimization
    
  • PyPi (📥 50K / month · 📦 3 · ⏱️ 14.09.2020):

     pip install tensorflow-model-optimization
    
TensorFlow I/O (🥉25 · ⭐ 400) - Dataset, streaming, and file system extensions.. Apache-2
  • GitHub (👨‍💻 72 · 🔀 140 · 📦 91 · 📋 380 - 28% open · ⏱️ 14.01.2021):

     git clone https://github.com/tensorflow/io
    
  • PyPi (📥 44K / month · ⏱️ 15.12.2020):

     pip install tensorflow-io
    
TensorFlow Cloud (🥉23 · ⭐ 220) - The TensorFlow Cloud repository provides APIs that.. Apache-2
  • GitHub (👨‍💻 16 · 🔀 43 · 📦 14 · 📋 71 - 64% open · ⏱️ 15.01.2021):

     git clone https://github.com/tensorflow/cloud
    
  • PyPi (📥 87K / month · ⏱️ 06.01.2021):

     pip install tensorflow-cloud
    
Neural Structured Learning (🥉22 · ⭐ 770) - Training neural models with structured signals. Apache-2
  • GitHub (👨‍💻 26 · 🔀 140 · 📦 21 · 📋 43 - 11% open · ⏱️ 15.01.2021):

     git clone https://github.com/tensorflow/neural-structured-learning
    
  • PyPi (📥 2K / month · ⏱️ 18.08.2020):

     pip install neural-structured-learning
    
TensorNets (🥉20 · ⭐ 970) - High level network definitions with pre-trained weights in.. MIT
  • GitHub (👨‍💻 6 · 🔀 180 · 📦 29 · 📋 56 - 25% open · ⏱️ 02.01.2021):

     git clone https://github.com/taehoonlee/tensornets
    
  • PyPi (📥 260 / month · 📦 2 · ⏱️ 31.03.2020):

     pip install tensornets
    
tffm (🥉19 · ⭐ 750 · 💤) - TensorFlow implementation of an arbitrary order Factorization Machine. MIT
  • GitHub (👨‍💻 10 · 🔀 170 · 📦 9 · 📋 37 - 45% open · ⏱️ 22.05.2020):

     git clone https://github.com/geffy/tffm
    
  • PyPi (📥 1.2K / month · 📦 1 · ⏱️ 27.01.2018):

     pip install tffm
    
Saliency (🥉16 · ⭐ 620) - TensorFlow implementation for SmoothGrad, Grad-CAM, Guided.. Apache-2
  • GitHub (👨‍💻 12 · 🔀 130 · 📦 13 · 📋 21 - 52% open · ⏱️ 01.10.2020):

     git clone https://github.com/PAIR-code/saliency
    
  • PyPi (📥 140 / month · ⏱️ 26.09.2019):

     pip install saliency
    
TF Compression (🥉16 · ⭐ 420) - Data compression in TensorFlow. Apache-2
  • GitHub (👨‍💻 9 · 🔀 150 · 📋 55 - 1% open · ⏱️ 12.01.2021):

     git clone https://github.com/tensorflow/compression
    
  • PyPi (📥 410 / month · 📦 1 · ⏱️ 06.01.2021):

     pip install tensorflow-compression
    

Sklearn Utilities

Back to top

Libraries that extend scikit-learn with additional capabilities.

imbalanced-learn (🥇30 · ⭐ 5K) - A Python Package to Tackle the Curse of Imbalanced.. MIT
  • GitHub (👨‍💻 51 · 🔀 1.1K · 📦 3.8K · 📋 450 - 10% open · ⏱️ 03.11.2020):

     git clone https://github.com/scikit-learn-contrib/imbalanced-learn
    
  • PyPi (📥 690K / month · 📦 280 · ⏱️ 09.06.2020):

     pip install imbalanced-learn
    
  • Conda (📥 110K · ⏱️ 14.06.2020):

     conda install -c conda-forge imbalanced-learn
    
MLxtend (🥇30 · ⭐ 3.3K) - A library of extension and helper modules for Python's data.. BSD-3
  • GitHub (👨‍💻 79 · 🔀 680 · 📦 2.4K · 📋 360 - 27% open · ⏱️ 09.01.2021):

     git clone https://github.com/rasbt/mlxtend
    
  • PyPi (📥 190K / month · 📦 95 · ⏱️ 26.11.2020):

     pip install mlxtend
    
  • Conda (📥 150K · ⏱️ 26.11.2020):

     conda install -c conda-forge mlxtend
    
category_encoders (🥈25 · ⭐ 1.6K) - A library of sklearn compatible categorical variable.. BSD-3
  • GitHub (👨‍💻 34 · 🔀 290 · 📋 200 - 32% open · ⏱️ 31.07.2020):

     git clone https://github.com/scikit-learn-contrib/category_encoders
    
  • PyPi (📥 190K / month · 📦 23 · ⏱️ 14.10.2018):

     pip install category_encoders
    
  • Conda (📥 91K · ⏱️ 29.04.2020):

     conda install -c conda-forge category_encoders
    
combo (🥈24 · ⭐ 470) - A Python Toolbox for Machine Learning Model Combination. BSD-2 xgboost
  • GitHub (👨‍💻 1 · 🔀 80 · 📦 250 · 📋 10 - 70% open · ⏱️ 23.12.2020):

     git clone https://github.com/yzhao062/combo
    
  • PyPi (📥 78K / month · ⏱️ 23.12.2020):

     pip install combo
    
sklearn-contrib-lightning (🥈23 · ⭐ 1.4K) - Large-scale linear classification, regression and.. BSD-3
  • GitHub (👨‍💻 16 · 🔀 190 · 📦 72 · 📋 85 - 57% open · ⏱️ 04.01.2021):

     git clone https://github.com/scikit-learn-contrib/lightning
    
  • PyPi (📥 490 / month · 📦 5 · ⏱️ 16.12.2020):

     pip install sklearn-contrib-lightning
    
  • Conda (📥 130K · ⏱️ 20.12.2020):

     conda install -c conda-forge sklearn-contrib-lightning
    
scikit-opt (🥈22 · ⭐ 1.8K) - Genetic Algorithm, Particle Swarm Optimization, Simulated.. MIT
  • GitHub (👨‍💻 8 · 🔀 420 · 📦 25 · 📋 82 - 25% open · ⏱️ 07.01.2021):

     git clone https://github.com/guofei9987/scikit-opt
    
  • PyPi (📥 870 / month · 📦 3 · ⏱️ 20.11.2020):

     pip install scikit-opt
    
fancyimpute (🥈22 · ⭐ 910) - Multivariate imputation and matrix completion algorithms.. Apache-2
  • GitHub (👨‍💻 11 · 🔀 140 · 📦 640 · 📋 110 - 3% open · ⏱️ 27.08.2020):

     git clone https://github.com/iskandr/fancyimpute
    
  • PyPi (📥 4.9K / month · 📦 21 · ⏱️ 16.07.2020):

     pip install fancyimpute
    
scikit-lego (🥈22 · ⭐ 380) - Extra blocks for scikit-learn pipelines. MIT
  • GitHub (👨‍💻 38 · 🔀 64 · 📦 15 · 📋 200 - 9% open · ⏱️ 14.01.2021):

     git clone https://github.com/koaning/scikit-lego
    
  • PyPi (📥 880 / month · ⏱️ 04.01.2021):

     pip install scikit-lego
    
  • Conda (📥 7.6K · ⏱️ 02.11.2020):

     conda install -c conda-forge scikit-lego
    
iterative-stratification (🥉19 · ⭐ 490) - scikit-learn cross validators for iterative.. BSD-3
  • GitHub (👨‍💻 4 · 🔀 44 · 📦 99 · 📋 11 - 9% open · ⏱️ 12.09.2020):

     git clone https://github.com/trent-b/iterative-stratification
    
  • PyPi (📥 2.7K / month · ⏱️ 12.08.2018):

     pip install iterative-stratification
    
scikit-tda (🥉19 · ⭐ 260) - Topological Data Analysis for Python. MIT
  • GitHub (👨‍💻 2 · 🔀 32 · 📦 18 · 📋 14 - 78% open · ⏱️ 09.01.2021):

     git clone https://github.com/scikit-tda/scikit-tda
    
  • PyPi (📥 4.7K / month · ⏱️ 09.01.2021):

     pip install scikit-tda
    
DESlib (🥉17 · ⭐ 290) - A Python library for dynamic classifier and ensemble selection. BSD-3
  • GitHub (👨‍💻 12 · 🔀 66 · 📦 13 · 📋 130 - 9% open · ⏱️ 01.01.2021):

     git clone https://github.com/scikit-learn-contrib/DESlib
    
  • PyPi (📥 130 / month · ⏱️ 18.02.2019):

     pip install deslib
    
skggm (🥉17 · ⭐ 170) - Scikit-learn compatible estimation of general graphical models. MIT
  • GitHub (👨‍💻 5 · 🔀 31 · 📦 6 · 📋 74 - 36% open · ⏱️ 24.12.2020):

     git clone https://github.com/skggm/skggm
    
  • PyPi (📥 92 / month · 📦 1 · ⏱️ 12.09.2018):

     pip install skggm
    
Show 5 hidden projects...
  • sklearn-crfsuite (🥈24 · ⭐ 360 · 💀) - scikit-learn inspired API for CRFsuite. MIT
  • scikit-multilearn (🥈22 · ⭐ 620 · 💀) - A scikit-learn based module for multi-label et. al... BSD-2
  • skope-rules (🥉20 · ⭐ 350) - machine learning with logical rules in Python. ❗️BSD-1-Clause
  • celer (🥉17 · ⭐ 100 · ➕) - Fast sklearn solvers for sparse problems (group Lasso, adaptive.. BSD-3
  • dabl (🥉16 · ⭐ 62) - Data Analysis Baseline Library. BSD-3

Pytorch Utilities

Back to top

Libraries that extend Pytorch with additional capabilities.

pretrainedmodels (🥇27 · ⭐ 7.6K · 💤) - Pretrained ConvNets for pytorch: NASNet, ResNeXt,.. BSD-3
  • GitHub (👨‍💻 22 · 🔀 1.6K · 📦 820 · 📋 170 - 49% open · ⏱️ 16.04.2020):

     git clone https://github.com/Cadene/pretrained-models.pytorch
    
  • PyPi (📥 28K / month · 📦 100 · ⏱️ 29.10.2018):

     pip install pretrainedmodels
    
pytorch-optimizer (🥇25 · ⭐ 1.6K) - torch-optimizer -- collection of optimizers for.. Apache-2
  • GitHub (👨‍💻 19 · 🔀 150 · 📦 100 · 📋 33 - 30% open · ⏱️ 14.01.2021):

     git clone https://github.com/jettify/pytorch-optimizer
    
  • PyPi (📥 11K / month · ⏱️ 01.01.2021):

     pip install torch_optimizer
    
torchdiffeq (🥇24 · ⭐ 3.3K) - Differentiable ODE solvers with full GPU support and.. MIT
  • GitHub (👨‍💻 19 · 🔀 570 · 📦 57 · 📋 120 - 9% open · ⏱️ 07.01.2021):

     git clone https://github.com/rtqichen/torchdiffeq
    
  • PyPi (📥 1.9K / month · ⏱️ 16.09.2020):

     pip install torchdiffeq
    
pytorch-summary (🥇24 · ⭐ 2.9K) - Model summary in PyTorch similar to `model.summary()` in.. MIT
  • GitHub (👨‍💻 11 · 🔀 340 · 📦 1.8K · 📋 140 - 73% open · ⏱️ 07.08.2020):

     git clone https://github.com/sksq96/pytorch-summary
    
  • PyPi (📥 33K / month · 📦 120 · ⏱️ 26.09.2018):

     pip install torchsummary
    
PML (🥇24 · ⭐ 2.6K) - The easiest way to use deep metric learning in your application. Modular,.. MIT
  • GitHub (👨‍💻 12 · 🔀 360 · 📦 45 · 📋 200 - 15% open · ⏱️ 15.01.2021):

     git clone https://github.com/KevinMusgrave/pytorch-metric-learning
    
  • PyPi (📥 3.5K / month · ⏱️ 15.01.2021):

     pip install pytorch-metric-learning
    
  • Conda (📥 1.1K · ⏱️ 12.01.2021):

     conda install -c metric-learning pytorch-metric-learning
    
SRU (🥇24 · ⭐ 1.9K) - Training RNNs as Fast as CNNs (https://arxiv.org/abs/1709.02755). MIT
  • GitHub (👨‍💻 21 · 🔀 280 · 📦 15 · 📋 100 - 47% open · ⏱️ 17.12.2020):

     git clone https://github.com/asappresearch/sru
    
  • PyPi (📥 1.9K / month · 📦 1 · ⏱️ 17.12.2020):

     pip install sru
    
EfficientNet-PyTorch (🥈23 · ⭐ 5.3K) - A PyTorch implementation of EfficientNet. Apache-2
  • GitHub (👨‍💻 21 · 🔀 1K · 📥 290K · 📋 220 - 46% open · ⏱️ 09.10.2020):

     git clone https://github.com/lukemelas/EfficientNet-PyTorch
    
  • PyPi (📥 30K / month · 📦 7 · ⏱️ 26.08.2020):

     pip install efficientnet-pytorch
    
EfficientNets (🥈22 · ⭐ 1.2K) - Pretrained EfficientNet, EfficientNet-Lite, MixNet,.. Apache-2
  • GitHub (👨‍💻 5 · 🔀 170 · 📦 28 · ⏱️ 30.11.2020):

     git clone https://github.com/rwightman/gen-efficientnet-pytorch
    
  • PyPi (📥 1.3K / month · ⏱️ 20.08.2020):

     pip install geffnet
    
Torchmeta (🥈21 · ⭐ 1.2K) - A collection of extensions and data-loaders for few-shot learning.. MIT
  • GitHub (👨‍💻 10 · 🔀 130 · 📦 24 · 📋 88 - 21% open · ⏱️ 25.11.2020):

     git clone https://github.com/tristandeleu/pytorch-meta
    
  • PyPi (📥 450 / month · ⏱️ 25.11.2020):

     pip install torchmeta
    
PyTorch Sparse (🥈21 · ⭐ 340) - PyTorch Extension Library of Optimized Autograd Sparse.. MIT
  • GitHub (👨‍💻 12 · 🔀 43 · 📋 85 - 41% open · ⏱️ 11.01.2021):

     git clone https://github.com/rusty1s/pytorch_sparse
    
  • PyPi (📥 4.2K / month · 📦 17 · ⏱️ 02.11.2020):

     pip install torch-sparse
    
reformer-pytorch (🥈20 · ⭐ 1.3K) - Reformer, the efficient Transformer, in Pytorch. MIT
  • GitHub (👨‍💻 9 · 🔀 160 · 📋 93 - 1% open · ⏱️ 11.12.2020):

     git clone https://github.com/lucidrains/reformer-pytorch
    
  • PyPi (📥 920 / month · ⏱️ 06.12.2020):

     pip install reformer-pytorch
    
torch-scatter (🥈20 · ⭐ 580) - PyTorch Extension Library of Optimized Scatter Operations. MIT
  • GitHub (👨‍💻 14 · 🔀 73 · 📋 170 - 34% open · ⏱️ 15.01.2021):

     git clone https://github.com/rusty1s/pytorch_scatter
    
  • PyPi (📥 4.9K / month · 📦 16 · ⏱️ 22.02.2020):

     pip install torch-scatter
    
Pytorch Toolbelt (🥉19 · ⭐ 880) - PyTorch extensions for fast R&D prototyping and Kaggle.. MIT
  • GitHub (👨‍💻 5 · 🔀 71 · 📋 17 - 17% open · ⏱️ 14.01.2021):

     git clone https://github.com/BloodAxe/pytorch-toolbelt
    
  • PyPi (📥 6.4K / month · 📦 1 · ⏱️ 14.01.2021):

     pip install pytorch_toolbelt
    
TabNet (🥉19 · ⭐ 740) - PyTorch implementation of TabNet paper :.. MIT
  • GitHub (👨‍💻 17 · 🔀 140 · 📋 130 - 24% open · ⏱️ 12.01.2021):

     git clone https://github.com/dreamquark-ai/tabnet
    
  • PyPi (📥 2.3K / month · ⏱️ 12.01.2021):

     pip install pytorch-tabnet
    
Higher (🥉18 · ⭐ 1K) - higher is a pytorch library allowing users to obtain higher order.. Apache-2
  • GitHub (👨‍💻 6 · 🔀 73 · 📦 27 · 📋 71 - 33% open · ⏱️ 28.10.2020):

     git clone https://github.com/facebookresearch/higher
    
  • PyPi (📥 660 / month · ⏱️ 14.07.2020):

     pip install higher
    
Lambda Networks (🥉17 · ⭐ 1.3K · 🐣) - Implementation of LambdaNetworks, a new approach to.. MIT
  • GitHub (👨‍💻 3 · 🔀 130 · 📋 23 - 34% open · ⏱️ 18.11.2020):

     git clone https://github.com/lucidrains/lambda-networks
    
  • PyPi (📥 1.2K / month · ⏱️ 20.10.2020):

     pip install lambda-networks
    
Tensor Sensor (🥉16 · ⭐ 500 · 🐣) - The goal of this library is to generate more helpful.. MIT
  • GitHub (👨‍💻 2 · 🔀 24 · 📦 1 · 📋 16 - 25% open · ⏱️ 07.01.2021):

     git clone https://github.com/parrt/tensor-sensor
    
  • PyPi (📥 220 / month · ⏱️ 03.12.2020):

     pip install tensor-sensor
    
Performer Pytorch (🥉16 · ⭐ 460 · 🐣) - An implementation of Performer, a linear attention-.. MIT
  • GitHub (👨‍💻 4 · 🔀 60 · 📦 7 · 📋 49 - 34% open · ⏱️ 17.01.2021):

     git clone https://github.com/lucidrains/performer-pytorch
    
  • PyPi (📥 880 / month · ⏱️ 17.01.2021):

     pip install performer-pytorch
    
Pywick (🥉16 · ⭐ 310) - High-level batteries-included neural network training library for.. MIT
  • GitHub (👨‍💻 4 · 🔀 35 · 📦 2 · 📋 11 - 18% open · ⏱️ 24.09.2020):

     git clone https://github.com/achaiah/pywick
    
  • PyPi (📥 210 / month · ⏱️ 20.01.2020):

     pip install pywick
    
tinygrad (🥉15 · ⭐ 4K · 🐣) - You like pytorch? You like micrograd? You love tinygrad!. MIT
  • GitHub (👨‍💻 42 · 🔀 440 · 📋 74 - 17% open · ⏱️ 05.01.2021):

     git clone https://github.com/geohot/tinygrad
    
torchsde (🥉15 · ⭐ 630 · 🐣) - Differentiable SDE solvers with GPU support and efficient.. Apache-2
  • GitHub (👨‍💻 4 · 🔀 54 · 📋 32 - 18% open · ⏱️ 05.01.2021):

     git clone https://github.com/google-research/torchsde
    
Tez (🥉14 · ⭐ 400 · 🐣) - Tez is a super-simple and lightweight Trainer for PyTorch. It.. Apache-2
  • GitHub (👨‍💻 1 · 🔀 46 · 📋 13 - 53% open · ⏱️ 15.01.2021):

     git clone https://github.com/abhishekkrthakur/tez
    
  • PyPi (📥 1K / month · ⏱️ 26.12.2020):

     pip install tez
    
micrograd (🥉13 · ⭐ 1.6K · 💤) - A tiny scalar-valued autograd engine and a neural net library.. MIT
  • GitHub (👨‍💻 2 · 🔀 110 · 📋 5 - 40% open · ⏱️ 18.04.2020):

     git clone https://github.com/karpathy/micrograd
    
  • PyPi (📥 20 / month · ⏱️ 18.04.2020):

     pip install micrograd
    
Torch-Struct (🥉13 · ⭐ 880) - Fast, general, and tested differentiable structured prediction.. MIT
  • GitHub (👨‍💻 12 · 🔀 69 · 📋 35 - 37% open · ⏱️ 16.01.2021):

     git clone https://github.com/harvardnlp/pytorch-struct
    
Show 3 hidden projects...

Database Clients

Back to top

Libraries for connecting to, operating, and querying databases.

🔗 best-of-python - DB Clients ( ⭐ 400) - Collection of database clients for python.


Others

Back to top

scipy (🥇40 · ⭐ 7.8K) - Ecosystem of open-source software for mathematics, science, and engineering. BSD-3
  • GitHub (👨‍💻 1.1K · 🔀 3.5K · 📥 300K · 📦 290K · 📋 7.2K - 21% open · ⏱️ 16.01.2021):

     git clone https://github.com/scipy/scipy
    
  • PyPi (📥 13M / month · 📦 87K · ⏱️ 22.12.2020):

     pip install scipy
    
  • Conda (📥 12M · ⏱️ 12.01.2021):

     conda install -c conda-forge scipy
    
SymPy (🥇36 · ⭐ 7.7K) - A computer algebra system written in pure Python. BSD-3
  • GitHub (👨‍💻 1K · 🔀 3.2K · 📥 410K · 📦 28K · 📋 11K - 35% open · ⏱️ 17.01.2021):

     git clone https://github.com/sympy/sympy
    
  • PyPi (📥 530K / month · 📦 6.4K · ⏱️ 12.12.2020):

     pip install sympy
    
  • Conda (📥 1.2M · ⏱️ 08.01.2021):

     conda install -c conda-forge sympy
    
PyOD (🥇29 · ⭐ 4K · 📈) - A Python Toolbox for Scalable Outlier Detection (Anomaly Detection). BSD-2
  • GitHub (👨‍💻 22 · 🔀 840 · 📦 460 · 📋 180 - 55% open · ⏱️ 12.01.2021):

     git clone https://github.com/yzhao062/pyod
    
  • PyPi (📥 79K / month · 📦 6 · ⏱️ 12.01.2021):

     pip install pyod
    
Keras-Preprocessing (🥇29 · ⭐ 900) - Utilities for working with image data, text data, and.. MIT
  • GitHub (👨‍💻 48 · 🔀 390 · 📋 190 - 48% open · ⏱️ 11.12.2020):

     git clone https://github.com/keras-team/keras-preprocessing
    
  • PyPi (📥 3.5M / month · 📦 2.7K · ⏱️ 14.05.2020):

     pip install keras-preprocessing
    
  • Conda (📥 770K · ⏱️ 15.01.2021):

     conda install -c conda-forge keras-preprocessing
    
Cython BLIS (🥇28 · ⭐ 160) - Fast matrix-multiplication as a self-contained Python library no.. BSD-3
  • GitHub (👨‍💻 9 · 🔀 22 · 📦 7.8K · 📋 21 - 28% open · ⏱️ 07.12.2020):

     git clone https://github.com/explosion/cython-blis
    
  • PyPi (📥 750K / month · 📦 390 · ⏱️ 07.12.2020):

     pip install blis
    
  • Conda (📥 390K · ⏱️ 07.12.2020):

     conda install -c conda-forge cython-blis
    
hdbscan (🥈27 · ⭐ 1.8K) - A high performance implementation of HDBSCAN clustering. BSD-3
  • GitHub (👨‍💻 64 · 🔀 330 · 📦 700 · 📋 360 - 59% open · ⏱️ 06.01.2021):

     git clone https://github.com/scikit-learn-contrib/hdbscan
    
  • PyPi (📥 110K / month · 📦 120 · ⏱️ 19.03.2020):

     pip install hdbscan
    
  • Conda (📥 510K · ⏱️ 02.11.2020):

     conda install -c conda-forge hdbscan
    
pyopencl (🥈27 · ⭐ 760) - OpenCL integration for Python, plus shiny features. MIT
  • GitHub (👨‍💻 82 · 🔀 200 · 📦 440 · 📋 260 - 19% open · ⏱️ 15.01.2021):

     git clone https://github.com/inducer/pyopencl
    
  • PyPi (📥 4.7K / month · 📦 240 · ⏱️ 20.11.2020):

     pip install pyopencl
    
  • Conda (📥 300K · ⏱️ 13.01.2021):

     conda install -c conda-forge pyopencl
    
Streamlit (🥈26 · ⭐ 13K) - Streamlit The fastest way to build data apps in Python. Apache-2
  • GitHub (👨‍💻 84 · 🔀 1K · 📦 49 · 📋 1.5K - 36% open · ⏱️ 15.01.2021):

     git clone https://github.com/streamlit/streamlit
    
  • PyPi (📥 100K / month · 📦 34 · ⏱️ 07.01.2021):

     pip install streamlit
    
carla (🥈26 · ⭐ 5.5K) - Open-source simulator for autonomous driving research. MIT
  • GitHub (👨‍💻 120 · 🔀 1.6K · 📦 54 · 📋 2.9K - 9% open · ⏱️ 22.12.2020):

     git clone https://github.com/carla-simulator/carla
    
  • PyPi (📥 490 / month · 📦 2 · ⏱️ 28.04.2019):

     pip install carla
    
Datasette (🥈26 · ⭐ 4.5K) - An open source multi-tool for exploring and publishing data. Apache-2
  • GitHub (👨‍💻 42 · 🔀 240 · 📥 24 · 📦 340 · 📋 950 - 22% open · ⏱️ 14.01.2021):

     git clone https://github.com/simonw/datasette
    
  • PyPi (📥 4.2K / month · 📦 70 · ⏱️ 11.12.2020):

     pip install datasette
    
agate (🥈26 · ⭐ 1K · 💤) - A Python data analysis library that is optimized for humans instead of.. MIT
  • GitHub (👨‍💻 47 · 🔀 130 · 📦 480 · 📋 640 - 8% open · ⏱️ 01.04.2020):

     git clone https://github.com/wireservice/agate
    
  • PyPi (📥 150K / month · 📦 92 · ⏱️ 11.03.2018):

     pip install agate
    
  • Conda (📥 59K · ⏱️ 19.08.2018):

     conda install -c conda-forge agate
    
pyclustering (🥈26 · ⭐ 780) - pyclustring is a Python, C++ data mining library. BSD-3
  • GitHub (👨‍💻 26 · 🔀 180 · 📥 280 · 📦 170 · 📋 640 - 8% open · ⏱️ 03.12.2020):

     git clone https://github.com/annoviko/pyclustering
    
  • PyPi (📥 17K / month · 📦 36 · ⏱️ 25.11.2020):

     pip install pyclustering
    
  • Conda (📥 12K · ⏱️ 30.11.2020):

     conda install -c conda-forge pyclustering
    
Trax (🥈25 · ⭐ 5.6K) - Trax Deep Learning with Clear Code and Speed. Apache-2
  • GitHub (👨‍💻 56 · 🔀 520 · 📦 19 · 📋 150 - 35% open · ⏱️ 15.01.2021):

     git clone https://github.com/google/trax
    
  • PyPi (📥 3.2K / month · ⏱️ 18.12.2020):

     pip install trax
    
DeepChem (🥈25 · ⭐ 2.7K) - Democratizing Deep-Learning for Drug Discovery, Quantum Chemistry,.. MIT
  • GitHub (👨‍💻 140 · 🔀 900 · 📦 35 · 📋 1.2K - 28% open · ⏱️ 16.01.2021):

     git clone https://github.com/deepchem/deepchem
    
  • PyPi (📥 1.5K / month · 📦 4 · ⏱️ 16.01.2021):

     pip install deepchem
    
causalml (🥈25 · ⭐ 1.6K) - Uplift modeling and causal inference with machine learning.. Apache-2
  • GitHub (👨‍💻 22 · 🔀 240 · 📦 11 · 📋 150 - 13% open · ⏱️ 13.01.2021):

     git clone https://github.com/uber/causalml
    
  • PyPi (📥 8.3K / month · ⏱️ 23.10.2020):

     pip install causalml
    
Pythran (🥈25 · ⭐ 1.5K) - Ahead of Time compiler for numeric kernels. BSD-3
  • GitHub (👨‍💻 47 · 🔀 130 · 📦 48 · 📋 640 - 15% open · ⏱️ 06.01.2021):

     git clone https://github.com/serge-sans-paille/pythran
    
  • PyPi (📥 4.3K / month · 📦 13 · ⏱️ 11.12.2020):

     pip install pythran
    
  • Conda (📥 120K · ⏱️ 15.12.2020):

     conda install -c conda-forge pythran
    
TabPy (🥈24 · ⭐ 1K) - Execute Python code on the fly and display results in Tableau visualizations:. MIT
  • GitHub (👨‍💻 38 · 🔀 370 · 📦 43 · 📋 250 - 3% open · ⏱️ 11.11.2020):

     git clone https://github.com/tableau/TabPy
    
  • PyPi (📥 1.8K / month · ⏱️ 11.11.2020):

     pip install tabpy
    
kmodes (🥈24 · ⭐ 800) - Python implementations of the k-modes and k-prototypes clustering.. MIT
  • GitHub (👨‍💻 16 · 🔀 340 · 📦 410 · 📋 120 - 15% open · ⏱️ 04.11.2020):

     git clone https://github.com/nicodv/kmodes
    
  • PyPi (📥 49K / month · 📦 13 · ⏱️ 25.02.2020):

     pip install kmodes
    
PennyLane (🥈24 · ⭐ 700) - PennyLane is a cross-platform Python library for differentiable.. Apache-2
  • GitHub (👨‍💻 39 · 🔀 210 · 📥 52 · 📋 270 - 18% open · ⏱️ 13.01.2021):

     git clone https://github.com/PennyLaneAI/PennyLane
    
  • PyPi (📥 1.9K / month · 📦 20 · ⏱️ 27.11.2020):

     pip install pennylane
    
pyjanitor (🥈24 · ⭐ 620) - Clean APIs for data cleaning. Python implementation of R package Janitor. MIT
  • GitHub (👨‍💻 87 · 🔀 120 · 📦 72 · 📋 370 - 24% open · ⏱️ 31.12.2020):

     git clone https://github.com/ericmjl/pyjanitor
    
  • PyPi (📥 1.4K / month · 📦 2 · ⏱️ 03.10.2020):

     pip install pyjanitor
    
  • Conda (📥 77K · ⏱️ 04.10.2020):

     conda install -c conda-forge pyjanitor
    
findspark (🥈24 · ⭐ 380 · 💤) - Find pyspark to make it importable. BSD-3
  • GitHub (👨‍💻 14 · 🔀 64 · 📦 1.5K · 📋 19 - 57% open · ⏱️ 08.06.2020):

     git clone https://github.com/minrk/findspark
    
  • PyPi (📥 530K / month · 📦 200 · ⏱️ 08.06.2020):

     pip install findspark
    
  • Conda (📥 480K · ⏱️ 06.07.2018):

     conda install -c conda-forge findspark
    
datalad (🥈24 · ⭐ 220) - Keep code, data, containers under control with git and git-annex. MIT
  • GitHub (👨‍💻 40 · 🔀 68 · 📋 2.9K - 24% open · ⏱️ 16.01.2021):

     git clone https://github.com/datalad/datalad
    
  • PyPi (📥 1.2K / month · 📦 26 · ⏱️ 14.12.2020):

     pip install datalad
    
  • Conda (📥 77K · ⏱️ 04.01.2021):

     conda install -c conda-forge datalad
    
PaddleHub (🥉23 · ⭐ 4.4K) - Awesome pre-trained models toolkit based on.. Apache-2
  • GitHub (👨‍💻 32 · 🔀 990 · 📥 430 · 📦 78 · 📋 640 - 41% open · ⏱️ 13.01.2021):

     git clone https://github.com/PaddlePaddle/PaddleHub
    
  • PyPi (📥 1.6K / month · ⏱️ 06.01.2021):

     pip install paddlehub
    
Gradio (🥉23 · ⭐ 1.9K · ➕) - Wrap UIs around any model, share with anyone. Apache-2
  • GitHub (👨‍💻 14 · 🔀 120 · 📦 59 · 📋 58 - 8% open · ⏱️ 23.12.2020):

     git clone https://github.com/gradio-app/gradio
    
  • PyPi (📥 1.5K / month · 📦 1 · ⏱️ 23.12.2020):

     pip install gradio
    
metric-learn (🥉23 · ⭐ 1.1K) - Metric learning algorithms in Python. MIT
  • GitHub (👨‍💻 19 · 🔀 200 · 📦 120 · 📋 160 - 30% open · ⏱️ 04.08.2020):

     git clone https://github.com/scikit-learn-contrib/metric-learn
    
  • PyPi (📥 15K / month · 📦 9 · ⏱️ 02.07.2020):

     pip install metric-learn
    
tensorly (🥉23 · ⭐ 950) - TensorLy: Tensor Learning in Python. BSD-2
  • GitHub (👨‍💻 38 · 🔀 190 · 📋 120 - 20% open · ⏱️ 03.01.2021):

     git clone https://github.com/tensorly/tensorly
    
  • PyPi (📥 2.5K / month · 📦 20 · ⏱️ 07.12.2020):

     pip install tensorly
    
  • Conda (📥 110K · ⏱️ 07.12.2020):

     conda install -c conda-forge tensorly
    
PySwarms (🥉23 · ⭐ 710) - A research toolkit for particle swarm optimization in Python. MIT
  • GitHub (👨‍💻 42 · 🔀 230 · 📦 87 · 📋 170 - 8% open · ⏱️ 03.01.2021):

     git clone https://github.com/ljvmiranda921/pyswarms
    
  • PyPi (📥 4.6K / month · ⏱️ 03.01.2021):

     pip install pyswarms
    
SUOD (🥉23 · ⭐ 230) - An Acceleration System for Large-scare Unsupervised Heterogeneous Outlier.. BSD-2
  • GitHub (👨‍💻 1 · 🔀 26 · 📦 240 · 📋 5 - 60% open · ⏱️ 15.01.2021):

     git clone https://github.com/yzhao062/SUOD
    
  • PyPi (📥 78K / month · ⏱️ 15.01.2021):

     pip install suod
    
Mars (🥉22 · ⭐ 2K) - Mars is a tensor-based unified framework for large-scale data computation.. Apache-2
  • GitHub (👨‍💻 17 · 🔀 240 · 📦 3 · 📋 740 - 15% open · ⏱️ 17.01.2021):

     git clone https://github.com/mars-project/mars
    
  • PyPi (📥 3.1K / month · ⏱️ 17.01.2021):

     pip install pymars
    
cleanlab (🥉22 · ⭐ 1.4K) - The standard package for machine learning with noisy labels and finding.. MIT
  • GitHub (👨‍💻 5 · 🔀 150 · 📦 10 · 📋 51 - 31% open · ⏱️ 10.01.2021):

     git clone https://github.com/cgnorthcutt/cleanlab
    
  • PyPi (📥 2.6K / month · ⏱️ 17.02.2020):

     pip install cleanlab
    
pycm (🥉22 · ⭐ 1K) - Multi-class confusion matrix library in Python. MIT
  • GitHub (👨‍💻 14 · 🔀 92 · 📦 80 · 📋 160 - 7% open · ⏱️ 26.10.2020):

     git clone https://github.com/sepandhaghighi/pycm
    
  • PyPi (📥 2K / month · 📦 10 · ⏱️ 26.10.2020):

     pip install pycm
    
modAL (🥉22 · ⭐ 1K) - A modular active learning framework for Python. MIT
  • GitHub (👨‍💻 14 · 🔀 170 · 📥 2 · 📦 44 · 📋 93 - 46% open · ⏱️ 07.01.2021):

     git clone https://github.com/modAL-python/modAL
    
  • PyPi (📥 960 / month · 📦 8 · ⏱️ 07.01.2021):

     pip install modAL
    
Prince (🥉22 · ⭐ 550) - Python factor analysis library (PCA, CA, MCA, MFA, FAMD). MIT
  • GitHub (👨‍💻 9 · 🔀 110 · 📦 100 · 📋 89 - 28% open · ⏱️ 11.01.2021):

     git clone https://github.com/MaxHalford/prince
    
  • PyPi (📥 3.9K / month · 📦 3 · ⏱️ 06.10.2020):

     pip install prince
    
AstroML (🥉21 · ⭐ 700) - Machine learning, statistics, and data mining for astronomy and.. BSD-2
  • GitHub (👨‍💻 29 · 🔀 250 · 📦 160 · 📋 130 - 37% open · ⏱️ 09.09.2020):

     git clone https://github.com/astroML/astroML
    
  • PyPi (📥 470 / month · 📦 29 · ⏱️ 23.03.2020):

     pip install astroML
    
  • Conda (📥 21K · ⏱️ 16.02.2020):

     conda install -c conda-forge astroml
    
BioPandas (🥉21 · ⭐ 320) - Working with molecular structures in pandas DataFrames. BSD-3
  • GitHub (👨‍💻 7 · 🔀 76 · 📦 47 · 📋 34 - 47% open · ⏱️ 01.01.2021):

     git clone https://github.com/rasbt/biopandas
    
  • PyPi (📥 200 / month · 📦 6 · ⏱️ 04.08.2020):

     pip install biopandas
    
  • Conda (📥 62K · ⏱️ 08.08.2020):

     conda install -c conda-forge biopandas
    
StreamAlert (🥉20 · ⭐ 2.4K) - StreamAlert is a serverless, realtime data analysis framework.. Apache-2
  • GitHub (👨‍💻 30 · 🔀 280 · 📋 340 - 26% open · ⏱️ 05.10.2020):

     git clone https://github.com/airbnb/streamalert
    
alibi-detect (🥉20 · ⭐ 520) - Algorithms for outlier and adversarial instance detection,.. Apache-2
  • GitHub (👨‍💻 7 · 🔀 53 · 📦 17 · 📋 71 - 42% open · ⏱️ 05.01.2021):

     git clone https://github.com/SeldonIO/alibi-detect
    
  • PyPi (📥 1K / month · 📦 1 · ⏱️ 08.10.2020):

     pip install alibi-detect
    
scikit-rebate (🥉20 · ⭐ 300) - A scikit-learn-compatible Python implementation of ReBATE, a.. MIT
  • GitHub (👨‍💻 13 · 🔀 55 · 📦 120 · 📋 32 - 46% open · ⏱️ 18.07.2020):

     git clone https://github.com/EpistasisLab/scikit-rebate
    
  • PyPi (📥 1.2K / month · 📦 31 · ⏱️ 18.07.2020):

     pip install skrebate
    
rrcf (🥉20 · ⭐ 280 · 💤) - Implementation of the Robust Random Cut Forest algorithm for anomaly.. MIT
  • GitHub (👨‍💻 4 · 🔀 65 · 📦 15 · 📋 33 - 48% open · ⏱️ 10.06.2020):

     git clone https://github.com/kLabUM/rrcf
    
  • PyPi (📥 2.1K / month · ⏱️ 10.06.2020):

     pip install rrcf
    
gplearn (🥉19 · ⭐ 900 · 💤) - Genetic Programming in Python, with a scikit-learn inspired API. BSD-3
  • GitHub (👨‍💻 10 · 🔀 150 · 📦 120 · 📋 140 - 19% open · ⏱️ 15.02.2020):

     git clone https://github.com/trevorstephens/gplearn
    
  • PyPi (📥 1K / month · 📦 7 · ⏱️ 01.06.2019):

     pip install gplearn
    
mlens (🥉19 · ⭐ 670 · 💤) - ML-Ensemble high performance ensemble learning. MIT
  • GitHub (👨‍💻 7 · 🔀 81 · 📦 99 · 📋 81 - 9% open · ⏱️ 25.02.2020):

     git clone https://github.com/flennerhag/mlens
    
  • PyPi (📥 1K / month · ⏱️ 30.10.2018):

     pip install mlens
    
baikal (🥉19 · ⭐ 570) - A graph-based functional API for building complex scikit-learn pipelines. BSD-3
  • GitHub (👨‍💻 1 · 🔀 28 · 📦 3 · 📋 15 - 26% open · ⏱️ 15.11.2020):

     git clone https://github.com/alegonz/baikal
    
  • PyPi (📥 31 / month · ⏱️ 15.11.2020):

     pip install baikal
    
  • Conda (📥 390K · ⏱️ 07.12.2020):

     conda install -c conda-forge cython-blis
    
Feature Engine (🥉19 · ⭐ 420) - Feature engineering package with sklearn like functionality. BSD-3
  • GitHub (👨‍💻 20 · 🔀 130 · 📋 98 - 27% open · ⏱️ 15.01.2021):

     git clone https://github.com/solegalli/feature_engine
    
  • PyPi (📥 13K / month · 📦 2 · ⏱️ 11.01.2021):

     pip install feature_engine
    
  • Conda (📥 1.2K · ⏱️ 11.01.2021):

     conda install -c conda-forge feature_engine
    
apricot (🥉18 · ⭐ 300) - apricot implements submodular optimization for the purpose of selecting.. MIT
  • GitHub (👨‍💻 2 · 🔀 29 · 📥 8 · 📦 12 · 📋 14 - 21% open · ⏱️ 30.09.2020):

     git clone https://github.com/jmschrei/apricot
    
  • PyPi (📥 410 / month · ⏱️ 28.09.2020):

     pip install apricot-select
    
River (🥉17 · ⭐ 1.3K) - Online machine learning in Python. BSD-3
  • GitHub (👨‍💻 57 · 🔀 160 · 📋 280 - 13% open · ⏱️ 06.01.2021):

     git clone https://github.com/online-ml/river
    
Show 7 hidden projects...
  • Autograd (🥇29 · ⭐ 5.1K · 💀) - Efficiently computes derivatives of numpy code. MIT
  • pysc2 (🥈24 · ⭐ 7.1K · 💀) - StarCraft II Learning Environment. Apache-2
  • minisom (🥉22 · ⭐ 760) - MiniSom is a minimalistic implementation of the Self Organizing.. ❗️CC-BY-3.0
  • impyute (🥉20 · ⭐ 260 · 💀) - Data imputations library to preprocess datasets with missing data. MIT
  • vecstack (🥉18 · ⭐ 570 · 💀) - Python package for stacking (machine learning technique). MIT
  • pandas-ml (🥉17 · ⭐ 260 · 💀) - pandas, scikit-learn, xgboost and seaborn integration. BSD-3
  • traingenerator (🥉9 · ⭐ 870 · 🐣) - A web app to generate template code for machine learning. MIT

Related Resources

Contribution

Contributions are encouraged and always welcome! If you like to add or update projects, choose one of the following ways:

  • Open an issue by selecting one of the provided categories from the issue page and fill in the requested information.
  • Modify the projects.yaml with your additions or changes, and submit a pull request. This can also be done directly via the Github UI.

If you like to contribute to or share suggestions regarding the project metadata collection or markdown generation, please refer to the best-of-generator repository. If you like to create your own best-of list, we recommend to follow this guide.

For more information on how to add or update projects, please read the contribution guidelines. By participating in this project, you agree to abide by its Code of Conduct.

License

CC0

best-of-ml-python's People

Contributors

aliabd avatar csinva avatar ehofesmann avatar github-actions[bot] avatar jankalkan avatar jasmcaus avatar lukasmasuch avatar mathurinm avatar raethlein avatar rbendias avatar tqtg avatar

Stargazers

 avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.