Olivier Houte's Projects
Credential Phish Analysis and Automation
Repository of teaching materials, code, and data for my data analysis and machine learning projects.
Snapshot of Item and Collection data for public domain materials in NYPL Digital Collections, as part of NYPL's January 2016 public domain release.
The Data Broker (DBR) is a distributed, in-memory container of key-value stores enabling applications in a workflow to exchange data through one or more shared namespaces. Thanks to a small set of primitives, applications in a workflow deployed in a (possibly) shared nothing distributed cluster, can easily share and exchange data and messages with a minimum effort. In- spired by the Linda coordination and communication model, the Data Broker provides a unified shared namespace to applications, which is independent from applications’ programming and communication model.
Ipython notebook presentations for getting starting with basic programming, statistics and machine learning techniques
Data Science at the Command Line
A curated list of data science blogs
code for Data Science From Scratch book
Continually updated data science Python notebooks: Deep learning (TensorFlow, Theano, Caffe), scikit-learn, Kaggle, big data (Spark, Hadoop MapReduce, HDFS), matplotlib, pandas, NumPy, SciPy, Python essentials, AWS, and various command lines.
An educational repo for students looking to learn about data structures in Python
Algorithmic operations of some common data structures
Click Security Data Hacking Project
Data Mining Virus Total for threat feed building
A Python tool that automatically cleans data sets and readies them for analysis.
PyData, The Complete Works of
common data analysis and machine learning tasks using python
Open Source Data Science Resources.
Datasets Euclidean to Hamming Conversion
Bin based rendering toolchain
A tool to perform various OSINT techniques, aggregate all the raw data, visualise it on a dashboard, and facilitate alerting and monitoring on the data.
Ansible Playbook for setting up Datasploit
Datumbox is an open-source Machine Learning framework written in Java which allows the rapid development of Machine Learning and Statistical applications.
DDoS attacks via other sites execution tool (DAVOSET) - it is command line tool for conducting DDoS attacks on the sites via Abuse of Functionality and XML External Entities vulnerabilities at other sites.
Fingerprints servers, finds exploits, scans WebDAV. May or may not also make coffee.
Advanced Web Shell
Writing a sqlite clone from scratch in C
Free universal database manager and SQL client
Defcon 27 "DaBomb!" badge.