danielhoadley Goto Github PK
Name: Daniel Hoadley
Type: User
Bio: R&D and data science in the legal sector. Main focus on data related to litigation.
Twitter: danhlawreporter
Location: London
Blog: www.carrefax.com
Name: Daniel Hoadley
Type: User
Bio: R&D and data science in the legal sector. Main focus on data related to litigation.
Twitter: danhlawreporter
Location: London
Blog: www.carrefax.com
:black_circle: A spaCy pipeline and model for NLP on unstructured legal text.
Collection of scripts used to metadata down from CanLII
Express.js application that uses regular expressions to match against legal case references in a hard-coded paragraph
Uses scikit-learn to predict the main subject matter of judgments
Converts JSON into CSV
Collaborative modeling for recommendation. Implements variational inference for a collaborative topic models. These models recommend items to users based on item content and other users' ratings.
A library for probabilistic modeling, inference, and criticism. Deep generative models, variational inference. Runs on TensorFlow.
Family Court Report processor
Work on clustering court cases (in XML format), done for Jack Cushman's Free the Law Wintersession Sprint in January 2016.
Connect to an FTP site with Python
Builds inverted index of sentences across multiple files
Hierarchical Dirichlet processes. Topic models where the data determine the number of topics. This implements Gibbs sampling.
Highlight popularly cited paragraphs on canlii cases
UK case law
🍇 Edit and execute code snippets in the browser using Jupyter kernels
Parses XML law reports files and extracts the case metadata and judgment portions of the file
Parses XML law reports files, identified the zero-level catchword in the markup and moves the file to a folder named according to the zero-level catchword
This is a C implementation of variational EM for latent Dirichlet allocation (LDA), a topic model for text or other discrete data.
Source code for example at http://stackoverflow.com/questions/5871730/need-a-minimal-django-file-upload-example
Python API Wrapper for CanLII
Scrapes the daily cause list of cases listed in the Royal Courts of Justice published by the Ministry of Justice at https://www.justice.gov.uk/courts/court-lists/list-cause-rcj
Sets out a very simple example of using Scikit-learn to run supervised classification over your own corpus of text data
A simple Python script that uses NLTK to split documents into sentences
BAILII feed parser
💫 Industrial-strength Natural Language Processing (NLP) with Python and Cython
A platform for real-time streaming search
Module for automatic summarization of text documents and HTML pages.
create a browser of a corpus using a topic model; original TMVE implementation (static pages)
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.