To find matching records in the government-organisation and local-authority-* registers:
- Run
R/download-organisations.R
to createlists/organisation.tsv
- Run
R/extract-public-bodies.R
to createlists/public-body.tsv
- Use my fork of csvdedupe to do the
probabalistic match.
csvlink lists/public-body.csv lists/organisation.csv \ --field_names name \ --output_file lists/auto-joined.csv \ --training_file training.json
- Manually check the matches in Google Sheets.