openelections / openelections-data-ks Goto Github PK
View Code? Open in Web Editor NEWPre-processed results for Kansas elections
Pre-processed results for Kansas elections
These two errors have been verified against the official results PDF.
==> 20161108__ks__general__rice__precinct.csv
ERROR: candidate total incorrect, line 183. 2979 != 2980
{'county': 'Rice', 'precinct': 'Total', 'office': 'U.S. Senate', 'district': '', 'party': 'R', 'candidate': 'Jerry Moran', 'votes': 2979}
ERROR: candidate total incorrect, line 270. 732 != 738
{'county': 'Rice', 'precinct': 'Total', 'office': 'U.S. House', 'district': 1.0, 'party': 'IND', 'candidate': 'Alan LaPolice', 'votes': 732}
I am looking at the precinct names in 2016/20161108__ks__general__douglas__precinct.csv
and I am trying to understand where they are coming from. The presidential results source file in https://github.com/openelections/openelections-sources-ks/blob/master/2016%20general%20precinct/2016%20General%20Election%20-%20President%20Results%20by%20Precinct.xlsx for example, has nothing like this line:
Douglas,Prec 67-78,President,,write-in,Evan McMullin,3
I can't correlate Prec 67-78
to any real precinct in Douglas county.
I can kind of squint and see how Lawrence Precinct 46 S3
might be shortened to Prec 46
but it is spelled out in the source xlsx file, so it makes me wonder: how was this data-ks/2016/...
file created in the first place?
There are discrepancies between the official abstract and the converted CSV file in a number of races.
If you want to work on a county, add a comment saying which one you'd like to work on or email [email protected]. You can either email us finished CSV files or submit a pull request, whatever is easiest.
The source files you'll be converting can be found here. Many of these are either PDFs or Excel files, and in some cases you'll need to convert a number of office-specific files. For electronic PDFs, we recommend using Tabula, which is free, to extract data. The goal is to create a single CSV file for each county, with the following headers:
county
, precinct
, office
, district
, party
, candidate
, votes
For the following offices: President, U.S. Senate, U.S. House, State Senate, State House. You can use the Kiowa County file as an example of how things should look.
Is this a typo in the transcription?
If you want to work on a county, add a comment saying which one you'd like to work on or email [email protected]. You can either email us finished CSV files or submit a pull request, whatever is easiest.
The source files you'll be converting can be found here. Many of these are either PDFs or Excel files, and in some cases you'll need to convert a number of office-specific files. For electronic PDFs, we recommend using Tabula, which is free, to extract data. The goal is to create a single CSV file for each county, with the following headers:
county
, precinct
, office
, district
, party
, candidate
, votes
For the following offices: President, U.S. Senate, U.S. House, State Senate, State House. You can use the Kiowa County file as an example of how things should look.
2014/20141104-totals.csv has a few problems:
There only look to be entries for her opponent in the general precinct file.
Odd I know, but seems to be missing Presidential vote in all precincts.
I am working through 2014 results and there are some systemic mistakes which make me suspicious of the whole.
For example, in Douglas county there are references to "Paul Davis" as the Democratic candidate for US Senate. He was not. He ran for Governor.
In the same file, for the Governor office the precinct vote counts are swapped for the Democrat and the Republican.
I have source files from the KS SOS and am going to reparse from source. I will try and preserve any totals in the originals per discussion in #39
The verifier is complaining because 20161108__ks__general__bourbon__precinct.csv
has 35 duplicate rows. While investigating this, I discovered that there are some precincts supposedly called "Fort Scott Ward 1 Exclave Halls" and the like. This precinct name appears in the 2012 general too. However, the PDF of the precinct-level results doesn't list these, and the vote totals are all 0. Probably they should be removed, but I can't find the original source of them, so I wasn't sure.
Also, the 2016 file seems to have the official precincts of "E Marion" and "W Marion" from the PDF combined into a single "Marion Township." Probably we should use the same divisions that the county used that year?
I'm seeing very similar lines in 20161108__ks__general__cherokee__precinct.csv
as well.
Example:
county,precinct,office,district,party,candidate,votes
Gove,Attorney General,Baker,,D,A J Kotich,73
Gove,Attorney General,Grainfield,,D,A J Kotich,37
It looks like precinct and office are swapped.
Not sure if that's an empty spot in the source files, or a missing 0
in the csv.
For 20181106__ks__general__precinct.csv, Wyandotte County uses a different naming convention for the parties compared to the rest of the counties. For example, is uses "DEM" instead of "Democratic". Standardizing this across the counties would be helpful.
placeholder - I am working on this.
Using files from the Kansas sources repo, parse precinct-level files from the following counties for the April 11, 2017 special election for U.S. House district 4:
A few weeks ago in 3adcc35, @karpet removed the TOTAL lines in a few files. I have been using them to do vote checksums (see total-checksum.py in OR). Especially in cases where the data is coming from hand keying, I've found lots of errors with the checksums in the past. But even without that, the checksum can still flag up duplicate or garbage lines, candidate name variations, and a whole bunch of other issues.
I realize I'm new to this state, so I didn't want to step on any toes. But I'm wondering if there's a particular reason to exclude them here?
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.