Git Product home page Git Product logo

openelections-data-ks's People

Contributors

brentajones avatar burace17 avatar dwillis avatar gvpeek avatar karpet avatar mikegaughan avatar nk9 avatar taylorterry3 avatar warwickmm avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

openelections-data-ks's Issues

Rice 2016 general results have 2 mistakes

These two errors have been verified against the official results PDF.

==> 20161108__ks__general__rice__precinct.csv
ERROR: candidate total incorrect, line 183. 2979 != 2980
{'county': 'Rice', 'precinct': 'Total', 'office': 'U.S. Senate', 'district': '', 'party': 'R', 'candidate': 'Jerry Moran', 'votes': 2979}
ERROR: candidate total incorrect, line 270. 732 != 738
{'county': 'Rice', 'precinct': 'Total', 'office': 'U.S. House', 'district': 1.0, 'party': 'IND', 'candidate': 'Alan LaPolice', 'votes': 732}

Precinct names

I am looking at the precinct names in 2016/20161108__ks__general__douglas__precinct.csv and I am trying to understand where they are coming from. The presidential results source file in https://github.com/openelections/openelections-sources-ks/blob/master/2016%20general%20precinct/2016%20General%20Election%20-%20President%20Results%20by%20Precinct.xlsx for example, has nothing like this line:

Douglas,Prec 67-78,President,,write-in,Evan McMullin,3

I can't correlate Prec 67-78 to any real precinct in Douglas county.

I can kind of squint and see how Lawrence Precinct 46 S3 might be shortened to Prec 46 but it is spelled out in the source xlsx file, so it makes me wonder: how was this data-ks/2016/... file created in the first place?

Convert 2014 general election files

If you want to work on a county, add a comment saying which one you'd like to work on or email [email protected]. You can either email us finished CSV files or submit a pull request, whatever is easiest.

The source files you'll be converting can be found here. Many of these are either PDFs or Excel files, and in some cases you'll need to convert a number of office-specific files. For electronic PDFs, we recommend using Tabula, which is free, to extract data. The goal is to create a single CSV file for each county, with the following headers:

county, precinct, office, district, party, candidate, votes

For the following offices: President, U.S. Senate, U.S. House, State Senate, State House. You can use the Kiowa County file as an example of how things should look.

  • Allen
  • Anderson
  • Atchison
  • Barber
  • Barton
  • Bourbon
  • Brown
  • Butler
  • Chase
  • Chautauqua
  • Cherokee
  • Cheyenne
  • Clark
  • Clay
  • Cloud
  • Coffey
  • Comanche
  • Cowley
  • Crawford
  • Decatur
  • Dickinson
  • Doniphan
  • Douglas
  • Edwards
  • Elk
  • Ellis
  • Ellsworth
  • Finney
  • Ford
  • Franklin
  • Geary
  • Gove
  • Graham
  • Grant
  • Gray
  • Greeley
  • Greenwood
  • Hamilton
  • Harper
  • Harvey
  • Haskell
  • Hodgeman
  • Jackson
  • Jefferson
  • Jewell
  • Johnson
  • Kearny
  • Kingman
  • Kiowa
  • Labette
  • Lane
  • Leavenworth
  • Lincoln
  • Linn
  • Logan
  • Lyon
  • McPherson
  • Marion
  • Marshall
  • Meade
  • Miami
  • Mitchell
  • Montgomery
  • Morris
  • Morton
  • Nemaha
  • Neosho
  • Ness
  • Norton
  • Osage
  • Osborne
  • Ottawa
  • Pawnee
  • Phillips
  • Pottawatomie
  • Pratt
  • Rawlins
  • Reno
  • Republic
  • Rice
  • Riley
  • Rooks
  • Rush
  • Russell
  • Saline
  • Scott
  • Sedgwick
  • Seward
  • Shawnee
  • Sheridan
  • Sherman
  • Smith
  • Stafford
  • Stanton
  • Stevens
  • Sumner
  • Thomas
  • Trego
  • Wabaunsee
  • Wallace
  • Washington
  • Wichita
  • Wilson
  • Woodson
  • Wyandotte

Convert 2016 general election files

If you want to work on a county, add a comment saying which one you'd like to work on or email [email protected]. You can either email us finished CSV files or submit a pull request, whatever is easiest.

The source files you'll be converting can be found here. Many of these are either PDFs or Excel files, and in some cases you'll need to convert a number of office-specific files. For electronic PDFs, we recommend using Tabula, which is free, to extract data. The goal is to create a single CSV file for each county, with the following headers:

county, precinct, office, district, party, candidate, votes

For the following offices: President, U.S. Senate, U.S. House, State Senate, State House. You can use the Kiowa County file as an example of how things should look.

  • Allen
  • Anderson
  • Atchison
  • Barber
  • Barton
  • Bourbon
  • Brown
  • Butler
  • Chase
  • Chautauqua
  • Cherokee
  • Cheyenne
  • Clark
  • Clay
  • Cloud
  • Coffey
  • Comanche
  • Cowley
  • Crawford
  • Decatur
  • Dickinson
  • Doniphan
  • Douglas
  • Edwards
  • Elk
  • Ellis
  • Ellsworth
  • Finney
  • Ford
  • Franklin
  • Geary
  • Gove
  • Graham
  • Grant
  • Gray
  • Greeley
  • Greenwood
  • Hamilton
  • Harper
  • Harvey
  • Haskell
  • Hodgeman
  • Jackson
  • Jefferson
  • Jewell
  • Johnson
  • Kearny
  • Kingman
  • Kiowa
  • Labette
  • Lane
  • Leavenworth
  • Lincoln
  • Linn
  • Logan
  • Lyon
  • McPherson
  • Marion
  • Marshall
  • Meade
  • Miami
  • Mitchell
  • Montgomery
  • Morris
  • Morton
  • Nemaha
  • Neosho
  • Ness
  • Norton
  • Osage
  • Osborne
  • Ottawa
  • Pawnee
  • Phillips
  • Pottawatomie
  • Pratt
  • Rawlins
  • Reno
  • Republic
  • Rice
  • Riley
  • Rooks
  • Rush
  • Russell
  • Saline
  • Scott
  • Sedgwick
  • Seward
  • Shawnee
  • Sheridan
  • Sherman
  • Smith
  • Stafford
  • Stanton
  • Stevens
  • Sumner
  • Thomas
  • Trego
  • Wabaunsee
  • Wallace
  • Washington
  • Wichita
  • Wilson
  • Woodson
  • Wyandotte

2014 results are wrong

I am working through 2014 results and there are some systemic mistakes which make me suspicious of the whole.

For example, in Douglas county there are references to "Paul Davis" as the Democratic candidate for US Senate. He was not. He ran for Governor.

In the same file, for the Governor office the precinct vote counts are swapped for the Democrat and the Republican.

I have source files from the KS SOS and am going to reparse from source. I will try and preserve any totals in the originals per discussion in #39

Bourbon, Cherokee 2016, 2012 have strange data

The verifier is complaining because 20161108__ks__general__bourbon__precinct.csv has 35 duplicate rows. While investigating this, I discovered that there are some precincts supposedly called "Fort Scott Ward 1 Exclave Halls" and the like. This precinct name appears in the 2012 general too. However, the PDF of the precinct-level results doesn't list these, and the vote totals are all 0. Probably they should be removed, but I can't find the original source of them, so I wasn't sure.

Also, the 2016 file seems to have the official precincts of "E Marion" and "W Marion" from the PDF combined into a single "Marion Township." Probably we should use the same divisions that the county used that year?

I'm seeing very similar lines in 20161108__ks__general__cherokee__precinct.csv as well.

Party naming consistency for 2018 General

For 20181106__ks__general__precinct.csv, Wyandotte County uses a different naming convention for the parties compared to the rest of the counties. For example, is uses "DEM" instead of "Democratic". Standardizing this across the counties would be helpful.

Candidate totals—keep?

A few weeks ago in 3adcc35, @karpet removed the TOTAL lines in a few files. I have been using them to do vote checksums (see total-checksum.py in OR). Especially in cases where the data is coming from hand keying, I've found lots of errors with the checksums in the past. But even without that, the checksum can still flag up duplicate or garbage lines, candidate name variations, and a whole bunch of other issues.

I realize I'm new to this state, so I didn't want to step on any toes. But I'm wondering if there's a particular reason to exclude them here?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.