Git Product home page Git Product logo

nes's Introduction

National Eutrophication Survey

DOI

This respository contains a digitally transcribed copy (nes_data.csv) of the National Eutrophication Survey (NES) dataset. The original data can be found in the archival_pdfs folder or by searching: https://www.epa.gov/nscep

Locations

Data

variable name description units
pdf pdf identifier (474 - 477) integer
pagenum page number of the pdf integer
storet_code identifier which links measurement to coordinate locations character
state state where the water body resides character
name name of the water body character
county county where the water body resides character
lake_type natural or impoundment character
drainage_area the total drainage area square kilometers
surface_area the area of the water surface sq km
mean_depth the volume of the water body divided by the surface area in square meters meters
total_inflow the mean of the inflows of all tributaries and the immediate drainage cubic meters per second
retention_time a mean value determined by dividing the lake volume, in cubic meters, by the mean annual outflow in cubic meters per unit cof time years or days
retention_time_units the units of time for each retention entry years or days
alkalinity alkalinity milligrams per liter
conductivity conductivity microohms
secchi secchi meters
tp total phosphorus milligrams per liter
po4 orthophosphate milligrams per liter
tin total inorganic nitrogen milligrams per liter
tn total nitrogen milligrams per liter
p_pnt_source_muni municipal point source phosphorus loading kilograms per year
p_pnt_source_industrial industrial point source phosphorus loading kilograms per year
p_pnt_source_septic septic point source phosphorus loading kilograms per year
p_nonpnt_source nonpoint source phosphorus loading kilograms per year
p_total total phosphorus loading kilograms per year
n_pnt_source_muni municipal point source nitrogen loading kilograms per year
n_pnt_source_industrial industrial point source nitrogen loading kilograms per year
n_pnt_source_septic septic point source nitrogen loading kilograms per year
n_nonpnt_source nonpoint source nitrogen loading kilograms per year
n_total total nitrogen loading kilograms per year
p_total_out total phosphorus outlet load kilograms per year
p_percent_retention percent phosphorus retention percent
p_surface_area_loading phosphorus surface area loading grams per square meter per year
n_total_out total nitrogen outlet load kilograms per year
n_percent_retention percent nitrogen retention percent
n_surface_area_loading nitrogen surface area loading grams per square meter per year
lat latitude decimal degrees
long longitude decimal degrees

Workflow

  1. Use the nesR package to generate the files in /02_raw_data.

  2. Hand check the files in /02_raw_data/merged_data against the pdfs in /01_archival_pdfs.

  3. Combine files with 05_analysis_scripts/PDF_Merge.R.

Contributing

We’ve combed the data to try and find all the transciption errors but it’s difficult to catch them all. If you find any errors please open an issue or submit a pull request against the files in /03_qa_data.

References

Stachelek, J., Ford, C., Kincaid, D., King, K., Miller, H. and Nagelkirk, R., 2018. The National Eutrophication Survey: lake characteristics and historical nutrient concentrations. Earth System Science Data, 10(1), pp.81-86. (https://doi.org/10.5194/essd-10-81-2018)

Stachelek, J. (2017). nesR: Scrape Data from National Eutrophication Survey archival PDFs. R package version 0.1. (https://github.com/jsta/nesR) (https://dx.doi.org/10.5281/zenodo.400258)

Stachelek, J. (2017). nesRdata: National Eutrophication Survey Data Package. R package version 0.1. https://github.com/jsta/nesRdata

nes's People

Contributors

cbahlai avatar chanse-ford avatar dustinkincaid avatar hkhorasani avatar jsta avatar kingka21 avatar mill2735 avatar nagelkirk avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

nes's Issues

Add EPA Methods

  • Add EPA methods and compare to current methods
  • Add blurb about current EPA surveys

Convert Google Doc to LaTeX

After we have the MS finalized as a Google Doc, I think it would be a good idea to prepare our submission in LaTeX rather than MS Word given our reproducible and open science focus.

Is anyone willing to do this who is either interested in learning LaTeX (or already knowledgeable in LaTeX)? I can but I don't want to take away the opportunity from someone interested in learning. The process can be done in RStudio (http://keziamanlove.com/latex-in-rstudio/).

Finish maps with labels and correct coordinates

  • Change coordinates to lat long
  • Create legends inside the plot
  • Move title inside the plot (moving these inside the plot allows them to be stacked in a 2x3 plot later.
  • log transform and plot the data

Bad storet code

The storet code for PDF 474, pg 93 was not correctly transcribed in the database. 27C! should be 27C1.

File organization

I think we are getting close to a point where we can finalize the file structure of our data product. My initial thought is to move the files to a structure resembling:

|
|__archival_pdfs
|
|__supporting_data
|
|__raw_data
|	|	
|	|____474
|	|    |__100.csv
|	|    |__101.csv
|	|    |__...
|	|
|	|____475
|	|    |__100.csv		
|
|__parsed_data
|	|	
|	|____474
|
|	    |__100_clean.csv
|__merged_data
|	|	
|	|____474
|	    |__res.csv
|
|__qa_data
|	|	
|	|____474
|	    |__res_review.csv
|
|__analysis_scripts
|
|__nes_data.csv
|
|__README.md

Does anyone have any thoughts or suggestions on this?

Research questions

Here are some potential research questions:

  • Are differences between the 2007 NLA and the NES related to specific factors?

  • How do depth estimates compare with more recent depth model predictions?

  • How does the NES compare with the 2012 NLA results?

Please add more below!

Create 3 Tables of Summary Statistics

Create tables of summary statistics (mean, sd, n) for each region and lake type. The tables will be separated by physical, chemical, and loading variables.

Syncing Problems

Hi all,

For some reason I'm having sync errors with csv's 476 and 477. I don't want to do anything with those, so I just shut down github desktop. Is anyone else having issues when the sync?

  • Ryan

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.