cont-limno / lagosne Goto Github PK

View Code? Open in Web Editor NEW

15.0 5.0 8.0 29.83 MB

Interface to the LAke multi-scaled GeOSpatial & temporal database :earth_americas:

Home Page: https://cont-limno.github.io/LAGOSNE/

R 96.02% TeX 3.98%

water-quality ecology geoscience limnology cran rstats

lagosne's People

Contributors

Stargazers

Watchers

Forkers

limnoliver boudre32 nemochina2008 solomonvimal lawinslow jhollist shekharsg ecustwy

lagosne's Issues

Access to package

@jsta -- I just tried to get Emily Stanley (@ehstanley) set up with the R package and she doesn't have access. How do I change this?

buffer table headers not recognized

Tables "lakes4ha.buffer500m" and "lakes4ha.buffer100m" do not import with column names (column names in row 1).

Add interactive select functionality?

This was present in the legacy code base.

Create vignettes to document common filtering/subsetting operations

See http://r-pkgs.had.co.nz/vignettes.html

add descriptions of hu8 etc to `lagos_select` man page under scale

lagos_select does not allow for selecting only from limno or only from geo

Refactoring the code will need to be done carefully.

Vignette building breaks CI (and CRAN) checks

The vignettes as currently written require that the full LAGOS dataset is installed and available to lagos_load. I think this is good because it allows us to describe the contents of the data product. However, automated build testing via CI services (and eventually CRAN) breaks because the they don't (and probably should not) have the data available. The only solution I can come up with is to make the vignettes static. That is to pre-build all the vignette figures and tables and display the code chunks without running them (eval = FALSE).

lagos_select locks up if provided bad inputs for vars

Need to sanitize inputs

column names for buffer metadata tables imported incorrectly

In tables lakes4ha_buffer100m and lakes4ha_buffer500m, column is "lagoslakeid" after importing through package, but column name in geo v1.040 is "lakes4ha_buffer100m_lagoslakeid" and "lakes4ha_buffer100m_lagoslakeid".

Coordinatize uses the sp package instead of the sf package

Pull all columns from table if no columns or categories specified

See #15

Standardize documentation

Some functions have a title in the documentation and other do not. Consider following the rules at: http://style.tidyverse.org/code-documentation.html

Add progress notifications to `lagos_get` and `lagos_compile`

Create a function that generates a subset of LAGOS for `testthat`

Something like the first 10 lines of every table

exports converted from txt to csv

the data compilation functions will need to be updated

List lagos_select keywords in documentation?

The keywords in LAGOS:::keyword_partial_key() are not included in any of the user-facing documentation.

All of the tables returned by lagos_load do not have corresponding help files

For example, ?locus does not yield any results.

Also buffer100m.lulc buffer500m.lulc lakes.geo lagos_source_program

Update docs to reflect upgrade to 1.054.2

`lagos_select` doesn't work with scale = "iws"

Identify existing multitable Gigascience datasets of a similar size to test remote downloading

Update docs to reflect upgrade to 1.087

Add a function to return table names given match between search string and column headers

allow users to input multiple 'scale' options in lagos_select

Refactor preprocessing(merging/selecting) functions

At the very least, make preprocessing functions operate on column names and not column numbers

A lot of this code may have reinvented the wheel. It is likely that we can simplify a lot of this by wrapping the dplyr package.

lagos_compile fails

LAGOS:::lagos_compile(version = "1.054.1", format = "rds")

fails with error message:
Error in gzfile(file, mode) : cannot open the connection
In addition: Warning message:
In gzfile(file, mode) :
cannot open compressed file 'C:\Users\Samantha\AppData\Local\LAGOS\LAGOS/data_1.054.1.rds', probable reason 'No such file or directory'

R version 3.3.1 (2016-06-21)
Platform: x86_64-w64-mingw32/x64 (64-bit)
Running under: Windows 7 x64 (build 7601) Service Pack 1

sessionInfo()
locale:
[1] LC_COLLATE=English_United States.1252 LC_CTYPE=English_United States.1252
[3] LC_MONETARY=English_United States.1252 LC_NUMERIC=C
[5] LC_TIME=English_United States.1252

attached base packages:
[1] stats graphics grDevices utils datasets methods base

other attached packages:
[1] LAGOS_1.054.1

loaded via a namespace (and not attached):
[1] lazyeval_0.2.0 magrittr_1.5 R6_2.1.2 assertthat_0.1 rsconnect_0.4.3
[6] DBI_0.5-12 tools_3.3.1 dplyr_0.5.0 rappdirs_0.3.1 tibble_1.1
[11] Rcpp_0.12.7

rappdirs::user_data_dir("LAGOS")

"C:\Users\Samantha\AppData\Local\LAGOS\LAGOS"

Add a function to turn a table into a spatial object

Allow user to select groups of columns

In addition to the ability select columns by name, allow the user to select pre-defined groups of columns within each table. For example, "atmospheric deposition" that may include multiple variables.

Refactor lagos_select as a keyword/lakeid* helper only?

Create function(s) to load and type-check the individual txt files for LAGOS-limno and LAGOS-geo

This is the bulk of the old LAGOS R package but it could be much cleaner by using functions to reduce redundancy versus copy-paste.

Create function(s) to download remote raw tables and "compiled" R objects

Store data using https://github.com/hadley/rappdirs

Possibly implement some of the ideas in https://github.com/richfitz/datastorr/blob/master/vignettes/datastorr.Rmd

Some tables have underscore separators now

epi.nutr -> epi_nutr
lakes.limno -> lakes_limno

keywords should pull additional info like censor codes and lagos flags

exactly which ones is an open question

Include functions to download and query previously published LAGOS subsets?

https://portal.lternet.edu/nis/mapbrowse?packageid=knb-lter-ntl.320.4

http://datadryad.org/resource/doi:10.5061/dryad.75s9s

Add a lagos_select keyword for sample info

sampling event, lagoslakeid, etc.

Change repository description to match package DESCRIPTION

LAke multi-scaled GeOSpatial & temporal database -> Tools for Interacting with the Lake Multi-scaled Geospatial and Temporal Database

@jiayuzhou can you do this? I cannot because I do not have administrator level control of the repository.

Provide user with all column names within each table + metadata

@jsta where is the best place for this information? As a user, I could imagine wanting this to be in two places: 1) some sort of documentation listing each variable within each table, along with some metadata (units, plain English description, etc). We could have documentation for each table, but where (in the package structure) would this go? 2) in a table format, similar to the info table that is currently imported with the rds file.