Git Product home page Git Product logo

cleanr's Introduction

cleanR

cleanR is data cleaning tool designed to simplify and enhance WFP data cleaning process. it enables users to load raw data and perform essential data quality checks including survey time completion, identification of missing data points, handling of non-standard responses, detection of outliers, pinpointing inconsistencies and validation of GPS coordinates. the package adopts variable naming standards from the WFP Survey Designer for consistency and compatibility. at the end, users can efficiently prepare their data for analysis. ensuring accuracy and reliability.

Installation

You can install the development version of cleanR from GitHub:

# install.packages("devtools")
remotes::install_github("axmedmaxamuud/cleanR")

Usage

Step 1: Create blank logbook

This is a basic example of creating a blank logbook that you can use it later to save the observation with issues.

library(cleanR)

# create the logbook
my_logbook <- logbook()

# inspect the head of the logbook
head(my_logbook)
#> [1] uuid          question.name issue         feedback      action       
#> [6] old.value     new.value    
#> <0 rows> (or 0-length row.names)

Step 2: Log issues in the logbook

Now you can inspect the data and use the log_issue() to write the observations of the data that you find with the issues in the logbook.

check_age <- survey_data %>% 
  filter(RESPAge < 15) %>% 
  log_sheet(question.name = "ki_age",
            issue = "respondent is under 15 years old",
            action = "delete")
# inspect the check_age data
head(check_age)
#> [1] uuid          question.name issue         feedback      action       
#> [6] old.value     new.value    
#> <0 rows> (or 0-length row.names)

now you can merge the check_age data with the logbook so you can maintain one logbook that can be exported at the end.

my_logbook <- rbind(my_logbook, check_age)

head(my_logbook)
#> [1] uuid          question.name issue         feedback      action       
#> [6] old.value     new.value    
#> <0 rows> (or 0-length row.names)

Feedback & Contributions

we welcome your feedback and contributions to make cleanR even more interactive and robust. feel free to report issues or contribute to the project by making pull request.

Reference

This work draws inspiration from the HighFrequencyChecks, IMPACT cleaninginspectoR and WFP DataBreiges. by building on the foundation laid by these packages, cleanR seeks to streamline the process and make it relevant to WFP programming.

cleanr's People

Contributors

axmedmaxamuud avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.