Git Product home page Git Product logo

data-wrangling's People

Contributors

andrewejaffe avatar avahoffman avatar carriewright11 avatar jewellsean avatar jtleek avatar muschellij2 avatar srenan avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

data-wrangling's Issues

factors?

I feel like we might want to mention this somewhere...

maybe in the summarize lecture? an example kinda like with the last lab question?

or do we want a module?

introduction

  • update the topics covered to remove genomics and move github to the bottom
  • add more about jargon? rows, columns, function, and package?
  • remove some slides that make me nervous haha

data IO

Lab 1

  • gah sorry I guess I messed this up by showing it programatically too early (somehow I thought I heard you say it but I don't think so).. maybe we should add directions to the lab about specifically using the GUI and going to file --> import dataset ect. (or have them do both if we add a tad more about how to do it programmatically in R in the lecture? - maybe even just copy and pasting from R when using the GUI?

Then part2 could then focus on using paths special imports and output?

Maybe for part one just use the urls if we did that

Q3 of the lab key has recode reversed

South_West <- ufo_clean %>% filter(state %in% c("tx", "nm", "ut")) %>%
  mutate(recode(state, "Texas" = "tx",
                        "New_Mexico" = "nm",
                        "Utah" = "ut"))
South_West

Need to reverse new and old names

data io lab part 2

seems to mostly be about naniar.. but now that will come later... so probably want to practice the data io stuff more?

Update slides

Split up some of Jeff's google slides into smaller pieces

cleaning lab changes

reshaping lab says to use pivot_wider instead of pivot_longer for Q2. For cleaning lab add hint and explanation about str_detect vs str_subset

cheatsheets?

what if we made a single cheatsheet for the class? (or we could make multiple cheatsheets.

This is just an icing on the cake idea :P

Duplication of both data sets when joing

data_As <- tibble(State = c("Alabama", "Alaska", "Alaska"),
                 state_bird = c("wild turkey", "willow ptarmigan", "puffin"))
data_cold <- tibble(State = c("Maine", "Alaska", "Alaska"),
                    vacc_rate = c("32.4%", "41.7%", "46.2%"),
                    month = c("April", "April", "May"))

missing data

equal sign for assignment on one of the first slides

Admin To do for 2023

  • Make a release for last year material
  • Zoom link
  • Edit welcome_email.txt - to be used as Slack message
  • Update index for software and any schedule changes
  • Update lecture_notes/setup
  • Send out welcome message with zoom link
  • Go through the index page and make sure the links are up to date (Ava)
  • Add disclaimer to the welcome message that materials may be updated between now and the first class session

data io part 1

we describe getting packages on anvil... do we want to?

Filtering for "4" when it's in a character type column

In the subsetting lab part 1:

mpg %>% filter(displ > 4 & drv == 4)
mpg %>% filter(displ > 4 & drv == "4")

Both of these yield the same results. Might be worth some explanation that as numbers can be quoted or not, regardless of the column data class

advanced data IO

Maybe this should happen after the functional programming module, where lists are introduced?

Also can maybe update to show how you could use purrr instead of the for loop

jhur?

I put the bike data in the missing data lecture... don't know if we introduce this package earlier or not

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.