Comments (14)
Draw examples from Kieran Healy's socviz
package.
opiates
gss_sm
gss_lon
from course-site.
Flagging @YinsuH on this. She's working as an RA for me this summer through SISRM.
from course-site.
This is the website I looked up with a few use of scatterplot. I was not sure if my concern was significant, so I decided to bring it up anyways :)
from course-site.
@YinsuH I think we'd be okay with the number of observations. But I agree with your concerns about an appropriate number of categorical variables for some of the examples. Especially I am thinking about computer programing as problem solving. Could you take a stab at rewriting the examples in the notes
folder that currently use diamonds
, but substituting with the penguins
dataset?
I think the easiest workflow will be to fork the course-site
repo, then edit the .Rmarkdown
files directly. Note that if you try to build the entire site, you will need to knit all the R Markdown files in the repo which will take some time (and probably require you have additional packages installed). If it's easier, just write a fresh .R
script for each page that uses diamonds
and just rework the code. I/we can update the written narrative later once we know the examples work.
from course-site.
from course-site.
Need to replace
diamonds
mtcars
mpg
Auto
Questionable datasets
titanic
flights
gapminder
from course-site.
Deadest names - see #115
from course-site.
Still working on this, specifically with Movies and Snapchat data. (Repo is very minimal now.) Will also look into socviz
, deadest names and police shootings.
Two other options, would love to hear what you think-- palmer penguins instead of diamonds
, and recent-ish O'Hare/Midway data using anyflights
instead of flights
.
EDIT: Also flagging Damon Jones' scrape of UCPD stops as a potential alternative to the WaPo Police shooting dataset.
from course-site.
Palmer penguins is supposed to be a good drop-in replacement for iris
. Not sure if it contains sufficient variables to replace diamonds
. We'd need to check how diamonds
is used on the website to verify the penguins dataset contains appropriate variables.
Chicago flights data would be nice to replace nycflights13
, though I think I only use it for one set of exercises for relational joins.
from course-site.
I have looked at the penguin dataset and the lecture notes. I would say the penguin data is viable in terms of most of the operations we need. For instance, it could be used for practicing pipe and writing functions. However, one problem I think might be significant about penguin data is that it contains only 344 observations, while diamonds has more than 20k observations. In the exercise we use characteristics like color and cut, both of which have more than 5 kinds. But the qualitative variables, species and island, in penguins only have three different possible entries. This fact to some extent signifies the lack of variability in the penguin data, and thus might lead to some problems in modeling and make the data visualization less diverse than figures produced by diamonds.
from course-site.
I don't think we use diamonds
in any modelling pages (feel free to correct me if I'm wrong, just searched diamonds on the website), so I don't think the sample size should be disqualifying. The lack of levels for categorical variables is definitely a valid concern though.
from course-site.
@bensoltoff I have created a pull request for the course site. However, this is my first attempt in updating the website and some of the work might still have problems. I will continue checking them in the next few days. Also, I have written a few questions I got in the pull request post. Please have a look.
from course-site.
Household Pulse Survey - assess impact of COVID-19 on households
from course-site.
Need to replace
-
diamonds
-
mtcars
- Need a fully numeric data frame to drop into iteration exercises. Or need to rewrite that exercise
-
mpg
-
Auto
Questionable datasets
-
titanic
-
flights
-
gapminder
from course-site.
Related Issues (20)
- API slides - add Twitter exercise back in HOT 1
- Update learner personas
- Update geospatial viz HOT 1
- Update Shiny modules HOT 1
- Fix all exercise repos to use rcis package HOT 1
- Convert all homework Rmd files to Quarto documents
- Tidy eval lesson design HOT 1
- Fix setup instructions for Git configuration
- Remove theme_xaringan from eda slides
- Easier way to find your assigned pull requests
- Readjust homework publication dates
- Early semester schedule tweak
- Update Quarto modules HOT 4
- Switch reprex and reproducible workflows
- Revise Git unit
- Quarto - execute options
- Quarto day 2
- Redesign ML units HOT 6
- Git class revisions
- Update Twitter API exercise
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from course-site.