This is the course site for INFO 5940 - Computing for Information Science.
cis-ds / course-site Goto Github PK
View Code? Open in Web Editor NEWCourse site for Computing for Information Science (INFO 5940)
Course site for Computing for Information Science (INFO 5940)
This is the course site for INFO 5940 - Computing for Information Science.
Add at least one day of coverage for spatial data analysis using R. Cover basic data structures and visualization methods. No spatial regression or complex stuff, stick to more basics like mapmaking
See Stat 545 for inspiration. Don't need all of this though.
With the tidyverse
one
Demo and practice flexdashboard
s
Write a brief guide for how to ask a question on Github. Include
Analysis of age as a factor in NASA astronaut selection and career landmarks. Includes reproducible dataset to predict which astronaut applicants are selected for NASA training.
ggplot2
objects to ggplotly()
graphsNeed a shell tutorial
render.sh
correctly converts Jupyter Notebooks to markdown, but does not include a document title. Makes rendered HTML file look funny. Do notebooks come with titles that can be auto-rendered or forced via command line, or do I need to convert the first chunk title to the document title?
Take way too long to finish all 4 in class
Define a format for how to write R scripts (headers mainly). Add to hw00_homework_guidelines.Rmd
manifestoR
Page has now changed format. Probably try and find an easier example
Handwritten Digit Recognition (CLASSIFICATION PROBLEM)
Demonstrate the capabilities of statistical learning for a multi-class problem. Integrate into cm012
manifestoR
Something is hinky in stat005_resampling.Rmd.
These should be nearly identical, not such a large variation. Why?
Make clear that narratives matter in submission
Put under additional resources
Based on reprex
. Add to lab01 or sometime in the first week of course
Store in a new repo under uc-cfss. Linked to via https://uc-cfss.github.io/fall2016
Use Jenny's data from the repurrrsive
package to write a practice lesson on parsing recursive lists in R
Worth covering? If so, stash in week 9
With the Datasaurus Dozen
Revise for each course module
Add POS taggers and other demo packages.
Rather than just software install support, structure it a la Jenny Bryan's intro to R basics. Need to cover:
.Rdata
).R
scripts and .Rmd
R Markdown documentsNeed to incorporate units on
Cannot cram this into a single module. How can I adjust the rest of the schedule to make it work?
Other ideas???
Replace 1:10
syntax with seq(from = 1, to = 10)
syntax - much clearer to students
Rewrite the lesson to use a unique app. Find something using more social scientific data
In-class exercise idea for sentiment analysis: https://paulvanderlaken.com/2017/08/03/harry-plotter-celebrating-the-20-year-anniversary-with-tidytext-the-tidyverse-and-r/
Aka scoped verbs. See here for some documentation. Make sense to incorporate into conditional execution in cm008?
Add section on using random forests/SVMs to classify documents using text features
Needs more social scientific data examples to practice skills, not just datasets of convenience
Any relevance to text mining lessons?
x > 5
x[x > 5]
Shifted to a paid API with private keys. Not a good, simple example anymore. Need to find a replacement that is free
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.