Git Product home page Git Product logo

sql-db-setup's Introduction

SQL & Relational Databases

Relational databases are the backbone of data science and the language that we use to communicate with them is called SQL. The SQL test is a common component for data-adjacenemnt jobs within industry, government and the education sector. It is a useful tool that some argue spawned the field of data science. Before Big Data was a thing, Knowledge Discovery in Databases (KDD) used simple SQL queries to investigate and understand the nature of the large amounts of data that were being collected by governments and companies. The humble SQL test now torments the budding data scientist as a right of passage in the job search process.

In this unit you will learn the basic ideas behind relational databases and SQL. You will setup a database in Amazon Web Services and then connect to it through RStudio. You will then load data into your database and run SQL queries on that data.

Goals for this Unit

  • Be able to discuss an overview of relational datanases and the purpose of SQL
  • Be able to spin up an AWS instance and load a SQL database into it
  • Be able to connect to the database through RStudio/R using the DBI package
  • Be able to run basic SQL commands in RStudio using the RMySQL package

Resources

Video 1

Introduction to Relational Databases & SQL

Transcript

Video Slide Deck

Video 2

AWS Setup Instructions

SQL Cheat Sheet

Overview of Amazon Web Services

AWS
AWS China

Project: AWS Database and SQL

Please fork and clone this repository to your computer. If you are unfamiliar with this process you must sign up for office hours.

Then you will need to create an account with AWS. This will require a credit card although we will only be using free services. If you already have an Amazpon account you can use that account.

Please create the account through the regional website for your location.

Once you have created an account follow the directions below, these steps are also shown in the video above.

Step by Step to Create MySQL Instance on Amazon Web Services

  • Log into your AWS Management Console
  • Locate RDS under the Databases heading
  • Within Amazon RDS click Create database
  • Under Choose a database creation method click Standard Create
  • Under Engine options choose MySQL
  • Under Templates choose Free tier
  • Under Settings name your DB instance identifier as database-1
  • Under Credential settings create a username and password combination and write it down (you will need it later)
  • Under Connectivity expand Additional connectivity configuration to show additional menu items and make sure that Publicly accessible is checked Yes
  • Expand the Additional configuration menu
  • Under Initial database name write oudb
  • Uncheck Automatic backups
  • Click Create database
  • Once the database is created, take a screenshot and add it to your repository

Modify Security Group

  • Under Security Groups click Inbound and then Edit
  • Add the rule SQL/Aurora on Port 3306 with the Connection of MyIP

RStudio

  • Open the sql-project.Rmd file in RStudio and follow the directions.

Submission

  • Once you have completed the project please commit and pull request the repository back to the main branch. Please be sure to include your Rmd file and your screenshot of the AWS console page. The due date for this project is January 27 by 5:00pm EDT. Don't forget to delete your AWS database so you don't get charged any money!

Knowledge Check

After you submit please complete the knowledge check quiz located here

sql-db-setup's People

Contributors

charles-lang avatar guoliangxu avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.