si507-finalproject's People
si507-finalproject's Issues
Get data from least one complex source, in a way you have learned this semester, with caching.
- Get Tumblr Post data from Tumblr websites via OAuth authentication, process the data to grab the required fields to generate the Post/Info data class instances.
- Store the raw data into files and setup a local file based caching system and it can support cache data expire.
Include a full test suite for your project.
- Implement the unit test class with unittest.TestCase which should have good test coverage of the project
Test as much as possible to provide good code quality
Create a git repository on your computer to build your project, and push it to a GitHub repository of your own.
-
Setup the git repo and pushed the code, readme files to it
-
Initial README file
-
Initial python files like: SI507F17_finalproject.py and the SI507F17_finalproject_tests.py file finalproject.py will cover the data models, cache system, website data scraping system and the database connection utilization system. tests.py will cover all required unit tests.
-
A requirements.txt file from your virtual environment
-
Any .py or other file templates that we have to fill in. e.g. secret_data.py for the webiste API call credentials, .csv files for local caching etc.
Details?
You've reflected all the requirements and showing which visualization system you are interested in, but all other details about the project are missing, which makes it impossible to give feedback!
Include some visual representation of your data that is clear
- Learn and use the library Plotly, which has a pretty clear Python API to create nice charts and graphs. https://plot.ly/python/
- Onboard the Plotly Dashboard API to visualize the Tumblr Post data in result of a dashboard that reflects the numbers of Posts in Bloggers, comparison of different Posts/Blogger Info.
Implementing the classes, functions, models and Connecting to Database
- Implement the Info, Post data class to model the data
The class should include a repr method that can descripe the Post/Info data
And a contains method to check if the Post data contains some words in Summary - Use this class definition for database models in your program
Process the raw data to model the Post data into instances of the classes - Connect to the database and get Info and Post database tables data
Set up Info and Post database tables in a database, and store the data instances in them.
Database table data with these columns in each:
Info
id
followed
likes
total_posts title
url
ask_page_title
name
Post
id
date
summary
format
short_url
can_like
can_reply
type
info_id
-
Post table has the external key to the Info table. We can make JOIN query to these tables
-
Setup the DB connection code to Connect, Insert, Update and Query the tables
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.