Git Product home page Git Product logo

gutenberg_scifi's Introduction

Scifi Project

Analyze scifi books on Project Gutenberg

Goals

Collect, wrangle, analyze and present insights on a web app from freely available works of science fiction on Project Gutenberg.

To Set Up

- pip install requirements into a virtualenv called scifi
- pip install gutenberg

To Run

Run in terminal by admin_gender.py

To Do

  1. Check to see if csv file exists already. if yes, skip to step 4. if no, go to step 2 and 3 [done].
  2. Go to Project Gutenberg scifi bookshelf [done].
  3. Scrape form the bookshelf the names, title, and urls of all scifi books and add them to a csv file [done].
  4. Access and read the csv file [done].
  5. For reach url in the file access the book text using the gutenberg module [done].
  6. Read each text and identify the male-female gendered words ratio [done].
  7. Add the {author:{title, ratio} to json data source[done].
  8. Test code with larger data set.
  9. Build a web app with Flask.
  10. Create unit tests.
  11. Add more gendered words to gender_words.py.

gutenberg_scifi's People

Contributors

ndanielsen avatar samxj avatar skasim avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.