Git Product home page Git Product logo

keyword-trends's People

Contributors

brezniczky avatar

Watchers

 avatar  avatar

keyword-trends's Issues

Share chart x axis

Dates need to be fixed up, they're incorrect (they should cover the same date range as on the right chart).

Create configs for scraping

scrapers/ contains 4 files, one for each (source, document) combination, could be replaced by a config file and a generic configurable implementation.

Add alpha-faded chart lines for less reliable regions

The current windowed filter requires data such that the edges (half a year on each side) are cut off from the charts. However these values become less reliable, the relationships are interesting and may suggest important, albeit a little speculative trend shifts - an opacity proportionate with the filtering window size seems to be a feasible visualisation of these additional values.

Add as a share of ... charts

Possibly it is the GitHub explosion (if there is one) that should be accounted for at the first place. I.e. number of relevant repositories probably increases due to the increase in total number of users.

Consistent color-coding across charts

Charts are becoming telly and easier to grasp - need for a colour coding is becoming more important as it's now getting too easy to ignore subtle differences in the legend.

Compare against IEEE rankings

Need to dig into IEEE's methodology, and check out how much less robust the purely GitHub based observations are against the appearance of new courses and other (perhaps unknown) events.

Use GitHub API to retrieve data

Use GitHub API, possibly oauth, generate finer grained results, still over 1 year windows to partially eliminate the effects of seasonality.

Refactor

(Because it's a big copy-paste haggis at the minute, obviously.)

Revise the entire text

The context is the new diagrams and potentially new findings due to SO usage info.

Don't forget to update the server document with the notion of StackOverflow.

Add JuPyTer notebook

Apparently does very well on GitHub, despite (or spot on because of) being a wrapper around other languages in a sense.

language:"Jupyter notebook"

White Ruby

Looks like Ruby is invisible and/or white on the Coursera chart.

Use a version trackable data storage format

Modifications to the .RData are not verifiable by reviewing.

It would be easier if the gathered data was in JSON files, or returning to the csv's, as they were cached earlier, perhaps adding a master document. The volume of the data currently allows for either.

(An alternative is to create an .RData plugin for e.g. Meld and then things become a bit better but that's a bit of sci-fi for this project I guess... there may be similar formats with native GitHub/Meld diff viewer support on the other hand - another approach to take.)

Number of users as a measure

It is possible to query for the number of users, given language:python and adding created:... filters.

Would be interesting to see how many have created Python/Ruby/etc. repositories over time.

Explore total repository numbers

After scraping this data, should turn out whether an increase in total number of GitHub users appears to be accountable for the changes in specific, e.g. "analysis"-related repository numbers.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.