Git Product home page Git Product logo

kaustubhgupta / blogathon-analysis Goto Github PK

View Code? Open in Web Editor NEW
8.0 2.0 2.0 1.96 MB

Analytics Vidhya Blogathon Data Analysis: Python Data Extraction with PowerBI dashboard

Home Page: https://www.analyticsvidhya.com/blog/2021/09/guide-for-data-analysis-from-data-extraction-to-dashboard/

Jupyter Notebook 100.00%
project python data-scraping pandas tqdm excel powerbi powerbi-report data-mining data-analysis

blogathon-analysis's Introduction

Analytics Vidhya Blogathon Data Analysis πŸ“ˆπŸ“‰πŸ“Š

blogathon-analysis

Blogathons are competitions that are conducted for over a month or so where, instead of coding, we need to create technical content on any relevant topic. In this Analysis, relevant data were extracted from blogathons, and then concluded some of the articles and blogathon trends, most popular and unexplored topics.

Step-by-Step Extraction and Analysis Article Link: Guide For Data Analysis: From Data Extraction to Dashboard

Final Dashboard πŸ“Š

Conclusions πŸ”₯

  • The views went from 280k in the 6th edition to 481k in the 7th edition. This happened because the team introduced a base price for all the articles published.
  • Though I don’t have the data for blogathon 8 articles categories, but, I believe that in comparison to blogathon 8, blogathon 9 had a huge surge of articles aligned towards advanced categories such as NLP. data engineering, computer vision as these categories were prized higher as compared to normal articles. As many as 108 advanced articles were published in blogathon 9.
  • In blogathon 9, a maximum of 300 blogs was– published which is the highest of all times with a total of 628k views. At the same time, the lowest views went to 13! That’s why in blogathon 10, a threshold of 500 views was set because they had to give prizes even for this many views too. It didn’t reduce the number of articles and in fact second-highest articles, 284 were posted. The shocking thing was that this edition, blogathon 10, recorded a total of 1 million views even after the threshold condition.
  • The 11th edition had a hard time with only 222k views and 123 articles. I think that’s the reason that a new category, guide, is introduced in blogathon 12.
  • May 2021 had the highest number of views of all time. peaking at 0.79M.
  • Python, Data, beginner. learning, and project are some of the most popular categories of all time.
  • Docker, R, Julia, Excel, and Deployment are some of the least explored categories.

Project Tech Stack 🏟

  • Python (Language)
  • Libraries
    • urllib.request (Making request to the website)
    • pandas (Data manipulation)
    • re (Rules for data extraction)
    • time (Handling requests)
    • numpy (Data manipulation)
    • tqdm (Progress bars for extraction process)
  • PowerBI (Data wrangling, visualization and dashboard creation)

blogathon-analysis's People

Contributors

kaustubhgupta avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.