Git Product home page Git Product logo

youtubenetworks's Introduction

Political YouTube

A visualization showing the relations and recommendation flows between political & cultural commentary on YouTube

Updates

14 Jan 2019: New channels and some updates. The network diagram looks quite different at first, but that's mostly cosmetic. The rotation and location of them are somewhat different but it clusters similarly. To see the older version matching the published article use this link

Data Collection Process

How channels are selected

Channels were included if the met the following criteria

  • 10k+ subscribers. If subscriber data is missing/lower, still include if video's average above 10k views
  • Significant focus (more than 30% of content) on US political or cultural news/commentary. I considered cultural commentary was anything from the ISideWith social issues list

There is no definitive list of YouTube channels, so a variety of techniques were used.

Over time, the ease of finding channels has diminished. I estimate the list as of 2018-12-13 is at least 2/3 of all channels that meet this criteria (in terms of views). Please email me if you have additional channel suggestions.

How channels political category was determined

For news:

For political/cultural commentary I considered all of the following:

  • Self identified political label, or support for a party
  • One sided content on divided political topics of the day (e.g. Kavanaugh, Migrant Caravan)
  • One sided content reacting to cultural events topics of the day (e.g. campus protests, trans activism )
  • Clearly matches Democrat (left) or Republican (right) views on issues in the ISideWith poll

If these considerations align in the same direction then the channel is left or right. If there was a mix then they are assigned the center/heterodox category.

Political Category FAQ

  • The classification of political category is just one persons subjective opinion, how can you trust the results when it is so subjective? There is some merit to this, you can't trust it as much as if I had manage to get all YouTubers to take a survey about their attitudes/content for example. It doesn't need to be a reason to dismiss my analysis for the following reasons:

    • I used respected sources of classification where possible (i.e. adfontesmedia.com and mediabiasfactcheck.com) which covered a large portion of the large mainstream channels.
    • The top 50 channels by video views make up 78% of all views. Download the channel data, then Go though the top 50 channels and check if you agree with the classification. If you generally accept the classifications then you can generally accept the results.
  • Why not perform a more procedural/quantifiable method for determining political category (e.g. like the process used for https://www.adfontesmedia.com)? I don't believe making this process more detailed and quantitative would give you a significantly more accurate/objective answer. .

  • Why is the apposition to Identity Politics/Social Justice considered "right" when it is not normally considered an important part of the standard political definition for left/right? I understand this, but I am confident this has changed. It is clear when evaluating YouTube content that this is a new and important divide.

  • The left/right dichotomy is not a good way to classify tribal politics, why do it that way?. I agree, and forcing it into this model creates many needless problems because it is not a natural category for this data. I was forced to use left/right because I wanted to use this data to evaluate the common narratives about YouTube radicalization which was already framed with this dichotomy.

How recommended video's are retrieved

The YouTube API is used to get all channel, video, recommendation data. For all seed channels, get a list of videos within the configured time range (Jan 1 2018 or later) For all videos, retrieve the top 10 recommended videos

Download Data

CSV

Updated 20th July 2019

Channels

Channel Relations

Daily Video Recommendations

Videos

Connect Directly to Azure Storage

Example Databricks notebook connecting to azure storage (public read access)

Visualization Information

Channel Relation Graph The "bubble" chart is a force directed graph. The area (not the radius) of each bubble corresponds to the number of views of a channels video's. The force/size of link line corresponds to the portion of recommendations between those channels.

NOTE:

  • When new data is added, the orientation and the final clustering of channels changes significantly. Unlike a principal component analysis (or similar) it does not display a quantifiable statistic by the location. The chart animates the simulated forces when opening to show this process.

Recommendation Flow Diagram The left boxes in the flow diagram show the number of views. it is broken down by split by channel (when one is selected) or political category otherwise. The right side shows the portion of those views (according to the number of times the other channels video's were in the recommended list) given to the category/channel.

Implementation

Data Analysis

A function runs each day and updates data from the YouTube API about the seed channels, their videos and recommendations.

  • Cached in cloud storage with history
  • Collected into a snapshot data as of each day into .parquet files in azure storage
  • Analysed using a databricks notebook into csv files (as listed in download data)

youtubenetworks's People

Contributors

markledwich2 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.