Git Product home page Git Product logo

word-art-from-youtube-captions's Introduction

▶️Word Cloud from YouTube captions

https://jonathanlawhh.com/works/nft/

Generating a word cloud based on captions from YouTubers video playlist.
Using WordCloud for Python by Andreas Muller!
This project may not work in the future if YouTube updated their website code.

🖥️ Setup

Install Python >3.8

$ git clone https://github.com/jonathanlawhh/word-art-from-youtube-captions.git
$ cd word-art-from-youtube-captions
$ pip3 install -r requirements.txt

📁 Directory

scrap.py
Running python scrap.py in console will start the program. Input the playlist URL accordingly and the script will run through every video in that playlist, attempt to extract the English caption, and save it in the export folder.
YouTube playlist URL should look like: https://www.youtube.com/playlist?list=XXX

create_wordcloud.py
Running python create_wordcloud.py will give you the option in order to:

  1. Select the extracted captions from scrap.py output in exports folder
  2. Select a mask for your word art. White will be the excluded area. Please refer to mask folder for sample.
  3. Generate the word art in SVG and PNG in the export folder with the dataset name.

export folder
All dataset of captions and generated word art will be kept here.

mask folder
The mask to use when generating the word art can be kept here and selected later.

Roboto-Regular.ttf
Font used in word art. Feel free to change.

word-art-from-youtube-captions's People

Contributors

jonathanlawhh avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.