Git Product home page Git Product logo

muskanverma2062 / youtube-transcript-summarizer Goto Github PK

View Code? Open in Web Editor NEW
0.0 2.0 0.0 76.11 MB

Youtube Transcript Summarizer is a Search Engine Extension that works with Youtube to extract the key points of a video and make them accessible to the user.

Python 96.69% CSS 0.22% JavaScript 0.30% HTML 0.30% Cython 1.12% C 0.97% C++ 0.21% Fortran 0.04% Makefile 0.01% MATLAB 0.01% TeX 0.10% PowerShell 0.05% Batchfile 0.01%
nlp gensim youtube-transcript-api extension

youtube-transcript-summarizer's Introduction

Youtube Transcript Summarizer

Objective: This project is a chrome/search engine extension that works with Youtube to extract the key points where it will perform NLP and respond with the summarized version of the Youtube transcript

Use case Scenario: YouTube has very large number of videos which has transcripts. Summarization would be especially helpful in the cases where videos are longer in length and different parts might have varying importance. In this sense, Summarization of the video might be useful in saving the viewer’s time. It will help in improving user productivity since they will focus only on the important text spoken in video.

🔽 Aim

By our project, we would be building functionality for summarizing those YouTube videos in which captions are added by their owner, to generate a summarized text response through various summarization techniques.

Our main goal is to save the time of the user who was wasting time on finding useful information about the topic which they are interested in and to save them from click baited videos. The client could be accessing this API from anywhere (say a Chrome extension) which will request our server, so that we would make summarizer accessible in many ways, and reducing user’s time and effort to get the text summary on the basis of their request.

🔽 External Libraries Needed

  • The summarization is done by first generating the transcript of the video for which if the video has already transcript then it is used with the help of a python library youtube-transcript-API.

  • Gensim is a Python library for topic modelling, document indexing and similarity retrieval with large corpora. Target audience is the natural language processing (NLP) and information retrieval (IR) community.

    Basic Requirements for extension:

    Extensions are small software programs that customize the browsing experience. They enable users to tailor Chrome functionality and behavior to individual preferences. They are built on web technologies such as HTML, CSS and JavaScript.

    • Create a chrome extension application directory containing essential files required as mentioned below:
    • The below diagram indicates the brief role of each of the files for building a chrome extension.

🔽 System Architecture

💻: User Interface

The image below shows the front-end of the extension of the summarizer.

💬 Future Look

  • We'll be working on this summarizer so that it can generate summary of the videos which do not have pre enabled subtitles.

✒️ References

Websites:

youtube-transcript-summarizer's People

Contributors

muskanverma2062 avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.