Light

jchu634 / subtitleproject Goto Github PK

View Code? Open in Web Editor NEW

0.0 1.0 0.0 36.92 MB

Home Page: https://www.hackster.io/jchu634/ryzen-ai-subtitling-5ead7f

Python 78.43% PowerShell 1.10% HTML 1.25% TypeScript 18.62% JavaScript 0.08% CSS 0.52%

subtitleproject's Introduction

Subtitle Project

This is a project for a windows GUI application for subtitling system audio in soft real-time using Whisper running on Ryzen AI.

Setup + Installation

Clone the repository
Build the frontend
```
.\buildFrontend.ps1
```
Install all backend dependencies Instruction in Backend README
Run the GUI
```
cd .\backend
python start_gui.py
```

Known Issues

Language Limitations:

The application is currently unable to transcribe audio in any language other than English due to model limitations.

Subtitle Transcription Accuracy:

The subtitle transcription for speaker loopback inputs is less accurate compared to direct microphone inputs.

Performance Issue After Stopping Transcription:

After clicking Stop Transcription, the application may become very slow due to ongoing backend transcription jobs. The application will return to normal speed after a short delay.
Note: The download subtitles button remains functional and works independently of the backend.

Stability Issues:

The application may crash if the user opens the settings window, selects the "Pin window to always stay on top" option, and then closes the settings window to quickly. (i.e. <2 seconds)
- This happens as the settings window has not been able to load the Settings javascript-python api bridge before it is closed.
- This issue can be mitigated by waiting at least 3 seconds after selecting the "Pin window to always stay on top" option before closing the settings window.

Acknowledgements

This application is built on top of the public domain work done at https://github.com/davabase/whisper_real_time.

subtitleproject's People

Contributors

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.