iThalay's Projects
Automatic Speech Recognition (ASR), Speaker Verification, Speech Synthesis, Text-to-Speech (TTS), Language Modelling, Singing Voice Synthesis (SVS), Voice Conversion (VC)
Buzz transcribes and translates audio offline on your personal computer. Powered by OpenAI's Whisper.
๐ฎ Word by word audio subtitle synchronisation tool and API. Developed under GSoC 2017 with CCExtractor.
GitHubโs official command line tool
Cloudflareโs documentation
The content behind MDN Web Docs
Ready-to-use OCR with 80+ supported languages and all popular writing scripts including Latin, Chinese, Arabic, Devanagari, Cyrillic and etc.
Browser extension for learning languages with watching movies and TV shows
Sign with Facebook and you can watch your all picture and can download Zip file
Download your facebook's albums on your computer with python and Facebook API
Python library and CLI tool to interface with Google Translate's text-to-speech API
A Node.js style checker and lint tool for Markdown/CommonMark files.
Mouseover Translate Any Language At Once - Chrome Extension: PDF Translator, EBOOK, EPUB, OCR, TTS, YOUTUBE DUAL SUBTITLES, GOOGLE DOCS, AI, VIEWER, GMAIL, WRITING, IMAGE, DUAL SUBS, MANGA, HOVER, DICTIONARY, WEBTOON, EDGE, JAPANESE, ENGLISH
An action to automatically extract keywords from images in issue bodies, making them searchable ๐
GitHub on steroids
TLS/SSL and crypto library
Pipe based dataframe manipulation library that can also transform data on SQL databases
A lightweight, dependency-free Python library (and command-line utility) for downloading YouTube Videos.
A single header simple, powerful and full blown srt subtitle parser written in C++.
MacOS system extension that allows applications to pass audio to other applications.
Speech to Text (Voice Recognition)
A wrapper to work with Tesseract OCR inside PHP.
Pure Javascript OCR for more than 100 Languages ๐๐๐ฅ
๐ธ๐ฌ - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Standalone plugin for VLC 2.x to support decoding of HEVC/H.265 using libde265.
Robust Speech Recognition via Large-Scale Weak Supervision
Port of OpenAI's Whisper model in C/C++