kunaldhawan Goto Github PK
Name: Kunal Dhawan
Type: User
Company: Carnegie Mellon University
Bio: Research Scientist @ NVIDIA; NLP | Conversational AI | LLMs | Multimodal ML
Twitter: KunalDhawan105
Location: San Jose, California
Name: Kunal Dhawan
Type: User
Company: Carnegie Mellon University
Bio: Research Scientist @ NVIDIA; NLP | Conversational AI | LLMs | Multimodal ML
Twitter: KunalDhawan105
Location: San Jose, California
The repository contains all the codes necessary for my project - Automatic Speech Recognition System in Hindi Language ( Project description is available at :- https://kunal-dhawan.weebly.com/asr-system-for-hindi-language-from-scratch.html) : It contains the code for the following systems - 1) Monophone-HMM system built using HTK toolkit , 2)Monophone-HMM system built using Kaldi toolkit, 3)Triphone-HMM system built using Kaldi toolkit and 4)DNN-HMM system built using Kaldi toolkit
COVID-19 detection from cough signal
The aim of this repository is to consolidate materials for all essential tools that are used in the lifecycle of any (python) project. This includes links to amazing documentations/tutorials for each of the tools and also a dummy self project to demonstrate their use.
A simple baseline for one-shot multi-object tracking
The Kick Ball Theory- Autonomous Football Playing Bot
A Digital Signal Processing course project aimed at building a system for effective musical instrument detection given a sound recording
NeMo: a toolkit for conversational AI
A toolkit for processing speech data and creating speech datasets
An i-vector based Non-Negative Matrix Factorization approach towards noise robust Automatic Speech Recognition
Developed a bot that could accurately follow a human in a given closed enclosure. Concepts used were: Image processing: to detect the human using a characteristic colour and to calculate the change in position of the subject and hence correspondingly calculate the next movement of the robot , PID (Proportional, Integral, Derivative): to minimize the error during motion of the bot and to ensure smooth motion even in case of sudden changes in subject’s location , ROS : to communicate between the python script which implemented image processing and the microcontroller(Arduino) which was responsible for bot motion .
Word embedding based on Phonetic features of the language (similar sounding vectors)
Phototron-Autonomous Ball Catching Bot
PyTorch image models, scripts, pretrained weights -- (SE)ResNet/ResNeXT, DPN, EfficientNet, MixNet, MobileNet-V3/V2, MNASNet, Single-Path NAS, FBNet, and more
website for autocompleting scene graphs for generating natural looking images
Built with the aim of providing freedom of movement to any individual irrespective of any disability. The speed of the wheelchair is controlled by the normal force exerted by the person against it’s backrest which is measured by a force sensor placed there and the direction is controlled by angular motion of user’s head with respect to vertical measured by a gyrometer placed on a cap worn by the user.
Pytorch implementation of Deepmind's WaveRNN model
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.