lilianemomeni / kws-net Goto Github PK
View Code? Open in Web Editor NEWSeeing Wake Words: Audio-visual Keyword Spotting
License: MIT License
Seeing Wake Words: Audio-visual Keyword Spotting
License: MIT License
Thanks for this amazing work!
May I know what is your GPU environment and how long it takes for training?
The implementation for audio-specific training and audio-visual combined training seems to be missing in this repository. Could you please point me to where I can access the same and reproduce the experiments from the paper.
Cheers!
Thanks for this amazing work!
The data pre-processing stages are a bit confusing for me.
Could you please kindly share the processed features? So I am avoid the results difference from the feature part.
First I have to say this is truly amazing work, I can see use cases beyond what you present.
I know you have a very extensive README, but I was wondering if you could make it a bit simpler to run, specifically, have a Google Colab notebook that:
apt
and pip
packagesSuch a notebook would be ideal for quick experimentations with your models, and for me, for example, allow testing additional languages in which I'm interested (besides English, French, and German).
Hi, thx for sharing this amazing work!
Actually, after finding that the directory does not contain audio-only KWS pre-trained model or model class, I've been trying to train audio-only KWS model for my own.
I extracted mel-spectrogram, and I'm wondering if you did any kind of normalization, and I would be much appreciated if you share what type of normalization or rough range of input mel-spectrogram data.
Hope you have a great day, bye!
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.