Git Product home page Git Product logo

kws-net's People

Contributors

lilianemomeni avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

kws-net's Issues

Audio only, Audio-Visual implementation availability

The implementation for audio-specific training and audio-visual combined training seems to be missing in this repository. Could you please point me to where I can access the same and reproduce the experiments from the paper.

Cheers!

The processed features

Thanks for this amazing work!

The data pre-processing stages are a bit confusing for me.
Could you please kindly share the processed features? So I am avoid the results difference from the feature part.

Colab for easy replication

First I have to say this is truly amazing work, I can see use cases beyond what you present.

I know you have a very extensive README, but I was wondering if you could make it a bit simpler to run, specifically, have a Google Colab notebook that:

  1. installs the necessary apt and pip packages
  2. downloads the pre-trained models
  3. allows inputting a video path/url and a list of words to spot
  4. runs the demo and spots these words

Such a notebook would be ideal for quick experimentations with your models, and for me, for example, allow testing additional languages in which I'm interested (besides English, French, and German).

Audio only model preprocessing

Hi, thx for sharing this amazing work!

Actually, after finding that the directory does not contain audio-only KWS pre-trained model or model class, I've been trying to train audio-only KWS model for my own.

I extracted mel-spectrogram, and I'm wondering if you did any kind of normalization, and I would be much appreciated if you share what type of normalization or rough range of input mel-spectrogram data.

Hope you have a great day, bye!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.