Git Product home page Git Product logo

Comments (3)

hbredin avatar hbredin commented on September 23, 2024

The best performance I got so far on VoxCeleb is with ClopiNet architecture on top of MFCC features.
A tutorial and pre-trained model is available here

The issue with this model is that it relies on MFCC which are slow to compute: one cannot really precompute them because random noise is added on-the-fly to the audio file as a data augmentation step. Therefore, I have been trying recently to replace the MFCC features by trained SincNet features (computed from the waveform directly) but to no avail so far.

Yet, I'd rather have you work directly from the waveform (looks cleaner).
So, unless I manage to switch ClopiNet to waveform quickly, I suggest you use the architecture provided in the SincNet repo.

from similaritylearning.

juanmc2005 avatar juanmc2005 commented on September 23, 2024

@hbredin Got it. A priori I'll work on integrating SincNet

from similaritylearning.

juanmc2005 avatar juanmc2005 commented on September 23, 2024

SincNet is working using our custom loss modules, like arc margin or congenerous cosine (CoCo).
A better implementation will probably come once VoxCeleb is integrated.

from similaritylearning.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.