Git Product home page Git Product logo

pooya-mohammadi / audio-classification-pytorch Goto Github PK

View Code? Open in Web Editor NEW
36.0 1.0 4.0 830 KB

In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.

Python 21.55% Jupyter Notebook 78.45%
audio-classification deep-learning deep-utils python pytorch lstm transformers wav2vec2

audio-classification-pytorch's Introduction

Hi there, I'm Pooya Mohammadi ๐Ÿ‘‹

I'm an AI Researcher & Developer

  • ๐Ÿ”ญ I love research topics in Deep Learning, Medical science, and Deep Reinforcement Learning
  • ๐ŸŒฑ I'm currently learning about GNN, GAN, and Diffusion models
  • ๐Ÿ”จ Iโ€™m currently working on chatbots, voice assisstance systems, reinforcment learning models for optimization and also making youtube videos about python and AI.
  • ๐Ÿ‘ฏ I'm looking for a PhD position in Medical Science fields.
  • ๐Ÿ“ซ How to reach me:
  • Follow me on Youtube: www.youtube.com/PooyaMohammadiKazaj

Keywords: Speech, Deep Learning, CV, AI, NLP, Reinforcement Learning

Pooya Mohammadi's GitHub stats

Pooya Mohammadi's GitHub stats

Projects:

Deep Utils

This repository contains the most frequently used deep learning models and functions. In mainly constists utilities for:
1. vision:
  1. yolov5
  2. mtcnn
  3. ultralight
  4. deepsort
  5. torchvision-image-classification
2. nlp
  1. ner
    1. ner-stratify
    2. ner-augmentation -> replacement and removal augmentation
3. utils
  1. box-utils 
  2. point-utils
4. augmentation
  1. Cutmix-classification
  2. Cutmix-segmentation
...
Deep_Utils is still under heavy development...

Yolov5-gradcam

Visualizing Yolov5's layers using GradCam 

audio-classification-pytorch's People

Contributors

pooya-mohammadi avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

audio-classification-pytorch's Issues

IndexError: index -1 is out of bounds for dimension 1 with size 0

Could you please tell me how I should modify the training file when I replace the binary classification task with a multi-classification task? Thank you for your open source.

python train.py
Sanity Checking DataLoader 0: 0%| | 0/2 [00:00<?, ?it/s]Traceback (most recent call last):
File "/sty/audio/train.py", line 147, in
main()
File "/sty/audio/train.py", line 135, in main
trainer.fit(model=lit_model, train_dataloaders=train_loader, val_dataloaders=val_loader)
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/trainer/trainer.py", line 608, in fit
call._call_and_handle_interrupt(
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/trainer/call.py", line 38, in _call_and_handle_interrupt
return trainer_fn(*args, **kwargs)
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/trainer/trainer.py", line 650, in _fit_impl
self._run(model, ckpt_path=self.ckpt_path)
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/trainer/trainer.py", line 1103, in _run
results = self._run_stage()
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/trainer/trainer.py", line 1182, in _run_stage
self._run_train()
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/trainer/trainer.py", line 1195, in _run_train
self._run_sanity_check()
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/trainer/trainer.py", line 1267, in _run_sanity_check
val_loop.run()
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/loops/loop.py", line 199, in run
self.advance(*args, **kwargs)
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/loops/dataloader/evaluation_loop.py", line 152, in advance
dl_outputs = self.epoch_loop.run(self._data_fetcher, dl_max_batches, kwargs)
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/loops/loop.py", line 199, in run
self.advance(*args, **kwargs)
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/loops/epoch/evaluation_epoch_loop.py", line 137, in advance
output = self._evaluation_step(**kwargs)
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/loops/epoch/evaluation_epoch_loop.py", line 234, in _evaluation_step
output = self.trainer._call_strategy_hook(hook_name, *kwargs.values())
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/trainer/trainer.py", line 1485, in _call_strategy_hook
output = fn(*args, **kwargs)
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/pytorch_lightning/strategies/strategy.py", line 390, in validation_step
return self.model.validation_step(*args, **kwargs)
File "/sty/audio/train.py", line 62, in validation_step
return self.get_step_metrics(batch)
File "/sty/audio/train.py", line 36, in get_step_metrics
logits = self.model(mfcc)
File "/root/anaconda3/envs/audio/lib/python3.10/site-packages/torch/nn/modules/module.py", line 1194, in _call_impl
return forward_call(*input, **kwargs)
File "/sty/audio/model/transformer_cls.py", line 37, in forward
final_feature_map = encoded_features[:, -1, :]
IndexError: index -1 is out of bounds for dimension 1 with size 0

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.