Light

Banala Saritha photo

banalasaritha Goto Github PK

followers: 2.0 following: 24.0 repos: 105.0 gists: 0.0

Name: Banala Saritha

Type: User

Company: India

Bio: Speaker Recognition and Identification, Meta-learning, Few Shot Learning & Speech Processing, Speech-activity-detection , T-F Representations.

Location: National Institute of Technology

Banala Saritha's Projects

3dcnn

3D convolutional neural network for video classification

afrnn

aishell-2

kaldi-asr/kaldi is the official location of the Kaldi project.

as-pvad

AS-pVAD: A Real-time Personalized Voice Activity Detection Network With Attentive Score Loss

attention_ocr.pytorch

This repository implements the the encoder and decoder model with attention model for OCR

audio-classification-models

Audio classification is a popular topic, here I implement several models using TenserFlow and Keras.

audio-spectrogram-transformer

Code for the Interspeech 2021 paper "AST: Audio Spectrogram Transformer".

audio_visual_speech_enhancement

Face Landmark-based Speaker-Independent Audio-Visual Speech Enhancement in Multi-Talker Environments

audioset-for-meta-learning

Meta-Learning for Few Shot Learning

audiosignalprocessingforml

Code and slides of my YouTube series called "Audio Signal Proessing for Machine Learning"

augself-few-shot

best-notes-fewshot-learning

bi-lstm-crf-tensorflow

Bidirectional LSTM + CRF (Conditional Random Fields) in Tensorflow

build-an-avatar-with-asr-tts-transformer-omniverse-audio2face

cacrn-net

Channel Attention Convolutional Recurrent Neural Network for Few-Shot Speaker Identification

channel-attention-fullsubnet-with-complex-spectrograms-for-speech-enhancement-

The official PyTorch implementation of "FullSubNet+: Channel Attention FullSubNet with Complex Spectrograms for Speech Enhancement".

channel-attention-resnet

A modified ResNet with channel attention mechanism

chinese-speaker-identification

End-to-End Chinese Speaker Identification

clusteringdirectioncentrality

A novel Clustering algorithm by measuring Direction Centrality (CDC) locally. It adopts a density-independent metric based on the distribution of K-nearest neighbors (KNNs) to distinguish between internal and boundary points. The boundary points generate enclosed cages to bind the connections of internal points.

cnn-rnn-dilated-convolution

protein-sequence-based drug discovery; dilated convolutions, recurrent interpretation to time-distributed output

convolutional-rnn

TensorFlow code for the paper Convolutional RNN: an Enhanced Model for Extracting Features from Sequential Data (https://arxiv.org/abs/1602.05875)

coughvid-19-crnn-attention

Another project for classifying Covid and non-covid patients through cough sound. Using CRNN-Attention model with the sound data converted into image data

crnn

CRNN(Convolutional Recurrent Neural Network), with optional STN(Spatial Transformer Network), in Tensorflow, multi-gpu supported.

crnn-imp-with-keywordspotting

Keyword spotting on Arm Cortex-M Microcontrollers

crnn-ocr-lite

Lightweight CRNN for OCR (including handwritten text) with depthwise separable convolutions and spatial transformer module [keras+tf]

crnn_tensorflow

Convolutional Recurrent Neural Networks(CRNN) for Scene Text Recognition

deepkws

deeplearningforaudiowithpython

Code and slides for the "Deep Learning (For Audio) With Python" course on TheSoundOfAI Youtube channel.

diffres-python

Learning differentiable temporal resolution on time-series data.

dnd-sed

Sound event detection with depthwise separable and dilated convolutions.

1
2
3
4

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.