Git Product home page Git Product logo

algorithm's Introduction

🛠 Tech Stack 🛠

Techs that I've used at least once

             
           



💜 Career

Type Date Contents Org.
Education 2014. 3 ~ 2021.2 Department of Computer Engineering Ajou University
🏢Employment 2021. 6 ~ Backend Developer Sejong Telecom

💡 Current Interests

  • Nodejs
  • RTP Protocol
  • A new application using an existing model or task
  • Asterisk and FreePBX
  • SIP Stack
  • Machine Learning
  • Digital Signal Processing
  • linear algebra, calculus, Discrete mathematics

Anurag's GitHub stats

 


algorithm's People

Contributors

nusgnojkrap avatar

Watchers

 avatar

Forkers

libtv

algorithm's Issues

Machine Learning - STT algorithm : MFCC

The preparation process for MFCC.

  1. Preemphasis : y(t)=x(t)-αx(t-1)
    The preprocessing process that boosts the energy of high frequency components.

By reinforcing relatively low-energy high-frequency components, the raw voice signal has a relatively even energy distribution across the entire frequency range.

Prevention of normative problems that may occur during Fourier transform.

Signal-to-noise ratio (SNR) improvement.

  1. Framing
    The process of splitting the voice signal into small pieces in a short time.

Cut the frame into a certain time unit (frame_size=25 ms) but overlap the frame_stride=10 ms.

  1. Windowing
    It is a technique of smoothing boundaries by applying a specific function to each frame.
    w[n]=0.54-0.46cos(2πn/(N-1))

  2. Filter Banks : Mel Scale
    In data (pow_frames) with energy information for each frequency domain, the low frequency domain can be viewed in relatively detail compared to the high frequency domain.

  3. Log-Mel Spectrum : LOG-FILTER BANKS
    We are interested in extracting characteristics that are important for human speech recognition, so we perform log transformation on the mel spectrum.
    filter_banks = 20 * np.log10(filter_banks)

  4. MFCCs

Error analysis and handling in newSTT

The efficiency of the DFT algorithm is very low at n^2.
To improve this, we tried using the FFT algorithm.
The algorithm can achieve the efficiency of nlogn.

While implementing the above FFT algorithm, signal data sometimes flies away from the recursive part, and as a result, data is not accumulated in the array, so we are trying to figure it out.

locate : algorithm\machineLearning\newSTT/jongfft.js
function fft

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.