Git Product home page Git Product logo

fpts2l's Introduction

FPT S2L

Welcome to FPTs2l - the FPT Sign Language Translation app, a cutting-edge solution that draws inspiration from the research outlined in my paper Exploring MediaPipe Optimization Strategies for Real-Time Sign Language Recognition. This app is meticulously designed to bridge the communication gap for individuals with hearing impairments. By harnessing insights from your paper, FPTs2l empowers users with an innovative approach that facilitates seamless interaction and understanding through sign language translation.

Table of Content

Overview

The FPTs2l application is designed to provide enhanced communication support for individuals with hearing impairments. The core functionality of the application involves real-time sign language translation utilizing MediaPipe and LSTM network technology. Here's a high-level overview of how the application works.

Operational Principles

  1. Pose Estimation with MediaPipe: The application utilizes the power of Google's MediaPipe library to detect and track the skeletal structure of a single individual captured by the device's camera. This real-time pose estimation enables the identification of hand gestures and movements essential for sign language.

  2. Coordinate Input to LSTM Network: The coordinates of the detected skeletal keypoints are extracted from the MediaPipe output. These coordinates are then fed into an LSTM (Long Short-Term Memory) neural network. LSTM networks are particularly suited for sequence-to-sequence tasks like capturing the dynamics of sign language gestures over time.

  3. Words Recognition: The LSTM network processes the sequence of skeletal coordinates and attempts to recognize the sign language words being expressed by the user. The network has been trained on a dataset of sign language gestures to learn the mapping between input sequences and their corresponding words or phrases.

  4. Text Output: The recognized sign language words are then translated into text, which is displayed on the user interface. This enables not only the person using the application but also those interacting with them to understand the conveyed message.

The FPTs2l application aims to empower individuals with hearing impairments by providing them with an efficient tool to communicate through gestures and receive real-time translations. By leveraging the capabilities of both MediaPipe and LSTM networks, the application bridges the communication gap and fosters inclusivity in various situations.

Deploying the Product

Welcome to FPTs2l, an innovative application designed to facilitate seamless communication for individuals with hearing impairments. FPTs2l empowers users to bridge the communication gap by converting sign language gestures into comprehensible text, fostering enhanced understanding and interaction.

User Interface

The user interface of FPTs2l has been carefully crafted to provide an intuitive and user-friendly experience:

  • Feature: Sign to Words

The heart of the application lies in the "Sign to Words" feature. Users can use their device's camera to capture sign language gestures, and FPTs2l will process these gestures using a sophisticated LSTM network. The application then displays the recognized text, making it possible for both users and their counterparts to comprehend the conveyed message.

Configuration

To run the FPTs2l application, follow the steps below:

  1. Install Dependencies: Make sure you have the required dependencies installed in your environment. You can install them by running the following command:

    conda env create -f environment.yml
  2. Download the Models: Download the models folder from the following link and move to ./gui/

  3. Run the Application: Once the dependencies are installed, you can run the application using the following command:

    python main.py

fpts2l's People

Contributors

zeres-engel avatar

Stargazers

 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.