hieunguyen1053 Goto Github PK

followers: 26.0 following: 21.0 repos: 93.0 gists: 1.0

Name: Hieu Nguyen

Type: User

Company: @iconclub

Bio: From Vietnam with love

Location: 19 Nguyen Huu Tho, Ho Chi Minh City

Nguyen Trong Hieu (@hieunguyen1053) - AI Engineer

Email: [email protected]

LinkedIn: https://www.linkedin.com/in/hieunguyen1053

Huggingface: https://huggingface.co/hieunguyen1053

SUMMARY

I’m an AI Engineer with 2 years of experience in Natural Language Processing. With my knowledge of NLP, I enjoy applying AI to life and creating the best experience for users. I also spend time reading Science papers, learning new technologies and best practices to become a better engineer.

TECHNICAL SKILLS

Programming Languages: Python, Java, C++

Frameworks/Libraries:

Pytorch, Tensorflow
Transformers, Optimum
Onnxruntime, TensorRT

Fields of research:

Machine Learning
Natural Language Processing
Computer Vision

PROFESSIONAL EXPERIENCE

Ademax (Vietnam) - AI Engineer (2021 - Now)

Project: Ademax OCR (5 members, 1 front-end, 3 back-end, 1 ML)

LINK DEMO

VIDEO DEMO

Description: A service used by a company to encode scanned documents into structured text documents, supporting metadata extraction.
Technologies:
- Front-end: VueJS
- Back-end: Django, Redis, PostgresSQL, ElasticSearch
- Machine learning: Pytorch, Transformers, Accelerate, Optimum
Responsibilities
- Generate training data, train models running on multiple GPUs (4 x Nvidia V100), evaluate and compare with previous models.
- Model optimization, model quantization, so that the model runs on CPU or GPU faster.
- Write API to execute multiple tasks concurrently, and handle queues to avoid overloading.

Project: Ademax Spelling (5 members, 1 front-end, 3 back-end, 1 ML)

VIDEO DEMO

Description: Service used to check the spelling of Vietnamese text. This service is also integrated into the Office Word Add-in, serving a wide range of customers such as businesses, students, and ordinary users.
Technologies:
- Front-end: VueJS, AngularJS
- Back-end: Django, Redis, PostgresSQL, ElasticSearch
- Machine learning: Pytorch, Transformers, Accelerate, Optimum
Responsibilities
- Generate training data, train models running on multiple GPUs (8 x Nvidia A100), evaluate and compare with previous models.
- Model optimization, model quantization, so that the model runs on CPU or GPU faster.
- Write API to spell check Vietnamese text, classify many typos and suggest corrections. Supports executing multiple concurrent tasks and processing queues to avoid overloading.

EDUCATION

Ton Duc Thang University (Vietnam) 08/2018 - now

Studying Computer Science
GPA: 8.2
Member of the ICON Acedemic Club (@iconclub)

STATS

Hieu Nguyen's Projects

anime-girls-holding-programming-books

Anime Girls Holding Programming Books

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

bertsearch

Elasticsearch with BERT for advanced document search.

binance-trade-bot

Automated cryptocurrency trading bot

bloomz.cpp

C++ implementation for BLOOM

chat-ui

Open source codebase powering the HuggingChat app

chatbot-dialog-dataset

Dialogs for training or setting up a chatbot

clipper.js

HTML to Markdown converter and crawler.

codebert

CodeBERT

colossalai

Making big AI models cheaper, easier, and more scalable

corpus.viwiki

Vietnamese Wikipedia Corpus

cosmopedia

crawler_instagram

Cào ảnh trên Instagram

cryptopp-example

A few examples use the crypto ++ library for hash functions, block ciphers, public key signature schemes.

d2l-vn

Một cuốn sách tương tác về học sâu có mã nguồn, toán và thảo luận. Đề cập đến nhiều framework phổ biến (TensorFlow, Pytorch & MXNet) và được sử dụng tại 175 trường Đại học.

danes

DANeS is an open-source E-newspaper dataset by collaboration between DATASET JSC (dataset.vn) and AIV Group (aivgroup.vn)

detectron2

Detectron2 for Document Layout Analysis

dify

An Open-Source Assistants API and GPTs alternative. Dify.AI is an LLM application development platform. It integrates the concepts of Backend as a Service and LLMOps, covering the core tech stack required for building generative AI-native applications, including a built-in RAG engine.