Git Product home page Git Product logo

About

I am a PhD student in the SMU Computer Vision and Machine Learning (CVML) Lab since Aug 2023, greatly honored to be advised by Prof SUN Qianru. Prior to this, I worked as Research Associate in NTU S-Lab (2021-2023), supervised by esteemed Prof LU Shijian. Before that, I have been research intern at Tsinghua University BNSCT Lab (2018-2019), directly advised by Prof LIU Bin. I obtained my graduate and undergraduate degrees from NTU and BUPT, respectively.

Interests

  • Computer vision: Text recognition & detection, image generation, unsupervised learning.
  • Telecommunications: Firewall circumvention, trustable network, switching architecture.
  • Art: Logo design, industrial design, Chinese literature.
  • Music: Techno & trap (Magnusthemagnus, Rezz).

Research

I am working in image generation for Beyond Visible Vision (i.e., vision out of visible spectrums, like radar and medical imagery). Before that, I have been working on unsupervised domain adaptation and unsupervised pre-training in text detection/recognition.

  • Image generation for beyond visible vision [2023]
  • Self-supervised learning in hand-writing/scene text recognition [2022 - 2023]
  • Domain adaptation in scene text detection [2021 - 2022]

Pinned Projects

  • 🕹 OpenAI-Proxy: OpenAI GPT3/GPT4 api access distribution/control/tracking tool.

  • 🪪 LaTexCV: Academic C.V. template in Latex with extensive block examples.

  • 🛣 VPGNet-PyTorch: Pytorch implementation of VPG Net (Caffe) for lane and road mark detection.

  • 📝 NTU EEE Dissertation: Latex template for Nanyang Technological University M.Sc dissertation report.

  • 🩻 SLC: Medical image segmentation & classification framework based on Keras, convenient experiment tracking.

  • 🗣 A2SRT: Speech recognition tool for subtitle extracting. Good support for Indian and Singaporean English.

Contact

Research discussion via doem1997 {at} gmail.com

{+7}zfxfB.%RWQ;6.:c0f?k}'s Projects

audio_to_srt icon audio_to_srt

Generate .srt subtitle file automatically from audio file or video file. Features in Indian accent or Singalish recognition.

case_p icon case_p

CASE plus: A network measurement structure, and its C++ simulation framework

classbench_analyzer icon classbench_analyzer

This is the analysis module based on Python for analyze the classbench tool, which is a widely-used tool in generating virtual network routing table.

cp3 icon cp3

Classroom Presenter v3

detic icon detic

Code release for "Detecting Twenty-thousand Classes using Image-level Supervision".

detr icon detr

End-to-End Object Detection with Transformers

diffusers icon diffusers

🤗 Diffusers: State-of-the-art diffusion models for image and audio generation in PyTorch

distribrain icon distribrain

DistriBrain is a non-profit project for truly OPEN AI. By creating a decentralized AI system, we aim to ensure every human to own AI equally, trustworthy, and securely.

east icon east

PyTorch Re-Implementation of EAST: An Efficient and Accurate Scene Text Detector

ee6129-asn-02 icon ee6129-asn-02

NTU EE6129 Assignment 02: MIMO contribution to LTE and 5G wireless network

ee6223 icon ee6223

EE6223 Assignment 02: Enterprise Network Design

fake115upload icon fake115upload

模拟115客户端的上传功能,支持极速秒传和本地文件上传及文件批量导入和导出。

iscript icon iscript

各种脚本 -- 关于 虾米 xiami.com, 百度网盘 pan.baidu.com, 115网盘 115.com, 网易音乐 music.163.com, 百度音乐 music.baidu.com, 360网盘/云盘 yunpan.cn, 视频解析 flvxz.com, bt torrent ↔ magnet, ed2k 搜索, tumblr 图片下载, unzip

ivu-segment icon ivu-segment

Improved VGG-Unet model, which does segmentation and classification task of skin-lesion images.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.