Git Product home page Git Product logo

commonsense-modeling's Introduction

Hi there ๐Ÿ‘‹

I am an NLP algorithm engineer graduated from Xiamen University with bachelor degree (2015-2019) and Tianjin University with master degree (2019-2022).

My research interests include:

  • ๐Ÿ”ญ clustering analysis (fuzzy clustering theory and linguistic clustering)
  • ๐ŸŒฑ machine translation (text-only and multimodal machine translation)
  • ๐Ÿ‘ฏ multimodal learning (pretraining technology and reasoning)
  • ๐ŸŒฑ large language modeling (infra, multilingual pretrain and efficient universal sft)

I am passionate about specializing in algorithms and fit them into practical applications.

Experiences

  • ๐Ÿ“ซ 2023-09 - now : working on Foundational LLM Team, Alibaba Inc., towards the universal intelligence of LLM, especially on dialogue and searching.
  • ๐Ÿ“ซ 2022-04 - 2023-09: worked on ByteDance AI Lab in the fields of multimodal/multilingual machine translation and multilingual LLM.
  • ๐Ÿ“ซ 2021-07 - 2021-11: conducted research on semi-parametric MT as a NLP Research intern on Alibaba Damo Academy (One conference paper published).
  • ๐Ÿ“ซ 2020-11 - 2021-02: participated in early NLP Migration Project on HUAWEI Ascend, our work was reported as a markable practice [wiki].
  • ๐Ÿ“ซ 2020-05 - 2020-11: conducted research on translation quality estimation in corporation with OPPO Research (One paper under review).
  • ๐Ÿ“ซ 2020-04 - 2020-09: conducted research on vison & language multimodal machine translation (One conference paper published).
  • ๐Ÿค” 2019-09 - 2020-05: joined in TJUNLP lab and conducted research on vision & language commensense reasoning, finally stopped for the lack of computational resources.
  • ๐Ÿ‘ฏ 2018-03 - 2019-09: joined in Optimization Machine Learning Team and studied Fuzzy Clustering Theory (major) and Mainfold Learning (secondary) (One journal paper published and another two journal papers collaborated).
  • ๐Ÿ‘ฏ 2016-11 - 2018-09: joined the Drone Team in charge of the compute vision algorithm, won the second place in International Aerial Robotics Competition.

Representative Publications [google scholar]

  • Efficient Cluster-Based k-Nearest-Neighbor Machine Translation. ACL. 2022.
  • AdaST: Dynamically Adapting Encoder States in the Decoder for End-to-End Speech-to-Text Translation. ACL Findings. 2021.
  • Efficient Object-Level Visual Context Modeling for Multimodal Machine Translation: Masking Irrelevant Objects Helps Grounding. AAAI. 2021.
  • A Novel Fuzzy c-Means Clustering Algorithm Using Adaptive Norm. International Journal of Fuzzy Sytems. 2019.

GitHub Stats

commonsense-modeling's People

Contributors

probe2 avatar wonderseen avatar

Stargazers

 avatar

Watchers

 avatar

Forkers

probe2

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.