erenup's Projects
An open-source NLP research library, built on PyTorch.
Tool for visualizing attention in the Transformer model (BERT, GPT-2, and XLNet)
深度学习数学、模型结构和基础应用
DeepSeek Coder: Let the Code Write Itself
Config files for my GitHub profile.
🚪✊Knock Knock: Be notified when your training ends with only two additional lines of code
A framework for few-shot evaluation of autoregressive language models.
👾 A library of state-of-the-art pretrained models for Natural Language Processing (NLP)
This is the repo for the paper "Revealing the Importance of Semantic Retrieval for Machine Reading at Scale".
Code for our SIGKDD'22 paper Pre-training-Enhanced Spatial-Temporal Graph Neural Network For Multivariate Time Series Forecasting.
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
The Triton TensorRT-LLM Backend
Reading Wikipedia to Answer Open-Domain Questions