wind91725 Goto Github PK
Type: User
Type: User
A curated list of pretrained sentence and word embedding models
End-to-End recipes for pre-training and fine-tuning BERT using Azure Machine Learning Service
Repository for the paper "Optimal Subarchitecture Extraction for BERT"
A Large-scale Chinese Short-Text Conversation Dataset and Chinese pre-training dialog models
“万创杯”中医药天池大数据竞赛——中医文献问题生成挑战 决赛 第一名方案
中文公开聊天语料库
Large-scale Pre-training Corpus for Chinese 100G 中文预训练语料
A Span-Extraction Dataset for Chinese Machine Reading Comprehension (CMRC 2018)
自然语言处理,知识图谱相关语料。按照Task细分,欢迎PR。
Chinese Pre-Trained Language Models (CPM-LM) Version-I
在bert4keras下加载CPM_LM模型
Implementation / replication of DALL-E, OpenAI's Text to Image Transformer, in Pytorch
Poetry-related datasets developed by THUAIPoet (Jiuge) group.
DeepIE: Deep Learning for Information Extraction
DELTA is a deep learning based natural language and speech processing platform.
本项目将《动手学深度学习》(Dive into Deep Learning)原书中的MXNet实现改为PyTorch实现。
基于开源GPT2.0的初代创作型人工智能 | 可扩展、可进化
Code for AAAI2021 paper: Few-Shot Learning for Multi-label Intent Detection.
An implementation of model parallel GPT2& GPT3-like models, with the ability to scale up to full GPT3 sizes (and possibly more!), using the mesh-tensorflow library.
根据gpt2-ml中文模型finetune自己的数据集
GuwenBERT: 古文预训练语言模型 a Pre-trained Language Model for Classical Chinese (Literary Chinese)
[ICLR 2021] "InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective" by Boxin Wang, Shuohang Wang, Yu Cheng, Zhe Gan, Ruoxi Jia, Bo Li, Jingjing Liu
LightSeq: A High Performance Library for Sequence Processing and Generation
A minimal PyTorch re-implementation of the OpenAI GPT (Generative Pretrained Transformer) training
XiaoMi Natural Language Processing Toolkits
此项目是机器学习(Machine Learning)、深度学习(Deep Learning)、NLP面试中常考到的知识点和代码实现,也是作为一个算法工程师必会的理论基础知识。
基于mlm方式的带有纠错功能的拼音转汉字bert预训练模型,pinyin correcter,基于pytorch框架实现
Pre-trained and Reproduced Deep Learning Models (『飞桨』官方模型库,包含多种学术前沿和工业场景验证的深度学习模型)
a new mrc system for microsoft marco dataset.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.