seanzhang-zhichen Goto Github PK
Type: User
Type: User
pytorch
百川Dynamic NTK-ALiBi的代码实现:无需微调即可推理更长文本
BiLSTM-CRF-NER
基于bert的图书多分类
利用Chatgpt来做知识库问答
中文法律大模型
Chinese version of CLIP which achieves Chinese cross-modal retrieval and representation generation.
A one-stop data processing system to make data higher-quality, juicier, and more digestible for LLMs! 🍎 🍋 🌽 ➡️ ➡️🍸 🍹 🍷为大语言模型提供更高质量、更丰富、更易”消化“的数据!
扩充百川大模型词表,其他模型也类似
基于深度学习的FAQ式问答系统
使用fasttext二分类
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models.
使用Qdrant + cnclip + gradio 实现图文检索
Unify Efficient Fine-tuning of 100+ LLMs
Llama3-Chinese是以Meta-Llama-3-8B为底座,使用 DORA + LORA+ 的训练方法,在50w高质量中文多轮SFT数据 + 10w英文多轮SFT数据 + 2000单轮自我认知数据训练而来的大模型。
minhash + lsh 在海量数据集上去重
OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
Pytorch Bert+BiLstm二分类
基于Bilstm + CRF的信息抽取模型
Qwen-WisdomVast is a large model trained on 1 million high-quality Chinese multi-turn SFT data, 200,000 English multi-turn SFT data, and 2,000 single-turn self-cognition data, using the training methods of DORA and LORA+ based on Qwen1.5-7B as the base. Compared to Qwen1.5-7B-Chat, it has improved mathematical abilities by 5.16%, 12.8% on the Human
SimCSE
文本分类器
TextCNN 文本分类(Pytorch)
Baichuan-13B-Base模型的继续预训练与微调
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
用子牙模型来做增量预训练,注入领域知识
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.