minglunhan Goto Github PK
Name: Mason
Type: User
Bio: My research interests include Multimodal LLM, Large Speech Foundation Models and ASR.
Location: Beijing, China.
Name: Mason
Type: User
Bio: My research interests include Multimodal LLM, Large Speech Foundation Models and ASR.
Location: Beijing, China.
chinese NLP corpus of chinese science fiction,chinese science fiction corpus : About 4675 Chinese science fiction novels 大约有4675本科幻小说,中文科幻小说自然语言处理语料库,中文科幻小说文本语料库,中文科幻小说文本数据库,科幻小说语料
AcadHomepage: A Modern and Responsive Academic Personal Homepage
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Collection of resources on the applications of Large Language Models (LLMs) in Audio AI.
🏆 A ranked gallery of awesome streamlit apps built by the community
Book PDF
Free Books
基于ChatGLM-6B模型,进行下游具体任务微调,涉及Freeze、Lora、P-tuning等
Use ChatGPT to summarize the arXiv papers. 全流程加速科研,利用chatgpt进行论文全文总结+专业翻译+润色+审稿+审稿回复
chinese speech pretrained models
[ICASSP 2022] Improving End-to-End Contextual Speech Recognition with Fine-Grained Contextual Knowledge Selection
[INTERSPEECH 2023] Knowledge Transfer from Pre-trained Language Models to Cif-based Recognizers via Hierarchical Distillation
[ICASSP 2020] CIF: Continuous Integrate-and-Fire for End-to-End Speech Recognition (A PyTorch implementation of Continuous Integrate-and-Fire mechanism).
主题:计算认知科学(Computational Cognitive Science)。此仓库诞生背景为IA003结业BP,仍处于萌芽期,内容设置有待转正。下一次大规模更新估计在三四年之后。
收藏的一些经典的历史、政治、心理、哲学、数学、计算机方面电子书(约10万本)
ASR-TTS experiments based on espnet. recipe for librispeech available
Facebook AI Research Sequence-to-Sequence Toolkit written in Python.
An implementation of Microsoft's "FastSpeech 2: Fast and High-Quality End-to-End Text to Speech"
Optimizing the FastSpeech2 model using the LJSpeech Dataset
Intro to Reinforcement Learning (强化学习纲要)
Extract xvector and ivector under kaldi
可醉楼藏书
Using Low-rank adaptation to quickly fine-tune diffusion models.
Codes for processing meeting summarization datasets AMI and ICSI.
A simple and elegant Jekyll theme for an academic personal homepage
Command line utility for forced alignment using Kaldi
Enriching MS-COCO with Chinese sentences and tags for cross-lingual multimedia tasks
Official repository of OFA (ICML 2022). Paper: OFA: Unifying Architectures, Tasks, and Modalities Through a Simple Sequence-to-Sequence Learning Framework
A fast and lightweight python-based CTC beam search decoder for speech recognition.
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.