Comments (4)
This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.
from nemo.
I'm also interested in this.
from nemo.
This issue is stale because it has been open for 30 days with no activity. Remove stale label or comment or this will be closed in 7 days.
from nemo.
This issue was closed because it has been inactive for 7 days since being marked as stale.
from nemo.
Related Issues (20)
- Converting Script for Mamba2 Hybrid to HF/Pytorch
- stop using ModelFilter and DatasetFilter from huggingface_hub HOT 1
- fix exp_manager.py to work on Windows
- be prepared when a user selects a file that isn't strictly mono and other file extensions as well
- fix where the transcription is saved please
- dim unmatch when doing sft with tensor parallel and sequence parallel and LoRA
- NeMo/tutorials/speaker_tasks/ASR_with_SpeakerDiarization needs confidence estimation
- [rank1]: AttributeError: 'NoneType' object has no attribute 'get' (finetuning Mamba Hybrid) HOT 1
- fastconformer hybrid recipe reports strange val_WER with `nemo:24.07` and `nemo:dev` HOT 1
- SFT training getting nan loss when using PP=4, TP=4 and model params > 7b HOT 1
- ERROR: Could not find a version that satisfies the requirement triton (from nemo-toolkit) (from versions: none) HOT 1
- Error in converting LLaMA3.1 nemo checkpoint into HF HOT 2
- Nemo ASR: TypeError: ConfidenceConfig.__init__() got an unexpected keyword argument 'tdt_include_duration' HOT 2
- Unusually high initial loss during continual pre-training of the Gemma2-2B model.
- Can't run basic inference HOT 2
- Continual training error: FileNotFoundError: [Errno 2] No such file or directory: '/tmp/tmpbqgpune1/model_weights/model.decoder.layers.self_attention.linear_proj._extra_state/shard_0_16.pt'
- 00_NeMo_Primer.ipynb in Google Collab fail HOT 2
- Convert Mamba2 Hybrid .nemo model to .safetensors / .bin
- RuntimeError: stack expects each tensor to be equal size (when using lhotse shar data sets)
- megatron.core.dist_checkpointing.core.CheckpointingException: Object shard /ckpt/model_weights/model.decoder.layers.self_attention.core_attention._extra_state/shard_0_80.pt not found
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from nemo.