Comments (2)
Right so the parameter is actually named --max-tokens
now. This is just an upper limit on the maximum number of tokens. Usually the model stops when it encounters an end of sentence token before it hits the upper limit.
from mlx-examples.
Thanks a lot.
from mlx-examples.
Related Issues (20)
- Error when running inference on newly converted OpenELM MLX model, ValueError(f"Received parameters not in model: {extras}.") HOT 1
- LLMEvaluator : libc++abi: terminating due to uncaught exception of type std::invalid_argument: [matmul] Last dimension of first input with shape (1,916,2048) must match second to last dimension of second input with shape (256,32000)
- Unable to allocate memory
- Proposal: Add mypy to .pre-commit-config.yml HOT 2
- Fusing adapters with llama3 cause bad performances HOT 7
- Struggling to convert models to MLX HOT 2
- mlx_lm stops generating HOT 1
- lora resume error HOT 2
- Error loading GGUF Mixtral 8x7B Q_8 model HOT 1
- iterate_batches in mlx_lm's Lora trainer is discarding the remainder dataset items (modulo batch size) HOT 1
- 01-ai/Yi-6B-Chat got IndexError: list assignment index out of range HOT 2
- [Feature Request] Finetuning Scripts for Whisper Models HOT 1
- Feature Request - Beam Search Decoder
- Discrepancies in generations from the fine tuned models after and before converting them into GGUF. The output generations go into an infinite loop. HOT 12
- NameError: name 'resume_adapter_file' is not defined HOT 1
- Received parameters not in model: {extras}. HOT 1
- support for Gemma 2 HOT 1
- Model type deepseek_v2 not supported. HOT 18
- [Feature Request] Supports fine-tuning of Vision models HOT 1
- Feature Request - Generate function, which returns response and logprobs for the response? HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mlx-examples.