Comments (4)
It's quite useful to log validation loss for early stopping as well. It's easy to overfit, especially with large models and small datasets and the validation set is the best way to determine...
Feel free to open a discussion if you'd like more input on this.
from mlx-examples.
Member
I notice the Val loss will be printed every 200 iterations, how can I make it print more frequent?
from mlx-examples.
--steps-per-eval=100
for example
from mlx-examples.
python -m mlx_lm.lora --help
for all the options
from mlx-examples.
Related Issues (20)
- A simple enhancement, in dataset creation time HOT 1
- [Question]about creating the 'adapters.npz' file HOT 3
- [QUESTION] Is there a way to provide a Huggingface access token for downloading models that are private? HOT 1
- [Model Request] Add support for IBM's Granite model HOT 2
- [Feature] Export Lora Adapters as GGML HOT 3
- Error when running inference on newly converted OpenELM MLX model, ValueError(f"Received parameters not in model: {extras}.") HOT 1
- LLMEvaluator : libc++abi: terminating due to uncaught exception of type std::invalid_argument: [matmul] Last dimension of first input with shape (1,916,2048) must match second to last dimension of second input with shape (256,32000)
- Unable to allocate memory
- Proposal: Add mypy to .pre-commit-config.yml HOT 2
- Fusing adapters with llama3 cause bad performances HOT 7
- Struggling to convert models to MLX HOT 2
- mlx_lm stops generating HOT 1
- lora resume error HOT 2
- Error loading GGUF Mixtral 8x7B Q_8 model HOT 1
- iterate_batches in mlx_lm's Lora trainer is discarding the remainder dataset items (modulo batch size) HOT 1
- 01-ai/Yi-6B-Chat got IndexError: list assignment index out of range HOT 2
- [Feature Request] Finetuning Scripts for Whisper Models HOT 1
- Feature Request - Beam Search Decoder
- gpt-neox HOT 4
- [Feature] Export Lora Adapters as GGML HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mlx-examples.