Comments (4)
I actually just tried to use convert on the model and I got this issue
python -m mlx_lm.convert --hf-path bigcode/starcoder2-3b
:File "/Users/awni/mlx-examples/llms/mlx_lm/utils.py", line 413, in fetch_from_hub config = AutoConfig.from_pretrained(model_path) ^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^ File "/Users/awni/miniconda3/lib/python3.11/site-packages/transformers/models/auto/configuration_auto.py", line 1130, in from_pretrained raise ValueError( ValueError: The checkpoint you are trying to load has model type `starcoder2` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.
Which Transformer version are you using?
Startcoder2 needs the transformer built from the master, which includes the model and configuration support. not in release build yet.
from mlx-examples.
I actually just tried to use convert on the model and I got this issue
python -m mlx_lm.convert --hf-path bigcode/starcoder2-3b
:File "/Users/awni/mlx-examples/llms/mlx_lm/utils.py", line 413, in fetch_from_hub
config = AutoConfig.from_pretrained(model_path)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/awni/miniconda3/lib/python3.11/site-packages/transformers/models/auto/configuration_auto.py", line 1130, in from_pretrained
raise ValueError(
ValueError: The checkpoint you are trying to load has model type
starcoder2
but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.Which Transformer version are you using?
Startcoder2 needs the transformer built from the master, which includes the model and configuration support. not in release build yet.
I am using the version built from source
from mlx-examples.
I actually just tried to use convert on the model and I got this issue python -m mlx_lm.convert --hf-path bigcode/starcoder2-3b
:
File "/Users/awni/mlx-examples/llms/mlx_lm/utils.py", line 413, in fetch_from_hub
config = AutoConfig.from_pretrained(model_path)
^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^^
File "/Users/awni/miniconda3/lib/python3.11/site-packages/transformers/models/auto/configuration_auto.py", line 1130, in from_pretrained
raise ValueError(
ValueError: The checkpoint you are trying to load has model type `starcoder2` but Transformers does not recognize this architecture. This could be because of an issue with the checkpoint, or because your version of Transformers is out of date.
Which Transformer version are you using?
from mlx-examples.
Fix is here #574
from mlx-examples.
Related Issues (20)
- A simple enhancement, in dataset creation time HOT 1
- [Question]about creating the 'adapters.npz' file HOT 3
- [QUESTION] Is there a way to provide a Huggingface access token for downloading models that are private? HOT 1
- [Model Request] Add support for IBM's Granite model HOT 2
- [Feature] Export Lora Adapters as GGML HOT 3
- Error when running inference on newly converted OpenELM MLX model, ValueError(f"Received parameters not in model: {extras}.") HOT 1
- LLMEvaluator : libc++abi: terminating due to uncaught exception of type std::invalid_argument: [matmul] Last dimension of first input with shape (1,916,2048) must match second to last dimension of second input with shape (256,32000)
- Unable to allocate memory
- Proposal: Add mypy to .pre-commit-config.yml HOT 2
- Fusing adapters with llama3 cause bad performances HOT 7
- Struggling to convert models to MLX HOT 2
- mlx_lm stops generating HOT 1
- lora resume error HOT 2
- Error loading GGUF Mixtral 8x7B Q_8 model HOT 1
- iterate_batches in mlx_lm's Lora trainer is discarding the remainder dataset items (modulo batch size) HOT 1
- 01-ai/Yi-6B-Chat got IndexError: list assignment index out of range HOT 2
- [Feature Request] Finetuning Scripts for Whisper Models HOT 1
- Feature Request - Beam Search Decoder
- gpt-neox HOT 4
- [Feature] Export Lora Adapters as GGML HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from mlx-examples.