Comments (6)
Thanks for the feature request! This sounds like a very useful feature and would be simple to add. We'll add it to our todo list.
from curated-transformers.
If there's an easy way to also support loading from a byte string, I'd see that as desirable too 🙂
I get that it might not be so easy though, depending on the way PyTorch does things.
from curated-transformers.
If there's an easy way to also support loading from a byte string, I'd see that as desirable too 🙂
I get that it might not be so easy though, depending on the way PyTorch does things.
We could definitely support that. But we still need to hash out how we want to serialize the model hyperparameters (outside TorchScript/ONNX/etc.). What we had in mind now is generating configuration json that it compatible with HF transformers. We could then serialize the config + state dict to a byte string.
from curated-transformers.
@bilelomrani1 the main
branch now supports this, so if you want to give it a spin:
from curated-transformers.
Thank you @shadeMe @danieldk this works exactly as intended! I leave the issue open if you want to track the status for loading from a byte string here but feel free to close, as this solves my issue.
from curated-transformers.
Thanks for the suggestion and for testing 👍.
from curated-transformers.
Related Issues (20)
- scipy through bitsandbytes HOT 1
- Output logits for generation HOT 1
- Add Low-Rank Adapters injection into base models HOT 3
- Loading a model saved locally HOT 2
- Support for loading from an AbstractFileSystem HOT 5
- Optimal Qlora settings HOT 1
- Unable to load fine-tuned causalLM MPT model HOT 2
- Pretrain from scratch and fine tune HOT 1
- Support for Encoder-Decoder-style architectures HOT 2
- Add support for Mistral
- Option to only return the last hidden layer output from models
- Convert QKV projection splitting methods into Torch modules
- Make `QkvMode` ADT-like
- Expose more outputs through the `Generator` interface
- Register models/tokenizers through `catalogue`, so that the auto classes pick them up HOT 2
- Add a an extras/contrib package
- Support DeBERTa v2/3 HOT 1
- Add support for attention sinks
- Move the old Falcon architecuture to the extras/addons pacakage
- Add suggested PyTorch LLM optimizations
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from curated-transformers.