Comments (5)
Yes, in the config, you can select it individually.
from aqt.
Thanks for the speedy reply. Would you mind pointing me to the config field that toggles this?
from aqt.
https://github.com/google/aqt/blob/main/aqt/common/aqt_config.py#L362
As you see lhs
and rhs
have completly separate quantization configs.
from aqt.
Yes, but from what i see in the underlying dot product code, if activation is not quantized, a float * float dot product is used? https://github.com/google/aqt/blob/main/aqt/jax/aqt_dot_general.py#L91 Is this a fake quantization or is this also the arithmetic in serving time?
from aqt.
Sorry that I missed your comment.
For the dot product to be accelerated, both sides need to have the same type.
You are probably most concerned about weight loading.
We will implement that in AQTv2 within a month.
from aqt.
Related Issues (20)
- Difference between jax and jax legacy? HOT 1
- Broken `aqtp-0.1.1` package: missing `aqt` package prefix HOT 1
- How to use this package to quantize a pretrained model in huggingface, such as BERT / Roberta? HOT 1
- Store quantized weights and calibration scales in the checkpoint. HOT 1
- Refactor config/code classes to follow Flax.
- Port static quantization from AQTv1 to AQTv2
- Implement FP8 Numerics. HOT 1
- Add functionality to allow QK cache quantization. HOT 1
- Implement backprop quantization for convolution.
- Does JAXv2 allow for arbitrary quantization? HOT 1
- Performance of MNIST example HOT 1
- Test HOT 2
- AqtEinsum 'not enough values to unpack'
- TypeError: dataclass() got an unexpected keyword argument 'frozen' HOT 2
- Can AQT be used to calculate qk score? HOT 2
- Binary quantization?
- Embedded inference or tflite compatibility on the roadmap?
- How to use it with jnp.einsum? HOT 1
- NormalFloat4 support
- generalized einsum or matmul api for pure jax
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from aqt.