Comments (1)
Added code to freeze all layers but the classification head of LLMs. When importing from transformers, differently named heads are created for various models and the layers to keep training need to be specified for each model.
When the model is allowed to tune fully, the training accuracy goes up to 0.9 but the val accuracy falls quickly after an initial spike as the model overfits and looses pretrained knowledge.
Training just the classification head is much more well behaved with the validation accuracy increasing up to a plateau. However, train accuracy is only 0.65 indicating underfitting.
TODO: Implement ability to add a custom classification head with specified complexity, and try training with more weights.
The BERT base was implemented in this branch, together with WIP code for RoBERTa and LLama-2. Turn off the latter as not implemented until finalized and merged. End scope creep here, and raise a separate issue to finalize.
from torch-control.
Related Issues (20)
- Implement sentiment chatbot using several public models
- Movie review sentiment analysis HOT 1
- Fine-tune most popular LLMs for movie sentiment analysis HOT 1
- Add the IMDB dataset to the sentiment analysis task data
- Drop augmented datapoints with variable labels
- Implement time series power consumption regression analysis HOT 1
- Implement Image Classification on dermaMNIST HOT 3
- Add MLOps HOT 3
- Add oversampling/augmentation to mitigate class imbalance
- Implement use of formal parameter search to log neatly with MLFlow HOT 1
- Implement customizable classification head for LLM fine tuning HOT 1
- Improve power consumption prediction model HOT 1
- Build docker container from mlflow and validate it HOT 1
- Upload Docker container to AWS as demo
- Implement production performance monitoring for power consumption model
- Build pipeline to extract structured text from images HOT 2
- Evaluate - Fine tuning the entire LLM network vs default classifier head vs bigger head HOT 5
- Summarize specific values from extracted text
- Detect bounding boxes containing text in images
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from torch-control.