- ๐โโ๏ธ My name is Joรฃo Lages
- ๐ทโ Deep Learning Engineer @ Revolut
- ๐ฑ Iโm interested in everything about machine learning, with focus on deep learning applied to text, images, tabular data, video, speech, time-series, anything!
- ๐ซ How to reach me: [email protected]
- โ๏ธ Blog posts:
- Model Merging: MoE, Frankenmerging, SLERP, and Task Vector Algorithms ๐ง - Deep dive on how LLM merging methods work (co-authored with Deci AI)
- OpenAI JSON Mode vs Functions - Practical differences between these two ways of using OpenAI API
- Direct Preference Optimization (DPO) - A simplified explanation of the DPO algorithm applied to large language models, like Zephyr
- Reinforcement Learning from Human Feedback (RLHF) ๐โโ๏ธ - A simplified explanation of the RLHF algorithm applied to large language models, like ChatGPT
- Transformers KV Caching Explained ๐พ - A short writing on how Key and Value states are cached in transformers for faster inference
- Transformers Positional Encodings Explained ๐ - Positional encoding and how it limits the input size of language models
- Mahalanobis for outlier detection - A simple demo on how to use mahalanobis distance for outlier detection
- โญ Main open-source contributions:
- Diffusers-Interpret ๐ค๐งจ๐ต๏ธโโ๏ธ - Own package, a model explainability tool built on top of ๐ค Diffusers
- Ecco - Major contributions to this package that is used to explain, analyze, and visualize NLP language models
- AI Reading Group - Co-author of an open AI reading group from 2019-2023
- RATransformers ๐ญ - Own package, used to make transformer models relation-aware
- ๐ค datasets - implemented the mahalanobis distance metric
joaolages / trec_webtrack Goto Github PK
View Code? Open in Web Editor NEWRelevance ranking for Ad-hoc Retrieval. This is a repository used to employ Machine Learning models on the TREC Web Track.
License: GNU General Public License v3.0