Light

João Lages photo

joaolages Goto Github PK

followers: 51.0 following: 18.0 repos: 21.0 gists: 3.0

Name: João Lages

Type: User

Company: Revolut

Bio: I live my life as a gradient descent algorithm: one step at a time to find local minimas that maximize my goals.

Hi there 👋

🙋‍♂️ My name is João Lages
👷‍ Deep Learning Engineer @ Revolut
🌱 I’m interested in everything about machine learning, with focus on deep learning applied to text, images, tabular data, video, speech, time-series, anything!
📫 How to reach me: [email protected]
✍️ Blog posts:
- Model Merging: MoE, Frankenmerging, SLERP, and Task Vector Algorithms 🧌 - Deep dive on how LLM merging methods work (co-authored with Deci AI)
- OpenAI JSON Mode vs Functions - Practical differences between these two ways of using OpenAI API
- Direct Preference Optimization (DPO) - A simplified explanation of the DPO algorithm applied to large language models, like Zephyr
- Reinforcement Learning from Human Feedback (RLHF) 🙋‍♂️ - A simplified explanation of the RLHF algorithm applied to large language models, like ChatGPT
- Transformers KV Caching Explained 💾 - A short writing on how Key and Value states are cached in transformers for faster inference
- Transformers Positional Encodings Explained 📝 - Positional encoding and how it limits the input size of language models
- Mahalanobis for outlier detection - A simple demo on how to use mahalanobis distance for outlier detection
⭐ Main open-source contributions:
- Diffusers-Interpret 🤗🧨🕵️‍♀️ - Own package, a model explainability tool built on top of 🤗 Diffusers
- Ecco - Major contributions to this package that is used to explain, analyze, and visualize NLP language models
- AI Reading Group - Co-author of an open AI reading group from 2019-2023
- RATransformers 🐭 - Own package, used to make transformer models relation-aware
- 🤗 datasets - implemented the mahalanobis distance metric

João Lages's Projects

advanced-computer-architectures

Mini-Projects I did for the course of Advanced Computer Architectures. Development of a processor in VHDL

agony-card-game

The Agony game, developed as an app running via WiFi-Direct.

blackjack-game

Blackjack game with a lot of options and a Swing GUI

cpd_game_of_life

Implementation of Game of Life in C++ done for the course of Distributed and Parallel Computing.

datasets

🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools

diffusers-interpret

Diffusers-Interpret 🤗🧨🕵️‍♀️: Model explainability for 🤗 Diffusers. Get explanations for your generated images.

ecco

Visualize and explore NLP language models. Ecco creates interactive visualizations directly in Jupyter notebooks explaining the behavior of Transformer-based language models (like GPT2).

general-search-algorithm

A General Search algorithm developed in Python for the course of Artificial Intelligence

image2video

This repository contains the source code for the paper First Order Motion Model for Image Animation

joaolages

lxmls-toolkit

Machine Learning applied to Natural Language Processing Toolkit used in the Lisbon Machine Learning Summer School

mysql-and-php-web-application

Mini-Project developed for the course of Database Systems

pixelsfund

Project @ PixelsCamp 2017

ratransformers

RATransformers 🐭- Make your transformer (like BERT, RoBERTa, GPT-2 and T5) Relation Aware!

satplan

Development of a SAT solver, following the SATPLAN, in Python for the course of Artificial Intelligence

temperature-control-of-multi-core-platforms

Optimization of core temperatures in a CPU using CVX

terminal-chat-in-c

Project done for a course in my university

test-suite-sql-eval

Semantic Evaluation for Text-to-SQL with Distilled Test Suites

tf_objectdetection_api

Tutorial on how to create your own object detection dataset and train using TensorFlow's API

transformers

🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.

trec_webtrack

Relevance ranking for Ad-hoc Retrieval. This is a repository used to employ Machine Learning models on the TREC Web Track.

1

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.