Git Product home page Git Product logo

transformers's Introduction

transformers course

make sense logo

πŸ‘‹ hello

I'm Peter, a software engineer embarking on a fresh journey into the world of transformers, and I invite you to join me! The course is a work in progressβ€”it’s free, open-source, and we’ll be building it together, step by step. We'll explore key concepts, tackle practical exercises, and dissect seminal papers, all while learning and growing together. Using YouTube videos for clarity and Jupyter notebooks for hands-on practice, we're set for our collaborative journey into the world of transformers. Let's dive in together! πŸš€

πŸš€ Course program

πŸ”‘ Key concepts

  • Encoder-decoder architecture
  • Self-attention
  • Multi-head attention
  • Positional encoding
  • Keys, queries, and values
  • Word embeddings
  • Dynamic padding
  • Tokenization

πŸ› οΈ Practical exercises

  • Implement self-attention from scratch
  • Implement multi-head attention from scratch
  • Build a simple transformer model for a sequence-to-sequence task
  • Fine-tune a pre-trained model like BERT or GPT-2 on a specific task
  • Use a pre-trained transformer like GPT-2 for text generation
  • Train ViT on custom dataset for image classification

πŸ—žοΈ Paper reviews

  • "Attention Is All You Need" (2017) [link]
  • "BERT: Pre-training of Deep Bidirectional Transformers for Language Understanding" (2018) [link]
  • "ViT: An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale" (2020) [link]
  • "DETR: End-to-End Object Detection with Transformers" (2020) [link]
  • "CLIP: Learning Transferable Visual Models From Natural Language Supervision" (2021) [link]
  • "GPT-3: Language Models are Few-Shot Learners" (2020) [link]

🎬 Upcoming videos

  • Introduction to the course (coming soon)
  • Self-attention (coming soon)
  • Multi-head attention (coming soon)
  • Paper review: "Attention Is All You Need" (coming soon)

🦸 Contribution

I would love your help in making this repository even better! Whether you want to correct a typo, add some new content, or if you have any suggestions for improvement, feel free to open an issue.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.