Git Product home page Git Product logo

cs-552-modern-nlp's Introduction

CS-552: Modern Natural Language Processing

Course Description

Natural language processing is ubiquitous in modern intelligent technologies, serving as a foundation for language translators, virtual assistants, search engines, and many more. In this course, we cover the foundations of modern methods for natural language processing, such as word embeddings, recurrent neural networks, transformers, and pretraining, and how they can be applied to important tasks in the field, such as machine translation and text classification. We also cover issues with these state-of-the-art approaches (such as robustness, interpretability, sensitivity), identify their failure modes in different NLP applications, and discuss analysis and mitigation techniques for these issues.

Quick access links:

Class

Platform Where & when
Lectures Wednesdays: 11:15-13:00 [STCC - Cloud C] & Thursdays: 13:15-14:00 [CE16]
Exercises Session Thursdays: 14:15-16:00 [CE11]
Project Assistance
(not every week)
Wednesdays: 13:15-14:00 [STCC - Cloud C]
QA Forum & Annoucements Ed Forum [link]
Grades Moodle [link]

All lectures will be given in person and live streamed on Zoom. The link to the Zoom is available on the Ed Forum (pinned post). Beware that, in the event of a technical failure during the lecture, continuing to accompany the lecture live via zoom might not be possible.

Recording of the lectures will be made available on SwitchTube. We will reuse some of last year's recordings and we may record a few new lectures in case of different lecture contents.

Lecture Schedule

Week Date Topic Instructor
Week 1 21 Feb
22 Feb
Introduction | Building a simple neural classifier [slides]
Neural LMs: word embeddings [slides]
Antoine Bosselut
Week 2 28 Feb
29 Feb
LM basics | Neural LMs: Fixed Context Models [slides]
Neural LMs: RNNs, Backpropagation, Vanishing Gradients; LSTMs [slides]
Antoine Bosselut
Week 3 6 Mar
7 Mar
Seq2seq + decoding + attention | Transformers
Transformers + Greedy Decoding; GPT
Antoine Bosselut
Week 4 13 Mar
14 Mar
Pretraining: ELMo, BERT, MLM, task generality | Transfer Learning: Introduction
Pretraining S2S: BART, T5
Antoine Bosselut
Week 5 20 Mar
21 Mar
Transfer Learning: Dataset Biases
Generation: Task
Antoine Bosselut
Week 6 27 Mar
28 Mar
Generation: Decoding and Training
Generation: Evaluation
Antoine Bosselut
EASTER BREAK
Week 7 10 Apr
11 Apr
In-context Learning - GPT-3 + Prompts | Instruction Tuning
Project Description
Antoine Bosselut
Week 8 17 Apr
18 Apr
Scaling laws | Model Compression
No class (Project work; A1 Grade Review Session)
Antoine Bosselut
Week 9 24 Apr
25 Apr
Ethics in NLP: Bias / Fairness and Toxicity, Privacy, Disinformation
No class (Project work; A1 Grade Review Session)
Anna Sotnikova
Week 10 1 May
2 May
Tokenization: BPE, WP, Char-based | Multilingual LMs
Guest Lecture: Kayo Yin
Negar Foroutan
Kayo Yin
Week 11 8 May
9 May
Syntactic and Semantic Tasks (NER) | Interpretability: BERTology
No class (Project work; A2 Grade Review Session)
Gail Weiss
Week 12 15 May
16 May
Reading Comprehension | Retrieval-augmented LMs
No class (Project work; A2 Grade Review Session)
Antoine Bosselut
Week 13 22 May
23 May
Multimodality: L & V
Looking forward
Syrielle Montariol
Antoine Bosselut
Week 14 29 May
30 May
No class (Project work; A3 Grade Review Session)

Exercise Schedule

Week Date Topic Instructor
Week 1 22 Feb Setup + Word embeddings [code] Mete Ismayilzada
Week 2 29 Feb Word embeddings review
Language and Sequence-to-sequence models [code]
Mete Ismayilzada
Badr AlKhamissi
Week 3 6 Mar Assignment 1 Q&A Mete Ismayilzada
Week 3 7 Mar Language and Sequence-to-sequence models review
Attention + Transformers
Assignment 1 Q&A
Badr AlKhamissi
Week 4 13 Mar Assignment 1 Q&A Simin Fan
Week 4 14 Mar Attention + Transformers review
Pretraining and Transfer Learning Pt. 1
Badr AlKhamissi
Simin Fan
Week 5 20 Mar Assignment 2 Q&A Simin Fan
Week 5 21 Mar Pretraining and Transfer Learning Pt. 1 review
Transfer Learning Pt. 2
Simin Fan
Week 6 28 Mar Assignment 2 Q&A Simin Fan
Week 6 28 Mar Transfer Learning Pt. 2 review
Text Generation
Simin Fan
Deniz Bayazit
EASTER BREAK
Week 7 10 Apr Assignment 3 Q&A Badr AlKhamissi
Deniz Bayazit
Week 7 11 Apr Text Generation review
In-context Learning
Deniz Bayazit
Mete Ismayilzada
Week 8 17 Apr Assignment 3 Q&A Badr AlKhamissi
Deniz Bayazit
Mete Ismayilzada
Week 8 18 Apr In-context Learning review Mete Ismayilzada
Week 9 24 & 25 Apr Project TA meetings on-demand
Week 10 1 & 2 May Project TA meetings on-demand
Week 11 8 & 9 May Project
Milestone 1 Feedback
TA meetings on-demand
Week 12 15 & 16 May Project TA meetings on-demand
Week 13 22 May A3 Grade Review Session Badr AlKhamissi
Deniz Bayazit
Week 13 23 May Project TA meetings on-demand
Week 14 30 May Project
Milestone 2 Feedback
TA meetings on-demand

Exercises Session format:

  • TAs will provide a small discussion over the last week's exercises, answering any questions and explaining the solutions. (10-15mins)
  • TAs will present this week's exercise. (5mins)
  • Students will be solving this week's exercises and TAs will provide answers and clarification if needed.

Note: Please make sure you have already done the setup prerequisites to run the coding parts of the exercises. You can find the instructions here.

Grading:

Your grade in the course will be computed according to the following guidelines.

Submission Format

Assignment and project release annoucements will be on Ed. Your work will be submitted as a repository created by GitHub classroom. Clicking the assignment link (announced on its release date) will automatically create a repository under your username (ensure it matches the one on the CS-552 GitHub registration form). Your last push to the repository will be considered as your final submission, with its timestamp determining any late days (see below for the policy).

All large files such as model checkpoints need to be pushed to the repository with Git LFS. Large files can take time to upload, therefore please avoid last-minute uploads that can create potential submission delays. We also propose to use Colab as a free GPU resource. You can find tutorials on all of these resources here.

Late Days Policy

All assignments and milestones are due at 23:59 on their due date. As we understand that circumstances can make it challenging to abide by these due dates, you will receive 6 late days over the course of the semester to be allocated to the assignments and project milestones as you see fit. No further extensions will be granted. The only exception to this rule is for the final report, code, and data. No extensions will be granted beyond June 14th.

Assignments (40%):

There will be three assignments throughout the course. They will be released and due according to the following schedule:

Assignment 1 (10%)

  • Released: 28 February 2024
  • Due: 17 March 2024
  • Grade released: 14 April 2024
  • Grade review sessions: 18 and 25 April 2024

Assignment 2 (15%)

  • Released: 20 March 2024
  • Due: 7 April 2024
  • Grade released: 5 May 2024
  • Grade review sessions: 9 and 16 May 2024

Assignment 3 (15%)

  • Released: 3 April 2024
  • Due: 21 April 2024
  • Grade released: 19 May 2024
  • Grade review sessions: 29 and 30 May 2024

Project (60%):

The project will be divided into 2 milestones and a final submission. Each milestone will be worth 15% of the final grade with the remaining 30% being allocated to the final report. Each team will be supervised by one of the course TAs or AEs.

More details on the content of the project and the deliverables of each milestone will be released at a later date.

Milestone 1:

  • Due: 5 May 2024

Milestone 2:

  • Due: 26 May 2024

Final Deliverable:

  • The final report, code, and date will be due on June 14th. Students are welcome to turn in their materials ahead of time, as soon as the semester ends.
  • Due: 14 June 2024

Contacts

Lecturer: Antoine Bosselut

Teaching assistants: Badr AlKhamissi, Deniz Bayazit, Beatriz Borges, Zeming (Eric) Chen, Simin Fan, Negar Foroutan Eghlidi, Silin Gao, Mete Ismayilzada

Please contact us for any organizational questions or questions related to the course content.

cs-552-modern-nlp's People

Contributors

bayazitdeniz avatar negar-foroutan avatar alibakly avatar mismayil avatar bkhmsi avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.