Git Product home page Git Product logo

mipt-nlp2022's Introduction

mipt-nlp2022

NLP course, MIPT

Course instructors

Anton Emelianov ([email protected], @king_menin), Albina Akhmetgareeva ([email protected])

Videos here

Exam questions here

Mark

$$final_mark=sum_i (max_score(HW_i)) / count(HWs) * 10, i==1..3$$

Lecture schedule

Week 1

Week 2

Distributional semantics. Count-based (pre-neural) methods. Word2Vec: learn vectors. GloVe: count, then learn. N-gram (collocations) RusVectores. t-SNE.

Week 3

Neural Language Models: Recurrent Models, Convolutional Models. Text classification (architectures)

Week 4

Task description, methods (Markov Model, RNNs), evaluation (perplexity), Sequence Labelling (NER, pos-tagging, chunking etc.) N-gram language models, HMM, MEMM, CRF

Week 5

Basics: Encoder-Decoder framework, Inference (e.g., beam search), Eval (bleu). Attention: general, score functions, models. Bahdanau and Luong models. Transformer: self-attention, masked self-attention, multi-head attention.

Week 6

Bertology (BERT, GPT-s, t5, etc.), Subword Segmentation (BPE), Evaluation of big LMs.

Week 7

Lecture & Practical: How to train big models? Distributed training

Training Multi-Billion Parameter Language Models. Model Parallelism. Data Parallelism.

Week 8

Squads (one-hop, multi-hop), architectures, retrieval and search, chat-bots

Week 9

Week 10

Recommended Resources

En

На русском (и про русский, в основном)

Literature

  • Manning, Christopher D., and Hinrich Schütze. Foundations of statistical natural language processing. Vol. 999. Cambridge: MIT press, 1999.
  • Martin, James H., and Daniel Jurafsky. "Speech and language processing." International Edition 710 (2000): 25.
  • Cohen, Shay. "Bayesian analysis in natural language processing." Synthesis Lectures on Human Language Technologies 9, no. 2 (2016): 1-274.
  • Goldberg, Yoav. "Neural Network Methods for Natural Language Processing." Synthesis Lectures on Human Language Technologies 10, no. 1 (2017): 1-309.

mipt-nlp2022's People

Contributors

alenush avatar king-menin avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.