yandexdataschool / practical_dl Goto Github PK

View Code? Open in Web Editor NEW

1.5K 70.0 616.0 241.07 MB

DL course co-developed by YSDA, HSE and Skoltech

License: MIT License

Jupyter Notebook 97.89% Python 1.96% Dockerfile 0.01% C++ 0.02% Cuda 0.12%

deep-learning course course-materials theano lasagne

practical_dl's Introduction

Deep learning course

This repo supplements Deep Learning course taught @fall'23. For previous iteration visit the spring branch.

Lecture and practice materials for each week are in ./week* folders. You can complete all asignments locally or in google colab (see readme files in week*)

General info

Telegram chat room (russian).
Deadlines & grading rules can be found at this page.
Any technical issues, ideas, bugs in course materials, contribution ideas - add an issue or ask around in the chat.

Syllabus

week01 Intro to deep learning
- Lecture: Deep learning -- introduction, backpropagation algorithm, adaptive optimization methods
- Seminar: Neural networks in numpy
- Homework 1 is out!
- Please begin worrying about installing pytorch. You will need it next week!
week02 Catch-all lecture about deep learning tricks
- Lecture: Deep learning as a language, dropout, batch/layer normalization, other tricks, deep learning frameworks
- Homework 2 is out!
- Seminar: PyTorch basics
week03 Convolutional neural networks
- Lecture: Computer vision tasks, Convolution and Pooling layers, ConvNet architectures, Data Augmentation
- Seminar: Training your first ConvNet

(to be updated)

Contributors & course staff

Course materials by

Victor Lempitsky - main track lecture videos (1-11)
Victor Yurchenko - intro notebooks, admin stuff
Vadim Lebedev - notebooks, admin stuff
Dmitry Ulyanov - notebooks on generative models & autoencoders
Fedor Ratnikov - pytorch & nlp notebooks, one bonus lecture
Oleg Vasilev - notebooks, technical issue resolution
Arseniy Ashukha - image captioning materials
Mikhail Khalman - variational autoencoder materials
many bugs were fixed and course materials were improved by students and volunteers, see PR authorship

practical_dl's People

Contributors

Stargazers

Watchers

Forkers

lareola maratbakiev serjtroshin glebfilatov feygina iiilia alexandrudaia trakhovroma idstep makarovia danilgizdatullin xiachenfeng yun-li robustfengbin xzflin allensmile wanjinchang chagge benjamesbabala bitisony shangguan wenmin-wu nonva martsen qgzang pivanova-ds amigoml scitator alexeyantonov vfdev-5 findcomrade kbaltakys ibrahim85 bydmitry theotheo sakshamb44 mlenthusiast sunjieee anirband shmuma benakiva cristidruta kvr777 bekerov pankajk i-aztec jason790 meelement vital3322 voropavlik pavanch007 latatyes bamasa karpovmax akanchurin vdyuk robberpenguin talalaevk cksimon evtectica sosnovik ashapulin asterisk14 demikandr ploxoy ychaga anvarknian alfords zahorecztibor apodolskiy mandroid6 wojohowitz00 rashidch stalkermustang m12sl joelee2013 victordek johndpope shir994 simon23rus vovcick fminkin antoshqa mnrozhkov dks1994 ilivans upml legendawes kuynzereb harrytanme adelkhafizova s0lucien ralucafe svyatoslavvolkov standy66 penguin138 dniku yerbolaussat vsevolod-reserve victorlempitsky

practical_dl's Issues

week9: broken image link

https://s24.postimg.org/cw4nognxx/gan.png

The link to a "cool visualization" in homework01/homework_main.ipynb is broken

The link leads to http://scs.ryerson.ca/~aharley/vis/, but the server does not respond.

Can you put lecture slides/notes here as well to make repo complete?

Will the answer to the assignment be released? sir

week5: tsne usage for nlp

tsne has not a "transform" functionality thus it can not be used for counting embedding of a new entry thus the query neighbors can not be found in embedded space (assume that the query is not in training data)
(however, the task does not demand on using tsne, but it is not obvious)
the seminar notebook encourage to use TSNE with verbose 100 or 1000, but the running time blow up from a few seconds to unknown (I lost my patience after 10 minutes). May be put some warning about it to students?

I sent my homework from [email protected] at 21.57 on Wensday, 28/09/2016. I sent another email (21/11/2016) when I didn't found myself in the grade sheet, but there was no answer. So I've posted an issue here.

Is the formula correct in homework 1 ex 4 ?

There may be an error in ex 4's statement . I believe it should be ||X||_F^2 = tr(...) instead . You also use the Frobenius norm later in the second approach.

In exercise 4. you say:

The fact is not too obvious as the 2-norm is Jose Nocedal's Numerical Optimization book as:

I have found the following readings useful regarding this issue but was not clarified - the rabbit hole runs deep :

Pedersen and Pedersen - The Matrix Cookbook : (262) page 30
Stackoverflow and mathpages with proofs about the characteristic polynomial

Installing dependencies

You can discuss any issues concerning installation in this thread.

We assume that you have basic data science toolkit (sklearn, numpy/scipy/pandas). Basically whatever comes with default anaconda distribution.

Anaconda: https://www.continuum.io/downloads (or simply use python with numpy/sklearn)

Assignments require numpy, scipy, pandas, matplotlib, scikit-learn and sometimes tqdm to launch. Luckily, all those packages are either pre-installed or can be installed with pip install <name>.

You will also need to install PyTorch:

Installing on Linux / Mac OS: http://pytorch.org/
Installing on windows: https://anaconda.org/peterjc123/pytorch (CPU only)

If you don't/can't install that (e.g. you use windows and installation is tricky), try Docker Container for CPU or nvidia-docker for GPU.

If you run into any trouble, feel free to post here, even if it's like "i don't know what the hell all these letters mean!!!".

week4: not valid "img" input for testing week

Original test line
assert embedding(torch.Tensor(img)).data.numpy().shape == (1, 2048), "your output for single image should have shape (1, 2048)"

After facing the same problems as below I decide to show the problem on default model for sure

Problem 1

---> 17 assert model(img).data.numpy().shape == (1, 2048), "your output for single image should have shape (1, 2048)"
 
/usr/local/lib/python3.6/dist-packages/torchvision/models/inception.py in forward(self, x)
     94     def forward(self, x):
     95         if self.transform_input:
---> 96             x_ch0 = torch.unsqueeze(x[:, 0], 1) * (0.229 / 0.5) + (0.485 - 0.5) / 0.5
     97             x_ch1 = torch.unsqueeze(x[:, 1], 1) * (0.224 / 0.5) + (0.456 - 0.5) / 0.5
     98             x_ch2 = torch.unsqueeze(x[:, 2], 1) * (0.225 / 0.5) + (0.406 - 0.5) / 0.5

TypeError: unsqueeze(): argument 'input' (position 1) must be Tensor, not numpy.ndarray

I fixed it using torch.Tensor(img), but ...

Problem 2

---> 17 assert model(torch.Tensor(img)).data.numpy().shape == (1, 2048), "your output for single image should have shape (1, 2048)"

/usr/local/lib/python3.6/dist-packages/torch/nn/modules/conv.py in conv2d_forward(self, input, weight)
    338                             _pair(0), self.dilation, self.groups)
    339         return F.conv2d(input, weight, self.bias, self.stride,
--> 340                         self.padding, self.dilation, self.groups)
    341 
    342     def forward(self, input):

RuntimeError: Expected 4-dimensional input for 4-dimensional weight 32 3 3, but got 3-dimensional input of size [299, 3, 3] instead

week08:autoencoders_pytorch Pooling layers usage

MaxUnpool needs "indices" that are returned by MaxPool but the first is located in the decoder and the second is in the encoder thus all direct calls of starter code to encoder and decoder do not support the indices transportation from Pool to Unpool

Possible solutions:

do not use direct calls for counting code/reconstruction
request both code and reconstruction using one function call

week5: viz with bokeh in Colab

(Figure is not printed)

Were the seminars recorded?

Would like to know whether the seminars (in Russian) were recorded for the 2019 course or will there be recordings of the seminars for the 2020 course?

Theano+Lasagne installation

Any issues concerning installation can just as well be sent here.

In this course, we'll use the following technology stack for deep learning

Theano (symbolic computation graphs)
Lasagne(neural networks)
Agentnet(deep reinforcement learning) - only if you decide to complete the deep reinforcement learning assignments.

A simple roadmap to installing them can be found here -

only theano and lasagne - pick bleeding edge version
all 3 of them
docker container with all 3

The frameworks can be easily installed on Mac OS and Linux. Windows installation is, a bit tougher, so if you don't feel like it, try using docker (e.g. kitematic gui or console on windows).

If you run into any trouble, feel free to post here, even if it's like "i don't know what the hell all these letters mean!!!".

week3: Invalidated Inception family link

hacktilldawn.com domain died at all
http://hacktilldawn.com/2016/09/25/inception-modules-explained-and-implemented/

week4: Memory problem

Google Collab:

CPU

100-th iteration kills runtime with "unknown reason"

CUDA

 0%|          | 110/25000 [00:01<05:47, 71.73it/s]
---------------------------------------------------------------------------
RuntimeError                              Traceback (most recent call last)
<ipython-input-19-adef1f515998> in <module>()
     26 
     27         # use your embedding model to produce feature vector
---> 28         features = embedding(input_tensor) #<YOUR CODE>
     29 
     30         X.append(features)



3 frames

/usr/local/lib/python3.6/dist-packages/torch/nn/functional.py in _max_pool2d(input, kernel_size, stride, padding, dilation, ceil_mode, return_indices)
    485         stride = torch.jit.annotate(List[int], [])
    486     return torch.max_pool2d(
--> 487         input, kernel_size, stride, padding, dilation, ceil_mode)
    488 
    489 max_pool2d = boolean_dispatch(

RuntimeError: CUDA out of memory. Tried to allocate 28.00 MiB (GPU 0; 11.17 GiB total capacity; 10.70 GiB already allocated; 5.81 MiB free; 141.58 MiB cached)

It leads to http://research.microsoft.com/en-us/um/people/cmbishop/prml/index.htm which redirects to https://www.microsoft.com/en-us/research/people/cmbishop/?from=http%3A%2F%2Fresearch.microsoft.com%2Fen-us%2Fum%2Fpeople%2Fcmbishop%2Fprml%2Findex.htm, which is probably not intended.