unnir / cvae Goto Github PK

View Code? Open in Web Editor NEW

61.0 61.0 20.0 20 KB

Conditional Variational AutoEncoder (CVAE) PyTorch implementation

Python 100.00%

conditional-variational-autoencoder cvae pytorch pytorch-implementation

cvae's People

Contributors

Stargazers

Watchers

cvae's Issues

Normalizing loss over entire dataset vs normalizing over number of batches.

I've noticed that when you display the training loss:

    print('====> Epoch: {} Average loss: {:.4f}'.format(
          epoch, train_loss / len(train_loader.dataset)))

you normalize that by the length of the entire dataset. Couldn't you instead divide it by the number of batches you passing for that epoch, in order to have an idea of the average loss based on batches?

A Contribution

I would like to contribute to this project, there are many aspects that can be improved. Here are the changes I propose for the code:

Determine the training device automatically.

Pass arguments to the script instead of declaring them in the code (e.g., batch_size, latent_size, epochs).

Compute the loss in the model's forward method for faster computations.

Remove the unnecessary one-hot encoding of labels since it's automatically done inside BCE.

Add a progress bar for both training and evaluation.

Modularize the code into multiple files to make it easier to explore and improve.

Additionally, there are several enhancements that could improve model training and usage:

Use Automatic Mixed Precision (AMP) for faster training on GPUs.

```
Implement a learning rate scheduler.
```

Add a loss graph visualizer like Wandb.

Add a method to save/load the model weights.

```
Add a requirements.txt file
```

These changes, along with other smaller improvements, will contribute to the overall enhancement of the project.

KLD term in loss_function()

In loss_function instead of doing

    KLD = -0.5 * torch.sum(1 + logvar - mu.pow(2) - logvar.exp())
    return BCE + K

shouldn’t we normalize the KLD term over the batch instead? For example doing torch.mean()

unnir / cvae Goto Github PK

cvae's People

Contributors

Stargazers

Watchers

Forkers

cvae's Issues

Normalizing loss over entire dataset vs normalizing over number of batches.

A Contribution

KLD term in loss_function()

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent