I´m testing out the --shape param, attempting to train/generate higher resolution imag

different shape/size = black about fauxtograph HOT 2 CLOSED

stitchfix commented on September 26, 2024

different shape/size = black

from fauxtograph.

Comments (2)

tjtorres commented on September 26, 2024

The way the CLI is structured, it's a bit hard to change just one parameter rapidly(especially moving to much higher resolutions). Many times going to higher resolutions leads to greater errors and the batch size needs to be reduced (or the learning rate reduced) at the beginning of the training. Effectively, when the random weights are generated it makes for KL Divergence terms which can be quite large to start and that can lead to convergence on minima which are non-optimal as well as error terms which overflow the max float32 size (which is probably why you're getting the RuntimeWarning). If you want to go to higher resolutions I'd recommend playing around with the parameters at the module level rather than using the command line tool. I mostly included that as a fun toy example so that people can get started quickly, but the customizability there is limited.

For higher resolutions, the right solution (and one I will likely end up implementing eventually) is to start with convolution layers in order to reduce the size of the final layer before moving to a fully connected system. Otherwise the memory allocation and training times needed, in order to avoid stepping down in layer size too rapidly after the input layer, can be prohibitive.

Try customizing the layer sizes, batch size, loss ratio between reconstruction loss and KL terms, and latent width with the module and see if that works. Additionally, it is a probabilistic method which starts off with a random set of layer weights, so just retrying the training step a few times may possibly net you a better model. Sorry if that isn't more satisfying.

from fauxtograph.

samim23 commented on September 26, 2024

Been tuning the hyper-parameters a bit (of course not using CLI) but as you pointed out, its tricky to get right. Using a convolutional auto-encoder (something like https://github.com/mikesj-public/convolutional_autoencoder) does indeed sound promising, or even using a Generative Adversarial Network. Exploring VAE in-depth is a good first step though to build intuition.

from fauxtograph.

Recommend Projects

different shape/size = black about fauxtograph HOT 2 CLOSED

Comments (2)

Related Issues (13)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent