Might I ask a stupid question that what is the difference between training autoenc and

Training autoenc and Training latent DDM about diffae HOT 4 OPEN

phizaz commented on August 11, 2024

Training autoenc and Training latent DDM

from diffae.

Comments (4)

phizaz commented on August 11, 2024 6

I could think of one way to utilize VAE which is using it as a regularization of the latent codes. Semantic codes become samples from a normal distribution. We have tried this. It was hard to strike a balance between the sample-ability (strong regularization) and expressiveness (weak regularization) of the latent code. From the quality of sample perspective, it turned out be better to learn another DDIM on top of the learned (and frozen) latent codes.

from diffae.

phizaz commented on August 11, 2024 3

They are not trained at the same time.
You can train the autoencoder alone with only images (and you will only get the autoencoder).
Since it is still an autoencoder, you CANNOT sample novel images from it yet.
To generate new images, you need to be able to sample the "semantic code".
In order to do this, you need a generative model, which is called latent DPM in this case, trained on a pool of semantic codes (to get this you need a trained autoencoder).

from diffae.

hao-pt commented on August 11, 2024

Thank you for your response! I finally understand your points after re-checking section 4 in your paper. Btw, have you experimented with a sampled latent from VAE for the generative process of DPM model? What I mean is that you stack a VAE on top of the diffusion model to get latent vector z for the decoding process.

from diffae.

fido20160817 commented on August 11, 2024

what affects the sample-ability and expressiveness? I am a beginner on neural network.

from diffae.

Recommend Projects

Training autoenc and Training latent DDM about diffae HOT 4 OPEN

Comments (4)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent