Git Product home page Git Product logo

Comments (2)

phizaz avatar phizaz commented on September 15, 2024 5

Oh, not like that.

Allow me to make the jargon a bit clearer. Stochastic subcode, X_T, is meant to be the "inverted noise". To get this, you need an original image, X_0, and run the denoising process backward effectively adding noise to the image (see DDIM paper for more details). Repeating this for T steps, you'll get the "exact" X_T that corresponds to that original image, X_0.

Not to be confused with a "usual" X_T which all you need is just scaling down the RGB and adding some noise. This usual noise is not specific to the original X_0 (because there are many random noises you can add) while the stochastic subcode, acting as a latent code, is specific to the original image.

Training diffusion autoencoders don't need the stochastic subcodes that are costly to get. You only need the cheap usual X_T's. Training DiffAE's hence are just like any other DPMs. The changes are only on the architecture and on what is supplied to it. Only when you need to get the best quality reconstruction of the original image, you need to obtain the stochastic subcode, not during training.

from diffae.

betterze avatar betterze commented on September 15, 2024

Thx a lot. I understand it now.

from diffae.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.