Comments (2)
For 1: you need to scale the noisy latents.
However, the scale is only done on diffusion targets. Sharing the UNet doesn't necessarily mean that they need to be of the same scale.
For 2: Actually it is related to the latter part of 1. By default the SD VAE output needs to be rescaled by about 0.19 (vae.config.scaling_factor) before sending into diffusion; we skipped that step for the condition branch (so it is roughly scaled by a factor of 5).
This can look strange at a first glance, but it enhances the local conditioning signal and helps with final results empirically.
from zero123plus.
This is Moon R. Jung. Let add some comments in this thread.
Q1. eliphatfs said:
For 1: you need to scale the noisy latents.
However, the scale is only done on diffusion targets.
=> Does it mean that we need not scale the noisy latents but the diffusion targets? Then what does it mean?
from zero123plus.
Related Issues (20)
- training data HOT 1
- camera FOV HOT 1
- something about trainning detail.. HOT 2
- training resolution 320^2 instead of 512^2? HOT 2
- depth control for v1.2 ? HOT 1
- RuntimeError: "LayerNormKernelImpl" not implemented for 'Half' HOT 1
- About the object normalization in v1.2 HOT 2
- How to align the coordinate system of zero123 in threestudio? HOT 1
- Would it be feasible to provide more than one view angle as input to better the quality of the outputs.
- Can I get train code? HOT 1
- How to change Camera Parameters of output view? HOT 1
- Depth map size ratios for depth ControlNet HOT 1
- About dataset HOT 1
- img_to_mv.py without cuda HOT 1
- The difference between v1.2 and v1.1 in the network architecture HOT 1
- CUDA out of memory. Tried to allocate error HOT 2
- would you release the training code in the future? HOT 2
- Can I use non-empty text with zero123++? HOT 2
- DLL load failed while importing libtriton HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from zero123plus.