Comments (10)
when I rained with L1 loss, it was also big.
How many is your L1 loss?
from edvr.
All of three loss is big, what is the problem?
from edvr.
All of three loss is big, what is the problem?
Same to me. My loss is about "e+4". How about you?
from edvr.
All of three loss is big, what is the problem?
Same to me. My loss is about "e+4". How about you
My loss is about "e+4" too
from edvr.
All of three loss is big, what is the problem?
Same to me. My loss is about "e+4". How about you
My loss is about "e+4" too
Cause the loss is reduced by sum, and when you try to divide it by (GT_sizeGT_sizebatch_size) you'll find the loss is just the common case like 1e-2. You can replace the reduce function by 'mean'.
from edvr.
I have another problem, which is that the loss function doesn't go down, does anybody else have this problem?
from edvr.
@zzzzwj has pointed it out. CharbonnierLoss is in the sum
mode. For L1/L2 losses, there are also some modes like mean
and sum
(you can see them in PyTorch doc).
The key during the training is the gradient instead of the loss. So even if with larger losses, the training is OK under proper gradients.
When using different modes, mean
or sum
, you may need to adjust the learning rate. But the Adam optimizer can automatically adjust it to some extent.
@LI945 During the training, you may observe the loss decreases very slowly. But if you evaluate the checkpoints, the performance (PSNR) actually increases as the training goes.
from edvr.
I met the same problem as @LI945 mentioned. When I trained with my own datasets, the loss decreases very slow. When I train with SISR model (for example, EDSR), the psnr increases very fast which can reach almost the best value around 37.0 psnr in 20~30 epochs. However when I train with EDVR, using the raw training code, the psnr increases fast in first 10 epochs reaching ~33.0 psnr, then it's psnr value seems to be stable which means in next 20 epochs, the psnr value just inceases less than 1.0. So have you met the same problem when you train the REDS or Vimeo90K datasets? And can I have your training log? Hope for your reply @xinntao .
from edvr.
@zzzzwj I will upload a training log example tomorrow. Actually, 1) we use a different training scheme with restarts, which improves the performance. 2) We usually measure in iteration rather than epoch.
from edvr.
@zzzzwj I will upload a training log example tomorrow. Actually, 1) we use a different training scheme with restarts, which improves the performance. 2) We usually measure in iteration rather than epoch.
Well, thanks a lot.
from edvr.
Related Issues (20)
- RuntimeError: Error compiling objects for extension
- the problem of python setup.py develop HOT 2
- the problem of PCD_Align HOT 1
- TypeError: DCNv2Pack() takes no arguments HOT 3
- AssertionError for assert x.size()[-2:] == flow.size()[1:3]
- The way to compute GFlops of deformable convolution
- What can be reason why I get this error: TypeError: DCNv2Pack() takes no arguments ? HOT 2
- 关于测试问题
- colab implementation of EDVR HOT 1
- NameError: name 'deform_conv_ext' is not defined HOT 1
- Results in MSU Video Super Resolution Benchmark
- EDVR: num_out_ch completely useless
- train on vimeo90K HOT 1
- Is REDS4_test correct paper split or REDSofficial4_test HOT 1
- what is woTSA? HOT 4
- Ringing artifacts on the output
- EDVR TSA模块里的点积运算符为什么是* 不应该是@吗
- EDVR colab? HOT 5
- what is the mean of the mask
- No such file or directory: 'experiments/103_EDVR_L_x4_SR_REDS_woTSA_600k_B4G8_valREDS4_wandb/models/net_g_600000.pth'
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from edvr.