Comments (7)
Hmm.. I'll have to look into it. Maybe @izmailovpavel can be of help?
from swa_gaussian.
Hi Kirk,
You're reading the plot incorrectly - beneath the blue line shows that both SGD and SWAG are overconfident in that situation (confidence > accuracy). With that being said, I'm not sure if we ever checked calibration of the CIFAR5+5 task - will get back to you on that.
from swa_gaussian.
Thank you, Wesley!
Appreciate the prompt reply and clarification.
from swa_gaussian.
Just following up, I checked and we never seem to have run calibration on CIFAR 5+5, but it's not terribly surprising that both SGD and SWAG (somewhat less so) are overconfident here as well.
from swa_gaussian.
Hi Wesley,
thanks a lot for the follow up.
May I ask an additional question if you could clarify that for me please?
Why is the split on cifar10 (5+5) deterministic, (i.e. predefined as 0<---first half of the classes and 1<---the remaining
, where 0 = [0, 1, 2, 8, 9] and 1 = [3, 2, 4, 8, 1] <--- labels
)
Have you noticed that if you train on 1
yields better results than on 0
for out of distribution?
from swa_gaussian.
I believe we sampled those randomly at one point, so it's a holdover from that.
No, I haven't noticed that.
from swa_gaussian.
Thank you, Wesley!
Here's an example of the difference between sgd vs swag if you train on 1 vs 0. Basically swag seems to perform worse than sgd when trained on 0. Left plots are trained on 1 and right ones on 0.
from swa_gaussian.
Related Issues (20)
- Replicating results from paper with dropout HOT 4
- Running on CPU HOT 2
- Replicating results of transfer learning and out-of-domain image detection HOT 3
- Could you share the pretrained model for imagenet? HOT 4
- Cannot find key 'n_models' HOT 1
- Question about KFACLaplace for BatchNorm
- Error with CUDA10 HOT 5
- Questions about the plotting of relability diagrams HOT 5
- Questions about the implementation of calculation of Low-Rank Covariance Matrix HOT 2
- Loading SWAG Checkpoint and Continue SWAG Training HOT 7
- Non-Reproducible / Weird Uncertainty Results HOT 1
- Results CSV
- RMSE UCI Regression Results Paper
- Reproducing UCI Regression Experiments
- Sampling using SWAG HOT 2
- Cannot understand result HOT 1
- Why BN Update is not used for other methods like SGD HOT 5
- Reproducibility of Uncertainty Experiment HOT 2
- 'CIFAR10' object has no attribute 'targets' HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from swa_gaussian.