Comments (10)
Could it be that this shows an underlying tension between having two different things: "examples" that should be read to learn, and a "model/sota zoo" that has enough so that someone can jump-start towards current SOTA?
Because regarding the two models: WRN is super widely used, but PyramidNet with Shake-shake is (close to) SOTA. If I want to learn about the framework, I'm happy reading WRN without many tricks, but if I want to see if my newest invention increases SOTA, I'd like to get a small code which gets close to SOTA where I can add my invention and get great numbers (or not).
from flax.
That link is now a 404. Where is the current CIFAR10 example code?
Do you still have a cifar10 example?
from flax.
Thanks for adding this issue Alexey, I agree with your observations.
Can we just keep one or two of the above combinations?
Which ones are more relevant / useful / educational?
Fine by me, I don't know much about the details of these architectures, but I would probably pick the one that is most common. I am not sure which one is most educational.
Should we turn some other combinations into HOW-TOs? Which ones?
HOWTOs are based on diffs, so they are most useful if they are being used for some added functionality. It seems that WideResNet is implemented without shake-shake and with shake-shake. Is it possible to create a HOWTO that shows how to add shake-shake to WideResNet?
from flax.
Good point Lucas! I think we are still figuring out which examples belong in Flax exactly (I agree with the tension), but for this specific example I probably lean towards keeping the simpler one (WRN), with possibly one or more HOWTOs, and having the most complex ones in a separate repository.
from flax.
FYI I chatted about this with Alexey, and he told me that he'd like to wait with finishing the regression testing until he will act on this issue. So I've linked that PR here.
from flax.
We have recently removed out official CIFAR example and instead linked to a much better open source repository (https://github.com/google-research/google-research/tree/master/flax_models/cifar)
from flax.
That link is now a 404. Where is the current CIFAR10 example code?
from flax.
We removed the example because it was using the old API and we had nobody actively working on it and willing to port it.
You can see more examples in our examples README:
https://github.com/google/flax/tree/main/examples
Note that for example the vision_transformer
codebase has example code to fine-tune a model on CIFAR10:
- https://github.com/google-research/vision_transformer
- https://colab.research.google.com/github/google-research/vision_transformer/blob/master/vit_jax.ipynb
from flax.
@andsteing Is there any example code showing a simple resnet running on CIFAR10 that you know of?
from flax.
You could switch the imagenet2012
dataset with cifar10
in our examples/imagenet
- or you could download the pre-trained imagenet checkpoint and fine-tune it on cifar10
I'm not aware of a repo that simply trains a resnet
on cifar10
.
from flax.
Related Issues (20)
- Bug in guide "Scale up Flax Modules on multiple devices" flax.errors.InvalidRngError: RNGs should be of shape (2,) HOT 3
- NNX Conv documentation states that kernel size can be an integer for 1D convs. HOT 1
- Bug in Tutorial: Jax 101 Working with PyTrees "AttributeError: module 'jax.tree_util' has no attribute 'register_static'" HOT 5
- flax is significantly slower than pytorch HOT 1
- NNX attention layer missing `qkv_features` arg HOT 2
- Metadata in `nnx.param_field` is not passed to the Param class
- Swapping selected layers with a different layer. HOT 1
- Quick start badges point to 404 HOT 1
- A guide on distributed training
- Proposal: StackedRNNCell HOT 3
- Is it an API like model.summary in Pytorch? HOT 1
- Documentation issue: batch_stats in batchnorm need to be marked as mutable even in test/inference HOT 1
- How to implement the same functionality as hk.BatchApply in flax HOT 6
- [Feature request]Unify the order of input arguments and returns โโof RNN HOT 2
- Precision error using checkpoint with CUDA
- [struct.dataclass] Consider adding optional `kw_only` arguments HOT 4
- Instance Normalization HOT 1
- Cannot assign arrays to dataclass fields in `nnx` HOT 2
- Flax not found for ppc64 power9 HOT 3
- Cannot use static_argnums with flax.linen.checkpoint on a Module whose __call__ has a boolean control flag HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. ๐๐๐
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google โค๏ธ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from flax.