Comments (3)
Hi,
Masked Image Modeling (MIM) methods in general are well desiged for patch-based architectures such as ViTs.
There have been some attemps to extend MIM to CNNs, eg Architecture-Agnostic Masked Image Modeling -- From ViT back to CNN or ConvMAE: Masked Convolution Meets Masked Autoencoders.
Such approaches could most likely be successfully integrated into CroCo but we are planning to work on that in the future.
Best
Philippe
from croco.
Maybe 《MC-JEPA: A Joint-Embedding Predictive Architecture for Self-Supervised Learning of Motion and Content Features》can meet your requirements.
from croco.
Thansk for the reply!
from croco.
Related Issues (20)
- pre-training details HOT 12
- Is there any bug in the pytorch RoPE codes? HOT 2
- The result HOT 2
- Stereo code release time HOT 2
- About Submission to Spring. HOT 2
- Compile cuda kernels for RoPE Fail HOT 4
- Availability of using my own images on CrocoFlow HOT 2
- domain generalization of croco-stereo HOT 2
- Issue with building RoPE - CUDA MISMATCH HOT 1
- train croco-stereo with a dataset without disparity map HOT 1
- Question about .pth file setting HOT 7
- Data generation without metadata takes forever HOT 1
- MegaDepth does not contain images used in Crops dataset HOT 1
- The submission of MPI-sintel Dataset HOT 1
- Tiling-based Inference HOT 4
- Is it better to set the same crop size for both pretraining and downstream finetuning? HOT 2
- How long does it take to train Croco-Stereo? HOT 2
- [W reducer.cpp:320] Warning: Grad strides do not match bucket view strides. This may indicate grad was not created according to the gradient layout contract, or that the param's strides changed since DDP was constructed. This is not an error, but may impair performance. HOT 1
- ./data/crop_metadata does not exist
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from croco.