Comments (10)
Hi Graphormer authors, please do make sure that your results are easily reproducible on the master branch. You can continuously update your code on the non-master branch, and integrate that when you have finished checking the compatibility.
In short, please ensure easily reproducibility as soon as possible on the master branch. In the worst case, we will need to delete your leaderboard submissions (which I hope not happen.)
Weihua -- OGB Team
Hi Weihua, we ensure that our results are easily reproducible on every branch at anytime by just following the instructions in this repository and our paper. We definately willing to offer help if anyone is in trouble when following our reproducing insturctions.
from graphormer.
Considering that we plan to continuously upgrade Graphormer in this repository, it's very possible to fail when load an old checkpoint using the latest version of code. Hereby, we encourage you to prepare the pre-trained model following our paper by yourself.
from graphormer.
Hi Graphormer authors, please do make sure that your results are easily reproducible on the master branch. You can continuously update your code on the non-master branch, and integrate that when you have finished checking the compatibility.
In short, please ensure easily reproducibility as soon as possible on the master branch. In the worst case, we will need to delete your leaderboard submissions (which I hope not happen.)
Weihua -- OGB Team
from graphormer.
Hi authors, since it's been one week, may I know the rough time when the checkpoints and codes to reproduce the result can be ready?
Thank you.
from graphormer.
Hi authors, since it's been one week, may I know the rough time when the checkpoints and codes to reproduce the result can be ready?
Thank you.
Hi @Noisyntrain , all the codes to reproduce all the results in our paper have already been released in the main branch since our first time release. Please let us know if you have any question about any part of code or script. Currently we don't have plan to relaese the checkpoints.
from graphormer.
Close this issue due to inactivity for a long time. Feel free to reopen it if the problem still exist.
from graphormer.
Hi @Noisyntrain , we notice some comments recently posted on Chinese social media talking about the reproduction issue derived from this issue.
Therefore we wonder whether your problem still exist (size mismatch caused by loading wrong pre-trained model) after you load a correct pre-trained model descripted in our paper?
Also, please feel free to repoen this issue and provide the detailed information of the reproduction process, if you sucessfully execute the program but meet any reproducing problem, e.g., the test accuracy could not reach the number we report in our paper.
from graphormer.
Hi authors. I met a similar problem. I pretrained the modle on PCQM dataset then try to load the checkpoint to train on pcba task. However, I met this error.
RuntimeError: Error(s) in loading state_dict for Graphormer:
size mismatch for downstream_out_proj.weight: copying a param with shape torch.Size([1, 1024]) from checkpoint, the shape in current model is torch.Size([128, 1024]).
size mismatch for downstream_out_proj.bias: copying a param with shape torch.Size([1]) from checkpoint, the shape in current model is torch.Size([128]).
I was using the v1 code. I wonder how your procedure got through this. Could you help me with this?
from graphormer.
Hi authors. I met a similar problem. I pretrained the modle on PCQM dataset then try to load the checkpoint to train on pcba task. However, I met this error.
RuntimeError: Error(s) in loading state_dict for Graphormer: size mismatch for downstream_out_proj.weight: copying a param with shape torch.Size([1, 1024]) from checkpoint, the shape in current model is torch.Size([128, 1024]). size mismatch for downstream_out_proj.bias: copying a param with shape torch.Size([1]) from checkpoint, the shape in current model is torch.Size([128]).
I was using the v1 code. I wonder how your procedure got through this. Could you help me with this?
Thanks for using Graphormer. When fine-tuning the pretrained Graphormer, the last layer (out projection layer for different downstream tasks) should not be loaded, since the downstream tasks are not same as the pre-training one (128 binary classification (PCBA) v.s. 1 regression (PCQ)). Therefore, a re-initialized last layer should be applied.
Btw, v2 is recommended where the pre-trained models are well-prepared.
from graphormer.
Hi @zhengsx, is there an argument in the fairseq-train
command specified in the documentation here that can apply a re-initialized last layer? If not, may you please advise how to re-initialize the last layer and then pass to the Graphormer training loop? Thanks!
from graphormer.
Related Issues (20)
- Implementation of Graphormer based on pytorch geometric HOT 5
- Pretrained OCP20 Graphormer3D model HOT 1
- Feature Request: New Model
- python setup.py build_ext --inplace bug
- install.sh Issue, no access to sudo, using pip21.1 HOT 1
- OC20 checkpoint
- Own dataset problem while dataset.map()
- Unable to download pretrain model.HTTP error 409 HOT 1
- No support for custom datasets in ogb format? HOT 1
- Binary pretrained model can't train a multi-class classifier?
- no setup.py and --user-feature param HOT 1
- How to create a submission file for the graphormer model trained on the OC20 task
- Working Dependencies 2024?
- Custom dataset
- Can this model be used in the field of traffic prediction?
- Lack of diversity in the 1ake example prediction by DiG HOT 4
- Feature representations for new Proteins in DiG
- Missing file train_cli.sh HOT 1
- json.decoder.JSONDecodeError ??? HOT 1
- Missing training scripts for proteins and protein-ligands HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from graphormer.