Git Product home page Git Product logo

Comments (6)

AlexMRuch avatar AlexMRuch commented on May 20, 2024

I'd really appreciate this. For example, on https://aws-dglke.readthedocs.io/en/latest/train_user_data.html It's not super clear what should be in --data_path and --data_files.

For example, --data_path says "to specify the path to the knowledge graph dataset"; however, I presume this means "to specify the path to the folder containing the knowledge graph dataset".

Also, --data_files says "to specify the triplets of a knowledge graph as well as node/relation ID mapping"; however, it's not immediately clear the order of these files. For example, I would presume this would follow the order of the files listed under udd_[h|r|t]:

DGLBACKEND=pytorch dglke_train \
--data_path results_SXSW_2018 \
--data_files entities.tsv relations.tsv train.tsv valid.tsv test.tsv \
--format udd_hrt \
--model_name ComplEx \
--max_step 12000 --batch_size 1000 --neg_sample_size 200 --batch_size_eval 16 \
--hidden_dim 400 --gamma 19.9 --lr 0.25 --regularization_coef=1e-9 -adv \
--gpu 0 1 --async_update --force_sync_interval 1000 --log_interval 1000 \
--test

^^^ But the order isn't clear. It seems like entities.txt and relations.tsv should go at the end since if someone uses to raw_udd_[h|r|t] option this would keep the first three elements consistently for training, validation, and testing files.

Perhaps there should be --data_tuple_files and --data_mapping_files options?

UPDATE:
When I ran the code above, it gave me this output with FB_15k in the checkpoints, which doesn't seem right...

(dglke) amruch@wit:~/graphika/kg$ DGLBACKEND=pytorch dglke_train --data_path results_SXSW_2018 --data_files entities.tsv relations.tsv train.tsv valid.tsv test.tsv--format udd_hrt --model_name ComplEx --max_step 12000 --batch_size 1000 --neg_sample_size 200 --batch_size_eval 16 --hidden_dim 400 --gamma 19.9 --lr 0.25 --regularization_coef=1e-9 -adv --gpu 0 1 --async_update --force_sync_interval 1000 --log_interval 1000 --test
Using backend: pytorch
Logs are being recorded at: ckpts/ComplEx_FB15k_0/train.log
Reading train triples....

from dgl-ke.

zheng-da avatar zheng-da commented on May 20, 2024

Thank you very much for your feedback. We'll prioritize it and provide documentation of the argument options.

If you find the explanation from --help isn't clear, please post them here. We'll improve them. Thanks a lot for your help.

from dgl-ke.

AlexMRuch avatar AlexMRuch commented on May 20, 2024

from dgl-ke.

zheng-da avatar zheng-da commented on May 20, 2024

We need to clarify our documentation to address all of the questions in this issue: #84

from dgl-ke.

classicsong avatar classicsong commented on May 20, 2024

The docs for command line arguments was updated along with 0.1.1 release.

from dgl-ke.

AlexMRuch avatar AlexMRuch commented on May 20, 2024

from dgl-ke.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.