Git Product home page Git Product logo

agah's Introduction

Adversary Guided Asymmetric Hashing for Cross-Modal Retrieval

Code for the paper Adversary Guided Asymmetric Hashing for Cross-Modal Retrieval (ICMR 2019 Best Student Paper).

Requirements

  • Python >= 3.5
  • PyTorch >= 1.0.0

Usage

Help

python main.py help

You will get the following help information:

========================::HELP::=========================
    usage : python file.py <function> [--args=value]
    <function> := train | test | help
    example:
            python main.py train --lr=0.01
            python main.py help
    avaiable args (default value):
            load_model_path: None
            pretrain_model_path: ./data/imagenet-vgg-f.mat
            vis_env: None
            vis_port: 8097
            dataset: flickr25k
            data_path: ./data/FLICKR-25K.mat
            db_size: 18015
            num_label: 24
            tag_dim: 1386
            query_size: 2000
            training_size: 10000
            batch_size: 128
            emb_dim: 512
            valid: True
            valid_freq: 2
            max_epoch: 300
            bit: 64
            lr: 0.0001
            device: cuda:1
            alpha: 1
            beta: 0
            gamma: 0.001
            eta: 1
            mu: 1
            delta: 0.5
            lambd: 0.8
            margin: 0.3
            debug: False
            data_enhance: False
========================::HELP::=========================

Train & Test

Train and test:

python main.py train

For test only:

python main.py test

Datasets

Coming soon...

Framework

Result

Citing AGAH

@inproceedings{gu2019adversary,
  title={Adversary Guided Asymmetric Hashing for Cross-Modal Retrieval},
  author={Gu, Wen and Gu, Xiaoyan and Gu, Jingzi and Li, Bo and Xiong, Zhi and Wang, Weiping},
  booktitle={Proceedings of the 2019 on International Conference on Multimedia Retrieval},
  pages={159--167},
  year={2019},
  organization={ACM}
}

agah's People

Contributors

wendellgul avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

agah's Issues

Modify model structure

hi.
Have you considered modifying the feature extraction structure of images and text.
Do you think you can use VIT(Visioni transformer) to replace it.

dataset

could you please tell me how to download the dataset,thanks!

Compared Method

Thanks for sharing your code!
I saw your paper "AGAH" using "CHN" as one of your compared methods. Could you tell me if you implement it by yourself? Or do you know where I can find the source code? Could you please share it to me?

dataset preprocessing

Dear Author,
Could you please share the code of generating the dataset file (nus-wide-tc21-iall.mat, for example)? thanks a lot.

question on loss for discriminator

Excuse me, when I read your code, I found the loss for the discriminator looks like this:

D_txt_real = -D_txt_real.mean()
D_txt_fake = D_txt_fake.mean()
loss_D_txt = D_txt_real - D_txt_fake

Does this mean you train both real and fake items to the same direction? Or I get it wrong?

The learning rate of Adamax in the NUS-WIDE dateset

Hello,thanks for the code.
I have a question about the learning rate of Adamax in the NUS-WIDE dataset. I am trying to use your code in the NUS-WIDE dataset, but I found that the result is so bad when using the same Adamax optimizer and learning rate as the MIRFLICKR-25K dataset.

parameters for 16bit/32bit?

Changing 64bit to 16bit can only reach about 74% map. What is the specific parameters of 16bit reaching 79% map?

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.