Light

wendellgul / agah Goto Github PK

View Code? Open in Web Editor NEW

36.0 2.0 11.0 566 KB

Source code for paper "Adversary Guided Asymmetric Hashing for Cross-Modal Retrieval".

Python 100.00%

cross-modal-retrieval pytorch agah

agah's Introduction

Adversary Guided Asymmetric Hashing for Cross-Modal Retrieval

Code for the paper Adversary Guided Asymmetric Hashing for Cross-Modal Retrieval (ICMR 2019 Best Student Paper).

Requirements

Python >= 3.5
PyTorch >= 1.0.0

Usage

Help

python main.py help

You will get the following help information:

========================::HELP::=========================
    usage : python file.py <function> [--args=value]
    <function> := train | test | help
    example:
            python main.py train --lr=0.01
            python main.py help
    avaiable args (default value):
            load_model_path: None
            pretrain_model_path: ./data/imagenet-vgg-f.mat
            vis_env: None
            vis_port: 8097
            dataset: flickr25k
            data_path: ./data/FLICKR-25K.mat
            db_size: 18015
            num_label: 24
            tag_dim: 1386
            query_size: 2000
            training_size: 10000
            batch_size: 128
            emb_dim: 512
            valid: True
            valid_freq: 2
            max_epoch: 300
            bit: 64
            lr: 0.0001
            device: cuda:1
            alpha: 1
            beta: 0
            gamma: 0.001
            eta: 1
            mu: 1
            delta: 0.5
            lambd: 0.8
            margin: 0.3
            debug: False
            data_enhance: False
========================::HELP::=========================

Train & Test

Train and test:

python main.py train

For test only:

python main.py test

Datasets

Coming soon...

Framework

Result

Citing AGAH

@inproceedings{gu2019adversary,
  title={Adversary Guided Asymmetric Hashing for Cross-Modal Retrieval},
  author={Gu, Wen and Gu, Xiaoyan and Gu, Jingzi and Li, Bo and Xiong, Zhi and Wang, Weiping},
  booktitle={Proceedings of the 2019 on International Conference on Multimedia Retrieval},
  pages={159--167},
  year={2019},
  organization={ACM}
}

agah's People

Contributors

Stargazers

Watchers

Forkers

goodgood123 zhangcheng-007 chrisbyd tuding1994 tudingneau huangjh98 zzxihuanheixiu zcyueternal wqingzhao jinglin-byte tingkunyan

agah's Issues

Modify model structure

hi.
Have you considered modifying the feature extraction structure of images and text.
Do you think you can use VIT(Visioni transformer) to replace it.

dataset

could you please tell me how to download the dataset,thanks!

Compared Method

Thanks for sharing your code!
I saw your paper "AGAH" using "CHN" as one of your compared methods. Could you tell me if you implement it by yourself? Or do you know where I can find the source code? Could you please share it to me?

dataset preprocessing

Dear Author,
Could you please share the code of generating the dataset file (nus-wide-tc21-iall.mat, for example)? thanks a lot.

question on loss for discriminator

Excuse me, when I read your code, I found the loss for the discriminator looks like this:

D_txt_real = -D_txt_real.mean()
D_txt_fake = D_txt_fake.mean()
loss_D_txt = D_txt_real - D_txt_fake

Does this mean you train both real and fake items to the same direction? Or I get it wrong?

The learning rate of Adamax in the NUS-WIDE dateset

Hello，thanks for the code.
I have a question about the learning rate of Adamax in the NUS-WIDE dataset. I am trying to use your code in the NUS-WIDE dataset, but I found that the result is so bad when using the same Adamax optimizer and learning rate as the MIRFLICKR-25K dataset.

parameters for 16bit/32bit?

Changing 64bit to 16bit can only reach about 74% map. What is the specific parameters of 16bit reaching 79% map?

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.