Git Product home page Git Product logo

wmrn's Introduction

Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval

Code for the paper WMRN: Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval, ICIP 2021.

Author: Sunjae Yoon, Dahyun Kim, Ji Woo Hong, Junyeong Kim, Kookhoi Kim, Chang D. Yoo

This work was partly supported by Institute for Information communications Technology Planning Evaluation(IITP) grant funded by the Korea government(MSIT) (2021-0-01381, Development of Causal AI through Video Understanding and Reinforcement Learning, and Its Applications to Real Environments) and partly supported by LIG-Nex1 Co. through grant Y20-005

Installation

Requirements

We build our model on top of HERO and TVR. It is compatible with the requirements and quick start of HERO, if you have any problem please refer above link also.

Quick Start

  1. Pretrained model HERO: Run bash scripts/download_pretrained.sh $PATH_TO_STORAGE to get latest pretrained checkpoints. We use the HowTo100M pre-tasks pretrained model in HERO.

  2. Load dataset

    bash scripts/download_tvr.sh $PATH_TO_STORAGE
  3. We utilize the Docker from HERO, which gives command for pooling docker image below.

    # docker image should be automatically pulled
    source launch_container.sh $PATH_TO_STORAGE/txt_db $PATH_TO_STORAGE/video_db \
        $PATH_TO_STORAGE/finetune $PATH_TO_STORAGE/pretrained
  4. Train weakly-supervised video corpus moment retrieval

    # inside the container
    horovodrun -np 8 python train.py --config config/train-tvr-8gpu.json

if you do not use 'horovodrun', you can train the model with only one gpu. The model is saved to the file that you make.

  1. Eval weakly-supervised video corpus moment retrieval
    # inside the container
    horovodrun -np 8 python eval.py --query_txt_db /txt/tvr_val.db/ --split val \
        --vfeat_db /video/tv/ --sub_txt_db /txt/tv_subtitles.db/ \
        --output_dir /storage/tvr_default/ --checkpoint 4800 --fp16 --pin_mem
    
    The result file will be written at /storage/tvr_default/results_val/results_4800_all.json. Change to --query_txt_db /txt/tvr_test_public.db/ --split test_public for inference on test_public split. Please format the result file as requested by the evaluation server for submission. The leaderboard server tells the formatting of submission for uploading.

Citation

If you find this code useful for your research, please consider citing:

@inproceedings{yoon2021weakly,
  title={Weakly-Supervised Moment Retrieval Network for Video Corpus Moment Retrieval},
  author={Yoon, Sunjae and Kim, Dahyun and Hong, Ji Woo and Kim, Junyeong and Kim, Kookhoi and Yoo, Chang D},
  booktitle={2021 IEEE International Conference on Image Processing (ICIP)},
  pages={534--538},
  year={2021},
  organization={IEEE}
}

License

MIT

wmrn's People

Contributors

dbstjswo505 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.