anymake / drn_cvpr2020 Goto Github PK

View Code? Open in Web Editor NEW

329.0 28.0 43.0 12.73 MB

Code and Dataset for CVPR2020 "Dynamic Reﬁnement Network for Oriented and Densely Packed Object Detection"

License: GNU General Public License v2.0

Python 4.52% Lua 1.15% MATLAB 3.21% C++ 1.99% Makefile 0.01% Jupyter Notebook 88.02% C 1.11% Shell 0.01%

drn_cvpr2020's Introduction

DRN and SKU110K-R

Xingjia Pan, Yuqiang Ren, Kekai Sheng, Weiming Dong, Haolei Yuan, Xiaowei Guo, Chongyang Ma, Changsheng Xu

Work in process.

Dynamic Reﬁnement Network for Oriented and Densely Packed Object Detection[Paper Link]

Figure 1. Overall framework of our Dynamic Reﬁnement Network. The backbone network is followed by two modules, i.e., feature selection module (FSM) and dynamic reﬁnement heads (DRHs). FSM selects the most suitable features by adaptively adjusting receptive ﬁelds. The DRHs dynamically reﬁne the predictions in an object-aware manner.

SKU110K-R

Figure 2. Some sample images from SKU110K. The images in top row are annotated with horizontal bounding boxes while the images in bottom row are with oriented bounding boxes.

To use SKU110K-R,

Download the original SKU110K data set from website and extract images
Generate SKU110K-R using our rotate augment script

   python rotate_augment.py path/to/images

Download the annotations for SKU110K-R from website

The annotation is in coco format.

Evaluation tools

cocoapi_ro

we provide a variant of cocoapi for evaluation of rotated bounding boxes.

Install cocoapi_ro(similar with cocoapi)

   cd PythonAPi
   make

Replace pycocotools with pycocotools_ro

FROM

   import pycocotools.coco as coco
   from pycocotools.cocoeval import COCOeval

   import pycocotools_ro.coco as coco
   from pycocotools_ro.cocoeval import COCOeval

Update the evaluation code.

FROM

   coco_eval = COCOeval(self.coco, coco_dets, "bbox")

   coco_eval = COCOeval(self.coco, coco_dets, "rbbox")
   coco_eval.params.maxDets = [1, 10, 300]

angle_nms

we provide angle_nms for nms of rotated bounding box in post process.

   from angle_nms.angle_soft_nms import angle_soft_nms
   # Example
   result_after_nms = angle_soft_nms(all_dets, Nt=0.5, method=1,threshold=0.05)
   # all_dets: detection results
   # Nt: iou threshold 
   # method: 1, linear soft nms; 2, gaussian soft nms; other, nms
   # threshold: the minimum confidence valu to retain the detection bbox

Rotation Conv Layer

To use the rotation conv layer, you need to install dcn_v2 first,

    # git clone -b pytorch_1.0.0 https://github.com/chengdazhi/Deformable-Convolution-V2-PyTorch.git
    # mv Deformable-Convolution-V2-PyTorch DCNv2
    cd DCNv2
    ./make.sh

Then you need to modify the import path of dcnv2 in rotation_conv_utils.py.

from path\to\DCNv1.modules.modulated_deform_conv import ModulatedDeformConv
from path\to\DCNv2.functions.modulated_deform_conv_func import ModulatedDeformConvFunction

We provide a simple example to use the rotation conv layer in test_rcl.py.

Citation

If you find this project useful for your research, please use the following BibTeX entry.

@article{pan2020dynamic,
  title={Dynamic Refinement Network for Oriented and Densely Packed Object Detection},
  author={Xingjia Pan and Yuqiang Ren and Kekai Sheng and Weiming Dong and Haolei Yuan and Xiaowei Guo and Chongyang Ma and Changsheng Xu},
  booktitle={CVPR},
  pages={1--8},
  year={2020}
}

Contacts

If you have any questions about our work, please do not hesitate to contact us by emails.
Xingjia Pan: [email protected]
Yuqiang Ren: [email protected]

drn_cvpr2020's People

Contributors

Stargazers

Watchers

drn_cvpr2020's Issues

Could you share the SKU110K-R dataset?

Hello.
I want to conduct an experiment on the SKU110K-R dataset.

Could you share the SKU100K-R dataset which contains annotations for oriented bounding box?

python rotate_augment.py path/to/images?

python rotate_augment.py '/home/hs/hao/data/SKU-110K/SKU110K_fixed/images'
Premature end of JPEG file
Premature end of JPEG file
Corrupt JPEG data: premature end of data segment
Premature end of JPEG file
Premature end of JPEG file

What is it about?
Hope to get your reply

我每天都来看一次，咋还没开源呀，等着看 DRN 源码呢，亲

A year has passed. How is the work going？

thank you for your outstanding work！
and when can we read the whole project?

已放出的这部分代码与论文中是否存在一定出入？

代码中在FSM模块中的attention map似乎与论文中有一定出入？论文中的attention map Ai维度是HW1，并且是在得到attention map后再concat+softmax; 而代码中先将三个分支相加，然后通过AdaptiveAvgPool2d使attention map的H和W维度均变为了1（这样看的话似乎论文中是空间注意力而代码中是通道注意力？）。不知我的理解是否有误？

Would you release the CenterNet † mentioned in your paper in ”Experimental Results“ section？

Hi，
Thank you for your great job! In your paper, your ever mentioned you implemented "CenterNet †" that been added center pooling and DCN to the baseline. Would you release this part including the training and test script? Thanks a lot!

代码放的半拉可急的，就别再论文里说自己开源，科学院的人也这么沽名钓誉。

Where is the train/test code ?

作者确实是在开玩笑吧，花了大半天分析完论文打算下载源码复现实验了，结果所谓开源代码就给了个这？？？？以后我是不是也可以放一个处理数据集的link上去然后宣布我开源代码了？

作者确实是在开玩笑吧，花了大半天分析完论文打算下载源码复现实验了，结果所谓开源代码就给了个这？？？？以后我是不是也可以放一个处理数据集的link上去然后宣布我开源代码了？

What is the specific structure of Gc

In the paper，you said that the Gc represents the dynamic filter generator and Kc are the learned example-wise kernel weights. So can you explain the specific structure of Gc? Thank you.

how about the inference time

how about the inference time on 1080Ti?

TypeError: modulated_deform_conv_forward(): incompatible function arguments. The following argument types are supported:

作者你好，运行你的测试代码报错，请问怎么解决呢？

How to train this model?

Hi, Pan! It seems does not contain the complete steps about how to train this model. Could you describe these steps in readme?

where is imgaug

I can not find imgaug. @Panxjia

Feature Selection Module

大家有人跑了这个模块吗，我遇到一个问题AttributeError: 'NoneType' object has no attribute 'data'，应该是反向传播时梯度更新的问题，请问怎样解决。

Where are the feature selection module (FSM) and dynamic reﬁnement heads (DRHs) in code ?

When can you publish the code?

First of all, thank you for your excellent work! I am also engaged in the direction of target detection. I am very interested in your article. When can you publish the code? Best wishes to you!

Why is NMS instead of max pooling for de-redundancy?

Why do you choose to use NMS instead of max pooling for de-redundancy? Are there any hidden disadvantages of max pooling?

the number of train epoch

thank you for your excellent work!
I am using you FSM in my object. I have a question that whether your method need more epoch to fit data. As you mentioned in your article, in DOTA and HRSC2016, your need 140 epochs to train.
thank for your reply

full code

Can you source the full code, includeing training and testing.

COCO-fashion evaluation code for oriented objects

Hello.

I want to evaluate the performance of an oriented object in COCO fashion.

I heard you evaluate the performance of the SKU110K-R dataset in the same way as COCO.

Did you calculate rotated IoU for performance evaluation?

Also, could you share the COCO-style evaluation code for rotated objects?

I look forward to your answer.