Git Product home page Git Product logo

segmentbasedaugmentation's Introduction

Segment Augmentation and Differentiable Ranking for Logo Retrieval

Code for ICPR2022 paper "Segment Augmentation and Differentiable Ranking for Logo Retrieval".

Segment Augmentation and Differentiable Ranking for Logo Retrieval

Abstract

Logo retrieval is a challenging problem since the definition of similarity is more subjective compared to image retrieval tasks and the set of known similarities is very scarce. To tackle this challenge, in this paper, we propose a simple but effective segment-based augmentation strategy to introduce artificially similar logos for training deep networks for logo retrieval. In this novel augmentation strategy, we first find segments in a logo and apply transformations such as rotation, scaling, and color change, on the segments, unlike the conventional image-level augmentation strategies. Moreover, we evaluate whether the recently introduced ranking-based loss function, Smooth-AP, is a better approach for learning similarity for logo retrieval. On the large scale METU Trademark Dataset, we show that (i) our segment-based augmentation strategy improves retrieval performance compared to the baseline model or image-level augmentation strategies, and (ii) Smooth-AP indeed performs better than conventional losses for logo retrieval.

Method

We perform segment level augmentation by following these steps:

  1. Logo segmentation
  2. Segment selection
  3. Segment transformation

Currently, our segment-level augmentation consists 3 methods: Color change, Segment Removal and Segment Rotation. For details, please visit the paper.

Presentation Video

You can access the short version of paper presentation from here.

Experimental Results

The Effect of Image-Level (H. Flip, V. Flip) and Segment-Level Augmentation:

Method NAR
Baseline(No augmentation) 0.102
Triplet Loss(No augmentation) 0.053
Triplet Loss(Image-level aug.) 0.051
Triplet Loss(S. Color, S. Removal) 0.046
Smooth-AP Loss(No augmentation) 0.046
Smooth-AP Loss(Image-level aug.) 0.044
Smooth-AP Loss(S. Color) 0.040

Visual Results

Dataset

In this repository, METU Trademark Dataset is used. The dataset is available per request only. If you are a researcher at a university (or a graduate student) interested in the dataset for research purposes, please contact Sinan Kalkan with your intention.

Dependencies

Dependencies can be found in requirements.txt.

Use pip3 install requirements.txt

Training the model

python3 Standard_Training.py --bs 512 --lr 0.001 --n_epochs 50

segmentbasedaugmentation's People

Contributors

blisgard avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.