Git Product home page Git Product logo

munit-keras's Introduction

MUNIT-keras

A keras (tensorflow) reimplementation of MUNIT: Multimodal Unsupervised Image-to-Image Translation

Xun Huang, Ming-Yu Liu, Serge Belongie, Jan Kautz

Deviation from the official implementation

  1. Use group normalization instead of layer normalization in upscaling blocks.
    • Model using group norm (group=8) failed on reconstructing edge images of edges2shoe dataset.
  2. Use mixup technique for training.
  3. Input/Output size is defaulted 128x128.
  4. Use only 3 res blocks (instead of 4) as default in content encoder/decoder in order to reduce training time.
    • However, I'm worrying that this decreases the receptive field size so that the output quality becomes worse.
  5. Upscaling blocks use conv2d having kernel_size = 3 instead of 4.

Environment

Result

  • Edges2shoes (config. 1)

    • Cyclic reconstruction loss weight = 1 for the first 80k iters and 0.3 for the rest.
    • Input/Output size: 64x64.
    • Training iterations: ~130k.
    • Optimization: Use mixup technique for the first 80k iters.
  • Edges2shoes (config. 2)

    • Cyclic reconstruction loss weight = 10
    • Input/Output size: 64x64.
    • Training iterations: ~70k.
    • Optimization: Use mixup technique for the entire training process.
    • Model performed better on guided translation (generated more detail and clearer edges) when using high reconstruction loss?

Acknowledgement

Code heavily inspired by official MUNIT pytorch implementation. Also borrow code from eridgd and tjwei.

munit-keras's People

Contributors

shaoanlu avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.