Git Product home page Git Product logo

sickzil-machine's Introduction

SickZil-Machine

English | 한국어

Notice

SZMC 0.1.1 - Fixed link
Tutorials and Tips

Currently, I am still working on the SickZil-Machine. It is just laying the groundwork invisible to users. I'm building the system I need for more effective training data collection, more efficient deep learning model training, faster experimentation and analysis, model serving and deployment automation, etc.
I will go to the distance. I appreciate if you could wait a little longer. Thank you.

szmc-0.1.0 (source: manga109, © Kanno Hiroshi, © Okuda Momoko, © Kato Masaki)

SickZil-Machine automates texts removal during manga/comics translation(Scanlation) process.

SeisinkiVulnus_028

LoveHina_vol14_003

AkkeraKanjinchou_031 All of the above images were edited automatically without human intervention.
(source: manga109, © Shimazaki Yuzuru, © Akamatsu Ken, © Kobayashi Yuki)

How it works??

Model

szmc-structure-eng

SickZil-Machine finds out the texts in manga/comics and erases it naturally to match the background.
Both processes are completely automatic, without any additional human intervention.
Of course, if you want, you can also specify text area you want to erase.

By the way, SickZil is korean word 식질, slang of 식자(작업). 식자 means editing manga/comics according to the translation(from translator).

We applied U-net for SegNet and Deepfill v2 for ComplNet.

Data set

SickZil-Machine consists of two deep learning models, SegNet and ComplNet.

To learn SegNet, we need original manga images and
text component masks that cover all text area corresponding to the original images.

To learn ComplNet, we need manga images with text removed (ie output).
(I'm researching how an images with a small amount of text affects performance.
 manga images with no text at all are the ideal data.)

Version 0.1.1 was trained using 285 image-mask pairs and 31,497 manga images.
(11,464 of 31,497 manga images are images with text.)

If you'd like to contribute a dataset to SickZil-Machine, please send your data to email .
The dataset will only be used for research purposes.

Release

We released 0.1.1 pre-release version!
You can download SZMC here.
Tutorials and Tips here.

SickZil-Machine is not a perfect program. We need your help.
If you find a bug or have a suggestion, please open a Github issue or send us an email.

Run the code(for developers)

You need NVIDIA driver 410.x, CUDA 10.0, CUDNN (>= 7.4.1). (tensorflow 1.13.0 requirements)

  1. git clone https://github.com/KUR-creative/SickZil-Machine.git; cd SickZil-Machine
  2. Download one of release zip files from here.
  3. Unzip the release file and copy SickZil-Machine-0.1.1-pre0-win64-cpu-eng/resource/cnet and SickZil-Machine-0.1.1-pre0-win64-cpu-eng/resource/snet directories to SickZil-Machine/resource.
  4. pip install -r requirements.txt
  5. cd src; python main.py

Future works

  • Increase text segmentation performance
  • Open manga text segmentation mask dataset
  • Automate typesetting(calligraphy style learning)



sickzil-machine's People

Contributors

kur-creative avatar 0x00000ff avatar fanglangxinghai avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.