njean42 / kumiko Goto Github PK

View Code? Open in Web Editor NEW

131.0 5.0 36.0 2.32 MB

Kumiko, the Comics Cutter

License: Other

Python 79.61% JavaScript 15.22% CSS 5.17%

comics splitter panels image-processing python opencv

kumiko's Introduction

Introduction

Kumiko mascot by Hurluberlue, CC-BY-SA 4.0

Kumiko, the Comics Cutter is a set of tools to compute useful information about comic book pages, panels, and more. Its main strength is to find out the locations of panels within a comic's page (image file). Kumiko can also compile information about panels for all pages in a comic book, and present it as one piece of data (JSON-formatted object).

Kumiko makes use of the great (freely licensed) opencv library, which provides image processing algorithms of all sorts. Mainly, the contour detection algorithm is used to detect panels within an image.

Demo

TL;WR Too Long; Won't Read the whole doc?

A live demo is available here, where you can try Kumiko out and cut your own comic pages into panels.

Philosophy

Kumiko aims at being a functional library to extract information from comic pages / books. The goal is to provide a set of tools that is usable beforehand, to extract all needed information.

External programs can later use the generated information for different purposes: panel-by-panel viewing, actual splitting of an image down into panels, etc.

Panel-by-panel comic reading

Being able to jump from one panel to the next was the original idea behind Kumiko.

xkcd by Randall Munroe, #208, CC BY-NC 2.5

Comic viewers usually imply a very common page-by-page reading paradigm. You read a page, possibly zooming on it to be able to read speech bubbles, then click, tap, press a key or swipe to the next page.

With knowledge about panels locations, we can imagine a comic reader that also offers panel-by-panel reading. This is especially interesting for small screens, on which you probably can't read the texts if a whole page is displayed.

Just run kumiko -i /path/to/comicpage.jpg -b firefox on your comicpage.jpg file, and read it panel-by-panel in your browser!

Requirements

apt-get install python3-opencv will install the only necessary library needed: opencv.

This should do the trick for Debian distros and derivatives (Ubuntu, Linux Mint...). If you successfully use Kumiko on any other platform, please let us know!

Usage & Testing

See the usage doc for details on how to use the Kumiko tools.

Also check the testing doc if you want to test modified versions of the code.

Numbering

The numbering is left-to-right, or right-to-left if requested.

Here is an example of how Kumiko is going to number panels by default (numbers and red lines not in the original picture).

Pepper & Carott by David Revoy, episode 2, CC BY 4.0

Contributing

Feature requests and PR are welcome!

Kumiko python code if formatted with yapf. Config file is committed here.

To format all your code, simply run:

yapf3 --recursive --in-place .

Short- and longer-term features (roadmap)

Kumiko library

detect panels on a growing range of comic page layouts
- detect non-framed panels (without clear boundaries/borders)
- separate intertwined panels
~~be able to detect panel contours on pages with non-white, non-black background~~ done in v1.5

Back-office (validation / edition tool)

Let's face it: we probably can't ensure that Kumiko can perfectly find out the panels in any image. There is a huge diversity of panel boundaries, layouts and whatnot.

This is why there could be some kind of back-office / editing tool that lets a human editor:

validate pages
add, delete, move or resize incorrect panels
report bugs
...

Such a tool would edit the JSON file representing a comic book information, for later use by other programs relying on it.

kumiko's People

Contributors

Stargazers

Watchers

kumiko's Issues

Contact.

Hey.

I've found your repo and I'm working on something very similar (also working on bubbles, panel order/flow, face detection, character identification, association of character and bubbles, etc).
I wanted to reach out because I'm curious to learn why you were working on this, and I'd like to know if you're still working on this, and if you'd like to cooperate or discuss possible solutions etc.

Apologies if I'm disturbing you.

[email protected]

Cheers.

Excellent!

I would like to use your excellent tool to create on online comic library based on your demo front end (extending it of course).
Is this code also shared ?
Thanks a lot!

Batch process fails

Hello,
I am trying the batch process now, and it raises an error:

 /home/maxunger/ComicToVideo/kumiko/kumiko -i /var/www/ilurn.com/wp-content/uploads/comics_adventures/lesecretdelalicorne/ > ./lesecretdelalicorne.json
Traceback (most recent call last):
  File "/home/maxunger/ComicToVideo/kumiko/kumiko", line 27, in <module>
    info = k.parse_dir(file_or_folder)
  File "/home/maxunger/ComicToVideo/kumiko/kumikolib.py", line 39, in parse_dir
    return self.parse_images(filenames)
  File "/home/maxunger/ComicToVideo/kumiko/kumikolib.py", line 45, in parse_images
    infos.append(self.parse_image(filename))
  File "/home/maxunger/ComicToVideo/kumiko/kumikolib.py", line 53, in parse_image
    size = list(img.shape[:2])
AttributeError: 'NoneType' object has no attribute 'shape'

Can you please help ?

Can't figure out how to use kumiko

I am terribly sorry if I am simply misunderstanding something, but I can't seem to launch/use kumiko in command line at all. I open command line in the folder with kumiko in it, but any command, like kumiko --help, only produces:
"'kumiko' is not recognized as an internal or external command, operable program or batch file."
I've tried on Windows and in WSL, but the issue is the same. I am probably missing something fundamental, seeing as it works on many machines. I would appreciate any help

kumiko takes a couple of minutes to process some pages

Hi! Sometimes when the image is too big and sharp kumiko takes more than a minute to process a single page (instead of a second). Apparently this can be fixed with applying GaussianBlur or resizing the page in question. For example, by adding a line to the kumikolib.py file:

	def parse_image(self,filename,url=None):
		self.img = cv.imread(filename)
+		self.img = cv.GaussianBlur(self.img,(5,5),0)

The solution is from here

Run kumiko from python script for local files

Hello. I'm trying to execute Kumiko in the Python script on some saved manga images stored on my PC. I tried to follow one of the previous issues by downloading lib folder and kumikolib and running:
from kumikolib import Kumiko k= Kumiko() info = k.parse_url_list(["path to the image"]) panels = info[0]['panels']

but info turned out to be None.
What am I doing wrong? Is there even a way to run Kumiko like this?

Installation help

I'd really like to test out the software, but pip install kumiko doesn't work on my end.

ERROR: Could not find a version that satisfies the requirement kumiko
ERROR: No matching distribution found for kumiko

What could I do to make this work and can kumiko deal with .cbz/cbr files?

Python rather than command line

Hello, I was wondering how I could use kumiko in a python script rather than from the command line.

ZeroDivisionError

Kumiko throws a ZeroDivisionError when running the algorithm on this image as input. Resolution matters when creating this bug, so this image may not throw an error because of the compression while uploading it. If that's the case I can work with you to get the image to you at the correct resolution.

Here is an image of the error logging in the terminal. I've spent some time poking around, but didn't really get anywhere as I don't know python too well.

Ordering seems off in certain cases

In the following image, the 6th panel is not right positionned.

Original page can be found here : https://i.pinimg.com/originals/22/d3/60/22d360573d335e84f85c7c734e94b8af.jpg

For OpenCV 3.2 or higher version usage

kumikolib.py
change

contours, hierarchy = cv.findContours(thresh, cv.RETR_EXTERNAL, cv.CHAIN_APPROX_SIMPLE)

to
_ , contours, hierarchy = cv.findContours(thresh, cv.RETR_EXTERNAL, cv.CHAIN_APPROX_SIMPLE)

OpenCV docs

Problem of override default `eq` in Panel

	def __eq__(self, other):
		return all(
			[
				abs(self.x - other.x) < self.wt(),
				abs(self.y - other.y) < self.ht(),
				abs(self.r - other.r) < self.wt(),
				abs(self.b - other.b) < self.ht(),
			]
		)

Override __eq__ of Panel cause problem. Assume panel_a is __eq__ to panel_b and panel_b is __eq__ to panel_c, but it is may not true that panel_a is __eq__ to panel_c.
If panel_a, panel_b, panel_c are in panels array(order a b c) and we remove it with.order panel_b, panel_c, panel_a, then panel_a, panel_b will be remove in order, and error will occur when try to remove pane_a because panel_c is left in panels array. This may happens in merge_panes in page.py

Maybe we should avoid override default __eq__ (lt) etc and use explicit func with similar top_left etc

Did development stop?

Hey @njean42 ! Great project, man!

Is the development halted?

Kumiko can't detect panels within a page with black background

I tried the kumiko through the online demo. Some other pages with white background are working great, but then I tried this page with black color background and kumiko only detected it as a single panel.

Perhaps you have solution for this? Thanks!

Image:

Extract image files of panels

Would be a nice feature if you could extract png's or jpg's of the individual panels for additional processing such as image to text.

Parameter tuning

I notice that when cutting certain xkcd comics, the cutting is not perfect, which is expected for a general-purpose tool. For example, This xkcd creates a panel that starts at the left edge and goes halfway into the second panel
as so. Which parameters should I tune in order to avoid these? Contour detection? panel.split?

Another example finds this
this as a panel