ben-kerman / anki-jrp Goto Github PK

Anki add-on for generating furigana and pitch accent coloring & graphs, including optional flexible card styling

License: GNU General Public License v3.0

Python 84.60% TypeScript 11.94% CSS 3.21% Max 0.25%

anki anki-addon japanese japanese-study pitch-accent furigana

anki-jrp's Introduction

Japanese Readings and Pitch Accent Add-on

This Anki add-on allows automatically generating readings (furigana) and pitch accent information for an entire field with a single button press. It can also convert any amount of cards in bulk.

The add-on is currently experimental and you'll probably encounter bugs or crashes. Data corruption is extremely unlikely (probably even impossible), the specific notes being changed aside, but you should still make backups, especially when bulk converting notes, and enable Anki's built-in automatic backups if you haven't already.
See below for details on how to report issues.

Getting Started

Make sure you have a supported version of Anki installed. See the AnkiWeb page for further information.
On non-Windows platforms only, install MeCab since only a Windows executable is bundled with the add-on.
Install the full version of the add-on from AnkiWeb.
Alternatively, download one of the .ankiaddon files from the latest release and manually install it from the add-on menu.
- The smallest file contains only the add-on itself, which means you will need to supply your own pitch accent/variant data and install MeCab as well as a MeCab dictionary alongside a dictionary for it to use externally.
- The full file has everything except a MeCab dictionary, but including a MeCab exe for Windows.
- The ipadic file contains everything the add-on needs out of the box and is probably what most users will want to install initially.
  This is also the version shared on AnkiWeb.
Restart Anki, open the Manage Note Types menu (Ctrl+⇧+N) and set up your templates as described here.
Open the add-on's preferences under Tools in Anki's main menu bar, go the Note Types tab, then select and Add all note types you set up in the last step.
Click Remove MIA/Migaku for any note types you previously used the Migaku Japanese Add-on with to remove the script and styling from that add-on, otherwise this add-on will not work properly.
Save your preferences. You should now be able to preview or review any cards that contain reading/accent syntax with furigana and accent coloring / indicators.

Adding Readings and Pitch Accent Information

Individual

To generate or remove readings and accent info for a single note, select it in the Anki browser, focus the field you want to change and click one of the conversion buttons at the right of the editor toolbar or press the associated shortcut.

Bulk Conversion

The Notes entry in the browser's menu bar contains an action for converting notes in bulk.

Select any notes you want to change, choose the bulk conversion action from the menu bar, adjust the configuration dialog to what you want and click Convert. All notes need to be of the same note type, since it wouldn't be possible to determine the target field otherwise.

With the Default and Migaku conversion types, any notes that already contain reading/accent syntax will be converted directly to the closest equivalent in the target syntax without regenerating (unless the corresponding option is checked), or left unchanged if the existing syntax type is the same as the target.

If you're converting large amounts of notes, such as sentence mining decks containing hundreds or thousands of cards, make sure to back everything up before running the conversion. Exporting the deck(s) with scheduling information (but without media, since the conversion only changes the notes themselves) should be sufficient.

Syntax

The add-on supports its own fully-featured syntax and is also compatible with the syntax from the old Migaku Japanese Add-on.

Default

In the default syntax, readings are written in square brackets with kanji on the left and furigana on the right separated by a |, like [振|ふ]り[仮名|がな].
Words with pitch accent information are enclosed in curly braces that contain the word (including reading tags) itself and accent information after a semicolon: {[受|う]け[入|い]れる;Y4,0}.

The accent information normally consists of a comma-separated list of numbers, each representing the accented mora, or 0 for unaccented (平板) words. The only special cases are unknown accents marked with a ?, which can only occur by converting from Migaku syntax, and split accents like 一目瞭然 (いち↓もく・りょ→うぜん) which are composed of several parts separated by dashes. Each part consists of the number of the accented mora, an @ sign, and the number of moras it applies to. For example: {[一目瞭然|いちもくりょうぜん];2@4-0@4,0}.

An ! and Y can be placed before the actual pitch accent information.
Exclamation marks indicate ambiguous accents, such as for many kana-only words or certain words with multiple accents. The add-on automatically inserts ambiguity marks if it found more than one possible set of accents with the same reading for a word. You should then manually look up the words in question and adjust the accent tags as necessary.
If present, a Y (from 用 as in 活用・用言) will cause all accents other than [0] to be displayed as the kifuku (起伏) pattern. Ys are added to all words identified as 動詞 (verbs) or 形容詞 (i-adjectives) by MeCab.

Base readings for conjugated words can be indicated in two ways, inline like {[行|い][って=く];Y0}, or after the accent(s) like {[来|き]た;Y1|くる}.

Migaku

For backwards compatibility, Migaku-style syntax is also supported, but it has some limitations compared to the new syntax:

Only one reading per word is possible, leading to furigana frequently duplicating characters from the word, like この先このさき, 付き合つきあう or 変わり身かわりみ.
The base reading of conjugable words must always be included in full, even the part identical to the reading of the conjugated form. This is also true if the base reading and actual reading are the same, so the Migaku version of what would be {[陥|おとしい]れる;Y5,0} in default syntax is 陥[おとしい,おとしいれる;k5,h]れる.
Since words are space-separated, regular ASCII spaces can't be represented properly and are replaced with en spaces (U+2002).
Ambiguous accents can't be marked and thus won't be highlighted.
Split accents can't be accurately represented and will be displayed as unknown.

Migrating from the Migaku Japanese Add-on

Follow the guide to getting started above. You'll need to manually transfer your accent pattern colors to the note type style settings if you want to keep them. The add-ons have different default colors.

You can easily migrate existing notes to the new syntax without losing any manual changes by selecting them in the browser and bulk converting them with the Default conversion type as long as Regenerate contents is disabled. Make sure to back up all notes in case there are bugs in the conversion algorithm.

Reporting Issues

If you encounter a bug or crash while using the add-on please report it. You can either open a GitHub issue in this repository or contact me on Discord, where you can find me in the Refold community servers.

Explain what you were doing when the issue occurred as accurately as possible, ideally with a list of steps to reproduce it. If the problem is related to specific cards, consider including an .apkg export of those cards (Notes → Export Notes... in the Anki browser while having the cards in question selected). If there was an error message include its full contents as well, if possible as selectable text.

anki-jrp's People

Contributors

Stargazers

Watchers

Forkers

pointofnilreturn itsupera 4rgc toyking10

anki-jrp's Issues

Issues with Mecab

I'm using Arch Linux, I've tried both the AUR packages for mecab, I've tried compiling the dictionaries myself but trying to generate syntac for cards always fails.
When I try to bulk convert after compiling mecab myself it says
Mecab error, stopping conversion: invalid line: param.cpp(69) [ifs] no such file or directory: ./dicrc
When I try using the AUR package it says
Mecab error, stopping conversion: invalid line: param.cpp(69) [ifs] no such file or directory: /usr/lib/mecab/dic/ipadic/dicrc
I've tried copying the dicrc included with the add on (which I downloaded from the Anki website) to the mecab directory, but that resulted in the same error.

Accent Audio Implementation?

Hello, I was just wondering if there are any plans to implement the feature of clicking on words to play audio, like Migaku Japanese did. It was a great feature.

The addon menu goes beyond the screen vertically

Please make the main menu and the note type style menu take up less vertical screen space because I can't see the bottom of the menu window, making it unusable. The machine in question is MacBook Air M1 with a resolution of 2560 x 1600. I have to switch my display settings to the most "spacious" screen setting to see the vertically long main menu in full but even then the note type style menu goes out of screen.

Maybe split the menu into 2 columns?

Thank you for the addon.

Allow generating into a different field

It would be handy to be able to keep one field as the original input, and generate the readings into a new field.

While duplicating a field is certainly possible, by default Anki doesn't make this too easy.

Thank you for this useful software!

Error when installing Release 0.1 on Anki 2.1.49 (Linux)

I have tried to install the release 0.1 (IPADIC version) from the .ankiaddon file, but it looks like a library is missing.
Here are the debug info:

Anki 2.1.49 (dc80804a) Python 3.8.1 Qt 5.15.1 PyQt 5.15.1
Platform: Linux
Flags: frz=True ao=True sv=2
Add-ons, last update check: 2022-07-20 08:49:15

Caught exception:
Traceback (most recent call last):
  File "aqt/main.py", line 1634, in onAppMsg
  File "aqt/main.py", line 1189, in installAddon
  File "aqt/addons.py", line 1595, in installAddonPackages
  File "aqt/addons.py", line 467, in processPackages
  File "aqt/addons.py", line 402, in install
  File "aqt/addons.py", line 439, in _install
  File "zipfile.py", line 1628, in extract
  File "zipfile.py", line 1698, in _extract_member
  File "zipfile.py", line 1569, in open
  File "zipfile.py", line 817, in __init__
  File "zipfile.py", line 718, in _get_decompressor
  File "zipfile.py", line 695, in _check_compression
RuntimeError: Compression requires the (missing) lzma module

I also try using the source code, here is the traceback at launch:

⁨Traceback (most recent call last):
  File "aqt/addons.py", line 230, in loadAddons
  File "/home/didier/.local/share/Anki2/addons21/anki-jrp/__init__.py", line 1, in <module>
    from . import ankilib
  File "/home/didier/.local/share/Anki2/addons21/anki-jrp/ankilib/__init__.py", line 1, in <module>
    from . import hooks
  File "/home/didier/.local/share/Anki2/addons21/anki-jrp/ankilib/hooks.py", line 4, in <module>
    from . import browser, editor, global_vars, main_menu, updates
  File "/home/didier/.local/share/Anki2/addons21/anki-jrp/ankilib/browser.py", line 16, in <module>
    from . import global_vars as gv
  File "/home/didier/.local/share/Anki2/addons21/anki-jrp/ankilib/global_vars.py", line 3, in <module>
    from lzma import LZMAError
  File "/home/dae/venv/lib/python3.8/site-packages/PyInstaller-4.0.dev0+g2886519-py3.8.egg/PyInstaller/loader/pyimod03_importers.py", line 625, in exec_module
  File "lzma.py", line 27, in <module>
ModuleNotFoundError: No module named '_lzma'

I thought of a conflict with another add-on, but I'm getting the same result with all other add-ons disabled.

for some cards getting an error of cannot, for others it doesn't work

hello,
I followed all the steps in the guide, but for some cards I get this error: "⚠ Error during parsing: Cannot read properties of null (reading 'length')"
and for others the add-on doesn't change anything. I'm working in windows 11.
These are the fields of the card type in my deck:

And this is the syntax of the card front:

Thank you for the help!

Mecab error, stopping conversion: executable not found

After following all the steps on getting started and doing the Bulk Conversion, I got this error: "Mecab error, stopping conversion: executable not found". I'm on mac, and I have the mecab addon installed, and I have it installed through homebrew

FR: allow highlighting parts of sentence

Hi, I have had the habit of adding a part of my sentence in bold so I cab identify which part I need to be looking at when reviewing quickly.

This way the part is highlighted when I review.

It is also very useful for long monolingual definitions where I can highlight just one part of a very long definition that just "clicks".

But these html tags seem to be wiped out when using anki-jrp.

Any ideas for a workaround?

FR: option to only show furigana on hover

A nice feature of the old addon was that you could have accent color without furigana by default; furigana could then be revealed on hover/click.

I mostly don't want any furigana, but always want accent colors.

Unable to install addon in anki

Hi, I was a user of the MIA/migaku tools for a couple of years until about a week ago when I updated my Linux Mint system and my old Anki could not start up any longer.

I came looking for an alternative and was pointed to your project - it looks very promising! But I am having an issue installing the addon.

I downloaded japanese-readings-and-pitch-accent_0.1_ipadic.ankiaddon and when I try to install it in anki's update manager I get this error:

Error
An error occurred. Please start Anki while holding down the shift key, which will temporarily disable the add-ons you have installed.
If the issue only occurs when add-ons are enabled, please use the Tools > Add-ons menu item to disable some add-ons and restart Anki, repeating until you discover the add-on that is causing the problem.
When you've discovered the add-on that is causing the problem, please report the issue on the add-on support site.
Debug info:
Anki 2.1.49 (dc80804a) Python 3.8.1 Qt 5.15.1 PyQt 5.15.1
Platform: Linux
Flags: frz=True ao=True sv=1
Add-ons, last update check: 2022-11-03 17:55:48

Caught exception:
Traceback (most recent call last):
  File "aqt/addons.py", line 893, in onInstallFiles
  File "aqt/addons.py", line 1595, in installAddonPackages
  File "aqt/addons.py", line 467, in processPackages
  File "aqt/addons.py", line 402, in install
  File "aqt/addons.py", line 439, in _install
  File "zipfile.py", line 1628, in extract
  File "zipfile.py", line 1698, in _extract_member
  File "zipfile.py", line 1569, in open
  File "zipfile.py", line 817, in __init__
  File "zipfile.py", line 718, in _get_decompressor
  File "zipfile.py", line 695, in _check_compression
RuntimeError: Compression requires the (missing) lzma module

I'm running anki-2.1.49-linux which I downloaded from the anki site.

I'm not that experienced with python versions but I suspect my python setup is missing some lzma module. I tried googling around but don't know how I could add it - I don't seem to have python 3.8.1 on my system so maybe it came with the anki download.

Error:⚠ Error during parsing: Cannot read properties of null (reading 'length')

I've followed the instructions, but after converting with default chosen this happened

Don't delete images when generating accent information

It would be really helpful if images weren't deleted since some of the common EPWING dictionaries have images instead of certain symbols (such as rare kanji or things like ①②③).

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.