Git Product home page Git Product logo

dbtxt2json_jp's Introduction

dblLineTXT2Json_jp

通用双行文本与json_jp互转脚本

  • 双行文本:是一种翻译常用文本格式

  • json_jp:是GalTransl接受的输入格式

双行文本例:

◇00000A7C◇清史郎
◆00000A7C◆清史郎

◇00000A84◇「ついっったぁ!!」<
◆00000A84◆「ついっったぁ!!」<

◇00000B40◇電車に揺られ数時間……。
◆00000B40◆電車に揺られ数時間……。

使用例:

  1. 修改settings.py,根据文本例参悟配置方法:
原文标签头部特征 = "◇"
原文标签尾部特征 = "◇"

译文标签头部特征 = "◆"
译文标签尾部特征 = "◆"

人名长度阈值 = 12
人名黑名单字符 = "。『」…!、"

file_encoding = "utf-8"
  1. 运行01_dump_nametable.py,获得nametable.txt

    nametable.txt格式为<名字>[TAB]<出现次数>
    凭感觉删掉不是人名的行

  2. 运行02_txt_jp2json_jp.py提取json_jp

  3. 运行03_json_cn2txt_cn.py将json_cn导入,得到txt_cn

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.