Git Product home page Git Product logo

Comments (7)

helloworld1973 avatar helloworld1973 commented on May 21, 2024 3

cannot agree more

from invoice.

guanshuicheng avatar guanshuicheng commented on May 21, 2024

更多区域的检测识别模型会在后期开放

from invoice.

simplew2011 avatar simplew2011 commented on May 21, 2024

您好,只检测了几个区域,是在yolo训练的时候只用了这几个区域的标注数据吧。

from invoice.

guanshuicheng avatar guanshuicheng commented on May 21, 2024

用了这几个区域的标注数据

对,只标注了指定的这五个区域

from invoice.

simplew2011 avatar simplew2011 commented on May 21, 2024

用了这几个区域的标注数据

对,只标注了指定的这五个区域

【您这五个区域,用yolo训练,最终loss能收敛到多少呢?】

我训练时的一些情况:
基于银行卡数据集https://github.com/cuishuaihua/Bankcard_OCR,
用chineseocr原作者训练代码,YOLO_Kmeans生成anchors = '8,46, 8,57, 8,66, 8,75, 8,84, 8,93, 8,103, 8,115, 8,129',因为只是银行卡数字,长度是一样的,只是原始图像尺寸上存在一些变化,所以聚类生成的anchors变化不大。
将chineseocr的文字检测模型作为resume加载,训练了10轮loss下降很慢
1606/1946 [=======================>......] - ETA: 3:28 - loss: 228.6397 - class_loss_loss: 0.0021 - xy_loss_loss: 79.8738 - wh_loss_loss: 0.2713 - confidence_loss_loss: 131.8300

from invoice.

Andy546 avatar Andy546 commented on May 21, 2024

用了这几个区域的标注数据

对,只标注了指定的这五个区域

请问下您的数据量大约多少呢?就是标注发票张数和生成发票张数

from invoice.

yanhaojie569 avatar yanhaojie569 commented on May 21, 2024

用了这几个区域的标注数据

对,只标注了指定的这五个区域

【您这五个区域,用yolo训练,最终loss能收敛到多少呢?】

我训练时的一些情况: 基于银行卡数据集https://github.com/cuishuaihua/Bankcard_OCR, 用chineseocr原作者训练代码,YOLO_Kmeans生成anchors = '8,46, 8,57, 8,66, 8,75, 8,84, 8,93, 8,103, 8,115, 8,129',因为只是银行卡数字,长度是一样的,只是原始图像尺寸上存在一些变化,所以聚类生成的anchors变化不大。 将chineseocr的文字检测模型作为resume加载,训练了10轮loss下降很慢 1606/1946 [=======================>......] - ETA: 3:28 - loss: 228.6397 - class_loss_loss: 0.0021 - xy_loss_loss: 79.8738 - wh_loss_loss: 0.2713 - confidence_loss_loss: 131.8300

我想问下这个是调模型就好了嘛

from invoice.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.