Comments (10)
@tucachmo2202 Not yet. The train process works smoothly with PubTabNet dataset, but draws error with my own data.
how did you create your dataset? I custom this git follow pubnet format. When training with my dataset, sometimes, it raise error
row = [cellnp[i, 0], cellnp[i, 2]] IndexError: index 132 is out of bounds for axis 0 with size 124
sometimes it raiseValueError: cannot convert float NaN to integer
We find that many people may different problems in custom the model on their own datasets. We will provide more examples about how the regular datalists format.
from davar-lab-ocr.
@tucachmo2202 I create my dataset with the same git repository and convert it to Davar format by running DAVAR-Lab-OCR/blob/demo/table_recognition/lgpma/tools/convert_html_ann.py.
I came up with the same error as you, but I haven't followed this project for a long time. Sorry.
from davar-lab-ocr.
@qiaoliang6 , Hope you provide soon. Thank you very much!
Could you please provide us some of the generated data image and its corresponding datalist (You may send them via email or share in an online drive)? This would help us find the problem quickly.
from davar-lab-ocr.
@qiaoliang6 , Hope you provide soon. Thank you very much!
@tucachmo2202 Thank you for providing samples. In this samples, we find that the orignal annoation in html has the mismatch problem, i.e., the number of bbox does not match with the number of "<td></td>" in html. So in the latest update c85ca3f, we modify the conversion script to filter out the illegal samples. See demo/table_recognition/datalist/ReadMe.md for more details.
from davar-lab-ocr.
您好,我用自己的表格数据集(PubTabNet格式的)训练LGPMA,报了错DAVAR-Lab-OCR/davarocr/davarocr/davar_table/core/mask/lp_mask_target.py", line 55, in get_lpmask_single middle_x, middle_y = round(np.where(box_text == 1)[1].mean()), round(np.where(box_text == 1)[0].mean()) ValueError: cannot convert float NaN to integer
我准备训练集的方法是:用DAVAR-Lab-OCR/demo/table_recognition/lgpma/tools/convert_html_ann.py将我的数据集转成davar格式的。其中对convert_html_ann.py的html_to_davar函数做了一些修改:"labels"用0和1代替"t-head"和"t-body";content_ann只返回"bboxes"、"cells"和"labels"。转出来的数据格式和您公开的PubTabNet_train_datalist_all.json 格式一致,但是训练时报了上述错误。想请教下可能是哪一步出了问题,谢谢
令:用您公开的PubTabNet_train_datalist_all.json 作训练集,可以正常训练
是不是这个函数里面的gt_bboxes中包含那种不包含文本的格子?在lpma分支,模型只训包含文字的框
from davar-lab-ocr.
hi @le8888e ,
I am stucking with the error too. Have you solve it yet?
from davar-lab-ocr.
@tucachmo2202 Not yet. The train process works smoothly with PubTabNet dataset, but draws error with my own data.
from davar-lab-ocr.
@tucachmo2202 Not yet. The train process works smoothly with PubTabNet dataset, but draws error with my own data.
how did you create your dataset? I custom this git follow pubnet format. When training with my dataset, sometimes, it raise error
row = [cellnp[i, 0], cellnp[i, 2]] IndexError: index 132 is out of bounds for axis 0 with size 124
sometimes it raise
ValueError: cannot convert float NaN to integer
from davar-lab-ocr.
@qiaoliang6 ,
Hope you provide soon. Thank you very much!
from davar-lab-ocr.
@qiaoliang6 , Hope you provide soon. Thank you very much!
@tucachmo2202 Thank you for providing samples. In this samples, we find that the orignal annoation in html has the mismatch problem, i.e., the number of bbox does not match with the number of "" in html. So in the latest update c85ca3f, we modify the conversion script to filter out the illegal samples. See demo/table_recognition/datalist/ReadMe.md for more details.
Thank you very much for your help!
from davar-lab-ocr.
Related Issues (20)
- RuntimeError: nms is not compiled with GPU support HOT 1
- question about maskrcnn config and trie config HOT 1
- 运行train报错OSError: TableRcgDataset: GPMADataGeneration: no file with expected extension HOT 1
- LGPMA - Global segmentation branch HOT 1
- run test_pub.py error HOT 1
- VSR Layout Recognition Test Issues
- 单卡训练如何设置参数 HOT 1
- 多卡训练时gpu占用率拉满,但是模型卡住了
- 怎样可以得到spin变换后的可视化效果
- 自己数据训练报错,请教gt_masks 是如何计算的? HOT 3
- spin train.py 报错
- Question about the code in DLD
- 训练自己数据到第三轮报错 HOT 4
- result['content_ann']['bboxes'
- When will the CTUNet code and datasets be released?It's a wonderful work!
- how to i use inference?
- train with 1 gpu HOT 1
- 生成的表格
- 18
- how to convert LGPMA to onnx HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from davar-lab-ocr.