Comments (4)
Thanks for your attention to our work.
Q1: Are these invoices monolingual or multilingual? For resource-rich languages, for example, English, LiLT+English-Roberta often performs better than LiLT+InfoXLM. Furthermore, you don't need to train a completely fresh model from scratch. For example, if your invoices are English, you can load the pre-trained LiLT+English-Roberta weight and continue to pre-train it on the unlabeled 1 million samples for a while.
Q2: Less than a week for the experimental setup described in our paper.
Q3: You can refer to the layout diversity (task difficulty), number of samples, and the SOTA performances of the public academic datasets such as FUNSD, CORD, SROIE, EPHOIE, XFUND. Generally speaking, compared with these datasets, 5000 is already a relatively sufficient number. You can also refer to our provided fine-tuning strategies and the experimental setup described in our paper.
from lilt.
Thanks a lot, Currently monolingual but may be extended to other languages, I believe the LiLT+LN-Roberta method is much suited for my specific task. Is it really required to do the pre-train again with my own data from our end to achieve any better results or the fine-tuning is alone is sufficient.
First I want to see a few results before spending time and money on doing pretraining. That's why I am asking.
from lilt.
Generally, performing fine-tuning alone can achieve a satisfactory result. But when you really want to utilize the unlabeled "in-domain" samples, or you really want to further improve the performance, you can try the strategy of continuing pre-training.
from lilt.
@vincentAGNES HI, I recently came across your project. I have a few doubts and it'd be very helpful to me if you could please make some time and help me out.
from lilt.
Related Issues (20)
- Word or segment position embeddings? HOT 6
- Is LiLT-Large possible? HOT 1
- Pre-training code? HOT 5
- post custom dataset training ser on funsd model, inference issue HOT 1
- 代码运行问题 HOT 8
- Config error in Multi-task Semantic Entity Recognition on XFUND HOT 4
- Export model using distilroberta-base HOT 2
- Possibility to combine lilt-only-base with roberta-large HOT 2
- Usage with BigBird-Roberta-Base HOT 1
- Improve relation extraction HOT 7
- How we can use it for unstructured data HOT 1
- pip install -e . error
- how to train from scratch
- pre-processed data HOT 2
- How to decrease inference time of LiLT?
- LiLT can not make inference with the Half (float16) dtype on CPU
- Pretraining with other ROBERTa model HOT 1
- dataset format of FUNSD/XFUND
- Use LiLT / an alternative model with more than 512 tokens HOT 1
- RuntimeError: CUDA error: device-side assert triggered HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from lilt.