Can you please provide us with more information regarding "lilt-only-base" file and ho

Hi, you can read our original paper at <a href="https://aclanthology.org/2022.acl-

Hi, you can read our original paper at <a href="https://aclanthology.org/

How is "lilt-only-base" bin file is created about lilt HOT 3 CLOSED

jpwang commented on August 12, 2024

How is "lilt-only-base" bin file is created

from lilt.

Comments (3)

jpWang commented on August 12, 2024

Hi,
you can read our original paper at https://aclanthology.org/2022.acl-long.534/. As explained in it, LiLT-base+En-Roberta are pre-trained using English docs. And the provided "lilt-only-base" is exactly the pre-trained LiLT-base part. It can be used to combine different textual models to deal with docs in different languages during fine-tuning.

from lilt.

vibeeshan025 commented on August 12, 2024

Hi, you can read our original paper at https://aclanthology.org/2022.acl-long.534/. As explained in it, LiLT-base+En-Roberta are pre-trained using English docs. And the provided "lilt-only-base" is exactly the pre-trained LiLT-base part. It can be used to combine different textual models to deal with docs in different languages during fine-tuning.

I understand the usage. But I am very curious how the file "lilt-only-base" is created. As you have mentioned what is "pre-trained LiLT-base part" how that specific base part is created.
We all know how roberta-en is created and from your provided code how "gen_weight_roberta_like.py" generates the base + roberta model.

What does the base part contains.

from lilt.

jpWang commented on August 12, 2024

Pytorch uses a dict-like format to store weight name-value pairs in 'pytorch_model.bin' files. We just filter out the name-value pairs of the LiLT part by weight names from the pre-trained checkpoint to create "lilt-only-base".

from lilt.

Recommend Projects

How is "lilt-only-base" bin file is created about lilt HOT 3 CLOSED

Comments (3)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent