I am trying to train pointer generator network . After training for 10k iterations it

Sir the link to repo is below:- <a href="https://github.com/priyanks179/pointer-ge

<a class="user-mention notranslate" data-hovercard-type="user" data-hovercard-url="/us

pointer generator model starts overfitting about pointer_summarizer HOT 6 CLOSED

priyanks179 commented on June 13, 2024

pointer generator model starts overfitting

from pointer_summarizer.

Comments (6)

atulkum commented on June 13, 2024

make sure you turn off the coverage loss initially.

from pointer_summarizer.

priyanks179 commented on June 13, 2024

make sure you turn off the coverage loss initially.

bro i am not using coverage loss at all . And model that i am using is exactly same as yours . Just the thing is that i am fixing max_enc_steps to 200 and max_dec_steps to 30 .

from pointer_summarizer.

priyanks179 commented on June 13, 2024

sir pls help coz i am trying to solve this problem for a month and i am not able to figure it out .

from pointer_summarizer.

priyanks179 commented on June 13, 2024

Sir the link to repo is below:-
https://github.com/priyanks179/pointer-gen-network

from pointer_summarizer.

A-Rain commented on June 13, 2024

@atulkum Hello sir, I'm confuse why we should turn off coverage loss initially

from pointer_summarizer.

atulkum commented on June 13, 2024

If you turn on coverage initially the training won't converge.
Intuitively think about coverage as one more constraint on optimization which will restrict the space you can explore.
That is the reason first we explore the loss space without the restriction and once it converge a bit we apply the coverage.

from pointer_summarizer.

pointer generator model starts overfitting about pointer_summarizer HOT 6 CLOSED

Comments (6)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent