according to the paper and data_by_IFD.py line 103-105, my understanding mean_1 = los

a confusion about Instruction-Following Difficulty (IFD) scores about cherry_llm HOT 2 CLOSED

xiaohuoguohh commented on July 28, 2024

a confusion about Instruction-Following Difficulty (IFD) scores

from cherry_llm.

Comments (2)

MingLiiii commented on July 28, 2024 1

Thanks for asking~
We are sorry that we just mentioned it in the paper but forgot to explain.

Typically, the Conditioned Answer Score is always smaller than the Direct Answer Score for one specific data instance due to the intrinsic nature of the next token prediction. With the context given, the prediction for the latter tokens should be easier.
Then let's come back to the IFD score. If the IFD score is greater than 1, the Conditioned Answer Score is even larger than the Direct Answer Score, which means the given instruction provides no useful context for the prediction of the response. In this situation, we think there exists a misalignment between the instruction and the corresponding response. Thus we choose to filter these potentially misaligned samples.

More intuitively speaking, if the response is nothing related to the instruction, it will probably have a really high IFD score greater than 1, and we want to filter these samples out.

from cherry_llm.

xiaohuoguohh commented on July 28, 2024

It is claer, great! Thanks for answering～

from cherry_llm.

a confusion about Instruction-Following Difficulty (IFD) scores about cherry_llm HOT 2 CLOSED

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent