Describe the bug 我使用了 lmdeploy 部署 InternLM2 int4的pipeline进行推理用于英文翻

<a target="_blank" rel="noopener noreferrer" href="https://private-user-images.githubu

[Bug] InternLM2 int4 出现重复说话、重复前置内容（system prompt）现象 about internlm HOT 11 CLOSED

sanbuphy commented on September 25, 2024

[Bug] InternLM2 int4 出现重复说话、重复前置内容（system prompt）现象

from internlm.

Comments (11)

RangiLyu commented on September 25, 2024 1

我感觉可能是RLHF的时候有些过拟合了，导致模型变得过于helpful，一般表现为在回复的答案前后加过多额外的内容，没法严格遵循指令。
以及翻译名字变成书生浦语应该也是过拟合导致的，训练时候身份认知数据加太多导致“我的名字是”这几个token后面出现“书生浦语”的概率变得太高了。
chat模型实在纠正不过来的话，要不考虑换成没有rl过的chat-sft模型试试。不过我也不确定会不会变好。

from internlm.

sanbuphy commented on September 25, 2024

如果避免出现空格，似乎可以改善现象

from internlm.

RangiLyu commented on September 25, 2024

重复说translator_system_prompt的问题改用这种方式试试呢？system prompt放到system的role里面，另外再强化一下指令的要求：

prompts = [[
{
    'role': 'system',
    'content': '把下列文字翻译成中文，只返回给我翻译结果，不要输出任何额外内容'
},
{
    'role': 'user',
    'content': '待翻译的文本'
},]
response = self.model(prompts, gen_config)

from internlm.

sanbuphy commented on September 25, 2024

重复说translator_system_prompt的问题改用这种方式试试呢？system prompt放到system的role里面，另外再强化一下指令的要求：
prompts = [[
{
    'role': 'system',
    'content': '把下列文字翻译成中文，只返回给我翻译结果，不要输出任何额外内容'
},
{
    'role': 'user',
    'content': '待翻译的文本'
},]
response = self.model(prompts, gen_config)

仍然未改善哭泣，还是有类似现象

from internlm.

sanbuphy commented on September 25, 2024

有时候还会有这样的问题

from internlm.

sanbuphy commented on September 25, 2024

我感觉可能是RLHF的时候有些过拟合了，导致模型变得过于helpful，一般表现为在回复的答案前后加过多额外的内容，没法严格遵循指令。以及翻译名字变成书生浦语应该也是过拟合导致的，训练时候身份认知数据加太多导致“我的名字是”这几个token后面出现“书生浦语”的概率变得太高了。 chat模型实在纠正不过来的话，要不考虑换成没有rl过的chat-sft模型试试。不过我也不确定会不会变好。

感觉，得等下一版本？

from internlm.

github-actions commented on September 25, 2024

This issue is marked as stale because it has been marked as invalid or awaiting response for 7 days without any further response. It will be closed in 7 days if the stale label is not removed or if there is no further response.

from internlm.

github-actions commented on September 25, 2024

from internlm.

github-actions commented on September 25, 2024

This issue is closed because it has been stale for 7 days. Please open a new issue if you have similar issues or you have any new updates now.

from internlm.

hotmengmeng commented on September 25, 2024

请问你解决了么？是哪里有问题呀？我也出现同样的问题了

from internlm.

lvhan028 commented on September 25, 2024

hi, @hotmengmeng 请问用的是 lmdeploy 哪个版本？

from internlm.

[Bug] InternLM2 int4 出现重复说话、重复前置内容（system prompt）现象 about internlm HOT 11 CLOSED

Comments (11)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent