Git Product home page Git Product logo

Comments (13)

jxg972 avatar jxg972 commented on May 25, 2024 1

在给python3安装rasa配套包的时候发现了问题,python包的默认安装目录\usr磁盘满了,增加了硬盘空间后,重装python2的rasa,这个问题解决了,可能是因为这个原因导致安装不完全?
rasa一直没报过错,所以一直没发现问题

from rasa_nlu_chi.

jxg972 avatar jxg972 commented on May 25, 2024 1

晕,订正一下,不是目录满的原因
是当前目录的原因,一直没注意到这个问题
我在随意目录下运行python -m rasa_nlu.train -c xxx/config_jieba_mitie_sklearn.json就会报错
必须先cd切换到git下来的rasa_nlu_chi目录下运行,就可以正常读取

from rasa_nlu_chi.

jxg972 avatar jxg972 commented on May 25, 2024 1

这个问题,你需要检查一下你的分词,你这样标注实体的话,你的分词必须是
['今天', '8点', '到', '9点45分', '有', '哪些', '闹钟']
但是如果使用结巴分词的话,默认的分词结果是
['今天', '8', '点到', '9', '点', '45', '分有', '哪些', '闹钟']

from rasa_nlu_chi.

crownpku avatar crownpku commented on May 25, 2024

相应这一段的你自己的标注json内容,能发出来看下吗?

from rasa_nlu_chi.

jxg972 avatar jxg972 commented on May 25, 2024

{
"text": "江铃E200VS东风风神AX7新能源",
"intent": "对比",
"entities": [
{
"start": 0,
"end": 6,
"value": "江铃E200",
"entity": "车系"
},
{
"start": 8,
"end": 18,
"value": "东风风神AX7新能源",
"entity": "车系"
}
]
}

from rasa_nlu_chi.

jxg972 avatar jxg972 commented on May 25, 2024

我有加入自定义词典

from rasa_nlu_chi.

crownpku avatar crownpku commented on May 25, 2024

"江铃E200"包含了汉字与英文数字字符,我感觉是python2的编码问题。
最简单的建议,你试下用python3跑试下?

from rasa_nlu_chi.

jxg972 avatar jxg972 commented on May 25, 2024

好,我去试试

from rasa_nlu_chi.

crownpku avatar crownpku commented on May 25, 2024

x_X

from rasa_nlu_chi.

crownpku avatar crownpku commented on May 25, 2024

Anyway,问题解决了就好:)

from rasa_nlu_chi.

BobCN2017 avatar BobCN2017 commented on May 25, 2024
  {
    "text": "今天8点到9点45分有哪些闹钟",
    "intent": "alarm_search",
    "entities": [
      {
        "start": 2,
        "end": 4,
        "value": "8点",
        "entity": "time"
      },
      {
        "start": 5,
        "end": 10,
        "value": "9点45分",
        "entity": "time"
      }
    ]
  },

2018-06-10 22:13:48 WARNING rasa_nlu.extractors.mitie_entity_extractor - Example skipped: Invalid entity {'start': 2, 'end': 4, 'value': '8点', 'entity': 'time'} in example '今天8点到9点45分有哪些闹钟': entities must span whole tokens. Wrong entity end.
2018-06-10 22:13:48 WARNING rasa_nlu.extractors.mitie_entity_extractor - Example skipped: Invalid entity {'start': 5, 'end': 10, 'value': '9点45分', 'entity': 'time'} in example '今天8点到9点45分有哪些闹钟': entities must span whole tokens. Wrong entity end.
我也报这个错误,类型都是这个前面一个数字后面一个中文。
尝试用python3运行,还是报同样的错误。
确认了目录,是在主目录下运行的。
python3是新装的,rasa相关的安装也在python3下装了一遍,也没有磁盘不足的问题。
有没有人碰到类似的问题?谢谢

from rasa_nlu_chi.

BobCN2017 avatar BobCN2017 commented on May 25, 2024

@jxg972 搞定了,确实是分词的问题,把8点什么的加入用户字典就行了

from rasa_nlu_chi.

zhoulijing01 avatar zhoulijing01 commented on May 25, 2024

您好,我定义的词典加载进去还是错误,没有进行训练rasa_rlu时候 自己亲自尝试使用使用该词典进行分词是正确的,您有好的建议吗

from rasa_nlu_chi.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.