Git Product home page Git Product logo

carrychang / customer_satisfaction_analysis Goto Github PK

View Code? Open in Web Editor NEW
400.0 13.0 126.0 118.39 MB

基于在线民宿 UGC 数据的意见挖掘项目,包含数据挖掘和NLP 相关的处理,负责数据采集、主题抽取、情感分析等任务。目的是克服用户打分和评论不一致,实时对在线民宿的满意度评测,包含在线评论采集和情感可视化分析。搭建了百度地图POI查询入口,可以进行自动化的批量查询 POI 信息的功能;构建了基于在线民宿语料的 LDA 自动主题聚类模型,利用主题中心词能找出对应的主题属性字典;以用户打分作为标注,然后 litNlp 自带的字符级 TextCNN 进行情感分析,将情感分类概率分布作为情感趋势,最后通过 POI 热力图的方式对不同地域的民宿满意度进行展示。软件版本请见链接。

Home Page: https://github.com/CarryChang/Real_Time_DataMining_Software

License: Apache License 2.0

Python 100.00%
customer-satisfaction-analysis nlp-machine-learning sentiment-analysis litnlp

customer_satisfaction_analysis's Introduction

996.icu

Customer_satisfaction_Analysis

Stargazers over time

结果整合
Demo 演示
基于用户 UGC 的在线民宿满意度挖掘,负责数据采集、主题抽取、情感分析等任务。开发的目的是克服用户打分和评论不一致,实现了在线评论采集和用户满意度分析。
主要功能包括在线原始评论采集、主题聚类、评论情感分析与结果可视化展示等四个模块,如下所示。
  1. 提取后的民宿地址和在线评论等信息如下。
  1. 搭建了百度地图 POI 查询入口,可以进行自动化的批量查询地理信息。
  1. 通过高频词可视化展示,归纳出评论主题。
  1. 构建了基于在线民宿语料的 LDA 自动化主题聚类模型,利用主题中心词能找出对应的主题属性字典,并使用用户打分作为标注,然后通过多种分类模型,选用最优模型对提出的评价主体 进行情感分析,针对主题属性表进行主题提取后的文本进行情感分析,分别得出当前主题对应的情感趋势,横坐标为所有关于主题为“环境”的情感得分,纵坐标为对应的情感的条数,可以起到纵观当前“环境”主题下的情感趋势,趋势往右代表当前主题评价较好,总共有{“交通”,“价格”,“体验”,“服务”,“特色”,“环境”,“设施”,“餐饮”}的主题,选取“环境”主题进行可视化之后的结果如下图所示。
  1. 通过POI热力图的方式对在线民宿满意度进行展示。
  1. 代码结构如下。
专利信息
新版本特性
  1. 使用 litNLP 深度情感推理
  2. 增加多进程提高多个 topic 下的文本匹配速度
  3. Project_Main.py 直接完成细粒度情感极性可视化操作

customer_satisfaction_analysis's People

Contributors

carrychang avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

customer_satisfaction_analysis's Issues

您好,导入项目运行报错,No such file or directory: 'model_saved/tokenizer.pickle',谢谢

Traceback (most recent call last):
File "Project_Main.py", line 48, in
model_train()
File "D:\python3.7.8\爬虫\民宿评论分析\Customer_Satisfaction_Analysis-master\s
a_model_train.py", line 18, in model_train
model.train(train_data, label, num_classes=2, batch_size=256, epochs=2, verb
ose=1, evaluate=True)
File "D:\python3.7.8\lib\site-packages\litNlp\train.py", line 41, in train
tokenizer = self.train_tk(train_data)
File "D:\python3.7.8\lib\site-packages\litNlp\train.py", line 32, in train_tk
with open(self.tokenize_path, 'wb') as tokenize:
FileNotFoundError: [Errno 2] No such file or directory: 'model_saved/tokenizer.p
ickle'
请问少了什么吗,怎么解决?谢谢

model/fina_analysis_9.py

您好,请问fina_analysis_9.py运行时报错,提示NameError: name 'stand_score' is not defined

Some Problems

大佬你好!实践过后生成了各角度情感极性图,想要试下画热力图!大佬能不能分享下直接爬虫下来的带民宿地址的csv文件呀?非常感谢!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.