Git Product home page Git Product logo

word-segment's Introduction

word-segment

一个基于HanLP的分词工具。

特性

  1. NLP分词
  2. 异步词频统计与过滤
  3. 批量文件导入
  4. 导出Excel

使用

  1. 下载:data.zip
.
├── LICENSE
├── README.md
├── data			#放入
├── docs
├── out.xlsx
├── pom.xml
├── src
├── summary.xlsx
├── target
└── word-segment.iml
  1. src/main/resources/hanlp.propertieshanlp.properties文件进行修改,通过pwd获取,替换root
#本配置文件中的路径的根目录,根目录+其他路径=完整路径(支持相对路径,请参考:https://github.com/hankcs/HanLP/pull/254)
#Windows用户请注意,路径分隔符统一使用/
root=/Users/mac/project/PersonalRepository/word-segment
  1. IDEA打开直接运行App.java

示例

image-20190412204743502

image-20190412204715563

参考

  1. Hanlp
  2. EasyExcel

word-segment's People

Contributors

imgbotapp avatar marveliu avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.