Git Product home page Git Product logo

huatwl's Projects

bayes icon bayes

用java实现的贝叶斯分类算法。用于大数据的分类。

chinesetextclassifier icon chinesetextclassifier

实现中文文本分类,支持文件、文本分类,基于多项式分布的朴素贝叶斯分类器。由于工作实际应用是二分类,加之考虑到每个分类属性都建立map存储词语向量可能引起的内存问题,所以目前只支持二分类。当然,直接复用这个结构扩展到多分类也是很容易。之所以自己写,主要原因是没有仔细研读mahout、weka等代码,不能灵活地进行中文分词、停用词过滤、词频统计、TF-IDF等,也就是向量化和特征提取没有自己手写相对灵活。

cs-notes icon cs-notes

:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计、Java、Python、C++

cws_evaluation icon cws_evaluation

Java开源项目cws_evaluation:中文分词器分词效果评估对比

emotionanalysis icon emotionanalysis

针对手机评论数据的情感挖掘与分析项目,基于依存句法分析和情感词库提取特征词,并对特征词做情感极性预测标注。

fnlp icon fnlp

中文自然语言处理工具包 Toolkit for Chinese natural language processing

fptree icon fptree

FPtree algorithm to mining frequent pattern

jcsprout icon jcsprout

👨‍🎓 Java Core Sprout : basic, concurrent, algorithm

sentiment-1 icon sentiment-1

基于情感词典和朴素贝叶斯算法实现中文文本情感分类

sparktextclassifier icon sparktextclassifier

使用Spark NaiveBayes 实现中文文本分类 use spark NaiveBayes for text classification

sso icon sso

单点跨域登录系统,同时搭配权限拦截器

tomcat-research icon tomcat-research

Tomcat源代码学习研究(包括代码注释、文档、用于代码分析的测试用例)

weibo-spider icon weibo-spider

新浪微博爬虫,采用Java语言开发,基于HTTPClient 4.0,采用MySQL存储爬取数据,支持多进程并发执行。功能包括:爬取微博、评论、转发、关注列表(层次)。根据数据需求,持续更新...

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.