Git Product home page Git Product logo

Hi there 👋

🔭 I currently work at home.

Shawn's Projects

996.icu icon 996.icu

Repo for counting stars and contributing. Press F to pay respect to glorious developers.

antfu-handle icon antfu-handle

A Chinese Hanzi variation of Wordle - 汉字 Wordle (无限版)

cnki_patent_svm icon cnki_patent_svm

文本分类是指在给定分类体系下 , 根据文本的内容自动确定文本类别的过程。首先我们根据scrapy爬虫根据**知网URL的规律,爬取70多万条2014年公开的发明专利,然后通过数据清洗筛选出了60多万条含标签数据。通过TF-IDF对60多万条本文进行词频提取,依照词频排序提取前3000个词语形成语义词典,然后根据观察设置停用词。然后再用TF-IDF的方式对每个摘要进行词频选取,通过布尔模型,对比语义词典生成文本向量。然后对标签进行数字化转换。取90%的文本为训练集,10%的文本为测试集。用有监督学习的SVM算法对文本进行分类,(人类生活必需品、作业运输、化学冶金、纺织造纸、固定建筑物、机械工程、物理学、电学)分成8类

dataflowkit icon dataflowkit

Extract structured data from web sites. Web sites scraping.

disgord icon disgord

Go module for interacting with the documented Discord's bot interface; Gateway, REST requests and voice

etl icon etl

数据基本清洗包括日期、时间、数值、字符串、字符、金钱、数据库(mysql、postgresql、mongodb、hbase、hdfs、memcached)、加解密(md5、sha、base64、aes、rsa)、文件、http服务、正则表达式等,后期会不断更新。

gpt-2 icon gpt-2

Code for the paper "Language Models are Unsupervised Multitask Learners"

hugo icon hugo

The world’s fastest framework for building websites.

libfaketime icon libfaketime

libfaketime modifies the system time for a single application

mentohust icon mentohust

Automatically exported from code.google.com/p/mentohust

pholcus icon pholcus

Pholcus is a distributed high-concurrency crawler software written in pure golang

qbot icon qbot

[🔥updating ...] 自动量化交易机器人 Qbot is an AI-oriented quantitative investment platform, which aims to realize the potential, empower AI technologies in quantitative investment. https://ufund-me.github.io/Qbot :news: qbot-mini: https://github.com/Charmve/iQuant

scrapy-examples icon scrapy-examples

Multifarious Scrapy examples. Spiders for alexa / amazon / douban / douyu / github / linkedin etc.

sentimentpolarityanalysis icon sentimentpolarityanalysis

情感极性分析repository1:基于情感词典、k-NN、Bayes、最大熵、SVM的情感极性分析。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.