Light

joinn99 / stackoverflow_expertfinding Goto Github PK

View Code? Open in Web Editor NEW

1.0 1.0 0.0 19 KB

StackOverflow Expert Finding

Python 98.97% Shell 1.03%

stackoverflow_expertfinding's Introduction

StackOverflow潜在专家预测实验测试

环境配置

Ubuntu系统下，运行 setup.sh 下载数据，或使用数据生成部分自行生成数据。其他系统可根据requirements.txt安装依赖库，并生成或下载数据。
解压下载的".7z"压缩包到"Data/"。示例:7za x Data/StackExpert.7z -oData

运行

运行"main.py"，可选择以下几种功能：

网格搜索法搜索参数。

模型会随机生成符合条件的参数网格，并寻找最优参数。参数网格的限制可在"Data/params.json"中修改。实验使用了四种分类器（梯度提升树(GBDT)、极度随机森林、支持向量机、岭回归）和ADASYN过采样算法。 (实验会持续循环进行，可使用Ctrl+C 退出)
寻找最可能的潜在专家。

模型会根据网格搜索的结果使用最优参数，预测出潜在专家概率最大的20名用户并显示他们的ID。
显示最优参数。

根据网格搜索的结果筛选出最优参数，并直接显示出来。
模型性能分析。

模型会根据网格搜索的结果使用最优参数，绘制准确率-召回率曲线，并输出特征重要性的数值。

在运行足够时长的功能[1]后，才能在功能[2-4]中获得较为准确的结果。

数据

Expert data:

Baidu Yun (密码:7jqb)

stackoverflow_expertfinding's People

Contributors

Stargazers

Watchers

Recommend Projects

React

A declarative, efficient, and flexible JavaScript library for building user interfaces.
Vue.js

🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
Typescript

TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
TensorFlow

An Open Source Machine Learning Framework for Everyone
Django

The Web framework for perfectionists with deadlines.
Laravel

A PHP framework for web artisans
D3

Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

javascript

JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
web

Some thing interesting about web. New door for the world.
server

A server is a program made to process requests and deliver data to clients.
Machine learning

Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Visualization

Some thing interesting about visualization, use data art
Game

Some thing interesting about game, make everyone happy.

Recommend Org

Facebook

We are working to build community through open source technology. NB: members must have two-factor auth.
Microsoft

Open source projects and samples from Microsoft.
Google

Google ❤️ Open Source for everyone.
Alibaba

Alibaba Open Source for everyone
D3

Data-Driven Documents codes.
Tencent

China tencent open source team.