Git Product home page Git Product logo

textinfoexp's Introduction

构建实验环境

1 安装python(2.7) https://www.python.org/

2 安装pip:

2.1 下载pip https://pypi.python.org/pypi/pip/9.0.1

2.2 解压缩后,安装指令 python setup.py install

2.3 pip升级 python -m pip install --upgrade pip

2.4 pip安装扩展包 pip install jieba (这里以jieba包为例),如果速度较慢,可改为国内的阿里源, 即 pip install jieba -i http://mirrors.aliyun.com/pypi/simple/ --trusted-host mirrors.aliyun.com

4 使用GitHub获取代码

4.1 安装git https://git-scm.com/

4.2 登陆自己的GitHub账号,找到自己的项目,(别人的需要先fork过来,也可以直接 git clone xxx,或者直接下载zip包放进pycharm)

4.3 打开pycharm,首先设置git的位置及github账号,点击Test都通过后继续,依次在菜单栏点击 VCS checkout from version control GitHub,登陆自己的账号后选择相应的项目,得到代码。

4.4 (更新fork的项目到最新的版本)Syncing a fork https://help.github.com/articles/syncing-a-fork/

5 ipython交互式开发环境

5.1 安装ipython pip install ipython

5.2 安装jupyter(即notebook) pip install notebook

5.3 jupyter notebook 启动,打开浏览器即可(默认1224端口)

textinfoexp's People

Contributors

haibaoy avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

textinfoexp's Issues

遗漏的文件

楼主能把完整的文件发上来吗?运行的时候都缺少文本文件。

关于part2_text_classify

本章 获取数据和标记中代码如下:
data = pd.read_table('Art.txt', header=None, sep=',')
data2 = pd.read_table('Computer.txt', header=None, sep=',')
data3 = pd.read_table('Sports.txt', header=None, sep=',')
但是在代码和相关资源中并未发现art.txt等三个文件,请问这三个文件是否可以上传一下?谢谢

关于数据集

想问下采用的是哪里的搜狗数据集作为训练用的,谢谢

关于训练集的问题

part4 词向量训练的语料完全木有说明,语料方便的话你上传一下,不方便的话,你好歹说明一下啊,比如用的什么语料,下载连接之类的?

中文近义词库

hi, 你好

sogou的开放语料质量不错,wikidata也不错,下面是我做的一个word2vec模型。
https://github.com/huyingxi/Synonyms
欢迎对比和使用,一起优化,谢谢!

对此处给出的相似度计算方法:
https://github.com/Roshanson/TextInfoExp/tree/master/Part4_Word_Similarity/get_similarity

我们可以一起评测一下:
Synonyms使用https://github.com/fssqawj/SentenceSim/blob/master/train.txt 来寻找最佳的模型参数,然后在 https://github.com/fssqawj/SentenceSim/blob/master/dev.txt 达到了 88%的准确度。
详见:chatopera/Synonyms#6

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.