Git Product home page Git Product logo

tm4ss.github.io's Introduction

tm4ss - Text Mining for Social Scientists and Digital Humanists

This course consists of 8 tutorials written in R-markdown and further described in this paper.

You can use knitr to create the tutorial sheets as HTML notebooks from the R-markdown source code.

In the /docs folder, you have access to the rendered tutorials.

Tutorials

  1. Web crawling and scraping
  2. Text data import in R
  3. Frequency analysis
  4. Key term extraction
  5. Co-occurrence analysis
  6. Topic models (LDA)
  7. Text classification
  8. Part-of-Speech tagging / Named Entity Recognition

Click here for the rendered tutorials.

Render from source

Clone the repository

git clone https://github.com/tm4ss/tm4ss.github.io.git

Open the Tutorials.Rproj R-project file and run

rmarkdown::render_site(output_format = "html_document")

License & Citation

This course was created by Gregor Wiedemann and Andreas Niekler. It was freely released under GPLv3 in September 2017. If you use (parts of) it for your own teaching or analysis, please cite

Wiedemann, Gregor; Niekler, Andreas (2017): [Hands-on: a five day text mining course for humanists and social scientists in R](http://ceur-ws.org/Vol-1918/wiedemann.pdf). Proceedings of the 1st Workshop Teaching NLP for Digital Humanities (Teach4DH@GSCL 2017), Berlin.

PDF

Download paper

Bibtex

@inproceedings{WN17,
  author    = {Gregor Wiedemann and Andreas Niekler},
  title     = {Hands-On: {A} Five Day Text Mining Course for Humanists and Social Scientists in {R}},
  booktitle = {Proceedings of the Workshop on Teaching {NLP} for Digital Humanities
               ({Teach4DH@GSCL 2017}), Berlin, Germany, September 12, 2017.},
  pages     = {57--65},
  year      = {2017},
  crossref  = {DBLP:conf/gldv/2017teach4dh},
  url       = {http://ceur-ws.org/Vol-1918/wiedemann.pdf},
}

tm4ss.github.io's People

Contributors

grenwi avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.