Git Product home page Git Product logo

presdebates's Introduction

PresDebates: US Presidential Debate Transcripts

This repository is an extension of the debate transcript repository created by James Martherus which aims for greater coverage of existing debate transcripts. In addition to the data provided in that repository, this repository includes primary debate transcripts since (and inclusive of) the 2000 Presidential Election.

This repository contains transcripts of presidential debates going back until 1960. This is not a complete repository and is using only the debates that are hosted on The American Presidency Project. The data is organized by speaker and in chronological order. The dataset includes a variable denoting if a speaker is a candidate or not, the presidential election year, as well as the type of debate.

While I can make no guarantee that the transcripts are of good quality, please let me know if there are any issues with the data that may have.

Data is also available as a .csv file in the 'data' folder

Installation

To install PresDebates, use the install_github function from the devtools package:

library(devtools)
install_github("JosiahMcMillan/presdebates")

Then data can be accessed with the following commands:

library(PresDebates)

data(pres_debates) 

Alternatively, you can download pres_debates.rda or pres_debates.csv directly from the data folder.

Examples

Use cases can be found in the Examples folder, currently term frequency is shown but other text minig methods such as topic modeling can be done as well.

Acknowledgments

  • The formatting of this repository and the files within came from James Martherus
  • Transcripts were scraped from The American Presidency Project Debate Archive

presdebates's People

Contributors

josiahmcmillan avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.