Git Product home page Git Product logo

caa's Introduction

CAA Dissertation Project

This project deals with the College Art Association (CAA) dissertation roster, which has been published since 1963, first in print and then online only. This roster provides information about the changing shape of the field of art history over the past sixty years, through a collective profile of recent PhDs. The dissertation roster is now published by caa.reviews, with entries beginning in 2002, and is updated yearly.

Ken Chiu of Binghamton University wrote the script, caa.py, which was used to scrape the data for completed dissertations from 2002 to 2018 in caa.reviews. The script uses the Beautiful Soup Python library for scraping. If you have any questions about using the script, or would like help modifying it, please contact Ken ([email protected]) or Nancy ([email protected]).

Nancy Um ran this script on July 22, 2020, which generated caa.csv. Some entries failed to populate due to formatting errors. The failed entries were saved separately. NU cleaned caa.csv with OpenRefine, which resulted in the identification of a few more failed entries. NU generated a new file, which contained all of the failed entries, cleaned it, and then combined it with the entries in caa.csv. The file caaTOTAL_OR.csv contains all of the entries from 2002 to 2018, including those that were harvested computationally and those that had to be entered by hand.

The R markdown file, caa.Rmd, includes the scripts that were used to process the data, relying upon the tidyverse suite of packages, along with the tokenizers and tidytext packages. Plots were generated using ggplot. The file subjects.csv includes the coded categories that were used to generate figures 10, 11, and 12 of the article, based on CAA's standard breakdown. These materials are intended to be paired with the article, Nancy Um, "What Do We Know about the Future of Art History? Let’s Start by Looking at Its Past, Sixty Years of Dissertations," published as a special essay in caa.reviews, August 18, 2020, http://www.caareviews.org/reviews/3797#.X0E0RC2ZO3I.

caa's People

Contributors

kennethchiu avatar nancyum avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.