Git Product home page Git Product logo

elektrobohemian / culturalanalytics Goto Github PK

View Code? Open in Web Editor NEW
4.0 4.0 0.0 162.55 MB

Experiments and tutorials from the wide field of cultural analytics based on textual and multimodal corpora. Sample use cases range from image retrieval over sentiment analysis to network analysis. Based on Python, JavaScript, and D3.js.

License: Apache License 2.0

cultural-heritage digital-humanities visualization cultural-analytics

culturalanalytics's Introduction

CulturalAnalytics

Experiments and tutorials from the wide field of cultural analytics based on textual and multimodal corpora.

Analyzing Unstructured Data

Superheroes teaser image

In this tutorial you will learn to:

  • web/screen scrape relatively unstructured data from the Wikipedia
  • transform unstructured data into tabular data to facilitate processing with Python
  • create graph data from your data to visualize your data as networks
  • export Python-created data to use it with JavaScript visualization libraries such as D3.js

Would you should already know:

  • a little Python 3
  • some minor HTML
  • some JavaScript if you want to understand the web-based visualization at the end of the tutorial

This notebook comes with a requirements.txt file to facilitate the installation of package dependencies. To install the dependencies, launch the following command from the command line before you start the notebook:

pip3 install -r requirements.txt

Multimodal Analysis and Enrichment of a Library Metadata Corpus

This notebook eventually evolved into a TPDL publication. ATTENTION! The notebook is no longer maintained here. It has been moved to a separate repository.

  • In this tutorial, you will learn to read metadata from an OAI-PMH data provider and how to convert the retrieved data from Dublin Core to a pandas data frame.
  • Furthermore, you will carry out some basic data analysis on your data in order to find out if the data is corrupt or unclean. Based on an example, you will clean some aspects of your data using techniques borrowed from machine learning.
  • Finally, you will visualize data with the help of a network graph.

Sentiment Analysis on the Berlin State Library Catalog and Amazon

Preview In this tutorial, you will learn how to read from a unstructured and structured dataset, create a dataframe from this raw data, and to visualize characteristics from the data in order to find out whether the titles of a research library are truly neutral from a sentiment analysis perspective and how they compare to a sample from books sold by Amazon.

culturalanalytics's People

Contributors

dependabot[bot] avatar elektrobohemian avatar

Stargazers

 avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.