Git Product home page Git Product logo

vu_thesis_aspect_extraction_category_detection's Introduction

README

This repo contains the code used for my thesis project at the VU (Vrije Universiteit - Amsterdam).

Aspect Based Sentiment Analysis is an approach to sentiment analysis which analyses texts at the word/token level in order to determine whether or not it is or forms part of a phrase which is the target of a subjective expression. According to the parameters laid down in SemEval 2014, there are 4 subtasks:

  • aspect extraction (AE)
  • aspect polarity classification
  • aspect category detection (ACD)
  • category polarity classification

This thesis reviews studies involving both approaches and selected systems from both approaches to re-implement with the aim of evaluating the output to uncover qualitative differences. Minimally modified versions of 4 existing systems discovered in the literature (2 AE systems; 2 ACD systems) were implemented in order that output on an existing dataset could be inspected:

2 AE systems:

  • a system based on Qiu et al.'s Double Propogation algorithm (2011)
  • a system based on Xu et al.'s Double Embedding CNN system (2018)

2 ACD systems:

  • a system based on Kirtichenko et al.'s SemEval 2014 entry (2014)
  • a system based on Xue et al.'s Multi Task Neural Approach system (2019)

Resources:

The code and links to resources used are here for reference: Most of the code can be run from the terminal. Much of the data required is (temporarily) available in the data folder. This will be taken offline after review. Links to the original data include:

References:

  • S. Kiritchenko, X. Zhu, C. Cherry, and S. Mohammad. Nrc-canada-2014: Detecting aspects and sentiment in customer reviews. In Proceedings of the 8th international workshop on semantic evaluation (SemEval 2014), pages 437–442, 2014
  • G. Qiu, B. Liu, J. Bu, and C. Chen. Opinion word expansion and target extraction through double propagation. Computational linguistics, 37(1):9–27, 2011
  • H. Xu, B. Liu, L. Shu, and P. S. Yu. Double embeddings and cnn-based sequence labeling for aspect extraction.arXiv preprint arXiv:1805.04601, 2018
  • W. Xue, W. Zhou, T. Li, and Q. Wang. Mtna: a neural multi-task model for aspect category classification and aspect term extraction on restaurant reviews. In Proceed-ings of the Eighth International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pages 151–156, 2017.

Thesis

A copy of the written part of the thesis is available here

vu_thesis_aspect_extraction_category_detection's People

Contributors

petercaine avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.