Git Product home page Git Product logo

nlp_hw2's Introduction

NLP_HW2

This repository holds a solution for Assignment 2 in the course Netural Language Processing held during semester 1, 2016, by Professor [Michael Elhadad] (https://www.cs.bgu.ac.il/~elhadad/)

Assignment

This assignment covers the topic of statistical distributions, regression and classification. The objective is:

  1. Experiment and evaluate classifiers for the tasks of named entity recognition and document classification.
  2. Explore the task of Document Classification, comparing email spam detection, SMS spam detection and news categorization.
  3. Explore the problem of domain adaptation by comparing the performance of classifiers trained in one domain when tested in another.
  4. Explore the task of Named Entity Recognition (NER), which features work for this task, and which classifier algorithms help: ..* -logistic regression, Naive Bayes and HMM. ..* -Use pre-trained word embeddings and measure whether they help for the task of NER.

Code Intructions

The solution for the Assignment is in the form of an iPython (Jupyter) notebook file (with extension ipynb). The main noteboos solution file is p1.ipynb

Most important code is also located in a .py file under/code/assignment2.py

The Spam Data set should be extracted to the folder data, as it is left empty.

Submitted Solution

An organized submitted solution is available under release

NLP161

The course is an introduction to Natural Language Processing. The main objective of the course is to learn how to develop practical computer systems capable of performing intelligent tasks on natural language: analyze, understand and generate written text. This task requires learning material from several fields: linguistics, machine learning and statistical analysis, and core natural language techniques. [course web site] (https://www.cs.bgu.ac.il/~elhadad/nlp16.html)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.