Git Product home page Git Product logo

ys19-project_1's Introduction

YΣ19 Artificial Intelligence II (Deep Learning for Natural Language Processing) Fall Semester 2021 Homework 1 20% of the course mark Announced: November 5, 2021 Due: November 25, 2021 before 23:59)

  1. In this exercise you will develop a vaccine sentiment classifier using softmax regression. The classifier will be trained on twitter data and will distinguish 3 classes (neutral, antivax and pro-vax). You will train your classifier using the datasets provided on e-class for this homework.1 The datasets are CSV files with the tweet texts and the labels 0 (neutral), 1 (anti-vax) or 2 (pro-vax). The first dataset is the training dataset and the second one is the validation dataset. Your model will be evaluated by us on a test set that will not be provided to you, but will be of the same format as the training and validation datasets. Your code should be written in such a way so that the model can be evaluated on the test simply by changing the name of the file to load. Start by reading about softmax regression from the slides of the course, the relevant chapters 4 and 5 of the “Speech and Language Processing” book of Jurafsky and Martin (http://web.stanford.edu/~jurafsky/slp3/) or any other relevant literature which you may find. 1Thanks to Dr. Saptarshi Ghosh and Soham Poddar from the Department of Computer Science and Engineering, IIT Kharagpur, India for providing us these datasets. 1You should use the toolkit Scikit-Learn for your implementation. You should plot learning curves that show that your models are not overfitting or underfitting. You should also evaluate your classifiers on the validation set using precision, recall and F-measure. Finally, we would like to stress that the goal of this exercise is not to simply create a classifier that can achieve a good score, but to get acquainted with the methodology of an NLP project, to experiment with different options and to deliver a report that presents your solution along with a brief performance comparison between the different options that you have tried. (9/10 marks) Note: You should hand in: (i) a pdf with the answer to question 1 and a detailed report of your solution to question 2, including citations to relevant literature that you might have used in developing your solutions. (ii) one Colab notebook (ipynb files using https://colab. research.google.com/) containing your code for question 2. You should use Python 3.6 or a later version. 2

ys19-project_1's People

Contributors

jtile avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.