Git Product home page Git Product logo

sentimentanalysisunisgml's Introduction

Sentiment Analysis of Pashto Text Using Machine Learning Techniques

Sentiment analysis has vast applications such as for political results predictions, decision-making related to different services and products, and recommendations of various items. People express their opinions on social media in the English language as well as their native languages. This project aims to carry out a sentiment analysis on one of the native languages called "Pashto". The Pashto language is the national language of Afghanistan, and it is spoken in many regions of Pakistan. We used online social networks generated corpus and annotated it into positive and negative by two different native and well-aware Pashto speakers. We performed binary classification using Supervised Learning algorithms including Support Vector Machine, Naive Bayes, decision Tree, Random Forest, and AdaBoost. The results are evaluated using the standard performance evaluation measures including Accuracy, F-measure, Precision, and Recall. The results show that the Naive Bayes achieved better accuracy than other ML algorithms.

Django Web app developed and deployed on 'pythonanawhere' server.
Project Live on : https://farhadmohmand66.pythonanywhere.com

About Corpus

The corpus of the Pashto language is generated from Facebook. The CSV file was created and stored every sentence according to the following fields: (i) ID (iii) Source (link) from where the comments were collected and topic of comments (iii) Pashto Text, this is the main text for SA (iv) English translation, (v) Annotator One and (vi) Annotator Two, these both annotators were a native speaker of Pashto language and well familiar of Pashto Text. The corpus belongs to three genres Politics, Sports, Dramas, and Movies combined. The corpus contains 300 rows of Politics, 150 rows of Sports, 150 rows of dramas and movies, and seven attributes. The link to the corpus is given below:

Corpus Link: https://www.kaggle.com/farhadkhan66/datasets

User view: image

๐Ÿ”— Contact:

linkedin twitter whatsApp

sentimentanalysisunisgml's People

Contributors

farhadmohmand66 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.