Git Product home page Git Product logo

My NLP Projects:

  • Created a tool that can detect the sentiment polarity (either positive or negative) of Book reviews written in Bengali Text.
  • Collected 1k book reviews from different online book shops as well as social media groups. Among these reviews 528 reviews are labelled as positve and 472 reviews are labelled as negative sentiment.
  • Extract Unigram, Bigram and Trigram features from the cleaned Text and use the TF-idf vectorizer as a feature extraction technique.
  • Employed different machine learning classifiers for the classification purpose. The used classifiers are Logistic Regression, Decision Tree, Multinomial Naive Bayes, Support Vector Machine and so on.
  • Evaluate the performance of the classification for every gram feature. Accuracy, Precision, Recall, F1-score, ROC curve and Precision-Recall curve used as evaluation metrics.
  • Finally, created a client facing API using Flask. App link
  • Publication: Link

book

  • Created a tool that can categorizes the Bengali news headlines into six category (National, Politics, International, Sports, Amusement, IT) using deep recurrent neural network.
  • A dataset of 0.13 Million news headlines is created. Chrome web scrapper used for scraping the news headlines from different Bengali online news portals such as Dainik Jugantor, Dainik Ittefaq, Dainik Kaler Kontho and so on.
  • Word embeeding feature represtations technique is used for extracting the semantic meaning of the words.
  • A deep learning model has been built by using a bidirectional gated recurrent network.
  • Finally, the model performance is evaluated using various evaluation measures such as confusion matrix, accuracy , precision, recall and f1-score.

headline

  • Created a tool that can identify the sentiment of a restaurant review written in Bengali Text. It classifies a review as positive or negative sentiment.
  • Collected 1.4k Bengali restaurant reviews from different social media groups of food or restaurant reviews. Among these reviews 630 reviews are labelled as positve and 790 reviews are labelled as negative sentiment.
  • Extract Unigram, Bigram and Trigram features from the cleaned Text and use the TF-idf vectorizer as a feature extraction technique.
  • Employed different machine learning classifiers for the classification purpose. The used classifiers are Logistic Regression, Decision Tree, Multinomial Naive Bayes, Support Vector Machine, Stochastic Gradient Descent and so on.
  • Evaluate the performance of the classification for every gram feature. Accuracy, Precision, Recall, F1-score, ROC curve and Precision-Recall curve used as evaluation metrics.
  • Finally, created a client facing API using Flask and deployed into cloud using Heroku. App Link
  • Publication: Link

rest

  • Developed a machine learning model that can classify the sentimental category (positive, negative and neutral) of a news comment written in Bangla Text.
  • For the implementation a publicly available dataset of 12k news comments have been used.
  • To create the system TF-idf feature extraction technique with n-gram features have been used.
  • Analysed the performance of different machine learning algorithms for n-gram feature by using various evaluation metrics such as accuracy, precision, recall and f1-score.

comment

  • Created a tool that can categorizes the Bengali news articles into 12 diffferent categories (Art, Politics, International, Sports, Science, Economics, Crime, Accident, Education, Entertainment, Environment, Opinion) using Deep Learning.
  • A publicly available dataset of 0.1 Million news articles is used to develop the system. The dataset consist 12 different categories news articles.
  • Word embeeding feature represtations technique is used for extracting the semantic meaning of the words.
  • A deep learning model has been built by using a Convolutional Neural Network and Long Short Term Memory.
  • The model performance is evaluated using various evaluation measures such as confusion matrix, accuracy , precision, recall and f1-score.
  • Finally, developed a client facing API using flask and heroku.
  • Here is the developed Flask App : Document Categorizer App

document

  • Created a word embedding model for Bangla text corpus.
  • Used Word2Vec algorithm.
  • Used a publicly availabe dataset of 0.1 Milion Bangla news articles.
  • Visualized the word similarity using t-sne plot.

word2vec

Eftekhar Hossain's Projects

al-folio icon al-folio

A beautiful, simple, clean, and responsive Jekyll theme for academics

awesome-pytorch-list icon awesome-pytorch-list

A comprehensive list of pytorch related content on github,such as different models,implementations,helper libraries,tutorials etc.

bangla-news-comments icon bangla-news-comments

Sentiment Analysis of Bangla news comments. This work is implemented on a publicly available Bengali news comments dataset.

bangla-news-headlines-categorization icon bangla-news-headlines-categorization

A Bengali Text Classification Task Using Deep Learning. It is Multi-class Text Classification Problem. Bengali news headlines used for the categorization.

bengali-book-reviews icon bengali-book-reviews

A task of sentiment analysis. Classify the Bengali book reviews into positive and negative sentiment.

bengali-hate-speech-dataset icon bengali-hate-speech-dataset

Dataset for identifying potential hates (e.g., political, religious, personal, gender abusive, geopolitical, etc.) for under-resourced Bengali language.

bengali-restaurant-reviews icon bengali-restaurant-reviews

[IEEE'19] [Dataset] Sentiment Analysis of Bengali Texts on Online Restaurant Reviews Using Multinomial Naive Bayes

constraint-aaai2021 icon constraint-aaai2021

Description of the system and its results that we developed as a part of our participation at CONSTRAINT shared task in AAAI-2021.

cuet_nlp-eacl_2021 icon cuet_nlp-eacl_2021

This repository contains the system description and the codes that we implemented for participating in EACL-2021 shared tasks.

memosen-lrec2022 icon memosen-lrec2022

[LREC'22] MemoSen: A Multimodal Dataset for Sentiment Analysis of Bengali Memes.

mute-aacl22 icon mute-aacl22

[AACL'22] A Multimodal Dataset for Bengali Hateful Memes Detection

nlp-projects icon nlp-projects

Hello, welcome to my portfolio. I am passionate about machine learning, natural language processing, computer vision and data science.

skbi_training icon skbi_training

This repository contain the notebooks and other materials of AI and Machine Learning Training

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.