Git Product home page Git Product logo

salihbout / headline_recommender Goto Github PK

View Code? Open in Web Editor NEW

This project forked from akhsassoualid/headline_recommender

0.0 1.0 0.0 33.71 MB

The project develops an application that suggests to the reader more similar articles to that he already read. It uses the embedding algorithms of headlines to create their own numerical representation, which allows to compute the similarity between headlines and get the most similar ones.

Python 97.97% Dockerfile 2.03%

headline_recommender's Introduction

A Recommender Engine for headlines articles using embedded words.

The project develop and application that suggest to reader more similar articles to the those they already read. It uses the embedding algorithms of headlines to create their own numerical representation, which allows to compute similarity between headlines and get the most similar ones.

For purpose of simplicity, we was satisfied only with headlines that concernes the year of 2018.

Steps of the project

We build the function "general_process" saved in the preprocessing.py file, to prepare the text data. Its output is the processed_data csv file, that contains the headlines after the preprocessing.

three algorithms are used to build a numerical representation of each headline, We talk about:

  • NMF and LDA factorization: We create a sparse matrix that composed of rows that represent each headlines and columns that represent each word in the entire vocabulary.
  • word2vec : A deeplearning approach, that uses an average word2vec of words composing that headline. those algorithms are exploited with the function "recommender_engine" developed in the recommender py file.

To excecute the app

Clone the repository in the commend line using the link : https://github.com/akhsassoualid/Headline_Recommender.

git clone https://github.com/akhsassoualid/Headline_Recommender.git

Install the necessary requirements :

pip install -r requirements.txt

Run the application savec in the app.py file

streamlit run app.py

Illustrate the application

A simple illustration of the App : Alt text

Special Thanks:

  • Google team of researchers for the Word2Vec trained model.
  • To the team of Streamlit for their open-source Python library to build applications.
  • To vikashrajluhaniwal for his tutorial about recommendation system.
  • To my friends Rachid and Salih for their help.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.