infosys / berguig Goto Github PK

View Code? Open in Web Editor NEW

Berguig is an application which delivers personalized news feed, tweets, youtube video links and articles published at Gartner, based on the interests specified by the user. The application gives score to these items to highlight their relevance with the help of Machine Learning models.

License: MIT License

Python 100.00%

berguig's Introduction

Berguig Application-Python

Berguig is a python application to deliver personalized news feed, tweets, youtube video links and gartner articles based on the company interests specified by the user. The list of companies that interest the user are stored in XML files, which are then read by the Berguig program.

Example of XML file

people.xml

Contains the person's name, mail id, groups of comapnies or individual companiees of interest

comapnies.xml

The information about the companies like twitter handle, youtube channel is stored in companies.xml as below

<companies_list> <c_id>c1</c_id> <c_name>company_name</c_name> channel_name <twitter_name>@twitter_account</twitter_name> #hashtag keyword1 keyword2 <c_id>c1</c_id> <c_name>company_name</c_name> channel_name <twitter_name>@twitter_account</twitter_name> #hashtag keyword1 keyword2 </companies_list>

keywords_list.xml

contains a list of keywords that the application will use to filter the news articles from google news API.

log.xml

Contains the last executed date of the program.

Algo_4

The Algo_4 is used to train the machine learning model. The input for this python program is the scoring_v3.xlsx file. The user has to score the articles generated by the application to train it to classify between relevant and non-relevant news.

Scoring_v3.xlsx

This xlsx file is used to train the machine learning model. The user has to score the articles generated by the application and store in this xlsx in the format shown below:

Category Company Source/User Keywords Subjects/Views Title/Tweet Date Link Useful Relevance Relevance_fig

The possible values of useful are

0,1,2 : Not Useful

3,4,5 : Useful

Relevance values based on useful values:

0,1,2 : Relevant

3,4,5 : Irrelevant

Relevance_fig based in the Relevance values:

Relevant : 1

Irrelevant : 0

Recommend Projects

infosys / berguig Goto Github PK

berguig's Introduction

Berguig Application-Python

Example of XML file

people.xml

comapnies.xml

keywords_list.xml

log.xml

Algo_4

Scoring_v3.xlsx

berguig's People

Contributors

Watchers

Forkers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent