Berguig is a python application to deliver personalized news feed, tweets, youtube video links and gartner articles based on the company interests specified by the user. The list of companies that interest the user are stored in XML files, which are then read by the Berguig program.
Contains the person's name, mail id, groups of comapnies or individual companiees of interest
The information about the companies like twitter handle, youtube channel is stored in companies.xml as below
<companies_list> <c_id>c1</c_id> <c_name>company_name</c_name> channel_name <twitter_name>@twitter_account</twitter_name> #hashtag keyword1 keyword2 <c_id>c1</c_id> <c_name>company_name</c_name> channel_name <twitter_name>@twitter_account</twitter_name> #hashtag keyword1 keyword2 </companies_list>
contains a list of keywords that the application will use to filter the news articles from google news API.
Contains the last executed date of the program.
The Algo_4 is used to train the machine learning model. The input for this python program is the scoring_v3.xlsx file. The user has to score the articles generated by the application to train it to classify between relevant and non-relevant news.
This xlsx file is used to train the machine learning model. The user has to score the articles generated by the application and store in this xlsx in the format shown below:
Category Company Source/User Keywords Subjects/Views Title/Tweet Date Link Useful Relevance Relevance_fig
The possible values of useful are
0,1,2 : Not Useful
3,4,5 : Useful
Relevance values based on useful values:
0,1,2 : Relevant
3,4,5 : Irrelevant
Relevance_fig based in the Relevance values:
Relevant : 1
Irrelevant : 0