ioepas / zippy Goto Github PK
View Code? Open in Web Editor NEWFinal Year Project for BCT: "Ranking Emails Based On Priority"
Final Year Project for BCT: "Ranking Emails Based On Priority"
The data set is full of redundant data and needs to be cleaned before data set exploration.
CodeFactor found an issue: Use of unsafe yaml load. Allows instantiation of arbitrary objects. Consider yaml.safe_load().
It's currently on:
src\utils\params.py:33
For storing data, we need remote storage solutions(AWS/Azure/GCP).
Configure Travis CI for:
Linting
Unit Tests (#27)
Split from #2.
After #8 is merged, we can enable stricter docs warning and even enable them in the travis.
The Parakweet data set consists of more than 3 thousands of labeled email data. Each email has a label if action is required for the email. Emails would be more important if they required action by the user.
CodeFactor found an issue: Use of unsafe yaml load. Allows instantiation of arbitrary objects. Consider yaml.safe_load().
It's currently on:
scripts\utils.py:13
For a better ranking algorithm, we could analyze the intent of the email using speech acts. The intent would determine if the email requires action. The B3C Corpus consists of 40 email threads with sentences labeled with speech acts and subjectivity. You can check the dataset information here. The dataset is in XML.
For now, is anyone available to parse the XML to a proper dataframe?
Write docs for Enron dataset
The data we get from make data/raw/emails.csv
is very raw and crude. We need to process that into better versions.
We can do either of the following:
Things to do:
raw_headers
and raw_message
. Ensure following correctness:
raw_<header_name>
from
and to
headers. Test.raw_datetime
. Test.is_reply
/ is_forward
.Created new issue( #49 ) for remaining tasks:
A declarative, efficient, and flexible JavaScript library for building user interfaces.
๐ Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. ๐๐๐
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google โค๏ธ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.