Git Product home page Git Product logo

wm-semeru / securereqnet Goto Github PK

View Code? Open in Web Editor NEW
8.0 8.0 5.0 260.31 MB

We present a novel approach, called SecureReqNet, for automatically identifying whether issues in bug or issue tracking systems describe security related content that should be given careful attention. Our approach consists of a two-phase deep learning architecture that operates purely on the natural language descriptions of issues. The first phase of our approach learns high dimensional sentence embeddings from hundreds of thousands of descriptions extracted from software vulnerabilities listed in the CVE database and issue descriptions extracted from open source projects using an unsupervised learning process. The second phase then utilizes this semantic ontology of embeddings to train a deep convolutional neural network capable of predicting whether a given issue contains security- related information.

License: Apache License 2.0

Python 0.64% Jupyter Notebook 99.35% Dockerfile 0.01% Shell 0.01% Makefile 0.01%

securereqnet's People

Contributors

danaderp avatar dependabot[bot] avatar eward21 avatar hoppe8 avatar jasonlin45 avatar jfsvo avatar rmclanton avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

securereqnet's Issues

I wonder where I could find the implementation of vectorize_sentence

Hi, danaderp. Many thanks for your work. I have a question, in the notebook "alpha_securereqnet
.ipynb", there is a line from vectorize_sentence import Embeddings, but I cannot figure out where to import this module. Is it possible that you did not upload this file? Thanks for your help! Best regards!

Testing for Alpha SecureReqNet

Determine and implement tests for Alpha SecureReqNet to verify proper pretrained accuracy and that training works properly

Configure TFX endpoint

Set up a Tensorflow Extended endpoint for internal use and as an API bridge to other team(s)

Remove or archive files without project functionality

02_Statistical Test and 03_Clustering seem to be tests of code necessary for running SecureReqNet, that has now been used elsewhere. Whether these have any use should be verified using coverage tests or by asking the project manager

Update index.ipynb

As the project is migrated to nbdev, continually update index.ipynb with documentation and code examples

Refactor settings.ini for WM_SEMERU/SecureReqNet

This is a last step, to be done at the end of development. While working on a development fork it is necessary to change the path to the library documents. This needs to be undone before or after merging back to the main project in order for the documents to generate properly.

Update project diagram

As functionality is added to securereqnet in each sprint the project diagram should reflect the additions

Setup settings.ini

The description, project name and keyword section need to be uncommented. The official description should be added and any keywords for nbdev decided on. The project will need to be named in the way that it will be named for package installation. index.ipynb specifically will need this information to compile correctly.

Identify and correct problems with augmented_dataset_csv/issues

The ExampleGen component of the TFX pipeline officially supports few data formats for ingestion. As we have decided to go with CSV, this requires a conversion of the contents of augmented_dataset/ from .txt to .csv.

An initial conversion has been made in augmented_dataset_csv/, but beam_dag_runner.py rejects the files which appear to be of the correct format.

It would also be conducive to merge the rows in ground_truth.csv with their respective issues, but this crashes my machine for an unknown reason.

Refactor utils/read_data.py

The read_data's method get_test_and_training doesn't correctly get the data. It needs to be modified to look into the zipped data file. The path from root to the content folder is ./data/augmented_dataset_augmented_dataset.zip

Testing for Alex-SecureReqNet

Determine and implement tests for Alex-SecureReqNet to verify proper pretrained accuracy and that training works properly

Refactor file paths in 08_alpha_securereqnet

Before moving 08_alpha_securereqnet into nbdev it needs changes to allow it to run with the current code architecture. This issue relies on the read_data issue to be completed first

Create prototype Flask backend for REST queries

Create an initial Flask backend prototype once port forwarding is complete. Prototype endpoints will mirror the input of the TFX Serving models. Output will be translated to True/False. Once Transform is complete.

Testing for SecureReqNet deep

Determine and implement tests for SecureReqNet Deep to verify proper pretrained accuracy and that training works properly

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.