Git Product home page Git Product logo

fraud_detection's Introduction

fraud_detection

Bank account fraud detection project. It will be presented at a technical interview for a Data Scientist position at the FIA.

The dataset can be found on Kaggle.

Requirements:

  • Develop 2 ML models to compare their performances.
  • Use Jupyter Notebook for the presentation.
  • Prepare a 10mins demo.
  • 5h assignement.
  • Take into account explainability of the model.
  • Develop further ideas as conclusion.

Prerequisites

Python Environment Setup

The python environement for this project is managed through Poetry. Poetry can be installed with the following command:

pip install poetry
poetry --version

Install the Poetry env with:

poetry install

The poetry virtual environment should be selectable as a kernel in a Jupyter Notebook.

Description

The project took about 5 hours to complete. Due to this time limit we had to prioritise certain tasks and make decisions about the development. We focused mainly on the initial exploration of the data, the comparison of two different models and their analysis in the context of explanatory power. We did not concentrate on optimising the visualisations or the hyperparameters of the models. The project is composed of two jupyter notebooks:

  • data_exploration: This notebook brings together the exploration of the dataset studied and any data cleansing. We focused mainly on understanding the features and their distribution conditional on the Fraud variable.
  • training: In this notebook, we have implemented two different models for fraud prediction: logistic regression and LightGBM. We have compared their performance in terms of the area under the ROC curve: In addition, we have compared the importance of each feature in decision making.

fraud_detection's People

Contributors

kruuzhaw avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.