Git Product home page Git Product logo

acm-hsg-autumn-coding-night's Introduction

ACM HSG Autumn Coding Night

Welcome to the ACM HSG Autumn Coding Night! This is a repository for the event. You can find the data, slides and some sample code here.

Setup

Before starting please make sure that you have the following installed:

  • Python
  • Pandas
  • Scikit-learn
  • Matplotlib
  • Seaborn

Task

The goal is to write a python script to detect fraudulent transactions. The group that detects the most amount of correct transactions wins. The script should take the filepath to the input file as an argument and write the transaction ids of the detected frauds to a text file in the required structure. The output should be one single output text file containing the ids of fraudulent transactions, one per line.

Suggested Approach

  1. Fork or download the repository
  2. Examine the training data (data structure, plots, etc.)
  3. Write one or multiple scripts (with or without ML) to detect frauds
  4. Make sure the script is able to import and export the data in the correct format
  5. Analyze the results to decide on a script
  6. Run the script on the competition data
  7. Submit the output file

Additional Challenge:

  • Make sure that the script runs as fast as possible
  • Try to lower the false-positive rate

๐Ÿ“š Resources

๐Ÿ“Š Training Data

You can find the training data in the training-data folder. There is a data set describing the terminals and one that contains all customer data. Then there are two transaction folders. One containing all the inputs and another one with the outputs. For each day there is one input and one output file. The input file contains all the transactions that happened on that day. The output file contains all the transaction ids that were marked as frauds.

To be able to start faster we also provide you the raw files for the training data set where for each transaction it is already stated in the dataframe whether it is a transaction or not. You can find the files in the simulated-training-data-raw folder.

๐Ÿ“ Code Samples

For the beginners there are code samples in the introduction folder. The code samples show you how to import from the data files, how to use pandas, how to analyze the dataframes with plots and how to use scikit-learn.

๐Ÿ“ Slides

You can find the slides in the resources/slides folder.

๐Ÿ“š Links

You can find some useful links in the resources/links.md file.

acm-hsg-autumn-coding-night's People

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.