Git Product home page Git Product logo

fd_cfd_extraction's Introduction

FD_CFD_extraction

This repository contains the implementation of two algorithms, TANE and CTANE, corresponding to the following publications:

  1. "TANE: An Efficient Algorithm for Discovering Functional and Approximate Dependencies" (link: https://www.lri.fr/~pierres/donn%E9es/save/these/articles/lpr-queue/huhtala99tane.pdf)

  2. "Discovering Conditional Functional Dependencies" (link: http://homepages.inf.ed.ac.uk/fgeerts/pdf/CFDdiscovery.pdf)

We have also provided several CSV files as test data.

This code was used in the following work: "Automatic Discovery of Functional Dependencies and Conditional Functional Dependencies: A Comparative Study" (link: https://cs.uwaterloo.ca/~nasghar/848.pdf)

##Running the code

To run tane.py on a particular csv file (e.g. adult.csv), execute the following command in your terminal:

python tane.py adult.csv

To run ctane.py on the same data, execute:

python ctane.py adult.csv

To run ctane.py and obtain k-frequent CFDs, execute:

python ctane.py adult.csv k

where k is your integer of choice.

fd_cfd_extraction's People

Contributors

nabihach avatar

Watchers

Maurizio Casciano avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.