Git Product home page Git Product logo

metanome's Introduction

Metanome

Build Status Coverage Status

The Metanome project is a joint project between the Hasso-Plattner-Institut (HPI) and the Qatar Computing Reserach Institute (QCRI). Metanome provides a fresh view on data profiling by developing and integrating efficient algorithms into a common tool, expanding on the functionality of data profiling, and addressing performance and scalabilities issues for Big Data. A vision of the project appears in SIGMOD Record: "Data Profiling Revisited".

The Metanome tool is supplied under Apache License. You can use and extend the tool to develop your own profiling algorithms. The profiling algorithms contained in our downloadable Metanome build have HPI copyright. You are free to use and distribute them for research purposes.

Building Metanome

Metanome is a maven project, which can be build by executing:

mvn verify

The verify phase should be executed as GWTTests are executed in this phase of the build.

Metanome can be packaged together with a jetty webserver and profiling algorithms. To speedup builds this package is not created in the default maven profile. The deployment package can be created by executing the build with the deployment profile:

mvn verify -P deployment

or by executing package on the deployment project directly (if metanome has not been installed dependencies will be retrieved online):

mvn -f deployment/pom.xml package

Downloads

Metanome releases can be found on the download page at:

https://hpi.de/naumann/sites/metanome/files

Documentation

The Metanome tool, information for algorithm developers and contributors to the project can be found in the github wiki.

Javadocs for the project can be found at https://hpi.de/naumann/sites/metanome/javadoc.

Development

The Metanome modules are continously deployed to sonatype and can be used by adding the repository:

<repositories>
    <repository>
        <id>snapshots-repo</id>
        <url>https://oss.sonatype.org/content/repositories/snapshots</url>
    </repository>
</repositories>
Coding style

The project follows the google-styleguide please make sure that all contributions adhere to the correct format. Formatting settings for common ides can be found at: http://code.google.com/p/google-styleguide/ All files should contain the apache copyright header. The header can be found in the COPYRIGHT_HEADER file.

metanome's People

Contributors

jakob-zwiener avatar tabergma avatar claudia-exeler avatar jens-ehrlich avatar dacry avatar lsgd avatar pmlanger avatar mandy-roick avatar fatschi avatar xchrdw avatar thorsten-papenbrock avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.