Git Product home page Git Product logo

diffblue-benchmarks / finraos-herd Goto Github PK

View Code? Open in Web Editor NEW

This project forked from finraos/herd

0.0 7.0 0.0 108.06 MB

Herd is a managed data lake for the cloud. The Herd unified data catalog helps separate storage from compute in the cloud. Manage petabytes of data and make it accessible for data processing and analytical purposes by any cloud compute platform.

Home Page: http://finraos.github.io/herd/

License: Apache License 2.0

JavaScript 0.90% CSS 1.07% HTML 0.17% Shell 0.04% Java 97.80% Batchfile 0.01% Dockerfile 0.01%

finraos-herd's Introduction

Overview Build Status

Herd is big data governance for the cloud. The herd unified data catalog helps separate compute from storage in the cloud. Herd job orchestration manages your ETL and analytics processes while tracking all data in the catalog. Here is a quick summary of features:

  • Unified Data Catalog A centralized, auditable catalog for operational usage and data governance.
  • Track Lineage Capture data ancestry for regulatory, forensic, and analytical purposes
  • Manage Clusters Create and launch clusters; load data into clusters from catalog entries
  • Orchestrate Jobs Orchestrate clusters and catalog services to automate processing jobs

Find out more about herd features on our GitHub project page

Quick Start

The best way to start learning about herd is through these links. The demo installation process is quick and easy - you can have herd up and running in AWS in 10-15 minutes and start registering data immediately afterwards.

Get Involved

We are actively seeking organizations and individuals that are interested in adopting herd and contributing to the development effort. Find out more in the contributions section of our GitHub project page. If you have any questions or discussion topics, post them on GitHub Issues or email us at [email protected].

License

Herd is licensed under Apache License 2.0

finraos-herd's People

Contributors

afelde avatar aniruddhadas9 avatar aniruddhadas9finra avatar davidbalash avatar foxsmart avatar jazhou avatar johnlbergqvist avatar jzhang80 avatar k26389 avatar kenisteward avatar kood1 avatar kusid avatar mchao47 avatar mona62 avatar nateiam avatar paulkennethkent avatar ryanbarwick avatar saisumughis avatar saisuryafinra avatar seoj avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.