Git Product home page Git Product logo

Pari's Projects

big-data-rosetta-code icon big-data-rosetta-code

Code snippets for solving common big data problems in various platforms. Inspired by Rosetta Code

brickflow icon brickflow

Pythonic Programming Framework to orchestrate jobs in Databricks Workflow

corda icon corda

Corda is an open source blockchain project, designed for business from the start. Only Corda allows you to build interoperable blockchain networks that transact in strict privacy. Corda's smart contract technology allows businesses to transact directly, with value.

deequ icon deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

delta icon delta

An open-source storage layer that brings scalable, ACID transactions to Apache Sparkβ„’ and big data workloads.

former2 icon former2

Generate CloudFormation / Terraform / Troposphere templates from your existing AWS resources.

koalas icon koalas

Koalas: pandas API on Apache Spark

ludwig icon ludwig

Ludwig is a toolbox built on top of TensorFlow that allows to train and test deep learning models without the need to write code.

metaflow icon metaflow

Build and manage real-life data science projects with ease.

minio icon minio

MinIO is a high performance object storage server compatible with Amazon S3 APIs

nakadi icon nakadi

A distributed event bus that implements a RESTful API abstraction on top of Kafka-like queues

nni icon nni

An open source AutoML toolkit for automate machine learning lifecycle, including feature engineering, neural architecture search, model compression and hyper-parameter tuning.

quinn icon quinn

pyspark methods to enhance developer productivity πŸ“£ πŸ‘― πŸŽ‰

redash icon redash

Make Your Company Data Driven. Connect to any data source, easily visualize, dashboard and share your data.

soda-sql icon soda-sql

Metric collection, data testing and monitoring for SQL accessible data

spark-daria icon spark-daria

Essential Spark extensions and helper methods ✨😲

spline icon spline

Data Lineage Tracking and Visualization tool for Apache Spark β„’

troposphere icon troposphere

troposphere - Python library to create AWS CloudFormation descriptions

zipkin icon zipkin

Zipkin is a distributed tracing system

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.