Git Product home page Git Product logo

yang040840219's Projects

airbyte icon airbyte

Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.

alluxio icon alluxio

Alluxio, data orchestration for analytics and machine learning in the cloud

amundsen icon amundsen

Amundsen is a metadata driven application for improving the productivity of data analysts, data scientists and engineers when interacting with data.

ballcat icon ballcat

😸一个快速开发脚手架,快速搭建企业级后台管理系统,并提供多种便捷starter进行功能扩展。主要功能包括前后台用户分离,菜单权限,数据权限,定时任务,访问日志,操作日志,异常日志,统一异常处理,XSS过滤,SQL防注入,国际化 等多种功能

beam icon beam

Apache Beam is a unified programming model for Batch and Streaming data processing.

casbah icon casbah

Officially supported Scala Driver for MongoDB

clickhouse icon clickhouse

ClickHouse is a free analytics DBMS for big data

common icon common

Common utilities library containing metrics, config and utils

cronex icon cronex

Heavily unit tested cron expression evaluation

cronhub icon cronhub

CronHub is a better crontab, it is a web application which can monitor a large number of machine's crontab, and easy to manage it from web page

dataease icon dataease

人人可用的开源数据可视化分析工具。

datagear icon datagear

数据可视化分析平台,自由制作任何您想要的数据看板

dataspherestudio icon dataspherestudio

DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.

dataworks-zeus icon dataworks-zeus

Ctrip Hadoop Job Scheduling System derived from https://github.com/alibaba/zeus

datax icon datax

DataX 是阿里巴巴集团内被广泛使用的离线数据同步工具/平台,实现包括 MySQL、Oracle、HDFS、Hive、OceanBase、HBase、OTS、ODPS 等各种异构数据源之间高效的数据同步功能。

dbt-core icon dbt-core

dbt enables data analysts and engineers to transform their data using the same practices that software engineers use to build applications.

dbt-spark icon dbt-spark

dbt-spark contains all of the code enabling dbt to work with Apache Spark and Databricks

deequ icon deequ

Deequ is a library built on top of Apache Spark for defining "unit tests for data", which measure data quality in large datasets.

delta icon delta

An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.