zhaomin1423 Goto Github PK
Name: Min Zhao
Type: User
Bio: learning how to code.
Location: Hangzhou, China
Name: Min Zhao
Type: User
Bio: learning how to code.
Location: Hangzhou, China
Airbyte is an open-source EL(T) platform that helps you replicate your data in your warehouses, lakes and databases.
ODPS SDK for Java Developers
Alluxio, data orchestration for analytics and machine learning in the cloud
剥离的模块,用于查看Spark SQL生成的语法树
Arctic is a streaming lake warehouse service open sourced by NetEase
Apache Arrow DataFusion Comet Spark Accelerator
AutoMQ is a cloud-native fork of Kafka by separating storage to S3. 10x cost-effective. Autoscale in seconds. Single-digit ms latency.
大数据入门指南 :star:
Bistoury是去哪儿网的java应用生产问题诊断工具,提供了一站式的问题诊断方案
BitSail is a distributed, high-performance data integration engine and provides global data integration solutions in batch, streaming, and incremental scenarios. At present, BitSail has been widely used and synchronizes hundreds of trillions data every day.
Blazing-fast query execution engine speaks Apache Spark language and has Arrow-DataFusion at its core.
一个采用Netty实现的RPC框架,适用于Spring Boot,Spring Cloud!
Mirror of Apache Calcite
Cch1996.github.io
:books: 技术面试必备基础知识、Leetcode、计算机操作系统、计算机网络、系统设计
DataSphereStudio is a one stop data application development& management portal, covering scenarios including data exchange, desensitization/cleansing, analysis/mining, quality measurement, visualization, and task scheduling.
DataX是阿里云DataWorks数据集成的开源版本。
Change data capture for a variety of databases. Please log issues at https://issues.redhat.com/browse/DBZ.
An open-source storage framework that enables building a Lakehouse architecture with compute engines including Spark, PrestoDB, Flink, Trino, and Hive and APIs for Scala, Java, Rust, Ruby, and Python.
Apache DolphinScheduler is a distributed and extensible workflow scheduler platform with powerful DAG visual interfaces, dedicated to solving complex job dependencies in the data pipeline and providing various types of jobs available out of box.
Apache Druid: a high performance real-time analytics database.
:elephant: Elasticsearch real-time search and analytics natively integrated with Hadoop
Exchangis is a lightweight,highly extensible data exchange platform that supports data transmission between structured and unstructured heterogeneous data sources
Apache Flink
Change Data Capture (CDC) Connectors for Apache Flink
基于开源的flink,对其实时sql进行扩展;主要实现了流与维表的join,支持原生flink SQL所有的语法
Based on Apache Flink. support data synchronization/integration and streaming SQL computation.
极客时间视频课程《玩转Spring全家桶》
A declarative, efficient, and flexible JavaScript library for building user interfaces.
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
An Open Source Machine Learning Framework for Everyone
The Web framework for perfectionists with deadlines.
A PHP framework for web artisans
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
Some thing interesting about web. New door for the world.
A server is a program made to process requests and deliver data to clients.
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
Some thing interesting about visualization, use data art
Some thing interesting about game, make everyone happy.
We are working to build community through open source technology. NB: members must have two-factor auth.
Open source projects and samples from Microsoft.
Google ❤️ Open Source for everyone.
Alibaba Open Source for everyone
Data-Driven Documents codes.
China tencent open source team.