Git Product home page Git Product logo

awesome-opensource-data-engineering's Introduction

Hello Stranger ๐Ÿ‘‹!

My name is Gunnar. I am an open-source software engineer, currently working at Decodable, involved with data streaming based on Apache Flink. Before joining Decodable, I spent ten years at Red Hat, where I led the Debezium project, a platform for change data capture.

I have been a long-time committer to multiple open-source projects, including Debezium, Hibernate, kcctl ๐Ÿงธ, JfrUnit, MapStruct and Deptective; I also serve as the spec lead for Bean Validation 2.0 (first at the JCP, now under the Jakarta EE umbrella at the Eclipse Foundation).

Named a Java Champion, I enjoy presenting at conferences. I have spoken at events such as QCon, JavaOne, Red Hat Summit, JavaZone, JavaLand, Kafka Summit, Current, Flink Forward, and Data Council. Here are some recordings from talks over the last few years:

Occasionally, I blog about topics related to software engineering. Here are some popular posts:

awesome-opensource-data-engineering's People

Contributors

antonmry avatar bsideup avatar chloejay avatar chrfoyer avatar chrjohn avatar dwiechert avatar geoand avatar gitcommitshow avatar gunnarmorling avatar iporollo avatar jogoodma avatar josep2 avatar joshmeek avatar justinpitts avatar maayan-s avatar martint avatar mustafa0taru avatar onecricketeer avatar senordeveloper avatar sergiojulio avatar sryza avatar tchiotludo avatar yaelriv avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

awesome-opensource-data-engineering's Issues

Suggestions for more tools

I suggest adding the following:

  1. Meltano by GitLab
  2. dbt by Fishtown Analytics
  3. Singer by StitchData (Talend)
  4. Airbyte
  5. ScyllaDB

Apache Kafka Is Not Messaging!

I know this may come as a surprise for many people but Apache Kafka is not a messaging infrastructure. It is simpler to label it as such because it has been known this way since the project was born many years ago; but the reality is that today Kafka is more than that. Kafka is a distributed streaming platform.

image

A streaming platform has three key capabilities:

  • Publish and subscribe to streams of records, similar to a message queue or enterprise messaging system.

  • Store streams of records in a fault-tolerant durable way.

  • Process streams of records as they occur.

Reference: https://kafka.apache.org/documentation/#gettingStarted

That means that Kafka should belong to the categories "Integration", "Messaging Infrastructure", and "Stream Processing" at the same time. Knowing that this would look odd on this list of open-source projects, I would suggest creating a new category called "Distributed Streaming Platforms" and put Kafka on it.

-- @riferrei

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.