Git Product home page Git Product logo

s3-kafka-connector's Introduction

s3-kafka-connector

Kafka Connector for AWS S3 file system

This connector copies Kafka messages into Amazon's S3 file system.

Quick Start:

Follow steps 1 thru 5 from the 'Quick Start' section in Kafka's documentation (http://kafka.apache.org/documentation.html#quickstart)

To summarize:

  1. Download & install

tar -xzf kafka_2.11-0.9.0.0.tgz cd kafka_2.11-0.9.0.0

  1. Start Zookeeper

bin/zookeeper-server-start.sh config/zookeeper.properties

  1. Open a new Terminal window & start the server

bin/kafka-server-start.sh config/server.properties

  1. Open a new Terminal windows & Create a topic

bin/kafka-topics.sh --create --zookeeper localhost:2181 --replication-factor 1 --partitions 1 --topic test

  1. In the same terminal window, start Producer & send a few messages

bin/kafka-console-producer.sh --broker-list localhost:9092 --topic test

This is a message

This is another message

  1. Open a new Terminal window & confirm that messages were received by Kafka server.

bin/kafka-console-consumer.sh --zookeeper localhost:2181 --topic test --from-beginning

  1. Get the source code for this project in a new Terminal window.

git clone https://github.com/dilteam/s3-kafka-connector

cd s3-kafka-connector

  1. Open connect-s3-sink.properties & add S3 related properties.

vi src/main/resources/connect-s3-sink.properties

Set these properties:

ACCESS_KEY=

SECRET_KEY=

BUCKET_NAME=

FOLDER_NAME=

  1. Compile. (You should have Maven installed.)

mvn clean package

  1. Start S3 Kafka Conector.

bin/startConnector.sh

  1. Now enter messages in the Producer window. They will show up on S3 at s3://BUCKET_NAME/FOLDER_NAME/

TODO:

  1. Allow users to partition data as per their need.

  2. Test performance. (Would we get SlowDown errors (503) errors?)

  3. Add more S3 configuration parameters.

  4. and more....

s3-kafka-connector's People

Contributors

dilteam avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.