Git Product home page Git Product logo

akramiot / iot-telemetry-data_aws_cloud_data_lake- Goto Github PK

View Code? Open in Web Editor NEW
4.0 3.0 0.0 2.13 MB

End to End architecture and Implementation of a Data ingestion Pipeline in AWS Cloud for building Data lake with MQTT based IOT Telemetry data coming from Cisco IOT gateways

License: Apache License 2.0

Python 86.11% Shell 13.89%
cisco edge-computing gateway-services iot iot-application

iot-telemetry-data_aws_cloud_data_lake-'s Introduction

published

IOT-Telemetry-data_AWS_Cloud_Data_Lake-

End to End architecture and Implementation of a Data ingestion Pipeline in AWS Cloud for build Data lake

For Reference on AWS IOT Device side Python SDK, kindly follow the below link and clone from it.

https://github.com/aws/aws-iot-device-sdk-python

Please check the following while using AWS IOT Core Python SDK and Boto3 with Python 3.x or Python 2.x versions in this following project.

On 2021-01-15, deprecation for Python 2.7 was announced and support was dropped on 2021-07-15. To avoid disruption, customers using Boto3 on Python 2.7 may need to upgrade their version of Python or pin the version of Boto3. For more information, see this blog post.https://aws.amazon.com/blogs/developer/announcing-end-of-support-for-python-2-7-in-aws-sdk-for-python-and-aws-cli-v1/

On 2022-05-30, support for Python 3.6 was ended. This follows the Python Software Foundation end of support for the runtime which occurred on 2021-12-23. For more information, see this blog post. https://aws.amazon.com/blogs/developer/python-support-policy-updates-for-aws-sdks-and-tools/

Here is the illustration of the Complete architectural implementation of the AWS Cloud IOT Data lake with IOT telemetry data coming from MQTT Client's running on Cisco IOT gateways.

A) (IR1101 , IR829, IR 1800 series Cisco IOT gateways could be used to run the MQTT Client) or

B) A linux machine or Raspberry PI could be used to run a vanilla MQTT Client (code present in this repository) on it.

C) Any Cisco Catalyst hardware supporting IOX_CAF and AWS Greengrass SDK could be also leveraged - Reference in link below. https://developer.cisco.com/ecosystem/cpp/solutions/177022/

aws_iot_cloud_data_lake_architecture_with_mqtt_based_iot_telemetry_data

Prerequisites for this Project:

  1. Setting up the AWS IoT Analytics channel, pipeline, and data store 2 Ingesting MQTT baaed sample pubSub data into the AWS IoT Core, then to the IOT data pipeline channel and then evnetually to the IOT data analytics, Visualisation toosl. Creating a dataset with dataset content delivery to different S3 buckets. Kinldy refer illustration above.
  2. Integrating IoT data with other data stored in your data lake on AWS IOT data pipeline channel. (Either by using Lakefromation tools or by using Glue Data catalog logic or by creating manual data pipelines).

Account Access, IAM access and other Prerequisites:

For this use case implemenation in the AWS Cloud make sure that you have access to the following resources:

  1. An AWS account in the same AWS Region where AWS IoT Analytics is available. The idea is to reduce the RTT for moving the data by selecting IOT analytics and S3 data storage buckets in same region.
  2. This project uses a Cloudformation YAMl template which is in the US East (N. Virginia) Region.
  3. However, you can choose another AWS Region where AWS IoT Analytics is available.
  4. The AdministratorAccess policy granted to your AWS account (for production, we recommend restricting this further).
  5. The AWS CLI installed and configured to use with your AWS account, along with Privelege, PKI key pairs for CLI access to your AWS Cloud account.
  6. AWS IOT Python SDK and all the modules as defined in the requirements.txt to be installed.

iot-telemetry-data_aws_cloud_data_lake-'s People

Contributors

akramiot avatar

Stargazers

 avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.