Git Product home page Git Product logo

taming-big-data-with-spark-streaming-and-scala-hands-on's Introduction

Taming Big Data with Spark Streaming and Scala – Hands On! [Video]

This is the code repository for Taming Big Data with Spark Streaming and Scala – Hands On! Video, published by Packt. It contains all the supporting project files necessary to work through the video course from start to finish.

About the Video Course

Businesses these days require constant, real-time analysis of large amounts of data, along with meaningful insights which influence business decisions. Apache Spark has emerged as the most popular tool in the Big Data market for efficient real-time analytics of Big Data. Spanning over 5 hours, this course will teach you the basics of Apache Spark and how to use Spark Streaming - a module of Apache Spark which involves handling and processing of Big Data on a real-time basis. You will learn how to create Spark applications with Scala to process streams of real-time data. Whether you want to analyze continuously incoming website traffic, analyze real-time streams of Twitter feeds or query your streaming data in real time, this course has got you covered. You will also learn how to use the MLlib module of Spark to train machine learning models with streaming data, and use those models to make real-time predictions. The course assumes some programming experience, and uses Scala to develop Spark applications. It includes a crash course in the Scala programming language in case you're new to it.

What You Will Learn

  • Process large amounts of real-time data using the Spark Streaming module
  • Create efficient Spark applications using the Scala programming language
  • Integrate Spark Streaming with various data sources
  • Integrate Spark Streaming with Spark SQL to query your data in real time
  • Train machine learning models with streaming data, and use them for making real-time predictions

Instructions and Navigation

Assumed Knowledge

To fully benefit from the coverage included in this course, you will need:

  • If you are a student who wants to learn how to use Apache Spark, or a Big Data professional who wants to process large amounts of data on real-time basis, this course is for you. Some basic programming and scripting experience is required to get the most out of the course.
  • taming-big-data-with-spark-streaming-and-scala-hands-on's People

    Contributors

    asif-packt avatar

    Recommend Projects

    • React photo React

      A declarative, efficient, and flexible JavaScript library for building user interfaces.

    • Vue.js photo Vue.js

      🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

    • Typescript photo Typescript

      TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

    • TensorFlow photo TensorFlow

      An Open Source Machine Learning Framework for Everyone

    • Django photo Django

      The Web framework for perfectionists with deadlines.

    • D3 photo D3

      Bring data to life with SVG, Canvas and HTML. 📊📈🎉

    Recommend Topics

    • javascript

      JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

    • web

      Some thing interesting about web. New door for the world.

    • server

      A server is a program made to process requests and deliver data to clients.

    • Machine learning

      Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

    • Game

      Some thing interesting about game, make everyone happy.

    Recommend Org

    • Facebook photo Facebook

      We are working to build community through open source technology. NB: members must have two-factor auth.

    • Microsoft photo Microsoft

      Open source projects and samples from Microsoft.

    • Google photo Google

      Google ❤️ Open Source for everyone.

    • D3 photo D3

      Data-Driven Documents codes.