Git Product home page Git Product logo

bigdataguide's Introduction

大数据学习指南

大数据学习指南,从零开始学习大数据开发,包含大数据学习各个阶段资汇总

概述

  1.大数据简介
  2.大数据相关岗位介绍

大数据学习路线

  1.大数据学习路线(包含自己看过的视频链接)

完全分布式集群搭建

  可以参考我的博客,按照顺序进行操作即可
  1.集群搭建

大数据框架组件

一、Hadoop

  1.Hadoop——分布式文件管理系统HDFS
  2.Hadoop——HDFS的Shell操作
  3.Hadoop——HDFS的Java API操作
  4.Hadoop——分布式计算框架MapReduce
  5.Hadoop——MapReduce案例
  6.Hadoop——资源调度器YARN
  7.Hadoop——Hadoop数据压缩

二、Zookeeper

  1.Zookeeper——Zookeeper概述

三、Hive

  1.Hive——Hive概述
  2.Hive——Hive数据类型

四、Flume

  1.Flume——Flume概述
  2.Flume——Flume实践操作
  3.Flume——Flume案例

五、Kafka

  1.Kafka——Kafka概述
  2.Kafka——Kafka深入解析
  3.Kafka——Kafka API操作实践
  3.Kafka——Kafka对接Flume实践

六、HBase

  1.HBase——HBase概述
  2.HBase——HBase数据结构
  3.HBase——HBase Shell操作
  4.HBase——HBase API实践操作

七、Spark

Spark基础

  1.Spark基础——Spark的诞生
  2.Spark基础——Spark概述
  3.Spark基础——Spark运行模式
  4.Spark基础——案例实践

Spark Core

  1.Spark Core——RDD概述

八、Flink

面试题

一、Hadoop

  1.Hadoop面试题总结(一)
  2.Hadoop面试题总结(二)——HDFS
  3.Hadoop面试题总结(三)——MapReduce
  4.Hadoop面试题总结(四)——YARN
  5.Hadoop面试题总结(五)——优化问题

二、Zookeeper

  1.Zookeeper面试题总结(一)

三、Hive

  1.Hive面试题总结(一)
  2.Hive面试题总结(二)

四、HBase

  1.HBase面试题总结(一)

五、Flume

  1.Flume面试题总结(一)

六、Kafka

  1.Kafka面试题总结(一)
  2.Kafka面试题总结(二)

七、Spark

  1.Spark面试题总结(一)
  2.Spark面试题总结(二)
  3.Spark面试题总结(三)
  4.Spark面试题总结(四)

Spark性能优化:
  5.Spark面试题总结(五)——几种常见的数据倾斜情况及调优方式
  6.Spark面试题总结(六)——Shuffle配置调优
  7.Spark面试题总结(七)——程序开发调优
  8.Spark面试题总结(八)——运行资源调优

交流群建好了,进群的小伙伴可以加我微信:MoRan1607,备注:GitHub

bigdataguide's People

Contributors

moran1607 avatar

Watchers

James Cloos avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.