Git Product home page Git Product logo

hive_notes's Introduction

Apache Hive是用来读、写、管理保存在分布式存储中的大型数据集,并且使用SQL语法进行查询。

基于Apache Hadoop构建,Hive有如下特性:

  • 使用SQL进行数据访问的工具,可以满足数据建仓的任务,包括ETL(extract/transform/load),报告和数据分析;
  • 将结构强加于多种数据格式的机制;
  • 访问Apache HDFS中,或者其它数据存储系统(比如Apache HBase)保存的数据;
  • 通过Apache Tez,Apache Spark或者MapReduce执行查询;
  • 使用HPL-SQL的(procedural language,存储过程处理语言?);
  • 通过Hive LLAP,Apache YARN和Apache Slider进行亚秒级(sub-second)查询。

Hive提供了标准的SQL支持,包括随后用于分析的SQL:2003,SQL:2011,和SQL:2016特性。

Hive的SQL可以通过UDFs(user defined functions)、UDAFs(user defined aggregates functions)、UDTFs(user defined table functions)使用用户代码进行扩展。

Hive支持多种数据格式。Hive内置了CSV、TSV、Apache Parquet、Apache ORC和其它格式的连接器(connectors)。用户可以使用其它格式的连接器对Hive进行扩展。

Hive不是为联机事务处理(OLTP)工作而设计的,最佳的用途是传统的数据建仓任务。

Hive是可扩展、高性能、容错、与输入格式低耦合的

Hive的组件包括HCatalog和WebHCat:

  • HCatalog:是Hadoop的一个表和存储管理层,让用户可以使用不同的数据处理工具——包括Pig和MapReduce——更简单地进行数据读写。
  • WebHCat:提供可以用来运行Hadoop MapReduce(或者YARN)、Pig、Hive jobs的服务。也可以使用HTTP接口进行Hive metadata操作。

hive_notes's People

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.