Git Product home page Git Product logo
huge-data photo

huge-data Goto Github PK

repos: 42.0 gists: 0.0

Name: huge-data

Type: Organization

Bio: 大数据相关处理,包括分布式搜索和分布式计算。

Location: 安徽,合肥

huge-data's Projects

anthelion icon anthelion

Anthelion is a plugin for Apache Nutch to crawl semantic annotations within HTML pages

common icon common

Kafka连接Hdfs的公共工具类,对应confluent-2.0.1。

elasticsearch-rtf icon elasticsearch-rtf

elasticsearch中文发行版,针对中文集成了相关插件,并带有Demo,方便新手学习,或者在生产环境中直接使用

galaxia icon galaxia

运维监控管理框架,包括:基础设施、应用和微服务。

hadoop-guides icon hadoop-guides

Hadoop使用指南,持续更新中。。。欲增加YARN使用模块。

kafka-connect-hdfs icon kafka-connect-hdfs

Kafka-0.9连接Hadoop-2.6,实现Avro、String和Byte三种格式数据传输,对应confluent-2.0.1。

nutch-ajax icon nutch-ajax

Apache Nutch Plugins for AJAX page fetch, parse, index

nutcher icon nutcher

nutcher是中文的nutch文档,包含nutch的配置和源码解析,持续更新中。

openbdre icon openbdre

Bigdata Ready Enterprise Open Source Software

schema-registry icon schema-registry

Kafka连接Hdfs数据传输格式注册表服务,对应confluent-2.0.1。

search-prod icon search-prod

搜索服务框架,基于Solr和Zookeeper,增加分布式部署脚本,兼容版本Solr-5.1.0和Zookeeper-3.4.5。

snowplow icon snowplow

企业级 web, mobile事件分析框架, 组件包括 Hadoop, Kinesis, Redshift 和 Elasticsearch。

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.