Git Product home page Git Product logo

meepo's Introduction

Meepo是一个轻量级的数据迁移工具,主要针对Mysql、ParquetFile之间的数据交换场景。

当然也定制了一些扩展,比如Redis、ElasticSearch等。

Meepo主要是用来解决如下几个问题:

1、Mysql表的同步,持续读取原表的新增数据,写入一个定制化的新表,有一些简单的数据加工。

这个需求也有很多公司是基于otter、cannal来做的,meepo和datax原理差不多,基于JDBC。

2、快速复制一张Mysql表,以最快的速度完成一个镜像的拷贝,可适当取舍列,主要用于测试需求。

为了快速写入Mysql,Meepo还是做了很多细致的性能优化工作的,基本上可以满足绝大多数需求了。

3、将在线库的数据生成Parquet,并写入到HDFS上,或者生成本地文件,方便数据的传递。

跟Sqoop功能差不多,但是Sqoop基于Yarn的MR不太好控制,而且依赖有些重。

4、两张Mysql表的比对,目前只能针对主键ID,进行差异比较,找到丢失的数据行。

主要是基于Bitmap,在有限的内存空间里,快速比对数据差异。

5、基于Plugin的定制开发,Meepo默认提供了一些Plugin,也允许plugin组合使用和自定义。

默认提供的插件能自动处理字段类型的差异,能够完成简单的Join计算。

meepo's People

Contributors

peiliping avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar  avatar

meepo's Issues

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.