Comments (4)
细节方面的TODO:
- logging
- 用户需求,并行度自由选择:repartition 增加或减少并行度,输出文件个数。
- 统一的插件定义的文档格式支持丰富的udf, Text(markdown), 支持多语言
- 把plugin单独拎出来作为一个archifact,方便插件依赖。
- dstream/ spark streaming window operation: https://spark.apache.org/docs/latest/streaming-programming-guide.html#window-operations, #42
- Monitoring: StreamingListener/ App monitor
- 性能调优:input调优,如 dstream/receiver个数调优https://spark.apache.org/docs/latest/streaming-programming-guide.html, filter调优, output调优
- input,output协作完成
exactly once
语义:ack, 幂等, 事务 - 支持离线计算
from seatunnel.
BaseInput的getDStream
返回类型不是通用的,预计在实现input插件时将遇到问题。
abstract class BaseInput(config: Config) extends Plugin {
/**
* No matter what kind of Input it is, all you have to do is create a DStream to be used latter
* */
def getDStream: DStream[(String, String)]
/**
* Things to do after filter and before output
* */
def beforeOutput: Unit = {}
/**
* Things to do after output, such as update offset
* */
def afterOutput: Unit = {}
}
from seatunnel.
Spark Benchmark:
https://github.com/intel-hadoop/HiBench
from seatunnel.
2017年11月17日
(1)改为从command arguments读取:
--conf spark.driver.extraJavaOptions=-Dconfig.path=application.conf
(2)spark.submit.deployMode
val spark = SparkSession
.builder()
.appName("SparkApp")
.master("spark: //192.168.60.80:7077")
.config("spark.submit.deployMode","cluster")
.enableHiveSupport()
.getOrCreate()
from seatunnel.
Related Issues (20)
- [Bug] [dev branch] Version 2.3.5 of dev packages and runs on the server with an error message HOT 3
- 创建postgresql数据源测试连接失败,报错No suitable driver found HOT 1
- [Bug] [OSS Checkpoint] OSS checkpoint not working HOT 2
- [Feature][Hbase] Implement Hbase connector catalog and typeconverter
- Does synchronized block required for method "addMetric(String name, Metric metric)" in "AbstractMetricsContext" class?
- JVM Memory Increase in Seatunnel
- [Bug] [mysql-cdc] stop.mode = 'specific' Not Worked.
- Minio
- Using rest-api to submit job is failed.
- [Bug] [Kafka Source] format_error_handle_way = skip is Not effective
- Why email not have corresponding e2e test
- [Bug] [Connector-JDBC] Table structure synchronization HOT 1
- MongoDB - Sink - PluginIdentifier not found HOT 6
- [Feature][Module Name] Feature title HOT 1
- how to set tls_verify_certificate = false in JDBC connection
- [Feature][Engine] The name of the rest-api interface for returning job details was changed from running-job to job-info
- [Feature][sftp] Ignore error records HOT 1
- The main method caused an error: Plugin PluginIdentifier HOT 7
- [Bug] [CI] Fix FixSlotResourceTest Unstable issues
- [Feature][Connector-V2] Support Hive catalog for paimon sink HOT 1
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from seatunnel.