Comments (11)
文中说:// 完整代码可见本文最后的附录 在哪里看呢?
文中“能够同时在处理的 batch 有 10 / 2 = 5 个”,不理解为什么是5个。
from coolplayspark.
@zhengzhou-spark 因为有两个output,所以产生了两个job, 每个job都要一个线程来运行,这样一来一个批次的数据需要两条线程来运行,所以10个线程可以并发处理5个批次的数据。
from coolplayspark.
你好,请问 文中提到的:// 完整代码可见本文最后的附录 在哪里看呢?
from coolplayspark.
// 完整代码可见本文最后的附录
代码已经更新到原文附录,thanks!
from coolplayspark.
为什么说“在 Spark Streaming 程序在 ssc.start() 开始运行时,会生成一个 JobScheduler 的实例,并被 start() 运行起来”呢?我看到的2.10版本里StreamingContext中scheduler的定义是:
private[streaming] val scheduler = new JobScheduler(this)
没有lazy,那么它应该是在streamingContext初始化的时候就生成了吧。
from coolplayspark.
确实之前的表述有问题,也确实是 没有lazy,那么它应该是在streamingContext初始化的时候就生成了吧
这样的。I'm fixing it -- thanks for pointing this out!
另外如果没加 Streaming 交流群的话,请加下?
from coolplayspark.
嗯,入群了。
from coolplayspark.
job的产生和提交都是在driver端,计算任务是如何发布到worker的呢?
from coolplayspark.
请问文章中以下两种表述方式,应该使用哪种?
`RDD` DAG
`RDD DAG`
涉及 2.1 及 2.2
from coolplayspark.
@lw-lin 加了iRobot,没有等到入群邀请,大佬能否发一下!
from coolplayspark.
@lw-lin 请问一下,spark.streaming.concurrentJobs参数没有在官网上找到,您知道在哪里吗?
目前我知道的相关路径:http://spark.apache.org/docs/2.3.0/configuration.html#spark-streaming
from coolplayspark.
Related Issues (20)
- 关于SparkStreaming的join操作 HOT 2
- [SS]《1.1 Structured Streaming 实现思路与实现概述》讨论区 HOT 9
- [SS]《1.2 Structured Streaming 之 Output Modes 解析》讨论区 HOT 5
- [SS]《2.1 Structured Streaming 之 Source 解析》讨论区 HOT 1
- [SS]《2.2 Structured Streaming 之 Sink 解析》讨论区 HOT 3
- [SS]《3.1 Structured Streaming 之状态存储解析》讨论区 HOT 8
- [SS]《4.1 Structured Streaming 之 Event Time 解析》讨论区 HOT 2
- [SS]《4.2 Structured Streaming 之 Watermark 解析》讨论区 HOT 3
- [SS]《[Q&A] Structured Streaming 与 Spark Streaming 的区别》讨论区 HOT 1
- 请教问题
- Spark技术群二维码过期 HOT 2
- 这篇文档("0.1 Spark Streaming 实现思路与模块概述.md")存在描述错误的地方 HOT 1
- 大神有没有 sparkstreaming 读取kafka相关的代码
- 程序编译的时候是kafka_client-0.10.jar的,spark-submit的时候加载了CDH自带的spark-assembly。导致类冲突 HOT 1
- driver端异常恢复, 如何确保exactly once语义的呢? HOT 1
- 【question】在watermark下spark如何维护kafka的offset
- structured streaming java.io.EOFException
- StateStore的实现以及exactly-once HOT 1
- 读取多个topic数据效率问题 HOT 1
- spark streaming读取redis问题
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from coolplayspark.