Git Product home page Git Product logo

Comments (11)

zhengzhou-spark avatar zhengzhou-spark commented on July 17, 2024

文中说:// 完整代码可见本文最后的附录 在哪里看呢?
文中“能够同时在处理的 batch 有 10 / 2 = 5 个”,不理解为什么是5个。

from coolplayspark.

TopSpoofer avatar TopSpoofer commented on July 17, 2024

@zhengzhou-spark 因为有两个output,所以产生了两个job, 每个job都要一个线程来运行,这样一来一个批次的数据需要两条线程来运行,所以10个线程可以并发处理5个批次的数据。

from coolplayspark.

JudeLmin avatar JudeLmin commented on July 17, 2024

你好,请问 文中提到的:// 完整代码可见本文最后的附录 在哪里看呢?

from coolplayspark.

lw-lin avatar lw-lin commented on July 17, 2024

@zhengzhou-spark @JudeLmin

// 完整代码可见本文最后的附录

代码已经更新到原文附录,thanks!

from coolplayspark.

AntikaSmith avatar AntikaSmith commented on July 17, 2024

为什么说“在 Spark Streaming 程序在 ssc.start() 开始运行时,会生成一个 JobScheduler 的实例,并被 start() 运行起来”呢?我看到的2.10版本里StreamingContext中scheduler的定义是:
private[streaming] val scheduler = new JobScheduler(this)
没有lazy,那么它应该是在streamingContext初始化的时候就生成了吧。

from coolplayspark.

lw-lin avatar lw-lin commented on July 17, 2024

@AntikaSmith

确实之前的表述有问题,也确实是 没有lazy,那么它应该是在streamingContext初始化的时候就生成了吧 这样的。I'm fixing it -- thanks for pointing this out!

另外如果没加 Streaming 交流群的话,请加下?

from coolplayspark.

AntikaSmith avatar AntikaSmith commented on July 17, 2024

@lw-lin

嗯,入群了。

from coolplayspark.

 avatar commented on July 17, 2024

job的产生和提交都是在driver端,计算任务是如何发布到worker的呢?

from coolplayspark.

hangim avatar hangim commented on July 17, 2024

请问文章中以下两种表述方式,应该使用哪种?

`RDD` DAG
`RDD DAG`

涉及 2.1 及 2.2

from coolplayspark.

allenlu1990 avatar allenlu1990 commented on July 17, 2024

@lw-lin 加了iRobot,没有等到入群邀请,大佬能否发一下!

from coolplayspark.

MrYuMing avatar MrYuMing commented on July 17, 2024

@lw-lin 请问一下,spark.streaming.concurrentJobs参数没有在官网上找到,您知道在哪里吗?
目前我知道的相关路径:http://spark.apache.org/docs/2.3.0/configuration.html#spark-streaming

from coolplayspark.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.