Community Note Please vote on this issue by adding a 👍 <a

Suggested Architecture SHS will exist as a wholly separate d

Alternatively-- <a class="user-mention notranslate" data-hovercard-type="user" data-ho

[FEATURE] Manage Spark History Server as a deployment via the Spark Operator helm chart. about spark-operator HOT 8 OPEN

peter-mcclonski commented on July 21, 2024 5

[FEATURE] Manage Spark History Server as a deployment via the Spark Operator helm chart.

from spark-operator.

Comments (8)

KhASQ commented on July 21, 2024 1

Kindly make the spark history server part of the operator.
I think targeting this operator as single point for spark on K8s eco system will add much better momentum for the development.

For example integrating spark operator to manage an external shuffle service on K8s.

Sorry for interrupting but I am so excited about the new development on this operator

from spark-operator.

peter-mcclonski commented on July 21, 2024

Suggested Architecture

SHS will exist as a wholly separate deployment from spark-operator, as a disjoint chart.
In order to resolve the problem of dynamically pulling in dependencies/packages, an initcontainer shall be spun up which populates a volume with the union of the default $SPARK_HOME/jars and the result of java -Divy.cache.dir=$SPARK_HOME -Divy.home=$SPARK_HOME -jar $SPARK_HOME/jars/ivy-2.5.1.jar -dependency [PACKAGE]. This populated volume shall be mounted in the SHS container as $SPARK_HOME/jars
$SPARK_HOME/conf/spark.conf shall be mounted as a volume populated by a raw text block in the helm chart.
Log storage shall default to a PVC.
If SHS is enabled, that does not necessarily imply that logging is enabled in your spark job configuration.

from spark-operator.

peter-mcclonski commented on July 21, 2024

Did some initial work on this just to feel it out-- Got automatic resolution of packages working via initcontainers. It's a bit gross, but it works as a start.

Major TODO items:

Add arbitrary volume/volumeMount support
Add support for pulling jars, rather than solely packages
Add a clean mechanism for mounting spark-defaults.conf
Create an example that works out of the box-- The hard part being a zero-barrier-to-entry Volume accessible across nodes.
Docs updates
General cleanup / hardening

from spark-operator.

peter-mcclonski commented on July 21, 2024

Alternatively-- @yuchaoran2011 Do you think it would be worth reviving https://artifacthub.io/packages/helm/cloudnativeapp/spark-history-server and the associated chart and (potentially) having it live here, adjacent to but disconnected from the actual operator chart? I think the real problem here isn't so much that operator should be managing the history server directly, and more that history server, a valuable part of the spark ecosystem, doesn't have any good helm charts out in the wild. We're working on one as part of boozallen/aissemble#66 (https://github.com/boozallen/aissemble/pull/80/files), covered by our BAPL (not as permissive as, say, Apache) solely because we couldn't find an existing OSS solution that was up to date, maintained, and flexible.

from spark-operator.

yuchaoran2011 commented on July 21, 2024

I'm not sure if it's a good idea to have history server co-deployed with operator. A single history server can aggregate jobs managed by multiple Spark operator deployments across multiple k8s clusters

I think the real problem here isn't so much that operator should be managing the history server directly, and more that history server, a valuable part of the spark ecosystem, doesn't have any good helm charts out in the wild.

I agree. I haven't looked at the quality of https://artifacthub.io/packages/helm/cloudnativeapp/spark-history-server, but if it's something you have used, I'm for that idea

from spark-operator.

peter-mcclonski commented on July 21, 2024

I'm not sure if it's a good idea to have history server co-deployed with operator. A single history server can aggregate jobs managed by multiple Spark operator deployments across multiple k8s clusters

I think the real problem here isn't so much that operator should be managing the history server directly, and more that history server, a valuable part of the spark ecosystem, doesn't have any good helm charts out in the wild.

I agree. I haven't looked at the quality of https://artifacthub.io/packages/helm/cloudnativeapp/spark-history-server, but if it's something you have used, I'm for that idea

Sounds reasonable to me. Wrt the helm chart I linked, I wasn't sure if you had specific thoughts, given that you're listed as the maintainer on artifacthub

from spark-operator.

yuchaoran2011 commented on July 21, 2024

Ah upon a closer look, now I remember that I initially created this chart many years ago. I haven't used it for a long time though and won't count on it still being production ready

from spark-operator.

peter-mcclonski commented on July 21, 2024

I think there's both interest and clearly an unfilled need in the community for a production ready, standalone spark history chart that's well maintained. Would kubeflow and the spark operator maintainers be open to one being created in this repo, or would it be better housed somewhere totally separate?

from spark-operator.

[FEATURE] Manage Spark History Server as a deployment via the Spark Operator helm chart. about spark-operator HOT 8 OPEN

Comments (8)

Suggested Architecture

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent