Comments (3)
Could you please provide more background to this request? Typically CfnCluster is deployed on a per user, project or team and doesn't require dedicated submission hosts as seen with on-premise shared cluster architectures.
from aws-parallelcluster.
I'd like to see this too. I'm setting up a workflow management system on an ECS-based server which needs to be able to submit jobs via DRMAA to a HPC scheduler. That server needs to be a submit host to submit jobs to the cfncluster's master node. I am struggling to set this up manually. I failed with SLURM, and am having trouble with SGE. But I now realize the SGE problems are due to mismatched versions between the workflow server and HPC master, so perhaps that was causing SLURM to fail as well.
from aws-parallelcluster.
@chambm Not clear how an additional node in CfnCluster will help you configure an ECS Task that is able to submit to CfnCluster however happy to discuss offline how you can achieve this setup. Please email me dougalb at amazon dot com and we can discuss.
from aws-parallelcluster.
Related Issues (20)
- Lack of guidance on how to succeed with Service-Linked-Roles Using UI to create a FSX/Lustre S3 backed cluster
- FSX and HeadNode:Iam:AdditionalIamPolicies result in failed cluster creation due to policy name restrictions HOT 1
- (3.7.0‐3.8.0) ParallelCluster API Deployment fails due to IAM Policy size exceeding service limits HOT 1
- Cluster Stopped Appearing in Lists for CLI/UI HOT 4
- Enable QUIC transport protocol for DCV on head node HOT 1
- DCV on Ubuntu 22.04 ARM not possible w/ v3.8.0 despite availability of compatble DCV binaries HOT 3
- imagebuilder custom_script.yaml is incompatible with GovCloud regions (work-around provided) HOT 1
- Redundant step required for PCluster UI to AWS Identity Center integration
- Feature request: include user instance tags with job HOT 1
- Need to use parameters without cloudformation using the cli HOT 6
- FSx creation failure due to FSx Security Group creation race condition? HOT 2
- (3.3.0-3.9.0) Potential data loss issue when removing storage with update-cluster in AWS ParallelCluster 3.3 and above HOT 1
- Question: Slurm Accounting migration between ParallelCluster versions HOT 9
- Feature Request: Allow tags to be configured in the SharedStorage.EbsSettings configuration HOT 1
- Adding support for Ubuntu 24.04 images HOT 1
- Doc issue: Setting cookbook node attribute for Closed Source Nvidia drivers in version 3.9.1? HOT 2
- FSX Timeout when changing permission after mounting HOT 2
- (3.8.0 - 3.9.1) SharedStorageType: Efs not working on arm instances HOT 2
- (3.9.0-3.9.1) Default ThreadsPerCore Slurm setting causes reduced CPU utilization
- Can't build rocky9 AMIs due to hardcoded rocky8 URL/9.4 lustre issues HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from aws-parallelcluster.