Comments (12)
The way I have got a similar setup to work in the past is to do ssh tunneling:
ssh -fN your-login@scheduler-machine -L 33023:localhost:33023
Then I can just open http://localhost:33023 on my local machine and see the status page. Slightly cumbersome but I am afraid I don't know of a more convenient way of doing it. This approach (which is not dask-specific at all) is mentioned in the distributed doc.
from dask-jobqueue.
@jhamman happy to do so. Give me a few hours (busy till later) and I'll be happy to PR in.
from dask-jobqueue.
from dask-jobqueue.
Originally, I did the following:
cluster = SGECluster(queue='default.q', walltime="1500000", processes=10, memory='20GB')
This gave the client address:
http://172.16.23.102:8787/status
After setting the interface to eth0
:
cluster = SGECluster(queue='default.q', walltime="1500000", processes=10, memory='20GB', interface='eth0')
the client address is:
http://172.16.23.102:43906/status
This was not accessible (also timed-out).
I also tried setting the cluster interface to ib0
. The client address then became:
http://10.145.71.204:8787/status
This also timed-out.
Setting the eth1
or ib1
also gave errors (likely because these interfaces are not 'enabled' somehow).
from dask-jobqueue.
Is there something that's blocking access that I'm missing?
Maybe. It might be worth checking with your IT staff.
You might consider seeing if machines on the SGE cluster can see that address, perhaps by using requests
to download that page, both from the client machine, and possibly from one of the workers:
import requests
requests.get(addr) # does this work?
client.run(requests.get, addr) # does this work?
from dask-jobqueue.
@mrocklin thanks for the help! I tried the following:
import requests
addr = "http://cluster.server.ip.addr:port/status" # with appropriate modifications.
requests.get(addr) # times out
In the absence of any information from IT, my current hypothesis is that there is something blocking the opening of ports from node to node. I'll continue to keep tabs on this issue.
from dask-jobqueue.
@lesteve thank you for the excellent tip! I have added this to my TextExpander snippets 😄
from dask-jobqueue.
@ericmjl - would you be interested in adding a note in the dask-jobqueue documentation identifying this issue? I suspect other users will run into the port-forwarding issue.
from dask-jobqueue.
you may also be interested in this PR: pangeo-data/pangeo#317
from dask-jobqueue.
We also do the same thing as @lesteve has suggested.
At some point we may consider revitalizing the JupyterLab extension for Dask (especially as JupyterLab has become more stable), which would make adding the extra ssh tunnel unnecessary. AIUI JupyterLab developers are eager to have users that can give feedback on this sort of thing. So getting help shouldn't be too hard if someone has cycles/interest to pursue this.
ref: https://github.com/dask/dask-labextension
from dask-jobqueue.
from dask-jobqueue.
Issue ( dask/dask-labextension#15 ) had some discussion about how this might be done. Not sure if that advice is still current.
from dask-jobqueue.
Related Issues (20)
- Cluster keeps appending "interface" flag to job script HOT 7
- Release 0.8.1 HOT 2
- OARCluster implementation does not let OAR take into account the memory parameter HOT 4
- `JobQueueCluster` with local worker(s) HOT 3
- Restart cluster job on task completion HOT 3
- Add CI with more tests for OAR
- dask_jobqueue tries to import non-existent function dask.utils.ignoring HOT 3
- a direct way to specify the worker spec HOT 4
- Documentation bug: interface HOT 1
- documentation: document `worker_command` kwarg
- Strange Worker KeyError when using LSFCluster. HOT 6
- Update NERSC Cori to NERSC Perlmutter in docs HOT 3
- SLURMCluster doesn't spawn new workers when old ones timeout HOT 12
- conftest.py not included in PyPI source tarball HOT 1
- CI is currently failing HOT 4
- ConnectionRefusedError HOT 2
- ImportError on ignoring attribute from dask.utils when importing dask_jobqueue HOT 2
- Resource allocation on SLURM cluster HOT 9
- Add a `py.typed` marker HOT 1
- Unable to submit jobs to PBS queue HOT 2
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from dask-jobqueue.