Git Product home page Git Product logo

Comments (12)

mrocklin avatar mrocklin commented on May 29, 2024

Oh, and if anyone has a better name, that would be welcome. I've copied over what was in the pangeo repository.

from dask-jobqueue.

jakirkham avatar jakirkham commented on May 29, 2024

Thanks @mrocklin for doing this and everyone involved in writing and sharing the code. :)

Had been thinking recently that it would be nice if there was something to construct shim shell scripts for getting dask-worker jobs launched on schedulers and connected back to a Distributed Client. Was about to raise an issue over at Distributed where it may or may not have been appropriate. Happy to see this emerge around the same time. :)

from dask-jobqueue.

jhamman avatar jhamman commented on May 29, 2024

I don't have any problem using the dask license or loosing my commit history, remarkable as it was.

from dask-jobqueue.

guillaumeeb avatar guillaumeeb commented on May 29, 2024

Hi everyone,

I 've followed Dask and Pangeo for a few months now, and I was wondering where to start to collaborate with you guys. I feel that this may be the right place for me, as it is the closer to my work environment (Dask on PBS cluster).

I will follow the issues opened here, but do not hesitate to cc me if you need some test on an alternative environment, and tell me if you think about a simple task on which you need some manpower.

Cheers!

from dask-jobqueue.

mrocklin avatar mrocklin commented on May 29, 2024

Thanks @guillaumeeb ! I see three main directions of work short term:

  1. Build CI systems. This requires some understanding of docker and docker-compose. See #2
  2. Add new implementations for new job schedulers. Given that you are most familiar with PBS and that PBS already exists this might not be a good fit.
  3. Proceed through the actions of a beginning user and try to use PBSCluster on your cluster. Report back any confusion or bugs that you encounter. This code was developed on a couple of PBS instances, but I suspect that it is not yet completely general and will fail in interesting ways on other PBS deployments.

from dask-jobqueue.

jedwards4b avatar jedwards4b commented on May 29, 2024

I don't have any concerns with license or history - thanks for taking this.

from dask-jobqueue.

jakirkham avatar jakirkham commented on May 29, 2024

To finish up item 2, have put up PR ( #9 ), which just copies Dask's License file and sets the copyright year to 2018. Sounds like we are on agreement on that. Also includes some stuff to package the license and some misc packaging things. Requested reviews of the original code owners. Hope that is ok. :)

from dask-jobqueue.

guillaumeeb avatar guillaumeeb commented on May 29, 2024

@mrocklin
I can begin with point 3. Do you foresee to add some beginner documentation on how to setup the module, or even more, on how to set up a dask python environment with job queue, to have the prerequisite to run the example in the Readme (I could write it, or can we point to some equivalent documentation)? Should we add some cluster scheduler specific documentation on how to check that the Dask cluster has started (eg. using qstat for PBS)?
The point 3 will give me some insights to help on issue #7 also.

I will then be happy to help on point 1, so issue #2, by beginning to set up a PBS docker environment, if no one else has the bandwith to do it! Hope I can find the time.

from dask-jobqueue.

mrocklin avatar mrocklin commented on May 29, 2024

I would install dask: http://dask.pydata.org/en/latest/install.html

I would then pip install from this repository:

pip install git+https://github.com/dask/dask-jobqueue

We don't have anything on PyPI or conda-forge yet. Having basic documentation on RTD would be nice in the future.

Should we add some cluster scheduler specific documentation on how to check that the Dask cluster has started

Sure?

from dask-jobqueue.

guillaumeeb avatar guillaumeeb commented on May 29, 2024

Okay, had never installed a Python module from git before, no idea it was so simple ^^'.
For the documentation, shall we initialize a docs folder with basic files for Sphinx and Read The Docs integration?

Sorry for the basic questions I am (and will be) asking, beginner on Open source Development here.

from dask-jobqueue.

guillaumeeb avatar guillaumeeb commented on May 29, 2024

So here are my first report on trying PBSCluster on our environment:

  • It works 👍, an it is really easy to use. Really good work, thanks!
  • I am concerned by the fact that there is no options for configuring the --local-directory of the dask-worker. We could use the extra configuration option, but I would prefer to have a named option, which is linked to #7 reflexion. In particular, our PBS conf sets a $TMPDIR env var that should be used.
  • I could not find a simple cluster.stop() method. I tried to use cluster.scheduler.close(), and take advantage of the death-timeout, but apparently it did not work. I think it would be good to have such cluster.stop() method that stops every PBS jobs and the sheduler process.
  • One minor point (and that's probably because I did not search long enough), I could not find a method to get the info about both scheduler and bokeh bindings ip:port, like what can be found in a scheduler.json written file.

from dask-jobqueue.

jhamman avatar jhamman commented on May 29, 2024

I'm going to close this. There are a smattering of comments/issues that we may want to open specific issues for but this issue isn't the right place to work on those.

from dask-jobqueue.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.