Git Product home page Git Product logo

Comments (5)

rongou avatar rongou commented on August 20, 2024

Thanks for the report! Searching around for this problem, there doesn't seem to be a definitive answer. In the kubernetes code base it's mostly set to minutes. Can you try something like kubeflowInformerFactory := informers.NewSharedInformerFactory(kubeflowClient, 10*time.Minute) to see if that solves your problem?

from mpi-operator.

rongou avatar rongou commented on August 20, 2024

By the way I'm really interested in where you are running these thousands of mpi jobs and for what purpose. :) Can I quote you guys as a user/customer of mpi-operator?

from mpi-operator.

answer3x avatar answer3x commented on August 20, 2024

@rongou

I have thousands of mpijob,most of them are successed or failed. If the resync period is short, mpi-operator cost many time to sync the finished mpijob which have nothing change.

I support our group's deep learning platform which has many many jobs.

Yes, you can quote me as a user/customer of mpi-operator

from mpi-operator.

rongou avatar rongou commented on August 20, 2024

Right, setting the resync period to 0 means to never resync, that should solve your problem.

from mpi-operator.

rongou avatar rongou commented on August 20, 2024

@answer3y do you mind telling me your group/company's name? If you don't feel comfortable sharing it on github, you can email me directly (rong dot ou at gmail dot com), or hit me up on the kubeflow slack. Thanks!

from mpi-operator.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.