Git Product home page Git Product logo

Comments (5)

abcdefgs0324 avatar abcdefgs0324 commented on July 20, 2024 1

Thanks for reporting this. It may be a bug on our side. To unblock you, could you temporarily set the base_image without / locally? Thanks.

from python-aiplatform.

shotarok avatar shotarok commented on July 20, 2024

@abcdefgs0324 Thank you for giving me the idea of renaming the image name locally! It turned out / was not the root cause. After I gave a short name without / to nvidia/cuda:11.1.1-devel-ubuntu20.04, the error happened to my environment. I'll get back to you once I can narrow down the cases where the error happens. Thanks

from python-aiplatform.

zetyquickly avatar zetyquickly commented on July 20, 2024

Hi @abcdefgs0324,

Is there any suggestion how can a user set up base_image with nvidia/pytorch on board?

Two options below produce the error described in the original post:

  • base_image="us-docker.pkg.dev/vertex-ai/prediction/pytorch-gpu.2-0:latest"
  • base_image="pytorch/pytorch:2.1.0-cuda12.1-cudnn8-runtime"

from python-aiplatform.

zetyquickly avatar zetyquickly commented on July 20, 2024

What is the purpose of this image? Is it meant to be a base for CPR or just an example?

from python-aiplatform.

bloukanov avatar bloukanov commented on July 20, 2024

@abcdefgs0324 Thank you for giving me the idea of renaming the image name locally! It turned out / was not the root cause. After I gave a short name without / to nvidia/cuda:11.1.1-devel-ubuntu20.04, the error happened to my environment. I'll get back to you once I can narrow down the cases where the error happens. Thanks

Did you end up figuring out what caused this? I am having the same issue. I wish there was a way to view the internal output from Docker to be able to debug this.

EDIT--
I was able to grab the debug info locally by changing the following in site-packages/google/cloud/aiplatform/docker_utils/local_util.py, line 59 :

Change _logger.info(line) to print(line)

        for line in out:
            print(line)
            # _logger.info(line)

There is probably a way to instantiate a logger locally to catch these lines? I am not too familiar with logging, but I did notice that the logging does work when this runs in the cloud.

Anyway, my problem was just a problem with the requirements.txt file.

from python-aiplatform.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.