Git Product home page Git Product logo

Comments (5)

loadams avatar loadams commented on May 29, 2024 1

Hi @vikram71198 - I assume we can close this issue. If not, please comment and we can re-open.

from deepspeed.

loadams avatar loadams commented on May 29, 2024

Hi @vikram71198 - it looks like DeepSpeed is installed, what you are seeing is that you have not pre-compiled any ops. That's fine, you don't need to, the ops can be JIT compiled just fine. You probably don't need to pre-compile, but you can read more about that here and decide if you need to. If you do, determine what ops you will need and you can pre-compile those. Some ops have other dependencies, async_io, cutlass kernels, etc, that's why you see some envs with those disabled.

from deepspeed.

vikram71198 avatar vikram71198 commented on May 29, 2024

Gotcha. I explicitly pip install torch == 2.2.1+cu118 (torch == 2.2.2+cu121 is the default torch which I attempt to override), so another part of ds_report that I find confounding is this:

DeepSpeed general environment info:
torch install path ............... ['/databricks/python3/lib/python3.10/site-packages/torch']
torch version .................... 2.2.2+cu121
deepspeed install path ........... ['/databricks/python3/lib/python3.10/site-packages/deepspeed']
deepspeed info ................... 0.14.2, unknown, unknown
torch cuda version ............... 12.1
torch hip version ................ None
nvcc version ..................... 11.8
deepspeed wheel compiled w. ...... torch 2.2, cuda 12.1
shared memory (/dev/shm) size .... 560.90 GB

Why do torch version, torch cuda version & deepspeed compiled wheel all indicate torch == 2.2.2+cu121 & not 2.2.1+cu118?

The Databricks Cluster Runtime I'm currently using has CUDA == 11.8.

And yes, I run the torch installation before the DeepSpeed installation.

from deepspeed.

vikram71198 avatar vikram71198 commented on May 29, 2024

Okay, I fixed this myself. Nvm.

from deepspeed.

loadams avatar loadams commented on May 29, 2024

So we can close this issue?

from deepspeed.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.