Git Product home page Git Product logo

Comments (4)

dakinggg avatar dakinggg commented on May 29, 2024

Hi, Gaudi support is alpha, and uses Deepspeed in FSDP. Most of the LLM Foundry repo is built around using FSDP, in particular that helper script assumes a single checkpoint file, which is different from what Deepspeed produces. Unfortunately we don't have an easy script for converting from a Deepspeed checkpoint into another format.

from llm-foundry.

greg-serochi avatar greg-serochi commented on May 29, 2024

Hi Daniel, thanks for the response, and to be clear, I am an Intel Gaudi employee. My goal here is to take what was documented in the blog: https://www.databricks.com/blog/llm-training-and-inference-intel-gaudi2-ai-accelerators and provide the specific
instructions to run the MPT-1B model with 8 Gaudi cards.

Note that the mpt-1b-gaudi2.yaml that is on your github page has FSDP commented out for Gaudi usage, it's not being used.

So how is the blog executing the commands? You show how to run 8 Gaudi Inference using Hugging Face, and it seems like you have to run that convert_composer_to_hf.py to get to Optimum Habana..

from llm-foundry.

greg-serochi avatar greg-serochi commented on May 29, 2024

we can close this. As the support is early, we'll stay focused on the training section only.

from llm-foundry.

greg-serochi avatar greg-serochi commented on May 29, 2024

closed

from llm-foundry.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    šŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. šŸ“ŠšŸ“ˆšŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ā¤ļø Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.