Describe the bug mpt is listed under supported models but not avai

<div class="highlight highlight-source-shell notranslate position-relative overflow-auto" dir="auto"

mpt is listed under supported models but not available in openllm build command about openllm HOT 2 CLOSED

bentoml commented on August 15, 2024

mpt is listed under supported models but not available in openllm build command

from openllm.

Comments (2)

aarnphm commented on August 15, 2024

Usage: openllm build [OPTIONS] {flan-t5|dolly-v2|chatglm|starcoder|falcon|stablelm|opt|mpt}

  Package a given models into a Bento.

  $ openllm build flan-t5 --model-id google/flan-t5-large

  > NOTE: To run a container built from this Bento with GPU support, make sure
  > to have https://github.com/NVIDIA/nvidia-container-toolkit install locally.

Options:
  --model-id TEXT                 Optional model_id name or path for (fine-tune) weight.
  -o, --output [json|pretty|porcelain]
                                  Showing output type.  [env var: OPENLLM_OUTPUT; default: pretty]
  --overwrite                     Overwrite existing Bento for given LLM if it already exists.
  --workers-per-resource FLOAT    Number of workers per resource assigned. See
                                  https://docs.bentoml.org/en/latest/guides/scheduling.html#resource-scheduling-
                                  strategy for more information. By default, this is set to 1.
                                  
                                  NOTE: The workers value passed into 'build' will determine how the LLM can be
                                  provisioned in Kubernetes as well as in standalone container. This will ensure it
                                  has the same effect with 'openllm start --workers ...'
  Optimisation options.: [mutually_exclusive]
    --quantize [int8|int4|gptq]   Set quantization mode for serving in deployment.
                                  
                                  GPTQ is currently working in progress and will be available soon.
                                  
                                  NOTE: Quantization is only available for PyTorch models.
    --bettertransformer           Apply FasterTransformer wrapper to serve model. This will applies during serving
                                  time.
  --enable-features FEATURE[,FEATURE]
                                  Enable additional features for building this LLM Bento. Available: mpt, fine-tune,
                                  chatglm, agents, flan-t5, playground, starcoder, openai, falcon
  --adapter-id [PATH | [remote/][adapter_name:]adapter_id][, ...]
                                  Optional adapters id to be included within the Bento. Note that if you are using
                                  relative path, '--build-ctx' must be passed.
  --build-ctx TEXT                Build context. This is required if --adapter-id uses relative path
  --model-version TEXT            Model version provided for this 'model-id' if it is a custom path.
  --dockerfile-template FILENAME  Optional custom dockerfile template to be used with this BentoLLM.
  Miscellaneous options: 
    -q, --quiet                   Suppress all output.
    --debug, --verbose            Print out debug logs.
    --do-not-track                Do not send usage info
  -h, --help                      Show this message and exit.

from openllm.

aarnphm commented on August 15, 2024

Sorry for the late reply, but any updates on this? Feel free to reopen if you still running into this issue.

I can build mpt with openllm build (tested on linux and mac)

from openllm.

mpt is listed under supported models but not available in openllm build command about openllm HOT 2 CLOSED

Comments (2)

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent