Git Product home page Git Product logo

Comments (10)

pooruss avatar pooruss commented on August 13, 2024 1

Hi, we have released the lora version here. Tool-llama trained with full parameters will be released soon. Please keep following our updates.

from toolbench.

Tomsentable avatar Tomsentable commented on August 13, 2024

Thanks for your reply, and the another i want to ask you is "can the weight i fine_tune to replace the "lora_version toolllama weight " as detal weight "

from toolbench.

pooruss avatar pooruss commented on August 13, 2024

If you train a lora version of tool llama model, sure you can replace the released lora weights with your own weights. If you train with full parameters, you can replace the /path/to/ToolLLaMA/weights with your saved model path:
python toolbench/inference/inference_single_tool.py --tool_name weather --model_path /path/to/ToolLLaMA/weights

from toolbench.

Tomsentable avatar Tomsentable commented on August 13, 2024

thank you! Before infernece(i have finetuned a weight), should i perform this step(apply base weight and delta weight) to get tool_llama weight to inference? And i can not see the delta weight, the link you shared is data not a model.

from toolbench.

pooruss avatar pooruss commented on August 13, 2024

No, you do not need to apply any delta weight if you fine-tune a model yourself. Just replace the /path/to/ToolLLaMA/weights with your own model path~

from toolbench.

Tomsentable avatar Tomsentable commented on August 13, 2024

thank you for your reply

from toolbench.

Tomsentable avatar Tomsentable commented on August 13, 2024

Dear, the multi data, such as meta_file.json and the first conversation of it, some action look like not a tool use,i want to know what the mean and purpose(i guess that is just to teach LLM how to do, but i don know when inference LLM how to achieve that)
b97d9f5d81901772eeafe5c572db7e9

from toolbench.

pooruss avatar pooruss commented on August 13, 2024

Detailed information about the tools can be found in BMTools. In multi tools scenario, we select several tools from BMTools.

Specifically, in meta analysis scenario, chemical-prop/meta_analysis/Slides Making/Wikipedia/file_operation/Bing_search are selected. The actions shown above are from Slides Making.

from toolbench.

Tomsentable avatar Tomsentable commented on August 13, 2024

The weights saved bad(can not be loaded),which show cuda out of memory,how to do that?

from toolbench.

Tomsentable avatar Tomsentable commented on August 13, 2024

error occur when training : Failed to clone() tensor with name _fsdp_wrapped_module.model.layers.31.mlp.up_proj.weight on rank 0. This may mean that this state_dict entry could point to invalid memory regions after returning from state_dict() call if this parameter is managed by FSDP,

from toolbench.

Related Issues (20)

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.