Git Product home page Git Product logo

itselgie / opencodeinterpreter Goto Github PK

View Code? Open in Web Editor NEW

This project forked from opencodeinterpreter/opencodeinterpreter

0.0 0.0 0.0 5.95 MB

OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. It significantly enhances code generation capabilities by integrating execution and iterative refinement functionalities.

Home Page: https://opencodeinterpreter.github.io/

License: Apache License 2.0

Shell 4.00% Python 95.89% Dockerfile 0.12%

opencodeinterpreter's Introduction

OpenCodeInterpreter: Integrating Code Generation with Execution and Refinement

OpenCodeInterpreter

[🏠Homepage] | [πŸ› οΈCode]


🌟 Upcoming Features

  • πŸ’‘ Open Sourcing OpenCodeInterpreter-SC2 series Model (based on StarCoder2 base)

  • πŸ’‘ Open Sourcing OpenCodeInterpreter-GM-7b Model with gemma-7b Base

πŸ””News

πŸ› οΈ[2024-02-29]: Our official online demo is deployed on HuggingFace Spaces! Take a look at Demo Page!

πŸ› οΈ[2024-02-28]: We have open-sourced the Demo Local Deployment Code with a Setup Guide.

✨[2024-02-26]: We have open-sourced the OpenCodeInterpreter-DS-1.3b Model.

πŸ“˜[2024-02-26]: We have open-sourced the CodeFeedback-Filtered-Instruction Dataset.

πŸš€[2024-02-23]: We have open-sourced the datasets used in our project named Code-Feedback.

πŸ”₯[2024-02-19]: We have open-sourced all models in the OpenCodeInterpreter series! We welcome everyone to try out our models and look forward to your participation! πŸ˜†

Introduction

OpenCodeInterpreter is a suite of open-source code generation systems aimed at bridging the gap between large language models and sophisticated proprietary systems like the GPT-4 Code Interpreter. It significantly enhances code generation capabilities by integrating execution and iterative refinement functionalities.

Models

All models within the OpenCodeInterpreter series have been open-sourced on Hugging Face. You can access our models via the following link: OpenCodeInterpreter Models.

Data Collection

Supported by Code-Feedback, a dataset featuring 68K multi-turn interactions, OpenCodeInterpreter incorporates execution and human feedback for dynamic code refinement. For additional insights into data collection procedures, please consult the readme provided under Data Collection.

Evaluation

Our evaluation framework primarily utilizes HumanEval and MBPP, alongside their extended versions, HumanEval+ and MBPP+, leveraging the EvalPlus framework for a more comprehensive assessment. For specific evaluation methodologies, please refer to the Evaluation README for more details.

Demo

We're excited to present our open-source demo, enabling users to effortlessly generate and execute code with our LLM locally. Within the demo, users can leverage the power of LLM to generate code and execute it locally, receiving automated execution feedback. LLM dynamically adjusts the code based on this feedback, ensuring a smoother coding experience. Additionally, users can engage in chat-based interactions with the LLM model, providing feedback to further enhance the generated code.

To begin exploring the demo and experiencing the capabilities firsthand, please refer to the instructions outlined in the OpenCodeInterpreter Demo README file. Happy coding!

Quick Start

  • Entering the workspace:

    git clone https://github.com/OpenCodeInterpreter/OpenCodeInterpreter.git
    cd demo
  • Create a new conda environment: conda create -n demo python=3.10

  • Activate the demo environment you create: conda activate demo

  • Install requirements: pip install -r requirements.txt

  • Create a Huggingface access token with write permission here. Our code will only use this token to create and push content to a specific repository called opencodeinterpreter_user_data under your own Huggingface account. We cannot get access to your data if you deploy this demo on your own device.

  • Add the access token to environment variables: export HF_TOKEN="your huggingface access token"

  • Run the Gradio App:

    python3 chatbot.py --path "the model name of opencodeinterpreter model family. e.g., m-a-p/OpenCodeInterpreter-DS-6.7B"

Video

demo.mp4

Contact

If you have any inquiries, please feel free to raise an issue or reach out to us via email at: [email protected], [email protected]. We're here to assist you!

Star History

Star History Chart

opencodeinterpreter's People

Contributors

zheng0428 avatar magnetic2014 avatar anitaliu98 avatar cclauss avatar opencodeinterpreter avatar wenhuchen avatar rishubi avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    πŸ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. πŸ“ŠπŸ“ˆπŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❀️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.