Git Product home page Git Product logo

steps's Introduction

STEPS

image

Paper

Interactive Text-to-SQL Generation via Editable Step-by-Step Explanations

Dataset

  1. Download the original Spider dataset
  2. Generate text to clause dataset using SQL2NL/SQL2NL.py
  • instruction = 'trainingData'.
  • You can design your own explanation template within the method "parseSQL()".
  • python SQL2NL.py and you will get the dataset under "dataset/structured/spider/train_spider.json".
  • You could also download our raw text-to-clause dataset. Please put it in the same directory as the original Spider dataset and include all the databases (For more information, please refer to https://github.com/taoyds/spider)
  1. Paraphrase the text-to-clause dataset (optional)
  • You could paraphrase the dataset by Quillbot with our automated script based on PyAutoGUI.
  • Please check the script and all screenshots under here. These screenshots are used to position the cursor during the automation. Due to subtle resolution/theme/version differences, the screenshots may not be identified on your computer (even if a human can), you may need to take your screenshots on your computer and replace them manually.

Models

  1. Our text-to-clause model is based on SmBoP, and you can strictly follow their environment and settings.
  2. You can directly download and reuse our check point (HuggingFace) as well as configuration file.
  • Please replace the original configuration file with ours!

To help you understand the project logic, we also encapsulate most of the project folder. You can directly download it to check the configuration. Folder

This repository is currently being updated, and more details will be provided in the future. If you have any questions, please feel free to email [email protected]

Thanks!

steps's People

Contributors

magic-yuantian avatar

Stargazers

Bo Zhang avatar  avatar yue avatar  avatar Zeyu Xiong avatar  avatar TheDetective avatar  avatar Edward Jian avatar  avatar Toby Li avatar

Watchers

 avatar

steps's Issues

what is the "Text-to-SQL model"?

Hi, there. Thanks for your marvelous work and i have some questions.

  1. In the Ask part of your system, a Text-to-SQL model is used to generate the initial SQL query. What model do you use? Since you use SmBoP as your text-to-clause model, what do you mean in your experiment when the "EditSQL" model is used as the base model? Maybe EditSQL is used as the text-to-SQL model and fine-tuned SmBoP is used as the text-to-clause model?
  2. I understand your system aims to explore the upper bound with perfect human feedback. However, do you use ChatGPT or other LLM to act as humans for human feedback?
    Hope for your reply!

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.