Git Product home page Git Product logo

llm-paper-qa's Introduction

LLM Paper QA

The LLM Paper QA is a tool designed to read, parse, and generate summaries from scientific papers hosted on the arXiv repository. This tool uses OpenAI's GPT-4 (gpt-4-32k) to generate the summaries and can be useful for quickly understanding the key points of a paper without having to read the entire document.

Quick Start

Before running the script, you need to set up your environment variables. An example .env.example file has been provided in the repository. Copy the .env.example file to a new file called .env, then replace the placeholders with your actual information.

cp .env.example .env
# edit .env

After setting up your environment variables, install the required packages using pip:

pip install -r requirements.txt

Once the required packages are installed, you can run the script with the following command:

python src/read_paper.py [arxiv_id]

Replace [arxiv_id] with the ID of the paper from arXiv that you wish to summarize.

Options

The script supports several command-line options:

--dry-run: If True, don't actually generate a summary. The paper will still be downloaded and cached, but no calls will be made to the LLM.

--keep-ref: If True, keep the references in the paper before summarizing.

--keep-app: If True, keep the appendices in the paper before summarizing.

--keep-latex: If True, keep the LaTeX in the paper before summarizing, otherwise it will be converted to plain text.

--use-ar5iv: If True, use ar5iv instead of arxiv-vanity.

--force-refresh: If True, force the paper to be refreshed, ignoring the cache.

Output

The script outputs a text file containing the generated summary of the paper. This summary includes the paper's metadata, number of tokens in the input prompt, number of tokens in the generated content, and the content itself.

If the CodiMD client is enabled and not in dry-run mode, the summary is also published to a CodiMD document and its URL is printed to the console.

Known Issues

If a paper is not found, is still being rendered, or has failed to render, the script will print an error message and exit.

Please report any issues you encounter here.

Contributing

Contributions to LLM Paper QA are welcomed.

License

This project is licensed under the MIT License - see the LICENSE.md file for details.

(README.md generated using GPT-4.)

llm-paper-qa's People

Contributors

volltin avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.