Git Product home page Git Product logo

space-apps-2023's Introduction

Exacto | S.T.A.R. Terminal

image

High-Level Summary

We developed an innovative AI tool to help NASA scientists manage their technical documents. Using webscraping technologies we collect the current available NASA resources like new research, lessons learned and others, then automatically catalogue them into a database. Users can submit a document on our website, it automatically scans the contents, finds relevant information on the topic in our database, then uses a custom machine learning model to suggest improvements like fixing ambiguities, updating outdated info and incorporating new research. This tool would speed up the manual work that goes into keeping these documents up to date and speed up the review process.

Project Demo

https://failiem.lv/u/7pea35374b

Project Details

For our backend we used NodeJS (Express) and Python. For our frontend we used Next.js. For our database we used Pinecone and MongoDB.

We used Python library Selenium to webscrape NASA lessons learned. Then we generated embeddings for every piece of information contained in the lessons learned using OpenAI Embeddings API. We stored these embeddings in a Pinecone vector database and in MongoDB document database for backup.

When a user uploads a PDF document on our website, we send it to our NodeJS backend where it gets divided into parts for which we find improvements. We generate an embedding for each of these parts, then do a vector search in our Pinecone database to find relevant information, which we then provide directly into the context of a finetuned OpenAI Chat Completions API, which finds and suggests improvements to every part of the PDF text.

This embedding and vector database approach minimizes GPT hallucinations and provides more precise info, because all of the relevant information and sources fit into GPT’s context window, unlike traditional finetuned models, which even with finetuning are prone to forgetting or providing nonexistant info with incorrect sources.

Use of Artificial Intelligence

We used ChatGPT to help write simple parts of our code. We used PerplexityAI to help conduct research into what tools we could use for the job.

Space Agency Data

Link URL Link Text
https://standards.nasa.gov/all-standards NASA standards
https://llis.nasa.gov/ NASA Lessons Learned

References

Link URL Link Text
https://platform.openai.com/docs/api-reference OpenAI API
https://andriymulyar.com/blog/how-to-visualize-pinecone-vector-database Visualizing Pinecone vector databse

space-apps-2023's People

Contributors

btosty avatar piparkuucinsh avatar robzlegz avatar rudolfsrijkuris avatar

Stargazers

 avatar  avatar

Watchers

 avatar

Forkers

roblzs roblzs

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.