Git Product home page Git Product logo

vinay2k2 / improving-rag-systems-dhs2024 Goto Github PK

View Code? Open in Web Editor NEW

This project forked from dipanjans/improving-rag-systems-dhs2024

0.0 0.0 0.0 10.98 MB

This repository will contain the presentation and python jupyter notebooks for the DataHack Summit 2024 conference talk, Improving Real-world Retrieval Augmented Generation Systems, focusing on the key challenges and practical solutions of how to solve them

License: GNU General Public License v3.0

Jupyter Notebook 100.00%

improving-rag-systems-dhs2024's Introduction

Improving Real-World RAG Systems - Key Challenges and Practical Solutions

Everyone knows how to build RAG systems, but how do you improve them? Retrieval Augmented Generation (RAG) systems have quickly become among the industry's biggest successes for driving Generative AI use cases on custom enterprise data. However, with their success comes a whole list of pain points that can lead to failure or sub-optimal performance in RAG systems.

This session is inspired by the famous paper “Seven Failure Points When Engineering a Retrieval Augmented Generation System” by Barnett et al., which discusses some of the major challenges and points of failure in RAG Systems. However, clear solutions to these challenges are not mentioned in detail.

This session aims to bridge this gap where we will cover the major challenges and pain points when building real-world RAG systems, which include:

  1. Missing Content
  2. Missed the Top Ranked Documents
  3. Not in Context
  4. Not Extracted
  5. Wrong Format
  6. Incorrect Specificity
  7. Incomplete

Besides discussing the challenges, we will also discuss practical solutions of how we could address these challenges using the latest and best techniques, including:

  • Better data cleaning and prompting
  • More intelligent chunking
  • Better retrieval strategies like Reranking and Compression
  • Effect of embedding models and how can we fine-tune such models
  • Output parsers for better response format adherence
  • Query transformations
  • Latest advancements in RAG systems like GraphRAG, Agentic RAG, CRAG, RAFT, etc
  • Can long-context LLMs help?

The overall structure of the talk would involve discussing each challenge, discussing potential solutions, and also showcasing some of these with hands-on code leveraging popular frameworks like LangChain and LlamaIndex.

Key Takeaways:

  • Learn about the common challenges and pain points when building real-world RAG Systems
  • Understand practical solutions for tackling each pain point which can lead to failure in RAG Systems
  • Learn concepts and hands-on implementations of solutions, including data processing, chunking, reranking, embedding models, parsers, query transformers, and more
  • Discuss some of the latest advancements in Generative AI and RAG systems like Agentic RAG, CRAG, RAFT, and long context LLMs

improving-rag-systems-dhs2024's People

Contributors

dipanjans avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.