Bug Deion This is not really a bug. In the Title Extraction

[Bug]: Redundant calculation in the title extractor about llama_index HOT 2 CLOSED

erlebach commented on June 29, 2024

[Bug]: Redundant calculation in the title extractor

from llama_index.

Comments (2)

dosubot commented on June 29, 2024 2

To optimize the extract_titles() function and avoid redundant LLM calls when there is only one node per document, you can modify the function as follows:

async def extract_titles(self, nodes_by_doc_id: Dict) -> Dict:
    titles_by_doc_id = {}
    for key, nodes in nodes_by_doc_id.items():
        if len(nodes) == 1:
            # Directly use the title from the single node
            titles_by_doc_id[key] = await self.llm.apredict(
                PromptTemplate(template=self.node_template),
                context_str=cast(TextNode, nodes[0]).text,
            )
        else:
            title_candidates = await self.get_title_candidates(nodes)
            combined_titles = ", ".join(title_candidates)
            titles_by_doc_id[key] = await self.llm.apredict(
                PromptTemplate(template=self.combine_template),
                context_str=combined_titles,
            )
    return titles_by_doc_id

This modification checks if there is only one node for a document and directly uses the title from that node, avoiding the need to combine multiple titles. This approach leverages the node_template for single nodes and the combine_template for multiple nodes, as described in the TitleExtractor class ^[1]^[2].

Details

About Dosu
This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

To continue the conversation, mention @dosu.

from llama_index.

erlebach commented on June 29, 2024

My comment was meant for the maintainers of the repo.

from llama_index.

[Bug]: Redundant calculation in the title extractor about llama_index HOT 2 CLOSED

Comments (2)

Details

Related Issues (20)

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent