Comments (1)
Comments from our discussion:
Thanks for putting the notebooks together my main feedback points summarized would be:
- have summary / conclusion for each in readme (also findings, learnings)
- use same PDF documents for each example
- generate the same machine processable (JSON) output from each (plus the human baseline) for comparison and further analysis
- use dotenv in the notebooks to allow loading environment variables from an .env file for the notebook so we don't need to ensure to add / remove credentials
Better use JSON than CSV actually because then we can handle multiple properties for each entity, align it with the structure we get from diffbot -> nodes / relationships
Just comments, no action needed here:
for the triple ones (like rebel / llama-index) our challenge here is that we can't use the results out of the box, we would either have to:
- modify them to output property graph nodes or relationships
- or post-process the triples to aggregate all entity attribute triples into properties and only keep the triples that represent semantic relationships as such
- or do this during insertion of the data into the graph - aggregating when inserting, e.g. initially create/merge the nodes with their ID and subsequently merge on id + add property and for the relationships find start and end node with label and id and create relationship
for the Rebel one in: def create_triplets(tx, triplet) : if we want to look at this approach in the future we should see if we can carry the entity-type over, so we can use not just the generic :Node but in addition also a label for the type like :Person or :Organization
and then also do the attribute aggregation there
from llm-graph-builder.
Related Issues (20)
- Backend only run issue: missing fastapi_health module
- Failed to process PDFs, Youtube, or Wikipedia - error message clipped HOT 1
- Fail to run local LLM as GCP and GEMINI when set to false still errors on Google Authentication HOT 7
- Conditionally Including the GCS login flow in gcs as source
- Tour info
- Chatbot State HOT 1
- Settings modal to support generating the labels from the llm by using text given by user
- AttributeError: 'NoneType' object has no attribute 'upper'
- Error connecting Neo4j - Please check Backend connection
- Css handling of info modal
- Remove driver from Frontend
- Application Responsiveness ( Desktop,laptop,tablet)
- Need Support For ChatOpenAI when Using Local LLM
- Difficulty Deploying Project with Docker Compose and Neo4j Aura DB
- Trying to solve this error: TypeError: int() argument must be a string, a bytes-like object or a real number, not 'NoneType' HOT 2
- wikipedia URL input
- Speech state
- UI polishing
- 如何打开应用程序 HOT 1
- 如何打开应用程序 HOT 3
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from llm-graph-builder.