Git Product home page Git Product logo

cwd-benchmark-data's Introduction

Chat with your Data (cwd) Benchmark Data

Introduction

This repository contains the data and metadata for the "Chat with your Data" benchmark. The aim of this project is to provide a comprehensive set of test scenarios for Language-to-query (specifically SQL and SPARQL) systems.

It focuses on testing whether these systems are capable of accurately converting natural language questions into valid, effective queries against various data sources.

Repository Structure

This repository is divided into multiple directories, each containing a specific type of data or metadata:

  • ontology/: This directory contains OWL file(s) representing the ontology data.

  • DDL/: This directory contains the DDL definitions for the database schema.

  • investigation/: Each Turtle (.ttl) file in this directory represents a complete benchmark investigation, which includes pointers to the dataset, metadata, and a set of inquiries.

  • data/: This directory contains the dataset(s) used for the benchmark. The data is represented in multiple formats to support a wide range of query languages. In addition to the CSV files.

  • RDF-View: This directory contains an R2RML file that describes the mapping between the ontology and the data tables.

File Formats

  • OWL: Web Ontology Language, used for representing the ontology data.

  • DDL: Data Definition Language, used for defining and managing databases.

  • TTL: Turtle form of RDF, used to represent the complete benchmark investigation.

  • R2RML: a TTL file that describes mappings according to the RDB to RDF Mapping Language

  • CSV/TSV/etc.: Various data formats used for the benchmark dataset.

Setup (Virtuoso)

Install the tables to your Virtuoso instance using one of the following options

Via Command Line

cd into {repo}/ACME_INSURANCE, and run one of the following options:

Option 1: Attached and Physcial Tables

isql {port} {username} {password} full_setup.sql

Option 2: Attached Tables Only

isql {port} {username} {password} csv_attach.sql

cwd-benchmark-data's People

Contributors

bryonjacob avatar dallemang avatar danielhmills avatar kidehen avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.