OOJBench

Benchmark data generator for order-oriented join, an implementation of hinting join.

How to use

Requirement
- Python
  - Validated only for python 2.7
- Packages of python: json, numpy (1.13.0), psycopg2 (2.7.1)
- Here assumes PostgreSQL as database.
Steps
1. Set parameters in conf.json
2. python dataGen.py
  - output: R.csv, R.sql, S.csv, S.sql, E.csv, and E.sql
3. createdb dbname
  - dbname is as written in conf.json.
  - If already created, skip this step.
4. python bulkLoad2Postgres.py
  - Drop tables R, S, E and G if they exist.
  - Create tables R, S, and E.
  - Store R.csv, S.csv, E.csv into corresponding tables.
  - Create table G as sample from E.

Component

conf.json: configuration file for the generator
- Parameters: N, k, l, s, t, z, samples, dbname
  - k: The number of attributes on R.
  - l: The number of attributes on S.
  - s: The number of attributes for order-oriented join on R.
  - t: The number of attributes for order-oriented join on S.
  - primary: The number tuples per join key value.
  - vocab: The number of vocabularies for order-oriented join keys.
  - samples: The number of join key values for sample.
  - dbname: The name of database.
- An example is shown in the below section.
dataGen.py: data generator
- Generates
  - Three CSV files (R.csv, S.csv and E.csv) corresponding with relations R, S, and E.
    - All attributes are integer.
  - Three SQL files (R.sql, S.sql and E.sql) for create table statements.
bulkLoad2Postgres.py: data loader to PostgreSQL database
- Execute SQL files (R.sql, S.sql and E.sql).
- Execute COPY command for loading CSV files into corresponding tables.
- Generate sample G of joined results.

Specifications of tables

Base join keys are the last columns of R and S.
- Herein, equi-join is assumed.
Order-oriented join keys are k-s-1 (resp. l-t-1) to k-1 (resp. l-1) columns in R (resp. S).
- k, l: the numbers of attributes of R and S
- s, t: the numbers of attrobutes for order-oriented join on R and S
Other attributes are filled by random values.

Sample conf.json

{
"k": {
	"value": 5,
	"description": "The number of attributes on R."
},
"l": {
	"value": 10,
	"description": "The number of attributes on S "
},
"s": {
	"value": 3,
	"description": "The number of attributes for order-oriented join on R."
},
"t": {
	"value": 1,
	"description": "The number of attributes for order-oriented join on S."
},
"primary": {
	"value": 100,
	"description": "The number of values for base join key."
},
"vocab": {
	"value": 2,
	"description": "The size of vocabulary for order-oriented join keys."
},
"samples": {
	"value": 10,
	"description": "The number of key values for sample."
},
"dbname": "ooj"
}

taka-coma / oojbench Goto Github PK

oojbench's Introduction

OOJBench

How to use

Component

Specifications of tables

Sample conf.json

oojbench's People

Contributors

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent