DBLP GraphDatabase

Loading

Generate CSV file of the DBLP XML dump file using ThomHurks tools (https://github.com/ThomHurks/dblp-to-csv) and store it in the "dblp_dump/" directory in this project.
python XMLToCSV.py dblp.xml dblp.dtd output.csv
Run a2_data_preparation_NugrohoSalmaan.py
Copy the output in the "out/" directory to neo4j import directory, by default the directory path is /var/lib/neo4j/import/ Run a2_loading_NugrohoSalmaan.py
Run a2_loading_NugrohoSalmaan.py

Evolving

Run a3_prepare_evolving.py
Copy the the generated file (out/authors_with_affiliation.csv) to neo4j import directory
Run a3_evolving_NugrohoSalmaan.py to start the evolving process

Running Queries

Run b_queries_NugrohoSalmaan.py using -q parameter specifying query number. For example to run query 1 (the h-index query), run the following command:

python b_queries_NugrohoSalmaan.py -q 1

The fourth query require additional year parameter that can be specified using -y option. For example:

python b_queries_NugrohoSalmaan.py -q 4 -y 1999

Running Graph Algorithm

Run c_graph_algorithm_NugrohoSalmaan using -a parameter specifying algorithm number. To run our first algorithm (article rank) use -a 1, and to run our second algorithm (louvain community detection) use -a 2. For example:

python c_graph_algorithm_NugrohoSalmaan.py -a 1

Running Recommender

Run d_recommender_NugrohoSalmaan.py with the following parameters:

-t (in a fraction) homogenity threshold for a conference to be considered as community
-m minimum paper appears in top 100 to be considered as guru
-k string representation of keywords, e.g:'["database", "sql"]'

For example:

python d_recommender_NugrohoSalmaan.py -t 0.5 -m 1 -k '["database", "sql"]'

In order to get meaningful result, we can use the following keyword list as keywords argument:

'["database","index","indexing","querying","data","sql","olap","b+-trees","dbms","dbmss","views","databases","queries","postgres","picodbms","xml","tables","oodbms","relational"]'

dpanugroho / dblp_neo4j Goto Github PK

dblp_neo4j's Introduction

DBLP GraphDatabase

Loading

Evolving

Running Queries

Running Graph Algorithm

Running Recommender

dblp_neo4j's People

Contributors

Stargazers

Watchers

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent