Git Product home page Git Product logo

dblp_neo4j's Introduction

DBLP GraphDatabase

Loading

  1. Generate CSV file of the DBLP XML dump file using ThomHurks tools (https://github.com/ThomHurks/dblp-to-csv) and store it in the "dblp_dump/" directory in this project.
    python XMLToCSV.py dblp.xml dblp.dtd output.csv
  2. Run a2_data_preparation_NugrohoSalmaan.py
  3. Copy the output in the "out/" directory to neo4j import directory, by default the directory path is /var/lib/neo4j/import/ Run a2_loading_NugrohoSalmaan.py
  4. Run a2_loading_NugrohoSalmaan.py

Evolving

  1. Run a3_prepare_evolving.py
  2. Copy the the generated file (out/authors_with_affiliation.csv) to neo4j import directory
  3. Run a3_evolving_NugrohoSalmaan.py to start the evolving process

Running Queries

Run b_queries_NugrohoSalmaan.py using -q parameter specifying query number. For example to run query 1 (the h-index query), run the following command:

python b_queries_NugrohoSalmaan.py -q 1

The fourth query require additional year parameter that can be specified using -y option. For example:

python b_queries_NugrohoSalmaan.py -q 4 -y 1999

Running Graph Algorithm

Run c_graph_algorithm_NugrohoSalmaan using -a parameter specifying algorithm number. To run our first algorithm (article rank) use -a 1, and to run our second algorithm (louvain community detection) use -a 2. For example:

python c_graph_algorithm_NugrohoSalmaan.py -a 1

Running Recommender

Run d_recommender_NugrohoSalmaan.py with the following parameters:

  • -t (in a fraction) homogenity threshold for a conference to be considered as community
  • -m minimum paper appears in top 100 to be considered as guru
  • -k string representation of keywords, e.g:'["database", "sql"]'

For example:

python d_recommender_NugrohoSalmaan.py -t 0.5 -m 1 -k '["database", "sql"]' 

In order to get meaningful result, we can use the following keyword list as keywords argument:

'["database","index","indexing","querying","data","sql","olap","b+-trees","dbms","dbmss","views","databases","queries","postgres","picodbms","xml","tables","oodbms","relational"]'

dblp_neo4j's People

Contributors

dpanugroho avatar

Stargazers

Paritosh Katre avatar DCC avatar

Watchers

James Cloos avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.