Git Product home page Git Product logo

spherical-projection's Introduction

Cartography of COVID-19 Scientific Literature

Network visualizations are useuful instrument to have a distant view from a subject. This visualization was created to give an idea of the scientific community behind the COVID-19 subject. Who they are? When they began? With whom they work? Which are their specializationtions?

Method

Using an open-source database called COVID‑19 Open Research Dataset (CORD‑19) provided by the Allen Institute for AI, scientific article were collected by author and then analyzed with methods of Natural Language Processing. The result is a metric of lexical similarity between authors, which enabled to space out the scientifc community as a network; it means that two authors that make use of the same vocabulary are next to each other. (More information about the method on this article titled Mapping as a Contemporary Instrument for Orientation in Conferences.)

Use

Open the cartography at this URL. After a few seconds for stabilizing forces, a network composed by the major authors working on the COVID-19 subject takes form. The elevation map in yellow points out as peaks the most active areas of research. When two close authors have a common keyword, that is visible between them with a font size that reflects the edge's weigth. By moving over an author, the relative network of peers is highlighted as well as the keywords that define his/her profile. The left panel shows a few author's information with the most relevant keywords, the nationality of his/her community, and the publication years. On the top right a search function to find a specific author.

Acknowledgements

The project started during the Digital Humanities Conference 2019 with the spirit to create a cartography of the DH community. At that moment, it was priceless the contribution of Daniele Guido (University of Luxembourg), Philippe Rivière (Visiocarto), and Stephan Risi (MIT).

This version has been improved thanks to Eveline Wandl-Vogt from Austrian Academy of Sciences and Elian Carsenat NamSor. The data about nationality are generated through the NamSor algorithm; we understand that reasoning by nationality might introduce bias and errors in the interpretation of the visualization, but we still believe that their use can give further insigths on the literary production. Aggregating information is a way to protect the single author and show the data in a collective way. If someone might be unhappy with some error in the diaspora.csv, we are available to to modify it.

Install the code and run your data

Clone the repository, the install the libraries by typing npm install.

Run the local server by typing npm run start and open the visualization at this URL.

If you want to run your data, format the file ./data/authors.json properly, according to this structure:

	{
		"id": 1
		"name": "Dario Rodighiero",
		"variants": ["Dario Cesare Rodighiero", "Dario C. Rodighiero"],
		"docs": 3,
		"years": {
			"2019": 1,
			"2020": 2,
		},
		"peers": [
			2,
			3,
		],
		"text": "Mapping as a Contemporary Instrument for Orientation in Conferences",
	},

The authors.json_ is parsed using analysis.js, that runs text analysis to compute the lexical distance. To run the analysis type node analysis. This algorithm will produce two files, nodes.json and links.json that are the constituents of the network visualization.

You can now check your visualization at the localhost, or publish its static version using npm run build that create the build in the ./docs folder.

spherical-projection's People

Contributors

emidiob avatar rodighiero avatar

Watchers

 avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.