Git Product home page Git Product logo

citation_bias_ext's Introduction

Citation Bias Chrome Extension logo

The goal of this project is to create an extension for Google Chrome that will display probabilistic gender information about the first and last authors of papers on Google Scholar and PubMed's search pages.

Motivated from work by J. D. Dworkin, K. A. Linn, E. G. Teich, P. Zurn, R. T. Shinohara, and D. S. Bassett (2020). bioRxiv. doi: https://doi.org/10.1101/2020.01.03.894378

Instructions

This project is available on the Chrome Extension store here. To make and test changes not included in the published version, you must add it in "developer mode".

  1. Download this GitHub repository (click the green "clone or download button" followed by "Download as zip"). Unzip the folder.
  2. Open the Extension Management page by navigating to chrome://extensions.
  3. Enable Developer Mode by clicking the toggle switch in the top right corner next to Developer mode.
  4. Click the LOAD UNPACKED button and select the "extension" directory under "citation_bias_ext" (inside the unzipped folder that you just downloaded called "citation_bias_ext").
  5. You should now see a new, colorful Google Scholar icon on your chrome browser. Click this icon to be sure that the extension is enabled.

Caveats

  1. Typically, gender is thought of as a self-identity that individual expressed behaviorally. Since our extension uses only first names, we have limited ability to actually capture this definition of gender. It is more accurate to think of the probabilities displayed for gender as perceived gender, rather than an estimate of gender identity. Additionally, this extension only shows probabilities for male or female genders, and incorrectly assumes that all people fall into that binary.
  2. Gender is determined using genderize.io api, which supports names across many countries taken from social media data. This database was chosen because it can support a large number of queries and is not limited to the US, but is still subject to error. Comparative reports (1. Karimi, F., Wagner, C., Lemmerich, F., Jadidi, M. & Strohmaier, M. Inferring Gender from Names on the Web: A Comparative Evaluation of Gender Detection Methods. in Proceedings of the 25th International Conference Companion on World Wide Web 53โ€“54 (International World Wide Web Conferences Steering Committee, 2016). doi:10.1145/2872518.2889385) listed it's overall accuracy at 82%. While this database was chosen because it contained data from multiple countries, the countries that it performs worst on, in order, are China, South Korea, and Brazil.
  3. Gender bias is not the only type of bias present in citation practices, and the current extension does not account for any kind of intersectionality. We are currently working on adding probabilistic race information to the extension to help mitigate this.

Contributors (Alphabetically)

  • Ann Sizemore Blevins
  • David Lydon-Staley
  • Jennifer Stiso
  • Katharine Crooks
  • Matthew Schaff
  • Ursula Tooley

citation_bias_ext's People

Contributors

jastiso avatar utooley avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar  avatar

Watchers

 avatar  avatar

citation_bias_ext's Issues

Potential co-author issue

Looking at several articles where a co-first author is mentioned, and several of them come up with authors that are not related to the article. However, it does not seem to be consistent

Examples of co-first authorship:
citation_bias
image

Examples of co-last / corresponding authors:
image

Google Scholar User Profile

In Google Scholar, when a search for a person is conducted and the person in question has their own Google Scholar account, the "User Profiles" section considers the first and last author to be Mike Kuniavsky (gender: male 99%). Error is found when there is only one option (see search for Jennifer Stiso below) or if there are several authors with a similar name that have a profile (see search for Matthew Sutherland).

Scholar_Mike3

Scholar_Mike2

Better title matching

Background: The extension currently works by grabbing the title of each paper on a search paper, searching for that title in Crossref, and trying to match the current title with the results. This process can be improved by including other information, such as the date, or DOI, in the crossref search.

Action Item: We would like to add code in the content.js file that pulls the date or DOI from each results on PubMed and Google scholar, and includes it in the Crossref query

Add user verified gender

Ideally, people would be able to verify if their gender was correct. This would increase the accuracy, but also allow people to identify as a non-binary gender.

We would like to add a feature where people can "verify" their gender by pulling their institution contact from google scholar's api

Add local SQL database

Background: The extension currently queries gender.api for each name to get the probabilistic gender. This could be a failure point if the API gets overwhelmed. A faster, and more robust solution would be to have a local database of common names, and their probabilistic genders that we can pull from first.

Action Items:

  • Convert .csv file with names into SQL database (I already have the .csv file)
  • Try to access the SQL database from content.js script before querying gender.api
  • Move SQL database to Heroku app

Package is invalid when installing from chrome store

I'm using Chromium 85.0.4183.102 (Official Build) on Manjaro Linux 20.1.

I get a pop-up when clicking "add to chrome:"

Package is invalid. Details: 'Could not load icon 'images/logo.PNG' specified in 'browser_action'.'.

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.