Git Product home page Git Product logo

fusion-jena / semantic-search-usability-analysis Goto Github PK

View Code? Open in Web Editor NEW
1.0 5.0 0.0 21.82 MB

Supplementary material for a usability evaluation of a semantic search for biological datasets

License: GNU Lesser General Public License v2.1

Jupyter Notebook 85.23% Python 8.17% Java 6.60%
user-study biodiversity dataset-search environmental-science evaluation life-sciences semantic-search user-interface jupyter-notebook

semantic-search-usability-analysis's Introduction

Supplementary material for a usability evaluation of a semantic search for biological datasets

DOI

We conducted a usability evaluation for a semantic dataset search with 20 biodiversity scholars in June and July 2022 in Germany. The research aim addressed two objectives:

  1. we explored two query inputs (A/B testing) and
  2. we studied two different explanations strategies in the search summary to examine whether users are confused or attracted by presented semantic information such as URIs and ontologies.

We developed a semantic search over biological datasets with two user interfaces (UI) with different characteristics. The search expands query terms on semantically related terms and allows a search over hierarchy relations. UI 1 (Biodiv 1) provides a category, form-based search input with no information on utilized ontologies. UI 2 (Biodiv 2) offers a classical one input field and in the search summary, it provides links to matched URIs and ontologies.

Following the TREC guidelines (https://www-nlpir.nist.gov/projects/t9i/spec.html), we setup eight user tasks and surveys with questionnaires to guide users through the evaluation.

  • the analysis folder contains a jupyter notebook to analyse a compiled csv
    • analysis/results16 contains the results for 16 users
    • analysis/results20 contains the results for 20 users
    • scripts to generate the complied csv and further instructions are available under analysis/preprocessing
  • data_corpus_preparation provides various small applications for the preparation and setup of the search index

Prerequites

Install Python (we developed and tested with version 3.9) and jupyter notebook (https://jupyter.org/). In a command line navigate to the root folder and run

jupyter notebook

Data

The survey templates, questionnaires and the original survey results are available at Zenodo: DOI

Search Tasks

  1. What data are in the repository for Foraminifera (forams, single-cell organisms) in the benthic zone (water layer in the ocean floor)?
  2. How variable is the oxygen concentration of sea water of the global ocean?
  3. What data exist for Poales (invasive grasses), e.g., Poaceae (grass family)?
  4. How high are sulfate reduction rates at cold seeps (cold vents, areas in the ocean floor where hydrocarbon-rich fluids are leaking)?
  5. What data are in the repository on ocean acidification or coral bleaching?
  6. What data exist in the repository for bacteria in the groundwater?
  7. What data exist for Lepidoptera (butterflies, moths) on oaks (Quercus)?
  8. What data in the repository contain samples from surface water?

License

The code in this project is distributed under the terms of the GNU LGPL v3.0.

Publication

Further information on this study can be obtained from our publication: Löffler, F., Shafiei, F., Witte, R., König-Ries, B. & Klan, F., (2023). Semantic Search for Biological Datasets: A Usability Study on Modes of Querying and Explaining Search Results. In: König-Ries, B., Scherzinger, S., Lehner, W. & Vossen, G. (Hrsg.), BTW 2023. Gesellschaft für Informatik e.V.. DOI: 10.18420/BTW2023-56

semantic-search-usability-analysis's People

Contributors

felicitasloeffler avatar

Stargazers

 avatar

Watchers

 avatar  avatar  avatar  avatar  avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.