Git Product home page Git Product logo

aquas's Introduction

<<<<<<< HEAD

AQUAS

Project documentation on Automatic Quality Assessment NLP approach for the semantic mapping of texts in the life science

Automatic Quality Assessment: NLP approach for the semantic mapping of texts in the life science


Project description

The growing incidence of deliberately spread misinformation poses a major challenge to our democratic society. It is increasingly being spread by political interest groups in order to determine public discourse. The recipients sometimes fail to recognize this misinformation as such. Since disinformation can also be found in scientific information, this development also affects scientists. In the medical applications of the life sciences (LeWi), this can have have health-endangering effects.

In the project AQUAS presented here, the first German-language dataset on disinformation in the life sciences will be created. On this basis, modern machine learning (ML) methods will be used to create an ML model that will be able to gradually classify the semantic proximity of unknown texts to the classes scientific texts, popular science texts and disinforming texts. At the same time, complementary information on the good scientific practices of the publications will be provided. With the enrichment and AQUAS aims at supporting the reader in making an informed assessment of literature by enriching and publishing the above-mentioned information (basic set and extended set of characteristics, respectively).

Thereby AQUAS does not aim at a final reading recommendation of the contents or censorship. Based on the developed enrichment methods, AQUAS will implement a service that can be accessed via an application programming interface (API). As a first central application we will use this service through the ZB MED discovery system LIVIVO to make the described classification of literature available to the users of ZB MED. This will initially be used by the scientists of the and practitioners in the health care professions as well as students will benefit from the improved knowledge infrastructure at LIVIVO through AQUAS. The dataset, the model, the workflow for the training and the software for the operation of the service will be made openly available, if possible. and thus also made usable for other subject areas.

AQUAS at ZB MED

Publications

Dataset

database schema

data set

  • 4 categories
  • retrieval modi: PDF scraped, HTML scraped, reused data set

collecting enough data for disinformation category is the bottle neck of the data set. If you like to inprove te datas set please inform us if you find an article which should be classified as disinformation. please write an email to [email protected]

Code

see this repoitory: ./analysis

Press release

Deutsches Ärzteblatt: Künstliche Intelligenz soll Fake News bei medizinischen Informationen erkennen, 2022-12-27
B.I.T.-online: ZB MED sagt Falschinformationen den Kampf an, 2023-01-06
Fachbuchjournal: ZB MED sagt Falschinformationen den Kampf an, 2023
German Circle (privater Blog), 2023-01-14

Responsible

Eva Seidlmayer, Dr. phil., M.LIS
Data Sciences and Services, Research Fellow
ORCID: 0000-0001-7258-0532
Mastodon: @eta_kivilih | Bluesky: @etakivilih.bsky.social

ZB MED – Informations Centre for Life Sciences
Gleueler Straße 60
50931 Cologne
Germany

www.zbmed.de
INFORMATION. KNOWLEDGE. LIFE.

Funding

DFG-LIS
FO 984/6-1

License

16abf1e4419ea0c154748f0336cf3138056dc945 Copyright (c) 2024 Eva Seidlmayer, Lukas Galke, Konrad U. Förstner

Permission is hereby granted, free of charge, to any person obtaining a copy of this software and associated documentation files (the "Software"), to deal in the Software without restriction, including without limitation the rights to use, copy, modify, merge, publish, distribute, sublicense, and/or sell copies of the Software, and to permit persons to whom the Software is furnished to do so, subject to the following conditions:

The above copyright notice and this permission notice shall be included in all copies or substantial portions of the Software.

THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY, FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM, OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE SOFTWARE.

aquas's People

Contributors

evaseidlmayer avatar lgalke avatar

Watchers

James Cloos avatar  avatar

Forkers

lgalke

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.