Git Product home page Git Product logo

gu-datalab / misinformation-detection-demis Goto Github PK

View Code? Open in Web Editor NEW
5.0 1.0 1.0 14.02 MB

Resource for misinformation research on Twitter. Official resource of the paper "DeMis: Data-efficient Misinformation Detection using Reinforcement Learning", ECML-PKDD 2022

License: GNU General Public License v3.0

Python 97.05% Shell 2.95%
data-efficient deep-learning ecml-pkdd fake-news machine-learning misinformation pytorch reinforcement-learning twitter natural-language-processing nlp

misinformation-detection-demis's Introduction

DeMis: Data-efficient Misinformation Detection using RL

Resources for misinformation detection on Twitter. This repo is the official resource of the following paper.

  • DeMis: Data-efficient Misinformation Detection using Reinforcement Learning, ECML-PKDD 2022.
  • [Link][PDF][Slide]

overview-model

๐Ÿ“š Data Sets

The data sets about COVID-19 misinformation on Twitter presented in our paper are available below.

  • COMYTH (weather & home-remedies) - [Datasets]
  • COVIDLies - [Paper]

image

๐Ÿš€ Pre-trained Models

We release our models for misinformation detection on Twitter trained using DeMis method. There are three models trained on three COVID-19 misinformation data sets separately. All trained misinformation detection models are available on my Google Drive ๐Ÿค— so you can download models via PyTorch and use it for prediction right away!!!

โš™๏ธ Usage

You can use the code to do any of the following purposes: (1) preprocessing tweets, (2) running the model for prediction, or (3) training the model. We tested in pytorch v1.10.1 and transformers v4.18.0.

Usage 1: Preprocessing tweets

Specify the input and output filepaths in the shell script run_tweet_preprocessing.sh and run the following command.

sh scripts/run_tweet_preprocessing.sh

Usage 2: Run the detection model for classification

Specify the model path, input and output filepaths in the shell script run_detector.sh. Note that you can download the models from the section above and try running it with the following command, or you can train a new detection model using DeMis in the next section.

sh scripts/run_detector.sh

Usage 3: Train detector using DeMis

Run a sample script to train using COVID-weather data. See README to learn more on how to train with your data.

sh scripts/run_sample_entire_demis_pipeline.sh

โœ๏ธ Citation

If you feel our paper and resources are useful, please consider citing our work! ๐Ÿ™

@inproceedings{kawintiranon2022demis,
  title     = {DeMis: Data-efficient Misinformation Detection using Reinforcement Learning},
  author    = {Kawintiranon, Kornraphop and Singh, Lisa},
  booktitle = {Proceedings of the European Conference on Machine Learning and Principles and Practice of Knowledge Discovery in Databases (ECML-PKDD)},
  year      = {2022},
  publisher = {Springer}
}

๐Ÿ›  Throubleshoots

Create an issue here if you have any issues loading models or data sets.

misinformation-detection-demis's People

Contributors

kornosk avatar

Stargazers

 avatar  avatar  avatar  avatar  avatar

Watchers

 avatar

Forkers

biancamusat

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    ๐Ÿ–– Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. ๐Ÿ“Š๐Ÿ“ˆ๐ŸŽ‰

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google โค๏ธ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.