Git Product home page Git Product logo

climatechange-expgen's Introduction

Automatic Explanation Generation For Climate Science Claims

This repository contains the source code for our ALTA 2022 paper: Automatic Explanation Generation For Climate Science Claims.

Abstract

Climate change is an existential threat to humanity, the proliferation of unsubstantiated claims relating to climate science is manipulating public perception, motivating the need for fact-checking in climate science. In this work, we draw on recent work that uses retrieval-augmented generation for veracity prediction and explanation generation, in framing explanation generation as a query-focused multi-document summarization task. We adapt PRIMERA to the climate science domain by adding additional global attention on claims. Through automatic evaluation and qualitative analysis, we demonstrate that our method is effective at generating explanations.

Requirements

Recommend proceed with creating a conda environment:

conda create --name <env_name> --file requirements.txt

Data

There are two key data components in our task:

  • An external knowledge source from which we retrieve documents, here we use Pubmed data and IPCC reports data.
  • Paired claim–explanation data, to serve as the input (claim) and output (explanation). We use Climate Fever.
  • After retrieval process via Binary Passage Retriever, the final data is used directly for model training and evaluation. Please feel free to download data from here.

Data format

Each instance in JSON file contains the following columns:

  • id - the unique identifier for the instance
  • ctxs - retrieved passages relating to the claim
    • title: the title of the passage
    • text: the content of the passage
  • question - the claim that needs to be fact-checked
  • answers - a list of evidences to support/refute the claim, they are extracted from original Climate Fever
  • target_lab - claim label, 0 - SUPPORTS, 1 - REFUTES, 2 - NOT_ENOUGH_INFO
  • task - not used, specific format for T5

Data example

{'id': 1411,
 'ctxs': [
   {
      'title': 'ipcc gw ch6',
      'text': 'set too low or too high (Loock et al., 2013)1192. Commitment strategies where people make a pledge to engage in climate actions can encourage mitigation behaviour...'},
      {'title': '...', 'text': '...'} // Other evidence passages
      ],
      'question': 'The Independent Climate Change Email Review found the CRU scientists were unhelpful and unsympathetic to information requesters and at times broke FoI laws.',
      'answers': ['The Science and Technology Select Committee report blamed the university for mishandling Freedom of Information requests and said it had "found ways to support the culture at CRU of resisting disclosure of information to climate change sceptics".'],
      'target_lab': 0,
      'task': 'evidence'
}

Citation

If you find this work useful, please cite the following paper: Automatic Explanation Generation For Climate Science Claims

@inproceedings{xing-etal-2022-automatic,
    title = "Automatic Explanation Generation For Climate Science Claims",
    author = "Xing, Rui  and
      Bhatia, Shraey  and
      Baldwin, Timothy  and
      Lau, Jey Han",
    booktitle = "Proceedings of the The 20th Annual Workshop of the Australasian Language Technology Association",
    month = dec,
    year = "2022",
    address = "Adelaide, Australia",
    publisher = "Australasian Language Technology Association",
    url = "https://aclanthology.org/2022.alta-1.16",
    pages = "122--129",
    abstract = "Climate change is an existential threat to humanity, the proliferation of unsubstantiated claims relating to climate science is manipulating public perception, motivating the need for fact-checking in climate science. In this work, we draw on recent work that uses retrieval-augmented generation for veracity prediction and explanation generation, in framing explanation generation as a query-focused multi-document summarization task. We adapt PRIMERA to the climate science domain by adding additional global attention on claims. Through automatic evaluation and qualitative analysis, we demonstrate that our method is effective at generating explanations.",
}

climatechange-expgen's People

Contributors

ruixing76 avatar

Watchers

 avatar

Recommend Projects

  • React photo React

    A declarative, efficient, and flexible JavaScript library for building user interfaces.

  • Vue.js photo Vue.js

    🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.

  • Typescript photo Typescript

    TypeScript is a superset of JavaScript that compiles to clean JavaScript output.

  • TensorFlow photo TensorFlow

    An Open Source Machine Learning Framework for Everyone

  • Django photo Django

    The Web framework for perfectionists with deadlines.

  • D3 photo D3

    Bring data to life with SVG, Canvas and HTML. 📊📈🎉

Recommend Topics

  • javascript

    JavaScript (JS) is a lightweight interpreted programming language with first-class functions.

  • web

    Some thing interesting about web. New door for the world.

  • server

    A server is a program made to process requests and deliver data to clients.

  • Machine learning

    Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.

  • Game

    Some thing interesting about game, make everyone happy.

Recommend Org

  • Facebook photo Facebook

    We are working to build community through open source technology. NB: members must have two-factor auth.

  • Microsoft photo Microsoft

    Open source projects and samples from Microsoft.

  • Google photo Google

    Google ❤️ Open Source for everyone.

  • D3 photo D3

    Data-Driven Documents codes.