automated-explanations's Introduction

Automated explanations

Explaining black box text modules in natural language with language models (arXiv 2023)

This repo contains code to reproduce the experiments in the SASC paper. SASC takes in a text module and produces a natural explanation for it that describes what it types of inputs elicit the largest response from the module (see Fig below).

SASC is similar to the nice concurrent paper by OpenAI, but simplifies explanations to describe the function rather than produce token-level activations. This makes it simpler/faster, and makes it more effective at describing semantic functions from limited data (e.g. fMRI voxels) but worse at finding patterns that depend on sequences / ordering.

For a simple scikit-learn interface to use SASC, use the imodelsX library. Install with pip install imodelsx then the below shows a quickstart example.

from imodelsx import explain_module_sasc
# a toy module that responds to the length of a string
mod = lambda str_list: np.array([len(s) for s in str_list])

# a toy dataset where the longest strings are animals
text_str_list = ["red", "blue", "x", "1", "2", "hippopotamus", "elephant", "rhinoceros"]
explanation_dict = explain_module_sasc(
    text_str_list,
    mod,
    ngrams=1,
)

Reference

See related fMRI experiments
Built from this template

@misc{singh2023explaining,
      title={Explaining black box text modules in natural language with language models}, 
      author={Chandan Singh and Aliyah R. Hsu and Richard Antonello and Shailee Jain and Alexander G. Huth and Bin Yu and Jianfeng Gao},
      year={2023},
      eprint={2305.09863},
      archivePrefix={arXiv},
      primaryClass={cs.AI}
}

automated-explanations's People

Contributors

Stargazers

Watchers

automated-explanations's Issues

name 'openai' is not defined

Hello, I am using the code:
import numpy as np
from imodelsx import explain_module_sasc

a toy module that responds to the length of a string

mod = lambda str_list: np.array([len(s) for s in str_list])

a toy dataset where the longest strings are animals

text_str_list = ["red", "blue", "x", "1", "2", "hippopotamus", "elephant", "rhinoceros"]
explanation_dict = explain_module_sasc(
text_str_list,
mod,
ngrams=1,
)
print(explanation_dict)
Errors will be encountered:
name 'openai' is not defined
How to solve this problem. thank you.

minor: removing the import CACHE_DIR

https://github.com/microsoft/automated-explanations/blob/6f3a8388e72a231005d910151496c45b0fa12c14/sasc/modules/dictionary_module.py#LL17C1-L17C34

tried to run

python3 sasc/modules/dictionary_module.py

and the above line gives an error.

    from sasc.config import CACHE_DIR
ModuleNotFoundError: No module named 'sasc.config'

can simply delete since CACHE_DIR is not used anywhere in that script.

This repo is missing important files

There are important files that Microsoft projects should all have that are not present in this repository. A pull request has been opened to add the missing file(s). When the pr is merged this issue will be closed automatically.

Microsoft teams can learn more about this effort and share feedback within the open source guidance available internally.

Merge this pull request

Recommend Projects

microsoft / automated-explanations Goto Github PK

automated-explanations's Introduction

Automated explanations

Reference

automated-explanations's People

Contributors

Stargazers

Watchers

Forkers

automated-explanations's Issues

name 'openai' is not defined

a toy module that responds to the length of a string

a toy dataset where the longest strings are animals

minor: removing the import CACHE_DIR

This repo is missing important files

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent