airbnb / artificial-adversary Goto Github PK

View Code? Open in Web Editor NEW

391.0 18.0 57.0 119 KB

🗣️ Tool to generate adversarial text examples and test machine learning models against them

License: MIT License

Python 95.11% Jupyter Notebook 4.89%

machine-learning classification python python3 python2 text text-mining adversarial-examples spam spam-filtering

artificial-adversary's People

Contributors

Stargazers

Watchers

artificial-adversary's Issues

Stop using whitespace as word separator

Currently this assumes words are separated by spaces. If this is not the case, or if an earlier attack removes a space, several words may be treated as a single word (such as send.me.money).

One way around this is explicitly to keep track of word indices in the original string (assume here that they are separated by whitespace), and then modify these as attacks modify words/text.

Add other attack mechanisms

Right now we assume no feedback between adversary and classifier.

What if the adversary has access to the labels? What if the adversary has access to the raw probabilities? What is the adversary has access to some observation that can be linked back to the label or probability?

These are very broad, and while some have been addressed in machine learning literature, there are many possible takes on this as it specifically applies to text classification.

Potential ideas (this list will grow):

Use Lime to identify words that are important to classification results and apply targeted attacks
Simulate a sequence of back-and-forths between classifier and adversary

Add more basic attacks

There are many possible attacks that have not yet been implemented.

Some of these include:

Phrase-level attacks
- Invert part-of-speech order
- Change tense
Replacing words with homonyms, or symbols that are pronounced as a homonym (ate -> eight, for -> 4)
Surrounding characters (or other alternating patterns) (bank -> (b)(a)(n)(k))

ModuleNotFoundError: No module named 'Adversary.adversary'

Because /anaconda3/lib/python3.6/site-packages/Adversary/Adversary.py
mv Adversary.py adversary.py

airbnb / artificial-adversary Goto Github PK

artificial-adversary's People

Contributors

Stargazers

Watchers

Forkers

artificial-adversary's Issues

Stop using whitespace as word separator

Add other attack mechanisms

Add more basic attacks

test accuracy of adversarial text examples

Add Support for Emojis

No module named 'Adversary.adversary'

Recommend Projects

React

Vue.js

Typescript

TensorFlow

Django

Laravel

D3

Recommend Topics

javascript

web

server

Machine learning

Visualization

Game

Recommend Org

Facebook

Microsoft

Google

Alibaba

D3

Tencent