Comments (4)
Hi @ysy970923 , thanks for submitting this question! You're right in that the feedback mechanism in this example is just the response from the target LLM.
More generally, we can envision cases where the score is very much required. Let's say our target LLM generates images (which we can't pass back into the LLM directly), then we need a score (or textual feedback) to be passed back to the Red Teaming LLM. As you can see, there are different kinds of setups, but we didn't want to overcomplicate the diagram either, so it just mentions feedback from the scoring engine.
I hope that helps! Please let us know if you have further thoughts on the topic otherwise we'll close the issue within the next 7 days.
from pyrit.
Thanks so much for the response 👍
Are there plans for adding examples for image generation?
If not can I contribute some examples for text to image models?
from pyrit.
Yes, that's definitely relevant.
I'm not sure to what extent you already have these or are planning to work on that, but you can certainly open a PR if you already have it and we can comment there. If you're just starting out it's probably faster and simpler to write up a short outline and share (in a new issue since the original question was answered and is unrelated) so that you can get quick feedback before spending too much time on it. What do you think?
from pyrit.
I did some work on this, so I made a pull request.
Feel free to give comments :)
Thank you
from pyrit.
Related Issues (20)
- Update WMDP Dataset HOT 1
- FEAT create scorer based on objective (without YAML)
- ImportError: cannot import name 'SelfAskTrueFalseScorer' from 'pyrit.score' HOT 1
- `AddTextImageConverter` not handling `font_size` properly HOT 1
- FEAT Leetspeak converter should have a deterministic option
- Using AzureMLChatTarget raises 404 and 405 errors HOT 3
- Gandalf example not working HOT 4
- bug: Azure SQL Tests Fail in MacOS M1 HOT 2
- Add adaptive jailbreaking
- Add fetch function for SecLists AI LLM Bias Testing datasets HOT 2
- Add fetch function for datasets from HarmBench
- Getting "404 Resource Not Found" HOT 3
- FEAT Metadata for datasets should allow fields as string OR list of strings
- gandalf example error (Failed to add request response to memory) HOT 2
- Got a new Jailbreak Prompt HOT 1
- FEAT add XSTest dataset
- FEAT add DecodingTrust dataset
- FEAT: Add Azure OpenAI GPT-4o target for pyrit
- Unable to get pyrit working after the change HOT 11
- [BUG] Black-pre-commit hook language-version binding causing installation issues.
Recommend Projects
-
React
A declarative, efficient, and flexible JavaScript library for building user interfaces.
-
Vue.js
🖖 Vue.js is a progressive, incrementally-adoptable JavaScript framework for building UI on the web.
-
Typescript
TypeScript is a superset of JavaScript that compiles to clean JavaScript output.
-
TensorFlow
An Open Source Machine Learning Framework for Everyone
-
Django
The Web framework for perfectionists with deadlines.
-
Laravel
A PHP framework for web artisans
-
D3
Bring data to life with SVG, Canvas and HTML. 📊📈🎉
-
Recommend Topics
-
javascript
JavaScript (JS) is a lightweight interpreted programming language with first-class functions.
-
web
Some thing interesting about web. New door for the world.
-
server
A server is a program made to process requests and deliver data to clients.
-
Machine learning
Machine learning is a way of modeling and interpreting data that allows a piece of software to respond intelligently.
-
Visualization
Some thing interesting about visualization, use data art
-
Game
Some thing interesting about game, make everyone happy.
Recommend Org
-
Facebook
We are working to build community through open source technology. NB: members must have two-factor auth.
-
Microsoft
Open source projects and samples from Microsoft.
-
Google
Google ❤️ Open Source for everyone.
-
Alibaba
Alibaba Open Source for everyone
-
D3
Data-Driven Documents codes.
-
Tencent
China tencent open source team.
from pyrit.