RedTeaming Framework

RedTeaming Framework is a Python-based red teaming toolkit for evaluating chatbot and LLM targets through YAML campaigns.

It currently supports:

PyRIT for dataset, crescendo, and red-teaming style attacks
Garak for probe-based attacks
HTTP targets configurable through campaign YAML files
JSON reports plus a Streamlit dashboard for analysis

1. What the framework does

The framework lets you describe a campaign like this:

define the target to attack
define a list of attack YAML files
run the campaign with main.py
execute PyRIT and/or Garak attacks against the target
normalize outputs into a common report format
save reports to reports/
inspect the results in the dashboard

In practice, the flow is:

campaign YAML
→ target config
→ attack YAML files
→ PyRIT / Garak execution
→ normalized AttackResult JSON files
→ dashboard view

2. Project structure

.
├── main.py                  # CLI entrypoint
├── src/                     # source code
│   ├── settings.py          # runtime settings loaded from .env
│   ├── core/                # campaign loading, orchestration, reporting
│   └── frameworks/          # PyRIT and Garak integrations
├── examples/                # example campaigns, attacks, templates
├── config/                  # runtime config files (e.g. generated Garak config)
├── reports/                 # generated JSON reports
├── tests/                   # test suite
├── .env.example             # environment variable template
└── README.md

3. Prerequisites

You need:

Python 3
a virtual environment
a target HTTP endpoint to test
the Python packages used by the repo
PyRIT if you want to run PyRIT campaigns
Garak if you want to run Garak campaigns
Streamlit if you want to use the dashboard

This repository currently does not expose a pinned dependency manifest in the root, so install the dependencies required by your environment manually.

At minimum, the codebase uses packages such as:

python-dotenv
pydantic
requests
PyYAML
streamlit
pytest
pyrit
garak

4. Setup

Create and activate a virtual environment

python3 -m venv .venv
source .venv/bin/activate

Copy the environment template

cp .env.example .env

Then edit .env for your setup.

The main variables are:

PyRIT attacker LLM
- PYRIT_ATTACKER_ENDPOINT
- PYRIT_ATTACKER_MODEL
- PYRIT_ATTACKER_API_KEY
PyRIT scorer LLM
- PYRIT_SCORER_ENDPOINT
- PYRIT_SCORER_MODEL
- PYRIT_SCORER_API_KEY
PyRIT runtime
- PYRIT_DB_PATH
Always required
- DEFAULT_TARGET_URL
- JSON_REPORTS_DIR
Garak
- GARAK_REPORTS_DIR
- GARAK_CONFIG_PATH

See .env.example for comments and defaults.

5. Quick start

Run an example campaign:

python main.py "examples/campaigns/R1-Prompt Leakage/prompt_leakeage.yaml"

Useful variants:

python main.py "examples/campaigns/R1-Prompt Leakage/prompt_leakeage.yaml" --log-level DEBUG
python main.py "examples/campaigns/R1-Prompt Leakage/prompt_leakeage.yaml" --skip-checks
python main.py "examples/campaigns/R1-Prompt Leakage/prompt_leakeage.yaml" --no-dashboard

By default, after a campaign run:

reports are written to reports/
the Streamlit dashboard is launched automatically unless --no-dashboard is used

6. Campaign format

A campaign defines:

metadata
one target
an ordered list of attacks

Use examples/templates/campaign.yaml as the reference template.

Campaign skeleton

campaign:
  name: "My campaign"
  description: "What this campaign is testing"

target:
  name: "CustomerBot"
  model: "gpt-4.1-nano"
  architecture_type: "System Prompt + Context Injected"
  chat_url: "http://localhost:8000/api/chat"
  reset_memory_url: "http://localhost:8000/api/reset"
  input_field: "prompt"
  output_field: "response"

attacks:
  - examples/attacks/some_attack.yaml
  - examples/attacks/another_attack.yaml

Target fields

name: human-readable target name
model: model name shown in logs and dashboard
architecture_type: target architecture category shown in logs and dashboard
chat_url: endpoint that receives the input message
reset_memory_url: optional endpoint used to reset target memory/context
input_field: JSON field used to send the prompt to the target
output_field: JSON field read from the target response

Attack paths

Each item in attacks: is a path to an attack YAML file.

Paths are expected relative to the project root.

7. Supported attack modes

PyRIT dataset

executes a list of prompts
treats prompts as independent objectives
currently supports reset between prompts when configured in the runner/target flow

PyRIT crescendo

multi-turn attack
keeps conversational context across turns
should not be reset between turns

PyRIT red teaming

multi-turn objective-driven adversarial interaction
uses attacker and scorer LLMs

Garak

probe-based scanning
uses the configured REST generator against the target HTTP API

8. Outputs

The framework stores normalized JSON results in reports/.

Those reports are used by the dashboard and include metadata such as:

framework
attack name
campaign name
target URL
target model
target architecture type
timestamp

Depending on the framework, a result contains either:

prompts for Garak-style probe results
conversation for PyRIT conversation-style results

9. Dashboard

The dashboard is implemented in:

src/core/results/report_viewer.py

It provides three main views:

Overview
Campaigns
Attacks

The dashboard shows campaign-level information such as:

campaign name
breach rate
target model
target architecture type

10. Testing

Run the test suite with:

pytest

Or run a subset, for example:

pytest tests/frameworks/test_pyrit_runner.py
pytest tests/application/test_campaign_loader.py

11. Current assumptions and limitations

targets are currently modeled as HTTP chat endpoints
the framework expects text input/output fields in JSON
campaign attack paths are root-relative
the dashboard reads normalized JSON reports, not live campaign YAML files
PyRIT and Garak have different execution models, but both are normalized into the same reporting layer

12. Where to extend the project

add new example campaigns under examples/campaigns/
add new attacks under examples/attacks/
add framework logic under src/frameworks/
add orchestration or reporting logic under src/core/
add regression tests under tests/

13. Recommended reading order for a new contributor

If you are new to the repo, start with:

README.md
main.py
examples/templates/campaign.yaml
src/core/application/campaign_loader.py
src/frameworks/pyrit/pyrit_runner.py
src/frameworks/garak/garak_runner.py

That gives a good end-to-end view of how campaigns are loaded, executed, normalized, and reported.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RedTeaming Framework

1. What the framework does

2. Project structure

3. Prerequisites

4. Setup

Create and activate a virtual environment

Copy the environment template

5. Quick start

6. Campaign format

Campaign skeleton

Target fields

Attack paths

7. Supported attack modes

PyRIT dataset

PyRIT crescendo

PyRIT red teaming

Garak

8. Outputs

9. Dashboard

10. Testing

11. Current assumptions and limitations

12. Where to extend the project

13. Recommended reading order for a new contributor

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
config		config
examples/templates		examples/templates
src		src
tests		tests
.env.example		.env.example
.gitignore		.gitignore
README.md		README.md
conftest.py		conftest.py
main.py		main.py

Folders and files

Latest commit

History

Repository files navigation

RedTeaming Framework

1. What the framework does

2. Project structure

3. Prerequisites

4. Setup

Create and activate a virtual environment

Copy the environment template

5. Quick start

6. Campaign format

Campaign skeleton

Target fields

Attack paths

7. Supported attack modes

PyRIT dataset

PyRIT crescendo

PyRIT red teaming

Garak

8. Outputs

9. Dashboard

10. Testing

11. Current assumptions and limitations

12. Where to extend the project

13. Recommended reading order for a new contributor

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages