SCT - Scylla Cluster Tests

SCT tests are designed to test Scylla database on physical/virtual servers under high read/write load. Currently, the tests are run using built in unittest These tests automatically create:

Scylla clusters - Run Scylla database
Loader machines - used to run load generators like cassandra-stress
Monitoring server - uses official Scylla Monitoring repo to monitor Scylla clusters and Loaders

Quickstart

Option 1 - Config AWS using OKTA (preferred option)

https://www.notion.so/AWS-864b26157112426f8e74bab61001425d

Option 2 - Config AWS using AWS credentials

# install aws cli
sudo apt install awscli # Debian/Ubuntu
sudo dnf install awscli # Redhat/Fedora
# or follow amazon instructions to get it: https://docs.aws.amazon.com/cli/latest/userguide/getting-started-install.html

# Ask your AWS account admin to create a user and access key for AWS) and then configure AWS

> aws configure
AWS Access Key ID [****************7S5A]:
AWS Secret Access Key [****************5NcH]:
Default region name [us-east-1]:
Default output format [None]:

# if using OKTA, use any of the tools to create the AWS profile, and export it as such,
# anywhere you are gonna use hydra command (replace DeveloperAccessRole with the name of your profile):
export AWS_PROFILE=DeveloperAccessRole

# Install hydra (docker holding all requirements for running SCT)
sudo ./install-hydra.sh

# if using podman, we need to disable enforcing of short name usage, without it monitoring stack won't run from withing hydra
echo 'unqualified-search-registries = ["registry.fedoraproject.org", "registry.access.redhat.com", "docker.io", "quay.io"]
short-name-mode="permissive"
' > ~/.config/containers/registries.conf

Run a test

Disable Argus (if not needed for testing)

To disable Argus reporting during testing, set the following environment variable:

export SCT_ENABLE_ARGUS=false

Logs Location

All logs generated during test runs can be found in the ~/sct-results directory.

Example running test using Hydra using test-cases/PR-provision-test.yaml configuration file

Run test locally with AWS backend:

export SCT_SCYLLA_VERSION=5.2.1
# Test fails to report to Argus. So we need to disable it
export SCT_ENABLE_ARGUS=false
# configuration is needed for running from a local development machine (default communication is via private addresses)
hydra run-test longevity_test.LongevityTest.test_custom_time --backend aws --config test-cases/PR-provision-test.yaml --config configurations/network_config/test_communication_public.yaml

# Run with IPv6 configuration
hydra run-test longevity_test.LongevityTest.test_custom_time --backend aws --config test-cases/PR-provision-test.yaml --config configurations/network_config/all_addresses_ipv6_public.yaml

Run test using SCT Runner with AWS backend:

hydra create-runner-instance --cloud-provider <cloud_name> -r <region_name> -z <az> -t <test-id> -d <run_duration>

export SCT_SCYLLA_VERSION=5.2.1
# For choose correct network configuration, check test jenkins pipeline.
# All predefined configurations are located under `configurations/network_config`
hydra --execute-on-runner <runner-ip|`cat sct_runner_ip> "run-test longevity_test.LongevityTest.test_custom_time --backend aws --config test-cases/PR-provision-test.yaml"

Run test locally with GCE backend:

export SCT_SCYLLA_VERSION=5.2.1
export SCT_IP_SSH_CONNECTIONS="public"
hydra run-test longevity_test.LongevityTest.test_custom_time --backend gce --config test-cases/PR-provision-test.yaml

Run test locally with Azure backend:

export SCT_SCYLLA_VERSION=5.2.1
hydra run-test longevity_test.LongevityTest.test_custom_time --backend azure --config test-cases/PR-provision-test.yaml

Run test locally with docker backend:

If you wish to run the tests on a local machine, ensure you have Docker installed and running. If necessary, set a higher value for asynchronous I/O by:

sudo nano /etc/sysctl.conf
# Edit or add the following line:
# fs.aio-max-nr=3000000
# Save and exit the file, then apply the changes:
sudo sysctl -p

For the purpose of debugging a simple logic of a nemesis, for example, it would be recommended to use Docker backend.

# **NOTE:** user should be part of sudo group, and setup with passwordless access,
# see https://unix.stackexchange.com/a/468417 for example on how to setup

# example of running specific docker version
export SCT_SCYLLA_VERSION=5.2.1
hydra run-test longevity_test.LongevityTest.test_custom_time --backend docker --config test-cases/PR-provision-test-docker.yaml

Run test with ScyllaDB Cloud (xcloud) backend:

export SCT_SCYLLA_VERSION=2025.3.0
export SCT_XCLOUD_PROVIDER=aws
export SCT_XCLOUD_ENV=lab

hydra run-test longevity_test.LongevityTest.test_custom_time --backend xcloud --config test-cases/PR-provision-test.yaml

For more details on xcloud backend, see xcloud backend documentation

You can specify a specific scylla version by:

# Simple version (release)
export SCT_SCYLLA_VERSION=2025.1

# Branch version (nightly)
export SCT_SCYLLA_VERSION=master:latest

# Full version tag (specific build with commit hash)
export SCT_SCYLLA_VERSION=2024.2.5-0.20250221.cb9e2a54ae6d-1

For detailed information on full version tag support, see docs/full-version-tag-usage.md

For debugging a standard nemesis setup you can simply use a default nemesis setup. Use the yaml files as in https://github.com/scylladb/scylla-cluster-tests/blob/master/jenkins-pipelines/oss/nemesis/longevity-5gb-1h-AbortRepairMonkey-docker.jenkinsfile:

hydra run-test longevity_test.LongevityTest.test_custom_time --backend docker \
-c configurations/nemesis/longevity-5gb-1h-nemesis.yaml \
-c configurations/nemesis/AbortRepairMonkey.yaml \
-c configurations/nemesis/additional_configs/docker_backend.yaml

For debugging a specific nemesis setup you can edit the nemesis configuration to run. Change the relevant parameters in test-cases/PR-provision-test-docker.yaml like below:

test_duration: 60
stress_cmd: "cassandra-stress write cl=QUORUM duration=5m -schema 'replication(strategy=NetworkTopologyStrategy,replication_factor=3) ' -mode cql3 native -rate threads=10 -pop seq=1..100000 -log interval=5"
n_db_nodes: 4
nemesis_class_name: 'SisyphusMonkey'
nemesis_selector: 'DecommissionMonkey'  # Filter to run only DecommissionMonkey
nemesis_interval: 5

The nemesis_class_name specifies the runner (e.g. SisyphusMonkey), while nemesis_selector filters which nemesis classes to include using boolean flag expressions or class names. For more details on nemesis architecture, flags, and configuration, see the Nemesis Developer Guide. For docker backend supported nemesis, check docker backend specifics.

#### You can also enter the containerized SCT environment using:
```bash
hydra bash

List resources being used by user:

# NOTE: Only use `whoami` if your local use is the same as your okta/email username
hydra list-resources --user `whoami`

Reuse already running cluster:

export SCT_REUSE_CLUSTER=$(cat ~/sct-results/latest/test_id)
hydra run-test longevity_test.LongevityTest.test_custom_time --backend aws --config test-cases/PR-provision-test.yaml --config configurations/network_config/test_communication_public.yaml

More details on reusing a cluster can be found in reuse_cluster

Clear resources:

hydra clean-resources --user `whoami`
# by default, it only cleans aws resources
# to clean other backends, specify manually
hydra clean-resources --user `whoami` -b gce

Clear resources being used by the last test run:

SCT_CLUSTER_BACKEND= hydra clean-resources --test-id `cat ~/sct-results/latest/test_id`

Install local development environment

Frequently Asked Questions (FAQ)

Contribution instructions

Supported backends

aws - the mostly used backed, most longevity run on top of this backend
gce - most of the artifacts and rolling upgrades run on top of this backend
azure -
docker - should be used for local development
baremetal - can be used to run with already setup cluster
xcloud - ScyllaDB Cloud managed clusters
k8s-eks -
k8s-gke -
k8s-local-kind - used for run k8s functional test locally
k8s-local-kind-gce - used for run k8s functional test locally on GCE
k8s-local-kind-aws - used for run k8s functional test locally on AWS

Configuring test run configuration YAML

Take a look at the test-cases/PR-provision-test.yaml file. It contains a number of configurable test parameters, such as DB cluster instance types and AMI IDs. In this example, we're assuming that you have copied test-cases/PR-provision-test.yaml to test-cases/your_config.yaml.

All the test run configurations are stored in test-cases directory.

Important: Some tests use custom hardcoded operations due to their nature, so those tests won't honor what is set in test-cases/your_config.yaml.

Configuration Documentation

SCT Configuration Guide - Comprehensive guide on how the configuration system works and how to add new options
Configuration Options Reference - Auto-generated list of all available configuration options

Development Plans

Active implementation plans for SCT are tracked in docs/plans/MASTER.md. For guidelines on creating new plans, see docs/plans/INSTRUCTIONS.md.

Name		Name	Last commit message	Last commit date
Latest commit History 10,449 Commits
.claude		.claude
.devcontainer		.devcontainer
.github		.github
argus		argus
argus_report_templates		argus_report_templates
configurations		configurations
data_dir		data_dir
defaults		defaults
docker		docker
docs		docs
functional_tests		functional_tests
jenkins-pipelines		jenkins-pipelines
jupyter		jupyter
scripts		scripts
sdcm		sdcm
skills		skills
templates		templates
test-cases		test-cases
test_lib		test_lib
unit_tests		unit_tests
utils		utils
vars		vars
.git-blame-ignore-revs		.git-blame-ignore-revs
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
.python-version		.python-version
AGENTS.md		AGENTS.md
CLAUDE.md		CLAUDE.md
Dockerfile		Dockerfile
Jenkinsfile		Jenkinsfile
LICENSE.AGPL		LICENSE.AGPL
README.md		README.md
artifacts_test.py		artifacts_test.py
cdc_replication_test.py		cdc_replication_test.py
cluster_configuration_tests.py		cluster_configuration_tests.py
commitlint.config.js		commitlint.config.js
conftest.py		conftest.py
gemini_test.py		gemini_test.py
grow_cluster_test.py		grow_cluster_test.py
ics_space_amplification_goal_test.py		ics_space_amplification_goal_test.py
install-hydra.sh		install-hydra.sh
install-prereqs.sh		install-prereqs.sh
jepsen_test.py		jepsen_test.py
longevity_alternator_ttl_test.py		longevity_alternator_ttl_test.py
longevity_balancer_test.py		longevity_balancer_test.py
longevity_large_partition_test.py		longevity_large_partition_test.py
longevity_lwt_test.py		longevity_lwt_test.py
longevity_oos_test.py		longevity_oos_test.py
longevity_operator_multi_tenant_test.py		longevity_operator_multi_tenant_test.py
longevity_sla_test.py		longevity_sla_test.py
longevity_test.py		longevity_test.py
longevity_tombstone_gc_test.py		longevity_tombstone_gc_test.py
longevity_twcs_test.py		longevity_twcs_test.py
mgmt_cli_test.py		mgmt_cli_test.py
mgmt_upgrade_test.py		mgmt_upgrade_test.py
microbenchmarking_test.py		microbenchmarking_test.py
performance_regression_alternator_test.py		performance_regression_alternator_test.py
performance_regression_cdc_test.py		performance_regression_cdc_test.py
performance_regression_gradual_grow_throughput.py		performance_regression_gradual_grow_throughput.py
performance_regression_lwt_test.py		performance_regression_lwt_test.py
performance_regression_manager_backup_test.py		performance_regression_manager_backup_test.py
performance_regression_operator_multi_tenant_test.py		performance_regression_operator_multi_tenant_test.py
performance_regression_row_level_repair_test.py		performance_regression_row_level_repair_test.py
performance_regression_test.py		performance_regression_test.py
performance_regression_user_profiles_test.py		performance_regression_user_profiles_test.py
performance_scale_up_test.py		performance_scale_up_test.py
performance_search_max_throughput_test.py		performance_search_max_throughput_test.py
platform_migration_test.py		platform_migration_test.py
pyproject.toml		pyproject.toml
renovate.json		renovate.json
sct.py		sct.py
sct_scan_issues.py		sct_scan_issues.py
sct_ssh.py		sct_ssh.py
sla_per_user_system_test.py		sla_per_user_system_test.py
snitch_test.py		snitch_test.py
spark_migrator_test.py		spark_migrator_test.py
staging_trigger.py		staging_trigger.py
test_add_remove_ldap_role_permission.py		test_add_remove_ldap_role_permission.py
throughput_limit_test.py		throughput_limit_test.py
uda_udf_test.py		uda_udf_test.py
upgrade_schema_test.py		upgrade_schema_test.py
upgrade_test.py		upgrade_test.py
uv.lock		uv.lock
ycsb_performance_regression_test.py		ycsb_performance_regression_test.py

Folders and files

Latest commit

History

Repository files navigation

SCT - Scylla Cluster Tests

Quickstart

Option 1 - Config AWS using OKTA (preferred option)

Option 2 - Config AWS using AWS credentials

Run a test

Disable Argus (if not needed for testing)

Logs Location

Run test locally with AWS backend:

Run test using SCT Runner with AWS backend:

Run test locally with GCE backend:

Run test locally with Azure backend:

Run test locally with docker backend:

Run test with ScyllaDB Cloud (xcloud) backend:

List resources being used by user:

Reuse already running cluster:

Clear resources:

Clear resources being used by the last test run:

Supported backends

Configuring test run configuration YAML

Configuration Documentation

Development Plans

Types of Tests

Longevity Tests (TODO: write explanation for them)

Upgrade Tests (TODO: write explanation for them)

Performance Tests (TODO: write explanation for them)

Features Tests (TODO: write explanation for them)

Manager Tests (TODO: write explanation for them)

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Uh oh!

Uh oh!

Languages