test: nightly fuzz test job for bstm by aljo242 · Pull Request #26136 · cosmos/cosmos-sdk

aljo242 · 2026-03-19T20:58:58Z

run nightly heavy fuzz and post to slack

greptile-apps · 2026-03-19T21:03:55Z

Greptile Summary

This PR adds a nightly GitHub Actions workflow that runs a 30-minute BlockSTM fuzz test and posts results to Slack, introduces a FuzzBlockSTMAppHashDeterminism differential fuzz test comparing sequential vs BlockSTM execution for app-hash determinism, and adds a safeFeePayer helper to prevent panicking FeePayer() implementations from crashing the pre-estimation path.

A full BaseApp harness (newFuzzAppHarness) is unnecessarily created inside the f.Fuzz(...) closure on every iteration solely to extract txConfig, which is constant across all iterations — this significantly reduces fuzzing throughput and should be lifted outside the loop.
safeFeePayer silently swallows all panics from FeePayer() without any logging; real bugs from production tx implementations will be invisible unless the crash is reproduced outside the estimation path.
The nightly workflow's failure notification fires on both failure and cancelled states (previously noted), and newly discovered corpus entries are not committed back to the repo (previously noted).

Confidence Score: 3/5

Safe to merge with minor functional concerns — the wasteful per-iteration app setup will reduce fuzz throughput but won't cause incorrect test results.
The core differential testing logic (FinalizeBlock → Commit loop, app-hash and tx-result comparison, post-commit balance checks) is structurally sound. The safeFeePayer recovery pattern is correct and well-tested. The main concern is a performance bug in the fuzz harness that creates a full app per iteration just for txConfig, which will noticeably limit how many cases the 30-minute nightly run can cover. Several workflow-level issues from prior review threads (cancelled-job false alerts, no corpus persistence) remain open.
tests/integration/blockstm/blockstm_fuzz_test.go (wasteful per-iteration app init) and .github/workflows/fuzz-nightly.yml (open workflow concerns from prior threads).

Important Files Changed

Filename	Overview
.github/workflows/fuzz-nightly.yml	New nightly workflow scheduling a 30-minute BlockSTM fuzz run with Slack success/failure notifications. Several pre-existing issues flagged in prior threads (cancelled-job notification, corpus persistence, duplicate concurrency group) remain open.
internal/blockstm/txnrunner.go	Added `safeFeePayer` helper that catches panics from `feeTx.FeePayer()` so that misbehaving tx implementations can't crash the pre-estimation phase. The approach is sound but silently discards panics without any logging.
internal/blockstm/txnrunner_test.go	Adds `mockPanicFeeTx` and a corresponding decoder, plus a test case asserting that a panicking `FeePayer()` implementation is handled gracefully. Coverage looks correct.
tests/integration/blockstm/blockstm_fuzz_test.go	New differential fuzz test comparing sequential vs BlockSTM execution for app-hash determinism. Core structure (newFuzzAppHarness → fundAccounts → runDifferentialBlockStream → FinalizeBlock+Commit loop) is correct, but a full app harness is unnecessarily spun up on every fuzz iteration purely to obtain `txConfig`, which can significantly reduce throughput.

_{Reviews (3): Last reviewed commit: "Merge branch 'main' into test/fuzz-bst" | Re-trigger Greptile}

tests/integration/blockstm/blockstm_fuzz_test.go

.github/workflows/fuzz-nightly.yml

tests/integration/blockstm/blockstm_fuzz_test.go

aljo242 · 2026-03-19T21:15:32Z

@greptile re-review, those p2 issues are not real

codecov · 2026-03-19T21:18:13Z

Codecov Report

❌ Patch coverage is 60.00000% with 4 lines in your changes missing coverage. Please review.
✅ Project coverage is 61.47%. Comparing base (6c7bddd) to head (86c27ea).

Files with missing lines	Patch %	Lines
internal/blockstm/txnrunner.go	60.00%	4 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main   #26136      +/-   ##
==========================================
+ Coverage   61.44%   61.47%   +0.03%     
==========================================
  Files         967      951      -16     
  Lines       63380    62455     -925     
==========================================
- Hits        38943    38395     -548     
+ Misses      24437    24060     -377

Files with missing lines	Coverage Δ
internal/blockstm/txnrunner.go	`94.28% <60.00%> (+13.03%)`	⬆️

... and 22 files with indirect coverage changes

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

.github/workflows/fuzz-nightly.yml

tests/integration/blockstm/blockstm_fuzz_test.go

aljo242 · 2026-03-24T15:23:46Z

@greptile can you re-review and give me a new score?

greptile-apps · 2026-03-24T15:28:05Z

internal/blockstm/txnrunner.go

+func safeFeePayer(feeTx sdk.FeeTx) (addr sdk.AccAddress, ok bool) {
+	defer func() {
+		if recover() != nil {
+			addr = nil
+			ok = false
+		}
+	}()
+
+	return sdk.AccAddress(feeTx.FeePayer()), true
+}


recover() in production estimation path masks real bugs

safeFeePayer uses a blanket recover() that silently swallows any panic originating from feeTx.FeePayer(), including ones that would indicate genuine programming errors (e.g., a nil-receiver dereference or an index out-of-bounds in a well-formed tx implementation). The pre-estimation path skipping a tx is functionally correct, but swallowing an unexpected panic here means the issue will never surface in logs or metrics — the tx is just processed without an estimate, and the fuzz test will pass because both seq and stm behave identically.

Consider logging the recovered panic value (even at debug level) so that unexpected panics from real tx types are visible:

func safeFeePayer(feeTx sdk.FeeTx) (addr sdk.AccAddress, ok bool) { defer func() { if r := recover(); r != nil { // FeePayer() panicked; skip pre-estimation for this tx. // Log at debug level if a logger is available. addr = nil ok = false } }() return sdk.AccAddress(feeTx.FeePayer()), true }

Emit a log line when FeePayer panics so pre-estimation skips remain visible during fuzz and production debugging. Made-with: Cursor

aljo242 added 3 commits March 19, 2026 16:44

gs

1e1697b

wf

2d13e32

update

e6ea111

github-project-automation bot added this to Interchain Public Works Mar 19, 2026

aljo242 requested a review from Eric-Warehime March 19, 2026 20:59

update

914a450

greptile-apps bot reviewed Mar 19, 2026

View reviewed changes

tests/integration/blockstm/blockstm_fuzz_test.go Show resolved Hide resolved

.github/workflows/fuzz-nightly.yml Show resolved Hide resolved

.github/workflows/fuzz-nightly.yml Show resolved Hide resolved

tests/integration/blockstm/blockstm_fuzz_test.go Outdated Show resolved Hide resolved

fix

3262c56

greptile-apps bot reviewed Mar 19, 2026

View reviewed changes

.github/workflows/fuzz-nightly.yml Show resolved Hide resolved

tests/integration/blockstm/blockstm_fuzz_test.go Show resolved Hide resolved

aljo242 added 3 commits March 19, 2026 17:23

dl

c351d7b

prove clanker wrong

16b382f

Merge branch 'main' into test/fuzz-bst

b32917f

aljo242 added the backport/v0.54.x Backport PR's to release/v0.54.x branch label Mar 20, 2026

aljo242 added 3 commits March 23, 2026 10:31

Merge branch 'main' into test/fuzz-bst

48fc932

Merge branch 'main' into test/fuzz-bst

9e40745

Merge branch 'main' into test/fuzz-bst

7b7b1ad

greptile-apps bot reviewed Mar 24, 2026

View reviewed changes

aljo242 added 3 commits March 24, 2026 11:49

Merge branch 'main' into test/fuzz-bst

aae0374

fix(blockstm): log recovered FeePayer panics in pre-estimation

3b7e872

Emit a log line when FeePayer panics so pre-estimation skips remain visible during fuzz and production debugging. Made-with: Cursor

Merge branch 'main' into test/fuzz-bst

86c27ea

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

test: nightly fuzz test job for bstm#26136

test: nightly fuzz test job for bstm#26136
aljo242 wants to merge 14 commits intomainfrom
test/fuzz-bst

aljo242 commented Mar 19, 2026

Uh oh!

greptile-apps bot commented Mar 19, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aljo242 commented Mar 19, 2026

Uh oh!

codecov bot commented Mar 19, 2026 •

edited

Loading

Uh oh!

Uh oh!

Uh oh!

aljo242 commented Mar 24, 2026

Uh oh!

greptile-apps bot Mar 24, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

aljo242 commented Mar 19, 2026

Uh oh!

greptile-apps bot commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 3/5

Important Files Changed

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

aljo242 commented Mar 19, 2026

Uh oh!

codecov bot commented Mar 19, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Uh oh!

aljo242 commented Mar 24, 2026

Uh oh!

greptile-apps bot Mar 24, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

greptile-apps bot commented Mar 19, 2026 •

edited

Loading

codecov bot commented Mar 19, 2026 •

edited

Loading