Skip to content

cuda: Verify counter and throughput metrics are supported on each unique chip#597

Open
Treece-Burgess wants to merge 3 commits intoicl-utk-edu:masterfrom
Treece-Burgess:04-29-2026-cuda-filter-counter-and-throughput-metrics
Open

cuda: Verify counter and throughput metrics are supported on each unique chip#597
Treece-Burgess wants to merge 3 commits intoicl-utk-edu:masterfrom
Treece-Burgess:04-29-2026-cuda-filter-counter-and-throughput-metrics

Conversation

@Treece-Burgess
Copy link
Copy Markdown
Contributor

Pull Request Description

This PR filters the Perfworks counter and throughput metrics by verifying that they are supported on the unique chip and by doing so resolves Issue #542.

It is important to note that this change results in longer execution time.

Testing

Testing was done on Heimdall (1 * RTX5080) at Oregon with Cuda Toolkit 12.9 and Guyot (8 * A100) at ICL with Cuda Toolkit 12.9. The configure used is as follows:

./configure --prefix=$PWD/test-install --with-components="cuda" --with-debug=yes

Heimdall

  • PAPI build: ✅
  • PAPI utilities*: ✅
  • Cuda component tests: ✅
  • Issue 542: ✅
  • Wall clock time comparison of papi_component_avail
    • This PR: [2.464, 2.492, 2.488]
    • Master branch: [0.463, 0.398, 0.429]

Guyot

  • PAPI build: ✅
  • PAPI utilities*: ✅
  • Cuda component tests: ✅
  • Wall clock time comparison of papi_component_avail
    • This PR: [8.058, 8.011, 8.068]
    • Master branch: [5.004, 5.024, 4.998]

* - papi_component_avail, papi_native_avail, papi_command_line

Author Checklist

  • Description
    Why this PR exists. Reference all relevant information, including background, issues, test failures, etc
  • Commits
    Commits are self contained and only do one thing
    Commits have a header of the form: module: short description
    Commits have a body (whenever relevant) containing a detailed description of the addressed problem and its solution
  • Tests
    The PR needs to pass all the tests

@Treece-Burgess Treece-Burgess requested a review from dbarry9 April 29, 2026 16:55
@Treece-Burgess Treece-Burgess added component-cuda PRs and Issues related to the cuda component status-ready-for-review PR is ready to be reviewed type-maintenance Update code to keep it compatible, secure, modern. labels Apr 29, 2026
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

component-cuda PRs and Issues related to the cuda component status-ready-for-review PR is ready to be reviewed type-maintenance Update code to keep it compatible, secure, modern.

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant