ENH: Add exact Cramér-von Mises two-sample p-value gufunc by fbourgey · Pull Request #118 · scipy/xsf

fbourgey · 2026-04-03T17:53:44Z

Supersedes #108

#108 is a direct a C++ port of SciPy's _pval_cvm_2samp_exact and contains a detailed review explaining the algorithm.

The current PR tries to implement @steppi's comment for computing the exact p-value of the Cramér-von Mises two-sample test and making it CUDA compatible.

…ue calculation for Cramér-von Mises two-sample test

…test

fbourgey · 2026-04-03T17:55:18Z

+        test_case{8862.0, 14, 8, 0.2679738562091503, 1e-10},
+        test_case{3491.0000000000005, 14, 5, 0.34657722738218094, 1e-10},
+        test_case{12559.0, 5, 26, 0.11812654860485784, 1e-10},
+        test_case{8901.0, 23, 5, 0.9907610907610908, 1e-10} //, test_case{119376.0, 20, 21, 0.5716351061359124, 1e-10}


When m=20 and n=21, the test was extremely slow, likely due to it taking too much memory.

fbourgey · 2026-04-03T17:57:13Z

+    // Device-safe greatest common divisor (gcd) for 64-bit integers
+    XSF_HOST_DEVICE inline int64_t gcd(int64_t a, int64_t b) {
+        a = (a < 0) ? -a : a;
+        b = (b < 0) ? -b : b;
+        while (b != 0) {
+            int64_t t = a % b;
+            a = b;
+            b = t;
+        }
+        return a;
+    }
+
+    // Device-safe least common multiple (lcm) for 64-bit integers
+    XSF_HOST_DEVICE inline int64_t lcm(int64_t a, int64_t b) {
+        if (a == 0 || b == 0) {
+            return 0;
+        }
+        int64_t g = gcd(a, b);
+        int64_t res = (a / g) * b;
+        return (res < 0) ? -res : res;
+    }


It does not seem like we can call std::lcm anymore. This implements gcd and lcm. We can move those somewhere else so other methods can use them in the future.

It turns out thatcuda::std::numeric has it, so we're good on this front actually and don't need to reimplement.

https://github.com/NVIDIA/cccl/blob/8ca13c846326556e6571400068d808d24a215552/libcudacxx/include/cuda/std/__numeric/gcd_lcm.h#L12

…er usage of gcd and lcm

…2samp_exact test

steppi · 2026-04-09T16:59:39Z

+    int64_t zeta =
+        static_cast<int64_t>(std::floor((lcm * lcm * (m + n) * (6.0 * s - mn * (4.0 * mn - 1))) / (6.0 * mn * mn)));
+
+    detail::cvm_freq_table_all(m, n, a, b, gs, next_gs);


This isn't the right idea. cramer_von_mises_exact should be taking in an already generated freq_table not generating a new one from scratch inside the kernel. Table generation has no s dependence, and we don't want to have to regenerate the table for each scalar value of s.

steppi · 2026-04-09T17:07:56Z

+using cuda::std::gcd;
+using cuda::std::lcm;
+


Don't worry about the CUDA side. I still have to figure out cupy/cupy#9839 before we can even try these in CuPy. I'm pretty sure just adding using cuda::std::gcd won't work in all cases, and we actually need wrappers for stdlib functions like the other ones in this file. I recall I had suggested using using like this when Irwin first set this up, but there was a reason he had done things the way he did.

fbourgey added 2 commits April 3, 2026 13:36

ENH: Add device-safe gcd and lcm functions, and implement exact p-val…

ae15833

…ue calculation for Cramér-von Mises two-sample test

TST: Add unit tests for exact p-value in Cramér-von Mises two-sample …

9cc28d2

…test

fbourgey commented Apr 3, 2026

View reviewed changes

fbourgey mentioned this pull request Apr 6, 2026

ENH: Port null-distribution special functions from scipy.stats to xsf #98

Open

18 tasks

fbourgey added 3 commits April 7, 2026 15:10

ENH: Include <cuda/std/numeric> for numeric functions and ensure prop…

7f7ca4a

…er usage of gcd and lcm

REF: gcd and lcm implementations; use cuda lcm instead

ab4596d

REF/TST: Replace custom lcm implementation with std::lcm in pval_cvm_…

0f6742f

…2samp_exact test

steppi reviewed Apr 9, 2026

View reviewed changes

Comment thread include/xsf/stats.h Outdated

steppi reviewed Apr 9, 2026

View reviewed changes

ENH: use std::max

e28dbcb

fbourgey added the enhancement New feature or request label Apr 15, 2026

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

ENH: Add exact Cramér-von Mises two-sample p-value gufunc#118

ENH: Add exact Cramér-von Mises two-sample p-value gufunc#118
fbourgey wants to merge 6 commits intoscipy:mainfrom
fbourgey:cramer_von_mises_gufunc

fbourgey commented Apr 3, 2026

Uh oh!

fbourgey Apr 3, 2026

Uh oh!

fbourgey Apr 3, 2026

Uh oh!

steppi Apr 7, 2026

Uh oh!

steppi Apr 9, 2026

Uh oh!

Uh oh!

steppi Apr 9, 2026 •

edited

Loading

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Uh oh!

Conversation

fbourgey commented Apr 3, 2026

Uh oh!

fbourgey Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

fbourgey Apr 3, 2026

Choose a reason for hiding this comment

Uh oh!

steppi Apr 7, 2026

Choose a reason for hiding this comment

Uh oh!

steppi Apr 9, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

steppi Apr 9, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

steppi Apr 9, 2026 •

edited

Loading