Integer dtypes and units can cause a footgun #3735

jokasimr · 2025-07-30T07:40:33Z

jokasimr
Jul 30, 2025
Maintainer

When converting units the dtype is conserved, this can create some surprising behaviors:

 sc.scalar(3, unit='deg').to(unit='rad')
# <scipp.Variable> ()      int64            [rad]  0
 sc.scalar(3.0, unit='deg').to(unit='rad')
# <scipp.Variable> ()    float64            [rad]  0.0523599

To work around it we can pass dtype='float64' as an extra argument in the unit conversion:

 sc.scalar(3, unit='deg').to(unit='rad', dtype='float64')

We are doing lots of unit conversions like this in our workflows, to make them able to handle user inputs having different units, and it's likely that they fail silently as above if the user passes an integer argument instead of a float.
There are some cases when you might want to preserve dtype after a unit conversion, mainly when converting counts, for example converting counts/s to counts/min (although in the opposite direction you do want a change of dtype!).

The example above uses deg and rad because that's a common case, but the same problem can happen with other units.

If we don't want users having to be aware of this we could consider either:

Making all scipp variables be dtype float64 by default unless the user explicitly specifies dtype.
- This is what I'm leaning towards. It won't solve the underlying problem but will probably reduce the number of times this causes bugs.
- It's different than how numpy behaves, but they don't have units.
Making unit conversions automatically convert the dtype to float64 if the input is an integer type.
- Not appropriate for some count related unit conversions.

What are your thoughts about it?

SimonHeybrock · 2025-08-04T06:19:59Z

SimonHeybrock
Aug 4, 2025
Maintainer

Deviating from NumPy behavior is usually a bad idea, since it will be surprising for the majority of users and will require a lot of extra explanation and references in the docs. If you need the behavior you describe I suggest you create wrapper functions.

0 replies

jokasimr · 2025-09-24T10:47:53Z

jokasimr
Sep 24, 2025
Maintainer Author

Then what is your opinion on the second option, making unit conversions automatically convert the dtype to float?
I think that is almost always the correct behavior, only rarely will the result of a unit conversion be an integer, even if the input is.

0 replies

SimonHeybrock · 2025-09-25T03:54:35Z

SimonHeybrock
Sep 25, 2025
Maintainer

We could consider auto-converting in unit conversions when the factor is less than 1? And maybe add preserve_dtype: bool = False argument? Should have a wider discussion around this, and see if there are any pitfalls/downsides.

0 replies

jokasimr · 2025-09-25T07:17:05Z

jokasimr
Sep 25, 2025
Maintainer Author

We could consider auto-converting in unit conversions when the factor is less than 1?

The problem is still there if the conversion factor is larger than 1. Consider converting foo to bar with a conversion factor of 1.5.
3 foo is currently converted to floor(1.5 * 3) = 4 bar, but of course the result should be 4.5 bar.

We could avoid converting the unit when the scaling factor is an integer.

2 replies

SimonHeybrock Sep 25, 2025
Maintainer

👍 for integer scale factors.

YooSunYoung Sep 25, 2025
Maintainer

My vote:

convert dtype to float64 of integer variables when we operate unit conversion by default.
add preserve_dtype=False argument so that users can explicitly keep the dtype, so that they don't have to do to(unit='otherunit', dtype=foo.dtype)

jokasimr · 2025-09-29T09:40:00Z

jokasimr
Sep 29, 2025
Maintainer Author

A concrete example where this problem occurs, see here and here.

If simulation.distance or distance_resolution are integer dtype and unit mm or cm, then they will be rounded to the closest m after conversion if distance_unit is m.

There are multiple other examples involving angels in the essreflectometry package, for example here.

0 replies

Sci++

Integer dtypes and units can cause a footgun #3735

Uh oh!

Uh oh!

jokasimr Jul 30, 2025 Maintainer

Replies: 5 comments · 2 replies

Uh oh!

SimonHeybrock Aug 4, 2025 Maintainer

Uh oh!

jokasimr Sep 24, 2025 Maintainer Author

Uh oh!

Uh oh!

SimonHeybrock Sep 25, 2025 Maintainer

Uh oh!

jokasimr Sep 25, 2025 Maintainer Author

Uh oh!

SimonHeybrock Sep 25, 2025 Maintainer

Uh oh!

YooSunYoung Sep 25, 2025 Maintainer

Uh oh!

Uh oh!

jokasimr Sep 29, 2025 Maintainer Author

jokasimr
Jul 30, 2025
Maintainer

Replies: 5 comments 2 replies

SimonHeybrock
Aug 4, 2025
Maintainer

jokasimr
Sep 24, 2025
Maintainer Author

SimonHeybrock
Sep 25, 2025
Maintainer

jokasimr
Sep 25, 2025
Maintainer Author

SimonHeybrock Sep 25, 2025
Maintainer

YooSunYoung Sep 25, 2025
Maintainer

jokasimr
Sep 29, 2025
Maintainer Author