Fix reward scaler when run on varied episode lengths by opz · Pull Request #455 · takuseno/d3rlpy

opz · 2025-05-25T01:01:52Z

Fixes #454

When calling fit with a reward scaler on a dataset with varied episode lengths, the following error would be thrown in the fit_with_trajectory_slicer method:

ValueError: setting an array element with a sequence. The requested array has an
inhomogeneous shape after 1 dimensions.

This commit fixes the issue by flattening the rewards before calculating the mean and std.

When calling `fit` with a reward scaler on a dataset with varied episode lengths, the following error would be thrown in the `fit_with_trajectory_slicer` method: ``` ValueError: setting an array element with a sequence. The requested array has an inhomogeneous shape after 1 dimensions. ``` This commit fixes the issue by flattening the rewards before calculating the mean and std.

takuseno

LGTM. Thank you for your contribution!

takuseno approved these changes May 25, 2025

View reviewed changes

takuseno merged commit 4f0956b into takuseno:master May 25, 2025
4 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Fix reward scaler when run on varied episode lengths#455

Fix reward scaler when run on varied episode lengths#455
takuseno merged 1 commit intotakuseno:masterfrom
opz:fix/reward-scaler-with-inhomogeneous-episodes

opz commented May 25, 2025

Uh oh!

takuseno left a comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

opz commented May 25, 2025

Uh oh!

takuseno left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants