Why are tasks and subsequences randomly picked for a Chronos-2 training dataset? #469

GuidoHeijden · 2026-02-25T13:42:47Z

GuidoHeijden
Feb 25, 2026

I have been using Chronos2Pipeline.fit(...) to fine-tune Chronos-2 on my own dataset. I have looked more into how the training data is constructed where I noticed that training samples are selected randomly.

Firstly, a batch is constructed by randomly selecting tasks.

chronos-forecasting/src/chronos/chronos2/dataset.py

Lines 646 to 649 in f951d9a

    
           while current_batch_size < self.batch_size: 
        
               input_idx = np.random.randint(len(self.inputs)) 
        
               input_indices.append(input_idx) 
        
               current_batch_size += self.inputs[input_idx]["context"].shape[0]

Then, the subsequences that go into the batch are selected randomly as well.

chronos-forecasting/src/chronos/chronos2/dataset.py

Lines 557 to 558 in f951d9a

    
           # slice a random subsequence from the full series 
        
           slice_idx = np.random.randint(self.min_past, full_length - self.prediction_length + 1)

Could someone elaborate what the reasoning is behind using randomization in these places?

Alternative approach

Alternatively to the randomization, I would hypothesize that selecting an equal number of samples from each task would result in more reliable outcomes. Similarly, the subsequences could be spread out across the available timespan of each task, such that the training steps cover as much training data as possible. This would also prevent accidental overfitting due to bad luck in selecting training datapoints.

Has such an alternative been considered? Is it deliberate choice to go with the current implementation that uses randomization? Or is it left as an exercise to the user of the chronos-forecast package to implement themselves?

Thanks in advance for any help on the matter!

PS: I am more than happy to contribute this alternative method for selecting training samples, if it seems useful :)

Answered by abdulfatir

Mar 12, 2026

@GuidoHeijden The time series slicing logic is based on our prior experience working with forecasting models and developing open source libraries like GluonTS and AutoGluon. That said, there are always trade-offs and this may not be the best possible setup for all situations. However, in our large scale benchmarking it works well across tasks.

View full answer

abdulfatir · 2026-03-12T12:34:36Z

abdulfatir
Mar 12, 2026
Maintainer

@GuidoHeijden The time series slicing logic is based on our prior experience working with forecasting models and developing open source libraries like GluonTS and AutoGluon. That said, there are always trade-offs and this may not be the best possible setup for all situations. However, in our large scale benchmarking it works well across tasks.

0 replies

wwfcnu · 2026-03-26T08:28:01Z

wwfcnu
Mar 26, 2026

Does your alternative work on your own dataset?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Why are tasks and subsequences randomly picked for a Chronos-2 training dataset? #469

Uh oh!

{{title}}

Uh oh!

Replies: 2 comments

Uh oh!

{{title}}

Uh oh!

Uh oh!

{{title}}

Uh oh!

Select a reply

Uh oh!

Why are tasks and subsequences randomly picked for a Chronos-2 training dataset? #469

Uh oh!

GuidoHeijden Feb 25, 2026

Alternative approach

Replies: 2 comments

Uh oh!

abdulfatir Mar 12, 2026 Maintainer

Uh oh!

wwfcnu Mar 26, 2026

GuidoHeijden
Feb 25, 2026

abdulfatir
Mar 12, 2026
Maintainer

wwfcnu
Mar 26, 2026