training and evaluating details

Hello! I noticed that the training and test datasets are identical when running bash examples/ttrl/Qwen2.5-Math/aime.sh. To ensure proper evaluation, the training data should be distinct from the test data. So I cannot understand.
<img width="2556" height="1476" alt="Image" src="https://github.com/user-attachments/assets/dffaa4e5-dcbb-4e48-803f-947cc63df693" />