Hi. I want to evaluate a released checkpoint. Which python script should I run and what parameters are needed?