Name	Name	Last commit message	Last commit date
parent directory ..
code	code
search-tooling	search-tooling
README.md	README.md
multinode.yaml	multinode.yaml
verl-grpo.yaml	verl-grpo.yaml
verl-ppo.yaml	verl-ppo.yaml

Name

Last commit message

Last commit date

Verl: State-of-the-art RL Training for LLMs

Verl is the most popular open-source reinforcement learning framework for LLMs, supporting PPO, GRPO, and other algorithms.

Also see search-tooling/ and this blog for tool-augmented “search” workflows (Search-R1 style), including Google Search–backed inference and a Wikipedia FAISS retrieval service used for inference and training.

Why SkyPilot + Verl?

SkyPilot makes RL training easy and cost-effective:

Get GPUs instantly across clouds and Kubernetes
3x cheaper with managed spot instances
Zero setup - handles distributed Ray clusters automatically

Quick Start

Launch single node agent training:

sky launch -c verl-ppo llm/verl/verl-ppo.yaml --secret WANDB_API_KEY --num-nodes 1 -y
sky launch -c verl-ppo llm/verl/verl-ppo.yaml --secret WANDB_API_KEY --secret HF_TOKEN --num-nodes 1 -y

sky launch -c verl-grpo llm/verl/verl-grpo.yaml --secret WANDB_API_KEY --num-nodes 1 -y
sky launch -c verl-grpo llm/verl/verl-grpo.yaml --secret WANDB_API_KEY --secret HF_TOKEN --num-nodes 1 -y

Launch a 2-node RLHF training job on the cheapest available GPUs:

sky launch -c verl llm/verl/multinode.yaml

Monitor training progress:

sky logs verl

Training logs showing PPO optimization progress with reward metrics

Access Ray dashboard:

sky status --endpoint 8280 verl

Ray dashboard showing real-time monitoring of distributed training across multiple nodes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

Verl: State-of-the-art RL Training for LLMs

Why SkyPilot + Verl?

Quick Start

Learn More

FilesExpand file tree

verl

Directory actions

More options

Directory actions

More options

Latest commit

History

verl

Folders and files

parent directory

README.md

Verl: State-of-the-art RL Training for LLMs

Why SkyPilot + Verl?

Quick Start

Learn More