Skip to content
Change the repository type filter

All

    Repositories list

    • P1-VL

      Public
      P1-VL: Bridging Visual Perception and Scientific Reasoning in Physics Olympiads
      11300Updated Feb 11, 2026Feb 11, 2026
    • FROM $f(x)$ AND $g(x)$ TO $f(g(x))$: LLMs Learn New Skills in RL by Composing Old Ones
      Python
      56020Updated Jan 26, 2026Jan 26, 2026
    • [ICLR 2026] SimpleVLA-RL: Scaling VLA Training via Reinforcement Learning
      Python
      831.4k451Updated Jan 6, 2026Jan 6, 2026
    • P1

      Public
      P1: Mastering Physics Olympiads with Reinforcement Learning
      47430Updated Dec 29, 2025Dec 29, 2025
    • TTRL

      Public
      [NeurIPS 2025] TTRL: Test-Time Reinforcement Learning
      Python
      72991160Updated Sep 26, 2025Sep 26, 2025
    • The Entropy Mechanism of Reinforcement Learning for Large Language Model Reasoning.
      Python
      1542020Updated Jul 11, 2025Jul 11, 2025
    • PRIME

      Public
      Scalable RL solution for advanced reasoning of language models
      Python
      1031.8k81Updated Mar 18, 2025Mar 18, 2025
    • Repo of paper "Free Process Rewards without Process Labels"
      Python
      11168120Updated Mar 14, 2025Mar 14, 2025