Skip to content
Change the repository type filter

All

    Repositories list

    • jrystal

      Public
      A JAX-based Differentiable Density Functional Theory Framework for Materials
      Python
      14552Updated Feb 15, 2026Feb 15, 2026
    • odc

      Public
      On demand communication
      Python
      23215Updated Feb 12, 2026Feb 12, 2026
    • Stable-RL

      Public
      Rethinking the Trust Region in LLM Reinforcement Learning
      Python
      33505Updated Feb 5, 2026Feb 5, 2026
    • oat

      Public
      🌾 OAT: A research-friendly framework for LLM online alignment, including reinforcement learning, preference learning, etc.
      Python
      5962861Updated Jan 29, 2026Jan 29, 2026
    • LifelongSafetyAlignment

      Public
      Python
      01110Updated Jan 13, 2026Jan 13, 2026
    • feedback-conditional-policy

      Public
      Code for "Language Models Can Learn from Verbal Feedback Without Scalar Rewards"
      Python
      25900Updated Jan 5, 2026Jan 5, 2026
    • InfNeRF

      Public
      InfNeRF: Towards Infinite Scale NeRF Rendering with O(log n) Space Complexity
      Python
      11210Updated Jan 3, 2026Jan 3, 2026
    • SkyLadder

      Public
      The official repository for SkyLadder: Better and Faster Pretraining via Context Window Scheduling
      Python
      6024210Updated Dec 29, 2025Dec 29, 2025
    • d4ft

      Public
      A JAX library for Density Functional Theory.
      Python
      554160Updated Nov 25, 2025Nov 25, 2025
    • Defeating the Training-Inference Mismatch via FP16
      Python
      1518240Updated Nov 14, 2025Nov 14, 2025
    • Precision-RL-verl

      Public
      Defeating the Training-Inference Mismatch via FP16
      Python
      3.2k500Updated Nov 14, 2025Nov 14, 2025
    • NDA

      Public
      Code for "Nonparametric Data Attribution for Diffusion Models"
      Jupyter Notebook
      01510Updated Nov 11, 2025Nov 11, 2025
    • tty-use

      Public
      C
      01400Updated Oct 13, 2025Oct 13, 2025
    • imperceptible-jailbreaks

      Public
      [ArXiv 2025] Imperceptible Jailbreaking against Large Language Models
      Python
      52400Updated Oct 7, 2025Oct 7, 2025
    • variational-reasoning

      Public
      Code for "Variational Reasoning for Language Models"
      Python
      15610Updated Sep 29, 2025Sep 29, 2025
    • autofd

      Public
      Automatic Functional Differentiation in JAX
      Python
      18160Updated Sep 18, 2025Sep 18, 2025
    • BanditSpec

      Public
      Python
      2500Updated Sep 2, 2025Sep 2, 2025
    • understand-r1-zero

      Public
      Understanding R1-Zero-Like Training: A Critical Perspective
      Python
      561.2k90Updated Aug 27, 2025Aug 27, 2025
    • Video-Next-Event-Prediction

      Public
      Python
      12030Updated Aug 9, 2025Aug 9, 2025
    • Optimizing Anytime Reasoning via Budget Relative Policy Optimization
      Python
      35100Updated Jul 15, 2025Jul 15, 2025
    • LongSpec

      Public
      LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
      Python
      37300Updated Jul 14, 2025Jul 14, 2025
    • [ICLR 2025] When Attention Sink Emerges in Language Models: An Empirical View (Spotlight)
      Python
      515510Updated Jul 8, 2025Jul 8, 2025
    • VeriFree

      Public
      Reinforcing General Reasoning without Verifiers
      Python
      69670Updated Jun 24, 2025Jun 24, 2025
    • Adan

      Public
      Adan: Adaptive Nesterov Momentum Algorithm for Faster Optimizing Deep Models
      Python
      7080750Updated Jun 8, 2025Jun 8, 2025
    • [CVPR 2025] TreeMeshGPT: Artistic Mesh Generation with Autoregressive Tree Sequencing
      Python
      1317950Updated May 22, 2025May 22, 2025
    • Python
      11910Updated May 20, 2025May 20, 2025
    • zero-bubble-pipeline-parallelism

      Public
      Zero Bubble Pipeline Parallelism
      Python
      3.6k449290Updated May 7, 2025May 7, 2025
    • Python
      714410Updated May 6, 2025May 6, 2025
    • Python
      23310Updated Apr 22, 2025Apr 22, 2025
    • The official implementation of "LightTransfer: Your Long-Context LLM is Secretly a Hybrid Model with Effortless Adaptation"
      Python
      02200Updated Apr 22, 2025Apr 22, 2025