Skip to content
@lmarena

LMArena

An open platform to evaluate, benchmark, compare, and test frontier AI models

Popular repositories Loading

  1. arena-hard-auto arena-hard-auto Public

    Arena-Hard-Auto: An automatic LLM benchmark.

    Python 988 143

  2. copilot-arena copilot-arena Public

    TypeScript 350 26

  3. p2l p2l Public

    Prompt-to-Leaderboard

    Python 271 24

  4. arena-rank arena-rank Public

    Source Code of LMArena Leaderboard Methodology

    Python 71 5

  5. PPE PPE Public

    Jupyter Notebook 62 13

  6. search-arena search-arena Public

    ⚔️ Official code of "Search Arena: Analyzing Search-Augmented LLMs".

    Jupyter Notebook 49 7

Repositories

Showing 10 of 11 repositories

Top languages

Loading…

Most used topics

Loading…