Skip to content
@langwatch

LangWatch

012d1688-24ae-4759-ae70-5f8f81a13c0e

Get started | Scenarios | Instrument with MCP | Documentation

Welcome to LangWatch, an open-source platform for building and shipping reliable LLM-powered agents.

LangWatch combines observability, evaluation, and scenario-based testing to help teams understand agent behavior across real workflows — from production traces to simulated failures.

It empowers domain experts to review and score conversations, developers to debug and evaluate agents end-to-end, and business teams to track quality, usage, and cost with custom analytics.

lwp_og.webm

You can sign up and already start the integration on our free tier by following the guides bellow:

🚀 Quick Start

Ship safer agents in minutes. Create a free account, then dive into these guides:

🔑 Key Projects

  • LangWatch The core open-source platform for observing, evaluating, and testing LLM-powered agents.

  • Scenarios End-to-end simulations for multi-step, tool-using agents across real workflows.

  • Better Agents A standard and tooling ecosystem for building production-grade AI agents.

  • Docs Source for LangWatch documentation.

🤝 Contributing

Open-source is at the heart of LangWatch. We welcome issues, pull requests, and discussions of new ideas.

Please read our Contribution Guidelines for details on our code of conduct and contribution process.

🛟 Support

Need help or want to get involved?

Popular repositories Loading

  1. langwatch langwatch Public

    The platform for LLM evaluations and AI agent testing

    TypeScript 2.8k 254

  2. better-agents better-agents Public

    Standards for building agents, better

    TypeScript 1.5k 155

  3. scenario scenario Public

    Agentic testing for agentic codebases

    TypeScript 785 56

  4. langevals langevals Public

    LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores and LLM guardrails, for you to protect and benchmark your LLM…

    70 10

  5. kanban-code kanban-code Public

    Swift 69 7

  6. data-simulator data-simulator Public

    Synthetic Data Generation

    Jupyter Notebook 9 1

Repositories

Showing 10 of 30 repositories
  • langwatch Public

    The platform for LLM evaluations and AI agent testing

    langwatch/langwatch’s past year of commit activity
    TypeScript 2,846 254 241 (1 issue needs help) 122 Updated Mar 4, 2026
  • kanban-code Public
    langwatch/kanban-code’s past year of commit activity
    Swift 69 7 0 0 Updated Mar 4, 2026
  • scenario Public

    Agentic testing for agentic codebases

    langwatch/scenario’s past year of commit activity
    TypeScript 785 MIT 56 22 15 Updated Mar 3, 2026
  • bank-example Public
    langwatch/bank-example’s past year of commit activity
    Python 1 0 0 3 Updated Mar 2, 2026
  • docs Public

    Docs for LangWatch LLM Ops Platform

    langwatch/docs’s past year of commit activity
    MDX 3 3 0 5 Updated Mar 2, 2026
  • claude-resume Public
    langwatch/claude-resume’s past year of commit activity
    TypeScript 1 0 0 0 Updated Feb 28, 2026
  • better-agents Public

    Standards for building agents, better

    langwatch/better-agents’s past year of commit activity
    TypeScript 1,496 MIT 155 11 3 Updated Feb 22, 2026
  • langwatch-nebius Public

    LangWatch x Nebius: Comparing LLM models for AI agent quality using agent simulations

    langwatch/langwatch-nebius’s past year of commit activity
    Python 1 0 0 0 Updated Feb 18, 2026
  • claude-remote Public
    langwatch/claude-remote’s past year of commit activity
    Shell 4 MIT 1 0 0 Updated Feb 17, 2026
  • langevals Public

    LangEvals aggregates various language model evaluators into a single platform, providing a standard interface for a multitude of scores and LLM guardrails, for you to protect and benchmark your LLM models and pipelines.

    langwatch/langevals’s past year of commit activity
    70 10 3 (1 issue needs help) 15 Updated Feb 15, 2026

People

This organization has no public members. You must be a member to see who’s a part of this organization.