You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Agent skill for running Codex or Claude Code as an orchestrator over Symphony workers and Linear issues. Plans waves, dispatches workers, reviews and merges, and optionally pursues a goal across many waves under hard budget caps.
A lightweight, declarative agent harness — define multi-agent workflows as YAML, run them from Python or the CLI, and they get measurably better every run.
Memory that learns and keeps itself current. A six-layer memory stack for Claude Code plus a nightly learning loop (capture, consolidation, scouts, conductor) that promotes your lessons into rules and surfaces new tools that fit your stack. Free, MIT.
Continual agent skill evolution through persistent decision history. Whole-skill optimisation (SKILL.md + scripts + references) with every decision landing as a local Git issue / PR / wiki. Runs on any agentskills.io runtime — Claude Code, Codex, OpenClaw, Hermes.
Five runnable demos for building resilient AI agents with Strands Agents: chaos-test the agent's tools, gate memory writes to stop hallucinations, defend against memory poisoning and prompt injection, verify multi-step tasks against the backend, and let an agent write its own tools. Runs on OpenAI or Amazon Bedrock.
6 tiny, runnable AI-agent demos on the 2026 frontier: self-improving agents, memory, theory of mind, active inference, verifiable behavior, and agent skills.
Evolve a state-of-the-art OfficeQA agent with EvoSkill and submit it to Sentient Arena Challenge 1 — in minutes. Seeded with a Cohort-0 highest-accuracy playbook. Clone → run → submit.
Methodology skill + deterministic harness for agents that self-iterate any skill/repo/project. Anti-self-deception by design — an un-gameable acceptance gate so 'accepted = real improvement', not a fake score-up curve. Built for the open-ended, no-ground-truth domains the self-improving-agent literature skips. 521 tests.