AI Intern — AI-Powered Coding Assistant

A terminal & web-based AI coding assistant built on the deepagents library. It can explore, plan, and modify code in any local repository through a modern chat interface.

Features

Multi-Provider LLM Support — Azure OpenAI, OpenAI, Google Gemini, and local Ollama models via a unified factory.
Deep Agent — Powered by deepagents with built-in planning (write_todos), filesystem tools (ls, read_file, write_file, edit_file, grep, glob), shell execution, and sub-agent spawning.
Model Context Protocol (MCP) — Native integration via langchain_mcp_adapters to connect standard MCP servers (e.g., Microsoft Docs) for extended context.
Custom Tooling Extensibility — Includes robust custom tools (such as think for deep reasoning) loaded directly into the agent mapping.
Semantic Code Search — ChromaDB-backed vector search lets the agent find code by concept ("Where is the auth logic?") without needing exact filenames or grep patterns. Indexed locally using Ollama embeddings (no API key) with OpenAI fallback.
Browser Interaction (Playwright + Edge) — Five built-in browser tools let the agent visually verify frontend work: take screenshots, capture JS console logs, read the DOM, click elements, and detect failed network requests — all driven headlessly through your installed Microsoft Edge.
Git Version Control — 12 built-in Git tools let the agent clone repos from a URL, read diffs and history, commit its own work, branch safely before risky changes, push/pull to remotes, and generate AI-written commit messages.
Chainlit Web UI — Real-time token streaming, custom aesthetic tool step indicators ("Editing...", "Thinking..."), inline visual Diff and Terminal blocks, and a dynamic Tasks sidebar.
Admin Dashboard — Integrated FastAPI dashboard for observability (token usage, tool stats, LOC) and real-time agent configuration (system prompt, iteration limits, per-tool enable/disable). All tools are tracked in all_known_tools so disabling a tool never removes it from the dashboard view.
Prompt Debug Logging — Set DEBUG_PRINT_PROMPT=true in .env to print the full prompt (per message block with char counts and token estimates) to stdout before every LLM call. Useful for diagnosing token waste.
In-Memory Persistence — Conversation memory is maintained across turns within a session via LangGraph's MemorySaver checkpointer with unique thread_ids.
CLI Mode — Lightweight terminal interface for quick interactions.

Project Structure

ai-intern/
├── app.py                   # Production entry point (Chat + Dashboard combined)
├── assistant_ui.py          # Chainlit web UI — streaming, tool rendering, auth
├── assistant_cli.py         # Lightweight CLI interface
├── init_db.py               # One-time DB schema setup
│
├── core/                    # Agent brain
│   ├── coding_assistant.py  # DeepAgent setup, system prompt, tool wiring
│   ├── llm_factory.py       # Multi-provider LLM initialization
│   └── mcp_client.py        # MCP server configuration & tool loading
│
├── tools/                   # All LangChain tools
│   ├── custom_tools.py      # think, read_package_source
│   ├── browser_tools.py     # Playwright browser tools (screenshot, DOM, console, network)
│   ├── git_tools.py         # Git tools (clone, diff, commit, push, branch, log, blame...)
│   └── vector_search.py     # Semantic code search (ChromaDB + Ollama/OpenAI embeddings)
│
├── dashboard/               # Admin dashboard
│   ├── api.py               # FastAPI routes (observability + config endpoints)
│   ├── db.py                # SQLite helpers (telemetry, config persistence)
│   └── static/              # Dashboard frontend (HTML/JS)
│
├── public/                  # Chainlit custom UI elements
│   └── elements/
│       ├── DiffViewer.jsx
│       └── TerminalOutput.jsx
│
├── agent_data/              # Runtime databases (gitignored)
├── .env.example             # Template for API keys
├── .ai-intern-rules.example # Template for project-level rules
├── requirements-simple.txt  # Core dependencies
└── requirements.txt         # Full pip freeze

Quick Start

1. Clone & Install

git clone <repo-url>
cd ai-intern
python -m venv env
env\Scripts\activate        # Windows
pip install -r requirements-simple.txt

2. Configure API Keys

cp .env.example .env
# Edit .env with your actual API keys

3. Run the Web UI

# First time only — create the SQLite schemas
python init_db.py

# Run the combined app (Chat at / and Dashboard at /dashboard)
uvicorn app:app --host 127.0.0.1 --port 8000

Open http://localhost:8000 and enter the name of a sibling project folder (e.g., my-project). The assistant will have full access to that repository.

4. (Alternative) Run the CLI

python assistant_cli.py

Admin Dashboard

The project includes a built-in dashboard for monitoring and configuration:

Observability: Track token usage (prompt vs completion), model distribution, and tool invocation stats.
Agent Configuration: Edit the system prompt, iteration limits, and toggle specific tools on/off in real-time.
Session History: View detailed logs of past conversations, including exact tool calls and durations.

Access it at http://localhost:8000/dashboard when running via app.py.

Browser Interaction Tools

The agent can interact with a running browser to verify frontend changes without any manual DevTools work. All tools are powered by Playwright and use your system-installed Microsoft Edge — no extra browser download needed.

Tool	What it does
`browser_screenshot`	Navigates to a URL and returns a screenshot rendered inline in the chat
`browser_get_console_logs`	Captures all `console.error`, `console.warn`, and `console.log` output during page load
`browser_get_dom`	Returns the `outerHTML` of a CSS selector or the full `body.innerHTML`
`browser_click_and_screenshot`	Clicks an element by CSS selector and screenshots the resulting state
`browser_get_network_errors`	Lists all HTTP 4xx/5xx responses and failed requests during page load

Setup — just install the Python package, no browser binary download required (If you have MS Edge, else download is required.):

pip install playwright
# playwright install msedge  ← only if Edge isn't already on your machine

External URL access — by default all tools allow navigation to any URL including cloud/public sites. To restrict to localhost only (e.g. for a sandboxed environment), pass allow_external=False when calling any browser tool, or flip the default in browser_tools.py:

# browser_tools.py — change the default on any tool signature
async def browser_screenshot(url: str, ..., allow_external: bool = False):  # locked to localhost
async def browser_screenshot(url: str, ..., allow_external: bool = True):   # allows all URLs (default)

Self-healing loop example — the agent can chain these tools automatically:

edit_file → execute (npm run dev) → browser_screenshot
         → browser_get_console_logs → [errors found] → think → edit_file → ...

Semantic Code Search

The agent can search the codebase by meaning, not just by keyword. Useful for unfamiliar or large codebases where you don't know the exact filename or function name.

Tool	What it does
`semantic_code_search`	Find relevant code chunks by natural language query
`rebuild_code_index`	Force a full re-index after large refactors

Example queries the agent can answer directly:

"Where is the authentication logic?"
"Find the database connection setup"
"Which file handles payment processing?"

The index is built automatically on first use and persisted in agent_data/chroma/. Subsequent sessions load it from disk instantly.

Embedding backends (in priority order):

Ollama nomic-embed-text — fully local, zero cost, zero API key
OpenAI text-embedding-3-small — automatic fallback if Ollama isn't running

Setup:

pip install langchain-chroma chromadb langchain-text-splitters

# For local embeddings (recommended — no API key needed):
ollama pull nomic-embed-text

# Or just set OPENAI_API_KEY and it falls back automatically

Git Tools

The agent has full Git awareness via 12 built-in tools backed by GitPython. Just share a remote URL and the agent can clone, explore, modify, commit, and push — all from the chat.

Cloning a repo

Tell the agent: "Clone https://github.com/user/repo and work on it" — it will call git_clone and immediately have the repo available as a workspace.

All available tools

Tool	Type	What it does
`git_clone`	Write	Clones a remote URL into the workspace parent directory
`git_status`	Read	Shows staged, modified, and untracked files
`git_diff`	Read	Returns unified diff of working tree or staged changes
`git_log`	Read	Returns recent commit history as structured JSON
`git_blame`	Read	Returns line-by-line authorship for a file
`git_commit`	Write	Stages files and creates a commit
`git_create_branch`	Write	Creates and checks out a new branch
`git_checkout`	Write	Switches branch or restores a file to HEAD
`git_push`	Write	Pushes current branch to remote (never force-pushes)
`git_pull`	Write	Pulls latest changes from remote
`git_stash`	Write	Stashes or restores uncommitted changes
`git_generate_commit_message`	AI	Reads staged diff and generates a conventional-commits message

Safe experimentation mode

Before making risky or large-scale changes, the agent automatically:

Checks git_status to confirm the repo is clean
Creates a new branch (ai-intern/<task-slug>) via git_create_branch
Makes all changes on that branch
Commits with an AI-generated message

If you dislike the result, just delete the branch — main is untouched.

Approval gates

git_commit, git_push, git_pull, and git_checkout trigger the existing human-in-the-loop approval prompt in the Chainlit UI before executing. Read-only tools (git_status, git_diff, git_log, git_blame) run without interruption.

Setup

pip install gitpython
# git must be installed and on PATH (standard on any dev machine)

How It Works

You provide a project folder name — the assistant maps it to a sibling directory alongside ai-intern/.
The agent plans first — it uses write_todos to create a task list before touching any code.
It explores efficiently — grep and glob find files & line numbers directly instead of navigating with ls.
It streams results in real-time — tokens appear word-by-word, tool calls show as collapsible steps, and the task sidebar updates live.
Memory persists across turns — within a session, the agent remembers everything you've discussed.

Configuration

Switching LLM Providers

Edit the provider in coding_assistant.py:

llm = get_llm(provider="azure")      # Azure OpenAI (default)
llm = get_llm(provider="openai")     # OpenAI
llm = get_llm(provider="google")     # Google Gemini
llm = get_llm(provider="ollama")     # Local Ollama

For models that require the OpenAI Responses API instead of Chat Completions (e.g. codex-mini-latest, o3, o4-mini with extended thinking), pass use_responses_api=True:

llm = get_llm(provider="openai", model_name="codex-mini-latest", use_responses_api=True)

Agent Middleware

The agent runs with middleware layers applied automatically:

Middleware	What it does
`SummarizationMiddleware`	Auto-summarises old messages when token count exceeds `SUMMARIZATION_TOKEN_TRIGGER` — keeps long sessions alive
`PIIMiddleware` (email, credit card)	Redacts PII before it reaches the LLM
`PIIMiddleware` (password, secret_key)	Redacts passwords, `client_secret`, API keys, and tokens found in code files
`ModelRetryMiddleware`	Retries transient LLM API failures (rate limits, 503s) with exponential backoff
`ToolRetryMiddleware`	Retries transient tool failures (browser, git, MCP network blips)

Tool Enable / Disable

Tools can be toggled on/off from the dashboard Settings page. The full list of known tools is always shown — disabling a tool removes it from the agent's active tool list but keeps it visible in the dashboard. New tools added to the codebase are automatically discovered on the next session start and added as enabled by default.

Prompt Debug Logging

To see exactly what is sent to the LLM on every call (useful for diagnosing token waste):

# .env
DEBUG_PRINT_PROMPT=true

Output shows each message block with char count, estimated tokens, and a note about tool schema overhead (tool schemas are sent separately by the API and account for ~200 tokens per tool).

Summarization Token Trigger

The summarization middleware uses absolute token counts (fractional limits require model profile metadata not available on all deployments):

# .env — set to ~75% of your model's context window
SUMMARIZATION_TOKEN_TRIGGER=100000   # gpt-4o / gpt-4.1 (128k)
SUMMARIZATION_KEEP_MESSAGES=30

Common values: gpt-5 / gpt-5.3-instant (128k) → 100000, gpt-5.3-codex (200k+) → 150000, gpt-5.4 (1M) → 750000.

Persistent Memory (Production)

Conversation history and agent state are persisted to SQLite automatically:

agent_data/chainlit_ui.db — Chainlit UI threads, messages, history sidebar
agent_data/checkpoints_lg.db — LangGraph agent state (tool calls, messages per thread)
/memories/ — Cross-session long-term memory via deepagents StoreBackend (agent writes here to remember things across conversations)

To use PostgreSQL in production, swap SQLAlchemyDataLayer conninfo and AsyncSqliteSaver for their Postgres equivalents.

Environment Variables

Variable	Required	Description
`AZURE_OPENAI_API_KEY`	For Azure	Azure OpenAI API key
`AZURE_OPENAI_ENDPOINT`	For Azure	Azure resource endpoint
`AZURE_OPENAI_API_VERSION`	For Azure	API version
`AZURE_OPENAI_DEPLOYMENT_NAME`	For Azure	Model deployment name
`OPENAI_API_KEY`	For OpenAI	OpenAI API key
`GOOGLE_API_KEY`	For Google	Google Gemini API key
`TAVILY_API_KEY`	For MCP	Tavily web search API key
`CHAINLIT_AUTH_SECRET`	Yes	Secret for Chainlit cookie auth
`CHAINLIT_USER`	Yes	Login username
`CHAINLIT_PASSWORD`	Yes	Login password
`DEBUG_PRINT_PROMPT`	No	Set `true` to print full prompt + token estimates before every LLM call
`SUMMARIZATION_TOKEN_TRIGGER`	No	Token count that triggers summarization (default `100000`)
`SUMMARIZATION_KEEP_MESSAGES`	No	Messages to keep after summarization (default `30`)
`OLLAMA_BASE_URL`	For Ollama	Ollama server URL (default `http://localhost:11434`)

ToDO

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
.chainlit		.chainlit
core		core
dashboard		dashboard
public/elements		public/elements
tests		tests
tools		tools
.ai-intern-rules.example		.ai-intern-rules.example
.env.example		.env.example
.gitignore		.gitignore
app.py		app.py
architecture.md		architecture.md
assistant_cli.py		assistant_cli.py
assistant_ui.py		assistant_ui.py
chainlit.md		chainlit.md
improvement_suggestions.md		improvement_suggestions.md
init_db.py		init_db.py
pytest.ini		pytest.ini
readme.md		readme.md
requirements-simple.txt		requirements-simple.txt
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

AI Intern — AI-Powered Coding Assistant

Features

Project Structure

Quick Start

1. Clone & Install

2. Configure API Keys

3. Run the Web UI

4. (Alternative) Run the CLI

Admin Dashboard

Browser Interaction Tools

Semantic Code Search

Git Tools

Cloning a repo

All available tools

Safe experimentation mode

Approval gates

Setup

How It Works

Configuration

Switching LLM Providers

Agent Middleware

Tool Enable / Disable

Prompt Debug Logging

Summarization Token Trigger

Persistent Memory (Production)

Environment Variables

ToDO

License

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

AI Intern — AI-Powered Coding Assistant

Features

Project Structure

Quick Start

1. Clone & Install

2. Configure API Keys

3. Run the Web UI

4. (Alternative) Run the CLI

Admin Dashboard

Browser Interaction Tools

Semantic Code Search

Git Tools

Cloning a repo

All available tools

Safe experimentation mode

Approval gates

Setup

How It Works

Configuration

Switching LLM Providers

Agent Middleware

Tool Enable / Disable

Prompt Debug Logging

Summarization Token Trigger

Persistent Memory (Production)

Environment Variables

ToDO

License

About

Topics

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages