An MCP server for AI-assisted Glean connector development. Add it to your IDE's MCP config and use Claude Code, Cursor, or any MCP-compatible AI assistant to scaffold, schema-map, generate, and test Glean connectors — without leaving your editor.
- Node.js ≥ 20
- Python + uv (for
run_connectorand Copier scaffolding) - A Glean API token
Add to .claude/mcp.json in your project (or ~/.claude/mcp.json globally):
{
"mcpServers": {
"local": {
"command": "npx",
"args": [
"-y",
"@gleanwork/connector-mcp"
],
"type": "stdio"
}
}
}For setup instructions for Cursor, VS Code, Windsurf, Goose, Codex, JetBrains, Gemini CLI, OpenCode, and more, see docs/setup.md.
These are set in the MCP server config, not in your connector project.
| Variable | Required | Description |
|---|---|---|
GLEAN_PROJECT_PATH |
No | Default project directory; overridden by create_connector |
GLEAN_CONNECTOR_TEMPLATE_PATH |
No | Path to a custom Copier template (defaults to copier-glean-connector in workspace) |
GLEAN_WORKER_COMMAND |
No | Command to start the Python worker (default: uv run python -m glean.indexing.worker) |
GLEAN_WORKER_REQUEST_TIMEOUT |
No | Max seconds to wait for a worker JSON-RPC response (default: 30) |
GLEAN_WORKER_SHUTDOWN_TIMEOUT |
No | Seconds to wait for graceful worker shutdown before SIGKILL (default: 5) |
GLEAN_INSTANCE and GLEAN_API_TOKEN belong in your connector project's .env file — create_connector generates a .env.example to get you started.
In your AI assistant, try:
"I want to build a Glean connector for our Salesforce Opportunities data. The API uses OAuth2 bearer tokens and returns paginated JSON. Let's start."
Or call get_started — the assistant will ask what you're connecting and walk you through the rest.
Six steps from zero to a running connector. The assistant guides you through each one.
| Step | What you're doing | Tool |
|---|---|---|
| 0 | Verify prerequisites | check_prerequisites |
| 1 | Scaffold the project | create_connector |
| 2 | Configure the data source | set_config |
| 3 | Define the schema (infer from a sample file or write it) | infer_schema + update_schema |
| 4 | Map fields and verify required Glean fields are covered | confirm_mappings + validate_mappings |
| 5 | Generate the Python connector code | build_connector |
| 5a | Implement real API calls in data_client.py | get_data_client + update_data_client |
| 6 | Run the connector and inspect results | run_connector + inspect_execution |
| Tool | Description |
|---|---|
get_started |
Open the guided workflow; the assistant asks what you're connecting |
check_prerequisites |
Verify uv, python, mise, copier, and Glean credentials are all configured |
create_connector |
Scaffold a new connector project and set the active session path |
list_connectors |
List all connector classes found in the project with their DataClient status |
set_config |
Write connector config (auth, endpoint, pagination) to .glean/config.json |
get_config |
Read .glean/config.json |
| Tool | Description |
|---|---|
infer_schema |
Parse a .csv, .json, or .ndjson file and return field analysis |
get_schema |
Read current .glean/schema.json |
update_schema |
Write field definitions to .glean/schema.json |
analyze_field |
Deep-dive on a single field: samples, type, Glean mapping suggestions |
| Tool | Description |
|---|---|
get_mappings |
Show source schema and Glean entity model side-by-side |
confirm_mappings |
Save field mapping decisions to .glean/mappings.json |
validate_mappings |
Check mappings for missing required Glean fields |
| Tool | Description |
|---|---|
get_data_client |
Read data_client.py and connector config — use before asking AI to write real API calls |
update_data_client |
Write a new data_client.py implementation (replaces the mock with real API calls) |
| Tool | Description |
|---|---|
build_connector |
Generate src/{module}/connector.py, models.py, mock_data.json from schema+mappings+config |
run_connector |
Start async connector execution; returns execution_id immediately |
inspect_execution |
Poll execution status; returns records, validation issues, logs |
manage_recording |
Record/replay/list/delete connector data recordings |
After create_connector, your project directory looks like:
my-connector/
├── src/
│ └── {module_name}/
│ ├── connector.py ← generated by build_connector
│ ├── models.py ← generated TypedDict for source data
│ └── mock_data.json ← sample data for local testing
├── CLAUDE.md ← workflow guidance (for Claude Code users)
└── .glean/
├── schema.json ← field schema
├── mappings.json ← field mappings to Glean entity model
├── config.json ← connector configuration
├── executions/ ← execution results (written on completion)
└── recordings/ ← captured API responses for replay
The server exposes a connector://workflow resource that returns the full authoring guide. Your AI assistant can fetch it at session start for workflow context.
- Single project per MCP session. The server tracks one active project at a time. To switch projects, set
GLEAN_PROJECT_PATHin the MCP server config or restart the server after runningcreate_connectorfor the new project. - Execution state is in-memory. Active execution history is lost when the MCP server restarts. Completed execution results written to
.glean/executions/persist on disk, but any in-progress executions must be re-run.
uv is not installed or is not on your PATH. The server requires uv to scaffold connector projects and to run the Python worker.
Install it following the official uv instructions, then verify:
uv --versionIf uv is installed but not on the PATH seen by your IDE, add it explicitly in the MCP server env config or set GLEAN_WORKER_COMMAND to the full path of an alternative command.
The server could not locate the copier-glean-connector template. By default it looks for the template alongside this package in the Glean workspace or clones it from github.com/gleanwork over SSH.
Set the GLEAN_CONNECTOR_TEMPLATE_PATH environment variable to the absolute path of a local checkout of the template:
{
"env": {
"GLEAN_CONNECTOR_TEMPLATE_PATH": "/path/to/copier-glean-connector"
}
}The Python worker process exited immediately. This usually means one of:
- You are not inside a Copier-scaffolded connector project. The
run_connectortool must be called aftercreate_connectorhas set up the project directory with the correctpyproject.tomland dependencies. - The
glean-indexing-sdkis not installed in the project's virtual environment. Runuv syncinside the connector project directory to install dependencies. - Wrong working directory. Ensure
GLEAN_PROJECT_PATHpoints to the connector project root, or runcreate_connectorfirst to set the active session path automatically.