CLAUDE.md

This file provides guidance to Claude Code (claude.ai/code) when working with code in this repository.

Project Overview

dacli (Documentation Access CLI) - Navigate and query large documentation projects. Supports AsciiDoc and Markdown with hierarchical, content-aware access. Available as CLI tool and MCP server for LLM integration.

Part of the docToolchain ecosystem.

Current State: Core implementation complete. AsciiDoc/Markdown parsers, Structure Index, CLI and MCP tools are working. Some tech-debt issues remain for advanced features.

Technology Stack

Language: Python 3.12+
Package Manager: uv (https://github.com/astral-sh/uv)
MCP Framework: FastMCP (https://github.com/jlowin/fastmcp)
MCP SDK: mcp[cli]

Fork-Based Development Workflow

Development happens on a fork to keep upstream/main stable for uv tool install.

Git Remotes:

origin → your fork (for development)
upstream → docToolchain/dacli (stable, production-ready)

Branches:

upstream/main - Stable, production-ready. Default branch (used by uv tool install).
Feature branches on fork - Use format feature/description-issue-number or fix/description-issue-number

Workflow:

Sync fork: git fetch upstream
Create feature branch: git checkout -b feature/xyz upstream/main
Implement changes with tests
Bump version (semantic versioning - IMPORTANT: Update all 3 locations!):
- MAJOR: Breaking changes (e.g., 1.0.0 → 2.0.0)
- MINOR: New features, backwards-compatible (e.g., 0.4.0 → 0.5.0)
- PATCH: Bug fixes only (e.g., 0.4.0 → 0.4.1)
Update these files:
1. pyproject.toml - line 3: version = "X.Y.Z"
2. src/dacli/__init__.py - line 7: __version__ = "X.Y.Z"
3. uv.lock - run uv sync to auto-update
Push to fork: git push origin feature/xyz
Create PR from fork to upstream/main (use Fixes #123 in PR body)
CI runs on PR, review, then merge
Issues auto-close when merged to main

After merge: Sync fork main with upstream to prevent duplicates:

git checkout main
git reset --hard upstream/main
git push origin main --force-with-lease

IMPORTANT:

Always create feature branches from upstream/main, never from fork main. This ensures clean, linear history and avoids duplicate commits.
Always bump the version number for every PR (even small bug fixes get a PATCH bump):
- Update all 3 files: pyproject.toml, src/dacli/__init__.py, and uv.lock (via uv sync)
- Missing any file causes --version to show incorrect version

Conventions

Documentation, Issues, Pull-Requests etc. is always written in english
use responsible-vibe-mcp wherever suitable

AsciiDoc Linting

The project uses asciidoc-linter as a pre-commit hook to catch AsciiDoc syntax errors.

Known Issues (as of 2026-02-11):

WS001 false positives: Linter incorrectly flags *bold* and **bold** syntax as list markers needing spaces
- These warnings can be safely ignored - the syntax is correct AsciiDoc
- See asciidoc-linter#41
- All ADR files (ADR-001 through ADR-013) trigger these false positives
Markdown tables NOT detected: Linter does not catch Markdown table syntax (|---|---|) in AsciiDoc files
- See asciidoc-linter#38
- Manual review required for table syntax

Workaround for commits:

When committing changes to .adoc files, the pre-commit hook may fail due to WS001 false positives:

# If only false positives (verify manually):
git commit --no-verify -m "your message"

# Or fix the linter output and commit normally
git commit -m "your message"

Real errors to watch for:

Incorrect table syntax (Markdown vs AsciiDoc)
Heading hierarchy issues (HEAD001-003)
Unterminated blocks (BLOCK001)

The linter runs on all .adoc and .asciidoc files in the repository.

Commands

# Install uv (if needed)
curl -LsSf https://astral.sh/uv/install.sh | sh

# Install dependencies
uv sync

# Run CLI
uv run dacli --help

# Run MCP server
uv run dacli-mcp --docs-root /path/to/docs

# Add dependencies
uv add <package-name>
uv add --dev <package-name>

# Run tests
uv run pytest

# Run tests with HTML report
uv run pytest --html=report.html --self-contained-html

# Run single test
uv run pytest tests/path/to/test.py::test_function_name

Architecture

Core Design Principles

In-Memory Index: On startup, parse all docs and build in-memory structure index for fast lookups
File System as Truth: Stateless design - file system is the single source of truth, no database
Atomic Writes: File modifications use temp files + backup strategy (ADR-004)

Components (see `src/docs/arc42/chapters/05_building_block_view.adoc`)

Component	Responsibility
API Endpoints	FastAPI routes, request validation
Document Service	Business logic for navigation/manipulation
Document Parser	Parse AsciiDoc/Markdown, resolve includes, build AST with line numbers
Structure Index	In-memory hierarchical structure for fast lookups
File System Handler	Atomic read/write operations
Search Service	Full-text search across indexed content
Validation Service	Structure validation, circular include detection

Key Data Models

Section: Hierarchical document section with path (dot-notation), title, level, location
SourceLocation: File path + line range (line, end_line, resolved_from for includes)
Element: Typed content (code, table, image, list, plantuml, admonition) with location and index

Path Format Convention

Section paths use hybrid notation combining colons and dots:

Colon (:) - Separates document slug from first-level section
Dot (.) - Separates nested sections

Examples:

Level 0 (document root): "doc" (document slug only)
Level 1 (chapter): "doc:introduction" (document:section)
Level 2 (subsection): "doc:introduction.goals" (document:section.subsection)
Level 3 (nested): "doc:introduction.goals.technical" (document:section.subsection.detail)

Important: Use colon only once to separate document from section. Use dots for all nested levels.

Common mistakes:

❌ "doc:section:subsection" - Multiple colons (wrong)
✅ "doc:section.subsection" - Colon once, then dots (correct)

Documentation Structure

src/docs/
├── 50-user-manual/  # User documentation
│   ├── index.adoc           # User manual index
│   ├── 10-installation.adoc # Installation guide
│   ├── 20-mcp-tools.adoc    # MCP tools reference
│   └── 50-tutorial.adoc     # CLI tutorial
├── arc42/           # Architecture documentation (arc42 template)
│   └── chapters/    # Individual architecture chapters
└── spec/            # Specifications
    ├── 01_use_cases.adoc          # Use cases with activity diagrams
    ├── 02_api_specification.adoc  # OpenAPI-style API spec
    ├── 03_acceptance_criteria.adoc # Gherkin scenarios
    ├── 04_markdown_parser.adoc    # Markdown parser component spec
    └── 06_cli_specification.adoc  # CLI interface specification

Specification Conventions

Use Cases (`01_use_cases.adoc`)

PlantUML Activity Diagrams for each use case
Structure: Akteure, Vorbedingungen, Ablauf, Nachbedingungen, Fehlerszenarien

API Specification (`02_api_specification.adoc`)

OpenAPI-style in AsciiDoc format
Data models as JSON schemas with descriptions
Endpoints grouped by API category (Navigation, Content Access, Manipulation, Meta-Information)

Acceptance Criteria (`03_acceptance_criteria.adoc`)

Gherkin format (Given-When-Then)
Grouped by feature/use case
German language

Component Specifications (e.g., `04_markdown_parser.adoc`)

Scope and limitations (what it does NOT do)
Data models as Python dataclasses
Acceptance criteria with Gherkin scenarios
Interface definition

Architecture Decision Records (ADRs)

Nygard format: Status, Context, Decision, Consequences
Pugh Matrix for each decision comparing alternatives
Located in src/docs/arc42/chapters/09_architecture_decisions.adoc

Key Architecture Decisions (ADRs)

Located in src/docs/arc42/chapters/09_architecture_decisions.adoc:

ADR-001: File system as single source of truth (no database)
ADR-002: In-memory index for performance
ADR-003: Python/FastAPI stack
ADR-004: Atomic writes via temporary files
ADR-005: Custom parser for include resolution and source mapping
ADR-006: uv for Python package management

Parser Specifics

AsciiDoc Structure Parser (`AsciidocStructureParser`)

Resolves include::[] directives recursively
Tracks original file path and line numbers for every element
Builds AST with source-map information
Detects circular includes

Markdown Structure Parser (`MarkdownStructureParser`)

Folder hierarchy = document structure (no includes)
Sorting: index.md/README.md first, then alphabetic with numeric prefix support
Extracts: headings (structure), code blocks (with content), tables, images
YAML frontmatter support for metadata

Available MCP Tools

Tool	Description
`get_structure`	Get hierarchical document structure
`get_section`	Read content of a specific section
`get_sections_at_level`	Get all sections at a nesting level
`search`	Full-text search across documentation
`get_elements`	Get code blocks, tables, images, etc.
`get_metadata`	Get project or section metadata
`validate_structure`	Validate documentation structure
`update_section`	Update section content (with optimistic locking)
`insert_content`	Insert content before/after sections

| get_dependencies | Get include tree for AsciiDoc documents |

For detailed tool documentation, see src/docs/50-user-manual/.

C4 Diagram Quality Rules

When creating or modifying C4 diagrams in arc42 docs:

File systems, databases are ContainerDb inside system boundary, NOT System_Ext
Component diagrams show internals of ONE container — keep consistent across diagrams
You can NOT zoom into components, only into containers
Use LAYOUT_WITH_LEGEND() consistently on all diagrams
Write abstraction type in Container_Boundary labels (e.g. "MCP Server [Container: Python, FastMCP]")
Review checklist: https://c4model.com/review

Risk Radar Assessment

Generated by /risk-assess on 2026-02-11

Module: dacli CLI

Dimension	Score	Level	Evidence
Code Type	2	Business Logic	Click commands, service layer orchestration (`src/dacli/cli.py`, `src/dacli/services/`)
Language	2	Dynamically typed	Python 3.12+ — 100% `.py` files
Deployment	1	Internal tool	Command-line tool for documentation teams (user confirmed)
Data Sensitivity	1	Internal business data	Operates on internal documentation (user confirmed)
Blast Radius	2	Data loss (recoverable)	Could corrupt docs, recoverable from git (user confirmed)

Tier: 2 — Extended Assurance (determined by max(Code Type=2, Language=2, Blast Radius=2))

Mitigations: dacli CLI (Tier 2)

Updated by /risk-mitigate on 2026-02-11

Measure	Status	Details
Tier 1 — Automated Gates
Linter & Formatter	✅ Vorhanden	Ruff (`pyproject.toml`, CI workflow)
Type Checking	N/A	Python without strict typing
Pre-Commit Hooks	✅ Eingerichtet	`.pre-commit-config.yaml` (commit 68d6ae4)
Dependency Check	✅ Eingerichtet	`pip-audit` in CI (commit fee56b6)
CI Build & Unit Tests	✅ Vorhanden	GitHub Actions (`.github/workflows/test.yml`), 713 tests with coverage
Tier 2 — Extended Assurance
SAST	✅ Eingerichtet	CodeQL security-extended (commit fead47e)
AI Code Review	✅ Vorhanden	`.github/workflows/claude-code-review.yml`
Property-Based Tests	✅ Eingerichtet	Hypothesis (`tests/test_property_based.py`, commit 87a965d)
SonarQube Quality Gate	✅ Eingerichtet	SonarCloud (`.github/workflows/sonarcloud.yml`, commit fb4c8ad)
Sampling Review (~20%)	✅ Eingerichtet	`.github/PR_REVIEW_POLICY.md` (commit efb868f)

Tier 1 completion: 4/4 (100%) | Tier 2 completion: 5/5 (100%) ✅

Security fixes applied: cryptography>=46.0.5, pip 26.0.1 (commit 7766e90)

Module: dacli-mcp

Dimension	Score	Level	Evidence
Code Type	2	Business Logic	MCP tools, service layer (`src/dacli/mcp_app.py`, `src/dacli/services/`)
Language	2	Dynamically typed	Python 3.12+ — 100% `.py` files
Deployment	1	Internal tool	MCP server for LLM integration in internal workflows (user confirmed)
Data Sensitivity	1	Internal business data	Operates on internal documentation (user confirmed)
Blast Radius	2	Data loss (recoverable)	Could corrupt docs, recoverable from git (user confirmed)

Tier: 2 — Extended Assurance (determined by max(Code Type=2, Language=2, Blast Radius=2))

Mitigations: dacli-mcp (Tier 2)

Updated by /risk-mitigate on 2026-02-11

Measure	Status	Details
Tier 1 — Automated Gates
Linter & Formatter	✅ Vorhanden	Ruff (`pyproject.toml`, CI workflow)
Type Checking	N/A	Python without strict typing
Pre-Commit Hooks	✅ Eingerichtet	`.pre-commit-config.yaml` (commit 68d6ae4)
Dependency Check	✅ Eingerichtet	`pip-audit` in CI (commit fee56b6)
CI Build & Unit Tests	✅ Vorhanden	GitHub Actions (`.github/workflows/test.yml`), 713 tests with coverage
Tier 2 — Extended Assurance
SAST	✅ Eingerichtet	CodeQL security-extended (commit fead47e)
AI Code Review	✅ Vorhanden	`.github/workflows/claude-code-review.yml`
Property-Based Tests	✅ Eingerichtet	Hypothesis (`tests/test_property_based.py`, commit 87a965d)
SonarQube Quality Gate	✅ Eingerichtet	SonarCloud (`.github/workflows/sonarcloud.yml`, commit fb4c8ad)
Sampling Review (~20%)	✅ Eingerichtet	`.github/PR_REVIEW_POLICY.md` (commit efb868f)

Tier 1 completion: 4/4 (100%) | Tier 2 completion: 5/5 (100%) ✅

Security fixes applied: cryptography>=46.0.5, pip 26.0.1 (commit 7766e90)

Note: Both modules share the same codebase and mitigations. Entry points differ (dacli.cli:cli vs dacli.main:main), but risk profile and protection measures are identical.

Architecture Decision Records:

link:src/docs/arc42/adr/ADR-011.adoc[ADR-011: Risk Classification - dacli CLI (Tier 2)]
link:src/docs/arc42/adr/ADR-012.adoc[ADR-012: Risk Classification - dacli-mcp (Tier 2)]
link:src/docs/arc42/adr/ADR-013.adoc[ADR-013: Security Mitigations - Tier 2 Implementation]

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

CLAUDE.md

Project Overview

Technology Stack

Fork-Based Development Workflow

Conventions

AsciiDoc Linting

Commands

Architecture

Core Design Principles

Components (see `src/docs/arc42/chapters/05_building_block_view.adoc`)

Key Data Models

Path Format Convention

Documentation Structure

Specification Conventions

Use Cases (`01_use_cases.adoc`)

API Specification (`02_api_specification.adoc`)

Acceptance Criteria (`03_acceptance_criteria.adoc`)

Component Specifications (e.g., `04_markdown_parser.adoc`)

Architecture Decision Records (ADRs)

Key Architecture Decisions (ADRs)

Parser Specifics

AsciiDoc Structure Parser (`AsciidocStructureParser`)

Markdown Structure Parser (`MarkdownStructureParser`)

Available MCP Tools

C4 Diagram Quality Rules

Risk Radar Assessment

Module: dacli CLI

Mitigations: dacli CLI (Tier 2)

Module: dacli-mcp

Mitigations: dacli-mcp (Tier 2)

FilesExpand file tree

CLAUDE.md

Latest commit

History

CLAUDE.md

File metadata and controls

CLAUDE.md

Project Overview

Technology Stack

Fork-Based Development Workflow

Conventions

AsciiDoc Linting

Commands

Architecture

Core Design Principles

Components (see src/docs/arc42/chapters/05_building_block_view.adoc)

Key Data Models

Path Format Convention

Documentation Structure

Specification Conventions

Use Cases (01_use_cases.adoc)

API Specification (02_api_specification.adoc)

Acceptance Criteria (03_acceptance_criteria.adoc)

Component Specifications (e.g., 04_markdown_parser.adoc)

Architecture Decision Records (ADRs)

Key Architecture Decisions (ADRs)

Parser Specifics

AsciiDoc Structure Parser (AsciidocStructureParser)

Markdown Structure Parser (MarkdownStructureParser)

Available MCP Tools

C4 Diagram Quality Rules

Risk Radar Assessment

Module: dacli CLI

Mitigations: dacli CLI (Tier 2)

Module: dacli-mcp

Mitigations: dacli-mcp (Tier 2)

Components (see `src/docs/arc42/chapters/05_building_block_view.adoc`)

Use Cases (`01_use_cases.adoc`)

API Specification (`02_api_specification.adoc`)

Acceptance Criteria (`03_acceptance_criteria.adoc`)

Component Specifications (e.g., `04_markdown_parser.adoc`)

AsciiDoc Structure Parser (`AsciidocStructureParser`)

Markdown Structure Parser (`MarkdownStructureParser`)