LinkedIn Post Automation Multi-Agent

A robust, automated pipeline that generates high-quality, strategically-sound LinkedIn posts using a multi-agent architecture powered by Google's Gemini AI models. This system handles everything from topic selection and research to content creation, review, and image generation.

Overview

This application orchestrates a series of specialized AI agents to create engaging LinkedIn content tailored to your field of expertise. Each agent performs a distinct task in a sequential workflow, ensuring consistent quality and strategic alignment while maintaining full auditability through comprehensive logging and artifact persistence.

Key Features

🤖 Multi-Agent Architecture: Specialized agents for each stage of content creation
📊 Intelligent Topic Selection: Automatic topic generation with built-in variety tracking
🔍 Automated Research: Gathers relevant information and synthesizes findings
✍️ Strategic Content Creation: Uses "The Witty Expert" persona for engaging, memorable posts
🎨 AI-Generated Images: Automated image creation using Gemini's image generation capabilities
📝 Quality Assurance: Two-pass review system with character count validation
💾 Full Auditability: Every step logged and artifacts saved for debugging and improvement
🔄 Resilient Error Handling: Automatic retries, fallback strategies, and circuit breakers

Architecture

System Architecture Diagram

┌─────────────────────────────────────────────────────────────────────────────┐
│                              ORCHESTRATOR                                    │
│  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐  ┌─────────────┐         │
│  │   Config    │  │   Retry     │  │   Circuit   │  │    Cost     │         │
│  │   Loader    │  │   Logic     │  │   Breaker   │  │   Tracker   │         │
│  └──────┬──────┘  └──────┬──────┘  └──────┬──────┘  └──────┬──────┘         │
│         └────────────────┼────────────────┼────────────────┘                │
│                          ▼                ▼                                  │
│  ┌───────────────────────────────────────────────────────────────────────┐  │
│  │                        SEQUENTIAL PIPELINE                             │  │
│  │                                                                        │  │
│  │   ┌──────────┐   ┌──────────┐   ┌──────────┐   ┌──────────┐           │  │
│  │   │  Topic   │──▶│ Research │──▶│  Prompt  │──▶│  Writer  │           │  │
│  │   │  Agent   │   │  Agent   │   │Generator │   │  Agent   │           │  │
│  │   └──────────┘   └──────────┘   └──────────┘   └────┬─────┘           │  │
│  │        │              │              │              │                  │  │
│  │        ▼              ▼              ▼              ▼                  │  │
│  │   10_topic.json  20_research  25_structured   40_draft.md             │  │
│  │                      .json    _prompt.json                             │  │
│  │                                                     │                  │  │
│  │   ┌──────────┐   ┌──────────┐   ┌──────────┐       │                  │  │
│  │   │  Image   │◀──│  Image   │◀──│ Reviewer │◀──────┘                  │  │
│  │   │Generator │   │  Prompt  │   │  Agent   │                          │  │
│  │   │  Agent   │   │  Agent   │   └────┬─────┘                          │  │
│  │   └────┬─────┘   └────┬─────┘        │                                │  │
│  │        │              │              ▼                                 │  │
│  │        ▼              ▼         50_review.json                        │  │
│  │   80_image.png  70_image_           │                                 │  │
│  │                 prompt.txt          ▼                                 │  │
│  │                              60_final_post.txt                        │  │
│  └───────────────────────────────────────────────────────────────────────┘  │
└─────────────────────────────────────────────────────────────────────────────┘
                                      │
                    ┌─────────────────┼─────────────────┐
                    ▼                 ▼                 ▼
            ┌───────────┐     ┌───────────┐     ┌───────────┐
            │  SQLite   │     │  Memory   │     │  Events   │
            │ topics.db │     │   Bank    │     │  .jsonl   │
            └───────────┘     │   (RAG)   │     └───────────┘
                              └───────────┘

Language Models

Core Logic & Text Agents: Gemini 2.5 Pro
AI Image Generator: gemini-2.5-flash-image-preview

LLM Integration by Agent

Each agent leverages different LLM capabilities depending on its role:

Agent	Model	Purpose	Temperature	Est. Tokens/Call
Topic Agent	Gemini 2.5 Pro	Generate new topics when database exhausted	0.7	~200 input, ~500 output
Research Agent	Gemini 2.5 Pro	Synthesize web search results into structured research	0.5	~1000 input, ~800 output
Prompt Generator	Gemini 2.5 Pro	Transform raw topics into Strategic Content Architect prompts	0.7	~800 input, ~600 output
Writer Agent	Gemini 2.5 Pro	Draft engaging posts using The Witty Expert persona	0.8	~1000 input, ~1500 output
Reviewer Agent	Gemini 2.5 Pro	Perform contextual review and coherence checking	0.3	~1500 input, ~1500 output
Image Prompt Generator	Gemini 2.5 Pro	Create descriptive visual prompts from post content	0.7	~1500 input, ~300 output
Image Generator	Gemini 2.5 Flash Image	Generate AI images from text prompts	N/A	Fixed cost per image

Total Estimated Cost per Run: $0.08 - $0.15 USD (varies by content complexity and iteration count)

Note: Strategic Type Agent was deprecated and removed from the pipeline. Writer Agent now receives structured prompts directly.

Agent System

The application uses nine specialized agents orchestrated in a sequential workflow:

Orchestrator Agent: Central controller managing the workflow, data flow, and error handling
Topic Agent: Selects unique topics from the configured field, avoiding recent repeats
Research Agent: Gathers and synthesizes information on the selected topic
Prompt Generator Agent (Strategic Content Architect): Transforms raw technical topics into structured prompts with inferred audience insights and fresh analogies
Strategic Type Agent: Designs post structure using RAG-enabled memory bank of proven content strategies
Writer Agent (The Witty Expert): Drafts engaging posts using a brilliant, witty persona
Reviewer Agent: Performs contextual sense-checking and grammar/spelling corrections
Image Prompt Generator Agent: Creates descriptive prompts for image generation
AI Image Generator Agent: Generates relevant images for the post

Agent Personas

The system features three specialized personas defined in system_prompts.md:

The Strategic Content Architect (Prompt Generator Agent)

This persona acts as a prompt engineer, not a content writer. It transforms raw technical topics into structured prompts that fuel the Writer Agent.

Core Responsibilities:

Infer target audience characteristics beyond surface-level descriptions
Identify the audience's "Core Pain Point"—the human frustration behind technical challenges
Create fresh, unexpected analogies (avoiding clichés like "distributed ledger" or "like a library")
Structure output using a specific template format

Output Template:

**Topic:** [Clear, compelling title]
**Target Audience:** [Specific professional persona]
**Audience's Core Pain Point:** [Emotional/practical frustration]
**Key Metrics/Facts:** [Data points with fresh analogy]
**The Simple Solution/Code Snippet:** [Aha! moment framing]

The Witty Expert (Writer Agent)

A brilliant professor who hosts a late-night talk show—deeply knowledgeable but allergic to stuffiness.

Voice Characteristics:

Intellectual sparkle: Makes complex topics genuinely delightful
Dry wit: Humor that makes you smarter, not just entertained
Rhythmic confidence: Short paragraphs, strategic emphasis, conversational flow
Fresh analogies: Every post uses unexpected comparisons that reveal insight

LinkedIn Post Structure:

Scroll-stopping hook (opens a curiosity gap)
Relatable problem (addresses human frustration, not just technical issues)
Elegant solution (code or concept that creates an "Aha!" moment)
Quantifiable impact (concrete metrics when possible)
Simple action (one thing the reader can try today)
Memorable sign-off (reinforces the core insight)

Social Media Visual Strategist (Image Prompt Agent)

Creates image prompts that capture the post's emotional and conceptual essence.

Key Constraints:

Zero text: All prompts must explicitly forbid text/words/letters in the image
Metaphorical: Prefer abstract representations over literal technical imagery
Visual vocabulary: Subject, environment, lighting, mood must be specified

Workflow

Each execution follows this sequence:

Initialization: Creates unique run directory with timestamp
Topic Selection: Queries database to select/generate a new topic
Research: Gathers information and synthesizes findings
Prompt Generation: Transforms topic into structured prompt with audience insights
Strategy: Designs post structure using proven content strategies
Writing: Drafts post using The Witty Expert persona
Review: Two-pass refinement (contextual + grammar)
Character Count Validation: Ensures post is under 3000 characters (loops back if needed)
Image Prompt Generation: Creates descriptive prompt for image
Image Generation: Generates relevant image
Completion: Marks run as complete

Project Structure

/project-root
├── main.py                  # Main execution script
├── orchestrator.py          # Orchestration logic
├── agents/                  # Agent implementations
│   ├── __init__.py
│   ├── topic_agent.py
│   ├── research_agent.py
│   ├── prompt_generator_agent.py
│   ├── strategic_type_agent.py
│   ├── writer_agent.py
│   ├── reviewer_agent.py
│   ├── image_prompt_agent.py
│   └── image_generator_agent.py
├── core/                    # Core utilities
│   ├── llm_clients.py
│   ├── rag_setup.py
│   └── ...
├── database/                # Data persistence
│   └── topics.db
├── runs/                    # Run artifacts (created at runtime)
│   └── {YYYY-MM-DD}-{runId}/
│       ├── 00_config.json
│       ├── 10_topic.json
│       ├── 20_research.json
│       ├── 25_structured_prompt.json
│       ├── 30_strategy.json
│       ├── 40_draft.md
│       ├── 50_review.json
│       ├── 60_final_post.txt
│       ├── 70_image_prompt.txt
│       ├── 80_image.png
│       ├── prompts/
│       └── completions/
├── memory_bank/             # RAG corpus
│   └── ... (newsletter .txt files)
├── .env                     # API keys (not in repo)
├── .gitignore
├── requirements.txt
├── system_prompts.md        # Agent persona definitions
├── project_spec.md          # Detailed specifications
└── README.md

Installation

Prerequisites

Python 3.10 or higher
Google AI Studio API key (Get one here)

Setup

Clone the repository

git clone https://github.com/lilfetz22/linkedIn-post-automation-multi-agent.git
cd linkedIn-post-automation-multi-agent

Create a virtual environment

python -m venv venv

# On Windows
.\venv\Scripts\activate

# On macOS/Linux
source venv/bin/activate

Install dependencies
```
pip install -r requirements.txt
```
Configure API keys

Create a .env file in the project root:
```
GOOGLE_API_KEY=your_google_ai_studio_api_key_here
```
First-time setup

On first run, you'll be prompted to enter your field of expertise. You can enter any field you specialize in:
- Data Science (Optimizations & Time-Series Analysis)
- Generative AI & AI Agents
- Software Engineering (Cloud Architecture)
- DevOps & Infrastructure
- Machine Learning Operations
- Or any custom field of your choice
Your selection is saved in config.json.

Usage

Basic Execution

Run the application from the command line with your virtual environment activated:

python main.py

Dry-Run Mode

Test your setup and estimate costs without making any API calls:

python main.py --dry-run

What dry-run mode does:

✅ Verifies configuration is valid
✅ Creates run directory with unique ID
✅ Tests database connectivity
✅ Estimates costs for each agent in the pipeline
✅ Shows what the first LLM call would be
❌ Does NOT make any API calls to Gemini
❌ Does NOT incur any costs

Cost Optimization: Skip Image Generation

Reduce costs by ~85% by skipping image generation (text-only posts):

# Skip image generation during actual run
python main.py --no-image

# Test text-only cost estimate
python main.py --dry-run --no-image

Cost comparison:

With image: $0.35 - $0.45 (includes fixed $0.30 image generation cost)
Without image: $0.04 - $0.10 (text-only, varies by token usage)

Image generation costs a fixed $0.30 per image, making up the majority of the total cost. The range accounts for variations in token usage across different topics and iterations. Use --no-image when:

Budget is limited
Running multiple iterations for testing
Post topic works better without visuals
You plan to add images manually later

Multi-Run Execution

Generate multiple LinkedIn posts sequentially in a single command with automatic run numbering:

# Generate 3 posts sequentially
python main.py --runs 3

# Generate 5 posts without images (cost optimization)
python main.py --runs 5 --no-image

# Multi-run with custom field
python main.py --runs 3 --field "Data Science (Optimizations & Time-Series Analysis)"

Folder Naming Convention:

Single run (default): 2026-01-09-c94d6e
Multi-run: 2026-01-09-1-c94d6e, 2026-01-09-2-f72a1b, 2026-01-09-3-e5d3c2

The run number makes it easy to:

Identify the order of runs on the same day
Track which posts have already been generated
Resume workflows without confusion

Behavior:

Executes runs 1 through N sequentially
Stops immediately on first failure (fail-fast pattern)
Prints status summary after each run
Each run creates an independent folder with all artifacts

Example Multi-Run Output:

LinkedIn Post Automation Multi-Agent System
==================================================

==================================================
Run 1 of 3
==================================================
Status     : success
Run ID     : 2026-01-09-1-a3f9d2
Run Path   : runs/2026-01-09-1-a3f9d2/
...

==================================================
Run 2 of 3
==================================================
Status     : success
Run ID     : 2026-01-09-2-b4e1f3
Run Path   : runs/2026-01-09-2-b4e1f3/
...

==================================================
Run 3 of 3
==================================================
Status     : success
Run ID     : 2026-01-09-3-c5d2a1
Run Path   : runs/2026-01-09-3-c5d2a1/
...

Example dry-run output:

LinkedIn Post Automation Multi-Agent System
==================================================

============================================================
RUN SUMMARY
============================================================
Status     : success
Run ID     : 2025-12-07-a1b2c3
Run Path   : runs/2025-12-07-a1b2c3/
Artifacts  :
  - runs/2025-12-07-a1b2c3/00_config.json
  - runs/2025-12-07-a1b2c3/dry_run_summary.json

Estimated Cost: $0.36 USD
Cost Range: $0.35 - 0.45 (includes $0.30 image generation)
💡 Tip: Use --no-image flag to save $0.30 and skip image generation

Next LLM Call: topic_agent.select_topic()
Model: gemini-2.5-pro (temperature: 0.7)

Note: No API calls were made. Remove --dry-run to execute full pipeline.

Dry-run artifacts:

The dry-run creates a dry_run_summary.json file with detailed cost estimates:

{
  "mode": "dry_run",
  "estimated_costs": {
    "topic_agent": {
      "description": "Select topic from database or generate new one",
      "estimated_tokens": {"input": 200, "output": 500},
      "estimated_cost_usd": 0.0063
    },
    "writer_agent": {
      "description": "Draft LinkedIn post",
      "estimated_tokens": {"input": 1000, "output": 1500},
      "estimated_cost_usd": 0.0163
    }
    // ... other agents
  },
  "total_estimated_cost_usd": 0.12,
  "next_steps": {
    "first_llm_call": "topic_agent.select_topic()",
    "model": "gemini-2.5-pro",
    "temperature": 0.7
  }
}

Output

Each run generates:

Text file: 60_final_post.txt - The completed LinkedIn post
Image file: 80_image.png - AI-generated accompanying image (unless --no-image flag is used)
Artifacts: Complete audit trail in the run directory

Viewing Run Artifacts

Navigate to runs/{YYYY-MM-DD}-{runId}/ to inspect:

All intermediate agent outputs (JSON files)
Draft versions and revisions
LLM prompts and completions
Generated images

Example Run Artifact Tree

Below is a complete example from an actual run, showing the artifact structure and sample content:

runs/
└── 2025-11-23-57e8c3/
    ├── 00_config.json          # Run configuration
    ├── 10_topic.json           # Selected topic
    ├── 20_research.json        # Research synthesis
    ├── 25_structured_prompt.json  # Prompt Generator output
    ├── 30_strategy.json        # Strategic structure (deprecated)
    ├── 40_draft.md             # Writer Agent draft
    ├── 50_review.json          # Reviewer Agent output
    ├── 60_final_post.txt       # Approved LinkedIn post
    ├── 70_image_prompt.txt     # Image generation prompt
    └── 80_image.png            # Generated image (1:1 ratio)

Sample 10_topic.json:

{
  "topic": "How to detect data leakage in time-series pipelines"
}

Sample 25_structured_prompt.json:

{
  "topic_title": "How To Detect Data Leakage In Time-Series Pipelines",
  "target_audience": "Senior engineers scaling AI/Data systems",
  "pain_point": "Hard to translate complexity into crisp narrative",
  "key_metrics": ["Latency reduction %", "Throughput", "Retrieval hit-rate"],
  "analogy": "Like tuning an orchestra so each instrument supports the melody without noise.",
  "solution_outline": "Stepwise breakdown + strategic framing + wit hooks"
}

Sample 70_image_prompt.txt:

High-resolution conceptual illustration reflecting: 'How To Detect Data Leakage 
In Time-Series Pipelines'. Modern minimal style, subtle gradients, clean 
typography accent, professional tone. Zero text or words in the image.

Event Logging

All system events are logged to events.jsonl at the project root. Each line is a JSON object containing:

Timestamp
Run ID
Step name
Attempt number
Status (ok/error)
Duration
Token usage

Data Persistence

SQLite Database (`topics.db`)

The application maintains two tables:

previous_topics

Tracks all topics used in previous posts
Prevents repetition

potential_topics

Stores curated topic ideas by field
Used by Topic Agent for generation

Memory Bank (RAG)

The memory_bank/ directory contains the corpus for the Strategic Type Agent's RAG system:

Newsletter content files (.txt)
Proven content strategies
Structural templates

Error Handling & Resilience

The system includes robust error handling:

Automatic Retries: Up to 3 attempts with exponential backoff for transient errors
Fallback Strategies: Simplified prompts, alternative approaches when primary methods fail
Circuit Breaker: Aborts run after 3 consecutive LLM failures
Artifact Integrity: Immediate validation of all written files
Failure Logging: Detailed run_failed.json created on abortion

All agents return standardized responses:

{
  "status": "ok" | "error",
  "data": { ... },
  "error": {
	 "type": "ErrorType",
	 "message": "...",
	 "retryable": true
  },
  "metrics": { ... }
}

Error Taxonomy

The system uses a structured error hierarchy defined in core/errors.py. Understanding these error types helps with debugging and extending the system.

Error Type	Retryable	Description	Example Scenarios
`ValidationError`	❌ No	Agent output fails validation checks	Character count ≥3000, missing required fields, invalid JSON schema
`DataNotFoundError`	❌ No*	Expected data cannot be retrieved	Research finds no sources, empty database query, RAG yields no docs
`ModelError`	✅ Yes	LLM API call failures	API timeout, rate limiting (429), service unavailable (503)
`CorruptionError`	❌ No	Artifact persistence/parsing failures	JSON parse error after write, disk write failure, file corruption

*DataNotFoundError triggers fallback strategies (e.g., topic pivot) rather than retries.

Error Handling Flow:

┌─────────────┐     ┌─────────────────┐     ┌─────────────────┐
│ Agent Call  │────▶│ Error Occurred? │────▶│   Retryable?    │
└─────────────┘     └────────┬────────┘     └────────┬────────┘
                             │ No                    │
                             ▼                       ▼
                    ┌────────────────┐      ┌───────────────────┐
                    │ Return Success │      │ Yes: Retry with   │
                    │    Envelope    │      │ exponential backoff│
                    └────────────────┘      └─────────┬─────────┘
                                                      │
                                            ┌─────────▼─────────┐
                                            │ Max retries (3)?  │
                                            └─────────┬─────────┘
                                                      │ Yes
                                            ┌─────────▼─────────┐
                                            │ Abort run, create │
                                            │  run_failed.json  │
                                            └───────────────────┘

Circuit Breaker Pattern:

The system implements a circuit breaker that opens after 3 consecutive LLM failures across any agents:

# Circuit breaker state
consecutive_failures = 0  # Resets to 0 on any success
CIRCUIT_BREAKER_THRESHOLD = 3

# When breaker trips:
raise CircuitBreakerTrippedError("Circuit breaker tripped after 3 consecutive LLM failures")

Cost Management & Budget Controls

The system includes comprehensive cost tracking and safety limits to prevent unexpected API expenses.

Cost Estimation Methodology

Token Counting: Before each LLM call, the system counts input tokens using Gemini's native token counter (fallback to heuristic: ~4 chars per token if offline).

Cost Calculation:

Gemini 2.5 Pro: $1.25 per 1M input tokens + $10.00 per 1M output tokens
Gemini 2.5 Flash Image: $0.30 per image (estimated)

Per-Call Tracking: Each agent call records:

{
  "model": "gemini-2.5-pro",
  "input_tokens": 1234,
  "output_tokens": 567,
  "cost_usd": 0.0072
}

Aggregation: Orchestrator accumulates costs across all agents and persists breakdown in run metrics.

Safety Limits & Flags

The system enforces two types of budget limits:

API Call Limit: Maximum 25 API calls per run (default)
- Prevents infinite retry loops
- Configurable via CostTracker(max_api_calls=N)
Cost Limit: Maximum $3.00 USD per run (default)
- Proactive budget checking before each call
- Configurable via CostTracker(max_cost_usd=N)

Budget Enforcement: If a call would exceed limits, raises ValidationError and aborts immediately.

Dry-Run Mode:

python main.py --dry-run

This will execute all logic up to the first LLM call, allowing you to:

Verify configuration
Test run directory setup
Estimate costs without spending money

Monitoring Costs

Real-time: Check events.jsonl for per-agent token usage:

{"step": "writer", "token_usage": {"input": 1234, "output": 567}, "cost_usd": 0.0072}

Post-run: Review runs/{run_id}/metrics.json for complete breakdown:

{
  "total_cost_usd": 0.12,
  "costs_by_agent": {
    "topic": 0.005,
    "research": 0.015,
    "writer": 0.045,
    "reviewer": 0.035,
    "image": 0.020
  }
}

Cost Alerts:

Warning logged if single run exceeds $0.50
Abort if single run exceeds configured max_cost_usd

Optimizing Costs

Reduce API Calls:

Use database-seeded topics (no LLM call for Topic Agent)
Minimize character count loop iterations (better Writer prompts)

Optimize Token Usage:

Keep research summaries concise
Use lower temperatures where appropriate (Reviewer: 0.3)
Pre-validate inputs to avoid failed calls

Configuration

Field Selection

Edit config.json to change your field. You can use any field of expertise you wish:

{
  "field": "Your custom field here (e.g., Data Science, Machine Learning, Cloud Architecture)"
}

Example fields:

"Data Science (Optimizations & Time-Series Analysis)"
"Generative AI & AI Agents"
"Software Engineering (Cloud Architecture)"
"DevOps & Infrastructure"
"Machine Learning Operations"
Any other field you specialize in

How field choice affects topic selection and cost

The two pre-seeded fields
- "Data Science (Optimizations & Time-Series Analysis)"
- "Generative AI & AI Agents"
  use the potential_topics table in database/topics.db for topic selection. For these, the Topic Agent selects topics via SQLite queries only (no extra LLM call for topic selection).
When you use any other custom field (for example, "Software Engineering (Cloud Architecture)" or your own niche), the Topic Agent typically will not find matching topics in the database and will fall back to LLM-based topic generation. This requires an additional LLM API call and therefore increases cost for each run that needs a new topic.
If you want to use a custom field without paying the extra LLM cost for topic selection, you can seed your own topics into the database (for your custom field value) using database/init_db.py or a similar initialization step. Once seeded, your custom field behaves like the pre-seeded ones and uses database queries first.

Character Limits

The system enforces a 3000-character limit for LinkedIn posts. If a post exceeds this, it's automatically sent back to the Writer Agent for shortening.

Dependencies

Key libraries include:

google-generativeai - Google's Gemini AI models
python-dotenv - Environment variable management
sqlite3 - Topic database (built-in)
RAG/vector embeddings: langchain, llama-index, or chromadb

See requirements.txt for complete list.

Development

Code Quality and Linting

This project enforces code quality standards using flake8. To maintain code quality:

Run linting checks:

python -m flake8 .

Check for bare except statements specifically:

python -m flake8 . --select=E722

Run tests (including linting checks):

pytest tests/

Why No Bare Except Statements?

Bare except: statements (without specifying an exception type) are prohibited because they:

Catch ALL exceptions, including SystemExit and KeyboardInterrupt
Make debugging extremely difficult by silently swallowing errors
Can mask serious bugs and make the application fail silently

Instead of:

try:
    risky_operation()
except:  # ❌ This will cause linting to fail
    pass

Use specific exceptions:

try:
    risky_operation()
except ValueError as e:  # ✅ Specific exception
    logger.error(f"Invalid value: {e}")
except Exception as e:  # ✅ Or catch Exception (but not bare)
    logger.error(f"Unexpected error: {e}")

The flake8 E722 rule ensures this standard is enforced across the codebase.

Future Development

Planned enhancements:

Scheduling capabilities for automatic daily posting
Integration with LinkedIn API for direct posting
Analytics dashboard for post performance tracking
Multi-platform support (Twitter, Medium, etc.)
Custom persona training and fine-tuning

System Prompts

Detailed agent personas and behavioral guidelines are maintained in system_prompts.md. Consult this file when:

Understanding agent behavior
Debugging output quality
Extending or modifying agents

Maintaining System Prompts

The system prompts in system_prompts.md are the core intellectual property of this system. They define agent personas and behavior patterns that directly impact output quality.

How to Update Personas

File Structure: system_prompts.md contains three main sections:

Strategic Content Architect - User Prompt Engineer: Transforms raw topics into structured prompts
The Witty Expert Persona: Drafts engaging, memorable LinkedIn posts
Social Media Visual Strategist: Creates image prompts from post content

Editing Guidelines:

Preserve Template Structures: Each persona includes exact templates (marked with **Topic:**, **Target Audience:**, etc.). These must remain intact—agents parse these fields programmatically.
Maintain Persona Voice: When revising instructions:
- Strategic Content Architect: Focus on inferring audience insights, not just summarizing
- Witty Expert: Balance intellectual depth with conversational accessibility
- Visual Strategist: Emphasize emotional hooks and zero-text constraints
Update Analogy Guidelines: The "fresh analogy" requirement is critical. When adding examples:
- Add new cliché phrases to avoid (e.g., "like a library", "distributed ledger")
- Provide 2-3 new fresh analogy examples
- Explain why the analogy works (unexpected + accurate)
Atomic Changes: Update one persona at a time. Test before moving to the next.

Example Edit (adding a new cliché to avoid):

**Do not use stale or common analogies.**
❌ Avoid: "distributed ledger", "like a library", "like a recipe", "like a Swiss Army knife"
✅ Use: Unexpected, specific comparisons that reveal insight

How to Test Prompt Changes

Testing Workflow:

Before Making Changes:

# Run a baseline test to capture current behavior
python -m scripts.smoke_test --field "Data Science" --max-cost 0.15
# Save artifacts from runs/{run_id}/ for comparison

After Making Changes:

# Run with same topic to isolate prompt impact
python -m scripts.smoke_test --field "Data Science" --max-cost 0.15

Compare Outputs:
- Prompt Generator: Check 25_structured_prompt.json for template compliance
- Writer: Review 40_draft.md for persona fidelity (wit, analogies, structure)
- Image Prompt: Verify 70_image_prompt.txt specifies no text/words

Automated Persona Fidelity Tests:

The test suite includes persona compliance checks in tests/test_agents/:

# Run persona-specific tests
pytest tests/test_agents/test_prompt_generator_agent.py::test_persona_fidelity -v
pytest tests/test_agents/test_writer_agent.py::test_witty_expert_persona -v

What Tests Check:

Template sections present (Topic, Audience, Pain Point, etc.)
Cliché detection (fails if blacklisted phrases appear)
Structure validation (hook → problem → solution → impact format)
Analogy freshness (heuristic: checks for unexpected word combinations)

Manual Review Checklist:

Template fields are all present and populated
Analogies are fresh (not in cliché blacklist)
Tone matches persona (witty but not silly, deep but not stuffy)
Post structure follows LinkedIn framework (hook → problem → solution)
Character count stays under 3000 (with reasonable iteration count)
Image prompt contains visual keywords but no text instructions

Regression Testing:

# Run full integration test with mocked LLMs
pytest tests/test_integration/test_llm_pipeline.py -v

# Verify all agents still produce valid envelopes
pytest tests/test_agents/ -v

Rollback Strategy: If prompt changes degrade output quality:

Use git log system_prompts.md to identify recent changes
Revert with git checkout <commit-hash> -- system_prompts.md
Re-run tests to confirm restoration

Best Practices:

Keep a changelog of prompt modifications in git commit messages
Test with multiple topics to ensure consistency
Monitor events.jsonl for increased retry rates (indicates prompt issues)
Compare costs before/after (poorly-worded prompts may require more tokens)
Get human review on drafts before deploying prompt changes to production

Testing & Code Coverage

Running Tests

Run the full test suite:

pytest tests/

Run with verbose output:

pytest tests/ -v

Run specific test categories using markers:

# Unit tests only
pytest tests/ -m unit

# Integration tests only
pytest tests/ -m integration

# Persona compliance tests
pytest tests/ -m persona

Windows PowerShell Helper Script

For Windows users, a convenience script is provided:

# Run all tests
.\run_tests.ps1

# Run with coverage
.\run_tests.ps1 -Coverage

# Generate HTML coverage report
.\run_tests.ps1 -CoverageHtml

# Run specific test category
.\run_tests.ps1 -Unit -Verbose

# Run specific file
.\run_tests.ps1 -File tests/test_error_handling.py

# Show help
.\run_tests.ps1 -Help

Code Coverage

Run tests with coverage reporting:

Note: Coverage source paths are configured in .coveragerc. You do not need to specify --cov=... for each module; simply use --cov and pytest will pick up the correct configuration automatically.

# Run with coverage and show missing lines
pytest tests/ --cov --cov-report=term-missing

# Generate HTML coverage report
pytest tests/ --cov --cov-report=html

# Generate both HTML and XML reports (for CI)
pytest tests/ --cov --cov-report=html --cov-report=xml

View the HTML report by opening htmlcov/index.html in your browser.

Coverage Goals

Metric	Current Target	Ultimate Goal
Overall Coverage	≥75%	>85%
Branch Coverage	Enabled	Enabled
Core Module	≥80%	>90%
Agents Module	≥70%	>85%

Test Categories

The test suite includes several specialized test files:

test_error_handling.py: Tests for error classification, retry logic, circuit breaker behavior, and error propagation
test_artifact_persistence.py: Tests for atomic file writes, JSON verification, corruption detection, and concurrent write safety
test_persona_compliance.py: Tests that validate agent outputs against persona guidelines (Strategic Content Architect, The Witty Expert, Visual Strategist)
test_orchestrator.py: Integration tests for the complete pipeline orchestration
test_agents/*.py: Unit tests for each individual agent

Continuous Integration

Coverage is enforced via pytest-cov with a minimum threshold of 75%. To run the same checks as CI:

pytest tests/ --cov=agents --cov=core --cov=database --cov-fail-under=75

Troubleshooting

Common Issues

API Key Errors

Verify .env file exists and contains valid GOOGLE_API_KEY
Check API key has proper permissions in Google AI Studio

Database Errors

Ensure database/ directory exists
Check file permissions for topics.db

Run Failures

Check events.jsonl for detailed error logs
Review run_failed.json in the failed run directory
Verify internet connection for API calls

Character Count Loop

If posts consistently exceed 3000 characters, check Writer Agent prompt
Review 40_draft.md and 50_review.json for excessive verbosity

Contributing

This project is currently in active development. Contributions, issues, and feature requests are welcome!

Commit Message Convention

This project follows the Conventional Commits specification for commit messages. This enables automated changelog generation and semantic versioning.

Format

<type>(<scope>): <subject>

<body>

<footer>

Types

feat: A new feature
fix: A bug fix
docs: Documentation only changes
style: Changes that don't affect code meaning (formatting, whitespace)
refactor: Code change that neither fixes a bug nor adds a feature
perf: Performance improvement
test: Adding or correcting tests
chore: Changes to build process or auxiliary tools

Examples

feat(agents): add prompt generator agent with Strategic Content Architect persona

fix(orchestrator): correct character count loop termination condition

docs(readme): add conventional commits documentation

test(agents): add persona fidelity tests for Writer Agent

Breaking Changes

For breaking changes, add BREAKING CHANGE: in the footer:

feat(api)!: change agent envelope structure

BREAKING CHANGE: Agent return format now requires 'metrics' field

Roadmap

Development progress is tracked in ROADMAP.md. The project follows a phased approach:

Phase	Description	Status
Phase 0-3	Core skeleton, infrastructure, database	✅ Complete
Phase 4-6	Agent implementations, orchestrator, main entry point	✅ Complete
Phase 7	LLM integration (critical)	✅ Complete
Phase 8	Testing infrastructure & coverage	✅ Complete
Phase 9	Dependency management	✅ Complete
Phase 10	Documentation enhancements	✅ Complete
Phase 11	CI/CD workflows	✅ Complete
Phase 12	Fallback & resilience edge cases	✅ Complete
Phase 13-14	Performance & future enhancements	📋 Backlog

Key Milestones

M1: Core skeleton + database ready
M2: All agents + orchestrator functional (stubs)
M3: LLM Integration - Real agent intelligence ✅
M4: Memory bank + config + tests with high coverage ✅
M5: Stable release + CI/CD + docs
M6: Resilience edge cases + enhancements

For detailed task breakdown, see ROADMAP.md.

License

[License information to be added]

Acknowledgments

Google's Gemini AI for powering the agent system
"The Tech Audience Accelerator" newsletter for content strategy insights

Contact

For questions or support, please open an issue on GitHub.

Note: This project is designed for local execution and requires an active internet connection for API calls to Google's Gemini services.

Name		Name	Last commit message	Last commit date
Latest commit History 313 Commits
.chroma		.chroma
.github		.github
agents		agents
core		core
database		database
docs		docs
memory_bank		memory_bank
scripts		scripts
tests		tests
.coveragerc		.coveragerc
.env.example		.env.example
.flake8		.flake8
.gitignore		.gitignore
CHANGELOG.md		CHANGELOG.md
README.md		README.md
ROADMAP.md		ROADMAP.md
config.json		config.json
main.py		main.py
next_task.md		next_task.md
orchestrator.py		orchestrator.py
project_spec.md		project_spec.md
project_spec_build.notesnlh		project_spec_build.notesnlh
pyproject.toml		pyproject.toml
pytest.ini		pytest.ini
requirements.txt		requirements.txt
run_tests.ps1		run_tests.ps1
system_prompts.md		system_prompts.md

lilfetz22/linkedIn-post-automation-multi-agent

Folders and files

Latest commit

History

Repository files navigation