Local AI load balancer for Ollama fleets — auto-discovery, smart routing, OpenAI-compatible API, zero config. Perfect for Mac Minis & Studios.
python load-balancer embeddings zero-config fleet-management multimodal fastapi edge-ai apple-silicon llm local-llm ollama ai-gateway ai-infrastructure self-hosted-ai openai-compatible ai-router inference-router ollama-cluster
-
Updated
Apr 27, 2026 - Python